Overview

Dataset statistics

Number of variables5
Number of observations931
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory37.4 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 폐의약품 수거함 현황 데이터로(연번,관할,기관명칭,소재지(도로명주소),세부위치)등의 항목에 대한 정보를 제공합니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15091229/fileData.do

Alerts

연번 is highly overall correlated with 관할 and 1 other fieldsHigh correlation
관할 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
세부위치 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
세부위치 is highly imbalanced (53.0%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:10:03.801228
Analysis finished2024-04-06 08:10:05.185225
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct931
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean466
Minimum1
Maximum931
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.3 KiB
2024-04-06T17:10:05.327955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile47.5
Q1233.5
median466
Q3698.5
95-th percentile884.5
Maximum931
Range930
Interquartile range (IQR)465

Descriptive statistics

Standard deviation268.90085
Coefficient of variation (CV)0.57704045
Kurtosis-1.2
Mean466
Median Absolute Deviation (MAD)233
Skewness0
Sum433846
Variance72307.667
MonotonicityStrictly increasing
2024-04-06T17:10:05.593610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
627 1
 
0.1%
615 1
 
0.1%
616 1
 
0.1%
617 1
 
0.1%
618 1
 
0.1%
619 1
 
0.1%
620 1
 
0.1%
621 1
 
0.1%
622 1
 
0.1%
Other values (921) 921
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
931 1
0.1%
930 1
0.1%
929 1
0.1%
928 1
0.1%
927 1
0.1%
926 1
0.1%
925 1
0.1%
924 1
0.1%
923 1
0.1%
922 1
0.1%

관할
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
부평구
265 
서구
218 
미추홀구
195 
중구
83 
연수구
55 
Other values (5)
115 

Length

Max length4
Median length3
Mean length2.8721805
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
부평구 265
28.5%
서구 218
23.4%
미추홀구 195
20.9%
중구 83
 
8.9%
연수구 55
 
5.9%
강화군 52
 
5.6%
계양구 27
 
2.9%
남동구 21
 
2.3%
동구 13
 
1.4%
옹진군 2
 
0.2%

Length

2024-04-06T17:10:05.850280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:10:06.089952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부평구 265
28.5%
서구 218
23.4%
미추홀구 195
20.9%
중구 83
 
8.9%
연수구 55
 
5.9%
강화군 52
 
5.6%
계양구 27
 
2.9%
남동구 21
 
2.3%
동구 13
 
1.4%
옹진군 2
 
0.2%
Distinct828
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2024-04-06T17:10:06.550946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length6.1385607
Min length3

Characters and Unicode

Total characters5715
Distinct characters347
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique753 ?
Unique (%)80.9%

Sample

1st row중구보건소
2nd row국제도시보건과
3rd row용유보건지소
4th row무의보건진료소
5th row중구1청사 민원지적과
ValueCountFrequency (%)
행정복지센터 83
 
8.0%
정서진 6
 
0.6%
새마을금고 6
 
0.6%
행복한약국 4
 
0.4%
미소약국 4
 
0.4%
중앙약국 4
 
0.4%
천사약국 4
 
0.4%
희망약국 4
 
0.4%
백제약국 4
 
0.4%
굿모닝약국 4
 
0.4%
Other values (830) 917
88.2%
2024-04-06T17:10:07.328550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
743
 
13.0%
743
 
13.0%
170
 
3.0%
166
 
2.9%
160
 
2.8%
136
 
2.4%
134
 
2.3%
134
 
2.3%
134
 
2.3%
110
 
1.9%
Other values (337) 3085
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5437
95.1%
Decimal Number 135
 
2.4%
Space Separator 110
 
1.9%
Uppercase Letter 11
 
0.2%
Lowercase Letter 9
 
0.2%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%
Other Punctuation 3
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
743
 
13.7%
743
 
13.7%
170
 
3.1%
166
 
3.1%
160
 
2.9%
136
 
2.5%
134
 
2.5%
134
 
2.5%
134
 
2.5%
94
 
1.7%
Other values (307) 2823
51.9%
Decimal Number
ValueCountFrequency (%)
1 33
24.4%
2 32
23.7%
3 28
20.7%
5 14
10.4%
6 13
 
9.6%
4 11
 
8.1%
0 2
 
1.5%
7 1
 
0.7%
8 1
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
S 2
18.2%
K 2
18.2%
H 1
9.1%
D 1
9.1%
Y 1
9.1%
W 1
9.1%
V 1
9.1%
I 1
9.1%
P 1
9.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
33.3%
n 1
 
11.1%
w 1
 
11.1%
o 1
 
11.1%
r 1
 
11.1%
s 1
 
11.1%
t 1
 
11.1%
Space Separator
ValueCountFrequency (%)
110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5437
95.1%
Common 258
 
4.5%
Latin 20
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
743
 
13.7%
743
 
13.7%
170
 
3.1%
166
 
3.1%
160
 
2.9%
136
 
2.5%
134
 
2.5%
134
 
2.5%
134
 
2.5%
94
 
1.7%
Other values (307) 2823
51.9%
Latin
ValueCountFrequency (%)
e 3
15.0%
S 2
 
10.0%
K 2
 
10.0%
H 1
 
5.0%
D 1
 
5.0%
Y 1
 
5.0%
W 1
 
5.0%
n 1
 
5.0%
w 1
 
5.0%
V 1
 
5.0%
Other values (6) 6
30.0%
Common
ValueCountFrequency (%)
110
42.6%
1 33
 
12.8%
2 32
 
12.4%
3 28
 
10.9%
5 14
 
5.4%
6 13
 
5.0%
4 11
 
4.3%
) 4
 
1.6%
( 4
 
1.6%
. 3
 
1.2%
Other values (4) 6
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5437
95.1%
ASCII 278
 
4.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
743
 
13.7%
743
 
13.7%
170
 
3.1%
166
 
3.1%
160
 
2.9%
136
 
2.5%
134
 
2.5%
134
 
2.5%
134
 
2.5%
94
 
1.7%
Other values (307) 2823
51.9%
ASCII
ValueCountFrequency (%)
110
39.6%
1 33
 
11.9%
2 32
 
11.5%
3 28
 
10.1%
5 14
 
5.0%
6 13
 
4.7%
4 11
 
4.0%
) 4
 
1.4%
( 4
 
1.4%
e 3
 
1.1%
Other values (20) 26
 
9.4%
Distinct925
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
2024-04-06T17:10:07.887673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length47
Mean length28.306122
Min length9

Characters and Unicode

Total characters26353
Distinct characters393
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique919 ?
Unique (%)98.7%

Sample

1st row인천광역시 중구 참외전로72번길 21(전동, 중구보건소)
2nd row인천광역시 중구 운남서로 100-1
3rd row인천광역시 중구 마시란로 308-13
4th row인천광역시 중구 대무의로 310-11(무의동 246-7)
5th row인천광역시 중구 신포로27번길 80
ValueCountFrequency (%)
인천광역시 896
 
16.9%
부평구 264
 
5.0%
서구 218
 
4.1%
미추홀구 197
 
3.7%
1층 155
 
2.9%
부평동 98
 
1.9%
중구 83
 
1.6%
주안동 78
 
1.5%
연수구 55
 
1.0%
강화군 52
 
1.0%
Other values (1338) 3197
60.4%
2024-04-06T17:10:08.687001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4387
 
16.6%
1 1072
 
4.1%
1023
 
3.9%
962
 
3.7%
927
 
3.5%
924
 
3.5%
907
 
3.4%
902
 
3.4%
900
 
3.4%
886
 
3.4%
Other values (383) 13463
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15578
59.1%
Space Separator 4387
 
16.6%
Decimal Number 4037
 
15.3%
Open Punctuation 789
 
3.0%
Close Punctuation 788
 
3.0%
Other Punctuation 599
 
2.3%
Dash Punctuation 109
 
0.4%
Uppercase Letter 44
 
0.2%
Lowercase Letter 17
 
0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1023
 
6.6%
962
 
6.2%
927
 
6.0%
924
 
5.9%
907
 
5.8%
902
 
5.8%
900
 
5.8%
886
 
5.7%
526
 
3.4%
426
 
2.7%
Other values (338) 7195
46.2%
Uppercase Letter
ValueCountFrequency (%)
S 6
13.6%
B 5
11.4%
E 4
9.1%
M 4
9.1%
A 4
9.1%
K 3
 
6.8%
I 3
 
6.8%
L 2
 
4.5%
C 2
 
4.5%
Y 2
 
4.5%
Other values (7) 9
20.5%
Decimal Number
ValueCountFrequency (%)
1 1072
26.6%
2 473
11.7%
0 450
11.1%
3 433
10.7%
4 375
 
9.3%
5 280
 
6.9%
6 260
 
6.4%
7 251
 
6.2%
8 243
 
6.0%
9 200
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
e 5
29.4%
s 2
 
11.8%
d 2
 
11.8%
a 2
 
11.8%
r 2
 
11.8%
y 1
 
5.9%
t 1
 
5.9%
i 1
 
5.9%
c 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 595
99.3%
' 2
 
0.3%
/ 1
 
0.2%
· 1
 
0.2%
Space Separator
ValueCountFrequency (%)
4387
100.0%
Open Punctuation
ValueCountFrequency (%)
( 789
100.0%
Close Punctuation
ValueCountFrequency (%)
) 788
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 109
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15578
59.1%
Common 10714
40.7%
Latin 61
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1023
 
6.6%
962
 
6.2%
927
 
6.0%
924
 
5.9%
907
 
5.8%
902
 
5.8%
900
 
5.8%
886
 
5.7%
526
 
3.4%
426
 
2.7%
Other values (338) 7195
46.2%
Latin
ValueCountFrequency (%)
S 6
 
9.8%
e 5
 
8.2%
B 5
 
8.2%
E 4
 
6.6%
M 4
 
6.6%
A 4
 
6.6%
K 3
 
4.9%
I 3
 
4.9%
s 2
 
3.3%
d 2
 
3.3%
Other values (16) 23
37.7%
Common
ValueCountFrequency (%)
4387
40.9%
1 1072
 
10.0%
( 789
 
7.4%
) 788
 
7.4%
, 595
 
5.6%
2 473
 
4.4%
0 450
 
4.2%
3 433
 
4.0%
4 375
 
3.5%
5 280
 
2.6%
Other values (9) 1072
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15578
59.1%
ASCII 10774
40.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4387
40.7%
1 1072
 
9.9%
( 789
 
7.3%
) 788
 
7.3%
, 595
 
5.5%
2 473
 
4.4%
0 450
 
4.2%
3 433
 
4.0%
4 375
 
3.5%
5 280
 
2.6%
Other values (34) 1132
 
10.5%
Hangul
ValueCountFrequency (%)
1023
 
6.6%
962
 
6.2%
927
 
6.0%
924
 
5.9%
907
 
5.8%
902
 
5.8%
900
 
5.8%
886
 
5.7%
526
 
3.4%
426
 
2.7%
Other values (338) 7195
46.2%
None
ValueCountFrequency (%)
· 1
100.0%

세부위치
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct44
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size7.4 KiB
약국 내
485 
약국 내부
179 
약국
68 
센터 내
 
46
대기실
 
26
Other values (39)
127 

Length

Max length24
Median length4
Mean length4.7035446
Min length2

Unique

Unique25 ?
Unique (%)2.7%

Sample

1st row보건소 내 1층
2nd row국제도시보건과 내
3rd row보건지소 내
4th row보건진료소 내
5th row민원지적과 내

Common Values

ValueCountFrequency (%)
약국 내 485
52.1%
약국 내부 179
 
19.2%
약국 68
 
7.3%
센터 내 46
 
4.9%
대기실 26
 
2.8%
행정복지센터 민원실 내 23
 
2.5%
1층 민원실내 13
 
1.4%
행정복지센터 내부(1층 민원실) 11
 
1.2%
1층 민원실 내 11
 
1.2%
1층 입구 10
 
1.1%
Other values (34) 59
 
6.3%

Length

2024-04-06T17:10:09.050114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
약국 732
40.2%
591
32.4%
내부 179
 
9.8%
민원실 54
 
3.0%
센터 48
 
2.6%
1층 47
 
2.6%
행정복지센터 34
 
1.9%
대기실 26
 
1.4%
민원실내 19
 
1.0%
입구 12
 
0.7%
Other values (31) 81
 
4.4%

Interactions

2024-04-06T17:10:04.710885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:10:09.198249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관할세부위치
연번1.0000.9670.877
관할0.9671.0000.968
세부위치0.8770.9681.000
2024-04-06T17:10:09.463469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부위치관할
세부위치1.0000.779
관할0.7791.000
2024-04-06T17:10:09.645170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관할세부위치
연번1.0000.6870.532
관할0.6871.0000.779
세부위치0.5320.7791.000

Missing values

2024-04-06T17:10:04.924593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:10:05.114803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번관할기관명칭소재지(도로명주소)세부위치
01중구중구보건소인천광역시 중구 참외전로72번길 21(전동, 중구보건소)보건소 내 1층
12중구국제도시보건과인천광역시 중구 운남서로 100-1국제도시보건과 내
23중구용유보건지소인천광역시 중구 마시란로 308-13보건지소 내
34중구무의보건진료소인천광역시 중구 대무의로 310-11(무의동 246-7)보건진료소 내
45중구중구1청사 민원지적과인천광역시 중구 신포로27번길 80민원지적과 내
56중구신흥동 행정복지센터인천광역시 중구 제물량로80번길 3-14센터 내
67중구도원동 행정복지센터인천광역시 중구 도원로 42센터 내
78중구율목동 행정복지센터인천광역시 중구 서해대로483번길 3센터 내
89중구동인천동 행정복지센터인천광역시 중구 참외전로72번길 25센터 내
910중구개항동 행정복지센터인천광역시 중구 차이나타운로44번길 13-2센터 내
연번관할기관명칭소재지(도로명주소)세부위치
921922강화군서검보건진료소인천광역시 강화군 삼산면 서검길153번길 8대기실
922923강화군상용보건진료소인천광역시 강화군 교동면 교동동로 208-3대기실
923924강화군난정보건진료소인천광역시 강화군 교동면 교동서로378번길 56-28대기실
924925강화군삼선보건진료소인천광역시 강화군 교동면 교동북로 220대기실
925926옹진군옹진군보건소인천시 미추홀구 매소홀로 120보건소 민원실
926927강화군보배약국인천광역시 강화군 강화읍 중앙로 17-9<NA>
927928강화군교동약국인천광역시 강화군 교동면 대룡안길 54-62<NA>
928929강화군강화건강약국인천광역시 강화군 강화읍 강화대로312번길 12약국 내
929930강화군서울약국인천광역시 강화군 강화읍 강화대로404번길 4약국 내
930931옹진군옹진군보건소인천시 미추홀구 매소홀로 120보건소 민원실