Overview

Dataset statistics

Number of variables13
Number of observations586
Missing cells842
Missing cells (%)11.1%
Duplicate rows7
Duplicate rows (%)1.2%
Total size in memory61.4 KiB
Average record size in memory107.2 B

Variable types

Text3
Numeric3
DateTime2
Categorical5

Dataset

Description먹는물 공동시설 현황(약수터)(제공표준)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=22E03L96RU925X8OH4WH21542624&infSeq=1

Alerts

Dataset has 7 (1.2%) duplicate rowsDuplicates
관리기관전화번호 is highly overall correlated with 위도 and 4 other fieldsHigh correlation
부적합항목 is highly overall correlated with 위도 and 5 other fieldsHigh correlation
관리기관명 is highly overall correlated with 위도 and 4 other fieldsHigh correlation
수질검사결과구분 is highly overall correlated with 부적합항목High correlation
데이터기준일자 is highly overall correlated with 위도 and 4 other fieldsHigh correlation
위도 is highly overall correlated with 부적합항목 and 3 other fieldsHigh correlation
경도 is highly overall correlated with 부적합항목 and 3 other fieldsHigh correlation
부적합항목 is highly imbalanced (53.3%)Imbalance
소재지도로명주소 has 406 (69.3%) missing valuesMissing
지정일자 has 435 (74.2%) missing valuesMissing
일평균이용인구수 has 22 (3.8%) zerosZeros

Reproduction

Analysis started2024-03-23 02:00:10.985994
Analysis finished2024-03-23 02:00:18.908594
Duration7.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct309
Distinct (%)52.7%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2024-03-23T02:00:19.606363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.8617747
Min length2

Characters and Unicode

Total characters2849
Distinct characters240
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique285 ?
Unique (%)48.6%

Sample

1st row새미약수터
2nd row설월리약수터
3rd row애기능약수터
4th row양묘장약수터
5th row영회원약수터
ValueCountFrequency (%)
상패7통약수터 20
 
3.2%
만수약수터 20
 
3.2%
창말약수터 20
 
3.2%
시민약수터 20
 
3.2%
옥녀봉약수터 20
 
3.2%
소요산광장약수터 20
 
3.2%
보은약수터 20
 
3.2%
무지개약수터 20
 
3.2%
소요산일주문약수터 20
 
3.2%
보문사약수터 20
 
3.2%
Other values (315) 423
67.9%
2024-03-23T02:00:21.007846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
408
 
14.3%
375
 
13.2%
373
 
13.1%
79
 
2.8%
56
 
2.0%
44
 
1.5%
44
 
1.5%
43
 
1.5%
42
 
1.5%
40
 
1.4%
Other values (230) 1345
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2692
94.5%
Decimal Number 89
 
3.1%
Space Separator 37
 
1.3%
Close Punctuation 13
 
0.5%
Open Punctuation 13
 
0.5%
Other Punctuation 2
 
0.1%
Uppercase Letter 2
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
408
 
15.2%
375
 
13.9%
373
 
13.9%
79
 
2.9%
56
 
2.1%
44
 
1.6%
44
 
1.6%
43
 
1.6%
42
 
1.6%
40
 
1.5%
Other values (214) 1188
44.1%
Decimal Number
ValueCountFrequency (%)
8 40
44.9%
7 20
22.5%
2 11
 
12.4%
1 10
 
11.2%
4 3
 
3.4%
6 2
 
2.2%
5 2
 
2.2%
3 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
. 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
L 1
50.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2692
94.5%
Common 154
 
5.4%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
408
 
15.2%
375
 
13.9%
373
 
13.9%
79
 
2.9%
56
 
2.1%
44
 
1.6%
44
 
1.6%
43
 
1.6%
42
 
1.6%
40
 
1.5%
Other values (214) 1188
44.1%
Common
ValueCountFrequency (%)
8 40
26.0%
37
24.0%
7 20
13.0%
) 13
 
8.4%
( 13
 
8.4%
2 11
 
7.1%
1 10
 
6.5%
4 3
 
1.9%
6 2
 
1.3%
5 2
 
1.3%
Other values (3) 3
 
1.9%
Latin
ValueCountFrequency (%)
1
33.3%
G 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2692
94.5%
ASCII 155
 
5.4%
None 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
408
 
15.2%
375
 
13.9%
373
 
13.9%
79
 
2.9%
56
 
2.1%
44
 
1.6%
44
 
1.6%
43
 
1.6%
42
 
1.6%
40
 
1.5%
Other values (214) 1188
44.1%
ASCII
ValueCountFrequency (%)
8 40
25.8%
37
23.9%
7 20
12.9%
) 13
 
8.4%
( 13
 
8.4%
2 11
 
7.1%
1 10
 
6.5%
4 3
 
1.9%
6 2
 
1.3%
5 2
 
1.3%
Other values (4) 4
 
2.6%
None
ValueCountFrequency (%)
· 1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct118
Distinct (%)65.6%
Missing406
Missing (%)69.3%
Memory size4.7 KiB
2024-03-23T02:00:21.766037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length21.55
Min length14

Characters and Unicode

Total characters3879
Distinct characters180
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)62.2%

Sample

1st row경기도 안산시 상록구 동막길 94
2nd row경기도 안산시 상록구 구룡체육관로 21
3rd row경기도 군포시 속달로110번길 25
4th row경기도 의왕시 부곡공원길 6
5th row경기도 안산시 상록구 정재초교길 11
ValueCountFrequency (%)
경기도 180
22.9%
동두천시 60
 
7.6%
평화로2910번길 40
 
5.1%
406-65 40
 
5.1%
상패로65번길 20
 
2.5%
113 20
 
2.5%
남양주시 20
 
2.5%
파주시 11
 
1.4%
양주시 10
 
1.3%
김포시 8
 
1.0%
Other values (281) 377
48.0%
2024-03-23T02:00:22.959789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
606
 
15.6%
189
 
4.9%
184
 
4.7%
183
 
4.7%
173
 
4.5%
1 173
 
4.5%
6 138
 
3.6%
136
 
3.5%
125
 
3.2%
0 121
 
3.1%
Other values (170) 1851
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2211
57.0%
Decimal Number 955
24.6%
Space Separator 606
 
15.6%
Dash Punctuation 89
 
2.3%
Close Punctuation 7
 
0.2%
Open Punctuation 7
 
0.2%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
8.5%
184
 
8.3%
183
 
8.3%
173
 
7.8%
136
 
6.2%
125
 
5.7%
100
 
4.5%
85
 
3.8%
72
 
3.3%
65
 
2.9%
Other values (155) 899
40.7%
Decimal Number
ValueCountFrequency (%)
1 173
18.1%
6 138
14.5%
0 121
12.7%
2 114
11.9%
4 104
10.9%
5 97
10.2%
9 72
7.5%
3 60
 
6.3%
8 38
 
4.0%
7 38
 
4.0%
Space Separator
ValueCountFrequency (%)
606
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2211
57.0%
Common 1668
43.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
189
 
8.5%
184
 
8.3%
183
 
8.3%
173
 
7.8%
136
 
6.2%
125
 
5.7%
100
 
4.5%
85
 
3.8%
72
 
3.3%
65
 
2.9%
Other values (155) 899
40.7%
Common
ValueCountFrequency (%)
606
36.3%
1 173
 
10.4%
6 138
 
8.3%
0 121
 
7.3%
2 114
 
6.8%
4 104
 
6.2%
5 97
 
5.8%
- 89
 
5.3%
9 72
 
4.3%
3 60
 
3.6%
Other values (5) 94
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2211
57.0%
ASCII 1668
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
606
36.3%
1 173
 
10.4%
6 138
 
8.3%
0 121
 
7.3%
2 114
 
6.8%
4 104
 
6.2%
5 97
 
5.8%
- 89
 
5.3%
9 72
 
4.3%
3 60
 
3.6%
Other values (5) 94
 
5.6%
Hangul
ValueCountFrequency (%)
189
 
8.5%
184
 
8.3%
183
 
8.3%
173
 
7.8%
136
 
6.2%
125
 
5.7%
100
 
4.5%
85
 
3.8%
72
 
3.3%
65
 
2.9%
Other values (155) 899
40.7%
Distinct290
Distinct (%)49.5%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2024-03-23T02:00:23.577874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length18.945392
Min length14

Characters and Unicode

Total characters11102
Distinct characters180
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique261 ?
Unique (%)44.5%

Sample

1st row경기도 광명시 노온사동 산141-1
2nd row경기도 광명시 소하2동 산58
3rd row경기도 광명시 노온사동 산141-11
4th row경기도 광명시 하안1동 산69-4
5th row경기도 광명시 노온사동 산141-11
ValueCountFrequency (%)
경기도 586
23.0%
동두천시 280
 
11.0%
산70번지 100
 
3.9%
생연동 80
 
3.1%
상패동 60
 
2.4%
안흥동 60
 
2.4%
상봉암동 40
 
1.6%
산1번지 40
 
1.6%
35
 
1.4%
양주시 26
 
1.0%
Other values (572) 1236
48.6%
2024-03-23T02:00:24.530046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1957
17.6%
830
 
7.5%
590
 
5.3%
589
 
5.3%
587
 
5.3%
573
 
5.2%
505
 
4.5%
1 398
 
3.6%
392
 
3.5%
384
 
3.5%
Other values (170) 4297
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7372
66.4%
Space Separator 1957
 
17.6%
Decimal Number 1536
 
13.8%
Dash Punctuation 237
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
830
 
11.3%
590
 
8.0%
589
 
8.0%
587
 
8.0%
573
 
7.8%
505
 
6.9%
392
 
5.3%
384
 
5.2%
354
 
4.8%
302
 
4.1%
Other values (158) 2266
30.7%
Decimal Number
ValueCountFrequency (%)
1 398
25.9%
7 247
16.1%
0 158
 
10.3%
3 143
 
9.3%
5 120
 
7.8%
4 111
 
7.2%
2 106
 
6.9%
6 103
 
6.7%
8 83
 
5.4%
9 67
 
4.4%
Space Separator
ValueCountFrequency (%)
1957
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 237
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7372
66.4%
Common 3730
33.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
830
 
11.3%
590
 
8.0%
589
 
8.0%
587
 
8.0%
573
 
7.8%
505
 
6.9%
392
 
5.3%
384
 
5.2%
354
 
4.8%
302
 
4.1%
Other values (158) 2266
30.7%
Common
ValueCountFrequency (%)
1957
52.5%
1 398
 
10.7%
7 247
 
6.6%
- 237
 
6.4%
0 158
 
4.2%
3 143
 
3.8%
5 120
 
3.2%
4 111
 
3.0%
2 106
 
2.8%
6 103
 
2.8%
Other values (2) 150
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7372
66.4%
ASCII 3730
33.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1957
52.5%
1 398
 
10.7%
7 247
 
6.6%
- 237
 
6.4%
0 158
 
4.2%
3 143
 
3.8%
5 120
 
3.2%
4 111
 
3.0%
2 106
 
2.8%
6 103
 
2.8%
Other values (2) 150
 
4.0%
Hangul
ValueCountFrequency (%)
830
 
11.3%
590
 
8.0%
589
 
8.0%
587
 
8.0%
573
 
7.8%
505
 
6.9%
392
 
5.3%
384
 
5.2%
354
 
4.8%
302
 
4.1%
Other values (158) 2266
30.7%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct287
Distinct (%)49.1%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean37.715758
Minimum36.941205
Maximum38.218721
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-03-23T02:00:24.972073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.941205
5-th percentile37.286732
Q137.463513
median37.899604
Q337.918715
95-th percentile37.946073
Maximum38.218721
Range1.2775162
Interquartile range (IQR)0.45520176

Descriptive statistics

Standard deviation0.25487682
Coefficient of variation (CV)0.0067578338
Kurtosis-0.64520048
Mean37.715758
Median Absolute Deviation (MAD)0.046469
Skewness-0.79764801
Sum22063.718
Variance0.064962194
MonotonicityNot monotonic
2024-03-23T02:00:25.525900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.899604 60
 
10.2%
37.921994 40
 
6.8%
37.946073 40
 
6.8%
37.911061 20
 
3.4%
37.913589 20
 
3.4%
37.933087 20
 
3.4%
37.908502 20
 
3.4%
37.906331 20
 
3.4%
37.918715 20
 
3.4%
37.920463 20
 
3.4%
Other values (277) 305
52.0%
ValueCountFrequency (%)
36.94120466 1
0.2%
36.99489689 1
0.2%
37.01197937 1
0.2%
37.01636925 1
0.2%
37.0166174 1
0.2%
37.068127 2
0.3%
37.071135 1
0.2%
37.1127078 1
0.2%
37.1421447 1
0.2%
37.1453332 1
0.2%
ValueCountFrequency (%)
38.21872085 1
 
0.2%
38.17587754 1
 
0.2%
38.15078762 1
 
0.2%
38.093227 1
 
0.2%
38.004191 1
 
0.2%
37.97672111 1
 
0.2%
37.946073 40
6.8%
37.933087 20
3.4%
37.929499 1
 
0.2%
37.928797 1
 
0.2%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct286
Distinct (%)48.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.05537
Minimum126.54161
Maximum127.74825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-03-23T02:00:25.967785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.54161
5-th percentile126.79417
Q1127.01861
median127.05094
Q3127.07469
95-th percentile127.33744
Maximum127.74825
Range1.206639
Interquartile range (IQR)0.056076325

Descriptive statistics

Standard deviation0.15410379
Coefficient of variation (CV)0.0012128869
Kurtosis3.8395478
Mean127.05537
Median Absolute Deviation (MAD)0.02375425
Skewness0.85868401
Sum74454.448
Variance0.023747978
MonotonicityNot monotonic
2024-03-23T02:00:26.508165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.074691 60
 
10.2%
127.048461 40
 
6.8%
127.070144 40
 
6.8%
127.043281 20
 
3.4%
127.064892 20
 
3.4%
127.043063 20
 
3.4%
127.041801 20
 
3.4%
127.034151 20
 
3.4%
127.068687 20
 
3.4%
127.072736 20
 
3.4%
Other values (276) 306
52.2%
ValueCountFrequency (%)
126.541608 1
0.2%
126.5865494 1
0.2%
126.5912583 1
0.2%
126.6188739 1
0.2%
126.6382017 1
0.2%
126.6866586 1
0.2%
126.697065 1
0.2%
126.7043048 1
0.2%
126.712285 1
0.2%
126.7127094 1
0.2%
ValueCountFrequency (%)
127.748247 1
0.2%
127.700061 1
0.2%
127.6490836 1
0.2%
127.646428 1
0.2%
127.6434122 1
0.2%
127.608277 1
0.2%
127.6016991 1
0.2%
127.597573 1
0.2%
127.5705125 1
0.2%
127.549078 1
0.2%

지정일자
Date

MISSING 

Distinct83
Distinct (%)55.0%
Missing435
Missing (%)74.2%
Memory size4.7 KiB
Minimum1962-07-09 00:00:00
Maximum2015-12-09 00:00:00
2024-03-23T02:00:27.112597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:27.591984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

일평균이용인구수
Real number (ℝ)

ZEROS 

Distinct47
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.33276
Minimum0
Maximum2750
Zeros22
Zeros (%)3.8%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2024-03-23T02:00:28.027413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20
Q130
median65
Q3150
95-th percentile295
Maximum2750
Range2750
Interquartile range (IQR)120

Descriptive statistics

Standard deviation149.53129
Coefficient of variation (CV)1.3931561
Kurtosis169.95384
Mean107.33276
Median Absolute Deviation (MAD)35
Skewness10.449041
Sum62897
Variance22359.607
MonotonicityNot monotonic
2024-03-23T02:00:28.441410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
30 80
13.7%
60 75
12.8%
100 49
 
8.4%
20 46
 
7.8%
180 45
 
7.7%
50 44
 
7.5%
80 37
 
6.3%
150 29
 
4.9%
130 24
 
4.1%
200 23
 
3.9%
Other values (37) 134
22.9%
ValueCountFrequency (%)
0 22
 
3.8%
10 4
 
0.7%
15 2
 
0.3%
20 46
7.8%
30 80
13.7%
40 1
 
0.2%
50 44
7.5%
52 1
 
0.2%
55 1
 
0.2%
60 75
12.8%
ValueCountFrequency (%)
2750 1
 
0.2%
1040 1
 
0.2%
700 2
0.3%
680 1
 
0.2%
500 3
0.5%
480 1
 
0.2%
460 1
 
0.2%
450 2
0.3%
375 1
 
0.2%
350 2
0.3%
Distinct63
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
Minimum2017-03-02 00:00:00
Maximum2024-03-04 00:00:00
2024-03-23T02:00:28.838650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:29.511964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수질검사결과구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
적합
389 
부적합
197 

Length

Max length3
Median length2
Mean length2.3361775
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row부적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 389
66.4%
부적합 197
33.6%

Length

2024-03-23T02:00:29.912185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T02:00:30.359890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 389
66.4%
부적합 197
33.6%

부적합항목
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct31
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
<NA>
374 
미검사(수원고갈)
56 
총대장균군(검출)
 
23
없음
 
20
총대장균군
 
20
Other values (26)
93 

Length

Max length33
Median length4
Mean length5.4146758
Min length1

Unique

Unique10 ?
Unique (%)1.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row채수량부족으로 수질검사 불가
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 374
63.8%
미검사(수원고갈) 56
 
9.6%
총대장균군(검출) 23
 
3.9%
없음 20
 
3.4%
총대장균군 20
 
3.4%
시설폐쇄 18
 
3.1%
총대장균균 15
 
2.6%
해당없음 7
 
1.2%
분원성 대장균군(검출) ,총대장균군(검출) 6
 
1.0%
수원고갈 5
 
0.9%
Other values (21) 42
 
7.2%

Length

2024-03-23T02:00:30.740448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 374
59.7%
미검사(수원고갈 56
 
8.9%
총대장균군(검출 32
 
5.1%
총대장균군 30
 
4.8%
없음 20
 
3.2%
시설폐쇄 18
 
2.9%
총대장균균 15
 
2.4%
분원성 10
 
1.6%
대장균군(검출 8
 
1.3%
해당없음 7
 
1.1%
Other values (21) 56
 
8.9%

관리기관전화번호
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
031-860-2471
280 
031-8082-6832
 
26
031-729-4094
 
21
031-590-4618
 
20
031-870-6335
 
19
Other values (35)
220 

Length

Max length13
Median length12
Mean length12.119454
Min length12

Unique

Unique7 ?
Unique (%)1.2%

Sample

1st row02-2680-2962
2nd row02-2680-2962
3rd row02-2680-2962
4th row02-2680-2962
5th row02-2680-2962

Common Values

ValueCountFrequency (%)
031-860-2471 280
47.8%
031-8082-6832 26
 
4.4%
031-729-4094 21
 
3.6%
031-590-4618 20
 
3.4%
031-870-6335 19
 
3.2%
031-8075-4550 17
 
2.9%
031-228-5348 15
 
2.6%
02-2680-2962 14
 
2.4%
031-8045-4512 14
 
2.4%
02-2150-3722 13
 
2.2%
Other values (30) 147
25.1%

Length

2024-03-23T02:00:31.150326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
031-860-2471 280
47.8%
031-8082-6832 26
 
4.4%
031-729-4094 21
 
3.6%
031-590-4618 20
 
3.4%
031-870-6335 19
 
3.2%
031-8075-4550 17
 
2.9%
031-228-5348 15
 
2.6%
02-2680-2962 14
 
2.4%
031-8045-4512 14
 
2.4%
02-2150-3722 13
 
2.2%
Other values (30) 147
25.1%

관리기관명
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
경기도 동두천시청
280 
경기도 양주시청
 
26
경기도 성남시 맑은물관리사업소
 
21
경기도 남양주시청
 
20
경기도 의정부시청
 
19
Other values (31)
220 

Length

Max length27
Median length9
Mean length9.4180887
Min length3

Unique

Unique4 ?
Unique (%)0.7%

Sample

1st row광명시청 공원녹지과(시설관련), 수도과(수질관련)
2nd row광명시청 공원녹지과(시설관련), 수도과(수질관련)
3rd row광명시청 공원녹지과(시설관련), 수도과(수질관련)
4th row광명시청 공원녹지과(시설관련), 수도과(수질관련)
5th row광명시청 공원녹지과(시설관련), 수도과(수질관련)

Common Values

ValueCountFrequency (%)
경기도 동두천시청 280
47.8%
경기도 양주시청 26
 
4.4%
경기도 성남시 맑은물관리사업소 21
 
3.6%
경기도 남양주시청 20
 
3.4%
경기도 의정부시청 19
 
3.2%
고양시 상하도사업소 17
 
2.9%
경기도 수원시 장안구청 15
 
2.6%
경기도 안양시 14
 
2.4%
광명시청 공원녹지과(시설관련), 수도과(수질관련) 14
 
2.4%
맑은물사업소 13
 
2.2%
Other values (26) 147
25.1%

Length

2024-03-23T02:00:31.598302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 469
39.6%
동두천시청 280
23.7%
양주시청 26
 
2.2%
성남시 21
 
1.8%
맑은물관리사업소 21
 
1.8%
남양주시청 20
 
1.7%
의정부시청 19
 
1.6%
고양시 17
 
1.4%
상하도사업소 17
 
1.4%
수원시 17
 
1.4%
Other values (38) 276
23.3%

데이터기준일자
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2019-06-30
280 
2020-05-30
 
26
2018-06-30
 
21
2020-01-24
 
20
2019-04-01
 
19
Other values (25)
220 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row2022-04-04
2nd row2022-04-04
3rd row2022-04-04
4th row2022-04-04
5th row2022-04-04

Common Values

ValueCountFrequency (%)
2019-06-30 280
47.8%
2020-05-30 26
 
4.4%
2018-06-30 21
 
3.6%
2020-01-24 20
 
3.4%
2019-04-01 19
 
3.2%
2019-02-20 17
 
2.9%
2019-04-17 17
 
2.9%
2022-04-04 14
 
2.4%
2023-06-19 14
 
2.4%
2023-08-08 13
 
2.2%
Other values (20) 145
24.7%

Length

2024-03-23T02:00:31.960537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-06-30 280
47.8%
2020-05-30 26
 
4.4%
2018-06-30 21
 
3.6%
2020-01-24 20
 
3.4%
2019-04-01 19
 
3.2%
2019-02-20 17
 
2.9%
2019-04-17 17
 
2.9%
2022-04-04 14
 
2.4%
2023-06-19 14
 
2.4%
2023-08-08 13
 
2.2%
Other values (20) 145
24.7%

Interactions

2024-03-23T02:00:16.189815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:14.290645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:15.339651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:16.574043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:14.589481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:15.625956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:16.851316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:14.962497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T02:00:15.910337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T02:00:32.225551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
위도1.0000.8120.9650.3420.9700.3340.9450.9730.9750.987
경도0.8121.0000.9900.2140.9560.3100.8800.9750.9590.974
지정일자0.9650.9901.0000.9270.9910.6780.9291.0001.0001.000
일평균이용인구수0.3420.2140.9271.0000.6240.1050.0000.6990.6770.716
수질검사일자0.9700.9560.9910.6241.0000.4890.9680.9980.9990.999
수질검사결과구분0.3340.3100.6780.1050.4891.0000.9990.4910.4830.489
부적합항목0.9450.8800.9290.0000.9680.9991.0000.9670.9620.965
관리기관전화번호0.9730.9751.0000.6990.9980.4910.9671.0001.0001.000
관리기관명0.9750.9591.0000.6770.9990.4830.9621.0001.0001.000
데이터기준일자0.9870.9741.0000.7160.9990.4890.9651.0001.0001.000
2024-03-23T02:00:32.563570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관전화번호부적합항목관리기관명수질검사결과구분데이터기준일자
관리기관전화번호1.0000.6500.9960.3800.991
부적합항목0.6501.0000.6490.9120.681
관리기관명0.9960.6491.0000.3740.995
수질검사결과구분0.3800.9120.3741.0000.381
데이터기준일자0.9910.6810.9950.3811.000
2024-03-23T02:00:32.925213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도일평균이용인구수수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
위도1.0000.133-0.2590.2550.6180.7970.8010.804
경도0.1331.000-0.0710.2360.5640.7480.7480.740
일평균이용인구수-0.259-0.0711.0000.1280.0000.3730.3820.390
수질검사결과구분0.2550.2360.1281.0000.9120.3800.3740.381
부적합항목0.6180.5640.0000.9121.0000.6500.6490.681
관리기관전화번호0.7970.7480.3730.3800.6501.0000.9960.991
관리기관명0.8010.7480.3820.3740.6490.9961.0000.995
데이터기준일자0.8040.7400.3900.3810.6810.9910.9951.000

Missing values

2024-03-23T02:00:17.272966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T02:00:18.117566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T02:00:18.652366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

약수터명소재지도로명주소소재지지번주소위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
0새미약수터<NA>경기도 광명시 노온사동 산141-137.442326126.8623992008-01-21752022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
1설월리약수터<NA>경기도 광명시 소하2동 산5837.414352126.8716572011-03-04202022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
2애기능약수터<NA>경기도 광명시 노온사동 산141-1137.439552126.8629012008-01-21752022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
3양묘장약수터<NA>경기도 광명시 하안1동 산69-437.460881126.8578772011-03-04152022-03-04부적합채수량부족으로 수질검사 불가02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
4영회원약수터<NA>경기도 광명시 노온사동 산141-1137.439552126.8629012011-03-04102022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
5절골약수터<NA>경기도 광명시 소하2동 산137-937.426385126.8762122008-01-213752022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
6진달래약수터<NA>경기도 광명시 노온사동 산141-1137.439552126.8629012008-01-211502022-03-04부적합채수량부족으로 수질검사 불가02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
7참샘물1호약수터<NA>경기도 광명시 소하2동 359-137.440106126.8749572008-01-212252022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
8참샘물2호약수터<NA>경기도 광명시 소하2동 359-137.440106126.8749572008-01-212252022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
9천연약수터<NA>경기도 광명시 노온사동 산141-1137.439552126.8629012008-01-21752022-03-04적합<NA>02-2680-2962광명시청 공원녹지과(시설관련), 수도과(수질관련)2022-04-04
약수터명소재지도로명주소소재지지번주소위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
576간촌2<NA>경기도 안양시 동안구 관양동 177637.417139126.964<NA>1002021-12-14부적합<NA>031-8045-4512경기도 안양시2023-06-19
577충의<NA>경기도 안양시 동안구 비산동 산 13937.406551126.938897<NA>3502021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-19
578갈산<NA>경기도 안양시 동안구 호계동 111237.379981126.964023<NA>3002021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-19
579청용경기도 시흥시 소래산길 41경기도 시흥시 대야동 305-9번지37.45177126.785576<NA>802020-08-06부적합총대장균군031-310-2342경기도 시흥시청2020-12-29
580황고개<NA>경기도 시흥시 장곡동 산15-537.371328126.790719<NA>302020-08-06부적합일반세균, 총대장균군, 탁도031-310-6135경기도 시흥시청2020-12-29
581목감경기도 시흥시 동서로 989-67경기도 시흥시 조남동 산19-1번지37.384639126.851159<NA>302020-08-06부적합일반세균 총대장균군031-310-6135경기도 시흥시청2020-12-29
582생금우물경기도 시흥시 서해안로 277경기도 시흥시 정왕동 2138번지37.355475126.712285<NA>1002020-08-06부적합총대장균군031-310-3863경기도 시흥시청2020-12-29
583능곡경기도 시흥시 능골길 26경기도 시흥시 능곡동 617번지37.369419126.817988<NA>502020-08-06적합<NA>031-310-3865경기도 시흥시청2020-12-29
584비둘기공원<NA>경기도 시흥시 은행동 55137.441405126.794047<NA>1002020-08-06적합<NA>031-310-3865경기도 시흥시청2020-12-29
585산신<NA>경기도 시흥시 하중동 산2-337.399135126.807434<NA>502020-08-21부적합일반세균, 총대장균군, 분원성대장균군/대장균, 탁도031-310-2243경기도 시흥시청2020-12-29

Duplicate rows

Most frequently occurring

약수터명소재지도로명주소소재지지번주소위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자# duplicates
0간촌1<NA>경기도 안양시 동안구 관양동 177637.417139126.964<NA>3002021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-192
1간촌2<NA>경기도 안양시 동안구 관양동 177637.417139126.964<NA>1002021-12-14부적합<NA>031-8045-4512경기도 안양시2023-06-192
2갈산<NA>경기도 안양시 동안구 호계동 111237.379981126.964023<NA>3002021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-192
3관상<NA>경기도 과천시 갈현동 산 106-237.422689126.962642<NA>2002021-12-14부적합<NA>031-8045-4512경기도 안양시2023-06-192
4매천경기도 안양시 동안구 평촌대로476번길 30-2경기도 안양시 동안구 비산동 산22번지37.414736126.955627<NA>1002021-12-14부적합<NA>031-8045-4512경기도 안양시2023-06-192
5청심<NA>경기도 안양시 동안구 관양동 176237.412578126.958727<NA>7002021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-192
6충의<NA>경기도 안양시 동안구 비산동 산 13937.406551126.938897<NA>3502021-12-14적합<NA>031-8045-4512경기도 안양시2023-06-192