Overview

Dataset statistics

Number of variables13
Number of observations453
Missing cells736
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.3 KiB
Average record size in memory109.3 B

Variable types

Numeric4
Text1
Categorical8

Dataset

Description부산광역시_먹는물공동시설(약수터)수질검사결과정보_09/30/2021
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083355

Alerts

미검사사유 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
총대장균군 is highly overall correlated with 검사여부 and 4 other fieldsHigh correlation
분원성대장균군 is highly overall correlated with 검사여부 and 4 other fieldsHigh correlation
검사여부 is highly overall correlated with 일반세균 중온 and 8 other fieldsHigh correlation
암모니아성 질소 is highly overall correlated with 번호 and 8 other fieldsHigh correlation
검사일자 is highly overall correlated with 번호 and 5 other fieldsHigh correlation
검사결과 is highly overall correlated with 검사여부 and 4 other fieldsHigh correlation
번호 is highly overall correlated with 미검사사유 and 2 other fieldsHigh correlation
일반세균 중온 is highly overall correlated with 검사여부 and 1 other fieldsHigh correlation
질산성 질소 is highly overall correlated with 검사여부 and 1 other fieldsHigh correlation
과망간산칼륨소비량 is highly overall correlated with 검사여부 and 1 other fieldsHigh correlation
암모니아성 질소(검출) is highly imbalanced (97.7%)Imbalance
일반세균 중온 has 237 (52.3%) missing valuesMissing
질산성 질소 has 243 (53.6%) missing valuesMissing
과망간산칼륨소비량 has 256 (56.5%) missing valuesMissing
번호 has unique valuesUnique
일반세균 중온 has 67 (14.8%) zerosZeros

Reproduction

Analysis started2024-03-13 13:15:09.842419
Analysis finished2024-03-13 13:15:13.070944
Duration3.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct453
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean227
Minimum1
Maximum453
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2024-03-13T22:15:13.193462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile23.6
Q1114
median227
Q3340
95-th percentile430.4
Maximum453
Range452
Interquartile range (IQR)226

Descriptive statistics

Standard deviation130.91409
Coefficient of variation (CV)0.57671407
Kurtosis-1.2
Mean227
Median Absolute Deviation (MAD)113
Skewness0
Sum102831
Variance17138.5
MonotonicityStrictly increasing
2024-03-13T22:15:13.376560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
285 1
 
0.2%
311 1
 
0.2%
310 1
 
0.2%
309 1
 
0.2%
308 1
 
0.2%
307 1
 
0.2%
306 1
 
0.2%
305 1
 
0.2%
304 1
 
0.2%
Other values (443) 443
97.8%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
453 1
0.2%
452 1
0.2%
451 1
0.2%
450 1
0.2%
449 1
0.2%
448 1
0.2%
447 1
0.2%
446 1
0.2%
445 1
0.2%
444 1
0.2%
Distinct151
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2024-03-13T22:15:13.673381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.1721854
Min length5

Characters and Unicode

Total characters3249
Distinct characters163
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구 서대신동 꽃마을
2nd row서구 중앙공원민속관
3rd row서구 중앙공원석탑
4th row서구 중앙공원구이
5th row서구 중앙공원옥천
ValueCountFrequency (%)
부산진구 72
 
7.9%
사상구 54
 
5.9%
금정구 45
 
4.9%
사하구 45
 
4.9%
북구 39
 
4.3%
남구 36
 
3.9%
영도구 30
 
3.3%
해운대구 30
 
3.3%
동래구 30
 
3.3%
서구 21
 
2.3%
Other values (151) 510
55.9%
2024-03-13T22:15:14.229594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
459
 
14.1%
453
 
13.9%
135
 
4.2%
132
 
4.1%
108
 
3.3%
78
 
2.4%
78
 
2.4%
72
 
2.2%
69
 
2.1%
66
 
2.0%
Other values (153) 1599
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2727
83.9%
Space Separator 459
 
14.1%
Open Punctuation 24
 
0.7%
Close Punctuation 24
 
0.7%
Decimal Number 15
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
453
 
16.6%
135
 
5.0%
132
 
4.8%
108
 
4.0%
78
 
2.9%
78
 
2.9%
72
 
2.6%
69
 
2.5%
66
 
2.4%
57
 
2.1%
Other values (147) 1479
54.2%
Decimal Number
ValueCountFrequency (%)
1 6
40.0%
2 6
40.0%
4 3
20.0%
Space Separator
ValueCountFrequency (%)
459
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2727
83.9%
Common 522
 
16.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
453
 
16.6%
135
 
5.0%
132
 
4.8%
108
 
4.0%
78
 
2.9%
78
 
2.9%
72
 
2.6%
69
 
2.5%
66
 
2.4%
57
 
2.1%
Other values (147) 1479
54.2%
Common
ValueCountFrequency (%)
459
87.9%
( 24
 
4.6%
) 24
 
4.6%
1 6
 
1.1%
2 6
 
1.1%
4 3
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2727
83.9%
ASCII 522
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
459
87.9%
( 24
 
4.6%
) 24
 
4.6%
1 6
 
1.1%
2 6
 
1.1%
4 3
 
0.6%
Hangul
ValueCountFrequency (%)
453
 
16.6%
135
 
5.0%
132
 
4.8%
108
 
4.0%
78
 
2.9%
78
 
2.9%
72
 
2.6%
69
 
2.5%
66
 
2.4%
57
 
2.1%
Other values (147) 1479
54.2%

검사여부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
미검사
237 
검사
216 

Length

Max length3
Median length3
Mean length2.5231788
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row검사
2nd row검사
3rd row검사
4th row검사
5th row검사

Common Values

ValueCountFrequency (%)
미검사 237
52.3%
검사 216
47.7%

Length

2024-03-13T22:15:14.356952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:14.451016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미검사 237
52.3%
검사 216
47.7%

미검사사유
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
216 
관리등급 안심/양호
171 
관리등급 양호/안심
52 
수원고갈
 
10
안심등급
 
3

Length

Max length14
Median length4
Mean length6.9757174
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 216
47.7%
관리등급 안심/양호 171
37.7%
관리등급 양호/안심 52
 
11.5%
수원고갈 10
 
2.2%
안심등급 3
 
0.7%
시설보수 공사로 채수 불가 1
 
0.2%

Length

2024-03-13T22:15:14.545701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:14.677058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관리등급 223
32.8%
na 216
31.8%
안심/양호 171
25.2%
양호/안심 52
 
7.7%
수원고갈 10
 
1.5%
안심등급 3
 
0.4%
시설보수 1
 
0.1%
공사로 1
 
0.1%
채수 1
 
0.1%
불가 1
 
0.1%

검사일자
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
237 
2021-09-09
25 
2021-07-14
 
18
2021-09-28
 
18
2021-08-10
 
15
Other values (23)
140 

Length

Max length10
Median length4
Mean length6.8609272
Min length4

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row2021-07-29
2nd row2021-07-29
3rd row2021-07-29
4th row2021-07-29
5th row2021-07-29

Common Values

ValueCountFrequency (%)
<NA> 237
52.3%
2021-09-09 25
 
5.5%
2021-07-14 18
 
4.0%
2021-09-28 18
 
4.0%
2021-08-10 15
 
3.3%
2021-07-22 15
 
3.3%
2021-09-06 15
 
3.3%
2021-09-13 12
 
2.6%
2021-09-23 10
 
2.2%
2021-09-08 10
 
2.2%
Other values (18) 78
 
17.2%

Length

2024-03-13T22:15:15.191966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 237
52.3%
2021-09-09 25
 
5.5%
2021-07-14 18
 
4.0%
2021-09-28 18
 
4.0%
2021-08-10 15
 
3.3%
2021-07-22 15
 
3.3%
2021-09-06 15
 
3.3%
2021-09-13 12
 
2.6%
2021-09-23 10
 
2.2%
2021-09-08 10
 
2.2%
Other values (18) 78
 
17.2%

검사결과
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
237 
적합
121 
부적합
95 

Length

Max length4
Median length4
Mean length3.2560706
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
<NA> 237
52.3%
적합 121
26.7%
부적합 95
21.0%

Length

2024-03-13T22:15:15.340437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:15.482257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 237
52.3%
적합 121
26.7%
부적합 95
21.0%

일반세균 중온
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct61
Distinct (%)28.2%
Missing237
Missing (%)52.3%
Infinite0
Infinite (%)0.0%
Mean29.685185
Minimum0
Maximum1500
Zeros67
Zeros (%)14.8%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2024-03-13T22:15:15.614905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q322
95-th percentile109.25
Maximum1500
Range1500
Interquartile range (IQR)22

Descriptive statistics

Standard deviation114.21568
Coefficient of variation (CV)3.847565
Kurtosis130.69171
Mean29.685185
Median Absolute Deviation (MAD)4
Skewness10.608721
Sum6412
Variance13045.221
MonotonicityNot monotonic
2024-03-13T22:15:15.806360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 67
 
14.8%
1 16
 
3.5%
2 11
 
2.4%
3 10
 
2.2%
4 7
 
1.5%
5 6
 
1.3%
25 5
 
1.1%
7 5
 
1.1%
11 4
 
0.9%
50 4
 
0.9%
Other values (51) 81
 
17.9%
(Missing) 237
52.3%
ValueCountFrequency (%)
0 67
14.8%
1 16
 
3.5%
2 11
 
2.4%
3 10
 
2.2%
4 7
 
1.5%
5 6
 
1.3%
6 2
 
0.4%
7 5
 
1.1%
8 2
 
0.4%
9 3
 
0.7%
ValueCountFrequency (%)
1500 1
 
0.2%
544 1
 
0.2%
364 1
 
0.2%
200 1
 
0.2%
185 1
 
0.2%
169 1
 
0.2%
149 1
 
0.2%
134 1
 
0.2%
110 3
0.7%
109 1
 
0.2%

총대장균군
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
237 
불검출
121 
검출
95 

Length

Max length4
Median length4
Mean length3.3134658
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
<NA> 237
52.3%
불검출 121
26.7%
검출 95
21.0%

Length

2024-03-13T22:15:15.971263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:16.073867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 237
52.3%
불검출 121
26.7%
검출 95
21.0%

분원성대장균군
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
237 
불검출
169 
검출
46 
음성
 
1

Length

Max length4
Median length4
Mean length3.419426
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
<NA> 237
52.3%
불검출 169
37.3%
검출 46
 
10.2%
음성 1
 
0.2%

Length

2024-03-13T22:15:16.192203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:16.305217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 237
52.3%
불검출 169
37.3%
검출 46
 
10.2%
음성 1
 
0.2%

암모니아성 질소
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
242 
불검출
211 

Length

Max length4
Median length4
Mean length3.5342163
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
<NA> 242
53.4%
불검출 211
46.6%

Length

2024-03-13T22:15:16.427647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:16.544549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 242
53.4%
불검출 211
46.6%

암모니아성 질소(검출)
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
<NA>
452 
0.11
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 452
99.8%
0.11 1
 
0.2%

Length

2024-03-13T22:15:16.662506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:15:16.790141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 452
99.8%
0.11 1
 
0.2%

질산성 질소
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct55
Distinct (%)26.2%
Missing243
Missing (%)53.6%
Infinite0
Infinite (%)0.0%
Mean2.267619
Minimum0.4
Maximum8.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2024-03-13T22:15:16.987346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.4
5-th percentile0.7
Q11.2
median1.9
Q32.8
95-th percentile5.21
Maximum8.6
Range8.2
Interquartile range (IQR)1.6

Descriptive statistics

Standard deviation1.4851414
Coefficient of variation (CV)0.65493427
Kurtosis3.4746826
Mean2.267619
Median Absolute Deviation (MAD)0.7
Skewness1.7198732
Sum476.2
Variance2.205645
MonotonicityNot monotonic
2024-03-13T22:15:17.201485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.7 14
 
3.1%
0.8 12
 
2.6%
1.0 11
 
2.4%
1.2 11
 
2.4%
1.1 9
 
2.0%
1.4 9
 
2.0%
2.3 9
 
2.0%
1.9 9
 
2.0%
2.4 9
 
2.0%
2.0 8
 
1.8%
Other values (45) 109
24.1%
(Missing) 243
53.6%
ValueCountFrequency (%)
0.4 2
 
0.4%
0.5 1
 
0.2%
0.6 3
 
0.7%
0.7 6
1.3%
0.8 12
2.6%
0.9 2
 
0.4%
1.0 11
2.4%
1.1 9
2.0%
1.2 11
2.4%
1.3 5
1.1%
ValueCountFrequency (%)
8.6 1
0.2%
7.8 1
0.2%
7.6 1
0.2%
7.3 1
0.2%
6.9 1
0.2%
6.8 1
0.2%
6.6 1
0.2%
6.1 1
0.2%
5.9 1
0.2%
5.4 1
0.2%

과망간산칼륨소비량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)3.0%
Missing256
Missing (%)56.5%
Infinite0
Infinite (%)0.0%
Mean0.38883249
Minimum0
Maximum2.5
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2024-03-13T22:15:17.348357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.3
Q10.3
median0.4
Q30.4
95-th percentile0.52
Maximum2.5
Range2.5
Interquartile range (IQR)0.1

Descriptive statistics

Standard deviation0.17431252
Coefficient of variation (CV)0.44829721
Kurtosis110.42731
Mean0.38883249
Median Absolute Deviation (MAD)0.1
Skewness9.1593009
Sum76.6
Variance0.030384854
MonotonicityNot monotonic
2024-03-13T22:15:17.477371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0.3 81
 
17.9%
0.4 81
 
17.9%
0.5 24
 
5.3%
0.6 9
 
2.0%
2.5 1
 
0.2%
0.0 1
 
0.2%
(Missing) 256
56.5%
ValueCountFrequency (%)
0.0 1
 
0.2%
0.3 81
17.9%
0.4 81
17.9%
0.5 24
 
5.3%
0.6 9
 
2.0%
2.5 1
 
0.2%
ValueCountFrequency (%)
2.5 1
 
0.2%
0.6 9
 
2.0%
0.5 24
 
5.3%
0.4 81
17.9%
0.3 81
17.9%
0.0 1
 
0.2%

Interactions

2024-03-13T22:15:11.898206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:10.728242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.092094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.516703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.996706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:10.813437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.191131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.607419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:12.096706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:10.918923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.305240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.717741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:12.185446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.000161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.409164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:15:11.803899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T22:15:17.589613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군질산성 질소과망간산칼륨소비량
번호1.0000.6000.8440.9630.3640.0000.3640.4110.2580.154
검사여부0.6001.000NaNNaNNaNNaNNaNNaNNaNNaN
미검사사유0.844NaN1.000NaNNaNNaNNaNNaNNaNNaN
검사일자0.963NaNNaN1.0000.6900.1560.6900.7930.0490.245
검사결과0.364NaNNaN0.6901.0000.1011.0000.3720.3310.149
일반세균 중온0.000NaNNaN0.1560.1011.0000.1010.2110.0000.137
총대장균군0.364NaNNaN0.6901.0000.1011.0000.3720.3310.149
분원성대장균군0.411NaNNaN0.7930.3720.2110.3721.0000.0000.000
질산성 질소0.258NaNNaN0.0490.3310.0000.3310.0001.0000.000
과망간산칼륨소비량0.154NaNNaN0.2450.1490.1370.1490.0000.0001.000
2024-03-13T22:15:17.747399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미검사사유총대장균군분원성대장균군검사여부암모니아성 질소(검출)암모니아성 질소검사일자검사결과
미검사사유1.000NaNNaN1.000NaNNaNNaNNaN
총대장균군NaN1.0000.5891.000NaN1.0000.5680.991
분원성대장균군NaN0.5891.0001.000NaN1.0000.5040.589
검사여부1.0001.0001.0001.000NaN1.0001.0001.000
암모니아성 질소(검출)NaNNaNNaNNaN1.000NaNNaNNaN
암모니아성 질소NaN1.0001.0001.000NaN1.0001.0001.000
검사일자NaN0.5680.5041.000NaN1.0001.0000.568
검사결과NaN0.9910.5891.000NaN1.0000.5681.000
2024-03-13T22:15:17.901753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호일반세균 중온질산성 질소과망간산칼륨소비량검사여부미검사사유검사일자검사결과총대장균군분원성대장균군암모니아성 질소암모니아성 질소(검출)
번호1.000-0.024-0.1640.1840.4600.5020.7650.2740.2740.2661.000NaN
일반세균 중온-0.0241.000-0.1160.0941.0000.0000.0670.1230.1230.1611.000NaN
질산성 질소-0.164-0.1161.0000.0331.0000.0000.0000.2600.2600.0001.000NaN
과망간산칼륨소비량0.1840.0940.0331.0001.0000.0000.1390.1460.1460.0001.000NaN
검사여부0.4601.0001.0001.0001.0001.0001.0001.0001.0001.0001.000NaN
미검사사유0.5020.0000.0000.0001.0001.0000.0000.0000.0000.0000.0000.000
검사일자0.7650.0670.0000.1391.0000.0001.0000.5680.5680.5041.000NaN
검사결과0.2740.1230.2600.1461.0000.0000.5681.0000.9910.5891.000NaN
총대장균군0.2740.1230.2600.1461.0000.0000.5680.9911.0000.5891.000NaN
분원성대장균군0.2660.1610.0000.0001.0000.0000.5040.5890.5891.0001.000NaN
암모니아성 질소1.0001.0001.0001.0001.0000.0001.0001.0001.0001.0001.0000.000
암모니아성 질소(검출)NaNNaNNaNNaNNaN0.000NaNNaNNaNNaN0.0001.000

Missing values

2024-03-13T22:15:12.338841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:15:12.676713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T22:15:12.906250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호공동시설명검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군암모니아성 질소암모니아성 질소(검출)질산성 질소과망간산칼륨소비량
01서구 서대신동 꽃마을검사<NA>2021-07-29적합4불검출불검출불검출<NA>2.80.3
12서구 중앙공원민속관검사<NA>2021-07-29적합0불검출불검출불검출<NA>1.10.4
23서구 중앙공원석탑검사<NA>2021-07-29적합0불검출불검출불검출<NA>1.10.4
34서구 중앙공원구이검사<NA>2021-07-29적합0불검출불검출불검출<NA>1.10.4
45서구 중앙공원옥천검사<NA>2021-07-29적합0불검출불검출불검출<NA>1.40.5
56서구 중앙공원팔각정검사<NA>2021-07-29적합0불검출불검출불검출<NA>1.70.3
67서구 남부민동 천마정검사<NA>2021-07-29적합0불검출불검출불검출<NA>4.50.4
78동구 만리산미검사관리등급 안심/양호<NA><NA><NA><NA><NA><NA><NA><NA><NA>
89동구 청조미검사관리등급 안심/양호<NA><NA><NA><NA><NA><NA><NA><NA><NA>
910동구 수정샘미검사관리등급 안심/양호<NA><NA><NA><NA><NA><NA><NA><NA><NA>
번호공동시설명검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군암모니아성 질소암모니아성 질소(검출)질산성 질소과망간산칼륨소비량
443444사상구 불심검사<NA>2021-09-09부적합48검출검출불검출<NA>1.80.5
444445사상구 승학(엄궁)검사<NA>2021-09-09부적합18검출검출불검출<NA>2.90.4
445446사상구 거북검사<NA>2021-09-09부적합19검출불검출불검출<NA>0.70.3
446447사상구 삼운정검사<NA>2021-09-09적합0불검출불검출불검출<NA>4.70.4
447448사상구 운수사검사<NA>2021-09-09부적합35검출불검출불검출<NA>1.10.4
448449사상구 백수검사<NA>2021-09-09부적합25검출검출불검출<NA>5.30.4
449450사상구 청수검사<NA>2021-09-09부적합25검출검출불검출<NA>2.30.6
450451사상구 승학(학장)검사<NA>2021-09-09부적합32검출불검출불검출<NA>0.60.4
451452기장군 용소검사<NA>2021-09-15부적합0검출불검출불검출<NA>0.80.3
452453기장군 불광산검사<NA>2021-09-15적합0불검출불검출<NA><NA><NA>0.3