Overview

Dataset statistics

Number of variables12
Number of observations142
Missing cells147
Missing cells (%)8.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.9 KiB
Average record size in memory99.9 B

Variable types

Numeric3
Text2
Categorical7

Dataset

Description부산광역시_먹는물공동시설(약수터)수질검사결과정보_20221231
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083355

Alerts

총대장균군 is highly overall correlated with 검사여부 and 4 other fieldsHigh correlation
암모니아성 질소 is highly overall correlated with 번호 and 8 other fieldsHigh correlation
분원성대장균군 is highly overall correlated with 일반세균 중온 and 4 other fieldsHigh correlation
검사일자 is highly overall correlated with 번호 and 4 other fieldsHigh correlation
과망간산칼륨소비량 is highly overall correlated with 검사여부 and 1 other fieldsHigh correlation
검사여부 is highly overall correlated with 일반세균 중온 and 6 other fieldsHigh correlation
검사결과 is highly overall correlated with 검사일자 and 3 other fieldsHigh correlation
번호 is highly overall correlated with 검사일자 and 1 other fieldsHigh correlation
일반세균 중온 is highly overall correlated with 검사여부 and 2 other fieldsHigh correlation
질산성 질소 is highly overall correlated with 검사여부 and 1 other fieldsHigh correlation
검사여부 is highly imbalanced (82.3%)Imbalance
검사결과 is highly imbalanced (52.9%)Imbalance
총대장균군 is highly imbalanced (50.8%)Imbalance
분원성대장균군 is highly imbalanced (68.4%)Imbalance
암모니아성 질소 is highly imbalanced (78.0%)Imbalance
미검사사유 has 137 (96.5%) missing valuesMissing
일반세균 중온 has 5 (3.5%) missing valuesMissing
질산성 질소 has 5 (3.5%) missing valuesMissing
번호 has unique valuesUnique
공동시설명 has unique valuesUnique
일반세균 중온 has 91 (64.1%) zerosZeros

Reproduction

Analysis started2024-04-17 12:08:21.737481
Analysis finished2024-04-17 12:08:23.166799
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct142
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.5
Minimum1
Maximum142
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-17T21:08:23.222947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.05
Q136.25
median71.5
Q3106.75
95-th percentile134.95
Maximum142
Range141
Interquartile range (IQR)70.5

Descriptive statistics

Standard deviation41.135953
Coefficient of variation (CV)0.57532802
Kurtosis-1.2
Mean71.5
Median Absolute Deviation (MAD)35.5
Skewness0
Sum10153
Variance1692.1667
MonotonicityStrictly increasing
2024-04-17T21:08:23.345241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
99 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
100 1
 
0.7%
91 1
 
0.7%
Other values (132) 132
93.0%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%

공동시설명
Text

UNIQUE 

Distinct142
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-04-17T21:08:23.606823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.056338
Min length5

Characters and Unicode

Total characters1002
Distinct characters155
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)100.0%

Sample

1st row서구 서대신동 꽃마을
2nd row서구 중앙공원민속관
3rd row서구 중앙공원석탑
4th row서구 중앙공원구이
5th row서구 중앙공원옥천
ValueCountFrequency (%)
사상구 19
 
6.6%
금정구 15
 
5.2%
사하구 15
 
5.2%
부산진구 13
 
4.5%
북구 13
 
4.5%
남구 12
 
4.2%
해운대구 10
 
3.5%
영도구 10
 
3.5%
동래구 10
 
3.5%
동구 8
 
2.8%
Other values (143) 161
56.3%
2024-04-17T21:08:23.960448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
 
14.4%
142
 
14.2%
46
 
4.6%
34
 
3.4%
33
 
3.3%
24
 
2.4%
23
 
2.3%
22
 
2.2%
21
 
2.1%
18
 
1.8%
Other values (145) 495
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 841
83.9%
Space Separator 144
 
14.4%
Close Punctuation 6
 
0.6%
Open Punctuation 6
 
0.6%
Decimal Number 5
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
16.9%
46
 
5.5%
34
 
4.0%
33
 
3.9%
24
 
2.9%
23
 
2.7%
22
 
2.6%
21
 
2.5%
18
 
2.1%
17
 
2.0%
Other values (139) 461
54.8%
Decimal Number
ValueCountFrequency (%)
1 2
40.0%
2 2
40.0%
4 1
20.0%
Space Separator
ValueCountFrequency (%)
144
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 841
83.9%
Common 161
 
16.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
16.9%
46
 
5.5%
34
 
4.0%
33
 
3.9%
24
 
2.9%
23
 
2.7%
22
 
2.6%
21
 
2.5%
18
 
2.1%
17
 
2.0%
Other values (139) 461
54.8%
Common
ValueCountFrequency (%)
144
89.4%
) 6
 
3.7%
( 6
 
3.7%
1 2
 
1.2%
2 2
 
1.2%
4 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 841
83.9%
ASCII 161
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
144
89.4%
) 6
 
3.7%
( 6
 
3.7%
1 2
 
1.2%
2 2
 
1.2%
4 1
 
0.6%
Hangul
ValueCountFrequency (%)
142
 
16.9%
46
 
5.5%
34
 
4.0%
33
 
3.9%
24
 
2.9%
23
 
2.7%
22
 
2.6%
21
 
2.5%
18
 
2.1%
17
 
2.0%
Other values (139) 461
54.8%

검사여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
검사
136 
미검사
 
5
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0492958
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row검사
2nd row검사
3rd row검사
4th row검사
5th row검사

Common Values

ValueCountFrequency (%)
검사 136
95.8%
미검사 5
 
3.5%
<NA> 1
 
0.7%

Length

2024-04-17T21:08:24.077373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:24.166272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
검사 136
95.8%
미검사 5
 
3.5%
na 1
 
0.7%

미검사사유
Text

MISSING 

Distinct4
Distinct (%)80.0%
Missing137
Missing (%)96.5%
Memory size1.2 KiB
2024-04-17T21:08:24.264154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.6
Min length4

Characters and Unicode

Total characters23
Distinct characters10
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)60.0%

Sample

1st row수원고갈
2nd row수원고갈
3rd row유량부족
4th row수량 부족
5th row채수량 부족
ValueCountFrequency (%)
수원고갈 2
28.6%
부족 2
28.6%
유량부족 1
14.3%
수량 1
14.3%
채수량 1
14.3%
2024-04-17T21:08:24.489705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
17.4%
3
13.0%
3
13.0%
3
13.0%
2
8.7%
2
8.7%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21
91.3%
Space Separator 2
 
8.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
19.0%
3
14.3%
3
14.3%
3
14.3%
2
9.5%
2
9.5%
2
9.5%
1
 
4.8%
1
 
4.8%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21
91.3%
Common 2
 
8.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
19.0%
3
14.3%
3
14.3%
3
14.3%
2
9.5%
2
9.5%
2
9.5%
1
 
4.8%
1
 
4.8%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21
91.3%
ASCII 2
 
8.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
19.0%
3
14.3%
3
14.3%
3
14.3%
2
9.5%
2
9.5%
2
9.5%
1
 
4.8%
1
 
4.8%
ASCII
ValueCountFrequency (%)
2
100.0%

검사일자
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2022-10-19
16 
2022-10-26
14 
2022-11-17
10 
2022-11-07
10 
2022-12-05
Other values (19)
83 

Length

Max length10
Median length10
Mean length9.7887324
Min length4

Unique

Unique4 ?
Unique (%)2.8%

Sample

1st row2022-12-05
2nd row2022-12-05
3rd row2022-12-05
4th row2022-12-05
5th row2022-12-05

Common Values

ValueCountFrequency (%)
2022-10-19 16
 
11.3%
2022-10-26 14
 
9.9%
2022-11-17 10
 
7.0%
2022-11-07 10
 
7.0%
2022-12-05 9
 
6.3%
2022-11-22 9
 
6.3%
2022-11-14 9
 
6.3%
2022-11-28 8
 
5.6%
2022-12-16 8
 
5.6%
2022-11-10 7
 
4.9%
Other values (14) 42
29.6%

Length

2024-04-17T21:08:24.603761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-19 16
 
11.3%
2022-10-26 14
 
9.9%
2022-11-17 10
 
7.0%
2022-11-07 10
 
7.0%
2022-12-05 9
 
6.3%
2022-11-22 9
 
6.3%
2022-11-14 9
 
6.3%
2022-11-28 8
 
5.6%
2022-12-16 8
 
5.6%
2022-11-10 7
 
4.9%
Other values (14) 42
29.6%

검사결과
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
적합
119 
부적합
19 
<NA>
 
4

Length

Max length4
Median length2
Mean length2.1901408
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 119
83.8%
부적합 19
 
13.4%
<NA> 4
 
2.8%

Length

2024-04-17T21:08:24.712928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:24.800919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 119
83.8%
부적합 19
 
13.4%
na 4
 
2.8%

일반세균 중온
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct26
Distinct (%)19.0%
Missing5
Missing (%)3.5%
Infinite0
Infinite (%)0.0%
Mean18.218978
Minimum0
Maximum900
Zeros91
Zeros (%)64.1%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-17T21:08:24.879258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile48.2
Maximum900
Range900
Interquartile range (IQR)2

Descriptive statistics

Standard deviation103.81033
Coefficient of variation (CV)5.6979226
Kurtosis61.716508
Mean18.218978
Median Absolute Deviation (MAD)0
Skewness7.7938756
Sum2496
Variance10776.584
MonotonicityNot monotonic
2024-04-17T21:08:24.969090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0 91
64.1%
2 8
 
5.6%
3 5
 
3.5%
1 4
 
2.8%
5 4
 
2.8%
8 3
 
2.1%
21 2
 
1.4%
9 2
 
1.4%
54 1
 
0.7%
7 1
 
0.7%
Other values (16) 16
 
11.3%
(Missing) 5
 
3.5%
ValueCountFrequency (%)
0 91
64.1%
1 4
 
2.8%
2 8
 
5.6%
3 5
 
3.5%
4 1
 
0.7%
5 4
 
2.8%
6 1
 
0.7%
7 1
 
0.7%
8 3
 
2.1%
9 2
 
1.4%
ValueCountFrequency (%)
900 1
0.7%
800 1
0.7%
192 1
0.7%
78 1
0.7%
68 1
0.7%
54 1
0.7%
49 1
0.7%
48 1
0.7%
44 1
0.7%
38 1
0.7%

총대장균군
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
불검출
118 
검출
19 
<NA>
 
5

Length

Max length4
Median length3
Mean length2.9014085
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
불검출 118
83.1%
검출 19
 
13.4%
<NA> 5
 
3.5%

Length

2024-04-17T21:08:25.069796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:25.158001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불검출 118
83.1%
검출 19
 
13.4%
na 5
 
3.5%

분원성대장균군
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
불검출
130 
검출
 
7
<NA>
 
5

Length

Max length4
Median length3
Mean length2.9859155
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
불검출 130
91.5%
검출 7
 
4.9%
<NA> 5
 
3.5%

Length

2024-04-17T21:08:25.250098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:25.353046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불검출 130
91.5%
검출 7
 
4.9%
na 5
 
3.5%

암모니아성 질소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
불검출
137 
<NA>
 
5

Length

Max length4
Median length3
Mean length3.0352113
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row불검출

Common Values

ValueCountFrequency (%)
불검출 137
96.5%
<NA> 5
 
3.5%

Length

2024-04-17T21:08:25.439539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:25.521255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불검출 137
96.5%
na 5
 
3.5%

질산성 질소
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct30
Distinct (%)21.9%
Missing5
Missing (%)3.5%
Infinite0
Infinite (%)0.0%
Mean1.5175182
Minimum0.5
Maximum4.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-17T21:08:25.597092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.5
5-th percentile0.7
Q11
median1.3
Q31.8
95-th percentile3.2
Maximum4.1
Range3.6
Interquartile range (IQR)0.8

Descriptive statistics

Standard deviation0.763496
Coefficient of variation (CV)0.50312147
Kurtosis1.6157324
Mean1.5175182
Median Absolute Deviation (MAD)0.3
Skewness1.3712656
Sum207.9
Variance0.58292615
MonotonicityNot monotonic
2024-04-17T21:08:25.692488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1.0 12
 
8.5%
1.3 12
 
8.5%
1.1 12
 
8.5%
1.2 10
 
7.0%
0.9 9
 
6.3%
1.4 9
 
6.3%
0.8 9
 
6.3%
1.5 9
 
6.3%
2.2 5
 
3.5%
1.6 5
 
3.5%
Other values (20) 45
31.7%
(Missing) 5
 
3.5%
ValueCountFrequency (%)
0.5 4
 
2.8%
0.6 2
 
1.4%
0.7 4
 
2.8%
0.8 9
6.3%
0.9 9
6.3%
1.0 12
8.5%
1.1 12
8.5%
1.2 10
7.0%
1.3 12
8.5%
1.4 9
6.3%
ValueCountFrequency (%)
4.1 1
 
0.7%
4.0 1
 
0.7%
3.9 1
 
0.7%
3.3 3
2.1%
3.2 3
2.1%
3.1 2
1.4%
2.8 1
 
0.7%
2.7 2
1.4%
2.6 2
1.4%
2.5 1
 
0.7%

과망간산칼륨소비량
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
불검출
82 
0.3
43 
0.4
 
6
<NA>
 
5
0.5
 
3
Other values (2)
 
3

Length

Max length4
Median length3
Mean length3.0352113
Min length3

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row불검출
2nd row불검출
3rd row불검출
4th row불검출
5th row0.3

Common Values

ValueCountFrequency (%)
불검출 82
57.7%
0.3 43
30.3%
0.4 6
 
4.2%
<NA> 5
 
3.5%
0.5 3
 
2.1%
0.6 2
 
1.4%
0.7 1
 
0.7%

Length

2024-04-17T21:08:25.801889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:08:25.894309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불검출 82
57.7%
0.3 43
30.3%
0.4 6
 
4.2%
na 5
 
3.5%
0.5 3
 
2.1%
0.6 2
 
1.4%
0.7 1
 
0.7%

Interactions

2024-04-17T21:08:22.597214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.214475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.416246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.665048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.283620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.479418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.727049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.344909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:08:22.533846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T21:08:25.966022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군질산성 질소과망간산칼륨소비량
번호1.0000.0001.0000.9510.5050.0000.5030.4460.2710.000
검사여부0.0001.000NaNNaN0.000NaNNaNNaNNaNNaN
미검사사유1.000NaN1.000NaNNaNNaNNaNNaNNaNNaN
검사일자0.951NaNNaN1.0000.7810.0000.7810.4510.4510.602
검사결과0.5050.000NaN0.7811.0000.5090.9990.7350.0000.000
일반세균 중온0.000NaNNaN0.0000.5091.0000.5090.7140.0000.531
총대장균군0.503NaNNaN0.7810.9990.5091.0000.7350.0000.000
분원성대장균군0.446NaNNaN0.4510.7350.7140.7351.0000.0000.285
질산성 질소0.271NaNNaN0.4510.0000.0000.0000.0001.0000.000
과망간산칼륨소비량0.000NaNNaN0.6020.0000.5310.0000.2850.0001.000
2024-04-17T21:08:26.079159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총대장균군암모니아성 질소분원성대장균군검사일자과망간산칼륨소비량검사여부검사결과
총대장균군1.0001.0000.5250.6460.0001.0000.969
암모니아성 질소1.0001.0001.0001.0001.0001.0001.000
분원성대장균군0.5251.0001.0000.3600.2011.0000.525
검사일자0.6461.0000.3601.0000.3001.0000.646
과망간산칼륨소비량0.0001.0000.2010.3001.0001.0000.000
검사여부1.0001.0001.0001.0001.0001.0000.000
검사결과0.9691.0000.5250.6460.0000.0001.000
2024-04-17T21:08:26.169468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호일반세균 중온질산성 질소검사여부검사일자검사결과총대장균군분원성대장균군암모니아성 질소과망간산칼륨소비량
번호1.0000.054-0.1090.0000.7240.3770.3750.3321.0000.000
일반세균 중온0.0541.000-0.1501.0000.0000.3430.3430.5051.0000.367
질산성 질소-0.109-0.1501.0001.0000.2030.0000.0000.0001.0000.000
검사여부0.0001.0001.0001.0001.0000.0001.0001.0001.0001.000
검사일자0.7240.0000.2031.0001.0000.6460.6460.3601.0000.300
검사결과0.3770.3430.0000.0000.6461.0000.9690.5251.0000.000
총대장균군0.3750.3430.0001.0000.6460.9691.0000.5251.0000.000
분원성대장균군0.3320.5050.0001.0000.3600.5250.5251.0001.0000.201
암모니아성 질소1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
과망간산칼륨소비량0.0000.3670.0001.0000.3000.0000.0000.2011.0001.000

Missing values

2024-04-17T21:08:22.823130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T21:08:22.952713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-17T21:08:23.077404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호공동시설명검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군암모니아성 질소질산성 질소과망간산칼륨소비량
01서구 서대신동 꽃마을검사<NA>2022-12-05적합0불검출불검출불검출2.2불검출
12서구 중앙공원민속관검사<NA>2022-12-05적합0불검출불검출불검출1.7불검출
23서구 중앙공원석탑검사<NA>2022-12-05적합0불검출불검출불검출1.0불검출
34서구 중앙공원구이검사<NA>2022-12-05적합0불검출불검출불검출1.3불검출
45서구 중앙공원옥천검사<NA>2022-12-05적합0불검출불검출불검출1.40.3
56서구 중앙공원팔각정검사<NA>2022-12-05적합0불검출불검출불검출1.1불검출
67서구 남부민동 천마정검사<NA>2022-12-05적합0불검출불검출불검출3.1불검출
78동구 만리산검사<NA>2022-12-16적합3불검출불검출불검출2.8불검출
89동구 청조검사<NA>2022-12-16적합0불검출불검출불검출2.60.3
910동구 수정샘검사<NA>2022-12-16적합5불검출불검출불검출1.00.3
번호공동시설명검사여부미검사사유검사일자검사결과일반세균 중온총대장균군분원성대장균군암모니아성 질소질산성 질소과망간산칼륨소비량
132133사상구 승학(엄궁)검사<NA>2022-10-19적합2불검출불검출불검출1.40.3
133134사상구 거북검사<NA>2022-10-19적합0불검출불검출불검출0.60.3
134135사상구 삼운정검사<NA>2022-12-07부적합68검출불검출불검출1.00.3
135136사상구 운수사검사<NA>2022-10-19적합0불검출불검출불검출0.7불검출
136137사상구 백수검사<NA>2022-12-06부적합21검출불검출불검출3.20.5
137138사상구 청수검사<NA>2022-11-01부적합1검출불검출불검출1.5불검출
138139사상구 승학(학장)검사<NA>2022-12-12부적합3검출불검출불검출0.50.3
139140사상구 사상<NA><NA>2022-10-19적합6불검출불검출불검출1.2불검출
140141기장군 용소검사<NA>2022-12-05적합7불검출불검출불검출1.0불검출
141142기장군 불광산검사<NA>2022-12-05적합9불검출불검출불검출0.8불검출