Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 519 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 21.4 KiB |
Average record size in memory | 42.3 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Boolean | 1 |
Dataset
Description | 상수도사업본부 수질검사 항목 목록에 관한 데이터로 검사항목명, 항목정의, 항목발생원, 중요오염물질여부 등의 정보를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15118749/fileData.do |
Reproduction
Analysis started | 2023-12-12 02:09:46.786960 |
---|---|
Analysis finished | 2023-12-12 02:09:47.758118 |
Duration | 0.97 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
검사항목일련번호
Real number (ℝ)
UNIQUE
 
Distinct | 519 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 278.5896 |
Minimum | 1 |
---|---|
Maximum | 550 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 32.8 |
Q1 | 140.5 |
median | 280 |
Q3 | 414.5 |
95-th percentile | 523.2 |
Maximum | 550 |
Range | 549 |
Interquartile range (IQR) | 274 |
Descriptive statistics
Standard deviation | 158.05894 |
---|---|
Coefficient of variation (CV) | 0.56735407 |
Kurtosis | -1.1952264 |
Mean | 278.5896 |
Median Absolute Deviation (MAD) | 137 |
Skewness | -0.011380216 |
Sum | 144588 |
Variance | 24982.629 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201 | 1 | 0.2% |
398 | 1 | 0.2% |
106 | 1 | 0.2% |
105 | 1 | 0.2% |
412 | 1 | 0.2% |
451 | 1 | 0.2% |
104 | 1 | 0.2% |
385 | 1 | 0.2% |
332 | 1 | 0.2% |
103 | 1 | 0.2% |
Other values (509) | 509 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
6 | 1 | |
9 | 1 | |
10 | 1 | |
12 | 1 | |
13 | 1 | |
14 | 1 |
Value | Count | Frequency (%) |
550 | 1 | |
549 | 1 | |
548 | 1 | |
547 | 1 | |
546 | 1 | |
545 | 1 | |
544 | 1 | |
543 | 1 | |
542 | 1 | |
541 | 1 |
검사항목명
Text
Distinct | 508 |
---|---|
Distinct (%) | 97.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 17 |
Mean length | 6.5491329 |
Min length | 1 |
Characters and Unicode
Total characters | 3399 |
---|---|
Distinct characters | 336 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 499 ? |
---|---|
Unique (%) | 96.1% |
Sample
1st row | 1,1-디클로로에탄 |
---|---|
2nd row | 1,1-디클로로프로펜 |
3rd row | 1,1,1-트리클로로아세톤 |
4th row | 1,1,1,2-테트라클로로에탄 |
5th row | 1,1,2-트리클로로에탄 |
Value | Count | Frequency (%) |
pacsⅱ | 3 | 0.6% |
불활성화비 | 3 | 0.6% |
지아디아 | 3 | 0.6% |
대장균 | 3 | 0.6% |
탁도 | 3 | 0.6% |
아조벤젠 | 2 | 0.4% |
잔류염소 | 2 | 0.4% |
과산화수소 | 2 | 0.4% |
브로모클로로아이요드메탄 | 2 | 0.4% |
크립토스포리디움 | 2 | 0.4% |
Other values (502) | 505 |
Most occurring characters
Value | Count | Frequency (%) |
로 | 217 | 6.4% |
- | 100 | 2.9% |
클 | 93 | 2.7% |
트 | 81 | 2.4% |
이 | 74 | 2.2% |
디 | 70 | 2.1% |
아 | 68 | 2.0% |
) | 62 | 1.8% |
( | 62 | 1.8% |
1 | 60 | 1.8% |
Other values (326) | 2512 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2498 | |
Uppercase Letter | 226 | 6.6% |
Lowercase Letter | 203 | 6.0% |
Decimal Number | 156 | 4.6% |
Dash Punctuation | 100 | 2.9% |
Other Punctuation | 73 | 2.1% |
Close Punctuation | 62 | 1.8% |
Open Punctuation | 62 | 1.8% |
Space Separator | 13 | 0.4% |
Letter Number | 3 | 0.1% |
Other values (2) | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 217 | 8.7% |
클 | 93 | 3.7% |
트 | 81 | 3.2% |
이 | 74 | 3.0% |
디 | 70 | 2.8% |
아 | 68 | 2.7% |
소 | 56 | 2.2% |
리 | 48 | 1.9% |
산 | 46 | 1.8% |
틸 | 39 | 1.6% |
Other values (263) | 1706 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 34 | |
e | 22 | |
a | 22 | |
c | 16 | 7.9% |
r | 13 | 6.4% |
o | 12 | 5.9% |
h | 10 | 4.9% |
l | 9 | 4.4% |
i | 9 | 4.4% |
s | 8 | 3.9% |
Other values (13) | 48 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 32 | |
A | 31 | |
P | 24 | |
S | 23 | |
B | 20 | |
T | 17 | |
D | 15 | |
N | 12 | 5.3% |
M | 11 | 4.9% |
I | 7 | 3.1% |
Other values (10) | 34 |
Decimal Number
Value | Count | Frequency (%) |
1 | 60 | |
2 | 39 | |
4 | 26 | |
3 | 17 | 10.9% |
6 | 7 | 4.5% |
0 | 4 | 2.6% |
5 | 2 | 1.3% |
7 | 1 | 0.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 45 | |
. | 26 | |
/ | 1 | 1.4% |
% | 1 | 1.4% |
Other Number
Value | Count | Frequency (%) |
₂ | 1 | |
₃ | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 100 |
Close Punctuation
Value | Count | Frequency (%) |
) | 62 |
Open Punctuation
Value | Count | Frequency (%) |
( | 62 |
Space Separator
Value | Count | Frequency (%) |
13 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 3 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2498 | |
Common | 470 | 13.8% |
Latin | 431 | 12.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 217 | 8.7% |
클 | 93 | 3.7% |
트 | 81 | 3.2% |
이 | 74 | 3.0% |
디 | 70 | 2.8% |
아 | 68 | 2.7% |
소 | 56 | 2.2% |
리 | 48 | 1.9% |
산 | 46 | 1.8% |
틸 | 39 | 1.6% |
Other values (263) | 1706 |
Latin
Value | Count | Frequency (%) |
n | 34 | 7.9% |
C | 32 | 7.4% |
A | 31 | 7.2% |
P | 24 | 5.6% |
S | 23 | 5.3% |
e | 22 | 5.1% |
a | 22 | 5.1% |
B | 20 | 4.6% |
T | 17 | 3.9% |
c | 16 | 3.7% |
Other values (33) | 190 |
Common
Value | Count | Frequency (%) |
- | 100 | |
) | 62 | |
( | 62 | |
1 | 60 | |
, | 45 | |
2 | 39 | 8.3% |
. | 26 | 5.5% |
4 | 26 | 5.5% |
3 | 17 | 3.6% |
13 | 2.8% | |
Other values (10) | 20 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2498 | |
ASCII | 895 | 26.3% |
Number Forms | 3 | 0.1% |
None | 2 | 0.1% |
Letterlike Symbols | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
로 | 217 | 8.7% |
클 | 93 | 3.7% |
트 | 81 | 3.2% |
이 | 74 | 3.0% |
디 | 70 | 2.8% |
아 | 68 | 2.7% |
소 | 56 | 2.2% |
리 | 48 | 1.9% |
산 | 46 | 1.8% |
틸 | 39 | 1.6% |
Other values (263) | 1706 |
ASCII
Value | Count | Frequency (%) |
- | 100 | 11.2% |
) | 62 | 6.9% |
( | 62 | 6.9% |
1 | 60 | 6.7% |
, | 45 | 5.0% |
2 | 39 | 4.4% |
n | 34 | 3.8% |
C | 32 | 3.6% |
A | 31 | 3.5% |
. | 26 | 2.9% |
Other values (49) | 404 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 3 |
Letterlike Symbols
Value | Count | Frequency (%) |
ℓ | 1 |
None
Value | Count | Frequency (%) |
₂ | 1 | |
₃ | 1 |
사용여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 651.0 B |
True | |
---|---|
False | 10 |
Value | Count | Frequency (%) |
True | 509 | |
False | 10 | 1.9% |
수정자일련번호
Real number (ℝ)
Distinct | 13 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1015.6474 |
Minimum | 0 |
---|---|
Maximum | 9627 |
Zeros | 1 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 9332 |
Maximum | 9627 |
Range | 9627 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2600.7568 |
---|---|
Coefficient of variation (CV) | 2.5606887 |
Kurtosis | 5.6751341 |
Mean | 1015.6474 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.6468953 |
Sum | 527121 |
Variance | 6763935.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 426 | |
2510 | 32 | 6.2% |
9627 | 20 | 3.9% |
9332 | 16 | 3.1% |
4518 | 11 | 2.1% |
4534 | 4 | 0.8% |
9294 | 3 | 0.6% |
748 | 2 | 0.4% |
716 | 1 | 0.2% |
0 | 1 | 0.2% |
Other values (3) | 3 | 0.6% |
Value | Count | Frequency (%) |
0 | 1 | 0.2% |
1 | 426 | |
548 | 1 | 0.2% |
716 | 1 | 0.2% |
748 | 2 | 0.4% |
1539 | 1 | 0.2% |
2510 | 32 | 6.2% |
4508 | 1 | 0.2% |
4518 | 11 | 2.1% |
4534 | 4 | 0.8% |
Value | Count | Frequency (%) |
9627 | 20 | |
9332 | 16 | |
9294 | 3 | 0.6% |
4534 | 4 | 0.8% |
4518 | 11 | 2.1% |
4508 | 1 | 0.2% |
2510 | 32 | |
1539 | 1 | 0.2% |
748 | 2 | 0.4% |
716 | 1 | 0.2% |
수정시간
Text
Distinct | 144 |
---|---|
Distinct (%) | 27.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 21 |
Mean length | 21.198459 |
Min length | 21 |
Characters and Unicode
Total characters | 11002 |
---|---|
Distinct characters | 16 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 131 ? |
---|---|
Unique (%) | 25.2% |
Sample
1st row | 2011-07-20 오후 4:06:59 |
---|---|
2nd row | 2011-05-30 오후 5:44:11 |
3rd row | 2011-07-20 오후 4:06:59 |
4th row | 2011-07-20 오후 4:06:59 |
5th row | 2011-07-20 오후 4:06:59 |
Value | Count | Frequency (%) |
오후 | 415 | |
2011-07-20 | 178 | |
4:06:59 | 177 | |
2011-05-30 | 136 | 8.7% |
5:44:11 | 136 | 8.7% |
오전 | 104 | 6.7% |
2011-06-23 | 23 | 1.5% |
10:48:28 | 23 | 1.5% |
2018-02-13 | 21 | 1.3% |
2011-08-04 | 16 | 1.0% |
Other values (184) | 328 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1716 | |
1 | 1604 | |
- | 1038 | |
1038 | ||
: | 1038 | |
2 | 1016 | |
4 | 603 | 5.5% |
5 | 590 | 5.4% |
오 | 519 | 4.7% |
후 | 415 | 3.8% |
Other values (6) | 1425 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 6850 | |
Dash Punctuation | 1038 | 9.4% |
Space Separator | 1038 | 9.4% |
Other Punctuation | 1038 | 9.4% |
Other Letter | 1038 | 9.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 1716 | |
1 | 1604 | |
2 | 1016 | |
4 | 603 | 8.8% |
5 | 590 | 8.6% |
3 | 334 | 4.9% |
9 | 309 | 4.5% |
6 | 275 | 4.0% |
7 | 244 | 3.6% |
8 | 159 | 2.3% |
Other Letter
Value | Count | Frequency (%) |
오 | 519 | |
후 | 415 | |
전 | 104 | 10.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1038 |
Space Separator
Value | Count | Frequency (%) |
1038 |
Other Punctuation
Value | Count | Frequency (%) |
: | 1038 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 9964 | |
Hangul | 1038 | 9.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 1716 | |
1 | 1604 | |
- | 1038 | |
1038 | ||
: | 1038 | |
2 | 1016 | |
4 | 603 | 6.1% |
5 | 590 | 5.9% |
3 | 334 | 3.4% |
9 | 309 | 3.1% |
Other values (3) | 678 | 6.8% |
Hangul
Value | Count | Frequency (%) |
오 | 519 | |
후 | 415 | |
전 | 104 | 10.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9964 | |
Hangul | 1038 | 9.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1716 | |
1 | 1604 | |
- | 1038 | |
1038 | ||
: | 1038 | |
2 | 1016 | |
4 | 603 | 6.1% |
5 | 590 | 5.9% |
3 | 334 | 3.4% |
9 | 309 | 3.1% |
Other values (3) | 678 | 6.8% |
Hangul
Value | Count | Frequency (%) |
오 | 519 | |
후 | 415 | |
전 | 104 | 10.0% |
검사항목일련번호 | 사용여부 | 수정자일련번호 | |
---|---|---|---|
검사항목일련번호 | 1.000 | 0.451 | 0.774 |
사용여부 | 0.451 | 1.000 | 0.228 |
수정자일련번호 | 0.774 | 0.228 | 1.000 |
검사항목일련번호 | 수정자일련번호 | 사용여부 | |
---|---|---|---|
검사항목일련번호 | 1.000 | 0.491 | 0.344 |
수정자일련번호 | 0.491 | 1.000 | 0.277 |
사용여부 | 0.344 | 0.277 | 1.000 |
검사항목일련번호 | 검사항목명 | 사용여부 | 수정자일련번호 | 수정시간 | |
---|---|---|---|---|---|
0 | 201 | 1,1-디클로로에탄 | Y | 1 | 2011-07-20 오후 4:06:59 |
1 | 315 | 1,1-디클로로프로펜 | Y | 1 | 2011-05-30 오후 5:44:11 |
2 | 206 | 1,1,1-트리클로로아세톤 | Y | 1 | 2011-07-20 오후 4:06:59 |
3 | 208 | 1,1,1,2-테트라클로로에탄 | Y | 1 | 2011-07-20 오후 4:06:59 |
4 | 207 | 1,1,2-트리클로로에탄 | Y | 1 | 2011-07-20 오후 4:06:59 |
5 | 313 | 1,1,2,2-테트라클로로에탄 | Y | 1 | 2011-05-30 오후 5:44:11 |
6 | 319 | 1,2-디클로로프로판 | Y | 1 | 2011-05-30 오후 5:44:11 |
7 | 269 | 1,2,3-트리클로로벤젠 | Y | 1 | 2011-05-30 오후 5:44:11 |
8 | 320 | 1,2,3-트리클로로프로판 | Y | 1 | 2011-05-30 오후 5:44:11 |
9 | 314 | 1,2,4-트리메틸벤젠 | Y | 1 | 2011-05-30 오후 5:44:11 |
검사항목일련번호 | 검사항목명 | 사용여부 | 수정자일련번호 | 수정시간 | |
---|---|---|---|---|---|
509 | 541 | MCPA | Y | 9332 | 2022-03-06 오후 2:41:10 |
510 | 542 | 아세나프텐 | Y | 9332 | 2022-05-10 오후 5:25:49 |
511 | 543 | 미세플라스틱 | Y | 9627 | 2023-01-02 오후 1:58:11 |
512 | 544 | DCAcAm | Y | 9627 | 2023-01-02 오후 2:29:46 |
513 | 545 | BCAcAm | Y | 9627 | 2023-01-02 오후 2:30:23 |
514 | 546 | TCAcAm | Y | 9627 | 2023-01-02 오후 2:30:50 |
515 | 547 | DBAcAm | Y | 9627 | 2023-01-02 오후 2:31:11 |
516 | 548 | BDCAcAm | Y | 9627 | 2023-01-02 오후 2:31:35 |
517 | 549 | DBCAcAm | Y | 9627 | 2023-01-02 오후 2:31:55 |
518 | 550 | TBAcAm | Y | 9627 | 2023-01-02 오후 2:32:15 |