Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 76 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 644.5 KiB |
Average record size in memory | 66.0 B |
Variable types
Categorical | 3 |
---|---|
Text | 3 |
Numeric | 1 |
Dataset
Description | 검사년도,검체번호,고유번호,구명,조사결과값,조사항목명,허가신고번호 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-22145/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 04:45:21.919953 |
---|---|
Analysis finished | 2024-05-11 04:45:23.907552 |
Duration | 1.99 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
검사년도
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2008 | |
---|---|
2011 | |
2009 | |
2010 | |
2012 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010 |
---|---|
2nd row | 2011 |
3rd row | 2008 |
4th row | 2009 |
5th row | 2008 |
Common Values
Value | Count | Frequency (%) |
2008 | 3284 | |
2011 | 2673 | |
2009 | 1936 | |
2010 | 1453 | |
2012 | 654 | 6.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2008 | 3284 | |
2011 | 2673 | |
2009 | 1936 | |
2010 | 1453 | |
2012 | 654 | 6.5% |
검체번호
Text
Distinct | 2535 |
---|---|
Distinct (%) | 25.4% |
Missing | 4 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 19 |
---|---|
Median length | 14 |
Mean length | 14.011004 |
Min length | 14 |
Characters and Unicode
Total characters | 140054 |
---|---|
Distinct characters | 17 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 180 ? |
---|---|
Unique (%) | 1.8% |
Sample
1st row | 2W10-06670-005 |
---|---|
2nd row | 2W11-07033-005 |
3rd row | 2W08-21171-002 |
4th row | 2W09-10650-003 |
5th row | 2W08-18635-001 |
Value | Count | Frequency (%) |
2w11-06223-002 | 11 | 0.1% |
2w11-04999-001 | 11 | 0.1% |
2w08-20634-010 | 10 | 0.1% |
2w08-19706-003 | 10 | 0.1% |
2w08-15145-003 | 10 | 0.1% |
2w08-10144-004 | 10 | 0.1% |
2w11-14611-003 | 10 | 0.1% |
2w11-05727-001 | 10 | 0.1% |
2w08-15612-001 | 10 | 0.1% |
2w11-16257-017 | 10 | 0.1% |
Other values (2525) | 9894 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 34789 | |
- | 19992 | |
1 | 18487 | |
2 | 17666 | |
W | 9996 | 7.1% |
8 | 6891 | 4.9% |
9 | 5863 | 4.2% |
4 | 5768 | 4.1% |
3 | 5709 | 4.1% |
5 | 5460 | 3.9% |
Other values (7) | 9433 | 6.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 109956 | |
Dash Punctuation | 19992 | 14.3% |
Uppercase Letter | 10062 | 7.2% |
Open Punctuation | 22 | < 0.1% |
Close Punctuation | 22 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 34789 | |
1 | 18487 | |
2 | 17666 | |
8 | 6891 | 6.3% |
9 | 5863 | 5.3% |
4 | 5768 | 5.2% |
3 | 5709 | 5.2% |
5 | 5460 | 5.0% |
6 | 5072 | 4.6% |
7 | 4251 | 3.9% |
Uppercase Letter
Value | Count | Frequency (%) |
W | 9996 | |
C | 22 | 0.2% |
L | 22 | 0.2% |
S | 22 | 0.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19992 |
Open Punctuation
Value | Count | Frequency (%) |
( | 22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 22 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 129992 | |
Latin | 10062 | 7.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 34789 | |
- | 19992 | |
1 | 18487 | |
2 | 17666 | |
8 | 6891 | 5.3% |
9 | 5863 | 4.5% |
4 | 5768 | 4.4% |
3 | 5709 | 4.4% |
5 | 5460 | 4.2% |
6 | 5072 | 3.9% |
Other values (3) | 4295 | 3.3% |
Latin
Value | Count | Frequency (%) |
W | 9996 | |
C | 22 | 0.2% |
L | 22 | 0.2% |
S | 22 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 140054 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 34789 | |
- | 19992 | |
1 | 18487 | |
2 | 17666 | |
W | 9996 | 7.1% |
8 | 6891 | 4.9% |
9 | 5863 | 4.2% |
4 | 5768 | 4.1% |
3 | 5709 | 4.1% |
5 | 5460 | 3.9% |
Other values (7) | 9433 | 6.7% |
고유번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 2543 |
---|---|
Distinct (%) | 25.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1302.4044 |
Minimum | 1 |
---|---|
Maximum | 2591 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 131 |
Q1 | 655.75 |
median | 1308.5 |
Q3 | 1942 |
95-th percentile | 2465 |
Maximum | 2591 |
Range | 2590 |
Interquartile range (IQR) | 1286.25 |
Descriptive statistics
Standard deviation | 748.53109 |
---|---|
Coefficient of variation (CV) | 0.57473016 |
Kurtosis | -1.1896201 |
Mean | 1302.4044 |
Median Absolute Deviation (MAD) | 641.5 |
Skewness | -0.012355124 |
Sum | 13024044 |
Variance | 560298.79 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1058 | 11 | 0.1% |
783 | 11 | 0.1% |
1569 | 10 | 0.1% |
832 | 10 | 0.1% |
11 | 10 | 0.1% |
1339 | 10 | 0.1% |
763 | 10 | 0.1% |
918 | 10 | 0.1% |
786 | 10 | 0.1% |
2335 | 9 | 0.1% |
Other values (2533) | 9899 |
Value | Count | Frequency (%) |
1 | 2 | < 0.1% |
2 | 2 | < 0.1% |
3 | 5 | |
4 | 6 | |
5 | 4 | |
6 | 3 | < 0.1% |
7 | 5 | |
8 | 9 | |
9 | 6 | |
10 | 3 | < 0.1% |
Value | Count | Frequency (%) |
2591 | 2 | < 0.1% |
2590 | 4 | |
2589 | 4 | |
2588 | 3 | < 0.1% |
2587 | 3 | < 0.1% |
2586 | 9 | |
2585 | 3 | < 0.1% |
2584 | 2 | < 0.1% |
2583 | 3 | < 0.1% |
2582 | 4 |
구명
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
서초구 | |
---|---|
강동구 | |
관악구 | |
노원구 | 624 |
동작구 | 601 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0755 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강동구 |
---|---|
2nd row | 용산구 |
3rd row | 마포구 |
4th row | 은평구 |
5th row | 서초구 |
Common Values
Value | Count | Frequency (%) |
서초구 | 923 | 9.2% |
강동구 | 822 | 8.2% |
관악구 | 674 | 6.7% |
노원구 | 624 | 6.2% |
동작구 | 601 | 6.0% |
송파구 | 551 | 5.5% |
은평구 | 539 | 5.4% |
강남구 | 473 | 4.7% |
구로구 | 443 | 4.4% |
금천구 | 437 | 4.4% |
Other values (15) | 3913 |
Length
Value | Count | Frequency (%) |
서초구 | 923 | 9.2% |
강동구 | 822 | 8.2% |
관악구 | 674 | 6.7% |
노원구 | 624 | 6.2% |
동작구 | 601 | 6.0% |
송파구 | 551 | 5.5% |
은평구 | 539 | 5.4% |
강남구 | 473 | 4.7% |
구로구 | 443 | 4.4% |
금천구 | 437 | 4.4% |
Other values (15) | 3913 |
조사결과값
Text
Distinct | 404 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
4885 | ||
불검출 | 3799 | |
0 | 174 | 1.7% |
1.8미만 | 47 | 0.5% |
없음 | 41 | 0.4% |
무취 | 38 | 0.4% |
무미 | 33 | 0.3% |
6.7 | 28 | 0.3% |
2미만 | 21 | 0.2% |
7 | 21 | 0.2% |
Other values (394) | 913 | 9.1% |
Most occurring characters
Value | Count | Frequency (%) |
- | 4885 | |
불 | 3799 | |
검 | 3799 | |
출 | 3799 | |
. | 650 | 3.3% |
0 | 636 | 3.2% |
1 | 403 | 2.0% |
7 | 257 | 1.3% |
6 | 254 | 1.3% |
2 | 216 | 1.1% |
Other values (13) | 1251 | 6.3% |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 11773 | |
Dash Punctuation | 4885 | |
Decimal Number | 2641 | 13.2% |
Other Punctuation | 650 | 3.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
불 | 3799 | |
검 | 3799 | |
출 | 3799 | |
미 | 101 | 0.9% |
무 | 71 | 0.6% |
만 | 68 | 0.6% |
음 | 41 | 0.3% |
없 | 41 | 0.3% |
취 | 38 | 0.3% |
이 | 8 | 0.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 636 | |
1 | 403 | |
7 | 257 | |
6 | 254 | 9.6% |
2 | 216 | 8.2% |
3 | 197 | 7.5% |
4 | 191 | 7.2% |
8 | 183 | 6.9% |
5 | 182 | 6.9% |
9 | 122 | 4.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4885 |
Other Punctuation
Value | Count | Frequency (%) |
. | 650 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 11773 | |
Common | 8176 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 4885 | |
. | 650 | 8.0% |
0 | 636 | 7.8% |
1 | 403 | 4.9% |
7 | 257 | 3.1% |
6 | 254 | 3.1% |
2 | 216 | 2.6% |
3 | 197 | 2.4% |
4 | 191 | 2.3% |
8 | 183 | 2.2% |
Other values (2) | 304 | 3.7% |
Hangul
Value | Count | Frequency (%) |
불 | 3799 | |
검 | 3799 | |
출 | 3799 | |
미 | 101 | 0.9% |
무 | 71 | 0.6% |
만 | 68 | 0.6% |
음 | 41 | 0.3% |
없 | 41 | 0.3% |
취 | 38 | 0.3% |
이 | 8 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 11773 | |
ASCII | 8176 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 4885 | |
. | 650 | 8.0% |
0 | 636 | 7.8% |
1 | 403 | 4.9% |
7 | 257 | 3.1% |
6 | 254 | 3.1% |
2 | 216 | 2.6% |
3 | 197 | 2.4% |
4 | 191 | 2.3% |
8 | 183 | 2.2% |
Other values (2) | 304 | 3.7% |
Hangul
Value | Count | Frequency (%) |
불 | 3799 | |
검 | 3799 | |
출 | 3799 | |
미 | 101 | 0.9% |
무 | 71 | 0.6% |
만 | 68 | 0.6% |
음 | 41 | 0.3% |
없 | 41 | 0.3% |
취 | 38 | 0.3% |
이 | 8 | 0.1% |
조사항목명
Categorical
Distinct | 43 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1,2-디브로모-3-클로로프로판 | 300 |
---|---|
대장균군수 | 297 |
카드뮴 | 283 |
1,1-디클로로에틸렌 | 279 |
분원성대장균군 | 277 |
Other values (38) |
Length
Max length | 17 |
---|---|
Median length | 11 |
Mean length | 4.6873 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1,2-디브로모-3-클로로프로판 |
---|---|
2nd row | 비소 |
3rd row | 크실렌 |
4th row | 시안 |
5th row | 디클로로메탄 |
Common Values
Value | Count | Frequency (%) |
1,2-디브로모-3-클로로프로판 | 300 | 3.0% |
대장균군수 | 297 | 3.0% |
카드뮴 | 283 | 2.8% |
1,1-디클로로에틸렌 | 279 | 2.8% |
분원성대장균군 | 277 | 2.8% |
벤젠 | 277 | 2.8% |
페놀 | 276 | 2.8% |
수은 | 276 | 2.8% |
비소 | 275 | 2.8% |
과망간산칼륨소비량 | 272 | 2.7% |
Other values (33) | 7188 |
Length
Value | Count | Frequency (%) |
1,2-디브로모-3-클로로프로판 | 300 | 3.0% |
대장균군수 | 297 | 3.0% |
카드뮴 | 283 | 2.8% |
1,1-디클로로에틸렌 | 279 | 2.8% |
분원성대장균군 | 277 | 2.8% |
벤젠 | 277 | 2.8% |
페놀 | 276 | 2.8% |
수은 | 276 | 2.8% |
비소 | 275 | 2.8% |
시안 | 272 | 2.7% |
Other values (33) | 7188 |
허가신고번호
Text
Distinct | 990 |
---|---|
Distinct (%) | 10.0% |
Missing | 72 |
Missing (%) | 0.7% |
Memory size | 156.2 KiB |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 9.6175463 |
Min length | 1 |
Characters and Unicode
Total characters | 95483 |
---|---|
Distinct characters | 35 |
Distinct categories | 4 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 23 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 1190100007 |
---|---|
2nd row | 2200600022 |
3rd row | 142120009 |
4th row | 120520003 |
5th row | 1190100004 |
Value | Count | Frequency (%) |
폐공 | 111 | 1.1% |
10320098 | 68 | 0.7% |
2200600013 | 60 | 0.6% |
2200600014 | 58 | 0.6% |
2200600001 | 54 | 0.5% |
1199700003 | 46 | 0.5% |
2197700003 | 45 | 0.5% |
2200100005 | 43 | 0.4% |
2200800043 | 42 | 0.4% |
2200900004 | 40 | 0.4% |
Other values (980) | 9361 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 37179 | |
1 | 17855 | |
2 | 14451 | 15.1% |
9 | 7371 | 7.7% |
3 | 3760 | 3.9% |
4 | 2937 | 3.1% |
8 | 2921 | 3.1% |
6 | 2688 | 2.8% |
7 | 2516 | 2.6% |
5 | 2468 | 2.6% |
Other values (25) | 1337 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 94146 | |
Dash Punctuation | 1009 | 1.1% |
Other Letter | 305 | 0.3% |
Lowercase Letter | 23 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
폐 | 119 | |
공 | 119 | |
수 | 5 | 1.6% |
하 | 5 | 1.6% |
지 | 5 | 1.6% |
출 | 5 | 1.6% |
유 | 5 | 1.6% |
예 | 4 | 1.3% |
정 | 4 | 1.3% |
처 | 4 | 1.3% |
Other values (12) | 30 | 9.8% |
Decimal Number
Value | Count | Frequency (%) |
0 | 37179 | |
1 | 17855 | |
2 | 14451 | 15.3% |
9 | 7371 | 7.8% |
3 | 3760 | 4.0% |
4 | 2937 | 3.1% |
8 | 2921 | 3.1% |
6 | 2688 | 2.9% |
7 | 2516 | 2.7% |
5 | 2468 | 2.6% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 12 | |
b | 11 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1009 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 95155 | |
Hangul | 305 | 0.3% |
Latin | 23 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
폐 | 119 | |
공 | 119 | |
수 | 5 | 1.6% |
하 | 5 | 1.6% |
지 | 5 | 1.6% |
출 | 5 | 1.6% |
유 | 5 | 1.6% |
예 | 4 | 1.3% |
정 | 4 | 1.3% |
처 | 4 | 1.3% |
Other values (12) | 30 | 9.8% |
Common
Value | Count | Frequency (%) |
0 | 37179 | |
1 | 17855 | |
2 | 14451 | 15.2% |
9 | 7371 | 7.7% |
3 | 3760 | 4.0% |
4 | 2937 | 3.1% |
8 | 2921 | 3.1% |
6 | 2688 | 2.8% |
7 | 2516 | 2.6% |
5 | 2468 | 2.6% |
Latin
Value | Count | Frequency (%) |
a | 12 | |
b | 11 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 95178 | |
Hangul | 305 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 37179 | |
1 | 17855 | |
2 | 14451 | 15.2% |
9 | 7371 | 7.7% |
3 | 3760 | 4.0% |
4 | 2937 | 3.1% |
8 | 2921 | 3.1% |
6 | 2688 | 2.8% |
7 | 2516 | 2.6% |
5 | 2468 | 2.6% |
Other values (3) | 1032 | 1.1% |
Hangul
Value | Count | Frequency (%) |
폐 | 119 | |
공 | 119 | |
수 | 5 | 1.6% |
하 | 5 | 1.6% |
지 | 5 | 1.6% |
출 | 5 | 1.6% |
유 | 5 | 1.6% |
예 | 4 | 1.3% |
정 | 4 | 1.3% |
처 | 4 | 1.3% |
Other values (12) | 30 | 9.8% |
검사년도 | 고유번호 | 구명 | 조사항목명 | |
---|---|---|---|---|
검사년도 | 1.000 | 0.483 | 0.609 | 0.053 |
고유번호 | 0.483 | 1.000 | 0.989 | 0.261 |
구명 | 0.609 | 0.989 | 1.000 | 0.228 |
조사항목명 | 0.053 | 0.261 | 0.228 | 1.000 |
조사항목명 | 검사년도 | 구명 | |
---|---|---|---|
조사항목명 | 1.000 | 0.024 | 0.054 |
검사년도 | 0.024 | 1.000 | 0.311 |
구명 | 0.054 | 0.311 | 1.000 |
고유번호 | 검사년도 | 구명 | 조사항목명 | |
---|---|---|---|---|
고유번호 | 1.000 | 0.221 | 0.898 | 0.093 |
검사년도 | 0.221 | 1.000 | 0.311 | 0.024 |
구명 | 0.898 | 0.311 | 1.000 | 0.054 |
조사항목명 | 0.093 | 0.024 | 0.054 | 1.000 |
검사년도 | 검체번호 | 고유번호 | 구명 | 조사결과값 | 조사항목명 | 허가신고번호 | |
---|---|---|---|---|---|---|---|
15442 | 2010 | 2W10-06670-005 | 286 | 강동구 | 불검출 | 1,2-디브로모-3-클로로프로판 | 1190100007 |
37789 | 2011 | 2W11-07033-005 | 2259 | 용산구 | 불검출 | 비소 | 2200600022 |
29331 | 2008 | 2W08-21171-002 | 1460 | 마포구 | - | 크실렌 | 142120009 |
48127 | 2009 | 2W09-10650-003 | 2330 | 은평구 | - | 시안 | 120520003 |
71993 | 2008 | 2W08-18635-001 | 1602 | 서초구 | - | 디클로로메탄 | 1190100004 |
87345 | 2011 | 2W11-09029-002 | 2207 | 영등포구 | - | 분원성대장균군 | 2200600001 |
18481 | 2010 | 2W10-02562-001 | 266 | 강동구 | 0 | 대장균군수 | 2200100097 |
1874 | 2009 | 2W09-03445-006 | 188 | 강동구 | 불검출 | 테트라클로로에틸렌 | 2200600037 |
67407 | 2011 | 2W11-04093-001 | 1857 | 성동구 | - | 망간 | 2200600009 |
81867 | 2010 | 2W10-03413-005 | 1131 | 도봉구 | 불검출 | 세제 | 2199400018 |
검사년도 | 검체번호 | 고유번호 | 구명 | 조사결과값 | 조사항목명 | 허가신고번호 | |
---|---|---|---|---|---|---|---|
71778 | 2009 | 2W09-04089-009 | 1886 | 성북구 | - | 맛 | 2199700033 |
83814 | 2009 | 2W09-04705-003 | 1659 | 서초구 | - | 사염화탄소 | 1190100238 |
79149 | 2011 | 2W11-14687-004 | 924 | 금천구 | - | 냄새 | 18-07-10729 |
55927 | 2009 | 2W09-03506-007 | 217 | 강동구 | - | 과망간산칼륨소비량 | 1190100053 |
69698 | 2008 | 2W08-12662-008 | 414 | 강서구 | 불검출 | 동 | 1199500025 |
35749 | 2008 | 2W08-15198-001 | 14 | 강남구 | 8.1 | 수소이온농도 | 23140143 |
77755 | 2012 | 2W12-04372-009 | 2591 | 중랑구 | - | 냄새 | 2200800011 |
16256 | 2011 | 2W11-05430-006 | 2363 | 은평구 | 불검출 | 페놀 | 121520011 |
24647 | 2010 | 2W10-04459-006 | 2354 | 은평구 | 불검출 | 6가크롬 | 121720010 |
75939 | 2010 | 2W10-12328-006 | 2454 | 종로구 | - | 냄새 | 10322251 |