Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 21 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 867.0 B |
Average record size in memory | 41.3 B |
Variable types
Text | 1 |
---|---|
Numeric | 3 |
Dataset
Description | 전라남도 보건환경연구원 홈페이지에 게시된 하천수 호소수 관련 검사항목 및 수수료에 관한 사항을 정리한 파일입니다. |
---|---|
Author | 전라남도 |
URL | https://www.data.go.kr/data/15041960/fileData.do |
Reproduction
Analysis started | 2023-12-12 16:43:24.487346 |
---|---|
Analysis finished | 2023-12-12 16:43:25.926946 |
Duration | 1.44 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
검사항목
Text
UNIQUE
 
Distinct | 21 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 300.0 B |
Length
Max length | 47 |
---|---|
Median length | 13 |
Mean length | 10.142857 |
Min length | 3 |
Characters and Unicode
Total characters | 213 |
---|---|
Distinct characters | 96 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 21 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 수소이온농도(pH) |
---|---|
2nd row | 화학적산소요구량(COD) |
3rd row | 생물화학적산소요구량(BOD) |
4th row | 부유물질량(SS) |
5th row | 용존산소량(DO) |
Value | Count | Frequency (%) |
수소이온농도(ph | 1 | 4.2% |
화학적산소요구량(cod | 1 | 4.2% |
클로로필a | 1 | 4.2% |
pce | 1 | 4.2% |
1,2디클로로에탄,디클로로메탄 | 1 | 4.2% |
클로로포름 | 1 | 4.2% |
휘발성저급탄화수소류(사염화탄소 | 1 | 4.2% |
음이온계면활성제(abs | 1 | 4.2% |
폴리크로리네이티드비페닐(pcb | 1 | 4.2% |
6가크롬(cr6 | 1 | 4.2% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
( | 16 | 7.5% |
) | 16 | 7.5% |
로 | 9 | 4.2% |
소 | 8 | 3.8% |
C | 6 | 2.8% |
, | 5 | 2.3% |
클 | 4 | 1.9% |
P | 4 | 1.9% |
화 | 4 | 1.9% |
탄 | 4 | 1.9% |
Other values (86) | 137 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 128 | |
Uppercase Letter | 31 | 14.6% |
Open Punctuation | 16 | 7.5% |
Close Punctuation | 16 | 7.5% |
Lowercase Letter | 7 | 3.3% |
Other Punctuation | 5 | 2.3% |
Decimal Number | 4 | 1.9% |
Space Separator | 3 | 1.4% |
Dash Punctuation | 2 | 0.9% |
Math Symbol | 1 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 9 | 7.0% |
소 | 8 | 6.2% |
클 | 4 | 3.1% |
화 | 4 | 3.1% |
탄 | 4 | 3.1% |
량 | 4 | 3.1% |
성 | 3 | 2.3% |
총 | 3 | 2.3% |
수 | 3 | 2.3% |
산 | 3 | 2.3% |
Other values (59) | 83 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 6 | |
P | 4 | |
S | 3 | |
B | 3 | |
D | 3 | |
O | 3 | |
A | 2 | 6.5% |
N | 2 | 6.5% |
T | 2 | 6.5% |
H | 2 | 6.5% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 1 | |
a | 1 | |
d | 1 | |
g | 1 | |
b | 1 | |
r | 1 | |
s | 1 |
Decimal Number
Value | Count | Frequency (%) |
6 | 2 | |
1 | 1 | |
2 | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 16 |
Close Punctuation
Value | Count | Frequency (%) |
) | 16 |
Other Punctuation
Value | Count | Frequency (%) |
, | 5 |
Space Separator
Value | Count | Frequency (%) |
3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 128 | |
Common | 47 | 22.1% |
Latin | 38 | 17.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 9 | 7.0% |
소 | 8 | 6.2% |
클 | 4 | 3.1% |
화 | 4 | 3.1% |
탄 | 4 | 3.1% |
량 | 4 | 3.1% |
성 | 3 | 2.3% |
총 | 3 | 2.3% |
수 | 3 | 2.3% |
산 | 3 | 2.3% |
Other values (59) | 83 |
Latin
Value | Count | Frequency (%) |
C | 6 | |
P | 4 | |
S | 3 | 7.9% |
B | 3 | 7.9% |
D | 3 | 7.9% |
O | 3 | 7.9% |
A | 2 | 5.3% |
N | 2 | 5.3% |
T | 2 | 5.3% |
H | 2 | 5.3% |
Other values (8) | 8 |
Common
Value | Count | Frequency (%) |
( | 16 | |
) | 16 | |
, | 5 | 10.6% |
3 | 6.4% | |
6 | 2 | 4.3% |
- | 2 | 4.3% |
1 | 1 | 2.1% |
2 | 1 | 2.1% |
+ | 1 | 2.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 128 | |
ASCII | 85 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 16 | |
) | 16 | |
C | 6 | 7.1% |
, | 5 | 5.9% |
P | 4 | 4.7% |
S | 3 | 3.5% |
B | 3 | 3.5% |
D | 3 | 3.5% |
O | 3 | 3.5% |
3 | 3.5% | |
Other values (17) | 23 |
Hangul
Value | Count | Frequency (%) |
로 | 9 | 7.0% |
소 | 8 | 6.2% |
클 | 4 | 3.1% |
화 | 4 | 3.1% |
탄 | 4 | 3.1% |
량 | 4 | 3.1% |
성 | 3 | 2.3% |
총 | 3 | 2.3% |
수 | 3 | 2.3% |
산 | 3 | 2.3% |
Other values (59) | 83 |
수수료(원)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 85.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13785.714 |
Minimum | 800 |
---|---|
Maximum | 125200 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 321.0 B |
Quantile statistics
Minimum | 800 |
---|---|
5-th percentile | 1300 |
Q1 | 3400 |
median | 6900 |
Q3 | 13200 |
95-th percentile | 20300 |
Maximum | 125200 |
Range | 124400 |
Interquartile range (IQR) | 9800 |
Descriptive statistics
Standard deviation | 26091.92 |
---|---|
Coefficient of variation (CV) | 1.8926781 |
Kurtosis | 18.966082 |
Mean | 13785.714 |
Median Absolute Deviation (MAD) | 4100 |
Skewness | 4.2662449 |
Sum | 289500 |
Variance | 6.8078829 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6900 | 3 | 14.3% |
2800 | 2 | 9.5% |
800 | 1 | 4.8% |
10600 | 1 | 4.8% |
3000 | 1 | 4.8% |
1300 | 1 | 4.8% |
15400 | 1 | 4.8% |
13200 | 1 | 4.8% |
125200 | 1 | 4.8% |
20300 | 1 | 4.8% |
Other values (8) | 8 |
Value | Count | Frequency (%) |
800 | 1 | 4.8% |
1300 | 1 | 4.8% |
2800 | 2 | |
3000 | 1 | 4.8% |
3400 | 1 | 4.8% |
3700 | 1 | 4.8% |
5800 | 1 | 4.8% |
6900 | 3 | |
7300 | 1 | 4.8% |
10600 | 1 | 4.8% |
Value | Count | Frequency (%) |
125200 | 1 | |
20300 | 1 | |
15400 | 1 | |
14800 | 1 | |
13900 | 1 | |
13200 | 1 | |
13100 | 1 | |
11400 | 1 | |
10600 | 1 | |
7300 | 1 |
하천수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 28.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 785.71429 |
Minimum | 0 |
---|---|
Maximum | 5800 |
Zeros | 16 |
Zeros (%) | 76.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 321.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3700 |
Maximum | 5800 |
Range | 5800 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1649.3289 |
---|---|
Coefficient of variation (CV) | 2.0991458 |
Kurtosis | 3.5521224 |
Mean | 785.71429 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.0829061 |
Sum | 16500 |
Variance | 2720285.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 16 | |
800 | 1 | 4.8% |
5800 | 1 | 4.8% |
2800 | 1 | 4.8% |
3700 | 1 | 4.8% |
3400 | 1 | 4.8% |
Value | Count | Frequency (%) |
0 | 16 | |
800 | 1 | 4.8% |
2800 | 1 | 4.8% |
3400 | 1 | 4.8% |
3700 | 1 | 4.8% |
5800 | 1 | 4.8% |
Value | Count | Frequency (%) |
5800 | 1 | 4.8% |
3700 | 1 | 4.8% |
3400 | 1 | 4.8% |
2800 | 1 | 4.8% |
800 | 1 | 4.8% |
0 | 16 |
호소수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 28.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 857.14286 |
Minimum | 0 |
---|---|
Maximum | 7300 |
Zeros | 16 |
Zeros (%) | 76.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 321.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3700 |
Maximum | 7300 |
Range | 7300 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1891.9755 |
---|---|
Coefficient of variation (CV) | 2.2073048 |
Kurtosis | 6.2597453 |
Mean | 857.14286 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4816238 |
Sum | 18000 |
Variance | 3579571.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 16 | |
800 | 1 | 4.8% |
7300 | 1 | 4.8% |
2800 | 1 | 4.8% |
3700 | 1 | 4.8% |
3400 | 1 | 4.8% |
Value | Count | Frequency (%) |
0 | 16 | |
800 | 1 | 4.8% |
2800 | 1 | 4.8% |
3400 | 1 | 4.8% |
3700 | 1 | 4.8% |
7300 | 1 | 4.8% |
Value | Count | Frequency (%) |
7300 | 1 | 4.8% |
3700 | 1 | 4.8% |
3400 | 1 | 4.8% |
2800 | 1 | 4.8% |
800 | 1 | 4.8% |
0 | 16 |
검사항목 | 수수료(원) | 하천수 | 호소수 | |
---|---|---|---|---|
검사항목 | 1.000 | 1.000 | 1.000 | 1.000 |
수수료(원) | 1.000 | 1.000 | 0.000 | 0.000 |
하천수 | 1.000 | 0.000 | 1.000 | 0.991 |
호소수 | 1.000 | 0.000 | 0.991 | 1.000 |
수수료(원) | 하천수 | 호소수 | |
---|---|---|---|
수수료(원) | 1.000 | -0.509 | -0.422 |
하천수 | -0.509 | 1.000 | 0.637 |
호소수 | -0.422 | 0.637 | 1.000 |
검사항목 | 수수료(원) | 하천수 | 호소수 | |
---|---|---|---|---|
0 | 수소이온농도(pH) | 800 | 800 | 800 |
1 | 화학적산소요구량(COD) | 7300 | 0 | 7300 |
2 | 생물화학적산소요구량(BOD) | 5800 | 5800 | 0 |
3 | 부유물질량(SS) | 2800 | 2800 | 2800 |
4 | 용존산소량(DO) | 2800 | 0 | 0 |
5 | 총대장균군 | 14800 | 0 | 0 |
6 | 분원성대장균군 | 11400 | 0 | 0 |
7 | 총질소(T-N) | 3700 | 3700 | 3700 |
8 | 총인(T-P) | 3400 | 3400 | 3400 |
9 | 카드뮴(Cd) | 6900 | 0 | 0 |
검사항목 | 수수료(원) | 하천수 | 호소수 | |
---|---|---|---|---|
11 | 시안(CN) | 13100 | 0 | 0 |
12 | 수은(Hg) | 10600 | 0 | 0 |
13 | 유기인 | 20300 | 0 | 0 |
14 | 납(Pb) | 6900 | 0 | 0 |
15 | 6가크롬(Cr6+) | 6900 | 0 | 0 |
16 | 폴리크로리네이티드비페닐(PCB) | 125200 | 0 | 0 |
17 | 음이온계면활성제(ABS) | 13200 | 0 | 0 |
18 | 휘발성저급탄화수소류(사염화탄소, 클로로포름, 1,2디클로로에탄,디클로로메탄, PCE) | 15400 | 0 | 0 |
19 | 클로로필a | 1300 | 0 | 0 |
20 | 전기전도도 | 3000 | 0 | 0 |