Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 2147 |
Missing cells | 2226 |
Missing cells (%) | 14.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 123.8 KiB |
Average record size in memory | 59.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 3 |
Categorical | 1 |
Dataset
Description | 공공데이터 중장기 개방계획에 따라 공개하는 경상남도 하천관리 시스템의 데이터 입니다. 하천관리시스템의 무제부지적도 정보를 포함하고있습니다. |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15093555 |
해당도면_구분코드 has constant value "" | Constant |
일련번호 is highly overall correlated with 해당도면_일련번호 and 1 other fields | High correlation |
해당도면_일련번호 is highly overall correlated with 일련번호 and 1 other fields | High correlation |
파일 갯수 is highly overall correlated with 일련번호 and 1 other fields | High correlation |
파일 설명 has 409 (19.0%) missing values | Missing |
파일 갯수 has 1817 (84.6%) missing values | Missing |
파일명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 23:46:19.116218 |
---|---|
Analysis finished | 2023-12-10 23:46:20.482739 |
Duration | 1.37 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
하천관리코드
Text
Distinct | 212 |
---|---|
Distinct (%) | 9.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.9 KiB |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Characters and Unicode
Total characters | 40793 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20140402015F02Q0101 |
---|---|
2nd row | 20140402015F02Q0101 |
3rd row | 20140402015F02Q0101 |
4th row | 20140402015F02Q0101 |
5th row | 20140402015F02Q0101 |
Value | Count | Frequency (%) |
20268002012f02q0101 | 81 | 3.8% |
20272002012f02q0101 | 54 | 2.5% |
20272002012f02q0102 | 54 | 2.5% |
20227802010f01q0101 | 41 | 1.9% |
20243402014f02q0101 | 38 | 1.8% |
27209902014f02q0101 | 38 | 1.8% |
20249602010f02q0101 | 33 | 1.5% |
40226502008f01q0101 | 33 | 1.5% |
20140402015f02q0101 | 32 | 1.5% |
20142102012f02q0101 | 27 | 1.3% |
Other values (202) | 1716 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 14132 | |
2 | 8677 | |
1 | 7734 | |
F | 2147 | 5.3% |
Q | 2147 | 5.3% |
7 | 1269 | 3.1% |
4 | 1141 | 2.8% |
6 | 981 | 2.4% |
5 | 783 | 1.9% |
8 | 691 | 1.7% |
Other values (2) | 1091 | 2.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 36499 | |
Uppercase Letter | 4294 | 10.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 14132 | |
2 | 8677 | |
1 | 7734 | |
7 | 1269 | 3.5% |
4 | 1141 | 3.1% |
6 | 981 | 2.7% |
5 | 783 | 2.1% |
8 | 691 | 1.9% |
3 | 653 | 1.8% |
9 | 438 | 1.2% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 2147 | |
Q | 2147 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 36499 | |
Latin | 4294 | 10.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 14132 | |
2 | 8677 | |
1 | 7734 | |
7 | 1269 | 3.5% |
4 | 1141 | 3.1% |
6 | 981 | 2.7% |
5 | 783 | 2.1% |
8 | 691 | 1.9% |
3 | 653 | 1.8% |
9 | 438 | 1.2% |
Latin
Value | Count | Frequency (%) |
F | 2147 | |
Q | 2147 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40793 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 14132 | |
2 | 8677 | |
1 | 7734 | |
F | 2147 | 5.3% |
Q | 2147 | 5.3% |
7 | 1269 | 3.1% |
4 | 1141 | 2.8% |
6 | 981 | 2.4% |
5 | 783 | 1.9% |
8 | 691 | 1.7% |
Other values (2) | 1091 | 2.7% |
일련번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 81 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.030275 |
Minimum | 1 |
---|---|
Maximum | 81 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 19.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 12 |
95-th percentile | 33.7 |
Maximum | 81 |
Range | 80 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 11.589215 |
---|---|
Coefficient of variation (CV) | 1.1554234 |
Kurtosis | 9.0368662 |
Mean | 10.030275 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 2.6924226 |
Sum | 21535 |
Variance | 134.30989 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 212 | 9.9% |
1 | 211 | 9.8% |
3 | 209 | 9.7% |
4 | 194 | 9.0% |
5 | 171 | 8.0% |
6 | 141 | 6.6% |
7 | 116 | 5.4% |
8 | 102 | 4.8% |
9 | 77 | 3.6% |
10 | 71 | 3.3% |
Other values (71) | 643 |
Value | Count | Frequency (%) |
1 | 211 | |
2 | 212 | |
3 | 209 | |
4 | 194 | |
5 | 171 | |
6 | 141 | |
7 | 116 | |
8 | 102 | |
9 | 77 | 3.6% |
10 | 71 | 3.3% |
Value | Count | Frequency (%) |
81 | 1 | |
80 | 1 | |
79 | 1 | |
78 | 1 | |
77 | 1 | |
76 | 1 | |
75 | 1 | |
74 | 1 | |
73 | 1 | |
72 | 1 |
파일명
Text
UNIQUE
 
Distinct | 2147 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.9 KiB |
Length
Max length | 26 |
---|---|
Median length | 26 |
Mean length | 26 |
Min length | 26 |
Characters and Unicode
Total characters | 55822 |
---|---|
Distinct characters | 13 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2147 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 20140402015F02Q0101I060001 |
---|---|
2nd row | 20140402015F02Q0101I060002 |
3rd row | 20140402015F02Q0101I060003 |
4th row | 20140402015F02Q0101I060004 |
5th row | 20140402015F02Q0101I060005 |
Value | Count | Frequency (%) |
20140402015f02q0101i060001 | 1 | < 0.1% |
20272002012f02q0101i060032 | 1 | < 0.1% |
20272002012f02q0101i060046 | 1 | < 0.1% |
20272002012f02q0101i060045 | 1 | < 0.1% |
20272002012f02q0101i060044 | 1 | < 0.1% |
20272002012f02q0101i060043 | 1 | < 0.1% |
20272002012f02q0101i060042 | 1 | < 0.1% |
20272002012f02q0101i060041 | 1 | < 0.1% |
20272002012f02q0101i060040 | 1 | < 0.1% |
20272002012f02q0101i060039 | 1 | < 0.1% |
Other values (2137) | 2137 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 22116 | |
2 | 9114 | |
1 | 8481 | 15.2% |
6 | 3338 | 6.0% |
F | 2147 | 3.8% |
Q | 2147 | 3.8% |
I | 2147 | 3.8% |
7 | 1447 | 2.6% |
4 | 1437 | 2.6% |
5 | 1039 | 1.9% |
Other values (3) | 2409 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 49381 | |
Uppercase Letter | 6441 | 11.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 22116 | |
2 | 9114 | |
1 | 8481 | 17.2% |
6 | 3338 | 6.8% |
7 | 1447 | 2.9% |
4 | 1437 | 2.9% |
5 | 1039 | 2.1% |
3 | 1013 | 2.1% |
8 | 840 | 1.7% |
9 | 556 | 1.1% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 2147 | |
Q | 2147 | |
I | 2147 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 49381 | |
Latin | 6441 | 11.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 22116 | |
2 | 9114 | |
1 | 8481 | 17.2% |
6 | 3338 | 6.8% |
7 | 1447 | 2.9% |
4 | 1437 | 2.9% |
5 | 1039 | 2.1% |
3 | 1013 | 2.1% |
8 | 840 | 1.7% |
9 | 556 | 1.1% |
Latin
Value | Count | Frequency (%) |
F | 2147 | |
Q | 2147 | |
I | 2147 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 55822 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 22116 | |
2 | 9114 | |
1 | 8481 | 15.2% |
6 | 3338 | 6.0% |
F | 2147 | 3.8% |
Q | 2147 | 3.8% |
I | 2147 | 3.8% |
7 | 1447 | 2.6% |
4 | 1437 | 2.6% |
5 | 1039 | 1.9% |
Other values (3) | 2409 | 4.3% |
해당도면_구분코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.9 KiB |
I06 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | I06 |
---|---|
2nd row | I06 |
3rd row | I06 |
4th row | I06 |
5th row | I06 |
Common Values
Value | Count | Frequency (%) |
I06 | 2147 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
i06 | 2147 |
해당도면_일련번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 81 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.030275 |
Minimum | 1 |
---|---|
Maximum | 81 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 19.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 12 |
95-th percentile | 33.7 |
Maximum | 81 |
Range | 80 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 11.589215 |
---|---|
Coefficient of variation (CV) | 1.1554234 |
Kurtosis | 9.0368662 |
Mean | 10.030275 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 2.6924226 |
Sum | 21535 |
Variance | 134.30989 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 212 | 9.9% |
1 | 211 | 9.8% |
3 | 209 | 9.7% |
4 | 194 | 9.0% |
5 | 171 | 8.0% |
6 | 141 | 6.6% |
7 | 116 | 5.4% |
8 | 102 | 4.8% |
9 | 77 | 3.6% |
10 | 71 | 3.3% |
Other values (71) | 643 |
Value | Count | Frequency (%) |
1 | 211 | |
2 | 212 | |
3 | 209 | |
4 | 194 | |
5 | 171 | |
6 | 141 | |
7 | 116 | |
8 | 102 | |
9 | 77 | 3.6% |
10 | 71 | 3.3% |
Value | Count | Frequency (%) |
81 | 1 | |
80 | 1 | |
79 | 1 | |
78 | 1 | |
77 | 1 | |
76 | 1 | |
75 | 1 | |
74 | 1 | |
73 | 1 | |
72 | 1 |
파일 설명
Text
MISSING
 
Distinct | 257 |
---|---|
Distinct (%) | 14.8% |
Missing | 409 |
Missing (%) | 19.0% |
Memory size | 16.9 KiB |
Value | Count | Frequency (%) |
0001 | 103 | 5.9% |
01 | 103 | 5.9% |
0002 | 91 | 5.2% |
0003 | 89 | 5.1% |
0004 | 86 | 4.9% |
0005 | 76 | 4.4% |
0006 | 71 | 4.1% |
0007 | 59 | 3.4% |
0008 | 54 | 3.1% |
0009 | 38 | 2.2% |
Other values (247) | 968 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3203 | |
1 | 659 | 11.6% |
2 | 323 | 5.7% |
3 | 268 | 4.7% |
4 | 228 | 4.0% |
5 | 188 | 3.3% |
6 | 153 | 2.7% |
7 | 128 | 2.3% |
8 | 109 | 1.9% |
R | 99 | 1.7% |
Other values (24) | 316 | 5.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5343 | |
Uppercase Letter | 212 | 3.7% |
Other Letter | 119 | 2.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
현 | 15 | |
천 | 11 | |
신 | 11 | |
계 | 11 | |
도 | 9 | |
적 | 9 | |
지 | 9 | |
당 | 9 | |
동 | 9 | |
상 | 7 | 5.9% |
Other values (4) | 19 |
Decimal Number
Value | Count | Frequency (%) |
0 | 3203 | |
1 | 659 | 12.3% |
2 | 323 | 6.0% |
3 | 268 | 5.0% |
4 | 228 | 4.3% |
5 | 188 | 3.5% |
6 | 153 | 2.9% |
7 | 128 | 2.4% |
8 | 109 | 2.0% |
9 | 84 | 1.6% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 99 | |
H | 28 | 13.2% |
D | 20 | 9.4% |
C | 17 | 8.0% |
J | 10 | 4.7% |
B | 10 | 4.7% |
S | 9 | 4.2% |
M | 7 | 3.3% |
G | 7 | 3.3% |
Y | 5 | 2.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 5343 | |
Latin | 212 | 3.7% |
Hangul | 119 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
현 | 15 | |
천 | 11 | |
신 | 11 | |
계 | 11 | |
도 | 9 | |
적 | 9 | |
지 | 9 | |
당 | 9 | |
동 | 9 | |
상 | 7 | 5.9% |
Other values (4) | 19 |
Common
Value | Count | Frequency (%) |
0 | 3203 | |
1 | 659 | 12.3% |
2 | 323 | 6.0% |
3 | 268 | 5.0% |
4 | 228 | 4.3% |
5 | 188 | 3.5% |
6 | 153 | 2.9% |
7 | 128 | 2.4% |
8 | 109 | 2.0% |
9 | 84 | 1.6% |
Latin
Value | Count | Frequency (%) |
R | 99 | |
H | 28 | 13.2% |
D | 20 | 9.4% |
C | 17 | 8.0% |
J | 10 | 4.7% |
B | 10 | 4.7% |
S | 9 | 4.2% |
M | 7 | 3.3% |
G | 7 | 3.3% |
Y | 5 | 2.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5555 | |
Hangul | 119 | 2.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3203 | |
1 | 659 | 11.9% |
2 | 323 | 5.8% |
3 | 268 | 4.8% |
4 | 228 | 4.1% |
5 | 188 | 3.4% |
6 | 153 | 2.8% |
7 | 128 | 2.3% |
8 | 109 | 2.0% |
R | 99 | 1.8% |
Other values (10) | 197 | 3.5% |
Hangul
Value | Count | Frequency (%) |
현 | 15 | |
천 | 11 | |
신 | 11 | |
계 | 11 | |
도 | 9 | |
적 | 9 | |
지 | 9 | |
당 | 9 | |
동 | 9 | |
상 | 7 | 5.9% |
Other values (4) | 19 |
파일 갯수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 14 |
---|---|
Distinct (%) | 4.2% |
Missing | 1817 |
Missing (%) | 84.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.115152 |
Minimum | 1 |
---|---|
Maximum | 20 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 19.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 5 |
median | 8 |
Q3 | 17 |
95-th percentile | 20 |
Maximum | 20 |
Range | 19 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 5.7112975 |
---|---|
Coefficient of variation (CV) | 0.56462797 |
Kurtosis | -1.3338807 |
Mean | 10.115152 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 0.30023305 |
Sum | 3338 |
Variance | 32.618919 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17 | 51 | 2.4% |
5 | 40 | 1.9% |
7 | 35 | 1.6% |
6 | 30 | 1.4% |
13 | 26 | 1.2% |
8 | 24 | 1.1% |
4 | 20 | 0.9% |
20 | 20 | 0.9% |
18 | 18 | 0.8% |
3 | 18 | 0.8% |
Other values (4) | 48 | 2.2% |
(Missing) | 1817 |
Value | Count | Frequency (%) |
1 | 12 | 0.6% |
3 | 18 | |
4 | 20 | |
5 | 40 | |
6 | 30 | |
7 | 35 | |
8 | 24 | |
10 | 10 | 0.5% |
11 | 11 | 0.5% |
13 | 26 |
Value | Count | Frequency (%) |
20 | 20 | 0.9% |
18 | 18 | 0.8% |
17 | 51 | |
15 | 15 | 0.7% |
13 | 26 | |
11 | 11 | 0.5% |
10 | 10 | 0.5% |
8 | 24 | |
7 | 35 | |
6 | 30 |
일련번호 | 해당도면_일련번호 | 파일 갯수 | |
---|---|---|---|
일련번호 | 1.000 | 1.000 | 0.588 |
해당도면_일련번호 | 1.000 | 1.000 | 0.588 |
파일 갯수 | 0.588 | 0.588 | 1.000 |
일련번호 | 해당도면_일련번호 | 파일 갯수 | |
---|---|---|---|
일련번호 | 1.000 | 1.000 | 0.577 |
해당도면_일련번호 | 1.000 | 1.000 | 0.577 |
파일 갯수 | 0.577 | 0.577 | 1.000 |
하천관리코드 | 일련번호 | 파일명 | 해당도면_구분코드 | 해당도면_일련번호 | 파일 설명 | 파일 갯수 | |
---|---|---|---|---|---|---|---|
0 | 20140402015F02Q0101 | 1 | 20140402015F02Q0101I060001 | I06 | 1 | <NA> | <NA> |
1 | 20140402015F02Q0101 | 2 | 20140402015F02Q0101I060002 | I06 | 2 | <NA> | <NA> |
2 | 20140402015F02Q0101 | 3 | 20140402015F02Q0101I060003 | I06 | 3 | <NA> | <NA> |
3 | 20140402015F02Q0101 | 4 | 20140402015F02Q0101I060004 | I06 | 4 | <NA> | <NA> |
4 | 20140402015F02Q0101 | 5 | 20140402015F02Q0101I060005 | I06 | 5 | <NA> | <NA> |
5 | 20140402015F02Q0101 | 6 | 20140402015F02Q0101I060006 | I06 | 6 | <NA> | <NA> |
6 | 20140402015F02Q0101 | 7 | 20140402015F02Q0101I060007 | I06 | 7 | <NA> | <NA> |
7 | 20140402015F02Q0101 | 8 | 20140402015F02Q0101I060008 | I06 | 8 | <NA> | <NA> |
8 | 20140402015F02Q0101 | 9 | 20140402015F02Q0101I060009 | I06 | 9 | <NA> | <NA> |
9 | 20140402015F02Q0101 | 10 | 20140402015F02Q0101I060010 | I06 | 10 | <NA> | <NA> |
하천관리코드 | 일련번호 | 파일명 | 해당도면_구분코드 | 해당도면_일련번호 | 파일 설명 | 파일 갯수 | |
---|---|---|---|---|---|---|---|
2137 | 40227902007F01Q0101 | 4 | 40227902007F01Q0101I060004 | I06 | 4 | 0004 | <NA> |
2138 | 40227902007F01Q0101 | 5 | 40227902007F01Q0101I060005 | I06 | 5 | 0005 | <NA> |
2139 | 40227902007F01Q0101 | 6 | 40227902007F01Q0101I060006 | I06 | 6 | 0006 | <NA> |
2140 | 40227902007F01Q0101 | 7 | 40227902007F01Q0101I060007 | I06 | 7 | 0007 | <NA> |
2141 | 40227902007F01Q0101 | 8 | 40227902007F01Q0101I060008 | I06 | 8 | 0008 | <NA> |
2142 | 40227902007F01Q0101 | 9 | 40227902007F01Q0101I060009 | I06 | 9 | 0009 | <NA> |
2143 | 40227902007F01Q0101 | 10 | 40227902007F01Q0101I060010 | I06 | 10 | 0010 | <NA> |
2144 | 40227902007F01Q0101 | 11 | 40227902007F01Q0101I060011 | I06 | 11 | 0011 | <NA> |
2145 | 40227902007F01Q0101 | 12 | 40227902007F01Q0101I060012 | I06 | 12 | 0012 | <NA> |
2146 | 40227902007F01Q0101 | 13 | 40227902007F01Q0101I060013 | I06 | 13 | 0013 | <NA> |