Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 566.4 KiB |
Average record size in memory | 58.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 3 |
Categorical | 1 |
Dataset
Description | 공공데이터 중장기 개방계획에 따라 공개하는 경상남도 하천관리 시스템의 데이터 입니다. 하천관리시스템의 부속물 정보를 포함하고있습니다. |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15093560 |
Reproduction
Analysis started | 2024-04-21 02:04:48.049302 |
---|---|
Analysis finished | 2024-04-21 02:04:50.649707 |
Duration | 2.6 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
공간아이디
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11061.285 |
Minimum | 2 |
---|---|
Maximum | 22085 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 1181.85 |
Q1 | 5594.5 |
median | 11019 |
Q3 | 16577.25 |
95-th percentile | 20974.2 |
Maximum | 22085 |
Range | 22083 |
Interquartile range (IQR) | 10982.75 |
Descriptive statistics
Standard deviation | 6353.9472 |
---|---|
Coefficient of variation (CV) | 0.57443117 |
Kurtosis | -1.1966007 |
Mean | 11061.285 |
Median Absolute Deviation (MAD) | 5503 |
Skewness | 0.0069052034 |
Sum | 1.1061286 × 108 |
Variance | 40372644 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7169 | 1 | < 0.1% |
16577 | 1 | < 0.1% |
10701 | 1 | < 0.1% |
3803 | 1 | < 0.1% |
4586 | 1 | < 0.1% |
6364 | 1 | < 0.1% |
15539 | 1 | < 0.1% |
20055 | 1 | < 0.1% |
18767 | 1 | < 0.1% |
12593 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
9 | 1 | |
14 | 1 | |
16 | 1 | |
20 | 1 | |
22 | 1 | |
23 | 1 |
Value | Count | Frequency (%) |
22085 | 1 | |
22084 | 1 | |
22081 | 1 | |
22080 | 1 | |
22079 | 1 | |
22078 | 1 | |
22077 | 1 | |
22076 | 1 | |
22073 | 1 | |
22071 | 1 |
하천관리코드
Text
Distinct | 699 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Characters and Unicode
Total characters | 190000 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 27 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 20255602014F02Q0101 |
---|---|
2nd row | 20227502012F01Q0101 |
3rd row | 27209901994F02Q0101 |
4th row | 27202401995F01Q0101 |
5th row | 20241502016F01Q0101 |
Value | Count | Frequency (%) |
20246902020f02q0101 | 170 | 1.7% |
20228802004f01q0101 | 96 | 1.0% |
20231502019f02q0101 | 86 | 0.9% |
20249602010f02q0101 | 68 | 0.7% |
20248701986f01q0101 | 65 | 0.7% |
20250301997f01q0101 | 62 | 0.6% |
20234302019f02q0101 | 61 | 0.6% |
27209902014f02q0101 | 59 | 0.6% |
20265801995f02q0101 | 59 | 0.6% |
20233402019f02q0101 | 57 | 0.6% |
Other values (689) | 9217 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 63851 | |
1 | 35464 | |
2 | 35425 | |
F | 10000 | 5.3% |
Q | 10000 | 5.3% |
9 | 8283 | 4.4% |
7 | 5550 | 2.9% |
4 | 5324 | 2.8% |
5 | 4383 | 2.3% |
3 | 4358 | 2.3% |
Other values (2) | 7362 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 170000 | |
Uppercase Letter | 20000 | 10.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 63851 | |
1 | 35464 | |
2 | 35425 | |
9 | 8283 | 4.9% |
7 | 5550 | 3.3% |
4 | 5324 | 3.1% |
5 | 4383 | 2.6% |
3 | 4358 | 2.6% |
6 | 4333 | 2.5% |
8 | 3029 | 1.8% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 10000 | |
Q | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 170000 | |
Latin | 20000 | 10.5% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 63851 | |
1 | 35464 | |
2 | 35425 | |
9 | 8283 | 4.9% |
7 | 5550 | 3.3% |
4 | 5324 | 3.1% |
5 | 4383 | 2.6% |
3 | 4358 | 2.6% |
6 | 4333 | 2.5% |
8 | 3029 | 1.8% |
Latin
Value | Count | Frequency (%) |
F | 10000 | |
Q | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 190000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 63851 | |
1 | 35464 | |
2 | 35425 | |
F | 10000 | 5.3% |
Q | 10000 | 5.3% |
9 | 8283 | 4.4% |
7 | 5550 | 2.9% |
4 | 5324 | 2.8% |
5 | 4383 | 2.3% |
3 | 4358 | 2.3% |
Other values (2) | 7362 | 3.9% |
구분코드
Categorical
IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
S08 | |
---|---|
S09 | |
S10 | 145 |
K03 | 144 |
S11 | 121 |
Other values (4) | 167 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | S08 |
---|---|
2nd row | S09 |
3rd row | S08 |
4th row | S08 |
5th row | S08 |
Common Values
Value | Count | Frequency (%) |
S08 | 8379 | |
S09 | 1044 | 10.4% |
S10 | 145 | 1.5% |
K03 | 144 | 1.4% |
S11 | 121 | 1.2% |
S13 | 107 | 1.1% |
K02 | 25 | 0.2% |
S12 | 22 | 0.2% |
S99 | 13 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
s08 | 8379 | |
s09 | 1044 | 10.4% |
s10 | 145 | 1.5% |
k03 | 144 | 1.4% |
s11 | 121 | 1.2% |
s13 | 107 | 1.1% |
k02 | 25 | 0.2% |
s12 | 22 | 0.2% |
s99 | 13 | 0.1% |
일련번호
Real number (ℝ)
Distinct | 684 |
---|---|
Distinct (%) | 6.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 96.596 |
Minimum | 1 |
---|---|
Maximum | 1703 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 11 |
median | 24 |
Q3 | 51 |
95-th percentile | 354.15 |
Maximum | 1703 |
Range | 1702 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 269.56999 |
---|---|
Coefficient of variation (CV) | 2.7906952 |
Kurtosis | 17.340926 |
Mean | 96.596 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 4.2417926 |
Sum | 965960 |
Variance | 72667.979 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 269 | 2.7% |
1 | 263 | 2.6% |
4 | 261 | 2.6% |
5 | 260 | 2.6% |
6 | 258 | 2.6% |
2 | 246 | 2.5% |
7 | 244 | 2.4% |
8 | 237 | 2.4% |
9 | 227 | 2.3% |
13 | 227 | 2.3% |
Other values (674) | 7508 |
Value | Count | Frequency (%) |
1 | 263 | |
2 | 246 | |
3 | 269 | |
4 | 261 | |
5 | 260 | |
6 | 258 | |
7 | 244 | |
8 | 237 | |
9 | 227 | |
10 | 224 |
Value | Count | Frequency (%) |
1703 | 1 | |
1702 | 1 | |
1701 | 1 | |
1700 | 1 | |
1699 | 1 | |
1698 | 1 | |
1697 | 1 | |
1696 | 1 | |
1695 | 1 | |
1693 | 1 |
부속물명
Text
Distinct | 7472 |
---|---|
Distinct (%) | 74.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
배수통관 | 653 | 6.3% |
배수암거 | 89 | 0.9% |
취수문 | 87 | 0.8% |
죽천 | 59 | 0.6% |
제 | 55 | 0.5% |
배수통문 | 35 | 0.3% |
계획배수통관 | 35 | 0.3% |
제3배수통관 | 33 | 0.3% |
제1배수통관 | 32 | 0.3% |
제4배수통관 | 31 | 0.3% |
Other values (7360) | 9258 |
Most occurring characters
Value | Count | Frequency (%) |
수 | 9659 | |
배 | 9030 | 12.9% |
관 | 6991 | 10.0% |
통 | 6718 | 9.6% |
제 | 3486 | 5.0% |
1 | 3346 | 4.8% |
2 | 2028 | 2.9% |
암 | 1848 | 2.6% |
거 | 1679 | 2.4% |
3 | 1443 | 2.1% |
Other values (280) | 24006 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 57312 | |
Decimal Number | 12002 | 17.1% |
Space Separator | 370 | 0.5% |
Uppercase Letter | 183 | 0.3% |
Close Punctuation | 164 | 0.2% |
Open Punctuation | 164 | 0.2% |
Dash Punctuation | 25 | < 0.1% |
Other Punctuation | 13 | < 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
수 | 9659 | |
배 | 9030 | |
관 | 6991 | |
통 | 6718 | |
제 | 3486 | 6.1% |
암 | 1848 | 3.2% |
거 | 1679 | 2.9% |
문 | 1083 | 1.9% |
천 | 752 | 1.3% |
산 | 605 | 1.1% |
Other values (252) | 15461 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3346 | |
2 | 2028 | |
3 | 1443 | |
4 | 1117 | 9.3% |
5 | 924 | 7.7% |
6 | 816 | 6.8% |
7 | 683 | 5.7% |
8 | 591 | 4.9% |
9 | 532 | 4.4% |
0 | 522 | 4.3% |
Uppercase Letter
Value | Count | Frequency (%) |
U | 47 | |
O | 43 | |
X | 43 | |
B | 43 | |
I | 2 | 1.1% |
C | 2 | 1.1% |
D | 1 | 0.5% |
V | 1 | 0.5% |
Y | 1 | 0.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 6 | |
@ | 3 | |
* | 3 | |
# | 1 | 7.7% |
Space Separator
Value | Count | Frequency (%) |
370 |
Close Punctuation
Value | Count | Frequency (%) |
) | 164 |
Open Punctuation
Value | Count | Frequency (%) |
( | 164 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 25 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 57312 | |
Common | 12739 | 18.1% |
Latin | 183 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
수 | 9659 | |
배 | 9030 | |
관 | 6991 | |
통 | 6718 | |
제 | 3486 | 6.1% |
암 | 1848 | 3.2% |
거 | 1679 | 2.9% |
문 | 1083 | 1.9% |
천 | 752 | 1.3% |
산 | 605 | 1.1% |
Other values (252) | 15461 |
Common
Value | Count | Frequency (%) |
1 | 3346 | |
2 | 2028 | |
3 | 1443 | |
4 | 1117 | 8.8% |
5 | 924 | 7.3% |
6 | 816 | 6.4% |
7 | 683 | 5.4% |
8 | 591 | 4.6% |
9 | 532 | 4.2% |
0 | 522 | 4.1% |
Other values (9) | 737 | 5.8% |
Latin
Value | Count | Frequency (%) |
U | 47 | |
O | 43 | |
X | 43 | |
B | 43 | |
I | 2 | 1.1% |
C | 2 | 1.1% |
D | 1 | 0.5% |
V | 1 | 0.5% |
Y | 1 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 57312 | |
ASCII | 12922 | 18.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
수 | 9659 | |
배 | 9030 | |
관 | 6991 | |
통 | 6718 | |
제 | 3486 | 6.1% |
암 | 1848 | 3.2% |
거 | 1679 | 2.9% |
문 | 1083 | 1.9% |
천 | 752 | 1.3% |
산 | 605 | 1.1% |
Other values (252) | 15461 |
ASCII
Value | Count | Frequency (%) |
1 | 3346 | |
2 | 2028 | |
3 | 1443 | |
4 | 1117 | 8.6% |
5 | 924 | 7.2% |
6 | 816 | 6.3% |
7 | 683 | 5.3% |
8 | 591 | 4.6% |
9 | 532 | 4.1% |
0 | 522 | 4.0% |
Other values (18) | 920 | 7.1% |
공간정보
Text
Distinct | 9700 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 45 |
---|---|
Median length | 45 |
Mean length | 44.5105 |
Min length | 40 |
Characters and Unicode
Total characters | 445105 |
---|---|
Distinct characters | 19 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 9400 ? |
---|---|
Unique (%) | 94.0% |
Sample
1st row | POINT (1068114.0508975058 1678896.8450204893) |
---|---|
2nd row | POINT (1060449.0227276098 1749069.2373822585) |
3rd row | POINT (1056820.565164969 1670808.3378589272) |
4th row | POINT (1098977.458962936 1693429.5799286035) |
5th row | POINT (1025278.9763804282 1734412.4653892987) |
Value | Count | Frequency (%) |
point | 10000 | |
1140418.8755580366 | 2 | < 0.1% |
1080934.427717068 | 2 | < 0.1% |
1734583.964913643 | 2 | < 0.1% |
1104549.7904665205 | 2 | < 0.1% |
1728178.8659452284 | 2 | < 0.1% |
1062277.4897165694 | 2 | < 0.1% |
1725296.5015104176 | 2 | < 0.1% |
1021494.0922068657 | 2 | < 0.1% |
1717981.1739145685 | 2 | < 0.1% |
Other values (19391) | 19982 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 50397 | |
7 | 35791 | 8.0% |
0 | 35642 | 8.0% |
6 | 33919 | 7.6% |
2 | 30456 | 6.8% |
5 | 30400 | 6.8% |
3 | 30334 | 6.8% |
4 | 29701 | 6.7% |
8 | 29366 | 6.6% |
9 | 29099 | 6.5% |
Other values (9) | 110000 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 335105 | |
Uppercase Letter | 50000 | 11.2% |
Other Punctuation | 20000 | 4.5% |
Space Separator | 20000 | 4.5% |
Open Punctuation | 10000 | 2.2% |
Close Punctuation | 10000 | 2.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 50397 | |
7 | 35791 | |
0 | 35642 | |
6 | 33919 | |
2 | 30456 | |
5 | 30400 | |
3 | 30334 | |
4 | 29701 | |
8 | 29366 | |
9 | 29099 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 10000 | |
O | 10000 | |
T | 10000 | |
N | 10000 | |
I | 10000 |
Other Punctuation
Value | Count | Frequency (%) |
. | 20000 |
Space Separator
Value | Count | Frequency (%) |
20000 |
Open Punctuation
Value | Count | Frequency (%) |
( | 10000 |
Close Punctuation
Value | Count | Frequency (%) |
) | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 395105 | |
Latin | 50000 | 11.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 50397 | |
7 | 35791 | |
0 | 35642 | |
6 | 33919 | |
2 | 30456 | |
5 | 30400 | |
3 | 30334 | |
4 | 29701 | |
8 | 29366 | |
9 | 29099 | |
Other values (4) | 60000 |
Latin
Value | Count | Frequency (%) |
P | 10000 | |
O | 10000 | |
T | 10000 | |
N | 10000 | |
I | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 445105 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 50397 | |
7 | 35791 | 8.0% |
0 | 35642 | 8.0% |
6 | 33919 | 7.6% |
2 | 30456 | 6.8% |
5 | 30400 | 6.8% |
3 | 30334 | 6.8% |
4 | 29701 | 6.7% |
8 | 29366 | 6.6% |
9 | 29099 | 6.5% |
Other values (9) | 110000 |
공간아이디 | 구분코드 | 일련번호 | |
---|---|---|---|
공간아이디 | 1.000 | 0.297 | 0.429 |
구분코드 | 0.297 | 1.000 | 0.261 |
일련번호 | 0.429 | 0.261 | 1.000 |
공간아이디 | 일련번호 | 구분코드 | |
---|---|---|---|
공간아이디 | 1.000 | 0.083 | 0.139 |
일련번호 | 0.083 | 1.000 | 0.131 |
구분코드 | 0.139 | 0.131 | 1.000 |
공간아이디 | 하천관리코드 | 구분코드 | 일련번호 | 부속물명 | 공간정보 | |
---|---|---|---|---|---|---|
7175 | 7169 | 20255602014F02Q0101 | S08 | 42 | 영오제10배수통관 | POINT (1068114.0508975058 1678896.8450204893) |
641 | 642 | 20227502012F01Q0101 | S09 | 13 | 월광제13배수통관 | POINT (1060449.0227276098 1749069.2373822585) |
16845 | 16838 | 27209901994F02Q0101 | S08 | 83 | 우천2취수문 | POINT (1056820.565164969 1670808.3378589272) |
14199 | 14192 | 27202401995F01Q0101 | S08 | 18 | 배수통관 | POINT (1098977.458962936 1693429.5799286035) |
3869 | 3868 | 20241502016F01Q0101 | S08 | 20 | 좌11배수통관 | POINT (1025278.9763804282 1734412.4653892987) |
311 | 311 | 20225602005F02Q0101 | S09 | 26 | 좌 | POINT (1095450.8159278487 1733662.4881836309) |
16878 | 16871 | 27209901994F02Q0101 | S08 | 130 | 종천7배수통관 | POINT (1058364.563886197 1667780.880478418) |
15306 | 15299 | 27205202014F01Q0101 | S13 | 32 | 여수토 | POINT (1083266.9717359336 1668948.1986530311) |
20122 | 20123 | 20233802019F02Q0101 | S08 | 35 | 황계제27배수통관 | POINT (1053561.459955034 1723850.994294676) |
4021 | 4020 | 20242902016F02Q0101 | S08 | 14 | 죽곡좌8배수통관 | POINT (1018609.189352776 1725115.2229685425) |
공간아이디 | 하천관리코드 | 구분코드 | 일련번호 | 부속물명 | 공간정보 | |
---|---|---|---|---|---|---|
10301 | 10302 | 20264901995F01Q0101 | S08 | 47 | 거문6배수통관 | POINT (1099067.820325124 1716103.5296267036) |
12878 | 12872 | 20276102018F02Q0101 | S08 | 4 | 제20배수통관 | POINT (1140700.125847342 1706977.634365025) |
16543 | 16536 | 27209401996F01Q0101 | S08 | 35 | 동림7배수암거 | POINT (1052796.9133151432 1660974.7634337025) |
11860 | 11855 | 20272202008F01Q0101 | S08 | 1 | 시전 제1배수통관 | POINT (1128586.1744087113 1726487.8868032238) |
12162 | 12157 | 20274201995F01Q0101 | S08 | 8 | 내포2배수암거 | POINT (1129683.837154323 1712743.898516166) |
16980 | 16973 | 27210002018F02Q0101 | S08 | 6 | 제4배수통문 | POINT (1058696.9743806804 1671987.3846174257) |
14476 | 14468 | 27202701999F01Q0101 | S08 | 22 | 교방1배수암거 | POINT (1097185.4108382112 1691315.403944658) |
7401 | 7395 | 20256002010F01Q0101 | S08 | 9 | 검암제3배수암거 | POINT (1063503.5778962586 1680856.526224242) |
1642 | 1643 | 20231102004F01Q0101 | S13 | 80 | 관정 | POINT (1038306.698614512 1739290.791807352) |
3960 | 3959 | 20242002008F01Q0101 | S08 | 5 | 공배5배수통관 | POINT (1024970.5325376411 1727317.4616374543) |