Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 30770 |
Missing cells (%) | 38.5% |
Duplicate rows | 25 |
Duplicate rows (%) | 0.2% |
Total size in memory | 722.7 KiB |
Average record size in memory | 74.0 B |
Variable types
Text | 6 |
---|---|
Numeric | 2 |
Dataset
Description | 경상남도 하천관리 시스템의 구조물현황 데이터로, 하천명, 부속물명, 부속물 주소, 부속물 측점번호, 부속물 구조 및 규모, 비고사항에 대한 정보를 제공합니다. |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15093546 |
Dataset has 25 (0.2%) duplicate rows | Duplicates |
부속물_기타주소 has 5425 (54.2%) missing values | Missing |
부속물_구조 has 8425 (84.2%) missing values | Missing |
부속물_규모 has 9566 (95.7%) missing values | Missing |
비고 has 7288 (72.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-11 00:41:12.987087 |
---|---|
Analysis finished | 2023-12-11 00:41:14.868141 |
Duration | 1.88 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
하천명
Text
Distinct | 459 |
---|---|
Distinct (%) | 4.6% |
Missing | 18 |
Missing (%) | 0.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
대산천 | 139 | 1.4% |
가천천 | 138 | 1.4% |
대곡천 | 120 | 1.2% |
동천 | 115 | 1.2% |
금양천 | 107 | 1.1% |
영천강 | 106 | 1.1% |
회야강 | 101 | 1.0% |
양천 | 101 | 1.0% |
석교천 | 91 | 0.9% |
금성천 | 91 | 0.9% |
Other values (449) | 8873 |
Most occurring characters
Value | Count | Frequency (%) |
천 | 10401 | |
곡 | 1141 | 3.8% |
산 | 823 | 2.8% |
양 | 593 | 2.0% |
대 | 477 | 1.6% |
계 | 468 | 1.6% |
성 | 419 | 1.4% |
강 | 404 | 1.4% |
가 | 384 | 1.3% |
신 | 333 | 1.1% |
Other values (196) | 14383 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29718 | |
Decimal Number | 52 | 0.2% |
Close Punctuation | 28 | 0.1% |
Open Punctuation | 28 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
천 | 10401 | |
곡 | 1141 | 3.8% |
산 | 823 | 2.8% |
양 | 593 | 2.0% |
대 | 477 | 1.6% |
계 | 468 | 1.6% |
성 | 419 | 1.4% |
강 | 404 | 1.4% |
가 | 384 | 1.3% |
신 | 333 | 1.1% |
Other values (192) | 14275 |
Decimal Number
Value | Count | Frequency (%) |
1 | 38 | |
2 | 14 | 26.9% |
Close Punctuation
Value | Count | Frequency (%) |
) | 28 |
Open Punctuation
Value | Count | Frequency (%) |
( | 28 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29718 | |
Common | 108 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
천 | 10401 | |
곡 | 1141 | 3.8% |
산 | 823 | 2.8% |
양 | 593 | 2.0% |
대 | 477 | 1.6% |
계 | 468 | 1.6% |
성 | 419 | 1.4% |
강 | 404 | 1.4% |
가 | 384 | 1.3% |
신 | 333 | 1.1% |
Other values (192) | 14275 |
Common
Value | Count | Frequency (%) |
1 | 38 | |
) | 28 | |
( | 28 | |
2 | 14 | 13.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29718 | |
ASCII | 108 | 0.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
천 | 10401 | |
곡 | 1141 | 3.8% |
산 | 823 | 2.8% |
양 | 593 | 2.0% |
대 | 477 | 1.6% |
계 | 468 | 1.6% |
성 | 419 | 1.4% |
강 | 404 | 1.4% |
가 | 384 | 1.3% |
신 | 333 | 1.1% |
Other values (192) | 14275 |
ASCII
Value | Count | Frequency (%) |
1 | 38 | |
) | 28 | |
( | 28 | |
2 | 14 | 13.0% |
일련번호
Real number (ℝ)
Distinct | 636 |
---|---|
Distinct (%) | 6.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 79.3483 |
Minimum | 1 |
---|---|
Maximum | 2002 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 12 |
median | 26 |
Q3 | 51 |
95-th percentile | 267.05 |
Maximum | 2002 |
Range | 2001 |
Interquartile range (IQR) | 39 |
Descriptive statistics
Standard deviation | 218.65936 |
---|---|
Coefficient of variation (CV) | 2.7556905 |
Kurtosis | 27.152975 |
Mean | 79.3483 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 5.0992906 |
Sum | 793483 |
Variance | 47811.914 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 255 | 2.5% |
4 | 248 | 2.5% |
7 | 246 | 2.5% |
3 | 245 | 2.5% |
2 | 242 | 2.4% |
5 | 230 | 2.3% |
8 | 223 | 2.2% |
6 | 218 | 2.2% |
15 | 213 | 2.1% |
12 | 209 | 2.1% |
Other values (626) | 7671 |
Value | Count | Frequency (%) |
1 | 255 | |
2 | 242 | |
3 | 245 | |
4 | 248 | |
5 | 230 | |
6 | 218 | |
7 | 246 | |
8 | 223 | |
9 | 188 | |
10 | 202 |
Value | Count | Frequency (%) |
2002 | 2 | |
1704 | 1 | |
1703 | 1 | |
1702 | 1 | |
1701 | 1 | |
1698 | 1 | |
1697 | 1 | |
1691 | 1 | |
1688 | 1 | |
1680 | 1 |
부속물명
Text
Distinct | 7839 |
---|---|
Distinct (%) | 78.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
배수통관 | 312 | 3.0% |
호수,저수지 | 130 | 1.3% |
호수저수지 | 107 | 1.0% |
배수암거 | 46 | 0.4% |
취수문 | 36 | 0.4% |
저수지 | 31 | 0.3% |
제1낙차공 | 26 | 0.3% |
낙차보 | 25 | 0.2% |
제2낙차공 | 25 | 0.2% |
계획교량 | 24 | 0.2% |
Other values (7824) | 9495 |
Most occurring characters
Value | Count | Frequency (%) |
수 | 6068 | 9.8% |
배 | 4716 | 7.6% |
관 | 3741 | 6.0% |
통 | 3583 | 5.8% |
제 | 3205 | 5.2% |
1 | 3003 | 4.8% |
교 | 2035 | 3.3% |
2 | 1888 | 3.0% |
낙 | 1399 | 2.3% |
차 | 1392 | 2.2% |
Other values (386) | 31051 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 50449 | |
Decimal Number | 10528 | 17.0% |
Space Separator | 305 | 0.5% |
Uppercase Letter | 219 | 0.4% |
Open Punctuation | 200 | 0.3% |
Close Punctuation | 197 | 0.3% |
Other Punctuation | 142 | 0.2% |
Dash Punctuation | 28 | < 0.1% |
Connector Punctuation | 11 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
수 | 6068 | 12.0% |
배 | 4716 | 9.3% |
관 | 3741 | 7.4% |
통 | 3583 | 7.1% |
제 | 3205 | 6.4% |
교 | 2035 | 4.0% |
낙 | 1399 | 2.8% |
차 | 1392 | 2.8% |
공 | 1315 | 2.6% |
보 | 1188 | 2.4% |
Other values (352) | 21807 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 64 | |
X | 63 | |
O | 61 | |
U | 17 | 7.8% |
C | 4 | 1.8% |
I | 3 | 1.4% |
D | 1 | 0.5% |
Y | 1 | 0.5% |
T | 1 | 0.5% |
J | 1 | 0.5% |
Other values (3) | 3 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 3003 | |
2 | 1888 | |
3 | 1279 | |
4 | 968 | 9.2% |
5 | 812 | 7.7% |
6 | 704 | 6.7% |
7 | 556 | 5.3% |
8 | 497 | 4.7% |
0 | 421 | 4.0% |
9 | 400 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 131 | |
. | 6 | 4.2% |
@ | 2 | 1.4% |
: | 2 | 1.4% |
# | 1 | 0.7% |
Space Separator
Value | Count | Frequency (%) |
305 |
Open Punctuation
Value | Count | Frequency (%) |
( | 200 |
Close Punctuation
Value | Count | Frequency (%) |
) | 197 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 28 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 11 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 50449 | |
Common | 11413 | 18.4% |
Latin | 219 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
수 | 6068 | 12.0% |
배 | 4716 | 9.3% |
관 | 3741 | 7.4% |
통 | 3583 | 7.1% |
제 | 3205 | 6.4% |
교 | 2035 | 4.0% |
낙 | 1399 | 2.8% |
차 | 1392 | 2.8% |
공 | 1315 | 2.6% |
보 | 1188 | 2.4% |
Other values (352) | 21807 |
Common
Value | Count | Frequency (%) |
1 | 3003 | |
2 | 1888 | |
3 | 1279 | |
4 | 968 | 8.5% |
5 | 812 | 7.1% |
6 | 704 | 6.2% |
7 | 556 | 4.9% |
8 | 497 | 4.4% |
0 | 421 | 3.7% |
9 | 400 | 3.5% |
Other values (11) | 885 | 7.8% |
Latin
Value | Count | Frequency (%) |
B | 64 | |
X | 63 | |
O | 61 | |
U | 17 | 7.8% |
C | 4 | 1.8% |
I | 3 | 1.4% |
D | 1 | 0.5% |
Y | 1 | 0.5% |
T | 1 | 0.5% |
J | 1 | 0.5% |
Other values (3) | 3 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 50449 | |
ASCII | 11632 | 18.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
수 | 6068 | 12.0% |
배 | 4716 | 9.3% |
관 | 3741 | 7.4% |
통 | 3583 | 7.1% |
제 | 3205 | 6.4% |
교 | 2035 | 4.0% |
낙 | 1399 | 2.8% |
차 | 1392 | 2.8% |
공 | 1315 | 2.6% |
보 | 1188 | 2.4% |
Other values (352) | 21807 |
ASCII
Value | Count | Frequency (%) |
1 | 3003 | |
2 | 1888 | |
3 | 1279 | |
4 | 968 | 8.3% |
5 | 812 | 7.0% |
6 | 704 | 6.1% |
7 | 556 | 4.8% |
8 | 497 | 4.3% |
0 | 421 | 3.6% |
9 | 400 | 3.4% |
Other values (24) | 1104 | 9.5% |
부속물_기타주소
Text
MISSING
 
Distinct | 724 |
---|---|
Distinct (%) | 15.8% |
Missing | 5425 |
Missing (%) | 54.2% |
Memory size | 156.2 KiB |
Length
Max length | 20 |
---|---|
Median length | 16 |
Mean length | 15.094645 |
Min length | 10 |
Characters and Unicode
Total characters | 69058 |
---|---|
Distinct characters | 243 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 123 ? |
---|---|
Unique (%) | 2.7% |
Sample
1st row | 경상남도 진해시 소사동 |
---|---|
2nd row | 함안군 대산면 옥렬리 |
3rd row | 경상남도 거창군 거창읍 김천리 |
4th row | 창원시 마산회원구 합성동 |
5th row | 경상남도 진해시 소사동 |
Value | Count | Frequency (%) |
경상남도 | 4175 | |
밀양시 | 352 | 2.0% |
마산시 | 348 | 2.0% |
고성군 | 338 | 1.9% |
진주시 | 317 | 1.8% |
창원시 | 307 | 1.8% |
함안군 | 306 | 1.8% |
창녕군 | 301 | 1.7% |
사천시 | 266 | 1.5% |
하동군 | 264 | 1.5% |
Other values (771) | 10437 |
Most occurring characters
Value | Count | Frequency (%) |
12836 | ||
남 | 4528 | 6.6% |
상 | 4388 | 6.4% |
도 | 4372 | 6.3% |
경 | 4220 | 6.1% |
리 | 3988 | 5.8% |
면 | 3356 | 4.9% |
군 | 2380 | 3.4% |
시 | 2243 | 3.2% |
산 | 1503 | 2.2% |
Other values (233) | 25244 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 56191 | |
Space Separator | 12836 | 18.6% |
Decimal Number | 31 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
남 | 4528 | 8.1% |
상 | 4388 | 7.8% |
도 | 4372 | 7.8% |
경 | 4220 | 7.5% |
리 | 3988 | 7.1% |
면 | 3356 | 6.0% |
군 | 2380 | 4.2% |
시 | 2243 | 4.0% |
산 | 1503 | 2.7% |
동 | 1385 | 2.5% |
Other values (228) | 23828 |
Decimal Number
Value | Count | Frequency (%) |
1 | 14 | |
3 | 11 | |
4 | 5 | 16.1% |
2 | 1 | 3.2% |
Space Separator
Value | Count | Frequency (%) |
12836 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 56191 | |
Common | 12867 | 18.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
남 | 4528 | 8.1% |
상 | 4388 | 7.8% |
도 | 4372 | 7.8% |
경 | 4220 | 7.5% |
리 | 3988 | 7.1% |
면 | 3356 | 6.0% |
군 | 2380 | 4.2% |
시 | 2243 | 4.0% |
산 | 1503 | 2.7% |
동 | 1385 | 2.5% |
Other values (228) | 23828 |
Common
Value | Count | Frequency (%) |
12836 | ||
1 | 14 | 0.1% |
3 | 11 | 0.1% |
4 | 5 | < 0.1% |
2 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 56191 | |
ASCII | 12867 | 18.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
12836 | ||
1 | 14 | 0.1% |
3 | 11 | 0.1% |
4 | 5 | < 0.1% |
2 | 1 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
남 | 4528 | 8.1% |
상 | 4388 | 7.8% |
도 | 4372 | 7.8% |
경 | 4220 | 7.5% |
리 | 3988 | 7.1% |
면 | 3356 | 6.0% |
군 | 2380 | 4.2% |
시 | 2243 | 4.0% |
산 | 1503 | 2.7% |
동 | 1385 | 2.5% |
Other values (228) | 23828 |
부속물_측점번호
Text
Distinct | 6580 |
---|---|
Distinct (%) | 66.1% |
Missing | 48 |
Missing (%) | 0.5% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 9 |
Mean length | 9.1359526 |
Min length | 2 |
Characters and Unicode
Total characters | 90921 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4850 ? |
---|---|
Unique (%) | 48.7% |
Sample
1st row | 0008+0037 |
---|---|
2nd row | 0012+0035 |
3rd row | 0024+0003 |
4th row | 0000+0794 |
5th row | 0000+0632 |
Value | Count | Frequency (%) |
0000+0000 | 692 | 7.0% |
0008+0000 | 10 | 0.1% |
0007+0000 | 9 | 0.1% |
0000+0030 | 9 | 0.1% |
0021+0000 | 8 | 0.1% |
0005+0031 | 8 | 0.1% |
0016+0000 | 8 | 0.1% |
0000+0050 | 8 | 0.1% |
0002+0000 | 8 | 0.1% |
0004+0028 | 8 | 0.1% |
Other values (6570) | 9184 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 47774 | |
+ | 9948 | 10.9% |
1 | 6215 | 6.8% |
2 | 4522 | 5.0% |
3 | 3899 | 4.3% |
5 | 3549 | 3.9% |
4 | 3448 | 3.8% |
6 | 2948 | 3.2% |
7 | 2846 | 3.1% |
8 | 2761 | 3.0% |
Other values (4) | 3011 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80512 | |
Math Symbol | 9948 | 10.9% |
Other Punctuation | 458 | 0.5% |
Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 47774 | |
1 | 6215 | 7.7% |
2 | 4522 | 5.6% |
3 | 3899 | 4.8% |
5 | 3549 | 4.4% |
4 | 3448 | 4.3% |
6 | 2948 | 3.7% |
7 | 2846 | 3.5% |
8 | 2761 | 3.4% |
9 | 2550 | 3.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 456 | |
, | 2 | 0.4% |
Math Symbol
Value | Count | Frequency (%) |
+ | 9948 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 90921 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 47774 | |
+ | 9948 | 10.9% |
1 | 6215 | 6.8% |
2 | 4522 | 5.0% |
3 | 3899 | 4.3% |
5 | 3549 | 3.9% |
4 | 3448 | 3.8% |
6 | 2948 | 3.2% |
7 | 2846 | 3.1% |
8 | 2761 | 3.0% |
Other values (4) | 3011 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90921 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 47774 | |
+ | 9948 | 10.9% |
1 | 6215 | 6.8% |
2 | 4522 | 5.0% |
3 | 3899 | 4.3% |
5 | 3549 | 3.9% |
4 | 3448 | 3.8% |
6 | 2948 | 3.2% |
7 | 2846 | 3.1% |
8 | 2761 | 3.0% |
Other values (4) | 3011 | 3.3% |
부속물_구조
Text
MISSING
 
Distinct | 207 |
---|---|
Distinct (%) | 13.1% |
Missing | 8425 |
Missing (%) | 84.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
r.c | 335 | |
흄관 | 225 | 10.6% |
con'c | 183 | 8.6% |
146 | 6.9% | |
thp | 124 | 5.8% |
hp | 111 | 5.2% |
csp | 68 | 3.2% |
콘크리트 | 50 | 2.3% |
접수면적 | 48 | 2.3% |
중력식 | 43 | 2.0% |
Other values (286) | 798 |
Most occurring characters
Value | Count | Frequency (%) |
C | 962 | 10.2% |
. | 675 | 7.2% |
592 | 6.3% | |
: | 400 | 4.2% |
P | 352 | 3.7% |
R | 345 | 3.7% |
0 | 312 | 3.3% |
O | 300 | 3.2% |
관 | 269 | 2.9% |
' | 261 | 2.8% |
Other values (142) | 4951 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 3053 | |
Other Letter | 2817 | |
Other Punctuation | 1538 | |
Decimal Number | 925 | 9.8% |
Space Separator | 592 | 6.3% |
Lowercase Letter | 377 | 4.0% |
Other Symbol | 103 | 1.1% |
Modifier Symbol | 4 | < 0.1% |
Dash Punctuation | 4 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Other values (2) | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
관 | 269 | 9.5% |
흄 | 225 | 8.0% |
조 | 137 | 4.9% |
수 | 113 | 4.0% |
면 | 105 | 3.7% |
적 | 103 | 3.7% |
강 | 88 | 3.1% |
이 | 80 | 2.8% |
식 | 69 | 2.4% |
중 | 69 | 2.4% |
Other values (82) | 1559 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 962 | |
P | 352 | 11.5% |
R | 345 | 11.3% |
O | 300 | 9.8% |
N | 259 | 8.5% |
H | 252 | 8.3% |
T | 135 | 4.4% |
S | 109 | 3.6% |
B | 85 | 2.8% |
L | 56 | 1.8% |
Other values (10) | 198 | 6.5% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 172 | |
r | 40 | 10.6% |
h | 39 | 10.3% |
f | 39 | 10.3% |
o | 17 | 4.5% |
n | 11 | 2.9% |
c | 11 | 2.9% |
b | 10 | 2.7% |
x | 9 | 2.4% |
l | 8 | 2.1% |
Other values (7) | 21 | 5.6% |
Decimal Number
Value | Count | Frequency (%) |
0 | 312 | |
2 | 113 | 12.2% |
7 | 86 | 9.3% |
1 | 75 | 8.1% |
5 | 75 | 8.1% |
3 | 65 | 7.0% |
8 | 64 | 6.9% |
4 | 59 | 6.4% |
6 | 54 | 5.8% |
9 | 22 | 2.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 675 | |
: | 400 | |
' | 261 | 17.0% |
/ | 160 | 10.4% |
, | 40 | 2.6% |
" | 2 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
592 |
Other Symbol
Value | Count | Frequency (%) |
㎢ | 103 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 4 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Math Symbol
Value | Count | Frequency (%) |
= | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3430 | |
Common | 3172 | |
Hangul | 2817 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
관 | 269 | 9.5% |
흄 | 225 | 8.0% |
조 | 137 | 4.9% |
수 | 113 | 4.0% |
면 | 105 | 3.7% |
적 | 103 | 3.7% |
강 | 88 | 3.1% |
이 | 80 | 2.8% |
식 | 69 | 2.4% |
중 | 69 | 2.4% |
Other values (82) | 1559 |
Latin
Value | Count | Frequency (%) |
C | 962 | |
P | 352 | 10.3% |
R | 345 | 10.1% |
O | 300 | 8.7% |
N | 259 | 7.6% |
H | 252 | 7.3% |
m | 172 | 5.0% |
T | 135 | 3.9% |
S | 109 | 3.2% |
B | 85 | 2.5% |
Other values (27) | 459 |
Common
Value | Count | Frequency (%) |
. | 675 | |
592 | ||
: | 400 | |
0 | 312 | |
' | 261 | 8.2% |
/ | 160 | 5.0% |
2 | 113 | 3.6% |
㎢ | 103 | 3.2% |
7 | 86 | 2.7% |
1 | 75 | 2.4% |
Other values (13) | 395 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 6498 | |
Hangul | 2817 | |
CJK Compat | 103 | 1.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 962 | |
. | 675 | 10.4% |
592 | 9.1% | |
: | 400 | 6.2% |
P | 352 | 5.4% |
R | 345 | 5.3% |
0 | 312 | 4.8% |
O | 300 | 4.6% |
' | 261 | 4.0% |
N | 259 | 4.0% |
Other values (48) | 2040 |
Hangul
Value | Count | Frequency (%) |
관 | 269 | 9.5% |
흄 | 225 | 8.0% |
조 | 137 | 4.9% |
수 | 113 | 4.0% |
면 | 105 | 3.7% |
적 | 103 | 3.7% |
강 | 88 | 3.1% |
이 | 80 | 2.8% |
식 | 69 | 2.4% |
중 | 69 | 2.4% |
Other values (82) | 1559 |
CJK Compat
Value | Count | Frequency (%) |
㎢ | 103 |
None
Value | Count | Frequency (%) |
Ø | 1 |
부속물_규모
Real number (ℝ)
MISSING
 
Distinct | 182 |
---|---|
Distinct (%) | 41.9% |
Missing | 9566 |
Missing (%) | 95.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.409447 |
Minimum | 0 |
---|---|
Maximum | 1000 |
Zeros | 3 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3.1 |
Q1 | 7 |
median | 11 |
Q3 | 20 |
95-th percentile | 51.335 |
Maximum | 1000 |
Range | 1000 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 104.53639 |
---|---|
Coefficient of variation (CV) | 3.5545173 |
Kurtosis | 53.113622 |
Mean | 29.409447 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 7.255506 |
Sum | 12763.7 |
Variance | 10927.856 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8.0 | 22 | 0.2% |
12.0 | 17 | 0.2% |
7.0 | 15 | 0.1% |
10.0 | 12 | 0.1% |
6.0 | 10 | 0.1% |
3.1 | 9 | 0.1% |
3.0 | 9 | 0.1% |
18.0 | 8 | 0.1% |
4.0 | 8 | 0.1% |
5.0 | 8 | 0.1% |
Other values (172) | 316 | 3.2% |
(Missing) | 9566 |
Value | Count | Frequency (%) |
0.0 | 3 | < 0.1% |
1.0 | 1 | < 0.1% |
1.1 | 1 | < 0.1% |
1.2 | 1 | < 0.1% |
2.0 | 1 | < 0.1% |
2.7 | 1 | < 0.1% |
3.0 | 9 | |
3.1 | 9 | |
3.2 | 1 | < 0.1% |
3.4 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1000.0 | 1 | < 0.1% |
800.0 | 5 | |
600.0 | 2 | < 0.1% |
110.0 | 1 | < 0.1% |
98.3 | 1 | < 0.1% |
93.1 | 1 | < 0.1% |
75.0 | 1 | < 0.1% |
69.3 | 1 | < 0.1% |
64.3 | 1 | < 0.1% |
63.0 | 1 | < 0.1% |
비고
Text
MISSING
 
Distinct | 167 |
---|---|
Distinct (%) | 6.2% |
Missing | 7288 |
Missing (%) | 72.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
신설 | 1309 | |
재가설 | 373 | 12.8% |
충분 | 226 | 7.8% |
존치 | 148 | 5.1% |
부족 | 90 | 3.1% |
계획지구 | 87 | 3.0% |
증설 | 60 | 2.1% |
재설치 | 41 | 1.4% |
기존확장 | 37 | 1.3% |
철거 | 33 | 1.1% |
Other values (185) | 510 | 17.5% |
Most occurring characters
Value | Count | Frequency (%) |
설 | 1899 | |
신 | 1322 | |
재 | 452 | 5.3% |
가 | 413 | 4.8% |
충 | 229 | 2.7% |
분 | 229 | 2.7% |
223 | 2.6% | |
치 | 220 | 2.6% |
존 | 196 | 2.3% |
장 | 172 | 2.0% |
Other values (166) | 3192 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7408 | |
Decimal Number | 330 | 3.9% |
Space Separator | 225 | 2.6% |
Other Punctuation | 225 | 2.6% |
Close Punctuation | 85 | 1.0% |
Open Punctuation | 85 | 1.0% |
Uppercase Letter | 60 | 0.7% |
Lowercase Letter | 49 | 0.6% |
Math Symbol | 45 | 0.5% |
Dash Punctuation | 26 | 0.3% |
Other values (2) | 9 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
설 | 1899 | |
신 | 1322 | |
재 | 452 | 6.1% |
가 | 413 | 5.6% |
충 | 229 | 3.1% |
분 | 229 | 3.1% |
치 | 220 | 3.0% |
존 | 196 | 2.6% |
장 | 172 | 2.3% |
구 | 140 | 1.9% |
Other values (118) | 2136 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 9 | |
h | 7 | |
e | 6 | |
s | 6 | |
o | 5 | |
c | 5 | |
m | 4 | |
φ | 3 | 6.1% |
l | 1 | 2.0% |
p | 1 | 2.0% |
Other values (2) | 2 | 4.1% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 22 | |
H | 5 | 8.3% |
E | 4 | 6.7% |
U | 4 | 6.7% |
T | 4 | 6.7% |
B | 4 | 6.7% |
A | 4 | 6.7% |
L | 4 | 6.7% |
S | 3 | 5.0% |
N | 3 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
0 | 100 | |
1 | 81 | |
2 | 39 | 11.8% |
5 | 22 | 6.7% |
8 | 19 | 5.8% |
7 | 17 | 5.2% |
3 | 17 | 5.2% |
4 | 14 | 4.2% |
6 | 11 | 3.3% |
9 | 10 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 153 | |
. | 34 | 15.1% |
: | 30 | 13.3% |
/ | 7 | 3.1% |
* | 1 | 0.4% |
Space Separator
Value | Count | Frequency (%) |
223 | ||
2 | 0.9% |
Math Symbol
Value | Count | Frequency (%) |
× | 24 | |
+ | 21 |
Other Symbol
Value | Count | Frequency (%) |
㎥ | 5 | |
㈜ | 1 | 16.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 85 |
Open Punctuation
Value | Count | Frequency (%) |
( | 85 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 26 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7409 | |
Common | 1029 | 12.0% |
Latin | 103 | 1.2% |
Greek | 6 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
설 | 1899 | |
신 | 1322 | |
재 | 452 | 6.1% |
가 | 413 | 5.6% |
충 | 229 | 3.1% |
분 | 229 | 3.1% |
치 | 220 | 3.0% |
존 | 196 | 2.6% |
장 | 172 | 2.3% |
구 | 140 | 1.9% |
Other values (119) | 2137 |
Common
Value | Count | Frequency (%) |
223 | ||
, | 153 | |
0 | 100 | |
) | 85 | 8.3% |
( | 85 | 8.3% |
1 | 81 | 7.9% |
2 | 39 | 3.8% |
. | 34 | 3.3% |
: | 30 | 2.9% |
- | 26 | 2.5% |
Other values (14) | 173 |
Latin
Value | Count | Frequency (%) |
D | 22 | |
a | 9 | 8.7% |
h | 7 | 6.8% |
e | 6 | 5.8% |
s | 6 | 5.8% |
o | 5 | 4.9% |
c | 5 | 4.9% |
H | 5 | 4.9% |
E | 4 | 3.9% |
U | 4 | 3.9% |
Other values (11) | 30 |
Greek
Value | Count | Frequency (%) |
φ | 3 | |
Φ | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7408 | |
ASCII | 1101 | 12.9% |
None | 33 | 0.4% |
CJK Compat | 5 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
설 | 1899 | |
신 | 1322 | |
재 | 452 | 6.1% |
가 | 413 | 5.6% |
충 | 229 | 3.1% |
분 | 229 | 3.1% |
치 | 220 | 3.0% |
존 | 196 | 2.6% |
장 | 172 | 2.3% |
구 | 140 | 1.9% |
Other values (118) | 2136 |
ASCII
Value | Count | Frequency (%) |
223 | ||
, | 153 | |
0 | 100 | |
) | 85 | 7.7% |
( | 85 | 7.7% |
1 | 81 | 7.4% |
2 | 39 | 3.5% |
. | 34 | 3.1% |
: | 30 | 2.7% |
- | 26 | 2.4% |
Other values (32) | 245 |
None
Value | Count | Frequency (%) |
× | 24 | |
φ | 3 | 9.1% |
Φ | 3 | 9.1% |
2 | 6.1% | |
㈜ | 1 | 3.0% |
CJK Compat
Value | Count | Frequency (%) |
㎥ | 5 |
일련번호 | 부속물_규모 | |
---|---|---|
일련번호 | 1.000 | NaN |
부속물_규모 | NaN | 1.000 |
일련번호 | 부속물_규모 | |
---|---|---|
일련번호 | 1.000 | -0.003 |
부속물_규모 | -0.003 | 1.000 |
하천명 | 일련번호 | 부속물명 | 부속물_기타주소 | 부속물_측점번호 | 부속물_구조 | 부속물_규모 | 비고 | |
---|---|---|---|---|---|---|---|---|
19692 | 소사천 | 8 | 소사7취입보 | 경상남도 진해시 소사동 | 0008+0037 | R.C | <NA> | 신설 |
40592 | 검단천 | 4 | 검단1교 | <NA> | 0012+0035 | <NA> | <NA> | <NA> |
6575 | 우명천 | 33 | 우명7낙차공(철거) | <NA> | 0024+0003 | <NA> | <NA> | <NA> |
7048 | 성만천 | 18 | 성만9배수통관 | <NA> | 0000+0794 | <NA> | <NA> | <NA> |
1319 | 장자천 | 27 | 장자제2낙차공 | <NA> | 0000+0632 | <NA> | 19.3 | 재가설 |
41716 | 옥열천 | 82 | 옥열10낙차공 | 함안군 대산면 옥렬리 | 0039+0086 | <NA> | <NA> | <NA> |
8703 | 웅곡천 | 5 | 김천2교 | 경상남도 거창군 거창읍 김천리 | 0000+0165 | R.C | <NA> | <NA> |
22819 | 산호천 | 18 | 산호제1취입보 | 창원시 마산회원구 합성동 | 0013+0029 | <NA> | <NA> | <NA> |
3247 | 소사천 | 6 | 소사4배수암거 | 경상남도 진해시 소사동 | 0005+0093 | R.C | <NA> | <NA> |
33797 | 가좌천 | 38 | 가좌3교 | <NA> | 0053+0036 | R.C | <NA> | 재가설 |
하천명 | 일련번호 | 부속물명 | 부속물_기타주소 | 부속물_측점번호 | 부속물_구조 | 부속물_규모 | 비고 | |
---|---|---|---|---|---|---|---|---|
45081 | 덕곡천 | 16 | 덕곡제10배수통관 | <NA> | 0002+0228 | <NA> | <NA> | 재가설(통수단면적 부족) |
5275 | 가천천 | 1201 | 가천21배수암거 | <NA> | 0013+0330 | <NA> | <NA> | <NA> |
31246 | 영오천 | 123 | 좌연7배수통관 | 경상남도 고성군 개천면 좌연리 | 0070+0010 | HP | <NA> | 신설 |
42748 | 유곡천 | 55 | 판곡양수장 | 경상남도 의령군 유곡면 칠곡리 | 0046+0100 | R.C | <NA> | <NA> |
32393 | 내곡천 | 8 | 내곡제3배수통관 | <NA> | 0000+0767 | <NA> | <NA> | 기존확장 |
37845 | 영오천 | 120 | 좌연3취수보 | <NA> | 0000+0000 | <NA> | <NA> | <NA> |
3425 | 부목천 | 32 | 문12 | <NA> | 0015+0079 | <NA> | <NA> | 증설 |
13525 | 의령천 | 3 | 의령1배수관 | <NA> | 0003+0036 | <NA> | <NA> | <NA> |
15238 | 여좌천 | 4 | 여좌28배수통관 | <NA> | 0026+0024 | <NA> | <NA> | <NA> |
11205 | 양산천 | 84 | 배수암거 | <NA> | 0066+0072 | <NA> | <NA> | <NA> |
Most frequently occurring
하천명 | 일련번호 | 부속물명 | 부속물_기타주소 | 부속물_측점번호 | 부속물_구조 | 부속물_규모 | 비고 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
0 | 단장천 | 27 | 단장4배수통관 | <NA> | 0008+0595 | <NA> | <NA> | <NA> | 2 |
1 | 단장천 | 37 | 사연2배수통관 | <NA> | 0011+0146 | <NA> | <NA> | <NA> | 2 |
2 | 단장천 | 44 | 범도5배수통관 | <NA> | 0015+0014 | <NA> | <NA> | <NA> | 2 |
3 | 대산천 | 4 | 호수저수지 | <NA> | 0000+0000 | <NA> | <NA> | <NA> | 2 |
4 | 방곡천 | 36 | 방곡제5보 | <NA> | 0008+0018 | 중력식 | <NA> | <NA> | 2 |
5 | 방곡천 | 62 | 신촌저수지 | <NA> | 0000+0000 | <NA> | <NA> | <NA> | 2 |
6 | 연초천 | 59 | 연초42배수통관 | <NA> | 0060+0095.50 | <NA> | <NA> | <NA> | 2 |
7 | 연초천 | 71 | 연초53배수통관 | <NA> | 0071+0097.70 | <NA> | <NA> | <NA> | 2 |
8 | 연초천 | 77 | 제1취입보 | <NA> | 0021+0096.60 | <NA> | <NA> | 재설치 | 2 |
9 | 영천강 | 17 | 봉발17교(재가설) | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |