Overview

Dataset statistics

Number of variables8
Number of observations237
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory65.6 B

Variable types

Categorical3
Text3
Numeric1
DateTime1

Dataset

Description경상남도 거창군 하천에 대한 데이터로 하천구분(지방하천, 소하천), 하천명, 소재지, 기점, 종점, 길이 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15061749/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
구분 is highly overall correlated with 연장(m) and 1 other fieldsHigh correlation
관리주체(관리청) is highly overall correlated with 연장(m) and 1 other fieldsHigh correlation
연장(m) is highly overall correlated with 구분 and 1 other fieldsHigh correlation
기점 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:54:31.261356
Analysis finished2023-12-11 23:54:31.896684
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
소하천
195 
지방하천
42 

Length

Max length4
Median length3
Mean length3.1772152
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방하천
2nd row지방하천
3rd row지방하천
4th row지방하천
5th row지방하천

Common Values

ValueCountFrequency (%)
소하천 195
82.3%
지방하천 42
 
17.7%

Length

2023-12-12T08:54:31.994060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:54:32.078014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소하천 195
82.3%
지방하천 42
 
17.7%

명칭
Text

Distinct233
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T08:54:32.362895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.2109705
Min length2

Characters and Unicode

Total characters761
Distinct characters167
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)96.6%

Sample

1st row양항천
2nd row대곡천
3rd row정장천
4th row대산천
5th row무촌천
ValueCountFrequency (%)
황산천 2
 
0.8%
웅곡천 2
 
0.8%
동호천 2
 
0.8%
산포천 2
 
0.8%
월포천 1
 
0.4%
대곡소천 1
 
0.4%
아주천 1
 
0.4%
안흥천 1
 
0.4%
대촌천 1
 
0.4%
정골천 1
 
0.4%
Other values (223) 223
94.1%
2023-12-12T08:54:32.880890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
243
31.9%
25
 
3.3%
25
 
3.3%
15
 
2.0%
15
 
2.0%
11
 
1.4%
11
 
1.4%
10
 
1.3%
2 10
 
1.3%
9
 
1.2%
Other values (157) 387
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 743
97.6%
Decimal Number 18
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
11
 
1.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 371
49.9%
Decimal Number
ValueCountFrequency (%)
2 10
55.6%
1 7
38.9%
3 1
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 743
97.6%
Common 18
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
11
 
1.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 371
49.9%
Common
ValueCountFrequency (%)
2 10
55.6%
1 7
38.9%
3 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 743
97.6%
ASCII 18
 
2.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
11
 
1.5%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 371
49.9%
ASCII
ValueCountFrequency (%)
2 10
55.6%
1 7
38.9%
3 1
 
5.6%

소재지
Categorical

Distinct12
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
경상남도 거창군 가조면
30 
경상남도 거창군 신원면
23 
경상남도 거창군 고제면
23 
경상남도 거창군 남하면
21 
경상남도 거창군 북상면
21 
Other values (7)
119 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도 거창군 남하면
2nd row경상남도 거창군 남하면
3rd row경상남도 거창군 거창읍
4th row경상남도 거창군 남상면
5th row경상남도 거창군 남상면

Common Values

ValueCountFrequency (%)
경상남도 거창군 가조면 30
12.7%
경상남도 거창군 신원면 23
9.7%
경상남도 거창군 고제면 23
9.7%
경상남도 거창군 남하면 21
8.9%
경상남도 거창군 북상면 21
8.9%
경상남도 거창군 거창읍 20
8.4%
경상남도 거창군 가북면 20
8.4%
경상남도 거창군 웅양면 20
8.4%
경상남도 거창군 남상면 17
7.2%
경상남도 거창군 위천면 15
6.3%
Other values (2) 27
11.4%

Length

2023-12-12T08:54:33.273449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경상남도 237
33.3%
거창군 237
33.3%
가조면 30
 
4.2%
신원면 23
 
3.2%
고제면 23
 
3.2%
남하면 21
 
3.0%
북상면 21
 
3.0%
거창읍 20
 
2.8%
가북면 20
 
2.8%
웅양면 20
 
2.8%
Other values (4) 59
 
8.3%

기점
Text

UNIQUE 

Distinct237
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T08:54:33.673902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length21.105485
Min length18

Characters and Unicode

Total characters5002
Distinct characters117
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique237 ?
Unique (%)100.0%

Sample

1st row경상남도 거창군 남하면 둔마리 744-2
2nd row경상남도 거창군 남하면 양항리 72
3rd row경상남도 거창군 거창읍 장팔리 1462-1
4th row경상남도 거창군 남상면 춘전리 311-3
5th row경상남도 거창군 남상면 무촌리 126
ValueCountFrequency (%)
경상남도 237
19.4%
거창군 237
19.4%
37
 
3.0%
가조면 30
 
2.5%
고제면 23
 
1.9%
신원면 22
 
1.8%
북상면 21
 
1.7%
남하면 21
 
1.7%
거창읍 20
 
1.6%
가북면 20
 
1.6%
Other values (322) 555
45.4%
2023-12-12T08:54:34.280431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
987
19.7%
290
 
5.8%
280
 
5.6%
259
 
5.2%
258
 
5.2%
252
 
5.0%
244
 
4.9%
240
 
4.8%
237
 
4.7%
217
 
4.3%
Other values (107) 1738
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3108
62.1%
Space Separator 987
 
19.7%
Decimal Number 827
 
16.5%
Dash Punctuation 80
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
252
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
80
 
2.6%
Other values (95) 751
24.2%
Decimal Number
ValueCountFrequency (%)
1 188
22.7%
2 102
12.3%
6 83
10.0%
3 81
9.8%
4 71
 
8.6%
8 68
 
8.2%
5 67
 
8.1%
9 62
 
7.5%
7 56
 
6.8%
0 49
 
5.9%
Space Separator
ValueCountFrequency (%)
987
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3108
62.1%
Common 1894
37.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
252
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
80
 
2.6%
Other values (95) 751
24.2%
Common
ValueCountFrequency (%)
987
52.1%
1 188
 
9.9%
2 102
 
5.4%
6 83
 
4.4%
3 81
 
4.3%
- 80
 
4.2%
4 71
 
3.7%
8 68
 
3.6%
5 67
 
3.5%
9 62
 
3.3%
Other values (2) 105
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3108
62.1%
ASCII 1894
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
987
52.1%
1 188
 
9.9%
2 102
 
5.4%
6 83
 
4.4%
3 81
 
4.3%
- 80
 
4.2%
4 71
 
3.7%
8 68
 
3.6%
5 67
 
3.5%
9 62
 
3.3%
Other values (2) 105
 
5.5%
Hangul
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
252
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
80
 
2.6%
Other values (95) 751
24.2%

종점
Text

Distinct235
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T08:54:34.677557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length21.49789
Min length18

Characters and Unicode

Total characters5095
Distinct characters119
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)98.3%

Sample

1st row경상남도 거창군 남하면 양항리 963-2
2nd row경상남도 거창군 남하면 양항리 1121
3rd row경상남도 거창군 거창읍 대평리 1452-8
4th row경상남도 거창군 남상면 대산리 101-2
5th row경상남도 거창군 남상면 무촌리 1768
ValueCountFrequency (%)
경상남도 237
20.0%
거창군 236
19.9%
가조면 30
 
2.5%
거창읍 23
 
1.9%
고제면 22
 
1.9%
남하면 22
 
1.9%
신원면 22
 
1.9%
북상면 20
 
1.7%
가북면 19
 
1.6%
웅양면 19
 
1.6%
Other values (319) 537
45.2%
2023-12-12T08:54:35.350188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
959
18.8%
290
 
5.7%
280
 
5.5%
262
 
5.1%
260
 
5.1%
253
 
5.0%
244
 
4.8%
239
 
4.7%
237
 
4.7%
1 223
 
4.4%
Other values (109) 1848
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3066
60.2%
Space Separator 959
 
18.8%
Decimal Number 934
 
18.3%
Dash Punctuation 136
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
 
9.5%
280
 
9.1%
262
 
8.5%
260
 
8.5%
253
 
8.3%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (97) 735
24.0%
Decimal Number
ValueCountFrequency (%)
1 223
23.9%
2 107
11.5%
3 101
10.8%
4 90
9.6%
7 79
 
8.5%
6 72
 
7.7%
8 71
 
7.6%
5 70
 
7.5%
0 63
 
6.7%
9 58
 
6.2%
Space Separator
ValueCountFrequency (%)
959
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 136
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3066
60.2%
Common 2029
39.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
290
 
9.5%
280
 
9.1%
262
 
8.5%
260
 
8.5%
253
 
8.3%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (97) 735
24.0%
Common
ValueCountFrequency (%)
959
47.3%
1 223
 
11.0%
- 136
 
6.7%
2 107
 
5.3%
3 101
 
5.0%
4 90
 
4.4%
7 79
 
3.9%
6 72
 
3.5%
8 71
 
3.5%
5 70
 
3.4%
Other values (2) 121
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3066
60.2%
ASCII 2029
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
959
47.3%
1 223
 
11.0%
- 136
 
6.7%
2 107
 
5.3%
3 101
 
5.0%
4 90
 
4.4%
7 79
 
3.9%
6 72
 
3.5%
8 71
 
3.5%
5 70
 
3.4%
Other values (2) 121
 
6.0%
Hangul
ValueCountFrequency (%)
290
 
9.5%
280
 
9.1%
262
 
8.5%
260
 
8.5%
253
 
8.3%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (97) 735
24.0%

연장(m)
Real number (ℝ)

HIGH CORRELATION 

Distinct203
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2169.6709
Minimum441
Maximum30120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T08:54:35.526175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum441
5-th percentile549
Q1796
median1160
Q32036
95-th percentile5732
Maximum30120
Range29679
Interquartile range (IQR)1240

Descriptive statistics

Standard deviation3712.9147
Coefficient of variation (CV)1.7112801
Kurtosis36.585001
Mean2169.6709
Median Absolute Deviation (MAD)481
Skewness5.6671206
Sum514212
Variance13785736
MonotonicityNot monotonic
2023-12-12T08:54:35.675533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
600 4
 
1.7%
1400 4
 
1.7%
800 3
 
1.3%
950 3
 
1.3%
650 3
 
1.3%
700 3
 
1.3%
1000 3
 
1.3%
1013 2
 
0.8%
743 2
 
0.8%
3200 2
 
0.8%
Other values (193) 208
87.8%
ValueCountFrequency (%)
441 1
0.4%
465 1
0.4%
469 1
0.4%
473 1
0.4%
475 1
0.4%
500 1
0.4%
520 1
0.4%
531 2
0.8%
537 1
0.4%
543 1
0.4%
ValueCountFrequency (%)
30120 1
0.4%
29370 1
0.4%
28220 1
0.4%
19000 1
0.4%
13410 1
0.4%
11500 1
0.4%
10110 1
0.4%
10000 1
0.4%
7110 1
0.4%
7000 1
0.4%

관리주체(관리청)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
거창군
195 
경상남도(거창군)
42 

Length

Max length9
Median length3
Mean length4.0632911
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도(거창군)
2nd row경상남도(거창군)
3rd row경상남도(거창군)
4th row경상남도(거창군)
5th row경상남도(거창군)

Common Values

ValueCountFrequency (%)
거창군 195
82.3%
경상남도(거창군) 42
 
17.7%

Length

2023-12-12T08:54:35.865733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:54:35.996035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거창군 195
82.3%
경상남도(거창군 42
 
17.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2023-07-05 00:00:00
Maximum2023-07-05 00:00:00
2023-12-12T08:54:36.076458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:54:36.163403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T08:54:31.601894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:54:36.229733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소재지연장(m)관리주체(관리청)
구분1.0000.0000.7231.000
소재지0.0001.0000.0000.000
연장(m)0.7230.0001.0000.723
관리주체(관리청)1.0000.0000.7231.000
2023-12-12T08:54:36.329274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소재지관리주체(관리청)
구분1.0000.0000.985
소재지0.0001.0000.000
관리주체(관리청)0.9850.0001.000
2023-12-12T08:54:36.407452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연장(m)구분소재지관리주체(관리청)
연장(m)1.0000.7740.0000.774
구분0.7741.0000.0000.985
소재지0.0000.0001.0000.000
관리주체(관리청)0.7740.9850.0001.000

Missing values

2023-12-12T08:54:31.733263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:54:31.842935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분명칭소재지기점종점연장(m)관리주체(관리청)데이터기준일자
0지방하천양항천경상남도 거창군 남하면경상남도 거창군 남하면 둔마리 744-2경상남도 거창군 남하면 양항리 963-24940경상남도(거창군)2023-07-05
1지방하천대곡천경상남도 거창군 남하면경상남도 거창군 남하면 양항리 72경상남도 거창군 남하면 양항리 11212330경상남도(거창군)2023-07-05
2지방하천정장천경상남도 거창군 거창읍경상남도 거창군 거창읍 장팔리 1462-1경상남도 거창군 거창읍 대평리 1452-84430경상남도(거창군)2023-07-05
3지방하천대산천경상남도 거창군 남상면경상남도 거창군 남상면 춘전리 311-3경상남도 거창군 남상면 대산리 101-211500경상남도(거창군)2023-07-05
4지방하천무촌천경상남도 거창군 남상면경상남도 거창군 남상면 무촌리 126경상남도 거창군 남상면 무촌리 17682070경상남도(거창군)2023-07-05
5지방하천전척천경상남도 거창군 남상면경상남도 거창군 남상면 전척리 928경상남도 거창군 남상면 전척리 3212830경상남도(거창군)2023-07-05
6지방하천임불천경상남도 거창군 남상면경상남도 거창군 남상면 임불리 2178경상남도 거창군 남상면 임불리 713020경상남도(거창군)2023-07-05
7지방하천가천천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 431경상남도 거창군 남하면 대야리 958-129370경상남도(거창군)2023-07-05
8지방하천우혜천경상남도 거창군 가북면경상남도 거창군 가북면 우혜리 885경상남도 거창군 가북면 우혜리 17322530경상남도(거창군)2023-07-05
9지방하천좌가천경상남도 거창군 가북면경상남도 거창군 가북면 중촌리 571경상남도 거창군 가북면 해평리 317-210110경상남도(거창군)2023-07-05
구분명칭소재지기점종점연장(m)관리주체(관리청)데이터기준일자
227소하천호암천경상남도 거창군 가북면경상남도 거창군 가북면 박암리 1052-2경상남도 거창군 가북면 박암리 486787거창군2023-07-05
228소하천윤오천경상남도 거창군 가북면경상남도 거창군 가북면 몽석리 1378경상남도 거창군 가북면 몽석리 1740-2667거창군2023-07-05
229소하천명동천경상남도 거창군 가북면경상남도 거창군 가북면 몽석리 719경상남도 거창군 가북면 몽석리 119-41603거창군2023-07-05
230소하천더무강천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 1667경상남도 거창군 가북면 용암리 1800802거창군2023-07-05
231소하천개금천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 294경상남도 거창군 가북면 용암리 428796거창군2023-07-05
232소하천상개금천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 43경상남도 거창군 가북면 용암리 4281044거창군2023-07-05
233소하천고비천경상남도 거창군 가북면경상남도 거창군 가북면 중촌리 761경상남도 거창군 가북면 중촌리 17762466거창군2023-07-05
234소하천회남천경상남도 거창군 가북면경상남도 거창군 가북면 해평리 1671경상남도 거창군 가북면 해평리 1465-11490거창군2023-07-05
235소하천연곡천경상남도 거창군 가북면경상남도 거창군 가북면 해평리 1276경상남도 거창군 가북면 해평리 854696거창군2023-07-05
236소하천정봉천경상남도 거창군 가북면경상남도 거창군 가북면 용산리 1054경상남도 거창군 가북면 용산리 3442036거창군2023-07-05