Overview

Dataset statistics

Number of variables8
Number of observations237
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory65.6 B

Variable types

Categorical3
Text3
Numeric1
DateTime1

Dataset

Description경상남도 거창군 하천에 대한 데이터로 하천구분(지방하천, 소하천), 하천명, 소재지, 기점, 종점, 길이 항목을 제공합니다.
Author경상남도 거창군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15061749

Alerts

데이터기준일자 has constant value ""Constant
관리주체(관리청) is highly overall correlated with 연장(m) and 1 other fieldsHigh correlation
구분 is highly overall correlated with 연장(m) and 1 other fieldsHigh correlation
연장(m) is highly overall correlated with 구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 23:30:30.551430
Analysis finished2023-12-10 23:30:31.139068
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
소하천
195 
지방하천
42 

Length

Max length4
Median length3
Mean length3.1772152
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지방하천
2nd row지방하천
3rd row지방하천
4th row지방하천
5th row지방하천

Common Values

ValueCountFrequency (%)
소하천 195
82.3%
지방하천 42
 
17.7%

Length

2023-12-11T08:30:31.191653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:30:31.264779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소하천 195
82.3%
지방하천 42
 
17.7%

명칭
Text

Distinct227
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T08:30:31.514761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.1814346
Min length2

Characters and Unicode

Total characters754
Distinct characters166
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)92.8%

Sample

1st row양항천
2nd row대곡천
3rd row정장천
4th row대산천
5th row무촌천
ValueCountFrequency (%)
신기천 4
 
1.7%
당산천 3
 
1.3%
양지천 2
 
0.8%
황산천 2
 
0.8%
동호천 2
 
0.8%
산포천 2
 
0.8%
웅곡천 2
 
0.8%
월포천 1
 
0.4%
아주천 1
 
0.4%
안흥천 1
 
0.4%
Other values (217) 217
91.6%
2023-12-11T08:30:31.898531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
243
32.2%
25
 
3.3%
25
 
3.3%
15
 
2.0%
15
 
2.0%
12
 
1.6%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (156) 381
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 743
98.5%
Decimal Number 11
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
12
 
1.6%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 370
49.8%
Decimal Number
ValueCountFrequency (%)
2 7
63.6%
1 4
36.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 743
98.5%
Common 11
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
12
 
1.6%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 370
49.8%
Common
ValueCountFrequency (%)
2 7
63.6%
1 4
36.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 743
98.5%
ASCII 11
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
243
32.7%
25
 
3.4%
25
 
3.4%
15
 
2.0%
15
 
2.0%
12
 
1.6%
11
 
1.5%
10
 
1.3%
9
 
1.2%
8
 
1.1%
Other values (154) 370
49.8%
ASCII
ValueCountFrequency (%)
2 7
63.6%
1 4
36.4%

소재지
Categorical

Distinct12
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
경상남도 거창군 가조면
30 
경상남도 거창군 신원면
23 
경상남도 거창군 고제면
23 
경상남도 거창군 남하면
21 
경상남도 거창군 북상면
21 
Other values (7)
119 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도 거창군 남하면
2nd row경상남도 거창군 남하면
3rd row경상남도 거창군 거창읍
4th row경상남도 거창군 남상면
5th row경상남도 거창군 남상면

Common Values

ValueCountFrequency (%)
경상남도 거창군 가조면 30
12.7%
경상남도 거창군 신원면 23
9.7%
경상남도 거창군 고제면 23
9.7%
경상남도 거창군 남하면 21
8.9%
경상남도 거창군 북상면 21
8.9%
경상남도 거창군 거창읍 20
8.4%
경상남도 거창군 가북면 20
8.4%
경상남도 거창군 웅양면 20
8.4%
경상남도 거창군 남상면 17
7.2%
경상남도 거창군 위천면 15
6.3%
Other values (2) 27
11.4%

Length

2023-12-11T08:30:32.016196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경상남도 237
33.3%
거창군 237
33.3%
가조면 30
 
4.2%
신원면 23
 
3.2%
고제면 23
 
3.2%
남하면 21
 
3.0%
북상면 21
 
3.0%
거창읍 20
 
2.8%
가북면 20
 
2.8%
웅양면 20
 
2.8%
Other values (4) 59
 
8.3%

기점
Text

Distinct236
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T08:30:32.359712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length20.957806
Min length17

Characters and Unicode

Total characters4967
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)99.2%

Sample

1st row경상남도 거창군 남하면 둔마리 744-2
2nd row경상남도 거창군 남하면 양항리 72
3rd row경상남도 거창군 거창읍 장팔리 1462-1
4th row경상남도 거창군 남상면 춘전리 311-3
5th row경상남도 거창군 남상면 무촌리 126
ValueCountFrequency (%)
경상남도 237
19.8%
거창군 237
19.8%
가조면 30
 
2.5%
고제면 23
 
1.9%
신원면 22
 
1.8%
북상면 21
 
1.8%
남하면 21
 
1.8%
웅양면 20
 
1.7%
거창읍 20
 
1.7%
가북면 20
 
1.7%
Other values (320) 544
45.5%
2023-12-11T08:30:32.947115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
958
19.3%
290
 
5.8%
280
 
5.6%
259
 
5.2%
258
 
5.2%
251
 
5.1%
244
 
4.9%
240
 
4.8%
237
 
4.8%
217
 
4.4%
Other values (110) 1733
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3108
62.6%
Space Separator 958
 
19.3%
Decimal Number 824
 
16.6%
Dash Punctuation 77
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
251
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
81
 
2.6%
Other values (98) 751
24.2%
Decimal Number
ValueCountFrequency (%)
1 187
22.7%
2 99
12.0%
6 83
10.1%
3 81
9.8%
8 68
 
8.3%
4 66
 
8.0%
5 63
 
7.6%
9 62
 
7.5%
7 61
 
7.4%
0 54
 
6.6%
Space Separator
ValueCountFrequency (%)
958
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3108
62.6%
Common 1859
37.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
251
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
81
 
2.6%
Other values (98) 751
24.2%
Common
ValueCountFrequency (%)
958
51.5%
1 187
 
10.1%
2 99
 
5.3%
6 83
 
4.5%
3 81
 
4.4%
- 77
 
4.1%
8 68
 
3.7%
4 66
 
3.6%
5 63
 
3.4%
9 62
 
3.3%
Other values (2) 115
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3108
62.6%
ASCII 1859
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
958
51.5%
1 187
 
10.1%
2 99
 
5.3%
6 83
 
4.5%
3 81
 
4.4%
- 77
 
4.1%
8 68
 
3.7%
4 66
 
3.6%
5 63
 
3.4%
9 62
 
3.3%
Other values (2) 115
 
6.2%
Hangul
ValueCountFrequency (%)
290
 
9.3%
280
 
9.0%
259
 
8.3%
258
 
8.3%
251
 
8.1%
244
 
7.9%
240
 
7.7%
237
 
7.6%
217
 
7.0%
81
 
2.6%
Other values (98) 751
24.2%

종점
Text

Distinct235
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T08:30:33.326598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length21.371308
Min length18

Characters and Unicode

Total characters5065
Distinct characters121
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)98.3%

Sample

1st row경상남도 거창군 남하면 양항리 963-2
2nd row경상남도 거창군 남하면 양항리 1121
3rd row경상남도 거창군 거창읍 대평리 1452-8
4th row경상남도 거창군 남상면 대산리 101-2
5th row경상남도 거창군 남상면 무촌리 1768
ValueCountFrequency (%)
경상남도 237
20.0%
거창군 236
19.9%
가조면 30
 
2.5%
거창읍 23
 
1.9%
신원면 22
 
1.9%
고제면 22
 
1.9%
남하면 22
 
1.9%
북상면 20
 
1.7%
가북면 19
 
1.6%
웅양면 19
 
1.6%
Other values (321) 534
45.1%
2023-12-11T08:30:33.863066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
947
18.7%
289
 
5.7%
280
 
5.5%
262
 
5.2%
260
 
5.1%
252
 
5.0%
244
 
4.8%
239
 
4.7%
237
 
4.7%
1 221
 
4.4%
Other values (111) 1834
36.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3068
60.6%
Space Separator 947
 
18.7%
Decimal Number 920
 
18.2%
Dash Punctuation 130
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
289
 
9.4%
280
 
9.1%
262
 
8.5%
260
 
8.5%
252
 
8.2%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (99) 739
24.1%
Decimal Number
ValueCountFrequency (%)
1 221
24.0%
2 103
11.2%
4 96
10.4%
3 85
 
9.2%
7 81
 
8.8%
6 78
 
8.5%
5 68
 
7.4%
8 67
 
7.3%
0 61
 
6.6%
9 60
 
6.5%
Space Separator
ValueCountFrequency (%)
947
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3068
60.6%
Common 1997
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
289
 
9.4%
280
 
9.1%
262
 
8.5%
260
 
8.5%
252
 
8.2%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (99) 739
24.1%
Common
ValueCountFrequency (%)
947
47.4%
1 221
 
11.1%
- 130
 
6.5%
2 103
 
5.2%
4 96
 
4.8%
3 85
 
4.3%
7 81
 
4.1%
6 78
 
3.9%
5 68
 
3.4%
8 67
 
3.4%
Other values (2) 121
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3068
60.6%
ASCII 1997
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
947
47.4%
1 221
 
11.1%
- 130
 
6.5%
2 103
 
5.2%
4 96
 
4.8%
3 85
 
4.3%
7 81
 
4.1%
6 78
 
3.9%
5 68
 
3.4%
8 67
 
3.4%
Other values (2) 121
 
6.1%
Hangul
ValueCountFrequency (%)
289
 
9.4%
280
 
9.1%
262
 
8.5%
260
 
8.5%
252
 
8.2%
244
 
8.0%
239
 
7.8%
237
 
7.7%
214
 
7.0%
52
 
1.7%
Other values (99) 739
24.1%

관리주체(관리청)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
거창군
195 
경상남도(거창군)
42 

Length

Max length9
Median length3
Mean length4.0632911
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도(거창군)
2nd row경상남도(거창군)
3rd row경상남도(거창군)
4th row경상남도(거창군)
5th row경상남도(거창군)

Common Values

ValueCountFrequency (%)
거창군 195
82.3%
경상남도(거창군) 42
 
17.7%

Length

2023-12-11T08:30:33.987983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:30:34.069929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거창군 195
82.3%
경상남도(거창군 42
 
17.7%

연장(m)
Real number (ℝ)

HIGH CORRELATION 

Distinct128
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2171.5696
Minimum500
Maximum30120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T08:30:34.164694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum500
5-th percentile550
Q1800
median1150
Q32050
95-th percentile5732
Maximum30120
Range29620
Interquartile range (IQR)1250

Descriptive statistics

Standard deviation3713.7167
Coefficient of variation (CV)1.7101532
Kurtosis36.539051
Mean2171.5696
Median Absolute Deviation (MAD)470
Skewness5.6619307
Sum514662
Variance13791692
MonotonicityNot monotonic
2023-12-11T08:30:34.288150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
800 11
 
4.6%
700 10
 
4.2%
500 8
 
3.4%
900 6
 
2.5%
1400 6
 
2.5%
750 6
 
2.5%
1050 6
 
2.5%
1100 6
 
2.5%
600 6
 
2.5%
1150 5
 
2.1%
Other values (118) 167
70.5%
ValueCountFrequency (%)
500 8
3.4%
530 1
 
0.4%
541 1
 
0.4%
545 1
 
0.4%
550 5
2.1%
551 1
 
0.4%
575 1
 
0.4%
600 6
2.5%
601 1
 
0.4%
616 1
 
0.4%
ValueCountFrequency (%)
30120 1
0.4%
29370 1
0.4%
28220 1
0.4%
19000 1
0.4%
13410 1
0.4%
11500 1
0.4%
10110 1
0.4%
10000 1
0.4%
7110 1
0.4%
7000 1
0.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2021-06-30 00:00:00
Maximum2021-06-30 00:00:00
2023-12-11T08:30:34.377535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:30:34.469856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T08:30:30.907839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:30:34.553788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소재지관리주체(관리청)연장(m)
구분1.0000.0001.0000.722
소재지0.0001.0000.0000.000
관리주체(관리청)1.0000.0001.0000.722
연장(m)0.7220.0000.7221.000
2023-12-11T08:30:34.641232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리주체(관리청)소재지구분
관리주체(관리청)1.0000.0000.985
소재지0.0001.0000.000
구분0.9850.0001.000
2023-12-11T08:30:34.721215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연장(m)구분소재지관리주체(관리청)
연장(m)1.0000.7740.0000.774
구분0.7741.0000.0000.985
소재지0.0000.0001.0000.000
관리주체(관리청)0.7740.9850.0001.000

Missing values

2023-12-11T08:30:31.003858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:30:31.100707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분명칭소재지기점종점관리주체(관리청)연장(m)데이터기준일자
0지방하천양항천경상남도 거창군 남하면경상남도 거창군 남하면 둔마리 744-2경상남도 거창군 남하면 양항리 963-2경상남도(거창군)49402021-06-30
1지방하천대곡천경상남도 거창군 남하면경상남도 거창군 남하면 양항리 72경상남도 거창군 남하면 양항리 1121경상남도(거창군)23302021-06-30
2지방하천정장천경상남도 거창군 거창읍경상남도 거창군 거창읍 장팔리 1462-1경상남도 거창군 거창읍 대평리 1452-8경상남도(거창군)44302021-06-30
3지방하천대산천경상남도 거창군 남상면경상남도 거창군 남상면 춘전리 311-3경상남도 거창군 남상면 대산리 101-2경상남도(거창군)115002021-06-30
4지방하천무촌천경상남도 거창군 남상면경상남도 거창군 남상면 무촌리 126경상남도 거창군 남상면 무촌리 1768경상남도(거창군)20702021-06-30
5지방하천전척천경상남도 거창군 남상면경상남도 거창군 남상면 전척리 928경상남도 거창군 남상면 전척리 321경상남도(거창군)28302021-06-30
6지방하천임불천경상남도 거창군 남상면경상남도 거창군 남상면 임불리 2178경상남도 거창군 남상면 임불리 71경상남도(거창군)30202021-06-30
7지방하천가천천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 431경상남도 거창군 남하면 대야리 958-1경상남도(거창군)293702021-06-30
8지방하천우혜천경상남도 거창군 가북면경상남도 거창군 가북면 우혜리 885경상남도 거창군 가북면 우혜리 1732경상남도(거창군)25302021-06-30
9지방하천좌가천경상남도 거창군 가북면경상남도 거창군 가북면 중촌리 571경상남도 거창군 가북면 해평리 317-2경상남도(거창군)101102021-06-30
구분명칭소재지기점종점관리주체(관리청)연장(m)데이터기준일자
227소하천윤오천경상남도 거창군 가북면경상남도 거창군 가북면 몽석리1378경상남도 거창군 가북면 몽석리 1742거창군7002021-06-30
228소하천명동천경상남도 거창군 가북면경상남도 거창군 가북면 몽석리 650경상남도 거창군 가북면 몽석리 118거창군16002021-06-30
229소하천더무강천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 1667경상남도 거창군 가북면 용암리 1800거창군7802021-06-30
230소하천개금천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 294경상남도 거창군 가북면 용암리 428거창군8002021-06-30
231소하천상개금천경상남도 거창군 가북면경상남도 거창군 가북면 용암리 산35경상남도 거창군 가북면 용암리 428거창군10802021-06-30
232소하천고비천경상남도 거창군 가북면경상남도 거창군 가북면 중촌리 761경상남도 거창군 가북면 중촌리 1776거창군25002021-06-30
233소하천회남천경상남도 거창군 가북면경상남도 거창군 가북면 해평리 1671경상남도 거창군 가북면 해평리 1465-1거창군16002021-06-30
234소하천연곡천경상남도 거창군 가북면경상남도 거창군 가북면 해평리 1276경상남도 거창군 가북면 해평리 854거창군6302021-06-30
235소하천정봉천경상남도 거창군 가북면경상남도 거창군 가북면 용산리 1054경상남도 거창군 가북면 용산리 344거창군20002021-06-30
236소하천율리천경상남도 거창군 가북면경상남도 거창군 가북면 용산리 1307-1경상남도 거창군 가북면 용산리 1394거창군8002021-06-30