Overview

Dataset statistics

Number of variables7
Number of observations391
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.9 KiB
Average record size in memory57.3 B

Variable types

Text4
Categorical1
Numeric1
DateTime1

Dataset

Description보성군에서 관리하는 소하천 지정현황의 데이터로 보성군 소하천의 연번, 명칭, 시점, 종점, 길이 등에 대해 나와있는 데이터 자료입니다.
URLhttps://www.data.go.kr/data/15034527/fileData.do

Alerts

기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:41:35.902956
Analysis finished2023-12-12 14:41:36.683883
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Text

UNIQUE 

Distinct391
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T23:41:37.005086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters3519
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique391 ?
Unique (%)100.0%

Sample

1st row15-12-001
2nd row15-12-002
3rd row15-12-003
4th row15-12-004
5th row15-12-005
ValueCountFrequency (%)
15-12-001 1
 
0.3%
15-12-295 1
 
0.3%
15-12-268 1
 
0.3%
15-12-267 1
 
0.3%
15-12-266 1
 
0.3%
15-12-265 1
 
0.3%
15-12-264 1
 
0.3%
15-12-263 1
 
0.3%
15-12-262 1
 
0.3%
15-12-261 1
 
0.3%
Other values (381) 381
97.4%
2023-12-12T23:41:37.547977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 962
27.3%
- 782
22.2%
2 570
16.2%
5 470
13.4%
0 177
 
5.0%
3 171
 
4.9%
8 79
 
2.2%
4 79
 
2.2%
6 79
 
2.2%
7 79
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2737
77.8%
Dash Punctuation 782
 
22.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 962
35.1%
2 570
20.8%
5 470
17.2%
0 177
 
6.5%
3 171
 
6.2%
8 79
 
2.9%
4 79
 
2.9%
6 79
 
2.9%
7 79
 
2.9%
9 71
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 782
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3519
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 962
27.3%
- 782
22.2%
2 570
16.2%
5 470
13.4%
0 177
 
5.0%
3 171
 
4.9%
8 79
 
2.2%
4 79
 
2.2%
6 79
 
2.2%
7 79
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3519
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 962
27.3%
- 782
22.2%
2 570
16.2%
5 470
13.4%
0 177
 
5.0%
3 171
 
4.9%
8 79
 
2.2%
4 79
 
2.2%
6 79
 
2.2%
7 79
 
2.2%
Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
섬진강
210 
득량만
101 
여자만
80 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row섬진강
2nd row섬진강
3rd row섬진강
4th row섬진강
5th row섬진강

Common Values

ValueCountFrequency (%)
섬진강 210
53.7%
득량만 101
25.8%
여자만 80
 
20.5%

Length

2023-12-12T23:41:37.705668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:41:37.815859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
섬진강 210
53.7%
득량만 101
25.8%
여자만 80
 
20.5%
Distinct373
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T23:41:38.086414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.1534527
Min length3

Characters and Unicode

Total characters1233
Distinct characters212
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique355 ?
Unique (%)90.8%

Sample

1st row평우천
2nd row건동천
3rd row오서천
4th row예동천
5th row부곡천
ValueCountFrequency (%)
두곡천 2
 
0.5%
은곡천 2
 
0.5%
양지천 2
 
0.5%
회룡천 2
 
0.5%
신촌천 2
 
0.5%
내동천 2
 
0.5%
동교천 2
 
0.5%
문양천 2
 
0.5%
축내천 2
 
0.5%
쌍가마천 2
 
0.5%
Other values (363) 371
94.9%
2023-12-12T23:41:38.604603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
400
32.4%
43
 
3.5%
32
 
2.6%
23
 
1.9%
19
 
1.5%
17
 
1.4%
15
 
1.2%
15
 
1.2%
13
 
1.1%
13
 
1.1%
Other values (202) 643
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1223
99.2%
Decimal Number 10
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
400
32.7%
43
 
3.5%
32
 
2.6%
23
 
1.9%
19
 
1.6%
17
 
1.4%
15
 
1.2%
15
 
1.2%
13
 
1.1%
13
 
1.1%
Other values (199) 633
51.8%
Decimal Number
ValueCountFrequency (%)
1 5
50.0%
2 4
40.0%
3 1
 
10.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1223
99.2%
Common 10
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
400
32.7%
43
 
3.5%
32
 
2.6%
23
 
1.9%
19
 
1.6%
17
 
1.4%
15
 
1.2%
15
 
1.2%
13
 
1.1%
13
 
1.1%
Other values (199) 633
51.8%
Common
ValueCountFrequency (%)
1 5
50.0%
2 4
40.0%
3 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1223
99.2%
ASCII 10
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
400
32.7%
43
 
3.5%
32
 
2.6%
23
 
1.9%
19
 
1.6%
17
 
1.4%
15
 
1.2%
15
 
1.2%
13
 
1.1%
13
 
1.1%
Other values (199) 633
51.8%
ASCII
ValueCountFrequency (%)
1 5
50.0%
2 4
40.0%
3 1
 
10.0%

시점
Text

Distinct390
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T23:41:38.958546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length35
Mean length23.378517
Min length20

Characters and Unicode

Total characters9141
Distinct characters123
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique389 ?
Unique (%)99.5%

Sample

1st row전라남도 보성군 보성읍 쾌상리 337번지
2nd row전라남도 보성군 보성읍 쾌상리 1157번지
3rd row전라남도 보성군 보성읍 봉산리 산55-4번지
4th row전라남도 보성군 보성읍 옥암리 664번지
5th row전라남도 보성군 보성읍 보성리 493-3번지
ValueCountFrequency (%)
전라남도 403
20.4%
보성군 403
20.4%
벌교읍 80
 
4.0%
율어면 39
 
2.0%
조성면 37
 
1.9%
득량면 36
 
1.8%
복내면 32
 
1.6%
보성읍 30
 
1.5%
웅치면 27
 
1.4%
회천면 26
 
1.3%
Other values (494) 866
43.8%
2023-12-12T23:41:39.502472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1588
17.4%
476
 
5.2%
434
 
4.7%
421
 
4.6%
409
 
4.5%
407
 
4.5%
407
 
4.5%
403
 
4.4%
397
 
4.3%
391
 
4.3%
Other values (113) 3808
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6091
66.6%
Space Separator 1588
 
17.4%
Decimal Number 1309
 
14.3%
Dash Punctuation 153
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
476
 
7.8%
434
 
7.1%
421
 
6.9%
409
 
6.7%
407
 
6.7%
407
 
6.7%
403
 
6.6%
397
 
6.5%
391
 
6.4%
391
 
6.4%
Other values (101) 1955
32.1%
Decimal Number
ValueCountFrequency (%)
1 294
22.5%
2 141
10.8%
4 136
10.4%
7 128
9.8%
3 124
9.5%
5 117
 
8.9%
6 105
 
8.0%
8 92
 
7.0%
9 89
 
6.8%
0 83
 
6.3%
Space Separator
ValueCountFrequency (%)
1588
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6091
66.6%
Common 3050
33.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
476
 
7.8%
434
 
7.1%
421
 
6.9%
409
 
6.7%
407
 
6.7%
407
 
6.7%
403
 
6.6%
397
 
6.5%
391
 
6.4%
391
 
6.4%
Other values (101) 1955
32.1%
Common
ValueCountFrequency (%)
1588
52.1%
1 294
 
9.6%
- 153
 
5.0%
2 141
 
4.6%
4 136
 
4.5%
7 128
 
4.2%
3 124
 
4.1%
5 117
 
3.8%
6 105
 
3.4%
8 92
 
3.0%
Other values (2) 172
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6091
66.6%
ASCII 3050
33.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1588
52.1%
1 294
 
9.6%
- 153
 
5.0%
2 141
 
4.6%
4 136
 
4.5%
7 128
 
4.2%
3 124
 
4.1%
5 117
 
3.8%
6 105
 
3.4%
8 92
 
3.0%
Other values (2) 172
 
5.6%
Hangul
ValueCountFrequency (%)
476
 
7.8%
434
 
7.1%
421
 
6.9%
409
 
6.7%
407
 
6.7%
407
 
6.7%
403
 
6.6%
397
 
6.5%
391
 
6.4%
391
 
6.4%
Other values (101) 1955
32.1%

종점
Text

Distinct385
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T23:41:39.771824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length35
Mean length24.202046
Min length20

Characters and Unicode

Total characters9463
Distinct characters123
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique379 ?
Unique (%)96.9%

Sample

1st row전라남도 보성군 보성읍 우산리 937-4번지
2nd row전라남도 보성군 보성읍 쾌상리 598번지
3rd row전라남도 보성군 보성읍 봉산리 1457-4번지
4th row전라남도 보성군 보성읍 옥암리 607번지
5th row전라남도 보성군 보성읍 보성리 873-16번지
ValueCountFrequency (%)
전라남도 404
20.4%
보성군 403
20.3%
벌교읍 80
 
4.0%
율어면 39
 
2.0%
조성면 37
 
1.9%
득량면 36
 
1.8%
복내면 32
 
1.6%
웅치면 27
 
1.4%
보성읍 27
 
1.4%
회천면 26
 
1.3%
Other values (500) 870
43.9%
2023-12-12T23:41:40.207375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1590
 
16.8%
476
 
5.0%
432
 
4.6%
427
 
4.5%
412
 
4.4%
408
 
4.3%
408
 
4.3%
404
 
4.3%
396
 
4.2%
391
 
4.1%
Other values (113) 4119
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5987
63.3%
Decimal Number 1614
 
17.1%
Space Separator 1590
 
16.8%
Dash Punctuation 272
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
476
 
8.0%
432
 
7.2%
427
 
7.1%
412
 
6.9%
408
 
6.8%
408
 
6.8%
404
 
6.7%
396
 
6.6%
391
 
6.5%
391
 
6.5%
Other values (101) 1842
30.8%
Decimal Number
ValueCountFrequency (%)
1 390
24.2%
2 165
10.2%
4 150
 
9.3%
5 150
 
9.3%
7 137
 
8.5%
8 133
 
8.2%
3 129
 
8.0%
6 124
 
7.7%
9 123
 
7.6%
0 113
 
7.0%
Space Separator
ValueCountFrequency (%)
1590
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 272
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5987
63.3%
Common 3476
36.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
476
 
8.0%
432
 
7.2%
427
 
7.1%
412
 
6.9%
408
 
6.8%
408
 
6.8%
404
 
6.7%
396
 
6.6%
391
 
6.5%
391
 
6.5%
Other values (101) 1842
30.8%
Common
ValueCountFrequency (%)
1590
45.7%
1 390
 
11.2%
- 272
 
7.8%
2 165
 
4.7%
4 150
 
4.3%
5 150
 
4.3%
7 137
 
3.9%
8 133
 
3.8%
3 129
 
3.7%
6 124
 
3.6%
Other values (2) 236
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5987
63.3%
ASCII 3476
36.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1590
45.7%
1 390
 
11.2%
- 272
 
7.8%
2 165
 
4.7%
4 150
 
4.3%
5 150
 
4.3%
7 137
 
3.9%
8 133
 
3.8%
3 129
 
3.7%
6 124
 
3.6%
Other values (2) 236
 
6.8%
Hangul
ValueCountFrequency (%)
476
 
8.0%
432
 
7.2%
427
 
7.1%
412
 
6.9%
408
 
6.8%
408
 
6.8%
404
 
6.7%
396
 
6.6%
391
 
6.5%
391
 
6.5%
Other values (101) 1842
30.8%

소하천의 총길이(m)
Real number (ℝ)

Distinct330
Distinct (%)84.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1304.8951
Minimum448
Maximum6600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-12T23:41:40.361253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum448
5-th percentile527.5
Q1757.5
median1042
Q31516
95-th percentile3143.5
Maximum6600
Range6152
Interquartile range (IQR)758.5

Descriptive statistics

Standard deviation865.00103
Coefficient of variation (CV)0.6628893
Kurtosis8.128633
Mean1304.8951
Median Absolute Deviation (MAD)347
Skewness2.4661615
Sum510214
Variance748226.78
MonotonicityNot monotonic
2023-12-12T23:41:40.506110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
500 9
 
2.3%
950 5
 
1.3%
550 4
 
1.0%
600 3
 
0.8%
3740 3
 
0.8%
830 3
 
0.8%
755 3
 
0.8%
640 3
 
0.8%
1030 3
 
0.8%
700 3
 
0.8%
Other values (320) 352
90.0%
ValueCountFrequency (%)
448 1
 
0.3%
499 1
 
0.3%
500 9
2.3%
505 1
 
0.3%
507 1
 
0.3%
510 1
 
0.3%
515 1
 
0.3%
520 2
 
0.5%
525 1
 
0.3%
526 2
 
0.5%
ValueCountFrequency (%)
6600 1
 
0.3%
5670 1
 
0.3%
5633 1
 
0.3%
4635 1
 
0.3%
4590 1
 
0.3%
4515 1
 
0.3%
4400 1
 
0.3%
3972 1
 
0.3%
3763 1
 
0.3%
3740 3
0.8%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum2023-06-15 00:00:00
Maximum2023-06-15 00:00:00
2023-12-12T23:41:40.639159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:41:40.743915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T23:41:36.275019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:41:40.816832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수계(水系)명소하천의 총길이(m)
수계(水系)명1.0000.144
소하천의 총길이(m)0.1441.000
2023-12-12T23:41:40.918867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소하천의 총길이(m)수계(水系)명
소하천의 총길이(m)1.0000.064
수계(水系)명0.0641.000

Missing values

2023-12-12T23:41:36.494228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:41:36.628691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번수계(水系)명명칭시점종점소하천의 총길이(m)기준일자
015-12-001섬진강평우천전라남도 보성군 보성읍 쾌상리 337번지전라남도 보성군 보성읍 우산리 937-4번지56332023-06-15
115-12-002섬진강건동천전라남도 보성군 보성읍 쾌상리 1157번지전라남도 보성군 보성읍 쾌상리 598번지33812023-06-15
215-12-003섬진강오서천전라남도 보성군 보성읍 봉산리 산55-4번지전라남도 보성군 보성읍 봉산리 1457-4번지13132023-06-15
315-12-004섬진강예동천전라남도 보성군 보성읍 옥암리 664번지전라남도 보성군 보성읍 옥암리 607번지25092023-06-15
415-12-005섬진강부곡천전라남도 보성군 보성읍 보성리 493-3번지전라남도 보성군 보성읍 보성리 873-16번지17322023-06-15
515-12-006섬진강동윤천전라남도 보성군 보성읍 원봉리 산17-1번지전라남도 보성군 보성읍 보성리 966번지45152023-06-15
615-12-007섬진강구기천전라남도 보성군 보성읍 주봉리 790번지전라남도 보성군 보성읍 주봉리 790-4번지8982023-06-15
715-12-008섬진강청룡천전라남도 보성군 보성읍 원봉리 산34번지전라남도 보성군 보성읍 원봉리 603-4번지9562023-06-15
815-12-009섬진강동암천전라남도 보성군 보성읍 쾌상리 1157-10번지전라남도 보성군 보성읍 쾌상리 597-9번지15292023-06-15
915-12-010섬진강가신천전라남도 보성군 보성읍 대야리 794-65번지전라남도 보성군 보성읍 대야리 840-30번지13802023-06-15
연번수계(水系)명명칭시점종점소하천의 총길이(m)기준일자
38115-12-382섬진강삼수1천전라남도 보성군 웅치면 봉산리 570번지전라남도 보성군 웅치면 봉산리 487-1번지5002023-06-15
38215-12-383섬진강대은천전라남도 보성군 웅치면 용반리 산36-1번지전라남도 보성군 웅치면 용반리 141-7번지11132023-06-15
38315-12-384섬진강거재천전라남도 보성군 웅치면 강산리 271-1번지전라남도 보성군 웅치면 강산리 1779-12번지6952023-06-15
38415-12-385섬진강서재동천전라남도 보성군 웅치면 강산리 산133번지전라남도 보성군 웅치면 강산리 1849-1번지12122023-06-15
38515-12-386섬진강가정천전라남도 보성군 웅치면 유산리 1082-11번지전라남도 보성군 웅치면 유산리 1084-1번지5152023-06-15
38615-12-387섬진강월등천전라남도 보성군 웅치면 유산리 산154번지전라남도 보성군 웅치면 유산리 1047-1번지5072023-06-15
38715-12-388섬진강초삼천전라남도 보성군 웅치면 유산리 789-2번지전라남도 보성군 웅치면 유산리 1127-1번지7052023-06-15
38815-12-389섬진강토동천전라남도 보성군 웅치면 강산리 산111번지전라남도 보성군 웅치면 강산리 산100-1번지7002023-06-15
38915-12-390섬진강삼매천전라남도 보성군 웅치면 용반리 595번지전라남도 보성군 웅치면 용반리 694-1번지6002023-06-15
39015-12-391섬진강왕암천전라남도 보성군 웅치면 유산리 854번지전라남도 보성군 웅치면 유산리 1107-1번지5502023-06-15