Overview

Dataset statistics

Number of variables3
Number of observations300
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)1.0%
Total size in memory7.2 KiB
Average record size in memory24.4 B

Variable types

Categorical1
Text2

Dataset

Description강원도 하천 소개
Author강원도
URLhttps://www.data.go.kr/data/3044751/fileData.do

Alerts

Dataset has 3 (1.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 10:18:55.102899
Analysis finished2023-12-12 10:18:55.565215
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소재지
Categorical

Distinct18
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
평창군
62 
홍천군
32 
정선군
26 
강릉시
23 
춘천시
23 
Other values (13)
134 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정선군
2nd row횡성군
3rd row홍천군
4th row강릉시
5th row영월군

Common Values

ValueCountFrequency (%)
평창군 62
20.7%
홍천군 32
10.7%
정선군 26
8.7%
강릉시 23
 
7.7%
춘천시 23
 
7.7%
원주시 21
 
7.0%
인제군 17
 
5.7%
철원군 16
 
5.3%
양양군 15
 
5.0%
영월군 15
 
5.0%
Other values (8) 50
16.7%

Length

2023-12-12T19:18:55.671827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
평창군 62
20.7%
홍천군 32
10.7%
정선군 26
8.7%
강릉시 23
 
7.7%
춘천시 23
 
7.7%
원주시 21
 
7.0%
인제군 17
 
5.7%
철원군 16
 
5.3%
영월군 15
 
5.0%
양양군 15
 
5.0%
Other values (8) 50
16.7%
Distinct176
Distinct (%)58.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T19:18:56.131949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.0966667
Min length2

Characters and Unicode

Total characters929
Distinct characters171
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)41.0%

Sample

1st row아우라지
2nd row섬강
3rd row홍천강
4th row경포생태저류지
5th row동강
ValueCountFrequency (%)
남대천 10
 
3.3%
섬강 8
 
2.7%
골지천 7
 
2.3%
소양강 7
 
2.3%
주천강 7
 
2.3%
북한강 7
 
2.3%
홍천강 6
 
2.0%
내촌천 6
 
2.0%
남한강 5
 
1.7%
동강 5
 
1.7%
Other values (166) 232
77.3%
2023-12-12T19:18:56.762113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
252
27.1%
63
 
6.8%
31
 
3.3%
24
 
2.6%
22
 
2.4%
21
 
2.3%
19
 
2.0%
18
 
1.9%
16
 
1.7%
15
 
1.6%
Other values (161) 448
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 929
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
252
27.1%
63
 
6.8%
31
 
3.3%
24
 
2.6%
22
 
2.4%
21
 
2.3%
19
 
2.0%
18
 
1.9%
16
 
1.7%
15
 
1.6%
Other values (161) 448
48.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 929
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
252
27.1%
63
 
6.8%
31
 
3.3%
24
 
2.6%
22
 
2.4%
21
 
2.3%
19
 
2.0%
18
 
1.9%
16
 
1.7%
15
 
1.6%
Other values (161) 448
48.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 929
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
252
27.1%
63
 
6.8%
31
 
3.3%
24
 
2.6%
22
 
2.4%
21
 
2.3%
19
 
2.0%
18
 
1.9%
16
 
1.7%
15
 
1.6%
Other values (161) 448
48.2%

주소
Text

Distinct262
Distinct (%)87.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T19:18:57.153573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length13.556667
Min length9

Characters and Unicode

Total characters4067
Distinct characters188
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique237 ?
Unique (%)79.0%

Sample

1st row강원 정선군 여량면 여량5리
2nd row강원 횡성군 횡성읍
3rd row강원 홍천군 북방면
4th row강원 강릉시 죽헌동
5th row강원 영월군 영월읍
ValueCountFrequency (%)
강원 300
25.7%
평창군 62
 
5.3%
홍천군 32
 
2.7%
정선군 26
 
2.2%
강릉시 23
 
2.0%
춘천시 23
 
2.0%
원주시 21
 
1.8%
인제군 17
 
1.5%
철원군 16
 
1.4%
양양군 15
 
1.3%
Other values (337) 631
54.1%
2023-12-12T19:18:57.681634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
866
21.3%
361
 
8.9%
336
 
8.3%
256
 
6.3%
212
 
5.2%
208
 
5.1%
104
 
2.6%
89
 
2.2%
83
 
2.0%
73
 
1.8%
Other values (178) 1479
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3174
78.0%
Space Separator 866
 
21.3%
Decimal Number 24
 
0.6%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
361
 
11.4%
336
 
10.6%
256
 
8.1%
212
 
6.7%
208
 
6.6%
104
 
3.3%
89
 
2.8%
83
 
2.6%
73
 
2.3%
63
 
2.0%
Other values (169) 1389
43.8%
Decimal Number
ValueCountFrequency (%)
6 5
20.8%
5 4
16.7%
1 4
16.7%
9 3
12.5%
3 3
12.5%
8 3
12.5%
7 2
 
8.3%
Space Separator
ValueCountFrequency (%)
866
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3174
78.0%
Common 893
 
22.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
361
 
11.4%
336
 
10.6%
256
 
8.1%
212
 
6.7%
208
 
6.6%
104
 
3.3%
89
 
2.8%
83
 
2.6%
73
 
2.3%
63
 
2.0%
Other values (169) 1389
43.8%
Common
ValueCountFrequency (%)
866
97.0%
6 5
 
0.6%
5 4
 
0.4%
1 4
 
0.4%
9 3
 
0.3%
3 3
 
0.3%
- 3
 
0.3%
8 3
 
0.3%
7 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3174
78.0%
ASCII 893
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
866
97.0%
6 5
 
0.6%
5 4
 
0.4%
1 4
 
0.4%
9 3
 
0.3%
3 3
 
0.3%
- 3
 
0.3%
8 3
 
0.3%
7 2
 
0.2%
Hangul
ValueCountFrequency (%)
361
 
11.4%
336
 
10.6%
256
 
8.1%
212
 
6.7%
208
 
6.6%
104
 
3.3%
89
 
2.8%
83
 
2.6%
73
 
2.3%
63
 
2.0%
Other values (169) 1389
43.8%

Missing values

2023-12-12T19:18:55.413315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:18:55.526508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소재지하천명주소
0정선군아우라지강원 정선군 여량면 여량5리
1횡성군섬강강원 횡성군 횡성읍
2홍천군홍천강강원 홍천군 북방면
3강릉시경포생태저류지강원 강릉시 죽헌동
4영월군동강강원 영월군 영월읍
5춘천시소양강강원 춘천시 동면
6양양군양양남대천강원 양양군 양양읍
7철원군한탄강강원 철원군 갈말읍
8평창군오대천강원 평창군 진부면
9정선군소금강강원 정선군 화암면 몰운리
소재지하천명주소
290평창군절골천강원 평창군 봉평면 덕거리
291평창군구룡소골천강원 평창군 용평면 속사리
292삼척시무릉천강원 삼척시 신기면 서하리
293평창군봉동천강원 평창군 평창읍 노론리
294영월군주천강강원 영월군 무릉도원면 도원리
295화천군화천천강원 화천군 화천읍 신읍리
296정선군골지천강원 정선군 여량면 봉정리
297평창군보래동천강원 평창군 봉평면 덕거리
298평창군소고개천강원 평창군 평창읍 하리
299화천군북한강강원 화천군 하남면 논미리

Duplicate rows

Most frequently occurring

소재지하천명주소# duplicates
0영월군주천강강원 영월군 무릉도원면 도원리2
1철원군대교천강원 철원군 동송읍 이평리2
2춘천시약사천강원 춘천시 효자동2