Overview

Dataset statistics

Number of variables20
Number of observations3111
Missing cells1284
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory486.2 KiB
Average record size in memory160.0 B

Variable types

Text14
Categorical4
DateTime2

Dataset

Description한국농어촌공사에서 관리하는 농업생산기반시설의 인허가 정보
Author한국농어촌공사
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20230823000000002397

Alerts

성명_법인명 has 38 (1.2%) missing valuesMissing
전화번호 has 181 (5.8%) missing valuesMissing
주소 has 143 (4.6%) missing valuesMissing
허가_점용위치 has 126 (4.1%) missing valuesMissing
점용목적 has 97 (3.1%) missing valuesMissing
허가_점용기간 - 종료 has 568 (18.3%) missing valuesMissing
허가일자 has 94 (3.0%) missing valuesMissing

Reproduction

Analysis started2023-12-11 03:28:41.758239
Analysis finished2023-12-11 03:28:43.901877
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2483
Distinct (%)79.8%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2023-12-11T12:28:44.081542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters31110
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2054 ?
Unique (%)66.0%

Sample

1st row41530Z0001
2nd row41530Z0001
3rd row41530Z0001
4th row41670Z0002
5th row4150040001
ValueCountFrequency (%)
44210a0021 14
 
0.5%
4471010090 6
 
0.2%
4785020024 5
 
0.2%
4679050104 5
 
0.2%
4679050108 5
 
0.2%
4725040008 5
 
0.2%
4725040010 5
 
0.2%
47840a0040 5
 
0.2%
4157030002 5
 
0.2%
4679020025 4
 
0.1%
Other values (2473) 3052
98.1%
2023-12-11T12:28:44.467813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10135
32.6%
4 4858
15.6%
1 2914
 
9.4%
2 2768
 
8.9%
7 2732
 
8.8%
5 2411
 
7.7%
3 1752
 
5.6%
8 1346
 
4.3%
6 1269
 
4.1%
9 794
 
2.6%
Other values (5) 131
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30979
99.6%
Uppercase Letter 131
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10135
32.7%
4 4858
15.7%
1 2914
 
9.4%
2 2768
 
8.9%
7 2732
 
8.8%
5 2411
 
7.8%
3 1752
 
5.7%
8 1346
 
4.3%
6 1269
 
4.1%
9 794
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
A 60
45.8%
Z 29
22.1%
C 21
 
16.0%
B 12
 
9.2%
D 9
 
6.9%

Most occurring scripts

ValueCountFrequency (%)
Common 30979
99.6%
Latin 131
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10135
32.7%
4 4858
15.7%
1 2914
 
9.4%
2 2768
 
8.9%
7 2732
 
8.8%
5 2411
 
7.8%
3 1752
 
5.7%
8 1346
 
4.3%
6 1269
 
4.1%
9 794
 
2.6%
Latin
ValueCountFrequency (%)
A 60
45.8%
Z 29
22.1%
C 21
 
16.0%
B 12
 
9.2%
D 9
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31110
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10135
32.6%
4 4858
15.6%
1 2914
 
9.4%
2 2768
 
8.9%
7 2732
 
8.8%
5 2411
 
7.7%
3 1752
 
5.6%
8 1346
 
4.3%
6 1269
 
4.1%
9 794
 
2.6%
Other values (5) 131
 
0.4%

본부
Categorical

Distinct11
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
전남
636 
충남
546 
경북
531 
충북
406 
경남
360 
Other values (6)
632 

Length

Max length6
Median length2
Mean length2.0282867
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기
2nd row경기
3rd row경기
4th row경기
5th row경기

Common Values

ValueCountFrequency (%)
전남 636
20.4%
충남 546
17.6%
경북 531
17.1%
충북 406
13.1%
경남 360
11.6%
경기 332
10.7%
전북 234
 
7.5%
강원 42
 
1.4%
천수만사업단 14
 
0.5%
금강사업단 8
 
0.3%

Length

2023-12-11T12:28:44.614498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전남 636
20.4%
충남 546
17.6%
경북 531
17.1%
충북 406
13.1%
경남 360
11.6%
경기 332
10.7%
전북 234
 
7.5%
강원 42
 
1.4%
천수만사업단 14
 
0.5%
금강사업단 8
 
0.3%

지사
Text

Distinct91
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2023-12-11T12:28:44.859573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length2
Mean length3.0761813
Min length2

Characters and Unicode

Total characters9570
Distinct characters102
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row여주.이천
2nd row여주.이천
3rd row여주.이천
4th row여주.이천
5th row여주.이천
ValueCountFrequency (%)
화순 203
 
6.5%
담양 104
 
3.3%
옥천.영동 103
 
3.3%
나주 99
 
3.2%
보은 91
 
2.9%
여주.이천 91
 
2.9%
파주 80
 
2.6%
공주 75
 
2.4%
고령 75
 
2.4%
음성 75
 
2.4%
Other values (81) 2115
68.0%
2023-12-11T12:28:45.233291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1100
 
11.5%
763
 
8.0%
558
 
5.8%
525
 
5.5%
364
 
3.8%
271
 
2.8%
231
 
2.4%
223
 
2.3%
203
 
2.1%
198
 
2.1%
Other values (92) 5134
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8470
88.5%
Other Punctuation 1100
 
11.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
763
 
9.0%
558
 
6.6%
525
 
6.2%
364
 
4.3%
271
 
3.2%
231
 
2.7%
223
 
2.6%
203
 
2.4%
198
 
2.3%
177
 
2.1%
Other values (91) 4957
58.5%
Other Punctuation
ValueCountFrequency (%)
. 1100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8470
88.5%
Common 1100
 
11.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
763
 
9.0%
558
 
6.6%
525
 
6.2%
364
 
4.3%
271
 
3.2%
231
 
2.7%
223
 
2.6%
203
 
2.4%
198
 
2.3%
177
 
2.1%
Other values (91) 4957
58.5%
Common
ValueCountFrequency (%)
. 1100
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8470
88.5%
ASCII 1100
 
11.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1100
100.0%
Hangul
ValueCountFrequency (%)
763
 
9.0%
558
 
6.6%
525
 
6.2%
364
 
4.3%
271
 
3.2%
231
 
2.7%
223
 
2.6%
203
 
2.4%
198
 
2.3%
177
 
2.1%
Other values (91) 4957
58.5%

시설구분
Categorical

Distinct14
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
양수장
1191 
취입보
1171 
배수장
405 
저수지
 
107
양배수장
 
89
Other values (9)
148 

Length

Max length4
Median length3
Mean length3.0527162
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row배수장

Common Values

ValueCountFrequency (%)
양수장 1191
38.3%
취입보 1171
37.6%
배수장 405
 
13.0%
저수지 107
 
3.4%
양배수장 89
 
2.9%
용수간선 60
 
1.9%
기타 29
 
0.9%
배수간선 21
 
0.7%
용수지선 12
 
0.4%
배수지선 9
 
0.3%
Other values (4) 17
 
0.5%

Length

2023-12-11T12:28:45.379702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양수장 1191
38.3%
취입보 1171
37.6%
배수장 405
 
13.0%
저수지 107
 
3.4%
양배수장 89
 
2.9%
용수간선 60
 
1.9%
기타 29
 
0.9%
배수간선 21
 
0.7%
용수지선 12
 
0.4%
배수지선 9
 
0.3%
Other values (4) 17
 
0.5%
Distinct2037
Distinct (%)65.5%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2023-12-11T12:28:45.741431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length2
Mean length2.3336548
Min length1

Characters and Unicode

Total characters7260
Distinct characters391
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1436 ?
Unique (%)46.2%

Sample

1st row성호낙차공
2nd row성호낙차공
3rd row성호낙차공
4th row대당배수문
5th row상용
ValueCountFrequency (%)
17
 
0.5%
간월9-1 14
 
0.4%
대곡 13
 
0.4%
신평 9
 
0.3%
강정 9
 
0.3%
용산 8
 
0.3%
신기 8
 
0.3%
석동 7
 
0.2%
낙동 7
 
0.2%
7
 
0.2%
Other values (2030) 3019
96.8%
2023-12-11T12:28:46.515417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
 
3.0%
1 215
 
3.0%
184
 
2.5%
2 165
 
2.3%
164
 
2.3%
139
 
1.9%
134
 
1.8%
116
 
1.6%
113
 
1.6%
106
 
1.5%
Other values (381) 5708
78.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6679
92.0%
Decimal Number 455
 
6.3%
Open Punctuation 41
 
0.6%
Close Punctuation 41
 
0.6%
Dash Punctuation 22
 
0.3%
Space Separator 21
 
0.3%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
216
 
3.2%
184
 
2.8%
164
 
2.5%
139
 
2.1%
134
 
2.0%
116
 
1.7%
113
 
1.7%
106
 
1.6%
104
 
1.6%
103
 
1.5%
Other values (367) 5300
79.4%
Decimal Number
ValueCountFrequency (%)
1 215
47.3%
2 165
36.3%
3 28
 
6.2%
4 18
 
4.0%
9 14
 
3.1%
5 6
 
1.3%
8 3
 
0.7%
6 3
 
0.7%
7 3
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6679
92.0%
Common 580
 
8.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
216
 
3.2%
184
 
2.8%
164
 
2.5%
139
 
2.1%
134
 
2.0%
116
 
1.7%
113
 
1.7%
106
 
1.6%
104
 
1.6%
103
 
1.5%
Other values (367) 5300
79.4%
Common
ValueCountFrequency (%)
1 215
37.1%
2 165
28.4%
( 41
 
7.1%
) 41
 
7.1%
3 28
 
4.8%
- 22
 
3.8%
21
 
3.6%
4 18
 
3.1%
9 14
 
2.4%
5 6
 
1.0%
Other values (3) 9
 
1.6%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6679
92.0%
ASCII 581
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
216
 
3.2%
184
 
2.8%
164
 
2.5%
139
 
2.1%
134
 
2.0%
116
 
1.7%
113
 
1.7%
106
 
1.6%
104
 
1.6%
103
 
1.5%
Other values (367) 5300
79.4%
ASCII
ValueCountFrequency (%)
1 215
37.0%
2 165
28.4%
( 41
 
7.1%
) 41
 
7.1%
3 28
 
4.8%
- 22
 
3.8%
21
 
3.6%
4 18
 
3.1%
9 14
 
2.4%
5 6
 
1.0%
Other values (4) 10
 
1.7%

성명_법인명
Text

MISSING 

Distinct283
Distinct (%)9.2%
Missing38
Missing (%)1.2%
Memory size24.4 KiB
2023-12-11T12:28:46.819478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length11.941425
Min length4

Characters and Unicode

Total characters36696
Distinct characters133
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)2.9%

Sample

1st row농업기반공사 여주이천지사
2nd row한국농어촌공사 여주.이천지사
3rd row한국농어촌공사 여주.이천지사
4th row한국농어촌공사 여주.이천지사
5th row한국농어촌공사 여주.이천지사
ValueCountFrequency (%)
한국농어촌공사 1997
37.4%
한국농촌공사 324
 
6.1%
담양지사 104
 
1.9%
옥천영동지사 102
 
1.9%
나주지사 99
 
1.9%
한국농어촌공사화순지사 98
 
1.8%
보은지사 79
 
1.5%
음성지사 73
 
1.4%
농업기반공사 73
 
1.4%
한국농어촌공사부여지사 69
 
1.3%
Other values (219) 2328
43.5%
2023-12-11T12:28:47.241192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5863
16.0%
3008
 
8.2%
2976
 
8.1%
2890
 
7.9%
2851
 
7.8%
2850
 
7.8%
2850
 
7.8%
2367
 
6.5%
2275
 
6.2%
685
 
1.9%
Other values (123) 8081
22.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34178
93.1%
Space Separator 2275
 
6.2%
Other Punctuation 96
 
0.3%
Open Punctuation 74
 
0.2%
Close Punctuation 72
 
0.2%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5863
17.2%
3008
8.8%
2976
8.7%
2890
 
8.5%
2851
 
8.3%
2850
 
8.3%
2850
 
8.3%
2367
 
6.9%
685
 
2.0%
542
 
1.6%
Other values (116) 7296
21.3%
Other Punctuation
ValueCountFrequency (%)
, 62
64.6%
. 33
34.4%
· 1
 
1.0%
Space Separator
ValueCountFrequency (%)
2275
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34178
93.1%
Common 2518
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5863
17.2%
3008
8.8%
2976
8.7%
2890
 
8.5%
2851
 
8.3%
2850
 
8.3%
2850
 
8.3%
2367
 
6.9%
685
 
2.0%
542
 
1.6%
Other values (116) 7296
21.3%
Common
ValueCountFrequency (%)
2275
90.3%
( 74
 
2.9%
) 72
 
2.9%
, 62
 
2.5%
. 33
 
1.3%
· 1
 
< 0.1%
2 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34178
93.1%
ASCII 2517
 
6.9%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5863
17.2%
3008
8.8%
2976
8.7%
2890
 
8.5%
2851
 
8.3%
2850
 
8.3%
2850
 
8.3%
2367
 
6.9%
685
 
2.0%
542
 
1.6%
Other values (116) 7296
21.3%
ASCII
ValueCountFrequency (%)
2275
90.4%
( 74
 
2.9%
) 72
 
2.9%
, 62
 
2.5%
. 33
 
1.3%
2 1
 
< 0.1%
None
ValueCountFrequency (%)
· 1
100.0%

전화번호
Text

MISSING 

Distinct293
Distinct (%)10.0%
Missing181
Missing (%)5.8%
Memory size24.4 KiB
2023-12-11T12:28:47.535282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.836519
Min length7

Characters and Unicode

Total characters34681
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)3.6%

Sample

1st row031-884-6062
2nd row031-887-7530
3rd row031-887-7530
4th row031-887-7530
5th row031-887-7530
ValueCountFrequency (%)
061-370-1188 108
 
3.7%
043-730-2500 99
 
3.4%
061-330-9500 81
 
2.8%
043-871-7300 74
 
2.5%
540-2550 71
 
2.4%
061-372-1188 70
 
2.4%
041-850-6459 63
 
2.2%
041-669-4781 60
 
2.0%
054-339-5063 59
 
2.0%
063-270-0542 55
 
1.9%
Other values (283) 2190
74.7%
2023-12-11T12:28:47.990356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6985
20.1%
- 5699
16.4%
3 4172
12.0%
5 3742
10.8%
1 2982
8.6%
4 2379
 
6.9%
6 2097
 
6.0%
8 2027
 
5.8%
7 1806
 
5.2%
2 1646
 
4.7%
Other values (2) 1146
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 28902
83.3%
Dash Punctuation 5699
 
16.4%
Math Symbol 80
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6985
24.2%
3 4172
14.4%
5 3742
12.9%
1 2982
10.3%
4 2379
 
8.2%
6 2097
 
7.3%
8 2027
 
7.0%
7 1806
 
6.2%
2 1646
 
5.7%
9 1066
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 5699
100.0%
Math Symbol
ValueCountFrequency (%)
~ 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34681
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6985
20.1%
- 5699
16.4%
3 4172
12.0%
5 3742
10.8%
1 2982
8.6%
4 2379
 
6.9%
6 2097
 
6.0%
8 2027
 
5.8%
7 1806
 
5.2%
2 1646
 
4.7%
Other values (2) 1146
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34681
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6985
20.1%
- 5699
16.4%
3 4172
12.0%
5 3742
10.8%
1 2982
8.6%
4 2379
 
6.9%
6 2097
 
6.0%
8 2027
 
5.8%
7 1806
 
5.2%
2 1646
 
4.7%
Other values (2) 1146
 
3.3%

주소
Text

MISSING 

Distinct139
Distinct (%)4.7%
Missing143
Missing (%)4.6%
Memory size24.4 KiB
2023-12-11T12:28:48.415072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length14.525943
Min length11

Characters and Unicode

Total characters43113
Distinct characters166
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.9%

Sample

1st row경기도 여주군 능서면 신지리
2nd row경기도 여주군 능서면 신지리
3rd row경기도 여주시 능서면 신지리
4th row경기도 여주시 능서면 신지리
5th row경기도 여주시 능서면 신지리
ValueCountFrequency (%)
전라남도 599
 
5.5%
충청남도 516
 
4.8%
경상북도 496
 
4.6%
충청북도 387
 
3.6%
경상남도 331
 
3.1%
경기도 256
 
2.4%
전라북도 239
 
2.2%
화순군 203
 
1.9%
능주면 203
 
1.9%
관영리 193
 
1.8%
Other values (292) 7373
68.3%
2023-12-11T12:28:49.022332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7828
 
18.2%
2897
 
6.7%
1734
 
4.0%
1613
 
3.7%
1548
 
3.6%
1497
 
3.5%
1466
 
3.4%
1387
 
3.2%
1165
 
2.7%
1140
 
2.6%
Other values (156) 20838
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35255
81.8%
Space Separator 7828
 
18.2%
Decimal Number 30
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2897
 
8.2%
1734
 
4.9%
1613
 
4.6%
1548
 
4.4%
1497
 
4.2%
1466
 
4.2%
1387
 
3.9%
1165
 
3.3%
1140
 
3.2%
1054
 
3.0%
Other values (154) 19754
56.0%
Space Separator
ValueCountFrequency (%)
7828
100.0%
Decimal Number
ValueCountFrequency (%)
2 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35255
81.8%
Common 7858
 
18.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2897
 
8.2%
1734
 
4.9%
1613
 
4.6%
1548
 
4.4%
1497
 
4.2%
1466
 
4.2%
1387
 
3.9%
1165
 
3.3%
1140
 
3.2%
1054
 
3.0%
Other values (154) 19754
56.0%
Common
ValueCountFrequency (%)
7828
99.6%
2 30
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35255
81.8%
ASCII 7858
 
18.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7828
99.6%
2 30
 
0.4%
Hangul
ValueCountFrequency (%)
2897
 
8.2%
1734
 
4.9%
1613
 
4.6%
1548
 
4.4%
1497
 
4.2%
1466
 
4.2%
1387
 
3.9%
1165
 
3.3%
1140
 
3.2%
1054
 
3.0%
Other values (154) 19754
56.0%
Distinct567
Distinct (%)18.2%
Missing2
Missing (%)0.1%
Memory size24.4 KiB
2023-12-11T12:28:49.465415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.0604696
Min length1

Characters and Unicode

Total characters9515
Distinct characters227
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)7.3%

Sample

1st row청미천
2nd row청미천
3rd row청미천
4th row복하천
5th row복하천
ValueCountFrequency (%)
낙동강 288
 
9.0%
금강 116
 
3.6%
남강 86
 
2.7%
영산강 81
 
2.5%
영산강수계 70
 
2.2%
지석천 68
 
2.1%
청미천 55
 
1.7%
보청천 42
 
1.3%
만경강 42
 
1.3%
미호천 39
 
1.2%
Other values (543) 2321
72.4%
2023-12-11T12:28:50.073510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2268
23.8%
1038
 
10.9%
391
 
4.1%
301
 
3.2%
273
 
2.9%
196
 
2.1%
188
 
2.0%
163
 
1.7%
139
 
1.5%
131
 
1.4%
Other values (217) 4427
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9393
98.7%
Space Separator 99
 
1.0%
Dash Punctuation 7
 
0.1%
Other Punctuation 6
 
0.1%
Decimal Number 4
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2268
24.1%
1038
 
11.1%
391
 
4.2%
301
 
3.2%
273
 
2.9%
196
 
2.1%
188
 
2.0%
163
 
1.7%
139
 
1.5%
131
 
1.4%
Other values (207) 4305
45.8%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
6 1
25.0%
0 1
25.0%
4 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 5
83.3%
· 1
 
16.7%
Space Separator
ValueCountFrequency (%)
99
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9393
98.7%
Common 122
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2268
24.1%
1038
 
11.1%
391
 
4.2%
301
 
3.2%
273
 
2.9%
196
 
2.1%
188
 
2.0%
163
 
1.7%
139
 
1.5%
131
 
1.4%
Other values (207) 4305
45.8%
Common
ValueCountFrequency (%)
99
81.1%
- 7
 
5.7%
, 5
 
4.1%
) 3
 
2.5%
( 3
 
2.5%
· 1
 
0.8%
2 1
 
0.8%
6 1
 
0.8%
0 1
 
0.8%
4 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9393
98.7%
ASCII 121
 
1.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2268
24.1%
1038
 
11.1%
391
 
4.2%
301
 
3.2%
273
 
2.9%
196
 
2.1%
188
 
2.0%
163
 
1.7%
139
 
1.5%
131
 
1.4%
Other values (207) 4305
45.8%
ASCII
ValueCountFrequency (%)
99
81.8%
- 7
 
5.8%
, 5
 
4.1%
) 3
 
2.5%
( 3
 
2.5%
2 1
 
0.8%
6 1
 
0.8%
0 1
 
0.8%
4 1
 
0.8%
None
ValueCountFrequency (%)
· 1
100.0%

하천등급
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
국가
1961 
지방2
718 
지방1
415 
기타
 
17

Length

Max length3
Median length2
Mean length2.3641916
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가
2nd row국가
3rd row지방1
4th row국가
5th row국가

Common Values

ValueCountFrequency (%)
국가 1961
63.0%
지방2 718
 
23.1%
지방1 415
 
13.3%
기타 17
 
0.5%

Length

2023-12-11T12:28:50.224216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:50.344536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국가 1961
63.0%
지방2 718
 
23.1%
지방1 415
 
13.3%
기타 17
 
0.5%

허가_점용위치
Text

MISSING 

Distinct1732
Distinct (%)58.0%
Missing126
Missing (%)4.1%
Memory size24.4 KiB
2023-12-11T12:28:50.710222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length15.731323
Min length11

Characters and Unicode

Total characters46958
Distinct characters317
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1044 ?
Unique (%)35.0%

Sample

1st row경기도 이천시 설성면 제요리
2nd row경기도 이천시 설성면 제요리
3rd row경기도 이천시 설성면 제요리
4th row경기도 여주시 흥천면 대당리
5th row경기도 이천시 백사면 우곡리
ValueCountFrequency (%)
전라남도 602
 
5.1%
충청남도 539
 
4.6%
경상북도 498
 
4.2%
충청북도 388
 
3.3%
경상남도 328
 
2.8%
경기도 255
 
2.2%
전라북도 229
 
1.9%
화순군 199
 
1.7%
나주시 106
 
0.9%
담양군 104
 
0.9%
Other values (2135) 8558
72.5%
2023-12-11T12:28:51.298093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8821
18.8%
3025
 
6.4%
2769
 
5.9%
2269
 
4.8%
1733
 
3.7%
1698
 
3.6%
1421
 
3.0%
1268
 
2.7%
1184
 
2.5%
1176
 
2.5%
Other values (307) 21594
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38113
81.2%
Space Separator 8821
 
18.8%
Decimal Number 22
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3025
 
7.9%
2769
 
7.3%
2269
 
6.0%
1733
 
4.5%
1698
 
4.5%
1421
 
3.7%
1268
 
3.3%
1184
 
3.1%
1176
 
3.1%
963
 
2.5%
Other values (301) 20607
54.1%
Decimal Number
ValueCountFrequency (%)
1 14
63.6%
2 5
 
22.7%
3 3
 
13.6%
Space Separator
ValueCountFrequency (%)
8821
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38111
81.2%
Common 8845
 
18.8%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3025
 
7.9%
2769
 
7.3%
2269
 
6.0%
1733
 
4.5%
1698
 
4.5%
1421
 
3.7%
1268
 
3.3%
1184
 
3.1%
1176
 
3.1%
963
 
2.5%
Other values (299) 20605
54.1%
Common
ValueCountFrequency (%)
8821
99.7%
1 14
 
0.2%
2 5
 
0.1%
3 3
 
< 0.1%
( 1
 
< 0.1%
) 1
 
< 0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38111
81.2%
ASCII 8845
 
18.8%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8821
99.7%
1 14
 
0.2%
2 5
 
0.1%
3 3
 
< 0.1%
( 1
 
< 0.1%
) 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
3025
 
7.9%
2769
 
7.3%
2269
 
6.0%
1733
 
4.5%
1698
 
4.5%
1421
 
3.7%
1268
 
3.3%
1184
 
3.1%
1176
 
3.1%
963
 
2.5%
Other values (299) 20605
54.1%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct1124
Distinct (%)36.1%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2023-12-11T12:28:51.782294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length3.0703954
Min length1

Characters and Unicode

Total characters9552
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique602 ?
Unique (%)19.4%

Sample

1st row4,439
2nd row4,439
3rd row4,439
4th row1,304
5th row2,331
ValueCountFrequency (%)
0 265
 
8.5%
40 20
 
0.6%
100 20
 
0.6%
80 19
 
0.6%
60 18
 
0.6%
90 17
 
0.5%
300 17
 
0.5%
10 17
 
0.5%
30 17
 
0.5%
52 16
 
0.5%
Other values (1114) 2685
86.3%
2023-12-11T12:28:52.443024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1585
16.6%
1 1299
13.6%
2 1032
10.8%
5 848
8.9%
3 837
8.8%
4 776
8.1%
, 731
7.7%
6 675
7.1%
8 663
6.9%
7 580
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8821
92.3%
Other Punctuation 731
 
7.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1585
18.0%
1 1299
14.7%
2 1032
11.7%
5 848
9.6%
3 837
9.5%
4 776
8.8%
6 675
7.7%
8 663
7.5%
7 580
 
6.6%
9 526
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 731
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9552
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1585
16.6%
1 1299
13.6%
2 1032
10.8%
5 848
8.9%
3 837
8.8%
4 776
8.1%
, 731
7.7%
6 675
7.1%
8 663
6.9%
7 580
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9552
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1585
16.6%
1 1299
13.6%
2 1032
10.8%
5 848
8.9%
3 837
8.8%
4 776
8.1%
, 731
7.7%
6 675
7.1%
8 663
6.9%
7 580
 
6.1%
Distinct998
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
2023-12-11T12:28:52.856640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length4.075217
Min length1

Characters and Unicode

Total characters12678
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique608 ?
Unique (%)19.5%

Sample

1st row63,000
2nd row13,810
3rd row13,810
4th row0
5th row0
ValueCountFrequency (%)
0 832
26.7%
864 32
 
1.0%
8,640 26
 
0.8%
1,000 25
 
0.8%
2,000 22
 
0.7%
4,320 22
 
0.7%
1,700 20
 
0.6%
800 20
 
0.6%
1,728 20
 
0.6%
170 18
 
0.6%
Other values (988) 2074
66.7%
2023-12-11T12:28:53.475292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3931
31.0%
, 1972
15.6%
1 1162
 
9.2%
2 1039
 
8.2%
4 772
 
6.1%
8 716
 
5.6%
3 713
 
5.6%
6 703
 
5.5%
7 631
 
5.0%
5 588
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10706
84.4%
Other Punctuation 1972
 
15.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3931
36.7%
1 1162
 
10.9%
2 1039
 
9.7%
4 772
 
7.2%
8 716
 
6.7%
3 713
 
6.7%
6 703
 
6.6%
7 631
 
5.9%
5 588
 
5.5%
9 451
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 1972
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12678
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3931
31.0%
, 1972
15.6%
1 1162
 
9.2%
2 1039
 
8.2%
4 772
 
6.1%
8 716
 
5.6%
3 713
 
5.6%
6 703
 
5.5%
7 631
 
5.0%
5 588
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12678
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3931
31.0%
, 1972
15.6%
1 1162
 
9.2%
2 1039
 
8.2%
4 772
 
6.1%
8 716
 
5.6%
3 713
 
5.6%
6 703
 
5.5%
7 631
 
5.0%
5 588
 
4.6%
Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size24.4 KiB
설치
1806 
미설치
1267 
<NA>
 
38

Length

Max length4
Median length2
Mean length2.431694
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미설치
2nd row미설치
3rd row미설치
4th row미설치
5th row미설치

Common Values

ValueCountFrequency (%)
설치 1806
58.1%
미설치 1267
40.7%
<NA> 38
 
1.2%

Length

2023-12-11T12:28:53.677021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:28:53.801978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
설치 1806
58.1%
미설치 1267
40.7%
na 38
 
1.2%

점용목적
Text

MISSING 

Distinct448
Distinct (%)14.9%
Missing97
Missing (%)3.1%
Memory size24.4 KiB
2023-12-11T12:28:54.089144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length46
Mean length6.910418
Min length2

Characters and Unicode

Total characters20828
Distinct characters297
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique350 ?
Unique (%)11.6%

Sample

1st row농업용수인용
2nd row농업용수 인용
3rd row농업용수 인용
4th row배수문 설치에 따른 하천의 토지굴착 및 구조물 설치
5th row배수문 설치
ValueCountFrequency (%)
농업용수인용 1163
27.3%
농업용수 664
15.6%
설치 230
 
5.4%
배수장설치 108
 
2.5%
농업용수이용 105
 
2.5%
배수장 103
 
2.4%
농업 101
 
2.4%
97
 
2.3%
공작물설치 73
 
1.7%
농업용수공급 62
 
1.5%
Other values (571) 1550
36.4%
2023-12-11T12:28:54.650091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3645
17.5%
2915
14.0%
2216
10.6%
2215
10.6%
1245
 
6.0%
1243
 
6.0%
684
 
3.3%
600
 
2.9%
451
 
2.2%
411
 
2.0%
Other values (287) 5203
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18837
90.4%
Space Separator 1245
 
6.0%
Decimal Number 177
 
0.8%
Open Punctuation 163
 
0.8%
Close Punctuation 162
 
0.8%
Other Punctuation 132
 
0.6%
Uppercase Letter 48
 
0.2%
Lowercase Letter 33
 
0.2%
Math Symbol 28
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3645
19.4%
2915
15.5%
2216
11.8%
2215
11.8%
1243
 
6.6%
684
 
3.6%
600
 
3.2%
451
 
2.4%
411
 
2.2%
378
 
2.0%
Other values (248) 4079
21.7%
Uppercase Letter
ValueCountFrequency (%)
L 15
31.2%
D 6
 
12.5%
B 5
 
10.4%
M 5
 
10.4%
H 4
 
8.3%
X 4
 
8.3%
O 4
 
8.3%
C 2
 
4.2%
V 1
 
2.1%
T 1
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 44
24.9%
0 32
18.1%
2 32
18.1%
5 20
11.3%
3 17
 
9.6%
8 11
 
6.2%
4 10
 
5.6%
6 5
 
2.8%
7 4
 
2.3%
9 2
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 81
61.4%
. 26
 
19.7%
: 8
 
6.1%
· 8
 
6.1%
* 6
 
4.5%
" 2
 
1.5%
/ 1
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
m 30
90.9%
d 1
 
3.0%
c 1
 
3.0%
p 1
 
3.0%
Math Symbol
ValueCountFrequency (%)
= 20
71.4%
× 8
 
28.6%
Space Separator
ValueCountFrequency (%)
1245
100.0%
Open Punctuation
ValueCountFrequency (%)
( 163
100.0%
Close Punctuation
ValueCountFrequency (%)
) 162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18837
90.4%
Common 1910
 
9.2%
Latin 81
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3645
19.4%
2915
15.5%
2216
11.8%
2215
11.8%
1243
 
6.6%
684
 
3.6%
600
 
3.2%
451
 
2.4%
411
 
2.2%
378
 
2.0%
Other values (248) 4079
21.7%
Common
ValueCountFrequency (%)
1245
65.2%
( 163
 
8.5%
) 162
 
8.5%
, 81
 
4.2%
1 44
 
2.3%
0 32
 
1.7%
2 32
 
1.7%
. 26
 
1.4%
= 20
 
1.0%
5 20
 
1.0%
Other values (14) 85
 
4.5%
Latin
ValueCountFrequency (%)
m 30
37.0%
L 15
18.5%
D 6
 
7.4%
B 5
 
6.2%
M 5
 
6.2%
H 4
 
4.9%
X 4
 
4.9%
O 4
 
4.9%
C 2
 
2.5%
d 1
 
1.2%
Other values (5) 5
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18837
90.4%
ASCII 1974
 
9.5%
None 16
 
0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3645
19.4%
2915
15.5%
2216
11.8%
2215
11.8%
1243
 
6.6%
684
 
3.6%
600
 
3.2%
451
 
2.4%
411
 
2.2%
378
 
2.0%
Other values (248) 4079
21.7%
ASCII
ValueCountFrequency (%)
1245
63.1%
( 163
 
8.3%
) 162
 
8.2%
, 81
 
4.1%
1 44
 
2.2%
0 32
 
1.6%
2 32
 
1.6%
m 30
 
1.5%
. 26
 
1.3%
= 20
 
1.0%
Other values (26) 139
 
7.0%
None
ValueCountFrequency (%)
· 8
50.0%
× 8
50.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct1037
Distinct (%)33.6%
Missing23
Missing (%)0.7%
Memory size24.4 KiB
Minimum1945-01-01 00:00:00
Maximum2025-10-31 00:00:00
2023-12-11T12:28:54.820596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:28:54.981655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct666
Distinct (%)26.2%
Missing568
Missing (%)18.3%
Memory size24.4 KiB
2023-12-11T12:28:55.375299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.8710185
Min length2

Characters and Unicode

Total characters25102
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique419 ?
Unique (%)16.5%

Sample

1st row2012-10-30
2nd row2017-11-04
3rd row2022-11-04
4th row2021-11-08
5th row2023-09-03
ValueCountFrequency (%)
2025-04-30 118
 
4.6%
2020-06-26 93
 
3.7%
2025-12-31 79
 
3.1%
2025-05-11 72
 
2.8%
2026-10-31 72
 
2.8%
2021-09-30 65
 
2.6%
2025-10-30 61
 
2.4%
2020-12-31 60
 
2.4%
2021-02-28 54
 
2.1%
2026-09-30 48
 
1.9%
Other values (656) 1821
71.6%
2023-12-11T12:28:55.982270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6018
24.0%
0 5501
21.9%
- 5004
19.9%
1 3115
12.4%
3 1758
 
7.0%
5 986
 
3.9%
6 862
 
3.4%
4 486
 
1.9%
9 479
 
1.9%
7 436
 
1.7%
Other values (3) 457
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20016
79.7%
Dash Punctuation 5004
 
19.9%
Other Letter 82
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6018
30.1%
0 5501
27.5%
1 3115
15.6%
3 1758
 
8.8%
5 986
 
4.9%
6 862
 
4.3%
4 486
 
2.4%
9 479
 
2.4%
7 436
 
2.2%
8 375
 
1.9%
Other Letter
ValueCountFrequency (%)
41
50.0%
41
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 5004
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25020
99.7%
Hangul 82
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6018
24.1%
0 5501
22.0%
- 5004
20.0%
1 3115
12.5%
3 1758
 
7.0%
5 986
 
3.9%
6 862
 
3.4%
4 486
 
1.9%
9 479
 
1.9%
7 436
 
1.7%
Hangul
ValueCountFrequency (%)
41
50.0%
41
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25020
99.7%
Hangul 82
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6018
24.1%
0 5501
22.0%
- 5004
20.0%
1 3115
12.5%
3 1758
 
7.0%
5 986
 
3.9%
6 862
 
3.4%
4 486
 
1.9%
9 479
 
1.9%
7 436
 
1.7%
Hangul
ValueCountFrequency (%)
41
50.0%
41
50.0%

허가일자
Date

MISSING 

Distinct1089
Distinct (%)36.1%
Missing94
Missing (%)3.0%
Memory size24.4 KiB
Minimum1945-01-01 00:00:00
Maximum2022-12-12 00:00:00
2023-12-11T12:28:56.220036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:28:56.407173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2826
Distinct (%)90.9%
Missing1
Missing (%)< 0.1%
Memory size24.4 KiB
2023-12-11T12:28:56.706447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length11.76045
Min length1

Characters and Unicode

Total characters36575
Distinct characters161
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2714 ?
Unique (%)87.3%

Sample

1st row하천03-14호
2nd row한강 제439호
3rd row한강 제439-1호
4th row경기제1827호
5th row경기제1350-2호
ValueCountFrequency (%)
267
 
6.8%
139
 
3.6%
한강 61
 
1.6%
52
 
1.3%
건설4612-306 42
 
1.1%
제호 34
 
0.9%
제2022 28
 
0.7%
익산청 25
 
0.6%
서천 23
 
0.6%
옥천건설417-533 21
 
0.5%
Other values (2850) 3215
82.3%
2023-12-11T12:28:57.187893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6271
17.1%
2 5202
14.2%
1 4244
11.6%
- 4101
11.2%
2498
 
6.8%
2438
 
6.7%
3 1575
 
4.3%
9 1521
 
4.2%
8 1239
 
3.4%
4 1079
 
3.0%
Other values (151) 6407
17.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24077
65.8%
Other Letter 7176
 
19.6%
Dash Punctuation 4101
 
11.2%
Space Separator 799
 
2.2%
Close Punctuation 117
 
0.3%
Open Punctuation 117
 
0.3%
Other Punctuation 91
 
0.2%
Lowercase Letter 64
 
0.2%
Uppercase Letter 32
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2498
34.8%
2438
34.0%
169
 
2.4%
165
 
2.3%
157
 
2.2%
133
 
1.9%
132
 
1.8%
80
 
1.1%
79
 
1.1%
72
 
1.0%
Other values (112) 1253
17.5%
Lowercase Letter
ValueCountFrequency (%)
a 14
21.9%
n 9
14.1%
c 7
10.9%
e 7
10.9%
u 5
 
7.8%
r 5
 
7.8%
y 5
 
7.8%
p 4
 
6.2%
t 3
 
4.7%
l 2
 
3.1%
Other values (3) 3
 
4.7%
Decimal Number
ValueCountFrequency (%)
0 6271
26.0%
2 5202
21.6%
1 4244
17.6%
3 1575
 
6.5%
9 1521
 
6.3%
8 1239
 
5.1%
4 1079
 
4.5%
5 1060
 
4.4%
6 1005
 
4.2%
7 881
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
J 11
34.4%
M 8
25.0%
D 4
 
12.5%
O 3
 
9.4%
S 2
 
6.2%
A 2
 
6.2%
N 1
 
3.1%
F 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 83
91.2%
, 4
 
4.4%
/ 4
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 4101
100.0%
Space Separator
ValueCountFrequency (%)
799
100.0%
Close Punctuation
ValueCountFrequency (%)
) 117
100.0%
Open Punctuation
ValueCountFrequency (%)
( 117
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29303
80.1%
Hangul 7176
 
19.6%
Latin 96
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2498
34.8%
2438
34.0%
169
 
2.4%
165
 
2.3%
157
 
2.2%
133
 
1.9%
132
 
1.8%
80
 
1.1%
79
 
1.1%
72
 
1.0%
Other values (112) 1253
17.5%
Latin
ValueCountFrequency (%)
a 14
14.6%
J 11
11.5%
n 9
9.4%
M 8
 
8.3%
c 7
 
7.3%
e 7
 
7.3%
u 5
 
5.2%
r 5
 
5.2%
y 5
 
5.2%
D 4
 
4.2%
Other values (11) 21
21.9%
Common
ValueCountFrequency (%)
0 6271
21.4%
2 5202
17.8%
1 4244
14.5%
- 4101
14.0%
3 1575
 
5.4%
9 1521
 
5.2%
8 1239
 
4.2%
4 1079
 
3.7%
5 1060
 
3.6%
6 1005
 
3.4%
Other values (8) 2006
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29399
80.4%
Hangul 7176
 
19.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6271
21.3%
2 5202
17.7%
1 4244
14.4%
- 4101
13.9%
3 1575
 
5.4%
9 1521
 
5.2%
8 1239
 
4.2%
4 1079
 
3.7%
5 1060
 
3.6%
6 1005
 
3.4%
Other values (29) 2102
 
7.1%
Hangul
ValueCountFrequency (%)
2498
34.8%
2438
34.0%
169
 
2.4%
165
 
2.3%
157
 
2.2%
133
 
1.9%
132
 
1.8%
80
 
1.1%
79
 
1.1%
72
 
1.0%
Other values (112) 1253
17.5%
Distinct225
Distinct (%)7.3%
Missing11
Missing (%)0.4%
Memory size24.4 KiB
2023-12-11T12:28:57.472572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length7.1116129
Min length3

Characters and Unicode

Total characters22046
Distinct characters137
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)2.7%

Sample

1st row이천시장
2nd row한강홍수통제소장
3rd row한강홍수통제소장
4th row서울지방국토관리청장
5th row서울지방국토관리청장
ValueCountFrequency (%)
영산강홍수통제소장 355
 
11.2%
낙동강홍수통제소장 268
 
8.4%
영산강홍수통제소 232
 
7.3%
낙동강홍수통제소 231
 
7.3%
금강홍수통제소장 148
 
4.7%
금강홍수통제소 137
 
4.3%
한강홍수통제소장 116
 
3.6%
부산지방국토관리청장 102
 
3.2%
부산지방국토관리청 67
 
2.1%
대전지방국토관리청장 65
 
2.0%
Other values (219) 1461
45.9%
2023-12-11T12:28:57.919459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1884
 
8.5%
1590
 
7.2%
1573
 
7.1%
1556
 
7.1%
1546
 
7.0%
1543
 
7.0%
1428
 
6.5%
1070
 
4.9%
627
 
2.8%
619
 
2.8%
Other values (127) 8610
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21946
99.5%
Space Separator 83
 
0.4%
Decimal Number 7
 
< 0.1%
Other Punctuation 5
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1884
 
8.6%
1590
 
7.2%
1573
 
7.2%
1556
 
7.1%
1546
 
7.0%
1543
 
7.0%
1428
 
6.5%
1070
 
4.9%
627
 
2.9%
619
 
2.8%
Other values (115) 8510
38.8%
Decimal Number
ValueCountFrequency (%)
3 2
28.6%
9 1
14.3%
0 1
14.3%
7 1
14.3%
1 1
14.3%
4 1
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 4
80.0%
, 1
 
20.0%
Space Separator
ValueCountFrequency (%)
83
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21946
99.5%
Common 100
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1884
 
8.6%
1590
 
7.2%
1573
 
7.2%
1556
 
7.1%
1546
 
7.0%
1543
 
7.0%
1428
 
6.5%
1070
 
4.9%
627
 
2.9%
619
 
2.8%
Other values (115) 8510
38.8%
Common
ValueCountFrequency (%)
83
83.0%
/ 4
 
4.0%
3 2
 
2.0%
) 2
 
2.0%
( 2
 
2.0%
, 1
 
1.0%
9 1
 
1.0%
0 1
 
1.0%
- 1
 
1.0%
7 1
 
1.0%
Other values (2) 2
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21946
99.5%
ASCII 100
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1884
 
8.6%
1590
 
7.2%
1573
 
7.2%
1556
 
7.1%
1546
 
7.0%
1543
 
7.0%
1428
 
6.5%
1070
 
4.9%
627
 
2.9%
619
 
2.8%
Other values (115) 8510
38.8%
ASCII
ValueCountFrequency (%)
83
83.0%
/ 4
 
4.0%
3 2
 
2.0%
) 2
 
2.0%
( 2
 
2.0%
, 1
 
1.0%
9 1
 
1.0%
0 1
 
1.0%
- 1
 
1.0%
7 1
 
1.0%
Other values (2) 2
 
2.0%

Correlations

2023-12-11T12:28:58.061686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본부지사시설구분하천등급유량계측시설
본부1.0000.9910.5070.2960.186
지사0.9911.0000.8020.8190.815
시설구분0.5070.8021.0000.3900.187
하천등급0.2960.8190.3901.0000.583
유량계측시설0.1860.8150.1870.5831.000
2023-12-11T12:28:58.193529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분유량계측시설본부하천등급
시설구분1.0000.1460.2250.230
유량계측시설0.1461.0000.1770.401
본부0.2250.1771.0000.183
하천등급0.2300.4010.1831.000
2023-12-11T12:28:58.336503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본부시설구분하천등급유량계측시설
본부1.0000.2250.1830.177
시설구분0.2251.0000.2300.146
하천등급0.1830.2301.0000.401
유량계측시설0.1770.1460.4011.000

Missing values

2023-12-11T12:28:43.325300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:28:43.548889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T12:28:43.769841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

표준코드본부지사시설구분시설명성명_법인명전화번호주소하천명하천등급허가_점용위치허가_점용면적유수사용량유량계측시설점용목적허가_점용기간 - 시작허가_점용기간 - 종료허가일자허가번호허가(권)자
041530Z0001경기여주.이천기타성호낙차공농업기반공사 여주이천지사031-884-6062경기도 여주군 능서면 신지리청미천국가경기도 이천시 설성면 제요리4,43963,000미설치농업용수인용2003-11-052012-10-302003-11-05하천03-14호이천시장
141530Z0001경기여주.이천기타성호낙차공한국농어촌공사 여주.이천지사031-887-7530경기도 여주군 능서면 신지리청미천국가경기도 이천시 설성면 제요리4,43913,810미설치농업용수 인용2012-11-052017-11-042012-10-30한강 제439호한강홍수통제소장
241530Z0001경기여주.이천기타성호낙차공한국농어촌공사 여주.이천지사031-887-7530경기도 여주시 능서면 신지리청미천지방1경기도 이천시 설성면 제요리4,43913,810미설치농업용수 인용2017-11-052022-11-042017-12-18한강 제439-1호한강홍수통제소장
341670Z0002경기여주.이천기타대당배수문한국농어촌공사 여주.이천지사031-887-7530경기도 여주시 능서면 신지리복하천국가경기도 여주시 흥천면 대당리1,3040미설치배수문 설치에 따른 하천의 토지굴착 및 구조물 설치2016-11-092021-11-082016-11-09경기제1827호서울지방국토관리청장
44150040001경기여주.이천배수장상용한국농어촌공사 여주.이천지사031-887-7530경기도 여주시 능서면 신지리복하천국가경기도 이천시 백사면 우곡리2,3310미설치배수문 설치2018-09-042023-09-032018-10-05경기제1350-2호서울지방국토관리청장
54150040001경기여주.이천배수장상용한국농어촌공사 여주.이천지사031-887-7530경기도 여주시 능서면 신지리복하천국가경기도 여주시 흥천면 하다리2,5290미설치배수장 및 배수문 설치2018-05-072023-05-062018-07-09경기제1326-4호서울지방국토관리청장
64150040002경기여주.이천배수장어석한국농어촌공사 여주이천지사031-887-7530경기도 여주시 능서면 신지리청미천국가경기도 이천시 장호원읍 어석리20,5350미설치어석지구 배수개선사업으로 인한 공작물(배수펌프장) 설치2008-10-20<NA>2011-04-28제2011-6호이천시장
74173040001경기여주.이천배수장내양여주,이천지사장031-884-6062경기도 여주군 능서면 신지리양화천지방2경기도 여주군 흥천면 율극리5,0740미설치공작물설치 및 토지점용2002-03-12<NA>2002-03-12제2002-3호여주군수
84153020002경기여주.이천양수장남천여주,이천지사031-884-6062경기도 여주군 능서면 신지리청미천국가경기도 이천시 장호원읍 오남리35010,950미설치유수사용을위한공작물설치,토지의사용2003-02-032013-02-022003-02-03한강 제104호한강홍수통제소장
94153020002경기여주.이천양수장남천한국농어촌공사 여주.이천지사031-887-7534경기도 여주군 능서면 신지리청미천국가경기도 이천시 장호원읍 오남리35010,950설치유수사용을위한공작물설치, 토지의사용2013-02-032018-02-022013-01-25한강 제104-1호국토해양부 한강홍수통제소장
표준코드본부지사시설구분시설명성명_법인명전화번호주소하천명하천등급허가_점용위치허가_점용면적유수사용량유량계측시설점용목적허가_점용기간 - 시작허가_점용기간 - 종료허가일자허가번호허가(권)자
310144210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리신대천지방2충청남도 보령시 주교면 신대리680미설치농업용수 공급2010-03-10<NA>2010-03-10제1-03-41보령시장
310244210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리금평천지방2충청남도 홍성군 홍동면 금평리800미설치농업용수 공급2010-10-01<NA>2010-10-01제2009-06호홍성군수
310344210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리음야천지방2충청남도 보령시 청소면 정전리950미설치농업용수공급2009-11-18<NA>2009-11-18제2-6-17보령시장
310444210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리마강천지방2충청남도 보령시 주포면 마강리1170미설치농업용수 공급2007-11-01<NA><NA>제2-2-8호보령시장
310544210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리담산천지방2충청남도 홍성군 광천읍 담산리520미설치농업용수 공급2008-07-22<NA>2008-07-22제2008-02-06호홍성군수
310644210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리재정천지방2충청남도 보령시 청소면 재정리1460미설치농업용수 공급2008-08-26<NA>2008-08-26제2-6-14,15보령시장
310744210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리광성천지방2충청남도 홍성군 장곡면 화계리1580미설치농업용수 공급2009-01-01<NA>2009-01-01제2009-01호홍성군수
310844210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리상송천지방2충청남도 홍성군 장곡면 상송리180미설치농업용수 공급2009-01-15<NA>2009-01-15제2008-02-17호홍성군수
310944210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리보봉천지방2충청남도 보령시 주포면 봉당리210미설치농업용수 공급2009-12-23<NA>2009-12-23제2-2-13보령시장
311044210A0021천수만사업단본부직할용수간선간월9-1한국농어촌공사 천수만사업단041-630-5900충청남도 홍성군 홍성읍 월산리오성천지방2충청남도 홍성군 장곡면 오성리410미설치농업용수 공급2010-02-17<NA>2010-02-17제2010-03호홍성군수