Overview

Dataset statistics

Number of variables10
Number of observations380
Missing cells1396
Missing cells (%)36.7%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory30.2 KiB
Average record size in memory81.3 B

Variable types

Text6
Categorical3
DateTime1

Dataset

Description2023년 7월 12일 기준, 경남 산청군 업무별 민원 신청 서식 목록에 대한 산청군 대표 누리집 링크로 민원사무명, 관련부서, 서식첨부파일로 구성되어 있습니다.
Author경상남도 산청군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15041745

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.3%) duplicate rowsDuplicates
첨부파일 수 is highly imbalanced (57.7%)Imbalance
첨부파일명 2 has 311 (81.8%) missing valuesMissing
첨부파일명 3 has 345 (90.8%) missing valuesMissing
첨부파일명 4 has 369 (97.1%) missing valuesMissing
첨부파일명 5 has 371 (97.6%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:26:29.819773
Analysis finished2023-12-11 00:26:31.122596
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct277
Distinct (%)72.9%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T09:26:31.271762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length28
Mean length13.123684
Min length4

Characters and Unicode

Total characters4987
Distinct characters240
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)60.0%

Sample

1st row사회복지서비스 및 급여 제공(변경) 신청
2nd row복지대상자(해산급여·장제급여)지원신청
3rd row장애인등록증재교부
4th row장애인등록증기재사항변경신청
5th row장애인등급조정신청
ValueCountFrequency (%)
신청 31
 
3.6%
신청서 28
 
3.3%
신고 21
 
2.4%
신고서 15
 
1.7%
14
 
1.6%
주민등록증 12
 
1.4%
주민등록 12
 
1.4%
열람 12
 
1.4%
교부 11
 
1.3%
발급 11
 
1.3%
Other values (426) 691
80.5%
2023-12-11T09:26:31.643443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
479
 
9.6%
343
 
6.9%
189
 
3.8%
177
 
3.5%
165
 
3.3%
156
 
3.1%
112
 
2.2%
) 93
 
1.9%
( 93
 
1.9%
82
 
1.6%
Other values (230) 3098
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4226
84.7%
Space Separator 479
 
9.6%
Close Punctuation 93
 
1.9%
Open Punctuation 93
 
1.9%
Other Punctuation 80
 
1.6%
Decimal Number 11
 
0.2%
Modifier Symbol 3
 
0.1%
Connector Punctuation 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
343
 
8.1%
189
 
4.5%
177
 
4.2%
165
 
3.9%
156
 
3.7%
112
 
2.7%
82
 
1.9%
78
 
1.8%
75
 
1.8%
74
 
1.8%
Other values (216) 2775
65.7%
Other Punctuation
ValueCountFrequency (%)
, 59
73.8%
. 15
 
18.8%
· 5
 
6.2%
/ 1
 
1.2%
Decimal Number
ValueCountFrequency (%)
2 5
45.5%
3 3
27.3%
6 2
 
18.2%
1 1
 
9.1%
Space Separator
ValueCountFrequency (%)
479
100.0%
Close Punctuation
ValueCountFrequency (%)
) 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 93
100.0%
Modifier Symbol
ValueCountFrequency (%)
¸ 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4226
84.7%
Common 761
 
15.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
343
 
8.1%
189
 
4.5%
177
 
4.2%
165
 
3.9%
156
 
3.7%
112
 
2.7%
82
 
1.9%
78
 
1.8%
75
 
1.8%
74
 
1.8%
Other values (216) 2775
65.7%
Common
ValueCountFrequency (%)
479
62.9%
) 93
 
12.2%
( 93
 
12.2%
, 59
 
7.8%
. 15
 
2.0%
· 5
 
0.7%
2 5
 
0.7%
¸ 3
 
0.4%
3 3
 
0.4%
6 2
 
0.3%
Other values (4) 4
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4221
84.6%
ASCII 752
 
15.1%
None 8
 
0.2%
Compat Jamo 5
 
0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
479
63.7%
) 93
 
12.4%
( 93
 
12.4%
, 59
 
7.8%
. 15
 
2.0%
2 5
 
0.7%
3 3
 
0.4%
6 2
 
0.3%
1 1
 
0.1%
_ 1
 
0.1%
Hangul
ValueCountFrequency (%)
343
 
8.1%
189
 
4.5%
177
 
4.2%
165
 
3.9%
156
 
3.7%
112
 
2.7%
82
 
1.9%
78
 
1.8%
75
 
1.8%
74
 
1.8%
Other values (215) 2770
65.6%
None
ValueCountFrequency (%)
· 5
62.5%
¸ 3
37.5%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

관련부서
Categorical

Distinct24
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
환경위생과
45 
민원과
44 
경제전략과
26 
생초면
25 
주민복지과
23 
Other values (19)
217 

Length

Max length5
Median length3
Mean length3.7421053
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주민복지과
2nd row주민복지과
3rd row주민복지과
4th row주민복지과
5th row주민복지과

Common Values

ValueCountFrequency (%)
환경위생과 45
 
11.8%
민원과 44
 
11.6%
경제전략과 26
 
6.8%
생초면 25
 
6.6%
주민복지과 23
 
6.1%
신등면 22
 
5.8%
신안면 21
 
5.5%
단성면 20
 
5.3%
재무과 17
 
4.5%
산청읍 16
 
4.2%
Other values (14) 121
31.8%

Length

2023-12-11T09:26:31.860319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
환경위생과 45
 
11.8%
민원과 44
 
11.6%
경제전략과 26
 
6.8%
생초면 25
 
6.6%
주민복지과 23
 
6.1%
신등면 22
 
5.8%
신안면 21
 
5.5%
단성면 20
 
5.3%
재무과 17
 
4.5%
산청읍 16
 
4.2%
Other values (14) 121
31.8%
Distinct66
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Minimum2008-01-01 00:00:00
Maximum2019-09-24 00:00:00
2023-12-11T09:26:32.004473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:32.164994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

첨부파일 수
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
1
310 
2
33 
3
 
25
5
 
10
4
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 310
81.6%
2 33
 
8.7%
3 25
 
6.6%
5 10
 
2.6%
4 2
 
0.5%

Length

2023-12-11T09:26:32.321184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:26:32.447595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 310
81.6%
2 33
 
8.7%
3 25
 
6.6%
5 10
 
2.6%
4 2
 
0.5%
Distinct291
Distinct (%)76.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T09:26:32.684950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length93
Median length39
Mean length23.5
Min length6

Characters and Unicode

Total characters8930
Distinct characters262
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)64.7%

Sample

1st row[별지제1호서식]사회복지서비스및급여제공(변경)신청서(서식)(0).hwp
2nd row[별지제3호서식]복지대상자 해산 장제급여 신청서('12.7. 개정)(서식)(0).hwp
3rd row장애인등록증재교부신청서(0).hwp
4th row장애인등록증기재사항변경신청서(0).hwp
5th row장애인등급조정신청서.hwp
ValueCountFrequency (%)
별지 101
 
8.8%
신청서.hwp 38
 
3.3%
또는 25
 
2.2%
열람 21
 
1.8%
17
 
1.5%
교부 16
 
1.4%
서식 16
 
1.4%
발급 15
 
1.3%
주민등록증 14
 
1.2%
영어 14
 
1.2%
Other values (486) 865
75.7%
2023-12-11T09:26:33.136633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
762
 
8.5%
529
 
5.9%
. 399
 
4.5%
p 376
 
4.2%
w 375
 
4.2%
h 375
 
4.2%
366
 
4.1%
198
 
2.2%
197
 
2.2%
187
 
2.1%
Other values (252) 5166
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5342
59.8%
Lowercase Letter 1137
 
12.7%
Space Separator 762
 
8.5%
Decimal Number 451
 
5.1%
Other Punctuation 430
 
4.8%
Open Punctuation 329
 
3.7%
Close Punctuation 329
 
3.7%
Connector Punctuation 84
 
0.9%
Modifier Symbol 51
 
0.6%
Uppercase Letter 10
 
0.1%
Other values (2) 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
529
 
9.9%
366
 
6.9%
198
 
3.7%
197
 
3.7%
187
 
3.5%
179
 
3.4%
133
 
2.5%
132
 
2.5%
130
 
2.4%
114
 
2.1%
Other values (217) 3177
59.5%
Decimal Number
ValueCountFrequency (%)
1 128
28.4%
0 67
14.9%
3 53
11.8%
2 52
11.5%
5 36
 
8.0%
9 28
 
6.2%
7 25
 
5.5%
4 24
 
5.3%
6 20
 
4.4%
8 18
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
p 376
33.1%
w 375
33.0%
h 375
33.0%
t 6
 
0.5%
x 3
 
0.3%
i 1
 
0.1%
z 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
D 4
40.0%
B 3
30.0%
H 1
 
10.0%
W 1
 
10.0%
P 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 399
92.8%
, 24
 
5.6%
· 6
 
1.4%
' 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 170
51.7%
[ 159
48.3%
Close Punctuation
ValueCountFrequency (%)
) 170
51.7%
] 159
48.3%
Space Separator
ValueCountFrequency (%)
762
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 84
100.0%
Modifier Symbol
ValueCountFrequency (%)
¸ 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5342
59.8%
Common 2441
27.3%
Latin 1147
 
12.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
529
 
9.9%
366
 
6.9%
198
 
3.7%
197
 
3.7%
187
 
3.5%
179
 
3.4%
133
 
2.5%
132
 
2.5%
130
 
2.4%
114
 
2.1%
Other values (217) 3177
59.5%
Common
ValueCountFrequency (%)
762
31.2%
. 399
16.3%
( 170
 
7.0%
) 170
 
7.0%
[ 159
 
6.5%
] 159
 
6.5%
1 128
 
5.2%
_ 84
 
3.4%
0 67
 
2.7%
3 53
 
2.2%
Other values (13) 290
 
11.9%
Latin
ValueCountFrequency (%)
p 376
32.8%
w 375
32.7%
h 375
32.7%
t 6
 
0.5%
D 4
 
0.3%
B 3
 
0.3%
x 3
 
0.3%
i 1
 
0.1%
z 1
 
0.1%
H 1
 
0.1%
Other values (2) 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5312
59.5%
ASCII 3531
39.5%
None 57
 
0.6%
Compat Jamo 30
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
762
21.6%
. 399
11.3%
p 376
10.6%
w 375
10.6%
h 375
10.6%
( 170
 
4.8%
) 170
 
4.8%
[ 159
 
4.5%
] 159
 
4.5%
1 128
 
3.6%
Other values (23) 458
13.0%
Hangul
ValueCountFrequency (%)
529
 
10.0%
366
 
6.9%
198
 
3.7%
197
 
3.7%
187
 
3.5%
179
 
3.4%
133
 
2.5%
132
 
2.5%
130
 
2.4%
114
 
2.1%
Other values (216) 3147
59.2%
None
ValueCountFrequency (%)
¸ 51
89.5%
· 6
 
10.5%
Compat Jamo
ValueCountFrequency (%)
30
100.0%

첨부파일명 2
Text

MISSING 

Distinct59
Distinct (%)85.5%
Missing311
Missing (%)81.8%
Memory size3.1 KiB
2023-12-11T09:26:33.460939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length38
Mean length26.318841
Min length7

Characters and Unicode

Total characters1816
Distinct characters173
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)75.4%

Sample

1st row[별지제1호의2서식]소득재산신고서('12.7. 개정)(서식).hwp
2nd row공유수면점사용허가변경신청서.hwp
3rd row게임제작업 등록신청서.hwp
4th row게임관련 허가.등록 또는 신고사항 변경신청서.hwp
5th row[별지제1호의2서식]소득재산신고서('12.7. 개정)(서식)(0).hwp
ValueCountFrequency (%)
별지 18
 
6.3%
베트남어 13
 
4.6%
또는 10
 
3.5%
교부 9
 
3.2%
열람 8
 
2.8%
주민등록표 7
 
2.5%
위임장.hwp 6
 
2.1%
있는 4
 
1.4%
4
 
1.4%
입증할 4
 
1.4%
Other values (147) 201
70.8%
2023-12-11T09:26:33.984231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
215
 
11.8%
86
 
4.7%
. 82
 
4.5%
p 69
 
3.8%
h 68
 
3.7%
w 68
 
3.7%
47
 
2.6%
) 43
 
2.4%
( 42
 
2.3%
34
 
1.9%
Other values (163) 1062
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1024
56.4%
Space Separator 215
 
11.8%
Lowercase Letter 207
 
11.4%
Decimal Number 125
 
6.9%
Other Punctuation 90
 
5.0%
Close Punctuation 70
 
3.9%
Open Punctuation 70
 
3.9%
Modifier Symbol 9
 
0.5%
Uppercase Letter 5
 
0.3%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
8.4%
47
 
4.6%
34
 
3.3%
32
 
3.1%
31
 
3.0%
28
 
2.7%
27
 
2.6%
25
 
2.4%
24
 
2.3%
22
 
2.1%
Other values (135) 668
65.2%
Decimal Number
ValueCountFrequency (%)
1 29
23.2%
2 26
20.8%
9 13
10.4%
0 13
10.4%
4 10
 
8.0%
8 8
 
6.4%
3 8
 
6.4%
6 7
 
5.6%
5 7
 
5.6%
7 4
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
p 69
33.3%
h 68
32.9%
w 68
32.9%
z 1
 
0.5%
i 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 82
91.1%
, 4
 
4.4%
· 2
 
2.2%
' 2
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 43
61.4%
] 27
38.6%
Open Punctuation
ValueCountFrequency (%)
( 42
60.0%
[ 28
40.0%
Uppercase Letter
ValueCountFrequency (%)
B 4
80.0%
D 1
 
20.0%
Space Separator
ValueCountFrequency (%)
215
100.0%
Modifier Symbol
ValueCountFrequency (%)
¸ 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1024
56.4%
Common 580
31.9%
Latin 212
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
8.4%
47
 
4.6%
34
 
3.3%
32
 
3.1%
31
 
3.0%
28
 
2.7%
27
 
2.6%
25
 
2.4%
24
 
2.3%
22
 
2.1%
Other values (135) 668
65.2%
Common
ValueCountFrequency (%)
215
37.1%
. 82
 
14.1%
) 43
 
7.4%
( 42
 
7.2%
1 29
 
5.0%
[ 28
 
4.8%
] 27
 
4.7%
2 26
 
4.5%
9 13
 
2.2%
0 13
 
2.2%
Other values (11) 62
 
10.7%
Latin
ValueCountFrequency (%)
p 69
32.5%
h 68
32.1%
w 68
32.1%
B 4
 
1.9%
z 1
 
0.5%
i 1
 
0.5%
D 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1018
56.1%
ASCII 781
43.0%
None 11
 
0.6%
Compat Jamo 6
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
215
27.5%
. 82
 
10.5%
p 69
 
8.8%
h 68
 
8.7%
w 68
 
8.7%
) 43
 
5.5%
( 42
 
5.4%
1 29
 
3.7%
[ 28
 
3.6%
] 27
 
3.5%
Other values (16) 110
14.1%
Hangul
ValueCountFrequency (%)
86
 
8.4%
47
 
4.6%
34
 
3.3%
32
 
3.1%
31
 
3.0%
28
 
2.8%
27
 
2.7%
25
 
2.5%
24
 
2.4%
22
 
2.2%
Other values (134) 662
65.0%
None
ValueCountFrequency (%)
¸ 9
81.8%
· 2
 
18.2%
Compat Jamo
ValueCountFrequency (%)
6
100.0%

첨부파일명 3
Text

MISSING 

Distinct32
Distinct (%)91.4%
Missing345
Missing (%)90.8%
Memory size3.1 KiB
2023-12-11T09:26:34.288033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length33
Mean length27.114286
Min length7

Characters and Unicode

Total characters949
Distinct characters136
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)82.9%

Sample

1st row[별지제1호의3서식]금융정보등(금융 신용 보험정보) 제공 동의서('12.7. 개정)(서식).hwp
2nd row게임배급업 등록신청서.hwp
3rd row[별지제1호의3서식]금융정보등(금융 신용 보험정보) 제공 동의서('12.7. 개정)(서식)(0).hwp
4th row별지 제10호의2서식[현상변경허가신청서-도지정문화재).hwp
5th row[서식_3]_보증서.hwp
ValueCountFrequency (%)
중국어 13
 
9.6%
별지 6
 
4.4%
교부 5
 
3.7%
신청서.hwp 3
 
2.2%
주민등록표 3
 
2.2%
열람 3
 
2.2%
또는 3
 
2.2%
위임장[별지제9호서식].hwp 3
 
2.2%
신청 3
 
2.2%
발급신청서.hwp 3
 
2.2%
Other values (74) 91
66.9%
2023-12-11T09:26:34.699842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
10.6%
41
 
4.3%
. 39
 
4.1%
p 35
 
3.7%
h 34
 
3.6%
w 34
 
3.6%
) 33
 
3.5%
( 32
 
3.4%
29
 
3.1%
20
 
2.1%
Other values (126) 551
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 529
55.7%
Lowercase Letter 105
 
11.1%
Space Separator 101
 
10.6%
Decimal Number 74
 
7.8%
Close Punctuation 45
 
4.7%
Open Punctuation 45
 
4.7%
Other Punctuation 43
 
4.5%
Connector Punctuation 2
 
0.2%
Uppercase Letter 2
 
0.2%
Modifier Symbol 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
7.8%
29
 
5.5%
20
 
3.8%
18
 
3.4%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
14
 
2.6%
14
 
2.6%
Other values (99) 329
62.2%
Decimal Number
ValueCountFrequency (%)
1 20
27.0%
2 14
18.9%
3 12
16.2%
9 9
12.2%
5 5
 
6.8%
0 5
 
6.8%
8 3
 
4.1%
7 3
 
4.1%
6 2
 
2.7%
4 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
p 35
33.3%
h 34
32.4%
w 34
32.4%
z 1
 
1.0%
i 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 39
90.7%
' 2
 
4.7%
· 2
 
4.7%
Close Punctuation
ValueCountFrequency (%)
) 33
73.3%
] 12
 
26.7%
Open Punctuation
ValueCountFrequency (%)
( 32
71.1%
[ 13
28.9%
Space Separator
ValueCountFrequency (%)
101
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
¸ 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 529
55.7%
Common 313
33.0%
Latin 107
 
11.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
7.8%
29
 
5.5%
20
 
3.8%
18
 
3.4%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
14
 
2.6%
14
 
2.6%
Other values (99) 329
62.2%
Common
ValueCountFrequency (%)
101
32.3%
. 39
 
12.5%
) 33
 
10.5%
( 32
 
10.2%
1 20
 
6.4%
2 14
 
4.5%
[ 13
 
4.2%
] 12
 
3.8%
3 12
 
3.8%
9 9
 
2.9%
Other values (11) 28
 
8.9%
Latin
ValueCountFrequency (%)
p 35
32.7%
h 34
31.8%
w 34
31.8%
B 2
 
1.9%
z 1
 
0.9%
i 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 527
55.5%
ASCII 416
43.8%
None 4
 
0.4%
Compat Jamo 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
101
24.3%
. 39
 
9.4%
p 35
 
8.4%
h 34
 
8.2%
w 34
 
8.2%
) 33
 
7.9%
( 32
 
7.7%
1 20
 
4.8%
2 14
 
3.4%
[ 13
 
3.1%
Other values (15) 61
14.7%
Hangul
ValueCountFrequency (%)
41
 
7.8%
29
 
5.5%
20
 
3.8%
18
 
3.4%
17
 
3.2%
17
 
3.2%
15
 
2.8%
15
 
2.8%
14
 
2.7%
14
 
2.7%
Other values (98) 327
62.0%
None
ValueCountFrequency (%)
· 2
50.0%
¸ 2
50.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

첨부파일명 4
Text

MISSING 

Distinct11
Distinct (%)100.0%
Missing369
Missing (%)97.1%
Memory size3.1 KiB
2023-12-11T09:26:34.933634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length30
Mean length25.818182
Min length8

Characters and Unicode

Total characters284
Distinct characters81
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)100.0%

Sample

1st row일반게임제공업 허가신청서.hwp
2nd row9890는 등·초본 교부 신청서[별지제7호서식].hwp
3rd row비산먼지발생사업 신고사업장 준수사항.hwp
4th row[별지 제1호서식] 건설업 등록신청서.hwp
5th row[별지 제16호서식] 건설업상속신고서.hwp
ValueCountFrequency (%)
교부 4
 
9.1%
별지 3
 
6.8%
열람 3
 
6.8%
또는 3
 
6.8%
제10호서식 1
 
2.3%
관련).hwp 1
 
2.3%
행정정보공동이용사전동의서.hwp 1
 
2.3%
9d98 1
 
2.3%
신청서[별지제11호서식].hwp 1
 
2.3%
일반게임제공업 1
 
2.3%
Other values (25) 25
56.8%
2023-12-11T09:26:35.306112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
11.6%
14
 
4.9%
. 11
 
3.9%
w 11
 
3.9%
h 11
 
3.9%
p 11
 
3.9%
9
 
3.2%
9
 
3.2%
8
 
2.8%
] 7
 
2.5%
Other values (71) 160
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 168
59.2%
Space Separator 33
 
11.6%
Lowercase Letter 33
 
11.6%
Decimal Number 21
 
7.4%
Other Punctuation 12
 
4.2%
Close Punctuation 8
 
2.8%
Open Punctuation 8
 
2.8%
Uppercase Letter 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.3%
9
 
5.4%
9
 
5.4%
8
 
4.8%
7
 
4.2%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
Other values (54) 92
54.8%
Decimal Number
ValueCountFrequency (%)
1 7
33.3%
9 4
19.0%
0 4
19.0%
8 3
14.3%
6 2
 
9.5%
7 1
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
w 11
33.3%
h 11
33.3%
p 11
33.3%
Other Punctuation
ValueCountFrequency (%)
. 11
91.7%
· 1
 
8.3%
Close Punctuation
ValueCountFrequency (%)
] 7
87.5%
) 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
[ 7
87.5%
( 1
 
12.5%
Space Separator
ValueCountFrequency (%)
33
100.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 168
59.2%
Common 82
28.9%
Latin 34
 
12.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.3%
9
 
5.4%
9
 
5.4%
8
 
4.8%
7
 
4.2%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
Other values (54) 92
54.8%
Common
ValueCountFrequency (%)
33
40.2%
. 11
 
13.4%
] 7
 
8.5%
[ 7
 
8.5%
1 7
 
8.5%
9 4
 
4.9%
0 4
 
4.9%
8 3
 
3.7%
6 2
 
2.4%
( 1
 
1.2%
Other values (3) 3
 
3.7%
Latin
ValueCountFrequency (%)
w 11
32.4%
h 11
32.4%
p 11
32.4%
D 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 168
59.2%
ASCII 115
40.5%
None 1
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33
28.7%
. 11
 
9.6%
w 11
 
9.6%
h 11
 
9.6%
p 11
 
9.6%
] 7
 
6.1%
[ 7
 
6.1%
1 7
 
6.1%
9 4
 
3.5%
0 4
 
3.5%
Other values (6) 9
 
7.8%
Hangul
ValueCountFrequency (%)
14
 
8.3%
9
 
5.4%
9
 
5.4%
8
 
4.8%
7
 
4.2%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
Other values (54) 92
54.8%
None
ValueCountFrequency (%)
· 1
100.0%

첨부파일명 5
Text

MISSING 

Distinct9
Distinct (%)100.0%
Missing371
Missing (%)97.6%
Memory size3.1 KiB
2023-12-11T09:26:35.510138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length31
Mean length30.888889
Min length16

Characters and Unicode

Total characters278
Distinct characters85
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)100.0%

Sample

1st row청소년게임제공업.인터넷컴퓨터게임시설제공업 등록신청서.hwp
2nd row9890는 등·초본 교부 신청서[별지제8호서식].hwp
3rd row토사 운송차량_준수사항.hwp
4th row[별지 제5호서식] 건설업등록증ㆍ건설업등록수첩의 기재사항변경신청서.hwp
5th row[별지 제16호의2서식] 건설업폐업신고서.hwp
ValueCountFrequency (%)
교부 4
 
10.3%
별지 3
 
7.7%
또는 3
 
7.7%
열람 3
 
7.7%
청소년게임제공업.인터넷컴퓨터게임시설제공업 1
 
2.6%
과태료의 1
 
2.6%
신청서.hwp 1
 
2.6%
초본 1
 
2.6%
주민등록표 1
 
2.6%
관계자의 1
 
2.6%
Other values (20) 20
51.3%
2023-12-11T09:26:35.835957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
10.8%
13
 
4.7%
. 10
 
3.6%
w 9
 
3.2%
p 9
 
3.2%
h 9
 
3.2%
9
 
3.2%
7
 
2.5%
[ 7
 
2.5%
7
 
2.5%
Other values (75) 168
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171
61.5%
Space Separator 30
 
10.8%
Lowercase Letter 27
 
9.7%
Decimal Number 21
 
7.6%
Other Punctuation 11
 
4.0%
Open Punctuation 8
 
2.9%
Close Punctuation 8
 
2.9%
Uppercase Letter 1
 
0.4%
Connector Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
7.6%
9
 
5.3%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
Other values (55) 99
57.9%
Decimal Number
ValueCountFrequency (%)
1 6
28.6%
9 5
23.8%
8 4
19.0%
0 2
 
9.5%
7 1
 
4.8%
2 1
 
4.8%
6 1
 
4.8%
5 1
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
w 9
33.3%
p 9
33.3%
h 9
33.3%
Other Punctuation
ValueCountFrequency (%)
. 10
90.9%
· 1
 
9.1%
Open Punctuation
ValueCountFrequency (%)
[ 7
87.5%
( 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
] 7
87.5%
) 1
 
12.5%
Space Separator
ValueCountFrequency (%)
30
100.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 171
61.5%
Common 79
28.4%
Latin 28
 
10.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
7.6%
9
 
5.3%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
Other values (55) 99
57.9%
Common
ValueCountFrequency (%)
30
38.0%
. 10
 
12.7%
[ 7
 
8.9%
] 7
 
8.9%
1 6
 
7.6%
9 5
 
6.3%
8 4
 
5.1%
0 2
 
2.5%
7 1
 
1.3%
) 1
 
1.3%
Other values (6) 6
 
7.6%
Latin
ValueCountFrequency (%)
w 9
32.1%
p 9
32.1%
h 9
32.1%
D 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 169
60.8%
ASCII 106
38.1%
Compat Jamo 2
 
0.7%
None 1
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30
28.3%
. 10
 
9.4%
w 9
 
8.5%
p 9
 
8.5%
h 9
 
8.5%
[ 7
 
6.6%
] 7
 
6.6%
1 6
 
5.7%
9 5
 
4.7%
8 4
 
3.8%
Other values (9) 10
 
9.4%
Hangul
ValueCountFrequency (%)
13
 
7.7%
9
 
5.3%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
Other values (54) 97
57.4%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2019-12-06
380 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-12-06
2nd row2019-12-06
3rd row2019-12-06
4th row2019-12-06
5th row2019-12-06

Common Values

ValueCountFrequency (%)
2019-12-06 380
100.0%

Length

2023-12-11T09:26:35.985826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:26:36.124879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-12-06 380
100.0%

Correlations

2023-12-11T09:26:36.189960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관련부서등록일첨부파일 수첨부파일명 2첨부파일명 3첨부파일명 4첨부파일명 5
관련부서1.0000.9850.5670.9090.9631.0001.000
등록일0.9851.0000.9090.9660.8161.0001.000
첨부파일 수0.5670.9091.0000.9951.0001.000NaN
첨부파일명 20.9090.9660.9951.0001.0001.0001.000
첨부파일명 30.9630.8161.0001.0001.0001.0001.000
첨부파일명 41.0001.0001.0001.0001.0001.0001.000
첨부파일명 51.0001.000NaN1.0001.0001.0001.000
2023-12-11T09:26:36.305677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
첨부파일 수관련부서
첨부파일 수1.0000.307
관련부서0.3071.000
2023-12-11T09:26:36.378623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관련부서첨부파일 수
관련부서1.0000.307
첨부파일 수0.3071.000

Missing values

2023-12-11T09:26:30.763448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:26:30.924833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:26:31.047181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

민원사무명관련부서등록일첨부파일 수첨부파일명 1첨부파일명 2첨부파일명 3첨부파일명 4첨부파일명 5데이터기준일자
0사회복지서비스 및 급여 제공(변경) 신청주민복지과2008-01-013[별지제1호서식]사회복지서비스및급여제공(변경)신청서(서식)(0).hwp[별지제1호의2서식]소득재산신고서('12.7. 개정)(서식).hwp[별지제1호의3서식]금융정보등(금융 신용 보험정보) 제공 동의서('12.7. 개정)(서식).hwp<NA><NA>2019-12-06
1복지대상자(해산급여·장제급여)지원신청주민복지과2008-01-011[별지제3호서식]복지대상자 해산 장제급여 신청서('12.7. 개정)(서식)(0).hwp<NA><NA><NA><NA>2019-12-06
2장애인등록증재교부주민복지과2008-01-011장애인등록증재교부신청서(0).hwp<NA><NA><NA><NA>2019-12-06
3장애인등록증기재사항변경신청주민복지과2008-01-011장애인등록증기재사항변경신청서(0).hwp<NA><NA><NA><NA>2019-12-06
4장애인등급조정신청주민복지과2008-01-011장애인등급조정신청서.hwp<NA><NA><NA><NA>2019-12-06
5장애인등록주민복지과2008-01-011장애인등록신청서(0).hwp<NA><NA><NA><NA>2019-12-06
6장애인의료비청구주민복지과2008-01-011장애인의료비청구서(0).hwp<NA><NA><NA><NA>2019-12-06
7장애인자동차표지의발급(재발급)신청주민복지과2008-01-011장애인자동차표지 발급 신청서(0).hwp<NA><NA><NA><NA>2019-12-06
8미지급경로연금지급청구주민복지과2008-01-011미지급경로연금지급청구서(0).hwp<NA><NA><NA><NA>2019-12-06
9공유수면점사용허가안전건설과2008-01-012공유수면 신청서식.hwp공유수면점사용허가변경신청서.hwp<NA><NA><NA>2019-12-06
민원사무명관련부서등록일첨부파일 수첨부파일명 1첨부파일명 2첨부파일명 3첨부파일명 4첨부파일명 5데이터기준일자
370재산세(납세의무자, 과세대상) 변동 신고서재무과2019-07-231[별지 제64호서식] 재산세(납세의무자¸과세대상)변동 신고서.hwp<NA><NA><NA><NA>2019-12-06
371개별공시지가 의견제출서재무과2008-01-011[별지 제7호서식] 개별공시지가 의견서.hwp<NA><NA><NA><NA>2019-12-06
372표준지 공시지가 이의신청서재무과2008-01-011[별지 제5호서식] 표준지공시지가 이의신청서.hwp<NA><NA><NA><NA>2019-12-06
373개별공시지가 이의신청서재무과2008-01-011[별지 제8호서식] 개별공시지가 이의신청서.hwp<NA><NA><NA><NA>2019-12-06
374계약해제신고서재무과2019-09-241[별지 제1호의3서식] 계약해제신고서.hwp<NA><NA><NA><NA>2019-12-06
375과세전적부심사청구서재무과2019-09-241[별지 제53호서식] 과세전적부심사청구서.hwp<NA><NA><NA><NA>2019-12-06
376이의신청서재무과2019-09-241[별지 제56호서식] 이의신청서.hwp<NA><NA><NA><NA>2019-12-06
377지방세 과세표준 및 세액 등의 결정 또는 경정 청구서재무과2019-09-241[별지 제14호서식] 지방세 과세표준 및 세액 등의 결정 또는 경정 청구서.hwp<NA><NA><NA><NA>2019-12-06
378지방세 과세표준 수정신고서재무과2019-09-241[별지 제13호서식] 지방세 과세표준 수정신고서.hwp<NA><NA><NA><NA>2019-12-06
379지방세 환급청구서재무과2019-09-241[별지 제22호서식] 지방세 환급청구서¸ 지방세환급금 지급청구서.hwp<NA><NA><NA><NA>2019-12-06

Duplicate rows

Most frequently occurring

민원사무명관련부서등록일첨부파일 수첨부파일명 1첨부파일명 2첨부파일명 3첨부파일명 4첨부파일명 5데이터기준일자# duplicates
0출입국사실증명서 등 발급신청서(영어, 베트남어, 중국어)생초면2019-02-25323 (영어) 출입국사실증명서 등 발급신청서.hwp23 (베트남어) 출입국 사실증명 발급신청서.hwp23 (중국어) 출입국사실증명서 등 발급신청서.hwp<NA><NA>2019-12-062