Overview

Dataset statistics

Number of variables5
Number of observations3594
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory140.5 KiB
Average record size in memory40.0 B

Variable types

DateTime1
Text4

Dataset

Description청주시 소통팔달시스템 전자문서 유통내역입니다. 제공자료(발송일자, 발송지역, 발송팀, 발송자, 문서제목 등)
URLhttps://www.data.go.kr/data/15063100/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 11:59:14.399662
Analysis finished2023-12-12 11:59:15.560827
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3265
Distinct (%)90.8%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
Minimum2016-12-09 11:00:00
Maximum2023-04-13 11:39:00
2023-12-12T20:59:15.650345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:15.883143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct542
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
2023-12-12T20:59:16.296223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length6.6001669
Min length1

Characters and Unicode

Total characters23721
Distinct characters249
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique349 ?
Unique (%)9.7%

Sample

1st row(주)세이정보기술외64
2nd row(주)세이정보기술외64
3rd row(주)세이정보기술외39
4th row(주)세이정보기술외59
5th row(주)세이정보기술외64
ValueCountFrequency (%)
오산1리외42 317
 
8.7%
관정1리외21 222
 
6.1%
1통외39 167
 
4.6%
호계리외45 161
 
4.4%
상삼리외30 126
 
3.5%
구성1리외53 124
 
3.4%
미원1리외46 119
 
3.3%
호계리외46 116
 
3.2%
청용1리외30 116
 
3.2%
신기리외50 111
 
3.1%
Other values (544) 2056
56.6%
2023-12-12T20:59:16.835187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3228
 
13.6%
1 3091
 
13.0%
2049
 
8.6%
2 1225
 
5.2%
3 1215
 
5.1%
4 1146
 
4.8%
970
 
4.1%
5 772
 
3.3%
505
 
2.1%
6 500
 
2.1%
Other values (239) 9020
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14499
61.1%
Decimal Number 9106
38.4%
Space Separator 60
 
0.3%
Close Punctuation 15
 
0.1%
Open Punctuation 15
 
0.1%
Dash Punctuation 11
 
< 0.1%
Uppercase Letter 10
 
< 0.1%
Connector Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3228
22.3%
2049
 
14.1%
970
 
6.7%
505
 
3.5%
357
 
2.5%
305
 
2.1%
293
 
2.0%
285
 
2.0%
277
 
1.9%
277
 
1.9%
Other values (220) 5953
41.1%
Decimal Number
ValueCountFrequency (%)
1 3091
33.9%
2 1225
 
13.5%
3 1215
 
13.3%
4 1146
 
12.6%
5 772
 
8.5%
6 500
 
5.5%
0 476
 
5.2%
9 353
 
3.9%
8 226
 
2.5%
7 102
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
L 5
50.0%
H 5
50.0%
Space Separator
ValueCountFrequency (%)
60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14499
61.1%
Common 9210
38.8%
Latin 12
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3228
22.3%
2049
 
14.1%
970
 
6.7%
505
 
3.5%
357
 
2.5%
305
 
2.1%
293
 
2.0%
285
 
2.0%
277
 
1.9%
277
 
1.9%
Other values (220) 5953
41.1%
Common
ValueCountFrequency (%)
1 3091
33.6%
2 1225
 
13.3%
3 1215
 
13.2%
4 1146
 
12.4%
5 772
 
8.4%
6 500
 
5.4%
0 476
 
5.2%
9 353
 
3.8%
8 226
 
2.5%
7 102
 
1.1%
Other values (6) 104
 
1.1%
Latin
ValueCountFrequency (%)
L 5
41.7%
H 5
41.7%
e 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14499
61.1%
ASCII 9222
38.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3228
22.3%
2049
 
14.1%
970
 
6.7%
505
 
3.5%
357
 
2.5%
305
 
2.1%
293
 
2.0%
285
 
2.0%
277
 
1.9%
277
 
1.9%
Other values (220) 5953
41.1%
ASCII
ValueCountFrequency (%)
1 3091
33.5%
2 1225
 
13.3%
3 1215
 
13.2%
4 1146
 
12.4%
5 772
 
8.4%
6 500
 
5.4%
0 476
 
5.2%
9 353
 
3.8%
8 226
 
2.5%
7 102
 
1.1%
Other values (9) 116
 
1.3%
Distinct104
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
2023-12-12T20:59:17.161490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length7
Mean length8.3583751
Min length3

Characters and Unicode

Total characters30040
Distinct characters88
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)0.4%

Sample

1st row율량사천동 행정민원팀
2nd row율량사천동 행정민원팀
3rd row남이면 행정팀
4th row용암1동 행정민원팀
5th row율량사천동 행정민원팀
ValueCountFrequency (%)
행정민원팀 1344
18.7%
산업팀 1199
16.7%
행정팀 730
 
10.2%
옥산면 364
 
5.1%
오송읍 328
 
4.6%
낭성면 244
 
3.4%
내수읍 207
 
2.9%
미원면 170
 
2.4%
남이면 165
 
2.3%
주민복지팀 161
 
2.2%
Other values (49) 2274
31.6%
2023-12-12T20:59:17.612861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3592
 
12.0%
3592
 
12.0%
2165
 
7.2%
2074
 
6.9%
1669
 
5.6%
1549
 
5.2%
1532
 
5.1%
1518
 
5.1%
1448
 
4.8%
1199
 
4.0%
Other values (78) 9702
32.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25704
85.6%
Space Separator 3592
 
12.0%
Decimal Number 744
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3592
14.0%
2165
 
8.4%
2074
 
8.1%
1669
 
6.5%
1549
 
6.0%
1532
 
6.0%
1518
 
5.9%
1448
 
5.6%
1199
 
4.7%
587
 
2.3%
Other values (75) 8371
32.6%
Decimal Number
ValueCountFrequency (%)
2 444
59.7%
1 300
40.3%
Space Separator
ValueCountFrequency (%)
3592
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25704
85.6%
Common 4336
 
14.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3592
14.0%
2165
 
8.4%
2074
 
8.1%
1669
 
6.5%
1549
 
6.0%
1532
 
6.0%
1518
 
5.9%
1448
 
5.6%
1199
 
4.7%
587
 
2.3%
Other values (75) 8371
32.6%
Common
ValueCountFrequency (%)
3592
82.8%
2 444
 
10.2%
1 300
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25704
85.6%
ASCII 4336
 
14.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3592
82.8%
2 444
 
10.2%
1 300
 
6.9%
Hangul
ValueCountFrequency (%)
3592
14.0%
2165
 
8.4%
2074
 
8.1%
1669
 
6.5%
1549
 
6.0%
1532
 
6.0%
1518
 
5.9%
1448
 
5.6%
1199
 
4.7%
587
 
2.3%
Other values (75) 8371
32.6%
Distinct316
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
2023-12-12T20:59:18.022045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9755147
Min length2

Characters and Unicode

Total characters10694
Distinct characters136
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)1.9%

Sample

1st row이0수
2nd row이0수
3rd row오0술
4th row김0진
5th row이0수
ValueCountFrequency (%)
최0범 116
 
3.2%
채0욱 101
 
2.8%
구0 87
 
2.4%
주0혁 81
 
2.3%
지0란 80
 
2.2%
정0영 73
 
2.0%
고0옥 66
 
1.8%
신0음 63
 
1.8%
강0지 62
 
1.7%
이0희 54
 
1.5%
Other values (306) 2811
78.2%
2023-12-12T20:59:18.585682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3594
33.6%
615
 
5.8%
445
 
4.2%
286
 
2.7%
241
 
2.3%
238
 
2.2%
185
 
1.7%
183
 
1.7%
177
 
1.7%
171
 
1.6%
Other values (126) 4559
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7100
66.4%
Decimal Number 3594
33.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
615
 
8.7%
445
 
6.3%
286
 
4.0%
241
 
3.4%
238
 
3.4%
185
 
2.6%
183
 
2.6%
177
 
2.5%
171
 
2.4%
151
 
2.1%
Other values (125) 4408
62.1%
Decimal Number
ValueCountFrequency (%)
0 3594
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7100
66.4%
Common 3594
33.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
615
 
8.7%
445
 
6.3%
286
 
4.0%
241
 
3.4%
238
 
3.4%
185
 
2.6%
183
 
2.6%
177
 
2.5%
171
 
2.4%
151
 
2.1%
Other values (125) 4408
62.1%
Common
ValueCountFrequency (%)
0 3594
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7100
66.4%
ASCII 3594
33.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3594
100.0%
Hangul
ValueCountFrequency (%)
615
 
8.7%
445
 
6.3%
286
 
4.0%
241
 
3.4%
238
 
3.4%
185
 
2.6%
183
 
2.6%
177
 
2.5%
171
 
2.4%
151
 
2.1%
Other values (125) 4408
62.1%
Distinct3058
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
2023-12-12T20:59:18.984164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length27.152476
Min length7

Characters and Unicode

Total characters97586
Distinct characters664
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2658 ?
Unique (%)74.0%

Sample

1st row『FIFA U-20 월드컵코리아 2017』 천안경기 홍보 요청
2nd row하수도 악취발생 대상지 조사 제출
3rd row12월 이장회의 알림
4th row『2017년도 스포츠강좌 이용권』신청 안내 협조요청
5th row【시정공유】겨울철 내 집 · 내 점포 앞 눈치우기 운동 홍보 협조
ValueCountFrequency (%)
알림 1363
 
6.0%
홍보 999
 
4.4%
요청 798
 
3.5%
협조 763
 
3.4%
2018년 734
 
3.3%
503
 
2.2%
신청 486
 
2.2%
지원사업 412
 
1.8%
안내 377
 
1.7%
2019년 339
 
1.5%
Other values (4068) 15779
70.0%
2023-12-12T20:59:19.621884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18968
 
19.4%
1 2503
 
2.6%
2 2271
 
2.3%
0 2192
 
2.2%
1996
 
2.0%
1951
 
2.0%
1947
 
2.0%
1641
 
1.7%
1514
 
1.6%
1497
 
1.5%
Other values (654) 61106
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65913
67.5%
Space Separator 18968
 
19.4%
Decimal Number 9808
 
10.1%
Open Punctuation 914
 
0.9%
Close Punctuation 914
 
0.9%
Other Punctuation 589
 
0.6%
Uppercase Letter 280
 
0.3%
Lowercase Letter 56
 
0.1%
Initial Punctuation 46
 
< 0.1%
Final Punctuation 38
 
< 0.1%
Other values (4) 60
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1996
 
3.0%
1951
 
3.0%
1947
 
3.0%
1641
 
2.5%
1514
 
2.3%
1497
 
2.3%
1427
 
2.2%
1318
 
2.0%
1257
 
1.9%
1203
 
1.8%
Other values (578) 50162
76.1%
Uppercase Letter
ValueCountFrequency (%)
A 42
15.0%
S 30
10.7%
T 26
9.3%
I 24
 
8.6%
P 24
 
8.6%
F 21
 
7.5%
C 20
 
7.1%
L 13
 
4.6%
G 10
 
3.6%
O 10
 
3.6%
Other values (12) 60
21.4%
Lowercase Letter
ValueCountFrequency (%)
e 12
21.4%
l 9
16.1%
a 9
16.1%
p 5
8.9%
h 4
 
7.1%
g 4
 
7.1%
n 4
 
7.1%
c 4
 
7.1%
t 2
 
3.6%
y 2
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 2503
25.5%
2 2271
23.2%
0 2192
22.3%
8 1154
11.8%
9 629
 
6.4%
7 556
 
5.7%
3 214
 
2.2%
6 132
 
1.3%
4 84
 
0.9%
5 73
 
0.7%
Other Punctuation
ValueCountFrequency (%)
· 209
35.5%
, 169
28.7%
' 99
16.8%
. 93
15.8%
: 5
 
0.8%
3
 
0.5%
/ 3
 
0.5%
! 3
 
0.5%
% 3
 
0.5%
& 2
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 515
56.3%
253
27.7%
126
 
13.8%
[ 14
 
1.5%
6
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 515
56.3%
253
27.7%
126
 
13.8%
] 14
 
1.5%
6
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 27
73.0%
5
 
13.5%
> 2
 
5.4%
< 2
 
5.4%
+ 1
 
2.7%
Initial Punctuation
ValueCountFrequency (%)
27
58.7%
19
41.3%
Final Punctuation
ValueCountFrequency (%)
20
52.6%
18
47.4%
Space Separator
ValueCountFrequency (%)
18968
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65845
67.5%
Common 31336
32.1%
Latin 337
 
0.3%
Han 68
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1996
 
3.0%
1951
 
3.0%
1947
 
3.0%
1641
 
2.5%
1514
 
2.3%
1497
 
2.3%
1427
 
2.2%
1318
 
2.0%
1257
 
1.9%
1203
 
1.8%
Other values (570) 50094
76.1%
Common
ValueCountFrequency (%)
18968
60.5%
1 2503
 
8.0%
2 2271
 
7.2%
0 2192
 
7.0%
8 1154
 
3.7%
9 629
 
2.0%
7 556
 
1.8%
( 515
 
1.6%
) 515
 
1.6%
253
 
0.8%
Other values (32) 1780
 
5.7%
Latin
ValueCountFrequency (%)
A 42
 
12.5%
S 30
 
8.9%
T 26
 
7.7%
I 24
 
7.1%
P 24
 
7.1%
F 21
 
6.2%
C 20
 
5.9%
L 13
 
3.9%
e 12
 
3.6%
G 10
 
3.0%
Other values (24) 115
34.1%
Han
ValueCountFrequency (%)
29
42.6%
28
41.2%
4
 
5.9%
2
 
2.9%
2
 
2.9%
1
 
1.5%
1
 
1.5%
1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65839
67.5%
ASCII 30601
31.4%
None 982
 
1.0%
Punctuation 84
 
0.1%
CJK 68
 
0.1%
Compat Jamo 6
 
< 0.1%
Arrows 5
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18968
62.0%
1 2503
 
8.2%
2 2271
 
7.4%
0 2192
 
7.2%
8 1154
 
3.8%
9 629
 
2.1%
7 556
 
1.8%
( 515
 
1.7%
) 515
 
1.7%
3 214
 
0.7%
Other values (52) 1084
 
3.5%
Hangul
ValueCountFrequency (%)
1996
 
3.0%
1951
 
3.0%
1947
 
3.0%
1641
 
2.5%
1514
 
2.3%
1497
 
2.3%
1427
 
2.2%
1318
 
2.0%
1257
 
1.9%
1203
 
1.8%
Other values (569) 50088
76.1%
None
ValueCountFrequency (%)
253
25.8%
253
25.8%
· 209
21.3%
126
12.8%
126
12.8%
6
 
0.6%
6
 
0.6%
3
 
0.3%
CJK
ValueCountFrequency (%)
29
42.6%
28
41.2%
4
 
5.9%
2
 
2.9%
2
 
2.9%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Punctuation
ValueCountFrequency (%)
27
32.1%
20
23.8%
19
22.6%
18
21.4%
Compat Jamo
ValueCountFrequency (%)
6
100.0%
Arrows
ValueCountFrequency (%)
5
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-12T20:59:15.348558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:59:15.503684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발송일자발송지역발송팀발송자문서제목
02016-12-09 11:00:00(주)세이정보기술외64율량사천동 행정민원팀이0수『FIFA U-20 월드컵코리아 2017』 천안경기 홍보 요청
12016-12-09 11:00:00(주)세이정보기술외64율량사천동 행정민원팀이0수하수도 악취발생 대상지 조사 제출
22016-12-12 11:00:00(주)세이정보기술외39남이면 행정팀오0술12월 이장회의 알림
32016-12-14 11:00:00(주)세이정보기술외59용암1동 행정민원팀김0진『2017년도 스포츠강좌 이용권』신청 안내 협조요청
42016-12-19 11:00:00(주)세이정보기술외64율량사천동 행정민원팀이0수【시정공유】겨울철 내 집 · 내 점포 앞 눈치우기 운동 홍보 협조
52016-12-23 11:00:00(주)세이정보기술외64율량사천동 행정민원팀이0수【시정공유】청주 실외 스케이트·썰매장 이용 홍보 요청
62016-12-26 10:08:00(주)세이정보기술외50오송읍 행정팀이0진2017년 1월 이장회의 알림
72017-01-03 11:00:00(주)세이정보기술강내면 행정팀김0수2016년 2분기 경로당 운영비 보조금 신청
82017-01-11 09:36:001통(7)외63율량사천동 행정민원팀이0수【시정공유】시 홈페이지 안전신문고 코너 활용 안내
92017-01-11 09:37:001통(7)외63율량사천동 행정민원팀이0수【시정공유】안전신고 체험수기 공모 참여 및 홍보 협조
발송일자발송지역발송팀발송자문서제목
35842023-01-17 02:52:00정보통신과_시험정보통신과 정보개발팀김0우충청북도 조직개편에 따른 조직정보(LDAP) 변경 및 전자문서유통 중단 안내
35852023-01-17 02:57:000정보통신과 정보개발팀김0우온-나라 문서시스템 데이터 이관에 따른 서비스 일시중단 안내
35862023-01-17 03:15:00정보통신과_시험정보통신과 정보개발팀김0우2023년 상반기 조직개편에 따른 전자문서 유통 중단 안내
35872023-02-06 01:02:000현도면 행정민원팀정0완마스크 착용 의무화 방역지침 변경에 따른 홍보 협조 요청
35882023-02-06 01:49:00상삼리외30현도면 행정민원팀정0완마스크 착용 의무화 방역지침 변경에 따른 홍보 협조 요청
35892023-03-21 10:09:00오송읍장오송읍 행정팀김0림제104주년 3·1절「나라사랑 태극기 달기 운동」추진계획 알림
35902023-03-21 10:18:00오송읍장오송읍 행정팀김0림제104주년 3·1절「나라사랑 태극기 달기 운동」추진계획 알림
35912023-04-05 03:26:00만수4리오송읍 행정팀김0림청주시 도로명주소 부여등 고시
35922023-04-06 04:29:000오송읍 행정팀김0림밀양 봄나물 4종(땅두릅,참두릅,엄나무순,가죽나물) 홍보·판매 안내
35932023-04-13 11:39:00오송읍장외2오송읍 행정팀김0림2023년 스포츠강좌이용권 신청자 추가접수 홍보 협조 요청

Duplicate rows

Most frequently occurring

발송일자발송지역발송팀발송자문서제목# duplicates
02017-12-29 11:27:00관정1리외21낭성면 산업팀채0욱2018년 시설원예 에너지이용 효율화(절감시설, 목재펠릿) 지원사업 신청 홍보2
12018-09-20 09:12:001통외35운천신봉동 행정민원팀김0림'2018 생명문화도시 시민실천 콘테스트' 공모 알림 및 참여 요청2