Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells249
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Text4
Numeric1

Dataset

Description충남도청 행정자료실, 인재개발원 자료실, 농업기술원 자료실, 충청남도의회 의정자료실의 소장자료에 대한 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=402&beforeMenuCd=DOM_000000201001001000&publicdatapk=3071594

Alerts

발행자 has 107 (1.1%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:58:13.527894
Analysis finished2024-01-09 20:58:15.427298
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9997
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T05:58:15.586131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9994 ?
Unique (%)99.9%

Sample

1st rowAM0000006998
2nd rowDM0000005651
3rd rowEM0000000146
4th rowAM0000020251
5th rowDM0000005026
ValueCountFrequency (%)
em0000006703 2
 
< 0.1%
am0000014372 2
 
< 0.1%
am0000014487 2
 
< 0.1%
em0000010924 1
 
< 0.1%
em0000020767 1
 
< 0.1%
em0000000345 1
 
< 0.1%
em0000022706 1
 
< 0.1%
dm0000010529 1
 
< 0.1%
am0000006998 1
 
< 0.1%
em0000018373 1
 
< 0.1%
Other values (9987) 9987
99.9%
2024-01-10T05:58:15.925274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 58681
48.9%
M 10000
 
8.3%
1 7916
 
6.6%
2 6362
 
5.3%
E 5289
 
4.4%
4 4146
 
3.5%
5 4064
 
3.4%
6 4057
 
3.4%
3 3982
 
3.3%
9 3718
 
3.1%
Other values (8) 11785
 
9.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99999
83.3%
Uppercase Letter 20000
 
16.7%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 58681
58.7%
1 7916
 
7.9%
2 6362
 
6.4%
4 4146
 
4.1%
5 4064
 
4.1%
6 4057
 
4.1%
3 3982
 
4.0%
9 3718
 
3.7%
7 3551
 
3.6%
8 3522
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
M 10000
50.0%
E 5289
26.4%
A 2021
 
10.1%
D 1274
 
6.4%
S 1250
 
6.2%
C 133
 
0.7%
V 33
 
0.2%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 58681
58.7%
1 7916
 
7.9%
2 6362
 
6.4%
4 4146
 
4.1%
5 4064
 
4.1%
6 4057
 
4.1%
3 3982
 
4.0%
9 3718
 
3.7%
7 3551
 
3.6%
8 3522
 
3.5%
Latin
ValueCountFrequency (%)
M 10000
50.0%
E 5289
26.4%
A 2021
 
10.1%
D 1274
 
6.4%
S 1250
 
6.2%
C 133
 
0.7%
V 33
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 58681
48.9%
M 10000
 
8.3%
1 7916
 
6.6%
2 6362
 
5.3%
E 5289
 
4.4%
4 4146
 
3.5%
5 4064
 
3.4%
6 4057
 
3.4%
3 3982
 
3.3%
9 3718
 
3.1%
Other values (8) 11785
 
9.8%

서명
Text

Distinct9674
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T05:58:16.244869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length168
Median length78
Mean length18.3462
Min length1

Characters and Unicode

Total characters183462
Distinct characters1615
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9417 ?
Unique (%)94.2%

Sample

1st row1996년도 농사시험연구사업설계서
2nd row경상분지에서의 지진연구(3)
3rd row민화
4th row지하수관리계획 보고서
5th row농정 전환에 따른 정책대상 범위의 조정
ValueCountFrequency (%)
2854
 
7.0%
연구 428
 
1.1%
1 307
 
0.8%
위한 307
 
0.8%
2 282
 
0.7%
272
 
0.7%
관한 215
 
0.5%
장편소설 148
 
0.4%
충청남도 136
 
0.3%
3 125
 
0.3%
Other values (19790) 35449
87.5%
2024-01-10T05:58:16.736438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31310
 
17.1%
3311
 
1.8%
. 3290
 
1.8%
0 3048
 
1.7%
) 2887
 
1.6%
( 2884
 
1.6%
2 2743
 
1.5%
1 2614
 
1.4%
: 2348
 
1.3%
2114
 
1.2%
Other values (1605) 126913
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120581
65.7%
Space Separator 31310
 
17.1%
Decimal Number 13269
 
7.2%
Other Punctuation 7018
 
3.8%
Lowercase Letter 3490
 
1.9%
Close Punctuation 2901
 
1.6%
Open Punctuation 2898
 
1.6%
Uppercase Letter 1478
 
0.8%
Math Symbol 241
 
0.1%
Dash Punctuation 192
 
0.1%
Other values (5) 84
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3311
 
2.7%
2114
 
1.8%
2093
 
1.7%
2010
 
1.7%
1787
 
1.5%
1759
 
1.5%
1645
 
1.4%
1586
 
1.3%
1562
 
1.3%
1423
 
1.2%
Other values (1495) 101291
84.0%
Lowercase Letter
ValueCountFrequency (%)
e 446
12.8%
o 334
 
9.6%
i 281
 
8.1%
a 281
 
8.1%
t 268
 
7.7%
n 267
 
7.7%
r 215
 
6.2%
s 214
 
6.1%
l 147
 
4.2%
h 130
 
3.7%
Other values (16) 907
26.0%
Uppercase Letter
ValueCountFrequency (%)
S 139
 
9.4%
O 123
 
8.3%
T 111
 
7.5%
C 105
 
7.1%
I 100
 
6.8%
E 99
 
6.7%
D 86
 
5.8%
A 82
 
5.5%
P 67
 
4.5%
B 66
 
4.5%
Other values (16) 500
33.8%
Other Punctuation
ValueCountFrequency (%)
. 3290
46.9%
: 2348
33.5%
, 813
 
11.6%
· 176
 
2.5%
' 114
 
1.6%
! 112
 
1.6%
/ 100
 
1.4%
; 27
 
0.4%
& 16
 
0.2%
% 15
 
0.2%
Other values (4) 7
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 3048
23.0%
2 2743
20.7%
1 2614
19.7%
9 1477
11.1%
3 853
 
6.4%
8 591
 
4.5%
4 557
 
4.2%
7 480
 
3.6%
5 476
 
3.6%
6 430
 
3.2%
Other Number
ValueCountFrequency (%)
10
23.8%
10
23.8%
8
19.0%
6
14.3%
5
11.9%
2
 
4.8%
1
 
2.4%
Math Symbol
ValueCountFrequency (%)
= 130
53.9%
~ 82
34.0%
> 11
 
4.6%
< 11
 
4.6%
+ 5
 
2.1%
2
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 2887
99.5%
] 7
 
0.2%
5
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2884
99.5%
[ 7
 
0.2%
5
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
16
50.0%
8
25.0%
6
 
18.8%
2
 
6.2%
Other Symbol
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
31310
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 192
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 119664
65.2%
Common 57881
31.5%
Latin 5000
 
2.7%
Han 907
 
0.5%
Hiragana 7
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3311
 
2.8%
2114
 
1.8%
2093
 
1.7%
2010
 
1.7%
1787
 
1.5%
1759
 
1.5%
1645
 
1.4%
1586
 
1.3%
1562
 
1.3%
1423
 
1.2%
Other values (1116) 100374
83.9%
Han
ValueCountFrequency (%)
22
 
2.4%
20
 
2.2%
19
 
2.1%
17
 
1.9%
15
 
1.7%
14
 
1.5%
14
 
1.5%
14
 
1.5%
12
 
1.3%
12
 
1.3%
Other values (363) 748
82.5%
Latin
ValueCountFrequency (%)
e 446
 
8.9%
o 334
 
6.7%
i 281
 
5.6%
a 281
 
5.6%
t 268
 
5.4%
n 267
 
5.3%
r 215
 
4.3%
s 214
 
4.3%
l 147
 
2.9%
S 139
 
2.8%
Other values (46) 2408
48.2%
Common
ValueCountFrequency (%)
31310
54.1%
. 3290
 
5.7%
0 3048
 
5.3%
) 2887
 
5.0%
( 2884
 
5.0%
2 2743
 
4.7%
1 2614
 
4.5%
: 2348
 
4.1%
9 1477
 
2.6%
3 853
 
1.5%
Other values (44) 4427
 
7.6%
Hiragana
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Katakana
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 119657
65.2%
ASCII 62603
34.1%
CJK 881
 
0.5%
None 192
 
0.1%
Enclosed Alphanum 42
 
< 0.1%
Number Forms 32
 
< 0.1%
CJK Compat Ideographs 26
 
< 0.1%
Hiragana 7
 
< 0.1%
Compat Jamo 7
 
< 0.1%
Letterlike Symbols 4
 
< 0.1%
Other values (5) 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31310
50.0%
. 3290
 
5.3%
0 3048
 
4.9%
) 2887
 
4.6%
( 2884
 
4.6%
2 2743
 
4.4%
1 2614
 
4.2%
: 2348
 
3.8%
9 1477
 
2.4%
3 853
 
1.4%
Other values (75) 9149
 
14.6%
Hangul
ValueCountFrequency (%)
3311
 
2.8%
2114
 
1.8%
2093
 
1.7%
2010
 
1.7%
1787
 
1.5%
1759
 
1.5%
1645
 
1.4%
1586
 
1.3%
1562
 
1.3%
1423
 
1.2%
Other values (1112) 100367
83.9%
None
ValueCountFrequency (%)
· 176
91.7%
5
 
2.6%
5
 
2.6%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
CJK
ValueCountFrequency (%)
22
 
2.5%
20
 
2.3%
19
 
2.2%
17
 
1.9%
15
 
1.7%
14
 
1.6%
14
 
1.6%
14
 
1.6%
12
 
1.4%
12
 
1.4%
Other values (345) 722
82.0%
Number Forms
ValueCountFrequency (%)
16
50.0%
8
25.0%
6
 
18.8%
2
 
6.2%
Enclosed Alphanum
ValueCountFrequency (%)
10
23.8%
10
23.8%
8
19.0%
6
14.3%
5
11.9%
2
 
4.8%
1
 
2.4%
CJK Compat Ideographs
ValueCountFrequency (%)
7
26.9%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (8) 8
30.8%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
Hiragana
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Punctuation
ValueCountFrequency (%)
3
100.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
2
28.6%
2
28.6%
2
28.6%
1
14.3%
Katakana
ValueCountFrequency (%)
2
66.7%
1
33.3%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct5804
Distinct (%)58.5%
Missing81
Missing (%)0.8%
Memory size156.2 KiB
2024-01-10T05:58:17.075404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length246
Median length166
Mean length9.5125517
Min length1

Characters and Unicode

Total characters94355
Distinct characters1122
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4918 ?
Unique (%)49.6%

Sample

1st row충청남도 편
2nd row한국자원연구원 편
3rd row김영학 글,사진
4th row충청남도 수질관리과
5th row한국농촌경제연구원 편
ValueCountFrequency (%)
1609
 
6.9%
1532
 
6.6%
지음 1446
 
6.2%
충청남도 968
 
4.2%
옮김 963
 
4.2%
734
 
3.2%
그림 211
 
0.9%
189
 
0.8%
엮음 146
 
0.6%
통계청 140
 
0.6%
Other values (8020) 15266
65.8%
2024-01-10T05:58:17.483808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13676
 
14.5%
. 2582
 
2.7%
2287
 
2.4%
2192
 
2.3%
2129
 
2.3%
1833
 
1.9%
1712
 
1.8%
1639
 
1.7%
1637
 
1.7%
1564
 
1.7%
Other values (1112) 63104
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72256
76.6%
Space Separator 13676
 
14.5%
Other Punctuation 5932
 
6.3%
Lowercase Letter 811
 
0.9%
Uppercase Letter 577
 
0.6%
Open Punctuation 465
 
0.5%
Close Punctuation 464
 
0.5%
Decimal Number 137
 
0.1%
Math Symbol 26
 
< 0.1%
Dash Punctuation 9
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2287
 
3.2%
2192
 
3.0%
2129
 
2.9%
1833
 
2.5%
1712
 
2.4%
1639
 
2.3%
1637
 
2.3%
1564
 
2.2%
1526
 
2.1%
1512
 
2.1%
Other values (1032) 54225
75.0%
Lowercase Letter
ValueCountFrequency (%)
a 93
11.5%
o 88
10.9%
i 84
 
10.4%
e 63
 
7.8%
h 54
 
6.7%
s 50
 
6.2%
n 47
 
5.8%
t 44
 
5.4%
r 37
 
4.6%
m 34
 
4.2%
Other values (15) 217
26.8%
Uppercase Letter
ValueCountFrequency (%)
S 69
12.0%
B 56
 
9.7%
A 51
 
8.8%
K 45
 
7.8%
T 37
 
6.4%
O 34
 
5.9%
J 33
 
5.7%
M 33
 
5.7%
C 31
 
5.4%
R 25
 
4.3%
Other values (14) 163
28.2%
Decimal Number
ValueCountFrequency (%)
1 33
24.1%
2 30
21.9%
0 25
18.2%
5 13
 
9.5%
8 13
 
9.5%
3 7
 
5.1%
6 7
 
5.1%
4 6
 
4.4%
7 2
 
1.5%
9 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 2582
43.5%
; 1556
26.2%
, 1457
24.6%
: 230
 
3.9%
· 90
 
1.5%
/ 10
 
0.2%
& 4
 
0.1%
' 2
 
< 0.1%
@ 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 446
95.9%
( 18
 
3.9%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 445
95.9%
) 18
 
3.9%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 13
50.0%
< 13
50.0%
Space Separator
ValueCountFrequency (%)
13676
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71593
75.9%
Common 20710
 
21.9%
Latin 1388
 
1.5%
Han 663
 
0.7%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2287
 
3.2%
2192
 
3.1%
2129
 
3.0%
1833
 
2.6%
1712
 
2.4%
1639
 
2.3%
1637
 
2.3%
1564
 
2.2%
1526
 
2.1%
1512
 
2.1%
Other values (771) 53562
74.8%
Han
ValueCountFrequency (%)
77
 
11.6%
23
 
3.5%
21
 
3.2%
19
 
2.9%
18
 
2.7%
14
 
2.1%
13
 
2.0%
13
 
2.0%
9
 
1.4%
9
 
1.4%
Other values (251) 447
67.4%
Latin
ValueCountFrequency (%)
a 93
 
6.7%
o 88
 
6.3%
i 84
 
6.1%
S 69
 
5.0%
e 63
 
4.5%
B 56
 
4.0%
h 54
 
3.9%
A 51
 
3.7%
s 50
 
3.6%
n 47
 
3.4%
Other values (39) 733
52.8%
Common
ValueCountFrequency (%)
13676
66.0%
. 2582
 
12.5%
; 1556
 
7.5%
, 1457
 
7.0%
[ 446
 
2.2%
] 445
 
2.1%
: 230
 
1.1%
· 90
 
0.4%
1 33
 
0.2%
2 30
 
0.1%
Other values (20) 165
 
0.8%
Cyrillic
ValueCountFrequency (%)
П 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71592
75.9%
ASCII 22004
 
23.3%
CJK 633
 
0.7%
None 92
 
0.1%
CJK Compat Ideographs 30
 
< 0.1%
Number Forms 1
 
< 0.1%
Cyrillic 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13676
62.2%
. 2582
 
11.7%
; 1556
 
7.1%
, 1457
 
6.6%
[ 446
 
2.0%
] 445
 
2.0%
: 230
 
1.0%
a 93
 
0.4%
o 88
 
0.4%
i 84
 
0.4%
Other values (64) 1347
 
6.1%
Hangul
ValueCountFrequency (%)
2287
 
3.2%
2192
 
3.1%
2129
 
3.0%
1833
 
2.6%
1712
 
2.4%
1639
 
2.3%
1637
 
2.3%
1564
 
2.2%
1526
 
2.1%
1512
 
2.1%
Other values (770) 53561
74.8%
None
ValueCountFrequency (%)
· 90
97.8%
1
 
1.1%
1
 
1.1%
CJK
ValueCountFrequency (%)
77
 
12.2%
23
 
3.6%
21
 
3.3%
19
 
3.0%
14
 
2.2%
13
 
2.1%
13
 
2.1%
9
 
1.4%
9
 
1.4%
8
 
1.3%
Other values (241) 427
67.5%
CJK Compat Ideographs
ValueCountFrequency (%)
18
60.0%
3
 
10.0%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Cyrillic
ValueCountFrequency (%)
П 1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

발행자
Text

MISSING 

Distinct2701
Distinct (%)27.3%
Missing107
Missing (%)1.1%
Memory size156.2 KiB
2024-01-10T05:58:17.699197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length32
Mean length5.5417972
Min length1

Characters and Unicode

Total characters54825
Distinct characters807
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1623 ?
Unique (%)16.4%

Sample

1st row농촌진흥원
2nd row한국자원연구원
3rd row대원사
4th row충청남도
5th row한국농촌경제연구원
ValueCountFrequency (%)
충청남도 1042
 
9.8%
한국지방행정연구원 151
 
1.4%
통계청 140
 
1.3%
경기개발연구원 135
 
1.3%
문학동네 123
 
1.2%
교통개발연구원 116
 
1.1%
한국행정연구원 116
 
1.1%
한국보건사회연구원 106
 
1.0%
국토개발연구원 91
 
0.9%
위즈덤하우스 91
 
0.9%
Other values (2812) 8495
80.1%
2024-01-10T05:58:18.085410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2187
 
4.0%
1709
 
3.1%
1596
 
2.9%
1585
 
2.9%
1562
 
2.8%
1545
 
2.8%
1429
 
2.6%
1428
 
2.6%
1403
 
2.6%
1312
 
2.4%
Other values (797) 39069
71.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52067
95.0%
Lowercase Letter 846
 
1.5%
Space Separator 715
 
1.3%
Uppercase Letter 483
 
0.9%
Other Punctuation 315
 
0.6%
Decimal Number 215
 
0.4%
Close Punctuation 90
 
0.2%
Open Punctuation 90
 
0.2%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2187
 
4.2%
1709
 
3.3%
1596
 
3.1%
1585
 
3.0%
1562
 
3.0%
1545
 
3.0%
1429
 
2.7%
1428
 
2.7%
1403
 
2.7%
1312
 
2.5%
Other values (725) 36311
69.7%
Lowercase Letter
ValueCountFrequency (%)
o 157
18.6%
s 72
 
8.5%
a 71
 
8.4%
e 62
 
7.3%
n 61
 
7.2%
i 59
 
7.0%
k 47
 
5.6%
r 41
 
4.8%
m 41
 
4.8%
u 29
 
3.4%
Other values (15) 206
24.3%
Uppercase Letter
ValueCountFrequency (%)
B 101
20.9%
K 59
12.2%
S 51
10.6%
M 46
9.5%
H 33
 
6.8%
R 30
 
6.2%
P 19
 
3.9%
T 19
 
3.9%
C 17
 
3.5%
O 15
 
3.1%
Other values (14) 93
19.3%
Decimal Number
ValueCountFrequency (%)
2 84
39.1%
1 83
38.6%
0 19
 
8.8%
5 10
 
4.7%
8 9
 
4.2%
4 3
 
1.4%
9 3
 
1.4%
3 2
 
0.9%
6 2
 
0.9%
Other Punctuation
ValueCountFrequency (%)
: 191
60.6%
. 61
 
19.4%
& 37
 
11.7%
, 15
 
4.8%
· 7
 
2.2%
@ 2
 
0.6%
# 1
 
0.3%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 86
95.6%
] 4
 
4.4%
Open Punctuation
ValueCountFrequency (%)
( 86
95.6%
[ 4
 
4.4%
Space Separator
ValueCountFrequency (%)
715
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51583
94.1%
Common 1429
 
2.6%
Latin 1329
 
2.4%
Han 484
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2187
 
4.2%
1709
 
3.3%
1596
 
3.1%
1585
 
3.1%
1562
 
3.0%
1545
 
3.0%
1429
 
2.8%
1428
 
2.8%
1403
 
2.7%
1312
 
2.5%
Other values (606) 35827
69.5%
Han
ValueCountFrequency (%)
72
 
14.9%
28
 
5.8%
20
 
4.1%
19
 
3.9%
18
 
3.7%
16
 
3.3%
15
 
3.1%
15
 
3.1%
15
 
3.1%
14
 
2.9%
Other values (109) 252
52.1%
Latin
ValueCountFrequency (%)
o 157
 
11.8%
B 101
 
7.6%
s 72
 
5.4%
a 71
 
5.3%
e 62
 
4.7%
n 61
 
4.6%
i 59
 
4.4%
K 59
 
4.4%
S 51
 
3.8%
k 47
 
3.5%
Other values (39) 589
44.3%
Common
ValueCountFrequency (%)
715
50.0%
: 191
 
13.4%
) 86
 
6.0%
( 86
 
6.0%
2 84
 
5.9%
1 83
 
5.8%
. 61
 
4.3%
& 37
 
2.6%
0 19
 
1.3%
, 15
 
1.0%
Other values (13) 52
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51576
94.1%
ASCII 2750
 
5.0%
CJK 483
 
0.9%
None 8
 
< 0.1%
Compat Jamo 7
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2187
 
4.2%
1709
 
3.3%
1596
 
3.1%
1585
 
3.1%
1562
 
3.0%
1545
 
3.0%
1429
 
2.8%
1428
 
2.8%
1403
 
2.7%
1312
 
2.5%
Other values (603) 35820
69.5%
ASCII
ValueCountFrequency (%)
715
26.0%
: 191
 
6.9%
o 157
 
5.7%
B 101
 
3.7%
) 86
 
3.1%
( 86
 
3.1%
2 84
 
3.1%
1 83
 
3.0%
s 72
 
2.6%
a 71
 
2.6%
Other values (60) 1104
40.1%
CJK
ValueCountFrequency (%)
72
 
14.9%
28
 
5.8%
20
 
4.1%
19
 
3.9%
18
 
3.7%
16
 
3.3%
15
 
3.1%
15
 
3.1%
15
 
3.1%
14
 
2.9%
Other values (108) 251
52.0%
None
ValueCountFrequency (%)
· 7
87.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

발행년
Real number (ℝ)

Distinct70
Distinct (%)0.7%
Missing61
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean2005.2109
Minimum1928
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T05:58:18.223412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1928
5-th percentile1991
Q12000
median2006
Q32012
95-th percentile2020
Maximum2022
Range94
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.9836392
Coefficient of variation (CV)0.0044801469
Kurtosis4.0813911
Mean2005.2109
Median Absolute Deviation (MAD)6
Skewness-0.86967922
Sum19929791
Variance80.705774
MonotonicityNot monotonic
2024-01-10T05:58:18.386446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2007 618
 
6.2%
2006 593
 
5.9%
2003 512
 
5.1%
2001 505
 
5.1%
2002 495
 
5.0%
2000 398
 
4.0%
1999 381
 
3.8%
2008 378
 
3.8%
1998 378
 
3.8%
2013 361
 
3.6%
Other values (60) 5320
53.2%
ValueCountFrequency (%)
1928 1
< 0.1%
1929 1
< 0.1%
1931 1
< 0.1%
1932 1
< 0.1%
1938 2
< 0.1%
1939 1
< 0.1%
1947 2
< 0.1%
1948 1
< 0.1%
1949 1
< 0.1%
1950 1
< 0.1%
ValueCountFrequency (%)
2022 83
 
0.8%
2021 210
2.1%
2020 231
2.3%
2019 172
1.7%
2018 198
2.0%
2017 166
1.7%
2016 221
2.2%
2015 239
2.4%
2014 327
3.3%
2013 361
3.6%

Interactions

2024-01-10T05:58:15.075782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T05:58:15.181008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:58:15.269285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T05:58:15.365273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호서명저작자발행자발행년
1542AM00000069981996년도 농사시험연구사업설계서충청남도 편농촌진흥원1996
10227DM0000005651경상분지에서의 지진연구(3)한국자원연구원 편한국자원연구원1997
14117EM0000000146민화김영학 글,사진대원사1993
6314AM0000020251지하수관리계획 보고서충청남도 수질관리과충청남도2013
9616DM0000005026농정 전환에 따른 정책대상 범위의 조정한국농촌경제연구원 편한국농촌경제연구원1994
5763AM0000019631충남도청(내포) 신도시 신·재생에너지 도입방안 연구한국에너지기술연구원충청남도2010
14873EM0000003099전자정부론정충식 지음녹두1997
22587EM0000014375담금질 : 안희정의 새로운 시작안희정나남2008
7300AM0000021287가전천 하천기본계획 : 부록충청남도충청남도2017
18148EM0000009659오다노부나가. 2 : 효웅의 죽음야마오카 소하치 지음 ; 이길진 옮김솔출판사2002
등록번호서명저작자발행자발행년
11253DM0000008651인천시와 기타규슈시간 중소기업 교류증진방안 : 기술교류분야를 중심으로인천발전연구원 편.인천발전연구원2000
22715EM0000014504드라마 맛있게 읽기정수연북인2008
12064DM0000009533지식기반사회의 교통정책방향 : 텔레쇼핑이 화물운송변화에 미치는 영향및 교통정책방향.홍갑선.교통개발연구원2003
28675EM0000020600공주 부여 여행레시피 : 공주·부여 여행이 더욱 즐거워지는 완벽 가이드강희은 지음즐거운상상2014
28661EM0000020586이방인을 보았다구경미북멘토2014
19259EM0000010924도시 주거단지계획양동석기술당1998
29946EM0000021889고환율의 음모 : 서민지갑을 강탈한 검은 손의 실체송기균 지음21세기북스2012
7622AM0000021614천안 축산자원개발부 부지활용 기본구상 수립 연구, 최종보고서충청남도충청남도2019
4025AM0000015436황화천 하천정비기본계획 보고서. .충청남도 편충청남도2005
17180EM0000006913그것이있어야할자리<NA>도솔2001