Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells767
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory888.7 KiB
Average record size in memory91.0 B

Variable types

Numeric2
Text5
Categorical3

Dataset

Description자료번호,청구기호,서지번호,서명,저자,출판사,출판일,배가위치코드,배가위치명,언어명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2251/S/1/datasetView.do

Alerts

배가위치코드 is highly overall correlated with 자료번호 and 3 other fieldsHigh correlation
언어명 is highly overall correlated with 배가위치코드 and 1 other fieldsHigh correlation
배가위치명 is highly overall correlated with 자료번호 and 3 other fieldsHigh correlation
자료번호 is highly overall correlated with 서지번호 and 2 other fieldsHigh correlation
서지번호 is highly overall correlated with 자료번호 and 2 other fieldsHigh correlation
배가위치코드 is highly imbalanced (89.7%)Imbalance
배가위치명 is highly imbalanced (89.7%)Imbalance
언어명 is highly imbalanced (72.8%)Imbalance
저자 has 574 (5.7%) missing valuesMissing
자료번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 06:07:40.848836
Analysis finished2023-12-11 06:07:45.143955
Duration4.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자료번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38591.378
Minimum21151
Maximum55249
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T15:07:45.225077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21151
5-th percentile22905.95
Q130206.75
median38628.5
Q347161.75
95-th percentile53697.25
Maximum55249
Range34098
Interquartile range (IQR)16955

Descriptive statistics

Standard deviation9829.1902
Coefficient of variation (CV)0.25469912
Kurtosis-1.1904679
Mean38591.378
Median Absolute Deviation (MAD)8467.5
Skewness-0.03229132
Sum3.8591378 × 108
Variance96612980
MonotonicityNot monotonic
2023-12-11T15:07:45.390190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
48924 1
 
< 0.1%
28725 1
 
< 0.1%
38576 1
 
< 0.1%
36890 1
 
< 0.1%
35325 1
 
< 0.1%
39103 1
 
< 0.1%
53297 1
 
< 0.1%
27789 1
 
< 0.1%
41721 1
 
< 0.1%
41989 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
21151 1
< 0.1%
21159 1
< 0.1%
21162 1
< 0.1%
21163 1
< 0.1%
21168 1
< 0.1%
21170 1
< 0.1%
21171 1
< 0.1%
21172 1
< 0.1%
21177 1
< 0.1%
21178 1
< 0.1%
ValueCountFrequency (%)
55249 1
< 0.1%
55247 1
< 0.1%
55245 1
< 0.1%
55243 1
< 0.1%
55236 1
< 0.1%
55234 1
< 0.1%
55230 1
< 0.1%
55219 1
< 0.1%
55218 1
< 0.1%
55216 1
< 0.1%
Distinct9942
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T15:07:45.984827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length19.4027
Min length5

Characters and Unicode

Total characters194027
Distinct characters486
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9926 ?
Unique (%)99.3%

Sample

1st row 340.9 하195ㅈ
2nd row 818.08 한257한 v.8-4
3rd row 810.81 정428여 v.17
4th rowP 911.005 서272 v.35
5th row 911.0091 한257고 v.23
ValueCountFrequency (%)
p 2078
 
6.7%
r 624
 
2.0%
c.2 556
 
1.8%
071.1 373
 
1.2%
911.05 364
 
1.2%
911 313
 
1.0%
v.1 312
 
1.0%
v.2 296
 
0.9%
서272서 292
 
0.9%
911.6 249
 
0.8%
Other values (7300) 25704
82.5%
2023-12-11T15:07:46.576959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49473
25.5%
1 20373
10.5%
. 14577
 
7.5%
9 12900
 
6.6%
2 12152
 
6.3%
0 10818
 
5.6%
3 8327
 
4.3%
5 8038
 
4.1%
6 7400
 
3.8%
8 7177
 
3.7%
Other values (476) 42792
22.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 97870
50.4%
Space Separator 49473
25.5%
Other Letter 20979
 
10.8%
Other Punctuation 14581
 
7.5%
Lowercase Letter 4889
 
2.5%
Uppercase Letter 4451
 
2.3%
Dash Punctuation 721
 
0.4%
Open Punctuation 488
 
0.3%
Close Punctuation 463
 
0.2%
Math Symbol 111
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1938
 
9.2%
1419
 
6.8%
813
 
3.9%
717
 
3.4%
666
 
3.2%
638
 
3.0%
604
 
2.9%
514
 
2.5%
467
 
2.2%
465
 
2.2%
Other values (419) 12738
60.7%
Lowercase Letter
ValueCountFrequency (%)
v 4782
97.8%
c 41
 
0.8%
n 13
 
0.3%
s 12
 
0.2%
e 11
 
0.2%
d 4
 
0.1%
k 4
 
0.1%
t 4
 
0.1%
r 3
 
0.1%
h 3
 
0.1%
Other values (9) 12
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
P 2275
51.1%
C 723
 
16.2%
R 626
 
14.1%
V 416
 
9.3%
A 240
 
5.4%
D 140
 
3.1%
S 13
 
0.3%
L 4
 
0.1%
N 3
 
0.1%
H 2
 
< 0.1%
Other values (8) 9
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 20373
20.8%
9 12900
13.2%
2 12152
12.4%
0 10818
11.1%
3 8327
8.5%
5 8038
 
8.2%
6 7400
 
7.6%
8 7177
 
7.3%
7 5816
 
5.9%
4 4869
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 14577
> 99.9%
? 4
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 483
99.0%
[ 5
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 458
98.9%
] 5
 
1.1%
Space Separator
ValueCountFrequency (%)
49473
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 721
100.0%
Math Symbol
ValueCountFrequency (%)
~ 111
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 163707
84.4%
Hangul 20822
 
10.7%
Latin 9341
 
4.8%
Han 157
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1938
 
9.3%
1419
 
6.8%
813
 
3.9%
717
 
3.4%
666
 
3.2%
638
 
3.1%
604
 
2.9%
514
 
2.5%
467
 
2.2%
465
 
2.2%
Other values (409) 12581
60.4%
Latin
ValueCountFrequency (%)
v 4782
51.2%
P 2275
24.4%
C 723
 
7.7%
R 626
 
6.7%
V 416
 
4.5%
A 240
 
2.6%
D 140
 
1.5%
c 41
 
0.4%
S 13
 
0.1%
n 13
 
0.1%
Other values (28) 72
 
0.8%
Common
ValueCountFrequency (%)
49473
30.2%
1 20373
12.4%
. 14577
 
8.9%
9 12900
 
7.9%
2 12152
 
7.4%
0 10818
 
6.6%
3 8327
 
5.1%
5 8038
 
4.9%
6 7400
 
4.5%
8 7177
 
4.4%
Other values (9) 12472
 
7.6%
Han
ValueCountFrequency (%)
71
45.2%
56
35.7%
23
 
14.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 173047
89.2%
Hangul 17093
 
8.8%
Compat Jamo 3729
 
1.9%
CJK 157
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49473
28.6%
1 20373
11.8%
. 14577
 
8.4%
9 12900
 
7.5%
2 12152
 
7.0%
0 10818
 
6.3%
3 8327
 
4.8%
5 8038
 
4.6%
6 7400
 
4.3%
8 7177
 
4.1%
Other values (46) 21812
12.6%
Hangul
ValueCountFrequency (%)
1938
 
11.3%
1419
 
8.3%
813
 
4.8%
666
 
3.9%
604
 
3.5%
514
 
3.0%
467
 
2.7%
455
 
2.7%
450
 
2.6%
413
 
2.4%
Other values (390) 9354
54.7%
Compat Jamo
ValueCountFrequency (%)
717
19.2%
638
17.1%
465
12.5%
455
12.2%
450
12.1%
256
 
6.9%
252
 
6.8%
193
 
5.2%
133
 
3.6%
67
 
1.8%
Other values (9) 103
 
2.8%
CJK
ValueCountFrequency (%)
71
45.2%
56
35.7%
23
 
14.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
Number Forms
ValueCountFrequency (%)
1
100.0%

서지번호
Real number (ℝ)

HIGH CORRELATION 

Distinct9681
Distinct (%)97.0%
Missing17
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean36316.34
Minimum19928
Maximum52159
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T15:07:46.764826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19928
5-th percentile21603
Q128502
median36626
Q344100
95-th percentile50610.4
Maximum52159
Range32231
Interquartile range (IQR)15598

Descriptive statistics

Standard deviation9202.6033
Coefficient of variation (CV)0.25340117
Kurtosis-1.1570382
Mean36316.34
Median Absolute Deviation (MAD)7804
Skewness-0.03964684
Sum3.6254603 × 108
Variance84687908
MonotonicityNot monotonic
2023-12-11T15:07:46.936934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39305 13
 
0.1%
40354 13
 
0.1%
39219 11
 
0.1%
39905 10
 
0.1%
39221 9
 
0.1%
39197 8
 
0.1%
40305 8
 
0.1%
40292 7
 
0.1%
39352 7
 
0.1%
40284 7
 
0.1%
Other values (9671) 9890
98.9%
(Missing) 17
 
0.2%
ValueCountFrequency (%)
19928 1
< 0.1%
19936 1
< 0.1%
19939 1
< 0.1%
19940 1
< 0.1%
19945 1
< 0.1%
19947 1
< 0.1%
19948 1
< 0.1%
19949 1
< 0.1%
19954 1
< 0.1%
19955 1
< 0.1%
ValueCountFrequency (%)
52159 1
< 0.1%
52157 1
< 0.1%
52155 1
< 0.1%
52153 1
< 0.1%
52146 1
< 0.1%
52144 1
< 0.1%
52140 1
< 0.1%
52129 1
< 0.1%
52128 1
< 0.1%
52126 1
< 0.1%

서명
Text

Distinct9582
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T15:07:47.251255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length83
Mean length21.9857
Min length1

Characters and Unicode

Total characters219857
Distinct characters2472
Distinct categories15 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9321 ?
Unique (%)93.2%

Sample

1st row죽은 자의 정치학 : 프랑스?미국?한국 국립묘지의 탄생과 진화
2nd row韓國口碑文學大系 8-4:慶尙南道 晋州市 晋陽郡篇(2)
3rd row與猶堂全集 17:政法集 第23~29卷
4th row韓國史論 35
5th row古文書集成 23:居昌 草溪鄭氏篇
ValueCountFrequency (%)
1393
 
3.4%
2 316
 
0.8%
1 296
 
0.7%
of 261
 
0.6%
서울 214
 
0.5%
3 197
 
0.5%
the 195
 
0.5%
서울의 157
 
0.4%
국역 151
 
0.4%
4 128
 
0.3%
Other values (17742) 37740
91.9%
2023-12-11T15:07:47.753000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32918
 
15.0%
1 7561
 
3.4%
0 4331
 
2.0%
2 4129
 
1.9%
9 3856
 
1.8%
( 3392
 
1.5%
) 3372
 
1.5%
2703
 
1.2%
2366
 
1.1%
3 2324
 
1.1%
Other values (2462) 152905
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 118303
53.8%
Space Separator 32918
 
15.0%
Decimal Number 31095
 
14.1%
Lowercase Letter 15255
 
6.9%
Other Punctuation 7866
 
3.6%
Open Punctuation 3750
 
1.7%
Close Punctuation 3726
 
1.7%
Uppercase Letter 2397
 
1.1%
Modifier Symbol 1894
 
0.9%
Math Symbol 1420
 
0.6%
Other values (5) 1233
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2703
 
2.3%
2366
 
2.0%
2286
 
1.9%
1758
 
1.5%
1412
 
1.2%
1401
 
1.2%
1359
 
1.1%
1261
 
1.1%
1246
 
1.1%
1193
 
1.0%
Other values (2327) 101318
85.6%
Uppercase Letter
ValueCountFrequency (%)
S 285
 
11.9%
K 177
 
7.4%
T 167
 
7.0%
A 143
 
6.0%
C 123
 
5.1%
E 122
 
5.1%
I 121
 
5.0%
N 114
 
4.8%
H 112
 
4.7%
J 108
 
4.5%
Other values (35) 925
38.6%
Lowercase Letter
ValueCountFrequency (%)
e 1736
11.4%
o 1623
10.6%
a 1329
 
8.7%
n 1301
 
8.5%
i 1158
 
7.6%
r 1094
 
7.2%
t 1066
 
7.0%
s 910
 
6.0%
l 740
 
4.9%
u 647
 
4.2%
Other values (29) 3651
23.9%
Decimal Number
ValueCountFrequency (%)
1 7561
24.3%
0 4331
13.9%
2 4129
13.3%
9 3856
12.4%
3 2324
 
7.5%
4 1955
 
6.3%
8 1871
 
6.0%
6 1717
 
5.5%
7 1678
 
5.4%
5 1673
 
5.4%
Other Punctuation
ValueCountFrequency (%)
/ 2310
29.4%
: 1745
22.2%
; 1728
22.0%
. 1441
18.3%
? 416
 
5.3%
' 118
 
1.5%
& 45
 
0.6%
# 34
 
0.4%
! 27
 
0.3%
2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
42
38.2%
33
30.0%
16
 
14.5%
8
 
7.3%
6
 
5.5%
2
 
1.8%
1
 
0.9%
1
 
0.9%
1
 
0.9%
Math Symbol
ValueCountFrequency (%)
~ 901
63.5%
= 455
32.0%
> 30
 
2.1%
< 29
 
2.0%
+ 3
 
0.2%
2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 3392
90.5%
[ 333
 
8.9%
18
 
0.5%
6
 
0.2%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 3372
90.5%
] 327
 
8.8%
20
 
0.5%
6
 
0.2%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
32918
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 1894
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1115
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 83792
38.1%
Hangul 82552
37.5%
Han 35529
16.2%
Latin 17653
 
8.0%
Hiragana 134
 
0.1%
Cyrillic 109
 
< 0.1%
Katakana 88
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
932
 
2.6%
912
 
2.6%
874
 
2.5%
838
 
2.4%
779
 
2.2%
709
 
2.0%
676
 
1.9%
645
 
1.8%
552
 
1.6%
507
 
1.4%
Other values (1415) 28105
79.1%
Hangul
ValueCountFrequency (%)
2703
 
3.3%
2366
 
2.9%
2286
 
2.8%
1758
 
2.1%
1412
 
1.7%
1401
 
1.7%
1359
 
1.6%
1261
 
1.5%
1246
 
1.5%
1193
 
1.4%
Other values (844) 65567
79.4%
Latin
ValueCountFrequency (%)
e 1736
 
9.8%
o 1623
 
9.2%
a 1329
 
7.5%
n 1301
 
7.4%
i 1158
 
6.6%
r 1094
 
6.2%
t 1066
 
6.0%
s 910
 
5.2%
l 740
 
4.2%
u 647
 
3.7%
Other values (51) 6049
34.3%
Common
ValueCountFrequency (%)
32918
39.3%
1 7561
 
9.0%
0 4331
 
5.2%
2 4129
 
4.9%
9 3856
 
4.6%
( 3392
 
4.0%
) 3372
 
4.0%
3 2324
 
2.8%
/ 2310
 
2.8%
4 1955
 
2.3%
Other values (32) 17644
21.1%
Katakana
ValueCountFrequency (%)
14
 
15.9%
9
 
10.2%
7
 
8.0%
5
 
5.7%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
2
 
2.3%
Other values (27) 36
40.9%
Cyrillic
ValueCountFrequency (%)
Р 15
13.8%
А 11
 
10.1%
О 9
 
8.3%
Г 7
 
6.4%
Ф 7
 
6.4%
И 6
 
5.5%
С 5
 
4.6%
К 5
 
4.6%
и 4
 
3.7%
с 4
 
3.7%
Other values (22) 36
33.0%
Hiragana
ValueCountFrequency (%)
69
51.5%
36
26.9%
4
 
3.0%
4
 
3.0%
3
 
2.2%
2
 
1.5%
2
 
1.5%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Other values (11) 11
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 101273
46.1%
Hangul 82431
37.5%
CJK 34865
 
15.9%
CJK Compat Ideographs 664
 
0.3%
Hiragana 134
 
0.1%
Compat Jamo 121
 
0.1%
Number Forms 110
 
0.1%
Cyrillic 109
 
< 0.1%
Katakana 88
 
< 0.1%
None 56
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32918
32.5%
1 7561
 
7.5%
0 4331
 
4.3%
2 4129
 
4.1%
9 3856
 
3.8%
( 3392
 
3.3%
) 3372
 
3.3%
3 2324
 
2.3%
/ 2310
 
2.3%
4 1955
 
1.9%
Other values (74) 35125
34.7%
Hangul
ValueCountFrequency (%)
2703
 
3.3%
2366
 
2.9%
2286
 
2.8%
1758
 
2.1%
1412
 
1.7%
1401
 
1.7%
1359
 
1.6%
1261
 
1.5%
1246
 
1.5%
1193
 
1.4%
Other values (829) 65446
79.4%
CJK
ValueCountFrequency (%)
932
 
2.7%
912
 
2.6%
874
 
2.5%
838
 
2.4%
779
 
2.2%
709
 
2.0%
676
 
1.9%
645
 
1.8%
552
 
1.6%
507
 
1.5%
Other values (1342) 27441
78.7%
CJK Compat Ideographs
ValueCountFrequency (%)
150
22.6%
105
15.8%
81
12.2%
31
 
4.7%
30
 
4.5%
28
 
4.2%
19
 
2.9%
12
 
1.8%
9
 
1.4%
9
 
1.4%
Other values (63) 190
28.6%
Hiragana
ValueCountFrequency (%)
69
51.5%
36
26.9%
4
 
3.0%
4
 
3.0%
3
 
2.2%
2
 
1.5%
2
 
1.5%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Other values (11) 11
 
8.2%
Compat Jamo
ValueCountFrequency (%)
52
43.0%
12
 
9.9%
12
 
9.9%
9
 
7.4%
7
 
5.8%
5
 
4.1%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
Other values (5) 6
 
5.0%
Number Forms
ValueCountFrequency (%)
42
38.2%
33
30.0%
16
 
14.5%
8
 
7.3%
6
 
5.5%
2
 
1.8%
1
 
0.9%
1
 
0.9%
1
 
0.9%
None
ValueCountFrequency (%)
20
35.7%
18
32.1%
6
 
10.7%
6
 
10.7%
2
 
3.6%
2
 
3.6%
1
 
1.8%
1
 
1.8%
Cyrillic
ValueCountFrequency (%)
Р 15
13.8%
А 11
 
10.1%
О 9
 
8.3%
Г 7
 
6.4%
Ф 7
 
6.4%
И 6
 
5.5%
С 5
 
4.6%
К 5
 
4.6%
и 4
 
3.7%
с 4
 
3.7%
Other values (22) 36
33.0%
Katakana
ValueCountFrequency (%)
14
 
15.9%
9
 
10.2%
7
 
8.0%
5
 
5.7%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
2
 
2.3%
Other values (27) 36
40.9%
Geometric Shapes
ValueCountFrequency (%)
4
100.0%
Punctuation
ValueCountFrequency (%)
2
100.0%

저자
Text

MISSING 

Distinct4355
Distinct (%)46.2%
Missing574
Missing (%)5.7%
Memory size156.2 KiB
2023-12-11T15:07:48.047881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length99
Median length66
Mean length11.488861
Min length2

Characters and Unicode

Total characters108294
Distinct characters1568
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3434 ?
Unique (%)36.4%

Sample

1st row하상복 지음
2nd row韓國精神文化硏究院
3rd row정약용(丁若鏞)
4th row서울大學校 國史學科
5th row韓國精神文化硏究院
ValueCountFrequency (%)
1310
 
6.0%
지음 1027
 
4.7%
서울특별시 692
 
3.2%
594
 
2.7%
447
 
2.1%
옮김 355
 
1.6%
서울特別市 306
 
1.4%
민족문화추진회 302
 
1.4%
229
 
1.1%
國史編纂委員會 227
 
1.0%
Other values (6219) 16242
74.7%
2023-12-11T15:07:48.632263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13371
 
12.3%
; 2369
 
2.2%
2069
 
1.9%
2001
 
1.8%
1887
 
1.7%
1740
 
1.6%
1616
 
1.5%
1416
 
1.3%
1323
 
1.2%
1309
 
1.2%
Other values (1558) 79193
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85734
79.2%
Space Separator 13371
 
12.3%
Other Punctuation 2844
 
2.6%
Lowercase Letter 1820
 
1.7%
Open Punctuation 1562
 
1.4%
Close Punctuation 1562
 
1.4%
Uppercase Letter 1114
 
1.0%
Decimal Number 242
 
0.2%
Dash Punctuation 43
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2069
 
2.4%
2001
 
2.3%
1887
 
2.2%
1740
 
2.0%
1616
 
1.9%
1416
 
1.7%
1323
 
1.5%
1309
 
1.5%
1239
 
1.4%
1214
 
1.4%
Other values (1482) 69920
81.6%
Uppercase Letter
ValueCountFrequency (%)
S 199
17.9%
B 165
14.8%
K 163
14.6%
A 68
 
6.1%
C 57
 
5.1%
T 51
 
4.6%
R 48
 
4.3%
M 47
 
4.2%
E 46
 
4.1%
O 41
 
3.7%
Other values (15) 229
20.6%
Lowercase Letter
ValueCountFrequency (%)
e 259
14.2%
o 175
9.6%
n 172
9.5%
i 163
9.0%
a 158
 
8.7%
t 144
 
7.9%
r 128
 
7.0%
d 75
 
4.1%
u 72
 
4.0%
s 72
 
4.0%
Other values (14) 402
22.1%
Decimal Number
ValueCountFrequency (%)
0 63
26.0%
1 34
14.0%
2 30
12.4%
5 25
 
10.3%
3 20
 
8.3%
6 17
 
7.0%
8 17
 
7.0%
7 16
 
6.6%
4 13
 
5.4%
9 7
 
2.9%
Other Punctuation
ValueCountFrequency (%)
; 2369
83.3%
. 211
 
7.4%
: 140
 
4.9%
? 83
 
2.9%
& 17
 
0.6%
# 16
 
0.6%
' 6
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 1223
78.3%
( 339
 
21.7%
Close Punctuation
ValueCountFrequency (%)
] 1222
78.2%
) 340
 
21.8%
Space Separator
ValueCountFrequency (%)
13371
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Control
ValueCountFrequency (%)
 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61729
57.0%
Han 23913
 
22.1%
Common 19625
 
18.1%
Latin 2934
 
2.7%
Katakana 75
 
0.1%
Hiragana 18
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1157
 
4.8%
1035
 
4.3%
960
 
4.0%
959
 
4.0%
760
 
3.2%
759
 
3.2%
705
 
2.9%
701
 
2.9%
638
 
2.7%
579
 
2.4%
Other values (884) 15660
65.5%
Hangul
ValueCountFrequency (%)
2069
 
3.4%
2001
 
3.2%
1887
 
3.1%
1740
 
2.8%
1616
 
2.6%
1416
 
2.3%
1323
 
2.1%
1309
 
2.1%
1239
 
2.0%
1214
 
2.0%
Other values (573) 45915
74.4%
Latin
ValueCountFrequency (%)
e 259
 
8.8%
S 199
 
6.8%
o 175
 
6.0%
n 172
 
5.9%
B 165
 
5.6%
K 163
 
5.6%
i 163
 
5.6%
a 158
 
5.4%
t 144
 
4.9%
r 128
 
4.4%
Other values (39) 1208
41.2%
Common
ValueCountFrequency (%)
13371
68.1%
; 2369
 
12.1%
[ 1223
 
6.2%
] 1222
 
6.2%
) 340
 
1.7%
( 339
 
1.7%
. 211
 
1.1%
: 140
 
0.7%
? 83
 
0.4%
0 63
 
0.3%
Other values (16) 264
 
1.3%
Katakana
ValueCountFrequency (%)
20
26.7%
20
26.7%
20
26.7%
4
 
5.3%
3
 
4.0%
2
 
2.7%
2
 
2.7%
1
 
1.3%
1
 
1.3%
1
 
1.3%
Hiragana
ValueCountFrequency (%)
4
22.2%
4
22.2%
4
22.2%
3
16.7%
3
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61701
57.0%
CJK 23655
 
21.8%
ASCII 22557
 
20.8%
CJK Compat Ideographs 258
 
0.2%
Katakana 75
 
0.1%
Compat Jamo 27
 
< 0.1%
Hiragana 18
 
< 0.1%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13371
59.3%
; 2369
 
10.5%
[ 1223
 
5.4%
] 1222
 
5.4%
) 340
 
1.5%
( 339
 
1.5%
e 259
 
1.1%
. 211
 
0.9%
S 199
 
0.9%
o 175
 
0.8%
Other values (63) 2849
 
12.6%
Hangul
ValueCountFrequency (%)
2069
 
3.4%
2001
 
3.2%
1887
 
3.1%
1740
 
2.8%
1616
 
2.6%
1416
 
2.3%
1323
 
2.1%
1309
 
2.1%
1239
 
2.0%
1214
 
2.0%
Other values (571) 45887
74.4%
CJK
ValueCountFrequency (%)
1157
 
4.9%
1035
 
4.4%
960
 
4.1%
959
 
4.1%
760
 
3.2%
759
 
3.2%
705
 
3.0%
701
 
3.0%
638
 
2.7%
579
 
2.4%
Other values (845) 15402
65.1%
CJK Compat Ideographs
ValueCountFrequency (%)
100
38.8%
23
 
8.9%
13
 
5.0%
13
 
5.0%
11
 
4.3%
8
 
3.1%
8
 
3.1%
7
 
2.7%
6
 
2.3%
6
 
2.3%
Other values (29) 63
24.4%
Compat Jamo
ValueCountFrequency (%)
27
100.0%
Katakana
ValueCountFrequency (%)
20
26.7%
20
26.7%
20
26.7%
4
 
5.3%
3
 
4.0%
2
 
2.7%
2
 
2.7%
1
 
1.3%
1
 
1.3%
1
 
1.3%
Hiragana
ValueCountFrequency (%)
4
22.2%
4
22.2%
4
22.2%
3
16.7%
3
16.7%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct2128
Distinct (%)21.5%
Missing99
Missing (%)1.0%
Memory size156.2 KiB
2023-12-11T15:07:48.899192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length47
Mean length6.8715281
Min length1

Characters and Unicode

Total characters68035
Distinct characters1003
Distinct categories10 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1259 ?
Unique (%)12.7%

Sample

1st row모티브북
2nd row한국정신문화연구원
3rd row아름출판사
4th row서울大學校 國史學科
5th row한국정신문화연구원
ValueCountFrequency (%)
서울특별시 1075
 
9.3%
민족문화추진회 503
 
4.3%
국사편찬위원회 350
 
3.0%
세종대왕기념사업회 262
 
2.3%
경인문화사 173
 
1.5%
서울역사편찬원 122
 
1.0%
한국고전번역원 114
 
1.0%
홍보담당관 106
 
0.9%
서울특별시사편찬위원회 99
 
0.9%
민속원 95
 
0.8%
Other values (2239) 8721
75.1%
2023-12-11T15:07:49.419869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3253
 
4.8%
2171
 
3.2%
2011
 
3.0%
1989
 
2.9%
1838
 
2.7%
1762
 
2.6%
1723
 
2.5%
1652
 
2.4%
1609
 
2.4%
1475
 
2.2%
Other values (993) 48552
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63885
93.9%
Space Separator 1723
 
2.5%
Lowercase Letter 1121
 
1.6%
Uppercase Letter 733
 
1.1%
Other Punctuation 263
 
0.4%
Decimal Number 131
 
0.2%
Open Punctuation 86
 
0.1%
Close Punctuation 85
 
0.1%
Dash Punctuation 6
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3253
 
5.1%
2171
 
3.4%
2011
 
3.1%
1989
 
3.1%
1838
 
2.9%
1762
 
2.8%
1652
 
2.6%
1609
 
2.5%
1475
 
2.3%
1233
 
1.9%
Other values (925) 44892
70.3%
Uppercase Letter
ValueCountFrequency (%)
S 175
23.9%
B 173
23.6%
K 152
20.7%
M 56
 
7.6%
C 26
 
3.5%
T 21
 
2.9%
G 15
 
2.0%
E 13
 
1.8%
U 12
 
1.6%
V 11
 
1.5%
Other values (13) 79
10.8%
Lowercase Letter
ValueCountFrequency (%)
e 140
12.5%
o 136
12.1%
i 121
10.8%
a 99
8.8%
t 95
8.5%
n 87
7.8%
r 77
 
6.9%
s 69
 
6.2%
d 48
 
4.3%
u 44
 
3.9%
Other values (12) 205
18.3%
Decimal Number
ValueCountFrequency (%)
1 29
22.1%
0 27
20.6%
5 21
16.0%
8 18
13.7%
2 16
12.2%
6 8
 
6.1%
3 3
 
2.3%
9 3
 
2.3%
4 3
 
2.3%
7 3
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 87
33.1%
; 78
29.7%
: 44
16.7%
? 41
15.6%
& 9
 
3.4%
# 4
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 47
54.7%
[ 39
45.3%
Close Punctuation
ValueCountFrequency (%)
) 46
54.1%
] 39
45.9%
Space Separator
ValueCountFrequency (%)
1723
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53395
78.5%
Han 10427
 
15.3%
Common 2294
 
3.4%
Latin 1853
 
2.7%
Katakana 40
 
0.1%
Hiragana 25
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3253
 
6.1%
2171
 
4.1%
2011
 
3.8%
1989
 
3.7%
1838
 
3.4%
1762
 
3.3%
1652
 
3.1%
1609
 
3.0%
1475
 
2.8%
1233
 
2.3%
Other values (451) 34402
64.4%
Han
ValueCountFrequency (%)
627
 
6.0%
492
 
4.7%
471
 
4.5%
461
 
4.4%
435
 
4.2%
389
 
3.7%
366
 
3.5%
365
 
3.5%
336
 
3.2%
319
 
3.1%
Other values (442) 6166
59.1%
Latin
ValueCountFrequency (%)
S 175
 
9.4%
B 173
 
9.3%
K 152
 
8.2%
e 140
 
7.6%
o 136
 
7.3%
i 121
 
6.5%
a 99
 
5.3%
t 95
 
5.1%
n 87
 
4.7%
r 77
 
4.2%
Other values (34) 598
32.3%
Common
ValueCountFrequency (%)
1723
75.1%
. 87
 
3.8%
; 78
 
3.4%
( 47
 
2.0%
) 46
 
2.0%
: 44
 
1.9%
? 41
 
1.8%
] 39
 
1.7%
[ 39
 
1.7%
1 29
 
1.3%
Other values (12) 121
 
5.3%
Katakana
ValueCountFrequency (%)
6
15.0%
5
12.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (9) 11
27.5%
Hiragana
ValueCountFrequency (%)
8
32.0%
8
32.0%
8
32.0%
1
 
4.0%
Cyrillic
ValueCountFrequency (%)
Щ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53377
78.5%
CJK 10343
 
15.2%
ASCII 4147
 
6.1%
CJK Compat Ideographs 84
 
0.1%
Katakana 40
 
0.1%
Hiragana 25
 
< 0.1%
Compat Jamo 16
 
< 0.1%
None 2
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3253
 
6.1%
2171
 
4.1%
2011
 
3.8%
1989
 
3.7%
1838
 
3.4%
1762
 
3.3%
1652
 
3.1%
1609
 
3.0%
1475
 
2.8%
1233
 
2.3%
Other values (449) 34384
64.4%
ASCII
ValueCountFrequency (%)
1723
41.5%
S 175
 
4.2%
B 173
 
4.2%
K 152
 
3.7%
e 140
 
3.4%
o 136
 
3.3%
i 121
 
2.9%
a 99
 
2.4%
t 95
 
2.3%
. 87
 
2.1%
Other values (56) 1246
30.0%
CJK
ValueCountFrequency (%)
627
 
6.1%
492
 
4.8%
471
 
4.6%
461
 
4.5%
435
 
4.2%
389
 
3.8%
366
 
3.5%
365
 
3.5%
336
 
3.2%
319
 
3.1%
Other values (426) 6082
58.8%
CJK Compat Ideographs
ValueCountFrequency (%)
35
41.7%
11
 
13.1%
10
 
11.9%
5
 
6.0%
5
 
6.0%
4
 
4.8%
3
 
3.6%
2
 
2.4%
2
 
2.4%
1
 
1.2%
Other values (6) 6
 
7.1%
Compat Jamo
ValueCountFrequency (%)
16
100.0%
Hiragana
ValueCountFrequency (%)
8
32.0%
8
32.0%
8
32.0%
1
 
4.0%
Katakana
ValueCountFrequency (%)
6
15.0%
5
12.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (9) 11
27.5%
None
ValueCountFrequency (%)
2
100.0%
Cyrillic
ValueCountFrequency (%)
Щ 1
100.0%
Distinct493
Distinct (%)5.0%
Missing77
Missing (%)0.8%
Memory size156.2 KiB
2023-12-11T15:07:49.766647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length4.551043
Min length1

Characters and Unicode

Total characters45160
Distinct characters173
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique303 ?
Unique (%)3.1%

Sample

1st row2014
2nd row1981
3rd row1995
4th row1995
5th row1995
ValueCountFrequency (%)
2010 299
 
2.8%
1995 288
 
2.7%
2017 279
 
2.6%
2016 271
 
2.6%
2009 270
 
2.5%
1994 270
 
2.5%
2005 261
 
2.5%
1993 261
 
2.5%
2007 254
 
2.4%
2006 254
 
2.4%
Other values (412) 7908
74.5%
2023-12-11T15:07:50.349754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9009
19.9%
9 8519
18.9%
1 8375
18.5%
2 6133
13.6%
8 2390
 
5.3%
7 1807
 
4.0%
6 1342
 
3.0%
5 1158
 
2.6%
3 1063
 
2.4%
4 1059
 
2.3%
Other values (163) 4305
9.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40855
90.5%
Other Letter 2058
 
4.6%
Space Separator 823
 
1.8%
Other Punctuation 472
 
1.0%
Dash Punctuation 393
 
0.9%
Open Punctuation 160
 
0.4%
Close Punctuation 143
 
0.3%
Uppercase Letter 125
 
0.3%
Lowercase Letter 81
 
0.2%
Math Symbol 50
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
140
 
6.8%
138
 
6.7%
117
 
5.7%
111
 
5.4%
110
 
5.3%
106
 
5.2%
97
 
4.7%
97
 
4.7%
73
 
3.5%
65
 
3.2%
Other values (131) 1004
48.8%
Decimal Number
ValueCountFrequency (%)
0 9009
22.1%
9 8519
20.9%
1 8375
20.5%
2 6133
15.0%
8 2390
 
5.8%
7 1807
 
4.4%
6 1342
 
3.3%
5 1158
 
2.8%
3 1063
 
2.6%
4 1059
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
r 16
19.8%
e 16
19.8%
a 16
19.8%
s 16
19.8%
t 16
19.8%
m 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 237
50.2%
: 139
29.4%
; 44
 
9.3%
? 27
 
5.7%
/ 25
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
D 65
52.0%
C 25
 
20.0%
V 20
 
16.0%
M 15
 
12.0%
Close Punctuation
ValueCountFrequency (%)
] 118
82.5%
) 25
 
17.5%
Open Punctuation
ValueCountFrequency (%)
[ 118
73.8%
( 42
 
26.2%
Space Separator
ValueCountFrequency (%)
823
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 393
100.0%
Math Symbol
ValueCountFrequency (%)
~ 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42896
95.0%
Hangul 1454
 
3.2%
Han 604
 
1.3%
Latin 206
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
140
 
9.6%
138
 
9.5%
117
 
8.0%
111
 
7.6%
110
 
7.6%
97
 
6.7%
97
 
6.7%
65
 
4.5%
58
 
4.0%
40
 
2.8%
Other values (82) 481
33.1%
Han
ValueCountFrequency (%)
106
17.5%
73
12.1%
64
10.6%
64
10.6%
42
 
7.0%
42
 
7.0%
36
 
6.0%
36
 
6.0%
26
 
4.3%
26
 
4.3%
Other values (39) 89
14.7%
Common
ValueCountFrequency (%)
0 9009
21.0%
9 8519
19.9%
1 8375
19.5%
2 6133
14.3%
8 2390
 
5.6%
7 1807
 
4.2%
6 1342
 
3.1%
5 1158
 
2.7%
3 1063
 
2.5%
4 1059
 
2.5%
Other values (12) 2041
 
4.8%
Latin
ValueCountFrequency (%)
D 65
31.6%
C 25
 
12.1%
V 20
 
9.7%
r 16
 
7.8%
e 16
 
7.8%
a 16
 
7.8%
s 16
 
7.8%
t 16
 
7.8%
M 15
 
7.3%
m 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43102
95.4%
Hangul 1454
 
3.2%
CJK 603
 
1.3%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9009
20.9%
9 8519
19.8%
1 8375
19.4%
2 6133
14.2%
8 2390
 
5.5%
7 1807
 
4.2%
6 1342
 
3.1%
5 1158
 
2.7%
3 1063
 
2.5%
4 1059
 
2.5%
Other values (22) 2247
 
5.2%
Hangul
ValueCountFrequency (%)
140
 
9.6%
138
 
9.5%
117
 
8.0%
111
 
7.6%
110
 
7.6%
97
 
6.7%
97
 
6.7%
65
 
4.5%
58
 
4.0%
40
 
2.8%
Other values (82) 481
33.1%
CJK
ValueCountFrequency (%)
106
17.6%
73
12.1%
64
10.6%
64
10.6%
42
 
7.0%
42
 
7.0%
36
 
6.0%
36
 
6.0%
26
 
4.3%
26
 
4.3%
Other values (38) 88
14.6%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

배가위치코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9865 
<NA>
 
135

Length

Max length4
Median length1
Mean length1.0405
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9865
98.7%
<NA> 135
 
1.4%

Length

2023-12-11T15:07:50.566887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:07:50.712485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9865
98.7%
na 135
 
1.4%

배가위치명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울역사자료실
9865 
<NA>
 
135

Length

Max length7
Median length7
Mean length6.9595
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울역사자료실
2nd row서울역사자료실
3rd row서울역사자료실
4th row서울역사자료실
5th row서울역사자료실

Common Values

ValueCountFrequency (%)
서울역사자료실 9865
98.7%
<NA> 135
 
1.4%

Length

2023-12-11T15:07:50.845021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:07:50.980368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울역사자료실 9865
98.7%
na 135
 
1.4%

언어명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한국어
8555 
<NA>
1257 
일본어
 
120
영어
 
41
중국어
 
24

Length

Max length4
Median length3
Mean length3.1219
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국어
2nd row한국어
3rd row한국어
4th row한국어
5th row한국어

Common Values

ValueCountFrequency (%)
한국어 8555
85.5%
<NA> 1257
 
12.6%
일본어 120
 
1.2%
영어 41
 
0.4%
중국어 24
 
0.2%
러시아어 3
 
< 0.1%

Length

2023-12-11T15:07:51.142787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:07:51.300168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국어 8555
85.5%
na 1257
 
12.6%
일본어 120
 
1.2%
영어 41
 
0.4%
중국어 24
 
0.2%
러시아어 3
 
< 0.1%

Interactions

2023-12-11T15:07:44.365855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T15:07:44.119483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T15:07:44.480818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T15:07:44.225371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T15:07:51.429279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료번호서지번호언어명
자료번호1.0000.9940.169
서지번호0.9941.0000.165
언어명0.1690.1651.000
2023-12-11T15:07:51.552279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배가위치코드언어명배가위치명
배가위치코드1.0001.0001.000
언어명1.0001.0001.000
배가위치명1.0001.0001.000
2023-12-11T15:07:51.690797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료번호서지번호배가위치코드배가위치명언어명
자료번호1.0000.9771.0001.0000.098
서지번호0.9771.0001.0001.0000.096
배가위치코드1.0001.0001.0001.0001.000
배가위치명1.0001.0001.0001.0001.000
언어명0.0980.0961.0001.0001.000

Missing values

2023-12-11T15:07:44.621395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T15:07:44.834664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T15:07:45.043067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

자료번호청구기호서지번호서명저자출판사출판일배가위치코드배가위치명언어명
2873948924340.9 하195ㅈ45868죽은 자의 정치학 : 프랑스?미국?한국 국립묘지의 탄생과 진화하상복 지음모티브북20141서울역사자료실한국어
456630717818.08 한257한 v.8-429494韓國口碑文學大系 8-4:慶尙南道 晋州市 晋陽郡篇(2)韓國精神文化硏究院한국정신문화연구원19811서울역사자료실한국어
584030373810.81 정428여 v.1729150與猶堂全集 17:政法集 第23~29卷정약용(丁若鏞)아름출판사19951서울역사자료실한국어
1760937354P 911.005 서272 v.3536131韓國史論 35서울大學校 國史學科서울大學校 國史學科19951서울역사자료실한국어
952833917911.0091 한257고 v.2332694古文書集成 23:居昌 草溪鄭氏篇韓國精神文化硏究院한국정신문화연구원19951서울역사자료실한국어
1521339638911.6 홍635ㅇ v.32138415안녕하세요 서울입니다 v.321서울특별시 홍보담당관서울특별시 홍보담당관2002.: 유성천연색 -1서울역사자료실한국어
1215431092911.06302 국513 v.2829869韓民族獨立運動史資料集 28 ;의열투쟁 1국사편찬위원회국사편찬위원회19961서울역사자료실한국어
2821051753070.434 신682ㅌ48654특종 1987 :박종철과 한국 민주화신성호 지음중앙books :중앙일보플러스20171서울역사자료실<NA>
1840545001911.78 대51ㅁ C.242009沔川邑城 精密地表調査報告書大田産業大學校 鄕土文化硏究所 편唐津郡19991서울역사자료실한국어
1488439040AV(CD) 359.05 서272회 v.6 C.337817서울특별시의회 회의록[컴퓨터파일] 제6대(2002.7.1~2004.11.19)서울特別市議會서울特別市議會20031서울역사자료실한국어
자료번호청구기호서지번호서명저자출판사출판일배가위치코드배가위치명언어명
1787343840679.1 박67ㄱ40854고려사 악지의 당악연구: 高麗史』樂志의 唐樂硏究 = (A)comparative study of Sa-ak in Goryeosa Akji and박은옥민속원2006<NA><NA>한국어
956734526P 906 역337 v.12133303歷史學報 第121輯<NA>歷史學會19891서울역사자료실한국어
95422201326.3352 한239서 v.120978서울市內버스 路線別 交通量 調査서울特別市 ;韓國科學技術硏究所서울특별시19751서울역사자료실한국어
725426201911.05 사174 v.11624978이조실록 116:중종공희대왕실록 (5년4월-5년12월)(평양)사회과학원 민족고전연구소 번역 사회과학원 민족고전연구소여강출판사19931서울역사자료실한국어
1911744516600.15 서66ㅅ41489서울시 문화지표 설정 및 측정 연구= (A)study on the development and measurement of cultural indicator서울시정개발연구원 [편]장영희 이기현 신경희 전기택서울시정연구원19961서울역사자료실한국어
512630203259.3 대51삼 c.228980삼일신고(譯解三一神誥)대종교 총본사대종교 총본사19491서울역사자료실한국어
3139754880911.1 안478ㅂ51790북한 민중사안문석 지음일조각20201서울역사자료실<NA>
2985653318334.4 서789ㄲ 250224끌려가다 버려지다 우리 앞에 서다 ;사진과 자료로 보는 일본군 '위안부' 피해 여성 이야기 ^2서울대 인권센터 정진성 연구팀 지음푸른역사20181서울역사자료실<NA>
2815052446600.15 강256문 201449350(2014년도) 문화재수리보고서 :도지정문화재江原道 [편]강원도20171서울역사자료실<NA>
2326448778시사 911.6 김721조45721조선왕릉의 멋 헌인릉김웅호시사편찬위원회<NA>1서울역사자료실한국어