Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells1407
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Numeric1
Categorical2
Text5

Dataset

Description2014년도 광진정보도서관 신규도서 목록
Author광진구시설관리공단
URLhttps://www.data.go.kr/data/15044587/fileData.do

Alerts

관리구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
자료실명 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 관리구분 and 1 other fieldsHigh correlation
청구기호 has 347 (3.5%) missing valuesMissing
저작자 has 620 (6.2%) missing valuesMissing
발행년 has 440 (4.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:57:15.179399
Analysis finished2023-12-12 02:57:18.198356
Duration3.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6543.5812
Minimum3
Maximum13061
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:57:18.296178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile657.95
Q13286.5
median6561.5
Q39798.25
95-th percentile12401.1
Maximum13061
Range13058
Interquartile range (IQR)6511.75

Descriptive statistics

Standard deviation3765.271
Coefficient of variation (CV)0.57541443
Kurtosis-1.1965351
Mean6543.5812
Median Absolute Deviation (MAD)3255.5
Skewness-0.0074432135
Sum65435812
Variance14177266
MonotonicityNot monotonic
2023-12-12T11:57:18.514849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3952 1
 
< 0.1%
3930 1
 
< 0.1%
11995 1
 
< 0.1%
5894 1
 
< 0.1%
1926 1
 
< 0.1%
4201 1
 
< 0.1%
4182 1
 
< 0.1%
6555 1
 
< 0.1%
2717 1
 
< 0.1%
1356 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
13061 1
< 0.1%
13060 1
< 0.1%
13059 1
< 0.1%
13058 1
< 0.1%
13057 1
< 0.1%
13055 1
< 0.1%
13054 1
< 0.1%
13051 1
< 0.1%
13050 1
< 0.1%
13049 1
< 0.1%

관리구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
5680 
아동
3661 
서양-아동
599 
서양-일반
 
60

Length

Max length5
Median length2
Mean length2.1977
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아동
2nd row일반
3rd row일반
4th row일반
5th row아동

Common Values

ValueCountFrequency (%)
일반 5680
56.8%
아동 3661
36.6%
서양-아동 599
 
6.0%
서양-일반 60
 
0.6%

Length

2023-12-12T11:57:18.667487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:57:18.834739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 5680
56.8%
아동 3661
36.6%
서양-아동 599
 
6.0%
서양-일반 60
 
0.6%

청구기호
Text

MISSING 

Distinct9628
Distinct (%)99.7%
Missing347
Missing (%)3.5%
Memory size156.2 KiB
2023-12-12T11:57:19.286283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length12.865016
Min length8

Characters and Unicode

Total characters124186
Distinct characters232
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9603 ?
Unique (%)99.5%

Sample

1st row813.8 ㄱ943ㅊ
2nd row223.51 ㄱ216ㄴ v.1
3rd row814.6 ㅇ663ㄸ
4th row334.225 ㅎ521ㅅ
5th row233 ㅇ631ㄴ
ValueCountFrequency (%)
808.9 889
 
3.9%
843 675
 
2.9%
v.2 363
 
1.6%
v.1 356
 
1.6%
813.6 316
 
1.4%
f86g 303
 
1.3%
813.8 299
 
1.3%
v.3 249
 
1.1%
408 206
 
0.9%
c.2 204
 
0.9%
Other values (7803) 19047
83.1%
2023-12-12T11:57:20.041932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15987
12.9%
8 12097
 
9.7%
. 10590
 
8.5%
1 9325
 
7.5%
3 8652
 
7.0%
2 7481
 
6.0%
4 7367
 
5.9%
9 7358
 
5.9%
6 6256
 
5.0%
5 5535
 
4.5%
Other values (222) 33538
27.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 73646
59.3%
Other Letter 18137
 
14.6%
Space Separator 15987
 
12.9%
Other Punctuation 10590
 
8.5%
Lowercase Letter 4745
 
3.8%
Uppercase Letter 649
 
0.5%
Dash Punctuation 432
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3887
21.4%
2305
12.7%
2289
12.6%
1243
 
6.9%
1216
 
6.7%
1212
 
6.7%
987
 
5.4%
833
 
4.6%
683
 
3.8%
665
 
3.7%
Other values (161) 2817
15.5%
Lowercase Letter
ValueCountFrequency (%)
v 3730
78.6%
c 340
 
7.2%
g 322
 
6.8%
m 33
 
0.7%
s 32
 
0.7%
i 27
 
0.6%
b 27
 
0.6%
a 26
 
0.5%
r 23
 
0.5%
p 21
 
0.4%
Other values (15) 164
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
F 315
48.5%
B 40
 
6.2%
G 32
 
4.9%
Y 27
 
4.2%
R 24
 
3.7%
S 22
 
3.4%
L 20
 
3.1%
C 20
 
3.1%
M 20
 
3.1%
H 19
 
2.9%
Other values (13) 110
 
16.9%
Decimal Number
ValueCountFrequency (%)
8 12097
16.4%
1 9325
12.7%
3 8652
11.7%
2 7481
10.2%
4 7367
10.0%
9 7358
10.0%
6 6256
8.5%
5 5535
7.5%
7 4857
6.6%
0 4718
 
6.4%
Space Separator
ValueCountFrequency (%)
15987
100.0%
Other Punctuation
ValueCountFrequency (%)
. 10590
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 432
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100655
81.1%
Hangul 18137
 
14.6%
Latin 5394
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3887
21.4%
2305
12.7%
2289
12.6%
1243
 
6.9%
1216
 
6.7%
1212
 
6.7%
987
 
5.4%
833
 
4.6%
683
 
3.8%
665
 
3.7%
Other values (161) 2817
15.5%
Latin
ValueCountFrequency (%)
v 3730
69.2%
c 340
 
6.3%
g 322
 
6.0%
F 315
 
5.8%
B 40
 
0.7%
m 33
 
0.6%
G 32
 
0.6%
s 32
 
0.6%
Y 27
 
0.5%
i 27
 
0.5%
Other values (38) 496
 
9.2%
Common
ValueCountFrequency (%)
15987
15.9%
8 12097
12.0%
. 10590
10.5%
1 9325
9.3%
3 8652
8.6%
2 7481
7.4%
4 7367
7.3%
9 7358
7.3%
6 6256
 
6.2%
5 5535
 
5.5%
Other values (3) 10007
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106049
85.4%
Compat Jamo 17099
 
13.8%
Hangul 1038
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15987
15.1%
8 12097
11.4%
. 10590
10.0%
1 9325
8.8%
3 8652
8.2%
2 7481
7.1%
4 7367
6.9%
9 7358
6.9%
6 6256
 
5.9%
5 5535
 
5.2%
Other values (51) 15401
14.5%
Compat Jamo
ValueCountFrequency (%)
3887
22.7%
2305
13.5%
2289
13.4%
1243
 
7.3%
1216
 
7.1%
1212
 
7.1%
987
 
5.8%
833
 
4.9%
683
 
4.0%
665
 
3.9%
Other values (9) 1779
10.4%
Hangul
ValueCountFrequency (%)
275
26.5%
76
 
7.3%
38
 
3.7%
32
 
3.1%
32
 
3.1%
25
 
2.4%
24
 
2.3%
19
 
1.8%
19
 
1.8%
17
 
1.6%
Other values (142) 481
46.3%

자료실명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
광진정보도서관 종합자료실
5740 
광진정보도서관 어린이자료실
4260 

Length

Max length14
Median length13
Mean length13.426
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광진정보도서관 어린이자료실
2nd row광진정보도서관 종합자료실
3rd row광진정보도서관 종합자료실
4th row광진정보도서관 종합자료실
5th row광진정보도서관 어린이자료실

Common Values

ValueCountFrequency (%)
광진정보도서관 종합자료실 5740
57.4%
광진정보도서관 어린이자료실 4260
42.6%

Length

2023-12-12T11:57:20.237415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:57:20.369706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광진정보도서관 10000
50.0%
종합자료실 5740
28.7%
어린이자료실 4260
21.3%

서명
Text

Distinct9550
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:57:20.857241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length104
Mean length34.8884
Min length2

Characters and Unicode

Total characters348884
Distinct characters1588
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9311 ?
Unique (%)93.1%

Sample

1st row책 읽기 달인 /김진완 글 ;허구 그림
2nd row니까야 강독 /각묵스님 옮김·엮음 ;대림스님 옮김 .1-2
3rd row(손자 바보 이계진의) 똥꼬 할아버지와 장미꽃 손자 /이계진 지음 ;이두용 ;이경은 [공] 사진
4th row사춘기 내 몸 사용설명서 /안트예 헬름스 글 ;얀 폰 홀레벤 사진 ;박종대 옮김
5th row노아가 동물을 태워요 /윤아해 지음 ;이갑규 그림
ValueCountFrequency (%)
지음 4699
 
5.7%
옮김 2592
 
3.2%
그림 1992
 
2.4%
1911
 
2.3%
by 686
 
0.8%
글·그림 369
 
0.4%
이야기 329
 
0.4%
320
 
0.4%
the 258
 
0.3%
235
 
0.3%
Other values (30297) 68769
83.7%
2023-12-12T11:57:21.678049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72563
 
20.8%
/ 8520
 
2.4%
7209
 
2.1%
; 7005
 
2.0%
6147
 
1.8%
5638
 
1.6%
4962
 
1.4%
e 3634
 
1.0%
3470
 
1.0%
3417
 
1.0%
Other values (1578) 226319
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 200156
57.4%
Space Separator 72563
 
20.8%
Lowercase Letter 32605
 
9.3%
Other Punctuation 23865
 
6.8%
Uppercase Letter 6759
 
1.9%
Decimal Number 5938
 
1.7%
Open Punctuation 2729
 
0.8%
Close Punctuation 2728
 
0.8%
Dash Punctuation 908
 
0.3%
Math Symbol 618
 
0.2%
Other values (5) 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7209
 
3.6%
6147
 
3.1%
5638
 
2.8%
4962
 
2.5%
3470
 
1.7%
3417
 
1.7%
2995
 
1.5%
2842
 
1.4%
2827
 
1.4%
2677
 
1.3%
Other values (1470) 157972
78.9%
Lowercase Letter
ValueCountFrequency (%)
e 3634
11.1%
a 2950
 
9.0%
t 2517
 
7.7%
r 2446
 
7.5%
i 2433
 
7.5%
o 2402
 
7.4%
n 2392
 
7.3%
s 1927
 
5.9%
l 1774
 
5.4%
y 1523
 
4.7%
Other values (17) 8607
26.4%
Uppercase Letter
ValueCountFrequency (%)
S 585
 
8.7%
D 475
 
7.0%
A 465
 
6.9%
E 408
 
6.0%
R 405
 
6.0%
T 401
 
5.9%
B 401
 
5.9%
M 361
 
5.3%
C 348
 
5.1%
H 295
 
4.4%
Other values (16) 2615
38.7%
Other Punctuation
ValueCountFrequency (%)
/ 8520
35.7%
; 7005
29.4%
: 3094
 
13.0%
. 2246
 
9.4%
, 1031
 
4.3%
· 668
 
2.8%
! 533
 
2.2%
? 444
 
1.9%
' 159
 
0.7%
& 70
 
0.3%
Other values (10) 95
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 1791
30.2%
2 911
15.3%
0 884
14.9%
3 562
 
9.5%
4 417
 
7.0%
5 403
 
6.8%
6 281
 
4.7%
8 238
 
4.0%
7 227
 
3.8%
9 224
 
3.8%
Math Symbol
ValueCountFrequency (%)
= 494
79.9%
+ 53
 
8.6%
~ 25
 
4.0%
< 21
 
3.4%
> 21
 
3.4%
2
 
0.3%
| 1
 
0.2%
× 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 1677
61.5%
( 1036
38.0%
12
 
0.4%
4
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 1676
61.4%
) 1036
38.0%
12
 
0.4%
4
 
0.1%
Letter Number
ValueCountFrequency (%)
6
75.0%
2
 
25.0%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
72563
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 908
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 199923
57.3%
Common 109356
31.3%
Latin 39371
 
11.3%
Han 228
 
0.1%
Katakana 5
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7209
 
3.6%
6147
 
3.1%
5638
 
2.8%
4962
 
2.5%
3470
 
1.7%
3417
 
1.7%
2995
 
1.5%
2842
 
1.4%
2827
 
1.4%
2677
 
1.3%
Other values (1348) 157739
78.9%
Han
ValueCountFrequency (%)
13
 
5.7%
10
 
4.4%
10
 
4.4%
9
 
3.9%
9
 
3.9%
8
 
3.5%
8
 
3.5%
7
 
3.1%
6
 
2.6%
4
 
1.8%
Other values (107) 144
63.2%
Latin
ValueCountFrequency (%)
e 3634
 
9.2%
a 2950
 
7.5%
t 2517
 
6.4%
r 2446
 
6.2%
i 2433
 
6.2%
o 2402
 
6.1%
n 2392
 
6.1%
s 1927
 
4.9%
l 1774
 
4.5%
y 1523
 
3.9%
Other values (44) 15373
39.0%
Common
ValueCountFrequency (%)
72563
66.4%
/ 8520
 
7.8%
; 7005
 
6.4%
: 3094
 
2.8%
. 2246
 
2.1%
1 1791
 
1.6%
[ 1677
 
1.5%
] 1676
 
1.5%
( 1036
 
0.9%
) 1036
 
0.9%
Other values (43) 8712
 
8.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 199905
57.3%
ASCII 147995
42.4%
None 709
 
0.2%
CJK 223
 
0.1%
Compat Jamo 18
 
< 0.1%
Punctuation 13
 
< 0.1%
Number Forms 8
 
< 0.1%
CJK Compat Ideographs 5
 
< 0.1%
Katakana 5
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
72563
49.0%
/ 8520
 
5.8%
; 7005
 
4.7%
e 3634
 
2.5%
: 3094
 
2.1%
a 2950
 
2.0%
t 2517
 
1.7%
r 2446
 
1.7%
i 2433
 
1.6%
o 2402
 
1.6%
Other values (79) 40431
27.3%
Hangul
ValueCountFrequency (%)
7209
 
3.6%
6147
 
3.1%
5638
 
2.8%
4962
 
2.5%
3470
 
1.7%
3417
 
1.7%
2995
 
1.5%
2842
 
1.4%
2827
 
1.4%
2677
 
1.3%
Other values (1343) 157721
78.9%
None
ValueCountFrequency (%)
· 668
94.2%
12
 
1.7%
12
 
1.7%
4
 
0.6%
4
 
0.6%
2
 
0.3%
2
 
0.3%
1
 
0.1%
1
 
0.1%
× 1
 
0.1%
Other values (2) 2
 
0.3%
CJK
ValueCountFrequency (%)
13
 
5.8%
10
 
4.5%
10
 
4.5%
9
 
4.0%
9
 
4.0%
8
 
3.6%
8
 
3.6%
7
 
3.1%
6
 
2.7%
4
 
1.8%
Other values (103) 139
62.3%
Punctuation
ValueCountFrequency (%)
10
76.9%
2
 
15.4%
1
 
7.7%
Compat Jamo
ValueCountFrequency (%)
9
50.0%
6
33.3%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Number Forms
ValueCountFrequency (%)
6
75.0%
2
 
25.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

저작자
Text

MISSING 

Distinct6438
Distinct (%)68.6%
Missing620
Missing (%)6.2%
Memory size156.2 KiB
2023-12-12T11:57:22.284374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length3
Mean length5.5683369
Min length1

Characters and Unicode

Total characters52231
Distinct characters836
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5197 ?
Unique (%)55.4%

Sample

1st row김진완
2nd row각묵
3rd row이계진
4th row헬름스, 안트예
5th row윤아해
ValueCountFrequency (%)
편집부 165
 
1.2%
이상교 87
 
0.6%
톨스토이 65
 
0.5%
기획팀 59
 
0.4%
melissa 56
 
0.4%
lagonegro 55
 
0.4%
etc 54
 
0.4%
한국헤밍웨이 51
 
0.4%
데이비드 46
 
0.3%
43
 
0.3%
Other values (7642) 13008
95.0%
2023-12-12T11:57:23.003568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4376
 
8.4%
, 2793
 
5.3%
1689
 
3.2%
1063
 
2.0%
1030
 
2.0%
e 777
 
1.5%
752
 
1.4%
a 729
 
1.4%
n 706
 
1.4%
o 613
 
1.2%
Other values (826) 37703
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35433
67.8%
Lowercase Letter 6575
 
12.6%
Space Separator 4376
 
8.4%
Other Punctuation 3174
 
6.1%
Uppercase Letter 2600
 
5.0%
Dash Punctuation 41
 
0.1%
Decimal Number 14
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1689
 
4.8%
1063
 
3.0%
1030
 
2.9%
752
 
2.1%
561
 
1.6%
415
 
1.2%
399
 
1.1%
396
 
1.1%
389
 
1.1%
380
 
1.1%
Other values (750) 28359
80.0%
Lowercase Letter
ValueCountFrequency (%)
e 777
11.8%
a 729
11.1%
n 706
10.7%
o 613
9.3%
r 550
8.4%
i 524
 
8.0%
s 419
 
6.4%
t 379
 
5.8%
l 355
 
5.4%
h 204
 
3.1%
Other values (17) 1319
20.1%
Uppercase Letter
ValueCountFrequency (%)
A 206
 
7.9%
L 203
 
7.8%
M 197
 
7.6%
R 195
 
7.5%
S 185
 
7.1%
E 155
 
6.0%
B 128
 
4.9%
N 126
 
4.8%
T 124
 
4.8%
D 123
 
4.7%
Other values (16) 958
36.8%
Decimal Number
ValueCountFrequency (%)
1 4
28.6%
2 3
21.4%
3 2
14.3%
0 1
 
7.1%
6 1
 
7.1%
9 1
 
7.1%
4 1
 
7.1%
8 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 2793
88.0%
. 338
 
10.6%
/ 24
 
0.8%
? 13
 
0.4%
' 4
 
0.1%
& 1
 
< 0.1%
· 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4
66.7%
[ 2
33.3%
Close Punctuation
ValueCountFrequency (%)
) 4
66.7%
] 2
33.3%
Math Symbol
ValueCountFrequency (%)
> 3
50.0%
< 3
50.0%
Space Separator
ValueCountFrequency (%)
4376
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35428
67.8%
Latin 9175
 
17.6%
Common 7623
 
14.6%
Katakana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1689
 
4.8%
1063
 
3.0%
1030
 
2.9%
752
 
2.1%
561
 
1.6%
415
 
1.2%
399
 
1.1%
396
 
1.1%
389
 
1.1%
380
 
1.1%
Other values (745) 28354
80.0%
Latin
ValueCountFrequency (%)
e 777
 
8.5%
a 729
 
7.9%
n 706
 
7.7%
o 613
 
6.7%
r 550
 
6.0%
i 524
 
5.7%
s 419
 
4.6%
t 379
 
4.1%
l 355
 
3.9%
A 206
 
2.2%
Other values (43) 3917
42.7%
Common
ValueCountFrequency (%)
4376
57.4%
, 2793
36.6%
. 338
 
4.4%
- 41
 
0.5%
/ 24
 
0.3%
? 13
 
0.2%
( 4
 
0.1%
) 4
 
0.1%
1 4
 
0.1%
' 4
 
0.1%
Other values (13) 22
 
0.3%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35428
67.8%
ASCII 16796
32.2%
Katakana 5
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4376
26.1%
, 2793
16.6%
e 777
 
4.6%
a 729
 
4.3%
n 706
 
4.2%
o 613
 
3.6%
r 550
 
3.3%
i 524
 
3.1%
s 419
 
2.5%
t 379
 
2.3%
Other values (64) 4930
29.4%
Hangul
ValueCountFrequency (%)
1689
 
4.8%
1063
 
3.0%
1030
 
2.9%
752
 
2.1%
561
 
1.6%
415
 
1.2%
399
 
1.1%
396
 
1.1%
389
 
1.1%
380
 
1.1%
Other values (745) 28354
80.0%
None
ValueCountFrequency (%)
ø 1
50.0%
· 1
50.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct1958
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:57:23.417093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length33
Mean length5.137
Min length1

Characters and Unicode

Total characters51370
Distinct characters677
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique908 ?
Unique (%)9.1%

Sample

1st row봄봄출판사
2nd row초기불전연구원
3rd row북새통
4th row조선에듀케이션
5th row생명의말씀사
ValueCountFrequency (%)
문학동네 196
 
1.8%
웅진씽크빅 195
 
1.8%
알에이치코리아 170
 
1.6%
이수미디어 143
 
1.3%
위즈덤하우스 128
 
1.2%
스마일북스 122
 
1.1%
톨스토이 104
 
1.0%
아람 101
 
0.9%
김영사 100
 
0.9%
민음사 97
 
0.9%
Other values (1903) 9522
87.5%
2023-12-12T11:57:23.990283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2045
 
4.0%
1445
 
2.8%
1419
 
2.8%
1333
 
2.6%
1192
 
2.3%
872
 
1.7%
e 767
 
1.5%
s 758
 
1.5%
o 718
 
1.4%
707
 
1.4%
Other values (667) 40114
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40177
78.2%
Lowercase Letter 7285
 
14.2%
Uppercase Letter 2340
 
4.6%
Space Separator 1192
 
2.3%
Other Punctuation 191
 
0.4%
Decimal Number 136
 
0.3%
Close Punctuation 21
 
< 0.1%
Open Punctuation 21
 
< 0.1%
Math Symbol 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2045
 
5.1%
1445
 
3.6%
1419
 
3.5%
1333
 
3.3%
872
 
2.2%
707
 
1.8%
693
 
1.7%
627
 
1.6%
624
 
1.6%
624
 
1.6%
Other values (592) 29788
74.1%
Lowercase Letter
ValueCountFrequency (%)
e 767
10.5%
s 758
10.4%
o 718
 
9.9%
r 688
 
9.4%
n 627
 
8.6%
i 506
 
6.9%
a 469
 
6.4%
l 318
 
4.4%
u 301
 
4.1%
p 298
 
4.1%
Other values (15) 1835
25.2%
Uppercase Letter
ValueCountFrequency (%)
S 238
 
10.2%
P 188
 
8.0%
M 188
 
8.0%
A 157
 
6.7%
H 146
 
6.2%
B 139
 
5.9%
L 136
 
5.8%
C 132
 
5.6%
D 123
 
5.3%
O 121
 
5.2%
Other values (15) 772
33.0%
Other Punctuation
ValueCountFrequency (%)
& 68
35.6%
' 37
19.4%
, 33
17.3%
. 25
 
13.1%
# 11
 
5.8%
· 7
 
3.7%
: 7
 
3.7%
/ 1
 
0.5%
? 1
 
0.5%
1
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 66
48.5%
2 65
47.8%
8 2
 
1.5%
5 2
 
1.5%
4 1
 
0.7%
Math Symbol
ValueCountFrequency (%)
= 2
50.0%
+ 1
25.0%
1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 20
95.2%
] 1
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 20
95.2%
[ 1
 
4.8%
Space Separator
ValueCountFrequency (%)
1192
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40163
78.2%
Latin 9625
 
18.7%
Common 1568
 
3.1%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2045
 
5.1%
1445
 
3.6%
1419
 
3.5%
1333
 
3.3%
872
 
2.2%
707
 
1.8%
693
 
1.7%
627
 
1.6%
624
 
1.6%
624
 
1.6%
Other values (578) 29774
74.1%
Latin
ValueCountFrequency (%)
e 767
 
8.0%
s 758
 
7.9%
o 718
 
7.5%
r 688
 
7.1%
n 627
 
6.5%
i 506
 
5.3%
a 469
 
4.9%
l 318
 
3.3%
u 301
 
3.1%
p 298
 
3.1%
Other values (40) 4175
43.4%
Common
ValueCountFrequency (%)
1192
76.0%
& 68
 
4.3%
1 66
 
4.2%
2 65
 
4.1%
' 37
 
2.4%
, 33
 
2.1%
. 25
 
1.6%
) 20
 
1.3%
( 20
 
1.3%
# 11
 
0.7%
Other values (15) 31
 
2.0%
Han
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40163
78.2%
ASCII 11183
 
21.8%
CJK 14
 
< 0.1%
None 9
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2045
 
5.1%
1445
 
3.6%
1419
 
3.5%
1333
 
3.3%
872
 
2.2%
707
 
1.8%
693
 
1.7%
627
 
1.6%
624
 
1.6%
624
 
1.6%
Other values (578) 29774
74.1%
ASCII
ValueCountFrequency (%)
1192
 
10.7%
e 767
 
6.9%
s 758
 
6.8%
o 718
 
6.4%
r 688
 
6.2%
n 627
 
5.6%
i 506
 
4.5%
a 469
 
4.2%
l 318
 
2.8%
u 301
 
2.7%
Other values (61) 4839
43.3%
None
ValueCountFrequency (%)
· 7
77.8%
1
 
11.1%
1
 
11.1%
CJK
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Punctuation
ValueCountFrequency (%)
1
100.0%

발행년
Text

MISSING 

Distinct67
Distinct (%)0.7%
Missing440
Missing (%)4.4%
Memory size156.2 KiB
2023-12-12T11:57:24.233198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length4.182113
Min length4

Characters and Unicode

Total characters39981
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)0.2%

Sample

1st row2014
2nd row2013
3rd row2014
4th row2014
5th row2013
ValueCountFrequency (%)
2014 6512
68.1%
2013 1098
 
11.5%
2012 401
 
4.2%
2011 282
 
2.9%
2010 281
 
2.9%
2008 218
 
2.3%
2009 135
 
1.4%
2007 84
 
0.9%
2006 75
 
0.8%
2013-2014 70
 
0.7%
Other values (52) 404
 
4.2%
2023-12-12T11:57:24.623563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10901
27.3%
2 10265
25.7%
1 9329
23.3%
4 6701
16.8%
3 1248
 
3.1%
9 270
 
0.7%
- 265
 
0.7%
8 262
 
0.7%
[ 223
 
0.6%
] 223
 
0.6%
Other values (3) 294
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39270
98.2%
Dash Punctuation 265
 
0.7%
Open Punctuation 223
 
0.6%
Close Punctuation 223
 
0.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10901
27.8%
2 10265
26.1%
1 9329
23.8%
4 6701
17.1%
3 1248
 
3.2%
9 270
 
0.7%
8 262
 
0.7%
6 114
 
0.3%
7 102
 
0.3%
5 78
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 265
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 223
100.0%
Close Punctuation
ValueCountFrequency (%)
] 223
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39981
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10901
27.3%
2 10265
25.7%
1 9329
23.3%
4 6701
16.8%
3 1248
 
3.1%
9 270
 
0.7%
- 265
 
0.7%
8 262
 
0.7%
[ 223
 
0.6%
] 223
 
0.6%
Other values (3) 294
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39981
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10901
27.3%
2 10265
25.7%
1 9329
23.3%
4 6701
16.8%
3 1248
 
3.1%
9 270
 
0.7%
- 265
 
0.7%
8 262
 
0.7%
[ 223
 
0.6%
] 223
 
0.6%
Other values (3) 294
 
0.7%

Interactions

2023-12-12T11:57:17.627777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:57:24.743334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리구분자료실명발행년
연번1.0000.8550.9960.580
관리구분0.8551.0001.0000.667
자료실명0.9961.0001.0000.505
발행년0.5800.6670.5051.000
2023-12-12T11:57:24.862974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리구분자료실명
관리구분1.0001.000
자료실명1.0001.000
2023-12-12T11:57:24.991708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번관리구분자료실명
연번1.0000.7050.941
관리구분0.7051.0001.000
자료실명0.9411.0001.000

Missing values

2023-12-12T11:57:17.778443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:57:17.959273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:57:18.115813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번관리구분청구기호자료실명서명저작자발행자발행년
39513952아동813.8 ㄱ943ㅊ광진정보도서관 어린이자료실책 읽기 달인 /김진완 글 ;허구 그림김진완봄봄출판사2014
65636564일반223.51 ㄱ216ㄴ v.1광진정보도서관 종합자료실니까야 강독 /각묵스님 옮김·엮음 ;대림스님 옮김 .1-2각묵초기불전연구원2013
1139311394일반814.6 ㅇ663ㄸ광진정보도서관 종합자료실(손자 바보 이계진의) 똥꼬 할아버지와 장미꽃 손자 /이계진 지음 ;이두용 ;이경은 [공] 사진이계진북새통2014
78497850일반334.225 ㅎ521ㅅ광진정보도서관 종합자료실사춘기 내 몸 사용설명서 /안트예 헬름스 글 ;얀 폰 홀레벤 사진 ;박종대 옮김헬름스, 안트예조선에듀케이션2014
10811082아동233 ㅇ631ㄴ광진정보도서관 어린이자료실노아가 동물을 태워요 /윤아해 지음 ;이갑규 그림윤아해생명의말씀사2013
15421543아동408 ㄷ49ㅇ v.14광진정보도서관 어린이자료실커다랗고 커다란 고래 /김선숙 글 ;유진희 ;오헌균 ;신상우 [공]그림김선숙웅진씽크빅2006
50725073아동990ㄱ81ㅊ v.31광진정보도서관 어린이자료실꿈담인물그림책 31<NA>차일드아카데미<NA>
1225512256일반859.7 ㅅ626ㅇ광진정보도서관 종합자료실악명 높은 연인/알렉산데르 쇠데르베리 지음 ;이원열 옮김쇠데르베리, 알렉산데르더난콘텐츠그룹2014
65976598일반224.82 ㅈ982ㄱ광진정보도서관 종합자료실간절히 원하면 이루어진다 /진옹월성 엮음진옹월성아침단청2014
1149711498일반816.6 ㅎ419ㅁ광진정보도서관 종합자료실먹다, 사랑하다, 떠나다 :노마드 소설가 함정임의 세계 식도락 기행 /함정임 글·사진함정임푸르메2014
연번관리구분청구기호자료실명서명저작자발행자발행년
1239812399일반892.1 ㅅ724ㅊ광진정보도서관 종합자료실춤추라, 아무도 바라보지 않는 것처럼 :닫힌 마음을 열 때 약속처럼 찾아오는 삶의 기적들 /아가피 스타시노풀로스 지음 ;이지연 옮김스타시노풀로스, 아가피티즈맵출판사2014
43074308아동823.6 ㅈ626홍-ㅂ광진정보도서관 어린이자료실홍루몽 :슬프고도 아름다운 사랑 이야기 /조설근 원작 ;진보 지음 ;[자오청웨이 그림] ;전수정 옮김조설근보림출판사2014
96669667일반657.1 ㅎ465식ㄱ v.2-3광진정보도서관 종합자료실식객 Ⅱ =食客 /허영만 지음.1-3허영만시루2014
1001710018일반747.2 ㄷ88ㅇ v.1-5광진정보도서관 종합자료실adventures of Tom Sawyer =톰 소여의 모험 /by Mark Twain ;retold by David ThayneTwain, Mark알에이치코리아2013
1197311974일반843 ㅂ246ㅅ3 v.1광진정보도서관 종합자료실사금파리 한 조각 /린다 수 박 글 ;이상희 옮김 ;김세현 그림.1박, 린다 수서울문화사2002
67886789일반308 ㅂ146ㅎ광진정보도서관 종합자료실희망, 살아 있는 자의 의무 :지그문트 바우만 인터뷰 /[지그문트 바우만 인터뷰] ;인디고 연구소 기획바우만, 지그문트궁리출판2014
1112311124일반813.6 ㅈ278ㅁㄹ v.2-6 c.3광진정보도서관 종합자료실룬의 아이들 /전민희 지음.2-6 :데모닉전민희제우미디어2006
1090710908일반813.6 ㄱ966ㅂ v.1광진정보도서관 종합자료실불멸의 이순신 :김탁환 장편소설 /김탁환 지음.1-8김탁환민음사2014
1045610457일반808.9 ㅅ116사 v.339광진정보도서관 종합자료실해리 포터와 비밀의 방 1 (반양장) - 신개정판J. K. 롤링문학수첩2014
32433244아동808.9 ㅊ788ㅋ v.186광진정보도서관 어린이자료실기억을 지워주는 문방구조규미살림어린이2014