Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells108
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description대구광역시 중구의 작은도서관 보유 도서목록(책제목, 저자명, 출판사 등)을 제공하고 있습니다.
Author대구광역시 중구
URLhttps://www.data.go.kr/data/15054150/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:03:14.724259
Analysis finished2023-12-12 10:03:17.401618
Duration2.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50251.94
Minimum20
Maximum99983
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T19:03:17.488090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile4815.95
Q125126.75
median50477.5
Q375588.75
95-th percentile95087.4
Maximum99983
Range99963
Interquartile range (IQR)50462

Descriptive statistics

Standard deviation28986.571
Coefficient of variation (CV)0.57682491
Kurtosis-1.2086044
Mean50251.94
Median Absolute Deviation (MAD)25226.5
Skewness-0.014129618
Sum5.025194 × 108
Variance8.4022129 × 108
MonotonicityNot monotonic
2023-12-12T19:03:17.691692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
92143 1
 
< 0.1%
77912 1
 
< 0.1%
47953 1
 
< 0.1%
20495 1
 
< 0.1%
59195 1
 
< 0.1%
64714 1
 
< 0.1%
69668 1
 
< 0.1%
20298 1
 
< 0.1%
98335 1
 
< 0.1%
94677 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
20 1
< 0.1%
21 1
< 0.1%
43 1
< 0.1%
46 1
< 0.1%
54 1
< 0.1%
77 1
< 0.1%
89 1
< 0.1%
97 1
< 0.1%
110 1
< 0.1%
126 1
< 0.1%
ValueCountFrequency (%)
99983 1
< 0.1%
99980 1
< 0.1%
99978 1
< 0.1%
99974 1
< 0.1%
99960 1
< 0.1%
99942 1
< 0.1%
99932 1
< 0.1%
99929 1
< 0.1%
99927 1
< 0.1%
99919 1
< 0.1%
Distinct9630
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:03:18.105596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length130
Median length92
Mean length18.1268
Min length1

Characters and Unicode

Total characters181268
Distinct characters1490
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9294 ?
Unique (%)92.9%

Sample

1st row(소리치지 않고 화내지 않고) 초등학생 공부시키기/
2nd row이상 소설 전집
3rd row킹핀
4th rowOne fish two fish red fish blue fish(One fish two fish red fish blue fish)
5th row26년 1
ValueCountFrequency (%)
1220
 
3.1%
the 678
 
1.7%
and 300
 
0.8%
of 258
 
0.7%
이야기 204
 
0.5%
a 204
 
0.5%
2 149
 
0.4%
1 148
 
0.4%
위한 133
 
0.3%
125
 
0.3%
Other values (17739) 35935
91.3%
2023-12-12T19:03:18.599980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29410
 
16.2%
e 5343
 
2.9%
a 3626
 
2.0%
o 3511
 
1.9%
t 3264
 
1.8%
r 3038
 
1.7%
i 2957
 
1.6%
n 2869
 
1.6%
s 2864
 
1.6%
) 2602
 
1.4%
Other values (1480) 121784
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84919
46.8%
Lowercase Letter 42982
23.7%
Space Separator 29410
 
16.2%
Other Punctuation 7430
 
4.1%
Decimal Number 5636
 
3.1%
Uppercase Letter 4675
 
2.6%
Close Punctuation 2818
 
1.6%
Open Punctuation 2813
 
1.6%
Dash Punctuation 248
 
0.1%
Math Symbol 215
 
0.1%
Other values (6) 122
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2402
 
2.8%
2279
 
2.7%
1578
 
1.9%
1412
 
1.7%
1376
 
1.6%
1150
 
1.4%
1138
 
1.3%
1122
 
1.3%
1091
 
1.3%
1073
 
1.3%
Other values (1364) 70298
82.8%
Lowercase Letter
ValueCountFrequency (%)
e 5343
12.4%
a 3626
 
8.4%
o 3511
 
8.2%
t 3264
 
7.6%
r 3038
 
7.1%
i 2957
 
6.9%
n 2869
 
6.7%
s 2864
 
6.7%
h 2357
 
5.5%
l 1847
 
4.3%
Other values (17) 11306
26.3%
Uppercase Letter
ValueCountFrequency (%)
T 650
13.9%
W 378
 
8.1%
S 373
 
8.0%
A 355
 
7.6%
M 299
 
6.4%
C 243
 
5.2%
P 223
 
4.8%
B 220
 
4.7%
I 218
 
4.7%
H 207
 
4.4%
Other values (16) 1509
32.3%
Other Punctuation
ValueCountFrequency (%)
/ 2427
32.7%
: 1528
20.6%
. 1377
18.5%
, 910
 
12.2%
? 381
 
5.1%
! 316
 
4.3%
' 154
 
2.1%
· 96
 
1.3%
; 81
 
1.1%
& 43
 
0.6%
Other values (12) 117
 
1.6%
Decimal Number
ValueCountFrequency (%)
1 1281
22.7%
2 1096
19.4%
0 1093
19.4%
3 461
 
8.2%
5 384
 
6.8%
9 351
 
6.2%
4 317
 
5.6%
7 220
 
3.9%
6 220
 
3.9%
8 213
 
3.8%
Math Symbol
ValueCountFrequency (%)
= 160
74.4%
~ 33
 
15.3%
+ 11
 
5.1%
< 3
 
1.4%
> 3
 
1.4%
2
 
0.9%
÷ 1
 
0.5%
1
 
0.5%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 2602
92.3%
] 211
 
7.5%
2
 
0.1%
2
 
0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2598
92.4%
[ 210
 
7.5%
2
 
0.1%
2
 
0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
9
50.0%
7
38.9%
1
 
5.6%
1
 
5.6%
Modifier Symbol
ValueCountFrequency (%)
` 93
95.9%
´ 4
 
4.1%
Space Separator
ValueCountFrequency (%)
29410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 248
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Letter
ValueCountFrequency (%)
ː 1
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 84543
46.6%
Common 48674
26.9%
Latin 47674
26.3%
Han 371
 
0.2%
Katakana 5
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2402
 
2.8%
2279
 
2.7%
1578
 
1.9%
1412
 
1.7%
1376
 
1.6%
1150
 
1.4%
1138
 
1.3%
1122
 
1.3%
1091
 
1.3%
1073
 
1.3%
Other values (1183) 69922
82.7%
Han
ValueCountFrequency (%)
13
 
3.5%
13
 
3.5%
12
 
3.2%
12
 
3.2%
8
 
2.2%
7
 
1.9%
7
 
1.9%
6
 
1.6%
6
 
1.6%
6
 
1.6%
Other values (166) 281
75.7%
Common
ValueCountFrequency (%)
29410
60.4%
) 2602
 
5.3%
( 2598
 
5.3%
/ 2427
 
5.0%
: 1528
 
3.1%
. 1377
 
2.8%
1 1281
 
2.6%
2 1096
 
2.3%
0 1093
 
2.2%
, 910
 
1.9%
Other values (49) 4352
 
8.9%
Latin
ValueCountFrequency (%)
e 5343
 
11.2%
a 3626
 
7.6%
o 3511
 
7.4%
t 3264
 
6.8%
r 3038
 
6.4%
i 2957
 
6.2%
n 2869
 
6.0%
s 2864
 
6.0%
h 2357
 
4.9%
l 1847
 
3.9%
Other values (46) 15998
33.6%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 96101
53.0%
Hangul 84538
46.6%
CJK 357
 
0.2%
None 220
 
0.1%
Number Forms 18
 
< 0.1%
CJK Compat Ideographs 14
 
< 0.1%
Punctuation 6
 
< 0.1%
Katakana 5
 
< 0.1%
Compat Jamo 5
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29410
30.6%
e 5343
 
5.6%
a 3626
 
3.8%
o 3511
 
3.7%
t 3264
 
3.4%
r 3038
 
3.2%
i 2957
 
3.1%
n 2869
 
3.0%
s 2864
 
3.0%
) 2602
 
2.7%
Other values (76) 36617
38.1%
Hangul
ValueCountFrequency (%)
2402
 
2.8%
2279
 
2.7%
1578
 
1.9%
1412
 
1.7%
1376
 
1.6%
1150
 
1.4%
1138
 
1.3%
1122
 
1.3%
1091
 
1.3%
1073
 
1.3%
Other values (1178) 69917
82.7%
None
ValueCountFrequency (%)
· 96
43.6%
43
19.5%
32
 
14.5%
12
 
5.5%
6
 
2.7%
´ 4
 
1.8%
4
 
1.8%
3
 
1.4%
3
 
1.4%
2
 
0.9%
Other values (11) 15
 
6.8%
CJK
ValueCountFrequency (%)
13
 
3.6%
13
 
3.6%
12
 
3.4%
12
 
3.4%
8
 
2.2%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
6
 
1.7%
Other values (158) 267
74.8%
Number Forms
ValueCountFrequency (%)
9
50.0%
7
38.9%
1
 
5.6%
1
 
5.6%
CJK Compat Ideographs
ValueCountFrequency (%)
4
28.6%
3
21.4%
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Punctuation
ValueCountFrequency (%)
3
50.0%
3
50.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Compat Jamo
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Modifier Letters
ValueCountFrequency (%)
ː 1
100.0%
Distinct7622
Distinct (%)76.4%
Missing23
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T19:03:18.954024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length151
Median length123
Mean length15.696301
Min length2

Characters and Unicode

Total characters156602
Distinct characters939
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6403 ?
Unique (%)64.2%

Sample

1st row고봉익;
2nd row이상 지음;권영민 책임 편집
3rd row전옥표 지음
4th rowDr. Seuss
5th row강풀 글·그림
ValueCountFrequency (%)
지음 3133
 
8.6%
by 2677
 
7.3%
2644
 
7.2%
옮김 1443
 
3.9%
그림 1012
 
2.8%
843
 
2.3%
illustrated 682
 
1.9%
547
 
1.5%
written 232
 
0.6%
엮음 194
 
0.5%
Other values (10943) 23206
63.4%
2023-12-12T19:03:19.455036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26884
 
17.2%
e 5414
 
3.5%
a 4830
 
3.1%
; 4713
 
3.0%
r 4295
 
2.7%
4074
 
2.6%
t 4017
 
2.6%
i 3998
 
2.6%
l 3931
 
2.5%
3871
 
2.5%
Other values (929) 90575
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62995
40.2%
Lowercase Letter 51957
33.2%
Space Separator 26884
17.2%
Uppercase Letter 7416
 
4.7%
Other Punctuation 6379
 
4.1%
Open Punctuation 383
 
0.2%
Close Punctuation 383
 
0.2%
Decimal Number 98
 
0.1%
Dash Punctuation 85
 
0.1%
Math Symbol 14
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4074
 
6.5%
3871
 
6.1%
2866
 
4.5%
1996
 
3.2%
1589
 
2.5%
1539
 
2.4%
1528
 
2.4%
1488
 
2.4%
1007
 
1.6%
846
 
1.3%
Other values (843) 42191
67.0%
Lowercase Letter
ValueCountFrequency (%)
e 5414
10.4%
a 4830
 
9.3%
r 4295
 
8.3%
t 4017
 
7.7%
i 3998
 
7.7%
l 3931
 
7.6%
y 3764
 
7.2%
n 3653
 
7.0%
b 3069
 
5.9%
o 2662
 
5.1%
Other values (16) 12324
23.7%
Uppercase Letter
ValueCountFrequency (%)
M 766
 
10.3%
S 664
 
9.0%
B 534
 
7.2%
J 503
 
6.8%
A 491
 
6.6%
R 475
 
6.4%
D 457
 
6.2%
C 451
 
6.1%
L 401
 
5.4%
H 375
 
5.1%
Other values (16) 2299
31.0%
Other Punctuation
ValueCountFrequency (%)
; 4713
73.9%
. 854
 
13.4%
, 376
 
5.9%
· 241
 
3.8%
: 149
 
2.3%
/ 13
 
0.2%
& 13
 
0.2%
' 12
 
0.2%
3
 
< 0.1%
1
 
< 0.1%
Other values (4) 4
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 21
21.4%
8 17
17.3%
5 16
16.3%
0 12
12.2%
3 12
12.2%
2 7
 
7.1%
6 7
 
7.1%
9 4
 
4.1%
4 2
 
2.0%
Open Punctuation
ValueCountFrequency (%)
[ 366
95.6%
( 17
 
4.4%
Close Punctuation
ValueCountFrequency (%)
] 366
95.6%
) 17
 
4.4%
Math Symbol
ValueCountFrequency (%)
> 7
50.0%
< 7
50.0%
Space Separator
ValueCountFrequency (%)
26884
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 6
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62916
40.2%
Latin 59374
37.9%
Common 34233
21.9%
Han 79
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4074
 
6.5%
3871
 
6.2%
2866
 
4.6%
1996
 
3.2%
1589
 
2.5%
1539
 
2.4%
1528
 
2.4%
1488
 
2.4%
1007
 
1.6%
846
 
1.3%
Other values (789) 42112
66.9%
Han
ValueCountFrequency (%)
5
 
6.3%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (44) 54
68.4%
Latin
ValueCountFrequency (%)
e 5414
 
9.1%
a 4830
 
8.1%
r 4295
 
7.2%
t 4017
 
6.8%
i 3998
 
6.7%
l 3931
 
6.6%
y 3764
 
6.3%
n 3653
 
6.2%
b 3069
 
5.2%
o 2662
 
4.5%
Other values (43) 19741
33.2%
Common
ValueCountFrequency (%)
26884
78.5%
; 4713
 
13.8%
. 854
 
2.5%
, 376
 
1.1%
[ 366
 
1.1%
] 366
 
1.1%
· 241
 
0.7%
: 149
 
0.4%
- 85
 
0.2%
1 21
 
0.1%
Other values (23) 178
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 93357
59.6%
Hangul 62916
40.2%
None 247
 
0.2%
CJK 72
 
< 0.1%
CJK Compat Ideographs 7
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
Number Forms 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26884
28.8%
e 5414
 
5.8%
a 4830
 
5.2%
; 4713
 
5.0%
r 4295
 
4.6%
t 4017
 
4.3%
i 3998
 
4.3%
l 3931
 
4.2%
y 3764
 
4.0%
n 3653
 
3.9%
Other values (68) 27858
29.8%
Hangul
ValueCountFrequency (%)
4074
 
6.5%
3871
 
6.2%
2866
 
4.6%
1996
 
3.2%
1589
 
2.5%
1539
 
2.4%
1528
 
2.4%
1488
 
2.4%
1007
 
1.6%
846
 
1.3%
Other values (789) 42112
66.9%
None
ValueCountFrequency (%)
· 241
97.6%
3
 
1.2%
1
 
0.4%
1
 
0.4%
1
 
0.4%
CJK
ValueCountFrequency (%)
5
 
6.9%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (40) 48
66.7%
CJK Compat Ideographs
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct3782
Distinct (%)37.9%
Missing19
Missing (%)0.2%
Memory size156.2 KiB
2023-12-12T19:03:19.697772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length35
Mean length6.8758641
Min length1

Characters and Unicode

Total characters68628
Distinct characters758
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2398 ?
Unique (%)24.0%

Sample

1st row명진출판,
2nd row민음사
3rd row위즈덤하우스
4th rowRandom House
5th row재미주의
ValueCountFrequency (%)
books 262
 
2.1%
press 257
 
2.0%
scholastic 167
 
1.3%
대구광역시 144
 
1.1%
예림당 139
 
1.1%
120
 
1.0%
문학동네 117
 
0.9%
oxford 109
 
0.9%
house 101
 
0.8%
창비 92
 
0.7%
Other values (3023) 11058
88.0%
2023-12-12T19:03:20.400449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2589
 
3.8%
, 2427
 
3.5%
o 2384
 
3.5%
r 2171
 
3.2%
e 2168
 
3.2%
s 2084
 
3.0%
1864
 
2.7%
i 1727
 
2.5%
n 1687
 
2.5%
a 1623
 
2.4%
Other values (748) 47904
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35245
51.4%
Lowercase Letter 22508
32.8%
Uppercase Letter 4423
 
6.4%
Other Punctuation 2786
 
4.1%
Space Separator 2589
 
3.8%
Open Punctuation 387
 
0.6%
Close Punctuation 386
 
0.6%
Decimal Number 267
 
0.4%
Dash Punctuation 30
 
< 0.1%
Modifier Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1864
 
5.3%
901
 
2.6%
873
 
2.5%
782
 
2.2%
781
 
2.2%
711
 
2.0%
641
 
1.8%
607
 
1.7%
568
 
1.6%
559
 
1.6%
Other values (667) 26958
76.5%
Lowercase Letter
ValueCountFrequency (%)
o 2384
10.6%
r 2171
 
9.6%
e 2168
 
9.6%
s 2084
 
9.3%
i 1727
 
7.7%
n 1687
 
7.5%
a 1623
 
7.2%
l 1108
 
4.9%
t 1045
 
4.6%
d 811
 
3.6%
Other values (16) 5700
25.3%
Uppercase Letter
ValueCountFrequency (%)
P 583
13.2%
C 454
 
10.3%
H 438
 
9.9%
B 412
 
9.3%
S 350
 
7.9%
R 254
 
5.7%
O 205
 
4.6%
M 204
 
4.6%
T 183
 
4.1%
G 167
 
3.8%
Other values (16) 1173
26.5%
Other Punctuation
ValueCountFrequency (%)
, 2427
87.1%
& 116
 
4.2%
. 101
 
3.6%
' 61
 
2.2%
28
 
1.0%
: 15
 
0.5%
/ 12
 
0.4%
; 11
 
0.4%
· 10
 
0.4%
? 3
 
0.1%
Other values (2) 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 89
33.3%
2 76
28.5%
0 39
14.6%
8 17
 
6.4%
7 16
 
6.0%
5 14
 
5.2%
3 7
 
2.6%
4 5
 
1.9%
9 3
 
1.1%
6 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 369
95.3%
[ 18
 
4.7%
Close Punctuation
ValueCountFrequency (%)
) 368
95.3%
] 18
 
4.7%
Space Separator
ValueCountFrequency (%)
2589
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35065
51.1%
Latin 26931
39.2%
Common 6452
 
9.4%
Han 180
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1864
 
5.3%
901
 
2.6%
873
 
2.5%
782
 
2.2%
781
 
2.2%
711
 
2.0%
641
 
1.8%
607
 
1.7%
568
 
1.6%
559
 
1.6%
Other values (587) 26778
76.4%
Han
ValueCountFrequency (%)
28
 
15.6%
17
 
9.4%
13
 
7.2%
13
 
7.2%
5
 
2.8%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (70) 87
48.3%
Latin
ValueCountFrequency (%)
o 2384
 
8.9%
r 2171
 
8.1%
e 2168
 
8.1%
s 2084
 
7.7%
i 1727
 
6.4%
n 1687
 
6.3%
a 1623
 
6.0%
l 1108
 
4.1%
t 1045
 
3.9%
d 811
 
3.0%
Other values (42) 10123
37.6%
Common
ValueCountFrequency (%)
2589
40.1%
, 2427
37.6%
( 369
 
5.7%
) 368
 
5.7%
& 116
 
1.8%
. 101
 
1.6%
1 89
 
1.4%
2 76
 
1.2%
' 61
 
0.9%
0 39
 
0.6%
Other values (19) 217
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35065
51.1%
ASCII 33345
48.6%
CJK 180
 
0.3%
None 38
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2589
 
7.8%
, 2427
 
7.3%
o 2384
 
7.1%
r 2171
 
6.5%
e 2168
 
6.5%
s 2084
 
6.2%
i 1727
 
5.2%
n 1687
 
5.1%
a 1623
 
4.9%
l 1108
 
3.3%
Other values (69) 13377
40.1%
Hangul
ValueCountFrequency (%)
1864
 
5.3%
901
 
2.6%
873
 
2.5%
782
 
2.2%
781
 
2.2%
711
 
2.0%
641
 
1.8%
607
 
1.7%
568
 
1.6%
559
 
1.6%
Other values (587) 26778
76.4%
CJK
ValueCountFrequency (%)
28
 
15.6%
17
 
9.4%
13
 
7.2%
13
 
7.2%
5
 
2.8%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (70) 87
48.3%
None
ValueCountFrequency (%)
28
73.7%
· 10
 
26.3%

발행년
Real number (ℝ)

Distinct53
Distinct (%)0.5%
Missing66
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean2007.2099
Minimum1900
Maximum2102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T19:03:20.574545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1992
Q12002
median2010
Q32014
95-th percentile2017
Maximum2102
Range202
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.5435818
Coefficient of variation (CV)0.0042564467
Kurtosis8.714977
Mean2007.2099
Median Absolute Deviation (MAD)4
Skewness-1.2730485
Sum19939623
Variance72.992791
MonotonicityNot monotonic
2023-12-12T19:03:20.711325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014 911
 
9.1%
2010 673
 
6.7%
2011 661
 
6.6%
2012 616
 
6.2%
2013 579
 
5.8%
2016 579
 
5.8%
2009 462
 
4.6%
2008 435
 
4.3%
1992 414
 
4.1%
2017 403
 
4.0%
Other values (43) 4201
42.0%
ValueCountFrequency (%)
1900 3
< 0.1%
1965 1
 
< 0.1%
1968 1
 
< 0.1%
1970 4
< 0.1%
1971 2
< 0.1%
1972 1
 
< 0.1%
1973 2
< 0.1%
1974 1
 
< 0.1%
1976 3
< 0.1%
1977 3
< 0.1%
ValueCountFrequency (%)
2102 1
 
< 0.1%
2019 24
 
0.2%
2018 333
 
3.3%
2017 403
4.0%
2016 579
5.8%
2015 353
 
3.5%
2014 911
9.1%
2013 579
5.8%
2012 616
6.2%
2011 661
6.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020-10-13
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-10-13
2nd row2020-10-13
3rd row2020-10-13
4th row2020-10-13
5th row2020-10-13

Common Values

ValueCountFrequency (%)
2020-10-13 10000
100.0%

Length

2023-12-12T19:03:20.863515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:03:20.970960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-10-13 10000
100.0%

Interactions

2023-12-12T19:03:16.767514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:03:16.536129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:03:16.886461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:03:16.651096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:03:21.026866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발행년
연번1.0000.527
발행년0.5271.000
2023-12-12T19:03:21.102060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발행년
연번1.0000.268
발행년0.2681.000

Missing values

2023-12-12T19:03:17.041078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:03:17.172714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:03:17.298309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번도서명저작자출판사발행년데이터기준일자
9214292143(소리치지 않고 화내지 않고) 초등학생 공부시키기/고봉익;명진출판,20102020-10-13
9834198342이상 소설 전집이상 지음;권영민 책임 편집민음사20182020-10-13
1366513666킹핀전옥표 지음위즈덤하우스20102020-10-13
4343143432One fish two fish red fish blue fish(One fish two fish red fish blue fish)Dr. SeussRandom House20022020-10-13
132141321526년 1강풀 글·그림재미주의20122020-10-13
7385373854마지막 강의/랜디 포시살림,20102020-10-13
9401794018커넥팅:창조하고-연결하고-소통하라/데이비드 건틀릿 지음;삼천리,20112020-10-13
1618416185사할린의 여름 하늘은 낮다인문사회연구소경상북도20112020-10-13
9877398774아리랑.8조정래 저해냄20172020-10-13
52635264자신있게 살아라앤드류 매튜스 지음; 홍은주 옮김(도서출판)고도19992020-10-13
연번도서명저작자출판사발행년데이터기준일자
4101141012열두 달 김치 이야기김진완토마토하우스20162020-10-13
22732274동양사대기서전집.9[東洋四大奇書全集:수호지(3)]시내암 지음삼성문화사19852020-10-13
2553725538백화점 그리고 사물.세계;사람조경란문학동네20112020-10-13
8705487055밀실살인게임 : 마니악스우타노 쇼고 지음 ; 김은모 옮김한스미디어20122020-10-13
9312493125요구르트 표현/한국피카소편집부 편한국피카소,19972020-10-13
2254822549절대로 실수하지 않는 아이게리 루빈스타인 글;마크 펫 그림,노경실 옮김두레아이들20152020-10-13
3386333864(내 이름은) 빨강 = My name is red : 오르한 파묵 장편소설. 2오르한 파묵 지음 ; 이난아 옮김민음사20062020-10-13
3340333404흔들흔들 내 앞니 절대 안 빼로렌 차일드 지음 ; 김난령 옮김국민서관20132020-10-13
7101271013지금 당장 변해야 산다/원포드E.더치 홀랜드 지음경성라인,20042020-10-13
8427284273백만 광년의 고독 속에서 한 줄의 시를 읽다 : 류시화의 하이쿠 읽기류시화 지음연금술사20142020-10-13