Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells442
Missing cells (%)0.6%
Duplicate rows29
Duplicate rows (%)0.3%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Text5
Numeric2
Categorical1

Dataset

Description파주시 관내 도서관에서 소장중인 도서들에 대한 데이터로서, 도서제목, 구입연도, 저자, 출판사, 페이지수, 가격, 출판연도, 도서관명 등의 정보를 제공합니다
URLhttps://www.data.go.kr/data/15113594/fileData.do

Alerts

Dataset has 29 (0.3%) duplicate rowsDuplicates
구입연도 is highly overall correlated with 도서관명High correlation
도서관명 is highly overall correlated with 구입연도High correlation
페이지수 has 442 (4.4%) missing valuesMissing
가격 is highly skewed (γ1 = 26.95724388)Skewed

Reproduction

Analysis started2023-12-12 03:25:55.125725
Analysis finished2023-12-12 03:25:58.523096
Duration3.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9853
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:25:58.849346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length151
Median length85
Mean length18.2723
Min length1

Characters and Unicode

Total characters182723
Distinct characters1700
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9720 ?
Unique (%)97.2%

Sample

1st row가집에 담아낸 노래와 사람들
2nd row코는 냄새만 맡을까?
3rd row장자 : 자연 속에서 찾은 자유의 세계
4th row(홍정상인 호설암의) 인간경영
5th row깜둥바가지 아줌마
ValueCountFrequency (%)
3358
 
7.0%
이야기 410
 
0.9%
장편소설 338
 
0.7%
2 251
 
0.5%
1 240
 
0.5%
위한 207
 
0.4%
200
 
0.4%
우리 172
 
0.4%
나는 133
 
0.3%
the 130
 
0.3%
Other values (21017) 42380
88.6%
2023-12-12T12:25:59.627701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38674
 
21.2%
3558
 
1.9%
3271
 
1.8%
: 3063
 
1.7%
2235
 
1.2%
1897
 
1.0%
1766
 
1.0%
1701
 
0.9%
1646
 
0.9%
1609
 
0.9%
Other values (1690) 123303
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 116770
63.9%
Space Separator 38674
 
21.2%
Lowercase Letter 11797
 
6.5%
Other Punctuation 6915
 
3.8%
Decimal Number 3189
 
1.7%
Uppercase Letter 1873
 
1.0%
Open Punctuation 1434
 
0.8%
Close Punctuation 1434
 
0.8%
Math Symbol 504
 
0.3%
Dash Punctuation 92
 
0.1%
Other values (6) 41
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3558
 
3.0%
3271
 
2.8%
2235
 
1.9%
1897
 
1.6%
1766
 
1.5%
1701
 
1.5%
1646
 
1.4%
1609
 
1.4%
1532
 
1.3%
1481
 
1.3%
Other values (1574) 96074
82.3%
Lowercase Letter
ValueCountFrequency (%)
e 1368
11.6%
o 1070
 
9.1%
a 980
 
8.3%
i 950
 
8.1%
t 841
 
7.1%
r 838
 
7.1%
n 823
 
7.0%
s 799
 
6.8%
l 525
 
4.5%
h 501
 
4.2%
Other values (17) 3102
26.3%
Uppercase Letter
ValueCountFrequency (%)
T 192
 
10.3%
S 161
 
8.6%
C 144
 
7.7%
A 127
 
6.8%
M 100
 
5.3%
E 97
 
5.2%
I 92
 
4.9%
D 91
 
4.9%
B 86
 
4.6%
P 81
 
4.3%
Other values (17) 702
37.5%
Other Punctuation
ValueCountFrequency (%)
: 3063
44.3%
, 1363
19.7%
. 1149
 
16.6%
! 457
 
6.6%
? 432
 
6.2%
· 194
 
2.8%
' 107
 
1.5%
; 41
 
0.6%
26
 
0.4%
/ 22
 
0.3%
Other values (10) 61
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 842
26.4%
2 574
18.0%
0 556
17.4%
3 300
 
9.4%
5 204
 
6.4%
4 181
 
5.7%
9 152
 
4.8%
8 129
 
4.0%
6 128
 
4.0%
7 123
 
3.9%
Math Symbol
ValueCountFrequency (%)
= 428
84.9%
~ 45
 
8.9%
< 8
 
1.6%
> 8
 
1.6%
+ 8
 
1.6%
× 2
 
0.4%
2
 
0.4%
2
 
0.4%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1411
98.4%
[ 16
 
1.1%
3
 
0.2%
2
 
0.1%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1411
98.4%
] 16
 
1.1%
3
 
0.2%
2
 
0.1%
2
 
0.1%
Letter Number
ValueCountFrequency (%)
8
50.0%
4
25.0%
2
 
12.5%
1
 
6.2%
1
 
6.2%
Other Symbol
ValueCountFrequency (%)
8
88.9%
1
 
11.1%
Space Separator
ValueCountFrequency (%)
38674
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 7
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 116212
63.6%
Common 52267
28.6%
Latin 13685
 
7.5%
Han 556
 
0.3%
Hiragana 2
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3558
 
3.1%
3271
 
2.8%
2235
 
1.9%
1897
 
1.6%
1766
 
1.5%
1701
 
1.5%
1646
 
1.4%
1609
 
1.4%
1532
 
1.3%
1481
 
1.3%
Other values (1283) 95516
82.2%
Han
ValueCountFrequency (%)
12
 
2.2%
11
 
2.0%
10
 
1.8%
10
 
1.8%
10
 
1.8%
9
 
1.6%
9
 
1.6%
8
 
1.4%
7
 
1.3%
7
 
1.3%
Other values (279) 463
83.3%
Latin
ValueCountFrequency (%)
e 1368
 
10.0%
o 1070
 
7.8%
a 980
 
7.2%
i 950
 
6.9%
t 841
 
6.1%
r 838
 
6.1%
n 823
 
6.0%
s 799
 
5.8%
l 525
 
3.8%
h 501
 
3.7%
Other values (48) 4990
36.5%
Common
ValueCountFrequency (%)
38674
74.0%
: 3063
 
5.9%
( 1411
 
2.7%
) 1411
 
2.7%
, 1363
 
2.6%
. 1149
 
2.2%
1 842
 
1.6%
2 574
 
1.1%
0 556
 
1.1%
! 457
 
0.9%
Other values (47) 2767
 
5.3%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Greek
ValueCountFrequency (%)
χ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 116192
63.6%
ASCII 65646
35.9%
CJK 538
 
0.3%
None 262
 
0.1%
Compat Jamo 20
 
< 0.1%
Punctuation 19
 
< 0.1%
CJK Compat Ideographs 18
 
< 0.1%
Number Forms 16
 
< 0.1%
Enclosed Alphanum 8
 
< 0.1%
Hiragana 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38674
58.9%
: 3063
 
4.7%
( 1411
 
2.1%
) 1411
 
2.1%
e 1368
 
2.1%
, 1363
 
2.1%
. 1149
 
1.8%
o 1070
 
1.6%
a 980
 
1.5%
i 950
 
1.4%
Other values (77) 14207
 
21.6%
Hangul
ValueCountFrequency (%)
3558
 
3.1%
3271
 
2.8%
2235
 
1.9%
1897
 
1.6%
1766
 
1.5%
1701
 
1.5%
1646
 
1.4%
1609
 
1.4%
1532
 
1.3%
1481
 
1.3%
Other values (1274) 95496
82.2%
None
ValueCountFrequency (%)
· 194
74.0%
26
 
9.9%
8
 
3.1%
5
 
1.9%
5
 
1.9%
3
 
1.1%
3
 
1.1%
× 2
 
0.8%
2
 
0.8%
2
 
0.8%
Other values (8) 12
 
4.6%
CJK
ValueCountFrequency (%)
12
 
2.2%
11
 
2.0%
10
 
1.9%
10
 
1.9%
10
 
1.9%
9
 
1.7%
9
 
1.7%
8
 
1.5%
7
 
1.3%
7
 
1.3%
Other values (267) 445
82.7%
Punctuation
ValueCountFrequency (%)
11
57.9%
5
26.3%
3
 
15.8%
Compat Jamo
ValueCountFrequency (%)
8
40.0%
3
 
15.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Number Forms
ValueCountFrequency (%)
8
50.0%
4
25.0%
2
 
12.5%
1
 
6.2%
1
 
6.2%
Enclosed Alphanum
ValueCountFrequency (%)
8
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
16.7%
3
16.7%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (2) 2
11.1%
Math Operators
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%

구입연도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2014.1299
Minimum2011
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:25:59.799270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12011
median2014
Q32016
95-th percentile2021
Maximum2023
Range12
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.4955183
Coefficient of variation (CV)0.0017354979
Kurtosis-0.43995499
Mean2014.1299
Median Absolute Deviation (MAD)3
Skewness0.84804994
Sum20141299
Variance12.218648
MonotonicityNot monotonic
2023-12-12T12:25:59.942367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2011 4438
44.4%
2014 2200
22.0%
2015 504
 
5.0%
2017 468
 
4.7%
2016 434
 
4.3%
2019 430
 
4.3%
2021 382
 
3.8%
2018 371
 
3.7%
2020 368
 
3.7%
2022 324
 
3.2%
ValueCountFrequency (%)
2011 4438
44.4%
2014 2200
22.0%
2015 504
 
5.0%
2016 434
 
4.3%
2017 468
 
4.7%
2018 371
 
3.7%
2019 430
 
4.3%
2020 368
 
3.7%
2021 382
 
3.8%
2022 324
 
3.2%
ValueCountFrequency (%)
2023 81
 
0.8%
2022 324
 
3.2%
2021 382
 
3.8%
2020 368
 
3.7%
2019 430
 
4.3%
2018 371
 
3.7%
2017 468
 
4.7%
2016 434
 
4.3%
2015 504
 
5.0%
2014 2200
22.0%

저자
Text

Distinct9235
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:26:00.464668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length236
Median length103
Mean length17.4092
Min length2

Characters and Unicode

Total characters174092
Distinct characters1158
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8681 ?
Unique (%)86.8%

Sample

1st row조순자 저
2nd row백명식 글·그림
3rd row장자 지음 ; 조수형 풀어씀
4th row호설암 원전 ; 구양일비 해석 ; 이선영 옮김
5th row권정생 지음
ValueCountFrequency (%)
8252
 
15.6%
지음 5458
 
10.3%
옮김 3991
 
7.5%
그림 2882
 
5.4%
2277
 
4.3%
글·그림 641
 
1.2%
340
 
0.6%
by 272
 
0.5%
236
 
0.4%
엮음 229
 
0.4%
Other values (14328) 28359
53.6%
2023-12-12T12:26:01.270377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44347
25.5%
; 8243
 
4.7%
6493
 
3.7%
6404
 
3.7%
5923
 
3.4%
4067
 
2.3%
3862
 
2.2%
3731
 
2.1%
3542
 
2.0%
3177
 
1.8%
Other values (1148) 84303
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108448
62.3%
Space Separator 44347
25.5%
Other Punctuation 10650
 
6.1%
Lowercase Letter 7254
 
4.2%
Uppercase Letter 1700
 
1.0%
Open Punctuation 785
 
0.5%
Close Punctuation 785
 
0.5%
Dash Punctuation 59
 
< 0.1%
Decimal Number 38
 
< 0.1%
Math Symbol 20
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6493
 
6.0%
6404
 
5.9%
5923
 
5.5%
4067
 
3.8%
3862
 
3.6%
3731
 
3.4%
3542
 
3.3%
3177
 
2.9%
1872
 
1.7%
1582
 
1.5%
Other values (1067) 67795
62.5%
Lowercase Letter
ValueCountFrequency (%)
e 797
11.0%
a 718
9.9%
i 574
 
7.9%
n 573
 
7.9%
r 566
 
7.8%
t 541
 
7.5%
l 536
 
7.4%
y 404
 
5.6%
s 388
 
5.3%
o 377
 
5.2%
Other values (16) 1780
24.5%
Uppercase Letter
ValueCountFrequency (%)
M 146
 
8.6%
S 139
 
8.2%
J 124
 
7.3%
B 123
 
7.2%
A 116
 
6.8%
D 98
 
5.8%
C 92
 
5.4%
H 91
 
5.4%
L 88
 
5.2%
K 86
 
5.1%
Other values (15) 597
35.1%
Other Punctuation
ValueCountFrequency (%)
; 8243
77.4%
, 1010
 
9.5%
· 773
 
7.3%
. 571
 
5.4%
: 20
 
0.2%
/ 17
 
0.2%
& 5
 
< 0.1%
' 4
 
< 0.1%
? 4
 
< 0.1%
2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 10
26.3%
1 9
23.7%
8 4
 
10.5%
9 4
 
10.5%
0 3
 
7.9%
7 3
 
7.9%
3 3
 
7.9%
5 1
 
2.6%
6 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
[ 779
99.2%
( 6
 
0.8%
Close Punctuation
ValueCountFrequency (%)
] 779
99.2%
) 6
 
0.8%
Math Symbol
ValueCountFrequency (%)
< 10
50.0%
> 10
50.0%
Space Separator
ValueCountFrequency (%)
44347
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108212
62.2%
Common 56689
32.6%
Latin 8955
 
5.1%
Han 236
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6493
 
6.0%
6404
 
5.9%
5923
 
5.5%
4067
 
3.8%
3862
 
3.6%
3731
 
3.4%
3542
 
3.3%
3177
 
2.9%
1872
 
1.7%
1582
 
1.5%
Other values (929) 67559
62.4%
Han
ValueCountFrequency (%)
50
 
21.2%
10
 
4.2%
5
 
2.1%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
2
 
0.8%
Other values (128) 149
63.1%
Latin
ValueCountFrequency (%)
e 797
 
8.9%
a 718
 
8.0%
i 574
 
6.4%
n 573
 
6.4%
r 566
 
6.3%
t 541
 
6.0%
l 536
 
6.0%
y 404
 
4.5%
s 388
 
4.3%
o 377
 
4.2%
Other values (42) 3481
38.9%
Common
ValueCountFrequency (%)
44347
78.2%
; 8243
 
14.5%
, 1010
 
1.8%
[ 779
 
1.4%
] 779
 
1.4%
· 773
 
1.4%
. 571
 
1.0%
- 59
 
0.1%
: 20
 
< 0.1%
/ 17
 
< 0.1%
Other values (19) 91
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108209
62.2%
ASCII 64861
37.3%
None 777
 
0.4%
CJK 228
 
0.1%
CJK Compat Ideographs 8
 
< 0.1%
Enclosed Alphanum 5
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
44347
68.4%
; 8243
 
12.7%
, 1010
 
1.6%
e 797
 
1.2%
[ 779
 
1.2%
] 779
 
1.2%
a 718
 
1.1%
i 574
 
0.9%
n 573
 
0.9%
. 571
 
0.9%
Other values (65) 6470
 
10.0%
Hangul
ValueCountFrequency (%)
6493
 
6.0%
6404
 
5.9%
5923
 
5.5%
4067
 
3.8%
3862
 
3.6%
3731
 
3.4%
3542
 
3.3%
3177
 
2.9%
1872
 
1.7%
1582
 
1.5%
Other values (927) 67556
62.4%
None
ValueCountFrequency (%)
· 773
99.5%
2
 
0.3%
1
 
0.1%
ß 1
 
0.1%
CJK
ValueCountFrequency (%)
50
 
21.9%
10
 
4.4%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
3
 
1.3%
2
 
0.9%
2
 
0.9%
Other values (122) 142
62.3%
Enclosed Alphanum
ValueCountFrequency (%)
5
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
37.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct2445
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:26:01.681943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length38
Mean length4.4197
Min length1

Characters and Unicode

Total characters44197
Distinct characters754
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1313 ?
Unique (%)13.1%

Sample

1st row보고사
2nd row내인생의책
3rd row풀빛
4th row태웅출판사
5th row우리교육
ValueCountFrequency (%)
비룡소 282
 
2.7%
문학동네 227
 
2.2%
창비 177
 
1.7%
민음사 139
 
1.3%
사계절 133
 
1.3%
시공주니어 126
 
1.2%
주니어김영사 121
 
1.2%
웅진씽크빅 109
 
1.0%
시공사 109
 
1.0%
웅진주니어 106
 
1.0%
Other values (2483) 8939
85.4%
2023-12-12T12:26:02.340359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1878
 
4.2%
1130
 
2.6%
991
 
2.2%
960
 
2.2%
817
 
1.8%
779
 
1.8%
740
 
1.7%
623
 
1.4%
606
 
1.4%
586
 
1.3%
Other values (744) 35087
79.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38369
86.8%
Lowercase Letter 3828
 
8.7%
Uppercase Letter 1145
 
2.6%
Space Separator 468
 
1.1%
Other Punctuation 144
 
0.3%
Decimal Number 130
 
0.3%
Close Punctuation 53
 
0.1%
Open Punctuation 53
 
0.1%
Dash Punctuation 6
 
< 0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1878
 
4.9%
1130
 
2.9%
991
 
2.6%
960
 
2.5%
817
 
2.1%
779
 
2.0%
740
 
1.9%
623
 
1.6%
606
 
1.6%
586
 
1.5%
Other values (667) 29259
76.3%
Lowercase Letter
ValueCountFrequency (%)
o 573
15.0%
s 352
 
9.2%
i 315
 
8.2%
r 306
 
8.0%
n 295
 
7.7%
e 289
 
7.5%
a 227
 
5.9%
l 176
 
4.6%
k 174
 
4.5%
t 173
 
4.5%
Other values (16) 948
24.8%
Uppercase Letter
ValueCountFrequency (%)
B 174
15.2%
H 93
 
8.1%
M 85
 
7.4%
P 81
 
7.1%
R 65
 
5.7%
K 63
 
5.5%
O 59
 
5.2%
S 58
 
5.1%
D 55
 
4.8%
W 54
 
4.7%
Other values (16) 358
31.3%
Other Punctuation
ValueCountFrequency (%)
42
29.2%
& 31
21.5%
· 23
16.0%
' 17
11.8%
. 11
 
7.6%
, 7
 
4.9%
@ 3
 
2.1%
/ 2
 
1.4%
: 2
 
1.4%
2
 
1.4%
Other values (4) 4
 
2.8%
Decimal Number
ValueCountFrequency (%)
2 62
47.7%
1 57
43.8%
6 4
 
3.1%
0 3
 
2.3%
4 2
 
1.5%
3 2
 
1.5%
Space Separator
ValueCountFrequency (%)
468
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38173
86.4%
Latin 4973
 
11.3%
Common 855
 
1.9%
Han 196
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1878
 
4.9%
1130
 
3.0%
991
 
2.6%
960
 
2.5%
817
 
2.1%
779
 
2.0%
740
 
1.9%
623
 
1.6%
606
 
1.6%
586
 
1.5%
Other values (604) 29063
76.1%
Han
ValueCountFrequency (%)
28
 
14.3%
21
 
10.7%
12
 
6.1%
11
 
5.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (53) 88
44.9%
Latin
ValueCountFrequency (%)
o 573
 
11.5%
s 352
 
7.1%
i 315
 
6.3%
r 306
 
6.2%
n 295
 
5.9%
e 289
 
5.8%
a 227
 
4.6%
l 176
 
3.5%
k 174
 
3.5%
B 174
 
3.5%
Other values (42) 2092
42.1%
Common
ValueCountFrequency (%)
468
54.7%
2 62
 
7.3%
1 57
 
6.7%
) 53
 
6.2%
( 53
 
6.2%
42
 
4.9%
& 31
 
3.6%
· 23
 
2.7%
' 17
 
2.0%
. 11
 
1.3%
Other values (15) 38
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38173
86.4%
ASCII 5759
 
13.0%
CJK 196
 
0.4%
None 68
 
0.2%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1878
 
4.9%
1130
 
3.0%
991
 
2.6%
960
 
2.5%
817
 
2.1%
779
 
2.0%
740
 
1.9%
623
 
1.6%
606
 
1.6%
586
 
1.5%
Other values (604) 29063
76.1%
ASCII
ValueCountFrequency (%)
o 573
 
9.9%
468
 
8.1%
s 352
 
6.1%
i 315
 
5.5%
r 306
 
5.3%
n 295
 
5.1%
e 289
 
5.0%
a 227
 
3.9%
l 176
 
3.1%
k 174
 
3.0%
Other values (62) 2584
44.9%
None
ValueCountFrequency (%)
42
61.8%
· 23
33.8%
2
 
2.9%
1
 
1.5%
CJK
ValueCountFrequency (%)
28
 
14.3%
21
 
10.7%
12
 
6.1%
11
 
5.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (53) 88
44.9%
Punctuation
ValueCountFrequency (%)
1
100.0%

페이지수
Text

MISSING 

Distinct884
Distinct (%)9.2%
Missing442
Missing (%)4.4%
Memory size156.2 KiB
2023-12-12T12:26:02.907260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length2.8627328
Min length1

Characters and Unicode

Total characters27362
Distinct characters23
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique285 ?
Unique (%)3.0%

Sample

1st row203
2nd row33
3rd row195
4th row326
5th row191
ValueCountFrequency (%)
32 213
 
2.2%
33 134
 
1.4%
31 126
 
1.3%
40 90
 
0.9%
24 84
 
0.9%
25 72
 
0.7%
30 72
 
0.7%
191 68
 
0.7%
223 67
 
0.7%
175 61
 
0.6%
Other values (804) 8685
89.8%
2023-12-12T12:26:03.678949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 4701
17.2%
3 4287
15.7%
1 3938
14.4%
4 2550
9.3%
5 2214
8.1%
7 1946
7.1%
9 1904
7.0%
6 1756
 
6.4%
0 1669
 
6.1%
8 1601
 
5.9%
Other values (13) 796
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26566
97.1%
Lowercase Letter 386
 
1.4%
Space Separator 223
 
0.8%
Other Punctuation 163
 
0.6%
Other Letter 16
 
0.1%
Dash Punctuation 6
 
< 0.1%
Letter Number 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 4701
17.7%
3 4287
16.1%
1 3938
14.8%
4 2550
9.6%
5 2214
8.3%
7 1946
7.3%
9 1904
7.2%
6 1756
 
6.6%
0 1669
 
6.3%
8 1601
 
6.0%
Lowercase Letter
ValueCountFrequency (%)
i 202
52.3%
x 115
29.8%
v 68
 
17.6%
l 1
 
0.3%
Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%
Space Separator
ValueCountFrequency (%)
223
100.0%
Other Punctuation
ValueCountFrequency (%)
, 163
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Uppercase Letter
ValueCountFrequency (%)
V 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26958
98.5%
Latin 388
 
1.4%
Hangul 16
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 4701
17.4%
3 4287
15.9%
1 3938
14.6%
4 2550
9.5%
5 2214
8.2%
7 1946
7.2%
9 1904
7.1%
6 1756
 
6.5%
0 1669
 
6.2%
8 1601
 
5.9%
Other values (3) 392
 
1.5%
Latin
ValueCountFrequency (%)
i 202
52.1%
x 115
29.6%
v 68
 
17.5%
l 1
 
0.3%
1
 
0.3%
V 1
 
0.3%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27345
99.9%
Hangul 16
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 4701
17.2%
3 4287
15.7%
1 3938
14.4%
4 2550
9.3%
5 2214
8.1%
7 1946
7.1%
9 1904
7.0%
6 1756
 
6.4%
0 1669
 
6.1%
8 1601
 
5.9%
Other values (8) 779
 
2.8%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
4
25.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

가격
Real number (ℝ)

SKEWED 

Distinct255
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13538.713
Minimum0
Maximum660000
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:26:03.877260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6800
Q19000
median12000
Q315000
95-th percentile25000
Maximum660000
Range660000
Interquartile range (IQR)6000

Descriptive statistics

Standard deviation21410.187
Coefficient of variation (CV)1.5814049
Kurtosis797.55492
Mean13538.713
Median Absolute Deviation (MAD)3000
Skewness26.957244
Sum1.3538713 × 108
Variance4.5839609 × 108
MonotonicityNot monotonic
2023-12-12T12:26:04.067838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12000.0 923
 
9.2%
15000.0 673
 
6.7%
10000.0 638
 
6.4%
13000.0 638
 
6.4%
11000.0 522
 
5.2%
9000.0 511
 
5.1%
8000.0 456
 
4.6%
8500.0 430
 
4.3%
9500.0 370
 
3.7%
14000.0 318
 
3.2%
Other values (245) 4521
45.2%
ValueCountFrequency (%)
0.0 19
0.2%
5.0 4
 
< 0.1%
15.95 1
 
< 0.1%
19.0 1
 
< 0.1%
19.93 1
 
< 0.1%
19.99 2
 
< 0.1%
700.0 1
 
< 0.1%
800.0 1
 
< 0.1%
900.0 2
 
< 0.1%
1000.0 1
 
< 0.1%
ValueCountFrequency (%)
660000.0 9
0.1%
580000.0 1
 
< 0.1%
275000.0 1
 
< 0.1%
195000.0 1
 
< 0.1%
100000.0 2
 
< 0.1%
90000.0 1
 
< 0.1%
86000.0 1
 
< 0.1%
85200.0 1
 
< 0.1%
85000.0 1
 
< 0.1%
80000.0 3
 
< 0.1%
Distinct51
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:26:04.292594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length4
Mean length4.0018
Min length4

Characters and Unicode

Total characters40018
Distinct characters16
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row2006
2nd row2013
3rd row2013
4th row2005
5th row2013
ValueCountFrequency (%)
2007 2296
23.0%
2006 959
 
9.6%
2013 567
 
5.7%
2005 557
 
5.6%
2008 497
 
5.0%
2018 403
 
4.0%
2017 394
 
3.9%
2019 379
 
3.8%
2011 370
 
3.7%
2012 363
 
3.6%
Other values (38) 3215
32.1%
2023-12-12T12:26:04.676271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 15669
39.2%
2 11691
29.2%
1 4650
 
11.6%
7 2717
 
6.8%
6 1336
 
3.3%
8 935
 
2.3%
5 865
 
2.2%
3 778
 
1.9%
9 772
 
1.9%
4 595
 
1.5%
Other values (6) 10
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40008
> 99.9%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Lowercase Letter 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 15669
39.2%
2 11691
29.2%
1 4650
 
11.6%
7 2717
 
6.8%
6 1336
 
3.3%
8 935
 
2.3%
5 865
 
2.2%
3 778
 
1.9%
9 772
 
1.9%
4 595
 
1.5%
Open Punctuation
ValueCountFrequency (%)
[ 2
50.0%
( 2
50.0%
Close Punctuation
ValueCountFrequency (%)
] 2
50.0%
) 2
50.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40017
> 99.9%
Latin 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 15669
39.2%
2 11691
29.2%
1 4650
 
11.6%
7 2717
 
6.8%
6 1336
 
3.3%
8 935
 
2.3%
5 865
 
2.2%
3 778
 
1.9%
9 772
 
1.9%
4 595
 
1.5%
Other values (5) 9
 
< 0.1%
Latin
ValueCountFrequency (%)
c 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40018
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 15669
39.2%
2 11691
29.2%
1 4650
 
11.6%
7 2717
 
6.8%
6 1336
 
3.3%
8 935
 
2.3%
5 865
 
2.2%
3 778
 
1.9%
9 772
 
1.9%
4 595
 
1.5%
Other values (6) 10
 
< 0.1%

도서관명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
파주가람도서관
5560 
파주교하도서관
4440 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row파주교하도서관
2nd row파주가람도서관
3rd row파주가람도서관
4th row파주교하도서관
5th row파주가람도서관

Common Values

ValueCountFrequency (%)
파주가람도서관 5560
55.6%
파주교하도서관 4440
44.4%

Length

2023-12-12T12:26:04.846221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:26:04.972872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
파주가람도서관 5560
55.6%
파주교하도서관 4440
44.4%

Interactions

2023-12-12T12:25:57.977015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:25:57.685514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:25:58.100971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:25:57.806777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:26:05.057211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구입연도가격출판연도도서관명
구입연도1.0000.0000.9600.073
가격0.0001.0000.0000.043
출판연도0.9600.0001.0000.963
도서관명0.0730.0430.9631.000
2023-12-12T12:26:05.173277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구입연도가격도서관명
구입연도1.0000.3770.999
가격0.3771.0000.031
도서관명0.9990.0311.000

Missing values

2023-12-12T12:25:58.276864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:25:58.443462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서제목구입연도저자출판사페이지수가격출판연도도서관명
69995가집에 담아낸 노래와 사람들2011조순자 저보고사20310000.02006파주교하도서관
6255코는 냄새만 맡을까?2014백명식 글·그림내인생의책3312000.02013파주가람도서관
30002장자 : 자연 속에서 찾은 자유의 세계2016장자 지음 ; 조수형 풀어씀풀빛19510000.02013파주가람도서관
80896(홍정상인 호설암의) 인간경영2011호설암 원전 ; 구양일비 해석 ; 이선영 옮김태웅출판사32611000.02005파주교하도서관
9258깜둥바가지 아줌마2014권정생 지음우리교육1917000.02013파주가람도서관
36249지도로 볼 수 없는 우리 땅을 알려 줄게2017홍민정 글 ; 안녕달 그림 ; 진종헌 감수해와나무11911000.02017파주가람도서관
76731아인슈타인2011김혜경 글 ; 이정아 그림파란자전거1426500.02003파주교하도서관
3245나라서 참 다행이다 : 바닥에 떨어진 자존감을 구할 심리학 행동 법칙2014크리스토프 앙드레 지음 ; 이세진 옮김북폴리오39112000.02010파주가람도서관
33378딴짓의 재발견, 첫번째 이야기 : 우리가 꼭 알아야 할 과학자들의 우연하고 기발한 발견들2017니콜라 비트코프스키 지음 ; 양진성 옮김애플북스25514000.02016파주가람도서관
81946몸은 나보다 먼저 말한다2011피터 콜릿 지음 ; 박태선 옮김청림출판44719500.02007파주교하도서관
도서제목구입연도저자출판사페이지수가격출판연도도서관명
27538난세의 인문학 : 제자백가 12인의 지략으로 맞서다2015신동준 지음이담Books38618000.02015파주가람도서관
90007부자가 되려면 채권에 미쳐라2011심영철 지음한국경제신문 한경BP19311000.02007파주교하도서관
23426샘과 데이브가 땅을 팠어요2015맥 바넷 글 ; 존 클라센 그림 ; 서남희 옮김시공주니어3111000.02014파주가람도서관
4672자기계발의 덫2014미키 맥기 지음 ; 김상화 옮김모요사39517000.02011파주가람도서관
86747돼지꿈 : 황석영 소설집2011황석영 지음민음사4169000.02006파주교하도서관
93422매니페스토와 한국정치개혁2011이현출 지음건국대학교출판부26610000.02006파주교하도서관
47447(손미나의) 나의 첫 외국어 수업2021손미나 지음토네이도28816800.02021파주가람도서관
80629최고의 브랜드 네임은 어떻게 만들어 지는가2011스티브 리브킨 ; 프레이저 서더랜드 [같이] 지음 ; 토탈브랜딩코리아 옮김김앤김북스48018000.02006파주교하도서관
61021(법구경)인연담2011정태혁 엮고·옮김정신세계사44512800.02007파주교하도서관
66093살아 있는 현재에 행동하라2011윤문원 지음책만드는집2559500.02007파주교하도서관

Duplicate rows

Most frequently occurring

도서제목구입연도저자출판사페이지수가격출판연도도서관명# duplicates
0(마르크스가 꿈꾼)더 나은 세상 이야기2011자비네 카르본 ; 바르바라 뤼커 [공]글 ; 마렌 바르버 그림 ; 김라합 옮김웅진주니어459000.02007파주교하도서관2
1(부뚜막 고양이의 오물딱 조물딱)환경 공책. [1]2011곽임정난 글·그림살림어린이201900.02007파주교하도서관2
2(전쟁의 시대)청동기 고인돌 마을2011최향미 글 ; 김이랑 그림한솔수북616800.02007파주교하도서관2
3(조선의 글씨를 천하에 세운)김정희2011조정육 지음아이세움2289500.02007파주교하도서관2
4(지식을 위한)철학통조림. 3:, 담백한 맛2011김용규 글 ; 김동연 그림주니어김영사29111000.02007파주교하도서관2
5거짓말이 아니야2011카트린 돌토 ; 콜린 포르푸아레 [공]글 ; 조엘 부셰 그림 ; 이세진 옮김비룡소236000.02007파주교하도서관2
6고양이 학교. 2부-1권 : 태양신검의 수호자2011김진경 글 ; 김재홍 그림문학동네1398500.02007파주교하도서관2
7나와 악기 박물관2011안드레아 호이어 글·그림 ; 유혜자 옮김미래M&B248000.02008파주교하도서관2
8난 엄마 품이 제일 좋아2011유영진 글 ; 박소영 그림씽크하우스1158500.02007파주교하도서관2
9네가 무당벌레니?2011주디 앨런 글 ; 튜더 험프리스 그림 ; 이성실 옮김다섯수레317000.02007파주교하도서관2