Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells43
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description부산광역시 남구 도서관 신규도서 구매에 대한 책명, 저자, 발행자, 발행년도, 가격에 대한 상세한 자료를 제공합니다.
URLhttps://www.data.go.kr/data/15060954/fileData.do

Alerts

발행년 is highly imbalanced (65.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:57:46.683581
Analysis finished2023-12-12 20:57:49.243171
Duration2.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7155.2812
Minimum2
Maximum14301
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:57:49.343351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile703.9
Q13605.75
median7172.5
Q310695.5
95-th percentile13569.1
Maximum14301
Range14299
Interquartile range (IQR)7089.75

Descriptive statistics

Standard deviation4120.872
Coefficient of variation (CV)0.57592034
Kurtosis-1.1943541
Mean7155.2812
Median Absolute Deviation (MAD)3547.5
Skewness-0.0033163736
Sum71552812
Variance16981586
MonotonicityNot monotonic
2023-12-13T05:57:49.525231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9190 1
 
< 0.1%
4022 1
 
< 0.1%
12524 1
 
< 0.1%
6157 1
 
< 0.1%
8876 1
 
< 0.1%
2731 1
 
< 0.1%
570 1
 
< 0.1%
13228 1
 
< 0.1%
9781 1
 
< 0.1%
2586 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
ValueCountFrequency (%)
14301 1
< 0.1%
14299 1
< 0.1%
14297 1
< 0.1%
14296 1
< 0.1%
14295 1
< 0.1%
14294 1
< 0.1%
14293 1
< 0.1%
14292 1
< 0.1%
14291 1
< 0.1%
14290 1
< 0.1%

서명
Text

Distinct9434
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:57:49.867910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length150
Median length88
Mean length23.646
Min length1

Characters and Unicode

Total characters236460
Distinct characters1798
Distinct categories15 ?
Distinct scripts7 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8905 ?
Unique (%)89.0%

Sample

1st row아티스트 웨이, 마음의 소리를 듣는 시간
2nd row어느 할머니 이야기
3rd row공부란 무엇인가
4th row거인의 탄생 : 이원호의 성장, 개척, 기업소설. 4, 거인의 탄생
5th row우주시대에 오신 것을 환영합니다 : 우주가 산업이 되는 뉴 스페이스 시대 가이드
ValueCountFrequency (%)
4744
 
7.8%
이야기 435
 
0.7%
장편소설 372
 
0.6%
위한 361
 
0.6%
대활자본 257
 
0.4%
더책 254
 
0.4%
1 219
 
0.4%
2 198
 
0.3%
the 190
 
0.3%
187
 
0.3%
Other values (23244) 53984
88.2%
2023-12-13T05:57:50.423961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51206
 
21.7%
: 4670
 
2.0%
4392
 
1.9%
3997
 
1.7%
3408
 
1.4%
2346
 
1.0%
, 2232
 
0.9%
2129
 
0.9%
2039
 
0.9%
2014
 
0.9%
Other values (1788) 158027
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149426
63.2%
Space Separator 51206
 
21.7%
Lowercase Letter 14928
 
6.3%
Other Punctuation 10347
 
4.4%
Decimal Number 3686
 
1.6%
Uppercase Letter 2467
 
1.0%
Close Punctuation 1890
 
0.8%
Open Punctuation 1889
 
0.8%
Math Symbol 443
 
0.2%
Dash Punctuation 147
 
0.1%
Other values (5) 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4392
 
2.9%
3997
 
2.7%
3408
 
2.3%
2346
 
1.6%
2129
 
1.4%
2039
 
1.4%
2014
 
1.3%
2000
 
1.3%
1893
 
1.3%
1846
 
1.2%
Other values (1632) 123362
82.6%
Lowercase Letter
ValueCountFrequency (%)
e 1781
11.9%
a 1265
 
8.5%
o 1251
 
8.4%
i 1093
 
7.3%
n 1079
 
7.2%
t 1067
 
7.1%
r 1020
 
6.8%
s 858
 
5.7%
h 661
 
4.4%
l 596
 
4.0%
Other values (46) 4257
28.5%
Uppercase Letter
ValueCountFrequency (%)
T 254
 
10.3%
S 246
 
10.0%
A 174
 
7.1%
M 162
 
6.6%
I 148
 
6.0%
C 148
 
6.0%
B 140
 
5.7%
F 113
 
4.6%
D 103
 
4.2%
P 102
 
4.1%
Other values (24) 877
35.5%
Other Punctuation
ValueCountFrequency (%)
: 4670
45.1%
, 2232
21.6%
? 1308
 
12.6%
. 996
 
9.6%
! 581
 
5.6%
· 239
 
2.3%
' 177
 
1.7%
40
 
0.4%
& 19
 
0.2%
; 16
 
0.2%
Other values (14) 69
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 959
26.0%
0 702
19.0%
2 608
16.5%
3 361
 
9.8%
5 271
 
7.4%
4 230
 
6.2%
6 153
 
4.2%
9 153
 
4.2%
7 126
 
3.4%
8 123
 
3.3%
Math Symbol
ValueCountFrequency (%)
= 343
77.4%
~ 63
 
14.2%
× 13
 
2.9%
+ 11
 
2.5%
| 4
 
0.9%
< 4
 
0.9%
> 4
 
0.9%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1080
57.1%
] 799
42.3%
6
 
0.3%
2
 
0.1%
2
 
0.1%
} 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1080
57.2%
[ 799
42.3%
6
 
0.3%
2
 
0.1%
2
 
0.1%
Letter Number
ValueCountFrequency (%)
5
50.0%
3
30.0%
2
 
20.0%
Other Symbol
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Final Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Initial Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
51206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 147
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 148575
62.8%
Common 69629
29.4%
Latin 17033
 
7.2%
Han 445
 
0.2%
Cyrillic 372
 
0.2%
Hiragana 331
 
0.1%
Katakana 75
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4392
 
3.0%
3997
 
2.7%
3408
 
2.3%
2346
 
1.6%
2129
 
1.4%
2039
 
1.4%
2014
 
1.4%
2000
 
1.3%
1893
 
1.3%
1846
 
1.2%
Other values (1285) 122511
82.5%
Han
ValueCountFrequency (%)
20
 
4.5%
12
 
2.7%
8
 
1.8%
7
 
1.6%
7
 
1.6%
7
 
1.6%
6
 
1.3%
6
 
1.3%
6
 
1.3%
5
 
1.1%
Other values (238) 361
81.1%
Common
ValueCountFrequency (%)
51206
73.5%
: 4670
 
6.7%
, 2232
 
3.2%
? 1308
 
1.9%
) 1080
 
1.6%
( 1080
 
1.6%
. 996
 
1.4%
1 959
 
1.4%
[ 799
 
1.1%
] 799
 
1.1%
Other values (53) 4500
 
6.5%
Hiragana
ValueCountFrequency (%)
30
 
9.1%
19
 
5.7%
18
 
5.4%
15
 
4.5%
15
 
4.5%
11
 
3.3%
10
 
3.0%
10
 
3.0%
9
 
2.7%
8
 
2.4%
Other values (50) 186
56.2%
Latin
ValueCountFrequency (%)
e 1781
 
10.5%
a 1265
 
7.4%
o 1251
 
7.3%
i 1093
 
6.4%
n 1079
 
6.3%
t 1067
 
6.3%
r 1020
 
6.0%
s 858
 
5.0%
h 661
 
3.9%
l 596
 
3.5%
Other values (46) 6362
37.4%
Katakana
ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
Other values (29) 38
50.7%
Cyrillic
ValueCountFrequency (%)
о 44
 
11.8%
е 29
 
7.8%
р 26
 
7.0%
и 26
 
7.0%
а 22
 
5.9%
к 21
 
5.6%
н 20
 
5.4%
с 20
 
5.4%
т 18
 
4.8%
д 17
 
4.6%
Other values (27) 129
34.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 148572
62.8%
ASCII 86265
36.5%
CJK 444
 
0.2%
Cyrillic 372
 
0.2%
None 370
 
0.2%
Hiragana 331
 
0.1%
Katakana 75
 
< 0.1%
Punctuation 12
 
< 0.1%
Number Forms 10
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Other values (5) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51206
59.4%
: 4670
 
5.4%
, 2232
 
2.6%
e 1781
 
2.1%
? 1308
 
1.5%
a 1265
 
1.5%
o 1251
 
1.5%
i 1093
 
1.3%
) 1080
 
1.3%
( 1080
 
1.3%
Other values (79) 19299
 
22.4%
Hangul
ValueCountFrequency (%)
4392
 
3.0%
3997
 
2.7%
3408
 
2.3%
2346
 
1.6%
2129
 
1.4%
2039
 
1.4%
2014
 
1.4%
2000
 
1.3%
1893
 
1.3%
1846
 
1.2%
Other values (1282) 122508
82.5%
None
ValueCountFrequency (%)
· 239
64.6%
40
 
10.8%
đ 20
 
5.4%
× 13
 
3.5%
12
 
3.2%
7
 
1.9%
7
 
1.9%
6
 
1.6%
6
 
1.6%
5
 
1.4%
Other values (8) 15
 
4.1%
Cyrillic
ValueCountFrequency (%)
о 44
 
11.8%
е 29
 
7.8%
р 26
 
7.0%
и 26
 
7.0%
а 22
 
5.9%
к 21
 
5.6%
н 20
 
5.4%
с 20
 
5.4%
т 18
 
4.8%
д 17
 
4.6%
Other values (27) 129
34.7%
Hiragana
ValueCountFrequency (%)
30
 
9.1%
19
 
5.7%
18
 
5.4%
15
 
4.5%
15
 
4.5%
11
 
3.3%
10
 
3.0%
10
 
3.0%
9
 
2.7%
8
 
2.4%
Other values (50) 186
56.2%
CJK
ValueCountFrequency (%)
20
 
4.5%
12
 
2.7%
8
 
1.8%
7
 
1.6%
7
 
1.6%
7
 
1.6%
6
 
1.4%
6
 
1.4%
6
 
1.4%
5
 
1.1%
Other values (237) 360
81.1%
Katakana
ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
Other values (29) 38
50.7%
Number Forms
ValueCountFrequency (%)
5
50.0%
3
30.0%
2
 
20.0%
Punctuation
ValueCountFrequency (%)
4
33.3%
4
33.3%
2
16.7%
1
 
8.3%
1
 
8.3%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct8325
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:57:50.838680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length122
Mean length15.5763
Min length3

Characters and Unicode

Total characters155763
Distinct characters1290
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7232 ?
Unique (%)72.3%

Sample

1st row줄리아 카메론 지음 ; 이상원 옮김
2nd row조앤 슈워츠 글 ; 나히드 카제미 그림 ; 신형건 옮김
3rd row한근태 지음
4th row이원호 지음
5th row켈리 제라디 지음 ; 이지민 옮김
ValueCountFrequency (%)
6522
 
14.5%
지음 5704
 
12.7%
옮김 2695
 
6.0%
그림 2599
 
5.8%
2166
 
4.8%
글·그림 535
 
1.2%
by 421
 
0.9%
공]지음 327
 
0.7%
241
 
0.5%
illustrated 128
 
0.3%
Other values (12404) 23723
52.6%
2023-12-13T05:57:51.610580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35085
22.5%
7126
 
4.6%
; 6409
 
4.1%
6193
 
4.0%
5238
 
3.4%
3392
 
2.2%
3353
 
2.2%
3092
 
2.0%
2953
 
1.9%
2744
 
1.8%
Other values (1280) 80178
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95410
61.3%
Space Separator 35085
 
22.5%
Lowercase Letter 11292
 
7.2%
Other Punctuation 10037
 
6.4%
Uppercase Letter 2089
 
1.3%
Open Punctuation 879
 
0.6%
Close Punctuation 878
 
0.6%
Dash Punctuation 47
 
< 0.1%
Decimal Number 37
 
< 0.1%
Math Symbol 4
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7126
 
7.5%
6193
 
6.5%
5238
 
5.5%
3392
 
3.6%
3353
 
3.5%
3092
 
3.2%
2953
 
3.1%
2744
 
2.9%
1662
 
1.7%
1333
 
1.4%
Other values (1192) 58324
61.1%
Lowercase Letter
ValueCountFrequency (%)
a 1202
10.6%
e 1073
 
9.5%
i 938
 
8.3%
r 898
 
8.0%
n 838
 
7.4%
t 805
 
7.1%
l 786
 
7.0%
y 665
 
5.9%
s 601
 
5.3%
o 590
 
5.2%
Other values (19) 2896
25.6%
Uppercase Letter
ValueCountFrequency (%)
S 188
 
9.0%
B 164
 
7.9%
M 162
 
7.8%
A 129
 
6.2%
C 129
 
6.2%
T 128
 
6.1%
J 125
 
6.0%
D 114
 
5.5%
L 107
 
5.1%
K 98
 
4.7%
Other values (16) 745
35.7%
Decimal Number
ValueCountFrequency (%)
1 12
32.4%
2 7
18.9%
3 5
13.5%
0 4
 
10.8%
6 3
 
8.1%
9 2
 
5.4%
8 1
 
2.7%
4 1
 
2.7%
5 1
 
2.7%
7 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
; 6409
63.9%
, 1762
 
17.6%
? 768
 
7.7%
· 677
 
6.7%
. 218
 
2.2%
: 176
 
1.8%
/ 20
 
0.2%
' 5
 
< 0.1%
& 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 832
94.7%
( 43
 
4.9%
4
 
0.5%
Close Punctuation
ValueCountFrequency (%)
] 831
94.6%
) 43
 
4.9%
4
 
0.5%
Math Symbol
ValueCountFrequency (%)
2
50.0%
2
50.0%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
35085
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 94624
60.7%
Common 46972
30.2%
Latin 13381
 
8.6%
Han 472
 
0.3%
Katakana 172
 
0.1%
Hiragana 142
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7126
 
7.5%
6193
 
6.5%
5238
 
5.5%
3392
 
3.6%
3353
 
3.5%
3092
 
3.3%
2953
 
3.1%
2744
 
2.9%
1662
 
1.8%
1333
 
1.4%
Other values (896) 57538
60.8%
Han
ValueCountFrequency (%)
35
 
7.4%
24
 
5.1%
15
 
3.2%
15
 
3.2%
11
 
2.3%
10
 
2.1%
9
 
1.9%
8
 
1.7%
8
 
1.7%
8
 
1.7%
Other values (186) 329
69.7%
Katakana
ValueCountFrequency (%)
22
 
12.8%
9
 
5.2%
8
 
4.7%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
Other values (47) 91
52.9%
Latin
ValueCountFrequency (%)
a 1202
 
9.0%
e 1073
 
8.0%
i 938
 
7.0%
r 898
 
6.7%
n 838
 
6.3%
t 805
 
6.0%
l 786
 
5.9%
y 665
 
5.0%
s 601
 
4.5%
o 590
 
4.4%
Other values (45) 4985
37.3%
Hiragana
ValueCountFrequency (%)
9
 
6.3%
8
 
5.6%
7
 
4.9%
7
 
4.9%
7
 
4.9%
7
 
4.9%
6
 
4.2%
6
 
4.2%
6
 
4.2%
5
 
3.5%
Other values (33) 74
52.1%
Common
ValueCountFrequency (%)
35085
74.7%
; 6409
 
13.6%
, 1762
 
3.8%
[ 832
 
1.8%
] 831
 
1.8%
? 768
 
1.6%
· 677
 
1.4%
. 218
 
0.5%
: 176
 
0.4%
- 47
 
0.1%
Other values (23) 167
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 94623
60.7%
ASCII 59650
38.3%
None 698
 
0.4%
CJK 472
 
0.3%
Katakana 172
 
0.1%
Hiragana 142
 
0.1%
Enclosed Alphanum 2
 
< 0.1%
Punctuation 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Box Drawing 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35085
58.8%
; 6409
 
10.7%
, 1762
 
3.0%
a 1202
 
2.0%
e 1073
 
1.8%
i 938
 
1.6%
r 898
 
1.5%
n 838
 
1.4%
[ 832
 
1.4%
] 831
 
1.4%
Other values (66) 9782
 
16.4%
Hangul
ValueCountFrequency (%)
7126
 
7.5%
6193
 
6.5%
5238
 
5.5%
3392
 
3.6%
3353
 
3.5%
3092
 
3.3%
2953
 
3.1%
2744
 
2.9%
1662
 
1.8%
1333
 
1.4%
Other values (895) 57537
60.8%
None
ValueCountFrequency (%)
· 677
97.0%
đ 7
 
1.0%
4
 
0.6%
4
 
0.6%
2
 
0.3%
2
 
0.3%
ø 1
 
0.1%
æ 1
 
0.1%
CJK
ValueCountFrequency (%)
35
 
7.4%
24
 
5.1%
15
 
3.2%
15
 
3.2%
11
 
2.3%
10
 
2.1%
9
 
1.9%
8
 
1.7%
8
 
1.7%
8
 
1.7%
Other values (186) 329
69.7%
Katakana
ValueCountFrequency (%)
22
 
12.8%
9
 
5.2%
8
 
4.7%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
Other values (47) 91
52.9%
Hiragana
ValueCountFrequency (%)
9
 
6.3%
8
 
5.6%
7
 
4.9%
7
 
4.9%
7
 
4.9%
7
 
4.9%
6
 
4.2%
6
 
4.2%
6
 
4.2%
5
 
3.5%
Other values (33) 74
52.1%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct2779
Distinct (%)27.8%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T05:57:51.908599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length40
Mean length5.4304291
Min length1

Characters and Unicode

Total characters54288
Distinct characters875
Distinct categories9 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1378 ?
Unique (%)13.8%

Sample

1st row비즈니스북스
2nd row보물창고
3rd row샘터
4th row한결미디어
5th row혜윰터
ValueCountFrequency (%)
books 143
 
1.3%
문학동네 139
 
1.3%
창비 138
 
1.3%
웅진북클럽:웅진씽크빅 106
 
1.0%
위즈덤하우스 105
 
1.0%
김영사 79
 
0.7%
그레이트북스 73
 
0.7%
민음사 71
 
0.7%
비룡소 68
 
0.6%
천개의바람 58
 
0.5%
Other values (2861) 9929
91.0%
2023-12-13T05:57:52.357320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1797
 
3.3%
1594
 
2.9%
1312
 
2.4%
1083
 
2.0%
: 993
 
1.8%
o 986
 
1.8%
920
 
1.7%
852
 
1.6%
749
 
1.4%
682
 
1.3%
Other values (865) 43320
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42594
78.5%
Lowercase Letter 6812
 
12.5%
Uppercase Letter 2074
 
3.8%
Other Punctuation 1489
 
2.7%
Space Separator 920
 
1.7%
Decimal Number 158
 
0.3%
Close Punctuation 117
 
0.2%
Open Punctuation 117
 
0.2%
Dash Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1797
 
4.2%
1594
 
3.7%
1312
 
3.1%
1083
 
2.5%
852
 
2.0%
749
 
1.8%
682
 
1.6%
629
 
1.5%
597
 
1.4%
498
 
1.2%
Other values (783) 32801
77.0%
Lowercase Letter
ValueCountFrequency (%)
o 986
14.5%
r 648
9.5%
s 645
9.5%
e 554
 
8.1%
a 512
 
7.5%
i 509
 
7.5%
n 497
 
7.3%
l 322
 
4.7%
k 317
 
4.7%
t 278
 
4.1%
Other values (17) 1544
22.7%
Uppercase Letter
ValueCountFrequency (%)
B 416
20.1%
S 198
 
9.5%
H 127
 
6.1%
P 124
 
6.0%
K 111
 
5.4%
R 108
 
5.2%
C 108
 
5.2%
E 100
 
4.8%
O 83
 
4.0%
L 70
 
3.4%
Other values (16) 629
30.3%
Other Punctuation
ValueCountFrequency (%)
: 993
66.7%
? 325
 
21.8%
' 42
 
2.8%
. 37
 
2.5%
& 34
 
2.3%
, 25
 
1.7%
# 17
 
1.1%
10
 
0.7%
/ 2
 
0.1%
1
 
0.1%
Other values (3) 3
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 55
34.8%
2 52
32.9%
4 13
 
8.2%
8 10
 
6.3%
3 7
 
4.4%
9 7
 
4.4%
6 7
 
4.4%
5 4
 
2.5%
0 2
 
1.3%
7 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 115
98.3%
] 2
 
1.7%
Open Punctuation
ValueCountFrequency (%)
( 115
98.3%
[ 2
 
1.7%
Space Separator
ValueCountFrequency (%)
920
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42117
77.6%
Latin 8886
 
16.4%
Common 2808
 
5.2%
Han 407
 
0.7%
Katakana 53
 
0.1%
Hiragana 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1797
 
4.3%
1594
 
3.8%
1312
 
3.1%
1083
 
2.6%
852
 
2.0%
749
 
1.8%
682
 
1.6%
629
 
1.5%
597
 
1.4%
498
 
1.2%
Other values (646) 32324
76.7%
Han
ValueCountFrequency (%)
48
 
11.8%
48
 
11.8%
47
 
11.5%
13
 
3.2%
13
 
3.2%
10
 
2.5%
10
 
2.5%
9
 
2.2%
9
 
2.2%
8
 
2.0%
Other values (94) 192
47.2%
Latin
ValueCountFrequency (%)
o 986
 
11.1%
r 648
 
7.3%
s 645
 
7.3%
e 554
 
6.2%
a 512
 
5.8%
i 509
 
5.7%
n 497
 
5.6%
B 416
 
4.7%
l 322
 
3.6%
k 317
 
3.6%
Other values (43) 3480
39.2%
Common
ValueCountFrequency (%)
: 993
35.4%
920
32.8%
? 325
 
11.6%
) 115
 
4.1%
( 115
 
4.1%
1 55
 
2.0%
2 52
 
1.9%
' 42
 
1.5%
. 37
 
1.3%
& 34
 
1.2%
Other values (19) 120
 
4.3%
Katakana
ValueCountFrequency (%)
6
 
11.3%
5
 
9.4%
4
 
7.5%
4
 
7.5%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (13) 17
32.1%
Hiragana
ValueCountFrequency (%)
4
23.5%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42109
77.6%
ASCII 11676
 
21.5%
CJK 407
 
0.7%
Katakana 53
 
0.1%
None 18
 
< 0.1%
Hiragana 17
 
< 0.1%
Compat Jamo 8
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1797
 
4.3%
1594
 
3.8%
1312
 
3.1%
1083
 
2.6%
852
 
2.0%
749
 
1.8%
682
 
1.6%
629
 
1.5%
597
 
1.4%
498
 
1.2%
Other values (639) 32316
76.7%
ASCII
ValueCountFrequency (%)
: 993
 
8.5%
o 986
 
8.4%
920
 
7.9%
r 648
 
5.5%
s 645
 
5.5%
e 554
 
4.7%
a 512
 
4.4%
i 509
 
4.4%
n 497
 
4.3%
B 416
 
3.6%
Other values (67) 4996
42.8%
CJK
ValueCountFrequency (%)
48
 
11.8%
48
 
11.8%
47
 
11.5%
13
 
3.2%
13
 
3.2%
10
 
2.5%
10
 
2.5%
9
 
2.2%
9
 
2.2%
8
 
2.0%
Other values (94) 192
47.2%
None
ValueCountFrequency (%)
10
55.6%
đ 5
27.8%
1
 
5.6%
· 1
 
5.6%
1
 
5.6%
Katakana
ValueCountFrequency (%)
6
 
11.3%
5
 
9.4%
4
 
7.5%
4
 
7.5%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (13) 17
32.1%
Hiragana
ValueCountFrequency (%)
4
23.5%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Compat Jamo
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

발행년
Categorical

IMBALANCE 

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
6259 
2021
2490 
2020
 
388
2019
 
243
2018
 
152
Other values (24)
 
468

Length

Max length10
Median length4
Mean length4.0135
Min length4

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st row2022
2nd row2022
3rd row2021
4th row2018
5th row2022

Common Values

ValueCountFrequency (%)
2022 6259
62.6%
2021 2490
 
24.9%
2020 388
 
3.9%
2019 243
 
2.4%
2018 152
 
1.5%
2017 96
 
1.0%
2016 69
 
0.7%
[2020] 63
 
0.6%
2014 50
 
0.5%
2011 38
 
0.4%
Other values (19) 152
 
1.5%

Length

2023-12-13T05:57:52.513728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022 6259
62.6%
2021 2490
 
24.9%
2020 451
 
4.5%
2019 244
 
2.4%
2018 152
 
1.5%
2017 96
 
1.0%
2016 69
 
0.7%
2014 50
 
0.5%
2011 38
 
0.4%
2003 38
 
0.4%
Other values (17) 113
 
1.1%

가격
Real number (ℝ)

Distinct215
Distinct (%)2.2%
Missing40
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean16533.955
Minimum3500
Maximum60000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:57:52.629954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3500
5-th percentile10000
Q113000
median15000
Q318000
95-th percentile28800
Maximum60000
Range56500
Interquartile range (IQR)5000

Descriptive statistics

Standard deviation5883.5289
Coefficient of variation (CV)0.35584523
Kurtosis4.9528325
Mean16533.955
Median Absolute Deviation (MAD)2000
Skewness1.8220761
Sum1.6467819 × 108
Variance34615912
MonotonicityNot monotonic
2023-12-13T05:57:52.773043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 1055
 
10.5%
13000 997
 
10.0%
12000 866
 
8.7%
14000 720
 
7.2%
16000 657
 
6.6%
18000 515
 
5.1%
17000 360
 
3.6%
10000 236
 
2.4%
22000 227
 
2.3%
20000 214
 
2.1%
Other values (205) 4113
41.1%
ValueCountFrequency (%)
3500 3
 
< 0.1%
3820 27
 
0.3%
4020 1
 
< 0.1%
6000 68
0.7%
6500 8
 
0.1%
7000 38
0.4%
7350 33
0.3%
7500 12
 
0.1%
7600 1
 
< 0.1%
7700 3
 
< 0.1%
ValueCountFrequency (%)
60000 1
 
< 0.1%
59000 1
 
< 0.1%
58000 1
 
< 0.1%
55000 1
 
< 0.1%
50000 2
 
< 0.1%
49500 1
 
< 0.1%
49000 3
< 0.1%
48000 6
0.1%
47000 4
< 0.1%
46000 1
 
< 0.1%

Interactions

2023-12-13T05:57:48.716876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:57:48.514112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:57:48.804444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:57:48.606723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:57:52.872066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발행년가격
연번1.0000.2680.230
발행년0.2681.0000.571
가격0.2300.5711.000
2023-12-13T05:57:52.969360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번가격발행년
연번1.000-0.0690.098
가격-0.0691.0000.240
발행년0.0980.2401.000

Missing values

2023-12-13T05:57:48.946310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:57:49.060162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:57:49.173845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번서명저작자발행자발행년가격
91899190아티스트 웨이, 마음의 소리를 듣는 시간줄리아 카메론 지음 ; 이상원 옮김비즈니스북스202216000
94299430어느 할머니 이야기조앤 슈워츠 글 ; 나히드 카제미 그림 ; 신형건 옮김보물창고202215000
30163017공부란 무엇인가한근태 지음샘터202112000
26832684거인의 탄생 : 이원호의 성장, 개척, 기업소설. 4, 거인의 탄생이원호 지음한결미디어201815000
1085610857우주시대에 오신 것을 환영합니다 : 우주가 산업이 되는 뉴 스페이스 시대 가이드켈리 제라디 지음 ; 이지민 옮김혜윰터202217000
154815492030 반도체 지정학 : 21세기 지정학 리스크 속 어떻게 반도체 초강국이 될 것인가오타 야스히코 지음 ; 임재덕 옮김성안당202218000
1039110392왜 이런 이름이 생겼을까? : 우리가 몰랐던 동물 이름의 유래. 동물 2박영산 글 ; 이형진 그림기린미디어202111000
51055106당신이 원하던 잡학사전김주은 엮음지브레인202213000
80978098세계의 빵 도감오모리 히로코 글·그림 ; 고향옥 옮김 ; 이노우에 요시후미길벗스쿨202011500
55995600때려치우기의 기술 : 행복하고 가벼운 삶을 위해 똑똑하게 손절합니다사와 마도카 지음 ; 이효진 옮김한빛비즈202216000
연번서명저작자발행자발행년가격
1383713838한라산에 기대어이영균 시 ; 이도헌 사진202215000
90449045我不?打?!(?)鞠志承 文/? ; 禹明延 ?新世?出版社202020400
1167911680자전거를 타면 앞으로 간다 : 정지된 일상을 깨우고, 앞으로 나아가는 법강민영 지음 ; 최연주 일러스트휴머니스트출판그룹202216000
33743375그랜드 캉티뉴쓰 호텔 : 리보칭 장편소설리보칭 지음 ; 허유영 옮김비채202215800
78(0세부터 6세까지)우리집 소아과은성훈,양세령 [공]지음포르체202222000
44984499내가 틀릴 수도 있습니다비욘 나티코 린데블라드 지음 ; 박미경 옮김다산초당202216000
141142(The Usborne)엄청나게 큰 곤충 백과 : 플랩북에밀리 본 지음 ; 파비아노 피오린 그림 ; 스티브 라이트 디자인 ; 채도영 옮김어스본코리아201713000
1192011921점프 점프[더책] [더책]정인석 지음고래뱃속202128000
1328113282통계의 아름다움 : 인공지능 시대에 필요한 과학적 사고리찌엔,하이언 [공]지음 ; 김슬기 옮김제이펍202019800
1223912240즐거운 다문화도서관 : 언어와 문화의 경계를 허무는 도서관 공동체정은주 지음학교도서관저널202018000