Overview

Dataset statistics

Number of variables9
Number of observations9276
Missing cells0
Missing cells (%)0.0%
Duplicate rows729
Duplicate rows (%)7.9%
Total size in memory670.5 KiB
Average record size in memory74.0 B

Variable types

Categorical2
Text4
Numeric1
DateTime2

Dataset

Description경상남도 김해시 통합도서관에서 보유잔 전자책에 대한 자료로 전자책타입, 서명, 라이센스수, 저자, 출판사 등에 대한 데이터로 구성되어 있습니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15106419/fileData.do

Alerts

Dataset has 729 (7.9%) duplicate rowsDuplicates
전자책타입 is highly imbalanced (98.2%)Imbalance
라이센스수 is highly skewed (γ1 = 33.90541514)Skewed

Reproduction

Analysis started2023-12-12 01:48:36.302436
Analysis finished2023-12-12 01:48:38.926796
Duration2.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

전자책타입
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
EBOOK
9260 
AUDIO
 
16

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEBOOK
2nd rowEBOOK
3rd rowEBOOK
4th rowEBOOK
5th rowEBOOK

Common Values

ValueCountFrequency (%)
EBOOK 9260
99.8%
AUDIO 16
 
0.2%

Length

2023-12-12T10:48:39.020591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:48:39.143673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ebook 9260
99.8%
audio 16
 
0.2%

서명
Text

Distinct8389
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
2023-12-12T10:48:39.451529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length58
Mean length13.125809
Min length1

Characters and Unicode

Total characters121755
Distinct characters1266
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7601 ?
Unique (%)81.9%

Sample

1st row애프터 피케티
2nd row예정된 전쟁
3rd row럭키 타로북
4th row하루를 살아도 후회없이 살고 싶다
5th row무엇이 되지 않더라도
ValueCountFrequency (%)
476
 
1.4%
나는 219
 
0.7%
2 187
 
0.6%
1 178
 
0.5%
172
 
0.5%
152
 
0.5%
이야기 137
 
0.4%
위한 126
 
0.4%
109
 
0.3%
94
 
0.3%
Other values (13304) 31806
94.5%
2023-12-12T10:48:40.103600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24460
 
20.1%
2487
 
2.0%
2442
 
2.0%
2383
 
2.0%
1753
 
1.4%
1318
 
1.1%
1301
 
1.1%
1249
 
1.0%
1189
 
1.0%
1179
 
1.0%
Other values (1256) 81994
67.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87838
72.1%
Space Separator 24460
 
20.1%
Decimal Number 2938
 
2.4%
Lowercase Letter 2390
 
2.0%
Other Punctuation 1821
 
1.5%
Uppercase Letter 1073
 
0.9%
Open Punctuation 394
 
0.3%
Close Punctuation 394
 
0.3%
Dash Punctuation 393
 
0.3%
Math Symbol 26
 
< 0.1%
Other values (4) 28
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2487
 
2.8%
2442
 
2.8%
2383
 
2.7%
1753
 
2.0%
1318
 
1.5%
1301
 
1.5%
1249
 
1.4%
1189
 
1.4%
1179
 
1.3%
1170
 
1.3%
Other values (1166) 71367
81.2%
Lowercase Letter
ValueCountFrequency (%)
e 300
12.6%
t 212
 
8.9%
o 198
 
8.3%
n 185
 
7.7%
a 176
 
7.4%
i 173
 
7.2%
s 156
 
6.5%
r 155
 
6.5%
l 113
 
4.7%
h 92
 
3.8%
Other values (16) 630
26.4%
Uppercase Letter
ValueCountFrequency (%)
S 117
 
10.9%
T 84
 
7.8%
E 79
 
7.4%
O 78
 
7.3%
C 73
 
6.8%
A 66
 
6.2%
B 50
 
4.7%
I 48
 
4.5%
F 47
 
4.4%
P 43
 
4.0%
Other values (16) 388
36.2%
Other Punctuation
ValueCountFrequency (%)
, 767
42.1%
. 286
 
15.7%
: 246
 
13.5%
? 196
 
10.8%
! 123
 
6.8%
& 66
 
3.6%
; 55
 
3.0%
# 30
 
1.6%
% 28
 
1.5%
· 11
 
0.6%
Other values (3) 13
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 765
26.0%
0 603
20.5%
2 582
19.8%
3 253
 
8.6%
5 188
 
6.4%
4 139
 
4.7%
7 137
 
4.7%
6 99
 
3.4%
9 93
 
3.2%
8 79
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 369
93.7%
[ 24
 
6.1%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 369
93.7%
] 24
 
6.1%
1
 
0.3%
Math Symbol
ValueCountFrequency (%)
~ 17
65.4%
+ 8
30.8%
× 1
 
3.8%
Space Separator
ValueCountFrequency (%)
24460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 393
100.0%
Final Punctuation
ValueCountFrequency (%)
13
100.0%
Initial Punctuation
ValueCountFrequency (%)
11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87809
72.1%
Common 30454
 
25.0%
Latin 3463
 
2.8%
Han 29
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2487
 
2.8%
2442
 
2.8%
2383
 
2.7%
1753
 
2.0%
1318
 
1.5%
1301
 
1.5%
1249
 
1.4%
1189
 
1.4%
1179
 
1.3%
1170
 
1.3%
Other values (1149) 71338
81.2%
Latin
ValueCountFrequency (%)
e 300
 
8.7%
t 212
 
6.1%
o 198
 
5.7%
n 185
 
5.3%
a 176
 
5.1%
i 173
 
5.0%
s 156
 
4.5%
r 155
 
4.5%
S 117
 
3.4%
l 113
 
3.3%
Other values (42) 1678
48.5%
Common
ValueCountFrequency (%)
24460
80.3%
, 767
 
2.5%
1 765
 
2.5%
0 603
 
2.0%
2 582
 
1.9%
- 393
 
1.3%
( 369
 
1.2%
) 369
 
1.2%
. 286
 
0.9%
3 253
 
0.8%
Other values (28) 1607
 
5.3%
Han
ValueCountFrequency (%)
5
17.2%
4
13.8%
3
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (7) 7
24.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87795
72.1%
ASCII 33877
 
27.8%
CJK 28
 
< 0.1%
Punctuation 25
 
< 0.1%
Compat Jamo 14
 
< 0.1%
None 14
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24460
72.2%
, 767
 
2.3%
1 765
 
2.3%
0 603
 
1.8%
2 582
 
1.7%
- 393
 
1.2%
( 369
 
1.1%
) 369
 
1.1%
e 300
 
0.9%
. 286
 
0.8%
Other values (72) 4983
 
14.7%
Hangul
ValueCountFrequency (%)
2487
 
2.8%
2442
 
2.8%
2383
 
2.7%
1753
 
2.0%
1318
 
1.5%
1301
 
1.5%
1249
 
1.4%
1189
 
1.4%
1179
 
1.3%
1170
 
1.3%
Other values (1148) 71324
81.2%
Compat Jamo
ValueCountFrequency (%)
14
100.0%
Punctuation
ValueCountFrequency (%)
13
52.0%
11
44.0%
1
 
4.0%
None
ValueCountFrequency (%)
· 11
78.6%
× 1
 
7.1%
1
 
7.1%
1
 
7.1%
CJK
ValueCountFrequency (%)
5
17.9%
4
14.3%
3
10.7%
2
 
7.1%
2
 
7.1%
2
 
7.1%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (6) 6
21.4%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct8450
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
2023-12-12T10:48:40.462978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length10.319103
Min length9

Characters and Unicode

Total characters95720
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7721 ?
Unique (%)83.2%

Sample

1st row180107785
2nd row180108367
3rd row180108372
4th row180108373
5th row180200045
ValueCountFrequency (%)
090600101 67
 
0.7%
090600100 22
 
0.2%
110500091 4
 
< 0.1%
140801251 3
 
< 0.1%
140100019 3
 
< 0.1%
131201661 3
 
< 0.1%
140801250 3
 
< 0.1%
140101264 3
 
< 0.1%
140101278 3
 
< 0.1%
140114458 3
 
< 0.1%
Other values (8440) 9162
98.8%
2023-12-12T10:48:41.062484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 21909
22.9%
1 16725
17.5%
8 9966
10.4%
4 9267
9.7%
2 7482
 
7.8%
9 6810
 
7.1%
6 6382
 
6.7%
3 6084
 
6.4%
5 5928
 
6.2%
7 5038
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 95591
99.9%
Uppercase Letter 129
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 21909
22.9%
1 16725
17.5%
8 9966
10.4%
4 9267
9.7%
2 7482
 
7.8%
9 6810
 
7.1%
6 6382
 
6.7%
3 6084
 
6.4%
5 5928
 
6.2%
7 5038
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
D 129
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 95591
99.9%
Latin 129
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 21909
22.9%
1 16725
17.5%
8 9966
10.4%
4 9267
9.7%
2 7482
 
7.8%
9 6810
 
7.1%
6 6382
 
6.7%
3 6084
 
6.4%
5 5928
 
6.2%
7 5038
 
5.3%
Latin
ValueCountFrequency (%)
D 129
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 95720
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 21909
22.9%
1 16725
17.5%
8 9966
10.4%
4 9267
9.7%
2 7482
 
7.8%
9 6810
 
7.1%
6 6382
 
6.7%
3 6084
 
6.4%
5 5928
 
6.2%
7 5038
 
5.3%

라이센스수
Real number (ℝ)

SKEWED 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3031479
Minimum1
Maximum1000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size81.7 KiB
2023-12-12T10:48:41.277324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q12
median2
Q33
95-th percentile5
Maximum1000
Range999
Interquartile range (IQR)1

Descriptive statistics

Standard deviation29.302
Coefficient of variation (CV)8.8709317
Kurtosis1150.0657
Mean3.3031479
Median Absolute Deviation (MAD)0
Skewness33.905415
Sum30640
Variance858.60717
MonotonicityNot monotonic
2023-12-12T10:48:41.438544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2 6051
65.2%
3 2596
28.0%
5 495
 
5.3%
1 123
 
1.3%
1000 4
 
< 0.1%
999 4
 
< 0.1%
100 1
 
< 0.1%
4 1
 
< 0.1%
52 1
 
< 0.1%
ValueCountFrequency (%)
1 123
 
1.3%
2 6051
65.2%
3 2596
28.0%
4 1
 
< 0.1%
5 495
 
5.3%
52 1
 
< 0.1%
100 1
 
< 0.1%
999 4
 
< 0.1%
1000 4
 
< 0.1%
ValueCountFrequency (%)
1000 4
 
< 0.1%
999 4
 
< 0.1%
100 1
 
< 0.1%
52 1
 
< 0.1%
5 495
 
5.3%
4 1
 
< 0.1%
3 2596
28.0%
2 6051
65.2%
1 123
 
1.3%

공급사번호
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
9
6121 
2
3059 
5
 
96

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9
2nd row9
3rd row9
4th row9
5th row9

Common Values

ValueCountFrequency (%)
9 6121
66.0%
2 3059
33.0%
5 96
 
1.0%

Length

2023-12-12T10:48:41.596162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:48:41.736056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9 6121
66.0%
2 3059
33.0%
5 96
 
1.0%

저자
Text

Distinct6109
Distinct (%)65.9%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
2023-12-12T10:48:42.189448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length3
Mean length5.4862009
Min length1

Characters and Unicode

Total characters50890
Distinct characters874
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4461 ?
Unique (%)48.1%

Sample

1st row토마 피케티 외 25인
2nd row그레이엄 앨리슨
3rd row레이철 폴락
4th row정태섭
5th row김동영
ValueCountFrequency (%)
지은이 82
 
0.6%
김해정 67
 
0.5%
66
 
0.5%
42
 
0.3%
제작팀 40
 
0.3%
제임스 39
 
0.3%
로버트 36
 
0.2%
데이비드 35
 
0.2%
그림 35
 
0.2%
옮긴이 31
 
0.2%
Other values (7690) 14079
96.7%
2023-12-12T10:48:42.863907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5276
 
10.4%
1877
 
3.7%
1456
 
2.9%
, 1078
 
2.1%
1008
 
2.0%
844
 
1.7%
781
 
1.5%
596
 
1.2%
541
 
1.1%
533
 
1.0%
Other values (864) 36900
72.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41090
80.7%
Space Separator 5276
 
10.4%
Lowercase Letter 1736
 
3.4%
Other Punctuation 1365
 
2.7%
Uppercase Letter 884
 
1.7%
Close Punctuation 237
 
0.5%
Open Punctuation 237
 
0.5%
Decimal Number 57
 
0.1%
Dash Punctuation 7
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1877
 
4.6%
1456
 
3.5%
1008
 
2.5%
844
 
2.1%
781
 
1.9%
596
 
1.5%
541
 
1.3%
533
 
1.3%
498
 
1.2%
429
 
1.0%
Other values (790) 32527
79.2%
Uppercase Letter
ValueCountFrequency (%)
M 92
 
10.4%
S 85
 
9.6%
B 79
 
8.9%
K 78
 
8.8%
J 60
 
6.8%
A 56
 
6.3%
C 56
 
6.3%
T 46
 
5.2%
L 45
 
5.1%
E 38
 
4.3%
Other values (16) 249
28.2%
Lowercase Letter
ValueCountFrequency (%)
e 207
11.9%
a 204
11.8%
n 176
10.1%
r 168
9.7%
t 152
8.8%
i 144
 
8.3%
o 117
 
6.7%
l 83
 
4.8%
u 67
 
3.9%
c 54
 
3.1%
Other values (14) 364
21.0%
Decimal Number
ValueCountFrequency (%)
5 14
24.6%
0 10
17.5%
1 10
17.5%
2 7
12.3%
9 6
10.5%
3 3
 
5.3%
8 3
 
5.3%
4 3
 
5.3%
6 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 1078
79.0%
. 221
 
16.2%
& 30
 
2.2%
; 28
 
2.1%
? 4
 
0.3%
/ 2
 
0.1%
: 1
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 225
94.9%
12
 
5.1%
Open Punctuation
ValueCountFrequency (%)
( 225
94.9%
12
 
5.1%
Space Separator
ValueCountFrequency (%)
5276
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41087
80.7%
Common 7180
 
14.1%
Latin 2620
 
5.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1877
 
4.6%
1456
 
3.5%
1008
 
2.5%
844
 
2.1%
781
 
1.9%
596
 
1.5%
541
 
1.3%
533
 
1.3%
498
 
1.2%
429
 
1.0%
Other values (787) 32524
79.2%
Latin
ValueCountFrequency (%)
e 207
 
7.9%
a 204
 
7.8%
n 176
 
6.7%
r 168
 
6.4%
t 152
 
5.8%
i 144
 
5.5%
o 117
 
4.5%
M 92
 
3.5%
S 85
 
3.2%
l 83
 
3.2%
Other values (40) 1192
45.5%
Common
ValueCountFrequency (%)
5276
73.5%
, 1078
 
15.0%
) 225
 
3.1%
( 225
 
3.1%
. 221
 
3.1%
& 30
 
0.4%
; 28
 
0.4%
5 14
 
0.2%
12
 
0.2%
12
 
0.2%
Other values (14) 59
 
0.8%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41087
80.7%
ASCII 9775
 
19.2%
None 25
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5276
54.0%
, 1078
 
11.0%
) 225
 
2.3%
( 225
 
2.3%
. 221
 
2.3%
e 207
 
2.1%
a 204
 
2.1%
n 176
 
1.8%
r 168
 
1.7%
t 152
 
1.6%
Other values (61) 1843
 
18.9%
Hangul
ValueCountFrequency (%)
1877
 
4.6%
1456
 
3.5%
1008
 
2.5%
844
 
2.1%
781
 
1.9%
596
 
1.5%
541
 
1.3%
533
 
1.3%
498
 
1.2%
429
 
1.0%
Other values (787) 32524
79.2%
None
ValueCountFrequency (%)
12
48.0%
12
48.0%
1
 
4.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct1141
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
2023-12-12T10:48:43.219554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length4.678633
Min length1

Characters and Unicode

Total characters43399
Distinct characters595
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique398 ?
Unique (%)4.3%

Sample

1st row율리시즈
2nd row세종서적
3rd row재미주의
4th row걷는나무
5th rowarte(아르테)
ValueCountFrequency (%)
위즈덤하우스 314
 
3.3%
21세기북스 226
 
2.4%
도서출판 188
 
2.0%
문학동네 155
 
1.6%
웅진지식하우스 129
 
1.4%
rourke 123
 
1.3%
원앤원북스 117
 
1.2%
다산책방 102
 
1.1%
다산북스 101
 
1.1%
rhk 99
 
1.0%
Other values (1126) 7971
83.7%
2023-12-12T10:48:43.723996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2839
 
6.5%
1871
 
4.3%
944
 
2.2%
640
 
1.5%
604
 
1.4%
602
 
1.4%
597
 
1.4%
586
 
1.4%
582
 
1.3%
571
 
1.3%
Other values (585) 33563
77.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38708
89.2%
Lowercase Letter 1905
 
4.4%
Uppercase Letter 1136
 
2.6%
Decimal Number 599
 
1.4%
Open Punctuation 312
 
0.7%
Close Punctuation 312
 
0.7%
Space Separator 249
 
0.6%
Other Punctuation 169
 
0.4%
Other Symbol 6
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2839
 
7.3%
1871
 
4.8%
944
 
2.4%
640
 
1.7%
604
 
1.6%
602
 
1.6%
597
 
1.5%
586
 
1.5%
582
 
1.5%
571
 
1.5%
Other values (521) 28872
74.6%
Uppercase Letter
ValueCountFrequency (%)
R 236
20.8%
B 190
16.7%
K 158
13.9%
H 119
10.5%
I 72
 
6.3%
O 71
 
6.2%
P 56
 
4.9%
M 43
 
3.8%
S 41
 
3.6%
D 31
 
2.7%
Other values (15) 119
10.5%
Lowercase Letter
ValueCountFrequency (%)
e 337
17.7%
o 295
15.5%
r 265
13.9%
k 193
10.1%
u 136
7.1%
t 107
 
5.6%
a 106
 
5.6%
s 97
 
5.1%
n 67
 
3.5%
i 59
 
3.1%
Other values (12) 243
12.8%
Decimal Number
ValueCountFrequency (%)
2 251
41.9%
1 239
39.9%
0 40
 
6.7%
3 38
 
6.3%
4 14
 
2.3%
6 10
 
1.7%
5 7
 
1.2%
Other Punctuation
ValueCountFrequency (%)
# 70
41.4%
. 47
27.8%
: 37
21.9%
& 14
 
8.3%
? 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 312
100.0%
Close Punctuation
ValueCountFrequency (%)
) 312
100.0%
Space Separator
ValueCountFrequency (%)
249
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38714
89.2%
Latin 3041
 
7.0%
Common 1644
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2839
 
7.3%
1871
 
4.8%
944
 
2.4%
640
 
1.7%
604
 
1.6%
602
 
1.6%
597
 
1.5%
586
 
1.5%
582
 
1.5%
571
 
1.5%
Other values (522) 28878
74.6%
Latin
ValueCountFrequency (%)
e 337
 
11.1%
o 295
 
9.7%
r 265
 
8.7%
R 236
 
7.8%
k 193
 
6.3%
B 190
 
6.2%
K 158
 
5.2%
u 136
 
4.5%
H 119
 
3.9%
t 107
 
3.5%
Other values (37) 1005
33.0%
Common
ValueCountFrequency (%)
( 312
19.0%
) 312
19.0%
2 251
15.3%
249
15.1%
1 239
14.5%
# 70
 
4.3%
. 47
 
2.9%
0 40
 
2.4%
3 38
 
2.3%
: 37
 
2.3%
Other values (6) 49
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38708
89.2%
ASCII 4685
 
10.8%
None 6
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2839
 
7.3%
1871
 
4.8%
944
 
2.4%
640
 
1.7%
604
 
1.6%
602
 
1.6%
597
 
1.5%
586
 
1.5%
582
 
1.5%
571
 
1.5%
Other values (521) 28872
74.6%
ASCII
ValueCountFrequency (%)
e 337
 
7.2%
( 312
 
6.7%
) 312
 
6.7%
o 295
 
6.3%
r 265
 
5.7%
2 251
 
5.4%
249
 
5.3%
1 239
 
5.1%
R 236
 
5.0%
k 193
 
4.1%
Other values (53) 1996
42.6%
None
ValueCountFrequency (%)
6
100.0%
Distinct2653
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
Minimum2002-09-05 00:00:00
Maximum2022-05-30 00:00:00
2023-12-12T10:48:43.932855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:48:44.124812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct51
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size72.6 KiB
Minimum2009-12-02 00:00:00
Maximum2022-07-05 00:00:00
2023-12-12T10:48:44.338896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:48:44.515176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T10:48:38.458737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:48:44.678341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자책타입라이센스수공급사번호반입일
전자책타입1.0000.4660.0160.566
라이센스수0.4661.0000.0090.607
공급사번호0.0160.0091.0001.000
반입일0.5660.6071.0001.000
2023-12-12T10:48:44.832799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급사번호전자책타입
공급사번호1.0000.026
전자책타입0.0261.000
2023-12-12T10:48:44.932798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
라이센스수전자책타입공급사번호
라이센스수1.0000.3080.015
전자책타입0.3081.0000.026
공급사번호0.0150.0261.000

Missing values

2023-12-12T10:48:38.654318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:48:38.840699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

전자책타입서명상품코드라이센스수공급사번호저자출판사출판일반입일
0EBOOK애프터 피케티18010778529토마 피케티 외 25인율리시즈2018-01-262018-03-09
1EBOOK예정된 전쟁18010836729그레이엄 앨리슨세종서적2018-01-292018-03-09
2EBOOK럭키 타로북18010837229레이철 폴락재미주의2018-01-302018-03-09
3EBOOK하루를 살아도 후회없이 살고 싶다18010837329정태섭걷는나무2018-01-302018-03-09
4EBOOK무엇이 되지 않더라도18020004529김동영arte(아르테)2018-02-012018-03-09
5EBOOK사랑에 대한 작은 책18020034329울프 스타르크책빛2018-02-012018-03-09
6EBOOK탈무드: 피르케이 아보트18020044329여후다 하나시(편집)투나미스 출판사2018-02-032018-03-09
7EBOOK무라카미 하루키를 음악으로 읽다18020073929구리하라 유이치로 외영인미디어2018-02-022018-03-09
8EBOOK빅데이터 부동산 투자18020122829김기원다산북스2018-02-062019-09-24
9EBOOK미래를 읽는 기술18020375029이동우비즈니스북스2018-02-202018-03-09
전자책타입서명상품코드라이센스수공급사번호저자출판사출판일반입일
9266EBOOK투자를 잘한다는 것22030154129배진한이레미디어2022-03-082022-04-29
9267EBOOK나는 당신도 재개발 투자로 돈을 벌면 좋겠습니다22020306229남무98원앤원북스2022-02-182022-04-29
9268EBOOK돈의 공식22040056829윌리엄 그린RHK2022-04-042022-04-29
9269EBOOK엑설런스22030047529도리스 메르틴다산초당2022-03-032022-04-29
9270EBOOK빅데이터 사용설명서22030115129김진호메이트북스2022-03-072022-04-29
9271EBOOK일잘러는 노션으로 일합니다22020306129김대중원앤원북스2022-02-182022-04-29
9272EBOOK투자의 미래22030154029제러미 시겔이레미디어2022-03-082022-04-29
9273EBOOK나를 알고 싶을 때 뇌과학을 공부합니다22030254029질 볼트 테일러윌북2022-03-152022-04-29
9274EBOOK부의 해답22030434829존 아사라프, 머레이 스미스RHK2022-03-222022-04-29
9275EBOOK통증 때려잡는 스트레칭22030443829최재석주식회사 센시오2022-03-232022-04-29

Duplicate rows

Most frequently occurring

전자책타입서명상품코드라이센스수공급사번호저자출판사출판일반입일# duplicates
475EBOOK연상 일본어 단어장09060010139김해정다락원2009-08-312010-04-3067
409EBOOK실용 일본어 회화 Step 209060010039요시모토 하지메다락원2009-08-312010-04-3022
255EBOOK미네르바의 경제전쟁11050009139박대성미르북스2011-05-202011-11-284
22EBOOKHOLA, 스무살14011405339이지현좋은땅2014-04-032014-10-133
40EBOOK거상 김만덕, 꽃으로 피기보다 새가 되어 날아가리10060048029정창권푸른숲2010-07-082016-05-123
96EBOOK김구 청문회 1 - 독립운동가 김구의 정직한 이력서14080125039김상구매직하우스2014-08-162014-12-113
97EBOOK김구 청문회 2 - 김구는 통일의 화신인가?14080125139김상구매직하우스2014-08-162014-12-113
215EBOOK마지막 외침14010061139무심천좋은땅2014-01-312014-04-303
228EBOOK매직트리 마법의 체스 (상)14010126439안제이말레슈카책빛2014-02-192014-04-303
432EBOOK안녕?! 오케스트라13120166139이보영이담Books2014-01-252014-04-303