Overview

Dataset statistics

Number of variables10
Number of observations9172
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory725.6 KiB
Average record size in memory81.0 B

Variable types

Numeric1
Categorical4
Text4
DateTime1

Dataset

Description서울기 금천구 구립도서관 전자책 보유 목록에 관한 정보로 도서관명, 도서관 구분코드, 도서관 홈페이지 URL, 카테고리, 도서명, 저자명, 출판사, 형식 등을 제공합니다.
Author서울특별시 금천구
URLhttps://www.data.go.kr/data/15112687/fileData.do

Alerts

도서관명 has constant value ""Constant
도서관홈페이지(URL) has constant value ""Constant
데이터기준일자 has constant value ""Constant
형식(전자책 또는 오디오북) is highly overall correlated with 카테고리(대분류)High correlation
카테고리(대분류) is highly overall correlated with 형식(전자책 또는 오디오북)High correlation
형식(전자책 또는 오디오북) is highly imbalanced (94.7%)Imbalance
연번 has unique valuesUnique
도서관구분코드 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:15:38.528345
Analysis finished2024-04-21 02:15:41.937479
Duration3.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct9172
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4586.5
Minimum1
Maximum9172
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size80.7 KiB
2024-04-21T11:15:42.018666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile459.55
Q12293.75
median4586.5
Q36879.25
95-th percentile8713.45
Maximum9172
Range9171
Interquartile range (IQR)4585.5

Descriptive statistics

Standard deviation2647.8727
Coefficient of variation (CV)0.5773188
Kurtosis-1.2
Mean4586.5
Median Absolute Deviation (MAD)2293
Skewness0
Sum42067378
Variance7011229.7
MonotonicityStrictly increasing
2024-04-21T11:15:42.156390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
6120 1
 
< 0.1%
6114 1
 
< 0.1%
6115 1
 
< 0.1%
6116 1
 
< 0.1%
6117 1
 
< 0.1%
6118 1
 
< 0.1%
6119 1
 
< 0.1%
6121 1
 
< 0.1%
6112 1
 
< 0.1%
Other values (9162) 9162
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
9172 1
< 0.1%
9171 1
< 0.1%
9170 1
< 0.1%
9169 1
< 0.1%
9168 1
< 0.1%
9167 1
< 0.1%
9166 1
< 0.1%
9165 1
< 0.1%
9164 1
< 0.1%
9163 1
< 0.1%

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
금천구립도서관
9172 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금천구립도서관
2nd row금천구립도서관
3rd row금천구립도서관
4th row금천구립도서관
5th row금천구립도서관

Common Values

ValueCountFrequency (%)
금천구립도서관 9172
100.0%

Length

2024-04-21T11:15:42.279934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:15:42.368892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
금천구립도서관 9172
100.0%
Distinct9172
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
2024-04-21T11:15:42.654683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length82
Mean length82
Min length82

Characters and Unicode

Total characters752104
Distinct characters27
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9172 ?
Unique (%)100.0%

Sample

1st row금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111434
2nd row금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111435
3rd row금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111436
4th row금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111437
5th row금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111438
ValueCountFrequency (%)
64204
46.7%
금천구립독산도서관 9172
 
6.7%
111040 9172
 
6.7%
금천구립가산도서관 9172
 
6.7%
111077 9172
 
6.7%
금천구립금나래도서관 9172
 
6.7%
111113 9172
 
6.7%
금천구립시흥도서관 9172
 
6.7%
117550 1
 
< 0.1%
117577 1
 
< 0.1%
Other values (9170) 9170
 
6.7%
2024-04-21T11:15:43.151479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128408
17.1%
1 121923
16.2%
45860
 
6.1%
36688
 
4.9%
36688
 
4.9%
36688
 
4.9%
36688
 
4.9%
36688
 
4.9%
: 36688
 
4.9%
36688
 
4.9%
Other values (17) 199097
26.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339364
45.1%
Decimal Number 220128
29.3%
Space Separator 128408
 
17.1%
Other Punctuation 36688
 
4.9%
Math Symbol 27516
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45860
13.5%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
18344
 
5.4%
9172
 
2.7%
9172
 
2.7%
Other values (4) 36688
10.8%
Decimal Number
ValueCountFrequency (%)
1 121923
55.4%
0 30855
 
14.0%
7 22081
 
10.0%
4 12976
 
5.9%
3 12905
 
5.9%
2 4333
 
2.0%
5 3838
 
1.7%
6 3743
 
1.7%
8 3737
 
1.7%
9 3737
 
1.7%
Space Separator
ValueCountFrequency (%)
128408
100.0%
Other Punctuation
ValueCountFrequency (%)
: 36688
100.0%
Math Symbol
ValueCountFrequency (%)
+ 27516
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 412740
54.9%
Hangul 339364
45.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45860
13.5%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
18344
 
5.4%
9172
 
2.7%
9172
 
2.7%
Other values (4) 36688
10.8%
Common
ValueCountFrequency (%)
128408
31.1%
1 121923
29.5%
: 36688
 
8.9%
0 30855
 
7.5%
+ 27516
 
6.7%
7 22081
 
5.3%
4 12976
 
3.1%
3 12905
 
3.1%
2 4333
 
1.0%
5 3838
 
0.9%
Other values (3) 11217
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 412740
54.9%
Hangul 339364
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
128408
31.1%
1 121923
29.5%
: 36688
 
8.9%
0 30855
 
7.5%
+ 27516
 
6.7%
7 22081
 
5.3%
4 12976
 
3.1%
3 12905
 
3.1%
2 4333
 
1.0%
5 3838
 
0.9%
Other values (3) 11217
 
2.7%
Hangul
ValueCountFrequency (%)
45860
13.5%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
36688
10.8%
18344
 
5.4%
9172
 
2.7%
9172
 
2.7%
Other values (4) 36688
10.8%

도서관홈페이지(URL)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
https://elib.geumcheonlib.seoul.kr/FxLibrary/index/
9172 

Length

Max length51
Median length51
Mean length51
Min length51

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://elib.geumcheonlib.seoul.kr/FxLibrary/index/
2nd rowhttps://elib.geumcheonlib.seoul.kr/FxLibrary/index/
3rd rowhttps://elib.geumcheonlib.seoul.kr/FxLibrary/index/
4th rowhttps://elib.geumcheonlib.seoul.kr/FxLibrary/index/
5th rowhttps://elib.geumcheonlib.seoul.kr/FxLibrary/index/

Common Values

ValueCountFrequency (%)
https://elib.geumcheonlib.seoul.kr/FxLibrary/index/ 9172
100.0%

Length

2024-04-21T11:15:43.293262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:15:43.376278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
https://elib.geumcheonlib.seoul.kr/fxlibrary/index 9172
100.0%

카테고리(대분류)
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
문학
4439 
경제-비즈니스
1291 
에세이-산문
833 
인문
655 
가정-생활
 
345
Other values (13)
1609 

Length

Max length7
Median length2
Mean length3.4584605
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row문학
2nd row문학
3rd row문학
4th row문학
5th row문학

Common Values

ValueCountFrequency (%)
문학 4439
48.4%
경제-비즈니스 1291
 
14.1%
에세이-산문 833
 
9.1%
인문 655
 
7.1%
가정-생활 345
 
3.8%
사회 310
 
3.4%
역사 251
 
2.7%
자연-과학 211
 
2.3%
어린이 208
 
2.3%
문화-예술 169
 
1.8%
Other values (8) 460
 
5.0%

Length

2024-04-21T11:15:43.483809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문학 4439
48.4%
경제-비즈니스 1291
 
14.1%
에세이-산문 833
 
9.1%
인문 655
 
7.1%
가정-생활 345
 
3.8%
사회 310
 
3.4%
역사 251
 
2.7%
자연-과학 211
 
2.3%
어린이 208
 
2.3%
문화-예술 169
 
1.8%
Other values (8) 460
 
5.0%
Distinct9067
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
2024-04-21T11:15:43.800746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length75
Mean length12.871239
Min length1

Characters and Unicode

Total characters118055
Distinct characters2016
Distinct categories17 ?
Distinct scripts5 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8966 ?
Unique (%)97.8%

Sample

1st row12월 12일
2nd row17원 50전
3rd row1932년의 문단 전망
4th row20년 후
5th row3월 창작평
ValueCountFrequency (%)
the 380
 
1.3%
344
 
1.1%
of 198
 
0.7%
전집〉 188
 
0.6%
나는 144
 
0.5%
1 138
 
0.5%
2 136
 
0.5%
〈세계의 110
 
0.4%
이야기 102
 
0.3%
작품집〉 98
 
0.3%
Other values (13885) 28318
93.9%
2024-04-21T11:15:44.317155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21001
 
17.8%
2142
 
1.8%
1922
 
1.6%
e 1666
 
1.4%
1579
 
1.3%
( 1196
 
1.0%
) 1193
 
1.0%
1179
 
1.0%
1165
 
1.0%
1008
 
0.9%
Other values (2006) 84004
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75401
63.9%
Space Separator 21001
 
17.8%
Lowercase Letter 10436
 
8.8%
Uppercase Letter 2792
 
2.4%
Decimal Number 2436
 
2.1%
Open Punctuation 2058
 
1.7%
Close Punctuation 2055
 
1.7%
Other Punctuation 1029
 
0.9%
Connector Punctuation 480
 
0.4%
Dash Punctuation 332
 
0.3%
Other values (7) 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2142
 
2.8%
1922
 
2.5%
1579
 
2.1%
1179
 
1.6%
1165
 
1.5%
1008
 
1.3%
1006
 
1.3%
936
 
1.2%
882
 
1.2%
872
 
1.2%
Other values (1895) 62710
83.2%
Uppercase Letter
ValueCountFrequency (%)
T 501
17.9%
A 229
 
8.2%
O 192
 
6.9%
S 184
 
6.6%
I 178
 
6.4%
P 175
 
6.3%
M 155
 
5.6%
C 153
 
5.5%
B 114
 
4.1%
E 106
 
3.8%
Other values (18) 805
28.8%
Lowercase Letter
ValueCountFrequency (%)
e 1666
16.0%
o 863
 
8.3%
a 799
 
7.7%
r 794
 
7.6%
n 774
 
7.4%
s 702
 
6.7%
i 687
 
6.6%
t 659
 
6.3%
h 635
 
6.1%
l 515
 
4.9%
Other values (16) 2342
22.4%
Other Punctuation
ValueCountFrequency (%)
, 569
55.3%
? 139
 
13.5%
: 100
 
9.7%
! 71
 
6.9%
' 39
 
3.8%
. 35
 
3.4%
% 21
 
2.0%
· 21
 
2.0%
; 13
 
1.3%
& 9
 
0.9%
Other values (4) 12
 
1.2%
Decimal Number
ValueCountFrequency (%)
1 649
26.6%
2 473
19.4%
0 428
17.6%
3 254
 
10.4%
5 191
 
7.8%
4 124
 
5.1%
9 96
 
3.9%
6 88
 
3.6%
7 77
 
3.2%
8 56
 
2.3%
Open Punctuation
ValueCountFrequency (%)
( 1196
58.1%
729
35.4%
[ 127
 
6.2%
4
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 1193
58.1%
729
35.5%
] 127
 
6.2%
4
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Math Symbol
ValueCountFrequency (%)
~ 8
80.0%
× 1
 
10.0%
+ 1
 
10.0%
Other Symbol
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Final Punctuation
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
Initial Punctuation
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
21001
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 480
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 332
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73595
62.3%
Common 29420
 
24.9%
Latin 13232
 
11.2%
Han 1806
 
1.5%
Cyrillic 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2142
 
2.9%
1922
 
2.6%
1579
 
2.1%
1179
 
1.6%
1165
 
1.6%
1008
 
1.4%
1006
 
1.4%
936
 
1.3%
882
 
1.2%
872
 
1.2%
Other values (1142) 60904
82.8%
Han
ValueCountFrequency (%)
36
 
2.0%
29
 
1.6%
20
 
1.1%
18
 
1.0%
17
 
0.9%
16
 
0.9%
15
 
0.8%
15
 
0.8%
15
 
0.8%
14
 
0.8%
Other values (743) 1611
89.2%
Latin
ValueCountFrequency (%)
e 1666
 
12.6%
o 863
 
6.5%
a 799
 
6.0%
r 794
 
6.0%
n 774
 
5.8%
s 702
 
5.3%
i 687
 
5.2%
t 659
 
5.0%
h 635
 
4.8%
l 515
 
3.9%
Other values (48) 5138
38.8%
Common
ValueCountFrequency (%)
21001
71.4%
( 1196
 
4.1%
) 1193
 
4.1%
729
 
2.5%
729
 
2.5%
1 649
 
2.2%
, 569
 
1.9%
_ 480
 
1.6%
2 473
 
1.6%
0 428
 
1.5%
Other values (41) 1973
 
6.7%
Cyrillic
ValueCountFrequency (%)
Г 1
50.0%
И 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73580
62.3%
ASCII 41132
34.8%
CJK 1749
 
1.5%
None 1494
 
1.3%
CJK Compat Ideographs 57
 
< 0.1%
Compat Jamo 15
 
< 0.1%
Punctuation 15
 
< 0.1%
Number Forms 6
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%
Box Drawing 2
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21001
51.1%
e 1666
 
4.1%
( 1196
 
2.9%
) 1193
 
2.9%
o 863
 
2.1%
a 799
 
1.9%
r 794
 
1.9%
n 774
 
1.9%
s 702
 
1.7%
i 687
 
1.7%
Other values (73) 11457
27.9%
Hangul
ValueCountFrequency (%)
2142
 
2.9%
1922
 
2.6%
1579
 
2.1%
1179
 
1.6%
1165
 
1.6%
1008
 
1.4%
1006
 
1.4%
936
 
1.3%
882
 
1.2%
872
 
1.2%
Other values (1137) 60889
82.8%
None
ValueCountFrequency (%)
729
48.8%
729
48.8%
· 21
 
1.4%
4
 
0.3%
4
 
0.3%
× 1
 
0.1%
­ 1
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other values (2) 2
 
0.1%
CJK
ValueCountFrequency (%)
36
 
2.1%
29
 
1.7%
20
 
1.1%
18
 
1.0%
17
 
1.0%
16
 
0.9%
15
 
0.9%
15
 
0.9%
15
 
0.9%
14
 
0.8%
Other values (711) 1554
88.9%
Compat Jamo
ValueCountFrequency (%)
8
53.3%
3
 
20.0%
2
 
13.3%
1
 
6.7%
1
 
6.7%
CJK Compat Ideographs
ValueCountFrequency (%)
7
 
12.3%
4
 
7.0%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
Other values (22) 25
43.9%
Punctuation
ValueCountFrequency (%)
5
33.3%
3
20.0%
3
20.0%
2
 
13.3%
2
 
13.3%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Box Drawing
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Cyrillic
ValueCountFrequency (%)
Г 1
50.0%
И 1
50.0%
Number Forms
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct4495
Distinct (%)49.0%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
2024-04-21T11:15:44.654404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length3
Mean length5.7033362
Min length2

Characters and Unicode

Total characters52311
Distinct characters836
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3752 ?
Unique (%)40.9%

Sample

1st row이상
2nd row나도향
3rd row심훈
4th row오 헨리
5th row채만식
ValueCountFrequency (%)
방정환 242
 
1.7%
이효석 236
 
1.7%
김동인 163
 
1.2%
김소월 154
 
1.1%
박인환 146
 
1.0%
장정심 137
 
1.0%
정지용 133
 
1.0%
채만식 116
 
0.8%
오장환 114
 
0.8%
권구현 108
 
0.8%
Other values (6235) 12432
88.9%
2024-04-21T11:15:45.359140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4812
 
9.2%
1759
 
3.4%
1333
 
2.5%
, 1190
 
2.3%
1145
 
2.2%
e 1017
 
1.9%
a 796
 
1.5%
o 714
 
1.4%
672
 
1.3%
613
 
1.2%
Other values (826) 38260
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35882
68.6%
Lowercase Letter 7515
 
14.4%
Space Separator 4812
 
9.2%
Uppercase Letter 2598
 
5.0%
Other Punctuation 1354
 
2.6%
Open Punctuation 50
 
0.1%
Close Punctuation 50
 
0.1%
Decimal Number 40
 
0.1%
Dash Punctuation 8
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1759
 
4.9%
1333
 
3.7%
1145
 
3.2%
672
 
1.9%
613
 
1.7%
536
 
1.5%
528
 
1.5%
489
 
1.4%
454
 
1.3%
424
 
1.2%
Other values (750) 27929
77.8%
Uppercase Letter
ValueCountFrequency (%)
A 334
12.9%
D 191
 
7.4%
E 190
 
7.3%
H 190
 
7.3%
L 186
 
7.2%
C 172
 
6.6%
B 163
 
6.3%
P 143
 
5.5%
W 131
 
5.0%
S 122
 
4.7%
Other values (15) 776
29.9%
Lowercase Letter
ValueCountFrequency (%)
e 1017
13.5%
a 796
10.6%
o 714
9.5%
l 585
 
7.8%
n 581
 
7.7%
i 536
 
7.1%
r 506
 
6.7%
h 431
 
5.7%
s 393
 
5.2%
t 284
 
3.8%
Other values (14) 1672
22.2%
Decimal Number
ValueCountFrequency (%)
7 9
22.5%
5 7
17.5%
1 5
12.5%
6 5
12.5%
8 4
10.0%
2 4
10.0%
9 3
 
7.5%
3 2
 
5.0%
0 1
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 1190
87.9%
. 152
 
11.2%
? 5
 
0.4%
; 3
 
0.2%
' 1
 
0.1%
· 1
 
0.1%
/ 1
 
0.1%
& 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 36
72.0%
13
 
26.0%
1
 
2.0%
Close Punctuation
ValueCountFrequency (%)
) 36
72.0%
13
 
26.0%
1
 
2.0%
Space Separator
ValueCountFrequency (%)
4812
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35879
68.6%
Latin 10113
 
19.3%
Common 6316
 
12.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1759
 
4.9%
1333
 
3.7%
1145
 
3.2%
672
 
1.9%
613
 
1.7%
536
 
1.5%
528
 
1.5%
489
 
1.4%
454
 
1.3%
424
 
1.2%
Other values (747) 27926
77.8%
Latin
ValueCountFrequency (%)
e 1017
 
10.1%
a 796
 
7.9%
o 714
 
7.1%
l 585
 
5.8%
n 581
 
5.7%
i 536
 
5.3%
r 506
 
5.0%
h 431
 
4.3%
s 393
 
3.9%
A 334
 
3.3%
Other values (39) 4220
41.7%
Common
ValueCountFrequency (%)
4812
76.2%
, 1190
 
18.8%
. 152
 
2.4%
( 36
 
0.6%
) 36
 
0.6%
13
 
0.2%
13
 
0.2%
7 9
 
0.1%
- 8
 
0.1%
5 7
 
0.1%
Other values (17) 40
 
0.6%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
姿 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35879
68.6%
ASCII 16398
31.3%
None 29
 
0.1%
CJK 3
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4812
29.3%
, 1190
 
7.3%
e 1017
 
6.2%
a 796
 
4.9%
o 714
 
4.4%
l 585
 
3.6%
n 581
 
3.5%
i 536
 
3.3%
r 506
 
3.1%
h 431
 
2.6%
Other values (59) 5230
31.9%
Hangul
ValueCountFrequency (%)
1759
 
4.9%
1333
 
3.7%
1145
 
3.2%
672
 
1.9%
613
 
1.7%
536
 
1.5%
528
 
1.5%
489
 
1.4%
454
 
1.3%
424
 
1.2%
Other values (747) 27926
77.8%
None
ValueCountFrequency (%)
13
44.8%
13
44.8%
1
 
3.4%
1
 
3.4%
· 1
 
3.4%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
姿 1
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct729
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
2024-04-21T11:15:45.722957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length16
Mean length5.8795246
Min length1

Characters and Unicode

Total characters53927
Distinct characters505
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique257 ?
Unique (%)2.8%

Sample

1st row지혜의숲
2nd row광보사
3rd row광보사
4th row아이브러리
5th row지혜의숲
ValueCountFrequency (%)
북큐브네트웍스 2375
 
23.6%
21세기북스 178
 
1.8%
문학동네 166
 
1.6%
위즈덤하우스 163
 
1.6%
rhk 141
 
1.4%
지혜의숲 134
 
1.3%
성현사 124
 
1.2%
도서출판 123
 
1.2%
house 121
 
1.2%
brass 121
 
1.2%
Other values (737) 6438
63.8%
2024-04-21T11:15:46.212903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4009
 
7.4%
3570
 
6.6%
2591
 
4.8%
2500
 
4.6%
2472
 
4.6%
2384
 
4.4%
2375
 
4.4%
s 1050
 
1.9%
912
 
1.7%
e 825
 
1.5%
Other values (495) 31239
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42626
79.0%
Lowercase Letter 7103
 
13.2%
Uppercase Letter 2068
 
3.8%
Space Separator 912
 
1.7%
Decimal Number 437
 
0.8%
Open Punctuation 273
 
0.5%
Close Punctuation 273
 
0.5%
Other Punctuation 142
 
0.3%
Final Punctuation 92
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4009
 
9.4%
3570
 
8.4%
2591
 
6.1%
2500
 
5.9%
2472
 
5.8%
2384
 
5.6%
2375
 
5.6%
612
 
1.4%
575
 
1.3%
476
 
1.1%
Other values (429) 21062
49.4%
Uppercase Letter
ValueCountFrequency (%)
B 349
16.9%
P 246
11.9%
K 176
8.5%
S 154
7.4%
C 152
7.4%
O 149
7.2%
H 146
7.1%
R 146
7.1%
G 128
 
6.2%
L 116
 
5.6%
Other values (13) 306
14.8%
Lowercase Letter
ValueCountFrequency (%)
s 1050
14.8%
e 825
11.6%
a 723
10.2%
o 586
8.3%
r 565
8.0%
l 557
7.8%
i 394
 
5.5%
u 356
 
5.0%
h 342
 
4.8%
n 329
 
4.6%
Other values (12) 1376
19.4%
Decimal Number
ValueCountFrequency (%)
1 185
42.3%
2 184
42.1%
3 20
 
4.6%
0 19
 
4.3%
4 11
 
2.5%
9 6
 
1.4%
6 4
 
0.9%
5 4
 
0.9%
8 4
 
0.9%
Other Punctuation
ValueCountFrequency (%)
& 41
28.9%
; 41
28.9%
. 23
16.2%
# 21
14.8%
: 13
 
9.2%
, 2
 
1.4%
? 1
 
0.7%
Space Separator
ValueCountFrequency (%)
912
100.0%
Open Punctuation
ValueCountFrequency (%)
( 273
100.0%
Close Punctuation
ValueCountFrequency (%)
) 273
100.0%
Final Punctuation
ValueCountFrequency (%)
92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42626
79.0%
Latin 9171
 
17.0%
Common 2130
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4009
 
9.4%
3570
 
8.4%
2591
 
6.1%
2500
 
5.9%
2472
 
5.8%
2384
 
5.6%
2375
 
5.6%
612
 
1.4%
575
 
1.3%
476
 
1.1%
Other values (429) 21062
49.4%
Latin
ValueCountFrequency (%)
s 1050
 
11.4%
e 825
 
9.0%
a 723
 
7.9%
o 586
 
6.4%
r 565
 
6.2%
l 557
 
6.1%
i 394
 
4.3%
u 356
 
3.9%
B 349
 
3.8%
h 342
 
3.7%
Other values (35) 3424
37.3%
Common
ValueCountFrequency (%)
912
42.8%
( 273
 
12.8%
) 273
 
12.8%
1 185
 
8.7%
2 184
 
8.6%
92
 
4.3%
& 41
 
1.9%
; 41
 
1.9%
. 23
 
1.1%
# 21
 
1.0%
Other values (11) 85
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42624
79.0%
ASCII 11209
 
20.8%
Punctuation 92
 
0.2%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4009
 
9.4%
3570
 
8.4%
2591
 
6.1%
2500
 
5.9%
2472
 
5.8%
2384
 
5.6%
2375
 
5.6%
612
 
1.4%
575
 
1.3%
476
 
1.1%
Other values (428) 21060
49.4%
ASCII
ValueCountFrequency (%)
s 1050
 
9.4%
912
 
8.1%
e 825
 
7.4%
a 723
 
6.5%
o 586
 
5.2%
r 565
 
5.0%
l 557
 
5.0%
i 394
 
3.5%
u 356
 
3.2%
B 349
 
3.1%
Other values (55) 4892
43.6%
Punctuation
ValueCountFrequency (%)
92
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

형식(전자책 또는 오디오북)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
전자책
9117 
오디오북
 
55

Length

Max length4
Median length3
Mean length3.0059965
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자책
2nd row전자책
3rd row전자책
4th row전자책
5th row전자책

Common Values

ValueCountFrequency (%)
전자책 9117
99.4%
오디오북 55
 
0.6%

Length

2024-04-21T11:15:46.354063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:15:46.445643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전자책 9117
99.4%
오디오북 55
 
0.6%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.8 KiB
Minimum2024-04-08 00:00:00
Maximum2024-04-08 00:00:00
2024-04-21T11:15:46.515918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:15:46.616597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T11:15:41.518258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:15:46.694942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번카테고리(대분류)형식(전자책 또는 오디오북)
연번1.0000.5750.301
카테고리(대분류)0.5751.0001.000
형식(전자책 또는 오디오북)0.3011.0001.000
2024-04-21T11:15:46.798313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
형식(전자책 또는 오디오북)카테고리(대분류)
형식(전자책 또는 오디오북)1.0000.999
카테고리(대분류)0.9991.000
2024-04-21T11:15:46.901161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번카테고리(대분류)형식(전자책 또는 오디오북)
연번1.0000.2620.231
카테고리(대분류)0.2621.0000.999
형식(전자책 또는 오디오북)0.2310.9991.000

Missing values

2024-04-21T11:15:41.700749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:15:41.848673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도서관명도서관구분코드도서관홈페이지(URL)카테고리(대분류)도서명저자명출판사형식(전자책 또는 오디오북)데이터기준일자
01금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111434https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학12월 12일이상지혜의숲전자책2024-04-08
12금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111435https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학17원 50전나도향광보사전자책2024-04-08
23금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111436https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학1932년의 문단 전망심훈광보사전자책2024-04-08
34금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111437https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학20년 후오 헨리아이브러리전자책2024-04-08
45금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111438https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학3월 창작평채만식지혜의숲전자책2024-04-08
56금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111439https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학5원 75전최서해광보사전자책2024-04-08
67금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111440https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학가거라 벗이여오장환동도서기전자책2024-04-08
78금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111441https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학가구의 추위이상지혜의숲전자책2024-04-08
89금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111442https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학가난한 아내최서해광보사전자책2024-04-08
910금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 111443https://elib.geumcheonlib.seoul.kr/FxLibrary/index/문학가는 길김소월동도서기전자책2024-04-08
연번도서관명도서관구분코드도서관홈페이지(URL)카테고리(대분류)도서명저자명출판사형식(전자책 또는 오디오북)데이터기준일자
91629163금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120596https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 날개이상북큐브네트웍스오디오북2024-04-08
91639164금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120597https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 메밀꽃 필 무렵이효석북큐브네트웍스오디오북2024-04-08
91649165금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120598https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 만무방김유정북큐브네트웍스오디오북2024-04-08
91659166금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120599https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 산이효석북큐브네트웍스오디오북2024-04-08
91669167금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120600https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 밤길이태준북큐브네트웍스오디오북2024-04-08
91679168금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120601https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 빈처현진건북큐브네트웍스오디오북2024-04-08
91689169금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120602https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - B사감과 러브레터현진건북큐브네트웍스오디오북2024-04-08
91699170금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120603https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 꿈나도향북큐브네트웍스오디오북2024-04-08
91709171금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120604https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 두포전김유정북큐브네트웍스오디오북2024-04-08
91719172금천구립도서관금천구립독산도서관 : 111040 + 금천구립가산도서관 : 111077 + 금천구립금나래도서관 : 111113 + 금천구립시흥도서관 : 120605https://elib.geumcheonlib.seoul.kr/FxLibrary/index/오디오북[오디오북] 한국대표중단편문학 - 배따라기김동인북큐브네트웍스오디오북2024-04-08