Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells5
Missing cells (%)< 0.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Categorical2
Text4
Numeric1
DateTime1

Dataset

Description경기도 안양시 안양시립도서관 시군명, 도서관명, 도서명, 저자, 출판사, 도서분류번호, 대출이용자 출생연도, 대출일 정보입니다.
Author경기도 안양시
URLhttps://www.data.go.kr/data/15078121/fileData.do

Alerts

시군명 has constant value ""Constant
도서관명 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
대출이용자 출생연도 is highly skewed (γ1 = 40.28394131)Skewed

Reproduction

Analysis started2024-03-14 20:03:00.807069
Analysis finished2024-03-14 20:03:05.333600
Duration4.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안양시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안양시
2nd row안양시
3rd row안양시
4th row안양시
5th row안양시

Common Values

ValueCountFrequency (%)
안양시 10000
100.0%

Length

2024-03-15T05:03:05.547881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:03:05.843583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안양시 10000
100.0%

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안양시석수도서관
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안양시석수도서관
2nd row안양시석수도서관
3rd row안양시석수도서관
4th row안양시석수도서관
5th row안양시석수도서관

Common Values

ValueCountFrequency (%)
안양시석수도서관 10000
100.0%

Length

2024-03-15T05:03:06.155765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:03:06.353117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안양시석수도서관 10000
100.0%
Distinct8654
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T05:03:07.643812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length68
Mean length18.2236
Min length1

Characters and Unicode

Total characters182236
Distinct characters1556
Distinct categories12 ?
Distinct scripts7 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7576 ?
Unique (%)75.8%

Sample

1st row불안하다고 불안해하면 더 불안해지니까
2nd row격몽요결
3rd row어떤 물질의 사랑: 천선란 소설집
4th row휴먼카인드
5th row영차 영차 부지런한 개미
ValueCountFrequency (%)
1194
 
2.7%
1 489
 
1.1%
2 418
 
0.9%
장편소설 362
 
0.8%
the 268
 
0.6%
이야기 210
 
0.5%
3 202
 
0.5%
159
 
0.4%
코믹 130
 
0.3%
4 126
 
0.3%
Other values (16476) 40663
92.0%
2024-03-15T05:03:09.643751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38847
 
21.3%
3106
 
1.7%
2754
 
1.5%
e 2639
 
1.4%
: 2235
 
1.2%
. 2195
 
1.2%
a 1984
 
1.1%
, 1962
 
1.1%
o 1911
 
1.0%
1884
 
1.0%
Other values (1546) 122719
67.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 101732
55.8%
Space Separator 38847
 
21.3%
Lowercase Letter 22314
 
12.2%
Other Punctuation 8056
 
4.4%
Decimal Number 4249
 
2.3%
Uppercase Letter 3166
 
1.7%
Close Punctuation 1664
 
0.9%
Open Punctuation 1663
 
0.9%
Math Symbol 330
 
0.2%
Dash Punctuation 180
 
0.1%
Other values (2) 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3106
 
3.1%
2754
 
2.7%
1884
 
1.9%
1536
 
1.5%
1468
 
1.4%
1446
 
1.4%
1340
 
1.3%
1248
 
1.2%
1208
 
1.2%
1202
 
1.2%
Other values (1437) 84540
83.1%
Lowercase Letter
ValueCountFrequency (%)
e 2639
11.8%
a 1984
 
8.9%
o 1911
 
8.6%
t 1669
 
7.5%
i 1571
 
7.0%
r 1479
 
6.6%
s 1437
 
6.4%
n 1420
 
6.4%
h 1067
 
4.8%
l 1029
 
4.6%
Other values (17) 6108
27.4%
Uppercase Letter
ValueCountFrequency (%)
T 332
 
10.5%
S 293
 
9.3%
W 214
 
6.8%
D 194
 
6.1%
B 194
 
6.1%
M 185
 
5.8%
G 184
 
5.8%
C 163
 
5.1%
A 161
 
5.1%
P 155
 
4.9%
Other values (17) 1091
34.5%
Other Punctuation
ValueCountFrequency (%)
: 2235
27.7%
. 2195
27.2%
, 1962
24.4%
! 859
 
10.7%
? 402
 
5.0%
' 157
 
1.9%
· 114
 
1.4%
& 55
 
0.7%
/ 28
 
0.3%
% 15
 
0.2%
Other values (9) 34
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 1214
28.6%
2 873
20.5%
0 489
11.5%
3 458
 
10.8%
4 305
 
7.2%
5 272
 
6.4%
6 171
 
4.0%
7 163
 
3.8%
9 161
 
3.8%
8 143
 
3.4%
Math Symbol
ValueCountFrequency (%)
= 254
77.0%
~ 37
 
11.2%
+ 32
 
9.7%
2
 
0.6%
× 2
 
0.6%
> 1
 
0.3%
< 1
 
0.3%
| 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 1603
96.3%
] 59
 
3.5%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1602
96.3%
[ 59
 
3.5%
1
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
17
81.0%
2
 
9.5%
1
 
4.8%
1
 
4.8%
Other Symbol
ValueCountFrequency (%)
11
78.6%
2
 
14.3%
1
 
7.1%
Dash Punctuation
ValueCountFrequency (%)
- 179
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
38847
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 101265
55.6%
Common 55003
30.2%
Latin 25499
 
14.0%
Han 412
 
0.2%
Hiragana 38
 
< 0.1%
Katakana 17
 
< 0.1%
Greek 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3106
 
3.1%
2754
 
2.7%
1884
 
1.9%
1536
 
1.5%
1468
 
1.4%
1446
 
1.4%
1340
 
1.3%
1248
 
1.2%
1208
 
1.2%
1202
 
1.2%
Other values (1211) 84073
83.0%
Han
ValueCountFrequency (%)
14
 
3.4%
14
 
3.4%
13
 
3.2%
11
 
2.7%
10
 
2.4%
10
 
2.4%
10
 
2.4%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (177) 306
74.3%
Latin
ValueCountFrequency (%)
e 2639
 
10.3%
a 1984
 
7.8%
o 1911
 
7.5%
t 1669
 
6.5%
i 1571
 
6.2%
r 1479
 
5.8%
s 1437
 
5.6%
n 1420
 
5.6%
h 1067
 
4.2%
l 1029
 
4.0%
Other values (46) 9293
36.4%
Common
ValueCountFrequency (%)
38847
70.6%
: 2235
 
4.1%
. 2195
 
4.0%
, 1962
 
3.6%
) 1603
 
2.9%
( 1602
 
2.9%
1 1214
 
2.2%
2 873
 
1.6%
! 859
 
1.6%
0 489
 
0.9%
Other values (41) 3124
 
5.7%
Hiragana
ValueCountFrequency (%)
5
 
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
1
 
2.6%
Other values (14) 14
36.8%
Katakana
ValueCountFrequency (%)
2
 
11.8%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (5) 5
29.4%
Greek
ValueCountFrequency (%)
1
50.0%
π 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 101250
55.6%
ASCII 80320
44.1%
CJK 403
 
0.2%
None 144
 
0.1%
Hiragana 38
 
< 0.1%
Number Forms 21
 
< 0.1%
Katakana 17
 
< 0.1%
Compat Jamo 15
 
< 0.1%
Misc Symbols 11
 
< 0.1%
CJK Compat Ideographs 9
 
< 0.1%
Other values (4) 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38847
48.4%
e 2639
 
3.3%
: 2235
 
2.8%
. 2195
 
2.7%
a 1984
 
2.5%
, 1962
 
2.4%
o 1911
 
2.4%
t 1669
 
2.1%
) 1603
 
2.0%
( 1602
 
2.0%
Other values (75) 23673
29.5%
Hangul
ValueCountFrequency (%)
3106
 
3.1%
2754
 
2.7%
1884
 
1.9%
1536
 
1.5%
1468
 
1.4%
1446
 
1.4%
1340
 
1.3%
1248
 
1.2%
1208
 
1.2%
1202
 
1.2%
Other values (1201) 84058
83.0%
None
ValueCountFrequency (%)
· 114
79.2%
9
 
6.2%
5
 
3.5%
3
 
2.1%
2
 
1.4%
× 2
 
1.4%
2
 
1.4%
1
 
0.7%
1
 
0.7%
π 1
 
0.7%
Other values (4) 4
 
2.8%
Number Forms
ValueCountFrequency (%)
17
81.0%
2
 
9.5%
1
 
4.8%
1
 
4.8%
CJK
ValueCountFrequency (%)
14
 
3.5%
14
 
3.5%
13
 
3.2%
11
 
2.7%
10
 
2.5%
10
 
2.5%
10
 
2.5%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (170) 297
73.7%
Misc Symbols
ValueCountFrequency (%)
11
100.0%
Hiragana
ValueCountFrequency (%)
5
 
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
1
 
2.6%
Other values (14) 14
36.8%
Punctuation
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Compat Jamo
ValueCountFrequency (%)
3
20.0%
2
13.3%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
CJK Compat Ideographs
ValueCountFrequency (%)
2
22.2%
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
Katakana
ValueCountFrequency (%)
2
 
11.8%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (5) 5
29.4%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct7202
Distinct (%)72.0%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-15T05:03:11.920203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length149
Median length113
Mean length17.745749
Min length2

Characters and Unicode

Total characters177422
Distinct characters1004
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5900 ?
Unique (%)59.0%

Sample

1st row나카시마 미스즈 지음;, 김지희 옮김
2nd row이이 지음; 이민수 옮김
3rd row천선란 지음
4th row뤼트허르 브레흐만 지음; 조현욱 옮김
5th row양미진 글;, 서정화 사진;, 류동필 그림
ValueCountFrequency (%)
지음 4358
 
10.0%
그림 3322
 
7.6%
2823
 
6.5%
옮김 2680
 
6.2%
by 1496
 
3.4%
글·그림 643
 
1.5%
illustrated 481
 
1.1%
188
 
0.4%
공]지음 188
 
0.4%
원작 173
 
0.4%
Other values (10888) 27206
62.5%
2024-03-15T05:03:14.137374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33567
 
18.9%
; 7539
 
4.2%
, 5861
 
3.3%
5391
 
3.0%
5075
 
2.9%
4759
 
2.7%
4313
 
2.4%
4231
 
2.4%
3725
 
2.1%
e 3147
 
1.8%
Other values (994) 99814
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91896
51.8%
Space Separator 33567
 
18.9%
Lowercase Letter 30563
 
17.2%
Other Punctuation 15035
 
8.5%
Uppercase Letter 4482
 
2.5%
Close Punctuation 898
 
0.5%
Open Punctuation 898
 
0.5%
Dash Punctuation 47
 
< 0.1%
Decimal Number 23
 
< 0.1%
Math Symbol 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5391
 
5.9%
5075
 
5.5%
4759
 
5.2%
4313
 
4.7%
4231
 
4.6%
3725
 
4.1%
3089
 
3.4%
2772
 
3.0%
1685
 
1.8%
1473
 
1.6%
Other values (908) 55383
60.3%
Lowercase Letter
ValueCountFrequency (%)
e 3147
10.3%
a 2795
 
9.1%
t 2572
 
8.4%
r 2564
 
8.4%
l 2368
 
7.7%
i 2194
 
7.2%
y 2133
 
7.0%
n 1963
 
6.4%
b 1786
 
5.8%
o 1645
 
5.4%
Other values (16) 7396
24.2%
Uppercase Letter
ValueCountFrequency (%)
M 415
 
9.3%
B 381
 
8.5%
R 364
 
8.1%
S 320
 
7.1%
J 318
 
7.1%
A 309
 
6.9%
H 267
 
6.0%
C 252
 
5.6%
D 249
 
5.6%
L 244
 
5.4%
Other values (16) 1363
30.4%
Other Punctuation
ValueCountFrequency (%)
; 7539
50.1%
, 5861
39.0%
· 768
 
5.1%
. 567
 
3.8%
: 250
 
1.7%
' 24
 
0.2%
& 9
 
0.1%
/ 8
 
0.1%
" 4
 
< 0.1%
! 2
 
< 0.1%
Other values (3) 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 7
30.4%
2 4
17.4%
1 4
17.4%
5 2
 
8.7%
9 2
 
8.7%
3 2
 
8.7%
4 1
 
4.3%
7 1
 
4.3%
Close Punctuation
ValueCountFrequency (%)
] 830
92.4%
) 65
 
7.2%
2
 
0.2%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
[ 830
92.4%
( 65
 
7.2%
2
 
0.2%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
< 6
46.2%
> 6
46.2%
| 1
 
7.7%
Space Separator
ValueCountFrequency (%)
33567
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 91802
51.7%
Common 50481
28.5%
Latin 35045
 
19.8%
Han 43
 
< 0.1%
Katakana 42
 
< 0.1%
Hiragana 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5391
 
5.9%
5075
 
5.5%
4759
 
5.2%
4313
 
4.7%
4231
 
4.6%
3725
 
4.1%
3089
 
3.4%
2772
 
3.0%
1685
 
1.8%
1473
 
1.6%
Other values (846) 55289
60.2%
Latin
ValueCountFrequency (%)
e 3147
 
9.0%
a 2795
 
8.0%
t 2572
 
7.3%
r 2564
 
7.3%
l 2368
 
6.8%
i 2194
 
6.3%
y 2133
 
6.1%
n 1963
 
5.6%
b 1786
 
5.1%
o 1645
 
4.7%
Other values (42) 11878
33.9%
Common
ValueCountFrequency (%)
33567
66.5%
; 7539
 
14.9%
, 5861
 
11.6%
] 830
 
1.6%
[ 830
 
1.6%
· 768
 
1.5%
. 567
 
1.1%
: 250
 
0.5%
( 65
 
0.1%
) 65
 
0.1%
Other values (24) 139
 
0.3%
Han
ValueCountFrequency (%)
9
20.9%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (21) 21
48.8%
Katakana
ValueCountFrequency (%)
7
16.7%
6
14.3%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
Other values (12) 12
28.6%
Hiragana
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 91797
51.7%
ASCII 84751
47.8%
None 775
 
0.4%
Katakana 42
 
< 0.1%
CJK 41
 
< 0.1%
Hiragana 9
 
< 0.1%
Compat Jamo 5
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33567
39.6%
; 7539
 
8.9%
, 5861
 
6.9%
e 3147
 
3.7%
a 2795
 
3.3%
t 2572
 
3.0%
r 2564
 
3.0%
l 2368
 
2.8%
i 2194
 
2.6%
y 2133
 
2.5%
Other values (70) 20011
23.6%
Hangul
ValueCountFrequency (%)
5391
 
5.9%
5075
 
5.5%
4759
 
5.2%
4313
 
4.7%
4231
 
4.6%
3725
 
4.1%
3089
 
3.4%
2772
 
3.0%
1685
 
1.8%
1473
 
1.6%
Other values (844) 55284
60.2%
None
ValueCountFrequency (%)
· 768
99.1%
2
 
0.3%
2
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
CJK
ValueCountFrequency (%)
9
22.0%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (20) 20
48.8%
Katakana
ValueCountFrequency (%)
7
16.7%
6
14.3%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
Other values (12) 12
28.6%
Compat Jamo
ValueCountFrequency (%)
3
60.0%
2
40.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%
Hiragana
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Distinct2992
Distinct (%)29.9%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-15T05:03:15.302965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length5.8784757
Min length1

Characters and Unicode

Total characters58773
Distinct characters739
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1706 ?
Unique (%)17.1%

Sample

1st row부키,
2nd row을유문화사
3rd row아작
4th row인플루엔셜
5th row한국톨스토이,
ValueCountFrequency (%)
비룡소 246
 
2.2%
아이세움 220
 
2.0%
books 203
 
1.8%
서울문화사 178
 
1.6%
문학동네 175
 
1.6%
아울북 171
 
1.5%
창비 167
 
1.5%
위즈덤하우스 145
 
1.3%
scholastic 133
 
1.2%
책읽는곰 117
 
1.0%
Other values (2292) 9400
84.3%
2024-03-15T05:03:16.790815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 4998
 
8.5%
o 1654
 
2.8%
1573
 
2.7%
1359
 
2.3%
1212
 
2.1%
r 1189
 
2.0%
1160
 
2.0%
1128
 
1.9%
e 1067
 
1.8%
1047
 
1.8%
Other values (729) 42386
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36011
61.3%
Lowercase Letter 12623
 
21.5%
Other Punctuation 5531
 
9.4%
Uppercase Letter 2821
 
4.8%
Space Separator 1160
 
2.0%
Open Punctuation 235
 
0.4%
Close Punctuation 235
 
0.4%
Decimal Number 130
 
0.2%
Dash Punctuation 21
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1573
 
4.4%
1359
 
3.8%
1212
 
3.4%
1128
 
3.1%
1047
 
2.9%
942
 
2.6%
683
 
1.9%
663
 
1.8%
494
 
1.4%
478
 
1.3%
Other values (648) 26432
73.4%
Uppercase Letter
ValueCountFrequency (%)
H 420
14.9%
B 361
12.8%
S 284
10.1%
P 216
 
7.7%
R 202
 
7.2%
C 197
 
7.0%
M 127
 
4.5%
K 125
 
4.4%
O 109
 
3.9%
A 106
 
3.8%
Other values (17) 674
23.9%
Lowercase Letter
ValueCountFrequency (%)
o 1654
13.1%
r 1189
 
9.4%
e 1067
 
8.5%
s 1025
 
8.1%
a 949
 
7.5%
i 869
 
6.9%
n 852
 
6.7%
l 661
 
5.2%
t 514
 
4.1%
c 469
 
3.7%
Other values (16) 3374
26.7%
Other Punctuation
ValueCountFrequency (%)
, 4998
90.4%
: 382
 
6.9%
& 59
 
1.1%
. 42
 
0.8%
' 20
 
0.4%
11
 
0.2%
· 8
 
0.1%
; 5
 
0.1%
! 3
 
0.1%
# 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 66
50.8%
1 47
36.2%
0 4
 
3.1%
4 4
 
3.1%
3 3
 
2.3%
6 3
 
2.3%
7 1
 
0.8%
5 1
 
0.8%
8 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 215
91.5%
[ 20
 
8.5%
Close Punctuation
ValueCountFrequency (%)
) 215
91.5%
] 20
 
8.5%
Space Separator
ValueCountFrequency (%)
1160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35950
61.2%
Latin 15444
26.3%
Common 7318
 
12.5%
Han 47
 
0.1%
Katakana 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1573
 
4.4%
1359
 
3.8%
1212
 
3.4%
1128
 
3.1%
1047
 
2.9%
942
 
2.6%
683
 
1.9%
663
 
1.8%
494
 
1.4%
478
 
1.3%
Other values (602) 26371
73.4%
Latin
ValueCountFrequency (%)
o 1654
 
10.7%
r 1189
 
7.7%
e 1067
 
6.9%
s 1025
 
6.6%
a 949
 
6.1%
i 869
 
5.6%
n 852
 
5.5%
l 661
 
4.3%
t 514
 
3.3%
c 469
 
3.0%
Other values (43) 6195
40.1%
Han
ValueCountFrequency (%)
5
 
10.6%
4
 
8.5%
3
 
6.4%
3
 
6.4%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.1%
1
 
2.1%
Other values (22) 22
46.8%
Common
ValueCountFrequency (%)
, 4998
68.3%
1160
 
15.9%
: 382
 
5.2%
( 215
 
2.9%
) 215
 
2.9%
2 66
 
0.9%
& 59
 
0.8%
1 47
 
0.6%
. 42
 
0.6%
- 21
 
0.3%
Other values (18) 113
 
1.5%
Katakana
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35949
61.2%
ASCII 22739
38.7%
CJK 46
 
0.1%
None 22
 
< 0.1%
Katakana 14
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 4998
22.0%
o 1654
 
7.3%
r 1189
 
5.2%
1160
 
5.1%
e 1067
 
4.7%
s 1025
 
4.5%
a 949
 
4.2%
i 869
 
3.8%
n 852
 
3.7%
l 661
 
2.9%
Other values (67) 8315
36.6%
Hangul
ValueCountFrequency (%)
1573
 
4.4%
1359
 
3.8%
1212
 
3.4%
1128
 
3.1%
1047
 
2.9%
942
 
2.6%
683
 
1.9%
663
 
1.8%
494
 
1.4%
478
 
1.3%
Other values (601) 26370
73.4%
None
ValueCountFrequency (%)
11
50.0%
· 8
36.4%
3
 
13.6%
CJK
ValueCountFrequency (%)
5
 
10.9%
4
 
8.7%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
Other values (21) 21
45.7%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct8883
Distinct (%)88.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T05:03:17.794666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7913 ?
Unique (%)79.1%

Sample

1st rowEM0000148631
2nd rowEM0000171382
3rd rowEM0000158188
4th rowEM0000163466
5th rowOJ0000041358
ValueCountFrequency (%)
oj0000056783 5
 
< 0.1%
oj0000075544 5
 
< 0.1%
em0000141277 4
 
< 0.1%
em0000166026 4
 
< 0.1%
em0000166134 4
 
< 0.1%
oj0000081194 4
 
< 0.1%
em0000166038 4
 
< 0.1%
oj0000062623 4
 
< 0.1%
oj0000054869 4
 
< 0.1%
oj0000075907 4
 
< 0.1%
Other values (8873) 9958
99.6%
2024-03-15T05:03:19.099329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 50875
42.4%
1 8083
 
6.7%
6 6533
 
5.4%
7 6511
 
5.4%
5 5806
 
4.8%
O 5772
 
4.8%
J 5772
 
4.8%
4 4895
 
4.1%
8 4437
 
3.7%
9 4365
 
3.6%
Other values (9) 16951
 
14.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 50875
50.9%
1 8083
 
8.1%
6 6533
 
6.5%
7 6511
 
6.5%
5 5806
 
5.8%
4 4895
 
4.9%
8 4437
 
4.4%
9 4365
 
4.4%
3 4285
 
4.3%
2 4210
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
O 5772
28.9%
J 5772
28.9%
M 4223
21.1%
E 4092
20.5%
N 94
 
0.5%
A 34
 
0.2%
L 7
 
< 0.1%
S 5
 
< 0.1%
P 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 50875
50.9%
1 8083
 
8.1%
6 6533
 
6.5%
7 6511
 
6.5%
5 5806
 
5.8%
4 4895
 
4.9%
8 4437
 
4.4%
9 4365
 
4.4%
3 4285
 
4.3%
2 4210
 
4.2%
Latin
ValueCountFrequency (%)
O 5772
28.9%
J 5772
28.9%
M 4223
21.1%
E 4092
20.5%
N 94
 
0.5%
A 34
 
0.2%
L 7
 
< 0.1%
S 5
 
< 0.1%
P 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 50875
42.4%
1 8083
 
6.7%
6 6533
 
5.4%
7 6511
 
5.4%
5 5806
 
4.8%
O 5772
 
4.8%
J 5772
 
4.8%
4 4895
 
4.1%
8 4437
 
3.7%
9 4365
 
3.6%
Other values (9) 16951
 
14.1%

대출이용자 출생연도
Real number (ℝ)

SKEWED 

Distinct87
Distinct (%)0.9%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1992.9782
Minimum1920
Maximum9999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T05:03:19.469069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1920
5-th percentile1961
Q11977
median1983
Q32009
95-th percentile2016
Maximum9999
Range8079
Interquartile range (IQR)32

Descriptive statistics

Standard deviation197.00604
Coefficient of variation (CV)0.098850073
Kurtosis1634.7494
Mean1992.9782
Median Absolute Deviation (MAD)9
Skewness40.283941
Sum19927789
Variance38811.38
MonotonicityNot monotonic
2024-03-15T05:03:19.932935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1982 484
 
4.8%
1980 444
 
4.4%
1983 437
 
4.4%
1981 437
 
4.4%
1984 408
 
4.1%
2015 393
 
3.9%
1979 382
 
3.8%
2012 373
 
3.7%
1978 372
 
3.7%
2014 353
 
3.5%
Other values (77) 5916
59.2%
ValueCountFrequency (%)
1920 1
 
< 0.1%
1933 2
 
< 0.1%
1934 1
 
< 0.1%
1939 4
 
< 0.1%
1940 1
 
< 0.1%
1941 2
 
< 0.1%
1942 10
0.1%
1943 6
 
0.1%
1944 2
 
< 0.1%
1946 17
0.2%
ValueCountFrequency (%)
9999 6
 
0.1%
2022 17
 
0.2%
2021 2
 
< 0.1%
2020 53
 
0.5%
2019 95
 
0.9%
2018 83
 
0.8%
2017 172
1.7%
2016 302
3.0%
2015 393
3.9%
2014 353
3.5%
Distinct126
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-01-02 00:00:00
Maximum2023-05-16 00:00:00
2024-03-15T05:03:20.356143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:03:20.768714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T05:03:03.903579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-15T05:03:04.334464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:03:04.848375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T05:03:05.131798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명도서관명도서명저자출판사도서분류번호대출이용자 출생연도대출일
261안양시안양시석수도서관불안하다고 불안해하면 더 불안해지니까나카시마 미스즈 지음;, 김지희 옮김부키,EM000014863119822023/01/03
52042안양시안양시석수도서관격몽요결이이 지음; 이민수 옮김을유문화사EM000017138219842023/04/07
16617안양시안양시석수도서관어떤 물질의 사랑: 천선란 소설집천선란 지음아작EM000015818819872023/02/01
60218안양시안양시석수도서관휴먼카인드뤼트허르 브레흐만 지음; 조현욱 옮김인플루엔셜EM000016346619642023/04/23
48055안양시안양시석수도서관영차 영차 부지런한 개미양미진 글;, 서정화 사진;, 류동필 그림한국톨스토이,OJ000004135819462023/03/30
57496안양시안양시석수도서관우리는 왜 죽음을 두려워할 필요 없는가정현채 지음비아북,EM000014395019732023/04/18
24317안양시안양시석수도서관(설민석의)만만 한국사: 재미 만점★효과 만점★한국사 만화. 1, 선사 시대부터 삼국 시대까지설민석, 신지희 [공]글; 김덕영 그림아이세움:미래엔OJ000007332120142023/02/14
8458안양시안양시석수도서관에릭 로메르: 은밀한 개인주의자앙투안 드 베크, 노엘 에르프 [공]지음; 임세은 옮김을유문화사EM000016415319712023/01/17
30443안양시안양시석수도서관우리 반 채무 관계김선정 글; 우지현 그림위즈덤하우스OJ000007806819812023/02/24
36756안양시안양시석수도서관Put it backby Roderick Hunt,, illustrated by Alex BrychtaOxfordOJ000003706520152023/03/09
시군명도서관명도서명저자출판사도서분류번호대출이용자 출생연도대출일
1320안양시안양시석수도서관Starsby Steve Tomecek;, illustrated by Sachiko YoshikawaNational Geographic Society,OJ000005164020142023/01/04
60569안양시안양시석수도서관그리스 로마 신화. 24, 헤라클래스의 마지막 원정박시연 글; 최우빈 그림아울북OJ000007852119822023/04/23
1877안양시안양시석수도서관게으름뱅이 잭조지프 제이콥스 원작;, 고영이 글;, 구윤미 그림아람,OJ000004495420192023/01/05
35006안양시안양시석수도서관(똑똑하게)당근 쓰는 토끼 이야기신더스 매클라우드 글·그림;, 공경희 옮김웅진주니어OJ000007194019792023/03/05
25481안양시안양시석수도서관사주팔자. 1: 서자영 장편소설서자영 지음고즈넉이엔티EM000016658619992023/02/16
18474안양시안양시석수도서관엄마가 유령이 되었어!노부미 글, 그림 ;, 이기웅 옮김길벗어린이,OJ000005581320182023/02/03
35726안양시안양시석수도서관경주 최씨 부자 이야기조은정 글;, 여기 그림여원미디어,OJ000006663719812023/03/07
16546안양시안양시석수도서관엄마의 화코칭김지혜 지음카시오페아,EM000014639919822023/02/01
42923안양시안양시석수도서관Go away, big green monster!by Ed EmberleyJYbooksOJ000007374320112023/03/21
27255안양시안양시석수도서관일론머스크: 리얼 아이언맨소니아 앤더스 감독알스컴퍼니NM000000649719582023/02/19

Duplicate rows

Most frequently occurring

시군명도서관명도서명저자출판사도서분류번호대출이용자 출생연도대출일# duplicates
0안양시안양시석수도서관불편한 편의점: 김호연 장편소설김호연 지음나무옆의자EM000016603819992023/02/252