Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 843 |
Missing cells (%) | 1.4% |
Duplicate rows | 56 |
Duplicate rows (%) | 0.6% |
Total size in memory | 556.6 KiB |
Average record size in memory | 57.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 1 |
Numeric | 1 |
DateTime | 2 |
Dataset
Description | 도서관 이용자별 대출 데이터현황(2020년 기준) - 책제목,성별, 생년, 대출일자, 반납일자 등 |
---|---|
Author | 서울특별시 동작구 |
URL | https://www.data.go.kr/data/15065639/fileData.do |
Dataset has 56 (0.6%) duplicate rows | Duplicates |
반납일 has 833 (8.3%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 09:09:51.669883 |
---|---|
Analysis finished | 2023-12-12 09:09:53.429064 |
Duration | 1.76 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
서명
Text
Distinct | 8761 |
---|---|
Distinct (%) | 87.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 138 |
---|---|
Median length | 78 |
Mean length | 17.6853 |
Min length | 1 |
Characters and Unicode
Total characters | 176853 |
---|---|
Distinct characters | 1409 |
Distinct categories | 17 ? |
Distinct scripts | 4 ? |
Distinct blocks | 9 ? |
Unique
Unique | 7789 ? |
---|---|
Unique (%) | 77.9% |
Sample
1st row | 날마다 그림 |
---|---|
2nd row | 악몽을 파는 가게 : 스티븐 킹 단편집. 2 |
3rd row | Gulliver's travels |
4th row | 나는 어린이입니다:철학동화 |
5th row | 치카치카 군단과 충치왕국 |
Value | Count | Frequency (%) |
2008 | 4.6% | |
the | 417 | 1.0% |
장편소설 | 318 | 0.7% |
이야기 | 300 | 0.7% |
1 | 216 | 0.5% |
and | 179 | 0.4% |
2 | 178 | 0.4% |
내 | 158 | 0.4% |
우리 | 140 | 0.3% |
a | 121 | 0.3% |
Other values (16984) | 39779 |
Most occurring characters
Value | Count | Frequency (%) |
34008 | 19.2% | |
e | 3286 | 1.9% |
이 | 3197 | 1.8% |
: | 2684 | 1.5% |
의 | 2550 | 1.4% |
a | 2224 | 1.3% |
o | 2048 | 1.2% |
는 | 1984 | 1.1% |
t | 1958 | 1.1% |
n | 1836 | 1.0% |
Other values (1399) | 121078 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 100728 | |
Space Separator | 34008 | 19.2% |
Lowercase Letter | 25391 | 14.4% |
Other Punctuation | 7077 | 4.0% |
Uppercase Letter | 3580 | 2.0% |
Decimal Number | 2665 | 1.5% |
Close Punctuation | 1472 | 0.8% |
Open Punctuation | 1472 | 0.8% |
Math Symbol | 322 | 0.2% |
Dash Punctuation | 103 | 0.1% |
Other values (7) | 35 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 3197 | 3.2% |
의 | 2550 | 2.5% |
는 | 1984 | 2.0% |
기 | 1579 | 1.6% |
리 | 1483 | 1.5% |
아 | 1448 | 1.4% |
가 | 1415 | 1.4% |
사 | 1405 | 1.4% |
지 | 1258 | 1.2% |
한 | 1201 | 1.2% |
Other values (1289) | 83208 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 3286 | |
a | 2224 | 8.8% |
o | 2048 | 8.1% |
t | 1958 | 7.7% |
n | 1836 | 7.2% |
r | 1744 | 6.9% |
s | 1719 | 6.8% |
i | 1693 | 6.7% |
h | 1346 | 5.3% |
l | 1061 | 4.2% |
Other values (16) | 6476 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 453 | |
S | 318 | 8.9% |
B | 272 | 7.6% |
M | 256 | 7.2% |
W | 206 | 5.8% |
D | 205 | 5.7% |
A | 203 | 5.7% |
C | 198 | 5.5% |
P | 174 | 4.9% |
H | 163 | 4.6% |
Other values (16) | 1132 |
Other Punctuation
Value | Count | Frequency (%) |
: | 2684 | |
, | 1531 | |
. | 1031 | 14.6% |
! | 855 | 12.1% |
? | 535 | 7.6% |
' | 250 | 3.5% |
· | 113 | 1.6% |
& | 24 | 0.3% |
% | 10 | 0.1% |
; | 10 | 0.1% |
Other values (8) | 34 | 0.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 748 | |
2 | 468 | |
0 | 455 | |
3 | 279 | 10.5% |
4 | 175 | 6.6% |
5 | 163 | 6.1% |
7 | 98 | 3.7% |
6 | 96 | 3.6% |
9 | 95 | 3.6% |
8 | 88 | 3.3% |
Math Symbol
Value | Count | Frequency (%) |
= | 281 | |
~ | 25 | 7.8% |
+ | 5 | 1.6% |
> | 3 | 0.9% |
< | 3 | 0.9% |
| | 3 | 0.9% |
× | 2 | 0.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1416 | |
] | 51 | 3.5% |
』 | 2 | 0.1% |
」 | 2 | 0.1% |
》 | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1416 | |
[ | 51 | 3.5% |
『 | 2 | 0.1% |
「 | 2 | 0.1% |
《 | 1 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 7 | |
Ⅰ | 3 | |
Ⅲ | 2 | 15.4% |
Ⅳ | 1 | 7.7% |
Currency Symbol
Value | Count | Frequency (%) |
¤ | 1 | |
¥ | 1 |
Space Separator
Value | Count | Frequency (%) |
34008 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 103 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 7 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 5 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 4 |
Other Symbol
Value | Count | Frequency (%) |
★ | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 100538 | |
Common | 47141 | |
Latin | 28984 | 16.4% |
Han | 190 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 3197 | 3.2% |
의 | 2550 | 2.5% |
는 | 1984 | 2.0% |
기 | 1579 | 1.6% |
리 | 1483 | 1.5% |
아 | 1448 | 1.4% |
가 | 1415 | 1.4% |
사 | 1405 | 1.4% |
지 | 1258 | 1.3% |
한 | 1201 | 1.2% |
Other values (1225) | 83018 |
Han
Value | Count | Frequency (%) |
國 | 12 | 6.3% |
三 | 11 | 5.8% |
大 | 11 | 5.8% |
志 | 11 | 5.8% |
趙 | 10 | 5.3% |
來 | 10 | 5.3% |
河 | 10 | 5.3% |
小 | 10 | 5.3% |
廷 | 10 | 5.3% |
說 | 10 | 5.3% |
Other values (54) | 85 |
Latin
Value | Count | Frequency (%) |
e | 3286 | 11.3% |
a | 2224 | 7.7% |
o | 2048 | 7.1% |
t | 1958 | 6.8% |
n | 1836 | 6.3% |
r | 1744 | 6.0% |
s | 1719 | 5.9% |
i | 1693 | 5.8% |
h | 1346 | 4.6% |
l | 1061 | 3.7% |
Other values (46) | 10069 |
Common
Value | Count | Frequency (%) |
34008 | ||
: | 2684 | 5.7% |
, | 1531 | 3.2% |
) | 1416 | 3.0% |
( | 1416 | 3.0% |
. | 1031 | 2.2% |
! | 855 | 1.8% |
1 | 748 | 1.6% |
? | 535 | 1.1% |
2 | 468 | 1.0% |
Other values (44) | 2449 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 100523 | |
ASCII | 75950 | |
CJK | 189 | 0.1% |
None | 145 | 0.1% |
Punctuation | 15 | < 0.1% |
Compat Jamo | 15 | < 0.1% |
Number Forms | 13 | < 0.1% |
Misc Symbols | 2 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
34008 | ||
e | 3286 | 4.3% |
: | 2684 | 3.5% |
a | 2224 | 2.9% |
o | 2048 | 2.7% |
t | 1958 | 2.6% |
n | 1836 | 2.4% |
r | 1744 | 2.3% |
s | 1719 | 2.3% |
i | 1693 | 2.2% |
Other values (77) | 22750 |
Hangul
Value | Count | Frequency (%) |
이 | 3197 | 3.2% |
의 | 2550 | 2.5% |
는 | 1984 | 2.0% |
기 | 1579 | 1.6% |
리 | 1483 | 1.5% |
아 | 1448 | 1.4% |
가 | 1415 | 1.4% |
사 | 1405 | 1.4% |
지 | 1258 | 1.3% |
한 | 1201 | 1.2% |
Other values (1222) | 83003 |
None
Value | Count | Frequency (%) |
· | 113 | |
' | 9 | 6.2% |
% | 4 | 2.8% |
』 | 2 | 1.4% |
『 | 2 | 1.4% |
& | 2 | 1.4% |
「 | 2 | 1.4% |
」 | 2 | 1.4% |
? | 2 | 1.4% |
× | 2 | 1.4% |
Other values (5) | 5 | 3.4% |
CJK
Value | Count | Frequency (%) |
國 | 12 | 6.3% |
三 | 11 | 5.8% |
大 | 11 | 5.8% |
志 | 11 | 5.8% |
趙 | 10 | 5.3% |
來 | 10 | 5.3% |
河 | 10 | 5.3% |
小 | 10 | 5.3% |
廷 | 10 | 5.3% |
說 | 10 | 5.3% |
Other values (53) | 84 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 7 | |
Ⅰ | 3 | |
Ⅲ | 2 | 15.4% |
Ⅳ | 1 | 7.7% |
Punctuation
Value | Count | Frequency (%) |
… | 6 | |
’ | 5 | |
‘ | 4 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 5 | |
ㄴ | 5 | |
ㄷ | 5 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 2 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
列 | 1 |
성별
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
여 | |
---|---|
남 | |
<NA> |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.2985 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남 |
---|---|
2nd row | 남 |
3rd row | <NA> |
4th row | 남 |
5th row | 남 |
Common Values
Value | Count | Frequency (%) |
여 | 6003 | |
남 | 3002 | |
<NA> | 995 | 10.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
여 | 6003 | |
남 | 3002 | |
na | 995 | 10.0% |
생년
Real number (ℝ)
Distinct | 81 |
---|---|
Distinct (%) | 0.8% |
Missing | 10 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1987.0931 |
Minimum | 1938 |
---|---|
Maximum | 2019 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1938 |
---|---|
5-th percentile | 1964 |
Q1 | 1976 |
median | 1981 |
Q3 | 2007 |
95-th percentile | 2013 |
Maximum | 2019 |
Range | 81 |
Interquartile range (IQR) | 31 |
Descriptive statistics
Standard deviation | 16.863545 |
---|---|
Coefficient of variation (CV) | 0.0084865401 |
Kurtosis | -0.83210111 |
Mean | 1987.0931 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.28709982 |
Sum | 19851060 |
Variance | 284.37916 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1979 | 595 | 5.9% |
1978 | 531 | 5.3% |
1982 | 498 | 5.0% |
1980 | 496 | 5.0% |
2012 | 459 | 4.6% |
1977 | 452 | 4.5% |
1981 | 449 | 4.5% |
1976 | 446 | 4.5% |
2011 | 394 | 3.9% |
1975 | 376 | 3.8% |
Other values (71) | 5294 |
Value | Count | Frequency (%) |
1938 | 5 | 0.1% |
1939 | 2 | < 0.1% |
1940 | 5 | 0.1% |
1942 | 6 | |
1943 | 14 | |
1944 | 4 | < 0.1% |
1945 | 6 | |
1946 | 11 | |
1947 | 7 | |
1948 | 5 | 0.1% |
Value | Count | Frequency (%) |
2019 | 21 | 0.2% |
2018 | 15 | 0.1% |
2017 | 40 | 0.4% |
2016 | 55 | 0.5% |
2015 | 113 | 1.1% |
2014 | 224 | |
2013 | 289 | |
2012 | 459 | |
2011 | 394 | |
2010 | 365 |
분류번호
Text
Distinct | 946 |
---|---|
Distinct (%) | 9.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
813.8 | 1203 | 12.0% |
843.6 | 919 | 9.2% |
843 | 456 | 4.6% |
813.7 | 367 | 3.7% |
747 | 338 | 3.4% |
408 | 284 | 2.8% |
808.91 | 266 | 2.7% |
833.8 | 261 | 2.6% |
843.5 | 229 | 2.3% |
808.9 | 224 | 2.2% |
Other values (935) | 5453 |
Most occurring characters
Value | Count | Frequency (%) |
8 | 9889 | |
3 | 6753 | |
. | 6678 | |
1 | 5342 | |
4 | 3860 | 8.5% |
0 | 2665 | 5.8% |
9 | 2473 | 5.4% |
7 | 2376 | 5.2% |
5 | 2052 | 4.5% |
6 | 1909 | 4.2% |
Other values (7) | 1591 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 38903 | |
Other Punctuation | 6679 | 14.7% |
Other Letter | 4 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
8 | 9889 | |
3 | 6753 | |
1 | 5342 | |
4 | 3860 | 9.9% |
0 | 2665 | 6.9% |
9 | 2473 | 6.4% |
7 | 2376 | 6.1% |
5 | 2052 | 5.3% |
6 | 1909 | 4.9% |
2 | 1584 | 4.1% |
Other Letter
Value | Count | Frequency (%) |
ㄱ | 2 | |
이 | 1 | |
한 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6678 | |
/ | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 45583 | |
Hangul | 4 | < 0.1% |
Latin | 1 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
8 | 9889 | |
3 | 6753 | |
. | 6678 | |
1 | 5342 | |
4 | 3860 | 8.5% |
0 | 2665 | 5.8% |
9 | 2473 | 5.4% |
7 | 2376 | 5.2% |
5 | 2052 | 4.5% |
6 | 1909 | 4.2% |
Other values (3) | 1586 | 3.5% |
Hangul
Value | Count | Frequency (%) |
ㄱ | 2 | |
이 | 1 | |
한 | 1 |
Latin
Value | Count | Frequency (%) |
b | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 45584 | |
Compat Jamo | 2 | < 0.1% |
Hangul | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8 | 9889 | |
3 | 6753 | |
. | 6678 | |
1 | 5342 | |
4 | 3860 | 8.5% |
0 | 2665 | 5.8% |
9 | 2473 | 5.4% |
7 | 2376 | 5.2% |
5 | 2052 | 4.5% |
6 | 1909 | 4.2% |
Other values (4) | 1587 | 3.5% |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 2 |
Hangul
Value | Count | Frequency (%) |
이 | 1 | |
한 | 1 |
대출일
Date
Distinct | 88 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-01-02 00:00:00 |
---|---|
Maximum | 2020-03-31 00:00:00 |
반납일
Date
MISSING
 
Distinct | 99 |
---|---|
Distinct (%) | 1.1% |
Missing | 833 |
Missing (%) | 8.3% |
Memory size | 156.2 KiB |
Minimum | 2020-01-02 00:00:00 |
---|---|
Maximum | 2020-04-09 00:00:00 |
성별 | 생년 | 대출일 | 반납일 | |
---|---|---|---|---|
성별 | 1.000 | 0.251 | 0.114 | 0.084 |
생년 | 0.251 | 1.000 | 0.177 | 0.237 |
대출일 | 0.114 | 0.177 | 1.000 | 0.900 |
반납일 | 0.084 | 0.237 | 0.900 | 1.000 |
생년 | 성별 | |
---|---|---|
생년 | 1.000 | 0.191 |
성별 | 0.191 | 1.000 |
서명 | 성별 | 생년 | 분류번호 | 대출일 | 반납일 | |
---|---|---|---|---|---|---|
83023 | 날마다 그림 | 남 | 1970 | 652.52 | 2020-01-29 | 2020-02-14 |
45957 | 악몽을 파는 가게 : 스티븐 킹 단편집. 2 | 남 | 1981 | 808 | 2020-03-31 | <NA> |
92479 | Gulliver's travels | <NA> | 2009 | 747 | 2020-01-30 | 2020-02-13 |
13652 | 나는 어린이입니다:철학동화 | 남 | 2008 | 863 | 2020-01-11 | 2020-01-21 |
20327 | 치카치카 군단과 충치왕국 | 남 | 2013 | 813.8 | 2020-01-03 | 2020-01-16 |
66441 | Homework! | 남 | 2011 | 843.6 | 2020-02-20 | 2020-03-31 |
64114 | 동전 하나로도 행복했던 구멍가게의 날들 | 여 | 1995 | 818 | 2020-03-03 | 2020-03-04 |
40595 | 백년 목 | 여 | 1966 | 514.321 | 2020-01-16 | 2020-01-29 |
36138 | 쿠키 : 한 입의 인생 수업 | 남 | 1976 | 843 | 2020-01-02 | 2020-01-16 |
7291 | Dog man. [2], Unleashed | 여 | 1981 | 843.6 | 2020-01-29 | 2020-02-06 |
서명 | 성별 | 생년 | 분류번호 | 대출일 | 반납일 | |
---|---|---|---|---|---|---|
105 | UFO를 따라간 외계인 | <NA> | 2009 | 813.8 | 2020-02-09 | 2020-03-03 |
36028 | 초록은 어디에 있을까? | 남 | 2015 | 650.8 | 2020-02-04 | 2020-02-13 |
63613 | 나의 뇌는 특별하다 : 템플 그랜딘의 자폐성 뇌 이야기 | 여 | 1978 | 513.896 | 2020-02-14 | 2020-02-20 |
21821 | 성性 정치학 | 여 | 1971 | 337 | 2020-03-07 | 2020-03-14 |
2383 | Little Beauty | 여 | 2011 | 375.1 | 2020-01-29 | 2020-02-14 |
26681 | 엉덩이탐정 : 뿡뿡 무지개 다이아몬드를 찾아라! | 여 | 1980 | 833.8 | 2020-01-17 | 2020-02-04 |
10782 | 국가대표 물고기 금붕이 | 남 | 1974 | 813.8 | 2020-02-02 | 2020-02-16 |
86401 | (에곤 실레)백 년간의 잠:임순만 장편소설 | 여 | 1997 | 813.7 | 2020-01-16 | 2020-02-04 |
53165 | 이솝 이야기 | 여 | 1973 | 808.9 | 2020-02-16 | 2020-02-23 |
44764 | 오 마이 갓 어쩌다 사춘기 | 남 | 2008 | 813.8 | 2020-01-14 | 2020-01-21 |
Most frequently occurring
서명 | 성별 | 생년 | 분류번호 | 대출일 | 반납일 | # duplicates | |
---|---|---|---|---|---|---|---|
9 | (The)Stars | 남 | 2012 | 747 | 2020-01-30 | 2020-02-02 | 3 |
22 | Harry Potter and the Order of the Phoenix | 남 | 2008 | 843.5 | 2020-01-17 | 2020-01-17 | 3 |
41 | 개구쟁이 아치 | 남 | 2009 | 375.108 | 2020-01-14 | 2020-01-31 | 3 |
0 | (Disney Princess)magical tales | 여 | 1975 | 843.6 | 2020-02-18 | <NA> | 2 |
1 | (Disney) Aladdin | 여 | 1975 | 843.6 | 2020-01-14 | 2020-01-19 | 2 |
2 | (The) lion, the witch and the wardrobe | 남 | 1971 | 843.5 | 2020-01-21 | 2020-02-01 | 2 |
3 | (The)Day of the bad haircut | 남 | 1973 | 747 | 2020-01-14 | 2020-01-21 | 2 |
4 | (The)Fault in our stars | 여 | 1973 | 843.6 | 2020-01-12 | 2020-02-02 | 2 |
5 | (The)Huggles' hug | 여 | 1984 | 747 | 2020-01-04 | 2020-01-08 | 2 |
6 | (The)Pizza Monster | 여 | 2011 | 843.5 | 2020-01-21 | 2020-02-11 | 2 |