Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory52.3 B

Variable types

Numeric3
Text1
Categorical2

Dataset

Description원신흥도서관의 1년간 대출실적을 기준으로 선정한 베스트 도서대출 100선(순위, 도서명, 저자, 출판사, 출판년, 대출횟수)
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15053374/fileData.do

Alerts

순위 is highly overall correlated with 대출건수High correlation
대출건수 is highly overall correlated with 순위High correlation
저자 is highly overall correlated with 발행처High correlation
발행처 is highly overall correlated with 저자High correlation
순위 has unique valuesUnique
서명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:23:47.286492
Analysis finished2023-12-12 12:23:48.937895
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순위
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T21:23:49.048265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-12T21:23:49.280443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

서명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-12T21:23:49.526334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length41
Mean length30.99
Min length8

Characters and Unicode

Total characters3099
Distinct characters321
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 3 , 도시 편
2nd row그리스 로마 신화 : 만화로 읽는 초등 인문학 . 13 , 도도한 여신, 아르테미스의 원칙
3rd row그리스 로마 신화 . 12 , 에로스와 프시케의 진정한 사랑
4th row쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 27 , 로봇의 심장편
5th row그리스 로마 신화 . 21 , 신이 선택한 인간, 헤라클레스의 탄생
ValueCountFrequency (%)
193
 
21.3%
go 21
 
2.3%
쿠키런 20
 
2.2%
로마 15
 
1.7%
신화 15
 
1.7%
그리스 15
 
1.7%
학습만화 14
 
1.5%
과학 14
 
1.5%
대작전 12
 
1.3%
서바이벌 12
 
1.3%
Other values (309) 577
63.5%
2023-12-12T21:23:49.963030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
808
26.1%
. 89
 
2.9%
, 76
 
2.5%
54
 
1.7%
54
 
1.7%
42
 
1.4%
40
 
1.3%
: 39
 
1.3%
37
 
1.2%
36
 
1.2%
Other values (311) 1824
58.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1701
54.9%
Space Separator 808
26.1%
Other Punctuation 222
 
7.2%
Decimal Number 147
 
4.7%
Lowercase Letter 118
 
3.8%
Uppercase Letter 40
 
1.3%
Open Punctuation 29
 
0.9%
Close Punctuation 29
 
0.9%
Math Symbol 4
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
3.2%
54
 
3.2%
42
 
2.5%
40
 
2.4%
37
 
2.2%
36
 
2.1%
35
 
2.1%
32
 
1.9%
31
 
1.8%
29
 
1.7%
Other values (260) 1311
77.1%
Lowercase Letter
ValueCountFrequency (%)
o 25
21.2%
r 15
12.7%
a 15
12.7%
e 9
 
7.6%
s 8
 
6.8%
i 8
 
6.8%
p 6
 
5.1%
h 4
 
3.4%
v 4
 
3.4%
n 4
 
3.4%
Other values (8) 20
16.9%
Uppercase Letter
ValueCountFrequency (%)
G 21
52.5%
S 3
 
7.5%
B 3
 
7.5%
V 3
 
7.5%
T 3
 
7.5%
K 2
 
5.0%
A 1
 
2.5%
C 1
 
2.5%
W 1
 
2.5%
O 1
 
2.5%
Decimal Number
ValueCountFrequency (%)
1 34
23.1%
2 29
19.7%
4 15
10.2%
3 15
10.2%
5 13
 
8.8%
7 11
 
7.5%
9 9
 
6.1%
6 8
 
5.4%
8 8
 
5.4%
0 5
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 89
40.1%
, 76
34.2%
: 39
17.6%
! 17
 
7.7%
? 1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 26
89.7%
[ 3
 
10.3%
Close Punctuation
ValueCountFrequency (%)
) 26
89.7%
] 3
 
10.3%
Space Separator
ValueCountFrequency (%)
808
100.0%
Math Symbol
ValueCountFrequency (%)
= 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1699
54.8%
Common 1240
40.0%
Latin 158
 
5.1%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
3.2%
54
 
3.2%
42
 
2.5%
40
 
2.4%
37
 
2.2%
36
 
2.1%
35
 
2.1%
32
 
1.9%
31
 
1.8%
29
 
1.7%
Other values (258) 1309
77.0%
Latin
ValueCountFrequency (%)
o 25
15.8%
G 21
13.3%
r 15
 
9.5%
a 15
 
9.5%
e 9
 
5.7%
s 8
 
5.1%
i 8
 
5.1%
p 6
 
3.8%
h 4
 
2.5%
v 4
 
2.5%
Other values (19) 43
27.2%
Common
ValueCountFrequency (%)
808
65.2%
. 89
 
7.2%
, 76
 
6.1%
: 39
 
3.1%
1 34
 
2.7%
2 29
 
2.3%
( 26
 
2.1%
) 26
 
2.1%
! 17
 
1.4%
4 15
 
1.2%
Other values (12) 81
 
6.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1699
54.8%
ASCII 1398
45.1%
CJK 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
808
57.8%
. 89
 
6.4%
, 76
 
5.4%
: 39
 
2.8%
1 34
 
2.4%
2 29
 
2.1%
( 26
 
1.9%
) 26
 
1.9%
o 25
 
1.8%
G 21
 
1.5%
Other values (41) 225
 
16.1%
Hangul
ValueCountFrequency (%)
54
 
3.2%
54
 
3.2%
42
 
2.5%
40
 
2.4%
37
 
2.2%
36
 
2.1%
35
 
2.1%
32
 
1.9%
31
 
1.8%
29
 
1.7%
Other values (258) 1309
77.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
박시연
15 
김강현
12 
김미영
10 
송도수
임우영
 
5
Other values (26)
50 

Length

Max length8
Median length3
Mean length3.54
Min length2

Unique

Unique14 ?
Unique (%)14.0%

Sample

1st row김강현
2nd row박시연
3rd row박시연
4th row김강현
5th row박시연

Common Values

ValueCountFrequency (%)
박시연 15
15.0%
김강현 12
12.0%
김미영 10
 
10.0%
송도수 8
 
8.0%
임우영 5
 
5.0%
홍종현 5
 
5.0%
신태훈 4
 
4.0%
팝콘스토리 4
 
4.0%
김현수 4
 
4.0%
흔한남매 4
 
4.0%
Other values (21) 29
29.0%

Length

2023-12-12T21:23:50.125823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
박시연 15
13.9%
김강현 12
 
11.1%
김미영 10
 
9.3%
송도수 8
 
7.4%
임우영 5
 
4.6%
홍종현 5
 
4.6%
co 4
 
3.7%
신태훈 4
 
3.7%
팝콘스토리 4
 
3.7%
김현수 4
 
3.7%
Other values (23) 37
34.3%

발행처
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
아울북
34 
서울문화사
31 
아이세움
15 
위즈덤하우스
MiraeN·아이세움
Other values (7)
11 

Length

Max length11
Median length8
Mean length4.41
Min length3

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row서울문화사
2nd row아울북
3rd row아울북
4th row서울문화사
5th row아울북

Common Values

ValueCountFrequency (%)
아울북 34
34.0%
서울문화사 31
31.0%
아이세움 15
15.0%
위즈덤하우스 5
 
5.0%
MiraeN·아이세움 4
 
4.0%
미래엔 3
 
3.0%
한솔수북 2
 
2.0%
미래엔 아이세움 2
 
2.0%
글송이 1
 
1.0%
아이휴먼 1
 
1.0%
Other values (2) 2
 
2.0%

Length

2023-12-12T21:23:50.319056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
아울북 34
33.3%
서울문화사 31
30.4%
아이세움 17
16.7%
위즈덤하우스 5
 
4.9%
미래엔 5
 
4.9%
miraen·아이세움 4
 
3.9%
한솔수북 2
 
2.0%
글송이 1
 
1.0%
아이휴먼 1
 
1.0%
다산어린이 1
 
1.0%

발행년
Real number (ℝ)

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.79
Minimum2013
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T21:23:50.458069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2015.95
Q12018
median2019
Q32020
95-th percentile2021
Maximum2021
Range8
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6954782
Coefficient of variation (CV)0.00083984874
Kurtosis1.5847849
Mean2018.79
Median Absolute Deviation (MAD)1
Skewness-1.046348
Sum201879
Variance2.8746465
MonotonicityNot monotonic
2023-12-12T21:23:50.607525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2019 27
27.0%
2020 21
21.0%
2018 20
20.0%
2021 15
15.0%
2017 8
 
8.0%
2016 4
 
4.0%
2015 3
 
3.0%
2013 2
 
2.0%
ValueCountFrequency (%)
2013 2
 
2.0%
2015 3
 
3.0%
2016 4
 
4.0%
2017 8
 
8.0%
2018 20
20.0%
2019 27
27.0%
2020 21
21.0%
2021 15
15.0%
ValueCountFrequency (%)
2021 15
15.0%
2020 21
21.0%
2019 27
27.0%
2018 20
20.0%
2017 8
 
8.0%
2016 4
 
4.0%
2015 3
 
3.0%
2013 2
 
2.0%

대출건수
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.21
Minimum28
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T21:23:50.775251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28
5-th percentile28
Q129
median30
Q331
95-th percentile33
Maximum45
Range17
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.3063154
Coefficient of variation (CV)0.076342782
Kurtosis16.085877
Mean30.21
Median Absolute Deviation (MAD)1
Skewness2.9796195
Sum3021
Variance5.3190909
MonotonicityDecreasing
2023-12-12T21:23:50.911946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
29 28
28.0%
30 20
20.0%
28 19
19.0%
32 11
 
11.0%
33 9
 
9.0%
31 9
 
9.0%
34 2
 
2.0%
45 1
 
1.0%
36 1
 
1.0%
ValueCountFrequency (%)
28 19
19.0%
29 28
28.0%
30 20
20.0%
31 9
 
9.0%
32 11
 
11.0%
33 9
 
9.0%
34 2
 
2.0%
36 1
 
1.0%
45 1
 
1.0%
ValueCountFrequency (%)
45 1
 
1.0%
36 1
 
1.0%
34 2
 
2.0%
33 9
 
9.0%
32 11
 
11.0%
31 9
 
9.0%
30 20
20.0%
29 28
28.0%
28 19
19.0%

Interactions

2023-12-12T21:23:48.235317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:47.621945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:47.893427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:48.382824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:47.702292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:47.989578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:48.521876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:47.795715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:48.100672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:23:51.027254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위서명저자발행처발행년대출건수
순위1.0001.0000.0000.0000.3120.792
서명1.0001.0001.0001.0001.0001.000
저자0.0001.0001.0000.9910.6750.000
발행처0.0001.0000.9911.0000.3280.000
발행년0.3121.0000.6750.3281.0000.110
대출건수0.7921.0000.0000.0000.1101.000
2023-12-12T21:23:51.160873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
저자발행처
저자1.0000.817
발행처0.8171.000
2023-12-12T21:23:51.291436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위발행년대출건수저자발행처
순위1.0000.012-0.9800.0000.000
발행년0.0121.0000.0140.4290.145
대출건수-0.9800.0141.0000.0000.000
저자0.0000.4290.0001.0000.817
발행처0.0000.1450.0000.8171.000

Missing values

2023-12-12T21:23:48.705518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:23:48.871652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순위서명저자발행처발행년대출건수
01쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 3 , 도시 편김강현서울문화사201745
12그리스 로마 신화 : 만화로 읽는 초등 인문학 . 13 , 도도한 여신, 아르테미스의 원칙박시연아울북201936
23그리스 로마 신화 . 12 , 에로스와 프시케의 진정한 사랑박시연아울북201934
34쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 27 , 로봇의 심장편김강현서울문화사201934
45그리스 로마 신화 . 21 , 신이 선택한 인간, 헤라클레스의 탄생박시연아울북202133
56쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 1 , 정글편김강현서울문화사201833
67브레드 이발소 . 3 , 베이커리타운의 악동들 = Bread Barbershop몬스터스튜디오한솔수북202133
78쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 26 , 사이보그의 역습편김강현서울문화사201933
89문방구 TV . 8 , 유머 대탐구문방구 TV서울문화사202133
910레이튼 미스터리 탐정사무소 : 카트리에일의 수수께끼 파일 . 2 , 행운의 사나이안치현미래엔201933
순위서명저자발행처발행년대출건수
9091(코믹 메이플스토리) 수학도둑 : [창의편] . 53송도수서울문화사201628
9192쿠키런 서바이벌 대작전 . 28 , 최후의 생존자 편김강현서울문화사201928
9293Go Go 카카오 프렌즈 . 5 , 중국(China)김미영아울북201828
9394(손오공의 한자 대탐험) 마법천자문 . 33 , 향해라! 향할 향올댓스토리아울북201728
9495(코믹 메이플스토리)수학도둑 . 62송도수서울문화사201828
9596쿠키런 탈출게임 과학 상식 : 게임 속 배경이 그대로! 탈출 과학 상식 25임우영서울문화사201828
9697Go Go 카카오 프렌즈 . 13 , 호주(Australia)김미영아울북202028
9798그리스 로마 신화 : 만화로 읽는 초등 인문학 . 10 , 영웅의 전설, 카드모스의 대가박시연아울북201928
9899(Who? special) 유재석김성재다산어린이201928
99100구해줘 카카오프렌즈 . 2 , 과학박영희메가스터디201928