Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)1.0%
Total size in memory5.1 KiB
Average record size in memory52.3 B

Variable types

Numeric3
Text1
Categorical2

Dataset

Description대전광역시 유성구 진잠도서관의 1년간 대출실적을 기준으로 선정한 베스트 도서대출 100선에 대한 데이터로 순위, 도서명, 저자, 출판사, 출판년, 대출횟수 등의 항목을 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15053375/fileData.do

Alerts

Dataset has 1 (1.0%) duplicate rowsDuplicates
순위 is highly overall correlated with 대출횟수High correlation
출판년 is highly overall correlated with 저자 and 1 other fieldsHigh correlation
대출횟수 is highly overall correlated with 순위High correlation
저자 is highly overall correlated with 출판년 and 1 other fieldsHigh correlation
출판사 is highly overall correlated with 출판년 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 06:25:34.163090
Analysis finished2023-12-12 06:25:35.214529
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순위
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.02
Minimum1
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T15:25:35.260832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.85
Q124
median48
Q374
95-th percentile89
Maximum89
Range88
Interquartile range (IQR)50

Descriptive statistics

Standard deviation27.166148
Coefficient of variation (CV)0.60342399
Kurtosis-1.1517335
Mean45.02
Median Absolute Deviation (MAD)25
Skewness0.11044498
Sum4502
Variance737.9996
MonotonicityIncreasing
2023-12-12T15:25:35.356803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
55 19
19.0%
74 15
15.0%
34 14
14.0%
89 12
12.0%
24 10
10.0%
11 7
 
7.0%
48 7
 
7.0%
18 6
 
6.0%
6 5
 
5.0%
3 3
 
3.0%
Other values (2) 2
 
2.0%
ValueCountFrequency (%)
1 1
 
1.0%
2 1
 
1.0%
3 3
 
3.0%
6 5
 
5.0%
11 7
 
7.0%
18 6
 
6.0%
24 10
10.0%
34 14
14.0%
48 7
 
7.0%
55 19
19.0%
ValueCountFrequency (%)
89 12
12.0%
74 15
15.0%
55 19
19.0%
48 7
 
7.0%
34 14
14.0%
24 10
10.0%
18 6
 
6.0%
11 7
 
7.0%
6 5
 
5.0%
3 3
 
3.0%

서명
Text

Distinct95
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-12T15:25:35.626106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length42
Mean length26.12
Min length8

Characters and Unicode

Total characters2612
Distinct characters306
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)90.0%

Sample

1st row흔한남매 . 5
2nd row흔한남매 . 6
3rd row(빈대 가족의) 생활의 달인
4th row구해줘 카카오프렌즈 한국사 . 1
5th row(흔한남매) 안 흔한 일기 . 2
ValueCountFrequency (%)
144
 
19.4%
go 18
 
2.4%
1 17
 
2.3%
학습만화 17
 
2.3%
빈대 14
 
1.9%
쿠키런 13
 
1.8%
가족의 13
 
1.8%
흔한남매 11
 
1.5%
과학 11
 
1.5%
문화 10
 
1.3%
Other values (244) 473
63.8%
2023-12-12T15:25:36.114576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
642
24.6%
. 73
 
2.8%
) 64
 
2.5%
( 64
 
2.5%
, 44
 
1.7%
40
 
1.5%
39
 
1.5%
38
 
1.5%
34
 
1.3%
32
 
1.2%
Other values (296) 1542
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1423
54.5%
Space Separator 642
24.6%
Other Punctuation 167
 
6.4%
Lowercase Letter 128
 
4.9%
Decimal Number 94
 
3.6%
Close Punctuation 65
 
2.5%
Open Punctuation 65
 
2.5%
Uppercase Letter 24
 
0.9%
Dash Punctuation 2
 
0.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
2.8%
39
 
2.7%
38
 
2.7%
34
 
2.4%
32
 
2.2%
28
 
2.0%
24
 
1.7%
24
 
1.7%
23
 
1.6%
22
 
1.5%
Other values (242) 1119
78.6%
Lowercase Letter
ValueCountFrequency (%)
o 22
17.2%
a 12
9.4%
i 12
9.4%
g 11
 
8.6%
n 11
 
8.6%
r 8
 
6.2%
e 8
 
6.2%
c 7
 
5.5%
p 5
 
3.9%
v 4
 
3.1%
Other values (11) 28
21.9%
Decimal Number
ValueCountFrequency (%)
1 26
27.7%
2 18
19.1%
4 11
11.7%
6 8
 
8.5%
3 8
 
8.5%
7 7
 
7.4%
5 6
 
6.4%
8 4
 
4.3%
9 4
 
4.3%
0 2
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
G 10
41.7%
S 4
 
16.7%
U 2
 
8.3%
F 2
 
8.3%
I 1
 
4.2%
K 1
 
4.2%
W 1
 
4.2%
J 1
 
4.2%
A 1
 
4.2%
X 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 73
43.7%
, 44
26.3%
: 31
18.6%
! 14
 
8.4%
? 4
 
2.4%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 64
98.5%
] 1
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 64
98.5%
[ 1
 
1.5%
Space Separator
ValueCountFrequency (%)
642
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1422
54.4%
Common 1037
39.7%
Latin 152
 
5.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
2.8%
39
 
2.7%
38
 
2.7%
34
 
2.4%
32
 
2.3%
28
 
2.0%
24
 
1.7%
24
 
1.7%
23
 
1.6%
22
 
1.5%
Other values (241) 1118
78.6%
Latin
ValueCountFrequency (%)
o 22
14.5%
a 12
 
7.9%
i 12
 
7.9%
g 11
 
7.2%
n 11
 
7.2%
G 10
 
6.6%
r 8
 
5.3%
e 8
 
5.3%
c 7
 
4.6%
p 5
 
3.3%
Other values (21) 46
30.3%
Common
ValueCountFrequency (%)
642
61.9%
. 73
 
7.0%
) 64
 
6.2%
( 64
 
6.2%
, 44
 
4.2%
: 31
 
3.0%
1 26
 
2.5%
2 18
 
1.7%
! 14
 
1.4%
4 11
 
1.1%
Other values (13) 50
 
4.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1422
54.4%
ASCII 1188
45.5%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
642
54.0%
. 73
 
6.1%
) 64
 
5.4%
( 64
 
5.4%
, 44
 
3.7%
: 31
 
2.6%
1 26
 
2.2%
o 22
 
1.9%
2 18
 
1.5%
! 14
 
1.2%
Other values (43) 190
 
16.0%
Hangul
ValueCountFrequency (%)
40
 
2.8%
39
 
2.7%
38
 
2.7%
34
 
2.4%
32
 
2.3%
28
 
2.0%
24
 
1.7%
24
 
1.7%
23
 
1.6%
22
 
1.5%
Other values (241) 1118
78.6%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

저자
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
흔한남매
11 
송도수
10 
김미영
신태훈
김강현
Other values (31)
54 

Length

Max length8
Median length3
Mean length3.26
Min length2

Unique

Unique19 ?
Unique (%)19.0%

Sample

1st row흔한남매
2nd row흔한남매
3rd row이봉기
4th row최태성
5th row흔한남매

Common Values

ValueCountFrequency (%)
흔한남매 11
 
11.0%
송도수 10
 
10.0%
김미영 9
 
9.0%
신태훈 8
 
8.0%
김강현 8
 
8.0%
한현동 8
 
8.0%
설민석 4
 
4.0%
임창호 4
 
4.0%
류수형 3
 
3.0%
조주희 2
 
2.0%
Other values (26) 33
33.0%

Length

2023-12-12T15:25:36.296680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
흔한남매 11
 
10.6%
송도수 10
 
9.6%
김미영 9
 
8.7%
신태훈 8
 
7.7%
김강현 8
 
7.7%
한현동 8
 
7.7%
설민석 4
 
3.8%
임창호 4
 
3.8%
류수형 3
 
2.9%
이봉기 2
 
1.9%
Other values (29) 37
35.6%

출판사
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)23.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울문화사
22 
재미북스
14 
아울북
11 
위즈덤하우스
10 
미래엔
Other values (18)
34 

Length

Max length12
Median length9
Mean length5.41
Min length2

Unique

Unique10 ?
Unique (%)10.0%

Sample

1st row미래엔
2nd row미래엔
3rd row재미북스
4th row대원키즈
5th row미래엔

Common Values

ValueCountFrequency (%)
서울문화사 22
22.0%
재미북스 14
14.0%
아울북 11
11.0%
위즈덤하우스 10
10.0%
미래엔 9
9.0%
Mirae N 아이세움 9
9.0%
주니어김영사 3
 
3.0%
길벗스쿨 2
 
2.0%
Mirea N 아이세움 2
 
2.0%
미래엔 아이세움 2
 
2.0%
Other values (13) 16
16.0%

Length

2023-12-12T15:25:36.422998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울문화사 22
17.6%
재미북스 14
11.2%
아이세움 13
10.4%
아울북 11
8.8%
미래엔 11
8.8%
n 11
8.8%
위즈덤하우스 10
8.0%
mirae 9
7.2%
주니어김영사 3
 
2.4%
miraen·아이세움 2
 
1.6%
Other values (15) 19
15.2%

출판년
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.53
Minimum2010
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T15:25:36.570386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2016
Q12018
median2019
Q32019
95-th percentile2020
Maximum2020
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.4103155
Coefficient of variation (CV)0.00069868445
Kurtosis12.844539
Mean2018.53
Median Absolute Deviation (MAD)1
Skewness-2.6859143
Sum201853
Variance1.9889899
MonotonicityNot monotonic
2023-12-12T15:25:36.739979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2019 45
45.0%
2018 20
20.0%
2020 19
19.0%
2017 10
 
10.0%
2016 3
 
3.0%
2015 2
 
2.0%
2010 1
 
1.0%
ValueCountFrequency (%)
2010 1
 
1.0%
2015 2
 
2.0%
2016 3
 
3.0%
2017 10
 
10.0%
2018 20
20.0%
2019 45
45.0%
2020 19
19.0%
ValueCountFrequency (%)
2020 19
19.0%
2019 45
45.0%
2018 20
20.0%
2017 10
 
10.0%
2016 3
 
3.0%
2015 2
 
2.0%
2010 1
 
1.0%

대출횟수
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.7
Minimum15
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T15:25:36.878661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile15
Q116
median16
Q317
95-th percentile20
Maximum24
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.6727163
Coefficient of variation (CV)0.10016265
Kurtosis2.8340484
Mean16.7
Median Absolute Deviation (MAD)1
Skewness1.4535026
Sum1670
Variance2.7979798
MonotonicityDecreasing
2023-12-12T15:25:37.019960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
16 34
34.0%
15 24
24.0%
17 18
18.0%
18 9
 
9.0%
19 7
 
7.0%
20 6
 
6.0%
24 1
 
1.0%
21 1
 
1.0%
ValueCountFrequency (%)
15 24
24.0%
16 34
34.0%
17 18
18.0%
18 9
 
9.0%
19 7
 
7.0%
20 6
 
6.0%
21 1
 
1.0%
24 1
 
1.0%
ValueCountFrequency (%)
24 1
 
1.0%
21 1
 
1.0%
20 6
 
6.0%
19 7
 
7.0%
18 9
 
9.0%
17 18
18.0%
16 34
34.0%
15 24
24.0%

Interactions

2023-12-12T15:25:34.851398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.468821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.651727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.912106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.533320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.715883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.985423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.595882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:25:34.780560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:25:37.111697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위서명저자출판사출판년대출횟수
순위1.0000.8770.4590.5920.3490.949
서명0.8771.0001.0001.0000.9940.000
저자0.4591.0001.0000.9940.8490.000
출판사0.5921.0000.9941.0000.7200.000
출판년0.3490.9940.8490.7201.0000.187
대출횟수0.9490.0000.0000.0000.1871.000
2023-12-12T15:25:37.215462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판사저자
출판사1.0000.810
저자0.8101.000
2023-12-12T15:25:37.313865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위출판년대출횟수저자출판사
순위1.000-0.264-0.9620.1490.256
출판년-0.2641.0000.4000.5450.509
대출횟수-0.9620.4001.0000.0000.000
저자0.1490.5450.0001.0000.810
출판사0.2560.5090.0000.8101.000

Missing values

2023-12-12T15:25:35.075435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:25:35.178188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순위서명저자출판사출판년대출횟수
01흔한남매 . 5흔한남매미래엔202024
12흔한남매 . 6흔한남매미래엔202021
23(빈대 가족의) 생활의 달인이봉기재미북스201920
33구해줘 카카오프렌즈 한국사 . 1최태성대원키즈201920
43(흔한남매) 안 흔한 일기 . 2흔한남매미래엔202020
56(안녕 자두야)명작동화 : 효녀 자두& 엄지공주이빈학산문화사202020
66흔한남매 . 6흔한남매미래엔202020
76(흔한남매의) 흔한 호기심 . 1흔한남매미래엔202020
86놓지 마 과학! . 2 , 정신이 탁구에 정신 놓다신태훈위즈덤하우스201919
96쿠키런 서바이벌 대작전 : 안전상식 학습만화 . 1 , 정글편김강현서울문화사201819
순위서명저자출판사출판년대출횟수
9089(Go go) 카카오 프렌즈 : 세계 역사 문화 체험 학습만화 . 7 , 독일(Germany)김미영아울북201915
9189(Go go) 카카오 프렌즈 : 세계 역사 문화 체험 학습만화 . 8 , 인도(India)김미영아울북201915
9289화재에서 살아남기한현동Mirae N 아이세움201615
9389물 부족에서 살아남기한현동Mirae N 아이세움201715
9489인공지능 세계에서 살아남기 . 2한현동Mirae N 아이세움201715
9589(코믹 메이플스토리) 수학도둑 : 종합편 . 69송도수서울문화사201915
9689흔한남매 . 2백난도Mirea N 아이세움201915
9789아드님, 참으시어요강민경좋은책어린이201915
9889(빈대 가족의) 덜렁이는 미운 우리 새끼류수형재미북스201915
9989(Go go) 카카오 프렌즈 : 세계 역사 문화 체험 학습만화 . 1 , 프랑스(France)김미영아울북201815

Duplicate rows

Most frequently occurring

순위서명저자출판사출판년대출횟수# duplicates
018(흔한남매) 안 흔한 일기 . 1흔한남매미래엔2020182