Overview

Dataset statistics

Number of variables5
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory45.3 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description김해시 통합도서관 올해의 책 선정도서 (선정년도, 구분, 도서명, 작가, 출판사 등)에 대한 데이터 항목을 제공합니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15065466

Alerts

도서명 has unique valuesUnique
작가 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:17:13.010028
Analysis finished2023-12-11 00:17:13.583606
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

Distinct15
Distinct (%)48.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2014.6452
Minimum2007
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-11T09:17:13.641636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2008
Q12011
median2015
Q32018.5
95-th percentile2021
Maximum2021
Range14
Interquartile range (IQR)7.5

Descriptive statistics

Standard deviation4.4086913
Coefficient of variation (CV)0.0021883215
Kurtosis-1.2614457
Mean2014.6452
Median Absolute Deviation (MAD)4
Skewness-0.097677526
Sum62454
Variance19.436559
MonotonicityNot monotonic
2023-12-11T09:17:13.772679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2020 3
 
9.7%
2021 3
 
9.7%
2008 2
 
6.5%
2009 2
 
6.5%
2010 2
 
6.5%
2011 2
 
6.5%
2012 2
 
6.5%
2013 2
 
6.5%
2014 2
 
6.5%
2015 2
 
6.5%
Other values (5) 9
29.0%
ValueCountFrequency (%)
2007 1
3.2%
2008 2
6.5%
2009 2
6.5%
2010 2
6.5%
2011 2
6.5%
2012 2
6.5%
2013 2
6.5%
2014 2
6.5%
2015 2
6.5%
2016 2
6.5%
ValueCountFrequency (%)
2021 3
9.7%
2020 3
9.7%
2019 2
6.5%
2018 2
6.5%
2017 2
6.5%
2016 2
6.5%
2015 2
6.5%
2014 2
6.5%
2013 2
6.5%
2012 2
6.5%

구분
Categorical

Distinct3
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size380.0 B
대표도서
15 
어린이도서
14 
시민작가도서

Length

Max length6
Median length5
Mean length4.5806452
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row어린이도서
2nd row어린이도서
3rd row어린이도서
4th row어린이도서
5th row어린이도서

Common Values

ValueCountFrequency (%)
대표도서 15
48.4%
어린이도서 14
45.2%
시민작가도서 2
 
6.5%

Length

2023-12-11T09:17:13.938653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:17:14.080380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대표도서 15
48.4%
어린이도서 14
45.2%
시민작가도서 2
 
6.5%

도서명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T09:17:14.343976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length9.3225806
Min length3

Characters and Unicode

Total characters289
Distinct characters143
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row그대를 사랑합니다
2nd row멀쩡한 이유정
3rd row도서관벌레와 도서관 벌레
4th row얼음소년
5th row코끼리 아줌마의 햇살도서관
ValueCountFrequency (%)
2
 
2.4%
부탁해 2
 
2.4%
카메라 2
 
2.4%
미워해 1
 
1.2%
된다 1
 
1.2%
어른이 1
 
1.2%
흔들려야 1
 
1.2%
천번을 1
 
1.2%
인생 1
 
1.2%
두근두근 1
 
1.2%
Other values (71) 71
84.5%
2023-12-11T09:17:14.718322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
18.3%
8
 
2.8%
6
 
2.1%
5
 
1.7%
5
 
1.7%
5
 
1.7%
4
 
1.4%
4
 
1.4%
4
 
1.4%
4
 
1.4%
Other values (133) 191
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 234
81.0%
Space Separator 53
 
18.3%
Decimal Number 1
 
0.3%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
3.4%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (130) 185
79.1%
Space Separator
ValueCountFrequency (%)
53
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 234
81.0%
Common 55
 
19.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
3.4%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (130) 185
79.1%
Common
ValueCountFrequency (%)
53
96.4%
4 1
 
1.8%
, 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 234
81.0%
ASCII 55
 
19.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53
96.4%
4 1
 
1.8%
, 1
 
1.8%
Hangul
ValueCountFrequency (%)
8
 
3.4%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (130) 185
79.1%

작가
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T09:17:14.923911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.4516129
Min length2

Characters and Unicode

Total characters107
Distinct characters69
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row강풀
2nd row유은실
3rd row김미애
4th row조원희
5th row김혜연
ValueCountFrequency (%)
강풀 1
 
2.9%
김난도 1
 
2.9%
최인호 1
 
2.9%
김려령 1
 
2.9%
신경숙 1
 
2.9%
한비야 1
 
2.9%
박경화 1
 
2.9%
김애란 1
 
2.9%
이동원 1
 
2.9%
유행두 1
 
2.9%
Other values (25) 25
71.4%
2023-12-11T09:17:15.229634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
10.3%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
Other values (59) 69
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 101
94.4%
Space Separator 4
 
3.7%
Other Punctuation 2
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.9%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (57) 65
64.4%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 101
94.4%
Common 6
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.9%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (57) 65
64.4%
Common
ValueCountFrequency (%)
4
66.7%
, 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 101
94.4%
ASCII 6
 
5.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
10.9%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (57) 65
64.4%
ASCII
ValueCountFrequency (%)
4
66.7%
, 2
33.3%
Distinct21
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T09:17:15.417124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length3.1290323
Min length2

Characters and Unicode

Total characters97
Distinct characters55
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)51.6%

Sample

1st row문학세계사
2nd row푸른숲
3rd row푸른정원
4th row느림보
5th row비룡소
ValueCountFrequency (%)
창비 5
16.1%
비룡소 4
 
12.9%
해냄 2
 
6.5%
문학동네 2
 
6.5%
푸른숲 2
 
6.5%
키다리 1
 
3.2%
문학세계사 1
 
3.2%
북센스 1
 
3.2%
와이즈베리 1
 
3.2%
위즈덤하우스 1
 
3.2%
Other values (11) 11
35.5%
2023-12-11T09:17:15.694263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
9.3%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (45) 55
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
9.3%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (45) 55
56.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
9.3%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (45) 55
56.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
9.3%
5
 
5.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (45) 55
56.7%

Interactions

2023-12-11T09:17:13.322713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:17:15.776955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도구분도서명작가출판사
년도1.0000.0001.0001.0000.000
구분0.0001.0001.0001.0000.985
도서명1.0001.0001.0001.0001.000
작가1.0001.0001.0001.0001.000
출판사0.0000.9851.0001.0001.000
2023-12-11T09:17:15.897591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도구분
년도1.0000.000
구분0.0001.000

Missing values

2023-12-11T09:17:13.466937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:17:13.551121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도구분도서명작가출판사
02008어린이도서그대를 사랑합니다강풀문학세계사
12009어린이도서멀쩡한 이유정유은실푸른숲
22010어린이도서도서관벌레와 도서관 벌레김미애푸른정원
32011어린이도서얼음소년조원희느림보
42012어린이도서코끼리 아줌마의 햇살도서관김혜연비룡소
52013어린이도서시간가게이나영문학동네
62014어린이도서거짓말 같은 이야기강경수시공주니어
72015어린이도서어느 날 구두에게 생긴 일황선미비룡소
82016어린이도서여름이 반짝김수빈문학동네
92017어린이도서동생을 데리고 미술관에 갔어요박현경해와나무
년도구분도서명작가출판사
212012대표도서두근두근 내 인생김애란창비
222013대표도서천번을 흔들려야 어른이 된다김난도오우아
232014대표도서조금 다른 지구마을 여행이동원예담
242015대표도서투명인간성석제창비
252016대표도서카메라, 편견을 부탁해강윤중서해문집
262017대표도서한 스푼의 시간구병모위즈덤하우스
272018대표도서대리사회김민섭와이즈베리
282019대표도서당신이 옳다정혜신해냄
292020대표도서우리가 빛의 속도로 갈 수 없다면김초엽허블
302021대표도서우리의 불행은 당연하지 않습니다김누리해냄