Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory44.9 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description김해시 통합도서관 올해의 책 선정도서 (선정년도, 구분, 도서명, 작가, 출판사 등)에 대한 데이터 항목을 제공합니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15065466/fileData.do

Alerts

도서명 has unique valuesUnique
작가 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:03:16.217357
Analysis finished2023-12-12 22:03:16.705060
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

Distinct16
Distinct (%)47.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.2941
Minimum2007
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-13T07:03:16.745419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2008
Q12011.25
median2015.5
Q32019.75
95-th percentile2022
Maximum2022
Range15
Interquartile range (IQR)8.5

Descriptive statistics

Standard deviation4.706729
Coefficient of variation (CV)0.0023355047
Kurtosis-1.2712363
Mean2015.2941
Median Absolute Deviation (MAD)4.5
Skewness-0.14633886
Sum68520
Variance22.153298
MonotonicityIncreasing
2023-12-13T07:03:16.845655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
2022 3
 
8.8%
2020 3
 
8.8%
2021 3
 
8.8%
2015 2
 
5.9%
2009 2
 
5.9%
2010 2
 
5.9%
2011 2
 
5.9%
2012 2
 
5.9%
2013 2
 
5.9%
2014 2
 
5.9%
Other values (6) 11
32.4%
ValueCountFrequency (%)
2007 1
2.9%
2008 2
5.9%
2009 2
5.9%
2010 2
5.9%
2011 2
5.9%
2012 2
5.9%
2013 2
5.9%
2014 2
5.9%
2015 2
5.9%
2016 2
5.9%
ValueCountFrequency (%)
2022 3
8.8%
2021 3
8.8%
2020 3
8.8%
2019 2
5.9%
2018 2
5.9%
2017 2
5.9%
2016 2
5.9%
2015 2
5.9%
2014 2
5.9%
2013 2
5.9%

구분
Categorical

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
대표도서
16 
어린이도서
15 
시민작가도서

Length

Max length6
Median length5
Mean length4.6176471
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대표도서
2nd row대표도서
3rd row어린이도서
4th row대표도서
5th row어린이도서

Common Values

ValueCountFrequency (%)
대표도서 16
47.1%
어린이도서 15
44.1%
시민작가도서 3
 
8.8%

Length

2023-12-13T07:03:16.980839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:03:17.080652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대표도서 16
47.1%
어린이도서 15
44.1%
시민작가도서 3
 
8.8%

도서명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T07:03:17.307065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length9.1764706
Min length3

Characters and Unicode

Total characters312
Distinct characters149
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row제4의제국
2nd row완득이
3rd row그대를 사랑합니다
4th row엄마를 부탁해
5th row멀쩡한 이유정
ValueCountFrequency (%)
카메라 2
 
2.2%
2
 
2.2%
부탁해 2
 
2.2%
품은 1
 
1.1%
당연하지 1
 
1.1%
불행은 1
 
1.1%
우리의 1
 
1.1%
기계 1
 
1.1%
뽑기 1
 
1.1%
없는 1
 
1.1%
Other values (78) 78
85.7%
2023-12-13T07:03:17.646710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
18.3%
8
 
2.6%
7
 
2.2%
6
 
1.9%
5
 
1.6%
5
 
1.6%
4
 
1.3%
4
 
1.3%
4
 
1.3%
4
 
1.3%
Other values (139) 208
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 253
81.1%
Space Separator 57
 
18.3%
Other Punctuation 1
 
0.3%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
3.2%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (136) 202
79.8%
Space Separator
ValueCountFrequency (%)
57
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 253
81.1%
Common 59
 
18.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
3.2%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (136) 202
79.8%
Common
ValueCountFrequency (%)
57
96.6%
, 1
 
1.7%
4 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 253
81.1%
ASCII 59
 
18.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
57
96.6%
, 1
 
1.7%
4 1
 
1.7%
Hangul
ValueCountFrequency (%)
8
 
3.2%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (136) 202
79.8%

작가
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T07:03:17.824179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.4117647
Min length2

Characters and Unicode

Total characters116
Distinct characters71
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row최인호
2nd row김려령
3rd row강풀
4th row신경숙
5th row유은실
ValueCountFrequency (%)
최인호 1
 
2.6%
김려령 1
 
2.6%
금동건 1
 
2.6%
곽유진 1
 
2.6%
1
 
2.6%
차상미 1
 
2.6%
그림 1
 
2.6%
김누리 1
 
2.6%
유행두 1
 
2.6%
김정선 1
 
2.6%
Other values (28) 28
73.7%
2023-12-13T07:03:18.178891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
9.5%
4
 
3.4%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (61) 75
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110
94.8%
Space Separator 4
 
3.4%
Other Punctuation 2
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.0%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (59) 71
64.5%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110
94.8%
Common 6
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.0%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (59) 71
64.5%
Common
ValueCountFrequency (%)
4
66.7%
, 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110
94.8%
ASCII 6
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
10.0%
4
 
3.6%
4
 
3.6%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (59) 71
64.5%
ASCII
ValueCountFrequency (%)
4
66.7%
, 2
33.3%
Distinct22
Distinct (%)64.7%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T07:03:18.363617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length3.1764706
Min length2

Characters and Unicode

Total characters108
Distinct characters60
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)50.0%

Sample

1st row여백
2nd row창비
3rd row문학세계사
4th row창비
5th row푸른숲
ValueCountFrequency (%)
창비 6
17.6%
비룡소 4
 
11.8%
문학동네 3
 
8.8%
푸른숲 2
 
5.9%
해냄 2
 
5.9%
와이즈베리 1
 
2.9%
여백 1
 
2.9%
위즈덤하우스 1
 
2.9%
교음사 1
 
2.9%
키다리 1
 
2.9%
Other values (12) 12
35.3%
2023-12-13T07:03:18.729908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.3%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (50) 62
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.3%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (50) 62
57.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.3%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (50) 62
57.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
9.3%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (50) 62
57.4%

Interactions

2023-12-13T07:03:16.481900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:03:18.842347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도구분도서명작가출판사
년도1.0000.0001.0001.0000.822
구분0.0001.0001.0001.0000.954
도서명1.0001.0001.0001.0001.000
작가1.0001.0001.0001.0001.000
출판사0.8220.9541.0001.0001.000
2023-12-13T07:03:18.980792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도구분
년도1.0000.000
구분0.0001.000

Missing values

2023-12-13T07:03:16.586397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:03:16.669935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도구분도서명작가출판사
02007대표도서제4의제국최인호여백
12008대표도서완득이김려령창비
22008어린이도서그대를 사랑합니다강풀문학세계사
32009대표도서엄마를 부탁해신경숙창비
42009어린이도서멀쩡한 이유정유은실푸른숲
52010대표도서그건 사랑이었네한비야푸른숲
62010어린이도서도서관벌레와 도서관 벌레김미애푸른정원
72011대표도서고릴라는 핸드폰을 미워해박경화북센스
82011어린이도서얼음소년조원희느림보
92012대표도서두근두근 내 인생김애란창비
년도구분도서명작가출판사
242019어린이도서숲으로 간 사람들안지혜, 김하나창비
252020대표도서우리가 빛의 속도로 갈 수 없다면김초엽허블
262020어린이도서숨바꼭질김정선사계절
272020시민작가도서독립군이 된 류타유행두키다리
282021대표도서우리의 불행은 당연하지 않습니다김누리해냄
292021어린이도서꽝 없는 뽑기 기계곽유진 글, 차상미 그림비룡소
302021시민작가도서시를 품은 내 가슴금동건교음사
312022대표도서알로하 나의 엄마들이금이창비
322022어린이도서동희의 오늘임은하문학동네
332022시민작가도서신기한 물꼭지어영수웃는돌고래