Overview

Dataset statistics

Number of variables6
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory54.0 B

Variable types

Numeric2
Categorical1
Text3

Dataset

Description인천광역시 미추홀구 미추홀구의 책 선정도서에 대한 데이터로 선정연도, 도서명, 저자, 출판사 등의 데이터를 제공하고 있습니다.
Author인천광역시 미추홀구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117502&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 선정연도High correlation
선정연도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
도서명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:34:53.325526
Analysis finished2024-03-18 03:34:55.559723
Duration2.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2024-03-18T12:34:55.619436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2024-03-18T12:34:55.726640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

선정연도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018
Minimum2013
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2024-03-18T12:34:55.818382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2013
Q12015
median2018
Q32021
95-th percentile2023
Maximum2023
Range10
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.2113081
Coefficient of variation (CV)0.0015913321
Kurtosis-1.2208172
Mean2018
Median Absolute Deviation (MAD)3
Skewness0
Sum66594
Variance10.3125
MonotonicityIncreasing
2024-03-18T12:34:55.905301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2013 3
9.1%
2014 3
9.1%
2015 3
9.1%
2016 3
9.1%
2017 3
9.1%
2018 3
9.1%
2019 3
9.1%
2020 3
9.1%
2021 3
9.1%
2022 3
9.1%
ValueCountFrequency (%)
2013 3
9.1%
2014 3
9.1%
2015 3
9.1%
2016 3
9.1%
2017 3
9.1%
2018 3
9.1%
2019 3
9.1%
2020 3
9.1%
2021 3
9.1%
2022 3
9.1%
ValueCountFrequency (%)
2023 3
9.1%
2022 3
9.1%
2021 3
9.1%
2020 3
9.1%
2019 3
9.1%
2018 3
9.1%
2017 3
9.1%
2016 3
9.1%
2015 3
9.1%
2014 3
9.1%

구분
Categorical

Distinct3
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
아동
11 
청소년
11 
성인
11 

Length

Max length3
Median length2
Mean length2.3333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아동
2nd row청소년
3rd row성인
4th row아동
5th row청소년

Common Values

ValueCountFrequency (%)
아동 11
33.3%
청소년 11
33.3%
성인 11
33.3%

Length

2024-03-18T12:34:56.002812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T12:34:56.084782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아동 11
33.3%
청소년 11
33.3%
성인 11
33.3%

도서명
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2024-03-18T12:34:56.285565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length8.9090909
Min length3

Characters and Unicode

Total characters294
Distinct characters151
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row기호 3번 안석뽕
2nd row내 청춘, 시속 370km
3rd row두근두근 내 인생
4th row시간 가게
5th row어쨌든 밸런타인
ValueCountFrequency (%)
가게 2
 
2.4%
2
 
2.4%
우린 1
 
1.2%
페인트 1
 
1.2%
산다 1
 
1.2%
혹등고래가 1
 
1.2%
동네에 1
 
1.2%
우리 1
 
1.2%
했다 1
 
1.2%
살기로 1
 
1.2%
Other values (70) 70
85.4%
2024-03-18T12:34:56.602875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
16.7%
7
 
2.4%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (141) 196
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 229
77.9%
Space Separator 49
 
16.7%
Decimal Number 6
 
2.0%
Uppercase Letter 5
 
1.7%
Other Punctuation 3
 
1.0%
Lowercase Letter 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
3.1%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.7%
Other values (128) 176
76.9%
Decimal Number
ValueCountFrequency (%)
3 2
33.3%
8 1
16.7%
2 1
16.7%
7 1
16.7%
0 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
I 2
40.0%
A 2
40.0%
Q 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
. 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
m 1
50.0%
Space Separator
ValueCountFrequency (%)
49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 229
77.9%
Common 58
 
19.7%
Latin 7
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
3.1%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.7%
Other values (128) 176
76.9%
Common
ValueCountFrequency (%)
49
84.5%
3 2
 
3.4%
, 2
 
3.4%
. 1
 
1.7%
8 1
 
1.7%
2 1
 
1.7%
7 1
 
1.7%
0 1
 
1.7%
Latin
ValueCountFrequency (%)
I 2
28.6%
A 2
28.6%
Q 1
14.3%
k 1
14.3%
m 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 229
77.9%
ASCII 65
 
22.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
75.4%
I 2
 
3.1%
A 2
 
3.1%
3 2
 
3.1%
, 2
 
3.1%
. 1
 
1.5%
Q 1
 
1.5%
8 1
 
1.5%
2 1
 
1.5%
7 1
 
1.5%
Other values (3) 3
 
4.6%
Hangul
ValueCountFrequency (%)
7
 
3.1%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.7%
Other values (128) 176
76.9%

저자
Text

Distinct31
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size396.0 B
2024-03-18T12:34:56.764409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.969697
Min length2

Characters and Unicode

Total characters98
Distinct characters54
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)87.9%

Sample

1st row진형민
2nd row이송현
3rd row김애란
4th row이나영
5th row강윤화
ValueCountFrequency (%)
최영희 2
 
6.1%
박영란 2
 
6.1%
안수자 1
 
3.0%
진형민 1
 
3.0%
이경화 1
 
3.0%
나혜림 1
 
3.0%
정유소영 1
 
3.0%
황보름 1
 
3.0%
유은실 1
 
3.0%
노수미 1
 
3.0%
Other values (21) 21
63.6%
2024-03-18T12:34:57.047100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
6
 
6.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
2
 
2.0%
Other values (44) 57
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 98
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
6
 
6.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
2
 
2.0%
Other values (44) 57
58.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 98
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
6
 
6.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
2
 
2.0%
Other values (44) 57
58.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 98
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
6
 
6.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
2
 
2.0%
2
 
2.0%
Other values (44) 57
58.2%
Distinct17
Distinct (%)51.5%
Missing0
Missing (%)0.0%
Memory size396.0 B
2024-03-18T12:34:57.209986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length3.5454545
Min length2

Characters and Unicode

Total characters117
Distinct characters61
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)39.4%

Sample

1st row창비
2nd row사계절
3rd row창비
4th row문학동네
5th row창비
ValueCountFrequency (%)
창비 11
33.3%
문학동네 5
15.2%
휴먼어린이 2
 
6.1%
비룡소 2
 
6.1%
파랑새 1
 
3.0%
중앙출판사 1
 
3.0%
클레이하우스 1
 
3.0%
마루비 1
 
3.0%
인플루엔셜 1
 
3.0%
잇츠북 1
 
3.0%
Other values (7) 7
21.2%
2024-03-18T12:34:57.523750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
12.0%
11
 
9.4%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
4
 
3.4%
3
 
2.6%
3
 
2.6%
2
 
1.7%
Other values (51) 60
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
12.0%
11
 
9.4%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
4
 
3.4%
3
 
2.6%
3
 
2.6%
2
 
1.7%
Other values (51) 60
51.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
12.0%
11
 
9.4%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
4
 
3.4%
3
 
2.6%
3
 
2.6%
2
 
1.7%
Other values (51) 60
51.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
12.0%
11
 
9.4%
5
 
4.3%
5
 
4.3%
5
 
4.3%
5
 
4.3%
4
 
3.4%
3
 
2.6%
3
 
2.6%
2
 
1.7%
Other values (51) 60
51.3%

Interactions

2024-03-18T12:34:55.304363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:34:55.062373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:34:55.367262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:34:55.225239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:34:57.602205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번선정연도구분도서명저자출판사
연번1.0000.9310.0001.0000.7980.357
선정연도0.9311.0000.0001.0000.8490.238
구분0.0000.0001.0001.0000.8780.544
도서명1.0001.0001.0001.0001.0001.000
저자0.7980.8490.8781.0001.0000.921
출판사0.3570.2380.5441.0000.9211.000
2024-03-18T12:34:57.683216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번선정연도구분
연번1.0000.9960.000
선정연도0.9961.0000.000
구분0.0000.0001.000

Missing values

2024-03-18T12:34:55.445033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:34:55.524351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번선정연도구분도서명저자출판사
012013아동기호 3번 안석뽕진형민창비
122013청소년내 청춘, 시속 370km이송현사계절
232013성인두근두근 내 인생김애란창비
342014아동시간 가게이나영문학동네
452014청소년어쨌든 밸런타인강윤화창비
562014성인뒤늦게 발동걸린 인생들의 이야기김덕영다큐스토리
672015아동악당의 무게이현휴먼어린이
782015청소년첫키스는 엘프와최영희푸른책들
892015성인투명인간성석제창비
9102016아동돌 씹어 먹는 아이송미경문학동네
연번선정연도구분도서명저자출판사
23242020성인선량한 차별주의자김지혜창비
24252021아동조용한 마을의 공유경제 소동안선모파랑새
25262021청소년게스트하우스 Q박영란창비
26272021성인우리가 인생이라 부르는 것들정재찬인플루엔셜
27282022아동AI디케노수미마루비
28292022청소년순례주택유은실비룡소
29302022성인어서오세요, 휴남동서점입니다.황보름클레이하우스
30312023아동아무네 가게정유소영고래가숨쉬는도서관
31322023청소년클로버나혜림창비
32332023성인이토록 평범한 미래김연수문학동네