Overview

Dataset statistics

Number of variables7
Number of observations158
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory58.8 B

Variable types

Numeric2
Categorical2
Text3

Dataset

Description인천광역시 남동구 도서관의 추천도서에 대한 데이터로 연번, 도서관구분, 책제목, 저자, 출판사, 책위치, 쪽수번호 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15040801&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 도서관구분High correlation
도서관구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 16:16:34.989056
Analysis finished2024-01-28 16:16:35.870059
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct158
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.5
Minimum1
Maximum158
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-01-29T01:16:35.924389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.85
Q140.25
median79.5
Q3118.75
95-th percentile150.15
Maximum158
Range157
Interquartile range (IQR)78.5

Descriptive statistics

Standard deviation45.754781
Coefficient of variation (CV)0.57553184
Kurtosis-1.2
Mean79.5
Median Absolute Deviation (MAD)39.5
Skewness0
Sum12561
Variance2093.5
MonotonicityStrictly increasing
2024-01-29T01:16:36.032603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
110 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
111 1
 
0.6%
Other values (148) 148
93.7%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
150 1
0.6%
149 1
0.6%

도서관구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
남동논현도서관
52 
서창도서관
47 
간석3동 어린이도서관
27 
소래도서관
22 
만수2동어린이도서관
10 

Length

Max length11
Median length10
Mean length7
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남동논현도서관
2nd row남동논현도서관
3rd row남동논현도서관
4th row남동논현도서관
5th row남동논현도서관

Common Values

ValueCountFrequency (%)
남동논현도서관 52
32.9%
서창도서관 47
29.7%
간석3동 어린이도서관 27
17.1%
소래도서관 22
13.9%
만수2동어린이도서관 10
 
6.3%

Length

2024-01-29T01:16:36.142453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T01:16:36.253523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동논현도서관 52
28.1%
서창도서관 47
25.4%
간석3동 27
14.6%
어린이도서관 27
14.6%
소래도서관 22
11.9%
만수2동어린이도서관 10
 
5.4%
Distinct156
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-29T01:16:36.547194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length23
Mean length11.911392
Min length3

Characters and Unicode

Total characters1882
Distinct characters381
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)97.5%

Sample

1st row출근길엔 니체, 퇴근길엔 장자
2nd row어른의 중력:생의 1/4 승강장에 도착한 어린 어른을 위한 심리학
3rd row너와 나의 야자 시간
4th row마법의 사탕 한 알
5th row아시아엔 다 있다!
ValueCountFrequency (%)
너에게 6
 
1.1%
나는 4
 
0.7%
고양이 4
 
0.7%
읽는 4
 
0.7%
이상한 4
 
0.7%
4
 
0.7%
3
 
0.6%
위한 3
 
0.6%
1 3
 
0.6%
3
 
0.6%
Other values (459) 497
92.9%
2024-01-29T01:16:36.952496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
377
 
20.0%
43
 
2.3%
43
 
2.3%
42
 
2.2%
33
 
1.8%
27
 
1.4%
26
 
1.4%
25
 
1.3%
22
 
1.2%
22
 
1.2%
Other values (371) 1222
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1423
75.6%
Space Separator 377
 
20.0%
Other Punctuation 34
 
1.8%
Decimal Number 33
 
1.8%
Open Punctuation 6
 
0.3%
Close Punctuation 6
 
0.3%
Uppercase Letter 2
 
0.1%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
3.0%
43
 
3.0%
42
 
3.0%
33
 
2.3%
27
 
1.9%
26
 
1.8%
25
 
1.8%
22
 
1.5%
22
 
1.5%
19
 
1.3%
Other values (353) 1121
78.8%
Decimal Number
ValueCountFrequency (%)
1 15
45.5%
0 8
24.2%
2 4
 
12.1%
4 3
 
9.1%
3 2
 
6.1%
6 1
 
3.0%
Other Punctuation
ValueCountFrequency (%)
: 13
38.2%
, 9
26.5%
! 4
 
11.8%
. 3
 
8.8%
? 3
 
8.8%
/ 2
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
I 1
50.0%
Space Separator
ValueCountFrequency (%)
377
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1423
75.6%
Common 457
 
24.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
3.0%
43
 
3.0%
42
 
3.0%
33
 
2.3%
27
 
1.9%
26
 
1.8%
25
 
1.8%
22
 
1.5%
22
 
1.5%
19
 
1.3%
Other values (353) 1121
78.8%
Common
ValueCountFrequency (%)
377
82.5%
1 15
 
3.3%
: 13
 
2.8%
, 9
 
2.0%
0 8
 
1.8%
( 6
 
1.3%
) 6
 
1.3%
2 4
 
0.9%
! 4
 
0.9%
. 3
 
0.7%
Other values (6) 12
 
2.6%
Latin
ValueCountFrequency (%)
A 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1423
75.6%
ASCII 458
 
24.3%
Letterlike Symbols 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
377
82.3%
1 15
 
3.3%
: 13
 
2.8%
, 9
 
2.0%
0 8
 
1.7%
( 6
 
1.3%
) 6
 
1.3%
2 4
 
0.9%
! 4
 
0.9%
. 3
 
0.7%
Other values (7) 13
 
2.8%
Hangul
ValueCountFrequency (%)
43
 
3.0%
43
 
3.0%
42
 
3.0%
33
 
2.3%
27
 
1.9%
26
 
1.8%
25
 
1.8%
22
 
1.5%
22
 
1.5%
19
 
1.3%
Other values (353) 1121
78.8%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct154
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-29T01:16:37.163172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length20
Mean length6.1898734
Min length2

Characters and Unicode

Total characters978
Distinct characters237
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)94.9%

Sample

1st row필로소피 미디엄
2nd row사티아 보일 바이오크
3rd row김달님 외
4th row코비 야마다
5th row조지욱 글;국형원 그림
ValueCountFrequency (%)
지음 43
 
14.2%
그림 9
 
3.0%
7
 
2.3%
6
 
2.0%
존스 2
 
0.7%
신현경 2
 
0.7%
매트 2
 
0.7%
졸러 2
 
0.7%
세이츠 2
 
0.7%
이꽃님 2
 
0.7%
Other values (217) 225
74.5%
2024-01-29T01:16:37.490276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
15.2%
53
 
5.4%
44
 
4.5%
36
 
3.7%
26
 
2.7%
15
 
1.5%
14
 
1.4%
13
 
1.3%
13
 
1.3%
12
 
1.2%
Other values (227) 603
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 811
82.9%
Space Separator 149
 
15.2%
Other Punctuation 15
 
1.5%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
6.5%
44
 
5.4%
36
 
4.4%
26
 
3.2%
15
 
1.8%
14
 
1.7%
13
 
1.6%
13
 
1.6%
12
 
1.5%
10
 
1.2%
Other values (221) 575
70.9%
Other Punctuation
ValueCountFrequency (%)
; 10
66.7%
, 5
33.3%
Space Separator
ValueCountFrequency (%)
149
100.0%
Close Punctuation
ValueCountFrequency (%)
] 1
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 811
82.9%
Common 167
 
17.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
6.5%
44
 
5.4%
36
 
4.4%
26
 
3.2%
15
 
1.8%
14
 
1.7%
13
 
1.6%
13
 
1.6%
12
 
1.5%
10
 
1.2%
Other values (221) 575
70.9%
Common
ValueCountFrequency (%)
149
89.2%
; 10
 
6.0%
, 5
 
3.0%
] 1
 
0.6%
[ 1
 
0.6%
- 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 811
82.9%
ASCII 167
 
17.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
89.2%
; 10
 
6.0%
, 5
 
3.0%
] 1
 
0.6%
[ 1
 
0.6%
- 1
 
0.6%
Hangul
ValueCountFrequency (%)
53
 
6.5%
44
 
5.4%
36
 
4.4%
26
 
3.2%
15
 
1.8%
14
 
1.7%
13
 
1.6%
13
 
1.6%
12
 
1.5%
10
 
1.2%
Other values (221) 575
70.9%
Distinct108
Distinct (%)68.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-29T01:16:37.747345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length4.0189873
Min length1

Characters and Unicode

Total characters635
Distinct characters193
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)52.5%

Sample

1st row한국경제신문
2nd row윌북
3rd row책폴
4th row상상의힘
5th row사계절
ValueCountFrequency (%)
문학동네 10
 
6.3%
창비 6
 
3.8%
책읽는곰 3
 
1.9%
동아시아 3
 
1.9%
마음산책 3
 
1.9%
우리학교 3
 
1.9%
웅진주니어 3
 
1.9%
포레스트북스 3
 
1.9%
21세기북스 3
 
1.9%
윌북 3
 
1.9%
Other values (99) 119
74.8%
2024-01-29T01:16:38.120229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
4.6%
24
 
3.8%
17
 
2.7%
15
 
2.4%
15
 
2.4%
15
 
2.4%
14
 
2.2%
14
 
2.2%
14
 
2.2%
14
 
2.2%
Other values (183) 464
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 606
95.4%
Uppercase Letter 17
 
2.7%
Decimal Number 7
 
1.1%
Lowercase Letter 3
 
0.5%
Space Separator 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
4.8%
24
 
4.0%
17
 
2.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (169) 435
71.8%
Uppercase Letter
ValueCountFrequency (%)
O 4
23.5%
K 3
17.6%
S 3
17.6%
B 3
17.6%
A 1
 
5.9%
E 1
 
5.9%
H 1
 
5.9%
R 1
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
l 1
33.3%
m 1
33.3%
a 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 4
57.1%
1 3
42.9%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 606
95.4%
Latin 20
 
3.1%
Common 9
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
4.8%
24
 
4.0%
17
 
2.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (169) 435
71.8%
Latin
ValueCountFrequency (%)
O 4
20.0%
K 3
15.0%
S 3
15.0%
B 3
15.0%
A 1
 
5.0%
l 1
 
5.0%
m 1
 
5.0%
a 1
 
5.0%
E 1
 
5.0%
H 1
 
5.0%
Common
ValueCountFrequency (%)
2 4
44.4%
1 3
33.3%
2
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 606
95.4%
ASCII 29
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
 
4.8%
24
 
4.0%
17
 
2.8%
15
 
2.5%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (169) 435
71.8%
ASCII
ValueCountFrequency (%)
2 4
13.8%
O 4
13.8%
K 3
10.3%
S 3
10.3%
B 3
10.3%
1 3
10.3%
2
6.9%
A 1
 
3.4%
l 1
 
3.4%
m 1
 
3.4%
Other values (4) 4
13.8%

책위치
Categorical

Distinct5
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
종합자료실
79 
어린이자료실
51 
일반자료실
17 
유아실
10 
유아자료실
 
1

Length

Max length6
Median length5
Mean length5.1962025
Min length3

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row종합자료실
2nd row종합자료실
3rd row종합자료실
4th row어린이자료실
5th row어린이자료실

Common Values

ValueCountFrequency (%)
종합자료실 79
50.0%
어린이자료실 51
32.3%
일반자료실 17
 
10.8%
유아실 10
 
6.3%
유아자료실 1
 
0.6%

Length

2024-01-29T01:16:38.240560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T01:16:38.333021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종합자료실 79
50.0%
어린이자료실 51
32.3%
일반자료실 17
 
10.8%
유아실 10
 
6.3%
유아자료실 1
 
0.6%

쪽수번호
Real number (ℝ)

Distinct102
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean223.84177
Minimum24
Maximum1120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-01-29T01:16:38.428887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24
5-th percentile32
Q185
median230
Q3306.25
95-th percentile481.2
Maximum1120
Range1096
Interquartile range (IQR)221.25

Descriptive statistics

Standard deviation159.71912
Coefficient of variation (CV)0.71353582
Kurtosis5.3963839
Mean223.84177
Median Absolute Deviation (MAD)110
Skewness1.402878
Sum35367
Variance25510.198
MonotonicityNot monotonic
2024-01-29T01:16:38.754234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32 9
 
5.7%
40 7
 
4.4%
292 5
 
3.2%
248 4
 
2.5%
36 4
 
2.5%
96 3
 
1.9%
44 3
 
1.9%
199 3
 
1.9%
268 3
 
1.9%
488 3
 
1.9%
Other values (92) 114
72.2%
ValueCountFrequency (%)
24 1
 
0.6%
28 1
 
0.6%
29 1
 
0.6%
31 2
 
1.3%
32 9
5.7%
36 4
2.5%
40 7
4.4%
42 1
 
0.6%
44 3
 
1.9%
48 3
 
1.9%
ValueCountFrequency (%)
1120 1
 
0.6%
696 1
 
0.6%
631 1
 
0.6%
570 1
 
0.6%
492 1
 
0.6%
488 3
1.9%
480 1
 
0.6%
468 1
 
0.6%
467 1
 
0.6%
463 1
 
0.6%

Interactions

2024-01-29T01:16:35.575601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T01:16:35.441024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T01:16:35.649400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T01:16:35.512146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T01:16:38.822314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도서관구분책위치쪽수번호
연번1.0000.9870.7370.000
도서관구분0.9871.0000.8520.270
책위치0.7370.8521.0000.585
쪽수번호0.0000.2700.5851.000
2024-01-29T01:16:38.890722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분책위치
도서관구분1.0000.489
책위치0.4891.000
2024-01-29T01:16:38.964021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번쪽수번호도서관구분책위치
연번1.0000.0620.8260.389
쪽수번호0.0621.0000.1660.405
도서관구분0.8260.1661.0000.489
책위치0.3890.4050.4891.000

Missing values

2024-01-29T01:16:35.746835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T01:16:35.836728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도서관구분책제목저자출판사책위치쪽수번호
01남동논현도서관출근길엔 니체, 퇴근길엔 장자필로소피 미디엄한국경제신문종합자료실259
12남동논현도서관어른의 중력:생의 1/4 승강장에 도착한 어린 어른을 위한 심리학사티아 보일 바이오크윌북종합자료실248
23남동논현도서관너와 나의 야자 시간김달님 외책폴종합자료실216
34남동논현도서관마법의 사탕 한 알코비 야마다상상의힘어린이자료실44
45남동논현도서관아시아엔 다 있다!조지욱 글;국형원 그림사계절어린이자료실1120
56남동논현도서관안녕, 겨울케나드 박국민서관어린이자료실32
67남동논현도서관네 칸 신화집로익 곰책빛어린이자료실88
78남동논현도서관나는 미니멀 유목민입니다박건우길벗어린이종합자료실228
89남동논현도서관백만장자를 위한 공짜 음식. 1이민진인플루엔셜종합자료실488
910남동논현도서관어쩌다 만난 수학고정욱책담종합자료실180
연번도서관구분책제목저자출판사책위치쪽수번호
148149만수2동어린이도서관우리집 고양이 이야기이토 미쿠그레이트BOOKS어린이자료실128
149150만수2동어린이도서관처음 우주에 간 고양이, 피자를 맛보다맥 바넷나무의말어린이자료실328
150151만수2동어린이도서관조선미의 현실 육아 상담소조선미북하우스일반자료실272
151152만수2동어린이도서관기타 등등동아리를 신청합니다류재항시공주니어어린이자료실102
152153만수2동어린이도서관모네의 고양이릴리 머레이아르카디아유아자료실40
153154간석3동 어린이도서관고구마구마사이다반달유아실40
154155간석3동 어린이도서관대장 토끼는 포기하지 않아큐라이스토토북유아실31
155156간석3동 어린이도서관꽃할머니권윤덕사계절유아실40
156157간석3동 어린이도서관30개 도시로 읽는 세계사조 지무쇼다산초당일반자료실357
157158간석3동 어린이도서관베어타운프레드릭 배크만다산책방일반자료실570