Overview

Dataset statistics

Number of variables4
Number of observations62
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory35.1 B

Variable types

Numeric1
Categorical2
Text1

Dataset

Description인천광역시 미추홀구 구립도서관에서 정기적으로 발행하는 도서에 대한 데이터로 도서관명, 도서명, 출판사 등의 정보를 제공합니다.
Author인천광역시 미추홀구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117534&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 도서관명High correlation
도서관명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:35:47.950063
Analysis finished2024-03-18 03:35:49.679693
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct62
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.5
Minimum1
Maximum62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-18T12:35:49.744818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.05
Q116.25
median31.5
Q346.75
95-th percentile58.95
Maximum62
Range61
Interquartile range (IQR)30.5

Descriptive statistics

Standard deviation18.041619
Coefficient of variation (CV)0.5727498
Kurtosis-1.2
Mean31.5
Median Absolute Deviation (MAD)15.5
Skewness0
Sum1953
Variance325.5
MonotonicityStrictly increasing
2024-03-18T12:35:49.856472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.6%
48 1
 
1.6%
35 1
 
1.6%
36 1
 
1.6%
37 1
 
1.6%
38 1
 
1.6%
39 1
 
1.6%
40 1
 
1.6%
41 1
 
1.6%
42 1
 
1.6%
Other values (52) 52
83.9%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
62 1
1.6%
61 1
1.6%
60 1
1.6%
59 1
1.6%
58 1
1.6%
57 1
1.6%
56 1
1.6%
55 1
1.6%
54 1
1.6%
53 1
1.6%

도서관명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)17.7%
Missing0
Missing (%)0.0%
Memory size628.0 B
학나래
10 
이랑
용비
쑥골
한우리
Other values (6)
27 

Length

Max length3
Median length3
Mean length2.5967742
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학나래
2nd row학나래
3rd row학나래
4th row학나래
5th row학나래

Common Values

ValueCountFrequency (%)
학나래 10
16.1%
이랑 7
11.3%
용비 7
11.3%
쑥골 6
9.7%
한우리 5
8.1%
관교 5
8.1%
석바위 5
8.1%
소금꽃 5
8.1%
제물포 4
 
6.5%
독정골 4
 
6.5%

Length

2024-03-18T12:35:49.962163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
학나래 10
16.1%
이랑 7
11.3%
용비 7
11.3%
쑥골 6
9.7%
한우리 5
8.1%
관교 5
8.1%
석바위 5
8.1%
소금꽃 5
8.1%
제물포 4
 
6.5%
독정골 4
 
6.5%
Distinct32
Distinct (%)51.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
2024-03-18T12:35:50.146761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length5.3709677
Min length2

Characters and Unicode

Total characters333
Distinct characters104
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)32.3%

Sample

1st row뉴턴
2nd row고교독서평설
3rd row탑기어
4th row전원생활
5th row좋은생각
ValueCountFrequency (%)
좋은생각 10
 
12.5%
수학동아 8
 
10.0%
과학소년 7
 
8.8%
시사원정대 5
 
6.2%
어린이 5
 
6.2%
위즈키즈 2
 
2.5%
과학동아 2
 
2.5%
뉴턴 2
 
2.5%
어린이과학동아 2
 
2.5%
싱글즈 2
 
2.5%
Other values (30) 35
43.8%
2024-03-18T12:35:50.432092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.7%
18
 
5.4%
14
 
4.2%
13
 
3.9%
12
 
3.6%
11
 
3.3%
10
 
3.0%
10
 
3.0%
10
 
3.0%
10
 
3.0%
Other values (94) 206
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 302
90.7%
Space Separator 18
 
5.4%
Lowercase Letter 7
 
2.1%
Uppercase Letter 6
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
6.3%
14
 
4.6%
13
 
4.3%
12
 
4.0%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
Other values (81) 184
60.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
h 1
14.3%
t 1
14.3%
b 1
14.3%
r 1
14.3%
a 1
14.3%
Uppercase Letter
ValueCountFrequency (%)
B 1
16.7%
C 1
16.7%
A 1
16.7%
H 1
16.7%
E 1
16.7%
G 1
16.7%
Space Separator
ValueCountFrequency (%)
18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 302
90.7%
Common 18
 
5.4%
Latin 13
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
6.3%
14
 
4.6%
13
 
4.3%
12
 
4.0%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
Other values (81) 184
60.9%
Latin
ValueCountFrequency (%)
e 2
15.4%
B 1
7.7%
C 1
7.7%
A 1
7.7%
H 1
7.7%
E 1
7.7%
G 1
7.7%
h 1
7.7%
t 1
7.7%
b 1
7.7%
Other values (2) 2
15.4%
Common
ValueCountFrequency (%)
18
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 302
90.7%
ASCII 31
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
6.3%
14
 
4.6%
13
 
4.3%
12
 
4.0%
11
 
3.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
9
 
3.0%
Other values (81) 184
60.9%
ASCII
ValueCountFrequency (%)
18
58.1%
e 2
 
6.5%
B 1
 
3.2%
C 1
 
3.2%
A 1
 
3.2%
H 1
 
3.2%
E 1
 
3.2%
G 1
 
3.2%
h 1
 
3.2%
t 1
 
3.2%
Other values (3) 3
 
9.7%

출판사
Categorical

Distinct26
Distinct (%)41.9%
Missing0
Missing (%)0.0%
Memory size628.0 B
동아사이언스
12 
좋은생각
10 
동아이지에듀
교원문고
교원
Other values (21)
27 

Length

Max length8
Median length6
Mean length4.6774194
Min length1

Unique

Unique16 ?
Unique (%)25.8%

Sample

1st row아이뉴턴
2nd row지학사
3rd row프린피아
4th row농민신문사
5th row좋은생각

Common Values

ValueCountFrequency (%)
동아사이언스 12
19.4%
좋은생각 10
16.1%
동아이지에듀 5
 
8.1%
교원문고 4
 
6.5%
교원 4
 
6.5%
지학사 3
 
4.8%
천재교육 2
 
3.2%
아이뉴턴 2
 
3.2%
더북컴퍼니 2
 
3.2%
샘터사 2
 
3.2%
Other values (16) 16
25.8%

Length

2024-03-18T12:35:50.560324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동아사이언스 12
19.4%
좋은생각 10
16.1%
동아이지에듀 5
 
8.1%
교원문고 4
 
6.5%
교원 4
 
6.5%
지학사 3
 
4.8%
샘터사 2
 
3.2%
더북컴퍼니 2
 
3.2%
아이뉴턴 2
 
3.2%
천재교육 2
 
3.2%
Other values (16) 16
25.8%

Interactions

2024-03-18T12:35:49.446773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:35:50.640404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도서관명도서명출판사
연번1.0000.9570.0000.000
도서관명0.9571.0000.0000.000
도서명0.0000.0001.0000.993
출판사0.0000.0000.9931.000
2024-03-18T12:35:50.721780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관명출판사
도서관명1.0000.000
출판사0.0001.000
2024-03-18T12:35:50.793133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도서관명출판사
연번1.0000.8190.000
도서관명0.8191.0000.000
출판사0.0000.0001.000

Missing values

2024-03-18T12:35:49.585388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:35:49.648261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도서관명도서명출판사
01학나래뉴턴아이뉴턴
12학나래고교독서평설지학사
23학나래탑기어프린피아
34학나래전원생활농민신문사
45학나래좋은생각좋은생각
56학나래수학동아동아사이언스
67학나래과학소년교원
78학나래위즈키즈천재교육
89학나래우등생논술한국방송
910학나래내셔널지오그래픽 리틀키즈유피에이
연번도서관명도서명출판사
5253이랑시사원정대동아이지에듀
5354이랑프리스쿨아에이오우디자인
5455이랑월간 그림책㈜행복한아침독서
5556용비좋은생각좋은생각
5657용비뉴턴아이뉴턴
5758용비책 CHAEG
5859용비과학소년교원
5960용비수학동아동아사이언스
6061용비초등독서평설지학사
6162용비어린이과학동아동아사이언스