Overview

Dataset statistics

Number of variables6
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory51.5 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description해당 데이터는 인천광역시 남동구의 남동현도서관 정기간행물 목록에 관련된 자료로서, 인천광영시 남동구 남동현도서관 정기간행물 목록의 연번, 주제, 분야, 간기, 간행물명, 출판사의 정보를 확인할 수 있다.
URLhttps://www.data.go.kr/data/15103947/fileData.do

Alerts

연번 is highly overall correlated with 주제 and 1 other fieldsHigh correlation
주제 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
분야 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
간행물명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:24:17.138250
Analysis finished2023-12-12 12:24:17.932026
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T21:24:18.023581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-12T21:24:18.195056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

주제
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
기술과학
사회과학
어린이
총류
자연과학
Other values (5)
13 

Length

Max length4
Median length3
Mean length3.0769231
Min length2

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row총류
2nd row총류
3rd row총류
4th row총류
5th row총류

Common Values

ValueCountFrequency (%)
기술과학 9
17.3%
사회과학 8
15.4%
어린이 8
15.4%
총류 7
13.5%
자연과학 7
13.5%
예술 4
7.7%
문학 3
 
5.8%
역사 3
 
5.8%
철학 2
 
3.8%
언어 1
 
1.9%

Length

2023-12-12T21:24:18.378512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:24:18.546324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기술과학 9
17.3%
사회과학 8
15.4%
어린이 8
15.4%
총류 7
13.5%
자연과학 7
13.5%
예술 4
7.7%
문학 3
 
5.8%
역사 3
 
5.8%
철학 2
 
3.8%
언어 1
 
1.9%

분야
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)30.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
라이프
11 
아동
논술
시사
과학
Other values (11)
15 

Length

Max length3
Median length2
Mean length2.2307692
Min length2

Unique

Unique8 ?
Unique (%)15.4%

Sample

1st row논술
2nd row교양
3rd row논술
4th row논술
5th row논술

Common Values

ValueCountFrequency (%)
라이프 11
21.2%
아동 8
15.4%
논술 6
11.5%
시사 6
11.5%
과학 6
11.5%
문학 3
 
5.8%
예술 2
 
3.8%
여행 2
 
3.8%
교양 1
 
1.9%
철학 1
 
1.9%
Other values (6) 6
11.5%

Length

2023-12-12T21:24:18.697345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
라이프 11
21.2%
아동 8
15.4%
논술 6
11.5%
시사 6
11.5%
과학 6
11.5%
문학 3
 
5.8%
예술 2
 
3.8%
여행 2
 
3.8%
교양 1
 
1.9%
철학 1
 
1.9%
Other values (6) 6
11.5%

간기
Categorical

Distinct5
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size548.0 B
월간
33 
계간
주간
격주간
 
3
격월간
 
2

Length

Max length3
Median length2
Mean length2.0961538
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row월간
2nd row계간
3rd row월간
4th row월간
5th row월간

Common Values

ValueCountFrequency (%)
월간 33
63.5%
계간 8
 
15.4%
주간 6
 
11.5%
격주간 3
 
5.8%
격월간 2
 
3.8%

Length

2023-12-12T21:24:18.903930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:24:19.058484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월간 33
63.5%
계간 8
 
15.4%
주간 6
 
11.5%
격주간 3
 
5.8%
격월간 2
 
3.8%

간행물명
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T21:24:19.406369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length8.5961538
Min length2

Characters and Unicode

Total characters447
Distinct characters166
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row월간 유레카
2nd row인디고잉 INDIGO+ing
3rd row독서평설(고등)
4th row독서평설(중등)
5th row독서평설(초등)
ValueCountFrequency (%)
월간 2
 
2.1%
한국판 2
 
2.1%
어린이 2
 
2.1%
코리아 2
 
2.1%
수학동아 2
 
2.1%
talk 2
 
2.1%
우등생 2
 
2.1%
과학동아 2
 
2.1%
일레븐 1
 
1.0%
베스트 1
 
1.0%
Other values (79) 79
81.4%
2023-12-12T21:24:19.957611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
11.0%
e 14
 
3.1%
9
 
2.0%
8
 
1.8%
a 8
 
1.8%
r 8
 
1.8%
) 8
 
1.8%
( 8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (156) 321
71.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 255
57.0%
Lowercase Letter 73
 
16.3%
Space Separator 49
 
11.0%
Uppercase Letter 49
 
11.0%
Close Punctuation 8
 
1.8%
Open Punctuation 8
 
1.8%
Decimal Number 4
 
0.9%
Math Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (116) 192
75.3%
Uppercase Letter
ValueCountFrequency (%)
T 6
12.2%
I 6
12.2%
E 5
10.2%
M 4
 
8.2%
N 4
 
8.2%
K 3
 
6.1%
D 3
 
6.1%
G 3
 
6.1%
S 2
 
4.1%
C 2
 
4.1%
Other values (8) 11
22.4%
Lowercase Letter
ValueCountFrequency (%)
e 14
19.2%
a 8
11.0%
r 8
11.0%
o 7
9.6%
i 7
9.6%
l 5
 
6.8%
t 5
 
6.8%
h 3
 
4.1%
n 3
 
4.1%
v 2
 
2.7%
Other values (6) 11
15.1%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 255
57.0%
Latin 122
27.3%
Common 70
 
15.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (116) 192
75.3%
Latin
ValueCountFrequency (%)
e 14
 
11.5%
a 8
 
6.6%
r 8
 
6.6%
o 7
 
5.7%
i 7
 
5.7%
T 6
 
4.9%
I 6
 
4.9%
l 5
 
4.1%
t 5
 
4.1%
E 5
 
4.1%
Other values (24) 51
41.8%
Common
ValueCountFrequency (%)
49
70.0%
) 8
 
11.4%
( 8
 
11.4%
2 2
 
2.9%
1 2
 
2.9%
+ 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 255
57.0%
ASCII 192
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
25.5%
e 14
 
7.3%
a 8
 
4.2%
r 8
 
4.2%
) 8
 
4.2%
( 8
 
4.2%
o 7
 
3.6%
i 7
 
3.6%
T 6
 
3.1%
I 6
 
3.1%
Other values (30) 71
37.0%
Hangul
ValueCountFrequency (%)
9
 
3.5%
8
 
3.1%
7
 
2.7%
7
 
2.7%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (116) 192
75.3%
Distinct39
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T21:24:20.228482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.6538462
Min length2

Characters and Unicode

Total characters294
Distinct characters137
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)57.7%

Sample

1st row유레카엠앤비
2nd row인디고서원
3rd row지학사
4th row지학사
5th row지학사
ValueCountFrequency (%)
동아사이언스 4
 
7.3%
한겨레신문사 3
 
5.5%
지학사 3
 
5.5%
천재교육 2
 
3.6%
디자인하우스 2
 
3.6%
교원 2
 
3.6%
동아일보사 2
 
3.6%
바다출판사 2
 
3.6%
두산매거진 2
 
3.6%
나비클럽 1
 
1.8%
Other values (32) 32
58.2%
2023-12-12T21:24:20.678081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
7.8%
16
 
5.4%
9
 
3.1%
8
 
2.7%
7
 
2.4%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (127) 205
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 261
88.8%
Lowercase Letter 14
 
4.8%
Uppercase Letter 13
 
4.4%
Space Separator 4
 
1.4%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
8.8%
16
 
6.1%
9
 
3.4%
8
 
3.1%
7
 
2.7%
6
 
2.3%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (106) 172
65.9%
Lowercase Letter
ValueCountFrequency (%)
a 4
28.6%
i 2
14.3%
n 1
 
7.1%
p 1
 
7.1%
m 1
 
7.1%
o 1
 
7.1%
d 1
 
7.1%
e 1
 
7.1%
s 1
 
7.1%
y 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
M 3
23.1%
I 2
15.4%
E 2
15.4%
B 2
15.4%
Y 1
 
7.7%
C 1
 
7.7%
T 1
 
7.7%
N 1
 
7.7%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 261
88.8%
Latin 27
 
9.2%
Common 6
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
8.8%
16
 
6.1%
9
 
3.4%
8
 
3.1%
7
 
2.7%
6
 
2.3%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (106) 172
65.9%
Latin
ValueCountFrequency (%)
a 4
14.8%
M 3
 
11.1%
I 2
 
7.4%
i 2
 
7.4%
E 2
 
7.4%
B 2
 
7.4%
Y 1
 
3.7%
n 1
 
3.7%
p 1
 
3.7%
m 1
 
3.7%
Other values (8) 8
29.6%
Common
ValueCountFrequency (%)
4
66.7%
) 1
 
16.7%
( 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 261
88.8%
ASCII 33
 
11.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
8.8%
16
 
6.1%
9
 
3.4%
8
 
3.1%
7
 
2.7%
6
 
2.3%
5
 
1.9%
5
 
1.9%
5
 
1.9%
5
 
1.9%
Other values (106) 172
65.9%
ASCII
ValueCountFrequency (%)
4
 
12.1%
a 4
 
12.1%
M 3
 
9.1%
I 2
 
6.1%
i 2
 
6.1%
E 2
 
6.1%
B 2
 
6.1%
Y 1
 
3.0%
n 1
 
3.0%
p 1
 
3.0%
Other values (11) 11
33.3%

Interactions

2023-12-12T21:24:17.568788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:24:20.796766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기간행물명출판사
연번1.0000.9710.9150.8371.0000.845
주제0.9711.0000.9860.7271.0000.919
분야0.9150.9861.0000.6091.0000.862
간기0.8370.7270.6091.0001.0000.895
간행물명1.0001.0001.0001.0001.0001.000
출판사0.8450.9190.8620.8951.0001.000
2023-12-12T21:24:20.955307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
간기주제분야
간기1.0000.3610.312
주제0.3611.0000.861
분야0.3120.8611.000
2023-12-12T21:24:21.047613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기
연번1.0000.7010.6300.469
주제0.7011.0000.8610.361
분야0.6300.8611.0000.312
간기0.4690.3610.3121.000

Missing values

2023-12-12T21:24:17.713404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:24:17.878746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번주제분야간기간행물명출판사
01총류논술월간월간 유레카유레카엠앤비
12총류교양계간인디고잉 INDIGO+ing인디고서원
23총류논술월간독서평설(고등)지학사
34총류논술월간독서평설(중등)지학사
45총류논술월간독서평설(초등)지학사
56총류논술월간톡톡 Talk Talk삼십육점오커뮤니케이션즈
67총류논술월간행복한 논술(중학생용)이태종NIE논술연구소
78철학라이프계간브리드 Breathe틔움출판
89철학철학계간뉴필로소퍼 NewPhilosopher바다출판사
910사회과학경제월간이코노미 인사이트한겨레신문사
연번주제분야간기간행물명출판사
4243역사여행월간트래비 Travie여행신문
4344역사지리월간내셔널 지오그래픽(한국어판)YBM
4445어린이아동월간개똥이네 놀이터보리
4546어린이아동월간고래가 그랬어고래가그랬어
4647어린이아동월간시사원정대동아이지에듀
4748어린이아동격주간어린이 과학동아동아사이언스
4849어린이아동격주간어린이 수학동아동아사이언스
4950어린이아동월간우등생 과학천재교육
5051어린이아동월간우등생 논술천재교육
5152어린이아동월간위즈키즈교원