Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory54.5 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description해당 데이터는 인천광역시 남동구의 간석3동 어린이도서관 정기간행물 목록에 관련된 자료로서, 인천광영시 남동구 간석3동 어린이도서관 정기간행물 목록의 연번, 주제, 분야, 간기, 간행물명, 출판사의 정보를 확인할 수 있다.
URLhttps://www.data.go.kr/data/15103950/fileData.do

Alerts

연번 is highly overall correlated with 주제High correlation
주제 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
간행물명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:17:32.510675
Analysis finished2023-12-12 21:17:33.085371
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-13T06:17:33.171318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2023-12-13T06:17:33.304234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 1
 
4.2%
14 1
 
4.2%
24 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
24 1
4.2%
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%

주제
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size324.0 B
어린이
12 
가정
인문
예술
경제
 
1
Other values (3)

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Unique

Unique4 ?
Unique (%)16.7%

Sample

1st row가정
2nd row가정
3rd row가정
4th row경제
5th row여행

Common Values

ValueCountFrequency (%)
어린이 12
50.0%
가정 3
 
12.5%
인문 3
 
12.5%
예술 2
 
8.3%
경제 1
 
4.2%
여행 1
 
4.2%
의학 1
 
4.2%
정치 1
 
4.2%

Length

2023-12-13T06:17:33.437650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:17:33.573087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이 12
50.0%
가정 3
 
12.5%
인문 3
 
12.5%
예술 2
 
8.3%
경제 1
 
4.2%
여행 1
 
4.2%
의학 1
 
4.2%
정치 1
 
4.2%

분야
Text

Distinct15
Distinct (%)62.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-13T06:17:33.733959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.2083333
Min length2

Characters and Unicode

Total characters53
Distinct characters29
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)37.5%

Sample

1st row요리
2nd row육아
3rd row인테리어
4th row경제
5th row여행
ValueCountFrequency (%)
어린이 3
12.5%
과학 3
12.5%
논술 3
12.5%
교양 2
 
8.3%
시사 2
 
8.3%
수학 2
 
8.3%
요리 1
 
4.2%
육아 1
 
4.2%
인테리어 1
 
4.2%
경제 1
 
4.2%
Other values (5) 5
20.8%
2023-12-13T06:17:34.037963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
 
9.4%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (19) 22
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
9.4%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (19) 22
41.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
9.4%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (19) 22
41.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
 
9.4%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
Other values (19) 22
41.5%

간기
Categorical

Distinct5
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Memory size324.0 B
월간
16 
격월간
주간
반월간
계간
 
1

Length

Max length3
Median length2
Mean length2.2083333
Min length2

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row격월간
2nd row격월간
3rd row월간
4th row주간
5th row월간

Common Values

ValueCountFrequency (%)
월간 16
66.7%
격월간 3
 
12.5%
주간 2
 
8.3%
반월간 2
 
8.3%
계간 1
 
4.2%

Length

2023-12-13T06:17:34.169403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:17:34.267709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월간 16
66.7%
격월간 3
 
12.5%
주간 2
 
8.3%
반월간 2
 
8.3%
계간 1
 
4.2%

간행물명
Text

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-13T06:17:34.457077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9.5
Mean length6.2916667
Min length2

Characters and Unicode

Total characters151
Distinct characters89
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row매거진 F
2nd row위매거진
3rd row행복이 가득한 집
4th row매경 Economy
5th row여행스케치
ValueCountFrequency (%)
매거진 2
 
5.0%
독서평설 2
 
5.0%
수학동아 2
 
5.0%
어린이 2
 
5.0%
nie 1
 
2.5%
놀이터 1
 
2.5%
고래가 1
 
2.5%
그랬어 1
 
2.5%
과학소년 1
 
2.5%
신나는 1
 
2.5%
Other values (26) 26
65.0%
2023-12-13T06:17:34.775576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
10.6%
7
 
4.6%
5
 
3.3%
4
 
2.6%
4
 
2.6%
4
 
2.6%
3
 
2.0%
3
 
2.0%
3
 
2.0%
3
 
2.0%
Other values (79) 99
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122
80.8%
Space Separator 16
 
10.6%
Uppercase Letter 7
 
4.6%
Lowercase Letter 6
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
5.7%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (67) 83
68.0%
Uppercase Letter
ValueCountFrequency (%)
E 2
28.6%
I 1
14.3%
N 1
14.3%
F 1
14.3%
C 1
14.3%
A 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
o 2
33.3%
y 1
16.7%
m 1
16.7%
n 1
16.7%
c 1
16.7%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122
80.8%
Common 16
 
10.6%
Latin 13
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
5.7%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (67) 83
68.0%
Latin
ValueCountFrequency (%)
o 2
15.4%
E 2
15.4%
I 1
7.7%
N 1
7.7%
y 1
7.7%
m 1
7.7%
n 1
7.7%
c 1
7.7%
F 1
7.7%
C 1
7.7%
Common
ValueCountFrequency (%)
16
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 122
80.8%
ASCII 29
 
19.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
55.2%
o 2
 
6.9%
E 2
 
6.9%
I 1
 
3.4%
N 1
 
3.4%
y 1
 
3.4%
m 1
 
3.4%
n 1
 
3.4%
c 1
 
3.4%
F 1
 
3.4%
Other values (2) 2
 
6.9%
Hangul
ValueCountFrequency (%)
7
 
5.7%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (67) 83
68.0%
Distinct19
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-13T06:17:34.938321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length7.5
Mean length5.4166667
Min length2

Characters and Unicode

Total characters130
Distinct characters74
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)62.5%

Sample

1st rowB Media Company
2nd row어라운드
3rd row디자인하우스
4th row매일경제신문사
5th row하이미디어
ValueCountFrequency (%)
동아사이언스 3
 
11.5%
지학사 2
 
7.7%
농민신문사 2
 
7.7%
교원문고 2
 
7.7%
좋은생각 1
 
3.8%
media 1
 
3.8%
b 1
 
3.8%
하이미디어 1
 
3.8%
cabooks 1
 
3.8%
허스트중앙 1
 
3.8%
Other values (11) 11
42.3%
2023-12-13T06:17:35.211185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
8.5%
6
 
4.6%
6
 
4.6%
5
 
3.8%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (64) 82
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
83.1%
Uppercase Letter 10
 
7.7%
Lowercase Letter 10
 
7.7%
Space Separator 2
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (47) 60
55.6%
Lowercase Letter
ValueCountFrequency (%)
a 2
20.0%
i 1
10.0%
d 1
10.0%
n 1
10.0%
p 1
10.0%
m 1
10.0%
o 1
10.0%
e 1
10.0%
y 1
10.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
20.0%
B 2
20.0%
O 2
20.0%
K 1
10.0%
S 1
10.0%
M 1
10.0%
A 1
10.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
83.1%
Latin 20
 
15.4%
Common 2
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (47) 60
55.6%
Latin
ValueCountFrequency (%)
C 2
 
10.0%
B 2
 
10.0%
O 2
 
10.0%
a 2
 
10.0%
i 1
 
5.0%
d 1
 
5.0%
n 1
 
5.0%
p 1
 
5.0%
m 1
 
5.0%
K 1
 
5.0%
Other values (6) 6
30.0%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
83.1%
ASCII 22
 
16.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
10.2%
6
 
5.6%
6
 
5.6%
5
 
4.6%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (47) 60
55.6%
ASCII
ValueCountFrequency (%)
2
 
9.1%
C 2
 
9.1%
B 2
 
9.1%
O 2
 
9.1%
a 2
 
9.1%
i 1
 
4.5%
d 1
 
4.5%
n 1
 
4.5%
p 1
 
4.5%
m 1
 
4.5%
Other values (7) 7
31.8%

Interactions

2023-12-13T06:17:32.801808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:17:35.294909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기간행물명출판사
연번1.0000.8600.7210.8781.0000.910
주제0.8601.0000.9770.6811.0000.985
분야0.7210.9771.0000.9271.0000.979
간기0.8780.6810.9271.0001.0000.933
간행물명1.0001.0001.0001.0001.0001.000
출판사0.9100.9850.9790.9331.0001.000
2023-12-13T06:17:35.394776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주제간기
주제1.0000.452
간기0.4521.000
2023-12-13T06:17:35.461531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제간기
연번1.0000.5920.461
주제0.5921.0000.452
간기0.4610.4521.000

Missing values

2023-12-13T06:17:32.923996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:17:33.026526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번주제분야간기간행물명출판사
01가정요리격월간매거진 FB Media Company
12가정육아격월간위매거진어라운드
23가정인테리어월간행복이 가득한 집디자인하우스
34경제경제주간매경 Economy매일경제신문사
45여행여행월간여행스케치하이미디어
56예술문화격월간디자인 매거진 CACABOOKS
67예술패션월간엘르허스트중앙
78의학건강월간건강 다이제스트건강다이제스트사
89인문사회계간뉴필로소퍼바다출판사
910인문교양월간전원생활농민신문사
연번주제분야간기간행물명출판사
1415어린이과학월간과학소년교원문고
1516어린이수학월간수학동아동아사이언스
1617어린이시사월간신나는 NIE 시사원정대동아이지에듀
1718어린이과학반월간어린이 과학동아동아사이언스
1819어린이수학반월간어린이 수학동아동아사이언스
1920어린이어린이월간어린이동산농민신문사
2021어린이논술월간우등생논술천재교육
2122어린이과학월간위즈키즈교원문고
2223어린이논술월간중학 독서평설지학사
2324어린이논술월간초등 독서평설지학사