Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory51.7 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description해당 데이터는 인천광역시 남동구의 소래도서관 정기간행물 목록에 관련된 자료로서, 인천광영시 남동구 소래도서관 정기간행물 목록의 연번, 주제, 분야, 간기, 간행물명, 출판사의 정보를 확인할 수 있다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15103948&srcSe=7661IVAWM27C61E190

Alerts

주제 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 주제High correlation
간기 is highly imbalanced (51.9%)Imbalance
연번 has unique valuesUnique
간행물명 has unique valuesUnique

Reproduction

Analysis started2024-01-28 11:29:49.152583
Analysis finished2024-01-28 11:29:49.665567
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-01-28T20:29:49.731766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2024-01-28T20:29:49.846233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

주제
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size524.0 B
기술과학
14 
어린이
10 
사회과학
예술
총류
Other values (2)

Length

Max length4
Median length4
Mean length3.2244898
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row언어
2nd row예술
3rd row어린이
4th row기술과학
5th row어린이

Common Values

ValueCountFrequency (%)
기술과학 14
28.6%
어린이 10
20.4%
사회과학 8
16.3%
예술 7
14.3%
총류 6
12.2%
자연과학 3
 
6.1%
언어 1
 
2.0%

Length

2024-01-28T20:29:49.950548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:29:50.038124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기술과학 14
28.6%
어린이 10
20.4%
사회과학 8
16.3%
예술 7
14.3%
총류 6
12.2%
자연과학 3
 
6.1%
언어 1
 
2.0%

분야
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size524.0 B
어린이
10 
시사
취미
경제
스포츠
 
2
Other values (14)
20 

Length

Max length4
Median length2
Mean length2.3469388
Min length2

Unique

Unique8 ?
Unique (%)16.3%

Sample

1st row어학
2nd row취미
3rd row어린이
4th row건강
5th row어린이

Common Values

ValueCountFrequency (%)
어린이 10
20.4%
시사 7
14.3%
취미 6
12.2%
경제 4
 
8.2%
스포츠 2
 
4.1%
여성 2
 
4.1%
독서 2
 
4.1%
패션 2
 
4.1%
여행 2
 
4.1%
과학 2
 
4.1%
Other values (9) 10
20.4%

Length

2024-01-28T20:29:50.142028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이 10
20.4%
시사 7
14.3%
취미 6
12.2%
경제 4
 
8.2%
패션 2
 
4.1%
인테리어 2
 
4.1%
여행 2
 
4.1%
과학 2
 
4.1%
독서 2
 
4.1%
여성 2
 
4.1%
Other values (9) 10
20.4%

간기
Categorical

IMBALANCE 

Distinct4
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
월간
38 
주간
격월간
 
1
격주간
 
1

Length

Max length3
Median length2
Mean length2.0408163
Min length2

Unique

Unique2 ?
Unique (%)4.1%

Sample

1st row월간
2nd row월간
3rd row월간
4th row월간
5th row월간

Common Values

ValueCountFrequency (%)
월간 38
77.6%
주간 9
 
18.4%
격월간 1
 
2.0%
격주간 1
 
2.0%

Length

2024-01-28T20:29:50.239176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:29:50.310695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월간 38
77.6%
주간 9
 
18.4%
격월간 1
 
2.0%
격주간 1
 
2.0%

간행물명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-28T20:29:50.489778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length13
Mean length6.0408163
Min length2

Characters and Unicode

Total characters296
Distinct characters149
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st rowEBS FM Radio 입이 트이는 영어
2nd rowVDCM
3rd row개똥이네 놀이터
4th row건강 다이제스트
5th row고래가 그랬어
ValueCountFrequency (%)
주간 3
 
4.2%
코리아 2
 
2.8%
과학동아 2
 
2.8%
ebs 1
 
1.4%
위즈키즈 1
 
1.4%
우등생논술 1
 
1.4%
우등생과학 1
 
1.4%
매거진 1
 
1.4%
올리브 1
 
1.4%
연합이매진 1
 
1.4%
Other values (58) 58
80.6%
2024-01-28T20:29:50.798676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
7.8%
11
 
3.7%
10
 
3.4%
7
 
2.4%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (139) 214
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 236
79.7%
Space Separator 23
 
7.8%
Uppercase Letter 16
 
5.4%
Lowercase Letter 10
 
3.4%
Decimal Number 4
 
1.4%
Open Punctuation 3
 
1.0%
Close Punctuation 3
 
1.0%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
4.7%
10
 
4.2%
7
 
3.0%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (114) 173
73.3%
Uppercase Letter
ValueCountFrequency (%)
E 3
18.8%
I 2
12.5%
N 2
12.5%
M 2
12.5%
C 1
 
6.2%
D 1
 
6.2%
V 1
 
6.2%
R 1
 
6.2%
F 1
 
6.2%
S 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
o 3
30.0%
n 1
 
10.0%
i 1
 
10.0%
d 1
 
10.0%
a 1
 
10.0%
m 1
 
10.0%
y 1
 
10.0%
c 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 236
79.7%
Common 34
 
11.5%
Latin 26
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
4.7%
10
 
4.2%
7
 
3.0%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (114) 173
73.3%
Latin
ValueCountFrequency (%)
E 3
 
11.5%
o 3
 
11.5%
I 2
 
7.7%
N 2
 
7.7%
M 2
 
7.7%
n 1
 
3.8%
C 1
 
3.8%
D 1
 
3.8%
V 1
 
3.8%
i 1
 
3.8%
Other values (9) 9
34.6%
Common
ValueCountFrequency (%)
23
67.6%
( 3
 
8.8%
) 3
 
8.8%
2 2
 
5.9%
1 2
 
5.9%
& 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 236
79.7%
ASCII 60
 
20.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23
38.3%
( 3
 
5.0%
) 3
 
5.0%
E 3
 
5.0%
o 3
 
5.0%
I 2
 
3.3%
N 2
 
3.3%
2 2
 
3.3%
1 2
 
3.3%
M 2
 
3.3%
Other values (15) 15
25.0%
Hangul
ValueCountFrequency (%)
11
 
4.7%
10
 
4.2%
7
 
3.0%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (114) 173
73.3%
Distinct40
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-01-28T20:29:50.976889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.0408163
Min length2

Characters and Unicode

Total characters247
Distinct characters110
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)69.4%

Sample

1st row동아출판
2nd row미디어브리지
3rd row보리
4th row건강다이제스트사
5th row고래가그랬어
ValueCountFrequency (%)
한겨레신문사 3
 
6.1%
지학사 3
 
6.1%
동아사이언스 3
 
6.1%
동아일보사 2
 
4.1%
천재교육 2
 
4.1%
서울문화사 2
 
4.1%
경향신문사 1
 
2.0%
허스트중앙 1
 
2.0%
연합뉴스 1
 
2.0%
안그라픽스 1
 
2.0%
Other values (30) 30
61.2%
2024-01-28T20:29:51.267646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
7.7%
11
 
4.5%
10
 
4.0%
10
 
4.0%
9
 
3.6%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
Other values (100) 156
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 241
97.6%
Uppercase Letter 5
 
2.0%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
7.9%
11
 
4.6%
10
 
4.1%
10
 
4.1%
9
 
3.7%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (94) 150
62.2%
Uppercase Letter
ValueCountFrequency (%)
G 1
20.0%
O 1
20.0%
M 1
20.0%
K 1
20.0%
C 1
20.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 241
97.6%
Latin 5
 
2.0%
Common 1
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
7.9%
11
 
4.6%
10
 
4.1%
10
 
4.1%
9
 
3.7%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (94) 150
62.2%
Latin
ValueCountFrequency (%)
G 1
20.0%
O 1
20.0%
M 1
20.0%
K 1
20.0%
C 1
20.0%
Common
ValueCountFrequency (%)
& 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 241
97.6%
ASCII 6
 
2.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
7.9%
11
 
4.6%
10
 
4.1%
10
 
4.1%
9
 
3.7%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (94) 150
62.2%
ASCII
ValueCountFrequency (%)
& 1
16.7%
G 1
16.7%
O 1
16.7%
M 1
16.7%
K 1
16.7%
C 1
16.7%

Interactions

2024-01-28T20:29:49.471985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T20:29:51.343275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기간행물명출판사
연번1.0000.5130.6430.5171.0000.913
주제0.5131.0000.9610.4281.0000.891
분야0.6430.9611.0000.0001.0000.932
간기0.5170.4280.0001.0001.0000.000
간행물명1.0001.0001.0001.0001.0001.000
출판사0.9130.8910.9320.0001.0001.000
2024-01-28T20:29:51.418843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야주제간기
분야1.0000.7210.000
주제0.7211.0000.293
간기0.0000.2931.000
2024-01-28T20:29:51.482073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기
연번1.0000.2660.2320.330
주제0.2661.0000.7210.293
분야0.2320.7211.0000.000
간기0.3300.2930.0001.000

Missing values

2024-01-28T20:29:49.552094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T20:29:49.634163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번주제분야간기간행물명출판사
01언어어학월간EBS FM Radio 입이 트이는 영어동아출판
12예술취미월간VDCM미디어브리지
23어린이어린이월간개똥이네 놀이터보리
34기술과학건강월간건강 다이제스트건강다이제스트사
45어린이어린이월간고래가 그랬어고래가그랬어
56자연과학과학월간과학동아동아사이언스
67어린이어린이월간과학소년교원문고
78자연과학여행월간내셔널지오그래픽 트래블러에이지커뮤니케이션즈
89자연과학과학월간뉴턴아이뉴턴
910예술뮤지컬월간더 뮤지컬클립서비스
연번주제분야간기간행물명출판사
3940사회과학시사주간주간 경향경향신문사
4041사회과학시사주간주간 동아동아일보사
4142사회과학시사주간주간 조선조선일보사
4243기술과학남성월간지큐 코리아두산매거진
4344총류시사월간최근 이슈&상식시대고시기획
4445기술과학취미월간탑기어프린피아
4546예술여행월간트래비한국여행신문
4647사회과학시사주간한겨레21한겨레신문사
4748사회과학경제주간한경비즈니스한국경제신문
4849기술과학인테리어월간행복이가득한집디자인하우스