Overview

Dataset statistics

Number of variables6
Number of observations51
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory51.6 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description해당 데이터는 인천광역시 남동구의 서창도서관 정기간행물 목록에 관련된 자료로서, 인천광영시 남동구 서창도서관 정기간행물 목록의 연번, 주제, 분야, 간기, 간행물명, 출판사의 정보를 확인할 수 있다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15103949&srcSe=7661IVAWM27C61E190

Alerts

주제 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 주제High correlation
연번 has unique valuesUnique
간행물명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 05:16:22.125300
Analysis finished2024-03-18 05:16:22.666087
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2024-03-18T14:16:22.743097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q113.5
median26
Q338.5
95-th percentile48.5
Maximum51
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.866069
Coefficient of variation (CV)0.57177187
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum1326
Variance221
MonotonicityStrictly increasing
2024-03-18T14:16:22.876969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
2 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
51 1
2.0%
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%

주제
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Memory size540.0 B
예술
11 
어린이
10 
기술과학
10 
사회과학
총류
Other values (4)

Length

Max length4
Median length3
Mean length3.0196078
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row언어
2nd row사회과학
3rd row어린이
4th row사회과학
5th row총류

Common Values

ValueCountFrequency (%)
예술 11
21.6%
어린이 10
19.6%
기술과학 10
19.6%
사회과학 9
17.6%
총류 3
 
5.9%
문학 3
 
5.9%
자연과학 2
 
3.9%
종교 2
 
3.9%
언어 1
 
2.0%

Length

2024-03-18T14:16:23.027947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:16:23.158795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
예술 11
21.6%
어린이 10
19.6%
기술과학 10
19.6%
사회과학 9
17.6%
총류 3
 
5.9%
문학 3
 
5.9%
자연과학 2
 
3.9%
종교 2
 
3.9%
언어 1
 
2.0%

분야
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size540.0 B
어린이
10 
취미
시사
교양
독서
Other values (12)
20 

Length

Max length4
Median length2
Mean length2.3529412
Min length2

Unique

Unique7 ?
Unique (%)13.7%

Sample

1st row어학
2nd row경제
3rd row어린이
4th row교양
5th row독서

Common Values

ValueCountFrequency (%)
어린이 10
19.6%
취미 8
15.7%
시사 5
9.8%
교양 5
9.8%
독서 3
 
5.9%
인테리어 3
 
5.9%
경제 3
 
5.9%
여성 3
 
5.9%
패션 2
 
3.9%
과학 2
 
3.9%
Other values (7) 7
13.7%

Length

2024-03-18T14:16:23.287640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이 10
19.6%
취미 8
15.7%
시사 5
9.8%
교양 5
9.8%
독서 3
 
5.9%
인테리어 3
 
5.9%
경제 3
 
5.9%
여성 3
 
5.9%
과학 2
 
3.9%
패션 2
 
3.9%
Other values (7) 7
13.7%

간기
Categorical

Distinct5
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size540.0 B
월간
35 
계간
주간
격월간
격주간
 
1

Length

Max length3
Median length2
Mean length2.0980392
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row월간
2nd row격월간
3rd row월간
4th row월간
5th row월간

Common Values

ValueCountFrequency (%)
월간 35
68.6%
계간 6
 
11.8%
주간 5
 
9.8%
격월간 4
 
7.8%
격주간 1
 
2.0%

Length

2024-03-18T14:16:23.419157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:16:23.519345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월간 35
68.6%
계간 6
 
11.8%
주간 5
 
9.8%
격월간 4
 
7.8%
격주간 1
 
2.0%

간행물명
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2024-03-18T14:16:23.725975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length13
Mean length6.7058824
Min length2

Characters and Unicode

Total characters342
Distinct characters170
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st rowEBS FM Radio 입이 트이는 영어
2nd rowMIT 테크놀로지 리뷰 코리아
3rd row개똥이네 놀이터
4th row건강다이제스트
5th row고교 독서평설
ValueCountFrequency (%)
독서평설 3
 
3.8%
과학동아 2
 
2.5%
ebs 1
 
1.3%
씨네21 1
 
1.3%
위즈키즈 1
 
1.3%
월간조선 1
 
1.3%
우먼센스 1
 
1.3%
우등생논술 1
 
1.3%
우등생과학 1
 
1.3%
엘르 1
 
1.3%
Other values (66) 66
83.5%
2024-03-18T14:16:24.099247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
8.2%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.8%
6
 
1.8%
e 5
 
1.5%
5
 
1.5%
I 5
 
1.5%
Other values (160) 255
74.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 245
71.6%
Uppercase Letter 31
 
9.1%
Space Separator 28
 
8.2%
Lowercase Letter 26
 
7.6%
Decimal Number 4
 
1.2%
Open Punctuation 3
 
0.9%
Close Punctuation 3
 
0.9%
Other Punctuation 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.7%
8
 
3.3%
8
 
3.3%
7
 
2.9%
6
 
2.4%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (122) 184
75.1%
Uppercase Letter
ValueCountFrequency (%)
I 5
16.1%
E 4
12.9%
N 4
12.9%
B 2
 
6.5%
M 2
 
6.5%
F 2
 
6.5%
S 2
 
6.5%
G 2
 
6.5%
Q 1
 
3.2%
A 1
 
3.2%
Other values (6) 6
19.4%
Lowercase Letter
ValueCountFrequency (%)
e 5
19.2%
n 3
11.5%
a 3
11.5%
o 3
11.5%
t 2
 
7.7%
g 1
 
3.8%
k 1
 
3.8%
r 1
 
3.8%
h 1
 
3.8%
w 1
 
3.8%
Other values (5) 5
19.2%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 245
71.6%
Latin 57
 
16.7%
Common 40
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.7%
8
 
3.3%
8
 
3.3%
7
 
2.9%
6
 
2.4%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (122) 184
75.1%
Latin
ValueCountFrequency (%)
e 5
 
8.8%
I 5
 
8.8%
E 4
 
7.0%
N 4
 
7.0%
n 3
 
5.3%
a 3
 
5.3%
o 3
 
5.3%
B 2
 
3.5%
M 2
 
3.5%
F 2
 
3.5%
Other values (21) 24
42.1%
Common
ValueCountFrequency (%)
28
70.0%
( 3
 
7.5%
) 3
 
7.5%
2 2
 
5.0%
1 2
 
5.0%
& 1
 
2.5%
, 1
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 245
71.6%
ASCII 97
 
28.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
28.9%
e 5
 
5.2%
I 5
 
5.2%
E 4
 
4.1%
N 4
 
4.1%
n 3
 
3.1%
( 3
 
3.1%
) 3
 
3.1%
a 3
 
3.1%
o 3
 
3.1%
Other values (28) 36
37.1%
Hangul
ValueCountFrequency (%)
9
 
3.7%
8
 
3.3%
8
 
3.3%
7
 
2.9%
6
 
2.4%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (122) 184
75.1%
Distinct44
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size540.0 B
2024-03-18T14:16:24.368286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length4.9411765
Min length2

Characters and Unicode

Total characters252
Distinct characters121
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)76.5%

Sample

1st row동아출판
2nd row디엠케이글로벌
3rd row보리
4th row다이제스트사
5th row지학사
ValueCountFrequency (%)
지학사 3
 
5.8%
동아사이언스 3
 
5.8%
디자인하우스 2
 
3.8%
서울문화사 2
 
3.8%
천재교육 2
 
3.8%
시대고시기획 1
 
1.9%
주택문화사 1
 
1.9%
아노락코리아 1
 
1.9%
에이엠아트 1
 
1.9%
어라운드 1
 
1.9%
Other values (35) 35
67.3%
2024-03-18T14:16:24.706161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.7%
12
 
4.8%
10
 
4.0%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
4
 
1.6%
4
 
1.6%
Other values (111) 173
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 240
95.2%
Uppercase Letter 5
 
2.0%
Decimal Number 4
 
1.6%
Open Punctuation 1
 
0.4%
Close Punctuation 1
 
0.4%
Space Separator 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
7.1%
12
 
5.0%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (100) 161
67.1%
Uppercase Letter
ValueCountFrequency (%)
N 1
20.0%
K 1
20.0%
C 1
20.0%
M 1
20.0%
I 1
20.0%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 1
25.0%
4 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 240
95.2%
Common 7
 
2.8%
Latin 5
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
7.1%
12
 
5.0%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (100) 161
67.1%
Common
ValueCountFrequency (%)
2 2
28.6%
( 1
14.3%
) 1
14.3%
1 1
14.3%
4 1
14.3%
1
14.3%
Latin
ValueCountFrequency (%)
N 1
20.0%
K 1
20.0%
C 1
20.0%
M 1
20.0%
I 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 240
95.2%
ASCII 12
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
7.1%
12
 
5.0%
10
 
4.2%
8
 
3.3%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (100) 161
67.1%
ASCII
ValueCountFrequency (%)
2 2
16.7%
( 1
8.3%
) 1
8.3%
1 1
8.3%
N 1
8.3%
K 1
8.3%
C 1
8.3%
M 1
8.3%
4 1
8.3%
1
8.3%

Interactions

2024-03-18T14:16:22.435582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T14:16:25.053461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기간행물명출판사
연번1.0000.3640.3630.1451.0000.798
주제0.3641.0000.9790.4291.0000.884
분야0.3630.9791.0000.0001.0000.873
간기0.1450.4290.0001.0001.0000.000
간행물명1.0001.0001.0001.0001.0001.000
출판사0.7980.8840.8730.0001.0001.000
2024-03-18T14:16:25.155048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야주제간기
분야1.0000.8050.000
주제0.8051.0000.247
간기0.0000.2471.000
2024-03-18T14:16:25.232689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주제분야간기
연번1.0000.1500.1680.079
주제0.1501.0000.8050.247
분야0.1680.8051.0000.000
간기0.0790.2470.0001.000

Missing values

2024-03-18T14:16:22.529562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T14:16:22.622041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번주제분야간기간행물명출판사
01언어어학월간EBS FM Radio 입이 트이는 영어동아출판
12사회과학경제격월간MIT 테크놀로지 리뷰 코리아디엠케이글로벌
23어린이어린이월간개똥이네 놀이터보리
34사회과학교양월간건강다이제스트다이제스트사
45총류독서월간고교 독서평설지학사
56어린이어린이월간고래가 그랬어야간비행
67자연과학과학월간과학동아동아사이언스
78어린이어린이월간과학소년교원문고
89기술과학인테리어월간까사리빙까사리빙편집부
910자연과학과학월간뉴턴 Newton한국뉴턴
연번주제분야간기간행물명출판사
4142기술과학패션월간지큐 GQ두산매거진
4243문학문예계간창작과 비평창비
4344총류독서월간책 Chaeg(주)책
4445어린이어린이월간초등 독서평설지학사
4546기술과학요리월간커피(월간)아이비라인
4647예술취미계간프리즘오브프리즘오브프레스
4748예술취미격월간필로 FILO필로편집부
4849사회과학시사주간한겨레21한겨레신문사
4950사회과학경제주간한경비즈니스한국경제신문
5051기술과학인테리어월간행복이 가득한 집디자인하우스