Overview

Dataset statistics

Number of variables6
Number of observations178
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory49.7 B

Variable types

Text4
Numeric1
DateTime1

Dataset

Description충청남도 도정신문에 서평이 게시된 도서에 대한 데이터로, 도서명, 저자,출판사, 수록된 도정신문 회차 등의 내용을 담고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=301&beforeMenuCd=DOM_000000201001001000&publicdatapk=15095052

Alerts

도정신문 발행일 has unique valuesUnique
도정신문 호수 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:57:54.556770
Analysis finished2024-01-09 20:57:55.179331
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct177
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T05:57:55.446151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length18
Mean length11.410112
Min length1

Characters and Unicode

Total characters2031
Distinct characters411
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)98.9%

Sample

1st row8체질 이야기
2nd row말의 품격
3rd row운다고 달라지는 일은 아무것도 없겠지만
4th row우리는 차별에 찬성합니다
5th row당신은 개를 키우면 안 된다
ValueCountFrequency (%)
5
 
0.9%
위한 4
 
0.7%
없다 4
 
0.7%
3
 
0.5%
모든 3
 
0.5%
사회 3
 
0.5%
나는 3
 
0.5%
기술 3
 
0.5%
않는다 3
 
0.5%
다시 3
 
0.5%
Other values (504) 546
94.1%
2024-01-10T05:57:55.883987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
402
 
19.8%
52
 
2.6%
49
 
2.4%
48
 
2.4%
42
 
2.1%
36
 
1.8%
29
 
1.4%
24
 
1.2%
23
 
1.1%
21
 
1.0%
Other values (401) 1305
64.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1571
77.4%
Space Separator 402
 
19.8%
Decimal Number 29
 
1.4%
Other Punctuation 13
 
0.6%
Uppercase Letter 11
 
0.5%
Lowercase Letter 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
3.3%
49
 
3.1%
48
 
3.1%
42
 
2.7%
36
 
2.3%
29
 
1.8%
24
 
1.5%
23
 
1.5%
21
 
1.3%
21
 
1.3%
Other values (374) 1226
78.0%
Uppercase Letter
ValueCountFrequency (%)
I 2
18.2%
Q 1
9.1%
A 1
9.1%
Z 1
9.1%
L 1
9.1%
O 1
9.1%
V 1
9.1%
E 1
9.1%
F 1
9.1%
B 1
9.1%
Decimal Number
ValueCountFrequency (%)
0 8
27.6%
1 7
24.1%
2 5
17.2%
9 4
13.8%
8 2
 
6.9%
5 2
 
6.9%
3 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 9
69.2%
: 2
 
15.4%
. 1
 
7.7%
? 1
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
v 1
50.0%
Space Separator
ValueCountFrequency (%)
402
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1571
77.4%
Common 447
 
22.0%
Latin 13
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
3.3%
49
 
3.1%
48
 
3.1%
42
 
2.7%
36
 
2.3%
29
 
1.8%
24
 
1.5%
23
 
1.5%
21
 
1.3%
21
 
1.3%
Other values (374) 1226
78.0%
Common
ValueCountFrequency (%)
402
89.9%
, 9
 
2.0%
0 8
 
1.8%
1 7
 
1.6%
2 5
 
1.1%
9 4
 
0.9%
8 2
 
0.4%
: 2
 
0.4%
5 2
 
0.4%
( 1
 
0.2%
Other values (5) 5
 
1.1%
Latin
ValueCountFrequency (%)
I 2
15.4%
s 1
7.7%
Q 1
7.7%
v 1
7.7%
A 1
7.7%
Z 1
7.7%
L 1
7.7%
O 1
7.7%
V 1
7.7%
E 1
7.7%
Other values (2) 2
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1571
77.4%
ASCII 460
 
22.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
402
87.4%
, 9
 
2.0%
0 8
 
1.7%
1 7
 
1.5%
2 5
 
1.1%
9 4
 
0.9%
8 2
 
0.4%
I 2
 
0.4%
: 2
 
0.4%
5 2
 
0.4%
Other values (17) 17
 
3.7%
Hangul
ValueCountFrequency (%)
52
 
3.3%
49
 
3.1%
48
 
3.1%
42
 
2.7%
36
 
2.3%
29
 
1.8%
24
 
1.5%
23
 
1.5%
21
 
1.3%
21
 
1.3%
Other values (374) 1226
78.0%
Distinct173
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T05:57:56.209047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length4.5786517
Min length2

Characters and Unicode

Total characters815
Distinct characters257
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique168 ?
Unique (%)94.4%

Sample

1st row주석원
2nd row이기주
3rd row박준
4th row오찬호
5th row강형욱
ValueCountFrequency (%)
7
 
2.7%
김초엽 3
 
1.2%
3
 
1.2%
피터 2
 
0.8%
정세랑 2
 
0.8%
김수현 2
 
0.8%
오찬호 2
 
0.8%
브라이언 2
 
0.8%
최원형 2
 
0.8%
b 2
 
0.8%
Other values (230) 231
89.5%
2024-01-10T05:57:56.655166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
9.8%
33
 
4.0%
31
 
3.8%
14
 
1.7%
12
 
1.5%
11
 
1.3%
10
 
1.2%
10
 
1.2%
9
 
1.1%
9
 
1.1%
Other values (247) 596
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 709
87.0%
Space Separator 80
 
9.8%
Other Punctuation 11
 
1.3%
Uppercase Letter 6
 
0.7%
Decimal Number 4
 
0.5%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
4.7%
31
 
4.4%
14
 
2.0%
12
 
1.7%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
9
 
1.3%
8
 
1.1%
Other values (233) 562
79.3%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
A 2
33.3%
J 1
16.7%
M 1
16.7%
Decimal Number
ValueCountFrequency (%)
6 1
25.0%
8 1
25.0%
4 1
25.0%
1 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 7
63.6%
, 4
36.4%
Space Separator
ValueCountFrequency (%)
80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
w 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 709
87.0%
Common 99
 
12.1%
Latin 7
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
4.7%
31
 
4.4%
14
 
2.0%
12
 
1.7%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
9
 
1.3%
8
 
1.1%
Other values (233) 562
79.3%
Common
ValueCountFrequency (%)
80
80.8%
. 7
 
7.1%
, 4
 
4.0%
( 2
 
2.0%
) 2
 
2.0%
6 1
 
1.0%
8 1
 
1.0%
4 1
 
1.0%
1 1
 
1.0%
Latin
ValueCountFrequency (%)
B 2
28.6%
A 2
28.6%
J 1
14.3%
M 1
14.3%
w 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 709
87.0%
ASCII 106
 
13.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
80
75.5%
. 7
 
6.6%
, 4
 
3.8%
B 2
 
1.9%
( 2
 
1.9%
A 2
 
1.9%
) 2
 
1.9%
J 1
 
0.9%
M 1
 
0.9%
6 1
 
0.9%
Other values (4) 4
 
3.8%
Hangul
ValueCountFrequency (%)
33
 
4.7%
31
 
4.4%
14
 
2.0%
12
 
1.7%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
9
 
1.3%
8
 
1.1%
Other values (233) 562
79.3%
Distinct135
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T05:57:56.924726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.9719101
Min length1

Characters and Unicode

Total characters707
Distinct characters209
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)61.2%

Sample

1st row씨앗을뿌리는사람
2nd row황소북스
3rd row난다
4th row개마고원
5th row동아일보사
ValueCountFrequency (%)
창비 6
 
3.3%
문학동네 5
 
2.8%
위즈덤하우스 5
 
2.8%
부키 5
 
2.8%
한빛비즈 3
 
1.7%
쌤앤파커스 3
 
1.7%
어크로스 3
 
1.7%
사이언스북스 3
 
1.7%
아작 2
 
1.1%
한겨레출판사 2
 
1.1%
Other values (127) 143
79.4%
2024-01-10T05:57:57.310106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
6.9%
28
 
4.0%
27
 
3.8%
17
 
2.4%
14
 
2.0%
14
 
2.0%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
Other values (199) 516
73.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 702
99.3%
Space Separator 2
 
0.3%
Decimal Number 2
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
7.0%
28
 
4.0%
27
 
3.8%
17
 
2.4%
14
 
2.0%
14
 
2.0%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
Other values (195) 511
72.8%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 702
99.3%
Common 5
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
7.0%
28
 
4.0%
27
 
3.8%
17
 
2.4%
14
 
2.0%
14
 
2.0%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
Other values (195) 511
72.8%
Common
ValueCountFrequency (%)
2
40.0%
. 1
20.0%
1 1
20.0%
2 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 702
99.3%
ASCII 5
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
7.0%
28
 
4.0%
27
 
3.8%
17
 
2.4%
14
 
2.0%
14
 
2.0%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
Other values (195) 511
72.8%
ASCII
ValueCountFrequency (%)
2
40.0%
. 1
20.0%
1 1
20.0%
2 1
20.0%

발행연도
Real number (ℝ)

Distinct18
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.3539
Minimum1996
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T05:57:57.427866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1996
5-th percentile2011
Q12017
median2019
Q32021
95-th percentile2022
Maximum2023
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.6950893
Coefficient of variation (CV)0.001830744
Kurtosis8.313596
Mean2018.3539
Median Absolute Deviation (MAD)2
Skewness-2.2298806
Sum359267
Variance13.653685
MonotonicityNot monotonic
2024-01-10T05:57:57.533862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2021 29
16.3%
2019 25
14.0%
2020 25
14.0%
2018 21
11.8%
2022 20
11.2%
2017 18
10.1%
2016 12
6.7%
2011 6
 
3.4%
2023 5
 
2.8%
2014 4
 
2.2%
Other values (8) 13
7.3%
ValueCountFrequency (%)
1996 1
 
0.6%
2005 1
 
0.6%
2006 1
 
0.6%
2007 1
 
0.6%
2010 2
 
1.1%
2011 6
3.4%
2012 1
 
0.6%
2013 3
1.7%
2014 4
2.2%
2015 3
1.7%
ValueCountFrequency (%)
2023 5
 
2.8%
2022 20
11.2%
2021 29
16.3%
2020 25
14.0%
2019 25
14.0%
2018 21
11.8%
2017 18
10.1%
2016 12
6.7%
2015 3
 
1.7%
2014 4
 
2.2%
Distinct178
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
Minimum2018-01-15 00:00:00
Maximum2023-08-05 00:00:00
2024-01-10T05:57:57.637355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:57:57.748998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

도정신문 호수
Text

UNIQUE 

Distinct178
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T05:57:58.042537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters712
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)100.0%

Sample

1st row800호
2nd row801호
3rd row802호
4th row803호
5th row804호
ValueCountFrequency (%)
800호 1
 
0.6%
934호 1
 
0.6%
914호 1
 
0.6%
923호 1
 
0.6%
915호 1
 
0.6%
916호 1
 
0.6%
917호 1
 
0.6%
918호 1
 
0.6%
919호 1
 
0.6%
920호 1
 
0.6%
Other values (168) 168
94.4%
2024-01-10T05:57:58.489679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
178
25.0%
8 127
17.8%
9 107
15.0%
5 38
 
5.3%
3 38
 
5.3%
6 38
 
5.3%
7 38
 
5.3%
0 37
 
5.2%
4 37
 
5.2%
2 37
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 534
75.0%
Other Letter 178
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 127
23.8%
9 107
20.0%
5 38
 
7.1%
3 38
 
7.1%
6 38
 
7.1%
7 38
 
7.1%
0 37
 
6.9%
4 37
 
6.9%
2 37
 
6.9%
1 37
 
6.9%
Other Letter
ValueCountFrequency (%)
178
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 534
75.0%
Hangul 178
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 127
23.8%
9 107
20.0%
5 38
 
7.1%
3 38
 
7.1%
6 38
 
7.1%
7 38
 
7.1%
0 37
 
6.9%
4 37
 
6.9%
2 37
 
6.9%
1 37
 
6.9%
Hangul
ValueCountFrequency (%)
178
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 534
75.0%
Hangul 178
 
25.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
178
100.0%
ASCII
ValueCountFrequency (%)
8 127
23.8%
9 107
20.0%
5 38
 
7.1%
3 38
 
7.1%
6 38
 
7.1%
7 38
 
7.1%
0 37
 
6.9%
4 37
 
6.9%
2 37
 
6.9%
1 37
 
6.9%

Interactions

2024-01-10T05:57:54.945773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T05:57:55.041864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:57:55.136274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서명저자명출판사발행연도도정신문 발행일도정신문 호수
08체질 이야기주석원씨앗을뿌리는사람20072018-01-15800호
1말의 품격이기주황소북스20172018-01-25801호
2운다고 달라지는 일은 아무것도 없겠지만박준난다20172018-02-05802호
3우리는 차별에 찬성합니다오찬호개마고원20132018-02-25803호
4당신은 개를 키우면 안 된다강형욱동아일보사20142018-03-05804호
5아무것도 아닌 지금은 없다김동혁쌤앤파커스20172018-03-15805호
6이 모든 극적인 순간들윤대녕푸르메20102018-03-25806호
7LOVE, 사랑에 대해 알아야 할 모든 것A. M. 파인스다산초당20052018-04-05807호
8손빈병법손빈(이병호 옮김)홍익출한사19962018-04-15808호
9신경 끄기의 기술마크 맨슨갤리온20172018-05-05810호
도서명저자명출판사발행연도도정신문 발행일도정신문 호수
168여성을 모욕하는 걸작들한승혜 등 8인문예출판사20232023-05-05971호
169아Q정전루쉰문학동네20112023-05-15972호
170누가 알려주지 않아도 난유지향산지니20222023-05-25973호
171다윈 지능최재천사이언스북스20232023-06-05974호
172그림으로 풀어 쓴 황제내경지토 편집부김영사20132023-06-15975호
173영화를 빨리 감기로 보는 사람들현대지성이나다 도요시20222023-06-25976호
174하얼빈김훈문학동네20222023-07-05977호
175이제 나가서 사람 좀 만나려고요제시카 팬부키20222023-07-15978호
176왜 아가리로만 할까박정한 외들녘20212023-07-25979호
177다 똑같이 살 순 없잖아김가지(김예지)다크호스20232023-08-05980호