Overview

Dataset statistics

Number of variables7
Number of observations542
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory29.8 KiB
Average record size in memory56.2 B

Variable types

Categorical2
Text4
DateTime1

Dataset

Description광주광역시 광산구 그림책 포털에 등록된 그림책 목록 정보
Author광주광역시 광산구
URLhttps://www.data.go.kr/data/15062714/fileData.do

Alerts

기준일자 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 02:03:23.276483
Analysis finished2023-12-12 02:03:24.624191
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct8
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
가족
114 
인권
96 
생명
89 
평화(적극적)
60 
민주주의
60 
Other values (3)
123 

Length

Max length10
Median length2
Mean length3.9188192
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생명
2nd row생명
3rd row생명
4th row생명
5th row생명

Common Values

ValueCountFrequency (%)
가족 114
21.0%
인권 96
17.7%
생명 89
16.4%
평화(적극적) 60
11.1%
민주주의 60
11.1%
20년 이상 그림책 60
11.1%
노동 35
 
6.5%
평화(소극적) 28
 
5.2%

Length

2023-12-12T11:03:24.746128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:03:24.934335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가족 114
17.2%
인권 96
14.5%
생명 89
13.4%
평화(적극적 60
9.1%
민주주의 60
9.1%
20년 60
9.1%
이상 60
9.1%
그림책 60
9.1%
노동 35
 
5.3%
평화(소극적 28
 
4.2%

제목
Text

Distinct504
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T11:03:25.427590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length23.5
Mean length8.6273063
Min length1

Characters and Unicode

Total characters4676
Distinct characters569
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique468 ?
Unique (%)86.3%

Sample

1st row우리 할머니가 자꾸만 작아져요
2nd row셋째 날
3rd row나도 까사모예요
4th row루나와 나
5th row엄마고향은 어디야?
ValueCountFrequency (%)
이야기 17
 
1.2%
아빠 17
 
1.2%
우리 14
 
1.0%
엄마 12
 
0.9%
내가 11
 
0.8%
10
 
0.7%
9
 
0.6%
9
 
0.6%
8
 
0.6%
우리는 8
 
0.6%
Other values (978) 1282
91.8%
2023-12-12T11:03:26.206451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
855
 
18.3%
145
 
3.1%
100
 
2.1%
83
 
1.8%
82
 
1.8%
77
 
1.6%
76
 
1.6%
76
 
1.6%
59
 
1.3%
56
 
1.2%
Other values (559) 3067
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3703
79.2%
Space Separator 855
 
18.3%
Other Punctuation 89
 
1.9%
Decimal Number 17
 
0.4%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Math Symbol 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
145
 
3.9%
100
 
2.7%
83
 
2.2%
82
 
2.2%
77
 
2.1%
76
 
2.1%
76
 
2.1%
59
 
1.6%
56
 
1.5%
56
 
1.5%
Other values (541) 2893
78.1%
Other Punctuation
ValueCountFrequency (%)
, 30
33.7%
? 28
31.5%
! 26
29.2%
. 3
 
3.4%
/ 1
 
1.1%
: 1
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 8
47.1%
8 2
 
11.8%
5 2
 
11.8%
0 2
 
11.8%
6 2
 
11.8%
2 1
 
5.9%
Space Separator
ValueCountFrequency (%)
855
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3703
79.2%
Common 972
 
20.8%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
145
 
3.9%
100
 
2.7%
83
 
2.2%
82
 
2.2%
77
 
2.1%
76
 
2.1%
76
 
2.1%
59
 
1.6%
56
 
1.5%
56
 
1.5%
Other values (541) 2893
78.1%
Common
ValueCountFrequency (%)
855
88.0%
, 30
 
3.1%
? 28
 
2.9%
! 26
 
2.7%
1 8
 
0.8%
( 4
 
0.4%
) 4
 
0.4%
. 3
 
0.3%
~ 2
 
0.2%
8 2
 
0.2%
Other values (7) 10
 
1.0%
Latin
ValueCountFrequency (%)
X 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3700
79.1%
ASCII 973
 
20.8%
Compat Jamo 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
855
87.9%
, 30
 
3.1%
? 28
 
2.9%
! 26
 
2.7%
1 8
 
0.8%
( 4
 
0.4%
) 4
 
0.4%
. 3
 
0.3%
~ 2
 
0.2%
8 2
 
0.2%
Other values (8) 11
 
1.1%
Hangul
ValueCountFrequency (%)
145
 
3.9%
100
 
2.7%
83
 
2.2%
82
 
2.2%
77
 
2.1%
76
 
2.1%
76
 
2.1%
59
 
1.6%
56
 
1.5%
56
 
1.5%
Other values (538) 2890
78.1%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

그림
Text

Distinct411
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T11:03:26.741396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length5.304428
Min length2

Characters and Unicode

Total characters2875
Distinct characters362
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique330 ?
Unique (%)60.9%

Sample

1st row메르다드 차에리
2nd row성영란
3rd row장경혜
4th row제니 수 코스테키-쇼
5th row이진경
ValueCountFrequency (%)
이세 8
 
0.9%
히데코 8
 
0.9%
한병호 7
 
0.8%
하나네 6
 
0.7%
카이 6
 
0.7%
스미스 6
 
0.7%
토미 6
 
0.7%
이억배 5
 
0.6%
정승각 5
 
0.6%
조원희 5
 
0.6%
Other values (618) 795
92.8%
2023-12-12T11:03:27.447460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
315
 
11.0%
112
 
3.9%
86
 
3.0%
77
 
2.7%
51
 
1.8%
49
 
1.7%
41
 
1.4%
40
 
1.4%
38
 
1.3%
38
 
1.3%
Other values (352) 2028
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2521
87.7%
Space Separator 315
 
11.0%
Other Punctuation 21
 
0.7%
Uppercase Letter 10
 
0.3%
Dash Punctuation 8
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
 
4.4%
86
 
3.4%
77
 
3.1%
51
 
2.0%
49
 
1.9%
41
 
1.6%
40
 
1.6%
38
 
1.5%
38
 
1.5%
37
 
1.5%
Other values (340) 1952
77.4%
Uppercase Letter
ValueCountFrequency (%)
K 4
40.0%
R 2
20.0%
B 1
 
10.0%
H 1
 
10.0%
J 1
 
10.0%
C 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 10
47.6%
. 9
42.9%
· 1
 
4.8%
/ 1
 
4.8%
Space Separator
ValueCountFrequency (%)
315
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2521
87.7%
Common 344
 
12.0%
Latin 10
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
4.4%
86
 
3.4%
77
 
3.1%
51
 
2.0%
49
 
1.9%
41
 
1.6%
40
 
1.6%
38
 
1.5%
38
 
1.5%
37
 
1.5%
Other values (340) 1952
77.4%
Common
ValueCountFrequency (%)
315
91.6%
, 10
 
2.9%
. 9
 
2.6%
- 8
 
2.3%
· 1
 
0.3%
/ 1
 
0.3%
Latin
ValueCountFrequency (%)
K 4
40.0%
R 2
20.0%
B 1
 
10.0%
H 1
 
10.0%
J 1
 
10.0%
C 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2521
87.7%
ASCII 353
 
12.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
315
89.2%
, 10
 
2.8%
. 9
 
2.5%
- 8
 
2.3%
K 4
 
1.1%
R 2
 
0.6%
B 1
 
0.3%
/ 1
 
0.3%
H 1
 
0.3%
J 1
 
0.3%
Hangul
ValueCountFrequency (%)
112
 
4.4%
86
 
3.4%
77
 
3.1%
51
 
2.0%
49
 
1.9%
41
 
1.6%
40
 
1.6%
38
 
1.5%
38
 
1.5%
37
 
1.5%
Other values (340) 1952
77.4%
None
ValueCountFrequency (%)
· 1
100.0%


Text

Distinct431
Distinct (%)79.5%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T11:03:27.932938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length5.49631
Min length2

Characters and Unicode

Total characters2979
Distinct characters371
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique357 ?
Unique (%)65.9%

Sample

1st row잉카 팝스트
2nd row성영란
3rd row신옥희
4th row제니 수 코스테키-쇼
5th row노정임
ValueCountFrequency (%)
권정생 9
 
1.0%
루이스 7
 
0.8%
토미 7
 
0.8%
스필스베리 6
 
0.7%
엘리자베스 5
 
0.6%
데이비드 4
 
0.5%
버지니아 4
 
0.5%
로이 4
 
0.5%
소피 4
 
0.5%
강경수 4
 
0.5%
Other values (653) 833
93.9%
2023-12-12T11:03:28.564179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
347
 
11.6%
105
 
3.5%
95
 
3.2%
71
 
2.4%
53
 
1.8%
47
 
1.6%
46
 
1.5%
41
 
1.4%
39
 
1.3%
37
 
1.2%
Other values (361) 2098
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2597
87.2%
Space Separator 347
 
11.6%
Other Punctuation 20
 
0.7%
Uppercase Letter 11
 
0.4%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
4.0%
95
 
3.7%
71
 
2.7%
53
 
2.0%
47
 
1.8%
46
 
1.8%
41
 
1.6%
39
 
1.5%
37
 
1.4%
37
 
1.4%
Other values (349) 2026
78.0%
Uppercase Letter
ValueCountFrequency (%)
K 4
36.4%
C 2
18.2%
B 1
 
9.1%
P 1
 
9.1%
A 1
 
9.1%
H 1
 
9.1%
J 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 11
55.0%
. 8
40.0%
· 1
 
5.0%
Space Separator
ValueCountFrequency (%)
347
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2597
87.2%
Common 371
 
12.5%
Latin 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
4.0%
95
 
3.7%
71
 
2.7%
53
 
2.0%
47
 
1.8%
46
 
1.8%
41
 
1.6%
39
 
1.5%
37
 
1.4%
37
 
1.4%
Other values (349) 2026
78.0%
Latin
ValueCountFrequency (%)
K 4
36.4%
C 2
18.2%
B 1
 
9.1%
P 1
 
9.1%
A 1
 
9.1%
H 1
 
9.1%
J 1
 
9.1%
Common
ValueCountFrequency (%)
347
93.5%
, 11
 
3.0%
. 8
 
2.2%
- 4
 
1.1%
· 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2597
87.2%
ASCII 381
 
12.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
347
91.1%
, 11
 
2.9%
. 8
 
2.1%
- 4
 
1.0%
K 4
 
1.0%
C 2
 
0.5%
B 1
 
0.3%
P 1
 
0.3%
A 1
 
0.3%
H 1
 
0.3%
Hangul
ValueCountFrequency (%)
105
 
4.0%
95
 
3.7%
71
 
2.7%
53
 
2.0%
47
 
1.8%
46
 
1.8%
41
 
1.6%
39
 
1.5%
37
 
1.4%
37
 
1.4%
Other values (349) 2026
78.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct155
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T11:03:28.899119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length3.8690037
Min length2

Characters and Unicode

Total characters2097
Distinct characters218
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)13.7%

Sample

1st row씨드북
2nd row반달
3rd row웅진주니어
4th row청어람아이
5th row웃는돌고래
ValueCountFrequency (%)
보림 41
 
7.2%
비룡소 40
 
7.1%
웅진주니어 30
 
5.3%
사계절 28
 
4.9%
길벗어린이 17
 
3.0%
시공주니어 16
 
2.8%
창비 14
 
2.5%
씨드북 11
 
1.9%
풀빛 11
 
1.9%
이야기꽃 11
 
1.9%
Other values (157) 347
61.3%
2023-12-12T11:03:29.360016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
 
5.2%
77
 
3.7%
67
 
3.2%
63
 
3.0%
63
 
3.0%
58
 
2.8%
57
 
2.7%
52
 
2.5%
50
 
2.4%
46
 
2.2%
Other values (208) 1456
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2055
98.0%
Space Separator 25
 
1.2%
Close Punctuation 5
 
0.2%
Open Punctuation 5
 
0.2%
Lowercase Letter 5
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
5.3%
77
 
3.7%
67
 
3.3%
63
 
3.1%
63
 
3.1%
58
 
2.8%
57
 
2.8%
52
 
2.5%
50
 
2.4%
46
 
2.2%
Other values (198) 1414
68.8%
Lowercase Letter
ValueCountFrequency (%)
i 1
20.0%
z 1
20.0%
d 1
20.0%
o 1
20.0%
m 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
I 1
50.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2055
98.0%
Common 35
 
1.7%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
5.3%
77
 
3.7%
67
 
3.3%
63
 
3.1%
63
 
3.1%
58
 
2.8%
57
 
2.8%
52
 
2.5%
50
 
2.4%
46
 
2.2%
Other values (198) 1414
68.8%
Latin
ValueCountFrequency (%)
K 1
14.3%
I 1
14.3%
i 1
14.3%
z 1
14.3%
d 1
14.3%
o 1
14.3%
m 1
14.3%
Common
ValueCountFrequency (%)
25
71.4%
) 5
 
14.3%
( 5
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2055
98.0%
ASCII 42
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
108
 
5.3%
77
 
3.7%
67
 
3.3%
63
 
3.1%
63
 
3.1%
58
 
2.8%
57
 
2.8%
52
 
2.5%
50
 
2.4%
46
 
2.2%
Other values (198) 1414
68.8%
ASCII
ValueCountFrequency (%)
25
59.5%
) 5
 
11.9%
( 5
 
11.9%
K 1
 
2.4%
I 1
 
2.4%
i 1
 
2.4%
z 1
 
2.4%
d 1
 
2.4%
o 1
 
2.4%
m 1
 
2.4%

출판연도
Categorical

Distinct29
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2018
152 
2017
105 
2016
35 
2015
32 
2014
26 
Other values (24)
192 

Length

Max length9
Median length4
Mean length4.0092251
Min length4

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row2017
2nd row2018
3rd row2011
4th row2017
5th row2018

Common Values

ValueCountFrequency (%)
2018 152
28.0%
2017 105
19.4%
2016 35
 
6.5%
2015 32
 
5.9%
2014 26
 
4.8%
1997 22
 
4.1%
1998 21
 
3.9%
2012 19
 
3.5%
2011 16
 
3.0%
2010 16
 
3.0%
Other values (19) 98
18.1%

Length

2023-12-12T11:03:29.510044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2018 152
28.0%
2017 105
19.4%
2016 35
 
6.5%
2015 32
 
5.9%
2014 26
 
4.8%
1997 22
 
4.1%
1998 21
 
3.9%
2012 19
 
3.5%
2011 16
 
3.0%
2010 16
 
3.0%
Other values (19) 98
18.1%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2020-07-28 00:00:00
Maximum2020-07-28 00:00:00
2023-12-12T11:03:29.639326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:03:29.760600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T11:03:29.831322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분출판연도
구분1.0000.683
출판연도0.6831.000
2023-12-12T11:03:29.934252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분출판연도
구분1.0000.343
출판연도0.3431.000
2023-12-12T11:03:30.035310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분출판연도
구분1.0000.343
출판연도0.3431.000

Missing values

2023-12-12T11:03:24.362406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:03:24.544914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분제목그림출판사출판연도기준일자
0생명우리 할머니가 자꾸만 작아져요메르다드 차에리잉카 팝스트씨드북20172020-07-28
1생명셋째 날성영란성영란반달20182020-07-28
2생명나도 까사모예요장경혜신옥희웅진주니어20112020-07-28
3생명루나와 나제니 수 코스테키-쇼제니 수 코스테키-쇼청어람아이20172020-07-28
4생명엄마고향은 어디야?이진경노정임웃는돌고래20182020-07-28
5생명콰앙!조원희조원희시공주니어20182020-07-28
6생명별이 되기 전 머무는 집김휘리함연연나한기획20172020-07-28
7생명담장을 허물다김슬기공광규바우솔20172020-07-28
8생명알레나의 채소밭소피 비시에르소피 비시에르단추20172020-07-28
9생명유리병 속의 물은 이제 어디로 갈까?그레이엄 베이커 스미스그레이엄 베이커 스미스노란상상20182020-07-28
구분제목그림출판사출판연도기준일자
53220년 이상 그림책마고 할미조선경정 근보림19952020-07-28
53320년 이상 그림책갯벌이 좋아요유애로유애로보림19952020-07-28
53420년 이상 그림책연아연아 올라라김세온김명자보림19952020-07-28
53520년 이상 그림책만희네 집권윤덕권윤덕길벗어린이19952020-07-28
53620년 이상 그림책까막나라에서 온 삽사리정승각정승각초방책방19942020-07-28
53720년 이상 그림책눈사람이 된 풍선류재수류재수보림19942020-07-28
53820년 이상 그림책춤추는 호랑이이우경이우경국민서관19922020-07-28
53920년 이상 그림책사막의 공룡강우현타지마 신지한림출판사19922020-07-28
54020년 이상 그림책봄을 찾아 준 아기 원숭이강우현타지마 신지한림출판사19922020-07-28
54120년 이상 그림책백두산 이야기류재수류재수통나무19882020-07-28

Duplicate rows

Most frequently occurring

구분제목그림출판사출판연도기준일자# duplicates
0가족엄마의 초상화유지연유지연이야기꽃20142020-07-282