Overview

Dataset statistics

Number of variables6
Number of observations151
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.5 KiB
Average record size in memory50.9 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description부산광역시 기장군_정관어린이도서관 신착자료 현황(서명, 저작자, 발행자, 발행년, 자료실명)에 대한 데이터입니다
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/15060476/fileData.do

Alerts

순번 is highly overall correlated with 자료실명High correlation
자료실명 is highly overall correlated with 순번High correlation
발행년 is highly imbalanced (77.9%)Imbalance
순번 has unique valuesUnique
서명 has unique valuesUnique

Reproduction

Analysis started2024-03-16 06:45:17.025507
Analysis finished2024-03-16 06:45:19.289536
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76
Minimum1
Maximum151
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-03-16T06:45:19.586008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.5
Q138.5
median76
Q3113.5
95-th percentile143.5
Maximum151
Range150
Interquartile range (IQR)75

Descriptive statistics

Standard deviation43.734045
Coefficient of variation (CV)0.57544796
Kurtosis-1.2
Mean76
Median Absolute Deviation (MAD)38
Skewness0
Sum11476
Variance1912.6667
MonotonicityStrictly increasing
2024-03-16T06:45:20.035292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
105 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
106 1
 
0.7%
Other values (141) 141
93.4%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
151 1
0.7%
150 1
0.7%
149 1
0.7%
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%

서명
Text

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-16T06:45:21.056913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length36
Mean length18.10596
Min length3

Characters and Unicode

Total characters2734
Distinct characters491
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)100.0%

Sample

1st row100명의 산타클로스
2nd row가끔은 혼자가 좋아
3rd row개념연결만화 수학교과서 초등5학년
4th row개념연결만화 수학교과서 초등6학년
5th row갯벌 댄스 경연대회
ValueCountFrequency (%)
10
 
1.4%
2 9
 
1.2%
그림책 7
 
0.9%
산타 5
 
0.7%
4 5
 
0.7%
크리스마스 4
 
0.5%
3 4
 
0.5%
1 4
 
0.5%
초등 4
 
0.5%
베이커리 4
 
0.5%
Other values (593) 682
92.4%
2024-03-16T06:45:22.640143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
643
 
23.5%
51
 
1.9%
44
 
1.6%
. 41
 
1.5%
34
 
1.2%
, 33
 
1.2%
32
 
1.2%
30
 
1.1%
29
 
1.1%
27
 
1.0%
Other values (481) 1770
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1918
70.2%
Space Separator 643
 
23.5%
Decimal Number 85
 
3.1%
Other Punctuation 79
 
2.9%
Lowercase Letter 4
 
0.1%
Math Symbol 2
 
0.1%
Uppercase Letter 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
2.7%
44
 
2.3%
34
 
1.8%
32
 
1.7%
30
 
1.6%
29
 
1.5%
27
 
1.4%
25
 
1.3%
25
 
1.3%
24
 
1.3%
Other values (458) 1597
83.3%
Decimal Number
ValueCountFrequency (%)
2 16
18.8%
1 16
18.8%
0 12
14.1%
4 9
10.6%
5 7
8.2%
3 6
 
7.1%
7 6
 
7.1%
9 5
 
5.9%
6 5
 
5.9%
8 3
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 41
51.9%
, 33
41.8%
' 2
 
2.5%
/ 1
 
1.3%
& 1
 
1.3%
* 1
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
s 2
50.0%
v 2
50.0%
Space Separator
ValueCountFrequency (%)
643
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
O 1
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
] 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1918
70.2%
Common 811
29.7%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
2.7%
44
 
2.3%
34
 
1.8%
32
 
1.7%
30
 
1.6%
29
 
1.5%
27
 
1.4%
25
 
1.3%
25
 
1.3%
24
 
1.3%
Other values (458) 1597
83.3%
Common
ValueCountFrequency (%)
643
79.3%
. 41
 
5.1%
, 33
 
4.1%
2 16
 
2.0%
1 16
 
2.0%
0 12
 
1.5%
4 9
 
1.1%
5 7
 
0.9%
3 6
 
0.7%
7 6
 
0.7%
Other values (10) 22
 
2.7%
Latin
ValueCountFrequency (%)
s 2
40.0%
v 2
40.0%
O 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1915
70.0%
ASCII 816
29.8%
Compat Jamo 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
643
78.8%
. 41
 
5.0%
, 33
 
4.0%
2 16
 
2.0%
1 16
 
2.0%
0 12
 
1.5%
4 9
 
1.1%
5 7
 
0.9%
3 6
 
0.7%
7 6
 
0.7%
Other values (13) 27
 
3.3%
Hangul
ValueCountFrequency (%)
51
 
2.7%
44
 
2.3%
34
 
1.8%
32
 
1.7%
30
 
1.6%
29
 
1.5%
27
 
1.4%
25
 
1.3%
25
 
1.3%
24
 
1.3%
Other values (455) 1594
83.2%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct138
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-16T06:45:23.511894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length31
Mean length16.490066
Min length5

Characters and Unicode

Total characters2490
Distinct characters278
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)85.4%

Sample

1st row다니구치 도모노리 글·그림 황세정 옮김
2nd row에이미 헤스트 글 필립 스테드 그림 김선희 옮김
3rd row전국수학교사모임 초등수학사전팀 원작 최수일,유대현[공]글 김석 그림
4th row전국수학교사모임 초등수학사전팀 원작 최수일,최미라 [공]글 김석 그림
5th row박상희 글 송민선 그림
ValueCountFrequency (%)
그림 100
 
14.6%
82
 
12.0%
옮김 40
 
5.9%
지음 38
 
5.6%
글·그림 16
 
2.3%
원작 11
 
1.6%
공]글 7
 
1.0%
윤영 4
 
0.6%
안영은 4
 
0.6%
쏘울크리에이티브 4
 
0.6%
Other values (330) 377
55.2%
2024-03-16T06:45:25.549988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
686
27.6%
122
 
4.9%
119
 
4.8%
106
 
4.3%
84
 
3.4%
53
 
2.1%
49
 
2.0%
42
 
1.7%
40
 
1.6%
29
 
1.2%
Other values (268) 1160
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1742
70.0%
Space Separator 686
 
27.6%
Other Punctuation 32
 
1.3%
Close Punctuation 13
 
0.5%
Open Punctuation 13
 
0.5%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
7.0%
119
 
6.8%
106
 
6.1%
84
 
4.8%
53
 
3.0%
49
 
2.8%
42
 
2.4%
40
 
2.3%
29
 
1.7%
28
 
1.6%
Other values (260) 1070
61.4%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
E 1
25.0%
H 1
25.0%
Other Punctuation
ValueCountFrequency (%)
· 17
53.1%
, 15
46.9%
Space Separator
ValueCountFrequency (%)
686
100.0%
Close Punctuation
ValueCountFrequency (%)
] 13
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1742
70.0%
Common 744
29.9%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
7.0%
119
 
6.8%
106
 
6.1%
84
 
4.8%
53
 
3.0%
49
 
2.8%
42
 
2.4%
40
 
2.3%
29
 
1.7%
28
 
1.6%
Other values (260) 1070
61.4%
Common
ValueCountFrequency (%)
686
92.2%
· 17
 
2.3%
, 15
 
2.0%
] 13
 
1.7%
[ 13
 
1.7%
Latin
ValueCountFrequency (%)
A 2
50.0%
E 1
25.0%
H 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1742
70.0%
ASCII 731
29.4%
None 17
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
686
93.8%
, 15
 
2.1%
] 13
 
1.8%
[ 13
 
1.8%
A 2
 
0.3%
E 1
 
0.1%
H 1
 
0.1%
Hangul
ValueCountFrequency (%)
122
 
7.0%
119
 
6.8%
106
 
6.1%
84
 
4.8%
53
 
3.0%
49
 
2.8%
42
 
2.4%
40
 
2.3%
29
 
1.7%
28
 
1.6%
Other values (260) 1070
61.4%
None
ValueCountFrequency (%)
· 17
100.0%
Distinct91
Distinct (%)60.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-16T06:45:26.249099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length7
Mean length4.1854305
Min length2

Characters and Unicode

Total characters632
Distinct characters169
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)42.4%

Sample

1st row주니어김영사
2nd row한빛에듀
3rd row비아에듀
4th row비아에듀
5th row어린이가문비
ValueCountFrequency (%)
주니어김영사 9
 
5.9%
다산어린이 7
 
4.6%
창비 6
 
3.9%
한솔수북 5
 
3.3%
비룡소 5
 
3.3%
서울문화사 4
 
2.6%
웅진주니어 4
 
2.6%
위즈덤하우스 4
 
2.6%
아이세움 4
 
2.6%
아울북 3
 
2.0%
Other values (82) 101
66.4%
2024-03-16T06:45:27.911696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
5.2%
27
 
4.3%
24
 
3.8%
21
 
3.3%
20
 
3.2%
19
 
3.0%
19
 
3.0%
17
 
2.7%
15
 
2.4%
14
 
2.2%
Other values (159) 423
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 606
95.9%
Lowercase Letter 13
 
2.1%
Uppercase Letter 12
 
1.9%
Space Separator 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
5.4%
27
 
4.5%
24
 
4.0%
21
 
3.5%
20
 
3.3%
19
 
3.1%
19
 
3.1%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (143) 397
65.5%
Lowercase Letter
ValueCountFrequency (%)
o 3
23.1%
i 2
15.4%
k 2
15.4%
u 1
 
7.7%
r 1
 
7.7%
n 1
 
7.7%
j 1
 
7.7%
a 1
 
7.7%
s 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
R 3
25.0%
K 3
25.0%
H 3
25.0%
F 1
 
8.3%
B 1
 
8.3%
N 1
 
8.3%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 606
95.9%
Latin 25
 
4.0%
Common 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
5.4%
27
 
4.5%
24
 
4.0%
21
 
3.5%
20
 
3.3%
19
 
3.1%
19
 
3.1%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (143) 397
65.5%
Latin
ValueCountFrequency (%)
R 3
12.0%
K 3
12.0%
H 3
12.0%
o 3
12.0%
i 2
 
8.0%
k 2
 
8.0%
u 1
 
4.0%
r 1
 
4.0%
n 1
 
4.0%
j 1
 
4.0%
Other values (5) 5
20.0%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 606
95.9%
ASCII 26
 
4.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
5.4%
27
 
4.5%
24
 
4.0%
21
 
3.5%
20
 
3.3%
19
 
3.1%
19
 
3.1%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (143) 397
65.5%
ASCII
ValueCountFrequency (%)
R 3
11.5%
K 3
11.5%
H 3
11.5%
o 3
11.5%
i 2
 
7.7%
k 2
 
7.7%
u 1
 
3.8%
r 1
 
3.8%
n 1
 
3.8%
j 1
 
3.8%
Other values (6) 6
23.1%

발행년
Categorical

IMBALANCE 

Distinct4
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023
141 
2022
 
6
2019
 
2
2021
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2019
4th row2019
5th row2023

Common Values

ValueCountFrequency (%)
2023 141
93.4%
2022 6
 
4.0%
2019 2
 
1.3%
2021 2
 
1.3%

Length

2024-03-16T06:45:28.551894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:45:28.883985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 141
93.4%
2022 6
 
4.0%
2019 2
 
1.3%
2021 2
 
1.3%

자료실명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
정관어린이도서관 아동자료실
103 
정관어린이도서관 유아자료실
48 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정관어린이도서관 유아자료실
2nd row정관어린이도서관 유아자료실
3rd row정관어린이도서관 아동자료실
4th row정관어린이도서관 아동자료실
5th row정관어린이도서관 아동자료실

Common Values

ValueCountFrequency (%)
정관어린이도서관 아동자료실 103
68.2%
정관어린이도서관 유아자료실 48
31.8%

Length

2024-03-16T06:45:29.382602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:45:29.794435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정관어린이도서관 151
50.0%
아동자료실 103
34.1%
유아자료실 48
 
15.9%

Interactions

2024-03-16T06:45:18.301331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T06:45:30.128541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행자발행년자료실명
순번1.0000.7750.2190.938
발행자0.7751.0000.7490.736
발행년0.2190.7491.0000.000
자료실명0.9380.7360.0001.000
2024-03-16T06:45:30.376693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료실명발행년
자료실명1.0000.000
발행년0.0001.000
2024-03-16T06:45:30.591725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행년자료실명
순번1.0000.1690.776
발행년0.1691.0000.000
자료실명0.7760.0001.000

Missing values

2024-03-16T06:45:18.685059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T06:45:19.157593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번서명저작자발행자발행년자료실명
01100명의 산타클로스다니구치 도모노리 글·그림 황세정 옮김주니어김영사2023정관어린이도서관 유아자료실
12가끔은 혼자가 좋아에이미 헤스트 글 필립 스테드 그림 김선희 옮김한빛에듀2023정관어린이도서관 유아자료실
23개념연결만화 수학교과서 초등5학년전국수학교사모임 초등수학사전팀 원작 최수일,유대현[공]글 김석 그림비아에듀2019정관어린이도서관 아동자료실
34개념연결만화 수학교과서 초등6학년전국수학교사모임 초등수학사전팀 원작 최수일,최미라 [공]글 김석 그림비아에듀2019정관어린이도서관 아동자료실
45갯벌 댄스 경연대회박상희 글 송민선 그림어린이가문비2023정관어린이도서관 아동자료실
56기네스 세계 기록 2024기네스 세계기록 지음 김미선 옮김비룡소2023정관어린이도서관 아동자료실
67무너진 자세를 바로 세우는기적의 자세요정자세요정 지음다산라이프2023정관어린이도서관 아동자료실
78긱블 제작소. 01, 녹로와 치킨 발사기긱블 원작·감수,박송이 글 팀키즈 그림아이세움2023정관어린이도서관 아동자료실
89김영하의세계문학 원정대. 2, 로미오와 줄리엣 / 오만과 편견김영하 기획 및 해설 박성일 그림 김난영 스토리주니어김영사2023정관어린이도서관 아동자료실
910꼬물꼬물 탐험대 무시무시 캠핑마이크 라워리 글·그림 김영선 옮김다산어린이2023정관어린이도서관 아동자료실
순번서명저작자발행자발행년자료실명
141142칙칙팥팥콩양신쨔오 글 구미 그림 남은숙 옮김키위북스2023정관어린이도서관 유아자료실
142143코끼리는 왜 그랬을까이셀 글 그림글로연2023정관어린이도서관 유아자료실
143144콧물 나라한지원 지음한림2023정관어린이도서관 유아자료실
144145크리스마스트리 장식은 나한테 맡겨 줄래로스 콜린스 글·그림 신인수 옮김사파리2023정관어린이도서관 유아자료실
145146크림별 선인장 효뚠 그림책효뚠 지음달리2023정관어린이도서관 유아자료실
146147파란 대문을 열면허은미 글 한지선 그림문학동네2023정관어린이도서관 유아자료실
147148페브 농장이민주 글 안승하 그림창비2023정관어린이도서관 유아자료실
148149펭귄의 모험김태린 지음뜨인돌어린이2023정관어린이도서관 유아자료실
149150행복한 그곳브리타 테큰트럽 지음 김하늬 옮김봄봄2023정관어린이도서관 유아자료실
150151엄마 아빠도 1학년 초등 멘토 이은경쌤의 다정한 초등 입학 안내서이은경 지음상상아카데미2023정관어린이도서관 아동자료실