Overview

Dataset statistics

Number of variables6
Number of observations137
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory51.0 B

Variable types

Numeric2
Text1
Categorical3

Dataset

Description부산광역시 기장군_고촌어울림도서관 신착자료 현황(서명, 저작자, 발행자, 발행년, 자료실명)에 대한 데이터입니다
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/15063959/fileData.do

Alerts

발행자 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
자료실명 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
저작자 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
순번 is highly overall correlated with 발행년 and 3 other fieldsHigh correlation
발행년 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
자료실명 is highly imbalanced (77.4%)Imbalance
순번 has unique valuesUnique
서명 has unique valuesUnique

Reproduction

Analysis started2024-03-16 06:45:18.088226
Analysis finished2024-03-16 06:45:21.213143
Duration3.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct137
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69
Minimum1
Maximum137
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-03-16T06:45:21.476733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.8
Q135
median69
Q3103
95-th percentile130.2
Maximum137
Range136
Interquartile range (IQR)68

Descriptive statistics

Standard deviation39.692569
Coefficient of variation (CV)0.57525462
Kurtosis-1.2
Mean69
Median Absolute Deviation (MAD)34
Skewness0
Sum9453
Variance1575.5
MonotonicityStrictly increasing
2024-03-16T06:45:22.270832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
96 1
 
0.7%
104 1
 
0.7%
Other values (127) 127
92.7%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%

서명
Text

UNIQUE 

Distinct137
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-16T06:45:23.281633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length38.941606
Min length13

Characters and Unicode

Total characters5335
Distinct characters410
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)100.0%

Sample

1st row국어 1등급 고득점의 비밀 : 현직 국어 교사가 알려 주는 상위 1% 초중고 국어 공부 로드맵
2nd row꿰맨 눈의 마을 : 조예은 소설
3rd row어린이의 여행법 : 불편하고 아름다운 것들을 사랑하는 마음에 관하여 : 이지나 에세이
4th row외로움의 습격 : 모두, 홀로 남겨질 것이다
5th row챗GPT 입문 가이드 : 당신의 일상을 바꾸는 인공지능
ValueCountFrequency (%)
ar도서 130
 
11.7%
한자 60
 
5.4%
손오공의 60
 
5.4%
대탐험 60
 
5.4%
마법천자문 60
 
5.4%
ar 49
 
4.4%
toto 41
 
3.7%
the 20
 
1.8%
14
 
1.3%
마음 7
 
0.6%
Other values (496) 611
54.9%
2024-03-16T06:45:25.346145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
975
 
18.3%
A 180
 
3.4%
R 179
 
3.4%
137
 
2.6%
133
 
2.5%
[ 130
 
2.4%
] 130
 
2.4%
126
 
2.4%
) 120
 
2.2%
( 120
 
2.2%
Other values (400) 3105
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2018
37.8%
Space Separator 975
18.3%
Lowercase Letter 742
 
13.9%
Uppercase Letter 582
 
10.9%
Other Punctuation 317
 
5.9%
Open Punctuation 250
 
4.7%
Close Punctuation 250
 
4.7%
Decimal Number 200
 
3.7%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
6.8%
133
 
6.6%
126
 
6.2%
75
 
3.7%
74
 
3.7%
68
 
3.4%
65
 
3.2%
63
 
3.1%
63
 
3.1%
62
 
3.1%
Other values (336) 1152
57.1%
Lowercase Letter
ValueCountFrequency (%)
e 95
12.8%
o 64
 
8.6%
a 59
 
8.0%
t 56
 
7.5%
i 53
 
7.1%
n 53
 
7.1%
r 43
 
5.8%
l 40
 
5.4%
h 37
 
5.0%
s 32
 
4.3%
Other values (13) 210
28.3%
Uppercase Letter
ValueCountFrequency (%)
A 180
30.9%
R 179
30.8%
T 96
16.5%
O 81
13.9%
S 9
 
1.5%
L 6
 
1.0%
H 5
 
0.9%
D 5
 
0.9%
G 3
 
0.5%
P 3
 
0.5%
Other values (8) 15
 
2.6%
Decimal Number
ValueCountFrequency (%)
3 37
18.5%
1 36
18.0%
2 33
16.5%
4 23
11.5%
5 20
10.0%
6 11
 
5.5%
7 10
 
5.0%
0 10
 
5.0%
9 10
 
5.0%
8 10
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 114
36.0%
. 113
35.6%
! 70
22.1%
: 14
 
4.4%
' 4
 
1.3%
& 1
 
0.3%
% 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
[ 130
52.0%
( 120
48.0%
Close Punctuation
ValueCountFrequency (%)
] 130
52.0%
) 120
48.0%
Space Separator
ValueCountFrequency (%)
975
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1993
37.4%
Hangul 1951
36.6%
Latin 1324
24.8%
Han 67
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
7.0%
133
 
6.8%
126
 
6.5%
75
 
3.8%
74
 
3.8%
68
 
3.5%
65
 
3.3%
63
 
3.2%
63
 
3.2%
62
 
3.2%
Other values (273) 1085
55.6%
Han
ValueCountFrequency (%)
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (53) 53
79.1%
Latin
ValueCountFrequency (%)
A 180
13.6%
R 179
13.5%
T 96
 
7.3%
e 95
 
7.2%
O 81
 
6.1%
o 64
 
4.8%
a 59
 
4.5%
t 56
 
4.2%
i 53
 
4.0%
n 53
 
4.0%
Other values (31) 408
30.8%
Common
ValueCountFrequency (%)
975
48.9%
[ 130
 
6.5%
] 130
 
6.5%
) 120
 
6.0%
( 120
 
6.0%
, 114
 
5.7%
. 113
 
5.7%
! 70
 
3.5%
3 37
 
1.9%
1 36
 
1.8%
Other values (13) 148
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3317
62.2%
Hangul 1951
36.6%
CJK 67
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
975
29.4%
A 180
 
5.4%
R 179
 
5.4%
[ 130
 
3.9%
] 130
 
3.9%
) 120
 
3.6%
( 120
 
3.6%
, 114
 
3.4%
. 113
 
3.4%
T 96
 
2.9%
Other values (54) 1160
35.0%
Hangul
ValueCountFrequency (%)
137
 
7.0%
133
 
6.8%
126
 
6.5%
75
 
3.8%
74
 
3.8%
68
 
3.5%
65
 
3.3%
63
 
3.2%
63
 
3.2%
62
 
3.2%
Other values (273) 1085
55.6%
CJK
ValueCountFrequency (%)
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (53) 53
79.1%

저작자
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
by Victoria Productions
40 
올댓스토리 글 ; 홍거북 그림
20 
시리얼 지음
20 
김현수 글 ; 홍거북 그림
유대영 글 ; 정수영 그림
Other values (22)
44 

Length

Max length30
Median length23
Mean length16.642336
Min length6

Unique

Unique14 ?
Unique (%)10.2%

Sample

1st row김지영 지음
2nd row조예은 지음
3rd row이지나 지음
4th row김만권 지음
5th row안상진 지음

Common Values

ValueCountFrequency (%)
by Victoria Productions 40
29.2%
올댓스토리 글 ; 홍거북 그림 20
14.6%
시리얼 지음 20
14.6%
김현수 글 ; 홍거북 그림 8
 
5.8%
유대영 글 ; 정수영 그림 5
 
3.6%
유대영 글 ; 홍거북 그림 5
 
3.6%
심선영 지음 ; 엄유진 ,강보선 그림 5
 
3.6%
이지헌 지음 ; 강보선 그림 5
 
3.6%
by Victoria H. Farago 4
 
2.9%
by Victoria Han Farago 4
 
2.9%
Other values (17) 21
15.3%

Length

2024-03-16T06:45:25.990514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
62
11.6%
그림 61
11.4%
by 48
9.0%
victoria 48
9.0%
지음 44
 
8.3%
42
 
7.9%
productions 40
 
7.5%
홍거북 33
 
6.2%
올댓스토리 20
 
3.8%
시리얼 20
 
3.8%
Other values (35) 115
21.6%

발행자
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
아울북
59 
Victoria Productions
40 
아이해빗북
15 
주렁주렁스튜디오
 
4
Victoria Productions Inc
 
4
Other values (12)
15 

Length

Max length29
Median length24
Mean length9.7445255
Min length2

Unique

Unique10 ?
Unique (%)7.3%

Sample

1st row카시오페아
2nd row자음과모음
3rd row라이프앤페이지
4th row혜다
5th row미문사

Common Values

ValueCountFrequency (%)
아울북 59
43.1%
Victoria Productions 40
29.2%
아이해빗북 15
 
10.9%
주렁주렁스튜디오 4
 
2.9%
Victoria Productions Inc 4
 
2.9%
Victoria Productions INC 3
 
2.2%
바이블캐슬 2
 
1.5%
실천문학사 1
 
0.7%
라이프앤페이지 1
 
0.7%
혜다 1
 
0.7%
Other values (7) 7
 
5.1%

Length

2024-03-16T06:45:26.515726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
아울북 59
30.7%
productions 47
24.5%
victoria 47
24.5%
아이해빗북 15
 
7.8%
inc 7
 
3.6%
주렁주렁스튜디오 4
 
2.1%
바이블캐슬 2
 
1.0%
블루래빗 1
 
0.5%
프로덕션 1
 
0.5%
빅토리아 1
 
0.5%
Other values (8) 8
 
4.2%

발행년
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.1387
Minimum2013
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-03-16T06:45:26.846259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2015.8
Q12018
median2021
Q32022
95-th percentile2023
Maximum2023
Range10
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4739302
Coefficient of variation (CV)0.0012246338
Kurtosis-0.69578561
Mean2020.1387
Median Absolute Deviation (MAD)2
Skewness-0.55179614
Sum276759
Variance6.1203306
MonotonicityNot monotonic
2024-03-16T06:45:27.304828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2022 42
30.7%
2018 41
29.9%
2023 24
17.5%
2020 7
 
5.1%
2021 6
 
4.4%
2015 5
 
3.6%
2019 5
 
3.6%
2016 3
 
2.2%
2017 2
 
1.5%
2013 1
 
0.7%
ValueCountFrequency (%)
2013 1
 
0.7%
2014 1
 
0.7%
2015 5
 
3.6%
2016 3
 
2.2%
2017 2
 
1.5%
2018 41
29.9%
2019 5
 
3.6%
2020 7
 
5.1%
2021 6
 
4.4%
2022 42
30.7%
ValueCountFrequency (%)
2023 24
17.5%
2022 42
30.7%
2021 6
 
4.4%
2020 7
 
5.1%
2019 5
 
3.6%
2018 41
29.9%
2017 2
 
1.5%
2016 3
 
2.2%
2015 5
 
3.6%
2014 1
 
0.7%

자료실명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
고촌어린이자료실
132 
고촌종합자료실
 
5

Length

Max length8
Median length8
Mean length7.9635036
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고촌종합자료실
2nd row고촌종합자료실
3rd row고촌종합자료실
4th row고촌종합자료실
5th row고촌종합자료실

Common Values

ValueCountFrequency (%)
고촌어린이자료실 132
96.4%
고촌종합자료실 5
 
3.6%

Length

2024-03-16T06:45:27.778575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:45:28.161720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고촌어린이자료실 132
96.4%
고촌종합자료실 5
 
3.6%

Interactions

2024-03-16T06:45:20.192269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:45:19.366682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:45:20.408171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T06:45:19.802922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T06:45:28.455076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번저작자발행자발행년자료실명
순번1.0000.9340.8460.8880.686
저작자0.9341.0000.9930.9471.000
발행자0.8460.9931.0000.9491.000
발행년0.8880.9470.9491.0000.450
자료실명0.6861.0001.0000.4501.000
2024-03-16T06:45:28.834442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행자자료실명저작자
발행자1.0000.9430.882
자료실명0.9431.0000.903
저작자0.8820.9031.000
2024-03-16T06:45:29.144509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행년저작자발행자자료실명
순번1.000-0.6790.6580.5230.518
발행년-0.6791.0000.7190.7580.336
저작자0.6580.7191.0000.8820.903
발행자0.5230.7580.8821.0000.943
자료실명0.5180.3360.9030.9431.000

Missing values

2024-03-16T06:45:20.704734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T06:45:21.059229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번서명저작자발행자발행년자료실명
01국어 1등급 고득점의 비밀 : 현직 국어 교사가 알려 주는 상위 1% 초중고 국어 공부 로드맵김지영 지음카시오페아2023고촌종합자료실
12꿰맨 눈의 마을 : 조예은 소설조예은 지음자음과모음2023고촌종합자료실
23어린이의 여행법 : 불편하고 아름다운 것들을 사랑하는 마음에 관하여 : 이지나 에세이이지나 지음라이프앤페이지2023고촌종합자료실
34외로움의 습격 : 모두, 홀로 남겨질 것이다김만권 지음혜다2023고촌종합자료실
45챗GPT 입문 가이드 : 당신의 일상을 바꾸는 인공지능안상진 지음미문사2023고촌종합자료실
56요나 이야기 [AR도서] : 증강현실 그림책김난예 ,이희만 글 ; 장소정 그림바이블캐슬2021고촌어린이자료실
67반구대 AR [AR도서] : 세계 최고 바위그림구광렬 지음실천문학사2017고촌어린이자료실
78아름답고 신기한 새 [AR도서] : 보고, 듣고, 움직이고, 배우자!김재환 지음블루래빗2015고촌어린이자료실
89움직이는 태양계 [AR도서]미국 자연사 박물관 지음 ; 김아림 옮김아이위즈2013고촌어린이자료실
910노아의 방주 [AR도서] : AR 증강현실 동화책peter Lee 글 ; 장소정 그림바이블캐슬2020고촌어린이자료실
순번서명저작자발행자발행년자료실명
127128AR TOTO [AR도서]. 31, Spring is hereby Victoria ProductionsVictoria Productions2018고촌어린이자료실
128129AR TOTO [AR도서]. 32, Cubby the lion cubby Victoria ProductionsVictoria Productions2018고촌어린이자료실
129130AR TOTO [AR도서]. 33, The handy cart beetleby Victoria ProductionsVictoria Productions2018고촌어린이자료실
130131AR TOTO [AR도서]. 34, The adventure of a curious clownfishby Victoria ProductionsVictoria Productions2018고촌어린이자료실
131132AR TOTO [AR도서]. 35, The town busby Victoria ProductionsVictoria Productions2018고촌어린이자료실
132133AR TOTO [AR도서]. 36, Cooking with blongo!by Victoria ProductionsVictoria Productions2018고촌어린이자료실
133134AR TOTO [AR도서]. 37, Monster streetby Victoria ProductionsVictoria Productions2018고촌어린이자료실
134135AR TOTO [AR도서]. 38, The mail mouseby Victoria ProductionsVictoria Productions2018고촌어린이자료실
135136AR TOTO [AR도서]. 39, Helping santaby Victoria ProductionsVictoria Productions2018고촌어린이자료실
136137AR TOTO [AR도서]. 40, Julia goes to schoolby Victoria ProductionsVictoria Productions2018고촌어린이자료실