Overview

Dataset statistics

Number of variables6
Number of observations118
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory51.1 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description부산광역시기장군_기장도서관신착자료현황_20201201
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15060475

Alerts

발행년 is highly imbalanced (54.0%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:44:31.005508
Analysis finished2023-12-10 16:44:31.788106
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct118
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.5
Minimum1
Maximum118
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-11T01:44:31.855043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.85
Q130.25
median59.5
Q388.75
95-th percentile112.15
Maximum118
Range117
Interquartile range (IQR)58.5

Descriptive statistics

Standard deviation34.207699
Coefficient of variation (CV)0.57491931
Kurtosis-1.2
Mean59.5
Median Absolute Deviation (MAD)29.5
Skewness0
Sum7021
Variance1170.1667
MonotonicityStrictly increasing
2023-12-11T01:44:31.982824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
76 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
81 1
 
0.8%
Other values (108) 108
91.5%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%
110 1
0.8%
109 1
0.8%

서명
Text

Distinct117
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T01:44:32.210062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length124
Median length45
Mean length32.813559
Min length5

Characters and Unicode

Total characters3872
Distinct characters505
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)98.3%

Sample

1st row(회사에서 맨날 쓰는) 비즈니스 영어패턴 500 플러스
2nd row트렌드 코리아 2020=Trend Korea 2020 : 서울대 소비트렌드 분석센터의 2020 전망
3rd row나는 왜 호오포노포노가 안 되는 걸까?
4th row폴리매스 : 한계를 거부하는 다재다능함의 힘
5th row지출증빙, 부가가치세, 종합소득세, 원천징수, 급여세금 실무설명서
ValueCountFrequency (%)
83
 
8.3%
9
 
0.9%
위한 8
 
0.8%
나는 6
 
0.6%
나를 5
 
0.5%
영어 5
 
0.5%
2 5
 
0.5%
철학 4
 
0.4%
어떻게 4
 
0.4%
etf 4
 
0.4%
Other values (766) 873
86.8%
2023-12-11T01:44:32.630338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
888
 
22.9%
: 85
 
2.2%
78
 
2.0%
72
 
1.9%
66
 
1.7%
44
 
1.1%
39
 
1.0%
38
 
1.0%
36
 
0.9%
34
 
0.9%
Other values (495) 2492
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2432
62.8%
Space Separator 888
 
22.9%
Lowercase Letter 219
 
5.7%
Other Punctuation 158
 
4.1%
Decimal Number 90
 
2.3%
Uppercase Letter 35
 
0.9%
Close Punctuation 21
 
0.5%
Open Punctuation 21
 
0.5%
Math Symbol 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
3.2%
72
 
3.0%
66
 
2.7%
44
 
1.8%
39
 
1.6%
38
 
1.6%
36
 
1.5%
34
 
1.4%
33
 
1.4%
30
 
1.2%
Other values (438) 1962
80.7%
Lowercase Letter
ValueCountFrequency (%)
e 24
11.0%
t 22
10.0%
o 21
9.6%
i 20
 
9.1%
n 18
 
8.2%
r 15
 
6.8%
a 15
 
6.8%
l 12
 
5.5%
s 11
 
5.0%
c 9
 
4.1%
Other values (13) 52
23.7%
Uppercase Letter
ValueCountFrequency (%)
T 8
22.9%
E 6
17.1%
N 4
11.4%
F 4
11.4%
S 3
 
8.6%
K 2
 
5.7%
O 2
 
5.7%
C 2
 
5.7%
G 1
 
2.9%
R 1
 
2.9%
Other values (2) 2
 
5.7%
Decimal Number
ValueCountFrequency (%)
0 25
27.8%
1 18
20.0%
2 16
17.8%
5 8
 
8.9%
3 7
 
7.8%
4 7
 
7.8%
8 4
 
4.4%
7 3
 
3.3%
9 1
 
1.1%
6 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
: 85
53.8%
, 25
 
15.8%
. 19
 
12.0%
! 10
 
6.3%
' 8
 
5.1%
? 8
 
5.1%
" 2
 
1.3%
· 1
 
0.6%
Space Separator
ValueCountFrequency (%)
888
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Math Symbol
ValueCountFrequency (%)
= 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2430
62.8%
Common 1186
30.6%
Latin 254
 
6.6%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
3.2%
72
 
3.0%
66
 
2.7%
44
 
1.8%
39
 
1.6%
38
 
1.6%
36
 
1.5%
34
 
1.4%
33
 
1.4%
30
 
1.2%
Other values (436) 1960
80.7%
Latin
ValueCountFrequency (%)
e 24
 
9.4%
t 22
 
8.7%
o 21
 
8.3%
i 20
 
7.9%
n 18
 
7.1%
r 15
 
5.9%
a 15
 
5.9%
l 12
 
4.7%
s 11
 
4.3%
c 9
 
3.5%
Other values (25) 87
34.3%
Common
ValueCountFrequency (%)
888
74.9%
: 85
 
7.2%
0 25
 
2.1%
, 25
 
2.1%
) 21
 
1.8%
( 21
 
1.8%
. 19
 
1.6%
1 18
 
1.5%
2 16
 
1.3%
! 10
 
0.8%
Other values (12) 58
 
4.9%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2430
62.8%
ASCII 1439
37.2%
CJK 2
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
888
61.7%
: 85
 
5.9%
0 25
 
1.7%
, 25
 
1.7%
e 24
 
1.7%
t 22
 
1.5%
o 21
 
1.5%
) 21
 
1.5%
( 21
 
1.5%
i 20
 
1.4%
Other values (46) 287
 
19.9%
Hangul
ValueCountFrequency (%)
78
 
3.2%
72
 
3.0%
66
 
2.7%
44
 
1.8%
39
 
1.6%
38
 
1.6%
36
 
1.5%
34
 
1.4%
33
 
1.4%
30
 
1.2%
Other values (436) 1960
80.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct110
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T01:44:32.908243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length32
Mean length11.940678
Min length5

Characters and Unicode

Total characters1409
Distinct characters235
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)87.3%

Sample

1st row케빈 경 지음
2nd row김난도 [외] 지음
3rd row이영현 지음
4th row와카스 아메드 지음 ; 이주만 옮김
5th row손원준 지음
ValueCountFrequency (%)
지음 85
 
19.4%
53
 
12.1%
그림 26
 
5.9%
옮김 20
 
4.6%
18
 
4.1%
공]지음 5
 
1.1%
글·그림 4
 
0.9%
이지성 3
 
0.7%
3
 
0.7%
김남중 2
 
0.5%
Other values (203) 219
50.0%
2023-12-11T01:44:33.418882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
320
22.7%
101
 
7.2%
90
 
6.4%
56
 
4.0%
; 53
 
3.8%
32
 
2.3%
31
 
2.2%
30
 
2.1%
23
 
1.6%
23
 
1.6%
Other values (225) 650
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 995
70.6%
Space Separator 320
 
22.7%
Other Punctuation 69
 
4.9%
Open Punctuation 12
 
0.9%
Close Punctuation 12
 
0.9%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
10.2%
90
 
9.0%
56
 
5.6%
32
 
3.2%
31
 
3.1%
30
 
3.0%
23
 
2.3%
23
 
2.3%
22
 
2.2%
15
 
1.5%
Other values (215) 572
57.5%
Other Punctuation
ValueCountFrequency (%)
; 53
76.8%
, 11
 
15.9%
· 4
 
5.8%
. 1
 
1.4%
Open Punctuation
ValueCountFrequency (%)
[ 10
83.3%
( 2
 
16.7%
Close Punctuation
ValueCountFrequency (%)
] 10
83.3%
) 2
 
16.7%
Space Separator
ValueCountFrequency (%)
320
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 995
70.6%
Common 413
29.3%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
10.2%
90
 
9.0%
56
 
5.6%
32
 
3.2%
31
 
3.1%
30
 
3.0%
23
 
2.3%
23
 
2.3%
22
 
2.2%
15
 
1.5%
Other values (215) 572
57.5%
Common
ValueCountFrequency (%)
320
77.5%
; 53
 
12.8%
, 11
 
2.7%
[ 10
 
2.4%
] 10
 
2.4%
· 4
 
1.0%
( 2
 
0.5%
) 2
 
0.5%
. 1
 
0.2%
Latin
ValueCountFrequency (%)
T 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 995
70.6%
ASCII 410
29.1%
None 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
320
78.0%
; 53
 
12.9%
, 11
 
2.7%
[ 10
 
2.4%
] 10
 
2.4%
( 2
 
0.5%
) 2
 
0.5%
. 1
 
0.2%
T 1
 
0.2%
Hangul
ValueCountFrequency (%)
101
 
10.2%
90
 
9.0%
56
 
5.6%
32
 
3.2%
31
 
3.1%
30
 
3.0%
23
 
2.3%
23
 
2.3%
22
 
2.2%
15
 
1.5%
Other values (215) 572
57.5%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct98
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T01:44:33.798615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length5.2372881
Min length2

Characters and Unicode

Total characters618
Distinct characters204
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)70.3%

Sample

1st row넥서스
2nd row미래의창
3rd row렛츠북
4th row안드로메디안
5th row지식만들기
ValueCountFrequency (%)
비에이블 3
 
2.4%
유노북스 3
 
2.4%
차이정원 3
 
2.4%
창비 3
 
2.4%
사람in 3
 
2.4%
비룡소 2
 
1.6%
좋은연필 2
 
1.6%
미래엔:아이세움 2
 
1.6%
한국경제신문:한경bp 2
 
1.6%
42미디어콘텐츠 2
 
1.6%
Other values (93) 98
79.7%
2023-12-11T01:44:34.359800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
5.2%
20
 
3.2%
20
 
3.2%
17
 
2.8%
15
 
2.4%
: 15
 
2.4%
14
 
2.3%
14
 
2.3%
14
 
2.3%
10
 
1.6%
Other values (194) 447
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 538
87.1%
Lowercase Letter 40
 
6.5%
Other Punctuation 16
 
2.6%
Uppercase Letter 12
 
1.9%
Decimal Number 7
 
1.1%
Space Separator 5
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.9%
20
 
3.7%
20
 
3.7%
17
 
3.2%
15
 
2.8%
14
 
2.6%
14
 
2.6%
14
 
2.6%
10
 
1.9%
8
 
1.5%
Other values (163) 374
69.5%
Lowercase Letter
ValueCountFrequency (%)
s 6
15.0%
i 5
12.5%
n 5
12.5%
e 3
7.5%
r 3
7.5%
t 3
7.5%
a 3
7.5%
o 3
7.5%
p 1
 
2.5%
g 1
 
2.5%
Other values (7) 7
17.5%
Uppercase Letter
ValueCountFrequency (%)
B 4
33.3%
P 2
16.7%
G 1
 
8.3%
K 1
 
8.3%
M 1
 
8.3%
R 1
 
8.3%
S 1
 
8.3%
D 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 4
57.1%
4 2
28.6%
1 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
: 15
93.8%
& 1
 
6.2%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 538
87.1%
Latin 52
 
8.4%
Common 28
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.9%
20
 
3.7%
20
 
3.7%
17
 
3.2%
15
 
2.8%
14
 
2.6%
14
 
2.6%
14
 
2.6%
10
 
1.9%
8
 
1.5%
Other values (163) 374
69.5%
Latin
ValueCountFrequency (%)
s 6
 
11.5%
i 5
 
9.6%
n 5
 
9.6%
B 4
 
7.7%
e 3
 
5.8%
r 3
 
5.8%
t 3
 
5.8%
a 3
 
5.8%
o 3
 
5.8%
P 2
 
3.8%
Other values (15) 15
28.8%
Common
ValueCountFrequency (%)
: 15
53.6%
5
 
17.9%
2 4
 
14.3%
4 2
 
7.1%
& 1
 
3.6%
1 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 538
87.1%
ASCII 80
 
12.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
5.9%
20
 
3.7%
20
 
3.7%
17
 
3.2%
15
 
2.8%
14
 
2.6%
14
 
2.6%
14
 
2.6%
10
 
1.9%
8
 
1.5%
Other values (163) 374
69.5%
ASCII
ValueCountFrequency (%)
: 15
18.8%
s 6
 
7.5%
i 5
 
6.2%
5
 
6.2%
n 5
 
6.2%
2 4
 
5.0%
B 4
 
5.0%
e 3
 
3.8%
r 3
 
3.8%
t 3
 
3.8%
Other values (21) 27
33.8%

발행년
Categorical

IMBALANCE 

Distinct5
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2020
92 
2019
17 
2017
 
5
2018
 
3
2016
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row2017
2nd row2019
3rd row2019
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 92
78.0%
2019 17
 
14.4%
2017 5
 
4.2%
2018 3
 
2.5%
2016 1
 
0.8%

Length

2023-12-11T01:44:34.524986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:44:34.646304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 92
78.0%
2019 17
 
14.4%
2017 5
 
4.2%
2018 3
 
2.5%
2016 1
 
0.8%

자료실명
Categorical

Distinct3
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
[기장]종합자료실
92 
[기장]아동자료실
19 
[기장]가족독서방
 
7

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[기장]종합자료실
2nd row[기장]종합자료실
3rd row[기장]종합자료실
4th row[기장]종합자료실
5th row[기장]종합자료실

Common Values

ValueCountFrequency (%)
[기장]종합자료실 92
78.0%
[기장]아동자료실 19
 
16.1%
[기장]가족독서방 7
 
5.9%

Length

2023-12-11T01:44:34.864938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:44:34.976027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기장]종합자료실 92
78.0%
기장]아동자료실 19
 
16.1%
기장]가족독서방 7
 
5.9%

Interactions

2023-12-11T01:44:31.516338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:44:35.043103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행자발행년자료실명
순번1.0000.9530.0000.661
발행자0.9531.0000.3990.994
발행년0.0000.3991.0000.000
자료실명0.6610.9940.0001.000
2023-12-11T01:44:35.133079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행년자료실명
발행년1.0000.000
자료실명0.0001.000
2023-12-11T01:44:35.237739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행년자료실명
순번1.0000.0000.494
발행년0.0001.0000.000
자료실명0.4940.0001.000

Missing values

2023-12-11T01:44:31.642853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:44:31.749648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번서명저작자발행자발행년자료실명
01(회사에서 맨날 쓰는) 비즈니스 영어패턴 500 플러스케빈 경 지음넥서스2017[기장]종합자료실
12트렌드 코리아 2020=Trend Korea 2020 : 서울대 소비트렌드 분석센터의 2020 전망김난도 [외] 지음미래의창2019[기장]종합자료실
23나는 왜 호오포노포노가 안 되는 걸까?이영현 지음렛츠북2019[기장]종합자료실
34폴리매스 : 한계를 거부하는 다재다능함의 힘와카스 아메드 지음 ; 이주만 옮김안드로메디안2020[기장]종합자료실
45지출증빙, 부가가치세, 종합소득세, 원천징수, 급여세금 실무설명서손원준 지음지식만들기2020[기장]종합자료실
56돈이 된다! 스마트스토어 : 학원강사 효정씨는 어떻게 1달 만에 월7천만원 매출을 올렸을까?엑스브레인 지음진서원2020[기장]종합자료실
67(누구나 하루 30분 투자로 월 100만 원 더 버는) 블로그 부업 : 제휴 마케팅 동행김상은 지음나비의활주로2020[기장]종합자료실
78절세 상식사전=Common sense dictionary of reducing tax legally유종오 지음길벗2020[기장]종합자료실
89푸념도 습관이다 : 왜 입만 열면 불만과 핑계를 늘어놓을까?우에니시 아키라 지음 ; 송소정 옮김유노북스2019[기장]종합자료실
910유튜브 지금 시작하시나요? : 시한책방 이시한과 함께하는 유튜브 첫걸음이시한 지음미래의창2020[기장]종합자료실
순번서명저작자발행자발행년자료실명
108109(처음 읽는) 그리스 로마 신화. 2, 신들의 사랑과 질투최설희 글 ; 정수영 구성 ; 한현동 그림미래엔:아이세움2020[기장]아동자료실
109110용선생이 간다 : 세계 문화 여행. 4, 미국사회평론 역사연구소 글 ; 강신영 그림 ; 이우일 캐릭터사회평론2020[기장]아동자료실
1101111분 과학=1minute science : 세상에서 가장 놀라운 꿀잼 과학 이야기이재범 지음 ; 최준석 그림위즈덤하우스2020[기장]종합자료실
111112카카오프렌즈 과학일기. 2, 식물서지원 글학산문화사2020[기장]아동자료실
112113(코믹 메이플스토리) 수학도둑. 78, 종합편송도수 글 ; 서정 엔터테인먼트 그림서울문화사2020[기장]아동자료실
113114(손오공의 한자 대탐험) 마법천자문. 49, 마주 보는 그림자! 그림자 영유대영 글 ; 홍거북 그림아울북2020[기장]아동자료실
114115달님과 소년입 스팡 올센 지음 ; 정영은 옮김진선아이2020[기장]가족독서방
115116상자 세상윤여림 글 ; 이명하 그림천개의바람2020[기장]가족독서방
116117(흔한남매) 안 흔한 일기. 3흔한남매 원작 ; 강효미 글 ; 조병주 그림미래엔:아이세움2020[기장]아동자료실
117118설민석의 한국사 대모험. 15, 신라 편.,천년의 보물을 지켜라!.신라 편.,천년의 보물을 지켜라!설민석,스토리박스 [공]글 ; 정현희 그림아이휴먼2020[기장]아동자료실