Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description경기도_고양시_메인베스트컨텐츠(컨텐츠한글명, 글저자명,출판사명,대출권수,예약권수) - 고양시 시립도서관 19개관 및 작은도서관 메인베스트컨텐츠 정보 작성
URLhttps://www.data.go.kr/data/15086056/fileData.do

Alerts

바코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:51:03.020490
Analysis finished2023-12-12 23:51:03.896625
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

바코드
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T08:51:04.044149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.79
Min length7

Characters and Unicode

Total characters1279
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row4801161571189
2nd row4801161903539
3rd row4808954686921
4th row4808954699808
5th row4808954687041
ValueCountFrequency (%)
4801161571189 1
 
1.0%
4808901110806 1
 
1.0%
4808954623360 1
 
1.0%
77223021 1
 
1.0%
4808982816635 1
 
1.0%
4801192625271 1
 
1.0%
14973058 1
 
1.0%
4801188862376 1
 
1.0%
4808954646123 1
 
1.0%
4808954638890 1
 
1.0%
Other values (90) 90
90.0%
2023-12-13T08:51:04.412179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 241
18.8%
4 202
15.8%
0 165
12.9%
1 133
10.4%
9 125
9.8%
5 116
9.1%
6 106
8.3%
7 68
 
5.3%
2 63
 
4.9%
3 58
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1277
99.8%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 241
18.9%
4 202
15.8%
0 165
12.9%
1 133
10.4%
9 125
9.8%
5 116
9.1%
6 106
8.3%
7 68
 
5.3%
2 63
 
4.9%
3 58
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
D 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1277
99.8%
Latin 2
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
8 241
18.9%
4 202
15.8%
0 165
12.9%
1 133
10.4%
9 125
9.8%
5 116
9.1%
6 106
8.3%
7 68
 
5.3%
2 63
 
4.9%
3 58
 
4.5%
Latin
ValueCountFrequency (%)
D 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1279
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 241
18.8%
4 202
15.8%
0 165
12.9%
1 133
10.4%
9 125
9.8%
5 116
9.1%
6 106
8.3%
7 68
 
5.3%
2 63
 
4.9%
3 58
 
4.5%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T08:51:04.742022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length18
Mean length9.1
Min length2

Characters and Unicode

Total characters910
Distinct characters286
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row불편한 편의점
2nd row연애의 행방
3rd row저만치 혼자서
4th row믿음에 대하여
5th row새의 선물
ValueCountFrequency (%)
1q84 3
 
1.1%
1 3
 
1.1%
2 3
 
1.1%
싶어 2
 
0.8%
파친코 2
 
0.8%
우리가 2
 
0.8%
인생의 2
 
0.8%
않는다 2
 
0.8%
지구에서 2
 
0.8%
사람 2
 
0.8%
Other values (234) 240
91.3%
2023-12-13T08:51:05.207430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
18.0%
30
 
3.3%
16
 
1.8%
15
 
1.6%
1 15
 
1.6%
14
 
1.5%
13
 
1.4%
12
 
1.3%
11
 
1.2%
11
 
1.2%
Other values (276) 609
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 666
73.2%
Space Separator 164
 
18.0%
Decimal Number 37
 
4.1%
Other Punctuation 12
 
1.3%
Uppercase Letter 12
 
1.3%
Lowercase Letter 12
 
1.3%
Close Punctuation 3
 
0.3%
Open Punctuation 3
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
4.5%
16
 
2.4%
15
 
2.3%
14
 
2.1%
13
 
2.0%
12
 
1.8%
11
 
1.7%
11
 
1.7%
10
 
1.5%
10
 
1.5%
Other values (245) 524
78.7%
Lowercase Letter
ValueCountFrequency (%)
e 2
16.7%
s 2
16.7%
i 2
16.7%
d 1
8.3%
n 1
8.3%
h 1
8.3%
r 1
8.3%
o 1
8.3%
y 1
8.3%
Uppercase Letter
ValueCountFrequency (%)
Q 3
25.0%
B 2
16.7%
I 2
16.7%
T 1
 
8.3%
F 1
 
8.3%
C 1
 
8.3%
A 1
 
8.3%
H 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 15
40.5%
2 10
27.0%
4 4
 
10.8%
3 3
 
8.1%
8 3
 
8.1%
0 2
 
5.4%
Other Punctuation
ValueCountFrequency (%)
. 6
50.0%
: 4
33.3%
, 1
 
8.3%
? 1
 
8.3%
Space Separator
ValueCountFrequency (%)
164
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 666
73.2%
Common 220
 
24.2%
Latin 24
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
4.5%
16
 
2.4%
15
 
2.3%
14
 
2.1%
13
 
2.0%
12
 
1.8%
11
 
1.7%
11
 
1.7%
10
 
1.5%
10
 
1.5%
Other values (245) 524
78.7%
Latin
ValueCountFrequency (%)
Q 3
12.5%
B 2
 
8.3%
e 2
 
8.3%
s 2
 
8.3%
i 2
 
8.3%
I 2
 
8.3%
T 1
 
4.2%
d 1
 
4.2%
n 1
 
4.2%
h 1
 
4.2%
Other values (7) 7
29.2%
Common
ValueCountFrequency (%)
164
74.5%
1 15
 
6.8%
2 10
 
4.5%
. 6
 
2.7%
: 4
 
1.8%
4 4
 
1.8%
3 3
 
1.4%
) 3
 
1.4%
( 3
 
1.4%
8 3
 
1.4%
Other values (4) 5
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 666
73.2%
ASCII 244
 
26.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
67.2%
1 15
 
6.1%
2 10
 
4.1%
. 6
 
2.5%
: 4
 
1.6%
4 4
 
1.6%
3 3
 
1.2%
) 3
 
1.2%
( 3
 
1.2%
8 3
 
1.2%
Other values (21) 29
 
11.9%
Hangul
ValueCountFrequency (%)
30
 
4.5%
16
 
2.4%
15
 
2.3%
14
 
2.1%
13
 
2.0%
12
 
1.8%
11
 
1.7%
11
 
1.7%
10
 
1.5%
10
 
1.5%
Other values (245) 524
78.7%
Distinct79
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T08:51:05.528509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length3
Mean length4.37
Min length2

Characters and Unicode

Total characters437
Distinct characters163
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)68.0%

Sample

1st row김호연
2nd row히가시노 게이고
3rd row김훈
4th row박상영
5th row은희경
ValueCountFrequency (%)
김영하 6
 
4.4%
하루키 5
 
3.6%
무라카미 5
 
3.6%
김훈 4
 
2.9%
정세랑 3
 
2.2%
3
 
2.2%
최은영 3
 
2.2%
한강 2
 
1.5%
파울로 2
 
1.5%
코엘료 2
 
1.5%
Other values (98) 102
74.5%
2023-12-13T08:51:05.941381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
 
8.5%
18
 
4.1%
17
 
3.9%
15
 
3.4%
12
 
2.7%
9
 
2.1%
8
 
1.8%
7
 
1.6%
7
 
1.6%
7
 
1.6%
Other values (153) 300
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 383
87.6%
Space Separator 37
 
8.5%
Math Symbol 12
 
2.7%
Other Punctuation 4
 
0.9%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
4.7%
17
 
4.4%
15
 
3.9%
12
 
3.1%
9
 
2.3%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
Other values (146) 277
72.3%
Other Punctuation
ValueCountFrequency (%)
/ 2
50.0%
, 1
25.0%
. 1
25.0%
Math Symbol
ValueCountFrequency (%)
> 6
50.0%
< 6
50.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 383
87.6%
Common 53
 
12.1%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
4.7%
17
 
4.4%
15
 
3.9%
12
 
3.1%
9
 
2.3%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
Other values (146) 277
72.3%
Common
ValueCountFrequency (%)
37
69.8%
> 6
 
11.3%
< 6
 
11.3%
/ 2
 
3.8%
, 1
 
1.9%
. 1
 
1.9%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 383
87.6%
ASCII 54
 
12.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37
68.5%
> 6
 
11.1%
< 6
 
11.1%
/ 2
 
3.7%
, 1
 
1.9%
. 1
 
1.9%
B 1
 
1.9%
Hangul
ValueCountFrequency (%)
18
 
4.7%
17
 
4.4%
15
 
3.9%
12
 
3.1%
9
 
2.3%
8
 
2.1%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.6%
Other values (146) 277
72.3%

출판사명
Categorical

Distinct35
Distinct (%)35.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
문학동네
52 
인플루엔셜
 
4
난다
 
4
복복서가
 
3
한빛비즈
 
3
Other values (30)
34 

Length

Max length7
Median length4
Mean length4.09
Min length1

Unique

Unique26 ?
Unique (%)26.0%

Sample

1st row나무옆의자
2nd row소미미디어
3rd row문학동네
4th row문학동네
5th row문학동네

Common Values

ValueCountFrequency (%)
문학동네 52
52.0%
인플루엔셜 4
 
4.0%
난다 4
 
4.0%
복복서가 3
 
3.0%
한빛비즈 3
 
3.0%
이봄 2
 
2.0%
엘릭시르 2
 
2.0%
마로니에북스 2
 
2.0%
웅진지식하우스 2
 
2.0%
메이븐 1
 
1.0%
Other values (25) 25
25.0%

Length

2023-12-13T08:51:06.076843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문학동네 52
52.0%
난다 4
 
4.0%
인플루엔셜 4
 
4.0%
복복서가 3
 
3.0%
한빛비즈 3
 
3.0%
이봄 2
 
2.0%
엘릭시르 2
 
2.0%
마로니에북스 2
 
2.0%
웅진지식하우스 2
 
2.0%
나무옆의자 1
 
1.0%
Other values (25) 25
25.0%

대출권수
Real number (ℝ)

Distinct24
Distinct (%)24.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.31
Minimum16
Maximum64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T08:51:06.198196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile17
Q124.75
median29.5
Q332
95-th percentile36.05
Maximum64
Range48
Interquartile range (IQR)7.25

Descriptive statistics

Standard deviation6.8544684
Coefficient of variation (CV)0.24212181
Kurtosis6.1642391
Mean28.31
Median Absolute Deviation (MAD)3.5
Skewness0.98390401
Sum2831
Variance46.983737
MonotonicityDecreasing
2023-12-13T08:51:06.313459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
32 10
 
10.0%
31 9
 
9.0%
33 9
 
9.0%
29 8
 
8.0%
30 8
 
8.0%
26 7
 
7.0%
17 6
 
6.0%
19 5
 
5.0%
27 5
 
5.0%
34 4
 
4.0%
Other values (14) 29
29.0%
ValueCountFrequency (%)
16 2
 
2.0%
17 6
6.0%
18 2
 
2.0%
19 5
5.0%
20 2
 
2.0%
21 4
4.0%
22 1
 
1.0%
23 1
 
1.0%
24 2
 
2.0%
25 1
 
1.0%
ValueCountFrequency (%)
64 1
 
1.0%
38 2
 
2.0%
37 2
 
2.0%
36 2
 
2.0%
35 3
 
3.0%
34 4
 
4.0%
33 9
9.0%
32 10
10.0%
31 9
9.0%
30 8
8.0%

Interactions

2023-12-13T08:51:03.678436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:51:06.398472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
바코드컨텐츠한글명글저자명출판사명대출권수
바코드1.0001.0001.0001.0001.000
컨텐츠한글명1.0001.0000.9971.0000.241
글저자명1.0000.9971.0000.9990.760
출판사명1.0001.0000.9991.0000.852
대출권수1.0000.2410.7600.8521.000
2023-12-13T08:51:06.487945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대출권수출판사명
대출권수1.0000.478
출판사명0.4781.000

Missing values

2023-12-13T08:51:03.770680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:51:03.862212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

바코드컨텐츠한글명글저자명출판사명대출권수
04801161571189불편한 편의점김호연나무옆의자64
14801161903539연애의 행방히가시노 게이고소미미디어38
24808954686921저만치 혼자서김훈문학동네38
34808954699808믿음에 대하여박상영문학동네37
44808954687041새의 선물은희경문학동네37
54801158161550그리고 행복하다는 소식을 들었습니다이병률36
64801188862291지구에서 한아뿐정세랑난다36
74808954671989귤의 맛조남주문학동네35
84801165343720달러구트 꿈 백화점. 2이미예팩토리나인35
94801189318414한국형 가치투자최준철이콘35
바코드컨텐츠한글명글저자명출판사명대출권수
904808954653480단 하나의 문장구병모문학동네18
914801188754763월가아재의 제2라운드 투자 수업최한철에프엔미디어18
924808954637756개인주의자 선언문유석문학동네17
934801192988031러시아 지정학 아틀라스델핀 파팽서해문집17
944801192229189비욘드 더 크라이시스 Beyond The Crisis안근모어바웃어북17
954808954688109쇳밥일지천현우문학동네17
966770372시간을 파는 상점<김선영> 저자음과모음17
974808960532410토지. 1(1부 1권)박경리마로니에북스17
9848089546086571Q84. 2무라카미 하루키문학동네16
99480D22113951021세기 리빙붓다와의 대화사무엘 소샘소북스16