Overview

Dataset statistics

Number of variables5
Number of observations226
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경기도 화성시 작은도서관 현황에 관련한 데이터로 연번,구분(공립, 사립),도서관명,소재지,면적 데이터를 포함합니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15119894/fileData.do

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
구분 is highly imbalanced (75.9%)Imbalance
연번 has unique valuesUnique
도서관명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:23:27.344979
Analysis finished2023-12-12 21:23:27.859330
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct226
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.5
Minimum1
Maximum226
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T06:23:27.922988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.25
Q157.25
median113.5
Q3169.75
95-th percentile214.75
Maximum226
Range225
Interquartile range (IQR)112.5

Descriptive statistics

Standard deviation65.384759
Coefficient of variation (CV)0.57607717
Kurtosis-1.2
Mean113.5
Median Absolute Deviation (MAD)56.5
Skewness0
Sum25651
Variance4275.1667
MonotonicityStrictly increasing
2023-12-13T06:23:28.036590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
171 1
 
0.4%
145 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
Other values (216) 216
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%
217 1
0.4%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
사립
217 
공립
 
9

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
사립 217
96.0%
공립 9
 
4.0%

Length

2023-12-13T06:23:28.133486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:23:28.202489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 217
96.0%
공립 9
 
4.0%

도서관명
Text

UNIQUE 

Distinct226
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T06:23:28.426800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length10.628319
Min length7

Characters and Unicode

Total characters2402
Distinct characters268
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)100.0%

Sample

1st row기아행복마루 작은도서관
2nd row샘내 작은도서관
3rd row비봉 작은도서관
4th row양감면 작은도서관
5th row마도 작은도서관
ValueCountFrequency (%)
작은도서관 222
45.4%
책마을 4
 
0.8%
동탄 3
 
0.6%
꿈꾸는 3
 
0.6%
푸른 2
 
0.4%
행복한 2
 
0.4%
꿈마루 2
 
0.4%
신일 2
 
0.4%
영어 2
 
0.4%
반올림 2
 
0.4%
Other values (242) 245
50.1%
2023-12-13T06:23:28.823398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
264
 
11.0%
234
 
9.7%
231
 
9.6%
230
 
9.6%
228
 
9.5%
226
 
9.4%
26
 
1.1%
24
 
1.0%
22
 
0.9%
22
 
0.9%
Other values (258) 895
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2099
87.4%
Space Separator 264
 
11.0%
Uppercase Letter 19
 
0.8%
Decimal Number 15
 
0.6%
Lowercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
234
 
11.1%
231
 
11.0%
230
 
11.0%
228
 
10.9%
226
 
10.8%
26
 
1.2%
24
 
1.1%
22
 
1.0%
22
 
1.0%
22
 
1.0%
Other values (239) 834
39.7%
Uppercase Letter
ValueCountFrequency (%)
L 5
26.3%
H 4
21.1%
X 2
 
10.5%
P 2
 
10.5%
A 2
 
10.5%
K 1
 
5.3%
I 1
 
5.3%
R 1
 
5.3%
U 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
2 8
53.3%
3 3
 
20.0%
1 2
 
13.3%
8 1
 
6.7%
9 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
k 1
33.3%
s 1
33.3%
Space Separator
ValueCountFrequency (%)
264
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2099
87.4%
Common 281
 
11.7%
Latin 22
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
234
 
11.1%
231
 
11.0%
230
 
11.0%
228
 
10.9%
226
 
10.8%
26
 
1.2%
24
 
1.1%
22
 
1.0%
22
 
1.0%
22
 
1.0%
Other values (239) 834
39.7%
Latin
ValueCountFrequency (%)
L 5
22.7%
H 4
18.2%
X 2
 
9.1%
P 2
 
9.1%
A 2
 
9.1%
K 1
 
4.5%
e 1
 
4.5%
k 1
 
4.5%
s 1
 
4.5%
I 1
 
4.5%
Other values (2) 2
 
9.1%
Common
ValueCountFrequency (%)
264
94.0%
2 8
 
2.8%
3 3
 
1.1%
, 2
 
0.7%
1 2
 
0.7%
8 1
 
0.4%
9 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2099
87.4%
ASCII 303
 
12.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
264
87.1%
2 8
 
2.6%
L 5
 
1.7%
H 4
 
1.3%
3 3
 
1.0%
, 2
 
0.7%
1 2
 
0.7%
X 2
 
0.7%
P 2
 
0.7%
A 2
 
0.7%
Other values (9) 9
 
3.0%
Hangul
ValueCountFrequency (%)
234
 
11.1%
231
 
11.0%
230
 
11.0%
228
 
10.9%
226
 
10.8%
26
 
1.2%
24
 
1.1%
22
 
1.0%
22
 
1.0%
22
 
1.0%
Other values (239) 834
39.7%

소재지
Text

UNIQUE 

Distinct226
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T06:23:29.149528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length46
Mean length33.323009
Min length16

Characters and Unicode

Total characters7531
Distinct characters315
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)100.0%

Sample

1st row경기도 화성시 우정읍 기아자동차로 559
2nd row경기도 화성시 매송고색로375번길 11 마을회관 2층
3rd row경기도 화성시 비봉면 비봉로 71번길 1 비봉면사무소 1층
4th row경기도 화성시 양감면 초록로 7
5th row경기도 화성시 마도면 마도북로 387(마도 복합문화센터 2층)
ValueCountFrequency (%)
화성시 228
 
15.8%
경기도 226
 
15.6%
아파트 61
 
4.2%
봉담읍 28
 
1.9%
관리동 15
 
1.0%
향남읍 15
 
1.0%
동탄반석로 12
 
0.8%
남양읍 11
 
0.8%
2층 10
 
0.7%
1층 10
 
0.7%
Other values (639) 829
57.4%
2023-12-13T06:23:29.656098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1231
 
16.3%
269
 
3.6%
256
 
3.4%
251
 
3.3%
250
 
3.3%
244
 
3.2%
240
 
3.2%
1 240
 
3.2%
235
 
3.1%
205
 
2.7%
Other values (305) 4110
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4855
64.5%
Space Separator 1231
 
16.3%
Decimal Number 1048
 
13.9%
Other Punctuation 182
 
2.4%
Open Punctuation 55
 
0.7%
Close Punctuation 55
 
0.7%
Dash Punctuation 53
 
0.7%
Uppercase Letter 45
 
0.6%
Lowercase Letter 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
269
 
5.5%
256
 
5.3%
251
 
5.2%
250
 
5.1%
244
 
5.0%
240
 
4.9%
235
 
4.8%
205
 
4.2%
167
 
3.4%
130
 
2.7%
Other values (274) 2608
53.7%
Decimal Number
ValueCountFrequency (%)
1 240
22.9%
2 195
18.6%
3 119
11.4%
0 100
9.5%
5 84
 
8.0%
4 79
 
7.5%
6 70
 
6.7%
9 57
 
5.4%
7 54
 
5.2%
8 50
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
L 16
35.6%
H 15
33.3%
B 3
 
6.7%
A 3
 
6.7%
X 2
 
4.4%
S 2
 
4.4%
K 1
 
2.2%
C 1
 
2.2%
P 1
 
2.2%
U 1
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
k 2
28.6%
c 2
28.6%
s 1
14.3%
a 1
14.3%
e 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 176
96.7%
. 6
 
3.3%
Space Separator
ValueCountFrequency (%)
1231
100.0%
Open Punctuation
ValueCountFrequency (%)
( 55
100.0%
Close Punctuation
ValueCountFrequency (%)
) 55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4855
64.5%
Common 2624
34.8%
Latin 52
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
269
 
5.5%
256
 
5.3%
251
 
5.2%
250
 
5.1%
244
 
5.0%
240
 
4.9%
235
 
4.8%
205
 
4.2%
167
 
3.4%
130
 
2.7%
Other values (274) 2608
53.7%
Common
ValueCountFrequency (%)
1231
46.9%
1 240
 
9.1%
2 195
 
7.4%
, 176
 
6.7%
3 119
 
4.5%
0 100
 
3.8%
5 84
 
3.2%
4 79
 
3.0%
6 70
 
2.7%
9 57
 
2.2%
Other values (6) 273
 
10.4%
Latin
ValueCountFrequency (%)
L 16
30.8%
H 15
28.8%
B 3
 
5.8%
A 3
 
5.8%
X 2
 
3.8%
k 2
 
3.8%
c 2
 
3.8%
S 2
 
3.8%
s 1
 
1.9%
a 1
 
1.9%
Other values (5) 5
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4855
64.5%
ASCII 2676
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1231
46.0%
1 240
 
9.0%
2 195
 
7.3%
, 176
 
6.6%
3 119
 
4.4%
0 100
 
3.7%
5 84
 
3.1%
4 79
 
3.0%
6 70
 
2.6%
9 57
 
2.1%
Other values (21) 325
 
12.1%
Hangul
ValueCountFrequency (%)
269
 
5.5%
256
 
5.3%
251
 
5.2%
250
 
5.1%
244
 
5.0%
240
 
4.9%
235
 
4.8%
205
 
4.2%
167
 
3.4%
130
 
2.7%
Other values (274) 2608
53.7%
Distinct200
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T06:23:29.952671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length8
Mean length4.0486726
Min length2

Characters and Unicode

Total characters915
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique178 ?
Unique (%)78.8%

Sample

1st row162
2nd row211.76
3rd row122.4
4th row75
5th row131
ValueCountFrequency (%)
36 4
 
1.8%
42 3
 
1.3%
70 3
 
1.3%
85.2 2
 
0.9%
162 2
 
0.9%
64 2
 
0.9%
123 2
 
0.9%
61 2
 
0.9%
120 2
 
0.9%
81 2
 
0.9%
Other values (190) 202
89.4%
2023-12-13T06:23:30.397338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 156
17.0%
. 119
13.0%
2 96
10.5%
4 83
9.1%
5 78
8.5%
7 73
8.0%
8 65
7.1%
6 65
7.1%
9 60
 
6.6%
3 59
 
6.4%
Other values (9) 61
 
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 786
85.9%
Other Punctuation 119
 
13.0%
Other Letter 6
 
0.7%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 156
19.8%
2 96
12.2%
4 83
10.6%
5 78
9.9%
7 73
9.3%
8 65
8.3%
6 65
8.3%
9 60
 
7.6%
3 59
 
7.5%
0 51
 
6.5%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 119
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 909
99.3%
Hangul 6
 
0.7%

Most frequent character per script

Common
ValueCountFrequency (%)
1 156
17.2%
. 119
13.1%
2 96
10.6%
4 83
9.1%
5 78
8.6%
7 73
8.0%
8 65
7.2%
6 65
7.2%
9 60
 
6.6%
3 59
 
6.5%
Other values (3) 55
 
6.1%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 909
99.3%
Hangul 6
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 156
17.2%
. 119
13.1%
2 96
10.6%
4 83
9.1%
5 78
8.6%
7 73
8.0%
8 65
7.2%
6 65
7.2%
9 60
 
6.6%
3 59
 
6.5%
Other values (3) 55
 
6.1%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Interactions

2023-12-13T06:23:27.678659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:23:30.512156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.743
구분0.7431.000
2023-12-13T06:23:30.594971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.572
구분0.5721.000

Missing values

2023-12-13T06:23:27.757823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:23:27.825967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분도서관명소재지면적(제곱미터)
01공립기아행복마루 작은도서관경기도 화성시 우정읍 기아자동차로 559162
12공립샘내 작은도서관경기도 화성시 매송고색로375번길 11 마을회관 2층211.76
23공립비봉 작은도서관경기도 화성시 비봉면 비봉로 71번길 1 비봉면사무소 1층122.4
34공립양감면 작은도서관경기도 화성시 양감면 초록로 775
45공립마도 작은도서관경기도 화성시 마도면 마도북로 387(마도 복합문화센터 2층)131
56공립팔탄 작은도서관경기도 화성시 팔탄면 구장길 14(팔탄행정복지센터 2층)88
67공립커피앤북 작은도서관경기도 화성시 팔탄면 봉담읍 상리3길 38 (커피복합문화센터 1층)92
78공립늘봄이음터 작은도서관경기도 화성시 동탄대로24길 49 늘봄이음터 3층138
89공립호연이음터 작은도서관경기도 화성시 동탄대로3길 17-9 호연이음터 3층140
910사립e편한세상동탄 작은도서관경기도 화성시 동탄순환대로20길 31, e편한세상동탄 아파트65.065
연번구분도서관명소재지면적(제곱미터)
216217사립향아름 작은도서관경기도 화성시 향남읍 칼바위길 22-39, 향남아름다운교회 내81.33
217218사립허브와나무 작은도서관경기도 화성시 동탄오산로 86-10, 동탄리코빌 1동 207호 (오산동)63.84
218219사립호수나래 작은도서관경기도 화성시 동탄대로6길 50, 더레이크시티부영5단지 아파트173.25
219220사립호수나무 작은도서관경기도 화성시 동탄대로8길 65, 1층(더레이크시티부영2단지 아파트)125.55
220221사립호수품 작은도서관경기도 화성시 동탄대로6길 84, 더레이크시티부영6단지 아파트187
221222사립화성 행복한 영어 작은도서관경기도 화성시 반정로204번길 53, 베들레헴 교회 교육관 2층(반정동)146.85
222223사립화성동부청소년 작은도서관경기도 화성시 영통로8번길 10-12 향유내음가득교회175
223224사립휴먼동화1 작은도서관경기도 화성시 봉담읍 동화길 81-13, 휴먼시아동화마을 아파트48.75
224225사립휴먼빌 꿈꾸는 작은도서관경기도 화성시 봉담읍 와우로 51, 봉담휴먼빌 아파트74.82
225226사립희망여기 작은도서관경기도 화성시 봉담읍 오궁길 8, 수기평안교회 1층77