Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory48.3 B

Variable types

Text2
Numeric3

Dataset

Description대전광역시의 공공도서관 현황에 대한 데이터로 대전시 소재 공공도서관의 명칭, 소재지, 건물연면적, 좌석수, 소장장서수 등의 정보를 제공합니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15086348/fileData.do

Alerts

건물 연면적(제곱미터) is highly overall correlated with 좌석수(석) and 1 other fieldsHigh correlation
좌석수(석) is highly overall correlated with 건물 연면적(제곱미터) and 1 other fieldsHigh correlation
소장 장서수(권) is highly overall correlated with 건물 연면적(제곱미터) and 1 other fieldsHigh correlation
명 칭 has unique valuesUnique
소 재 지 has unique valuesUnique
건물 연면적(제곱미터) has unique valuesUnique
좌석수(석) has unique valuesUnique
소장 장서수(권) has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:12:51.261047
Analysis finished2023-12-12 12:12:52.924127
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

명 칭
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T21:12:53.079663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length10
Mean length6.76
Min length5

Characters and Unicode

Total characters169
Distinct characters59
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row대전광역시한밭도서관
2nd row대전학생교육문화원
3rd row대전학생교육문화원 부설 산성도서관
4th row대전동구가오도서관
5th row대전동구용운도서관
ValueCountFrequency (%)
대전학생교육문화원 2
 
7.4%
대전광역시한밭도서관 1
 
3.7%
신탄진도서관 1
 
3.7%
송촌도서관 1
 
3.7%
진잠도서관 1
 
3.7%
유성도서관 1
 
3.7%
아가랑도서관 1
 
3.7%
원신흥도서관 1
 
3.7%
노은도서관 1
 
3.7%
구즉도서관 1
 
3.7%
Other values (16) 16
59.3%
2023-12-12T21:12:53.490895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
14.8%
25
14.8%
25
14.8%
7
 
4.1%
7
 
4.1%
6
 
3.6%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (49) 61
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 167
98.8%
Space Separator 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
15.0%
25
15.0%
25
15.0%
7
 
4.2%
7
 
4.2%
6
 
3.6%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (48) 59
35.3%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 167
98.8%
Common 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
15.0%
25
15.0%
25
15.0%
7
 
4.2%
7
 
4.2%
6
 
3.6%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (48) 59
35.3%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 167
98.8%
ASCII 2
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
15.0%
25
15.0%
25
15.0%
7
 
4.2%
7
 
4.2%
6
 
3.6%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (48) 59
35.3%
ASCII
ValueCountFrequency (%)
2
100.0%

소 재 지
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T21:12:53.729018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length17.56
Min length12

Characters and Unicode

Total characters439
Distinct characters80
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row대전 중구 서문로10(문화동)
2nd row대전 중구 동서대로 1360
3rd row대전 중구 산서로62번길 53
4th row대전 동구 동구청로 147(동구청)
5th row대전 동구 새울로68번길 23-23
ValueCountFrequency (%)
대전 24
25.0%
유성구 8
 
8.3%
동구 6
 
6.2%
서구 5
 
5.2%
중구 3
 
3.1%
대덕구 3
 
3.1%
20 2
 
2.1%
59 2
 
2.1%
반석로 1
 
1.0%
테크노4로131 1
 
1.0%
Other values (41) 41
42.7%
2023-12-12T21:12:54.180044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
 
16.4%
33
 
7.5%
27
 
6.2%
25
 
5.7%
24
 
5.5%
17
 
3.9%
6 15
 
3.4%
1 14
 
3.2%
3 13
 
3.0%
2 12
 
2.7%
Other values (70) 187
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 257
58.5%
Decimal Number 94
 
21.4%
Space Separator 72
 
16.4%
Close Punctuation 6
 
1.4%
Open Punctuation 6
 
1.4%
Dash Punctuation 4
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
12.8%
27
 
10.5%
25
 
9.7%
24
 
9.3%
17
 
6.6%
12
 
4.7%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
Other values (56) 83
32.3%
Decimal Number
ValueCountFrequency (%)
6 15
16.0%
1 14
14.9%
3 13
13.8%
2 12
12.8%
5 8
8.5%
4 8
8.5%
7 8
8.5%
0 7
7.4%
8 5
 
5.3%
9 4
 
4.3%
Space Separator
ValueCountFrequency (%)
72
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 257
58.5%
Common 182
41.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
12.8%
27
 
10.5%
25
 
9.7%
24
 
9.3%
17
 
6.6%
12
 
4.7%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
Other values (56) 83
32.3%
Common
ValueCountFrequency (%)
72
39.6%
6 15
 
8.2%
1 14
 
7.7%
3 13
 
7.1%
2 12
 
6.6%
5 8
 
4.4%
4 8
 
4.4%
7 8
 
4.4%
0 7
 
3.8%
) 6
 
3.3%
Other values (4) 19
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 257
58.5%
ASCII 182
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
72
39.6%
6 15
 
8.2%
1 14
 
7.7%
3 13
 
7.1%
2 12
 
6.6%
5 8
 
4.4%
4 8
 
4.4%
7 8
 
4.4%
0 7
 
3.8%
) 6
 
3.3%
Other values (4) 19
 
10.4%
Hangul
ValueCountFrequency (%)
33
 
12.8%
27
 
10.5%
25
 
9.7%
24
 
9.3%
17
 
6.6%
12
 
4.7%
10
 
3.9%
9
 
3.5%
9
 
3.5%
8
 
3.1%
Other values (56) 83
32.3%

건물 연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3083.84
Minimum468
Maximum22485
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T21:12:54.351630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum468
5-th percentile520.8
Q11010
median2318
Q33355
95-th percentile4970.2
Maximum22485
Range22017
Interquartile range (IQR)2345

Descriptive statistics

Standard deviation4243.3224
Coefficient of variation (CV)1.3759865
Kurtosis20.016744
Mean3083.84
Median Absolute Deviation (MAD)1266
Skewness4.2778339
Sum77096
Variance18005785
MonotonicityNot monotonic
2023-12-12T21:12:54.518100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
22485 1
 
4.0%
4990 1
 
4.0%
1944 1
 
4.0%
4891 1
 
4.0%
1983 1
 
4.0%
2590 1
 
4.0%
2465 1
 
4.0%
995 1
 
4.0%
3887 1
 
4.0%
3355 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
468 1
4.0%
511 1
4.0%
560 1
4.0%
916 1
4.0%
932 1
4.0%
995 1
4.0%
1010 1
4.0%
1496 1
4.0%
1937 1
4.0%
1944 1
4.0%
ValueCountFrequency (%)
22485 1
4.0%
4990 1
4.0%
4891 1
4.0%
3887 1
4.0%
3877 1
4.0%
3584 1
4.0%
3355 1
4.0%
2800 1
4.0%
2706 1
4.0%
2590 1
4.0%

좌석수(석)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean354.12
Minimum71
Maximum2481
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T21:12:54.668684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum71
5-th percentile74.4
Q1150
median277
Q3372
95-th percentile564.6
Maximum2481
Range2410
Interquartile range (IQR)222

Descriptive statistics

Standard deviation467.61392
Coefficient of variation (CV)1.3204956
Kurtosis19.537667
Mean354.12
Median Absolute Deviation (MAD)127
Skewness4.2037003
Sum8853
Variance218662.78
MonotonicityNot monotonic
2023-12-12T21:12:54.798022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
2481 1
 
4.0%
434 1
 
4.0%
334 1
 
4.0%
438 1
 
4.0%
220 1
 
4.0%
487 1
 
4.0%
118 1
 
4.0%
81 1
 
4.0%
165 1
 
4.0%
355 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
71 1
4.0%
73 1
4.0%
80 1
4.0%
81 1
4.0%
104 1
4.0%
118 1
4.0%
150 1
4.0%
165 1
4.0%
166 1
4.0%
181 1
4.0%
ValueCountFrequency (%)
2481 1
4.0%
582 1
4.0%
495 1
4.0%
487 1
4.0%
438 1
4.0%
434 1
4.0%
372 1
4.0%
355 1
4.0%
340 1
4.0%
334 1
4.0%

소장 장서수(권)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123850.4
Minimum8029
Maximum888129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T21:12:54.939777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8029
5-th percentile38384.4
Q154922
median83582
Q3130749
95-th percentile198470
Maximum888129
Range880100
Interquartile range (IQR)75827

Descriptive statistics

Standard deviation166142.4
Coefficient of variation (CV)1.3414765
Kurtosis20.588845
Mean123850.4
Median Absolute Deviation (MAD)34343
Skewness4.36319
Sum3096260
Variance2.7603297 × 1010
MonotonicityNot monotonic
2023-12-12T21:12:55.074615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
888129 1
 
4.0%
125007 1
 
4.0%
152474 1
 
4.0%
157077 1
 
4.0%
88363 1
 
4.0%
82418 1
 
4.0%
130749 1
 
4.0%
8029 1
 
4.0%
42690 1
 
4.0%
83582 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
8029 1
4.0%
37308 1
4.0%
42690 1
4.0%
44495 1
4.0%
51258 1
4.0%
51811 1
4.0%
54922 1
4.0%
62129 1
4.0%
63728 1
4.0%
64096 1
4.0%
ValueCountFrequency (%)
888129 1
4.0%
208764 1
4.0%
157294 1
4.0%
157077 1
4.0%
152474 1
4.0%
134511 1
4.0%
130749 1
4.0%
125007 1
4.0%
117925 1
4.0%
116918 1
4.0%

Interactions

2023-12-12T21:12:52.440587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.465587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.757698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:52.530109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.550947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.862432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:52.632804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.652748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:51.997760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:12:55.163571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
명 칭소 재 지건물 연면적(제곱미터)좌석수(석)소장 장서수(권)
명 칭1.0001.0001.0001.0001.000
소 재 지1.0001.0001.0001.0001.000
건물 연면적(제곱미터)1.0001.0001.0000.9030.897
좌석수(석)1.0001.0000.9031.0000.889
소장 장서수(권)1.0001.0000.8970.8891.000
2023-12-12T21:12:55.269938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물 연면적(제곱미터)좌석수(석)소장 장서수(권)
건물 연면적(제곱미터)1.0000.6750.583
좌석수(석)0.6751.0000.744
소장 장서수(권)0.5830.7441.000

Missing values

2023-12-12T21:12:52.747754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:12:52.872422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

명 칭소 재 지건물 연면적(제곱미터)좌석수(석)소장 장서수(권)
0대전광역시한밭도서관대전 중구 서문로10(문화동)224852481888129
1대전학생교육문화원대전 중구 동서대로 13604990434125007
2대전학생교육문화원 부설 산성도서관대전 중구 산서로62번길 53231818171344
3대전동구가오도서관대전 동구 동구청로 147(동구청)2800196116918
4대전동구용운도서관대전 동구 새울로68번길 23-231937330117925
5대전동구판암도서관대전 동구 옥천로180번길 204687337308
6무지개도서관대전 동구 동부로 56-225117163728
7자양도서관대전 동구 백룡로 16(자양동행정복지센터)101010451258
8홍도도서관대전 동구 동산초교로34번길 56(홍도동행정복지센터)9328051811
9가수원도서관대전 서구 가수원로91-113877582157294
명 칭소 재 지건물 연면적(제곱미터)좌석수(석)소장 장서수(권)
15구암도서관대전 유성구 유성대로626번길5791637254922
16구즉도서관대전 유성구 와룡로37번길 201496277101239
17노은도서관대전 유성구 노은동로234번길34335535583582
18원신흥도서관대전 유성구 원신흥남로 59388716542690
19아가랑도서관대전광역시 유성구 반석로 78 (반석동)995818029
20유성도서관대전 유성구 대덕대로 507-382465118130749
21진잠도서관대전 유성구 진잠로길 160번길19259048782418
22송촌도서관대전 대덕구 송촌로 59198322088363
23신탄진도서관대전 대덕구 신탄진동로 644891438157077
24안산도서관대전 대덕구 계족로 663번길 251944334152474