Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description광주광역시 광산구 내 구립도서관 이용자 현황 정보에 관한 데이터로 성별, 이용일자, 출생연도, 데이터기준일자 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15031936/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 도서관명High correlation
도서관명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:39:41.943019
Analysis finished2023-12-12 13:39:42.761121
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8687.0238
Minimum1
Maximum17231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:39:42.825032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile877.95
Q14391.75
median8709
Q312953.5
95-th percentile16389.05
Maximum17231
Range17230
Interquartile range (IQR)8561.75

Descriptive statistics

Standard deviation4972.9214
Coefficient of variation (CV)0.57245398
Kurtosis-1.1963448
Mean8687.0238
Median Absolute Deviation (MAD)4276.5
Skewness-0.016813702
Sum86870238
Variance24729947
MonotonicityNot monotonic
2023-12-12T22:39:42.962499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11765 1
 
< 0.1%
7009 1
 
< 0.1%
11628 1
 
< 0.1%
7756 1
 
< 0.1%
15306 1
 
< 0.1%
857 1
 
< 0.1%
7050 1
 
< 0.1%
12798 1
 
< 0.1%
11360 1
 
< 0.1%
13377 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
ValueCountFrequency (%)
17231 1
< 0.1%
17229 1
< 0.1%
17228 1
< 0.1%
17226 1
< 0.1%
17225 1
< 0.1%
17224 1
< 0.1%
17223 1
< 0.1%
17221 1
< 0.1%
17220 1
< 0.1%
17215 1
< 0.1%

도서관명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
첨단도서관
5404 
장덕도서관
2331 
이야기꽃도서관
1203 
운남어린이도서관
1062 

Length

Max length8
Median length5
Mean length5.5592
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row첨단도서관
2nd row첨단도서관
3rd row이야기꽃도서관
4th row이야기꽃도서관
5th row장덕도서관

Common Values

ValueCountFrequency (%)
첨단도서관 5404
54.0%
장덕도서관 2331
23.3%
이야기꽃도서관 1203
 
12.0%
운남어린이도서관 1062
 
10.6%

Length

2023-12-12T22:39:43.092893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:43.194375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
첨단도서관 5404
54.0%
장덕도서관 2331
23.3%
이야기꽃도서관 1203
 
12.0%
운남어린이도서관 1062
 
10.6%

성별
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6521 
3397 
<NA>
 
82

Length

Max length4
Median length1
Mean length1.0246
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6521
65.2%
3397
34.0%
<NA> 82
 
0.8%

Length

2023-12-12T22:39:43.306817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:43.702131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6521
65.2%
3397
34.0%
na 82
 
0.8%

출생연도
Real number (ℝ)

Distinct84
Distinct (%)0.8%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1989.5178
Minimum1901
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:39:43.802659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1901
5-th percentile1968
Q11978
median1985
Q32004
95-th percentile2014
Maximum2021
Range120
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.387956
Coefficient of variation (CV)0.0077345151
Kurtosis-0.69686758
Mean1989.5178
Median Absolute Deviation (MAD)10
Skewness0.13597359
Sum19887220
Variance236.78918
MonotonicityNot monotonic
2023-12-12T22:39:43.946131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1980 421
 
4.2%
1981 415
 
4.2%
1983 411
 
4.1%
1982 394
 
3.9%
1979 364
 
3.6%
1978 310
 
3.1%
1976 296
 
3.0%
1984 281
 
2.8%
1985 281
 
2.8%
1977 269
 
2.7%
Other values (74) 6554
65.5%
ValueCountFrequency (%)
1901 1
 
< 0.1%
1933 2
 
< 0.1%
1938 2
 
< 0.1%
1940 3
< 0.1%
1941 2
 
< 0.1%
1942 1
 
< 0.1%
1944 3
< 0.1%
1945 5
0.1%
1946 5
0.1%
1947 2
 
< 0.1%
ValueCountFrequency (%)
2021 8
 
0.1%
2020 2
 
< 0.1%
2019 16
 
0.2%
2018 76
 
0.8%
2017 89
 
0.9%
2016 96
 
1.0%
2015 169
1.7%
2014 180
1.8%
2013 240
2.4%
2012 249
2.5%
Distinct355
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-02 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T22:39:44.071852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:44.198280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-12-31
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-31
2nd row2022-12-31
3rd row2022-12-31
4th row2022-12-31
5th row2022-12-31

Common Values

ValueCountFrequency (%)
2022-12-31 10000
100.0%

Length

2023-12-12T22:39:44.330551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:44.424680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-31 10000
100.0%

Interactions

2023-12-12T22:39:42.401110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:42.225676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:42.485937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:42.310237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:39:44.478819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도서관명성별출생연도
연번1.0000.9210.0150.092
도서관명0.9211.0000.0150.120
성별0.0150.0151.0000.228
출생연도0.0920.1200.2281.000
2023-12-12T22:39:44.563315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별도서관명
성별1.0000.010
도서관명0.0101.000
2023-12-12T22:39:44.642989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번출생연도도서관명성별
연번1.000-0.0100.8200.012
출생연도-0.0101.0000.0540.171
도서관명0.8200.0541.0000.010
성별0.0120.1710.0101.000

Missing values

2023-12-12T22:39:42.600266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:39:42.713179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도서관명성별출생연도이용일자데이터기준일자
1176411765첨단도서관19792022-07-172022-12-31
1208412085첨단도서관20082022-07-272022-12-31
69946995이야기꽃도서관19902022-09-032022-12-31
75757576이야기꽃도서관19762022-10-122022-12-31
39143915장덕도서관19802022-12-212022-12-31
34383439장덕도서관19762022-11-252022-12-31
51935194운남어린이도서관20062022-08-272022-12-31
57985799운남어린이도서관20162022-12-032022-12-31
1295212953첨단도서관19562022-08-192022-12-31
87998800첨단도서관19802022-02-112022-12-31
연번도서관명성별출생연도이용일자데이터기준일자
1399613997첨단도서관19802022-09-232022-12-31
1580315804첨단도서관19762022-11-252022-12-31
1376413765첨단도서관19882022-09-152022-12-31
45464547운남어린이도서관20162022-04-302022-12-31
75557556이야기꽃도서관19832022-08-132022-12-31
1507815079첨단도서관19782022-11-032022-12-31
1143711438첨단도서관20112022-07-032022-12-31
82808281첨단도서관19892022-01-172022-12-31
16781679장덕도서관19802022-07-172022-12-31
33733374장덕도서관19802022-11-202022-12-31