Overview

Dataset statistics

Number of variables8
Number of observations1219
Missing cells0
Missing cells (%)0.0%
Duplicate rows157
Duplicate rows (%)12.9%
Total size in memory76.3 KiB
Average record size in memory64.1 B

Variable types

Categorical6
DateTime2

Dataset

Description보령시립도서관(중앙도서관, 죽정도서관) 자원봉사 신청에 대한 데이터로 신청 도서관, 신청일, 신청시간 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15039886/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 157 (12.9%) duplicate rowsDuplicates
도서관구분 is highly overall correlated with 신청시간1 and 3 other fieldsHigh correlation
신청시간4 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
신청시간1 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
신청시간2 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
신청시간3 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
신청시간5 is highly imbalanced (91.6%)Imbalance

Reproduction

Analysis started2023-12-12 15:08:33.390987
Analysis finished2023-12-12 15:08:33.882739
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
중앙도서관
749 
죽정도서관
470 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙도서관
2nd row중앙도서관
3rd row죽정도서관
4th row죽정도서관
5th row죽정도서관

Common Values

ValueCountFrequency (%)
중앙도서관 749
61.4%
죽정도서관 470
38.6%

Length

2023-12-13T00:08:33.942676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:34.030423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중앙도서관 749
61.4%
죽정도서관 470
38.6%
Distinct566
Distinct (%)46.4%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
Minimum2019-01-02 00:00:00
Maximum2023-05-21 00:00:00
2023-12-13T00:08:34.125753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:08:34.291888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청시간1
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
646 
13:00~14:00
573 

Length

Max length11
Median length4
Mean length7.290402
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row13:00~14:00
4th row13:00~14:00
5th row13:00~14:00

Common Values

ValueCountFrequency (%)
<NA> 646
53.0%
13:00~14:00 573
47.0%

Length

2023-12-13T00:08:34.421527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:34.534639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 646
53.0%
13:00~14:00 573
47.0%

신청시간2
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
14:00~15:00
690 
<NA>
529 

Length

Max length11
Median length11
Mean length7.9622642
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row14:00~15:00
4th row14:00~15:00
5th row<NA>

Common Values

ValueCountFrequency (%)
14:00~15:00 690
56.6%
<NA> 529
43.4%

Length

2023-12-13T00:08:34.636982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:34.767226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14:00~15:00 690
56.6%
na 529
43.4%

신청시간3
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
787 
15:00~16:00
432 

Length

Max length11
Median length4
Mean length6.4807219
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15:00~16:00
2nd row15:00~16:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 787
64.6%
15:00~16:00 432
35.4%

Length

2023-12-13T00:08:34.872601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:34.998626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 787
64.6%
15:00~16:00 432
35.4%

신청시간4
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
936 
16:00~17:00
283 

Length

Max length11
Median length4
Mean length5.6251025
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16:00~17:00
2nd row16:00~17:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 936
76.8%
16:00~17:00 283
 
23.2%

Length

2023-12-13T00:08:35.111209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:35.255794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 936
76.8%
16:00~17:00 283
 
23.2%

신청시간5
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
1198 
18:00~20:00
 
19
17:00~18:00
 
2

Length

Max length11
Median length4
Mean length4.1205906
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1198
98.3%
18:00~20:00 19
 
1.6%
17:00~18:00 2
 
0.2%

Length

2023-12-13T00:08:35.384265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:08:35.516424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1198
98.3%
18:00~20:00 19
 
1.6%
17:00~18:00 2
 
0.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
Minimum2023-05-22 00:00:00
Maximum2023-05-22 00:00:00
2023-12-13T00:08:35.618653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:08:35.741405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T00:08:35.829521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간5
도서관구분1.0000.000
신청시간50.0001.000
2023-12-13T00:08:35.938144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간5신청시간4신청시간1신청시간2신청시간3
도서관구분1.0000.0001.0001.0001.0001.000
신청시간50.0001.000NaNNaNNaNNaN
신청시간41.000NaN1.0001.000NaN1.000
신청시간11.000NaN1.0001.0001.0001.000
신청시간21.000NaNNaN1.0001.0001.000
신청시간31.000NaN1.0001.0001.0001.000
2023-12-13T00:08:36.089154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간1신청시간2신청시간3신청시간4신청시간5
도서관구분1.0001.0001.0001.0001.0000.000
신청시간11.0001.0001.0001.0001.0000.000
신청시간21.0001.0001.0001.000NaNNaN
신청시간31.0001.0001.0001.0001.000NaN
신청시간41.0001.000NaN1.0001.000NaN
신청시간50.0000.000NaNNaNNaN1.000

Missing values

2023-12-13T00:08:33.712401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:08:33.832123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
0중앙도서관2019-01-02<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
1중앙도서관2019-01-03<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
2죽정도서관2019-01-0513:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
3죽정도서관2019-01-1213:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
4죽정도서관2019-01-1913:00~14:00<NA><NA><NA><NA>2023-05-22
5중앙도서관2019-01-03<NA><NA><NA>16:00~17:00<NA>2023-05-22
6중앙도서관2019-01-02<NA><NA><NA>16:00~17:00<NA>2023-05-22
7중앙도서관2019-01-07<NA><NA><NA>16:00~17:00<NA>2023-05-22
8중앙도서관2019-01-0713:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
9죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2023-05-22
도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
1209죽정도서관2023-02-0313:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
1210죽정도서관2023-02-04<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
1211죽정도서관2023-02-05<NA><NA><NA>16:00~17:0017:00~18:002023-05-22
1212죽정도서관2023-03-24<NA><NA><NA><NA>17:00~18:002023-05-22
1213죽정도서관2023-03-3113:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
1214중앙도서관2023-04-09<NA><NA>15:00~16:00<NA><NA>2023-05-22
1215중앙도서관2023-04-16<NA><NA>15:00~16:00<NA><NA>2023-05-22
1216죽정도서관2023-05-13<NA>14:00~15:0015:00~16:00<NA><NA>2023-05-22
1217죽정도서관2023-05-20<NA>14:00~15:0015:00~16:00<NA><NA>2023-05-22
1218죽정도서관2023-05-2113:00~14:0014:00~15:00<NA><NA><NA>2023-05-22

Duplicate rows

Most frequently occurring

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자# duplicates
102중앙도서관2019-11-1313:00~14:0014:00~15:00<NA><NA><NA>2023-05-223
140중앙도서관2021-03-30<NA><NA><NA>16:00~17:00<NA>2023-05-223
0죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2023-05-222
1죽정도서관2019-02-01<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
2죽정도서관2019-02-08<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
3죽정도서관2019-03-18<NA>14:00~15:00<NA><NA><NA>2023-05-222
4죽정도서관2019-03-3013:00~14:0014:00~15:00<NA><NA><NA>2023-05-222
5죽정도서관2019-04-0713:00~14:0014:00~15:00<NA><NA><NA>2023-05-222
6죽정도서관2019-04-21<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
7죽정도서관2019-05-1813:00~14:00<NA><NA><NA><NA>2023-05-222