Overview

Dataset statistics

Number of variables8
Number of observations1219
Missing cells0
Missing cells (%)0.0%
Duplicate rows157
Duplicate rows (%)12.9%
Total size in memory76.3 KiB
Average record size in memory64.1 B

Variable types

Categorical6
DateTime2

Dataset

Description보령시립도서관(중앙도서관, 죽정도서관) 자원봉사 신청에 대한 데이터로 신청 도서관, 신청일, 신청시간 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=414&beforeMenuCd=DOM_000000201001001000&publicdatapk=15039886

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 157 (12.9%) duplicate rowsDuplicates
신청시간4 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
신청시간3 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
신청시간1 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
도서관구분 is highly overall correlated with 신청시간1 and 3 other fieldsHigh correlation
신청시간2 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
신청시간5 is highly imbalanced (91.6%)Imbalance

Reproduction

Analysis started2024-01-09 22:13:18.074900
Analysis finished2024-01-09 22:13:18.500922
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
중앙도서관
749 
죽정도서관
470 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙도서관
2nd row중앙도서관
3rd row죽정도서관
4th row죽정도서관
5th row죽정도서관

Common Values

ValueCountFrequency (%)
중앙도서관 749
61.4%
죽정도서관 470
38.6%

Length

2024-01-10T07:13:18.550653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:18.630230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중앙도서관 749
61.4%
죽정도서관 470
38.6%
Distinct566
Distinct (%)46.4%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
Minimum2019-01-02 00:00:00
Maximum2023-05-21 00:00:00
2024-01-10T07:13:18.711205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:19.048289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청시간1
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
646 
13:00~14:00
573 

Length

Max length11
Median length4
Mean length7.290402
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row13:00~14:00
4th row13:00~14:00
5th row13:00~14:00

Common Values

ValueCountFrequency (%)
<NA> 646
53.0%
13:00~14:00 573
47.0%

Length

2024-01-10T07:13:19.154494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:19.233457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 646
53.0%
13:00~14:00 573
47.0%

신청시간2
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
14:00~15:00
690 
<NA>
529 

Length

Max length11
Median length11
Mean length7.9622642
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row14:00~15:00
4th row14:00~15:00
5th row<NA>

Common Values

ValueCountFrequency (%)
14:00~15:00 690
56.6%
<NA> 529
43.4%

Length

2024-01-10T07:13:19.318693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:19.397046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14:00~15:00 690
56.6%
na 529
43.4%

신청시간3
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
787 
15:00~16:00
432 

Length

Max length11
Median length4
Mean length6.4807219
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15:00~16:00
2nd row15:00~16:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 787
64.6%
15:00~16:00 432
35.4%

Length

2024-01-10T07:13:19.481180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:19.559335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 787
64.6%
15:00~16:00 432
35.4%

신청시간4
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
936 
16:00~17:00
283 

Length

Max length11
Median length4
Mean length5.6251025
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16:00~17:00
2nd row16:00~17:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 936
76.8%
16:00~17:00 283
 
23.2%

Length

2024-01-10T07:13:19.647325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:19.731867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 936
76.8%
16:00~17:00 283
 
23.2%

신청시간5
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
<NA>
1198 
18:00~20:00
 
19
17:00~18:00
 
2

Length

Max length11
Median length4
Mean length4.1205906
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1198
98.3%
18:00~20:00 19
 
1.6%
17:00~18:00 2
 
0.2%

Length

2024-01-10T07:13:19.832878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:19.921449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1198
98.3%
18:00~20:00 19
 
1.6%
17:00~18:00 2
 
0.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
Minimum2023-05-22 00:00:00
Maximum2023-05-22 00:00:00
2024-01-10T07:13:19.987215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:20.060615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-01-10T07:13:20.119367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간5
도서관구분1.0000.000
신청시간50.0001.000
2024-01-10T07:13:20.193603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청시간4신청시간3신청시간1도서관구분신청시간5신청시간2
신청시간41.0001.0001.0001.000NaNNaN
신청시간31.0001.0001.0001.000NaN1.000
신청시간11.0001.0001.0001.000NaN1.000
도서관구분1.0001.0001.0001.0000.0001.000
신청시간5NaNNaNNaN0.0001.000NaN
신청시간2NaN1.0001.0001.000NaN1.000
2024-01-10T07:13:20.282952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간1신청시간2신청시간3신청시간4신청시간5
도서관구분1.0001.0001.0001.0001.0000.000
신청시간11.0001.0001.0001.0001.0000.000
신청시간21.0001.0001.0001.000NaNNaN
신청시간31.0001.0001.0001.0001.000NaN
신청시간41.0001.000NaN1.0001.000NaN
신청시간50.0000.000NaNNaNNaN1.000

Missing values

2024-01-10T07:13:18.361181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:13:18.460117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
0중앙도서관2019-01-02<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
1중앙도서관2019-01-03<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
2죽정도서관2019-01-0513:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
3죽정도서관2019-01-1213:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
4죽정도서관2019-01-1913:00~14:00<NA><NA><NA><NA>2023-05-22
5중앙도서관2019-01-03<NA><NA><NA>16:00~17:00<NA>2023-05-22
6중앙도서관2019-01-02<NA><NA><NA>16:00~17:00<NA>2023-05-22
7중앙도서관2019-01-07<NA><NA><NA>16:00~17:00<NA>2023-05-22
8중앙도서관2019-01-0713:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
9죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2023-05-22
도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
1209죽정도서관2023-02-0313:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
1210죽정도서관2023-02-04<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-22
1211죽정도서관2023-02-05<NA><NA><NA>16:00~17:0017:00~18:002023-05-22
1212죽정도서관2023-03-24<NA><NA><NA><NA>17:00~18:002023-05-22
1213죽정도서관2023-03-3113:00~14:0014:00~15:00<NA><NA><NA>2023-05-22
1214중앙도서관2023-04-09<NA><NA>15:00~16:00<NA><NA>2023-05-22
1215중앙도서관2023-04-16<NA><NA>15:00~16:00<NA><NA>2023-05-22
1216죽정도서관2023-05-13<NA>14:00~15:0015:00~16:00<NA><NA>2023-05-22
1217죽정도서관2023-05-20<NA>14:00~15:0015:00~16:00<NA><NA>2023-05-22
1218죽정도서관2023-05-2113:00~14:0014:00~15:00<NA><NA><NA>2023-05-22

Duplicate rows

Most frequently occurring

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자# duplicates
102중앙도서관2019-11-1313:00~14:0014:00~15:00<NA><NA><NA>2023-05-223
140중앙도서관2021-03-30<NA><NA><NA>16:00~17:00<NA>2023-05-223
0죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2023-05-222
1죽정도서관2019-02-01<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
2죽정도서관2019-02-08<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
3죽정도서관2019-03-18<NA>14:00~15:00<NA><NA><NA>2023-05-222
4죽정도서관2019-03-3013:00~14:0014:00~15:00<NA><NA><NA>2023-05-222
5죽정도서관2019-04-0713:00~14:0014:00~15:00<NA><NA><NA>2023-05-222
6죽정도서관2019-04-21<NA><NA>15:00~16:0016:00~17:00<NA>2023-05-222
7죽정도서관2019-05-1813:00~14:00<NA><NA><NA><NA>2023-05-222