Overview

Dataset statistics

Number of variables8
Number of observations1008
Missing cells0
Missing cells (%)0.0%
Duplicate rows126
Duplicate rows (%)12.5%
Total size in memory63.1 KiB
Average record size in memory64.1 B

Variable types

Categorical6
DateTime2

Dataset

Description보령시립도서관(중앙도서관, 죽정도서관) 자원봉사 신청에 대한 데이터로 신청 도서관, 신청일, 신청시간 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=414&beforeMenuCd=DOM_000000201001001000&publicdatapk=15039886

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 126 (12.5%) duplicate rowsDuplicates
신청시간4 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
신청시간3 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
신청시간1 is highly overall correlated with 도서관구분 and 2 other fieldsHigh correlation
도서관구분 is highly overall correlated with 신청시간1 and 4 other fieldsHigh correlation
신청시간5 is highly overall correlated with 도서관구분High correlation
신청시간2 is highly overall correlated with 도서관구분 and 3 other fieldsHigh correlation
신청시간5 is highly imbalanced (96.3%)Imbalance

Reproduction

Analysis started2024-01-09 22:13:15.508255
Analysis finished2024-01-09 22:13:15.972588
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
중앙도서관
634 
죽정도서관
374 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙도서관
2nd row중앙도서관
3rd row죽정도서관
4th row죽정도서관
5th row죽정도서관

Common Values

ValueCountFrequency (%)
중앙도서관 634
62.9%
죽정도서관 374
37.1%

Length

2024-01-10T07:13:16.022371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:16.095381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중앙도서관 634
62.9%
죽정도서관 374
37.1%
Distinct443
Distinct (%)43.9%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
Minimum2019-01-02 00:00:00
Maximum2021-07-31 00:00:00
2024-01-10T07:13:16.177543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:16.286812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청시간1
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
<NA>
530 
13:00~14:00
478 

Length

Max length11
Median length4
Mean length7.3194444
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row13:00~14:00
4th row13:00~14:00
5th row13:00~14:00

Common Values

ValueCountFrequency (%)
<NA> 530
52.6%
13:00~14:00 478
47.4%

Length

2024-01-10T07:13:16.400983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:16.479311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 530
52.6%
13:00~14:00 478
47.4%

신청시간2
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
14:00~15:00
576 
<NA>
432 

Length

Max length11
Median length11
Mean length8
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row14:00~15:00
4th row14:00~15:00
5th row<NA>

Common Values

ValueCountFrequency (%)
14:00~15:00 576
57.1%
<NA> 432
42.9%

Length

2024-01-10T07:13:16.562136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:16.665964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14:00~15:00 576
57.1%
na 432
42.9%

신청시간3
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
<NA>
638 
15:00~16:00
370 

Length

Max length11
Median length4
Mean length6.5694444
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15:00~16:00
2nd row15:00~16:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 638
63.3%
15:00~16:00 370
36.7%

Length

2024-01-10T07:13:16.751169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:16.827118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 638
63.3%
15:00~16:00 370
36.7%

신청시간4
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
<NA>
769 
16:00~17:00
239 

Length

Max length11
Median length4
Mean length5.6597222
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16:00~17:00
2nd row16:00~17:00
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 769
76.3%
16:00~17:00 239
 
23.7%

Length

2024-01-10T07:13:16.907714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:16.985607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 769
76.3%
16:00~17:00 239
 
23.7%

신청시간5
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
<NA>
1004 
18:00~20:00
 
4

Length

Max length11
Median length4
Mean length4.0277778
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1004
99.6%
18:00~20:00 4
 
0.4%

Length

2024-01-10T07:13:17.088558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:17.188818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1004
99.6%
18:00~20:00 4
 
0.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
Minimum2021-07-15 00:00:00
Maximum2021-07-15 00:00:00
2024-01-10T07:13:17.270751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:17.342381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-01-10T07:13:17.392235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분
도서관구분1.000
2024-01-10T07:13:17.470150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청시간4신청시간3신청시간1도서관구분신청시간5신청시간2
신청시간41.0001.000NaN1.000NaN1.000
신청시간31.0001.0001.0001.000NaN1.000
신청시간1NaN1.0001.0001.000NaN1.000
도서관구분1.0001.0001.0001.0001.0001.000
신청시간5NaNNaNNaN1.0001.000NaN
신청시간21.0001.0001.0001.000NaN1.000
2024-01-10T07:13:17.563355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서관구분신청시간1신청시간2신청시간3신청시간4신청시간5
도서관구분1.0001.0001.0001.0001.0001.000
신청시간11.0001.0001.0001.000NaN0.000
신청시간21.0001.0001.0001.0001.0000.000
신청시간31.0001.0001.0001.0001.0000.000
신청시간41.000NaN1.0001.0001.0000.000
신청시간51.0000.0000.0000.0000.0001.000

Missing values

2024-01-10T07:13:15.833935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:13:15.929131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
0중앙도서관2019-01-02<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-15
1중앙도서관2019-01-03<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-15
2죽정도서관2019-01-0513:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
3죽정도서관2019-01-1213:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
4죽정도서관2019-01-1913:00~14:00<NA><NA><NA><NA>2021-07-15
5중앙도서관2019-01-03<NA><NA><NA>16:00~17:00<NA>2021-07-15
6중앙도서관2019-01-02<NA><NA><NA>16:00~17:00<NA>2021-07-15
7중앙도서관2019-01-07<NA><NA><NA>16:00~17:00<NA>2021-07-15
8중앙도서관2019-01-0713:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
9죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2021-07-15
도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자
998죽정도서관2021-07-1613:00~14:00<NA><NA><NA><NA>2021-07-15
999죽정도서관2021-07-17<NA>14:00~15:0015:00~16:00<NA><NA>2021-07-15
1000중앙도서관2021-07-1713:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
1001죽정도서관2021-07-1813:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
1002죽정도서관2021-07-2313:00~14:00<NA><NA><NA><NA>2021-07-15
1003죽정도서관2021-07-24<NA>14:00~15:0015:00~16:00<NA><NA>2021-07-15
1004죽정도서관2021-07-2513:00~14:0014:00~15:00<NA><NA><NA>2021-07-15
1005죽정도서관2021-07-3013:00~14:00<NA><NA><NA><NA>2021-07-15
1006중앙도서관2021-07-3113:00~14:00<NA><NA><NA><NA>2021-07-15
1007죽정도서관2021-07-3113:00~14:0014:00~15:00<NA><NA><NA>2021-07-15

Duplicate rows

Most frequently occurring

도서관구분신청일신청시간1신청시간2신청시간3신청시간4신청시간5데이터기준일자# duplicates
89중앙도서관2019-11-1313:00~14:0014:00~15:00<NA><NA><NA>2021-07-153
0죽정도서관2019-01-13<NA>14:00~15:00<NA><NA><NA>2021-07-152
1죽정도서관2019-02-01<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-152
2죽정도서관2019-02-08<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-152
3죽정도서관2019-03-18<NA>14:00~15:00<NA><NA><NA>2021-07-152
4죽정도서관2019-03-3013:00~14:0014:00~15:00<NA><NA><NA>2021-07-152
5죽정도서관2019-04-0713:00~14:0014:00~15:00<NA><NA><NA>2021-07-152
6죽정도서관2019-04-21<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-152
7죽정도서관2019-05-1813:00~14:00<NA><NA><NA><NA>2021-07-152
8죽정도서관2019-05-26<NA><NA>15:00~16:0016:00~17:00<NA>2021-07-152