Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows282
Duplicate rows (%)2.8%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

DateTime3
Categorical3
Boolean1

Dataset

Description인천시설공단에서 운영하는 인천대공원 자전거대여반납에 관한 정보입니다. 인천대공원은 인천광역시 남동구 무네미로 238에 위치하고 있습니다.
Author인천시설공단
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15040808&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 282 (2.8%) duplicate rowsDuplicates
대여가격 is highly overall correlated with 자전거종류High correlation
자전거종류 is highly overall correlated with 대여가격High correlation
반납지연 is highly overall correlated with 추가금액High correlation
추가금액 is highly overall correlated with 반납지연High correlation
추가금액 is highly imbalanced (84.4%)Imbalance
반납지연 is highly imbalanced (77.8%)Imbalance

Reproduction

Analysis started2024-03-18 04:24:02.476755
Analysis finished2024-03-18 04:24:04.215842
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct328
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2011-01-21 00:00:00
Maximum2012-04-26 00:00:00
2024-03-18T13:24:04.314512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:24:04.447724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct541
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-03-18 09:00:00
Maximum2024-03-18 18:04:00
2024-03-18T13:24:04.566154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:24:04.684525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct554
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-03-18 09:16:00
Maximum2024-03-18 19:00:00
2024-03-18T13:24:04.817814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:24:04.958615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

자전거종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1인용
5656 
2인용
2241 
다인용
2103 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row다인용
2nd row1인용
3rd row다인용
4th row2인용
5th row2인용

Common Values

ValueCountFrequency (%)
1인용 5656
56.6%
2인용 2241
 
22.4%
다인용 2103
 
21.0%

Length

2024-03-18T13:24:05.092209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:24:05.192430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1인용 5656
56.6%
2인용 2241
 
22.4%
다인용 2103
 
21.0%

대여가격
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2000
5656 
5000
2241 
10000
2103 

Length

Max length5
Median length4
Mean length4.2103
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10000
2nd row2000
3rd row10000
4th row5000
5th row5000

Common Values

ValueCountFrequency (%)
2000 5656
56.6%
5000 2241
 
22.4%
10000 2103
 
21.0%

Length

2024-03-18T13:24:05.289021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:24:05.371914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2000 5656
56.6%
5000 2241
 
22.4%
10000 2103
 
21.0%

추가금액
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9643 
2000
 
291
1000
 
66

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9643
96.4%
2000 291
 
2.9%
1000 66
 
0.7%

Length

2024-03-18T13:24:05.461575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:24:05.543927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9643
96.4%
2000 291
 
2.9%
1000 66
 
0.7%

반납지연
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9643 
True
 
357
ValueCountFrequency (%)
False 9643
96.4%
True 357
 
3.6%
2024-03-18T13:24:05.611690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T13:24:05.668077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자전거종류대여가격추가금액반납지연
자전거종류1.0001.0000.0000.059
대여가격1.0001.0000.0000.059
추가금액0.0000.0001.000NaN
반납지연0.0590.059NaN1.000
2024-03-18T13:24:05.765276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대여가격자전거종류반납지연추가금액
대여가격1.0001.0000.0980.000
자전거종류1.0001.0000.0980.000
반납지연0.0980.0981.0001.000
추가금액0.0000.0001.0001.000
2024-03-18T13:24:05.853379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자전거종류대여가격추가금액반납지연
자전거종류1.0001.0000.0000.098
대여가격1.0001.0000.0000.098
추가금액0.0000.0001.0001.000
반납지연0.0980.0981.0001.000

Missing values

2024-03-18T13:24:03.851431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:24:04.150768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연
827872012-02-2914:1015:09다인용10000<NA>N
612082011-09-2410:5811:591인용2000<NA>N
197862011-04-2715:1315:59다인용10000<NA>N
717442011-10-2214:5315:512인용5000<NA>N
897082012-04-0212:0712:432인용5000<NA>N
277252011-05-1513:4414:462인용5000<NA>N
630492011-09-2715:1916:112인용5000<NA>N
694242011-10-1610:5411:51다인용10000<NA>N
830942012-03-0112:4013:32다인용10000<NA>N
212152011-05-0115:0815:581인용2000<NA>N
대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연
429692011-07-1012:3813:18다인용10000<NA>N
176772011-04-2115:2716:212인용5000<NA>N
270732011-05-1415:4416:431인용2000<NA>N
781162011-11-1312:0413:071인용2000<NA>N
956542012-04-1712:0412:541인용2000<NA>N
871602012-03-2112:3813:371인용2000<NA>N
660242011-10-0414:5815:372인용5000<NA>N
13412011-02-2115:0616:071인용2000<NA>N
295172011-05-2210:1312:571인용2000<NA>N
981042012-04-2014:1115:05다인용10000<NA>N

Duplicate rows

Most frequently occurring

대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연# duplicates
2362011-12-1815:1616:131인용2000<NA>N4
442011-04-169:2210:291인용2000<NA>N3
1542011-09-1012:2513:241인용2000<NA>N3
1942011-10-1015:2116:031인용2000<NA>N3
02011-02-1913:5414:521인용2000<NA>N2
12011-02-2015:0115:421인용2000<NA>N2
22011-02-2114:2915:281인용2000<NA>N2
32011-02-2115:0616:071인용2000<NA>N2
42011-02-2414:4716:021인용2000<NA>N2
52011-02-2610:4011:371인용2000<NA>N2