Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows309
Duplicate rows (%)3.1%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

DateTime3
Categorical3
Boolean1

Dataset

Description인천시설공단에서 운영하는 인천대공원 자전거대여반납에 관한 정보입니다. 인천대공원은 인천광역시 남동구 무네미로 238에 위치하고 있습니다.
URLhttps://www.data.go.kr/data/15040808/fileData.do

Alerts

Dataset has 309 (3.1%) duplicate rowsDuplicates
자전거종류 is highly overall correlated with 대여가격High correlation
추가금액 is highly overall correlated with 반납지연High correlation
대여가격 is highly overall correlated with 자전거종류High correlation
반납지연 is highly overall correlated with 추가금액High correlation
추가금액 is highly imbalanced (84.8%)Imbalance
반납지연 is highly imbalanced (78.4%)Imbalance

Reproduction

Analysis started2023-12-12 09:08:59.033533
Analysis finished2023-12-12 09:08:59.673668
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct320
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2011-01-27 00:00:00
Maximum2012-04-16 00:00:00
2023-12-12T18:08:59.764410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:08:59.976070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct540
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 09:01:00
Maximum2023-12-12 18:01:00
2023-12-12T18:09:00.181266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:00.375825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct556
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 09:14:00
Maximum2023-12-12 19:10:00
2023-12-12T18:09:00.521922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:09:00.979797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

자전거종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1인용
5845 
2인용
2118 
다인용
2037 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1인용
2nd row1인용
3rd row1인용
4th row1인용
5th row1인용

Common Values

ValueCountFrequency (%)
1인용 5845
58.5%
2인용 2118
 
21.2%
다인용 2037
 
20.4%

Length

2023-12-12T18:09:01.231885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:09:01.347732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1인용 5845
58.5%
2인용 2118
 
21.2%
다인용 2037
 
20.4%

대여가격
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2000
5845 
5000
2118 
10000
2037 

Length

Max length5
Median length4
Mean length4.2037
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2000
2nd row2000
3rd row2000
4th row2000
5th row2000

Common Values

ValueCountFrequency (%)
2000 5845
58.5%
5000 2118
 
21.2%
10000 2037
 
20.4%

Length

2023-12-12T18:09:01.502576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:09:01.657477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2000 5845
58.5%
5000 2118
 
21.2%
10000 2037
 
20.4%

추가금액
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9657 
2000
 
267
1000
 
76

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9657
96.6%
2000 267
 
2.7%
1000 76
 
0.8%

Length

2023-12-12T18:09:01.790314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:09:01.929964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9657
96.6%
2000 267
 
2.7%
1000 76
 
0.8%

반납지연
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9657 
True
 
343
ValueCountFrequency (%)
False 9657
96.6%
True 343
 
3.4%
2023-12-12T18:09:02.065235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:09:02.142083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자전거종류대여가격추가금액반납지연
자전거종류1.0001.0000.0000.044
대여가격1.0001.0000.0000.044
추가금액0.0000.0001.000NaN
반납지연0.0440.044NaN1.000
2023-12-12T18:09:02.265858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자전거종류추가금액대여가격반납지연
자전거종류1.0000.0001.0000.073
추가금액0.0001.0000.0001.000
대여가격1.0000.0001.0000.073
반납지연0.0731.0000.0731.000
2023-12-12T18:09:02.377204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자전거종류대여가격추가금액반납지연
자전거종류1.0001.0000.0000.073
대여가격1.0001.0000.0000.073
추가금액0.0000.0001.0001.000
반납지연0.0730.0731.0001.000

Missing values

2023-12-12T18:08:59.479306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:08:59.616243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연
409212011-06-1916:3017:401인용2000<NA>N
697982011-10-1614:4015:381인용2000<NA>N
122682011-04-1013:0714:111인용2000<NA>N
257712011-05-1217:0017:371인용2000<NA>N
302802011-05-2216:1617:061인용2000<NA>N
849742012-03-1013:5214:521인용2000<NA>N
91872011-04-0215:0416:06다인용10000<NA>N
772872011-11-1112:1413:00다인용10000<NA>N
326492011-05-2911:1713:291인용2000<NA>N
898252012-04-0413:0314:361인용2000<NA>N
대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연
331092011-05-2915:2016:401인용2000<NA>N
387502011-06-1312:2413:061인용2000<NA>N
270132011-05-1415:2215:271인용20002000Y
589242011-09-1615:4316:461인용2000<NA>N
922252012-04-1013:2714:181인용2000<NA>N
467942011-08-0514:4015:37다인용10000<NA>N
873042012-03-249:4910:462인용5000<NA>N
717672011-10-2215:0516:081인용2000<NA>N
89802011-04-0211:2012:161인용2000<NA>N
345392011-06-0511:1711:581인용2000<NA>N

Duplicate rows

Most frequently occurring

대여날짜대여시간반납시간자전거종류대여가격추가금액반납지연# duplicates
2372011-10-2110:2511:291인용2000<NA>N4
442011-04-1316:2217:121인용2000<NA>N3
462011-04-1415:5416:561인용2000<NA>N3
482011-04-1511:2912:311인용2000<NA>N3
852011-05-1415:2215:271인용20002000Y3
1032011-05-259:2310:101인용2000<NA>N3
2022011-09-2516:0817:011인용2000<NA>N3
2032011-09-2516:2017:191인용2000<NA>N3
2362011-10-2110:1310:581인용2000<NA>N3
2502011-10-2914:2315:151인용2000<NA>N3