Overview

Dataset statistics

Number of variables6
Number of observations214
Missing cells916
Missing cells (%)71.3%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory11.0 KiB
Average record size in memory52.6 B

Variable types

DateTime1
Text1
Unsupported4

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-14994/F/1/datasetView.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
대여일시 has 30 (14.0%) missing valuesMissing
대여건수 has 30 (14.0%) missing valuesMissing
Unnamed: 2 has 214 (100.0%) missing valuesMissing
Unnamed: 3 has 214 (100.0%) missing valuesMissing
Unnamed: 4 has 214 (100.0%) missing valuesMissing
Unnamed: 5 has 214 (100.0%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 06:51:45.923766
Analysis finished2023-12-11 06:51:46.230595
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대여일시
Date

MISSING 

Distinct184
Distinct (%)100.0%
Missing30
Missing (%)14.0%
Memory size1.8 KiB
Minimum2021-07-01 00:00:00
Maximum2021-12-31 00:00:00
2023-12-11T15:51:46.295641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T15:51:46.415173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대여건수
Text

MISSING 

Distinct184
Distinct (%)100.0%
Missing30
Missing (%)14.0%
Memory size1.8 KiB
2023-12-11T15:51:46.707440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length8.5652174
Min length8

Characters and Unicode

Total characters1576
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique184 ?
Unique (%)100.0%

Sample

1st row 138,855
2nd row 137,755
3rd row 51,624
4th row 34,280
5th row 137,012
ValueCountFrequency (%)
83,832 1
 
0.5%
123,993 1
 
0.5%
107,707 1
 
0.5%
123,048 1
 
0.5%
126,416 1
 
0.5%
129,282 1
 
0.5%
111,853 1
 
0.5%
96,432 1
 
0.5%
119,911 1
 
0.5%
128,186 1
 
0.5%
Other values (174) 174
94.6%
2023-12-11T15:51:47.154561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
368
23.4%
1 202
12.8%
, 184
11.7%
8 109
 
6.9%
3 99
 
6.3%
2 97
 
6.2%
5 93
 
5.9%
0 87
 
5.5%
4 85
 
5.4%
6 85
 
5.4%
Other values (2) 167
10.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1024
65.0%
Space Separator 368
 
23.4%
Other Punctuation 184
 
11.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 202
19.7%
8 109
10.6%
3 99
9.7%
2 97
9.5%
5 93
9.1%
0 87
8.5%
4 85
8.3%
6 85
8.3%
9 84
8.2%
7 83
8.1%
Space Separator
ValueCountFrequency (%)
368
100.0%
Other Punctuation
ValueCountFrequency (%)
, 184
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1576
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
368
23.4%
1 202
12.8%
, 184
11.7%
8 109
 
6.9%
3 99
 
6.3%
2 97
 
6.2%
5 93
 
5.9%
0 87
 
5.5%
4 85
 
5.4%
6 85
 
5.4%
Other values (2) 167
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1576
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
368
23.4%
1 202
12.8%
, 184
11.7%
8 109
 
6.9%
3 99
 
6.3%
2 97
 
6.2%
5 93
 
5.9%
0 87
 
5.5%
4 85
 
5.4%
6 85
 
5.4%
Other values (2) 167
10.6%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing214
Missing (%)100.0%
Memory size2.0 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing214
Missing (%)100.0%
Memory size2.0 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing214
Missing (%)100.0%
Memory size2.0 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing214
Missing (%)100.0%
Memory size2.0 KiB

Missing values

2023-12-11T15:51:46.008525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T15:51:46.113908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T15:51:46.193267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

대여일시대여건수Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
02021-07-01138,855<NA><NA><NA><NA>
12021-07-02137,755<NA><NA><NA><NA>
22021-07-0351,624<NA><NA><NA><NA>
32021-07-0434,280<NA><NA><NA><NA>
42021-07-05137,012<NA><NA><NA><NA>
52021-07-06141,676<NA><NA><NA><NA>
62021-07-07120,685<NA><NA><NA><NA>
72021-07-08120,509<NA><NA><NA><NA>
82021-07-09133,467<NA><NA><NA><NA>
92021-07-10108,159<NA><NA><NA><NA>
대여일시대여건수Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
204<NA><NA><NA><NA><NA><NA>
205<NA><NA><NA><NA><NA><NA>
206<NA><NA><NA><NA><NA><NA>
207<NA><NA><NA><NA><NA><NA>
208<NA><NA><NA><NA><NA><NA>
209<NA><NA><NA><NA><NA><NA>
210<NA><NA><NA><NA><NA><NA>
211<NA><NA><NA><NA><NA><NA>
212<NA><NA><NA><NA><NA><NA>
213<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

대여일시대여건수# duplicates
0<NA><NA>30