Overview

Dataset statistics

Number of variables4
Number of observations178
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)2.2%
Total size in memory5.7 KiB
Average record size in memory32.7 B

Variable types

Text1
Categorical2
DateTime1

Dataset

Description전라남도 여수시 공영자전거 운영 고장처리부품현황(자전거 아이디, 처리 구분, 처리 내용, 등록일자)정보 등을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15049729/fileData.do

Alerts

Dataset has 4 (2.2%) duplicate rowsDuplicates
구분 is highly imbalanced (50.4%)Imbalance

Reproduction

Analysis started2023-12-12 21:23:45.437189
Analysis finished2023-12-12 21:23:45.751958
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct112
Distinct (%)62.9%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T06:23:46.275310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters1602
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)39.3%

Sample

1st rowYS_000241
2nd rowYS_000141
3rd rowYS_000056
4th rowYS_000126
5th rowYS_000020
ValueCountFrequency (%)
ys_000273 7
 
3.9%
ys_000006 5
 
2.8%
ys_000104 4
 
2.2%
ys_000256 4
 
2.2%
ys_000020 4
 
2.2%
ys_000241 3
 
1.7%
ys_000099 3
 
1.7%
ys_000175 3
 
1.7%
ys_000204 3
 
1.7%
ys_000234 3
 
1.7%
Other values (102) 139
78.1%
2023-12-13T06:23:46.716179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 631
39.4%
Y 178
 
11.1%
S 178
 
11.1%
_ 178
 
11.1%
2 102
 
6.4%
1 74
 
4.6%
6 46
 
2.9%
5 44
 
2.7%
3 40
 
2.5%
4 37
 
2.3%
Other values (3) 94
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1068
66.7%
Uppercase Letter 356
 
22.2%
Connector Punctuation 178
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 631
59.1%
2 102
 
9.6%
1 74
 
6.9%
6 46
 
4.3%
5 44
 
4.1%
3 40
 
3.7%
4 37
 
3.5%
9 36
 
3.4%
7 31
 
2.9%
8 27
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
Y 178
50.0%
S 178
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 178
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1246
77.8%
Latin 356
 
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 631
50.6%
_ 178
 
14.3%
2 102
 
8.2%
1 74
 
5.9%
6 46
 
3.7%
5 44
 
3.5%
3 40
 
3.2%
4 37
 
3.0%
9 36
 
2.9%
7 31
 
2.5%
Latin
ValueCountFrequency (%)
Y 178
50.0%
S 178
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1602
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 631
39.4%
Y 178
 
11.1%
S 178
 
11.1%
_ 178
 
11.1%
2 102
 
6.4%
1 74
 
4.6%
6 46
 
2.9%
5 44
 
2.7%
3 40
 
2.5%
4 37
 
2.3%
Other values (3) 94
 
5.9%

구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
교체
120 
수리
56 
교체
 
1
 
1

Length

Max length3
Median length2
Mean length2
Min length1

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row수리
2nd row수리
3rd row수리
4th row교체
5th row교체

Common Values

ValueCountFrequency (%)
교체 120
67.4%
수리 56
31.5%
교체 1
 
0.6%
1
 
0.6%

Length

2023-12-13T06:23:46.899570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:23:47.039840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교체 121
68.0%
수리 56
31.5%
1
 
0.6%

처리 내용
Categorical

Distinct25
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
단말기
23 
브레이크
18 
라이트 교체
18 
타이어 펑크
17 
체인 수리
15 
Other values (20)
87 

Length

Max length10
Median length9
Mean length5.3146067
Min length2

Unique

Unique7 ?
Unique (%)3.9%

Sample

1st row체인 수리
2nd row체인 수리
3rd row체인 수리
4th row라이트 교체
5th row라이트 교체

Common Values

ValueCountFrequency (%)
단말기 23
12.9%
브레이크 18
10.1%
라이트 교체 18
10.1%
타이어 펑크 17
9.6%
체인 수리 15
8.4%
배터리 충전 15
8.4%
크랭크, 페달 14
7.9%
물받이 교체 13
7.3%
바퀴(림) 11
 
6.2%
바구니, 벨 5
 
2.8%
Other values (15) 29
16.3%

Length

2023-12-13T06:23:47.154736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교체 33
 
11.0%
단말기 23
 
7.7%
브레이크 22
 
7.4%
라이트 20
 
6.7%
수리 18
 
6.0%
타이어 17
 
5.7%
펑크 17
 
5.7%
체인 15
 
5.0%
배터리 15
 
5.0%
충전 15
 
5.0%
Other values (22) 104
34.8%
Distinct20
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
Minimum2023-01-02 00:00:00
Maximum2023-04-14 00:00:00
2023-12-13T06:23:47.273818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:23:47.372948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

Correlations

2023-12-13T06:23:47.444497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리 내용등록일자
구분1.0000.6640.000
처리 내용0.6641.0000.664
등록일자0.0000.6641.000
2023-12-13T06:23:47.536633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리 내용
구분1.0000.390
처리 내용0.3901.000
2023-12-13T06:23:47.612760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리 내용
구분1.0000.390
처리 내용0.3901.000

Missing values

2023-12-13T06:23:45.624795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:23:45.711794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자전거 아이디구분처리 내용등록일자
0YS_000241수리체인 수리2023-01-02
1YS_000141수리체인 수리2023-01-02
2YS_000056수리체인 수리2023-01-02
3YS_000126교체라이트 교체2023-01-02
4YS_000020교체라이트 교체2023-01-03
5YS_000020수리체인 수리2023-01-03
6YS_000114수리체인 수리2023-01-03
7YS_000131수리체인 수리2023-01-03
8YS_000162수리체인 수리2023-01-03
9YS_000211수리앞물받이 수리2023-01-03
자전거 아이디구분처리 내용등록일자
168YS_000175교체기어박스 보호대2023-04-03
169YS_000175교체물받이 교체2023-04-03
170YS_000030교체바구니, 벨2023-04-03
171YS_000129수리패널 충전기 선2023-04-14
172YS_000129교체바퀴(림)2023-04-14
173YS_000224교체타이어 펑크2023-04-14
174YS_000224교체타이어 펑크2023-04-14
175YS_000187교체바퀴(림)2023-04-14
176YS_000137교체안장2023-04-14
177YS_000183교체물받이 교체2023-04-14

Duplicate rows

Most frequently occurring

자전거 아이디구분처리 내용등록일자# duplicates
0YS_000099교체브레이크2023-03-162
1YS_000224교체타이어 펑크2023-04-142
2YS_000234수리체인 수리2023-03-292
3YS_000273교체태양광 패널2023-03-292