Overview

Dataset statistics

Number of variables3
Number of observations8335
Missing cells0
Missing cells (%)0.0%
Duplicate rows333
Duplicate rows (%)4.0%
Total size in memory195.5 KiB
Average record size in memory24.0 B

Variable types

Categorical1
Text1
DateTime1

Dataset

Description대구광역시에 등록된 건설 기계 사업자(건설기계대여업체)가 보유한 건설 기계에 대한 데이터로 건설기계명, 형식, 최초등록일 제공합니다.
URLhttps://www.data.go.kr/data/15113738/fileData.do

Alerts

Dataset has 333 (4.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 04:52:09.975545
Analysis finished2023-12-12 04:52:10.323228
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건설기계명
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size65.2 KiB
굴착기
3294 
지게차
1384 
덤프트럭
1040 
콘크리트 믹서트럭
987 
기중기
 
268
Other values (15)
1362 

Length

Max length9
Median length3
Mean length4.0323935
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row불도저
2nd row불도저
3rd row불도저
4th row불도저
5th row불도저

Common Values

ValueCountFrequency (%)
굴착기 3294
39.5%
지게차 1384
16.6%
덤프트럭 1040
 
12.5%
콘크리트 믹서트럭 987
 
11.8%
기중기 268
 
3.2%
롤러 263
 
3.2%
콘크리트 펌프 243
 
2.9%
로더 241
 
2.9%
타워크레인 204
 
2.4%
불도저 105
 
1.3%
Other values (10) 306
 
3.7%

Length

2023-12-12T13:52:10.452050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
굴착기 3294
33.8%
지게차 1384
14.2%
콘크리트 1230
 
12.6%
덤프트럭 1040
 
10.7%
믹서트럭 987
 
10.1%
기중기 268
 
2.8%
롤러 263
 
2.7%
펌프 243
 
2.5%
로더 241
 
2.5%
타워크레인 204
 
2.1%
Other values (15) 586
 
6.0%

형식
Text

Distinct1222
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Memory size65.2 KiB
2023-12-12T13:52:10.802029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length21
Mean length7.9960408
Min length2

Characters and Unicode

Total characters66647
Distinct characters49
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique557 ?
Unique (%)6.7%

Sample

1st rowD6HLGP
2nd rowD6HIILGP
3rd rowD9N
4th rowD6HIILGP
5th rowD3KLGP
ValueCountFrequency (%)
dx140w-5 192
 
2.3%
hd060-ymx0-esha 139
 
1.7%
hd060-mix-mhs 139
 
1.7%
dx55mt-5 137
 
1.6%
hd060p-8mix-mhhb 135
 
1.6%
ec55c 122
 
1.5%
m84sdc504i 120
 
1.4%
dx55w-5 110
 
1.3%
hd060-ymix-mhr 105
 
1.3%
vio17 95
 
1.1%
Other values (1217) 7068
84.5%
2023-12-12T13:52:11.344426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6597
 
9.9%
- 5411
 
8.1%
5 5251
 
7.9%
D 4879
 
7.3%
H 3140
 
4.7%
X 2934
 
4.4%
4 2822
 
4.2%
S 2665
 
4.0%
E 2465
 
3.7%
M 2461
 
3.7%
Other values (39) 28022
42.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 34852
52.3%
Decimal Number 26034
39.1%
Dash Punctuation 5411
 
8.1%
Other Punctuation 136
 
0.2%
Space Separator 72
 
0.1%
Close Punctuation 65
 
0.1%
Open Punctuation 65
 
0.1%
Other Letter 6
 
< 0.1%
Letter Number 5
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
D 4879
14.0%
H 3140
 
9.0%
X 2934
 
8.4%
S 2665
 
7.6%
E 2465
 
7.1%
M 2461
 
7.1%
C 2325
 
6.7%
R 1508
 
4.3%
L 1435
 
4.1%
A 1217
 
3.5%
Other values (16) 9823
28.2%
Decimal Number
ValueCountFrequency (%)
0 6597
25.3%
5 5251
20.2%
4 2822
10.8%
1 2424
 
9.3%
3 2100
 
8.1%
2 1814
 
7.0%
6 1809
 
6.9%
7 1513
 
5.8%
8 1249
 
4.8%
9 455
 
1.7%
Other Letter
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%
Letter Number
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 110
80.9%
/ 26
 
19.1%
Dash Punctuation
ValueCountFrequency (%)
- 5411
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 34857
52.3%
Common 31784
47.7%
Hangul 6
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
D 4879
14.0%
H 3140
 
9.0%
X 2934
 
8.4%
S 2665
 
7.6%
E 2465
 
7.1%
M 2461
 
7.1%
C 2325
 
6.7%
R 1508
 
4.3%
L 1435
 
4.1%
A 1217
 
3.5%
Other values (19) 9828
28.2%
Common
ValueCountFrequency (%)
0 6597
20.8%
- 5411
17.0%
5 5251
16.5%
4 2822
8.9%
1 2424
 
7.6%
3 2100
 
6.6%
2 1814
 
5.7%
6 1809
 
5.7%
7 1513
 
4.8%
8 1249
 
3.9%
Other values (7) 794
 
2.5%
Hangul
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66636
> 99.9%
Hangul 6
 
< 0.1%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6597
 
9.9%
- 5411
 
8.1%
5 5251
 
7.9%
D 4879
 
7.3%
H 3140
 
4.7%
X 2934
 
4.4%
4 2822
 
4.2%
S 2665
 
4.0%
E 2465
 
3.7%
M 2461
 
3.7%
Other values (33) 28011
42.0%
Hangul
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%
Number Forms
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Distinct4337
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Memory size65.2 KiB
Minimum1981-11-13 00:00:00
Maximum2023-05-10 00:00:00
2023-12-12T13:52:11.586689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:52:11.771650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T13:52:10.184603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:52:10.270327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건설기계명형식최초등록일
0불도저D6HLGP1993-12-29
1불도저D6HIILGP1995-06-03
2불도저D9N1996-08-23
3불도저D6HIILGP1995-11-28
4불도저D3KLGP2009-08-31
5불도저D6H2LGP1997-01-08
6불도저D6H21995-06-12
7불도저D9N1991-04-08
8불도저D9N1989-07-31
9불도저D11R2010-02-16
건설기계명형식최초등록일
8325노면파쇄기W200I2018-11-09
8326노면파쇄기W19002005-08-05
8327노면파쇄기W19002014-05-07
8328노면파쇄기W19002015-01-15
8329노면파쇄기W19002017-09-07
8330노면파쇄기W200I2016-10-18
8331노면파쇄기W210FI2021-09-07
8332노면파쇄기W200I2021-08-13
8333노면파쇄기W200HI2022-01-25
8334노면파쇄기W210XP2023-04-12

Duplicate rows

Most frequently occurring

건설기계명형식최초등록일# duplicates
241콘크리트 믹서트럭HD060-MIX-MHS2003-07-077
121덤프트럭FM84FR3HB2017-03-206
239콘크리트 믹서트럭HD060-MIX-MHS2003-05-206
315콘크리트 펌프DCP32X-5RZ2016-08-016
0공기압축기CPS1070-252015-09-235
95굴착기ROBEX552009-11-305
155덤프트럭M84SDC504I2012-07-305
189지게차D30S-72015-04-295
232콘크리트 믹서트럭HD060-MIX-MHS2002-06-175
252콘크리트 믹서트럭HD060-YMIX-MHR2006-03-175