Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Categorical1
Text2

Dataset

Description본 데이터는 충남 건설기계 등록 현황에 대한 데이터로 최초등록일 건설기계명 형식 사용본거지 등의 항목(최초등록일 건설기계명 형식 사용본거지)을 제공합니다
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=43&beforeMenuCd=DOM_000000201001001000&publicdatapk=15114207

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:39:31.122916
Analysis finished2024-01-09 22:39:31.926350
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9494.7542
Minimum6
Maximum18871
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:39:31.987528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile1005.9
Q14800.5
median9521
Q314212.25
95-th percentile17898.05
Maximum18871
Range18865
Interquartile range (IQR)9411.75

Descriptive statistics

Standard deviation5423.4871
Coefficient of variation (CV)0.57120879
Kurtosis-1.2006758
Mean9494.7542
Median Absolute Deviation (MAD)4705.5
Skewness-0.015456732
Sum94947542
Variance29414212
MonotonicityNot monotonic
2024-01-10T07:39:32.123502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14702 1
 
< 0.1%
16243 1
 
< 0.1%
3139 1
 
< 0.1%
16083 1
 
< 0.1%
18355 1
 
< 0.1%
6720 1
 
< 0.1%
8452 1
 
< 0.1%
4875 1
 
< 0.1%
7429 1
 
< 0.1%
9995 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
16 1
< 0.1%
26 1
< 0.1%
28 1
< 0.1%
ValueCountFrequency (%)
18871 1
< 0.1%
18869 1
< 0.1%
18868 1
< 0.1%
18863 1
< 0.1%
18860 1
< 0.1%
18859 1
< 0.1%
18858 1
< 0.1%
18855 1
< 0.1%
18847 1
< 0.1%
18845 1
< 0.1%

최초등록일
Real number (ℝ)

Distinct4297
Distinct (%)43.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20145324
Minimum19751230
Maximum20230601
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:39:32.249907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19751230
5-th percentile19970630
Q120111009
median20161106
Q320200730
95-th percentile20221024
Maximum20230601
Range479371
Interquartile range (IQR)89721.5

Descriptive statistics

Standard deviation74350.478
Coefficient of variation (CV)0.0036907066
Kurtosis1.7697344
Mean20145324
Median Absolute Deviation (MAD)40440
Skewness-1.3826252
Sum2.0145324 × 1011
Variance5.5279936 × 109
MonotonicityNot monotonic
2024-01-10T07:39:32.361903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220127 18
 
0.2%
20211028 18
 
0.2%
20211027 18
 
0.2%
20191230 17
 
0.2%
20181207 16
 
0.2%
20220311 16
 
0.2%
20230330 14
 
0.1%
20200929 14
 
0.1%
20210702 13
 
0.1%
20200624 13
 
0.1%
Other values (4287) 9843
98.4%
ValueCountFrequency (%)
19751230 1
< 0.1%
19760728 1
< 0.1%
19780804 1
< 0.1%
19790314 1
< 0.1%
19800527 1
< 0.1%
19800917 1
< 0.1%
19811207 1
< 0.1%
19820107 1
< 0.1%
19820614 2
< 0.1%
19820708 1
< 0.1%
ValueCountFrequency (%)
20230601 4
< 0.1%
20230531 5
0.1%
20230530 5
0.1%
20230526 3
< 0.1%
20230525 2
 
< 0.1%
20230524 3
< 0.1%
20230523 5
0.1%
20230522 4
< 0.1%
20230519 2
 
< 0.1%
20230518 1
 
< 0.1%

건설기계명
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
굴착기
4407 
덤프트럭
1863 
지게차
1790 
콘크리트 믹서트럭
701 
기중기
 
350
Other values (15)
889 

Length

Max length9
Median length3
Mean length3.6774
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row굴착기
2nd row덤프트럭
3rd row굴착기
4th row지게차
5th row굴착기

Common Values

ValueCountFrequency (%)
굴착기 4407
44.1%
덤프트럭 1863
18.6%
지게차 1790
17.9%
콘크리트 믹서트럭 701
 
7.0%
기중기 350
 
3.5%
로더 217
 
2.2%
롤러 196
 
2.0%
콘크리트 펌프 168
 
1.7%
불도저 114
 
1.1%
천공기 54
 
0.5%
Other values (10) 140
 
1.4%

Length

2024-01-10T07:39:32.475738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
굴착기 4407
40.3%
덤프트럭 1863
17.1%
지게차 1790
16.4%
콘크리트 877
 
8.0%
믹서트럭 701
 
6.4%
기중기 350
 
3.2%
로더 217
 
2.0%
롤러 196
 
1.8%
펌프 168
 
1.5%
불도저 114
 
1.0%
Other values (12) 243
 
2.2%

형식
Text

Distinct1370
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:39:32.696298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length7.9646
Min length2

Characters and Unicode

Total characters79646
Distinct characters54
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique626 ?
Unique (%)6.3%

Sample

1st rowU-35-6S
2nd rowG490CB8X4
3rd rowDX55ACE
4th rowD160S-5
5th rowEC60EPRO
ValueCountFrequency (%)
dx55mt-5 284
 
2.8%
dx140w-5 281
 
2.8%
ec60epro 208
 
2.1%
ec55c 160
 
1.6%
hx60amt 146
 
1.5%
d30se-7 136
 
1.4%
hw145 121
 
1.2%
hd150-dus-dhr 119
 
1.2%
hx60mt 118
 
1.2%
dl4mp1 113
 
1.1%
Other values (1369) 8363
83.2%
2024-01-10T07:39:33.032032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7465
 
9.4%
5 6838
 
8.6%
D 6375
 
8.0%
- 6282
 
7.9%
H 3835
 
4.8%
E 3510
 
4.4%
4 3445
 
4.3%
X 3438
 
4.3%
1 3382
 
4.2%
S 2882
 
3.6%
Other values (44) 32194
40.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 41904
52.6%
Decimal Number 31080
39.0%
Dash Punctuation 6282
 
7.9%
Other Punctuation 186
 
0.2%
Open Punctuation 59
 
0.1%
Close Punctuation 59
 
0.1%
Space Separator 49
 
0.1%
Letter Number 13
 
< 0.1%
Other Letter 9
 
< 0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
D 6375
15.2%
H 3835
 
9.2%
E 3510
 
8.4%
X 3438
 
8.2%
S 2882
 
6.9%
M 2447
 
5.8%
C 2402
 
5.7%
R 2101
 
5.0%
L 1643
 
3.9%
A 1563
 
3.7%
Other values (16) 11708
27.9%
Decimal Number
ValueCountFrequency (%)
0 7465
24.0%
5 6838
22.0%
4 3445
11.1%
1 3382
10.9%
3 2322
 
7.5%
7 1981
 
6.4%
6 1911
 
6.1%
2 1544
 
5.0%
8 1352
 
4.4%
9 840
 
2.7%
Other Letter
ValueCountFrequency (%)
3
33.3%
2
22.2%
2
22.2%
1
 
11.1%
1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
. 157
84.4%
/ 28
 
15.1%
" 1
 
0.5%
Letter Number
ValueCountFrequency (%)
6
46.2%
4
30.8%
3
23.1%
Math Symbol
ValueCountFrequency (%)
+ 2
66.7%
× 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 6282
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 41917
52.6%
Common 37720
47.4%
Hangul 9
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
D 6375
15.2%
H 3835
 
9.1%
E 3510
 
8.4%
X 3438
 
8.2%
S 2882
 
6.9%
M 2447
 
5.8%
C 2402
 
5.7%
R 2101
 
5.0%
L 1643
 
3.9%
A 1563
 
3.7%
Other values (19) 11721
28.0%
Common
ValueCountFrequency (%)
0 7465
19.8%
5 6838
18.1%
- 6282
16.7%
4 3445
9.1%
1 3382
9.0%
3 2322
 
6.2%
7 1981
 
5.3%
6 1911
 
5.1%
2 1544
 
4.1%
8 1352
 
3.6%
Other values (10) 1198
 
3.2%
Hangul
ValueCountFrequency (%)
3
33.3%
2
22.2%
2
22.2%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79623
> 99.9%
Number Forms 13
 
< 0.1%
Hangul 9
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7465
 
9.4%
5 6838
 
8.6%
D 6375
 
8.0%
- 6282
 
7.9%
H 3835
 
4.8%
E 3510
 
4.4%
4 3445
 
4.3%
X 3438
 
4.3%
1 3382
 
4.2%
S 2882
 
3.6%
Other values (35) 32171
40.4%
Number Forms
ValueCountFrequency (%)
6
46.2%
4
30.8%
3
23.1%
Hangul
ValueCountFrequency (%)
3
33.3%
2
22.2%
2
22.2%
1
 
11.1%
1
 
11.1%
None
ValueCountFrequency (%)
× 1
100.0%
Distinct387
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:39:33.274725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length41
Mean length24.6952
Min length18

Characters and Unicode

Total characters246952
Distinct characters277
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)1.5%

Sample

1st row충청남도 부여군 부여읍 부장대로 57
2nd row충청남도 홍성군 홍성읍 의사로36번길 8
3rd row충청남도 부여군 부여읍 금성로 2
4th row충청남도 보령시 주교면 토정로 1028-1
5th row충청남도 보령시 천변남길 238, 1층(동대동)
ValueCountFrequency (%)
충청남도 10000
 
20.0%
천안시 1887
 
3.8%
아산시 1680
 
3.4%
동남구 1647
 
3.3%
당진시 1137
 
2.3%
서산시 755
 
1.5%
공주시 663
 
1.3%
예산군 652
 
1.3%
홍성군 630
 
1.3%
홍성읍 619
 
1.2%
Other values (827) 30413
60.7%
2024-01-10T07:39:33.642032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40083
 
16.2%
12888
 
5.2%
11467
 
4.6%
10567
 
4.3%
10260
 
4.2%
2 9351
 
3.8%
1 8660
 
3.5%
8473
 
3.4%
7780
 
3.2%
7323
 
3.0%
Other values (267) 120100
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 146988
59.5%
Decimal Number 41332
 
16.7%
Space Separator 40083
 
16.2%
Close Punctuation 5834
 
2.4%
Open Punctuation 5834
 
2.4%
Other Punctuation 3576
 
1.4%
Dash Punctuation 3096
 
1.3%
Uppercase Letter 209
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12888
 
8.8%
11467
 
7.8%
10567
 
7.2%
10260
 
7.0%
8473
 
5.8%
7780
 
5.3%
7323
 
5.0%
4819
 
3.3%
4409
 
3.0%
3707
 
2.5%
Other values (246) 65295
44.4%
Decimal Number
ValueCountFrequency (%)
2 9351
22.6%
1 8660
21.0%
3 4643
11.2%
5 3520
 
8.5%
4 3299
 
8.0%
0 3235
 
7.8%
6 2531
 
6.1%
7 2461
 
6.0%
8 2295
 
5.6%
9 1337
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
A 159
76.1%
P 23
 
11.0%
S 23
 
11.0%
D 2
 
1.0%
B 2
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 3574
99.9%
/ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
40083
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5834
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5834
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3096
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 146988
59.5%
Common 99755
40.4%
Latin 209
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12888
 
8.8%
11467
 
7.8%
10567
 
7.2%
10260
 
7.0%
8473
 
5.8%
7780
 
5.3%
7323
 
5.0%
4819
 
3.3%
4409
 
3.0%
3707
 
2.5%
Other values (246) 65295
44.4%
Common
ValueCountFrequency (%)
40083
40.2%
2 9351
 
9.4%
1 8660
 
8.7%
) 5834
 
5.8%
( 5834
 
5.8%
3 4643
 
4.7%
, 3574
 
3.6%
5 3520
 
3.5%
4 3299
 
3.3%
0 3235
 
3.2%
Other values (6) 11722
 
11.8%
Latin
ValueCountFrequency (%)
A 159
76.1%
P 23
 
11.0%
S 23
 
11.0%
D 2
 
1.0%
B 2
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 146988
59.5%
ASCII 99964
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40083
40.1%
2 9351
 
9.4%
1 8660
 
8.7%
) 5834
 
5.8%
( 5834
 
5.8%
3 4643
 
4.6%
, 3574
 
3.6%
5 3520
 
3.5%
4 3299
 
3.3%
0 3235
 
3.2%
Other values (11) 11931
 
11.9%
Hangul
ValueCountFrequency (%)
12888
 
8.8%
11467
 
7.8%
10567
 
7.2%
10260
 
7.0%
8473
 
5.8%
7780
 
5.3%
7323
 
5.0%
4819
 
3.3%
4409
 
3.0%
3707
 
2.5%
Other values (246) 65295
44.4%

Interactions

2024-01-10T07:39:31.627184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:39:31.468136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:39:31.706748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:39:31.547148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:39:33.727793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번최초등록일건설기계명
순번1.0000.1450.554
최초등록일0.1451.0000.530
건설기계명0.5540.5301.000
2024-01-10T07:39:33.801931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번최초등록일건설기계명
순번1.000-0.0370.206
최초등록일-0.0371.0000.196
건설기계명0.2060.1961.000

Missing values

2024-01-10T07:39:31.800932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:39:31.887387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번최초등록일건설기계명형식사용본거지
147011470220220704굴착기U-35-6S충청남도 부여군 부여읍 부장대로 57
167791678020160412덤프트럭G490CB8X4충청남도 홍성군 홍성읍 의사로36번길 8
144311443220150922굴착기DX55ACE충청남도 부여군 부여읍 금성로 2
5504550520081211지게차D160S-5충청남도 보령시 주교면 토정로 1028-1
5340534120200831굴착기EC60EPRO충청남도 보령시 천변남길 238, 1층(동대동)
8071807220220524지게차30D-9V충청남도 아산시 음봉면 신수리길 161
114411144220210401덤프트럭HD150-YDU1-ESH충청남도 논산시 득안대로 1282(내동)
1202120320221104굴착기EW140EPRO충청남도 천안시 동남구 고재4길 6, 102호(원성동, 예신빌딩)
727320151005굴착기ZX35U-5B충청남도 천안시 동남구 고재4길 6, 102호(원성동, 예신빌딩)
146231462420211119굴착기DX55MT-5충청남도 부여군 부여읍 금성로 2
순번최초등록일건설기계명형식사용본거지
108751087620180816굴착기EW140E충청남도 논산시 먹골1길 1-3(내동)
6381638220180219굴착기DX140W-5충청남도 아산시 시민로 467-5(온천동)
4043404420180713굴착기DX55MT-5충청남도 공주시 번영3로 5(신관동)
9943994420171116지게차30L-7A충청남도 서산시 지곡면 충의로 958
168351683620110401덤프트럭H84SDC544I충청남도 홍성군 홍성읍 의사로36번길 8
1248124920070319로더WA500-3충청남도 천안시 동남구 광교로 33-1(구성동)
161931619420141230굴착기ROBEX55충청남도 홍성군 홍성읍 홍성천길 154
141441414520221011콘크리트 믹서트럭HD060-YMX0-ESHA충청남도 금산군 복수면 수심대길 9
3209321020170313콘크리트 믹서트럭DL4MP1충청남도 천안시 서북구 오성4길 25(두정동)
127661276720140318지게차D45S-7충청남도 당진시 당진중앙1로 272-32, 202호(읍내동)