Overview

Dataset statistics

Number of variables4
Number of observations685
Missing cells99
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.5 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description경기도 용인시 자동차 정비업 현황(구분, 사업장 상호, 주소, 전화번호)
Author경기도 용인시
URLhttps://www.data.go.kr/data/15044411/fileData.do

Alerts

전화번호 has 99 (14.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:27:47.982363
Analysis finished2023-12-12 02:27:48.482919
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
전문(3급)
567 
종합(1급)
75 
소형(2급)
 
43

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합(1급)
2nd row종합(1급)
3rd row종합(1급)
4th row종합(1급)
5th row종합(1급)

Common Values

ValueCountFrequency (%)
전문(3급) 567
82.8%
종합(1급) 75
 
10.9%
소형(2급) 43
 
6.3%

Length

2023-12-12T11:27:48.558498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:27:48.701669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문(3급 567
82.8%
종합(1급 75
 
10.9%
소형(2급 43
 
6.3%
Distinct663
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T11:27:49.006376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length8.0729927
Min length3

Characters and Unicode

Total characters5530
Distinct characters378
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique644 ?
Unique (%)94.0%

Sample

1st row(주)오토모빌
2nd row한국도로공사 수원정비공장
3rd row(주)광명자동차공업
4th row한국자동차공업(주)
5th row(주)아이엠카케어모터스
ValueCountFrequency (%)
스피드메이트 20
 
2.3%
현대자동차 14
 
1.6%
자동차 8
 
0.9%
기아오토큐 7
 
0.8%
애니카랜드 6
 
0.7%
공업사 5
 
0.6%
카센타 5
 
0.6%
쌍용자동차 5
 
0.6%
현대공업사 4
 
0.5%
모터스 4
 
0.5%
Other values (710) 774
90.8%
2023-12-12T11:27:49.448689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
264
 
4.8%
245
 
4.4%
229
 
4.1%
221
 
4.0%
208
 
3.8%
203
 
3.7%
200
 
3.6%
181
 
3.3%
167
 
3.0%
151
 
2.7%
Other values (368) 3461
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4960
89.7%
Space Separator 167
 
3.0%
Uppercase Letter 113
 
2.0%
Close Punctuation 98
 
1.8%
Open Punctuation 98
 
1.8%
Lowercase Letter 53
 
1.0%
Decimal Number 24
 
0.4%
Other Punctuation 11
 
0.2%
Dash Punctuation 5
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
264
 
5.3%
245
 
4.9%
229
 
4.6%
221
 
4.5%
208
 
4.2%
203
 
4.1%
200
 
4.0%
181
 
3.6%
151
 
3.0%
116
 
2.3%
Other values (312) 2942
59.3%
Uppercase Letter
ValueCountFrequency (%)
S 16
14.2%
O 12
10.6%
T 12
10.6%
M 10
 
8.8%
A 8
 
7.1%
G 7
 
6.2%
C 6
 
5.3%
E 5
 
4.4%
K 5
 
4.4%
P 4
 
3.5%
Other values (12) 28
24.8%
Lowercase Letter
ValueCountFrequency (%)
e 8
15.1%
o 7
13.2%
r 6
11.3%
t 5
9.4%
a 5
9.4%
n 4
7.5%
w 3
 
5.7%
s 3
 
5.7%
d 2
 
3.8%
h 2
 
3.8%
Other values (8) 8
15.1%
Decimal Number
ValueCountFrequency (%)
3 7
29.2%
2 5
20.8%
1 5
20.8%
5 4
16.7%
6 1
 
4.2%
0 1
 
4.2%
4 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 8
72.7%
& 1
 
9.1%
, 1
 
9.1%
/ 1
 
9.1%
Space Separator
ValueCountFrequency (%)
167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4961
89.7%
Common 403
 
7.3%
Latin 166
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
264
 
5.3%
245
 
4.9%
229
 
4.6%
221
 
4.5%
208
 
4.2%
203
 
4.1%
200
 
4.0%
181
 
3.6%
151
 
3.0%
116
 
2.3%
Other values (313) 2943
59.3%
Latin
ValueCountFrequency (%)
S 16
 
9.6%
O 12
 
7.2%
T 12
 
7.2%
M 10
 
6.0%
e 8
 
4.8%
A 8
 
4.8%
o 7
 
4.2%
G 7
 
4.2%
C 6
 
3.6%
r 6
 
3.6%
Other values (30) 74
44.6%
Common
ValueCountFrequency (%)
167
41.4%
) 98
24.3%
( 98
24.3%
. 8
 
2.0%
3 7
 
1.7%
- 5
 
1.2%
2 5
 
1.2%
1 5
 
1.2%
5 4
 
1.0%
& 1
 
0.2%
Other values (5) 5
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4960
89.7%
ASCII 569
 
10.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
264
 
5.3%
245
 
4.9%
229
 
4.6%
221
 
4.5%
208
 
4.2%
203
 
4.1%
200
 
4.0%
181
 
3.6%
151
 
3.0%
116
 
2.3%
Other values (312) 2942
59.3%
ASCII
ValueCountFrequency (%)
167
29.3%
) 98
17.2%
( 98
17.2%
S 16
 
2.8%
O 12
 
2.1%
T 12
 
2.1%
M 10
 
1.8%
. 8
 
1.4%
e 8
 
1.4%
A 8
 
1.4%
Other values (45) 132
23.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct681
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T11:27:49.813412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length28.918248
Min length11

Characters and Unicode

Total characters19809
Distinct characters178
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique677 ?
Unique (%)98.8%

Sample

1st row경기도 용인시 처인구 모현면 갈담리 323번지 1호
2nd row경기도 용인시 기흥구 덕영대로 2062(영덕동, 9외 5)
3rd row경기도 용인시 처인구 백옥대로 1290, 107동(유방동, 485-6외 3)
4th row경기도 용인시 처인구 경안천로 172(고림동, 696-7)
5th row경기도 용인시 기흥구 중부대로56번길 6-1, 304동(영덕동)
ValueCountFrequency (%)
용인시 684
 
17.5%
경기도 590
 
15.1%
처인구 240
 
6.1%
기흥구 236
 
6.0%
수지구 109
 
2.8%
중부대로 50
 
1.3%
모현면 45
 
1.2%
백옥대로 41
 
1.0%
포곡읍 37
 
0.9%
양지면 34
 
0.9%
Other values (1347) 1847
47.2%
2023-12-12T11:27:50.417068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3337
 
16.8%
930
 
4.7%
1 892
 
4.5%
842
 
4.3%
732
 
3.7%
686
 
3.5%
661
 
3.3%
627
 
3.2%
2 608
 
3.1%
602
 
3.0%
Other values (168) 9892
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10507
53.0%
Decimal Number 4170
 
21.1%
Space Separator 3337
 
16.8%
Dash Punctuation 513
 
2.6%
Open Punctuation 482
 
2.4%
Close Punctuation 482
 
2.4%
Other Punctuation 314
 
1.6%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
930
 
8.9%
842
 
8.0%
732
 
7.0%
686
 
6.5%
661
 
6.3%
627
 
6.0%
602
 
5.7%
590
 
5.6%
515
 
4.9%
268
 
2.6%
Other values (148) 4054
38.6%
Decimal Number
ValueCountFrequency (%)
1 892
21.4%
2 608
14.6%
3 453
10.9%
5 390
9.4%
4 381
9.1%
6 331
 
7.9%
7 305
 
7.3%
0 274
 
6.6%
8 271
 
6.5%
9 265
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
B 1
25.0%
T 1
25.0%
I 1
25.0%
D 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 312
99.4%
. 2
 
0.6%
Space Separator
ValueCountFrequency (%)
3337
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 513
100.0%
Open Punctuation
ValueCountFrequency (%)
( 482
100.0%
Close Punctuation
ValueCountFrequency (%)
) 482
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10507
53.0%
Common 9298
46.9%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
930
 
8.9%
842
 
8.0%
732
 
7.0%
686
 
6.5%
661
 
6.3%
627
 
6.0%
602
 
5.7%
590
 
5.6%
515
 
4.9%
268
 
2.6%
Other values (148) 4054
38.6%
Common
ValueCountFrequency (%)
3337
35.9%
1 892
 
9.6%
2 608
 
6.5%
- 513
 
5.5%
( 482
 
5.2%
) 482
 
5.2%
3 453
 
4.9%
5 390
 
4.2%
4 381
 
4.1%
6 331
 
3.6%
Other values (6) 1429
15.4%
Latin
ValueCountFrequency (%)
B 1
25.0%
T 1
25.0%
I 1
25.0%
D 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10507
53.0%
ASCII 9302
47.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3337
35.9%
1 892
 
9.6%
2 608
 
6.5%
- 513
 
5.5%
( 482
 
5.2%
) 482
 
5.2%
3 453
 
4.9%
5 390
 
4.2%
4 381
 
4.1%
6 331
 
3.6%
Other values (10) 1433
15.4%
Hangul
ValueCountFrequency (%)
930
 
8.9%
842
 
8.0%
732
 
7.0%
686
 
6.5%
661
 
6.3%
627
 
6.0%
602
 
5.7%
590
 
5.6%
515
 
4.9%
268
 
2.6%
Other values (148) 4054
38.6%

전화번호
Text

MISSING 

Distinct564
Distinct (%)96.2%
Missing99
Missing (%)14.5%
Memory size5.5 KiB
2023-12-12T11:27:50.749634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.018771
Min length9

Characters and Unicode

Total characters7043
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique542 ?
Unique (%)92.5%

Sample

1st row031-323-1600
2nd row031-289-2371
3rd row031-322-4975
4th row031-335-7842
5th row031-273-4700
ValueCountFrequency (%)
031-693-8245 2
 
0.3%
031-336-4947 2
 
0.3%
031-265-3337 2
 
0.3%
031-284-8297 2
 
0.3%
031-8021-7511 2
 
0.3%
031-336-6168 2
 
0.3%
031-338-9114 2
 
0.3%
031-272-3700 2
 
0.3%
031-336-5472 2
 
0.3%
031-265-0798 2
 
0.3%
Other values (554) 566
96.6%
2023-12-12T11:27:51.260036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1374
19.5%
- 1171
16.6%
0 958
13.6%
1 901
12.8%
2 644
9.1%
8 455
 
6.5%
6 384
 
5.5%
5 335
 
4.8%
4 293
 
4.2%
7 290
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5872
83.4%
Dash Punctuation 1171
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1374
23.4%
0 958
16.3%
1 901
15.3%
2 644
11.0%
8 455
 
7.7%
6 384
 
6.5%
5 335
 
5.7%
4 293
 
5.0%
7 290
 
4.9%
9 238
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 1171
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7043
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1374
19.5%
- 1171
16.6%
0 958
13.6%
1 901
12.8%
2 644
9.1%
8 455
 
6.5%
6 384
 
5.5%
5 335
 
4.8%
4 293
 
4.2%
7 290
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7043
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1374
19.5%
- 1171
16.6%
0 958
13.6%
1 901
12.8%
2 644
9.1%
8 455
 
6.5%
6 384
 
5.5%
5 335
 
4.8%
4 293
 
4.2%
7 290
 
4.1%

Missing values

2023-12-12T11:27:48.356573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:27:48.447374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분사업장 상호(명칭)사업장주소전화번호
0종합(1급)(주)오토모빌경기도 용인시 처인구 모현면 갈담리 323번지 1호031-323-1600
1종합(1급)한국도로공사 수원정비공장경기도 용인시 기흥구 덕영대로 2062(영덕동, 9외 5)031-289-2371
2종합(1급)(주)광명자동차공업경기도 용인시 처인구 백옥대로 1290, 107동(유방동, 485-6외 3)031-322-4975
3종합(1급)한국자동차공업(주)경기도 용인시 처인구 경안천로 172(고림동, 696-7)031-335-7842
4종합(1급)(주)아이엠카케어모터스경기도 용인시 기흥구 중부대로56번길 6-1, 304동(영덕동)031-273-4700
5종합(1급)대청자동차정비(주)경기도 용인시 처인구 경안천로 214-5(고림동)031-332-2347
6종합(1급)도원자동차공업(주)경기도 용인시 처인구 양지면 남평로42번길 9-6(남곡리 489-7)031-338-2177
7종합(1급)한국지엠용인서비스센터(주)경기도 용인시 처인구 고진로 127(고림동, 819-4)031-332-8255
8종합(1급)경기자동차공업사경기도 용인시 기흥구 중부대로 14-9(영덕동, 533-4)031-206-1661
9종합(1급)현대자동차서비스(주) 수원서비스센터경기도 용인시 기흥구 중부대로 30(영덕동, 537-3)031-206-5151
구분사업장 상호(명칭)사업장주소전화번호
675전문(3급)리버티 오토용인시 기흥구 신갈로 85-1(신갈동 38-4)031-284-2866
676전문(3급)송전점 현대자동차용인시 처인구 이동면 백옥대로 20 (송전리 736번지)031-323-5300
677전문(3급)한빛모터스용인시 수지구 현암로 89번길 12-12, 1층(죽전동 1175-10)<NA>
678전문(3급)용인시 자동차 전문정비 협동조합용인시 처인구 고림로 29<NA>
679전문(3급)애니카랜드 보정점용인시 기흥구 용구대로 2469번길 164(보정동 375-16)031-8021-7511
680전문(3급)큐브50모터스용인시 처인구 모현면 파담로 140-9031-333-4290
681전문(3급)오토메카닉용인시 기흥구 서천동로 43번길 5-5(서천동 782-2)031-273-2580
682전문(3급)성실카센타용인시 기흥구 마북로 98(마북동254-2)031-283-8287
683전문(3급)기흥인터내셔널(유)용인시 기흥구 신정로 115(신갈동 378-2)070-7405-8262
684전문(3급)OK타이어 OK공업사용인시 처인구 양지면 중부대로2670번길 6-3 외 1필지1899-3088