Overview

Dataset statistics

Number of variables6
Number of observations105
Missing cells65
Missing cells (%)10.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory50.3 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description부산광역시연제구_자동차관리업체현황_20220218
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025098

Alerts

상태정보 has constant value ""Constant
관리사업유형 is highly imbalanced (92.2%)Imbalance
전화번호 has 65 (61.9%) missing valuesMissing
순번 has unique valuesUnique
사업자 상호(명칭) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:26:48.031032
Analysis finished2023-12-10 16:26:49.049744
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53
Minimum1
Maximum105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T01:26:49.185145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.2
Q127
median53
Q379
95-th percentile99.8
Maximum105
Range104
Interquartile range (IQR)52

Descriptive statistics

Standard deviation30.454885
Coefficient of variation (CV)0.57462047
Kurtosis-1.2
Mean53
Median Absolute Deviation (MAD)26
Skewness0
Sum5565
Variance927.5
MonotonicityStrictly increasing
2023-12-11T01:26:49.403667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
80 1
 
1.0%
78 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
Other values (95) 95
90.5%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
105 1
1.0%
104 1
1.0%
103 1
1.0%
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%

상태정보
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
영업
105 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 105
100.0%

Length

2023-12-11T01:26:49.591962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:26:49.714086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 105
100.0%

관리사업유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size972.0 B
자동차매매업
104 
자동차종합정비업
 
1

Length

Max length8
Median length6
Mean length6.0190476
Min length6

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row자동차매매업
2nd row자동차매매업
3rd row자동차매매업
4th row자동차매매업
5th row자동차매매업

Common Values

ValueCountFrequency (%)
자동차매매업 104
99.0%
자동차종합정비업 1
 
1.0%

Length

2023-12-11T01:26:49.883826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:26:50.025892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자동차매매업 104
99.0%
자동차종합정비업 1
 
1.0%

전화번호
Text

MISSING 

Distinct40
Distinct (%)100.0%
Missing65
Missing (%)61.9%
Memory size972.0 B
2023-12-11T01:26:50.253096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.925
Min length9

Characters and Unicode

Total characters477
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row051-854-0400
2nd row051-853-8221
3rd row051-861-7577
4th row051-865-3355
5th row051-866-6644
ValueCountFrequency (%)
051-851-8851 1
 
2.5%
051-868-9455 1
 
2.5%
051-866-0080 1
 
2.5%
051-862-2828 1
 
2.5%
051-913-2878 1
 
2.5%
051-863-5577 1
 
2.5%
051-862-8949 1
 
2.5%
051-803-1318 1
 
2.5%
051-715-1221 1
 
2.5%
051-714-0120 1
 
2.5%
Other values (30) 30
75.0%
2023-12-11T01:26:50.771114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 79
16.6%
- 79
16.6%
1 74
15.5%
0 69
14.5%
8 48
10.1%
4 25
 
5.2%
3 23
 
4.8%
7 23
 
4.8%
6 22
 
4.6%
2 20
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 398
83.4%
Dash Punctuation 79
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 79
19.8%
1 74
18.6%
0 69
17.3%
8 48
12.1%
4 25
 
6.3%
3 23
 
5.8%
7 23
 
5.8%
6 22
 
5.5%
2 20
 
5.0%
9 15
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 477
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 79
16.6%
- 79
16.6%
1 74
15.5%
0 69
14.5%
8 48
10.1%
4 25
 
5.2%
3 23
 
4.8%
7 23
 
4.8%
6 22
 
4.6%
2 20
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 477
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 79
16.6%
- 79
16.6%
1 74
15.5%
0 69
14.5%
8 48
10.1%
4 25
 
5.2%
3 23
 
4.8%
7 23
 
4.8%
6 22
 
4.6%
2 20
 
4.2%
Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-11T01:26:51.038360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length7.2666667
Min length3

Characters and Unicode

Total characters763
Distinct characters150
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row프로모터스
2nd row진성모터스
3rd row친절한 모터스
4th row(주)연산자동차
5th row(주)kb모터스
ValueCountFrequency (%)
주식회사 7
 
5.8%
모터스 3
 
2.5%
프로모터스 1
 
0.8%
콰트로 1
 
0.8%
kb모터스 1
 
0.8%
주)차앤문모터스 1
 
0.8%
주)아이언모터스 1
 
0.8%
유한회사 1
 
0.8%
저먼오토모빌 1
 
0.8%
정상모터스 1
 
0.8%
Other values (103) 103
85.1%
2023-12-11T01:26:51.431965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
8.1%
57
 
7.5%
56
 
7.3%
55
 
7.2%
( 49
 
6.4%
) 49
 
6.4%
24
 
3.1%
19
 
2.5%
19
 
2.5%
19
 
2.5%
Other values (140) 354
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 623
81.7%
Open Punctuation 49
 
6.4%
Close Punctuation 49
 
6.4%
Uppercase Letter 21
 
2.8%
Space Separator 16
 
2.1%
Decimal Number 2
 
0.3%
Lowercase Letter 2
 
0.3%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
10.0%
57
 
9.1%
56
 
9.0%
55
 
8.8%
24
 
3.9%
19
 
3.0%
19
 
3.0%
19
 
3.0%
17
 
2.7%
14
 
2.2%
Other values (117) 281
45.1%
Uppercase Letter
ValueCountFrequency (%)
K 4
19.0%
B 3
14.3%
C 1
 
4.8%
H 1
 
4.8%
T 1
 
4.8%
A 1
 
4.8%
E 1
 
4.8%
R 1
 
4.8%
O 1
 
4.8%
G 1
 
4.8%
Other values (6) 6
28.6%
Lowercase Letter
ValueCountFrequency (%)
b 1
50.0%
k 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 624
81.8%
Common 116
 
15.2%
Latin 23
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
9.9%
57
 
9.1%
56
 
9.0%
55
 
8.8%
24
 
3.8%
19
 
3.0%
19
 
3.0%
19
 
3.0%
17
 
2.7%
14
 
2.2%
Other values (118) 282
45.2%
Latin
ValueCountFrequency (%)
K 4
17.4%
B 3
 
13.0%
C 1
 
4.3%
H 1
 
4.3%
b 1
 
4.3%
T 1
 
4.3%
A 1
 
4.3%
k 1
 
4.3%
E 1
 
4.3%
R 1
 
4.3%
Other values (8) 8
34.8%
Common
ValueCountFrequency (%)
( 49
42.2%
) 49
42.2%
16
 
13.8%
2 2
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 623
81.7%
ASCII 139
 
18.2%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
10.0%
57
 
9.1%
56
 
9.0%
55
 
8.8%
24
 
3.9%
19
 
3.0%
19
 
3.0%
19
 
3.0%
17
 
2.7%
14
 
2.2%
Other values (117) 281
45.1%
ASCII
ValueCountFrequency (%)
( 49
35.3%
) 49
35.3%
16
 
11.5%
K 4
 
2.9%
B 3
 
2.2%
2 2
 
1.4%
C 1
 
0.7%
H 1
 
0.7%
b 1
 
0.7%
T 1
 
0.7%
Other values (12) 12
 
8.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct76
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-11T01:26:51.658351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length36
Mean length28.104762
Min length22

Characters and Unicode

Total characters2951
Distinct characters49
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)63.8%

Sample

1st row부산광역시 연제구 거제천로 174(거제동)
2nd row부산광역시 연제구 중앙대로 1154(연산동)
3rd row부산광역시 연제구 거제천로182번길 3, (연산동)
4th row부산광역시 연제구 거제천로182번길 3, 203동동(연산동)
5th row부산광역시 연제구 거제천로 174(연산동)
ValueCountFrequency (%)
부산광역시 105
20.3%
연제구 105
20.3%
경기장로 61
11.8%
21 28
 
5.4%
15 27
 
5.2%
거제천로 22
 
4.2%
사직오토랜드 15
 
2.9%
연산동 14
 
2.7%
중앙대로 13
 
2.5%
174 12
 
2.3%
Other values (61) 116
22.4%
2023-12-11T01:26:52.007980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
413
 
14.0%
200
 
6.8%
1 158
 
5.4%
148
 
5.0%
148
 
5.0%
114
 
3.9%
105
 
3.6%
( 105
 
3.6%
105
 
3.6%
105
 
3.6%
Other values (39) 1350
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1767
59.9%
Decimal Number 467
 
15.8%
Space Separator 413
 
14.0%
Open Punctuation 105
 
3.6%
Close Punctuation 105
 
3.6%
Other Punctuation 93
 
3.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
11.3%
148
 
8.4%
148
 
8.4%
114
 
6.5%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
104
 
5.9%
Other values (24) 528
29.9%
Decimal Number
ValueCountFrequency (%)
1 158
33.8%
2 74
15.8%
5 52
 
11.1%
3 49
 
10.5%
4 42
 
9.0%
0 40
 
8.6%
7 28
 
6.0%
8 13
 
2.8%
6 8
 
1.7%
9 3
 
0.6%
Space Separator
ValueCountFrequency (%)
413
100.0%
Open Punctuation
ValueCountFrequency (%)
( 105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 105
100.0%
Other Punctuation
ValueCountFrequency (%)
, 93
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1767
59.9%
Common 1184
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
11.3%
148
 
8.4%
148
 
8.4%
114
 
6.5%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
104
 
5.9%
Other values (24) 528
29.9%
Common
ValueCountFrequency (%)
413
34.9%
1 158
 
13.3%
( 105
 
8.9%
) 105
 
8.9%
, 93
 
7.9%
2 74
 
6.2%
5 52
 
4.4%
3 49
 
4.1%
4 42
 
3.5%
0 40
 
3.4%
Other values (5) 53
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1767
59.9%
ASCII 1184
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
413
34.9%
1 158
 
13.3%
( 105
 
8.9%
) 105
 
8.9%
, 93
 
7.9%
2 74
 
6.2%
5 52
 
4.4%
3 49
 
4.1%
4 42
 
3.5%
0 40
 
3.4%
Other values (5) 53
 
4.5%
Hangul
ValueCountFrequency (%)
200
 
11.3%
148
 
8.4%
148
 
8.4%
114
 
6.5%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
105
 
5.9%
104
 
5.9%
Other values (24) 528
29.9%

Interactions

2023-12-11T01:26:48.352461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:26:52.093502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번관리사업유형전화번호사업장주소
순번1.0000.0001.0000.922
관리사업유형0.0001.0001.0001.000
전화번호1.0001.0001.0001.000
사업장주소0.9221.0001.0001.000
2023-12-11T01:26:52.176233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번관리사업유형
순번1.0000.000
관리사업유형0.0001.000

Missing values

2023-12-11T01:26:48.824449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:26:48.988498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상태정보관리사업유형전화번호사업자 상호(명칭)사업장주소
01영업자동차매매업051-854-0400프로모터스부산광역시 연제구 거제천로 174(거제동)
12영업자동차매매업<NA>진성모터스부산광역시 연제구 중앙대로 1154(연산동)
23영업자동차매매업051-853-8221친절한 모터스부산광역시 연제구 거제천로182번길 3, (연산동)
34영업자동차매매업051-861-7577(주)연산자동차부산광역시 연제구 거제천로182번길 3, 203동동(연산동)
45영업자동차매매업051-865-3355(주)kb모터스부산광역시 연제구 거제천로 174(연산동)
56영업자동차매매업051-866-6644수모터스부산광역시 연제구 거제천로 174, (연산동)
67영업자동차매매업051-853-5700(주)스피드모터스부산광역시 연제구 거제천로 174(거제동)
78영업자동차매매업<NA>동인자동차부산광역시 연제구 거제천로 174(연산동)
89영업자동차매매업051-866-8195중앙자동차매매상사부산광역시 연제구 거제천로 174, (연산동)
910영업자동차매매업051-853-4200한솔자동차매매상사부산광역시 연제구 거제천로 174, (연산동)
순번상태정보관리사업유형전화번호사업자 상호(명칭)사업장주소
9596영업자동차매매업<NA>(주)오토라인모터스부산광역시 연제구 경기장로 21, 311호(거제동)
9697영업자동차매매업<NA>판다모터스부산광역시 연제구 경기장로 21, 210호(거제동)
9798영업자동차매매업<NA>(주)아라모터스부산광역시 연제구 경기장로 21, 301호(거제동)
9899영업자동차매매업<NA>(주)더마니카부산광역시 연제구 경기장로 21, 214호(거제동)
99100영업자동차매매업<NA>신박한모터스부산광역시 연제구 경기장로 21, 316호(거제동)
100101영업자동차매매업<NA>(주)에이스타모터스부산광역시 연제구 경기장로 21, 208호(거제동)
101102영업자동차매매업051-709-6312스타자동차(주)사직오토랜드중고사업부부산광역시 연제구 경기장로 21, 1층(거제동)
102103영업자동차매매업<NA>에코모터스부산광역시 연제구 경기장로 21, 211호(거제동)
103104영업자동차매매업051-714-1300카 빌리지부산광역시 연제구 경기장로 21, 204호(거제동)
104105영업자동차종합정비업051-850-4051현대자동차㈜부산하이테크센터부산광역시 연제구 중앙대로 1168 (거제1동 1470-2)