Overview

Dataset statistics

Number of variables5
Number of observations51
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory43.6 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description경상남도 하동군 자동차정비업 (업체명, 소재지(도로명), 전화번호, 정비사업유형)의 정보를 제공하고 있습니다.
Author경상남도 하동군
URLhttps://www.data.go.kr/data/3063785/fileData.do

Alerts

연번 is highly overall correlated with 정비사업유형High correlation
정비사업유형 is highly overall correlated with 연번High correlation
정비사업유형 is highly imbalanced (56.1%)Imbalance
연번 has unique valuesUnique
업체명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:11:51.391236
Analysis finished2023-12-12 19:11:51.953068
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-13T04:11:52.036330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q113.5
median26
Q338.5
95-th percentile48.5
Maximum51
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.866069
Coefficient of variation (CV)0.57177187
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum1326
Variance221
MonotonicityStrictly increasing
2023-12-13T04:11:52.208675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
2 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
51 1
2.0%
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%

업체명
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-13T04:11:52.495318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.1764706
Min length3

Characters and Unicode

Total characters417
Distinct characters107
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row대광자동차정비
2nd row현대자동차종합정비공업사
3rd row(주)두일1급자동차종합비
4th row하동자동차정비
5th row월드모터스
ValueCountFrequency (%)
하동점 5
 
7.7%
현대자동차 2
 
3.1%
대광자동차정비 1
 
1.5%
하동점기아오토큐 1
 
1.5%
진교점 1
 
1.5%
또바기자동차부분정비공업사 1
 
1.5%
우리카부분정비 1
 
1.5%
애니카랜드 1
 
1.5%
카즈텍3급정비 1
 
1.5%
외형복원 1
 
1.5%
Other values (50) 50
76.9%
2023-12-13T04:11:52.997825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
8.4%
26
 
6.2%
24
 
5.8%
22
 
5.3%
22
 
5.3%
17
 
4.1%
14
 
3.4%
14
 
3.4%
13
 
3.1%
13
 
3.1%
Other values (97) 217
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
95.2%
Space Separator 14
 
3.4%
Decimal Number 3
 
0.7%
Other Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
8.8%
26
 
6.5%
24
 
6.0%
22
 
5.5%
22
 
5.5%
17
 
4.3%
14
 
3.5%
13
 
3.3%
13
 
3.3%
11
 
2.8%
Other values (91) 200
50.4%
Decimal Number
ValueCountFrequency (%)
3 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 397
95.2%
Common 20
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
8.8%
26
 
6.5%
24
 
6.0%
22
 
5.5%
22
 
5.5%
17
 
4.3%
14
 
3.5%
13
 
3.3%
13
 
3.3%
11
 
2.8%
Other values (91) 200
50.4%
Common
ValueCountFrequency (%)
14
70.0%
3 2
 
10.0%
, 1
 
5.0%
( 1
 
5.0%
) 1
 
5.0%
1 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
95.2%
ASCII 20
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
8.8%
26
 
6.5%
24
 
6.0%
22
 
5.5%
22
 
5.5%
17
 
4.3%
14
 
3.5%
13
 
3.3%
13
 
3.3%
11
 
2.8%
Other values (91) 200
50.4%
ASCII
ValueCountFrequency (%)
14
70.0%
3 2
 
10.0%
, 1
 
5.0%
( 1
 
5.0%
) 1
 
5.0%
1 1
 
5.0%

소재지
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-13T04:11:53.301871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length12.313725
Min length9

Characters and Unicode

Total characters628
Distinct characters73
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row진교면 새미골길 32-2
2nd row하동읍 섬진강대로 2334
3rd row하동읍 화심길 126
4th row고전면 선소길 38-5
5th row금남면 산업로 911-1
ValueCountFrequency (%)
하동읍 17
 
10.9%
진교면 10
 
6.4%
경서대로 7
 
4.5%
군청로 6
 
3.8%
산업로 5
 
3.2%
경충로 5
 
3.2%
금성면 5
 
3.2%
옥종면 4
 
2.6%
금남면 4
 
2.6%
악양면 3
 
1.9%
Other values (79) 90
57.7%
2023-12-13T04:11:53.746610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
16.7%
34
 
5.4%
1 34
 
5.4%
31
 
4.9%
2 24
 
3.8%
3 21
 
3.3%
18
 
2.9%
18
 
2.9%
6 17
 
2.7%
17
 
2.7%
Other values (63) 309
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 345
54.9%
Decimal Number 166
26.4%
Space Separator 105
 
16.7%
Dash Punctuation 12
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
9.9%
31
 
9.0%
18
 
5.2%
18
 
5.2%
17
 
4.9%
13
 
3.8%
13
 
3.8%
12
 
3.5%
11
 
3.2%
10
 
2.9%
Other values (51) 168
48.7%
Decimal Number
ValueCountFrequency (%)
1 34
20.5%
2 24
14.5%
3 21
12.7%
6 17
10.2%
4 15
9.0%
5 13
 
7.8%
7 11
 
6.6%
8 11
 
6.6%
9 10
 
6.0%
0 10
 
6.0%
Space Separator
ValueCountFrequency (%)
105
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 345
54.9%
Common 283
45.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
9.9%
31
 
9.0%
18
 
5.2%
18
 
5.2%
17
 
4.9%
13
 
3.8%
13
 
3.8%
12
 
3.5%
11
 
3.2%
10
 
2.9%
Other values (51) 168
48.7%
Common
ValueCountFrequency (%)
105
37.1%
1 34
 
12.0%
2 24
 
8.5%
3 21
 
7.4%
6 17
 
6.0%
4 15
 
5.3%
5 13
 
4.6%
- 12
 
4.2%
7 11
 
3.9%
8 11
 
3.9%
Other values (2) 20
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 345
54.9%
ASCII 283
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
37.1%
1 34
 
12.0%
2 24
 
8.5%
3 21
 
7.4%
6 17
 
6.0%
4 15
 
5.3%
5 13
 
4.6%
- 12
 
4.2%
7 11
 
3.9%
8 11
 
3.9%
Other values (2) 20
 
7.1%
Hangul
ValueCountFrequency (%)
34
 
9.9%
31
 
9.0%
18
 
5.2%
18
 
5.2%
17
 
4.9%
13
 
3.8%
13
 
3.8%
12
 
3.5%
11
 
3.2%
10
 
2.9%
Other values (51) 168
48.7%
Distinct50
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-13T04:11:54.034947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.941176
Min length9

Characters and Unicode

Total characters609
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)96.1%

Sample

1st row055-883-9173
2nd row055-882-6600
3rd row055-882-2500
4th row055-884-2121
5th row055-882-0456
ValueCountFrequency (%)
055-882-4537 2
 
3.9%
055-883-5446 1
 
2.0%
055-884-0808 1
 
2.0%
055-882-2302 1
 
2.0%
055-883-7358 1
 
2.0%
055-883-0988 1
 
2.0%
055-882-3216 1
 
2.0%
055-884-2063 1
 
2.0%
055-882-7977 1
 
2.0%
055-884-0078 1
 
2.0%
Other values (40) 40
78.4%
2023-12-13T04:11:54.421294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 127
20.9%
5 122
20.0%
- 101
16.6%
0 81
13.3%
2 38
 
6.2%
4 33
 
5.4%
3 32
 
5.3%
1 22
 
3.6%
7 20
 
3.3%
6 19
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 508
83.4%
Dash Punctuation 101
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 127
25.0%
5 122
24.0%
0 81
15.9%
2 38
 
7.5%
4 33
 
6.5%
3 32
 
6.3%
1 22
 
4.3%
7 20
 
3.9%
6 19
 
3.7%
9 14
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 609
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 127
20.9%
5 122
20.0%
- 101
16.6%
0 81
13.3%
2 38
 
6.2%
4 33
 
5.4%
3 32
 
5.3%
1 22
 
3.6%
7 20
 
3.3%
6 19
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 609
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 127
20.9%
5 122
20.0%
- 101
16.6%
0 81
13.3%
2 38
 
6.2%
4 33
 
5.4%
3 32
 
5.3%
1 22
 
3.6%
7 20
 
3.3%
6 19
 
3.1%

정비사업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
전문정비
44 
종합정비
소형종합
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합정비
2nd row종합정비
3rd row종합정비
4th row종합정비
5th row종합정비

Common Values

ValueCountFrequency (%)
전문정비 44
86.3%
종합정비 5
 
9.8%
소형종합 2
 
3.9%

Length

2023-12-13T04:11:54.581936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:11:54.674667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문정비 44
86.3%
종합정비 5
 
9.8%
소형종합 2
 
3.9%

Interactions

2023-12-13T04:11:51.665009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:11:54.742911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명소재지전화번호정비사업유형
연번1.0001.0001.0000.9450.862
업체명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
전화번호0.9451.0001.0001.0001.000
정비사업유형0.8621.0001.0001.0001.000
2023-12-13T04:11:54.833635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정비사업유형
연번1.0000.577
정비사업유형0.5771.000

Missing values

2023-12-13T04:11:51.796773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:11:51.912762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지전화번호정비사업유형
01대광자동차정비진교면 새미골길 32-2055-883-9173종합정비
12현대자동차종합정비공업사하동읍 섬진강대로 2334055-882-6600종합정비
23(주)두일1급자동차종합비하동읍 화심길 126055-882-2500종합정비
34하동자동차정비고전면 선소길 38-5055-884-2121종합정비
45월드모터스금남면 산업로 911-1055-882-0456종합정비
56창신자동차정비진교면 민다리안길 15055-883-2251소형종합
67이화모터스적량면 경서대로 501055-882-0521소형종합
78하남정비공업사진교면 들포길 45055-884-0506전문정비
89금성카센타진교면 민다리길 23055-882-4537전문정비
910금성공업사금성면 광포길 60055-882-0956전문정비
연번업체명소재지전화번호정비사업유형
4142해피카하동읍 군청로 1808529-8797전문정비
4243현대자동차 블루핸즈 하동점하동읍 군청로 149055-883-5805전문정비
4344세진모터스진교면 경충로 1145055-883-5222전문정비
4445재우자동차정비악양면 악양서로 325-6055-884-4160전문정비
4546옥종카센타옥종면 주포중앙길 21055-883-7811전문정비
4647북천카마스타북천면 경서대로 2477055-882-1673전문정비
4748옥종종합카센타옥종면 옥종중앙길 87055-882-0059전문정비
4849타이어테크 하동점하동읍 군청로 159055-882-1700전문정비
4950타이어프로 하동점하동읍 군청로 46055-884-0808전문정비
5051북천종합정비3급북천면 옥정리 594번지 6호055-882-8012전문정비