Overview

Dataset statistics

Number of variables4
Number of observations368
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.6 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description양산시 관내 화물운송업체 현황입니다. 업종, 상호명, 영업소 주소, 전화번호 등 공공데이터 정보를 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15005271

Alerts

업종 is highly imbalanced (70.1%)Imbalance
상호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 15:01:00.596747
Analysis finished2024-04-17 15:01:00.910727
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
일반화물
334 
이사화물
 
32
일반 이사화물
 
2

Length

Max length7
Median length4
Mean length4.0163043
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이사화물
2nd row이사화물
3rd row이사화물
4th row이사화물
5th row이사화물

Common Values

ValueCountFrequency (%)
일반화물 334
90.8%
이사화물 32
 
8.7%
일반 이사화물 2
 
0.5%

Length

2024-04-18T00:01:00.964594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T00:01:01.046523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반화물 334
90.3%
이사화물 34
 
9.2%
일반 2
 
0.5%

상호
Text

UNIQUE 

Distinct368
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-04-18T00:01:01.202160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.173913
Min length3

Characters and Unicode

Total characters2640
Distinct characters236
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique368 ?
Unique (%)100.0%

Sample

1st row강남익스프레스
2nd row한화익스프레스
3rd row금호익스프레스
4th row롯데익스프레스
5th row신세계익스프레스
ValueCountFrequency (%)
주식회사 4
 
1.1%
강남익스프레스 1
 
0.3%
주)신한상운 1
 
0.3%
주)영주화물 1
 
0.3%
주)엠케이로직스 1
 
0.3%
주)엘로드로지스 1
 
0.3%
주)엔지엔 1
 
0.3%
주)에이원로지스 1
 
0.3%
주)에스원로지스 1
 
0.3%
주)양산위생공사 1
 
0.3%
Other values (365) 365
96.6%
2024-04-18T00:01:01.483766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 275
 
10.4%
( 275
 
10.4%
272
 
10.3%
130
 
4.9%
130
 
4.9%
98
 
3.7%
75
 
2.8%
66
 
2.5%
60
 
2.3%
50
 
1.9%
Other values (226) 1209
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2046
77.5%
Close Punctuation 275
 
10.4%
Open Punctuation 275
 
10.4%
Other Symbol 17
 
0.6%
Uppercase Letter 14
 
0.5%
Space Separator 10
 
0.4%
Decimal Number 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
272
 
13.3%
130
 
6.4%
130
 
6.4%
98
 
4.8%
75
 
3.7%
66
 
3.2%
60
 
2.9%
50
 
2.4%
42
 
2.1%
35
 
1.7%
Other values (210) 1088
53.2%
Uppercase Letter
ValueCountFrequency (%)
K 3
21.4%
G 3
21.4%
J 2
14.3%
B 1
 
7.1%
T 1
 
7.1%
C 1
 
7.1%
L 1
 
7.1%
S 1
 
7.1%
F 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 275
100.0%
Open Punctuation
ValueCountFrequency (%)
( 275
100.0%
Other Symbol
ValueCountFrequency (%)
17
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2063
78.1%
Common 563
 
21.3%
Latin 14
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
272
 
13.2%
130
 
6.3%
130
 
6.3%
98
 
4.8%
75
 
3.6%
66
 
3.2%
60
 
2.9%
50
 
2.4%
42
 
2.0%
35
 
1.7%
Other values (211) 1105
53.6%
Latin
ValueCountFrequency (%)
K 3
21.4%
G 3
21.4%
J 2
14.3%
B 1
 
7.1%
T 1
 
7.1%
C 1
 
7.1%
L 1
 
7.1%
S 1
 
7.1%
F 1
 
7.1%
Common
ValueCountFrequency (%)
) 275
48.8%
( 275
48.8%
10
 
1.8%
1 1
 
0.2%
2 1
 
0.2%
. 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2046
77.5%
ASCII 577
 
21.9%
None 17
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 275
47.7%
( 275
47.7%
10
 
1.7%
K 3
 
0.5%
G 3
 
0.5%
J 2
 
0.3%
B 1
 
0.2%
T 1
 
0.2%
C 1
 
0.2%
1 1
 
0.2%
Other values (5) 5
 
0.9%
Hangul
ValueCountFrequency (%)
272
 
13.3%
130
 
6.4%
130
 
6.4%
98
 
4.8%
75
 
3.7%
66
 
3.2%
60
 
2.9%
50
 
2.4%
42
 
2.1%
35
 
1.7%
Other values (210) 1088
53.2%
None
ValueCountFrequency (%)
17
100.0%
Distinct247
Distinct (%)67.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-04-18T00:01:01.698625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length43
Mean length24.834239
Min length18

Characters and Unicode

Total characters9139
Distinct characters230
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)56.5%

Sample

1st row경상남도 양산시 북안북8길 5, 가동 107호 (북부동, 북부시장상가)
2nd row경상남도 양산시 물금읍 화합4길 3-17
3rd row경상남도 양산시 북정중앙로 46, 2층 (북정동)
4th row경상남도 양산시 삼호동부10길 16 (삼호동)
5th row경상남도 양산시 평산남로 54 (평산동)
ValueCountFrequency (%)
경상남도 368
 
17.9%
양산시 368
 
17.9%
동면 100
 
4.9%
물금읍 50
 
2.4%
20 43
 
2.1%
상북면 41
 
2.0%
남양산2길 40
 
1.9%
2층 22
 
1.1%
양산대로 21
 
1.0%
어곡동 20
 
1.0%
Other values (441) 984
47.8%
2024-04-18T00:01:02.044172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1690
18.5%
518
 
5.7%
474
 
5.2%
465
 
5.1%
428
 
4.7%
376
 
4.1%
370
 
4.0%
370
 
4.0%
302
 
3.3%
1 296
 
3.2%
Other values (220) 3850
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5509
60.3%
Space Separator 1690
 
18.5%
Decimal Number 1364
 
14.9%
Close Punctuation 168
 
1.8%
Open Punctuation 168
 
1.8%
Other Punctuation 137
 
1.5%
Dash Punctuation 57
 
0.6%
Uppercase Letter 42
 
0.5%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
518
 
9.4%
474
 
8.6%
465
 
8.4%
428
 
7.8%
376
 
6.8%
370
 
6.7%
370
 
6.7%
302
 
5.5%
201
 
3.6%
172
 
3.1%
Other values (191) 1833
33.3%
Decimal Number
ValueCountFrequency (%)
1 296
21.7%
2 285
20.9%
0 141
10.3%
3 137
10.0%
4 95
 
7.0%
5 91
 
6.7%
7 91
 
6.7%
8 90
 
6.6%
6 88
 
6.5%
9 50
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
I 11
26.2%
C 11
26.2%
D 11
26.2%
A 5
11.9%
B 1
 
2.4%
S 1
 
2.4%
T 1
 
2.4%
K 1
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
t 1
25.0%
y 1
25.0%
i 1
25.0%
c 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 134
97.8%
. 2
 
1.5%
? 1
 
0.7%
Space Separator
ValueCountFrequency (%)
1690
100.0%
Close Punctuation
ValueCountFrequency (%)
) 168
100.0%
Open Punctuation
ValueCountFrequency (%)
( 168
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5509
60.3%
Common 3584
39.2%
Latin 46
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
518
 
9.4%
474
 
8.6%
465
 
8.4%
428
 
7.8%
376
 
6.8%
370
 
6.7%
370
 
6.7%
302
 
5.5%
201
 
3.6%
172
 
3.1%
Other values (191) 1833
33.3%
Common
ValueCountFrequency (%)
1690
47.2%
1 296
 
8.3%
2 285
 
8.0%
) 168
 
4.7%
( 168
 
4.7%
0 141
 
3.9%
3 137
 
3.8%
, 134
 
3.7%
4 95
 
2.7%
5 91
 
2.5%
Other values (7) 379
 
10.6%
Latin
ValueCountFrequency (%)
I 11
23.9%
C 11
23.9%
D 11
23.9%
A 5
10.9%
B 1
 
2.2%
S 1
 
2.2%
T 1
 
2.2%
K 1
 
2.2%
t 1
 
2.2%
y 1
 
2.2%
Other values (2) 2
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5509
60.3%
ASCII 3630
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1690
46.6%
1 296
 
8.2%
2 285
 
7.9%
) 168
 
4.6%
( 168
 
4.6%
0 141
 
3.9%
3 137
 
3.8%
, 134
 
3.7%
4 95
 
2.6%
5 91
 
2.5%
Other values (19) 425
 
11.7%
Hangul
ValueCountFrequency (%)
518
 
9.4%
474
 
8.6%
465
 
8.4%
428
 
7.8%
376
 
6.8%
370
 
6.7%
370
 
6.7%
302
 
5.5%
201
 
3.6%
172
 
3.1%
Other values (191) 1833
33.3%
Distinct125
Distinct (%)34.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2024-04-18T00:01:02.321834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length8.5978261
Min length4

Characters and Unicode

Total characters3164
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)28.0%

Sample

1st row데이터 미집계
2nd row개인정보
3rd row개인정보
4th row개인정보
5th row개인정보
ValueCountFrequency (%)
데이터 96
20.7%
미집계 96
20.7%
개인정보 94
20.3%
055-381-8989 19
 
4.1%
055-388-8822 9
 
1.9%
051-317-5561 5
 
1.1%
051-314-7272 4
 
0.9%
055-365-1277 4
 
0.9%
055-381-5311 4
 
0.9%
051-311-3782 3
 
0.6%
Other values (116) 130
28.0%
2024-04-18T00:01:02.727688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 393
 
12.4%
- 351
 
11.1%
0 271
 
8.6%
8 215
 
6.8%
3 213
 
6.7%
1 177
 
5.6%
2 114
 
3.6%
7 113
 
3.6%
4 106
 
3.4%
96
 
3.0%
Other values (12) 1115
35.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1765
55.8%
Other Letter 952
30.1%
Dash Punctuation 351
 
11.1%
Space Separator 96
 
3.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 393
22.3%
0 271
15.4%
8 215
12.2%
3 213
12.1%
1 177
10.0%
2 114
 
6.5%
7 113
 
6.4%
4 106
 
6.0%
6 94
 
5.3%
9 69
 
3.9%
Other Letter
ValueCountFrequency (%)
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
94
9.9%
94
9.9%
94
9.9%
94
9.9%
Dash Punctuation
ValueCountFrequency (%)
- 351
100.0%
Space Separator
ValueCountFrequency (%)
96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2212
69.9%
Hangul 952
30.1%

Most frequent character per script

Common
ValueCountFrequency (%)
5 393
17.8%
- 351
15.9%
0 271
12.3%
8 215
9.7%
3 213
9.6%
1 177
8.0%
2 114
 
5.2%
7 113
 
5.1%
4 106
 
4.8%
96
 
4.3%
Other values (2) 163
7.4%
Hangul
ValueCountFrequency (%)
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
94
9.9%
94
9.9%
94
9.9%
94
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2212
69.9%
Hangul 952
30.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 393
17.8%
- 351
15.9%
0 271
12.3%
8 215
9.7%
3 213
9.6%
1 177
8.0%
2 114
 
5.2%
7 113
 
5.1%
4 106
 
4.8%
96
 
4.3%
Other values (2) 163
7.4%
Hangul
ValueCountFrequency (%)
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
96
10.1%
94
9.9%
94
9.9%
94
9.9%
94
9.9%

Missing values

2024-04-18T00:01:00.825192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T00:01:00.886068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호영업소 주소전화번호
0이사화물강남익스프레스경상남도 양산시 북안북8길 5, 가동 107호 (북부동, 북부시장상가)데이터 미집계
1이사화물한화익스프레스경상남도 양산시 물금읍 화합4길 3-17개인정보
2이사화물금호익스프레스경상남도 양산시 북정중앙로 46, 2층 (북정동)개인정보
3이사화물롯데익스프레스경상남도 양산시 삼호동부10길 16 (삼호동)개인정보
4이사화물신세계익스프레스경상남도 양산시 평산남로 54 (평산동)개인정보
5이사화물한솔익스프레스경상남도 양산시 동면 금오4길 33개인정보
6이사화물대신익스프레스경상남도 양산시 북안북7길 39 (북부동, 양산아파트 관리동)개인정보
7이사화물삼호익스프레스경상남도 양산시 삼성4길 8-11 (북정동)055-388-3535
8이사화물삼성익스프레스경상남도 양산시 웅상대로 996-1 (주진동)055-387-7000
9이사화물이사천사경상남도 양산시 양주3길 43-14, 왕뚜껑삼겹살 (중부동)개인정보
업종상호영업소 주소전화번호
358일반화물해림화물(주)경상남도 양산시 옥곡8길 3 (남부동)051-314-7272
359일반화물해성물류(주)경상남도 양산시 옥곡8길 3 (남부동)055-389-0255
360일반화물흥진기업(주)경상남도 양산시 옥곡8길 3 (남부동)055-931-7789
361일반화물(유)동경종합운수경상남도 양산시 원동면 원동로 1702051-463-3258
362일반화물(유)동진운수경상남도 양산시 원동면 원동로 1702051-803-3111
363일반화물(유)보영운수경상남도 양산시 동면 남양산2길 20051-317-5561
364일반화물(유)송종합물류경상남도 양산시 원동면 원동로 1702051-463-3257
365일반화물(유)태용물류경상남도 양산시 동면 남양산2길 20051-317-1120
366일반화물(유)하이엘에스경상남도 양산시 물금읍 신주로 16, 102동 2803호데이터 미집계
367일반화물(유)해태경상남도 양산시 동면 남양산2길 20055-381-8989