Overview

Dataset statistics

Number of variables6
Number of observations834
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.0 KiB
Average record size in memory49.2 B

Variable types

Categorical3
Text2
Numeric1

Dataset

Description울산광역시에서 등록된 관내 화물 운송업 등록 업체 명, 시도, 시군구, 업종(면허종류), 지번 주소, 보유 대수 등 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15114769/fileData.do

Alerts

시도 has constant value ""Constant
보유대수 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 보유대수High correlation
업종 is highly imbalanced (97.4%)Imbalance
보유대수 is highly skewed (γ1 = 22.9127489)Skewed
보유대수 has 84 (10.1%) zerosZeros

Reproduction

Analysis started2023-12-12 19:31:49.595072
Analysis finished2023-12-12 19:31:50.313254
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
울산광역시
834 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시
2nd row울산광역시
3rd row울산광역시
4th row울산광역시
5th row울산광역시

Common Values

ValueCountFrequency (%)
울산광역시 834
100.0%

Length

2023-12-13T04:31:50.378932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:50.469741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 834
100.0%

시군구
Categorical

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
남구
309 
울주군
280 
북구
169 
울산 중구
60 
동구
 
16

Length

Max length5
Median length2
Mean length2.5515588
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산 중구
2nd row남구
3rd row남구
4th row남구
5th row남구

Common Values

ValueCountFrequency (%)
남구 309
37.1%
울주군 280
33.6%
북구 169
20.3%
울산 중구 60
 
7.2%
동구 16
 
1.9%

Length

2023-12-13T04:31:50.579006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:50.709602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남구 309
34.6%
울주군 280
31.3%
북구 169
18.9%
울산 60
 
6.7%
중구 60
 
6.7%
동구 16
 
1.8%
Distinct831
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-13T04:31:50.906010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length7.6342926
Min length2

Characters and Unicode

Total characters6367
Distinct characters326
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique828 ?
Unique (%)99.3%

Sample

1st row개인차량
2nd row진양물류(주)
3rd row현대트레일러(주)
4th row(주)진보특수
5th row조운종합물류(주)
ValueCountFrequency (%)
하나스카이 2
 
0.2%
울산지점 2
 
0.2%
울산영업소 2
 
0.2%
울산스카이 2
 
0.2%
일반화물 2
 
0.2%
현대스카이 2
 
0.2%
동성운수 2
 
0.2%
주)제이엠로지스 1
 
0.1%
최병구(개별허가 1
 
0.1%
피케이종합물류(주 1
 
0.1%
Other values (833) 833
98.0%
2023-12-13T04:31:51.289591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 613
 
9.6%
( 613
 
9.6%
454
 
7.1%
197
 
3.1%
194
 
3.0%
160
 
2.5%
139
 
2.2%
138
 
2.2%
125
 
2.0%
123
 
1.9%
Other values (316) 3611
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4759
74.7%
Close Punctuation 613
 
9.6%
Open Punctuation 613
 
9.6%
Decimal Number 332
 
5.2%
Space Separator 16
 
0.3%
Math Symbol 16
 
0.3%
Uppercase Letter 14
 
0.2%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
454
 
9.5%
197
 
4.1%
194
 
4.1%
160
 
3.4%
139
 
2.9%
138
 
2.9%
125
 
2.6%
123
 
2.6%
110
 
2.3%
107
 
2.2%
Other values (289) 3012
63.3%
Decimal Number
ValueCountFrequency (%)
9 56
16.9%
8 52
15.7%
1 50
15.1%
2 37
11.1%
4 28
8.4%
5 27
8.1%
0 26
7.8%
6 21
 
6.3%
7 20
 
6.0%
3 15
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
S 3
21.4%
L 2
14.3%
O 2
14.3%
T 2
14.3%
B 1
 
7.1%
M 1
 
7.1%
J 1
 
7.1%
P 1
 
7.1%
H 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
. 1
25.0%
& 1
25.0%
Math Symbol
ValueCountFrequency (%)
> 8
50.0%
< 8
50.0%
Close Punctuation
ValueCountFrequency (%)
) 613
100.0%
Open Punctuation
ValueCountFrequency (%)
( 613
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4759
74.7%
Common 1594
 
25.0%
Latin 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
454
 
9.5%
197
 
4.1%
194
 
4.1%
160
 
3.4%
139
 
2.9%
138
 
2.9%
125
 
2.6%
123
 
2.6%
110
 
2.3%
107
 
2.2%
Other values (289) 3012
63.3%
Common
ValueCountFrequency (%)
) 613
38.5%
( 613
38.5%
9 56
 
3.5%
8 52
 
3.3%
1 50
 
3.1%
2 37
 
2.3%
4 28
 
1.8%
5 27
 
1.7%
0 26
 
1.6%
6 21
 
1.3%
Other values (8) 71
 
4.5%
Latin
ValueCountFrequency (%)
S 3
21.4%
L 2
14.3%
O 2
14.3%
T 2
14.3%
B 1
 
7.1%
M 1
 
7.1%
J 1
 
7.1%
P 1
 
7.1%
H 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4759
74.7%
ASCII 1608
 
25.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 613
38.1%
( 613
38.1%
9 56
 
3.5%
8 52
 
3.2%
1 50
 
3.1%
2 37
 
2.3%
4 28
 
1.7%
5 27
 
1.7%
0 26
 
1.6%
6 21
 
1.3%
Other values (17) 85
 
5.3%
Hangul
ValueCountFrequency (%)
454
 
9.5%
197
 
4.1%
194
 
4.1%
160
 
3.4%
139
 
2.9%
138
 
2.9%
125
 
2.6%
123
 
2.6%
110
 
2.3%
107
 
2.2%
Other values (289) 3012
63.3%

업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
일반화물
830 
개별화물
 
2
용달화물
 
1
<NA>
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row용달화물
2nd row일반화물
3rd row일반화물
4th row일반화물
5th row일반화물

Common Values

ValueCountFrequency (%)
일반화물 830
99.5%
개별화물 2
 
0.2%
용달화물 1
 
0.1%
<NA> 1
 
0.1%

Length

2023-12-13T04:31:51.431364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:51.524301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반화물 830
99.5%
개별화물 2
 
0.2%
용달화물 1
 
0.1%
na 1
 
0.1%

주소
Text

Distinct793
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-13T04:31:51.904699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length45
Mean length30.758993
Min length10

Characters and Unicode

Total characters25653
Distinct characters368
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique760 ?
Unique (%)91.1%

Sample

1st row울산 중구 성안동 506-2
2nd row부산 금정구 장전1동 205-2 우남이채룸A
3rd row부산 기장군 기장읍 서부리 77-5
4th row부산 남구 대연1동 891-12
5th row부산 해운대구 우1동 1434 썬프라자719호
ValueCountFrequency (%)
울산광역시 707
 
13.9%
울주군 308
 
6.1%
남구 277
 
5.5%
북구 162
 
3.2%
중구 65
 
1.3%
산업로 45
 
0.9%
2층 40
 
0.8%
온산읍 37
 
0.7%
101동 36
 
0.7%
청량읍 36
 
0.7%
Other values (1674) 3363
66.3%
2023-12-13T04:31:52.542028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4598
 
17.9%
1 1372
 
5.3%
1076
 
4.2%
1022
 
4.0%
0 860
 
3.4%
2 794
 
3.1%
775
 
3.0%
721
 
2.8%
716
 
2.8%
709
 
2.8%
Other values (358) 13010
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13735
53.5%
Decimal Number 5746
22.4%
Space Separator 4598
 
17.9%
Open Punctuation 481
 
1.9%
Close Punctuation 475
 
1.9%
Dash Punctuation 371
 
1.4%
Other Punctuation 184
 
0.7%
Uppercase Letter 55
 
0.2%
Lowercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1076
 
7.8%
1022
 
7.4%
775
 
5.6%
721
 
5.2%
716
 
5.2%
709
 
5.2%
574
 
4.2%
508
 
3.7%
405
 
2.9%
389
 
2.8%
Other values (322) 6840
49.8%
Uppercase Letter
ValueCountFrequency (%)
A 21
38.2%
B 11
20.0%
L 6
 
10.9%
H 5
 
9.1%
C 2
 
3.6%
S 2
 
3.6%
N 1
 
1.8%
U 1
 
1.8%
K 1
 
1.8%
G 1
 
1.8%
Other values (4) 4
 
7.3%
Decimal Number
ValueCountFrequency (%)
1 1372
23.9%
0 860
15.0%
2 794
13.8%
3 581
10.1%
6 436
 
7.6%
5 433
 
7.5%
4 432
 
7.5%
7 304
 
5.3%
9 276
 
4.8%
8 258
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 163
88.6%
/ 13
 
7.1%
@ 6
 
3.3%
# 1
 
0.5%
. 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
e 6
75.0%
t 1
 
12.5%
h 1
 
12.5%
Space Separator
ValueCountFrequency (%)
4598
100.0%
Open Punctuation
ValueCountFrequency (%)
( 481
100.0%
Close Punctuation
ValueCountFrequency (%)
) 475
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 371
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13735
53.5%
Common 11855
46.2%
Latin 63
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1076
 
7.8%
1022
 
7.4%
775
 
5.6%
721
 
5.2%
716
 
5.2%
709
 
5.2%
574
 
4.2%
508
 
3.7%
405
 
2.9%
389
 
2.8%
Other values (322) 6840
49.8%
Common
ValueCountFrequency (%)
4598
38.8%
1 1372
 
11.6%
0 860
 
7.3%
2 794
 
6.7%
3 581
 
4.9%
( 481
 
4.1%
) 475
 
4.0%
6 436
 
3.7%
5 433
 
3.7%
4 432
 
3.6%
Other values (9) 1393
 
11.8%
Latin
ValueCountFrequency (%)
A 21
33.3%
B 11
17.5%
e 6
 
9.5%
L 6
 
9.5%
H 5
 
7.9%
C 2
 
3.2%
S 2
 
3.2%
t 1
 
1.6%
h 1
 
1.6%
N 1
 
1.6%
Other values (7) 7
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13734
53.5%
ASCII 11918
46.5%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4598
38.6%
1 1372
 
11.5%
0 860
 
7.2%
2 794
 
6.7%
3 581
 
4.9%
( 481
 
4.0%
) 475
 
4.0%
6 436
 
3.7%
5 433
 
3.6%
4 432
 
3.6%
Other values (26) 1456
 
12.2%
Hangul
ValueCountFrequency (%)
1076
 
7.8%
1022
 
7.4%
775
 
5.6%
721
 
5.2%
716
 
5.2%
709
 
5.2%
574
 
4.2%
508
 
3.7%
405
 
2.9%
389
 
2.8%
Other values (321) 6839
49.8%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

보유대수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct69
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.895683
Minimum0
Maximum2164
Zeros84
Zeros (%)10.1%
Negative0
Negative (%)0.0%
Memory size7.5 KiB
2023-12-13T04:31:52.726370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q35
95-th percentile38.35
Maximum2164
Range2164
Interquartile range (IQR)4

Descriptive statistics

Standard deviation82.695336
Coefficient of variation (CV)7.5897337
Kurtosis571.06984
Mean10.895683
Median Absolute Deviation (MAD)1
Skewness22.912749
Sum9087
Variance6838.5185
MonotonicityNot monotonic
2023-12-13T04:31:52.885070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 386
46.3%
0 84
 
10.1%
2 69
 
8.3%
3 37
 
4.4%
4 32
 
3.8%
5 18
 
2.2%
6 17
 
2.0%
7 17
 
2.0%
8 15
 
1.8%
10 13
 
1.6%
Other values (59) 146
 
17.5%
ValueCountFrequency (%)
0 84
 
10.1%
1 386
46.3%
2 69
 
8.3%
3 37
 
4.4%
4 32
 
3.8%
5 18
 
2.2%
6 17
 
2.0%
7 17
 
2.0%
8 15
 
1.8%
9 12
 
1.4%
ValueCountFrequency (%)
2164 1
0.1%
929 1
0.1%
180 1
0.1%
122 1
0.1%
106 1
0.1%
100 1
0.1%
95 1
0.1%
94 1
0.1%
82 1
0.1%
79 2
0.2%

Interactions

2023-12-13T04:31:50.042056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:31:52.989702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구업종보유대수
시군구1.0000.1100.138
업종0.1101.0000.988
보유대수0.1380.9881.000
2023-12-13T04:31:53.393426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구업종
시군구1.0000.083
업종0.0831.000
2023-12-13T04:31:53.500904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보유대수시군구업종
보유대수1.0000.1040.866
시군구0.1041.0000.083
업종0.8660.0831.000

Missing values

2023-12-13T04:31:50.161251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:31:50.272293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구업체명업종주소보유대수
0울산광역시울산 중구개인차량용달화물울산 중구 성안동 506-22164
1울산광역시남구진양물류(주)일반화물부산 금정구 장전1동 205-2 우남이채룸A11
2울산광역시남구현대트레일러(주)일반화물부산 기장군 기장읍 서부리 77-56
3울산광역시남구(주)진보특수일반화물부산 남구 대연1동 891-128
4울산광역시남구조운종합물류(주)일반화물부산 해운대구 우1동 1434 썬프라자719호22
5울산광역시남구(주)흥국통운일반화물부산광역시 동구 자성로141번길 11 삼환오피스텔 605호18
6울산광역시북구롯데글로벌로지스(주)일반화물서울특별시 중구 통일로 10 (남대문로5가 84-11) 연세대학교 세브란스빌딩 10-11층30
7울산광역시남구(주)농협물류 울산영업소일반화물서울특별시 서대문구 통일로 87 NH농협생명빌딩 동관 7층6
8울산광역시울주군그린물류(박정호)일반화물울산 울주군 언양읍 미연1길 141
9울산광역시남구(주)비엔에프울산영업소일반화물남구 달동 남울산우체국 1314-1 남울산우체국2층1
시도시군구업체명업종주소보유대수
824울산광역시울주군(주)미소일반화물울산광역시 중구 종가로 406-21 ,1139호(복산동,혁신비지니스센터)12
825울산광역시울주군(주)지움일반화물울산광역시 중구 종가로 406-21 ,1139호(복산동,혁신비지니스센터)10
826울산광역시울산 중구울산로베드일반화물울산광역시 중구 종가로 668 우정LH1단지 112동 1006호(서동,우정LH1단지)1
827울산광역시울산 중구길임물류일반화물울산광역시 중구 학성로 1 마제스타워 102동 2141호 (우정동)1
828울산광역시울산 중구민성운수(권기원)일반화물울산광역시 중구 함월22길 24 벽산이빌리지104동 910호1
829울산광역시동구울산광고물협동조합일반화물울산시 동구 꽃바위로 344 방어동2
830울산광역시남구(주)태화특수차<울산영업소>일반화물인천 미추홀구 주안동 966-3 광해리드빌 102동 311호7
831울산광역시남구(주)삼성종합물류일반화물전북 군산시 산북동 1579-51
832울산광역시울주군무한로지스틱스(주)개별화물울주군 청량면 용암리 3111
833울산광역시울산 중구울산지부개별화물울산 중구 성안동 500-4호929