Overview

Dataset statistics

Number of variables7
Number of observations53
Missing cells25
Missing cells (%)6.7%
Duplicate rows1
Duplicate rows (%)1.9%
Total size in memory3.1 KiB
Average record size in memory59.5 B

Variable types

Categorical3
Text3
DateTime1

Dataset

Description경기도 이천시 화물자동차운송사업자 허가현황에 대한 데이터로 시군명, 업종, 업체명, 소재지주소, 전화번호, 데이터기준일자를 제공합니다.
Author경기도 이천시
URLhttps://www.data.go.kr/data/15123632/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 1 (1.9%) duplicate rowsDuplicates
업종 is highly overall correlated with 보유차량대수High correlation
보유차량대수 is highly overall correlated with 업종High correlation
전화번호 has 25 (47.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:31:48.499849
Analysis finished2023-12-12 09:31:49.023028
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size556.0 B
이천시
53 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이천시
2nd row이천시
3rd row이천시
4th row이천시
5th row이천시

Common Values

ValueCountFrequency (%)
이천시 53
100.0%

Length

2023-12-12T18:31:49.093429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:31:49.225667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이천시 53
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size556.0 B
일반화물
46 
개인화물

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반화물
2nd row일반화물
3rd row일반화물
4th row일반화물
5th row일반화물

Common Values

ValueCountFrequency (%)
일반화물 46
86.8%
개인화물 7
 
13.2%

Length

2023-12-12T18:31:49.414732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:31:49.529658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반화물 46
86.8%
개인화물 7
 
13.2%
Distinct52
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T18:31:49.797080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.8867925
Min length4

Characters and Unicode

Total characters365
Distinct characters107
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)96.2%

Sample

1st row씨제이대한통운㈜ 이천영업소
2nd row동국상운㈜
3rd row동국상운㈜
4th row㈜유니리버
5th row㈜대통운수
ValueCountFrequency (%)
이천영업소 6
 
10.0%
동국상운㈜ 2
 
3.3%
주)에스엘다현물류 1
 
1.7%
아트로지스㈜ 1
 
1.7%
㈜설봉메트로 1
 
1.7%
㈜가야로직스 1
 
1.7%
㈜코어로지스 1
 
1.7%
설봉렉카㈜ 1
 
1.7%
㈜대상로직스 1
 
1.7%
㈜케어로지스 1
 
1.7%
Other values (44) 44
73.3%
2023-12-12T18:31:50.326421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
12.9%
24
 
6.6%
17
 
4.7%
16
 
4.4%
14
 
3.8%
12
 
3.3%
12
 
3.3%
11
 
3.0%
11
 
3.0%
8
 
2.2%
Other values (97) 193
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 306
83.8%
Other Symbol 47
 
12.9%
Space Separator 7
 
1.9%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%
Connector Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
7.8%
17
 
5.6%
16
 
5.2%
14
 
4.6%
12
 
3.9%
12
 
3.9%
11
 
3.6%
11
 
3.6%
8
 
2.6%
7
 
2.3%
Other values (92) 174
56.9%
Other Symbol
ValueCountFrequency (%)
47
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 353
96.7%
Common 12
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
13.3%
24
 
6.8%
17
 
4.8%
16
 
4.5%
14
 
4.0%
12
 
3.4%
12
 
3.4%
11
 
3.1%
11
 
3.1%
8
 
2.3%
Other values (93) 181
51.3%
Common
ValueCountFrequency (%)
7
58.3%
) 2
 
16.7%
( 2
 
16.7%
_ 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 306
83.8%
None 47
 
12.9%
ASCII 12
 
3.3%

Most frequent character per block

None
ValueCountFrequency (%)
47
100.0%
Hangul
ValueCountFrequency (%)
24
 
7.8%
17
 
5.6%
16
 
5.2%
14
 
4.6%
12
 
3.9%
12
 
3.9%
11
 
3.6%
11
 
3.6%
8
 
2.6%
7
 
2.3%
Other values (92) 174
56.9%
ASCII
ValueCountFrequency (%)
7
58.3%
) 2
 
16.7%
( 2
 
16.7%
_ 1
 
8.3%
Distinct49
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T18:31:50.664769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length20.792453
Min length14

Characters and Unicode

Total characters1102
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)84.9%

Sample

1st row경기도 이천시 대월면 대평로 131
2nd row경기도 이천시 설성면 설가로 250-29
3rd row경기도 이천시 설성면 설가로 250-29
4th row경기도 이천시 마장면 덕평로 626
5th row경기도 이천시 마장면 마도로 177
ValueCountFrequency (%)
경기도 52
20.2%
이천시 52
20.2%
부발읍 12
 
4.7%
마장면 11
 
4.3%
경충대로 7
 
2.7%
신둔면 6
 
2.3%
대월면 5
 
1.9%
덕평로 5
 
1.9%
호법면 4
 
1.6%
황무로 4
 
1.6%
Other values (82) 99
38.5%
2023-12-12T18:31:51.175812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204
18.5%
59
 
5.4%
58
 
5.3%
58
 
5.3%
53
 
4.8%
53
 
4.8%
53
 
4.8%
52
 
4.7%
1 44
 
4.0%
3 31
 
2.8%
Other values (60) 437
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 658
59.7%
Decimal Number 225
 
20.4%
Space Separator 204
 
18.5%
Dash Punctuation 15
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
9.0%
58
 
8.8%
58
 
8.8%
53
 
8.1%
53
 
8.1%
53
 
8.1%
52
 
7.9%
31
 
4.7%
25
 
3.8%
19
 
2.9%
Other values (48) 197
29.9%
Decimal Number
ValueCountFrequency (%)
1 44
19.6%
3 31
13.8%
2 30
13.3%
4 24
10.7%
7 20
8.9%
0 19
8.4%
5 19
8.4%
9 16
 
7.1%
6 11
 
4.9%
8 11
 
4.9%
Space Separator
ValueCountFrequency (%)
204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 658
59.7%
Common 444
40.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
9.0%
58
 
8.8%
58
 
8.8%
53
 
8.1%
53
 
8.1%
53
 
8.1%
52
 
7.9%
31
 
4.7%
25
 
3.8%
19
 
2.9%
Other values (48) 197
29.9%
Common
ValueCountFrequency (%)
204
45.9%
1 44
 
9.9%
3 31
 
7.0%
2 30
 
6.8%
4 24
 
5.4%
7 20
 
4.5%
0 19
 
4.3%
5 19
 
4.3%
9 16
 
3.6%
- 15
 
3.4%
Other values (2) 22
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 658
59.7%
ASCII 444
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204
45.9%
1 44
 
9.9%
3 31
 
7.0%
2 30
 
6.8%
4 24
 
5.4%
7 20
 
4.5%
0 19
 
4.3%
5 19
 
4.3%
9 16
 
3.6%
- 15
 
3.4%
Other values (2) 22
 
5.0%
Hangul
ValueCountFrequency (%)
59
 
9.0%
58
 
8.8%
58
 
8.8%
53
 
8.1%
53
 
8.1%
53
 
8.1%
52
 
7.9%
31
 
4.7%
25
 
3.8%
19
 
2.9%
Other values (48) 197
29.9%

보유차량대수
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size556.0 B
<NA>
46 
1

Length

Max length4
Median length4
Mean length3.6037736
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 46
86.8%
1 7
 
13.2%

Length

2023-12-12T18:31:51.356461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:31:51.493256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 46
86.8%
1 7
 
13.2%

전화번호
Text

MISSING 

Distinct27
Distinct (%)96.4%
Missing25
Missing (%)47.2%
Memory size556.0 B
2023-12-12T18:31:51.709135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.892857
Min length9

Characters and Unicode

Total characters333
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)92.9%

Sample

1st row031-637-4130
2nd row02-598-3001
3rd row02-598-3001
4th row070-8676-8227
5th row031-637-6751
ValueCountFrequency (%)
02-598-3001 2
 
7.1%
031-637-6153 1
 
3.6%
031-637-4130 1
 
3.6%
031-631-2738 1
 
3.6%
031-633-8796 1
 
3.6%
031-341-1515 1
 
3.6%
1688-9368 1
 
3.6%
031-637-2250 1
 
3.6%
031-633-2708 1
 
3.6%
031-637-8872 1
 
3.6%
Other values (17) 17
60.7%
2023-12-12T18:31:52.164254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 56
16.8%
- 55
16.5%
0 44
13.2%
1 37
11.1%
6 37
11.1%
7 25
7.5%
2 24
7.2%
8 20
 
6.0%
5 16
 
4.8%
9 10
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 278
83.5%
Dash Punctuation 55
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 56
20.1%
0 44
15.8%
1 37
13.3%
6 37
13.3%
7 25
9.0%
2 24
8.6%
8 20
 
7.2%
5 16
 
5.8%
9 10
 
3.6%
4 9
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 333
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 56
16.8%
- 55
16.5%
0 44
13.2%
1 37
11.1%
6 37
11.1%
7 25
7.5%
2 24
7.2%
8 20
 
6.0%
5 16
 
4.8%
9 10
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 333
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 56
16.8%
- 55
16.5%
0 44
13.2%
1 37
11.1%
6 37
11.1%
7 25
7.5%
2 24
7.2%
8 20
 
6.0%
5 16
 
4.8%
9 10
 
3.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size556.0 B
Minimum2023-09-15 00:00:00
Maximum2023-09-15 00:00:00
2023-12-12T18:31:52.317548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:31:52.443584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T18:31:52.548354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업체명소재지주소전화번호
업종1.0001.0000.429NaN
업체명1.0001.0001.0001.000
소재지주소0.4291.0001.0001.000
전화번호NaN1.0001.0001.000
2023-12-12T18:31:52.696495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종보유차량대수
업종1.0001.000
보유차량대수1.0001.000
2023-12-12T18:31:52.800980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종보유차량대수
업종1.0001.000
보유차량대수1.0001.000

Missing values

2023-12-12T18:31:48.839592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:31:48.968979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명업종업체명소재지주소보유차량대수전화번호데이터기준일자
0이천시일반화물씨제이대한통운㈜ 이천영업소경기도 이천시 대월면 대평로 131<NA>031-637-41302023-09-15
1이천시일반화물동국상운㈜경기도 이천시 설성면 설가로 250-29<NA>02-598-30012023-09-15
2이천시일반화물동국상운㈜경기도 이천시 설성면 설가로 250-29<NA>02-598-30012023-09-15
3이천시일반화물㈜유니리버경기도 이천시 마장면 덕평로 626<NA>070-8676-82272023-09-15
4이천시일반화물㈜대통운수경기도 이천시 마장면 마도로 177<NA>031-637-67512023-09-15
5이천시일반화물㈜수광물류경기도 이천시 신둔면 마소로 53<NA>070-7136-76862023-09-15
6이천시일반화물신성로지스㈜경기도 이천시 대월면 대월로 412<NA>02-2043-99552023-09-15
7이천시일반화물정양산업㈜ 이천영업소부산광역시 사하구 원양로 359<NA>031-634-71722023-09-15
8이천시일반화물정우물류경기도 이천시 부발읍 경충대로 1804번길 47<NA><NA>2023-09-15
9이천시일반화물㈜진안물류경기도 이천시 신둔면 서이천로 774<NA>031-631-95912023-09-15
시군명업종업체명소재지주소보유차량대수전화번호데이터기준일자
43이천시일반화물㈜로얄운수경기도 이천시 부발읍 중부대로1925번길 19<NA><NA>2023-09-15
44이천시일반화물비즈로지스㈜경기도 이천시 부발읍 경충대로 2314<NA><NA>2023-09-15
45이천시일반화물두강운수협동조합경기도 이천시 호법면 덕평로 217-57<NA><NA>2023-09-15
46이천시개인화물모더스물류㈜경기도 이천시 마장면 이장로311번길 911<NA>2023-09-15
47이천시개인화물전국화물㈜경기도 이천시 부발읍 황무로 18391<NA>2023-09-15
48이천시개인화물㈜대현물류경기도 이천시 증포로 631<NA>2023-09-15
49이천시개인화물설봉스카이㈜경기도 이천시 경충대로 30581<NA>2023-09-15
50이천시개인화물㈜에이치에스라인경기도 이천시 마장면 서이천로320번길 27-201<NA>2023-09-15
51이천시개인화물케이스카이㈜경기도 이천시 진상미로2232번길 105-131<NA>2023-09-15
52이천시개인화물㈜베스트인로직스경기도 이천시 마장면 서이천로 3831<NA>2023-09-15

Duplicate rows

Most frequently occurring

시군명업종업체명소재지주소보유차량대수전화번호데이터기준일자# duplicates
0이천시일반화물동국상운㈜경기도 이천시 설성면 설가로 250-29<NA>02-598-30012023-09-152