Overview

Dataset statistics

Number of variables5
Number of observations86
Missing cells42
Missing cells (%)9.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory42.5 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description충청북도 진천군 관내 개인 및 법인 일반화물자동차 운송사업자 현황(업체명, 면허종류(일반운송사업), 차량대수, 주소, 현재운영여부)입니다.
URLhttps://www.data.go.kr/data/15114809/fileData.do

Alerts

현재운영여부 has constant value ""Constant
주소 has 42 (48.8%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:40:10.382763
Analysis finished2023-12-12 08:40:10.890431
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-12T17:40:11.065683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length5.7325581
Min length3

Characters and Unicode

Total characters493
Distinct characters143
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)100.0%

Sample

1st row(주)원로지스
2nd row만세물류㈜
3rd row(주)자원로직스
4th row(주)중부철강물류
5th row위드로지스 주식회사
ValueCountFrequency (%)
주식회사 10
 
10.3%
주)원로지스 1
 
1.0%
우이실업 1
 
1.0%
양기순 1
 
1.0%
㈜삼정 1
 
1.0%
㈜화랑운수 1
 
1.0%
진천현대서비스 1
 
1.0%
중부레카 1
 
1.0%
진천종합자동차(주 1
 
1.0%
동서익스프레스 1
 
1.0%
Other values (78) 78
80.4%
2023-12-12T17:40:11.460685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
6.7%
( 23
 
4.7%
) 23
 
4.7%
18
 
3.7%
16
 
3.2%
13
 
2.6%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
Other values (133) 319
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 420
85.2%
Open Punctuation 23
 
4.7%
Close Punctuation 23
 
4.7%
Space Separator 11
 
2.2%
Other Symbol 10
 
2.0%
Uppercase Letter 5
 
1.0%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
7.9%
18
 
4.3%
16
 
3.8%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
10
 
2.4%
Other values (123) 273
65.0%
Uppercase Letter
ValueCountFrequency (%)
L 1
20.0%
S 1
20.0%
I 1
20.0%
G 1
20.0%
O 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 430
87.2%
Common 58
 
11.8%
Latin 5
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
7.7%
18
 
4.2%
16
 
3.7%
13
 
3.0%
13
 
3.0%
12
 
2.8%
12
 
2.8%
10
 
2.3%
10
 
2.3%
10
 
2.3%
Other values (124) 283
65.8%
Latin
ValueCountFrequency (%)
L 1
20.0%
S 1
20.0%
I 1
20.0%
G 1
20.0%
O 1
20.0%
Common
ValueCountFrequency (%)
( 23
39.7%
) 23
39.7%
11
19.0%
1 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 420
85.2%
ASCII 63
 
12.8%
None 10
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
7.9%
18
 
4.3%
16
 
3.8%
13
 
3.1%
13
 
3.1%
12
 
2.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
10
 
2.4%
Other values (123) 273
65.0%
ASCII
ValueCountFrequency (%)
( 23
36.5%
) 23
36.5%
11
17.5%
1 1
 
1.6%
L 1
 
1.6%
S 1
 
1.6%
I 1
 
1.6%
G 1
 
1.6%
O 1
 
1.6%
None
ValueCountFrequency (%)
10
100.0%

면허종류
Categorical

Distinct2
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size820.0 B
(구)일반화물
70 
일반화물
16 

Length

Max length7
Median length7
Mean length6.4418605
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(구)일반화물
2nd row(구)일반화물
3rd row(구)일반화물
4th row(구)일반화물
5th row(구)일반화물

Common Values

ValueCountFrequency (%)
(구)일반화물 70
81.4%
일반화물 16
 
18.6%

Length

2023-12-12T17:40:11.586622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:40:11.692950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구)일반화물 70
81.4%
일반화물 16
 
18.6%

주소
Text

MISSING 

Distinct37
Distinct (%)84.1%
Missing42
Missing (%)48.8%
Memory size820.0 B
2023-12-12T17:40:11.913796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length34
Mean length26.590909
Min length19

Characters and Unicode

Total characters1170
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)77.3%

Sample

1st row충청북도 진천군 초평면 초평로 1163, LG전자 진천물류센터
2nd row충청북도 진천군 이월면 진안로 89
3rd row충청북도 진천군 덕산읍 습지길 32, 대성물류(주)
4th row충청북도 진천군 진천읍 상신2길 128-7
5th row충청북도 진천군 덕산읍 도장길 215-15
ValueCountFrequency (%)
충청북도 44
17.6%
진천군 44
17.6%
덕산읍 17
 
6.8%
진천읍 8
 
3.2%
도장길 7
 
2.8%
215-15 7
 
2.8%
광혜원면 7
 
2.8%
진광로 5
 
2.0%
문백면 4
 
1.6%
초평면 4
 
1.6%
Other values (87) 103
41.2%
2023-12-12T17:40:12.293691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
 
17.6%
69
 
5.9%
59
 
5.0%
51
 
4.4%
44
 
3.8%
44
 
3.8%
44
 
3.8%
44
 
3.8%
1 41
 
3.5%
2 28
 
2.4%
Other values (105) 540
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 716
61.2%
Space Separator 206
 
17.6%
Decimal Number 179
 
15.3%
Dash Punctuation 21
 
1.8%
Other Punctuation 20
 
1.7%
Open Punctuation 13
 
1.1%
Close Punctuation 13
 
1.1%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
9.6%
59
 
8.2%
51
 
7.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
27
 
3.8%
25
 
3.5%
24
 
3.4%
Other values (88) 285
39.8%
Decimal Number
ValueCountFrequency (%)
1 41
22.9%
2 28
15.6%
5 24
13.4%
3 21
11.7%
4 16
 
8.9%
8 12
 
6.7%
7 11
 
6.1%
6 11
 
6.1%
9 8
 
4.5%
0 7
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
G 1
50.0%
Space Separator
ValueCountFrequency (%)
206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 716
61.2%
Common 452
38.6%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
9.6%
59
 
8.2%
51
 
7.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
27
 
3.8%
25
 
3.5%
24
 
3.4%
Other values (88) 285
39.8%
Common
ValueCountFrequency (%)
206
45.6%
1 41
 
9.1%
2 28
 
6.2%
5 24
 
5.3%
- 21
 
4.6%
3 21
 
4.6%
, 20
 
4.4%
4 16
 
3.5%
( 13
 
2.9%
) 13
 
2.9%
Other values (5) 49
 
10.8%
Latin
ValueCountFrequency (%)
L 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 716
61.2%
ASCII 454
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
206
45.4%
1 41
 
9.0%
2 28
 
6.2%
5 24
 
5.3%
- 21
 
4.6%
3 21
 
4.6%
, 20
 
4.4%
4 16
 
3.5%
( 13
 
2.9%
) 13
 
2.9%
Other values (7) 51
 
11.2%
Hangul
ValueCountFrequency (%)
69
 
9.6%
59
 
8.2%
51
 
7.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
44
 
6.1%
27
 
3.8%
25
 
3.5%
24
 
3.4%
Other values (88) 285
39.8%

차량대수
Real number (ℝ)

Distinct15
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5465116
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-12T17:40:12.410825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34.5
95-th percentile22.5
Maximum37
Range36
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation7.666055
Coefficient of variation (CV)1.68614
Kurtosis8.4638453
Mean4.5465116
Median Absolute Deviation (MAD)0
Skewness2.9486045
Sum391
Variance58.768399
MonotonicityNot monotonic
2023-12-12T17:40:12.531590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 44
51.2%
2 15
 
17.4%
5 6
 
7.0%
3 5
 
5.8%
7 4
 
4.7%
8 2
 
2.3%
37 2
 
2.3%
16 1
 
1.2%
20 1
 
1.2%
6 1
 
1.2%
Other values (5) 5
 
5.8%
ValueCountFrequency (%)
1 44
51.2%
2 15
 
17.4%
3 5
 
5.8%
5 6
 
7.0%
6 1
 
1.2%
7 4
 
4.7%
8 2
 
2.3%
11 1
 
1.2%
16 1
 
1.2%
20 1
 
1.2%
ValueCountFrequency (%)
37 2
2.3%
30 1
 
1.2%
27 1
 
1.2%
23 1
 
1.2%
21 1
 
1.2%
20 1
 
1.2%
16 1
 
1.2%
11 1
 
1.2%
8 2
2.3%
7 4
4.7%

현재운영여부
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size820.0 B
운영
86 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영
2nd row운영
3rd row운영
4th row운영
5th row운영

Common Values

ValueCountFrequency (%)
운영 86
100.0%

Length

2023-12-12T17:40:12.683137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:40:12.822026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영 86
100.0%

Interactions

2023-12-12T17:40:10.616896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:40:12.886754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명면허종류주소차량대수
업체명1.0001.0001.0001.000
면허종류1.0001.0000.3950.351
주소1.0000.3951.0000.000
차량대수1.0000.3510.0001.000
2023-12-12T17:40:13.001605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량대수면허종류
차량대수1.0000.334
면허종류0.3341.000

Missing values

2023-12-12T17:40:10.747720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:40:10.849077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명면허종류주소차량대수현재운영여부
0(주)원로지스(구)일반화물충청북도 진천군 초평면 초평로 1163, LG전자 진천물류센터3운영
1만세물류㈜(구)일반화물충청북도 진천군 이월면 진안로 8916운영
2(주)자원로직스(구)일반화물충청북도 진천군 덕산읍 습지길 32, 대성물류(주)2운영
3(주)중부철강물류(구)일반화물충청북도 진천군 진천읍 상신2길 128-78운영
4위드로지스 주식회사(구)일반화물충청북도 진천군 덕산읍 도장길 215-155운영
5태동물류(주)(구)일반화물충청북도 진천군 덕산읍 도장길 215-151운영
6주식회사 대교특수화물(구)일반화물충청북도 진천군 문백면 송강로 114, 대교전국화물37운영
7김용민(구)일반화물<NA>1운영
8주식회사 평안운수(구)일반화물충청북도 진천군 문백면 문진로 468-131운영
9양수호(구)일반화물<NA>2운영
업체명면허종류주소차량대수현재운영여부
76(주)길벚운송일반화물충청북도 진천군 광혜원면 생거진천로 2642-61운영
77(주)비로지스일반화물충청북도 진천군 이월면 생거진천로 1967, 동호휴게소1운영
78주식회사 에스아이엠물류일반화물충청북도 진천군 덕산읍 산수산단3로 47 (주)더큰정성1운영
79이기호일반화물<NA>1운영
80김영남일반화물<NA>1운영
81(주)유원물류일반화물충청북도 진천군 덕산읍 초금로 318-51, 롯데알루미늄(주)진천공장1운영
82삼현 LOGIS일반화물<NA>1운영
83해성운수(주)일반화물충청북도 진천군 덕산읍 도장길 215-1523운영
84(주)영보케미칼일반화물충청북도 진천군 진천읍 금사로 208 (주)영보케미컬27운영
85이용희일반화물<NA>2운영