Overview

Dataset statistics

Number of variables6
Number of observations69
Missing cells42
Missing cells (%)10.1%
Duplicate rows5
Duplicate rows (%)7.2%
Total size in memory3.4 KiB
Average record size in memory50.9 B

Variable types

Categorical3
Text1
Numeric1
DateTime1

Dataset

Description충청남도 공주시 일반화물자동차 운송사업자 현황으로 면허종류, 업체명, 차량대수, 주소, 현재운영여부, 데이터기준일 등이 포함되어 있음
URLhttps://www.data.go.kr/data/15115245/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 5 (7.2%) duplicate rowsDuplicates
현재운영여부 is highly overall correlated with 차량대수 and 2 other fieldsHigh correlation
면허종류 is highly overall correlated with 주소 and 1 other fieldsHigh correlation
주소 is highly overall correlated with 차량대수 and 2 other fieldsHigh correlation
차량대수 is highly overall correlated with 주소 and 1 other fieldsHigh correlation
업체명 has 14 (20.3%) missing valuesMissing
차량대수 has 14 (20.3%) missing valuesMissing
데이터기준일 has 14 (20.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:53:52.420676
Analysis finished2023-12-12 15:53:53.324989
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

면허종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size684.0 B
(구)일반화물
38 
일반화물
17 
<NA>
14 

Length

Max length7
Median length7
Mean length5.6521739
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반화물
2nd row일반화물
3rd row일반화물
4th row일반화물
5th row일반화물

Common Values

ValueCountFrequency (%)
(구)일반화물 38
55.1%
일반화물 17
24.6%
<NA> 14
 
20.3%

Length

2023-12-13T00:53:53.412172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:53:53.555818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구)일반화물 38
55.1%
일반화물 17
24.6%
na 14
 
20.3%

업체명
Text

MISSING 

Distinct41
Distinct (%)74.5%
Missing14
Missing (%)20.3%
Memory size684.0 B
2023-12-13T00:53:53.820750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length5.1636364
Min length3

Characters and Unicode

Total characters284
Distinct characters92
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)63.6%

Sample

1st row한현렉카
2nd row(주)행복드림통운
3rd row(주)중앙이엔비
4th row공주개별렉카
5th row제이엘물류 주식회사
ValueCountFrequency (%)
6
 
10.3%
5
 
8.6%
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
공주물류 1
 
1.7%
1
 
1.7%
충남특수렉카 1
 
1.7%
대광특수렉카 1
 
1.7%
Other values (34) 34
58.6%
2023-12-13T00:53:54.274546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 54
19.0%
20
 
7.0%
) 15
 
5.3%
( 15
 
5.3%
9
 
3.2%
9
 
3.2%
9
 
3.2%
7
 
2.5%
7
 
2.5%
6
 
2.1%
Other values (82) 133
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 195
68.7%
Other Punctuation 54
 
19.0%
Close Punctuation 15
 
5.3%
Open Punctuation 15
 
5.3%
Space Separator 3
 
1.1%
Decimal Number 1
 
0.4%
Other Symbol 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
10.3%
9
 
4.6%
9
 
4.6%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
4
 
2.1%
Other values (76) 114
58.5%
Other Punctuation
ValueCountFrequency (%)
* 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 196
69.0%
Common 88
31.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
10.2%
9
 
4.6%
9
 
4.6%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
4
 
2.0%
Other values (77) 115
58.7%
Common
ValueCountFrequency (%)
* 54
61.4%
) 15
 
17.0%
( 15
 
17.0%
3
 
3.4%
2 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 195
68.7%
ASCII 88
31.0%
None 1
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 54
61.4%
) 15
 
17.0%
( 15
 
17.0%
3
 
3.4%
2 1
 
1.1%
Hangul
ValueCountFrequency (%)
20
 
10.3%
9
 
4.6%
9
 
4.6%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
4
 
2.1%
Other values (76) 114
58.5%
None
ValueCountFrequency (%)
1
100.0%

차량대수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct14
Distinct (%)25.5%
Missing14
Missing (%)20.3%
Infinite0
Infinite (%)0.0%
Mean4.8363636
Minimum1
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size753.0 B
2023-12-13T00:53:54.445654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34.5
95-th percentile17.7
Maximum44
Range43
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation8.2344436
Coefficient of variation (CV)1.7026105
Kurtosis13.577889
Mean4.8363636
Median Absolute Deviation (MAD)1
Skewness3.5869006
Sum266
Variance67.806061
MonotonicityNot monotonic
2023-12-13T00:53:54.568955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1 21
30.4%
2 10
14.5%
3 8
 
11.6%
5 4
 
5.8%
4 2
 
2.9%
6 2
 
2.9%
15 1
 
1.4%
44 1
 
1.4%
24 1
 
1.4%
11 1
 
1.4%
Other values (4) 4
 
5.8%
(Missing) 14
20.3%
ValueCountFrequency (%)
1 21
30.4%
2 10
14.5%
3 8
 
11.6%
4 2
 
2.9%
5 4
 
5.8%
6 2
 
2.9%
7 1
 
1.4%
8 1
 
1.4%
11 1
 
1.4%
14 1
 
1.4%
ValueCountFrequency (%)
44 1
 
1.4%
38 1
 
1.4%
24 1
 
1.4%
15 1
 
1.4%
14 1
 
1.4%
11 1
 
1.4%
8 1
 
1.4%
7 1
 
1.4%
6 2
2.9%
5 4
5.8%

주소
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Memory size684.0 B
충청남도 공주시 ********
28 
<NA>
14 
충청남도 공주시 우성면 갓바위길 17-7
 
2
충청남도 공주시 장기로 160 (금흥동)
 
2
충청남도 공주시 탄천면 금백로 1299
 
1
Other values (22)
22 

Length

Max length37
Median length31
Mean length16.927536
Min length4

Unique

Unique23 ?
Unique (%)33.3%

Sample

1st row충청남도 공주시 장기로 160 (금흥동)
2nd row충청남도 공주시 금흥안터길 32 (금흥동)
3rd row충청남도 공주시 탄천면 차돌배기길 72-14
4th row충청남도 공주시 웅진로 119 (중학동)
5th row충청남도 공주시 이인면 은행안길 19-6

Common Values

ValueCountFrequency (%)
충청남도 공주시 ******** 28
40.6%
<NA> 14
20.3%
충청남도 공주시 우성면 갓바위길 17-7 2
 
2.9%
충청남도 공주시 장기로 160 (금흥동) 2
 
2.9%
충청남도 공주시 탄천면 금백로 1299 1
 
1.4%
충청남도 공주시 우성면 질마고개길 21, 주건축물 제2동 1
 
1.4%
충청남도 공주시 웅진로 119 (중학동) 1
 
1.4%
충청남도 공주시 미나리3길 16-7, 2층 (금성동) 1
 
1.4%
충청남도 공주시 장골길 19 (금흥동) 1
 
1.4%
충청남도 공주시 의당면 문화마을길 29-3 1
 
1.4%
Other values (17) 17
24.6%

Length

2023-12-13T00:53:54.735255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충청남도 55
23.1%
공주시 55
23.1%
28
11.8%
na 14
 
5.9%
신관동 5
 
2.1%
금흥동 4
 
1.7%
우성면 3
 
1.3%
무령로 3
 
1.3%
17-7 2
 
0.8%
금성동 2
 
0.8%
Other values (62) 67
28.2%

현재운영여부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size684.0 B
운영
55 
<NA>
14 

Length

Max length4
Median length2
Mean length2.4057971
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영
2nd row운영
3rd row운영
4th row운영
5th row운영

Common Values

ValueCountFrequency (%)
운영 55
79.7%
<NA> 14
 
20.3%

Length

2023-12-13T00:53:54.896210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:53:55.053563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영 55
79.7%
na 14
 
20.3%

데이터기준일
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)1.8%
Missing14
Missing (%)20.3%
Memory size684.0 B
Minimum2023-06-25 00:00:00
Maximum2023-06-25 00:00:00
2023-12-13T00:53:55.160777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:53:55.258693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T00:53:52.707078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:53:55.350543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면허종류업체명차량대수주소
면허종류1.0000.8300.4230.921
업체명0.8301.0001.0001.000
차량대수0.4231.0001.0000.974
주소0.9211.0000.9741.000
2023-12-13T00:53:55.476615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현재운영여부면허종류주소
현재운영여부1.0001.0001.000
면허종류1.0001.0000.585
주소1.0000.5851.000
2023-12-13T00:53:55.596004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량대수면허종류주소현재운영여부
차량대수1.0000.4300.6811.000
면허종류0.4301.0000.5851.000
주소0.6810.5851.0001.000
현재운영여부1.0001.0001.0001.000

Missing values

2023-12-13T00:53:52.830482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:53:53.050427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:53:53.216445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

면허종류업체명차량대수주소현재운영여부데이터기준일
0일반화물한현렉카5충청남도 공주시 장기로 160 (금흥동)운영2023-06-25
1일반화물(주)행복드림통운3충청남도 공주시 금흥안터길 32 (금흥동)운영2023-06-25
2일반화물(주)중앙이엔비5충청남도 공주시 탄천면 차돌배기길 72-14운영2023-06-25
3일반화물공주개별렉카4충청남도 공주시 웅진로 119 (중학동)운영2023-06-25
4일반화물제이엘물류 주식회사1충청남도 공주시 이인면 은행안길 19-6운영2023-06-25
5일반화물(주)에스케이물류3충청남도 공주시 무령로 424 (신관동)운영2023-06-25
6일반화물충남특수렉카2충청남도 공주시 우성면 질마고개길 21, 주건축물 제2동운영2023-06-25
7일반화물김**1충청남도 공주시 ********운영2023-06-25
8일반화물양**2충청남도 공주시 ********운영2023-06-25
9일반화물(주)공주화물운송사15충청남도 공주시 미나리3길 16-7, 2층 (금성동)운영2023-06-25
면허종류업체명차량대수주소현재운영여부데이터기준일
59<NA><NA><NA><NA><NA><NA>
60<NA><NA><NA><NA><NA><NA>
61<NA><NA><NA><NA><NA><NA>
62<NA><NA><NA><NA><NA><NA>
63<NA><NA><NA><NA><NA><NA>
64<NA><NA><NA><NA><NA><NA>
65<NA><NA><NA><NA><NA><NA>
66<NA><NA><NA><NA><NA><NA>
67<NA><NA><NA><NA><NA><NA>
68<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

면허종류업체명차량대수주소현재운영여부데이터기준일# duplicates
4<NA><NA><NA><NA><NA><NA>14
0(구)일반화물김**1충청남도 공주시 ********운영2023-06-254
2(구)일반화물이**1충청남도 공주시 ********운영2023-06-253
1(구)일반화물양**1충청남도 공주시 ********운영2023-06-252
3(구)일반화물임**1충청남도 공주시 ********운영2023-06-252