Overview

Dataset statistics

Number of variables7
Number of observations1157
Missing cells31
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory66.8 KiB
Average record size in memory59.1 B

Variable types

Numeric3
Categorical2
Text2

Dataset

Description충청북도 일반화물운송업체에 대한 데이터로 조합, 업체명, 면허정보, 면허대수, 보유대수, 주소 등의 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15114800/fileData.do

Alerts

조합 has constant value ""Constant
면허정보 has constant value ""Constant
면허대수 is highly overall correlated with 보유대수High correlation
보유대수 is highly overall correlated with 면허대수High correlation
면허대수 is highly skewed (γ1 = 22.04900724)Skewed
보유대수 is highly skewed (γ1 = 21.92640641)Skewed
순서 has unique valuesUnique
면허대수 has 25 (2.2%) zerosZeros
보유대수 has 25 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 23:34:59.791614
Analysis finished2023-12-12 23:35:01.781453
Duration1.99 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순서
Real number (ℝ)

UNIQUE 

Distinct1157
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean579
Minimum1
Maximum1157
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.3 KiB
2023-12-13T08:35:01.859743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile58.8
Q1290
median579
Q3868
95-th percentile1099.2
Maximum1157
Range1156
Interquartile range (IQR)578

Descriptive statistics

Standard deviation334.14144
Coefficient of variation (CV)0.57710093
Kurtosis-1.2
Mean579
Median Absolute Deviation (MAD)289
Skewness0
Sum669903
Variance111650.5
MonotonicityStrictly increasing
2023-12-13T08:35:01.978722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
725 1
 
0.1%
777 1
 
0.1%
776 1
 
0.1%
775 1
 
0.1%
774 1
 
0.1%
773 1
 
0.1%
772 1
 
0.1%
771 1
 
0.1%
770 1
 
0.1%
Other values (1147) 1147
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1157 1
0.1%
1156 1
0.1%
1155 1
0.1%
1154 1
0.1%
1153 1
0.1%
1152 1
0.1%
1151 1
0.1%
1150 1
0.1%
1149 1
0.1%
1148 1
0.1%

조합
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
충북화물협회
1157 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충북화물협회
2nd row충북화물협회
3rd row충북화물협회
4th row충북화물협회
5th row충북화물협회

Common Values

ValueCountFrequency (%)
충북화물협회 1157
100.0%

Length

2023-12-13T08:35:02.094201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:02.169976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충북화물협회 1157
100.0%
Distinct1025
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2023-12-13T08:35:02.407059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length5.505618
Min length2

Characters and Unicode

Total characters6370
Distinct characters329
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique942 ?
Unique (%)81.4%

Sample

1st row(유)아리울물류청주영업소
2nd row(유)에스와이유통
3rd row(유)중원로지스
4th row(유)지산운수
5th row(유)청솔환경
ValueCountFrequency (%)
주식회사 7
 
0.6%
김*수 6
 
0.5%
이*희 5
 
0.4%
김*희 5
 
0.4%
이*순 5
 
0.4%
이*호 5
 
0.4%
이*영 5
 
0.4%
김*준 4
 
0.3%
이*현 4
 
0.3%
박*근 4
 
0.3%
Other values (1029) 1132
95.8%
2023-12-13T08:35:02.810394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 542
 
8.5%
( 542
 
8.5%
) 542
 
8.5%
541
 
8.5%
179
 
2.8%
175
 
2.7%
162
 
2.5%
146
 
2.3%
139
 
2.2%
127
 
2.0%
Other values (319) 3275
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4705
73.9%
Other Punctuation 542
 
8.5%
Open Punctuation 542
 
8.5%
Close Punctuation 542
 
8.5%
Space Separator 25
 
0.4%
Uppercase Letter 8
 
0.1%
Decimal Number 2
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
541
 
11.5%
179
 
3.8%
175
 
3.7%
162
 
3.4%
146
 
3.1%
139
 
3.0%
127
 
2.7%
113
 
2.4%
101
 
2.1%
88
 
1.9%
Other values (304) 2934
62.4%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
E 1
12.5%
T 1
12.5%
R 1
12.5%
N 1
12.5%
D 1
12.5%
S 1
12.5%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Other Punctuation
ValueCountFrequency (%)
* 542
100.0%
Open Punctuation
ValueCountFrequency (%)
( 542
100.0%
Close Punctuation
ValueCountFrequency (%)
) 542
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4705
73.9%
Common 1657
 
26.0%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
541
 
11.5%
179
 
3.8%
175
 
3.7%
162
 
3.4%
146
 
3.1%
139
 
3.0%
127
 
2.7%
113
 
2.4%
101
 
2.1%
88
 
1.9%
Other values (304) 2934
62.4%
Common
ValueCountFrequency (%)
* 542
32.7%
( 542
32.7%
) 542
32.7%
25
 
1.5%
1 2
 
0.1%
- 2
 
0.1%
> 1
 
0.1%
< 1
 
0.1%
Latin
ValueCountFrequency (%)
A 2
25.0%
E 1
12.5%
T 1
12.5%
R 1
12.5%
N 1
12.5%
D 1
12.5%
S 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4705
73.9%
ASCII 1665
 
26.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 542
32.6%
( 542
32.6%
) 542
32.6%
25
 
1.5%
A 2
 
0.1%
1 2
 
0.1%
- 2
 
0.1%
E 1
 
0.1%
T 1
 
0.1%
R 1
 
0.1%
Other values (5) 5
 
0.3%
Hangul
ValueCountFrequency (%)
541
 
11.5%
179
 
3.8%
175
 
3.7%
162
 
3.4%
146
 
3.1%
139
 
3.0%
127
 
2.7%
113
 
2.4%
101
 
2.1%
88
 
1.9%
Other values (304) 2934
62.4%

면허정보
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
일반화물
1157 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반화물
2nd row일반화물
3rd row일반화물
4th row일반화물
5th row일반화물

Common Values

ValueCountFrequency (%)
일반화물 1157
100.0%

Length

2023-12-13T08:35:02.930148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:03.003999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반화물 1157
100.0%

면허대수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct65
Distinct (%)5.7%
Missing10
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean6.5640802
Minimum0
Maximum812
Zeros25
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size10.3 KiB
2023-12-13T08:35:03.095566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q34
95-th percentile30
Maximum812
Range812
Interquartile range (IQR)3

Descriptive statistics

Standard deviation27.821891
Coefficient of variation (CV)4.2385057
Kurtosis617.0344
Mean6.5640802
Median Absolute Deviation (MAD)0
Skewness22.049007
Sum7529
Variance774.05763
MonotonicityNot monotonic
2023-12-13T08:35:03.234282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 711
61.5%
2 66
 
5.7%
3 52
 
4.5%
4 39
 
3.4%
5 27
 
2.3%
0 25
 
2.2%
7 23
 
2.0%
6 18
 
1.6%
8 17
 
1.5%
15 11
 
1.0%
Other values (55) 158
 
13.7%
(Missing) 10
 
0.9%
ValueCountFrequency (%)
0 25
 
2.2%
1 711
61.5%
2 66
 
5.7%
3 52
 
4.5%
4 39
 
3.4%
5 27
 
2.3%
6 18
 
1.6%
7 23
 
2.0%
8 17
 
1.5%
9 8
 
0.7%
ValueCountFrequency (%)
812 1
0.1%
159 1
0.1%
158 1
0.1%
157 1
0.1%
152 1
0.1%
151 1
0.1%
81 1
0.1%
72 1
0.1%
71 1
0.1%
70 1
0.1%

보유대수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct64
Distinct (%)5.6%
Missing10
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean6.5431561
Minimum0
Maximum805
Zeros25
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size10.3 KiB
2023-12-13T08:35:03.360875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q34
95-th percentile30.7
Maximum805
Range805
Interquartile range (IQR)3

Descriptive statistics

Standard deviation27.641444
Coefficient of variation (CV)4.2244818
Kurtosis611.75103
Mean6.5431561
Median Absolute Deviation (MAD)0
Skewness21.926406
Sum7505
Variance764.0494
MonotonicityNot monotonic
2023-12-13T08:35:03.485216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 711
61.5%
2 66
 
5.7%
3 52
 
4.5%
4 40
 
3.5%
5 28
 
2.4%
0 25
 
2.2%
7 22
 
1.9%
6 18
 
1.6%
8 17
 
1.5%
15 11
 
1.0%
Other values (54) 157
 
13.6%
(Missing) 10
 
0.9%
ValueCountFrequency (%)
0 25
 
2.2%
1 711
61.5%
2 66
 
5.7%
3 52
 
4.5%
4 40
 
3.5%
5 28
 
2.4%
6 18
 
1.6%
7 22
 
1.9%
8 17
 
1.5%
9 8
 
0.7%
ValueCountFrequency (%)
805 1
0.1%
159 1
0.1%
158 1
0.1%
157 1
0.1%
152 1
0.1%
151 1
0.1%
81 1
0.1%
72 1
0.1%
70 2
0.2%
67 1
0.1%

주소
Text

Distinct1043
Distinct (%)91.0%
Missing11
Missing (%)1.0%
Memory size9.2 KiB
2023-12-13T08:35:03.719882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length56
Mean length38.458115
Min length21

Characters and Unicode

Total characters44073
Distinct characters376
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique973 ?
Unique (%)84.9%

Sample

1st row(28182)충청북도 청주시 서원구 척산화당로 254
2nd row(27606)충청북도 음성군 사곡길 33
3rd row(27429)충청북도 충주시 중원대로 3482 (봉방동 915)
4th row(28667)충청북도 청주시 서원구 예체로29번길 9 삼익1차아파트 b동 205호
5th row(27006)충청북도 단양군 평동로 111-3
ValueCountFrequency (%)
청주시 538
 
7.4%
제천시 245
 
3.4%
흥덕구 215
 
3.0%
서원구 140
 
1.9%
청원구 107
 
1.5%
음성군 84
 
1.2%
상당구 73
 
1.0%
충주시 72
 
1.0%
진천군 61
 
0.8%
옥천군 33
 
0.5%
Other values (2856) 5692
78.4%
2023-12-13T08:35:04.072276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6759
 
15.3%
1 2520
 
5.7%
2 2467
 
5.6%
1860
 
4.2%
( 1715
 
3.9%
) 1714
 
3.9%
0 1588
 
3.6%
3 1435
 
3.3%
8 1288
 
2.9%
7 1235
 
2.8%
Other values (366) 21492
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18917
42.9%
Decimal Number 14080
31.9%
Space Separator 6759
 
15.3%
Open Punctuation 1715
 
3.9%
Close Punctuation 1714
 
3.9%
Dash Punctuation 855
 
1.9%
Uppercase Letter 16
 
< 0.1%
Lowercase Letter 9
 
< 0.1%
Other Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1860
 
9.8%
1225
 
6.5%
1195
 
6.3%
1122
 
5.9%
938
 
5.0%
890
 
4.7%
727
 
3.8%
567
 
3.0%
559
 
3.0%
465
 
2.5%
Other values (338) 9369
49.5%
Decimal Number
ValueCountFrequency (%)
1 2520
17.9%
2 2467
17.5%
0 1588
11.3%
3 1435
10.2%
8 1288
9.1%
7 1235
8.8%
4 989
 
7.0%
6 965
 
6.9%
5 898
 
6.4%
9 695
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 7
43.8%
A 3
18.8%
G 1
 
6.2%
C 1
 
6.2%
H 1
 
6.2%
L 1
 
6.2%
T 1
 
6.2%
P 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
b 4
44.4%
e 3
33.3%
j 1
 
11.1%
a 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 6
75.0%
. 2
 
25.0%
Space Separator
ValueCountFrequency (%)
6759
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1715
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1714
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 855
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25131
57.0%
Hangul 18917
42.9%
Latin 25
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1860
 
9.8%
1225
 
6.5%
1195
 
6.3%
1122
 
5.9%
938
 
5.0%
890
 
4.7%
727
 
3.8%
567
 
3.0%
559
 
3.0%
465
 
2.5%
Other values (338) 9369
49.5%
Common
ValueCountFrequency (%)
6759
26.9%
1 2520
 
10.0%
2 2467
 
9.8%
( 1715
 
6.8%
) 1714
 
6.8%
0 1588
 
6.3%
3 1435
 
5.7%
8 1288
 
5.1%
7 1235
 
4.9%
4 989
 
3.9%
Other values (6) 3421
13.6%
Latin
ValueCountFrequency (%)
B 7
28.0%
b 4
16.0%
A 3
12.0%
e 3
12.0%
j 1
 
4.0%
G 1
 
4.0%
C 1
 
4.0%
H 1
 
4.0%
L 1
 
4.0%
a 1
 
4.0%
Other values (2) 2
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25156
57.1%
Hangul 18917
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6759
26.9%
1 2520
 
10.0%
2 2467
 
9.8%
( 1715
 
6.8%
) 1714
 
6.8%
0 1588
 
6.3%
3 1435
 
5.7%
8 1288
 
5.1%
7 1235
 
4.9%
4 989
 
3.9%
Other values (18) 3446
13.7%
Hangul
ValueCountFrequency (%)
1860
 
9.8%
1225
 
6.5%
1195
 
6.3%
1122
 
5.9%
938
 
5.0%
890
 
4.7%
727
 
3.8%
567
 
3.0%
559
 
3.0%
465
 
2.5%
Other values (338) 9369
49.5%

Interactions

2023-12-13T08:35:01.143888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:00.216109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:00.540712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:01.243505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:00.346361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:00.634614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:01.346470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:00.439586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:01.039416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:35:04.143973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서면허대수보유대수
순서1.0000.1020.104
면허대수0.1021.0000.999
보유대수0.1040.9991.000
2023-12-13T08:35:04.207925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서면허대수보유대수
순서1.0000.0110.011
면허대수0.0111.0001.000
보유대수0.0111.0001.000

Missing values

2023-12-13T08:35:01.499382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:35:01.620046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:35:01.722418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순서조합업체명면허정보면허대수보유대수주소
01충북화물협회(유)아리울물류청주영업소일반화물4444(28182)충청북도 청주시 서원구 척산화당로 254
12충북화물협회(유)에스와이유통일반화물11(27606)충청북도 음성군 사곡길 33
23충북화물협회(유)중원로지스일반화물1515(27429)충청북도 충주시 중원대로 3482 (봉방동 915)
34충북화물협회(유)지산운수일반화물11(28667)충청북도 청주시 서원구 예체로29번길 9 삼익1차아파트 b동 205호
45충북화물협회(유)청솔환경일반화물55(27006)충청북도 단양군 평동로 111-3
56충북화물협회(유)탑로지스틱일반화물4444(27606)충청북도 음성군 사곡길 37
67충북화물협회(유)한신물류일반화물11(28484)충청북도 청주시 청원구 무심동로 528 4층
78충북화물협회(자)두산운수일반화물1111(28176)충청북도 청주시 흥덕구 저산태성로 289-6
89충북화물협회(합)국제상운일반화물2525(27703)충청북도 음성군 설성로 77
910충북화물협회(합)금강중기추레라일반화물22(27130)충청북도 제천시 의병대로 412
순서조합업체명면허정보면허대수보유대수주소
11471148충북화물협회황*구일반화물11(28389)충청북도 청주시 흥덕구 가경로189번길 52 (가경동 762) 벽산아파트 104-603
11481149충북화물협회(주)효림통운일반화물11(28171)충청북도 청주시 흥덕구 상월곡길 8
11491150충북화물협회효성물류일반화물11(27142)충청북도 제천시 내토로65길 17
11501151충북화물협회(주)효진물류일반화물11(28111)충청북도 청주시 흥덕구 중부로 301 (옥산면 700-15) 203호
11511152충북화물협회흥국통운(주)일반화물4646(395-900)충북 단양군 매포읍 매포길 113-9 205호단양물류기지관
11521153충북화물협회흥덕기업(주)일반화물2525(27002)충청북도 단양군 단양로 2000
11531154충북화물협회흥덕운수(주)일반화물66(24465)강원도 춘천시 강촌로 254
11541155충북화물협회(주)힘찬물류일반화물3030(28360)충청북도 청주시 흥덕구 직지대로240번길 38 (지동동 476) 화물터미널 110호
11551156충북화물협회AD물류일반화물11(27663)충청북도 음성군 대금로 792 가동
11561157충북화물협회(주)E-TRANS일반화물3535(29062)충청북도 옥천군 용방3길 2-1