Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 78 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.6 KiB |
Average record size in memory | 60.7 B |
Variable types
Text | 1 |
---|---|
Categorical | 3 |
Numeric | 3 |
Dataset
Description | 경찰청 배기량별 차량현황 데이터로 배기량, 차량대수, 승용, 승합, 화물, 특수, 이륜 항목을 제공합니다.(2020.12.31.기준) |
---|---|
Author | 경찰청 |
URL | https://www.data.go.kr/data/15065779/fileData.do |
화물 is highly overall correlated with 차량대수 and 1 other fields | High correlation |
특수 is highly overall correlated with 차량대수 and 1 other fields | High correlation |
이륜 is highly overall correlated with 차량대수 | High correlation |
차량대수 is highly overall correlated with 화물 and 3 other fields | High correlation |
승용 is highly overall correlated with 차량대수 | High correlation |
승합 is highly overall correlated with 화물 and 1 other fields | High correlation |
승합 is highly imbalanced (52.9%) | Imbalance |
배기량 has unique values | Unique |
화물 has 67 (85.9%) zeros | Zeros |
특수 has 49 (62.8%) zeros | Zeros |
이륜 has 71 (91.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 04:03:34.421405 |
---|---|
Analysis finished | 2023-12-12 04:03:35.828196 |
Duration | 1.41 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
배기량
Text
UNIQUE
 
Distinct | 78 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 756.0 B |
Value | Count | Frequency (%) |
cc | 78 | |
2,696 | 1 | 0.6% |
4,565 | 1 | 0.6% |
3,933 | 1 | 0.6% |
3,907 | 1 | 0.6% |
3,800 | 1 | 0.6% |
3,778 | 1 | 0.6% |
3,773 | 1 | 0.6% |
6,299 | 1 | 0.6% |
3,470 | 1 | 0.6% |
Other values (69) | 69 |
Most occurring characters
Value | Count | Frequency (%) |
c | 156 | |
78 | ||
, | 69 | |
9 | 54 | 8.9% |
1 | 47 | 7.7% |
0 | 41 | 6.7% |
2 | 33 | 5.4% |
6 | 28 | 4.6% |
7 | 26 | 4.3% |
3 | 23 | 3.8% |
Other values (3) | 54 | 8.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 306 | |
Lowercase Letter | 156 | |
Space Separator | 78 | 12.8% |
Other Punctuation | 69 | 11.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
9 | 54 | |
1 | 47 | |
0 | 41 | |
2 | 33 | |
6 | 28 | |
7 | 26 | |
3 | 23 | |
5 | 23 | |
4 | 18 | 5.9% |
8 | 13 | 4.2% |
Lowercase Letter
Value | Count | Frequency (%) |
c | 156 |
Space Separator
Value | Count | Frequency (%) |
78 |
Other Punctuation
Value | Count | Frequency (%) |
, | 69 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 453 | |
Latin | 156 | 25.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
78 | ||
, | 69 | |
9 | 54 | |
1 | 47 | |
0 | 41 | |
2 | 33 | |
6 | 28 | 6.2% |
7 | 26 | 5.7% |
3 | 23 | 5.1% |
5 | 23 | 5.1% |
Other values (2) | 31 | 6.8% |
Latin
Value | Count | Frequency (%) |
c | 156 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 609 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
c | 156 | |
78 | ||
, | 69 | |
9 | 54 | 8.9% |
1 | 47 | 7.7% |
0 | 41 | 6.7% |
2 | 33 | 5.4% |
6 | 28 | 4.6% |
7 | 26 | 4.3% |
3 | 23 | 3.8% |
Other values (3) | 54 | 8.9% |
차량대수
Categorical
HIGH CORRELATION
 
Distinct | 38 |
---|---|
Distinct (%) | 48.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 756.0 B |
1 | |
---|---|
2 | |
7 | |
6 | 4 |
8 | 4 |
Other values (33) |
Length
Max length | 5 |
---|---|
Median length | 1 |
Mean length | 1.7051282 |
Min length | 1 |
Unique
Unique | 29 ? |
---|---|
Unique (%) | 37.2% |
Sample
1st row | 6 |
---|---|
2nd row | 9 |
3rd row | 2 |
4th row | 73 |
5th row | 6 |
Common Values
Value | Count | Frequency (%) |
1 | 12 | |
2 | 11 | 14.1% |
7 | 6 | 7.7% |
6 | 4 | 5.1% |
8 | 4 | 5.1% |
4 | 4 | 5.1% |
5 | 3 | 3.8% |
9 | 3 | 3.8% |
16 | 2 | 2.6% |
1,463 | 1 | 1.3% |
Other values (28) | 28 |
Length
Value | Count | Frequency (%) |
1 | 12 | |
2 | 11 | 14.1% |
7 | 6 | 7.7% |
6 | 4 | 5.1% |
8 | 4 | 5.1% |
4 | 4 | 5.1% |
5 | 3 | 3.8% |
9 | 3 | 3.8% |
16 | 2 | 2.6% |
3,756 | 1 | 1.3% |
Other values (28) | 28 |
승용
Categorical
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 26.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 756.0 B |
0 | |
---|---|
1 | |
2 | |
7 | 4 |
4 | 3 |
Other values (16) |
Length
Max length | 5 |
---|---|
Median length | 1 |
Mean length | 1.3589744 |
Min length | 1 |
Unique
Unique | 16 ? |
---|---|
Unique (%) | 20.5% |
Sample
1st row | 0 |
---|---|
2nd row | 7 |
3rd row | 0 |
4th row | 0 |
5th row | 6 |
Common Values
Value | Count | Frequency (%) |
0 | 42 | |
1 | 7 | 9.0% |
2 | 6 | 7.7% |
7 | 4 | 5.1% |
4 | 3 | 3.8% |
53 | 1 | 1.3% |
6 | 1 | 1.3% |
156 | 1 | 1.3% |
274 | 1 | 1.3% |
364 | 1 | 1.3% |
Other values (11) | 11 | 14.1% |
Length
Value | Count | Frequency (%) |
0 | 42 | |
1 | 7 | 9.0% |
2 | 6 | 7.7% |
7 | 4 | 5.1% |
4 | 3 | 3.8% |
1,463 | 1 | 1.3% |
16 | 1 | 1.3% |
27 | 1 | 1.3% |
115 | 1 | 1.3% |
3,756 | 1 | 1.3% |
Other values (11) | 11 | 14.1% |
승합
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 17 |
---|---|
Distinct (%) | 21.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 756.0 B |
0 | |
---|---|
1 | 4 |
2 | 3 |
4 | 2 |
629 | 1 |
Other values (12) |
Length
Max length | 5 |
---|---|
Median length | 1 |
Mean length | 1.2051282 |
Min length | 1 |
Unique
Unique | 13 ? |
---|---|
Unique (%) | 16.7% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 56 | |
1 | 4 | 5.1% |
2 | 3 | 3.8% |
4 | 2 | 2.6% |
629 | 1 | 1.3% |
38 | 1 | 1.3% |
624 | 1 | 1.3% |
853 | 1 | 1.3% |
3,808 | 1 | 1.3% |
3 | 1 | 1.3% |
Other values (7) | 7 | 9.0% |
Length
Value | Count | Frequency (%) |
0 | 56 | |
1 | 4 | 5.1% |
2 | 3 | 3.8% |
4 | 2 | 2.6% |
6 | 1 | 1.3% |
8 | 1 | 1.3% |
5 | 1 | 1.3% |
48 | 1 | 1.3% |
10 | 1 | 1.3% |
15 | 1 | 1.3% |
Other values (7) | 7 | 9.0% |
화물
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 14.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.487179 |
Minimum | 0 |
---|---|
Maximum | 500 |
Zeros | 67 |
Zeros (%) | 85.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 834.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 23.05 |
Maximum | 500 |
Range | 500 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 59.610586 |
---|---|
Coefficient of variation (CV) | 5.684139 |
Kurtosis | 61.096328 |
Mean | 10.487179 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.5611634 |
Sum | 818 |
Variance | 3553.4219 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 67 | |
2 | 2 | 2.6% |
29 | 1 | 1.3% |
120 | 1 | 1.3% |
22 | 1 | 1.3% |
500 | 1 | 1.3% |
7 | 1 | 1.3% |
128 | 1 | 1.3% |
1 | 1 | 1.3% |
4 | 1 | 1.3% |
Value | Count | Frequency (%) |
0 | 67 | |
1 | 1 | 1.3% |
2 | 2 | 2.6% |
3 | 1 | 1.3% |
4 | 1 | 1.3% |
7 | 1 | 1.3% |
22 | 1 | 1.3% |
29 | 1 | 1.3% |
120 | 1 | 1.3% |
128 | 1 | 1.3% |
Value | Count | Frequency (%) |
500 | 1 | |
128 | 1 | |
120 | 1 | |
29 | 1 | |
22 | 1 | |
7 | 1 | |
4 | 1 | |
3 | 1 | |
2 | 2 | |
1 | 1 |
특수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 19.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.474359 |
Minimum | 0 |
---|---|
Maximum | 64 |
Zeros | 49 |
Zeros (%) | 62.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 834.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 11.9 |
Maximum | 64 |
Range | 64 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 10.141498 |
---|---|
Coefficient of variation (CV) | 2.9189552 |
Kurtosis | 23.615804 |
Mean | 3.474359 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.6970205 |
Sum | 271 |
Variance | 102.84998 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 49 | |
1 | 7 | 9.0% |
6 | 4 | 5.1% |
2 | 3 | 3.8% |
5 | 3 | 3.8% |
4 | 2 | 2.6% |
7 | 2 | 2.6% |
11 | 1 | 1.3% |
8 | 1 | 1.3% |
9 | 1 | 1.3% |
Other values (5) | 5 | 6.4% |
Value | Count | Frequency (%) |
0 | 49 | |
1 | 7 | 9.0% |
2 | 3 | 3.8% |
3 | 1 | 1.3% |
4 | 2 | 2.6% |
5 | 3 | 3.8% |
6 | 4 | 5.1% |
7 | 2 | 2.6% |
8 | 1 | 1.3% |
9 | 1 | 1.3% |
Value | Count | Frequency (%) |
64 | 1 | 1.3% |
53 | 1 | 1.3% |
32 | 1 | 1.3% |
17 | 1 | 1.3% |
11 | 1 | 1.3% |
9 | 1 | 1.3% |
8 | 1 | 1.3% |
7 | 2 | |
6 | 4 | |
5 | 3 |
이륜
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 10.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.935897 |
Minimum | 0 |
---|---|
Maximum | 951 |
Zeros | 71 |
Zeros (%) | 91.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 834.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 85.75 |
Maximum | 951 |
Range | 951 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 114.07532 |
---|---|
Coefficient of variation (CV) | 5.4487903 |
Kurtosis | 59.251711 |
Mean | 20.935897 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.398674 |
Sum | 1633 |
Variance | 13013.178 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 71 | |
2 | 1 | 1.3% |
73 | 1 | 1.3% |
951 | 1 | 1.3% |
200 | 1 | 1.3% |
158 | 1 | 1.3% |
1 | 1 | 1.3% |
248 | 1 | 1.3% |
Value | Count | Frequency (%) |
0 | 71 | |
1 | 1 | 1.3% |
2 | 1 | 1.3% |
73 | 1 | 1.3% |
158 | 1 | 1.3% |
200 | 1 | 1.3% |
248 | 1 | 1.3% |
951 | 1 | 1.3% |
Value | Count | Frequency (%) |
951 | 1 | 1.3% |
248 | 1 | 1.3% |
200 | 1 | 1.3% |
158 | 1 | 1.3% |
73 | 1 | 1.3% |
2 | 1 | 1.3% |
1 | 1 | 1.3% |
0 | 71 |
배기량 | 차량대수 | 승용 | 승합 | 화물 | 특수 | 이륜 | |
---|---|---|---|---|---|---|---|
배기량 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
차량대수 | 1.000 | 1.000 | 0.986 | 0.929 | 1.000 | 0.984 | 1.000 |
승용 | 1.000 | 0.986 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
승합 | 1.000 | 0.929 | 0.000 | 1.000 | 1.000 | 0.852 | 0.000 |
화물 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.986 | 0.000 |
특수 | 1.000 | 0.984 | 0.000 | 0.852 | 0.986 | 1.000 | 0.000 |
이륜 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
승용 | 차량대수 | 승합 | |
---|---|---|---|
승용 | 1.000 | 0.681 | 0.000 |
차량대수 | 0.681 | 1.000 | 0.456 |
승합 | 0.000 | 0.456 | 1.000 |
화물 | 특수 | 이륜 | 차량대수 | 승용 | 승합 | |
---|---|---|---|---|---|---|
화물 | 1.000 | 0.403 | -0.127 | 0.730 | 0.000 | 0.902 |
특수 | 0.403 | 1.000 | -0.233 | 0.673 | 0.000 | 0.565 |
이륜 | -0.127 | -0.233 | 1.000 | 0.735 | 0.000 | 0.000 |
차량대수 | 0.730 | 0.673 | 0.735 | 1.000 | 0.681 | 0.456 |
승용 | 0.000 | 0.000 | 0.000 | 0.681 | 1.000 | 0.000 |
승합 | 0.902 | 0.565 | 0.000 | 0.456 | 0.000 | 1.000 |
배기량 | 차량대수 | 승용 | 승합 | 화물 | 특수 | 이륜 | |
---|---|---|---|---|---|---|---|
0 | 0 cc | 6 | 0 | 0 | 0 | 6 | 0 |
1 | 1 cc | 9 | 7 | 0 | 0 | 2 | 0 |
2 | 20 cc | 2 | 0 | 0 | 0 | 0 | 2 |
3 | 100 cc | 73 | 0 | 0 | 0 | 0 | 73 |
4 | 113 cc | 6 | 6 | 0 | 0 | 0 | 0 |
5 | 120 cc | 4 | 0 | 4 | 0 | 0 | 0 |
6 | 124 cc | 951 | 0 | 0 | 0 | 0 | 951 |
7 | 995 cc | 156 | 156 | 0 | 0 | 0 | 0 |
8 | 999 cc | 274 | 274 | 0 | 0 | 0 | 0 |
9 | 1,170 cc | 200 | 0 | 0 | 0 | 0 | 200 |
배기량 | 차량대수 | 승용 | 승합 | 화물 | 특수 | 이륜 | |
---|---|---|---|---|---|---|---|
68 | 7,640 cc | 7 | 0 | 1 | 0 | 6 | 0 |
69 | 9,960 cc | 16 | 0 | 10 | 0 | 6 | 0 |
70 | 10,964 cc | 48 | 0 | 48 | 0 | 0 | 0 |
71 | 11,051 cc | 6 | 0 | 5 | 0 | 1 | 0 |
72 | 11,120 cc | 1 | 0 | 0 | 0 | 1 | 0 |
73 | 11,149 cc | 8 | 0 | 8 | 0 | 0 | 0 |
74 | 12,300 cc | 2 | 0 | 2 | 0 | 0 | 0 |
75 | 12,344 cc | 2 | 0 | 1 | 0 | 1 | 0 |
76 | 12,700 cc | 644 | 0 | 644 | 0 | 0 | 0 |
77 | 12,742 cc | 2 | 0 | 2 | 0 | 0 | 0 |