Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 229 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 15.8 KiB |
Average record size in memory | 70.6 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 6 |
Dataset
Description | * 부문별 이륜차 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094171/fileData.do |
발생건수 is highly overall correlated with 부상자수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
중상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
경상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상신고 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
사망자수 has 56 (24.5%) zeros | Zeros |
중상 has 4 (1.7%) zeros | Zeros |
경상 has 6 (2.6%) zeros | Zeros |
부상신고 has 42 (18.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 08:58:44.904130 |
---|---|
Analysis finished | 2023-12-12 08:58:49.365945 |
Duration | 4.46 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 7.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
경기 | |
---|---|
서울 | |
경북 | |
전남 | |
강원 | |
Other values (12) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 18 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
Length
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 18 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
시군구
Text
Distinct | 206 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
Value | Count | Frequency (%) |
중구 | 6 | 2.6% |
동구 | 6 | 2.6% |
서구 | 5 | 2.2% |
남구 | 5 | 2.2% |
북구 | 4 | 1.7% |
강서구 | 2 | 0.9% |
고성군 | 2 | 0.9% |
곡성군 | 1 | 0.4% |
화순군 | 1 | 0.4% |
보성군 | 1 | 0.4% |
Other values (196) | 196 |
Most occurring characters
Value | Count | Frequency (%) |
군 | 85 | 12.6% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (123) | 312 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 671 | |
Open Punctuation | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 671 | |
Common | 2 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
Common
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 671 | |
ASCII | 2 | 0.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
ASCII
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 125 |
---|---|
Distinct (%) | 54.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 65.641921 |
Minimum | 1 |
---|---|
Maximum | 329 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 16 |
median | 45 |
Q3 | 93 |
95-th percentile | 201.4 |
Maximum | 329 |
Range | 328 |
Interquartile range (IQR) | 77 |
Descriptive statistics
Standard deviation | 65.670384 |
---|---|
Coefficient of variation (CV) | 1.0004336 |
Kurtosis | 2.5762989 |
Mean | 65.641921 |
Median Absolute Deviation (MAD) | 32 |
Skewness | 1.611056 |
Sum | 15032 |
Variance | 4312.5993 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 9 | 3.9% |
11 | 7 | 3.1% |
13 | 6 | 2.6% |
27 | 5 | 2.2% |
15 | 5 | 2.2% |
34 | 4 | 1.7% |
1 | 4 | 1.7% |
20 | 4 | 1.7% |
60 | 4 | 1.7% |
12 | 4 | 1.7% |
Other values (115) | 177 |
Value | Count | Frequency (%) |
1 | 4 | |
2 | 3 | 1.3% |
3 | 3 | 1.3% |
4 | 1 | 0.4% |
5 | 2 | 0.9% |
6 | 3 | 1.3% |
7 | 3 | 1.3% |
8 | 1 | 0.4% |
9 | 3 | 1.3% |
10 | 9 |
Value | Count | Frequency (%) |
329 | 1 | |
328 | 1 | |
282 | 1 | |
281 | 1 | |
246 | 1 | |
244 | 1 | |
233 | 1 | |
231 | 1 | |
228 | 1 | |
224 | 1 |
사망자수
Real number (ℝ)
ZEROS
 
Distinct | 12 |
---|---|
Distinct (%) | 5.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.790393 |
Minimum | 0 |
---|---|
Maximum | 13 |
Zeros | 56 |
Zeros (%) | 24.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 13 |
Range | 13 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 2.0043143 |
---|---|
Coefficient of variation (CV) | 1.1194829 |
Kurtosis | 8.4568361 |
Mean | 1.790393 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.4277835 |
Sum | 410 |
Variance | 4.0172757 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 73 | |
0 | 56 | |
2 | 46 | |
3 | 24 | 10.5% |
4 | 12 | 5.2% |
5 | 8 | 3.5% |
6 | 4 | 1.7% |
11 | 2 | 0.9% |
13 | 1 | 0.4% |
10 | 1 | 0.4% |
Other values (2) | 2 | 0.9% |
Value | Count | Frequency (%) |
0 | 56 | |
1 | 73 | |
2 | 46 | |
3 | 24 | 10.5% |
4 | 12 | 5.2% |
5 | 8 | 3.5% |
6 | 4 | 1.7% |
7 | 1 | 0.4% |
9 | 1 | 0.4% |
10 | 1 | 0.4% |
Value | Count | Frequency (%) |
13 | 1 | 0.4% |
11 | 2 | 0.9% |
10 | 1 | 0.4% |
9 | 1 | 0.4% |
7 | 1 | 0.4% |
6 | 4 | 1.7% |
5 | 8 | 3.5% |
4 | 12 | 5.2% |
3 | 24 | |
2 | 46 |
부상자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 136 |
---|---|
Distinct (%) | 59.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 81.31441 |
Minimum | 0 |
---|---|
Maximum | 397 |
Zeros | 1 |
Zeros (%) | 0.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5 |
Q1 | 19 |
median | 51 |
Q3 | 117 |
95-th percentile | 258.6 |
Maximum | 397 |
Range | 397 |
Interquartile range (IQR) | 98 |
Descriptive statistics
Standard deviation | 83.191107 |
---|---|
Coefficient of variation (CV) | 1.0230795 |
Kurtosis | 2.1191159 |
Mean | 81.31441 |
Median Absolute Deviation (MAD) | 38 |
Skewness | 1.5437767 |
Sum | 18621 |
Variance | 6920.7604 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16 | 6 | 2.6% |
17 | 6 | 2.6% |
13 | 6 | 2.6% |
19 | 5 | 2.2% |
21 | 4 | 1.7% |
1 | 4 | 1.7% |
5 | 4 | 1.7% |
12 | 4 | 1.7% |
14 | 4 | 1.7% |
34 | 4 | 1.7% |
Other values (126) | 182 |
Value | Count | Frequency (%) |
0 | 1 | 0.4% |
1 | 4 | |
2 | 2 | |
3 | 2 | |
4 | 1 | 0.4% |
5 | 4 | |
6 | 1 | 0.4% |
7 | 2 | |
8 | 3 | |
9 | 3 |
Value | Count | Frequency (%) |
397 | 1 | |
386 | 1 | |
360 | 1 | |
344 | 1 | |
314 | 1 | |
310 | 1 | |
304 | 1 | |
301 | 1 | |
291 | 1 | |
280 | 1 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 63 |
---|---|
Distinct (%) | 27.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 22.620087 |
Minimum | 0 |
---|---|
Maximum | 115 |
Zeros | 4 |
Zeros (%) | 1.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 7 |
median | 17 |
Q3 | 32 |
95-th percentile | 64.6 |
Maximum | 115 |
Range | 115 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 20.467341 |
---|---|
Coefficient of variation (CV) | 0.90483033 |
Kurtosis | 2.515856 |
Mean | 22.620087 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 1.4981394 |
Sum | 5180 |
Variance | 418.91205 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 12 | 5.2% |
7 | 9 | 3.9% |
6 | 9 | 3.9% |
5 | 9 | 3.9% |
10 | 9 | 3.9% |
29 | 8 | 3.5% |
3 | 7 | 3.1% |
19 | 7 | 3.1% |
14 | 7 | 3.1% |
9 | 7 | 3.1% |
Other values (53) | 145 |
Value | Count | Frequency (%) |
0 | 4 | 1.7% |
1 | 6 | |
2 | 6 | |
3 | 7 | |
4 | 12 | |
5 | 9 | |
6 | 9 | |
7 | 9 | |
8 | 5 | |
9 | 7 |
Value | Count | Frequency (%) |
115 | 1 | |
95 | 1 | |
87 | 1 | |
85 | 1 | |
79 | 2 | |
76 | 1 | |
74 | 1 | |
73 | 1 | |
71 | 2 | |
67 | 1 |
경상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 103 |
---|---|
Distinct (%) | 45.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 45.650655 |
Minimum | 0 |
---|---|
Maximum | 231 |
Zeros | 6 |
Zeros (%) | 2.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 10 |
median | 28 |
Q3 | 64 |
95-th percentile | 157 |
Maximum | 231 |
Range | 231 |
Interquartile range (IQR) | 54 |
Descriptive statistics
Standard deviation | 49.400088 |
---|---|
Coefficient of variation (CV) | 1.0821332 |
Kurtosis | 2.7081058 |
Mean | 45.650655 |
Median Absolute Deviation (MAD) | 22 |
Skewness | 1.6813796 |
Sum | 10454 |
Variance | 2440.3687 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 10 | 4.4% |
3 | 9 | 3.9% |
4 | 8 | 3.5% |
12 | 7 | 3.1% |
8 | 7 | 3.1% |
25 | 6 | 2.6% |
0 | 6 | 2.6% |
13 | 5 | 2.2% |
5 | 5 | 2.2% |
1 | 5 | 2.2% |
Other values (93) | 161 |
Value | Count | Frequency (%) |
0 | 6 | |
1 | 5 | |
2 | 3 | 1.3% |
3 | 9 | |
4 | 8 | |
5 | 5 | |
6 | 4 | |
7 | 4 | |
8 | 7 | |
9 | 4 |
Value | Count | Frequency (%) |
231 | 1 | |
229 | 1 | |
228 | 1 | |
207 | 1 | |
201 | 1 | |
192 | 1 | |
179 | 1 | |
175 | 1 | |
173 | 1 | |
166 | 1 |
부상신고
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 51 |
---|---|
Distinct (%) | 22.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.043668 |
Minimum | 0 |
---|---|
Maximum | 109 |
Zeros | 42 |
Zeros (%) | 18.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 5 |
Q3 | 18 |
95-th percentile | 48 |
Maximum | 109 |
Range | 109 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 18.357269 |
---|---|
Coefficient of variation (CV) | 1.4073701 |
Kurtosis | 6.5104425 |
Mean | 13.043668 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 2.2851343 |
Sum | 2987 |
Variance | 336.98931 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 42 | |
1 | 33 | |
4 | 17 | 7.4% |
3 | 10 | 4.4% |
6 | 10 | 4.4% |
5 | 9 | 3.9% |
8 | 9 | 3.9% |
2 | 8 | 3.5% |
12 | 5 | 2.2% |
10 | 5 | 2.2% |
Other values (41) | 81 |
Value | Count | Frequency (%) |
0 | 42 | |
1 | 33 | |
2 | 8 | 3.5% |
3 | 10 | 4.4% |
4 | 17 | |
5 | 9 | 3.9% |
6 | 10 | 4.4% |
7 | 3 | 1.3% |
8 | 9 | 3.9% |
9 | 1 | 0.4% |
Value | Count | Frequency (%) |
109 | 1 | |
104 | 1 | |
88 | 1 | |
65 | 2 | |
63 | 1 | |
60 | 1 | |
58 | 2 | |
56 | 1 | |
53 | 1 | |
48 | 2 |
시도 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.506 | 0.127 | 0.490 | 0.403 | 0.488 | 0.538 |
발생건수 | 0.506 | 1.000 | 0.757 | 0.985 | 0.948 | 0.946 | 0.755 |
사망자수 | 0.127 | 0.757 | 1.000 | 0.709 | 0.838 | 0.732 | 0.409 |
부상자수 | 0.490 | 0.985 | 0.709 | 1.000 | 0.928 | 0.961 | 0.767 |
중상 | 0.403 | 0.948 | 0.838 | 0.928 | 1.000 | 0.922 | 0.678 |
경상 | 0.488 | 0.946 | 0.732 | 0.961 | 0.922 | 1.000 | 0.664 |
부상신고 | 0.538 | 0.755 | 0.409 | 0.767 | 0.678 | 0.664 | 1.000 |
발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | 시도 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.385 | 0.996 | 0.959 | 0.976 | 0.865 | 0.219 |
사망자수 | 0.385 | 1.000 | 0.361 | 0.395 | 0.359 | 0.236 | 0.045 |
부상자수 | 0.996 | 0.361 | 1.000 | 0.956 | 0.983 | 0.865 | 0.210 |
중상 | 0.959 | 0.395 | 0.956 | 1.000 | 0.913 | 0.790 | 0.169 |
경상 | 0.976 | 0.359 | 0.983 | 0.913 | 1.000 | 0.805 | 0.209 |
부상신고 | 0.865 | 0.236 | 0.865 | 0.790 | 0.805 | 1.000 | 0.253 |
시도 | 0.219 | 0.045 | 0.210 | 0.169 | 0.209 | 0.253 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 종로구 | 93 | 1 | 114 | 28 | 52 | 34 |
1 | 서울 | 중구 | 72 | 1 | 87 | 33 | 38 | 16 |
2 | 서울 | 용산구 | 103 | 3 | 118 | 38 | 66 | 14 |
3 | 서울 | 성동구 | 126 | 3 | 150 | 40 | 93 | 17 |
4 | 서울 | 동대문구 | 198 | 1 | 229 | 61 | 137 | 31 |
5 | 서울 | 성북구 | 164 | 3 | 196 | 50 | 104 | 42 |
6 | 서울 | 도봉구 | 79 | 0 | 89 | 26 | 39 | 24 |
7 | 서울 | 은평구 | 96 | 1 | 126 | 23 | 67 | 36 |
8 | 서울 | 서대문구 | 104 | 1 | 121 | 21 | 63 | 37 |
9 | 서울 | 마포구 | 99 | 2 | 129 | 32 | 78 | 19 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
219 | 대전 | 중구 | 29 | 1 | 43 | 10 | 28 | 5 |
220 | 대전 | 서구 | 92 | 3 | 118 | 25 | 85 | 8 |
221 | 대전 | 유성구 | 68 | 1 | 98 | 26 | 69 | 3 |
222 | 대전 | 대덕구 | 27 | 0 | 35 | 7 | 21 | 7 |
223 | 울산 | 중구 | 82 | 0 | 110 | 29 | 39 | 42 |
224 | 울산 | 남구 | 77 | 2 | 90 | 26 | 54 | 10 |
225 | 울산 | 동구 | 112 | 3 | 136 | 49 | 86 | 1 |
226 | 울산 | 북구 | 60 | 2 | 78 | 16 | 43 | 19 |
227 | 울산 | 울주군 | 53 | 2 | 62 | 29 | 27 | 6 |
228 | 세종 | 세종 | 63 | 4 | 75 | 29 | 41 | 5 |