Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 229 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 15.8 KiB |
Average record size in memory | 70.6 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 6 |
Dataset
Description | * 부문별 사망 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094168/fileData.do |
발생건수 is highly overall correlated with 사망자수 and 3 other fields | High correlation |
사망자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
중상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
경상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상자수 has 29 (12.7%) zeros | Zeros |
중상 has 54 (23.6%) zeros | Zeros |
경상 has 57 (24.9%) zeros | Zeros |
부상신고 has 182 (79.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 21:57:51.671416 |
---|---|
Analysis finished | 2023-12-12 21:57:55.749437 |
Duration | 4.08 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 7.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
경기 | |
---|---|
서울 | |
경북 | |
전남 | |
강원 | |
Other values (12) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 18 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
Length
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 18 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
시군구
Text
Distinct | 206 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
Value | Count | Frequency (%) |
중구 | 6 | 2.6% |
동구 | 6 | 2.6% |
서구 | 5 | 2.2% |
남구 | 5 | 2.2% |
북구 | 4 | 1.7% |
강서구 | 2 | 0.9% |
고성군 | 2 | 0.9% |
곡성군 | 1 | 0.4% |
화순군 | 1 | 0.4% |
보성군 | 1 | 0.4% |
Other values (196) | 196 |
Most occurring characters
Value | Count | Frequency (%) |
군 | 85 | 12.6% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (123) | 312 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 671 | |
Open Punctuation | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 671 | |
Common | 2 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
Common
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 671 | |
ASCII | 2 | 0.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
군 | 85 | 12.7% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
서 | 13 | 1.9% |
Other values (121) | 310 |
ASCII
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 44 |
---|---|
Distinct (%) | 19.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.969432 |
Minimum | 1 |
---|---|
Maximum | 84 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 9 |
median | 13 |
Q3 | 20 |
95-th percentile | 39 |
Maximum | 84 |
Range | 83 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 11.813386 |
---|---|
Coefficient of variation (CV) | 0.73974988 |
Kurtosis | 6.0728713 |
Mean | 15.969432 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 1.9814265 |
Sum | 3657 |
Variance | 139.55608 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 17 | 7.4% |
6 | 16 | 7.0% |
13 | 14 | 6.1% |
10 | 12 | 5.2% |
9 | 12 | 5.2% |
5 | 11 | 4.8% |
14 | 11 | 4.8% |
12 | 10 | 4.4% |
17 | 10 | 4.4% |
8 | 9 | 3.9% |
Other values (34) | 107 |
Value | Count | Frequency (%) |
1 | 5 | 2.2% |
2 | 6 | 2.6% |
3 | 5 | 2.2% |
5 | 11 | |
6 | 16 | |
7 | 4 | 1.7% |
8 | 9 | |
9 | 12 | |
10 | 12 | |
11 | 17 |
Value | Count | Frequency (%) |
84 | 1 | |
64 | 1 | |
57 | 1 | |
52 | 1 | |
51 | 1 | |
48 | 1 | |
42 | 2 | |
41 | 2 | |
40 | 1 | |
39 | 2 |
사망자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 46 |
---|---|
Distinct (%) | 20.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.510917 |
Minimum | 1 |
---|---|
Maximum | 86 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 9 |
median | 14 |
Q3 | 20 |
95-th percentile | 39.6 |
Maximum | 86 |
Range | 85 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 12.156729 |
---|---|
Coefficient of variation (CV) | 0.73628431 |
Kurtosis | 5.8153256 |
Mean | 16.510917 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.9326029 |
Sum | 3781 |
Variance | 147.78606 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 20 | 8.7% |
6 | 15 | 6.6% |
14 | 14 | 6.1% |
10 | 12 | 5.2% |
13 | 12 | 5.2% |
5 | 10 | 4.4% |
9 | 10 | 4.4% |
15 | 9 | 3.9% |
20 | 9 | 3.9% |
17 | 8 | 3.5% |
Other values (36) | 110 |
Value | Count | Frequency (%) |
1 | 5 | 2.2% |
2 | 6 | 2.6% |
3 | 5 | 2.2% |
5 | 10 | |
6 | 15 | |
7 | 4 | 1.7% |
8 | 8 | 3.5% |
9 | 10 | |
10 | 12 | |
11 | 20 |
Value | Count | Frequency (%) |
86 | 1 | |
64 | 1 | |
59 | 1 | |
54 | 1 | |
53 | 1 | |
51 | 1 | |
45 | 1 | |
42 | 2 | |
41 | 2 | |
40 | 1 |
부상자수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 35 |
---|---|
Distinct (%) | 15.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.3624454 |
Minimum | 0 |
---|---|
Maximum | 53 |
Zeros | 29 |
Zeros (%) | 12.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 5 |
Q3 | 11 |
95-th percentile | 28 |
Maximum | 53 |
Range | 53 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 9.3463843 |
---|---|
Coefficient of variation (CV) | 1.1176616 |
Kurtosis | 4.0384874 |
Mean | 8.3624454 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 1.8621054 |
Sum | 1915 |
Variance | 87.354899 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 29 | |
3 | 25 | 10.9% |
1 | 24 | 10.5% |
4 | 20 | 8.7% |
10 | 14 | 6.1% |
2 | 13 | 5.7% |
8 | 13 | 5.7% |
7 | 11 | 4.8% |
5 | 8 | 3.5% |
6 | 8 | 3.5% |
Other values (25) | 64 |
Value | Count | Frequency (%) |
0 | 29 | |
1 | 24 | |
2 | 13 | |
3 | 25 | |
4 | 20 | |
5 | 8 | 3.5% |
6 | 8 | 3.5% |
7 | 11 | 4.8% |
8 | 13 | |
9 | 5 | 2.2% |
Value | Count | Frequency (%) |
53 | 1 | |
46 | 1 | |
43 | 1 | |
42 | 1 | |
33 | 2 | |
32 | 1 | |
31 | 1 | |
30 | 1 | |
29 | 2 | |
28 | 2 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 21 |
---|---|
Distinct (%) | 9.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.5196507 |
Minimum | 0 |
---|---|
Maximum | 21 |
Zeros | 54 |
Zeros (%) | 23.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 5 |
95-th percentile | 12 |
Maximum | 21 |
Range | 21 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 4.0192309 |
---|---|
Coefficient of variation (CV) | 1.1419403 |
Kurtosis | 3.5072183 |
Mean | 3.5196507 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.7800639 |
Sum | 806 |
Variance | 16.154217 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 54 | |
1 | 37 | |
2 | 31 | |
3 | 23 | |
4 | 20 | 8.7% |
6 | 15 | 6.6% |
5 | 10 | 4.4% |
8 | 8 | 3.5% |
7 | 8 | 3.5% |
9 | 4 | 1.7% |
Other values (11) | 19 | 8.3% |
Value | Count | Frequency (%) |
0 | 54 | |
1 | 37 | |
2 | 31 | |
3 | 23 | |
4 | 20 | 8.7% |
5 | 10 | 4.4% |
6 | 15 | 6.6% |
7 | 8 | 3.5% |
8 | 8 | 3.5% |
9 | 4 | 1.7% |
Value | Count | Frequency (%) |
21 | 1 | 0.4% |
19 | 1 | 0.4% |
18 | 1 | 0.4% |
17 | 2 | |
16 | 1 | 0.4% |
15 | 1 | 0.4% |
14 | 1 | 0.4% |
13 | 2 | |
12 | 3 | |
11 | 3 |
경상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 23 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.4323144 |
Minimum | 0 |
---|---|
Maximum | 34 |
Zeros | 57 |
Zeros (%) | 24.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 6 |
95-th percentile | 14.6 |
Maximum | 34 |
Range | 34 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 5.8422992 |
---|---|
Coefficient of variation (CV) | 1.3181148 |
Kurtosis | 6.3433578 |
Mean | 4.4323144 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 2.2548477 |
Sum | 1015 |
Variance | 34.13246 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 57 | |
1 | 36 | |
2 | 26 | |
3 | 25 | |
4 | 16 | 7.0% |
7 | 9 | 3.9% |
6 | 8 | 3.5% |
10 | 8 | 3.5% |
13 | 7 | 3.1% |
9 | 6 | 2.6% |
Other values (13) | 31 |
Value | Count | Frequency (%) |
0 | 57 | |
1 | 36 | |
2 | 26 | |
3 | 25 | |
4 | 16 | 7.0% |
5 | 5 | 2.2% |
6 | 8 | 3.5% |
7 | 9 | 3.9% |
8 | 4 | 1.7% |
9 | 6 | 2.6% |
Value | Count | Frequency (%) |
34 | 1 | 0.4% |
32 | 1 | 0.4% |
30 | 1 | 0.4% |
23 | 2 | 0.9% |
21 | 2 | 0.9% |
20 | 1 | 0.4% |
19 | 1 | 0.4% |
15 | 3 | |
14 | 4 | |
13 | 7 |
부상신고
Real number (ℝ)
ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.41048035 |
Minimum | 0 |
---|---|
Maximum | 10 |
Zeros | 182 |
Zeros (%) | 79.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 10 |
Range | 10 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.1344193 |
---|---|
Coefficient of variation (CV) | 2.7636386 |
Kurtosis | 28.191592 |
Mean | 0.41048035 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.6335678 |
Sum | 94 |
Variance | 1.2869072 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 182 | |
1 | 26 | 11.4% |
2 | 13 | 5.7% |
4 | 3 | 1.3% |
6 | 2 | 0.9% |
3 | 1 | 0.4% |
5 | 1 | 0.4% |
10 | 1 | 0.4% |
Value | Count | Frequency (%) |
0 | 182 | |
1 | 26 | 11.4% |
2 | 13 | 5.7% |
3 | 1 | 0.4% |
4 | 3 | 1.3% |
5 | 1 | 0.4% |
6 | 2 | 0.9% |
10 | 1 | 0.4% |
Value | Count | Frequency (%) |
10 | 1 | 0.4% |
6 | 2 | 0.9% |
5 | 1 | 0.4% |
4 | 3 | 1.3% |
3 | 1 | 0.4% |
2 | 13 | 5.7% |
1 | 26 | 11.4% |
0 | 182 |
시도 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.322 | 0.315 | 0.240 | 0.370 | 0.296 | 0.000 |
발생건수 | 0.322 | 1.000 | 0.998 | 0.735 | 0.610 | 0.863 | 0.683 |
사망자수 | 0.315 | 0.998 | 1.000 | 0.746 | 0.637 | 0.857 | 0.570 |
부상자수 | 0.240 | 0.735 | 0.746 | 1.000 | 0.912 | 0.890 | 0.580 |
중상 | 0.370 | 0.610 | 0.637 | 0.912 | 1.000 | 0.723 | 0.395 |
경상 | 0.296 | 0.863 | 0.857 | 0.890 | 0.723 | 1.000 | 0.475 |
부상신고 | 0.000 | 0.683 | 0.570 | 0.580 | 0.395 | 0.475 | 1.000 |
발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | 시도 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.995 | 0.741 | 0.633 | 0.694 | 0.290 | 0.131 |
사망자수 | 0.995 | 1.000 | 0.750 | 0.652 | 0.693 | 0.287 | 0.133 |
부상자수 | 0.741 | 0.750 | 1.000 | 0.891 | 0.899 | 0.358 | 0.092 |
중상 | 0.633 | 0.652 | 0.891 | 1.000 | 0.648 | 0.225 | 0.150 |
경상 | 0.694 | 0.693 | 0.899 | 0.648 | 1.000 | 0.252 | 0.119 |
부상신고 | 0.290 | 0.287 | 0.358 | 0.225 | 0.252 | 1.000 | 0.000 |
시도 | 0.131 | 0.133 | 0.092 | 0.150 | 0.119 | 0.000 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 종로구 | 6 | 6 | 2 | 0 | 2 | 0 |
1 | 서울 | 중구 | 9 | 9 | 3 | 2 | 0 | 1 |
2 | 서울 | 용산구 | 12 | 14 | 1 | 1 | 0 | 0 |
3 | 서울 | 성동구 | 9 | 9 | 1 | 0 | 1 | 0 |
4 | 서울 | 동대문구 | 14 | 14 | 0 | 0 | 0 | 0 |
5 | 서울 | 성북구 | 14 | 14 | 3 | 2 | 1 | 0 |
6 | 서울 | 도봉구 | 6 | 6 | 0 | 0 | 0 | 0 |
7 | 서울 | 은평구 | 12 | 12 | 1 | 1 | 0 | 0 |
8 | 서울 | 서대문구 | 5 | 5 | 0 | 0 | 0 | 0 |
9 | 서울 | 마포구 | 11 | 11 | 1 | 1 | 0 | 0 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
219 | 대전 | 중구 | 12 | 12 | 2 | 0 | 2 | 0 |
220 | 대전 | 서구 | 22 | 22 | 11 | 3 | 7 | 1 |
221 | 대전 | 유성구 | 21 | 22 | 8 | 2 | 6 | 0 |
222 | 대전 | 대덕구 | 13 | 13 | 3 | 0 | 3 | 0 |
223 | 울산 | 중구 | 10 | 10 | 1 | 1 | 0 | 0 |
224 | 울산 | 남구 | 23 | 23 | 17 | 4 | 13 | 0 |
225 | 울산 | 동구 | 6 | 6 | 0 | 0 | 0 | 0 |
226 | 울산 | 북구 | 15 | 16 | 43 | 11 | 32 | 0 |
227 | 울산 | 울주군 | 23 | 24 | 24 | 11 | 12 | 1 |
228 | 세종 | 세종 | 17 | 20 | 3 | 2 | 1 | 0 |