Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 158 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.9 KiB |
Average record size in memory | 70.8 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 6 |
Dataset
Description | * 부문별 고속도로 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094161/fileData.do |
발생건수 is highly overall correlated with 사망자수 and 4 other fields | High correlation |
사망자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 4 other fields | High correlation |
중상 is highly overall correlated with 발생건수 and 4 other fields | High correlation |
경상 is highly overall correlated with 발생건수 and 4 other fields | High correlation |
부상신고 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
사망자수 has 60 (38.0%) zeros | Zeros |
중상 has 19 (12.0%) zeros | Zeros |
경상 has 6 (3.8%) zeros | Zeros |
부상신고 has 71 (44.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-20 14:55:13.443436 |
---|---|
Analysis finished | 2024-04-20 14:55:22.475848 |
Duration | 9.03 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
Distinct | 16 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
경기 | |
---|---|
경북 | |
전남 | |
경남 | |
충남 | |
Other values (11) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.3% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 27 | |
경북 | 17 | |
전남 | 14 | |
경남 | 14 | |
충남 | 13 | |
전북 | 13 | |
충북 | 11 | |
서울 | 10 | 6.3% |
강원 | 10 | 6.3% |
인천 | 8 | 5.1% |
Other values (6) | 21 |
Length
Value | Count | Frequency (%) |
경기 | 27 | |
경북 | 17 | |
전남 | 14 | |
경남 | 14 | |
충남 | 13 | |
전북 | 13 | |
충북 | 11 | |
서울 | 10 | 6.3% |
강원 | 10 | 6.3% |
인천 | 8 | 5.1% |
Other values (6) | 21 |
시군구
Text
Distinct | 150 |
---|---|
Distinct (%) | 94.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Value | Count | Frequency (%) |
동구 | 3 | 1.9% |
서구 | 3 | 1.9% |
북구 | 3 | 1.9% |
강서구 | 2 | 1.3% |
중구 | 2 | 1.3% |
함평군 | 1 | 0.6% |
영천시 | 1 | 0.6% |
영광군 | 1 | 0.6% |
구례군 | 1 | 0.6% |
보성군 | 1 | 0.6% |
Other values (140) | 140 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 68 | 14.5% |
군 | 58 | 12.4% |
구 | 38 | 8.1% |
주 | 18 | 3.8% |
천 | 17 | 3.6% |
양 | 16 | 3.4% |
성 | 16 | 3.4% |
산 | 13 | 2.8% |
동 | 9 | 1.9% |
서 | 9 | 1.9% |
Other values (101) | 206 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 466 | |
Close Punctuation | 1 | 0.2% |
Open Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 68 | 14.6% |
군 | 58 | 12.4% |
구 | 38 | 8.2% |
주 | 18 | 3.9% |
천 | 17 | 3.6% |
양 | 16 | 3.4% |
성 | 16 | 3.4% |
산 | 13 | 2.8% |
동 | 9 | 1.9% |
서 | 9 | 1.9% |
Other values (99) | 204 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 466 | |
Common | 2 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 68 | 14.6% |
군 | 58 | 12.4% |
구 | 38 | 8.2% |
주 | 18 | 3.9% |
천 | 17 | 3.6% |
양 | 16 | 3.4% |
성 | 16 | 3.4% |
산 | 13 | 2.8% |
동 | 9 | 1.9% |
서 | 9 | 1.9% |
Other values (99) | 204 |
Common
Value | Count | Frequency (%) |
) | 1 | |
( | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 466 | |
ASCII | 2 | 0.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 68 | 14.6% |
군 | 58 | 12.4% |
구 | 38 | 8.2% |
주 | 18 | 3.9% |
천 | 17 | 3.6% |
양 | 16 | 3.4% |
성 | 16 | 3.4% |
산 | 13 | 2.8% |
동 | 9 | 1.9% |
서 | 9 | 1.9% |
Other values (99) | 204 |
ASCII
Value | Count | Frequency (%) |
) | 1 | |
( | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 63 |
---|---|
Distinct (%) | 39.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25.816456 |
Minimum | 1 |
---|---|
Maximum | 317 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4.25 |
median | 13.5 |
Q3 | 30.75 |
95-th percentile | 93.3 |
Maximum | 317 |
Range | 316 |
Interquartile range (IQR) | 26.5 |
Descriptive statistics
Standard deviation | 38.473556 |
---|---|
Coefficient of variation (CV) | 1.4902726 |
Kurtosis | 22.989331 |
Mean | 25.816456 |
Median Absolute Deviation (MAD) | 10.5 |
Skewness | 3.9891947 |
Sum | 4079 |
Variance | 1480.2145 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 14 | 8.9% |
2 | 11 | 7.0% |
3 | 10 | 6.3% |
5 | 7 | 4.4% |
8 | 6 | 3.8% |
11 | 6 | 3.8% |
16 | 5 | 3.2% |
9 | 5 | 3.2% |
4 | 5 | 3.2% |
17 | 5 | 3.2% |
Other values (53) | 84 |
Value | Count | Frequency (%) |
1 | 14 | |
2 | 11 | |
3 | 10 | |
4 | 5 | 3.2% |
5 | 7 | |
6 | 4 | 2.5% |
7 | 4 | 2.5% |
8 | 6 | |
9 | 5 | 3.2% |
10 | 4 | 2.5% |
Value | Count | Frequency (%) |
317 | 1 | |
197 | 1 | |
152 | 1 | |
118 | 1 | |
110 | 1 | |
105 | 1 | |
102 | 1 | |
95 | 1 | |
93 | 1 | |
88 | 1 |
사망자수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.5949367 |
Minimum | 0 |
---|---|
Maximum | 8 |
Zeros | 60 |
Zeros (%) | 38.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 2 |
95-th percentile | 6 |
Maximum | 8 |
Range | 8 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.9353828 |
---|---|
Coefficient of variation (CV) | 1.2134543 |
Kurtosis | 1.9825869 |
Mean | 1.5949367 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.5226984 |
Sum | 252 |
Variance | 3.7457067 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 60 | |
1 | 38 | |
2 | 21 | 13.3% |
3 | 18 | 11.4% |
5 | 6 | 3.8% |
4 | 6 | 3.8% |
7 | 4 | 2.5% |
8 | 3 | 1.9% |
6 | 2 | 1.3% |
Value | Count | Frequency (%) |
0 | 60 | |
1 | 38 | |
2 | 21 | 13.3% |
3 | 18 | 11.4% |
4 | 6 | 3.8% |
5 | 6 | 3.8% |
6 | 2 | 1.3% |
7 | 4 | 2.5% |
8 | 3 | 1.9% |
Value | Count | Frequency (%) |
8 | 3 | 1.9% |
7 | 4 | 2.5% |
6 | 2 | 1.3% |
5 | 6 | 3.8% |
4 | 6 | 3.8% |
3 | 18 | 11.4% |
2 | 21 | 13.3% |
1 | 38 | |
0 | 60 |
부상자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 88 |
---|---|
Distinct (%) | 55.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.411392 |
Minimum | 1 |
---|---|
Maximum | 760 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.85 |
Q1 | 8 |
median | 27 |
Q3 | 68.25 |
95-th percentile | 197.3 |
Maximum | 760 |
Range | 759 |
Interquartile range (IQR) | 60.25 |
Descriptive statistics
Standard deviation | 88.042269 |
---|---|
Coefficient of variation (CV) | 1.5607179 |
Kurtosis | 27.068975 |
Mean | 56.411392 |
Median Absolute Deviation (MAD) | 23 |
Skewness | 4.2865339 |
Sum | 8913 |
Variance | 7751.4411 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 9 | 5.7% |
1 | 8 | 5.1% |
5 | 7 | 4.4% |
4 | 7 | 4.4% |
26 | 5 | 3.2% |
8 | 4 | 2.5% |
17 | 4 | 2.5% |
10 | 4 | 2.5% |
3 | 4 | 2.5% |
51 | 4 | 2.5% |
Other values (78) | 102 |
Value | Count | Frequency (%) |
1 | 8 | |
2 | 9 | |
3 | 4 | |
4 | 7 | |
5 | 7 | |
6 | 1 | 0.6% |
7 | 2 | 1.3% |
8 | 4 | |
9 | 4 | |
10 | 4 |
Value | Count | Frequency (%) |
760 | 1 | |
368 | 1 | |
342 | 1 | |
315 | 1 | |
281 | 1 | |
270 | 1 | |
234 | 1 | |
199 | 1 | |
197 | 1 | |
180 | 1 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 39 |
---|---|
Distinct (%) | 24.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.335443 |
Minimum | 0 |
---|---|
Maximum | 138 |
Zeros | 19 |
Zeros (%) | 12.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1.25 |
median | 7 |
Q3 | 16.75 |
95-th percentile | 41.45 |
Maximum | 138 |
Range | 138 |
Interquartile range (IQR) | 15.5 |
Descriptive statistics
Standard deviation | 17.467572 |
---|---|
Coefficient of variation (CV) | 1.4160474 |
Kurtosis | 18.50577 |
Mean | 12.335443 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 3.5412445 |
Sum | 1949 |
Variance | 305.11606 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 21 | 13.3% |
0 | 19 | 12.0% |
5 | 10 | 6.3% |
2 | 9 | 5.7% |
4 | 9 | 5.7% |
9 | 8 | 5.1% |
8 | 7 | 4.4% |
17 | 7 | 4.4% |
3 | 6 | 3.8% |
15 | 5 | 3.2% |
Other values (29) | 57 |
Value | Count | Frequency (%) |
0 | 19 | |
1 | 21 | |
2 | 9 | |
3 | 6 | 3.8% |
4 | 9 | |
5 | 10 | |
6 | 4 | 2.5% |
7 | 2 | 1.3% |
8 | 7 | 4.4% |
9 | 8 | 5.1% |
Value | Count | Frequency (%) |
138 | 1 | |
77 | 1 | |
68 | 2 | |
67 | 1 | |
51 | 1 | |
45 | 1 | |
44 | 1 | |
41 | 1 | |
40 | 1 | |
39 | 1 |
경상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 73 |
---|---|
Distinct (%) | 46.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.829114 |
Minimum | 0 |
---|---|
Maximum | 494 |
Zeros | 6 |
Zeros (%) | 3.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 5.25 |
median | 19 |
Q3 | 48.75 |
95-th percentile | 131.55 |
Maximum | 494 |
Range | 494 |
Interquartile range (IQR) | 43.5 |
Descriptive statistics
Standard deviation | 59.648567 |
---|---|
Coefficient of variation (CV) | 1.5361815 |
Kurtosis | 22.986021 |
Mean | 38.829114 |
Median Absolute Deviation (MAD) | 16 |
Skewness | 3.9752081 |
Sum | 6135 |
Variance | 3557.9515 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 9 | 5.7% |
3 | 7 | 4.4% |
2 | 7 | 4.4% |
0 | 6 | 3.8% |
8 | 6 | 3.8% |
5 | 6 | 3.8% |
11 | 5 | 3.2% |
7 | 5 | 3.2% |
4 | 5 | 3.2% |
32 | 4 | 2.5% |
Other values (63) | 98 |
Value | Count | Frequency (%) |
0 | 6 | |
1 | 9 | |
2 | 7 | |
3 | 7 | |
4 | 5 | |
5 | 6 | |
6 | 2 | 1.3% |
7 | 5 | |
8 | 6 | |
9 | 4 |
Value | Count | Frequency (%) |
494 | 1 | |
242 | 1 | |
236 | 1 | |
232 | 1 | |
228 | 1 | |
166 | 1 | |
159 | 1 | |
146 | 1 | |
129 | 1 | |
124 | 1 |
부상신고
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 27 |
---|---|
Distinct (%) | 17.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.2468354 |
Minimum | 0 |
---|---|
Maximum | 128 |
Zeros | 71 |
Zeros (%) | 44.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 4 |
95-th percentile | 23.15 |
Maximum | 128 |
Range | 128 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 14.283609 |
---|---|
Coefficient of variation (CV) | 2.7223284 |
Kurtosis | 40.595 |
Mean | 5.2468354 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 5.7497192 |
Sum | 829 |
Variance | 204.02149 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 71 | |
1 | 22 | 13.9% |
2 | 13 | 8.2% |
4 | 10 | 6.3% |
3 | 8 | 5.1% |
9 | 4 | 2.5% |
6 | 4 | 2.5% |
5 | 3 | 1.9% |
15 | 3 | 1.9% |
10 | 2 | 1.3% |
Other values (17) | 18 | 11.4% |
Value | Count | Frequency (%) |
0 | 71 | |
1 | 22 | 13.9% |
2 | 13 | 8.2% |
3 | 8 | 5.1% |
4 | 10 | 6.3% |
5 | 3 | 1.9% |
6 | 4 | 2.5% |
7 | 1 | 0.6% |
8 | 1 | 0.6% |
9 | 4 | 2.5% |
Value | Count | Frequency (%) |
128 | 1 | |
80 | 1 | |
64 | 1 | |
46 | 1 | |
33 | 1 | |
30 | 1 | |
26 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 |
시도 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.000 | 0.000 | 0.133 | 0.000 | 0.289 | 0.000 |
발생건수 | 0.000 | 1.000 | 0.594 | 0.884 | 0.973 | 0.843 | 0.966 |
사망자수 | 0.000 | 0.594 | 1.000 | 0.594 | 0.633 | 0.583 | 0.498 |
부상자수 | 0.133 | 0.884 | 0.594 | 1.000 | 0.918 | 0.985 | 0.802 |
중상 | 0.000 | 0.973 | 0.633 | 0.918 | 1.000 | 0.832 | 0.923 |
경상 | 0.289 | 0.843 | 0.583 | 0.985 | 0.832 | 1.000 | 0.778 |
부상신고 | 0.000 | 0.966 | 0.498 | 0.802 | 0.923 | 0.778 | 1.000 |
발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | 시도 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.564 | 0.976 | 0.919 | 0.966 | 0.782 | 0.000 |
사망자수 | 0.564 | 1.000 | 0.544 | 0.584 | 0.537 | 0.328 | 0.000 |
부상자수 | 0.976 | 0.544 | 1.000 | 0.932 | 0.991 | 0.800 | 0.057 |
중상 | 0.919 | 0.584 | 0.932 | 1.000 | 0.888 | 0.684 | 0.000 |
경상 | 0.966 | 0.537 | 0.991 | 0.888 | 1.000 | 0.786 | 0.136 |
부상신고 | 0.782 | 0.328 | 0.800 | 0.684 | 0.786 | 1.000 | 0.000 |
시도 | 0.000 | 0.000 | 0.057 | 0.000 | 0.136 | 0.000 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 성북구 | 1 | 0 | 1 | 0 | 0 | 1 |
1 | 서울 | 강서구 | 1 | 0 | 2 | 1 | 1 | 0 |
2 | 서울 | 강남구 | 1 | 0 | 1 | 1 | 0 | 0 |
3 | 서울 | 강동구 | 37 | 1 | 85 | 10 | 60 | 15 |
4 | 서울 | 송파구 | 24 | 1 | 49 | 5 | 38 | 6 |
5 | 서울 | 서초구 | 41 | 1 | 127 | 21 | 76 | 30 |
6 | 서울 | 양천구 | 3 | 0 | 4 | 1 | 3 | 0 |
7 | 서울 | 중랑구 | 3 | 0 | 10 | 0 | 9 | 1 |
8 | 서울 | 노원구 | 6 | 0 | 18 | 6 | 7 | 5 |
9 | 서울 | 금천구 | 3 | 1 | 4 | 0 | 4 | 0 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
148 | 인천 | 계양구 | 76 | 1 | 132 | 29 | 88 | 15 |
149 | 광주 | 북구 | 20 | 1 | 44 | 5 | 38 | 1 |
150 | 광주 | 광산구 | 7 | 0 | 17 | 1 | 15 | 1 |
151 | 대전 | 동구 | 14 | 3 | 29 | 9 | 19 | 1 |
152 | 대전 | 중구 | 1 | 0 | 1 | 1 | 0 | 0 |
153 | 대전 | 서구 | 5 | 1 | 14 | 3 | 11 | 0 |
154 | 대전 | 유성구 | 26 | 2 | 52 | 20 | 32 | 0 |
155 | 대전 | 대덕구 | 21 | 2 | 34 | 9 | 24 | 1 |
156 | 울산 | 울주군 | 47 | 5 | 79 | 18 | 57 | 4 |
157 | 세종 | 세종 | 4 | 0 | 9 | 1 | 8 | 0 |