Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 226 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 15.6 KiB |
Average record size in memory | 70.6 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 5 |
Dataset
Description | * 부문별 뺑소니 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094167/fileData.do |
발생건수 is highly overall correlated with 부상자수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
중상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
경상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상신고 is highly overall correlated with 발생건수 and 4 other fields | High correlation |
사망자수 is highly overall correlated with 부상신고 | High correlation |
부상자수 has 3 (1.3%) zeros | Zeros |
중상 has 18 (8.0%) zeros | Zeros |
경상 has 6 (2.7%) zeros | Zeros |
부상신고 has 138 (61.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-20 17:05:16.447551 |
---|---|
Analysis finished | 2024-04-20 17:05:23.964047 |
Duration | 7.52 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 7.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
경기 | |
---|---|
서울 | |
전남 | |
경북 | |
강원 | |
Other values (12) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
전남 | 22 | |
경북 | 22 | |
강원 | 17 | |
경남 | 17 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.2% |
충북 | 11 | 4.9% |
Other values (7) | 36 |
Length
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
전남 | 22 | |
경북 | 22 | |
강원 | 17 | |
경남 | 17 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.2% |
충북 | 11 | 4.9% |
Other values (7) | 36 |
시군구
Text
Distinct | 204 |
---|---|
Distinct (%) | 90.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
Value | Count | Frequency (%) |
중구 | 6 | 2.7% |
동구 | 6 | 2.7% |
서구 | 5 | 2.2% |
남구 | 5 | 2.2% |
북구 | 4 | 1.8% |
강서구 | 2 | 0.9% |
구례군 | 1 | 0.4% |
장흥군 | 1 | 0.4% |
화순군 | 1 | 0.4% |
보성군 | 1 | 0.4% |
Other values (194) | 194 |
Most occurring characters
Value | Count | Frequency (%) |
군 | 81 | 12.2% |
시 | 78 | 11.7% |
구 | 74 | 11.1% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
동 | 17 | 2.6% |
성 | 17 | 2.6% |
산 | 16 | 2.4% |
서 | 13 | 2.0% |
Other values (122) | 308 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 662 | |
Open Punctuation | 1 | 0.2% |
Close Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
군 | 81 | 12.2% |
시 | 78 | 11.8% |
구 | 74 | 11.2% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
동 | 17 | 2.6% |
성 | 17 | 2.6% |
산 | 16 | 2.4% |
서 | 13 | 2.0% |
Other values (120) | 306 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 662 | |
Common | 2 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
군 | 81 | 12.2% |
시 | 78 | 11.8% |
구 | 74 | 11.2% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
동 | 17 | 2.6% |
성 | 17 | 2.6% |
산 | 16 | 2.4% |
서 | 13 | 2.0% |
Other values (120) | 306 |
Common
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 662 | |
ASCII | 2 | 0.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
군 | 81 | 12.2% |
시 | 78 | 11.8% |
구 | 74 | 11.2% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
동 | 17 | 2.6% |
성 | 17 | 2.6% |
산 | 16 | 2.4% |
서 | 13 | 2.0% |
Other values (120) | 306 |
ASCII
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 83 |
---|---|
Distinct (%) | 36.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33.632743 |
Minimum | 1 |
---|---|
Maximum | 261 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 6 |
median | 18.5 |
Q3 | 39 |
95-th percentile | 120.5 |
Maximum | 261 |
Range | 260 |
Interquartile range (IQR) | 33 |
Descriptive statistics
Standard deviation | 43.232833 |
---|---|
Coefficient of variation (CV) | 1.2854388 |
Kurtosis | 7.7024378 |
Mean | 33.632743 |
Median Absolute Deviation (MAD) | 13.5 |
Skewness | 2.580525 |
Sum | 7601 |
Variance | 1869.0779 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 15 | 6.6% |
5 | 14 | 6.2% |
15 | 11 | 4.9% |
6 | 11 | 4.9% |
2 | 10 | 4.4% |
4 | 6 | 2.7% |
27 | 6 | 2.7% |
14 | 6 | 2.7% |
21 | 5 | 2.2% |
1 | 5 | 2.2% |
Other values (73) | 137 |
Value | Count | Frequency (%) |
1 | 5 | 2.2% |
2 | 10 | |
3 | 15 | |
4 | 6 | 2.7% |
5 | 14 | |
6 | 11 | |
7 | 4 | 1.8% |
8 | 3 | 1.3% |
9 | 4 | 1.8% |
10 | 4 | 1.8% |
Value | Count | Frequency (%) |
261 | 1 | |
233 | 1 | |
224 | 1 | |
182 | 1 | |
174 | 1 | |
166 | 1 | |
165 | 1 | |
161 | 1 | |
158 | 1 | |
143 | 1 |
사망자수
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
0 | |
---|---|
1 | |
2 | 10 |
3 | 9 |
4 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 150 | |
1 | 56 | 24.8% |
2 | 10 | 4.4% |
3 | 9 | 4.0% |
4 | 1 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 150 | |
1 | 56 | 24.8% |
2 | 10 | 4.4% |
3 | 9 | 4.0% |
4 | 1 | 0.4% |
부상자수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 99 |
---|---|
Distinct (%) | 43.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49.110619 |
Minimum | 0 |
---|---|
Maximum | 351 |
Zeros | 3 |
Zeros (%) | 1.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3 |
Q1 | 9.25 |
median | 29.5 |
Q3 | 55.75 |
95-th percentile | 183 |
Maximum | 351 |
Range | 351 |
Interquartile range (IQR) | 46.5 |
Descriptive statistics
Standard deviation | 61.390562 |
---|---|
Coefficient of variation (CV) | 1.2500466 |
Kurtosis | 7.0135164 |
Mean | 49.110619 |
Median Absolute Deviation (MAD) | 22.5 |
Skewness | 2.4856907 |
Sum | 11099 |
Variance | 3768.801 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7 | 11 | 4.9% |
4 | 9 | 4.0% |
2 | 8 | 3.5% |
5 | 7 | 3.1% |
20 | 7 | 3.1% |
3 | 7 | 3.1% |
15 | 6 | 2.7% |
6 | 6 | 2.7% |
38 | 5 | 2.2% |
24 | 5 | 2.2% |
Other values (89) | 155 |
Value | Count | Frequency (%) |
0 | 3 | 1.3% |
2 | 8 | |
3 | 7 | |
4 | 9 | |
5 | 7 | |
6 | 6 | |
7 | 11 | |
8 | 4 | 1.8% |
9 | 2 | 0.9% |
10 | 3 | 1.3% |
Value | Count | Frequency (%) |
351 | 1 | |
335 | 1 | |
297 | 1 | |
272 | 1 | |
270 | 1 | |
264 | 1 | |
231 | 1 | |
221 | 1 | |
220 | 1 | |
207 | 1 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 39 |
---|---|
Distinct (%) | 17.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.2345133 |
Minimum | 0 |
---|---|
Maximum | 61 |
Zeros | 18 |
Zeros (%) | 8.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 6 |
Q3 | 13 |
95-th percentile | 28.75 |
Maximum | 61 |
Range | 61 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 10.403113 |
---|---|
Coefficient of variation (CV) | 1.126547 |
Kurtosis | 6.2874271 |
Mean | 9.2345133 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 2.2341827 |
Sum | 2087 |
Variance | 108.22476 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 23 | 10.2% |
2 | 23 | 10.2% |
0 | 18 | 8.0% |
4 | 18 | 8.0% |
3 | 17 | 7.5% |
7 | 14 | 6.2% |
6 | 11 | 4.9% |
8 | 11 | 4.9% |
12 | 8 | 3.5% |
13 | 8 | 3.5% |
Other values (29) | 75 |
Value | Count | Frequency (%) |
0 | 18 | |
1 | 23 | |
2 | 23 | |
3 | 17 | |
4 | 18 | |
5 | 8 | 3.5% |
6 | 11 | |
7 | 14 | |
8 | 11 | |
9 | 3 | 1.3% |
Value | Count | Frequency (%) |
61 | 1 | |
58 | 1 | |
49 | 1 | |
48 | 1 | |
46 | 1 | |
42 | 1 | |
38 | 1 | |
35 | 1 | |
34 | 2 | |
31 | 1 |
경상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 86 |
---|---|
Distinct (%) | 38.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.49115 |
Minimum | 0 |
---|---|
Maximum | 301 |
Zeros | 6 |
Zeros (%) | 2.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
median | 21.5 |
Q3 | 42 |
95-th percentile | 145.75 |
Maximum | 301 |
Range | 301 |
Interquartile range (IQR) | 35 |
Descriptive statistics
Standard deviation | 50.288787 |
---|---|
Coefficient of variation (CV) | 1.3065026 |
Kurtosis | 7.8218684 |
Mean | 38.49115 |
Median Absolute Deviation (MAD) | 16.5 |
Skewness | 2.6149802 |
Sum | 8699 |
Variance | 2528.9621 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 10 | 4.4% |
5 | 10 | 4.4% |
17 | 9 | 4.0% |
18 | 8 | 3.5% |
3 | 8 | 3.5% |
4 | 8 | 3.5% |
1 | 8 | 3.5% |
0 | 6 | 2.7% |
31 | 5 | 2.2% |
12 | 5 | 2.2% |
Other values (76) | 149 |
Value | Count | Frequency (%) |
0 | 6 | |
1 | 8 | |
2 | 10 | |
3 | 8 | |
4 | 8 | |
5 | 10 | |
6 | 4 | 1.8% |
7 | 4 | 1.8% |
8 | 5 | |
9 | 1 | 0.4% |
Value | Count | Frequency (%) |
301 | 1 | |
251 | 1 | |
244 | 1 | |
239 | 1 | |
237 | 1 | |
230 | 1 | |
181 | 1 | |
167 | 2 | |
160 | 1 | |
158 | 1 |
부상신고
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 6.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3849558 |
Minimum | 0 |
---|---|
Maximum | 23 |
Zeros | 138 |
Zeros (%) | 61.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 6.75 |
Maximum | 23 |
Range | 23 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.1633378 |
---|---|
Coefficient of variation (CV) | 2.2840714 |
Kurtosis | 18.735517 |
Mean | 1.3849558 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.9547966 |
Sum | 313 |
Variance | 10.006706 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 138 | |
1 | 37 | 16.4% |
2 | 18 | 8.0% |
3 | 6 | 2.7% |
4 | 6 | 2.7% |
6 | 5 | 2.2% |
5 | 4 | 1.8% |
8 | 3 | 1.3% |
7 | 2 | 0.9% |
11 | 2 | 0.9% |
Other values (5) | 5 | 2.2% |
Value | Count | Frequency (%) |
0 | 138 | |
1 | 37 | 16.4% |
2 | 18 | 8.0% |
3 | 6 | 2.7% |
4 | 6 | 2.7% |
5 | 4 | 1.8% |
6 | 5 | 2.2% |
7 | 2 | 0.9% |
8 | 3 | 1.3% |
11 | 2 | 0.9% |
Value | Count | Frequency (%) |
23 | 1 | 0.4% |
19 | 1 | 0.4% |
18 | 1 | 0.4% |
16 | 1 | 0.4% |
12 | 1 | 0.4% |
11 | 2 | 0.9% |
8 | 3 | |
7 | 2 | 0.9% |
6 | 5 | |
5 | 4 |
시도 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.362 | 0.306 | 0.467 | 0.332 | 0.371 | 0.000 |
발생건수 | 0.362 | 1.000 | 0.494 | 0.929 | 0.851 | 0.916 | 0.812 |
사망자수 | 0.306 | 0.494 | 1.000 | 0.746 | 0.779 | 0.702 | 0.858 |
부상자수 | 0.467 | 0.929 | 0.746 | 1.000 | 0.950 | 0.982 | 0.897 |
중상 | 0.332 | 0.851 | 0.779 | 0.950 | 1.000 | 0.924 | 0.908 |
경상 | 0.371 | 0.916 | 0.702 | 0.982 | 0.924 | 1.000 | 0.878 |
부상신고 | 0.000 | 0.812 | 0.858 | 0.897 | 0.908 | 0.878 | 1.000 |
시도 | 사망자수 | |
---|---|---|
시도 | 1.000 | 0.157 |
사망자수 | 0.157 | 1.000 |
발생건수 | 부상자수 | 중상 | 경상 | 부상신고 | 시도 | 사망자수 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.989 | 0.909 | 0.977 | 0.612 | 0.124 | 0.316 |
부상자수 | 0.989 | 1.000 | 0.906 | 0.991 | 0.606 | 0.198 | 0.398 |
중상 | 0.909 | 0.906 | 1.000 | 0.850 | 0.552 | 0.132 | 0.429 |
경상 | 0.977 | 0.991 | 0.850 | 1.000 | 0.575 | 0.150 | 0.361 |
부상신고 | 0.612 | 0.606 | 0.552 | 0.575 | 1.000 | 0.000 | 0.520 |
시도 | 0.124 | 0.198 | 0.132 | 0.150 | 0.000 | 1.000 | 0.157 |
사망자수 | 0.316 | 0.398 | 0.429 | 0.361 | 0.520 | 0.157 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 종로구 | 17 | 0 | 28 | 6 | 20 | 2 |
1 | 서울 | 중구 | 13 | 0 | 17 | 4 | 11 | 2 |
2 | 서울 | 용산구 | 41 | 1 | 57 | 15 | 42 | 0 |
3 | 서울 | 성동구 | 24 | 0 | 44 | 4 | 40 | 0 |
4 | 서울 | 동대문구 | 33 | 0 | 45 | 12 | 33 | 0 |
5 | 서울 | 성북구 | 29 | 1 | 54 | 8 | 45 | 1 |
6 | 서울 | 도봉구 | 14 | 0 | 22 | 3 | 19 | 0 |
7 | 서울 | 은평구 | 21 | 0 | 33 | 9 | 22 | 2 |
8 | 서울 | 서대문구 | 15 | 0 | 32 | 3 | 27 | 2 |
9 | 서울 | 마포구 | 42 | 0 | 61 | 11 | 47 | 3 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
216 | 대전 | 중구 | 34 | 0 | 49 | 8 | 40 | 1 |
217 | 대전 | 서구 | 69 | 2 | 123 | 16 | 107 | 0 |
218 | 대전 | 유성구 | 57 | 1 | 103 | 13 | 90 | 0 |
219 | 대전 | 대덕구 | 27 | 1 | 46 | 7 | 38 | 1 |
220 | 울산 | 중구 | 37 | 1 | 51 | 10 | 39 | 2 |
221 | 울산 | 남구 | 56 | 3 | 99 | 27 | 71 | 1 |
222 | 울산 | 동구 | 21 | 1 | 24 | 7 | 17 | 0 |
223 | 울산 | 북구 | 41 | 0 | 67 | 14 | 53 | 0 |
224 | 울산 | 울주군 | 41 | 2 | 57 | 17 | 40 | 0 |
225 | 세종 | 세종 | 19 | 0 | 27 | 4 | 23 | 0 |