Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 228 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 15.7 KiB |
Average record size in memory | 70.6 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 5 |
Dataset
Description | * 부문별 어린이 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094169/fileData.do |
발생건수 is highly overall correlated with 부상자수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
중상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
경상 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
부상신고 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
사망자수 is highly imbalanced (59.9%) | Imbalance |
중상 has 16 (7.0%) zeros | Zeros |
부상신고 has 66 (28.9%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 02:28:38.376104 |
---|---|
Analysis finished | 2023-12-12 02:28:41.609805 |
Duration | 3.23 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 7.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
경기 | |
---|---|
서울 | |
경북 | |
전남 | |
강원 | |
Other values (12) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 17 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
Length
Value | Count | Frequency (%) |
경기 | 31 | |
서울 | 25 | |
경북 | 23 | |
전남 | 22 | |
강원 | 18 | |
경남 | 17 | |
부산 | 16 | |
충남 | 15 | |
전북 | 14 | 6.1% |
충북 | 11 | 4.8% |
Other values (7) | 36 |
시군구
Text
Distinct | 205 |
---|---|
Distinct (%) | 89.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
Value | Count | Frequency (%) |
중구 | 6 | 2.6% |
동구 | 6 | 2.6% |
서구 | 5 | 2.2% |
남구 | 5 | 2.2% |
북구 | 4 | 1.8% |
강서구 | 2 | 0.9% |
고성군 | 2 | 0.9% |
곡성군 | 1 | 0.4% |
화순군 | 1 | 0.4% |
보성군 | 1 | 0.4% |
Other values (195) | 195 |
Most occurring characters
Value | Count | Frequency (%) |
군 | 84 | 12.5% |
시 | 78 | 11.6% |
구 | 74 | 11.0% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
남 | 13 | 1.9% |
Other values (123) | 310 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 668 | |
Open Punctuation | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
군 | 84 | 12.6% |
시 | 78 | 11.7% |
구 | 74 | 11.1% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
남 | 13 | 1.9% |
Other values (121) | 308 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 668 | |
Common | 2 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
군 | 84 | 12.6% |
시 | 78 | 11.7% |
구 | 74 | 11.1% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
남 | 13 | 1.9% |
Other values (121) | 308 |
Common
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 668 | |
ASCII | 2 | 0.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
군 | 84 | 12.6% |
시 | 78 | 11.7% |
구 | 74 | 11.1% |
천 | 22 | 3.3% |
주 | 20 | 3.0% |
양 | 18 | 2.7% |
성 | 18 | 2.7% |
동 | 17 | 2.5% |
산 | 16 | 2.4% |
남 | 13 | 1.9% |
Other values (121) | 308 |
ASCII
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 95 |
---|---|
Distinct (%) | 41.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.899123 |
Minimum | 1 |
---|---|
Maximum | 266 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 11 |
median | 28.5 |
Q3 | 58.5 |
95-th percentile | 150.75 |
Maximum | 266 |
Range | 265 |
Interquartile range (IQR) | 47.5 |
Descriptive statistics
Standard deviation | 46.76206 |
---|---|
Coefficient of variation (CV) | 1.0652163 |
Kurtosis | 4.0170525 |
Mean | 43.899123 |
Median Absolute Deviation (MAD) | 19.5 |
Skewness | 1.9270699 |
Sum | 10009 |
Variance | 2186.6902 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 10 | 4.4% |
9 | 9 | 3.9% |
5 | 8 | 3.5% |
7 | 7 | 3.1% |
6 | 6 | 2.6% |
3 | 6 | 2.6% |
47 | 6 | 2.6% |
21 | 6 | 2.6% |
29 | 5 | 2.2% |
13 | 5 | 2.2% |
Other values (85) | 160 |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
2 | 3 | 1.3% |
3 | 6 | |
4 | 10 | |
5 | 8 | |
6 | 6 | |
7 | 7 | |
8 | 3 | 1.3% |
9 | 9 | |
10 | 2 | 0.9% |
Value | Count | Frequency (%) |
266 | 1 | |
217 | 1 | |
200 | 1 | |
192 | 1 | |
188 | 2 | |
181 | 2 | |
168 | 1 | |
162 | 1 | |
160 | 1 | |
156 | 1 |
사망자수
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.9 KiB |
0 | |
---|---|
1 | |
2 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 198 | |
1 | 26 | 11.4% |
2 | 4 | 1.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 198 | |
1 | 26 | 11.4% |
2 | 4 | 1.8% |
부상자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 108 |
---|---|
Distinct (%) | 47.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 55.013158 |
Minimum | 1 |
---|---|
Maximum | 356 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 14.75 |
median | 35 |
Q3 | 73 |
95-th percentile | 180.65 |
Maximum | 356 |
Range | 355 |
Interquartile range (IQR) | 58.25 |
Descriptive statistics
Standard deviation | 59.317965 |
---|---|
Coefficient of variation (CV) | 1.0782505 |
Kurtosis | 4.7095154 |
Mean | 55.013158 |
Median Absolute Deviation (MAD) | 24.5 |
Skewness | 2.0312527 |
Sum | 12543 |
Variance | 3518.621 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 9 | 3.9% |
5 | 8 | 3.5% |
6 | 7 | 3.1% |
25 | 6 | 2.6% |
15 | 6 | 2.6% |
79 | 5 | 2.2% |
11 | 5 | 2.2% |
4 | 5 | 2.2% |
9 | 4 | 1.8% |
8 | 4 | 1.8% |
Other values (98) | 169 |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
2 | 3 | 1.3% |
3 | 1 | 0.4% |
4 | 5 | |
5 | 8 | |
6 | 7 | |
7 | 4 | |
8 | 4 | |
9 | 4 | |
10 | 9 |
Value | Count | Frequency (%) |
356 | 1 | |
268 | 1 | |
262 | 1 | |
252 | 1 | |
240 | 1 | |
235 | 1 | |
234 | 1 | |
220 | 1 | |
214 | 1 | |
200 | 2 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 32 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.5701754 |
Minimum | 0 |
---|---|
Maximum | 49 |
Zeros | 16 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 5 |
Q3 | 10 |
95-th percentile | 24.65 |
Maximum | 49 |
Range | 49 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 8.2006439 |
---|---|
Coefficient of variation (CV) | 1.0832832 |
Kurtosis | 5.1248402 |
Mean | 7.5701754 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 2.0817404 |
Sum | 1726 |
Variance | 67.25056 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 29 | |
2 | 22 | 9.6% |
3 | 20 | 8.8% |
5 | 19 | 8.3% |
4 | 16 | 7.0% |
0 | 16 | 7.0% |
6 | 15 | 6.6% |
8 | 13 | 5.7% |
7 | 11 | 4.8% |
9 | 9 | 3.9% |
Other values (22) | 58 |
Value | Count | Frequency (%) |
0 | 16 | |
1 | 29 | |
2 | 22 | |
3 | 20 | |
4 | 16 | |
5 | 19 | |
6 | 15 | |
7 | 11 | 4.8% |
8 | 13 | |
9 | 9 | 3.9% |
Value | Count | Frequency (%) |
49 | 1 | 0.4% |
40 | 2 | |
35 | 1 | 0.4% |
33 | 1 | 0.4% |
30 | 1 | 0.4% |
29 | 4 | |
28 | 1 | 0.4% |
25 | 1 | 0.4% |
24 | 2 | |
23 | 1 | 0.4% |
경상
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 96 |
---|---|
Distinct (%) | 42.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.092105 |
Minimum | 0 |
---|---|
Maximum | 295 |
Zeros | 2 |
Zeros (%) | 0.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3 |
Q1 | 9.75 |
median | 25 |
Q3 | 51.25 |
95-th percentile | 142.6 |
Maximum | 295 |
Range | 295 |
Interquartile range (IQR) | 41.5 |
Descriptive statistics
Standard deviation | 43.874323 |
---|---|
Coefficient of variation (CV) | 1.0943382 |
Kurtosis | 6.0253356 |
Mean | 40.092105 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 2.1478582 |
Sum | 9141 |
Variance | 1924.9562 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 12 | 5.3% |
7 | 9 | 3.9% |
9 | 9 | 3.9% |
17 | 8 | 3.5% |
3 | 8 | 3.5% |
15 | 6 | 2.6% |
16 | 6 | 2.6% |
8 | 6 | 2.6% |
24 | 6 | 2.6% |
33 | 5 | 2.2% |
Other values (86) | 153 |
Value | Count | Frequency (%) |
0 | 2 | 0.9% |
1 | 2 | 0.9% |
2 | 2 | 0.9% |
3 | 8 | |
4 | 12 | |
5 | 4 | 1.8% |
6 | 3 | 1.3% |
7 | 9 | |
8 | 6 | |
9 | 9 |
Value | Count | Frequency (%) |
295 | 1 | |
195 | 1 | |
180 | 1 | |
171 | 1 | |
161 | 1 | |
160 | 1 | |
159 | 1 | |
153 | 1 | |
150 | 1 | |
145 | 2 |
부상신고
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 38 |
---|---|
Distinct (%) | 16.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.3508772 |
Minimum | 0 |
---|---|
Maximum | 62 |
Zeros | 66 |
Zeros (%) | 28.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3 |
Q3 | 9.25 |
95-th percentile | 33.3 |
Maximum | 62 |
Range | 62 |
Interquartile range (IQR) | 9.25 |
Descriptive statistics
Standard deviation | 11.128787 |
---|---|
Coefficient of variation (CV) | 1.5139401 |
Kurtosis | 7.0723518 |
Mean | 7.3508772 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 2.5221099 |
Sum | 1676 |
Variance | 123.84991 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 66 | |
1 | 23 | 10.1% |
2 | 15 | 6.6% |
5 | 14 | 6.1% |
3 | 13 | 5.7% |
4 | 12 | 5.3% |
8 | 9 | 3.9% |
7 | 8 | 3.5% |
6 | 8 | 3.5% |
11 | 7 | 3.1% |
Other values (28) | 53 |
Value | Count | Frequency (%) |
0 | 66 | |
1 | 23 | 10.1% |
2 | 15 | 6.6% |
3 | 13 | 5.7% |
4 | 12 | 5.3% |
5 | 14 | 6.1% |
6 | 8 | 3.5% |
7 | 8 | 3.5% |
8 | 9 | 3.9% |
9 | 3 | 1.3% |
Value | Count | Frequency (%) |
62 | 1 | 0.4% |
59 | 1 | 0.4% |
57 | 1 | 0.4% |
45 | 1 | 0.4% |
44 | 1 | 0.4% |
43 | 1 | 0.4% |
41 | 2 | |
35 | 3 | |
34 | 1 | 0.4% |
32 | 1 | 0.4% |
시도 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.448 | 0.210 | 0.472 | 0.368 | 0.400 | 0.577 |
발생건수 | 0.448 | 1.000 | 0.413 | 0.946 | 0.938 | 0.920 | 0.702 |
사망자수 | 0.210 | 0.413 | 1.000 | 0.565 | 0.372 | 0.395 | 0.426 |
부상자수 | 0.472 | 0.946 | 0.565 | 1.000 | 0.856 | 0.943 | 0.849 |
중상 | 0.368 | 0.938 | 0.372 | 0.856 | 1.000 | 0.829 | 0.572 |
경상 | 0.400 | 0.920 | 0.395 | 0.943 | 0.829 | 1.000 | 0.687 |
부상신고 | 0.577 | 0.702 | 0.426 | 0.849 | 0.572 | 0.687 | 1.000 |
시도 | 사망자수 | |
---|---|---|
시도 | 1.000 | 0.109 |
사망자수 | 0.109 | 1.000 |
발생건수 | 부상자수 | 중상 | 경상 | 부상신고 | 시도 | 사망자수 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.990 | 0.876 | 0.975 | 0.795 | 0.189 | 0.275 |
부상자수 | 0.990 | 1.000 | 0.860 | 0.987 | 0.807 | 0.205 | 0.295 |
중상 | 0.876 | 0.860 | 1.000 | 0.815 | 0.613 | 0.149 | 0.237 |
경상 | 0.975 | 0.987 | 0.815 | 1.000 | 0.745 | 0.175 | 0.271 |
부상신고 | 0.795 | 0.807 | 0.613 | 0.745 | 1.000 | 0.266 | 0.212 |
시도 | 0.189 | 0.205 | 0.149 | 0.175 | 0.266 | 1.000 | 0.109 |
사망자수 | 0.275 | 0.295 | 0.237 | 0.271 | 0.212 | 0.109 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 종로구 | 28 | 0 | 33 | 1 | 27 | 5 |
1 | 서울 | 중구 | 23 | 0 | 25 | 1 | 16 | 8 |
2 | 서울 | 용산구 | 34 | 0 | 39 | 6 | 25 | 8 |
3 | 서울 | 성동구 | 28 | 0 | 31 | 4 | 20 | 7 |
4 | 서울 | 동대문구 | 42 | 0 | 46 | 3 | 33 | 10 |
5 | 서울 | 성북구 | 64 | 0 | 82 | 16 | 57 | 9 |
6 | 서울 | 도봉구 | 34 | 0 | 38 | 2 | 29 | 7 |
7 | 서울 | 은평구 | 56 | 0 | 84 | 5 | 54 | 25 |
8 | 서울 | 서대문구 | 32 | 0 | 36 | 7 | 22 | 7 |
9 | 서울 | 마포구 | 45 | 0 | 50 | 7 | 37 | 6 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
218 | 대전 | 중구 | 47 | 0 | 60 | 7 | 45 | 8 |
219 | 대전 | 서구 | 117 | 1 | 142 | 18 | 117 | 7 |
220 | 대전 | 유성구 | 96 | 0 | 133 | 13 | 95 | 25 |
221 | 대전 | 대덕구 | 25 | 0 | 34 | 5 | 24 | 5 |
222 | 울산 | 중구 | 26 | 0 | 27 | 10 | 16 | 1 |
223 | 울산 | 남구 | 43 | 0 | 49 | 9 | 36 | 4 |
224 | 울산 | 동구 | 21 | 0 | 21 | 6 | 15 | 0 |
225 | 울산 | 북구 | 29 | 0 | 30 | 6 | 24 | 0 |
226 | 울산 | 울주군 | 35 | 0 | 51 | 11 | 29 | 11 |
227 | 세종 | 세종 | 49 | 0 | 61 | 18 | 39 | 4 |