Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 53 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.8 KiB |
Average record size in memory | 72.5 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 4 |
Dataset
Description | * 부문별 무면허 교통사고(2018) |
---|---|
Author | 도로교통공단 |
URL | https://www.data.go.kr/data/15094166/fileData.do |
발생건수 is highly overall correlated with 부상자수 and 3 other fields | High correlation |
부상자수 is highly overall correlated with 발생건수 and 2 other fields | High correlation |
중상 is highly overall correlated with 부상신고 | High correlation |
경상 is highly overall correlated with 발생건수 and 2 other fields | High correlation |
시도 is highly overall correlated with 발생건수 | High correlation |
부상신고 is highly overall correlated with 발생건수 and 3 other fields | High correlation |
사망자수 is highly imbalanced (83.0%) | Imbalance |
부상신고 is highly imbalanced (54.2%) | Imbalance |
중상 has 38 (71.7%) zeros | Zeros |
경상 has 9 (17.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 22:14:34.215202 |
---|---|
Analysis finished | 2023-12-12 22:14:36.785432 |
Duration | 2.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 24.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 556.0 B |
경기 | |
---|---|
서울 | |
강원 | |
대구 | |
경남 | |
Other values (8) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
경기 | 11 | |
서울 | 10 | |
강원 | 5 | |
대구 | 5 | |
경남 | 4 | 7.5% |
충남 | 3 | 5.7% |
전북 | 3 | 5.7% |
인천 | 3 | 5.7% |
대전 | 3 | 5.7% |
경북 | 2 | 3.8% |
Other values (3) | 4 | 7.5% |
Length
Value | Count | Frequency (%) |
경기 | 11 | |
서울 | 10 | |
강원 | 5 | |
대구 | 5 | |
경남 | 4 | 7.5% |
충남 | 3 | 5.7% |
전북 | 3 | 5.7% |
인천 | 3 | 5.7% |
대전 | 3 | 5.7% |
경북 | 2 | 3.8% |
Other values (3) | 4 | 7.5% |
시군구
Text
Distinct | 51 |
---|---|
Distinct (%) | 96.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 556.0 B |
Value | Count | Frequency (%) |
중구 | 2 | 3.8% |
서구 | 2 | 3.8% |
계양구 | 1 | 1.9% |
달서구 | 1 | 1.9% |
청주시 | 1 | 1.9% |
유성구 | 1 | 1.9% |
거제시 | 1 | 1.9% |
천안시 | 1 | 1.9% |
부여군 | 1 | 1.9% |
예산군 | 1 | 1.9% |
Other values (41) | 41 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 25 | 15.7% |
구 | 23 | 14.5% |
군 | 8 | 5.0% |
성 | 6 | 3.8% |
포 | 5 | 3.1% |
서 | 5 | 3.1% |
산 | 5 | 3.1% |
주 | 5 | 3.1% |
동 | 4 | 2.5% |
천 | 4 | 2.5% |
Other values (49) | 69 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 157 | |
Open Punctuation | 1 | 0.6% |
Close Punctuation | 1 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 25 | 15.9% |
구 | 23 | 14.6% |
군 | 8 | 5.1% |
성 | 6 | 3.8% |
포 | 5 | 3.2% |
서 | 5 | 3.2% |
산 | 5 | 3.2% |
주 | 5 | 3.2% |
동 | 4 | 2.5% |
천 | 4 | 2.5% |
Other values (47) | 67 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 157 | |
Common | 2 | 1.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 25 | 15.9% |
구 | 23 | 14.6% |
군 | 8 | 5.1% |
성 | 6 | 3.8% |
포 | 5 | 3.2% |
서 | 5 | 3.2% |
산 | 5 | 3.2% |
주 | 5 | 3.2% |
동 | 4 | 2.5% |
천 | 4 | 2.5% |
Other values (47) | 67 |
Common
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 157 | |
ASCII | 2 | 1.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 25 | 15.9% |
구 | 23 | 14.6% |
군 | 8 | 5.1% |
성 | 6 | 3.8% |
포 | 5 | 3.2% |
서 | 5 | 3.2% |
산 | 5 | 3.2% |
주 | 5 | 3.2% |
동 | 4 | 2.5% |
천 | 4 | 2.5% |
Other values (47) | 67 |
ASCII
Value | Count | Frequency (%) |
( | 1 | |
) | 1 |
발생건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 11.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9622642 |
Minimum | 1 |
---|---|
Maximum | 25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 609.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 3.4 |
Maximum | 25 |
Range | 24 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.3510174 |
---|---|
Coefficient of variation (CV) | 1.70773 |
Kurtosis | 45.057421 |
Mean | 1.9622642 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 6.4995988 |
Sum | 104 |
Variance | 11.229318 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 36 | |
2 | 8 | 15.1% |
3 | 6 | 11.3% |
25 | 1 | 1.9% |
4 | 1 | 1.9% |
5 | 1 | 1.9% |
Value | Count | Frequency (%) |
1 | 36 | |
2 | 8 | 15.1% |
3 | 6 | 11.3% |
4 | 1 | 1.9% |
5 | 1 | 1.9% |
25 | 1 | 1.9% |
Value | Count | Frequency (%) |
25 | 1 | 1.9% |
5 | 1 | 1.9% |
4 | 1 | 1.9% |
3 | 6 | 11.3% |
2 | 8 | 15.1% |
1 | 36 |
사망자수
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 5.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 556.0 B |
0 | |
---|---|
2 | 1 |
1 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | 0 |
---|---|
2nd row | 2 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 51 | |
2 | 1 | 1.9% |
1 | 1 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 51 | |
2 | 1 | 1.9% |
1 | 1 | 1.9% |
부상자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 18.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.1320755 |
Minimum | 1 |
---|---|
Maximum | 38 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 609.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 7.4 |
Maximum | 38 |
Range | 37 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 5.3386751 |
---|---|
Coefficient of variation (CV) | 1.7045167 |
Kurtosis | 36.294939 |
Mean | 3.1320755 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 5.6568301 |
Sum | 166 |
Variance | 28.501451 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 24 | |
2 | 12 | |
4 | 6 | 11.3% |
3 | 4 | 7.5% |
6 | 2 | 3.8% |
8 | 1 | 1.9% |
38 | 1 | 1.9% |
5 | 1 | 1.9% |
7 | 1 | 1.9% |
12 | 1 | 1.9% |
Value | Count | Frequency (%) |
1 | 24 | |
2 | 12 | |
3 | 4 | 7.5% |
4 | 6 | 11.3% |
5 | 1 | 1.9% |
6 | 2 | 3.8% |
7 | 1 | 1.9% |
8 | 1 | 1.9% |
12 | 1 | 1.9% |
38 | 1 | 1.9% |
Value | Count | Frequency (%) |
38 | 1 | 1.9% |
12 | 1 | 1.9% |
8 | 1 | 1.9% |
7 | 1 | 1.9% |
6 | 2 | 3.8% |
5 | 1 | 1.9% |
4 | 6 | 11.3% |
3 | 4 | 7.5% |
2 | 12 | |
1 | 24 |
중상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 11.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.50943396 |
Minimum | 0 |
---|---|
Maximum | 6 |
Zeros | 38 |
Zeros (%) | 71.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 609.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 2.4 |
Maximum | 6 |
Range | 6 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.1201415 |
---|---|
Coefficient of variation (CV) | 2.1987963 |
Kurtosis | 12.019532 |
Mean | 0.50943396 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.2175608 |
Sum | 27 |
Variance | 1.254717 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 38 | |
1 | 10 | 18.9% |
2 | 2 | 3.8% |
6 | 1 | 1.9% |
3 | 1 | 1.9% |
4 | 1 | 1.9% |
Value | Count | Frequency (%) |
0 | 38 | |
1 | 10 | 18.9% |
2 | 2 | 3.8% |
3 | 1 | 1.9% |
4 | 1 | 1.9% |
6 | 1 | 1.9% |
Value | Count | Frequency (%) |
6 | 1 | 1.9% |
4 | 1 | 1.9% |
3 | 1 | 1.9% |
2 | 2 | 3.8% |
1 | 10 | 18.9% |
0 | 38 |
경상
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 18.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.3207547 |
Minimum | 0 |
---|---|
Maximum | 27 |
Zeros | 9 |
Zeros (%) | 17.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 609.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 6.4 |
Maximum | 27 |
Range | 27 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.8619126 |
---|---|
Coefficient of variation (CV) | 1.6640761 |
Kurtosis | 33.017264 |
Mean | 2.3207547 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 5.290279 |
Sum | 123 |
Variance | 14.914369 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 18 | |
2 | 14 | |
0 | 9 | |
3 | 4 | 7.5% |
4 | 3 | 5.7% |
7 | 1 | 1.9% |
27 | 1 | 1.9% |
6 | 1 | 1.9% |
5 | 1 | 1.9% |
8 | 1 | 1.9% |
Value | Count | Frequency (%) |
0 | 9 | |
1 | 18 | |
2 | 14 | |
3 | 4 | 7.5% |
4 | 3 | 5.7% |
5 | 1 | 1.9% |
6 | 1 | 1.9% |
7 | 1 | 1.9% |
8 | 1 | 1.9% |
27 | 1 | 1.9% |
Value | Count | Frequency (%) |
27 | 1 | 1.9% |
8 | 1 | 1.9% |
7 | 1 | 1.9% |
6 | 1 | 1.9% |
5 | 1 | 1.9% |
4 | 3 | 5.7% |
3 | 4 | 7.5% |
2 | 14 | |
1 | 18 | |
0 | 9 |
부상신고
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 7.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 556.0 B |
0 | |
---|---|
1 | |
5 | 1 |
2 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 42 | |
1 | 9 | 17.0% |
5 | 1 | 1.9% |
2 | 1 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 42 | |
1 | 9 | 17.0% |
5 | 1 | 1.9% |
2 | 1 | 1.9% |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
시도 | 1.000 | 0.845 | 0.783 | 0.000 | 0.349 | 0.000 | 0.000 | 0.413 |
시군구 | 0.845 | 1.000 | 1.000 | 1.000 | 1.000 | 0.983 | 0.929 | 1.000 |
발생건수 | 0.783 | 1.000 | 1.000 | 0.000 | 0.765 | 0.985 | 0.723 | 0.805 |
사망자수 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.781 | 0.000 | 0.000 |
부상자수 | 0.349 | 1.000 | 0.765 | 0.000 | 1.000 | 0.939 | 0.967 | 0.914 |
중상 | 0.000 | 0.983 | 0.985 | 0.781 | 0.939 | 1.000 | 0.805 | 0.720 |
경상 | 0.000 | 0.929 | 0.723 | 0.000 | 0.967 | 0.805 | 1.000 | 0.896 |
부상신고 | 0.413 | 1.000 | 0.805 | 0.000 | 0.914 | 0.720 | 0.896 | 1.000 |
사망자수 | 시도 | 부상신고 | |
---|---|---|---|
사망자수 | 1.000 | 0.000 | 0.000 |
시도 | 0.000 | 1.000 | 0.218 |
부상신고 | 0.000 | 0.218 | 1.000 |
발생건수 | 부상자수 | 중상 | 경상 | 시도 | 사망자수 | 부상신고 | |
---|---|---|---|---|---|---|---|
발생건수 | 1.000 | 0.743 | 0.382 | 0.579 | 0.564 | 0.000 | 0.860 |
부상자수 | 0.743 | 1.000 | 0.363 | 0.872 | 0.178 | 0.000 | 0.614 |
중상 | 0.382 | 0.363 | 1.000 | 0.022 | 0.000 | 0.445 | 0.540 |
경상 | 0.579 | 0.872 | 0.022 | 1.000 | 0.000 | 0.000 | 0.578 |
시도 | 0.564 | 0.178 | 0.000 | 0.000 | 1.000 | 0.000 | 0.218 |
사망자수 | 0.000 | 0.000 | 0.445 | 0.000 | 0.000 | 1.000 | 0.000 |
부상신고 | 0.860 | 0.614 | 0.540 | 0.578 | 0.218 | 0.000 | 1.000 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
0 | 서울 | 중구 | 1 | 0 | 1 | 0 | 1 | 0 |
1 | 서울 | 용산구 | 3 | 2 | 4 | 2 | 2 | 0 |
2 | 서울 | 성동구 | 1 | 0 | 1 | 0 | 1 | 0 |
3 | 서울 | 마포구 | 1 | 0 | 1 | 0 | 1 | 0 |
4 | 서울 | 영등포구 | 1 | 0 | 1 | 0 | 1 | 0 |
5 | 서울 | 강남구 | 2 | 0 | 2 | 0 | 2 | 0 |
6 | 서울 | 강동구 | 1 | 0 | 2 | 0 | 2 | 0 |
7 | 서울 | 송파구 | 3 | 0 | 4 | 1 | 2 | 1 |
8 | 서울 | 서초구 | 2 | 0 | 3 | 0 | 2 | 1 |
9 | 서울 | 중랑구 | 1 | 0 | 4 | 0 | 4 | 0 |
시도 | 시군구 | 발생건수 | 사망자수 | 부상자수 | 중상 | 경상 | 부상신고 | |
---|---|---|---|---|---|---|---|---|
43 | 대구 | 남구 | 1 | 0 | 3 | 0 | 3 | 0 |
44 | 대구 | 북구 | 3 | 0 | 12 | 4 | 8 | 0 |
45 | 대구 | 수성구 | 2 | 0 | 2 | 1 | 1 | 0 |
46 | 대구 | 달서구 | 1 | 0 | 4 | 0 | 4 | 0 |
47 | 인천 | 중구 | 1 | 0 | 4 | 1 | 3 | 0 |
48 | 인천 | 서구 | 1 | 0 | 1 | 0 | 1 | 0 |
49 | 인천 | 계양구 | 1 | 0 | 1 | 1 | 0 | 0 |
50 | 대전 | 서구 | 1 | 0 | 1 | 0 | 1 | 0 |
51 | 대전 | 유성구 | 2 | 0 | 4 | 0 | 4 | 0 |
52 | 대전 | 대덕구 | 1 | 0 | 1 | 0 | 0 | 1 |