Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 38 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.4 KiB |
Average record size in memory | 64.5 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 5 |
Dataset
Description | 전국 경찰관서에 고소, 고발, 인지 등으로 형사입건된 사건의 발생, 검거, 피의자에 대한 죄종별 분석 현황 |
---|---|
Author | 경찰청 |
URL | https://www.data.go.kr/data/3074481/fileData.do |
자백여부(자백) is highly overall correlated with 자백여부(일부자백) and 4 other fields | High correlation |
자백여부(일부자백) is highly overall correlated with 자백여부(자백) and 3 other fields | High correlation |
자백여부(부인) is highly overall correlated with 자백여부(자백) and 3 other fields | High correlation |
자백여부(묵비) is highly overall correlated with 자백여부(자백) and 4 other fields | High correlation |
미상 is highly overall correlated with 자백여부(자백) and 3 other fields | High correlation |
범죄대분류 is highly overall correlated with 자백여부(자백) and 1 other fields | High correlation |
범죄중분류 has unique values | Unique |
자백여부(자백) has unique values | Unique |
자백여부(일부자백) has unique values | Unique |
미상 has unique values | Unique |
자백여부(묵비) has 6 (15.8%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 03:30:06.938515 |
---|---|
Analysis finished | 2023-12-12 03:30:10.391000 |
Duration | 3.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
범죄대분류
Categorical
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 39.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 436.0 B |
지능범죄 | |
---|---|
강력범죄 | |
폭력범죄 | |
풍속범죄 | |
절도범죄 | 1 |
Other values (10) |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 4.0263158 |
Min length | 2 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 28.9% |
Sample
1st row | 강력범죄 |
---|---|
2nd row | 강력범죄 |
3rd row | 강력범죄 |
4th row | 강력범죄 |
5th row | 강력범죄 |
Common Values
Value | Count | Frequency (%) |
지능범죄 | 9 | |
강력범죄 | 8 | |
폭력범죄 | 8 | |
풍속범죄 | 2 | 5.3% |
절도범죄 | 1 | 2.6% |
특별경제범죄 | 1 | 2.6% |
마약범죄 | 1 | 2.6% |
보건범죄 | 1 | 2.6% |
환경범죄 | 1 | 2.6% |
교통범죄 | 1 | 2.6% |
Other values (5) | 5 |
Length
Value | Count | Frequency (%) |
지능범죄 | 9 | |
강력범죄 | 8 | |
폭력범죄 | 8 | |
풍속범죄 | 2 | 5.3% |
절도범죄 | 1 | 2.6% |
특별경제범죄 | 1 | 2.6% |
마약범죄 | 1 | 2.6% |
보건범죄 | 1 | 2.6% |
환경범죄 | 1 | 2.6% |
교통범죄 | 1 | 2.6% |
Other values (5) | 5 |
범죄중분류
Text
UNIQUE
 
Distinct | 38 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 436.0 B |
Value | Count | Frequency (%) |
살인기수 | 1 | 2.6% |
도박범죄 | 1 | 2.6% |
병역범죄 | 1 | 2.6% |
문서·인장 | 1 | 2.6% |
유가증권인지 | 1 | 2.6% |
사기 | 1 | 2.6% |
횡령 | 1 | 2.6% |
배임 | 1 | 2.6% |
성풍속범죄 | 1 | 2.6% |
마약범죄 | 1 | 2.6% |
Other values (28) | 28 |
Most occurring characters
Value | Count | Frequency (%) |
범 | 12 | 8.5% |
죄 | 12 | 8.5% |
강 | 6 | 4.3% |
기 | 5 | 3.5% |
인 | 5 | 3.5% |
유 | 4 | 2.8% |
· | 4 | 2.8% |
행 | 4 | 2.8% |
제 | 3 | 2.1% |
간 | 3 | 2.1% |
Other values (63) | 83 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 137 | |
Other Punctuation | 4 | 2.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
범 | 12 | 8.8% |
죄 | 12 | 8.8% |
강 | 6 | 4.4% |
기 | 5 | 3.6% |
인 | 5 | 3.6% |
유 | 4 | 2.9% |
행 | 4 | 2.9% |
제 | 3 | 2.2% |
간 | 3 | 2.2% |
등 | 3 | 2.2% |
Other values (62) | 80 |
Other Punctuation
Value | Count | Frequency (%) |
· | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 137 | |
Common | 4 | 2.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
범 | 12 | 8.8% |
죄 | 12 | 8.8% |
강 | 6 | 4.4% |
기 | 5 | 3.6% |
인 | 5 | 3.6% |
유 | 4 | 2.9% |
행 | 4 | 2.9% |
제 | 3 | 2.2% |
간 | 3 | 2.2% |
등 | 3 | 2.2% |
Other values (62) | 80 |
Common
Value | Count | Frequency (%) |
· | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 137 | |
None | 4 | 2.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
범 | 12 | 8.8% |
죄 | 12 | 8.8% |
강 | 6 | 4.4% |
기 | 5 | 3.6% |
인 | 5 | 3.6% |
유 | 4 | 2.9% |
행 | 4 | 2.9% |
제 | 3 | 2.2% |
간 | 3 | 2.2% |
등 | 3 | 2.2% |
Other values (62) | 80 |
None
Value | Count | Frequency (%) |
· | 4 |
자백여부(자백)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 38 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24655.579 |
Minimum | 18 |
---|---|
Maximum | 376010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 474.0 B |
Quantile statistics
Minimum | 18 |
---|---|
5-th percentile | 26.65 |
Q1 | 288.25 |
median | 1777 |
Q3 | 18681.5 |
95-th percentile | 81805.95 |
Maximum | 376010 |
Range | 375992 |
Interquartile range (IQR) | 18393.25 |
Descriptive statistics
Standard deviation | 64483.966 |
---|---|
Coefficient of variation (CV) | 2.6153905 |
Kurtosis | 25.032967 |
Mean | 24655.579 |
Median Absolute Deviation (MAD) | 1733 |
Skewness | 4.7339463 |
Sum | 936912 |
Variance | 4.1581819 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
229 | 1 | 2.6% |
3486 | 1 | 2.6% |
98 | 1 | 2.6% |
63889 | 1 | 2.6% |
10741 | 1 | 2.6% |
1964 | 1 | 2.6% |
9301 | 1 | 2.6% |
23157 | 1 | 2.6% |
40985 | 1 | 2.6% |
15321 | 1 | 2.6% |
Other values (28) | 28 |
Value | Count | Frequency (%) |
18 | 1 | |
19 | 1 | |
28 | 1 | |
60 | 1 | |
77 | 1 | |
98 | 1 | |
156 | 1 | |
229 | 1 | |
266 | 1 | |
287 | 1 |
Value | Count | Frequency (%) |
376010 | 1 | |
132052 | 1 | |
72939 | 1 | |
63889 | 1 | |
50174 | 1 | |
40985 | 1 | |
38233 | 1 | |
36481 | 1 | |
23157 | 1 | |
19301 | 1 |
자백여부(일부자백)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 38 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4464.4474 |
Minimum | 19 |
---|---|
Maximum | 28225 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 474.0 B |
Quantile statistics
Minimum | 19 |
---|---|
5-th percentile | 22.85 |
Q1 | 191.25 |
median | 744 |
Q3 | 3105.25 |
95-th percentile | 21730.3 |
Maximum | 28225 |
Range | 28206 |
Interquartile range (IQR) | 2914 |
Descriptive statistics
Standard deviation | 7654.8923 |
---|---|
Coefficient of variation (CV) | 1.7146338 |
Kurtosis | 2.4560348 |
Mean | 4464.4474 |
Median Absolute Deviation (MAD) | 680.5 |
Skewness | 1.8896784 |
Sum | 169649 |
Variance | 58597376 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
88 | 1 | 2.6% |
707 | 1 | 2.6% |
63 | 1 | 2.6% |
21460 | 1 | 2.6% |
3106 | 1 | 2.6% |
848 | 1 | 2.6% |
1015 | 1 | 2.6% |
1863 | 1 | 2.6% |
11557 | 1 | 2.6% |
2808 | 1 | 2.6% |
Other values (28) | 28 |
Value | Count | Frequency (%) |
19 | 1 | |
22 | 1 | |
23 | 1 | |
30 | 1 | |
63 | 1 | |
64 | 1 | |
88 | 1 | |
89 | 1 | |
174 | 1 | |
183 | 1 |
Value | Count | Frequency (%) |
28225 | 1 | |
23262 | 1 | |
21460 | 1 | |
19192 | 1 | |
17236 | 1 | |
13415 | 1 | |
11557 | 1 | |
9442 | 1 | |
4461 | 1 | |
3106 | 1 |
자백여부(부인)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 37 |
---|---|
Distinct (%) | 97.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2835.2632 |
Minimum | 4 |
---|---|
Maximum | 22070 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 474.0 B |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 15.8 |
Q1 | 90.75 |
median | 498 |
Q3 | 2819.75 |
95-th percentile | 13112.7 |
Maximum | 22070 |
Range | 22066 |
Interquartile range (IQR) | 2729 |
Descriptive statistics
Standard deviation | 5113.8329 |
---|---|
Coefficient of variation (CV) | 1.8036537 |
Kurtosis | 6.2562182 |
Mean | 2835.2632 |
Median Absolute Deviation (MAD) | 470 |
Skewness | 2.5113242 |
Sum | 107740 |
Variance | 26151287 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
63 | 2 | 5.3% |
555 | 1 | 2.6% |
1783 | 1 | 2.6% |
22070 | 1 | 2.6% |
3031 | 1 | 2.6% |
1096 | 1 | 2.6% |
955 | 1 | 2.6% |
1293 | 1 | 2.6% |
6415 | 1 | 2.6% |
17 | 1 | 2.6% |
Other values (27) | 27 |
Value | Count | Frequency (%) |
4 | 1 | |
9 | 1 | |
17 | 1 | |
39 | 1 | |
45 | 1 | |
62 | 1 | |
63 | 2 | |
88 | 1 | |
90 | 1 | |
93 | 1 |
Value | Count | Frequency (%) |
22070 | 1 | |
18075 | 1 | |
12237 | 1 | |
10829 | 1 | |
7880 | 1 | |
6415 | 1 | |
5597 | 1 | |
4455 | 1 | |
3398 | 1 | |
3031 | 1 |
자백여부(묵비)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 63.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.315789 |
Minimum | 0 |
---|---|
Maximum | 667 |
Zeros | 6 |
Zeros (%) | 15.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 474.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 9 |
Q3 | 23.75 |
95-th percentile | 247.25 |
Maximum | 667 |
Range | 667 |
Interquartile range (IQR) | 21.75 |
Descriptive statistics
Standard deviation | 128.09458 |
---|---|
Coefficient of variation (CV) | 2.2745767 |
Kurtosis | 14.61121 |
Mean | 56.315789 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 3.6111595 |
Sum | 2140 |
Variance | 16408.222 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6 | |
2 | 4 | 10.5% |
8 | 3 | 7.9% |
9 | 3 | 7.9% |
3 | 3 | 7.9% |
21 | 1 | 2.6% |
667 | 1 | 2.6% |
13 | 1 | 2.6% |
379 | 1 | 2.6% |
1 | 1 | 2.6% |
Other values (14) | 14 |
Value | Count | Frequency (%) |
0 | 6 | |
1 | 1 | 2.6% |
2 | 4 | |
3 | 3 | |
5 | 1 | 2.6% |
8 | 3 | |
9 | 3 | |
10 | 1 | 2.6% |
13 | 1 | 2.6% |
16 | 1 | 2.6% |
Value | Count | Frequency (%) |
667 | 1 | |
379 | 1 | |
224 | 1 | |
197 | 1 | |
142 | 1 | |
104 | 1 | |
96 | 1 | |
68 | 1 | |
45 | 1 | |
24 | 1 |
미상
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 38 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13052.474 |
Minimum | 28 |
---|---|
Maximum | 138867 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 474.0 B |
Quantile statistics
Minimum | 28 |
---|---|
5-th percentile | 67.6 |
Q1 | 253.5 |
median | 851 |
Q3 | 5382.75 |
95-th percentile | 89681.1 |
Maximum | 138867 |
Range | 138839 |
Interquartile range (IQR) | 5129.25 |
Descriptive statistics
Standard deviation | 31756.343 |
---|---|
Coefficient of variation (CV) | 2.4329751 |
Kurtosis | 8.8308127 |
Mean | 13052.474 |
Median Absolute Deviation (MAD) | 780 |
Skewness | 3.0505297 |
Sum | 495994 |
Variance | 1.0084653 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
105 | 1 | 2.6% |
498 | 1 | 2.6% |
326 | 1 | 2.6% |
114530 | 1 | 2.6% |
15076 | 1 | 2.6% |
5437 | 1 | 2.6% |
2651 | 1 | 2.6% |
1427 | 1 | 2.6% |
28549 | 1 | 2.6% |
2546 | 1 | 2.6% |
Other values (28) | 28 |
Value | Count | Frequency (%) |
28 | 1 | |
54 | 1 | |
70 | 1 | |
72 | 1 | |
104 | 1 | |
105 | 1 | |
151 | 1 | |
174 | 1 | |
191 | 1 | |
246 | 1 |
Value | Count | Frequency (%) |
138867 | 1 | |
114530 | 1 | |
85296 | 1 | |
51835 | 1 | |
28549 | 1 | |
15076 | 1 | |
11755 | 1 | |
9200 | 1 | |
7571 | 1 | |
5437 | 1 |
범죄대분류 | 범죄중분류 | 자백여부(자백) | 자백여부(일부자백) | 자백여부(부인) | 자백여부(묵비) | 미상 | |
---|---|---|---|---|---|---|---|
범죄대분류 | 1.000 | 1.000 | 0.937 | 0.749 | 0.000 | 0.863 | 0.803 |
범죄중분류 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
자백여부(자백) | 0.937 | 1.000 | 1.000 | 0.986 | 0.982 | 0.998 | 0.929 |
자백여부(일부자백) | 0.749 | 1.000 | 0.986 | 1.000 | 0.994 | 0.962 | 0.924 |
자백여부(부인) | 0.000 | 1.000 | 0.982 | 0.994 | 1.000 | 0.931 | 0.913 |
자백여부(묵비) | 0.863 | 1.000 | 0.998 | 0.962 | 0.931 | 1.000 | 0.938 |
미상 | 0.803 | 1.000 | 0.929 | 0.924 | 0.913 | 0.938 | 1.000 |
자백여부(자백) | 자백여부(일부자백) | 자백여부(부인) | 자백여부(묵비) | 미상 | 범죄대분류 | |
---|---|---|---|---|---|---|
자백여부(자백) | 1.000 | 0.949 | 0.884 | 0.840 | 0.854 | 0.689 |
자백여부(일부자백) | 0.949 | 1.000 | 0.967 | 0.863 | 0.885 | 0.374 |
자백여부(부인) | 0.884 | 0.967 | 1.000 | 0.852 | 0.904 | 0.000 |
자백여부(묵비) | 0.840 | 0.863 | 0.852 | 1.000 | 0.808 | 0.524 |
미상 | 0.854 | 0.885 | 0.904 | 0.808 | 1.000 | 0.446 |
범죄대분류 | 0.689 | 0.374 | 0.000 | 0.524 | 0.446 | 1.000 |
범죄대분류 | 범죄중분류 | 자백여부(자백) | 자백여부(일부자백) | 자백여부(부인) | 자백여부(묵비) | 미상 | |
---|---|---|---|---|---|---|---|
0 | 강력범죄 | 살인기수 | 229 | 88 | 17 | 8 | 105 |
1 | 강력범죄 | 살인미수등 | 266 | 174 | 62 | 3 | 70 |
2 | 강력범죄 | 강도 | 1270 | 411 | 221 | 2 | 174 |
3 | 강력범죄 | 강간 | 1460 | 1256 | 1484 | 9 | 1308 |
4 | 강력범죄 | 유사강간 | 156 | 89 | 90 | 0 | 54 |
5 | 강력범죄 | 강제추행 | 5181 | 3103 | 3398 | 24 | 1694 |
6 | 강력범죄 | 기타강간·강제추행등 | 422 | 322 | 295 | 3 | 151 |
7 | 강력범죄 | 방화 | 1036 | 216 | 100 | 10 | 104 |
8 | 절도범죄 | 절도범죄 | 72939 | 9442 | 5597 | 96 | 7571 |
9 | 폭력범죄 | 상해 | 36481 | 17236 | 7880 | 68 | 5220 |
범죄대분류 | 범죄중분류 | 자백여부(자백) | 자백여부(일부자백) | 자백여부(부인) | 자백여부(묵비) | 미상 | |
---|---|---|---|---|---|---|---|
28 | 특별경제범죄 | 특별경제범죄 | 40985 | 11557 | 6415 | 142 | 28549 |
29 | 마약범죄 | 마약범죄 | 3486 | 707 | 555 | 18 | 498 |
30 | 보건범죄 | 보건범죄 | 15321 | 2808 | 1402 | 8 | 2546 |
31 | 환경범죄 | 환경범죄 | 2397 | 249 | 88 | 1 | 401 |
32 | 교통범죄 | 교통범죄 | 376010 | 13415 | 4455 | 379 | 138867 |
33 | 노동범죄 | 노동범죄 | 1590 | 220 | 45 | 0 | 276 |
34 | 안보범죄 | 안보범죄 | 19 | 30 | 4 | 9 | 28 |
35 | 선거범죄 | 선거범죄 | 1093 | 548 | 385 | 2 | 772 |
36 | 병역범죄 | 병역범죄 | 16823 | 713 | 153 | 13 | 821 |
37 | 기타 | 기타 | 132052 | 28225 | 18075 | 667 | 51835 |