Overview

Dataset statistics

Number of variables8
Number of observations36
Missing cells36
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory73.7 B

Variable types

Categorical3
Numeric4
Unsupported1

Dataset

Description광주광역시 자치구별 5대 범죄 현황에 대한 데이터로 (살인/강도/강간,강제추행/절도/폭력) 관련 데이터를 제공합니다.
Author경찰청 광주광역시경찰청
URLhttps://www.data.go.kr/data/15004571/fileData.do

Alerts

강도 is highly overall correlated with 강간-강제추행 and 2 other fieldsHigh correlation
강간-강제추행 is highly overall correlated with 강도 and 2 other fieldsHigh correlation
절도 is highly overall correlated with 강도 and 2 other fieldsHigh correlation
폭력 is highly overall correlated with 강도 and 2 other fieldsHigh correlation
Unnamed: 7 has 36 (100.0%) missing valuesMissing
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
강도 has 11 (30.6%) zerosZeros
강간-강제추행 has 1 (2.8%) zerosZeros
절도 has 6 (16.7%) zerosZeros
폭력 has 1 (2.8%) zerosZeros

Reproduction

Analysis started2023-12-12 03:41:59.077033
Analysis finished2023-12-12 03:42:01.519716
Duration2.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관서명
Categorical

Distinct6
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size420.0 B
광주광역시경찰청
광주동부경찰서
광주서부경찰서
광주북부경찰서
광주광산경찰서

Length

Max length8
Median length7
Mean length7.1666667
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시경찰청
2nd row광주광역시경찰청
3rd row광주광역시경찰청
4th row광주광역시경찰청
5th row광주광역시경찰청

Common Values

ValueCountFrequency (%)
광주광역시경찰청 6
16.7%
광주동부경찰서 6
16.7%
광주서부경찰서 6
16.7%
광주북부경찰서 6
16.7%
광주광산경찰서 6
16.7%
광주남부경찰서 6
16.7%

Length

2023-12-12T12:42:01.643645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:01.798321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주광역시경찰청 6
16.7%
광주동부경찰서 6
16.7%
광주서부경찰서 6
16.7%
광주북부경찰서 6
16.7%
광주광산경찰서 6
16.7%
광주남부경찰서 6
16.7%

구분
Categorical

Distinct6
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size420.0 B
발 생 건 수
검 거 건 수
검 거 인 원
구 속
불 구 속

Length

Max length10
Median length8.5
Mean length8.1666667
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row발 생 건 수
2nd row검 거 건 수
3rd row검 거 인 원
4th row구 속
5th row불 구 속

Common Values

ValueCountFrequency (%)
발 생 건 수 6
16.7%
검 거 건 수 6
16.7%
검 거 인 원 6
16.7%
구 속 6
16.7%
불 구 속 6
16.7%
기 타 6
16.7%

Length

2023-12-12T12:42:01.988062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:02.152264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12
10.5%
12
10.5%
12
10.5%
12
10.5%
12
10.5%
12
10.5%
6
 
5.3%
6
 
5.3%
6
 
5.3%
6
 
5.3%
Other values (3) 18
15.8%

살인
Categorical

Distinct5
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Memory size420.0 B
0
18 
2
1
4
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 18
50.0%
2 6
 
16.7%
1 6
 
16.7%
4 3
 
8.3%
3 3
 
8.3%

Length

2023-12-12T12:42:02.325341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:02.517715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 18
50.0%
2 6
 
16.7%
1 6
 
16.7%
4 3
 
8.3%
3 3
 
8.3%

강도
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6111111
Minimum0
Maximum5
Zeros11
Zeros (%)30.6%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T12:42:02.668417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q33
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.3995464
Coefficient of variation (CV)0.86868398
Kurtosis-0.63876259
Mean1.6111111
Median Absolute Deviation (MAD)1
Skewness0.42044048
Sum58
Variance1.9587302
MonotonicityNot monotonic
2023-12-12T12:42:02.814163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 11
30.6%
2 9
25.0%
3 7
19.4%
1 6
16.7%
4 2
 
5.6%
5 1
 
2.8%
ValueCountFrequency (%)
0 11
30.6%
1 6
16.7%
2 9
25.0%
3 7
19.4%
4 2
 
5.6%
5 1
 
2.8%
ValueCountFrequency (%)
5 1
 
2.8%
4 2
 
5.6%
3 7
19.4%
2 9
25.0%
1 6
16.7%
0 11
30.6%

강간-강제추행
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct32
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean63.694444
Minimum0
Maximum153
Zeros1
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T12:42:02.993797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2.75
Q125
median60.5
Q392.75
95-th percentile143
Maximum153
Range153
Interquartile range (IQR)67.75

Descriptive statistics

Standard deviation45.88546
Coefficient of variation (CV)0.72039971
Kurtosis-0.90536602
Mean63.694444
Median Absolute Deviation (MAD)35.5
Skewness0.39229392
Sum2293
Variance2105.4754
MonotonicityNot monotonic
2023-12-12T12:42:03.311442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
91 2
 
5.6%
11 2
 
5.6%
27 2
 
5.6%
62 2
 
5.6%
0 1
 
2.8%
100 1
 
2.8%
14 1
 
2.8%
81 1
 
2.8%
35 1
 
2.8%
125 1
 
2.8%
Other values (22) 22
61.1%
ValueCountFrequency (%)
0 1
2.8%
2 1
2.8%
3 1
2.8%
6 1
2.8%
11 2
5.6%
14 1
2.8%
15 1
2.8%
19 1
2.8%
27 2
5.6%
35 1
2.8%
ValueCountFrequency (%)
153 1
2.8%
146 1
2.8%
142 1
2.8%
133 1
2.8%
130 1
2.8%
125 1
2.8%
120 1
2.8%
100 1
2.8%
98 1
2.8%
91 2
5.6%

절도
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean369.11111
Minimum0
Maximum1465
Zeros6
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T12:42:03.518676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q124
median260
Q3628.5
95-th percentile1125.5
Maximum1465
Range1465
Interquartile range (IQR)604.5

Descriptive statistics

Standard deviation396.47322
Coefficient of variation (CV)1.0741297
Kurtosis0.33261941
Mean369.11111
Median Absolute Deviation (MAD)260
Skewness1.004129
Sum13288
Variance157191.02
MonotonicityNot monotonic
2023-12-12T12:42:03.759629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0 6
 
16.7%
24 2
 
5.6%
27 1
 
2.8%
61 1
 
2.8%
232 1
 
2.8%
12 1
 
2.8%
305 1
 
2.8%
371 1
 
2.8%
636 1
 
2.8%
71 1
 
2.8%
Other values (20) 20
55.6%
ValueCountFrequency (%)
0 6
16.7%
12 1
 
2.8%
13 1
 
2.8%
24 2
 
5.6%
25 1
 
2.8%
27 1
 
2.8%
61 1
 
2.8%
70 1
 
2.8%
71 1
 
2.8%
100 1
 
2.8%
ValueCountFrequency (%)
1465 1
2.8%
1184 1
2.8%
1106 1
2.8%
971 1
2.8%
798 1
2.8%
731 1
2.8%
725 1
2.8%
656 1
2.8%
636 1
2.8%
626 1
2.8%

폭력
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean778.27778
Minimum0
Maximum2388
Zeros1
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T12:42:03.961524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q128.5
median697.5
Q31256
95-th percentile2020.25
Maximum2388
Range2388
Interquartile range (IQR)1227.5

Descriptive statistics

Standard deviation689.19591
Coefficient of variation (CV)0.88553976
Kurtosis-0.5697749
Mean778.27778
Median Absolute Deviation (MAD)658.5
Skewness0.56255873
Sum28018
Variance474991.01
MonotonicityNot monotonic
2023-12-12T12:42:04.113503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
6 2
 
5.6%
0 1
 
2.8%
11 1
 
2.8%
16 1
 
2.8%
1334 1
 
2.8%
1038 1
 
2.8%
1631 1
 
2.8%
1459 1
 
2.8%
1998 1
 
2.8%
1129 1
 
2.8%
Other values (25) 25
69.4%
ValueCountFrequency (%)
0 1
2.8%
6 2
5.6%
10 1
2.8%
11 1
2.8%
15 1
2.8%
16 1
2.8%
18 1
2.8%
21 1
2.8%
31 1
2.8%
47 1
2.8%
ValueCountFrequency (%)
2388 1
2.8%
2087 1
2.8%
1998 1
2.8%
1775 1
2.8%
1631 1
2.8%
1545 1
2.8%
1459 1
2.8%
1366 1
2.8%
1334 1
2.8%
1230 1
2.8%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing36
Missing (%)100.0%
Memory size456.0 B

Interactions

2023-12-12T12:42:00.861024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:41:59.469249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:41:59.937351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.441038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.974248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:41:59.572832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.062881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.549144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:01.087468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:41:59.702869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.199802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.657641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:01.187185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:41:59.831332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.338874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:42:00.751467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:42:04.704356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관서명구분살인강도강간-강제추행절도폭력
관서명1.0000.0000.6100.6460.2280.3800.592
구분0.0001.0000.0000.7230.6880.6120.322
살인0.6100.0001.0000.5860.6460.7410.750
강도0.6460.7230.5861.0000.8270.7950.532
강간-강제추행0.2280.6880.6460.8271.0000.8960.847
절도0.3800.6120.7410.7950.8961.0000.906
폭력0.5920.3220.7500.5320.8470.9061.000
2023-12-12T12:42:04.873240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분살인관서명
구분1.0000.0000.000
살인0.0001.0000.458
관서명0.0000.4581.000
2023-12-12T12:42:05.011102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강도강간-강제추행절도폭력관서명구분살인
강도1.0000.5490.5680.5390.2740.3310.435
강간-강제추행0.5491.0000.7860.8430.0680.4120.283
절도0.5680.7861.0000.9200.1750.3440.356
폭력0.5390.8430.9201.0000.3270.1370.364
관서명0.2740.0680.1750.3271.0000.0000.458
구분0.3310.4120.3440.1370.0001.0000.000
살인0.4350.2830.3560.3640.4580.0001.000

Missing values

2023-12-12T12:42:01.318824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:42:01.453621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관서명구분살인강도강간-강제추행절도폭력Unnamed: 7
0광주광역시경찰청발 생 건 수00000<NA>
1광주광역시경찰청검 거 건 수0185018<NA>
2광주광역시경찰청검 거 인 원0291047<NA>
3광주광역시경찰청구 속021106<NA>
4광주광역시경찰청불 구 속0053031<NA>
5광주광역시경찰청기 타0027010<NA>
6광주동부경찰서발 생 건 수2166494677<NA>
7광주동부경찰서검 거 건 수2161350642<NA>
8광주동부경찰서검 거 인 원2260279880<NA>
9광주동부경찰서구 속2231321<NA>
관서명구분살인강도강간-강제추행절도폭력Unnamed: 7
26광주광산경찰서검 거 인 원141006561998<NA>
27광주광산경찰서구 속12112411<NA>
28광주광산경찰서불 구 속02625611129<NA>
29광주광산경찰서기 타002771858<NA>
30광주남부경찰서발 생 건 수0162636839<NA>
31광주남부경찰서검 거 건 수0157371718<NA>
32광주남부경찰서검 거 인 원0265305930<NA>
33광주남부경찰서구 속002126<NA>
34광주남부경찰서불 구 속0048232420<NA>
35광주남부경찰서기 타021561504<NA>