Overview

Dataset statistics

Number of variables3
Number of observations216
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory25.6 B

Variable types

Categorical2
Numeric1

Dataset

Description성범죄자 지역별 통계
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2702

Alerts

성범죄자 수 is highly overall correlated with 시도명High correlation
시도명 is highly overall correlated with 성범죄자 수High correlation

Reproduction

Analysis started2024-03-13 11:49:47.172312
Analysis finished2024-03-13 11:49:47.749408
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
서울특별시
 
12
경기도
 
12
충청남도
 
12
경상남도
 
12
경상북도
 
12
Other values (14)
156 

Length

Max length7
Median length5
Mean length4.5925926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row경기도
3rd row경상남도
4th row경상북도
5th row광주광역시

Common Values

ValueCountFrequency (%)
서울특별시 12
 
5.6%
경기도 12
 
5.6%
충청남도 12
 
5.6%
경상남도 12
 
5.6%
경상북도 12
 
5.6%
광주광역시 12
 
5.6%
기타 12
 
5.6%
대구광역시 12
 
5.6%
대전광역시 12
 
5.6%
충청북도 12
 
5.6%
Other values (9) 96
44.4%

Length

2024-03-13T20:49:47.840824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 12
 
5.6%
경기도 12
 
5.6%
제주특별자치도 12
 
5.6%
전라북도 12
 
5.6%
전라남도 12
 
5.6%
인천광역시 12
 
5.6%
울산광역시 12
 
5.6%
세종특별자치시 12
 
5.6%
부산광역시 12
 
5.6%
충청북도 12
 
5.6%
Other values (9) 96
44.4%

성범죄자 수
Real number (ℝ)

HIGH CORRELATION 

Distinct124
Distinct (%)57.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean208.25
Minimum6
Maximum727
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-03-13T20:49:47.992003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile9
Q1119.75
median156.5
Q3229.25
95-th percentile682.75
Maximum727
Range721
Interquartile range (IQR)109.5

Descriptive statistics

Standard deviation175.94291
Coefficient of variation (CV)0.84486394
Kurtosis1.8128683
Mean208.25
Median Absolute Deviation (MAD)59
Skewness1.5994759
Sum44982
Variance30955.909
MonotonicityNot monotonic
2024-03-13T20:49:48.521033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42 6
 
2.8%
178 5
 
2.3%
236 5
 
2.3%
55 5
 
2.3%
143 5
 
2.3%
141 5
 
2.3%
41 4
 
1.9%
139 4
 
1.9%
120 4
 
1.9%
136 4
 
1.9%
Other values (114) 169
78.2%
ValueCountFrequency (%)
6 2
 
0.9%
7 4
1.9%
8 4
1.9%
9 2
 
0.9%
41 4
1.9%
42 6
2.8%
44 2
 
0.9%
55 5
2.3%
56 1
 
0.5%
57 2
 
0.9%
ValueCountFrequency (%)
727 1
0.5%
719 1
0.5%
712 1
0.5%
708 1
0.5%
706 2
0.9%
698 2
0.9%
696 1
0.5%
685 2
0.9%
682 1
0.5%
581 1
0.5%

통계일자
Categorical

Distinct12
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2022-12-01
18 
2023-03-01
18 
2023-04-01
18 
2023-07-01
18 
2023-08-01
18 
Other values (7)
126 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-01
2nd row2022-12-01
3rd row2022-12-01
4th row2022-12-01
5th row2022-12-01

Common Values

ValueCountFrequency (%)
2022-12-01 18
8.3%
2023-03-01 18
8.3%
2023-04-01 18
8.3%
2023-07-01 18
8.3%
2023-08-01 18
8.3%
2023-10-01 18
8.3%
2024-01-01 18
8.3%
2022-11-08 18
8.3%
2023-02-01 18
8.3%
2023-09-01 18
8.3%
Other values (2) 36
16.7%

Length

2024-03-13T20:49:48.711759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-12-01 18
8.3%
2023-03-01 18
8.3%
2023-04-01 18
8.3%
2023-07-01 18
8.3%
2023-08-01 18
8.3%
2023-10-01 18
8.3%
2024-01-01 18
8.3%
2022-11-08 18
8.3%
2023-02-01 18
8.3%
2023-09-01 18
8.3%
Other values (2) 36
16.7%

Interactions

2024-03-13T20:49:47.311867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T20:49:48.818282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명성범죄자 수통계일자
시도명1.0000.9920.000
성범죄자 수0.9921.0000.000
통계일자0.0000.0001.000
2024-03-13T20:49:48.950960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계일자시도명
통계일자1.0000.000
시도명0.0001.000
2024-03-13T20:49:49.070797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성범죄자 수시도명통계일자
성범죄자 수1.0000.9460.000
시도명0.9461.0000.000
통계일자0.0000.0001.000

Missing values

2024-03-13T20:49:47.554833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T20:49:47.706003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명성범죄자 수통계일자
0강원도1292022-12-01
1경기도7272022-12-01
2경상남도2362022-12-01
3경상북도2022022-12-01
4광주광역시1212022-12-01
5기타5512022-12-01
6대구광역시1512022-12-01
7대전광역시832022-12-01
8부산광역시1632022-12-01
9서울특별시4382022-12-01
시도명성범죄자 수통계일자
206부산광역시1782023-06-01
207서울특별시4222023-06-01
208세종특별자치시82023-06-01
209울산광역시572023-06-01
210인천광역시2392023-06-01
211전라남도1382023-06-01
212전라북도1912023-06-01
213제주특별자치도422023-06-01
214충청남도1792023-06-01
215충청북도1422023-06-01