Overview

Dataset statistics

Number of variables4
Number of observations372
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.5 KiB
Average record size in memory34.4 B

Variable types

Categorical2
Numeric2

Dataset

Description주정차 위반 단속실적 집계 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=PK0FSVBLGN760SO14GB612433456&infSeq=1

Alerts

부과건수 is highly overall correlated with 단속건수 and 1 other fieldsHigh correlation
단속건수 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 부과건수 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-04-20 18:29:53.711175
Analysis finished2024-04-20 18:29:55.255805
Duration1.54 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년월
Categorical

Distinct12
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-01
31 
2023-02
31 
2023-03
31 
2023-04
31 
2023-05
31 
Other values (7)
217 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-01
2nd row2023-02
3rd row2023-03
4th row2023-04
5th row2023-05

Common Values

ValueCountFrequency (%)
2023-01 31
8.3%
2023-02 31
8.3%
2023-03 31
8.3%
2023-04 31
8.3%
2023-05 31
8.3%
2023-06 31
8.3%
2023-07 31
8.3%
2023-08 31
8.3%
2023-09 31
8.3%
2023-10 31
8.3%
Other values (2) 62
16.7%

Length

2024-04-21T03:29:55.309300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023-01 31
8.3%
2023-02 31
8.3%
2023-03 31
8.3%
2023-04 31
8.3%
2023-05 31
8.3%
2023-06 31
8.3%
2023-07 31
8.3%
2023-08 31
8.3%
2023-09 31
8.3%
2023-10 31
8.3%
Other values (2) 62
16.7%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
수원시
 
12
용인시
 
12
고양시
 
12
화성시
 
12
성남시
 
12
Other values (26)
312 

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시
2nd row수원시
3rd row수원시
4th row수원시
5th row수원시

Common Values

ValueCountFrequency (%)
수원시 12
 
3.2%
용인시 12
 
3.2%
고양시 12
 
3.2%
화성시 12
 
3.2%
성남시 12
 
3.2%
부천시 12
 
3.2%
남양주시 12
 
3.2%
안산시 12
 
3.2%
평택시 12
 
3.2%
안양시 12
 
3.2%
Other values (21) 252
67.7%

Length

2024-04-21T03:29:55.405369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 12
 
3.2%
광명시 12
 
3.2%
가평군 12
 
3.2%
과천시 12
 
3.2%
동두천시 12
 
3.2%
여주시 12
 
3.2%
양평군 12
 
3.2%
포천시 12
 
3.2%
의왕시 12
 
3.2%
구리시 12
 
3.2%
Other values (21) 252
67.7%

부과건수
Real number (ℝ)

HIGH CORRELATION 

Distinct368
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10245.075
Minimum127
Maximum39360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2024-04-21T03:29:55.510273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127
5-th percentile488.3
Q12337
median7070.5
Q314826.5
95-th percentile31276.95
Maximum39360
Range39233
Interquartile range (IQR)12489.5

Descriptive statistics

Standard deviation9781.9673
Coefficient of variation (CV)0.95479701
Kurtosis0.41948886
Mean10245.075
Median Absolute Deviation (MAD)5844
Skewness1.1210781
Sum3811168
Variance95686883
MonotonicityNot monotonic
2024-04-21T03:29:55.625634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
221 2
 
0.5%
205 2
 
0.5%
1004 2
 
0.5%
3757 2
 
0.5%
8605 1
 
0.3%
6368 1
 
0.3%
6125 1
 
0.3%
5947 1
 
0.3%
5343 1
 
0.3%
5381 1
 
0.3%
Other values (358) 358
96.2%
ValueCountFrequency (%)
127 1
0.3%
169 1
0.3%
188 1
0.3%
202 1
0.3%
205 2
0.5%
221 2
0.5%
228 1
0.3%
229 1
0.3%
265 1
0.3%
294 1
0.3%
ValueCountFrequency (%)
39360 1
0.3%
37878 1
0.3%
37749 1
0.3%
37504 1
0.3%
36979 1
0.3%
36462 1
0.3%
36359 1
0.3%
35854 1
0.3%
35323 1
0.3%
34908 1
0.3%

단속건수
Real number (ℝ)

HIGH CORRELATION 

Distinct370
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9673.5027
Minimum118
Maximum41227
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2024-04-21T03:29:55.743958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum118
5-th percentile499.8
Q11520.75
median7184.5
Q314570.25
95-th percentile28637.35
Maximum41227
Range41109
Interquartile range (IQR)13049.5

Descriptive statistics

Standard deviation9320.4614
Coefficient of variation (CV)0.96350429
Kurtosis0.43427857
Mean9673.5027
Median Absolute Deviation (MAD)5882
Skewness1.1017406
Sum3598543
Variance86871000
MonotonicityNot monotonic
2024-04-21T03:29:55.848476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
194 2
 
0.5%
16638 2
 
0.5%
41227 1
 
0.3%
6167 1
 
0.3%
2862 1
 
0.3%
2607 1
 
0.3%
2367 1
 
0.3%
6960 1
 
0.3%
6065 1
 
0.3%
5880 1
 
0.3%
Other values (360) 360
96.8%
ValueCountFrequency (%)
118 1
0.3%
133 1
0.3%
152 1
0.3%
160 1
0.3%
172 1
0.3%
194 2
0.5%
220 1
0.3%
256 1
0.3%
258 1
0.3%
262 1
0.3%
ValueCountFrequency (%)
41227 1
0.3%
37295 1
0.3%
36068 1
0.3%
35887 1
0.3%
35488 1
0.3%
35453 1
0.3%
33724 1
0.3%
33582 1
0.3%
33580 1
0.3%
32988 1
0.3%

Interactions

2024-04-21T03:29:54.992658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:29:54.686647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:29:55.073232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:29:54.812362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:29:55.915468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년월시군명부과건수단속건수
집계년월1.0000.0000.0000.000
시군명0.0001.0000.9500.916
부과건수0.0000.9501.0000.955
단속건수0.0000.9160.9551.000
2024-04-21T03:29:55.990293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명집계년월
시군명1.0000.000
집계년월0.0001.000
2024-04-21T03:29:56.054232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과건수단속건수집계년월시군명
부과건수1.0000.9890.0000.714
단속건수0.9891.0000.0000.616
집계년월0.0000.0001.0000.000
시군명0.7140.6160.0001.000

Missing values

2024-04-21T03:29:55.155575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:29:55.224793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년월시군명부과건수단속건수
02023-01수원시3429041227
12023-02수원시3697935453
22023-03수원시3787837295
32023-04수원시3489333724
42023-05수원시3532333582
52023-06수원시3441732942
62023-07수원시3585432436
72023-08수원시3936036068
82023-09수원시3635932481
92023-10수원시3774935887
집계년월시군명부과건수단속건수
3622023-03연천군169160
3632023-04연천군228258
3642023-05연천군127152
3652023-06연천군188194
3662023-07연천군205172
3672023-08연천군312194
3682023-09연천군265220
3692023-10연천군331133
3702023-11연천군205368
3712023-12연천군202305