Overview

Dataset statistics

Number of variables4
Number of observations2093
Missing cells285
Missing cells (%)3.4%
Duplicate rows286
Duplicate rows (%)13.7%
Total size in memory71.7 KiB
Average record size in memory35.1 B

Variable types

DateTime1
Numeric2
Categorical1

Dataset

Description경기도 평택시 교통위반과태료통합민원시스템 DB의 과오납 테이블 정보(과납등록일자, 고지년월, 시군구코드)입니다.
Author경기도 평택시
URLhttps://www.data.go.kr/data/15064012/fileData.do

Alerts

Dataset has 286 (13.7%) duplicate rowsDuplicates
고지기준월 has 285 (13.6%) missing valuesMissing

Reproduction

Analysis started2024-04-21 14:43:43.660993
Analysis finished2024-04-21 14:43:45.009779
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1007
Distinct (%)48.1%
Missing0
Missing (%)0.0%
Memory size16.5 KiB
Minimum2017-08-03 00:00:00
Maximum2024-01-31 00:00:00
2024-04-21T23:43:45.226988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:45.644538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

고지기준연도
Real number (ℝ)

Distinct21
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.4568
Minimum2003
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size18.5 KiB
2024-04-21T23:43:46.015953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2003
5-th percentile2008
Q12018
median2019
Q32021
95-th percentile2023
Maximum2024
Range21
Interquartile range (IQR)3

Descriptive statistics

Standard deviation4.3503004
Coefficient of variation (CV)0.0021552606
Kurtosis1.6540046
Mean2018.4568
Median Absolute Deviation (MAD)2
Skewness-1.5248057
Sum4224630
Variance18.925113
MonotonicityNot monotonic
2024-04-21T23:43:46.373040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
2019 385
18.4%
2020 266
12.7%
2023 265
12.7%
2018 264
12.6%
2017 218
10.4%
2022 204
9.7%
2021 190
9.1%
2008 165
7.9%
2007 47
 
2.2%
2024 23
 
1.1%
Other values (11) 66
 
3.2%
ValueCountFrequency (%)
2003 3
 
0.1%
2004 1
 
< 0.1%
2006 6
 
0.3%
2007 47
 
2.2%
2008 165
7.9%
2009 15
 
0.7%
2010 3
 
0.1%
2011 5
 
0.2%
2012 3
 
0.1%
2013 2
 
0.1%
ValueCountFrequency (%)
2024 23
 
1.1%
2023 265
12.7%
2022 204
9.7%
2021 190
9.1%
2020 266
12.7%
2019 385
18.4%
2018 264
12.6%
2017 218
10.4%
2016 17
 
0.8%
2015 6
 
0.3%

고지기준월
Real number (ℝ)

MISSING 

Distinct12
Distinct (%)0.7%
Missing285
Missing (%)13.6%
Infinite0
Infinite (%)0.0%
Mean6.9386062
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size18.5 KiB
2024-04-21T23:43:46.712788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4516332
Coefficient of variation (CV)0.49745339
Kurtosis-1.2211014
Mean6.9386062
Median Absolute Deviation (MAD)3
Skewness-0.17942006
Sum12545
Variance11.913772
MonotonicityNot monotonic
2024-04-21T23:43:47.063574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
11 259
12.4%
7 186
8.9%
10 160
7.6%
9 154
7.4%
5 146
7.0%
6 143
6.8%
3 140
6.7%
12 138
6.6%
2 134
6.4%
1 124
5.9%
Other values (2) 224
10.7%
(Missing) 285
13.6%
ValueCountFrequency (%)
1 124
5.9%
2 134
6.4%
3 140
6.7%
4 110
5.3%
5 146
7.0%
6 143
6.8%
7 186
8.9%
8 114
5.4%
9 154
7.4%
10 160
7.6%
ValueCountFrequency (%)
12 138
6.6%
11 259
12.4%
10 160
7.6%
9 154
7.4%
8 114
5.4%
7 186
8.9%
6 143
6.8%
5 146
7.0%
4 110
5.3%
3 140
6.7%
Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.5 KiB
41220
1287 
41222
445 
<NA>
285 
41224
 
76

Length

Max length5
Median length5
Mean length4.8638318
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row41224
2nd row41220
3rd row41222
4th row41220
5th row41222

Common Values

ValueCountFrequency (%)
41220 1287
61.5%
41222 445
 
21.3%
<NA> 285
 
13.6%
41224 76
 
3.6%

Length

2024-04-21T23:43:47.459780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T23:43:47.802527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41220 1287
61.5%
41222 445
 
21.3%
na 285
 
13.6%
41224 76
 
3.6%

Interactions

2024-04-21T23:43:44.385220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:43.867396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:44.574836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:44.115688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T23:43:48.026400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고지기준연도고지기준월시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)
고지기준연도1.0000.4630.447
고지기준월0.4631.0000.365
시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)0.4470.3651.000
2024-04-21T23:43:48.283586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고지기준연도고지기준월시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)
고지기준연도1.000-0.0750.299
고지기준월-0.0751.0000.235
시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)0.2990.2351.000

Missing values

2024-04-21T23:43:44.775010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T23:43:44.928164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과납등록일자고지기준연도고지기준월시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)
02017-08-032017741224
12017-08-082017741220
22017-08-162017741222
32017-08-172017841220
42017-08-172017741222
52017-08-182017741222
62017-08-182017741222
72017-09-1120111141222
82017-09-112017841222
92017-09-122017941224
과납등록일자고지기준연도고지기준월시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)
20832024-01-222024<NA><NA>
20842024-01-232024<NA><NA>
20852024-01-262024<NA><NA>
20862024-01-292024<NA><NA>
20872024-01-292024<NA><NA>
20882024-01-292024<NA><NA>
20892024-01-292024<NA><NA>
20902024-01-302024<NA><NA>
20912024-01-302024<NA><NA>
20922024-01-312024<NA><NA>

Duplicate rows

Most frequently occurring

과납등록일자고지기준연도고지기준월시군구코드(41220-평택시청 41222-송탄출장소 41224-안중출장소)# duplicates
862019-12-142019114122047
912019-12-242019114122040
822019-10-102017124122015
122017-12-052017114122014
2642023-10-042023<NA><NA>13
2782023-12-262023<NA><NA>12
2482023-07-122023<NA><NA>9
462018-10-0220189412208
812019-10-10201710412207
892019-12-23201911412206