Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Numeric2
Categorical4
Text2

Dataset

Description대전광역시 서구 어린이보호구역 불법주정차 단속현황입니다.(순번, 차량구분, 위반연도, 위반월, 위반시간, 단속위치, 단속동명, 신고구분)
URLhttps://www.data.go.kr/data/15104525/fileData.do

Alerts

순번 is highly overall correlated with 위반연도High correlation
위반월 is highly overall correlated with 위반연도High correlation
위반연도 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
차량구분 is highly imbalanced (76.6%)Imbalance
신고구분 is highly imbalanced (73.4%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:07:19.703169
Analysis finished2023-12-12 09:07:21.553822
Duration1.85 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14330.534
Minimum10
Maximum28869
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:07:21.639091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile1452.9
Q17154.75
median14300
Q321414.5
95-th percentile27345.4
Maximum28869
Range28859
Interquartile range (IQR)14259.75

Descriptive statistics

Standard deviation8285.2682
Coefficient of variation (CV)0.57815489
Kurtosis-1.1853452
Mean14330.534
Median Absolute Deviation (MAD)7130
Skewness0.015942795
Sum1.4330534 × 108
Variance68645669
MonotonicityNot monotonic
2023-12-12T18:07:21.813800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17556 1
 
< 0.1%
16356 1
 
< 0.1%
25492 1
 
< 0.1%
20553 1
 
< 0.1%
10660 1
 
< 0.1%
18586 1
 
< 0.1%
5790 1
 
< 0.1%
21829 1
 
< 0.1%
18352 1
 
< 0.1%
2321 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10 1
< 0.1%
14 1
< 0.1%
17 1
< 0.1%
24 1
< 0.1%
25 1
< 0.1%
36 1
< 0.1%
37 1
< 0.1%
38 1
< 0.1%
39 1
< 0.1%
43 1
< 0.1%
ValueCountFrequency (%)
28869 1
< 0.1%
28868 1
< 0.1%
28866 1
< 0.1%
28865 1
< 0.1%
28864 1
< 0.1%
28854 1
< 0.1%
28851 1
< 0.1%
28850 1
< 0.1%
28843 1
< 0.1%
28834 1
< 0.1%

차량구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
승용
9101 
화물4톤이하
 
612
승합
 
229
화물4톤초과
 
32
건설,중기,특수
 
26

Length

Max length8
Median length2
Mean length2.2732
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승용
2nd row승용
3rd row승용
4th row승용
5th row승용

Common Values

ValueCountFrequency (%)
승용 9101
91.0%
화물4톤이하 612
 
6.1%
승합 229
 
2.3%
화물4톤초과 32
 
0.3%
건설,중기,특수 26
 
0.3%

Length

2023-12-12T18:07:22.022470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:22.180913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승용 9101
91.0%
화물4톤이하 612
 
6.1%
승합 229
 
2.3%
화물4톤초과 32
 
0.3%
건설,중기,특수 26
 
0.3%

위반연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023
5446 
2022
4554 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2022
4th row2023
5th row2022

Common Values

ValueCountFrequency (%)
2023 5446
54.5%
2022 4554
45.5%

Length

2023-12-12T18:07:22.361067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:22.513153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 5446
54.5%
2022 4554
45.5%

위반월
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.5112
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:07:22.643757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile11
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.842619
Coefficient of variation (CV)0.51578948
Kurtosis-0.24527673
Mean5.5112
Median Absolute Deviation (MAD)2
Skewness0.61998515
Sum55112
Variance8.0804826
MonotonicityNot monotonic
2023-12-12T18:07:22.766704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
3 1506
15.1%
4 1422
14.2%
5 1403
14.0%
6 1354
13.5%
7 1195
11.9%
2 621
6.2%
1 569
 
5.7%
11 441
 
4.4%
12 437
 
4.4%
10 410
 
4.1%
Other values (2) 642
6.4%
ValueCountFrequency (%)
1 569
 
5.7%
2 621
6.2%
3 1506
15.1%
4 1422
14.2%
5 1403
14.0%
6 1354
13.5%
7 1195
11.9%
8 382
 
3.8%
9 260
 
2.6%
10 410
 
4.1%
ValueCountFrequency (%)
12 437
 
4.4%
11 441
 
4.4%
10 410
 
4.1%
9 260
 
2.6%
8 382
 
3.8%
7 1195
11.9%
6 1354
13.5%
5 1403
14.0%
4 1422
14.2%
3 1506
15.1%
Distinct669
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:07:23.173075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters80000
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)1.0%

Sample

1st row08:26:00
2nd row17:40:00
3rd row08:48:00
4th row10:43:00
5th row09:37:00
ValueCountFrequency (%)
08:34:00 118
 
1.2%
08:33:00 105
 
1.0%
08:35:00 98
 
1.0%
08:32:00 84
 
0.8%
08:23:00 80
 
0.8%
08:36:00 78
 
0.8%
08:40:00 71
 
0.7%
08:37:00 69
 
0.7%
08:39:00 69
 
0.7%
08:38:00 59
 
0.6%
Other values (662) 9172
91.7%
2023-12-12T18:07:23.768595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25231
31.5%
: 19997
25.0%
1 10189
12.7%
4 4061
 
5.1%
8 3704
 
4.6%
5 3523
 
4.4%
3 3379
 
4.2%
2 3097
 
3.9%
9 2940
 
3.7%
7 1963
 
2.5%
Other values (2) 1916
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
75.0%
Other Punctuation 19997
 
25.0%
Space Separator 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25231
42.1%
1 10189
17.0%
4 4061
 
6.8%
8 3704
 
6.2%
5 3523
 
5.9%
3 3379
 
5.6%
2 3097
 
5.2%
9 2940
 
4.9%
7 1963
 
3.3%
6 1913
 
3.2%
Other Punctuation
ValueCountFrequency (%)
: 19997
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 80000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 25231
31.5%
: 19997
25.0%
1 10189
12.7%
4 4061
 
5.1%
8 3704
 
4.6%
5 3523
 
4.4%
3 3379
 
4.2%
2 3097
 
3.9%
9 2940
 
3.7%
7 1963
 
2.5%
Other values (2) 1916
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25231
31.5%
: 19997
25.0%
1 10189
12.7%
4 4061
 
5.1%
8 3704
 
4.6%
5 3523
 
4.4%
3 3379
 
4.2%
2 3097
 
3.9%
9 2940
 
3.7%
7 1963
 
2.5%
Other values (2) 1916
 
2.4%
Distinct248
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:07:24.101641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length6
Mean length6.9737
Min length2

Characters and Unicode

Total characters69737
Distinct characters184
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)0.7%

Sample

1st row유천초등학교
2nd row봉산초등학교
3rd row둔산동학원가
4th row은새유치원
5th row백운초등학교
ValueCountFrequency (%)
둔산동학원가 1862
 
17.5%
둔산초등학교 642
 
6.0%
문정초등학교 433
 
4.1%
수밋들공원 386
 
3.6%
봉산초등학교 359
 
3.4%
순복음예광교회(선암초교 359
 
3.4%
은새유치원 337
 
3.2%
만년초등학교 268
 
2.5%
유천초등학교 216
 
2.0%
정림초등학교 214
 
2.0%
Other values (237) 5563
52.3%
2023-12-12T18:07:24.636137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7208
 
10.3%
6410
 
9.2%
5828
 
8.4%
5011
 
7.2%
4192
 
6.0%
3152
 
4.5%
2849
 
4.1%
2276
 
3.3%
2268
 
3.3%
1308
 
1.9%
Other values (174) 29235
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66183
94.9%
Open Punctuation 1307
 
1.9%
Close Punctuation 1307
 
1.9%
Space Separator 644
 
0.9%
Decimal Number 281
 
0.4%
Lowercase Letter 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7208
 
10.9%
6410
 
9.7%
5828
 
8.8%
5011
 
7.6%
4192
 
6.3%
3152
 
4.8%
2849
 
4.3%
2276
 
3.4%
2268
 
3.4%
1308
 
2.0%
Other values (161) 25681
38.8%
Decimal Number
ValueCountFrequency (%)
2 171
60.9%
1 65
 
23.1%
7 31
 
11.0%
8 5
 
1.8%
5 3
 
1.1%
9 2
 
0.7%
4 2
 
0.7%
0 1
 
0.4%
3 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 1307
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1307
100.0%
Space Separator
ValueCountFrequency (%)
644
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66183
94.9%
Common 3539
 
5.1%
Latin 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7208
 
10.9%
6410
 
9.7%
5828
 
8.8%
5011
 
7.6%
4192
 
6.3%
3152
 
4.8%
2849
 
4.3%
2276
 
3.4%
2268
 
3.4%
1308
 
2.0%
Other values (161) 25681
38.8%
Common
ValueCountFrequency (%)
( 1307
36.9%
) 1307
36.9%
644
18.2%
2 171
 
4.8%
1 65
 
1.8%
7 31
 
0.9%
8 5
 
0.1%
5 3
 
0.1%
9 2
 
0.1%
4 2
 
0.1%
Other values (2) 2
 
0.1%
Latin
ValueCountFrequency (%)
e 15
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66183
94.9%
ASCII 3554
 
5.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7208
 
10.9%
6410
 
9.7%
5828
 
8.8%
5011
 
7.6%
4192
 
6.3%
3152
 
4.8%
2849
 
4.3%
2276
 
3.4%
2268
 
3.4%
1308
 
2.0%
Other values (161) 25681
38.8%
ASCII
ValueCountFrequency (%)
( 1307
36.8%
) 1307
36.8%
644
18.1%
2 171
 
4.8%
1 65
 
1.8%
7 31
 
0.9%
e 15
 
0.4%
8 5
 
0.1%
5 3
 
0.1%
9 2
 
0.1%
Other values (3) 4
 
0.1%

단속동명
Categorical

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
둔산동
3708 
관저동
1186 
탄방동
823 
정림동
683 
도마동
459 
Other values (23)
3141 

Length

Max length4
Median length3
Mean length3.0262
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row도마동
2nd row갈마동
3rd row둔산동
4th row탄방동
5th row괴정동

Common Values

ValueCountFrequency (%)
둔산동 3708
37.1%
관저동 1186
 
11.9%
탄방동 823
 
8.2%
정림동 683
 
6.8%
도마동 459
 
4.6%
월평동 459
 
4.6%
갈마동 411
 
4.1%
괴정동 399
 
4.0%
만년동 358
 
3.6%
복수동 289
 
2.9%
Other values (18) 1225
 
12.2%

Length

2023-12-12T18:07:24.829592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
둔산동 3708
37.1%
관저동 1186
 
11.9%
탄방동 823
 
8.2%
정림동 683
 
6.8%
도마동 459
 
4.6%
월평동 459
 
4.6%
갈마동 411
 
4.1%
괴정동 399
 
4.0%
만년동 358
 
3.6%
복수동 289
 
2.9%
Other values (18) 1225
 
12.2%

신고구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고정형CCTV
8916 
주행형CCTV
 
779
안전신문고
 
224
버스장착형CCTV
 
73
PDA
 
8

Length

Max length9
Median length7
Mean length6.9666
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고정형CCTV
2nd row고정형CCTV
3rd row고정형CCTV
4th row고정형CCTV
5th row고정형CCTV

Common Values

ValueCountFrequency (%)
고정형CCTV 8916
89.2%
주행형CCTV 779
 
7.8%
안전신문고 224
 
2.2%
버스장착형CCTV 73
 
0.7%
PDA 8
 
0.1%

Length

2023-12-12T18:07:25.014978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:25.163771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고정형cctv 8916
89.2%
주행형cctv 779
 
7.8%
안전신문고 224
 
2.2%
버스장착형cctv 73
 
0.7%
pda 8
 
0.1%

Interactions

2023-12-12T18:07:20.800042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:07:20.452879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:07:21.117228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:07:20.638043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:07:25.593191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번차량구분위반연도위반월단속동명신고구분
순번1.0000.0650.9970.9530.2610.246
차량구분0.0651.0000.0220.0180.3430.172
위반연도0.9970.0221.0000.7350.2020.143
위반월0.9530.0180.7351.0000.2130.168
단속동명0.2610.3430.2020.2131.0000.699
신고구분0.2460.1720.1430.1680.6991.000
2023-12-12T18:07:25.719451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위반연도단속동명신고구분차량구분
위반연도1.0000.1600.1750.027
단속동명0.1601.0000.4250.172
신고구분0.1750.4251.0000.065
차량구분0.0270.1720.0651.000
2023-12-12T18:07:25.845294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번위반월차량구분위반연도단속동명신고구분
순번1.0000.0750.0270.9480.0960.104
위반월0.0751.0000.0080.5760.0780.070
차량구분0.0270.0081.0000.0270.1720.065
위반연도0.9480.5760.0271.0000.1600.175
단속동명0.0960.0780.1720.1601.0000.425
신고구분0.1040.0700.0650.1750.4251.000

Missing values

2023-12-12T18:07:21.328962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:07:21.477715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번차량구분위반연도위반월위반시간단속위치단속동명신고구분
1755517556승용2023308:26:00유천초등학교도마동고정형CCTV
1668816689승용2023317:40:00봉산초등학교갈마동고정형CCTV
218219승용2022308:48:00둔산동학원가둔산동고정형CCTV
1749517496승용2023310:43:00은새유치원탄방동고정형CCTV
73317332승용2022709:37:00백운초등학교괴정동고정형CCTV
315316승용2022319:52:00둔산동학원가둔산동고정형CCTV
1697416975승용2023314:24:00은새유치원탄방동고정형CCTV
33043305승용2022409:11:00원앙초등학교 나은유치원관저동고정형CCTV
2805828059승용2023718:50:00문정초등학교둔산동고정형CCTV
888889화물4톤이하2022308:46:00도마초등학교도마동안전신문고
순번차량구분위반연도위반월위반시간단속위치단속동명신고구분
1067010671승용20221108:37:00한밭초등학교둔산동고정형CCTV
24612462승용2022419:51:00순복음예광교회(선암초교)관저동고정형CCTV
1858718588승용2023319:24:00내동초등학교내동고정형CCTV
1495814959승용2023214:50:00수밋들공원정림동고정형CCTV
316317승용2022308:30:00목련어린이집내동고정형CCTV
2374623747승용2023519:39:00문정초등학교둔산동고정형CCTV
1626516266승용2023308:23:00신계초등학교2복수동고정형CCTV
2699026991승용2023709:35:00둔산동학원가둔산동고정형CCTV
31853186승용2022414:40:00수미초등학교괴곡동고정형CCTV
1523915240승용2023211:16:00순복음예광교회(선암초교)관저동고정형CCTV