Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows25
Duplicate rows (%)0.2%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical5
Numeric1
DateTime1
Text1

Dataset

Description강남구 주정차위반단속위치 현황은 서울특별시 내의 시군별 주정차 위반 단속 위치 현황. 시군코드, 단속일시정보, 단속장소 등의 정보를 제공합니다.(2021년 10월~2024년 3월)
Author서울특별시 강남구
URLhttps://www.data.go.kr/data/15060599/fileData.do

Alerts

시군명 has constant value ""Constant
관리기관명 has constant value ""Constant
Dataset has 25 (0.2%) duplicate rowsDuplicates
시군코드 is highly overall correlated with 집계년도High correlation
집계년도 is highly overall correlated with 시군코드High correlation

Reproduction

Analysis started2024-04-06 08:12:05.844056
Analysis finished2024-04-06 08:12:07.888875
Duration2.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2021
5760 
2022
4240 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2022
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 5760
57.6%
2022 4240
42.4%

Length

2024-04-06T17:12:08.015271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:08.190662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 5760
57.6%
2022 4240
42.4%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울특별시
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 10000
100.0%

Length

2024-04-06T17:12:08.374474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:08.560185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 10000
100.0%

시군코드
Real number (ℝ)

HIGH CORRELATION 

Distinct5761
Distinct (%)57.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3236348.7
Minimum3220000
Maximum3276780
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T17:12:08.762669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3220000
5-th percentile3220000
Q13220000
median3227318
Q33252230
95-th percentile3272127.7
Maximum3276780
Range56780
Interquartile range (IQR)32230

Descriptive statistics

Standard deviation18796.103
Coefficient of variation (CV)0.0058078116
Kurtosis-0.94452406
Mean3236348.7
Median Absolute Deviation (MAD)7318
Skewness0.72864289
Sum3.2363487 × 1010
Variance3.532935 × 108
MonotonicityNot monotonic
2024-04-06T17:12:09.139041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3220000 4240
42.4%
3263111 1
 
< 0.1%
3234606 1
 
< 0.1%
3238178 1
 
< 0.1%
3220256 1
 
< 0.1%
3267217 1
 
< 0.1%
3267187 1
 
< 0.1%
3222460 1
 
< 0.1%
3276133 1
 
< 0.1%
3252924 1
 
< 0.1%
Other values (5751) 5751
57.5%
ValueCountFrequency (%)
3220000 4240
42.4%
3220028 1
 
< 0.1%
3220036 1
 
< 0.1%
3220038 1
 
< 0.1%
3220077 1
 
< 0.1%
3220082 1
 
< 0.1%
3220086 1
 
< 0.1%
3220115 1
 
< 0.1%
3220125 1
 
< 0.1%
3220129 1
 
< 0.1%
ValueCountFrequency (%)
3276780 1
< 0.1%
3276747 1
< 0.1%
3276741 1
< 0.1%
3276726 1
< 0.1%
3276703 1
< 0.1%
3276700 1
< 0.1%
3276697 1
< 0.1%
3276691 1
< 0.1%
3276690 1
< 0.1%
3276684 1
< 0.1%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강남구
2nd row강남구
3rd row강남구
4th row강남구
5th row강남구

Common Values

ValueCountFrequency (%)
강남구 10000
100.0%

Length

2024-04-06T17:12:09.411862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:09.580629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강남구 10000
100.0%
Distinct9462
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-10-01 00:47:00
Maximum2022-03-22 20:32:00
2024-04-06T17:12:09.750117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:12:10.019444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

단속동
Categorical

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
역삼동
2006 
논현동
1852 
대치동
1470 
신사동
999 
삼성동
978 
Other values (9)
2695 

Length

Max length4
Median length3
Mean length3.0332
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row역삼동
2nd row신사동
3rd row신사동
4th row수서동
5th row논현동

Common Values

ValueCountFrequency (%)
역삼동 2006
20.1%
논현동 1852
18.5%
대치동 1470
14.7%
신사동 999
10.0%
삼성동 978
9.8%
청담동 896
9.0%
압구정동 332
 
3.3%
수서동 283
 
2.8%
도곡동 263
 
2.6%
개포동 257
 
2.6%
Other values (4) 664
 
6.6%

Length

2024-04-06T17:12:10.292095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
역삼동 2006
20.1%
논현동 1852
18.5%
대치동 1470
14.7%
신사동 999
10.0%
삼성동 978
9.8%
청담동 896
9.0%
압구정동 332
 
3.3%
수서동 283
 
2.8%
도곡동 263
 
2.6%
개포동 257
 
2.6%
Other values (4) 664
 
6.6%
Distinct3473
Distinct (%)34.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T17:12:10.802219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length11.4375
Min length1

Characters and Unicode

Total characters114375
Distinct characters301
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2444 ?
Unique (%)24.4%

Sample

1st row777-40
2nd row631
3rd row서울 강남구 신사동 501-5
4th row201-5
5th row7
ValueCountFrequency (%)
주변 4525
 
17.1%
강남구 2051
 
7.7%
서울 1108
 
4.2%
서울특별시 938
 
3.5%
테헤란로 398
 
1.5%
도곡로 364
 
1.4%
논현동 360
 
1.4%
청담동 335
 
1.3%
도산대로 302
 
1.1%
신사동 298
 
1.1%
Other values (3072) 15824
59.7%
2024-04-06T17:12:11.720421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16508
 
14.4%
1 7662
 
6.7%
5693
 
5.0%
5211
 
4.6%
2 5153
 
4.5%
4648
 
4.1%
6 3926
 
3.4%
5 3675
 
3.2%
7 3580
 
3.1%
- 3548
 
3.1%
Other values (291) 54771
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52924
46.3%
Decimal Number 39057
34.1%
Space Separator 16508
 
14.4%
Dash Punctuation 3548
 
3.1%
Open Punctuation 761
 
0.7%
Close Punctuation 761
 
0.7%
Other Punctuation 407
 
0.4%
Math Symbol 310
 
0.3%
Uppercase Letter 80
 
0.1%
Lowercase Letter 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5693
 
10.8%
5211
 
9.8%
4648
 
8.8%
3185
 
6.0%
2500
 
4.7%
2437
 
4.6%
2398
 
4.5%
2311
 
4.4%
2101
 
4.0%
2054
 
3.9%
Other values (244) 20386
38.5%
Uppercase Letter
ValueCountFrequency (%)
S 16
20.0%
K 14
17.5%
R 7
8.8%
T 7
8.8%
P 4
 
5.0%
H 4
 
5.0%
I 4
 
5.0%
G 3
 
3.8%
E 3
 
3.8%
W 3
 
3.8%
Other values (8) 15
18.8%
Decimal Number
ValueCountFrequency (%)
1 7662
19.6%
2 5153
13.2%
6 3926
10.1%
5 3675
9.4%
7 3580
9.2%
4 3504
9.0%
3 3348
8.6%
8 3043
 
7.8%
9 2656
 
6.8%
0 2510
 
6.4%
Lowercase Letter
ValueCountFrequency (%)
r 5
29.4%
a 2
 
11.8%
s 2
 
11.8%
t 2
 
11.8%
y 1
 
5.9%
n 1
 
5.9%
b 1
 
5.9%
e 1
 
5.9%
i 1
 
5.9%
o 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 236
58.0%
/ 166
40.8%
. 5
 
1.2%
Space Separator
ValueCountFrequency (%)
16508
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3548
100.0%
Open Punctuation
ValueCountFrequency (%)
( 761
100.0%
Close Punctuation
ValueCountFrequency (%)
) 761
100.0%
Math Symbol
ValueCountFrequency (%)
~ 310
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 61352
53.6%
Hangul 52924
46.3%
Latin 99
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5693
 
10.8%
5211
 
9.8%
4648
 
8.8%
3185
 
6.0%
2500
 
4.7%
2437
 
4.6%
2398
 
4.5%
2311
 
4.4%
2101
 
4.0%
2054
 
3.9%
Other values (244) 20386
38.5%
Latin
ValueCountFrequency (%)
S 16
16.2%
K 14
14.1%
R 7
 
7.1%
T 7
 
7.1%
r 5
 
5.1%
P 4
 
4.0%
H 4
 
4.0%
I 4
 
4.0%
G 3
 
3.0%
E 3
 
3.0%
Other values (19) 32
32.3%
Common
ValueCountFrequency (%)
16508
26.9%
1 7662
12.5%
2 5153
 
8.4%
6 3926
 
6.4%
5 3675
 
6.0%
7 3580
 
5.8%
- 3548
 
5.8%
4 3504
 
5.7%
3 3348
 
5.5%
8 3043
 
5.0%
Other values (8) 7405
12.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 61449
53.7%
Hangul 52923
46.3%
Number Forms 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16508
26.9%
1 7662
12.5%
2 5153
 
8.4%
6 3926
 
6.4%
5 3675
 
6.0%
7 3580
 
5.8%
- 3548
 
5.8%
4 3504
 
5.7%
3 3348
 
5.4%
8 3043
 
5.0%
Other values (36) 7502
12.2%
Hangul
ValueCountFrequency (%)
5693
 
10.8%
5211
 
9.8%
4648
 
8.8%
3185
 
6.0%
2500
 
4.7%
2437
 
4.6%
2398
 
4.5%
2311
 
4.4%
2101
 
4.0%
2054
 
3.9%
Other values (243) 20385
38.5%
Number Forms
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

단속구분
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
CCTV
4618 
PDA
3914 
안전신문고(앱)
617 
주행형CCTV
546 
서울스마트불편신고
 
292

Length

Max length9
Median length8
Mean length4.1691
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPDA
2nd rowPDA
3rd row주행형CCTV
4th rowPDA
5th rowPDA

Common Values

ValueCountFrequency (%)
CCTV 4618
46.2%
PDA 3914
39.1%
안전신문고(앱) 617
 
6.2%
주행형CCTV 546
 
5.5%
서울스마트불편신고 292
 
2.9%
고정형CCTV 13
 
0.1%

Length

2024-04-06T17:12:12.003891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:12.222882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
cctv 4618
46.2%
pda 3914
39.1%
안전신문고(앱 617
 
6.2%
주행형cctv 546
 
5.5%
서울스마트불편신고 292
 
2.9%
고정형cctv 13
 
0.1%

Interactions

2024-04-06T17:12:07.218387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:12:12.396936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집계년도시군코드단속동단속구분
집계년도1.000NaN0.0650.040
시군코드NaN1.0000.3790.847
단속동0.0650.3791.0000.481
단속구분0.0400.8470.4811.000
2024-04-06T17:12:12.616872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단속동단속구분집계년도
단속동1.0000.2610.051
단속구분0.2611.0000.028
집계년도0.0510.0281.000
2024-04-06T17:12:12.820134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군코드집계년도단속동단속구분
시군코드1.0000.8870.1270.490
집계년도0.8871.0000.0510.028
단속동0.1270.0511.0000.261
단속구분0.4900.0280.2611.000

Missing values

2024-04-06T17:12:07.515433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:12:07.757956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년도시군명시군코드관리기관명단속일시단속동단속장소단속구분
133652021서울특별시3263111강남구2021-10-24 10:15역삼동777-40PDA
8472021서울특별시3257407강남구2021-10-02 13:24신사동631PDA
688972022서울특별시3220000강남구2022-01-20 11:00신사동서울 강남구 신사동 501-5주행형CCTV
328552021서울특별시3264455강남구2021-11-24 15:51수서동201-5PDA
504322021서울특별시3274804강남구2021-12-22 19:21논현동7PDA
507262021서울특별시3223018강남구2021-12-23 10:07역삼동테헤란로4길 46CCTV
608602022서울특별시3220000강남구2022-01-07 14:31역삼동도곡로3길 26 주변CCTV
172422021서울특별시3237686강남구2021-10-29 21:04역삼동테헤란로 152 주변CCTV
958752022서울특별시3220000강남구2022-03-16 08:52역삼동선릉로69길 20 주변CCTV
60872021서울특별시3242986강남구2021-10-12 20:28역삼동테헤란로 237 주변CCTV
집계년도시군명시군코드관리기관명단속일시단속동단속장소단속구분
829472022서울특별시3220000강남구2022-02-18 23:15역삼동테헤란로 237 주변CCTV
935472022서울특별시3220000강남구2022-03-11 14:31일원동일원로 53 주변CCTV
277932021서울특별시3233047강남구2021-11-16 16:13청담동도산대로 524 주변CCTV
858752022서울특별시3220000강남구2022-02-24 15:50수서동서울특별시 강남구 광평로 295 (수서동, 사이룩스오피스텔)PDA
50862021서울특별시3263237강남구2021-10-10 18:49역삼동722-4PDA
429062021서울특별시3255104강남구2021-12-10 11:36논현동250PDA
481802021서울특별시3253081강남구2021-12-18 20:38삼성동10-18PDA
192682021서울특별시3236866강남구2021-11-02 21:49논현동논현로 647 주변CCTV
111522021서울특별시3240546강남구2021-10-20 19:17역삼동테헤란로52길 6 주변CCTV
682472022서울특별시3220000강남구2022-01-19 09:47대치동1012-82안전신문고(앱)

Duplicate rows

Most frequently occurring

집계년도시군명시군코드관리기관명단속일시단속동단속장소단속구분# duplicates
02022서울특별시3220000강남구2022-01-03 08:20역삼동801-7안전신문고(앱)2
12022서울특별시3220000강남구2022-01-05 15:08신사동661~663-24PDA2
22022서울특별시3220000강남구2022-01-06 20:18신사동659-13~657-36PDA2
32022서울특별시3220000강남구2022-01-10 15:25대치동1003PDA2
42022서울특별시3220000강남구2022-01-12 11:34논현동서울 강남구 논현동 279주행형CCTV2
52022서울특별시3220000강남구2022-01-12 19:20논현동서울 강남구 논현동 279주행형CCTV2
62022서울특별시3220000강남구2022-01-12 19:57논현동159-11~158PDA2
72022서울특별시3220000강남구2022-01-16 13:35논현동175-1PDA2
82022서울특별시3220000강남구2022-01-16 17:19도곡동957-11PDA2
92022서울특별시3220000강남구2022-01-26 12:35청담동서울 강남구 청담동 77-102주행형CCTV2