Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows103
Duplicate rows (%)1.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

DateTime2
Numeric1
Categorical4
Text1

Dataset

Description노원구에서 단속된 주정차 위반건에 대한 데이터로 위반일시, 동명칭, 상세위치, 관련법, 금액 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15083828/fileData.do

Alerts

관리부서 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 103 (1.0%) duplicate rowsDuplicates
견인지시 is highly imbalanced (99.9%)Imbalance

Reproduction

Analysis started2023-12-11 23:52:40.242912
Analysis finished2023-12-11 23:52:41.162025
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9585
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-01 07:01:00
Maximum2023-06-30 22:10:00
2023-12-12T08:52:41.210945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:52:41.308308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

단속원금
Real number (ℝ)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44814
Minimum10000
Maximum130000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:52:41.393042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile40000
Q140000
median40000
Q340000
95-th percentile120000
Maximum130000
Range120000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation18160.765
Coefficient of variation (CV)0.40524758
Kurtosis13.15548
Mean44814
Median Absolute Deviation (MAD)0
Skewness3.8386583
Sum4.4814 × 108
Variance3.2981339 × 108
MonotonicityNot monotonic
2023-12-12T08:52:41.471719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
40000 8894
88.9%
50000 525
 
5.2%
120000 497
 
5.0%
130000 31
 
0.3%
20000 22
 
0.2%
80000 19
 
0.2%
60000 7
 
0.1%
10000 3
 
< 0.1%
25000 2
 
< 0.1%
ValueCountFrequency (%)
10000 3
 
< 0.1%
20000 22
 
0.2%
25000 2
 
< 0.1%
40000 8894
88.9%
50000 525
 
5.2%
60000 7
 
0.1%
80000 19
 
0.2%
120000 497
 
5.0%
130000 31
 
0.3%
ValueCountFrequency (%)
130000 31
 
0.3%
120000 497
 
5.0%
80000 19
 
0.2%
60000 7
 
0.1%
50000 525
 
5.2%
40000 8894
88.9%
25000 2
 
< 0.1%
20000 22
 
0.2%
10000 3
 
< 0.1%

단속동
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
상계동
4871 
중계동
2171 
공릉동
1599 
하계동
682 
월계동
670 
Other values (2)
 
7

Length

Max length4
Median length3
Mean length3.0007
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row상계동
2nd row중계동
3rd row상계동
4th row상계동
5th row중계동

Common Values

ValueCountFrequency (%)
상계동 4871
48.7%
중계동 2171
21.7%
공릉동 1599
 
16.0%
하계동 682
 
6.8%
월계동 670
 
6.7%
월계동 6
 
0.1%
공릉1동 1
 
< 0.1%

Length

2023-12-12T08:52:41.565377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:52:41.656997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상계동 4871
48.7%
중계동 2171
21.7%
공릉동 1599
 
16.0%
하계동 682
 
6.8%
월계동 676
 
6.8%
공릉1동 1
 
< 0.1%
Distinct2511
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:52:41.812130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length24
Mean length19.7558
Min length2

Characters and Unicode

Total characters197558
Distinct characters354
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1498 ?
Unique (%)15.0%

Sample

1st row서울특별시 노원구 동일로227길 79
2nd row서울특별시 노원구 중계로14나길 25
3rd row주정차126, 온수골 사거리
4th row서울 노원구 상계동 72-172 흥안운수주변
5th row주정차009.동일로207길17 중계그린아파트앞
ValueCountFrequency (%)
노원구 4945
 
14.6%
서울 3658
 
10.8%
상계동 2090
 
6.2%
서울특별시 1287
 
3.8%
부근 1071
 
3.2%
공릉동 889
 
2.6%
중계동 818
 
2.4%
주변 444
 
1.3%
하계동 437
 
1.3%
월계동 348
 
1.0%
Other values (2331) 17980
52.9%
2023-12-12T08:52:42.114234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24080
 
12.2%
1 9531
 
4.8%
7010
 
3.5%
2 6888
 
3.5%
6492
 
3.3%
6327
 
3.2%
6288
 
3.2%
0 6049
 
3.1%
5740
 
2.9%
5481
 
2.8%
Other values (344) 113672
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 111834
56.6%
Decimal Number 49977
25.3%
Space Separator 24080
 
12.2%
Other Punctuation 4219
 
2.1%
Dash Punctuation 3820
 
1.9%
Open Punctuation 1739
 
0.9%
Close Punctuation 1600
 
0.8%
Uppercase Letter 280
 
0.1%
Lowercase Letter 7
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7010
 
6.3%
6492
 
5.8%
6327
 
5.7%
6288
 
5.6%
5740
 
5.1%
5481
 
4.9%
5194
 
4.6%
5034
 
4.5%
5019
 
4.5%
4156
 
3.7%
Other values (312) 55093
49.3%
Decimal Number
ValueCountFrequency (%)
1 9531
19.1%
2 6888
13.8%
0 6049
12.1%
3 4906
9.8%
5 4830
9.7%
4 4315
8.6%
7 3761
 
7.5%
6 3731
 
7.5%
9 3035
 
6.1%
8 2931
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
D 68
24.3%
L 38
13.6%
P 38
13.6%
G 38
13.6%
K 28
10.0%
T 28
10.0%
C 20
 
7.1%
S 19
 
6.8%
B 2
 
0.7%
A 1
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
s 2
28.6%
k 2
28.6%
c 1
14.3%
a 1
14.3%
b 1
14.3%
Other Punctuation
ValueCountFrequency (%)
. 3382
80.2%
, 837
 
19.8%
Space Separator
ValueCountFrequency (%)
24080
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3820
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1739
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1600
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 111834
56.6%
Common 85437
43.2%
Latin 287
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7010
 
6.3%
6492
 
5.8%
6327
 
5.7%
6288
 
5.6%
5740
 
5.1%
5481
 
4.9%
5194
 
4.6%
5034
 
4.5%
5019
 
4.5%
4156
 
3.7%
Other values (312) 55093
49.3%
Common
ValueCountFrequency (%)
24080
28.2%
1 9531
 
11.2%
2 6888
 
8.1%
0 6049
 
7.1%
3 4906
 
5.7%
5 4830
 
5.7%
4 4315
 
5.1%
- 3820
 
4.5%
7 3761
 
4.4%
6 3731
 
4.4%
Other values (7) 13526
15.8%
Latin
ValueCountFrequency (%)
D 68
23.7%
L 38
13.2%
P 38
13.2%
G 38
13.2%
K 28
9.8%
T 28
9.8%
C 20
 
7.0%
S 19
 
6.6%
s 2
 
0.7%
B 2
 
0.7%
Other values (5) 6
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 111833
56.6%
ASCII 85724
43.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24080
28.1%
1 9531
 
11.1%
2 6888
 
8.0%
0 6049
 
7.1%
3 4906
 
5.7%
5 4830
 
5.6%
4 4315
 
5.0%
- 3820
 
4.5%
7 3761
 
4.4%
6 3731
 
4.4%
Other values (22) 13813
16.1%
Hangul
ValueCountFrequency (%)
7010
 
6.3%
6492
 
5.8%
6327
 
5.7%
6288
 
5.6%
5740
 
5.1%
5481
 
4.9%
5194
 
4.6%
5034
 
4.5%
5019
 
4.5%
4156
 
3.7%
Other values (311) 55092
49.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

위반내용
Categorical

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
주차금지(황색점선)구역
3737 
주정차금지(황색실선)구역
2543 
교통소통장애
919 
보도
789 
횡단보도
762 
Other values (10)
1250 

Length

Max length13
Median length12
Mean length9.4544
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row횡단보도
2nd row주정차금지(황색실선)구역
3rd row주정차금지(황색실선)구역
4th row주차금지(황색점선)구역
5th row주차금지(황색점선)구역

Common Values

ValueCountFrequency (%)
주차금지(황색점선)구역 3737
37.4%
주정차금지(황색실선)구역 2543
25.4%
교통소통장애 919
 
9.2%
보도 789
 
7.9%
횡단보도 762
 
7.6%
도로 모퉁이 422
 
4.2%
버스정류소 262
 
2.6%
주차방법위반 260
 
2.6%
소화전 145
 
1.5%
안전지대 129
 
1.3%
Other values (5) 32
 
0.3%

Length

2023-12-12T08:52:42.231024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주차금지(황색점선)구역 3737
35.9%
주정차금지(황색실선)구역 2543
24.4%
교통소통장애 919
 
8.8%
보도 789
 
7.6%
횡단보도 762
 
7.3%
도로 422
 
4.0%
모퉁이 422
 
4.0%
버스정류소 262
 
2.5%
주차방법위반 260
 
2.5%
소화전 145
 
1.4%
Other values (7) 162
 
1.6%

견인지시
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미견인
9999 
견인처리
 
1

Length

Max length4
Median length3
Mean length3.0001
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row미견인
2nd row미견인
3rd row미견인
4th row미견인
5th row미견인

Common Values

ValueCountFrequency (%)
미견인 9999
> 99.9%
견인처리 1
 
< 0.1%

Length

2023-12-12T08:52:42.330229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:52:42.417426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미견인 9999
> 99.9%
견인처리 1
 
< 0.1%

관리부서
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
교통지도과
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교통지도과
2nd row교통지도과
3rd row교통지도과
4th row교통지도과
5th row교통지도과

Common Values

ValueCountFrequency (%)
교통지도과 10000
100.0%

Length

2023-12-12T08:52:42.501831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:52:42.573071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교통지도과 10000
100.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-07-26 00:00:00
Maximum2023-07-26 00:00:00
2023-12-12T08:52:42.635196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:52:42.708598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T08:52:40.916717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:52:42.770514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단속원금단속동위반내용견인지시
단속원금1.0000.0730.3970.000
단속동0.0731.0000.2630.000
위반내용0.3970.2631.0000.000
견인지시0.0000.0000.0001.000
2023-12-12T08:52:42.845013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단속동견인지시위반내용
단속동1.0000.0000.122
견인지시0.0001.0000.000
위반내용0.1220.0001.000
2023-12-12T08:52:42.916397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단속원금단속동위반내용견인지시
단속원금1.0000.0250.1930.000
단속동0.0251.0000.1220.000
위반내용0.1930.1221.0000.000
견인지시0.0000.0000.0001.000

Missing values

2023-12-12T08:52:41.011904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:52:41.114754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속일시단속원금단속동단속장소위반내용견인지시관리부서기준일자
150132022-10-30 09:5140000상계동서울특별시 노원구 동일로227길 79횡단보도미견인교통지도과2023-07-26
517652023-05-01 16:3040000중계동서울특별시 노원구 중계로14나길 25주정차금지(황색실선)구역미견인교통지도과2023-07-26
472792023-04-14 14:1540000상계동주정차126, 온수골 사거리주정차금지(황색실선)구역미견인교통지도과2023-07-26
442182023-03-31 17:0640000상계동서울 노원구 상계동 72-172 흥안운수주변주차금지(황색점선)구역미견인교통지도과2023-07-26
70972022-09-14 16:4140000중계동주정차009.동일로207길17 중계그린아파트앞주차금지(황색점선)구역미견인교통지도과2023-07-26
260332023-01-03 08:4240000공릉동주정차 120, 노원공릉공공행복주택앞주정차금지(황색실선)구역미견인교통지도과2023-07-26
292402023-01-20 18:2940000공릉동서울 노원구 공릉동 385-3 신도브래뉴주변교통소통장애미견인교통지도과2023-07-26
233252022-12-15 21:5940000월계동서울 노원구 월계동 56-5교통소통장애미견인교통지도과2023-07-26
78822022-09-17 23:2040000상계동서울특별시 노원구 노해로 507 (상계동, 와우쇼핑몰)주차금지(황색점선)구역미견인교통지도과2023-07-26
112032022-10-06 15:5040000상계동주정차115.동일로242길 21주차금지(황색점선)구역미견인교통지도과2023-07-26
단속일시단속원금단속동단속장소위반내용견인지시관리부서기준일자
86362022-09-21 19:03120000중계동주정차053.섬밭로258 중평초등학교주정차금지(황색실선)구역미견인교통지도과2023-07-26
491542023-04-21 16:3440000공릉동주정차005.동일로1114 수협사거리주변주차금지(황색점선)구역미견인교통지도과2023-07-26
250482022-12-27 17:3950000공릉동서울 노원구 공릉동 707-2주차금지(황색점선)구역미견인교통지도과2023-07-26
630802023-06-23 11:4840000중계동서울 노원구 중계동 423-57주정차금지(황색실선)구역미견인교통지도과2023-07-26
397812023-03-14 10:3040000상계동서울 노원구 상계동 176-24 양우아파트주변주정차금지(황색실선)구역미견인교통지도과2023-07-26
627752023-06-22 14:0040000중계동서울 노원구 중계동 511-1주정차금지(황색실선)구역미견인교통지도과2023-07-26
591912023-06-06 16:2640000공릉동공릉동 385-3 부근보도미견인교통지도과2023-07-26
378672023-03-06 17:1940000중계동서울 노원구 중계동 359-9주정차금지(황색실선)구역미견인교통지도과2023-07-26
388152023-03-10 08:03120000상계동서울 노원구 상계동 744주차방법위반미견인교통지도과2023-07-26
263212023-01-04 18:52120000월계동주정차131, 우남아파트주정차금지(황색실선)구역미견인교통지도과2023-07-26

Duplicate rows

Most frequently occurring

단속일시단속원금단속동단속장소위반내용견인지시관리부서기준일자# duplicates
362022-12-06 14:5540000상계동서울 노원구 상계동 739-4보도미견인교통지도과2023-07-263
402022-12-08 18:1040000중계동서울 노원구 중계동 359-16 중계주공5단지주변주정차금지(황색실선)구역미견인교통지도과2023-07-263
02022-08-14 20:4740000상계동서울 노원구 상계동 358-27주정차금지(황색실선)구역미견인교통지도과2023-07-262
12022-08-16 09:2740000중계동서울특별시 노원구 한글비석로15길 125주차금지(황색점선)구역미견인교통지도과2023-07-262
22022-08-17 10:2040000상계동서울 노원구 상계동 173-4주차금지(황색점선)구역미견인교통지도과2023-07-262
32022-08-21 17:5040000상계동서울특별시 노원구 동일로245가길 41 은빛아파트주변주정차금지(황색실선)구역미견인교통지도과2023-07-262
42022-08-27 17:0140000상계동서울특별시 노원구 상계로5길 12주정차금지(황색실선)구역미견인교통지도과2023-07-262
52022-08-30 08:5640000상계동서울 노원구 상계동 169-26보도미견인교통지도과2023-07-262
62022-09-05 18:5340000중계동서울 노원구 중계동 588-5주차금지(황색점선)구역미견인교통지도과2023-07-262
72022-09-06 18:1640000상계동서울 노원구 상계동 37-16주차금지(황색점선)구역미견인교통지도과2023-07-262