Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows196
Duplicate rows (%)2.0%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

DateTime1
Text2
Categorical1

Dataset

Description울산광역시 울주군의 불법주정차 단속정보에 대한 데이터로 위반일자,위반시간, 위반장소명, 견인여부 등의 항목을 제공합니다.(2021년~2023년 8월)
Author울산광역시 울주군
URLhttps://www.data.go.kr/data/15073339/fileData.do

Alerts

견인여부 has constant value ""Constant
Dataset has 196 (2.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 20:50:02.651498
Analysis finished2023-12-12 20:50:03.218398
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct864
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-01-01 00:00:00
Maximum2023-06-12 00:00:00
2023-12-13T05:50:03.317674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:50:03.503127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1146
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:50:03.970523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.83
Min length4

Characters and Unicode

Total characters48300
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique174 ?
Unique (%)1.7%

Sample

1st row10:12
2nd row11:57
3rd row15:48
4th row16:17
5th row10:19
ValueCountFrequency (%)
14:41 59
 
0.6%
14:35 54
 
0.5%
8:14 53
 
0.5%
14:37 52
 
0.5%
14:46 51
 
0.5%
14:30 51
 
0.5%
14:42 50
 
0.5%
14:36 49
 
0.5%
10:19 48
 
0.5%
14:40 47
 
0.5%
Other values (1136) 9486
94.9%
2023-12-13T05:50:04.545422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 11188
23.2%
: 10000
20.7%
4 4473
 
9.3%
0 4160
 
8.6%
2 3687
 
7.6%
5 3362
 
7.0%
3 3159
 
6.5%
8 2613
 
5.4%
9 2026
 
4.2%
7 1881
 
3.9%
Other values (2) 1751
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38299
79.3%
Other Punctuation 10001
 
20.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 11188
29.2%
4 4473
 
11.7%
0 4160
 
10.9%
2 3687
 
9.6%
5 3362
 
8.8%
3 3159
 
8.2%
8 2613
 
6.8%
9 2026
 
5.3%
7 1881
 
4.9%
6 1750
 
4.6%
Other Punctuation
ValueCountFrequency (%)
: 10000
> 99.9%
; 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 48300
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 11188
23.2%
: 10000
20.7%
4 4473
 
9.3%
0 4160
 
8.6%
2 3687
 
7.6%
5 3362
 
7.0%
3 3159
 
6.5%
8 2613
 
5.4%
9 2026
 
4.2%
7 1881
 
3.9%
Other values (2) 1751
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48300
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 11188
23.2%
: 10000
20.7%
4 4473
 
9.3%
0 4160
 
8.6%
2 3687
 
7.6%
5 3362
 
7.0%
3 3159
 
6.5%
8 2613
 
5.4%
9 2026
 
4.2%
7 1881
 
3.9%
Other values (2) 1751
 
3.6%
Distinct1715
Distinct (%)17.2%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T05:50:04.845482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length31
Mean length10.539754
Min length2

Characters and Unicode

Total characters105387
Distinct characters224
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1183 ?
Unique (%)11.8%

Sample

1st row온양서희스타힐스입구
2nd row울산역시티투어
3rd row온산읍덕남로
4th row천상도서관
5th row울주군 범서읍
ValueCountFrequency (%)
부근 879
 
4.1%
866
 
4.1%
울산역승하차장 853
 
4.0%
언양 709
 
3.3%
온산읍 600
 
2.8%
언양읍 572
 
2.7%
범서읍 557
 
2.6%
불법 512
 
2.4%
주정차 498
 
2.3%
파리바게트 490
 
2.3%
Other values (1597) 14713
69.2%
2023-12-13T05:50:05.330300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11254
 
10.7%
4243
 
4.0%
3001
 
2.8%
2996
 
2.8%
1 2760
 
2.6%
2496
 
2.4%
2493
 
2.4%
2118
 
2.0%
1978
 
1.9%
1975
 
1.9%
Other values (214) 70073
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79388
75.3%
Decimal Number 11974
 
11.4%
Space Separator 11254
 
10.7%
Dash Punctuation 1944
 
1.8%
Uppercase Letter 614
 
0.6%
Other Punctuation 181
 
0.2%
Lowercase Letter 24
 
< 0.1%
Connector Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4243
 
5.3%
3001
 
3.8%
2996
 
3.8%
2496
 
3.1%
2493
 
3.1%
2118
 
2.7%
1978
 
2.5%
1975
 
2.5%
1955
 
2.5%
1735
 
2.2%
Other values (188) 54398
68.5%
Decimal Number
ValueCountFrequency (%)
1 2760
23.0%
2 1649
13.8%
3 1367
11.4%
6 1047
 
8.7%
5 1039
 
8.7%
0 951
 
7.9%
4 886
 
7.4%
7 830
 
6.9%
9 766
 
6.4%
8 679
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
O 125
20.4%
S 125
20.4%
I 125
20.4%
L 125
20.4%
G 98
16.0%
D 8
 
1.3%
N 8
 
1.3%
Other Punctuation
ValueCountFrequency (%)
# 175
96.7%
, 4
 
2.2%
. 2
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
e 8
33.3%
a 8
33.3%
m 8
33.3%
Space Separator
ValueCountFrequency (%)
11254
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1944
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79388
75.3%
Common 25361
 
24.1%
Latin 638
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4243
 
5.3%
3001
 
3.8%
2996
 
3.8%
2496
 
3.1%
2493
 
3.1%
2118
 
2.7%
1978
 
2.5%
1975
 
2.5%
1955
 
2.5%
1735
 
2.2%
Other values (188) 54398
68.5%
Common
ValueCountFrequency (%)
11254
44.4%
1 2760
 
10.9%
- 1944
 
7.7%
2 1649
 
6.5%
3 1367
 
5.4%
6 1047
 
4.1%
5 1039
 
4.1%
0 951
 
3.7%
4 886
 
3.5%
7 830
 
3.3%
Other values (6) 1634
 
6.4%
Latin
ValueCountFrequency (%)
O 125
19.6%
S 125
19.6%
I 125
19.6%
L 125
19.6%
G 98
15.4%
e 8
 
1.3%
D 8
 
1.3%
N 8
 
1.3%
a 8
 
1.3%
m 8
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79388
75.3%
ASCII 25999
 
24.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11254
43.3%
1 2760
 
10.6%
- 1944
 
7.5%
2 1649
 
6.3%
3 1367
 
5.3%
6 1047
 
4.0%
5 1039
 
4.0%
0 951
 
3.7%
4 886
 
3.4%
7 830
 
3.2%
Other values (16) 2272
 
8.7%
Hangul
ValueCountFrequency (%)
4243
 
5.3%
3001
 
3.8%
2996
 
3.8%
2496
 
3.1%
2493
 
3.1%
2118
 
2.7%
1978
 
2.5%
1975
 
2.5%
1955
 
2.5%
1735
 
2.2%
Other values (188) 54398
68.5%

견인여부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
X
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowX
2nd rowX
3rd rowX
4th rowX
5th rowX

Common Values

ValueCountFrequency (%)
X 10000
100.0%

Length

2023-12-13T05:50:05.461478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:50:05.575708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
x 10000
100.0%

Missing values

2023-12-13T05:50:03.053486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:50:03.162746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위반일자위반시간위반장소명견인여부
448562022-04-0610:12온양서희스타힐스입구X
96912021-04-2511:57울산역시티투어X
175462021-07-2215:48온산읍덕남로X
499752022-05-1116:17천상도서관X
314512021-11-2410:19울주군 범서읍X
315712021-11-2518:36구영 우미린1차 사거리X
600192022-07-2214:34언양읍 파리바게트X
320342021-12-0319:26구영리 푸르지오X
480202022-04-2714:46삼남읍 서향교1길 67-12X
919202023-04-1718:45온산읍덕신리1321-3횡단보도 불법주정차X
위반일자위반시간위반장소명견인여부
538292022-06-0716:30구영초등학교X
866642023-03-1118:08울산역승하차장X
274922021-10-1910:49범서읍 울밀로 2879-8 부X
856782023-03-039:58성동초등학교X
683862022-09-3010:01온산읍 학남리 199 부근X
5072021-01-0818:06범서2차현대아파트X
389032022-02-2317:29울산역승하차장X
812722023-01-2722:58울산역승하차장X
204512021-08-1519:08울산역승하차장X
170262021-07-187:18청량읍청량읍 덕하리 406-12교차로 모퉁이 불법 주정차X

Duplicate rows

Most frequently occurring

위반일자위반시간위반장소명견인여부# duplicates
1222022-09-2114:41온산읍 학남리 199 부근X7
1042022-06-3014:41울주군 온산읍X5
1382022-11-0214:30온산읍 처용리 125 부근X5
1722023-04-0610:33온산읍 학남리 199 부근X5
612021-12-1415:06온산읍 우봉강양로 35 부X4
862022-04-0714:56울주군 온산읍X4
1262022-09-2810:13온산읍 학남리 199 부근X4
1422022-11-1414:34온산읍 학남리 199 부근X4
1922023-06-0910:03온산읍 학남리 199 부근X4
1942023-06-0910:09온산읍 방도리 520-2 부X4