Overview

Dataset statistics

Number of variables7
Number of observations404
Missing cells9
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.4 KiB
Average record size in memory59.3 B

Variable types

Numeric2
Categorical4
Text1

Dataset

Description울산광역시 구군별 불법주정차단속에 대한 연간 누적 정보(불법주정차 단속일, 단속 위치, 단속 건수 등)를 제공하고 있습니다.
Author울산광역시
URLhttps://www.data.go.kr/data/15091256/fileData.do

Alerts

시도 has constant value ""Constant
불법주정차 단속일 has constant value ""Constant
시군구 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
견인 건수 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 시군구 and 1 other fieldsHigh correlation
단속건수 is highly overall correlated with 견인 건수High correlation
견인 건수 is highly imbalanced (97.5%)Imbalance
단속건수 has 9 (2.2%) missing valuesMissing
연번 has unique valuesUnique
단속건수 has 13 (3.2%) zerosZeros

Reproduction

Analysis started2024-03-14 11:26:07.030498
Analysis finished2024-03-14 11:26:09.085700
Duration2.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct404
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202.5
Minimum1
Maximum404
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-03-14T20:26:09.299443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.15
Q1101.75
median202.5
Q3303.25
95-th percentile383.85
Maximum404
Range403
Interquartile range (IQR)201.5

Descriptive statistics

Standard deviation116.769
Coefficient of variation (CV)0.57663705
Kurtosis-1.2
Mean202.5
Median Absolute Deviation (MAD)101
Skewness0
Sum81810
Variance13635
MonotonicityStrictly increasing
2024-03-14T20:26:09.753331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
268 1
 
0.2%
278 1
 
0.2%
277 1
 
0.2%
276 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
Other values (394) 394
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
404 1
0.2%
403 1
0.2%
402 1
0.2%
401 1
0.2%
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
울산광역시
404 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시
2nd row울산광역시
3rd row울산광역시
4th row울산광역시
5th row울산광역시

Common Values

ValueCountFrequency (%)
울산광역시 404
100.0%

Length

2024-03-14T20:26:10.168643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:10.337630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 404
100.0%

시군구
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
남구
146 
북구
87 
울주군
75 
중구
70 
동구
26 

Length

Max length3
Median length2
Mean length2.1856436
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
남구 146
36.1%
북구 87
21.5%
울주군 75
18.6%
중구 70
17.3%
동구 26
 
6.4%

Length

2024-03-14T20:26:10.522200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:10.725257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남구 146
36.1%
북구 87
21.5%
울주군 75
18.6%
중구 70
17.3%
동구 26
 
6.4%

불법주정차 단속일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-31
404 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2023-12-31
5th row2023-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 404
100.0%

Length

2024-03-14T20:26:10.932343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:11.097686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-31 404
100.0%
Distinct401
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2024-03-14T20:26:12.059216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length11.331683
Min length3

Characters and Unicode

Total characters4578
Distinct characters347
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique398 ?
Unique (%)98.5%

Sample

1st row복산동 홈플러스
2nd row태화동 농협 명정지점
3rd row남외동 경남은행
4th row반구2동 농협구교지점
5th row옥교동 센트럴프라자
ValueCountFrequency (%)
59
 
6.3%
주변 44
 
4.7%
범서읍 21
 
2.3%
부근 20
 
2.1%
언양읍 14
 
1.5%
사거리 14
 
1.5%
교차로 12
 
1.3%
온산읍 10
 
1.1%
삼산로 9
 
1.0%
전하동 9
 
1.0%
Other values (534) 720
77.3%
2024-03-14T20:26:13.531114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
540
 
11.8%
166
 
3.6%
136
 
3.0%
116
 
2.5%
109
 
2.4%
105
 
2.3%
100
 
2.2%
88
 
1.9%
83
 
1.8%
82
 
1.8%
Other values (337) 3053
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3745
81.8%
Space Separator 540
 
11.8%
Decimal Number 85
 
1.9%
Uppercase Letter 80
 
1.7%
Open Punctuation 57
 
1.2%
Close Punctuation 57
 
1.2%
Dash Punctuation 12
 
0.3%
Other Punctuation 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
4.4%
136
 
3.6%
116
 
3.1%
109
 
2.9%
105
 
2.8%
100
 
2.7%
88
 
2.3%
83
 
2.2%
82
 
2.2%
74
 
2.0%
Other values (305) 2686
71.7%
Uppercase Letter
ValueCountFrequency (%)
K 14
17.5%
X 10
12.5%
T 10
12.5%
S 8
10.0%
G 8
10.0%
C 6
7.5%
B 5
 
6.2%
L 4
 
5.0%
I 3
 
3.8%
V 3
 
3.8%
Other values (6) 9
11.2%
Decimal Number
ValueCountFrequency (%)
2 26
30.6%
1 21
24.7%
5 9
 
10.6%
3 8
 
9.4%
4 6
 
7.1%
6 5
 
5.9%
7 4
 
4.7%
0 2
 
2.4%
9 2
 
2.4%
8 2
 
2.4%
Space Separator
ValueCountFrequency (%)
540
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
w 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3745
81.8%
Common 752
 
16.4%
Latin 81
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
4.4%
136
 
3.6%
116
 
3.1%
109
 
2.9%
105
 
2.8%
100
 
2.7%
88
 
2.3%
83
 
2.2%
82
 
2.2%
74
 
2.0%
Other values (305) 2686
71.7%
Latin
ValueCountFrequency (%)
K 14
17.3%
X 10
12.3%
T 10
12.3%
S 8
9.9%
G 8
9.9%
C 6
7.4%
B 5
 
6.2%
L 4
 
4.9%
I 3
 
3.7%
V 3
 
3.7%
Other values (7) 10
12.3%
Common
ValueCountFrequency (%)
540
71.8%
( 57
 
7.6%
) 57
 
7.6%
2 26
 
3.5%
1 21
 
2.8%
- 12
 
1.6%
5 9
 
1.2%
3 8
 
1.1%
4 6
 
0.8%
6 5
 
0.7%
Other values (5) 11
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3745
81.8%
ASCII 833
 
18.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
540
64.8%
( 57
 
6.8%
) 57
 
6.8%
2 26
 
3.1%
1 21
 
2.5%
K 14
 
1.7%
- 12
 
1.4%
X 10
 
1.2%
T 10
 
1.2%
5 9
 
1.1%
Other values (22) 77
 
9.2%
Hangul
ValueCountFrequency (%)
166
 
4.4%
136
 
3.6%
116
 
3.1%
109
 
2.9%
105
 
2.8%
100
 
2.7%
88
 
2.3%
83
 
2.2%
82
 
2.2%
74
 
2.0%
Other values (305) 2686
71.7%

단속건수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct241
Distinct (%)61.0%
Missing9
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean276.51646
Minimum0
Maximum5463
Zeros13
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2024-03-14T20:26:13.925540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q128.5
median91
Q3309
95-th percentile1184.2
Maximum5463
Range5463
Interquartile range (IQR)280.5

Descriptive statistics

Standard deviation525.24636
Coefficient of variation (CV)1.8995121
Kurtosis37.340147
Mean276.51646
Median Absolute Deviation (MAD)83
Skewness5.0994999
Sum109224
Variance275883.74
MonotonicityNot monotonic
2024-03-14T20:26:14.334271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 13
 
3.2%
1 9
 
2.2%
2 8
 
2.0%
11 7
 
1.7%
3 6
 
1.5%
25 5
 
1.2%
4 5
 
1.2%
16 5
 
1.2%
62 4
 
1.0%
39 4
 
1.0%
Other values (231) 329
81.4%
(Missing) 9
 
2.2%
ValueCountFrequency (%)
0 13
3.2%
1 9
2.2%
2 8
2.0%
3 6
1.5%
4 5
 
1.2%
5 1
 
0.2%
6 4
 
1.0%
7 2
 
0.5%
8 2
 
0.5%
9 1
 
0.2%
ValueCountFrequency (%)
5463 1
0.2%
4511 1
0.2%
3119 1
0.2%
2388 1
0.2%
2222 1
0.2%
2024 1
0.2%
1758 1
0.2%
1705 1
0.2%
1703 1
0.2%
1675 1
0.2%

견인 건수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
0
403 
<NA>
 
1

Length

Max length4
Median length1
Mean length1.0074257
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 403
99.8%
<NA> 1
 
0.2%

Length

2024-03-14T20:26:14.770737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:15.100604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 403
99.8%
na 1
 
0.2%

Interactions

2024-03-14T20:26:07.948990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:26:07.404475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:26:08.213631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:26:07.690947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:26:15.294149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군구단속건수
연번1.0000.9960.000
시군구0.9961.0000.000
단속건수0.0000.0001.000
2024-03-14T20:26:15.534049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구견인 건수
시군구1.0001.000
견인 건수1.0001.000
2024-03-14T20:26:15.767523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번단속건수시군구견인 건수
연번1.0000.1060.8991.000
단속건수0.1061.0000.0001.000
시군구0.8990.0001.0001.000
견인 건수1.0001.0001.0001.000

Missing values

2024-03-14T20:26:08.548349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:26:08.936413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구불법주정차 단속일불법주정차 단속 위치단속건수견인 건수
01울산광역시중구2023-12-31복산동 홈플러스410
12울산광역시중구2023-12-31태화동 농협 명정지점30
23울산광역시중구2023-12-31남외동 경남은행1440
34울산광역시중구2023-12-31반구2동 농협구교지점110
45울산광역시중구2023-12-31옥교동 센트럴프라자620
56울산광역시중구2023-12-31우정동 우정전통시장 입구4600
67울산광역시중구2023-12-31성안동 성안농협지점1830
78울산광역시중구2023-12-31남외동 남외초교 부근370
89울산광역시중구2023-12-31중앙동 롯데시네마 주변1100
910울산광역시중구2023-12-31반구동 학성초교 건너편860
연번시도시군구불법주정차 단속일불법주정차 단속 위치단속건수견인 건수
394395울산광역시울주군2023-12-31삼남읍 삼남초등학교30
395396울산광역시울주군2023-12-31범서읍 호연초등학교730
396397울산광역시울주군2023-12-31범서읍 명지초등학교50
397398울산광역시울주군2023-12-31범서읍 무거초등학교 사거리330
398399울산광역시울주군2023-12-31온양읍 온남초등학교3120
399400울산광역시울주군2023-12-31온산읍 온산초등학교10
400401울산광역시울주군2023-12-31온산읍 덕신초등학교450
401402울산광역시울주군2023-12-31웅촌면 웅촌 초등학교440
402403울산광역시울주군2023-12-31신한중공업600
403404울산광역시울주군2023-12-31덕하역사거리4350