Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows51
Duplicate rows (%)0.5%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Categorical3
DateTime1
Text1

Dataset

Description화성시 불법주정차 단속현황(단속기관, 단속일시, 단속구역, 단속구분 등)
Author경기도 화성시
URLhttps://www.data.go.kr/data/15052915/fileData.do

Alerts

단속기관 has constant value ""Constant
단속구분 has constant value ""Constant
Dataset has 51 (0.5%) duplicate rowsDuplicates
단속구역 is highly imbalanced (72.9%)Imbalance

Reproduction

Analysis started2023-12-12 22:28:01.646480
Analysis finished2023-12-12 22:28:02.119563
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

단속기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
화성시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화성시
2nd row화성시
3rd row화성시
4th row화성시
5th row화성시

Common Values

ValueCountFrequency (%)
화성시 10000
100.0%

Length

2023-12-13T07:28:02.177518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:28:02.275585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화성시 10000
100.0%
Distinct9243
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-07-01 07:20:00
Maximum2019-10-31 21:47:00
2023-12-13T07:28:02.400592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:28:02.553061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct89
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:28:02.887170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length17.6948
Min length14

Characters and Unicode

Total characters176948
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row경기도 화성시 반송동 86-4
2nd row경기도 화성시 기안동 924-4
3rd row경기도 화성시 구문천리 939
4th row경기도 화성시 향남읍 하길리 1471-2
5th row경기도 화성시 병점동 843-9
ValueCountFrequency (%)
경기도 10000
23.7%
화성시 10000
23.7%
병점동 2631
 
6.2%
행정리 1083
 
2.6%
평리 1041
 
2.5%
400-2 1031
 
2.4%
향남읍 1024
 
2.4%
하길리 671
 
1.6%
437-1 665
 
1.6%
남양리 626
 
1.5%
Other values (114) 13352
31.7%
2023-12-13T07:28:03.394736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33200
18.8%
10172
 
5.7%
10080
 
5.7%
10000
 
5.7%
10000
 
5.7%
10000
 
5.7%
10000
 
5.7%
- 8513
 
4.8%
4 5763
 
3.3%
1 5752
 
3.3%
Other values (52) 63468
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95394
53.9%
Decimal Number 39841
22.5%
Space Separator 33200
 
18.8%
Dash Punctuation 8513
 
4.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10172
10.7%
10080
10.6%
10000
10.5%
10000
10.5%
10000
10.5%
10000
10.5%
5472
 
5.7%
4570
 
4.8%
2631
 
2.8%
2631
 
2.8%
Other values (40) 19838
20.8%
Decimal Number
ValueCountFrequency (%)
4 5763
14.5%
1 5752
14.4%
2 5529
13.9%
3 4892
12.3%
6 3735
9.4%
7 3460
8.7%
8 3349
8.4%
9 2801
7.0%
0 2482
6.2%
5 2078
 
5.2%
Space Separator
ValueCountFrequency (%)
33200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8513
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95394
53.9%
Common 81554
46.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10172
10.7%
10080
10.6%
10000
10.5%
10000
10.5%
10000
10.5%
10000
10.5%
5472
 
5.7%
4570
 
4.8%
2631
 
2.8%
2631
 
2.8%
Other values (40) 19838
20.8%
Common
ValueCountFrequency (%)
33200
40.7%
- 8513
 
10.4%
4 5763
 
7.1%
1 5752
 
7.1%
2 5529
 
6.8%
3 4892
 
6.0%
6 3735
 
4.6%
7 3460
 
4.2%
8 3349
 
4.1%
9 2801
 
3.4%
Other values (2) 4560
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95394
53.9%
ASCII 81554
46.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33200
40.7%
- 8513
 
10.4%
4 5763
 
7.1%
1 5752
 
7.1%
2 5529
 
6.8%
3 4892
 
6.0%
6 3735
 
4.6%
7 3460
 
4.2%
8 3349
 
4.1%
9 2801
 
3.4%
Other values (2) 4560
 
5.6%
Hangul
ValueCountFrequency (%)
10172
10.7%
10080
10.6%
10000
10.5%
10000
10.5%
10000
10.5%
10000
10.5%
5472
 
5.7%
4570
 
4.8%
2631
 
2.8%
2631
 
2.8%
Other values (40) 19838
20.8%

단속구역
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
주정차금지구역
9535 
어린이보호구역
 
465

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주정차금지구역
2nd row주정차금지구역
3rd row주정차금지구역
4th row주정차금지구역
5th row주정차금지구역

Common Values

ValueCountFrequency (%)
주정차금지구역 9535
95.3%
어린이보호구역 465
 
4.7%

Length

2023-12-13T07:28:03.535694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:28:03.657551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주정차금지구역 9535
95.3%
어린이보호구역 465
 
4.7%

단속구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고정형CCTV
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고정형CCTV
2nd row고정형CCTV
3rd row고정형CCTV
4th row고정형CCTV
5th row고정형CCTV

Common Values

ValueCountFrequency (%)
고정형CCTV 10000
100.0%

Length

2023-12-13T07:28:03.775784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:28:03.891694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고정형cctv 10000
100.0%

Correlations

2023-12-13T07:28:03.976273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단속장소단속구역
단속장소1.0000.727
단속구역0.7271.000

Missing values

2023-12-13T07:28:01.972813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:28:02.076200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속기관단속일시단속장소단속구역단속구분
13438화성시2019-07-08 15:57경기도 화성시 반송동 86-4주정차금지구역고정형CCTV
13306화성시2019-10-31 17:11경기도 화성시 기안동 924-4주정차금지구역고정형CCTV
1025화성시2019-08-15 9:04경기도 화성시 구문천리 939주정차금지구역고정형CCTV
7693화성시2019-10-29 19:52경기도 화성시 향남읍 하길리 1471-2주정차금지구역고정형CCTV
11324화성시2019-09-11 16:50경기도 화성시 병점동 843-9주정차금지구역고정형CCTV
5200화성시2019-09-16 9:10경기도 화성시 동화리 597-3주정차금지구역고정형CCTV
8070화성시2019-07-03 18:12경기도 화성시 병점동 400-2주정차금지구역고정형CCTV
3345화성시2019-09-02 17:02경기도 화성시 구문천리 939주정차금지구역고정형CCTV
7764화성시2019-10-30 16:41경기도 화성시 우정읍 조암리 351-9주정차금지구역고정형CCTV
2088화성시2019-08-23 20:38경기도 화성시 향남읍 하길리 1493주정차금지구역고정형CCTV
단속기관단속일시단속장소단속구역단속구분
5208화성시2019-09-16 9:27경기도 화성시 남양읍 남양리 1267-7주정차금지구역고정형CCTV
872화성시2019-08-12 14:21경기도 화성시 평리 86-2주정차금지구역고정형CCTV
363화성시2019-07-23 15:01경기도 화성시 평리 86-2주정차금지구역고정형CCTV
6505화성시2019-10-08 14:13경기도 화성시 구장리 528-5주정차금지구역고정형CCTV
12399화성시2019-10-05 21:30경기도 화성시 반월동 864-1주정차금지구역고정형CCTV
14396화성시2019-10-22 17:31경기도 화성시 오산동 967-506주정차금지구역고정형CCTV
10520화성시2019-08-23 14:27경기도 화성시 병점동 348-4주정차금지구역고정형CCTV
5370화성시2019-09-17 9:17경기도 화성시 남양리 1276-2주정차금지구역고정형CCTV
8217화성시2019-07-06 12:33경기도 화성시 병점동 843-9주정차금지구역고정형CCTV
2081화성시2019-08-23 20:01경기도 화성시 행정리 437-1주정차금지구역고정형CCTV

Duplicate rows

Most frequently occurring

단속기관단속일시단속장소단속구역단속구분# duplicates
0화성시2019-07-03 14:56경기도 화성시 평리 86-2주정차금지구역고정형CCTV2
1화성시2019-07-07 7:12경기도 화성시 병점동 400-2주정차금지구역고정형CCTV2
2화성시2019-07-10 19:51경기도 화성시 상리 22-38주정차금지구역고정형CCTV2
3화성시2019-07-15 14:04경기도 화성시 능동 1065-3주정차금지구역고정형CCTV2
4화성시2019-07-22 19:17경기도 화성시 능동 1065-3주정차금지구역고정형CCTV2
5화성시2019-07-31 20:59경기도 화성시 병점동 843-9주정차금지구역고정형CCTV2
6화성시2019-08-03 16:37경기도 화성시 병점동 843-9주정차금지구역고정형CCTV2
7화성시2019-08-04 15:47경기도 화성시 병점동 400-2주정차금지구역고정형CCTV2
8화성시2019-08-06 14:52경기도 화성시 평리 86-2주정차금지구역고정형CCTV2
9화성시2019-08-08 10:05경기도 화성시 병점동 384-2주정차금지구역고정형CCTV2