Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows162
Duplicate rows (%)1.6%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

DateTime3
Categorical2
Text1

Dataset

Description경상남도 거제시 불법주정차단속정보현황(단속일자, 단속시간, 시군구, 단속장소명, 단속구분, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15087238

Alerts

시군구 구분 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 162 (1.6%) duplicate rowsDuplicates
단속구분(단속장비) is highly imbalanced (60.6%)Imbalance

Reproduction

Analysis started2023-12-11 00:52:25.375059
Analysis finished2023-12-11 00:52:26.004717
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct365
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-11T09:52:26.071559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:26.213050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1131
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-11 00:01:00
Maximum2023-12-11 23:56:00
2023-12-11T09:52:26.370068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:26.488498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시군구 구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
거제시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제시
2nd row거제시
3rd row거제시
4th row거제시
5th row거제시

Common Values

ValueCountFrequency (%)
거제시 10000
100.0%

Length

2023-12-11T09:52:26.596465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:52:26.677311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거제시 10000
100.0%
Distinct1533
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:52:26.912779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length12.4287
Min length6

Characters and Unicode

Total characters124287
Distinct characters408
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique971 ?
Unique (%)9.7%

Sample

1st row고현동엠파크앞CCTV
2nd row옥포동 옥포대첩로5길
3rd row옥포 공영주차장 앞 CCTV
4th row수월사거리CCTV
5th row장평동 장평4로
ValueCountFrequency (%)
고현동 1975
 
9.5%
cctv 1755
 
8.4%
장평동 872
 
4.2%
고현홍콩반점사거리cctv 847
 
4.1%
장평 818
 
3.9%
802
 
3.9%
옥포 739
 
3.6%
아주동 733
 
3.5%
옥포동 717
 
3.5%
아주 605
 
2.9%
Other values (1527) 10910
52.5%
2023-12-11T09:52:27.367526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10773
 
8.7%
C 10760
 
8.7%
5529
 
4.4%
T 5389
 
4.3%
5381
 
4.3%
V 5287
 
4.3%
4385
 
3.5%
( 3338
 
2.7%
) 3262
 
2.6%
2988
 
2.4%
Other values (398) 67195
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 76807
61.8%
Uppercase Letter 21469
 
17.3%
Space Separator 10773
 
8.7%
Decimal Number 7682
 
6.2%
Open Punctuation 3338
 
2.7%
Close Punctuation 3262
 
2.6%
Dash Punctuation 750
 
0.6%
Lowercase Letter 167
 
0.1%
Other Punctuation 38
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5529
 
7.2%
5381
 
7.0%
4385
 
5.7%
2988
 
3.9%
2673
 
3.5%
2450
 
3.2%
2428
 
3.2%
2408
 
3.1%
2403
 
3.1%
2362
 
3.1%
Other values (369) 43800
57.0%
Uppercase Letter
ValueCountFrequency (%)
C 10760
50.1%
T 5389
25.1%
V 5287
24.6%
S 10
 
< 0.1%
K 9
 
< 0.1%
G 9
 
< 0.1%
M 1
 
< 0.1%
U 1
 
< 0.1%
L 1
 
< 0.1%
J 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 2153
28.0%
2 1297
16.9%
3 963
12.5%
9 540
 
7.0%
0 538
 
7.0%
6 478
 
6.2%
4 461
 
6.0%
7 430
 
5.6%
5 429
 
5.6%
8 393
 
5.1%
Other Punctuation
ValueCountFrequency (%)
37
97.4%
. 1
 
2.6%
Space Separator
ValueCountFrequency (%)
10773
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3338
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3262
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 750
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 167
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 76807
61.8%
Common 25844
 
20.8%
Latin 21636
 
17.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5529
 
7.2%
5381
 
7.0%
4385
 
5.7%
2988
 
3.9%
2673
 
3.5%
2450
 
3.2%
2428
 
3.2%
2408
 
3.1%
2403
 
3.1%
2362
 
3.1%
Other values (369) 43800
57.0%
Common
ValueCountFrequency (%)
10773
41.7%
( 3338
 
12.9%
) 3262
 
12.6%
1 2153
 
8.3%
2 1297
 
5.0%
3 963
 
3.7%
- 750
 
2.9%
9 540
 
2.1%
0 538
 
2.1%
6 478
 
1.8%
Other values (7) 1752
 
6.8%
Latin
ValueCountFrequency (%)
C 10760
49.7%
T 5389
24.9%
V 5287
24.4%
e 167
 
0.8%
S 10
 
< 0.1%
K 9
 
< 0.1%
G 9
 
< 0.1%
M 1
 
< 0.1%
U 1
 
< 0.1%
L 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 76807
61.8%
ASCII 47443
38.2%
None 37
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10773
22.7%
C 10760
22.7%
T 5389
11.4%
V 5287
11.1%
( 3338
 
7.0%
) 3262
 
6.9%
1 2153
 
4.5%
2 1297
 
2.7%
3 963
 
2.0%
- 750
 
1.6%
Other values (18) 3471
 
7.3%
Hangul
ValueCountFrequency (%)
5529
 
7.2%
5381
 
7.0%
4385
 
5.7%
2988
 
3.9%
2673
 
3.5%
2450
 
3.2%
2428
 
3.2%
2408
 
3.1%
2403
 
3.1%
2362
 
3.1%
Other values (369) 43800
57.0%
None
ValueCountFrequency (%)
37
100.0%

단속구분(단속장비)
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
CCTV
9222 
현장단속
 
778

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCCTV
2nd rowCCTV
3rd rowCCTV
4th rowCCTV
5th rowCCTV

Common Values

ValueCountFrequency (%)
CCTV 9222
92.2%
현장단속 778
 
7.8%

Length

2023-12-11T09:52:27.493238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:52:27.585758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
cctv 9222
92.2%
현장단속 778
 
7.8%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-09-06 00:00:00
Maximum2021-09-06 00:00:00
2023-12-11T09:52:27.671664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:27.759101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-11T09:52:25.824275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:52:25.953932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속일자단속시간시군구 구분단속장소명단속구분(단속장비)기준일자
346752020-10-1522:22거제시고현동엠파크앞CCTVCCTV2021-09-06
126732020-03-2715:04거제시옥포동 옥포대첩로5길CCTV2021-09-06
349172020-10-1714:32거제시옥포 공영주차장 앞 CCTVCCTV2021-09-06
232512020-07-0318:09거제시수월사거리CCTVCCTV2021-09-06
343172020-10-1315:14거제시장평동 장평4로CCTV2021-09-06
125412020-03-2613:30거제시아주 카페베네CCTVCCTV2021-09-06
102662020-03-0914:59거제시장평 다이소맞은편CCTVCCTV2021-09-06
382362020-11-1213:23거제시고현 신화인아파트CCTVCCTV2021-09-06
398042020-11-269:43거제시고현동 거제중앙로31길CCTV2021-09-06
316512020-09-1515:53거제시수월동 수양로(복선)CCTV2021-09-06
단속일자단속시간시군구 구분단속장소명단속구분(단속장비)기준일자
325012020-09-2215:49거제시아주동 아주1로(횡단)CCTV2021-09-06
28762020-11-2314:48거제시고현동 부산복집(곡)현장단속2021-09-06
263872020-07-2717:52거제시고현중앙공영주차장뒤CCTCCTV2021-09-06
54412020-01-2815:43거제시아주동 아주1로2길CCTV2021-09-06
98152020-03-0614:47거제시고현동 중곡로(버스)CCTV2021-09-06
419302020-12-1519:27거제시아주 덕산2차사거리 CCTVCCTV2021-09-06
305462020-09-0410:15거제시아주동 아주1로4길CCTV2021-09-06
352412020-10-2011:55거제시고현버스터미널앞사거리CCTV2021-09-06
332432020-09-2819:39거제시아주 덕산2차사거리 CCTVCCTV2021-09-06
213602020-06-1918:29거제시아주 덕산2차사거리 CCTVCCTV2021-09-06

Duplicate rows

Most frequently occurring

단속일자단속시간시군구 구분단속장소명단속구분(단속장비)기준일자# duplicates
532020-04-109:44거제시장평동 장평1로10길CCTV2021-09-064
672020-06-0815:07거제시장평동 장평1로10길CCTV2021-09-064
962020-09-1014:58거제시장평동 장평3로7길CCTV2021-09-064
102020-01-1715:09거제시장평동 피솔길CCTV2021-09-063
112020-01-2210:52거제시장평동 피솔길CCTV2021-09-063
162020-01-2915:04거제시연초면 연하해안로CCTV2021-09-063
312020-02-2716:21거제시아주동 용소2길CCTV2021-09-063
452020-03-3115:01거제시아주동 아주1로4길CCTV2021-09-063
492020-04-0810:52거제시장평동 피솔길CCTV2021-09-063
592020-05-1521:11거제시고현동 고현로11길CCTV2021-09-063