Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows202
Duplicate rows (%)2.0%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

Categorical2
Text1
DateTime3

Dataset

Description경상남도 거제시 불법주정차단속정보현황(단속일자, 단속시간, 시군구, 단속장소명, 단속구분, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15087238

Alerts

시군명 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 202 (2.0%) duplicate rowsDuplicates
단속구분 is highly imbalanced (59.3%)Imbalance

Reproduction

Analysis started2023-12-11 00:52:12.341712
Analysis finished2023-12-11 00:52:13.770586
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경상남도 거제시
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도 거제시
2nd row경상남도 거제시
3rd row경상남도 거제시
4th row경상남도 거제시
5th row경상남도 거제시

Common Values

ValueCountFrequency (%)
경상남도 거제시 10000
100.0%

Length

2023-12-11T09:52:13.850191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:52:13.954588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 10000
50.0%
거제시 10000
50.0%
Distinct1531
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:52:14.263613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length12.4181
Min length6

Characters and Unicode

Total characters124181
Distinct characters406
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique966 ?
Unique (%)9.7%

Sample

1st row장평동 장평1로(인도)
2nd row고현동 펫츠아일랜드
3rd row장평동 장평3로7길
4th row고현 버스터미널CCTV
5th row장평동 피솔길
ValueCountFrequency (%)
고현동 1997
 
9.7%
cctv 1779
 
8.6%
장평동 861
 
4.2%
고현홍콩반점사거리cctv 847
 
4.1%
장평 758
 
3.7%
옥포동 750
 
3.6%
아주동 739
 
3.6%
728
 
3.5%
옥포 689
 
3.3%
아주 629
 
3.0%
Other values (1527) 10916
52.8%
2023-12-11T09:52:14.784349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 10791
 
8.7%
10693
 
8.6%
5571
 
4.5%
5414
 
4.4%
T 5401
 
4.3%
V 5273
 
4.2%
4436
 
3.6%
( 3281
 
2.6%
) 3190
 
2.6%
2895
 
2.3%
Other values (396) 67236
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 76967
62.0%
Uppercase Letter 21495
 
17.3%
Space Separator 10693
 
8.6%
Decimal Number 7608
 
6.1%
Open Punctuation 3281
 
2.6%
Close Punctuation 3190
 
2.6%
Dash Punctuation 720
 
0.6%
Lowercase Letter 190
 
0.2%
Other Punctuation 37
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5571
 
7.2%
5414
 
7.0%
4436
 
5.8%
2895
 
3.8%
2661
 
3.5%
2476
 
3.2%
2443
 
3.2%
2429
 
3.2%
2394
 
3.1%
2321
 
3.0%
Other values (369) 43927
57.1%
Decimal Number
ValueCountFrequency (%)
1 2069
27.2%
2 1268
16.7%
3 955
12.6%
9 535
 
7.0%
0 511
 
6.7%
6 470
 
6.2%
7 469
 
6.2%
4 463
 
6.1%
5 453
 
6.0%
8 415
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
C 10791
50.2%
T 5401
25.1%
V 5273
24.5%
K 10
 
< 0.1%
S 8
 
< 0.1%
G 8
 
< 0.1%
L 2
 
< 0.1%
J 1
 
< 0.1%
P 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
35
94.6%
. 1
 
2.7%
& 1
 
2.7%
Space Separator
ValueCountFrequency (%)
10693
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3281
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3190
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 720
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 190
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 76967
62.0%
Common 25529
 
20.6%
Latin 21685
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5571
 
7.2%
5414
 
7.0%
4436
 
5.8%
2895
 
3.8%
2661
 
3.5%
2476
 
3.2%
2443
 
3.2%
2429
 
3.2%
2394
 
3.1%
2321
 
3.0%
Other values (369) 43927
57.1%
Common
ValueCountFrequency (%)
10693
41.9%
( 3281
 
12.9%
) 3190
 
12.5%
1 2069
 
8.1%
2 1268
 
5.0%
3 955
 
3.7%
- 720
 
2.8%
9 535
 
2.1%
0 511
 
2.0%
6 470
 
1.8%
Other values (7) 1837
 
7.2%
Latin
ValueCountFrequency (%)
C 10791
49.8%
T 5401
24.9%
V 5273
24.3%
e 190
 
0.9%
K 10
 
< 0.1%
S 8
 
< 0.1%
G 8
 
< 0.1%
L 2
 
< 0.1%
J 1
 
< 0.1%
P 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 76967
62.0%
ASCII 47179
38.0%
None 35
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 10791
22.9%
10693
22.7%
T 5401
11.4%
V 5273
11.2%
( 3281
 
7.0%
) 3190
 
6.8%
1 2069
 
4.4%
2 1268
 
2.7%
3 955
 
2.0%
- 720
 
1.5%
Other values (16) 3538
 
7.5%
Hangul
ValueCountFrequency (%)
5571
 
7.2%
5414
 
7.0%
4436
 
5.8%
2895
 
3.8%
2661
 
3.5%
2476
 
3.2%
2443
 
3.2%
2429
 
3.2%
2394
 
3.1%
2321
 
3.0%
Other values (369) 43927
57.1%
None
ValueCountFrequency (%)
35
100.0%
Distinct365
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-11T09:52:14.944449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:15.136940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1140
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-11 00:01:00
Maximum2023-12-11 23:58:00
2023-12-11T09:52:15.308244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:15.497504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

단속구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
CCTV
9186 
현장단속
 
814

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCCTV
2nd row현장단속
3rd rowCCTV
4th rowCCTV
5th rowCCTV

Common Values

ValueCountFrequency (%)
CCTV 9186
91.9%
현장단속 814
 
8.1%

Length

2023-12-11T09:52:15.693722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:52:15.782129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
cctv 9186
91.9%
현장단속 814
 
8.1%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-09-05 00:00:00
Maximum2022-09-05 00:00:00
2023-12-11T09:52:15.866865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:52:15.989664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-11T09:52:13.562841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:52:13.701105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명단속장소명단속일자단속시간단속구분기준일자
29575경상남도 거제시장평동 장평1로(인도)2020-08-2615:57CCTV2022-09-05
988경상남도 거제시고현동 펫츠아일랜드2020-06-0111:03현장단속2022-09-05
34207경상남도 거제시장평동 장평3로7길2020-10-1215:27CCTV2022-09-05
8852경상남도 거제시고현 버스터미널CCTV2020-02-2718:44CCTV2022-09-05
22337경상남도 거제시장평동 피솔길2020-06-2610:31CCTV2022-09-05
16380경상남도 거제시장평 다이소맞은편CCTV2020-04-2615:22CCTV2022-09-05
34964경상남도 거제시옥포옥금당삼거리CCTV2020-10-1812:59CCTV2022-09-05
39450경상남도 거제시장평동 장평3로5길2020-11-2310:39CCTV2022-09-05
15565경상남도 거제시고현사거리CCTV2020-04-1916:59CCTV2022-09-05
2060경상남도 거제시고현동 루키버드2020-09-1715:58현장단속2022-09-05
시군명단속장소명단속일자단속시간단속구분기준일자
12085경상남도 거제시고현동 서문로2020-03-2315:22CCTV2022-09-05
3768경상남도 거제시아주동 아주1로2020-01-0614:32CCTV2022-09-05
18699경상남도 거제시아주동 아주로2020-05-2014:51CCTV2022-09-05
4258경상남도 거제시옥포 국산사거리CCTV2020-01-1115:33CCTV2022-09-05
30250경상남도 거제시옥포 e편한 사거리 CCTV2020-08-3119:20CCTV2022-09-05
37152경상남도 거제시양정동 제산로2020-11-0315:03CCTV2022-09-05
41757경상남도 거제시장평 다이소맞은편CCTV2020-12-1418:56CCTV2022-09-05
37470경상남도 거제시아주동 아주1로2길2020-11-0615:26CCTV2022-09-05
37725경상남도 거제시아주 덕산2차사거리 CCTV2020-11-0814:35CCTV2022-09-05
14263경상남도 거제시고현 신화인아파트CCTV2020-04-0909:48CCTV2022-09-05

Duplicate rows

Most frequently occurring

시군명단속장소명단속일자단속시간단속구분기준일자# duplicates
201경상남도 거제시장평동 피솔길2020-12-3114:35CCTV2022-09-055
130경상남도 거제시장평동 장평1로10길2020-03-1210:10CCTV2022-09-054
138경상남도 거제시장평동 장평1로10길2020-04-2815:14CCTV2022-09-054
163경상남도 거제시장평동 장평3로7길2020-08-1814:53CCTV2022-09-054
165경상남도 거제시장평동 장평3로7길2020-08-2110:11CCTV2022-09-054
175경상남도 거제시장평동 장평3로7길2020-10-1215:27CCTV2022-09-054
2경상남도 거제시고현동 33-36(시민신고)2020-06-0317:20CCTV2022-09-053
3경상남도 거제시고현동 41-8(시민신고)(인)2020-08-0400:03CCTV2022-09-053
19경상남도 거제시고현동 고현로9길2020-02-1410:57CCTV2022-09-053
37경상남도 거제시상동동 상동3길2020-03-1810:34CCTV2022-09-053