Overview

Dataset statistics

Number of variables4
Number of observations2980
Missing cells0
Missing cells (%)0.0%
Duplicate rows10
Duplicate rows (%)0.3%
Total size in memory93.3 KiB
Average record size in memory32.0 B

Variable types

Categorical1
Text2
DateTime1

Dataset

Description2023년 8월 1일자 창원시 담배소매인 지정현황(업소명, 주소, 지정일자)파일 데이터를 붙임과 같이 제공합니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15021325

Alerts

Dataset has 10 (0.3%) duplicate rowsDuplicates
민원구분 is highly imbalanced (61.6%)Imbalance

Reproduction

Analysis started2023-12-10 23:49:20.515015
Analysis finished2023-12-10 23:49:21.284573
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

민원구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.4 KiB
제7조의3제2항에따른경우
2757 
제7조의3제3항에따른경우
 
223

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제7조의3제2항에따른경우
2nd row제7조의3제2항에따른경우
3rd row제7조의3제2항에따른경우
4th row제7조의3제2항에따른경우
5th row제7조의3제2항에따른경우

Common Values

ValueCountFrequency (%)
제7조의3제2항에따른경우 2757
92.5%
제7조의3제3항에따른경우 223
 
7.5%

Length

2023-12-11T08:49:21.373808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:49:21.478029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제7조의3제2항에따른경우 2757
92.5%
제7조의3제3항에따른경우 223
 
7.5%
Distinct2706
Distinct (%)90.8%
Missing0
Missing (%)0.0%
Memory size23.4 KiB
2023-12-11T08:49:21.801481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length8.1191275
Min length1

Characters and Unicode

Total characters24195
Distinct characters645
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2552 ?
Unique (%)85.6%

Sample

1st row바다상회
2nd row창원식자재마트 신마산점
3rd row이마트24 진해다인2차점
4th row까까주까 구암점
5th row씨유 진해풍호마루점
ValueCountFrequency (%)
씨유 198
 
4.9%
세븐일레븐 123
 
3.0%
지에스(gs)25 89
 
2.2%
이마트24 80
 
2.0%
gs25 69
 
1.7%
주)코리아세븐 49
 
1.2%
지에스25 39
 
1.0%
슈퍼 19
 
0.5%
365할인마트 19
 
0.5%
미니스톱 18
 
0.4%
Other values (2789) 3356
82.7%
2023-12-11T08:49:22.246863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1373
 
5.7%
1086
 
4.5%
1050
 
4.3%
694
 
2.9%
646
 
2.7%
2 534
 
2.2%
512
 
2.1%
505
 
2.1%
480
 
2.0%
444
 
1.8%
Other values (635) 16871
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20144
83.3%
Decimal Number 1219
 
5.0%
Space Separator 1086
 
4.5%
Uppercase Letter 880
 
3.6%
Open Punctuation 370
 
1.5%
Close Punctuation 370
 
1.5%
Lowercase Letter 87
 
0.4%
Other Punctuation 33
 
0.1%
Dash Punctuation 5
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1373
 
6.8%
1050
 
5.2%
694
 
3.4%
646
 
3.2%
512
 
2.5%
505
 
2.5%
480
 
2.4%
444
 
2.2%
349
 
1.7%
344
 
1.7%
Other values (570) 13747
68.2%
Uppercase Letter
ValueCountFrequency (%)
S 305
34.7%
G 303
34.4%
C 55
 
6.2%
U 33
 
3.8%
K 20
 
2.3%
N 18
 
2.0%
L 15
 
1.7%
H 14
 
1.6%
E 13
 
1.5%
D 12
 
1.4%
Other values (15) 92
 
10.5%
Lowercase Letter
ValueCountFrequency (%)
e 22
25.3%
o 12
13.8%
f 11
12.6%
u 5
 
5.7%
p 5
 
5.7%
c 5
 
5.7%
s 5
 
5.7%
l 4
 
4.6%
a 4
 
4.6%
r 2
 
2.3%
Other values (9) 12
13.8%
Decimal Number
ValueCountFrequency (%)
2 534
43.8%
5 410
33.6%
4 139
 
11.4%
3 41
 
3.4%
1 36
 
3.0%
6 34
 
2.8%
0 10
 
0.8%
9 7
 
0.6%
7 6
 
0.5%
8 2
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 21
63.6%
& 7
 
21.2%
/ 2
 
6.1%
' 1
 
3.0%
: 1
 
3.0%
! 1
 
3.0%
Space Separator
ValueCountFrequency (%)
1086
100.0%
Open Punctuation
ValueCountFrequency (%)
( 370
100.0%
Close Punctuation
ValueCountFrequency (%)
) 370
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20143
83.3%
Common 3083
 
12.7%
Latin 968
 
4.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1373
 
6.8%
1050
 
5.2%
694
 
3.4%
646
 
3.2%
512
 
2.5%
505
 
2.5%
480
 
2.4%
444
 
2.2%
349
 
1.7%
344
 
1.7%
Other values (569) 13746
68.2%
Latin
ValueCountFrequency (%)
S 305
31.5%
G 303
31.3%
C 55
 
5.7%
U 33
 
3.4%
e 22
 
2.3%
K 20
 
2.1%
N 18
 
1.9%
L 15
 
1.5%
H 14
 
1.4%
E 13
 
1.3%
Other values (35) 170
17.6%
Common
ValueCountFrequency (%)
1086
35.2%
2 534
17.3%
5 410
 
13.3%
( 370
 
12.0%
) 370
 
12.0%
4 139
 
4.5%
3 41
 
1.3%
1 36
 
1.2%
6 34
 
1.1%
. 21
 
0.7%
Other values (10) 42
 
1.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20143
83.3%
ASCII 4050
 
16.7%
Number Forms 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1373
 
6.8%
1050
 
5.2%
694
 
3.4%
646
 
3.2%
512
 
2.5%
505
 
2.5%
480
 
2.4%
444
 
2.2%
349
 
1.7%
344
 
1.7%
Other values (569) 13746
68.2%
ASCII
ValueCountFrequency (%)
1086
26.8%
2 534
13.2%
5 410
 
10.1%
( 370
 
9.1%
) 370
 
9.1%
S 305
 
7.5%
G 303
 
7.5%
4 139
 
3.4%
C 55
 
1.4%
3 41
 
1.0%
Other values (54) 437
10.8%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct2571
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size23.4 KiB
2023-12-11T08:49:22.664356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length64
Mean length30.961409
Min length1

Characters and Unicode

Total characters92265
Distinct characters508
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2523 ?
Unique (%)84.7%

Sample

1st row경상남도 창원시 진해구 속천로 120-2 (안곡동)
2nd row경상남도 창원시 마산합포구 해안대로 111 (월남동2가)
3rd row경상남도 창원시 진해구 신항2로 114 (용원동. 다인로얄팰리스부산 신항2차) 115호 (용원동)
4th row경상남도 창원시 마산회원구 구암북12길 20 (구암동)
5th row경상남도 창원시 진해구 진해대로1047번길 17-1 (풍호동)
ValueCountFrequency (%)
창원시 2616
 
14.0%
경상남도 2615
 
14.0%
의창구 569
 
3.1%
성산구 552
 
3.0%
진해구 539
 
2.9%
1층 526
 
2.8%
마산회원구 478
 
2.6%
마산합포구 477
 
2.6%
101호 152
 
0.8%
내서읍 115
 
0.6%
Other values (3229) 9986
53.6%
2023-12-11T08:49:23.676408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16900
 
18.3%
1 3872
 
4.2%
3697
 
4.0%
3450
 
3.7%
3240
 
3.5%
3143
 
3.4%
3136
 
3.4%
2774
 
3.0%
2721
 
2.9%
2716
 
2.9%
Other values (498) 46616
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55340
60.0%
Space Separator 16900
 
18.3%
Decimal Number 12508
 
13.6%
Close Punctuation 2425
 
2.6%
Open Punctuation 2425
 
2.6%
Other Punctuation 2064
 
2.2%
Dash Punctuation 342
 
0.4%
Uppercase Letter 194
 
0.2%
Lowercase Letter 50
 
0.1%
Math Symbol 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3697
 
6.7%
3450
 
6.2%
3240
 
5.9%
3143
 
5.7%
3136
 
5.7%
2774
 
5.0%
2721
 
4.9%
2716
 
4.9%
2711
 
4.9%
2114
 
3.8%
Other values (439) 25638
46.3%
Uppercase Letter
ValueCountFrequency (%)
S 29
14.9%
A 24
12.4%
B 22
11.3%
C 16
8.2%
G 15
7.7%
T 12
 
6.2%
L 11
 
5.7%
N 10
 
5.2%
K 9
 
4.6%
X 9
 
4.6%
Other values (15) 37
19.1%
Lowercase Letter
ValueCountFrequency (%)
e 8
16.0%
o 7
14.0%
r 6
12.0%
m 6
12.0%
i 4
8.0%
n 4
8.0%
a 4
8.0%
t 4
8.0%
p 3
 
6.0%
v 1
 
2.0%
Other values (3) 3
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 3872
31.0%
2 1601
12.8%
0 1323
 
10.6%
3 1227
 
9.8%
4 931
 
7.4%
5 885
 
7.1%
6 775
 
6.2%
7 714
 
5.7%
8 613
 
4.9%
9 567
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 2004
97.1%
· 53
 
2.6%
: 3
 
0.1%
& 2
 
0.1%
@ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
16900
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2425
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2425
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 342
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55340
60.0%
Common 36681
39.8%
Latin 244
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3697
 
6.7%
3450
 
6.2%
3240
 
5.9%
3143
 
5.7%
3136
 
5.7%
2774
 
5.0%
2721
 
4.9%
2716
 
4.9%
2711
 
4.9%
2114
 
3.8%
Other values (439) 25638
46.3%
Latin
ValueCountFrequency (%)
S 29
 
11.9%
A 24
 
9.8%
B 22
 
9.0%
C 16
 
6.6%
G 15
 
6.1%
T 12
 
4.9%
L 11
 
4.5%
N 10
 
4.1%
K 9
 
3.7%
X 9
 
3.7%
Other values (28) 87
35.7%
Common
ValueCountFrequency (%)
16900
46.1%
1 3872
 
10.6%
) 2425
 
6.6%
( 2425
 
6.6%
. 2004
 
5.5%
2 1601
 
4.4%
0 1323
 
3.6%
3 1227
 
3.3%
4 931
 
2.5%
5 885
 
2.4%
Other values (11) 3088
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55340
60.0%
ASCII 36868
40.0%
None 53
 
0.1%
CJK Compat 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16900
45.8%
1 3872
 
10.5%
) 2425
 
6.6%
( 2425
 
6.6%
. 2004
 
5.4%
2 1601
 
4.3%
0 1323
 
3.6%
3 1227
 
3.3%
4 931
 
2.5%
5 885
 
2.4%
Other values (47) 3275
 
8.9%
Hangul
ValueCountFrequency (%)
3697
 
6.7%
3450
 
6.2%
3240
 
5.9%
3143
 
5.7%
3136
 
5.7%
2774
 
5.0%
2721
 
4.9%
2716
 
4.9%
2711
 
4.9%
2114
 
3.8%
Other values (439) 25638
46.3%
None
ValueCountFrequency (%)
· 53
100.0%
CJK Compat
ValueCountFrequency (%)
4
100.0%
Distinct2168
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size23.4 KiB
Minimum1967-11-16 00:00:00
Maximum2023-08-01 00:00:00
2023-12-11T08:49:23.881688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:49:24.053188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-11T08:49:21.144082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:49:21.238020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

민원구분업소명업소도로명주소지정일자
0제7조의3제2항에따른경우바다상회경상남도 창원시 진해구 속천로 120-2 (안곡동)2023-08-01
1제7조의3제2항에따른경우창원식자재마트 신마산점경상남도 창원시 마산합포구 해안대로 111 (월남동2가)2023-07-31
2제7조의3제2항에따른경우이마트24 진해다인2차점경상남도 창원시 진해구 신항2로 114 (용원동. 다인로얄팰리스부산 신항2차) 115호 (용원동)2023-07-28
3제7조의3제2항에따른경우까까주까 구암점경상남도 창원시 마산회원구 구암북12길 20 (구암동)2023-07-28
4제7조의3제2항에따른경우씨유 진해풍호마루점경상남도 창원시 진해구 진해대로1047번길 17-1 (풍호동)2023-07-21
5제7조의3제2항에따른경우농업회사법인(주)쌀스토리경상남도 창원시 성산구 창원대로 524. 2층 (대원동)2023-07-19
6제7조의3제2항에따른경우보니따 속천경상남도 창원시 진해구 속천로 30. 1층 (태평동)2023-07-17
7제7조의3제2항에따른경우퀸마트(석동점)경상남도 창원시 진해구 석동로 67. 퀸마트(석동점) (석동)2023-07-13
8제7조의3제3항에따른경우(주)정진홈푸드 창원지방법원점경상남도 창원시 성산구 창이대로 681. 창원지방법원 지하1층 (사파동)2023-07-13
9제7조의3제2항에따른경우지에스(GS)25 성산반딧불점경상남도 창원시 성산구 반지로 31. 1층 (반지동)2023-07-13
민원구분업소명업소도로명주소지정일자
2970제7조의3제2항에따른경우슈퍼1980-12-20
2971제7조의3제2항에따른경우세븐일레븐 마산SK뷰 후문점경상남도 창원시 마산합포구 월영남18길 68 (월영동)1980-12-20
2972제7조의3제2항에따른경우잡화경상남도 창원시 마산합포구 산호동2길 20 (산호동)1980-12-18
2973제7조의3제2항에따른경우잡화1980-12-15
2974제7조의3제2항에따른경우잡화1980-06-04
2975제7조의3제2항에따른경우연쇄슈퍼경상남도 창원시 마산합포구 교방천남길 348 (오동동)1977-07-27
2976제7조의3제2항에따른경우구멍가게1974-02-25
2977제7조의3제2항에따른경우슈퍼1973-09-12
2978제7조의3제2항에따른경우신롯데상회1971-05-05
2979제7조의3제2항에따른경우잡화1967-11-16

Duplicate rows

Most frequently occurring

민원구분업소명업소도로명주소지정일자# duplicates
0제7조의3제2항에따른경우GS25 창원봉림점경상남도 창원시 의창구 창이대로309번길 2. 103호 (봉곡동)2022-08-092
1제7조의3제2항에따른경우GS25성산창곡로점경상남도 창원시 성산구 창곡로 54. 3동 101.102호 (신촌동)2022-08-082
2제7조의3제2항에따른경우대산유통경상남도 창원시 의창구 대산면 진산대로287번길 6-72022-08-092
3제7조의3제2항에따른경우마산식당1998-10-302
4제7조의3제2항에따른경우씨유 마산합성제일점경상남도 창원시 마산회원구 합성서9길 12 (합성동)2022-08-102
5제7조의3제2항에따른경우씨유창원팔용힐스테이트점경상남도 창원시 의창구 창원대로397번길 11. 203동 1층 151호 (팔용동. 힐스테이트 아티움시티)2022-08-092
6제7조의3제2항에따른경우지에스25진해구청점경상남도 창원시 진해구 천자로 411. 경동프라자 105.105-1호 (풍호동)2022-08-082
7제7조의3제2항에따른경우카페73경상남도 창원시 성산구 창원대로 442 (대원동)2022-08-102
8제7조의3제2항에따른경우행복25편의점경상남도 창원시 의창구 사림로 67. 지하층 1호 (사림동)2022-08-042
9제7조의3제3항에따른경우행복식자재마트 극동점경상남도 창원시 의창구 팔용로 438. 팔용종합상가 지하층 (팔용동)2022-08-092