Overview

Dataset statistics

Number of variables6
Number of observations614
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.5 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description여수시 관내 담배소매인 지정 자료 제공
Author전라남도 여수시
URLhttps://www.data.go.kr/data/3079992/fileData.do

Alerts

영업구분 is highly imbalanced (98.3%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:33:47.869187
Analysis finished2023-12-12 10:33:48.712277
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct614
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean307.5
Minimum1
Maximum614
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.5 KiB
2023-12-12T19:33:48.797945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31.65
Q1154.25
median307.5
Q3460.75
95-th percentile583.35
Maximum614
Range613
Interquartile range (IQR)306.5

Descriptive statistics

Standard deviation177.39081
Coefficient of variation (CV)0.57688069
Kurtosis-1.2
Mean307.5
Median Absolute Deviation (MAD)153.5
Skewness0
Sum188805
Variance31467.5
MonotonicityStrictly increasing
2023-12-12T19:33:48.953706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
414 1
 
0.2%
407 1
 
0.2%
408 1
 
0.2%
409 1
 
0.2%
410 1
 
0.2%
411 1
 
0.2%
412 1
 
0.2%
413 1
 
0.2%
415 1
 
0.2%
Other values (604) 604
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
614 1
0.2%
613 1
0.2%
612 1
0.2%
611 1
0.2%
610 1
0.2%
609 1
0.2%
608 1
0.2%
607 1
0.2%
606 1
0.2%
605 1
0.2%

민원구분
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
제7조의3제2항에따른경우
536 
제7조의3제3항에따른경우
78 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제7조의3제2항에따른경우
2nd row제7조의3제2항에따른경우
3rd row제7조의3제2항에따른경우
4th row제7조의3제3항에따른경우
5th row제7조의3제2항에따른경우

Common Values

ValueCountFrequency (%)
제7조의3제2항에따른경우 536
87.3%
제7조의3제3항에따른경우 78
 
12.7%

Length

2023-12-12T19:33:49.135471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:33:49.254484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제7조의3제2항에따른경우 536
87.3%
제7조의3제3항에따른경우 78
 
12.7%
Distinct609
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T19:33:49.582728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length16
Mean length9.3697068
Min length2

Characters and Unicode

Total characters5753
Distinct characters381
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique604 ?
Unique (%)98.4%

Sample

1st row주공할인마트
2nd row솔마트
3rd row남해화학 구판장
4th row일레븐마트 선경점
5th row지에스(GS)25 죽림부영점
ValueCountFrequency (%)
세븐일레븐 60
 
6.4%
씨유 49
 
5.2%
이마트24 37
 
4.0%
지에스(gs)25 35
 
3.7%
지에스25 22
 
2.4%
gs25 13
 
1.4%
주)코리아세븐 7
 
0.7%
주식회사 7
 
0.7%
씨유(cu 7
 
0.7%
여수 5
 
0.5%
Other values (637) 692
74.1%
2023-12-12T19:33:50.460250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
391
 
6.8%
349
 
6.1%
336
 
5.8%
321
 
5.6%
164
 
2.9%
2 162
 
2.8%
162
 
2.8%
148
 
2.6%
117
 
2.0%
5 114
 
2.0%
Other values (371) 3489
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4644
80.7%
Decimal Number 327
 
5.7%
Space Separator 321
 
5.6%
Uppercase Letter 247
 
4.3%
Open Punctuation 95
 
1.7%
Close Punctuation 95
 
1.7%
Lowercase Letter 16
 
0.3%
Other Symbol 4
 
0.1%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
391
 
8.4%
349
 
7.5%
336
 
7.2%
164
 
3.5%
162
 
3.5%
148
 
3.2%
117
 
2.5%
100
 
2.2%
90
 
1.9%
87
 
1.9%
Other values (330) 2700
58.1%
Uppercase Letter
ValueCountFrequency (%)
S 81
32.8%
G 79
32.0%
C 29
 
11.7%
U 25
 
10.1%
E 6
 
2.4%
R 6
 
2.4%
D 4
 
1.6%
V 3
 
1.2%
L 3
 
1.2%
H 3
 
1.2%
Other values (8) 8
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
a 3
18.8%
n 2
12.5%
e 2
12.5%
s 2
12.5%
g 2
12.5%
v 1
 
6.2%
i 1
 
6.2%
t 1
 
6.2%
p 1
 
6.2%
u 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
2 162
49.5%
5 114
34.9%
4 47
 
14.4%
3 2
 
0.6%
6 1
 
0.3%
7 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
# 1
50.0%
Space Separator
ValueCountFrequency (%)
321
100.0%
Open Punctuation
ValueCountFrequency (%)
( 95
100.0%
Close Punctuation
ValueCountFrequency (%)
) 95
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4648
80.8%
Common 842
 
14.6%
Latin 263
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
391
 
8.4%
349
 
7.5%
336
 
7.2%
164
 
3.5%
162
 
3.5%
148
 
3.2%
117
 
2.5%
100
 
2.2%
90
 
1.9%
87
 
1.9%
Other values (331) 2704
58.2%
Latin
ValueCountFrequency (%)
S 81
30.8%
G 79
30.0%
C 29
 
11.0%
U 25
 
9.5%
E 6
 
2.3%
R 6
 
2.3%
D 4
 
1.5%
V 3
 
1.1%
L 3
 
1.1%
H 3
 
1.1%
Other values (18) 24
 
9.1%
Common
ValueCountFrequency (%)
321
38.1%
2 162
19.2%
5 114
 
13.5%
( 95
 
11.3%
) 95
 
11.3%
4 47
 
5.6%
3 2
 
0.2%
- 2
 
0.2%
6 1
 
0.1%
. 1
 
0.1%
Other values (2) 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4644
80.7%
ASCII 1105
 
19.2%
None 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
391
 
8.4%
349
 
7.5%
336
 
7.2%
164
 
3.5%
162
 
3.5%
148
 
3.2%
117
 
2.5%
100
 
2.2%
90
 
1.9%
87
 
1.9%
Other values (330) 2700
58.1%
ASCII
ValueCountFrequency (%)
321
29.0%
2 162
14.7%
5 114
 
10.3%
( 95
 
8.6%
) 95
 
8.6%
S 81
 
7.3%
G 79
 
7.1%
4 47
 
4.3%
C 29
 
2.6%
U 25
 
2.3%
Other values (30) 57
 
5.2%
None
ValueCountFrequency (%)
4
100.0%
Distinct610
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
2023-12-12T19:33:50.850833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length53
Mean length27.105863
Min length1

Characters and Unicode

Total characters16643
Distinct characters287
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique609 ?
Unique (%)99.2%

Sample

1st row전라남도 여수시 여서2로 38. 1호 상가 (여서동)
2nd row전라남도 여수시 소호5길 6-5 (소호동)
3rd row전라남도 여수시 여수산단로 1384 (낙포동)
4th row전라남도 여수시 양지1길 3. 상가동 지하층 8호 (미평동. 선경아파트)
5th row전라남도 여수시 소라면 죽림로 11
ValueCountFrequency (%)
전라남도 609
 
16.7%
여수시 609
 
16.7%
1층 156
 
4.3%
돌산읍 60
 
1.6%
학동 48
 
1.3%
소라면 35
 
1.0%
문수동 35
 
1.0%
상가동 35
 
1.0%
웅천동 34
 
0.9%
여서동 31
 
0.8%
Other values (871) 1999
54.8%
2023-12-12T19:33:51.482567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3048
18.3%
1 789
 
4.7%
752
 
4.5%
738
 
4.4%
677
 
4.1%
650
 
3.9%
644
 
3.9%
634
 
3.8%
616
 
3.7%
613
 
3.7%
Other values (277) 7482
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9578
57.5%
Space Separator 3048
 
18.3%
Decimal Number 2507
 
15.1%
Close Punctuation 485
 
2.9%
Open Punctuation 485
 
2.9%
Other Punctuation 390
 
2.3%
Dash Punctuation 123
 
0.7%
Uppercase Letter 17
 
0.1%
Math Symbol 6
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
752
 
7.9%
738
 
7.7%
677
 
7.1%
650
 
6.8%
644
 
6.7%
634
 
6.6%
616
 
6.4%
613
 
6.4%
407
 
4.2%
203
 
2.1%
Other values (250) 3644
38.0%
Decimal Number
ValueCountFrequency (%)
1 789
31.5%
2 355
14.2%
3 243
 
9.7%
0 237
 
9.5%
4 185
 
7.4%
5 171
 
6.8%
6 165
 
6.6%
9 122
 
4.9%
7 120
 
4.8%
8 120
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
B 8
47.1%
N 2
 
11.8%
C 2
 
11.8%
A 2
 
11.8%
D 1
 
5.9%
S 1
 
5.9%
J 1
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
b 1
25.0%
e 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 388
99.5%
@ 2
 
0.5%
Space Separator
ValueCountFrequency (%)
3048
100.0%
Close Punctuation
ValueCountFrequency (%)
) 485
100.0%
Open Punctuation
ValueCountFrequency (%)
( 485
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 123
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9578
57.5%
Common 7044
42.3%
Latin 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
752
 
7.9%
738
 
7.7%
677
 
7.1%
650
 
6.8%
644
 
6.7%
634
 
6.6%
616
 
6.4%
613
 
6.4%
407
 
4.2%
203
 
2.1%
Other values (250) 3644
38.0%
Common
ValueCountFrequency (%)
3048
43.3%
1 789
 
11.2%
) 485
 
6.9%
( 485
 
6.9%
. 388
 
5.5%
2 355
 
5.0%
3 243
 
3.4%
0 237
 
3.4%
4 185
 
2.6%
5 171
 
2.4%
Other values (7) 658
 
9.3%
Latin
ValueCountFrequency (%)
B 8
38.1%
N 2
 
9.5%
c 2
 
9.5%
C 2
 
9.5%
A 2
 
9.5%
D 1
 
4.8%
b 1
 
4.8%
e 1
 
4.8%
S 1
 
4.8%
J 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9578
57.5%
ASCII 7065
42.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3048
43.1%
1 789
 
11.2%
) 485
 
6.9%
( 485
 
6.9%
. 388
 
5.5%
2 355
 
5.0%
3 243
 
3.4%
0 237
 
3.4%
4 185
 
2.6%
5 171
 
2.4%
Other values (17) 679
 
9.6%
Hangul
ValueCountFrequency (%)
752
 
7.9%
738
 
7.7%
677
 
7.1%
650
 
6.8%
644
 
6.7%
634
 
6.6%
616
 
6.4%
613
 
6.4%
407
 
4.2%
203
 
2.1%
Other values (250) 3644
38.0%
Distinct489
Distinct (%)79.6%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
Minimum2014-01-03 00:00:00
Maximum2023-08-18 00:00:00
2023-12-12T19:33:51.659046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:33:51.815687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
정상영업
613 
영업정지
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 613
99.8%
영업정지 1
 
0.2%

Length

2023-12-12T19:33:52.031925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:33:52.133628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 613
99.8%
영업정지 1
 
0.2%

Interactions

2023-12-12T19:33:48.332060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:33:52.216773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번민원구분영업구분
연번1.0000.0760.015
민원구분0.0761.0000.000
영업구분0.0150.0001.000
2023-12-12T19:33:52.324594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업구분민원구분
영업구분1.0000.000
민원구분0.0001.000
2023-12-12T19:33:52.417637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번민원구분영업구분
연번1.0000.0580.010
민원구분0.0581.0000.000
영업구분0.0100.0001.000

Missing values

2023-12-12T19:33:48.517157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:33:48.659141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번민원구분업소명업소도로명주소지정일자영업구분
01제7조의3제2항에따른경우주공할인마트전라남도 여수시 여서2로 38. 1호 상가 (여서동)2023-08-18정상영업
12제7조의3제2항에따른경우솔마트전라남도 여수시 소호5길 6-5 (소호동)2023-08-18정상영업
23제7조의3제2항에따른경우남해화학 구판장전라남도 여수시 여수산단로 1384 (낙포동)2023-08-16정상영업
34제7조의3제3항에따른경우일레븐마트 선경점전라남도 여수시 양지1길 3. 상가동 지하층 8호 (미평동. 선경아파트)2023-08-14정상영업
45제7조의3제2항에따른경우지에스(GS)25 죽림부영점전라남도 여수시 소라면 죽림로 112023-08-14정상영업
56제7조의3제2항에따른경우여수대박복권전라남도 여수시 돌산읍 강남9길 242023-08-04정상영업
67제7조의3제2항에따른경우세븐일레븐 여수돌산라온점전라남도 여수시 돌산읍 강남해안로 1012023-08-04정상영업
78제7조의3제2항에따른경우샤크전자담배전라남도 여수시 미평로 39 (미평동)2023-07-26정상영업
89제7조의3제3항에따른경우지에스(GS)25 여수돌산점전라남도 여수시 돌산읍 강남8길 332023-07-21정상영업
910제7조의3제2항에따른경우이마트24 여수문수중앙점전라남도 여수시 문수북6길 14. 1층 (문수동)2023-07-18정상영업
연번민원구분업소명업소도로명주소지정일자영업구분
604605제7조의3제2항에따른경우GS25 미평원룸점전라남도 여수시 미평10길 30 (미평동)2014-05-01정상영업
605606제7조의3제2항에따른경우뷰티크레딧 동신전라남도 여수시 서교2길 6 (서교동)2014-04-28정상영업
606607제7조의3제2항에따른경우GS25여수센트럴전라남도 여수시 시청동1길 29. 106호 (학동)2014-04-21정상영업
607608제7조의3제2항에따른경우우리슈퍼전라남도 여수시 여문2로 15. 상가동 101호 (문수동. 문수주공아파트)2014-04-14정상영업
608609제7조의3제3항에따른경우여수공장사우회전라남도 여수시 여수산단로 951 (적량동)2014-03-31정상영업
609610제7조의3제3항에따른경우여수공장사우회전라남도 여수시 여수산단로 1121 (월내동)2014-03-31정상영업
610611제7조의3제2항에따른경우세븐일레븐여수여서점전라남도 여수시 여서1로 56 (여서동)2014-03-13정상영업
611612제7조의3제2항에따른경우부영함바웅천점2014-02-10정상영업
612613제7조의3제3항에따른경우GS25 여수여문전라남도 여수시 여문문화1길 16 (여서동)2014-01-16정상영업
613614제7조의3제2항에따른경우파밀리에마트전라남도 여수시 쌍봉로 32. 상가2동 121.122호 (학동. 신동아파밀리에)2014-01-03정상영업