Overview

Dataset statistics

Number of variables6
Number of observations157
Missing cells84
Missing cells (%)8.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory49.8 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description인천광역시 동구 담배소매인지정현황 데이터로, 민원구분, 업소명, 업소도로명주소, 업소전화번호, 지정일자 등 항목을 게시하였습니다.
URLhttps://www.data.go.kr/data/15045278/fileData.do

Alerts

업소전화번호 has 84 (53.5%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:13:08.683674
Analysis finished2023-12-12 07:13:09.360333
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct157
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.121019
Minimum1
Maximum159
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T16:13:09.427555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.8
Q140
median79
Q3118
95-th percentile150.2
Maximum159
Range158
Interquartile range (IQR)78

Descriptive statistics

Standard deviation45.664517
Coefficient of variation (CV)0.57714773
Kurtosis-1.1842367
Mean79.121019
Median Absolute Deviation (MAD)39
Skewness0.01344368
Sum12422
Variance2085.2481
MonotonicityStrictly increasing
2023-12-12T16:13:09.574776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
109 1
 
0.6%
102 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
110 1
 
0.6%
Other values (147) 147
93.6%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
151 1
0.6%
150 1
0.6%
149 1
0.6%

민원구분
Categorical

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
제7조의3제2항에따른경우
90 
<NA>
46 
제7조의3제3항에따른경우
21 

Length

Max length13
Median length13
Mean length10.363057
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제7조의3제3항에따른경우
2nd row제7조의3제3항에따른경우
3rd row제7조의3제3항에따른경우
4th row제7조의3제2항에따른경우
5th row제7조의3제2항에따른경우

Common Values

ValueCountFrequency (%)
제7조의3제2항에따른경우 90
57.3%
<NA> 46
29.3%
제7조의3제3항에따른경우 21
 
13.4%

Length

2023-12-12T16:13:09.785336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:13:09.924945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제7조의3제2항에따른경우 90
57.3%
na 46
29.3%
제7조의3제3항에따른경우 21
 
13.4%

업소명
Text

UNIQUE 

Distinct157
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T16:13:10.234813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.9044586
Min length2

Characters and Unicode

Total characters1084
Distinct characters252
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)100.0%

Sample

1st row(주)풀무원푸드앤컬처
2nd row송림장갑
3rd row(주)현대그린푸드
4th row은성 종합유통
5th row세븐일레븐 송림우정점
ValueCountFrequency (%)
세븐일레븐 9
 
4.3%
지에스25 8
 
3.8%
씨유 7
 
3.3%
송림점 3
 
1.4%
gs25 3
 
1.4%
이마트24 2
 
1.0%
인천송현점 2
 
1.0%
cu 2
 
1.0%
담배가게 1
 
0.5%
주)한국유통 1
 
0.5%
Other values (171) 171
81.8%
2023-12-12T16:13:10.709745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
5.0%
52
 
4.8%
35
 
3.2%
30
 
2.8%
29
 
2.7%
24
 
2.2%
21
 
1.9%
20
 
1.8%
19
 
1.8%
19
 
1.8%
Other values (242) 781
72.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 941
86.8%
Space Separator 52
 
4.8%
Decimal Number 38
 
3.5%
Uppercase Letter 21
 
1.9%
Lowercase Letter 12
 
1.1%
Close Punctuation 9
 
0.8%
Open Punctuation 9
 
0.8%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
5.7%
35
 
3.7%
30
 
3.2%
29
 
3.1%
24
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
19
 
2.0%
19
 
2.0%
Other values (219) 671
71.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
16.7%
e 2
16.7%
h 2
16.7%
m 1
8.3%
i 1
8.3%
l 1
8.3%
u 1
8.3%
o 1
8.3%
b 1
8.3%
Uppercase Letter
ValueCountFrequency (%)
S 6
28.6%
G 5
23.8%
C 3
14.3%
T 2
 
9.5%
D 2
 
9.5%
U 2
 
9.5%
Y 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
2 19
50.0%
5 15
39.5%
4 4
 
10.5%
Space Separator
ValueCountFrequency (%)
52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 941
86.8%
Common 110
 
10.1%
Latin 33
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
5.7%
35
 
3.7%
30
 
3.2%
29
 
3.1%
24
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
19
 
2.0%
19
 
2.0%
Other values (219) 671
71.3%
Latin
ValueCountFrequency (%)
S 6
18.2%
G 5
15.2%
C 3
9.1%
a 2
 
6.1%
e 2
 
6.1%
h 2
 
6.1%
T 2
 
6.1%
D 2
 
6.1%
U 2
 
6.1%
m 1
 
3.0%
Other values (6) 6
18.2%
Common
ValueCountFrequency (%)
52
47.3%
2 19
 
17.3%
5 15
 
13.6%
) 9
 
8.2%
( 9
 
8.2%
4 4
 
3.6%
. 2
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 941
86.8%
ASCII 143
 
13.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
5.7%
35
 
3.7%
30
 
3.2%
29
 
3.1%
24
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
19
 
2.0%
19
 
2.0%
Other values (219) 671
71.3%
ASCII
ValueCountFrequency (%)
52
36.4%
2 19
 
13.3%
5 15
 
10.5%
) 9
 
6.3%
( 9
 
6.3%
S 6
 
4.2%
G 5
 
3.5%
4 4
 
2.8%
C 3
 
2.1%
. 2
 
1.4%
Other values (13) 19
 
13.3%
Distinct156
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T16:13:11.031630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length45
Mean length29.917197
Min length20

Characters and Unicode

Total characters4697
Distinct characters150
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)98.7%

Sample

1st row인천광역시 동구 인중로 489, HD현대인프라코어 (화수동)
2nd row인천광역시 동구 방축로37번길 30, 인천산업용품유통단지 3동 137호 (송현동)
3rd row인천광역시 동구 인중로 489, HD현대인프라코어 (화수동)
4th row인천광역시 동구 송현로 11-1 (송현동)
5th row인천광역시 동구 연송로 4, 1층 (송림동)
ValueCountFrequency (%)
인천광역시 157
 
16.5%
동구 157
 
16.5%
송림동 60
 
6.3%
송현동 40
 
4.2%
1층 32
 
3.4%
화수동 23
 
2.4%
방축로37번길 13
 
1.4%
인천산업용품유통단지 11
 
1.2%
화도진로 10
 
1.1%
송현로 10
 
1.1%
Other values (262) 439
46.1%
2023-12-12T16:13:11.461277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
795
 
16.9%
354
 
7.5%
1 204
 
4.3%
185
 
3.9%
180
 
3.8%
( 161
 
3.4%
) 161
 
3.4%
159
 
3.4%
159
 
3.4%
159
 
3.4%
Other values (140) 2180
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2741
58.4%
Space Separator 795
 
16.9%
Decimal Number 686
 
14.6%
Open Punctuation 161
 
3.4%
Close Punctuation 161
 
3.4%
Other Punctuation 112
 
2.4%
Dash Punctuation 22
 
0.5%
Uppercase Letter 19
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
354
 
12.9%
185
 
6.7%
180
 
6.6%
159
 
5.8%
159
 
5.8%
159
 
5.8%
158
 
5.8%
157
 
5.7%
157
 
5.7%
82
 
3.0%
Other values (119) 991
36.2%
Decimal Number
ValueCountFrequency (%)
1 204
29.7%
3 97
14.1%
2 77
 
11.2%
0 74
 
10.8%
4 54
 
7.9%
7 53
 
7.7%
8 34
 
5.0%
5 33
 
4.8%
9 32
 
4.7%
6 28
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
B 8
42.1%
A 5
26.3%
D 2
 
10.5%
H 2
 
10.5%
C 1
 
5.3%
G 1
 
5.3%
Space Separator
ValueCountFrequency (%)
795
100.0%
Open Punctuation
ValueCountFrequency (%)
( 161
100.0%
Close Punctuation
ValueCountFrequency (%)
) 161
100.0%
Other Punctuation
ValueCountFrequency (%)
, 112
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2741
58.4%
Common 1937
41.2%
Latin 19
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
354
 
12.9%
185
 
6.7%
180
 
6.6%
159
 
5.8%
159
 
5.8%
159
 
5.8%
158
 
5.8%
157
 
5.7%
157
 
5.7%
82
 
3.0%
Other values (119) 991
36.2%
Common
ValueCountFrequency (%)
795
41.0%
1 204
 
10.5%
( 161
 
8.3%
) 161
 
8.3%
, 112
 
5.8%
3 97
 
5.0%
2 77
 
4.0%
0 74
 
3.8%
4 54
 
2.8%
7 53
 
2.7%
Other values (5) 149
 
7.7%
Latin
ValueCountFrequency (%)
B 8
42.1%
A 5
26.3%
D 2
 
10.5%
H 2
 
10.5%
C 1
 
5.3%
G 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2741
58.4%
ASCII 1956
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
795
40.6%
1 204
 
10.4%
( 161
 
8.2%
) 161
 
8.2%
, 112
 
5.7%
3 97
 
5.0%
2 77
 
3.9%
0 74
 
3.8%
4 54
 
2.8%
7 53
 
2.7%
Other values (11) 168
 
8.6%
Hangul
ValueCountFrequency (%)
354
 
12.9%
185
 
6.7%
180
 
6.6%
159
 
5.8%
159
 
5.8%
159
 
5.8%
158
 
5.8%
157
 
5.7%
157
 
5.7%
82
 
3.0%
Other values (119) 991
36.2%

업소전화번호
Text

MISSING 

Distinct73
Distinct (%)100.0%
Missing84
Missing (%)53.5%
Memory size1.4 KiB
2023-12-12T16:13:11.724498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters876
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row032-589-5386
2nd row032-773-0388
3rd row032-773-8545
4th row032-772-2011
5th row032-589-5135
ValueCountFrequency (%)
032-772-6036 1
 
1.4%
032-772-8889 1
 
1.4%
032-762-4030 1
 
1.4%
032-762-7755 1
 
1.4%
032-773-4903 1
 
1.4%
032-765-5872 1
 
1.4%
032-765-4836 1
 
1.4%
032-589-1273 1
 
1.4%
032-777-8928 1
 
1.4%
032-764-4081 1
 
1.4%
Other values (63) 63
86.3%
2023-12-12T16:13:12.125928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 146
16.7%
2 127
14.5%
7 127
14.5%
3 118
13.5%
0 111
12.7%
6 61
7.0%
5 48
 
5.5%
8 46
 
5.3%
4 40
 
4.6%
1 26
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 730
83.3%
Dash Punctuation 146
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 127
17.4%
7 127
17.4%
3 118
16.2%
0 111
15.2%
6 61
8.4%
5 48
 
6.6%
8 46
 
6.3%
4 40
 
5.5%
1 26
 
3.6%
9 26
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 146
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 876
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 146
16.7%
2 127
14.5%
7 127
14.5%
3 118
13.5%
0 111
12.7%
6 61
7.0%
5 48
 
5.5%
8 46
 
5.3%
4 40
 
4.6%
1 26
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 876
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 146
16.7%
2 127
14.5%
7 127
14.5%
3 118
13.5%
0 111
12.7%
6 61
7.0%
5 48
 
5.5%
8 46
 
5.3%
4 40
 
4.6%
1 26
 
3.0%
Distinct147
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum1975-07-01 00:00:00
Maximum2023-06-01 00:00:00
2023-12-12T16:13:12.297376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:13:12.457571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T16:13:09.026948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:13:12.570081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번민원구분업소전화번호
연번1.0000.3921.000
민원구분0.3921.0001.000
업소전화번호1.0001.0001.000
2023-12-12T16:13:12.688068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번민원구분
연번1.0000.267
민원구분0.2671.000

Missing values

2023-12-12T16:13:09.177338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:13:09.313562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번민원구분업소명업소도로명주소업소전화번호지정일자
01제7조의3제3항에따른경우(주)풀무원푸드앤컬처인천광역시 동구 인중로 489, HD현대인프라코어 (화수동)<NA>2023-06-01
12제7조의3제3항에따른경우송림장갑인천광역시 동구 방축로37번길 30, 인천산업용품유통단지 3동 137호 (송현동)032-589-53862023-05-31
23제7조의3제3항에따른경우(주)현대그린푸드인천광역시 동구 인중로 489, HD현대인프라코어 (화수동)<NA>2023-05-25
34제7조의3제2항에따른경우은성 종합유통인천광역시 동구 송현로 11-1 (송현동)<NA>2023-05-09
45제7조의3제2항에따른경우세븐일레븐 송림우정점인천광역시 동구 연송로 4, 1층 (송림동)<NA>2023-04-06
56제7조의3제2항에따른경우(주)코리아세븐 인천금곡점인천광역시 동구 금곡로 42, 1층 (금곡동)<NA>2023-04-03
67제7조의3제2항에따른경우동구슈퍼인천광역시 동구 샛골로162번길 5 (송림동)<NA>2023-03-14
78제7조의3제3항에따른경우에이스전기조명인천광역시 동구 방축로 105, 인천산업용품유통단지 21동 136호 (송림동)<NA>2023-03-13
89제7조의3제3항에따른경우마트앤커피인천광역시 동구 방축로83번길 23, 인천산업용품유통단지 B동 B20호 (송림동)<NA>2023-01-31
910제7조의3제2항에따른경우감동인천광역시 동구 화도진로 77 (화평동)<NA>2023-01-10
연번민원구분업소명업소도로명주소업소전화번호지정일자
147149<NA>독서마당인천광역시 동구 안송로 4 (송림동)032-762-37221998-11-21
148150<NA>영화식품인천광역시 동구 샛골로 197 (송현동)032-763-73131998-01-01
149151<NA>우정양복점인천광역시 동구 화도진로35번길 4-1 (송현동)032-773-13701998-12-09
150153<NA>대신상회인천광역시 동구 화수부두로 3-1 (화수동)<NA>1998-11-10
151154<NA>남영수출포장인천광역시 동구 방축로37번길 30, 32동 134호 (송현동,인천산업용품유통센타)032-589-06001997-12-11
152155<NA>금성상사인천광역시 동구 방축로37번길 30, 5동 131호 (송현동,인천산업용품유통센타)032-882-08981997-08-13
153156<NA>S.D마트인천광역시 동구 화수로 17, 삼두2차아파트 상가 (송현동)032-764-18501997-10-29
154157<NA>쌍우물슈퍼인천광역시 동구 쌍우물로 75 (화수동)032-772-47231984-02-08
155158<NA>북청미니슈퍼인천광역시 동구 샛골로 136 (송림동)032-766-20571978-03-15
156159<NA>인일빌딩인천광역시 동구 샛골로102번길 22 (송림동)032-763-24721975-07-01