Overview

Dataset statistics

Number of variables5
Number of observations320
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.9 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description충청남도 홍성군 담배소매업소 데이터 정보입니다.(연번, 민원구분, 업소명, 업소지번주소,업소도로명주소)
URLhttps://www.data.go.kr/data/15113316/fileData.do

Alerts

NO is highly overall correlated with 민원구분High correlation
민원구분 is highly overall correlated with NOHigh correlation
NO has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:02:22.779161
Analysis finished2023-12-12 10:02:23.674886
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

NO
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct320
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean160.5
Minimum1
Maximum320
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-12T19:02:23.817071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.95
Q180.75
median160.5
Q3240.25
95-th percentile304.05
Maximum320
Range319
Interquartile range (IQR)159.5

Descriptive statistics

Standard deviation92.520268
Coefficient of variation (CV)0.57645027
Kurtosis-1.2
Mean160.5
Median Absolute Deviation (MAD)80
Skewness0
Sum51360
Variance8560
MonotonicityStrictly increasing
2023-12-12T19:02:23.985379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
162 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
215 1
 
0.3%
214 1
 
0.3%
213 1
 
0.3%
Other values (310) 310
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
314 1
0.3%
313 1
0.3%
312 1
0.3%
311 1
0.3%

민원구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
제7조의3제2항에따른경우
193 
114 
제7조의3제3항에따른경우
 
13

Length

Max length13
Median length13
Mean length8.725
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제7조의3제3항에따른경우
2nd row제7조의3제2항에따른경우
3rd row제7조의3제2항에따른경우
4th row제7조의3제2항에따른경우
5th row제7조의3제2항에따른경우

Common Values

ValueCountFrequency (%)
제7조의3제2항에따른경우 193
60.3%
114
35.6%
제7조의3제3항에따른경우 13
 
4.1%

Length

2023-12-12T19:02:24.142307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:02:24.246989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제7조의3제2항에따른경우 193
93.7%
제7조의3제3항에따른경우 13
 
6.3%
Distinct312
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T19:02:24.535889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length7.759375
Min length1

Characters and Unicode

Total characters2483
Distinct characters290
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)95.9%

Sample

1st row생활용품DC마트
2nd row지에스25 홍성우체국점
3rd row세븐일레븐 홍성옥암점
4th row산단편의점
5th row주식회사 제이에스리
ValueCountFrequency (%)
씨유 40
 
8.7%
세븐일레븐 20
 
4.4%
이마트24 16
 
3.5%
gs25 10
 
2.2%
지에스25 8
 
1.7%
매점 4
 
0.9%
하나로마트 4
 
0.9%
홍성점 3
 
0.7%
3
 
0.7%
주식회사 3
 
0.7%
Other values (327) 348
75.8%
2023-12-12T19:02:25.036963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
 
5.9%
146
 
5.9%
117
 
4.7%
110
 
4.4%
74
 
3.0%
65
 
2.6%
55
 
2.2%
2 48
 
1.9%
47
 
1.9%
41
 
1.7%
Other values (280) 1633
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2141
86.2%
Space Separator 147
 
5.9%
Decimal Number 102
 
4.1%
Uppercase Letter 53
 
2.1%
Close Punctuation 15
 
0.6%
Open Punctuation 15
 
0.6%
Other Punctuation 9
 
0.4%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
6.8%
117
 
5.5%
110
 
5.1%
74
 
3.5%
65
 
3.0%
55
 
2.6%
47
 
2.2%
41
 
1.9%
38
 
1.8%
34
 
1.6%
Other values (259) 1414
66.0%
Uppercase Letter
ValueCountFrequency (%)
S 17
32.1%
G 17
32.1%
C 8
15.1%
U 4
 
7.5%
D 4
 
7.5%
B 1
 
1.9%
T 1
 
1.9%
I 1
 
1.9%
Decimal Number
ValueCountFrequency (%)
2 48
47.1%
5 28
27.5%
4 19
 
18.6%
8 2
 
2.0%
9 2
 
2.0%
3 2
 
2.0%
1 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
/ 2
 
22.2%
Space Separator
ValueCountFrequency (%)
147
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2141
86.2%
Common 289
 
11.6%
Latin 53
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
6.8%
117
 
5.5%
110
 
5.1%
74
 
3.5%
65
 
3.0%
55
 
2.6%
47
 
2.2%
41
 
1.9%
38
 
1.8%
34
 
1.6%
Other values (259) 1414
66.0%
Common
ValueCountFrequency (%)
147
50.9%
2 48
 
16.6%
5 28
 
9.7%
4 19
 
6.6%
) 15
 
5.2%
( 15
 
5.2%
. 7
 
2.4%
8 2
 
0.7%
9 2
 
0.7%
3 2
 
0.7%
Other values (3) 4
 
1.4%
Latin
ValueCountFrequency (%)
S 17
32.1%
G 17
32.1%
C 8
15.1%
U 4
 
7.5%
D 4
 
7.5%
B 1
 
1.9%
T 1
 
1.9%
I 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2141
86.2%
ASCII 342
 
13.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
147
43.0%
2 48
 
14.0%
5 28
 
8.2%
4 19
 
5.6%
S 17
 
5.0%
G 17
 
5.0%
) 15
 
4.4%
( 15
 
4.4%
C 8
 
2.3%
. 7
 
2.0%
Other values (11) 21
 
6.1%
Hangul
ValueCountFrequency (%)
146
 
6.8%
117
 
5.5%
110
 
5.1%
74
 
3.5%
65
 
3.0%
55
 
2.6%
47
 
2.2%
41
 
1.9%
38
 
1.8%
34
 
1.6%
Other values (259) 1414
66.0%
Distinct273
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T19:02:25.462372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length42
Mean length22.6125
Min length1

Characters and Unicode

Total characters7236
Distinct characters180
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique266 ?
Unique (%)83.1%

Sample

1st row충청남도 홍성군 홍성읍 오관리 578-1 생활용품D.C마트
2nd row충청남도 홍성군 홍성읍 옥암리 18-6
3rd row충청남도 홍성군 홍성읍 옥암리 959
4th row충청남도 홍성군 갈산면 취생리 671
5th row충청남도 홍성군 홍북읍 봉신리 292-11
ValueCountFrequency (%)
충청남도 278
 
16.8%
홍성군 278
 
16.8%
홍성읍 118
 
7.1%
오관리 41
 
2.5%
홍북읍 36
 
2.2%
33
 
2.0%
1호 32
 
1.9%
광천읍 30
 
1.8%
신경리 21
 
1.3%
남장리 20
 
1.2%
Other values (422) 768
46.4%
2023-12-12T19:02:26.084949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1600
22.1%
447
 
6.2%
412
 
5.7%
307
 
4.2%
283
 
3.9%
282
 
3.9%
278
 
3.8%
278
 
3.8%
275
 
3.8%
1 229
 
3.2%
Other values (170) 2845
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4414
61.0%
Space Separator 1600
 
22.1%
Decimal Number 1144
 
15.8%
Dash Punctuation 47
 
0.6%
Lowercase Letter 13
 
0.2%
Other Punctuation 10
 
0.1%
Uppercase Letter 6
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
447
 
10.1%
412
 
9.3%
307
 
7.0%
283
 
6.4%
282
 
6.4%
278
 
6.3%
278
 
6.3%
275
 
6.2%
194
 
4.4%
189
 
4.3%
Other values (142) 1469
33.3%
Decimal Number
ValueCountFrequency (%)
1 229
20.0%
2 143
12.5%
3 133
11.6%
4 122
10.7%
5 120
10.5%
6 96
8.4%
0 91
 
8.0%
8 82
 
7.2%
9 70
 
6.1%
7 58
 
5.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
23.1%
y 2
15.4%
a 2
15.4%
r 2
15.4%
t 1
 
7.7%
v 1
 
7.7%
d 1
 
7.7%
m 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
I 2
33.3%
C 1
16.7%
D 1
16.7%
H 1
16.7%
L 1
16.7%
Space Separator
ValueCountFrequency (%)
1600
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Other Punctuation
ValueCountFrequency (%)
. 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4414
61.0%
Common 2803
38.7%
Latin 19
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
447
 
10.1%
412
 
9.3%
307
 
7.0%
283
 
6.4%
282
 
6.4%
278
 
6.3%
278
 
6.3%
275
 
6.2%
194
 
4.4%
189
 
4.3%
Other values (142) 1469
33.3%
Common
ValueCountFrequency (%)
1600
57.1%
1 229
 
8.2%
2 143
 
5.1%
3 133
 
4.7%
4 122
 
4.4%
5 120
 
4.3%
6 96
 
3.4%
0 91
 
3.2%
8 82
 
2.9%
9 70
 
2.5%
Other values (5) 117
 
4.2%
Latin
ValueCountFrequency (%)
e 3
15.8%
y 2
10.5%
a 2
10.5%
r 2
10.5%
I 2
10.5%
t 1
 
5.3%
v 1
 
5.3%
d 1
 
5.3%
m 1
 
5.3%
C 1
 
5.3%
Other values (3) 3
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4414
61.0%
ASCII 2822
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1600
56.7%
1 229
 
8.1%
2 143
 
5.1%
3 133
 
4.7%
4 122
 
4.3%
5 120
 
4.3%
6 96
 
3.4%
0 91
 
3.2%
8 82
 
2.9%
9 70
 
2.5%
Other values (18) 136
 
4.8%
Hangul
ValueCountFrequency (%)
447
 
10.1%
412
 
9.3%
307
 
7.0%
283
 
6.4%
282
 
6.4%
278
 
6.3%
278
 
6.3%
275
 
6.2%
194
 
4.4%
189
 
4.3%
Other values (142) 1469
33.3%
Distinct252
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T19:02:26.485419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length49
Mean length20.50625
Min length1

Characters and Unicode

Total characters6562
Distinct characters197
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique247 ?
Unique (%)77.2%

Sample

1st row충청남도 홍성군 홍성읍 내포로 110. 생활용품D.C마트
2nd row충청남도 홍성군 홍성읍 내포로 112
3rd row충청남도 홍성군 홍성읍 내포로146번길 30-4
4th row충청남도 홍성군 갈산면 산단로388번길 81
5th row충청남도 홍성군 홍북읍 도청대로 221
ValueCountFrequency (%)
충청남도 255
 
17.7%
홍성군 255
 
17.7%
홍성읍 113
 
7.9%
1층 27
 
1.9%
광천읍 27
 
1.9%
홍북읍 22
 
1.5%
홍북면 18
 
1.3%
서부면 18
 
1.3%
충서로 17
 
1.2%
갈산면 14
 
1.0%
Other values (390) 673
46.8%
2023-12-12T19:02:27.214189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1253
19.1%
456
 
6.9%
389
 
5.9%
296
 
4.5%
292
 
4.4%
274
 
4.2%
1 267
 
4.1%
264
 
4.0%
255
 
3.9%
214
 
3.3%
Other values (187) 2602
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4005
61.0%
Space Separator 1253
 
19.1%
Decimal Number 1088
 
16.6%
Other Punctuation 91
 
1.4%
Dash Punctuation 36
 
0.5%
Close Punctuation 32
 
0.5%
Open Punctuation 32
 
0.5%
Lowercase Letter 13
 
0.2%
Uppercase Letter 12
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
456
 
11.4%
389
 
9.7%
296
 
7.4%
292
 
7.3%
274
 
6.8%
264
 
6.6%
255
 
6.4%
214
 
5.3%
162
 
4.0%
120
 
3.0%
Other values (157) 1283
32.0%
Decimal Number
ValueCountFrequency (%)
1 267
24.5%
2 155
14.2%
0 110
10.1%
3 110
10.1%
6 87
 
8.0%
4 82
 
7.5%
5 78
 
7.2%
9 70
 
6.4%
7 66
 
6.1%
8 63
 
5.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
23.1%
a 2
15.4%
r 2
15.4%
y 2
15.4%
d 1
 
7.7%
v 1
 
7.7%
t 1
 
7.7%
m 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
A 3
25.0%
H 2
16.7%
I 2
16.7%
L 2
16.7%
D 1
 
8.3%
C 1
 
8.3%
S 1
 
8.3%
Space Separator
ValueCountFrequency (%)
1253
100.0%
Other Punctuation
ValueCountFrequency (%)
. 91
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4005
61.0%
Common 2532
38.6%
Latin 25
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
456
 
11.4%
389
 
9.7%
296
 
7.4%
292
 
7.3%
274
 
6.8%
264
 
6.6%
255
 
6.4%
214
 
5.3%
162
 
4.0%
120
 
3.0%
Other values (157) 1283
32.0%
Common
ValueCountFrequency (%)
1253
49.5%
1 267
 
10.5%
2 155
 
6.1%
0 110
 
4.3%
3 110
 
4.3%
. 91
 
3.6%
6 87
 
3.4%
4 82
 
3.2%
5 78
 
3.1%
9 70
 
2.8%
Other values (5) 229
 
9.0%
Latin
ValueCountFrequency (%)
e 3
12.0%
A 3
12.0%
H 2
 
8.0%
a 2
 
8.0%
r 2
 
8.0%
y 2
 
8.0%
I 2
 
8.0%
L 2
 
8.0%
d 1
 
4.0%
v 1
 
4.0%
Other values (5) 5
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4005
61.0%
ASCII 2557
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1253
49.0%
1 267
 
10.4%
2 155
 
6.1%
0 110
 
4.3%
3 110
 
4.3%
. 91
 
3.6%
6 87
 
3.4%
4 82
 
3.2%
5 78
 
3.1%
9 70
 
2.7%
Other values (20) 254
 
9.9%
Hangul
ValueCountFrequency (%)
456
 
11.4%
389
 
9.7%
296
 
7.4%
292
 
7.3%
274
 
6.8%
264
 
6.6%
255
 
6.4%
214
 
5.3%
162
 
4.0%
120
 
3.0%
Other values (157) 1283
32.0%

Interactions

2023-12-12T19:02:23.325532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:02:27.399310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
NO민원구분
NO1.0000.787
민원구분0.7871.000
2023-12-12T19:02:27.519186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
NO민원구분
NO1.0000.665
민원구분0.6651.000

Missing values

2023-12-12T19:02:23.485450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:02:23.618194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

NO민원구분업소명업소지번주소업소도로명주소
01제7조의3제3항에따른경우생활용품DC마트충청남도 홍성군 홍성읍 오관리 578-1 생활용품D.C마트충청남도 홍성군 홍성읍 내포로 110. 생활용품D.C마트
12제7조의3제2항에따른경우지에스25 홍성우체국점충청남도 홍성군 홍성읍 옥암리 18-6충청남도 홍성군 홍성읍 내포로 112
23제7조의3제2항에따른경우세븐일레븐 홍성옥암점충청남도 홍성군 홍성읍 옥암리 959충청남도 홍성군 홍성읍 내포로146번길 30-4
34제7조의3제2항에따른경우산단편의점충청남도 홍성군 갈산면 취생리 671충청남도 홍성군 갈산면 산단로388번길 81
45제7조의3제2항에따른경우주식회사 제이에스리충청남도 홍성군 홍북읍 봉신리 292-11충청남도 홍성군 홍북읍 도청대로 221
56제7조의3제2항에따른경우왕마트주식회사충청남도 홍성군 홍성읍 소향리 75충청남도 홍성군 홍성읍 충서로 1510
67제7조의3제2항에따른경우마스코트충청남도 홍성군 홍성읍 오관리 321-4충청남도 홍성군 홍성읍 조양로 180
78제7조의3제2항에따른경우GS25 홍성행복점충청남도 홍성군 홍북읍 신경리 1372충청남도 홍성군 홍북읍 홍학로 124. 상가동 101. 102호
89제7조의3제2항에따른경우씨유홍성의료원점충청남도 홍성군 홍성읍 고암리 580-1충청남도 홍성군 홍성읍 조양로 236
910제7조의3제2항에따른경우이마트24 내포한울점충청남도 홍성군 홍북읍 신경리 1372 상가동 1호. 2호충청남도 홍성군 홍북읍 홍예로 163. 상가동 1호. 2호
NO민원구분업소명업소지번주소업소도로명주소
310311운곡회관충청남도 홍성군 갈산면 운곡리 367호
311312갈산소리사충청남도 홍성군 갈산면 상촌리 242번지 8 호
312313교황슈퍼충청남도 홍성군 결성면 교항리 177호
313314용호초등충청남도 홍성군 결성면 용호리 321번지
314315갈산주유소충청남도 홍성군 갈산면 상촌리 163번지 2 호
315316구항연쇄점충청남도 홍성군 구항면 오봉리 672번지충청남도 홍성군 구항면 구항길 65
316317박영화충청남도 홍성군 서부면 이호리 259번지 1호충청남도 홍성군 서부면 이호길 45
317318부평슈퍼충청남도 홍성군 금마면 부평리 433번지 1호충청남도 홍성군 금마면 광금북로 504
318319.충청남도 홍성군 홍성읍 오관리 397번지 1호 3통충청남도 홍성군 홍성읍 아문길 60
319320대흥청과충청남도 홍성군 홍성읍 오관리 126번지 9호충청남도 홍성군 홍성읍 조양로 103