Overview

Dataset statistics

Number of variables5
Number of observations182
Missing cells10
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory41.7 B

Variable types

Text3
Numeric1
Categorical1

Dataset

Description충청남도 보령시 문화유통업(노래연습장, 인터넷컴퓨터게임시설제공업, 청소년게임제공업, 일반게임제공업, 게임제작업) 상호, 우편번호, 소재지, 업종 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=423&beforeMenuCd=DOM_000000201001001000&publicdatapk=15037841

Alerts

우편번호 has 8 (4.4%) missing valuesMissing
영업소도로명소재지 has 2 (1.1%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:48:43.934568
Analysis finished2024-01-09 21:48:44.470841
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct176
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-01-10T06:48:44.626392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length6.7032967
Min length3

Characters and Unicode

Total characters1220
Distinct characters242
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)94.5%

Sample

1st row신나는노래연습장
2nd row내마음의노래방
3rd row은하수노래방
4th row새천년노래방
5th row바다 필 노래연습장
ValueCountFrequency (%)
노래연습장 10
 
4.8%
토이랜드 4
 
1.9%
pc방 3
 
1.4%
시유pc방 2
 
1.0%
뽑기세상 2
 
1.0%
라이브노래연습장 2
 
1.0%
코인노래연습장 2
 
1.0%
노다지pc 1
 
0.5%
조은피씨방 1
 
0.5%
라이또pc방 1
 
0.5%
Other values (182) 182
86.7%
2024-01-10T06:48:44.941878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
9.1%
109
 
8.9%
90
 
7.4%
84
 
6.9%
83
 
6.8%
C 40
 
3.3%
P 39
 
3.2%
39
 
3.2%
29
 
2.4%
28
 
2.3%
Other values (232) 568
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1076
88.2%
Uppercase Letter 109
 
8.9%
Space Separator 28
 
2.3%
Lowercase Letter 5
 
0.4%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
10.3%
109
 
10.1%
90
 
8.4%
84
 
7.8%
83
 
7.7%
39
 
3.6%
29
 
2.7%
23
 
2.1%
21
 
2.0%
21
 
2.0%
Other values (208) 466
43.3%
Uppercase Letter
ValueCountFrequency (%)
C 40
36.7%
P 39
35.8%
O 4
 
3.7%
Y 3
 
2.8%
N 3
 
2.8%
M 3
 
2.8%
D 2
 
1.8%
U 2
 
1.8%
L 2
 
1.8%
G 2
 
1.8%
Other values (8) 9
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
i 2
40.0%
g 1
20.0%
n 1
20.0%
x 1
20.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Decimal Number
ValueCountFrequency (%)
8 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1076
88.2%
Latin 114
 
9.3%
Common 30
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
10.3%
109
 
10.1%
90
 
8.4%
84
 
7.8%
83
 
7.7%
39
 
3.6%
29
 
2.7%
23
 
2.1%
21
 
2.0%
21
 
2.0%
Other values (208) 466
43.3%
Latin
ValueCountFrequency (%)
C 40
35.1%
P 39
34.2%
O 4
 
3.5%
Y 3
 
2.6%
N 3
 
2.6%
M 3
 
2.6%
D 2
 
1.8%
U 2
 
1.8%
L 2
 
1.8%
i 2
 
1.8%
Other values (12) 14
 
12.3%
Common
ValueCountFrequency (%)
28
93.3%
8 2
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1076
88.2%
ASCII 144
 
11.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
111
 
10.3%
109
 
10.1%
90
 
8.4%
84
 
7.8%
83
 
7.7%
39
 
3.6%
29
 
2.7%
23
 
2.1%
21
 
2.0%
21
 
2.0%
Other values (208) 466
43.3%
ASCII
ValueCountFrequency (%)
C 40
27.8%
P 39
27.1%
28
19.4%
O 4
 
2.8%
Y 3
 
2.1%
N 3
 
2.1%
M 3
 
2.1%
D 2
 
1.4%
U 2
 
1.4%
L 2
 
1.4%
Other values (14) 18
12.5%

우편번호
Real number (ℝ)

MISSING 

Distinct40
Distinct (%)23.0%
Missing8
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean33468.092
Minimum33411
Maximum33521
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T06:48:45.050410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33411
5-th percentile33433.65
Q133458
median33469
Q333487
95-th percentile33496.95
Maximum33521
Range110
Interquartile range (IQR)29

Descriptive statistics

Standard deviation21.59112
Coefficient of variation (CV)0.00064512552
Kurtosis0.26357514
Mean33468.092
Median Absolute Deviation (MAD)14
Skewness-0.067444107
Sum5823448
Variance466.17647
MonotonicityNot monotonic
2024-01-10T06:48:45.150108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
33470 16
 
8.8%
33466 16
 
8.8%
33469 16
 
8.8%
33489 15
 
8.2%
33443 10
 
5.5%
33487 9
 
4.9%
33488 9
 
4.9%
33471 6
 
3.3%
33434 6
 
3.3%
33435 5
 
2.7%
Other values (30) 66
36.3%
(Missing) 8
 
4.4%
ValueCountFrequency (%)
33411 2
 
1.1%
33415 1
 
0.5%
33430 1
 
0.5%
33432 1
 
0.5%
33433 4
 
2.2%
33434 6
3.3%
33435 5
2.7%
33436 2
 
1.1%
33438 1
 
0.5%
33443 10
5.5%
ValueCountFrequency (%)
33521 2
 
1.1%
33520 4
 
2.2%
33509 2
 
1.1%
33508 1
 
0.5%
33491 1
 
0.5%
33490 4
 
2.2%
33489 15
8.2%
33488 9
4.9%
33487 9
4.9%
33482 2
 
1.1%
Distinct174
Distinct (%)96.7%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2024-01-10T06:48:45.432107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36.5
Mean length26.116667
Min length19

Characters and Unicode

Total characters4701
Distinct characters180
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique168 ?
Unique (%)93.3%

Sample

1st row충청남도 보령시 작은오랏5길 37 (동대동)
2nd row충청남도 보령시 웅천읍 장터중앙길 252
3rd row충청남도 보령시 대천항2길 67 (신흑동)
4th row충청남도 보령시 대해로 897-5 (신흑동)
5th row충청남도 보령시 해수욕장4길 46 (신흑동)
ValueCountFrequency (%)
충청남도 180
18.1%
보령시 180
18.1%
동대동 67
 
6.7%
신흑동 40
 
4.0%
대천동 36
 
3.6%
1층 29
 
2.9%
죽정동 10
 
1.0%
2층 10
 
1.0%
69 9
 
0.9%
작은오랏2길 9
 
0.9%
Other values (224) 427
42.8%
2024-01-10T06:48:46.042558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
820
 
17.4%
246
 
5.2%
189
 
4.0%
186
 
4.0%
186
 
4.0%
184
 
3.9%
184
 
3.9%
183
 
3.9%
180
 
3.8%
) 173
 
3.7%
Other values (170) 2170
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2819
60.0%
Space Separator 820
 
17.4%
Decimal Number 594
 
12.6%
Close Punctuation 173
 
3.7%
Open Punctuation 173
 
3.7%
Other Punctuation 84
 
1.8%
Dash Punctuation 32
 
0.7%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
246
 
8.7%
189
 
6.7%
186
 
6.6%
186
 
6.6%
184
 
6.5%
184
 
6.5%
183
 
6.5%
180
 
6.4%
134
 
4.8%
116
 
4.1%
Other values (150) 1031
36.6%
Decimal Number
ValueCountFrequency (%)
1 149
25.1%
2 77
13.0%
6 61
10.3%
3 58
 
9.8%
5 51
 
8.6%
4 49
 
8.2%
7 47
 
7.9%
8 40
 
6.7%
0 35
 
5.9%
9 27
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
G 1
16.7%
L 1
16.7%
F 1
16.7%
M 1
16.7%
Space Separator
ValueCountFrequency (%)
820
100.0%
Close Punctuation
ValueCountFrequency (%)
) 173
100.0%
Open Punctuation
ValueCountFrequency (%)
( 173
100.0%
Other Punctuation
ValueCountFrequency (%)
, 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2819
60.0%
Common 1876
39.9%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
246
 
8.7%
189
 
6.7%
186
 
6.6%
186
 
6.6%
184
 
6.5%
184
 
6.5%
183
 
6.5%
180
 
6.4%
134
 
4.8%
116
 
4.1%
Other values (150) 1031
36.6%
Common
ValueCountFrequency (%)
820
43.7%
) 173
 
9.2%
( 173
 
9.2%
1 149
 
7.9%
, 84
 
4.5%
2 77
 
4.1%
6 61
 
3.3%
3 58
 
3.1%
5 51
 
2.7%
4 49
 
2.6%
Other values (5) 181
 
9.6%
Latin
ValueCountFrequency (%)
A 2
33.3%
G 1
16.7%
L 1
16.7%
F 1
16.7%
M 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2819
60.0%
ASCII 1882
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
820
43.6%
) 173
 
9.2%
( 173
 
9.2%
1 149
 
7.9%
, 84
 
4.5%
2 77
 
4.1%
6 61
 
3.2%
3 58
 
3.1%
5 51
 
2.7%
4 49
 
2.6%
Other values (10) 187
 
9.9%
Hangul
ValueCountFrequency (%)
246
 
8.7%
189
 
6.7%
186
 
6.6%
186
 
6.6%
184
 
6.5%
184
 
6.5%
183
 
6.5%
180
 
6.4%
134
 
4.8%
116
 
4.1%
Other values (150) 1031
36.6%
Distinct166
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-01-10T06:48:46.344373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length33
Mean length19.901099
Min length17

Characters and Unicode

Total characters3622
Distinct characters81
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)85.2%

Sample

1st row충청남도 보령시 동대동 1781
2nd row충청남도 보령시 웅천읍 대창리 682
3rd row충청남도 보령시 신흑동 950-19
4th row충청남도 보령시 신흑동 1996
5th row충청남도 보령시 신흑동 1928
ValueCountFrequency (%)
충청남도 182
23.7%
보령시 182
23.7%
동대동 67
 
8.7%
신흑동 40
 
5.2%
대천동 37
 
4.8%
죽정동 10
 
1.3%
웅천읍 9
 
1.2%
명천동 8
 
1.0%
궁촌동 7
 
0.9%
시티타워 6
 
0.8%
Other values (187) 220
28.6%
2024-01-10T06:48:46.765519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
766
21.1%
236
 
6.5%
188
 
5.2%
187
 
5.2%
184
 
5.1%
184
 
5.1%
182
 
5.0%
182
 
5.0%
182
 
5.0%
1 177
 
4.9%
Other values (71) 1154
31.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1943
53.6%
Decimal Number 809
22.3%
Space Separator 766
 
21.1%
Dash Punctuation 101
 
2.8%
Close Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
236
12.1%
188
9.7%
187
9.6%
184
9.5%
184
9.5%
182
9.4%
182
9.4%
182
9.4%
112
5.8%
58
 
3.0%
Other values (56) 248
12.8%
Decimal Number
ValueCountFrequency (%)
1 177
21.9%
2 114
14.1%
4 78
9.6%
8 77
9.5%
6 75
9.3%
3 72
8.9%
9 61
 
7.5%
7 60
 
7.4%
0 54
 
6.7%
5 41
 
5.1%
Space Separator
ValueCountFrequency (%)
766
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
H 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1943
53.6%
Common 1678
46.3%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
236
12.1%
188
9.7%
187
9.6%
184
9.5%
184
9.5%
182
9.4%
182
9.4%
182
9.4%
112
5.8%
58
 
3.0%
Other values (56) 248
12.8%
Common
ValueCountFrequency (%)
766
45.6%
1 177
 
10.5%
2 114
 
6.8%
- 101
 
6.0%
4 78
 
4.6%
8 77
 
4.6%
6 75
 
4.5%
3 72
 
4.3%
9 61
 
3.6%
7 60
 
3.6%
Other values (4) 97
 
5.8%
Latin
ValueCountFrequency (%)
H 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1943
53.6%
ASCII 1679
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
766
45.6%
1 177
 
10.5%
2 114
 
6.8%
- 101
 
6.0%
4 78
 
4.6%
8 77
 
4.6%
6 75
 
4.5%
3 72
 
4.3%
9 61
 
3.6%
7 60
 
3.6%
Other values (5) 98
 
5.8%
Hangul
ValueCountFrequency (%)
236
12.1%
188
9.7%
187
9.6%
184
9.5%
184
9.5%
182
9.4%
182
9.4%
182
9.4%
112
5.8%
58
 
3.0%
Other values (56) 248
12.8%

업종
Categorical

Distinct6
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
노래연습장업
105 
인터넷컴퓨터게임시설제공업
40 
청소년게임제공업
24 
일반게임제공업
 
9
복합유통게임제공업
 
2

Length

Max length13
Median length6
Mean length7.8846154
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노래연습장업
2nd row노래연습장업
3rd row노래연습장업
4th row노래연습장업
5th row노래연습장업

Common Values

ValueCountFrequency (%)
노래연습장업 105
57.7%
인터넷컴퓨터게임시설제공업 40
 
22.0%
청소년게임제공업 24
 
13.2%
일반게임제공업 9
 
4.9%
복합유통게임제공업 2
 
1.1%
게임물제작업 2
 
1.1%

Length

2024-01-10T06:48:46.876653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:48:46.964398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노래연습장업 105
57.7%
인터넷컴퓨터게임시설제공업 40
 
22.0%
청소년게임제공업 24
 
13.2%
일반게임제공업 9
 
4.9%
복합유통게임제공업 2
 
1.1%
게임물제작업 2
 
1.1%

Interactions

2024-01-10T06:48:44.203067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:48:47.043884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종
우편번호1.0000.464
업종0.4641.000
2024-01-10T06:48:47.125852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종
우편번호1.0000.266
업종0.2661.000

Missing values

2024-01-10T06:48:44.283832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:48:44.354877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T06:48:44.431533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호명우편번호영업소도로명소재지영업소지번소재지업종
0신나는노래연습장33470충청남도 보령시 작은오랏5길 37 (동대동)충청남도 보령시 동대동 1781노래연습장업
1내마음의노래방33520충청남도 보령시 웅천읍 장터중앙길 252충청남도 보령시 웅천읍 대창리 682노래연습장업
2은하수노래방33490충청남도 보령시 대천항2길 67 (신흑동)충청남도 보령시 신흑동 950-19노래연습장업
3새천년노래방33487충청남도 보령시 대해로 897-5 (신흑동)충청남도 보령시 신흑동 1996노래연습장업
4바다 필 노래연습장33488충청남도 보령시 해수욕장4길 46 (신흑동)충청남도 보령시 신흑동 1928노래연습장업
5열창노래방33470충청남도 보령시 작은오랏3길 58 (동대동)충청남도 보령시 동대동 1860노래연습장업
6소라노래연습장33508충청남도 보령시 웅천읍 열린바다1길 41충청남도 보령시 웅천읍 관당리 818-2노래연습장업
7열린음악회노래연습장33470충청남도 보령시 작은오랏4길 15-22 (동대동)충청남도 보령시 동대동 1707노래연습장업
8행운노래연습장33462충청남도 보령시 구장터로 9 (대천동)충청남도 보령시 대천동 210-18노래연습장업
9주공노래연습장33471충청남도 보령시 주공로 22 (동대동)충청남도 보령시 동대동 1936노래연습장업
상호명우편번호영업소도로명소재지영업소지번소재지업종
172광장오락실33489충청남도 보령시 머드광장로 6 (신흑동)충청남도 보령시 신흑동 2277-2청소년게임제공업
173게임존33489충청남도 보령시 해수욕장8길 61 (신흑동)충청남도 보령시 신흑동 2272-7청소년게임제공업
174광장게임존33489충청남도 보령시 머드광장로 6 (신흑동)충청남도 보령시 신흑동 2277-2청소년게임제공업
175토이랜드33478충청남도 보령시 한내로터리길 33-1 (동대동)충청남도 보령시 동대동 411청소년게임제공업
176드림오락실33470충청남도 보령시 작은오랏6길 26 (동대동)충청남도 보령시 동대동 1854청소년게임제공업
177뽑기세상33466충청남도 보령시 목장3길 22 (대천동)충청남도 보령시 대천동 618-375청소년게임제공업
178게임타운33488충청남도 보령시 해수욕장5길 17, 1층 (신흑동)충청남도 보령시 신흑동 1974청소년게임제공업
179힐링오락실33487충청남도 보령시 대해로 885, 1층 (신흑동)충청남도 보령시 신흑동 2004청소년게임제공업
180뾰로롱 토이스토리33489충청남도 보령시 머드로 189, 1층 A-3호 (신흑동)충청남도 보령시 신흑동 2210-2청소년게임제공업
181JOY33487충청남도 보령시 해수욕장3길 11-10, 한화콘도 지하층 (신흑동)충청남도 보령시 신흑동 2017 한화콘도청소년게임제공업