Overview

Dataset statistics

Number of variables5
Number of observations161
Missing cells21
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description태안군에 소재한 미용업(번호, 미용업종명, 미용업소명, 업소소재지(도로명), 전화번호)에 관한 데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=446&beforeMenuCd=DOM_000000201001001000&publicdatapk=15006944

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
소재지전화 has 21 (13.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:18:39.453224
Analysis finished2024-01-09 20:18:39.929265
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct161
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81
Minimum1
Maximum161
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-01-10T05:18:39.992913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q141
median81
Q3121
95-th percentile153
Maximum161
Range160
Interquartile range (IQR)80

Descriptive statistics

Standard deviation46.620811
Coefficient of variation (CV)0.57556557
Kurtosis-1.2
Mean81
Median Absolute Deviation (MAD)40
Skewness0
Sum13041
Variance2173.5
MonotonicityStrictly increasing
2024-01-10T05:18:40.101040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
122 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
Other values (151) 151
93.8%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%

업종명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
미용업
70 
일반미용업
59 
피부미용업
18 
종합미용업
 
6
네일미용업
 
6
Other values (2)
 
2

Length

Max length12
Median length5
Mean length4.2173913
Min length3

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row미용업
2nd row미용업
3rd row미용업
4th row미용업
5th row미용업

Common Values

ValueCountFrequency (%)
미용업 70
43.5%
일반미용업 59
36.6%
피부미용업 18
 
11.2%
종합미용업 6
 
3.7%
네일미용업 6
 
3.7%
일반미용업, 네일미용업 1
 
0.6%
피부미용업, 네일미용업 1
 
0.6%

Length

2024-01-10T05:18:40.200633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:18:40.291271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미용업 70
42.9%
일반미용업 60
36.8%
피부미용업 19
 
11.7%
네일미용업 8
 
4.9%
종합미용업 6
 
3.7%
Distinct159
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-10T05:18:40.519456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length5.2111801
Min length2

Characters and Unicode

Total characters839
Distinct characters235
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)97.5%

Sample

1st row대지미용실
2nd row코스모스미용실
3rd row서울미용실
4th row중앙헤어숍
5th row보라미용실
ValueCountFrequency (%)
댕기머리 2
 
1.2%
현대미용실 2
 
1.2%
로즈네일 1
 
0.6%
hm헤어 1
 
0.6%
one헤어 1
 
0.6%
대지미용실 1
 
0.6%
살롱미(salon美 1
 
0.6%
m헤어 1
 
0.6%
예쁘다헤어샵 1
 
0.6%
숙헤어샵 1
 
0.6%
Other values (151) 151
92.6%
2024-01-10T05:18:40.848927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
7.5%
60
 
7.2%
54
 
6.4%
48
 
5.7%
47
 
5.6%
27
 
3.2%
18
 
2.1%
16
 
1.9%
15
 
1.8%
12
 
1.4%
Other values (225) 479
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 805
95.9%
Lowercase Letter 14
 
1.7%
Decimal Number 4
 
0.5%
Uppercase Letter 4
 
0.5%
Close Punctuation 3
 
0.4%
Open Punctuation 3
 
0.4%
Other Punctuation 3
 
0.4%
Space Separator 2
 
0.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
7.8%
60
 
7.5%
54
 
6.7%
48
 
6.0%
47
 
5.8%
27
 
3.4%
18
 
2.2%
16
 
2.0%
15
 
1.9%
12
 
1.5%
Other values (204) 445
55.3%
Lowercase Letter
ValueCountFrequency (%)
n 4
28.6%
o 3
21.4%
a 1
 
7.1%
l 1
 
7.1%
g 1
 
7.1%
u 1
 
7.1%
i 1
 
7.1%
s 1
 
7.1%
e 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
0 2
50.0%
8 1
25.0%
2 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
M 2
50.0%
H 1
25.0%
Y 1
25.0%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
# 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 804
95.8%
Latin 18
 
2.1%
Common 16
 
1.9%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
7.8%
60
 
7.5%
54
 
6.7%
48
 
6.0%
47
 
5.8%
27
 
3.4%
18
 
2.2%
16
 
2.0%
15
 
1.9%
12
 
1.5%
Other values (203) 444
55.2%
Latin
ValueCountFrequency (%)
n 4
22.2%
o 3
16.7%
M 2
11.1%
a 1
 
5.6%
l 1
 
5.6%
H 1
 
5.6%
g 1
 
5.6%
u 1
 
5.6%
Y 1
 
5.6%
i 1
 
5.6%
Other values (2) 2
11.1%
Common
ValueCountFrequency (%)
) 3
18.8%
( 3
18.8%
0 2
12.5%
2
12.5%
& 2
12.5%
8 1
 
6.2%
2 1
 
6.2%
# 1
 
6.2%
- 1
 
6.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 804
95.8%
ASCII 34
 
4.1%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
63
 
7.8%
60
 
7.5%
54
 
6.7%
48
 
6.0%
47
 
5.8%
27
 
3.4%
18
 
2.2%
16
 
2.0%
15
 
1.9%
12
 
1.5%
Other values (203) 444
55.2%
ASCII
ValueCountFrequency (%)
n 4
 
11.8%
o 3
 
8.8%
) 3
 
8.8%
( 3
 
8.8%
0 2
 
5.9%
2
 
5.9%
M 2
 
5.9%
& 2
 
5.9%
a 1
 
2.9%
l 1
 
2.9%
Other values (11) 11
32.4%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct157
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-10T05:18:41.168711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length42
Mean length22.521739
Min length18

Characters and Unicode

Total characters3626
Distinct characters104
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)95.7%

Sample

1st row충청남도 태안군 태안읍 경이정1길 26
2nd row충청남도 태안군 태안읍 시장2길 39-12
3rd row충청남도 태안군 원북면 상리길 15-4
4th row충청남도 태안군 태안읍 독샘로 40
5th row충청남도 태안군 안면읍 장터로 114-5
ValueCountFrequency (%)
충청남도 161
18.5%
태안군 161
18.5%
태안읍 131
15.0%
중앙로 25
 
2.9%
1층 19
 
2.2%
안면읍 19
 
2.2%
독샘로 15
 
1.7%
장터로 14
 
1.6%
2층 9
 
1.0%
후곡로 8
 
0.9%
Other values (205) 310
35.6%
2024-01-10T05:18:41.600484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
711
19.6%
316
 
8.7%
295
 
8.1%
1 183
 
5.0%
170
 
4.7%
170
 
4.7%
165
 
4.6%
161
 
4.4%
161
 
4.4%
150
 
4.1%
Other values (94) 1144
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2223
61.3%
Space Separator 711
 
19.6%
Decimal Number 582
 
16.1%
Dash Punctuation 54
 
1.5%
Other Punctuation 45
 
1.2%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Math Symbol 2
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
316
14.2%
295
13.3%
170
 
7.6%
170
 
7.6%
165
 
7.4%
161
 
7.2%
161
 
7.2%
150
 
6.7%
90
 
4.0%
71
 
3.2%
Other values (77) 474
21.3%
Decimal Number
ValueCountFrequency (%)
1 183
31.4%
2 84
14.4%
4 62
 
10.7%
3 61
 
10.5%
0 41
 
7.0%
5 36
 
6.2%
7 33
 
5.7%
8 31
 
5.3%
9 27
 
4.6%
6 24
 
4.1%
Space Separator
ValueCountFrequency (%)
711
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Other Punctuation
ValueCountFrequency (%)
, 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2223
61.3%
Common 1402
38.7%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
316
14.2%
295
13.3%
170
 
7.6%
170
 
7.6%
165
 
7.4%
161
 
7.2%
161
 
7.2%
150
 
6.7%
90
 
4.0%
71
 
3.2%
Other values (77) 474
21.3%
Common
ValueCountFrequency (%)
711
50.7%
1 183
 
13.1%
2 84
 
6.0%
4 62
 
4.4%
3 61
 
4.4%
- 54
 
3.9%
, 45
 
3.2%
0 41
 
2.9%
5 36
 
2.6%
7 33
 
2.4%
Other values (6) 92
 
6.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2223
61.3%
ASCII 1403
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
711
50.7%
1 183
 
13.0%
2 84
 
6.0%
4 62
 
4.4%
3 61
 
4.3%
- 54
 
3.8%
, 45
 
3.2%
0 41
 
2.9%
5 36
 
2.6%
7 33
 
2.4%
Other values (7) 93
 
6.6%
Hangul
ValueCountFrequency (%)
316
14.2%
295
13.3%
170
 
7.6%
170
 
7.6%
165
 
7.4%
161
 
7.2%
161
 
7.2%
150
 
6.7%
90
 
4.0%
71
 
3.2%
Other values (77) 474
21.3%

소재지전화
Text

MISSING 

Distinct140
Distinct (%)100.0%
Missing21
Missing (%)13.0%
Memory size1.4 KiB
2024-01-10T05:18:41.811337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.157143
Min length12

Characters and Unicode

Total characters1702
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)100.0%

Sample

1st row041-674-2177
2nd row041-674-7617
3rd row041-672-5089
4th row041-674-2223
5th row041-674-2343
ValueCountFrequency (%)
041-673-2583 1
 
0.7%
041-674-9776 1
 
0.7%
041-672-0134 1
 
0.7%
041-673-1787 1
 
0.7%
070-7738-4500 1
 
0.7%
041-674-1170 1
 
0.7%
041-674-8260 1
 
0.7%
041-674-9959 1
 
0.7%
041-672-0423 1
 
0.7%
041-674-3613 1
 
0.7%
Other values (130) 130
92.9%
2024-01-10T05:18:42.134299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 280
16.5%
0 224
13.2%
4 211
12.4%
7 198
11.6%
1 193
11.3%
6 179
10.5%
5 115
6.8%
2 95
 
5.6%
3 86
 
5.1%
8 63
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1422
83.5%
Dash Punctuation 280
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 224
15.8%
4 211
14.8%
7 198
13.9%
1 193
13.6%
6 179
12.6%
5 115
8.1%
2 95
6.7%
3 86
 
6.0%
8 63
 
4.4%
9 58
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 280
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1702
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 280
16.5%
0 224
13.2%
4 211
12.4%
7 198
11.6%
1 193
11.3%
6 179
10.5%
5 115
6.8%
2 95
 
5.6%
3 86
 
5.1%
8 63
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1702
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 280
16.5%
0 224
13.2%
4 211
12.4%
7 198
11.6%
1 193
11.3%
6 179
10.5%
5 115
6.8%
2 95
 
5.6%
3 86
 
5.1%
8 63
 
3.7%

Interactions

2024-01-10T05:18:39.698855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:18:42.418283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.842
업종명0.8421.000
2024-01-10T05:18:42.488851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.636
업종명0.6361.000

Missing values

2024-01-10T05:18:39.799488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:18:39.889930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명영업소 주소(도로명)소재지전화
01미용업대지미용실충청남도 태안군 태안읍 경이정1길 26041-674-2177
12미용업코스모스미용실충청남도 태안군 태안읍 시장2길 39-12041-674-7617
23미용업서울미용실충청남도 태안군 원북면 상리길 15-4041-672-5089
34미용업중앙헤어숍충청남도 태안군 태안읍 독샘로 40041-674-2223
45미용업보라미용실충청남도 태안군 안면읍 장터로 114-5041-674-2343
56미용업가고파미용실충청남도 태안군 남면 남면로 93041-673-1915
67미용업내머리어때충청남도 태안군 태안읍 시장1길 35-15041-674-3439
78미용업쎄븐미용실충청남도 태안군 안면읍 장터로 120-3041-673-4320
89미용업현대미용실충청남도 태안군 태안읍 능샘2길 40041-674-3867
910미용업새미용실충청남도 태안군 태안읍 독샘로 37-11041-673-2583
연번업종명업소명영업소 주소(도로명)소재지전화
151152종합미용업서이헤어충청남도 태안군 태안읍 중앙로 72, 2층041-675-1030
152153네일미용업제이네일충청남도 태안군 태안읍 동문5길 11-4<NA>
153154네일미용업에스네일엔뷰티충청남도 태안군 태안읍 동백로 2360507-1309-1476
154155네일미용업네일봄충청남도 태안군 태안읍 동문2길 12-6, 101호0507-1488-8003
155156네일미용업케하충청남도 태안군 태안읍 대지길 2, 1층<NA>
156157네일미용업보름뷰티충청남도 태안군 태안읍 샘골로 14, 1층0507-1319-0496
157158네일미용업로즈네일충청남도 태안군 태안읍 후곡로 109, 나동 1호0507-1411-0406
158159일반미용업, 네일미용업분홍손톱 염색공주충청남도 태안군 태안읍 동문3길 17<NA>
159160피부미용업, 네일미용업갈바닉피부사랑방충청남도 태안군 안면읍 장터로 90<NA>
160161피부미용업로뎀나무충청남도 태안군 태안읍 후곡로 121, 2층<NA>