Overview

Dataset statistics

Number of variables5
Number of observations136
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory42.0 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description태백시에 개설된 공중위생업소 (일반미용, 종합미용, 피부미용, 네일아트, 화장분장)로 업소명, 소재지 주소, 연락처를 포함한 데이터를 제공합니다.
Author강원특별자치도 태백시
URLhttps://www.data.go.kr/data/15099728/fileData.do

Alerts

업태명 is highly imbalanced (70.0%)Imbalance
연번 has unique valuesUnique
소재지전화 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:18:44.221944
Analysis finished2023-12-12 20:18:44.747318
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.5
Minimum1
Maximum136
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T05:18:44.823904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.75
Q134.75
median68.5
Q3102.25
95-th percentile129.25
Maximum136
Range135
Interquartile range (IQR)67.5

Descriptive statistics

Standard deviation39.403892
Coefficient of variation (CV)0.57523929
Kurtosis-1.2
Mean68.5
Median Absolute Deviation (MAD)34
Skewness0
Sum9316
Variance1552.6667
MonotonicityStrictly increasing
2023-12-13T05:18:44.977319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
96 1
 
0.7%
70 1
 
0.7%
Other values (126) 126
92.6%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%
127 1
0.7%
Distinct132
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T05:18:45.252539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length5.2279412
Min length2

Characters and Unicode

Total characters711
Distinct characters209
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)94.1%

Sample

1st row제일미용실
2nd row현주미용실
3rd row수정미용실
4th row서울미용실
5th row현대미용실
ValueCountFrequency (%)
magic헤어 2
 
1.4%
봄날헤어살롱 2
 
1.4%
강남별이 2
 
1.4%
윤헤어 2
 
1.4%
헤어카페 1
 
0.7%
제일미용실 1
 
0.7%
헤어스토리 1
 
0.7%
헤어미인 1
 
0.7%
깍쇠 1
 
0.7%
헤어재인 1
 
0.7%
Other values (124) 124
89.9%
2023-12-13T05:18:45.713861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
9.0%
64
 
9.0%
48
 
6.8%
36
 
5.1%
36
 
5.1%
21
 
3.0%
14
 
2.0%
11
 
1.5%
10
 
1.4%
10
 
1.4%
Other values (199) 397
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 669
94.1%
Lowercase Letter 21
 
3.0%
Uppercase Letter 7
 
1.0%
Decimal Number 4
 
0.6%
Other Punctuation 4
 
0.6%
Open Punctuation 2
 
0.3%
Space Separator 2
 
0.3%
Close Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
9.6%
64
 
9.6%
48
 
7.2%
36
 
5.4%
36
 
5.4%
21
 
3.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (176) 355
53.1%
Lowercase Letter
ValueCountFrequency (%)
a 4
19.0%
i 3
14.3%
n 3
14.3%
r 2
9.5%
y 2
9.5%
c 2
9.5%
g 2
9.5%
h 1
 
4.8%
o 1
 
4.8%
z 1
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
M 2
28.6%
B 1
14.3%
F 1
14.3%
W 1
14.3%
C 1
14.3%
K 1
14.3%
Decimal Number
ValueCountFrequency (%)
0 2
50.0%
1 2
50.0%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
# 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 669
94.1%
Latin 28
 
3.9%
Common 14
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
9.6%
64
 
9.6%
48
 
7.2%
36
 
5.4%
36
 
5.4%
21
 
3.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (176) 355
53.1%
Latin
ValueCountFrequency (%)
a 4
14.3%
i 3
10.7%
n 3
10.7%
r 2
 
7.1%
y 2
 
7.1%
M 2
 
7.1%
c 2
 
7.1%
g 2
 
7.1%
B 1
 
3.6%
F 1
 
3.6%
Other values (6) 6
21.4%
Common
ValueCountFrequency (%)
0 2
14.3%
( 2
14.3%
1 2
14.3%
2
14.3%
) 2
14.3%
. 2
14.3%
# 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 669
94.1%
ASCII 42
 
5.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
64
 
9.6%
64
 
9.6%
48
 
7.2%
36
 
5.4%
36
 
5.4%
21
 
3.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
10
 
1.5%
Other values (176) 355
53.1%
ASCII
ValueCountFrequency (%)
a 4
 
9.5%
i 3
 
7.1%
n 3
 
7.1%
0 2
 
4.8%
( 2
 
4.8%
1 2
 
4.8%
r 2
 
4.8%
y 2
 
4.8%
2
 
4.8%
) 2
 
4.8%
Other values (13) 18
42.9%
Distinct135
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T05:18:46.002189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length33
Mean length25.573529
Min length20

Characters and Unicode

Total characters3478
Distinct characters91
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)98.5%

Sample

1st row강원특별자치도 태백시 상장남길 43 (황지동)
2nd row강원특별자치도 태백시 시장북길 24 (황지동)
3rd row강원특별자치도 태백시 장성로 31 (장성동)
4th row강원특별자치도 태백시 장성로 40 (장성동)
5th row강원특별자치도 태백시 연지로 22 (황지동)
ValueCountFrequency (%)
강원특별자치도 120
19.0%
태백시 120
19.0%
황지동 92
14.6%
황지로 33
 
5.2%
장성동 16
 
2.5%
장성로 11
 
1.7%
번영로 10
 
1.6%
1층 8
 
1.3%
먹거리길 7
 
1.1%
시장북길 5
 
0.8%
Other values (163) 209
33.1%
2023-12-13T05:18:46.503931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
495
 
14.2%
161
 
4.6%
153
 
4.4%
150
 
4.3%
144
 
4.1%
144
 
4.1%
140
 
4.0%
139
 
4.0%
138
 
4.0%
137
 
3.9%
Other values (81) 1677
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2267
65.2%
Space Separator 495
 
14.2%
Decimal Number 394
 
11.3%
Open Punctuation 136
 
3.9%
Close Punctuation 136
 
3.9%
Dash Punctuation 27
 
0.8%
Other Punctuation 22
 
0.6%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
161
 
7.1%
153
 
6.7%
150
 
6.6%
144
 
6.4%
144
 
6.4%
140
 
6.2%
139
 
6.1%
138
 
6.1%
137
 
6.0%
136
 
6.0%
Other values (65) 825
36.4%
Decimal Number
ValueCountFrequency (%)
1 105
26.6%
2 66
16.8%
3 43
10.9%
6 37
 
9.4%
4 33
 
8.4%
5 26
 
6.6%
0 24
 
6.1%
8 23
 
5.8%
7 19
 
4.8%
9 18
 
4.6%
Space Separator
ValueCountFrequency (%)
495
100.0%
Open Punctuation
ValueCountFrequency (%)
( 136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Other Punctuation
ValueCountFrequency (%)
, 22
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2267
65.2%
Common 1210
34.8%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
161
 
7.1%
153
 
6.7%
150
 
6.6%
144
 
6.4%
144
 
6.4%
140
 
6.2%
139
 
6.1%
138
 
6.1%
137
 
6.0%
136
 
6.0%
Other values (65) 825
36.4%
Common
ValueCountFrequency (%)
495
40.9%
( 136
 
11.2%
) 136
 
11.2%
1 105
 
8.7%
2 66
 
5.5%
3 43
 
3.6%
6 37
 
3.1%
4 33
 
2.7%
- 27
 
2.2%
5 26
 
2.1%
Other values (5) 106
 
8.8%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2267
65.2%
ASCII 1211
34.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
495
40.9%
( 136
 
11.2%
) 136
 
11.2%
1 105
 
8.7%
2 66
 
5.5%
3 43
 
3.6%
6 37
 
3.1%
4 33
 
2.7%
- 27
 
2.2%
5 26
 
2.1%
Other values (6) 107
 
8.8%
Hangul
ValueCountFrequency (%)
161
 
7.1%
153
 
6.7%
150
 
6.6%
144
 
6.4%
144
 
6.4%
140
 
6.2%
139
 
6.1%
138
 
6.1%
137
 
6.0%
136
 
6.0%
Other values (65) 825
36.4%

소재지전화
Text

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T05:18:46.844503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.727941
Min length12

Characters and Unicode

Total characters1867
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)100.0%

Sample

1st row 033- 552-3891
2nd row 033- 552-2791
3rd row 033- 581-6116
4th row 033- 582-9909
5th row 033- 552-3172
ValueCountFrequency (%)
033 105
35.1%
552 15
 
5.0%
553 14
 
4.7%
010 13
 
4.3%
554 10
 
3.3%
582 3
 
1.0%
581 2
 
0.7%
1070 2
 
0.7%
552-2791 1
 
0.3%
1239 1
 
0.3%
Other values (133) 133
44.5%
2023-12-13T05:18:47.338762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 343
18.4%
5 277
14.8%
- 272
14.6%
0 228
12.2%
222
11.9%
2 104
 
5.6%
8 91
 
4.9%
1 89
 
4.8%
4 72
 
3.9%
7 64
 
3.4%
Other values (2) 105
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1373
73.5%
Dash Punctuation 272
 
14.6%
Space Separator 222
 
11.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 343
25.0%
5 277
20.2%
0 228
16.6%
2 104
 
7.6%
8 91
 
6.6%
1 89
 
6.5%
4 72
 
5.2%
7 64
 
4.7%
6 53
 
3.9%
9 52
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 272
100.0%
Space Separator
ValueCountFrequency (%)
222
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1867
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 343
18.4%
5 277
14.8%
- 272
14.6%
0 228
12.2%
222
11.9%
2 104
 
5.6%
8 91
 
4.9%
1 89
 
4.8%
4 72
 
3.9%
7 64
 
3.4%
Other values (2) 105
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1867
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 343
18.4%
5 277
14.8%
- 272
14.6%
0 228
12.2%
222
11.9%
2 104
 
5.6%
8 91
 
4.9%
1 89
 
4.8%
4 72
 
3.9%
7 64
 
3.4%
Other values (2) 105
 
5.6%

업태명
Categorical

IMBALANCE 

Distinct5
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
일반미용업
121 
피부미용업
 
6
네일아트업
 
5
종합미용업
 
3
일반미용업, 네일미용업, 화장ㆍ분장 미용업
 
1

Length

Max length23
Median length5
Mean length5.1323529
Min length5

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row일반미용업
2nd row일반미용업
3rd row일반미용업
4th row일반미용업
5th row일반미용업

Common Values

ValueCountFrequency (%)
일반미용업 121
89.0%
피부미용업 6
 
4.4%
네일아트업 5
 
3.7%
종합미용업 3
 
2.2%
일반미용업, 네일미용업, 화장ㆍ분장 미용업 1
 
0.7%

Length

2023-12-13T05:18:47.510241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:47.643567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반미용업 122
87.8%
피부미용업 6
 
4.3%
네일아트업 5
 
3.6%
종합미용업 3
 
2.2%
네일미용업 1
 
0.7%
화장ㆍ분장 1
 
0.7%
미용업 1
 
0.7%

Interactions

2023-12-13T05:18:44.469471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:18:47.743493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업태명
연번1.0000.728
업태명0.7281.000
2023-12-13T05:18:47.831375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업태명
연번1.0000.389
업태명0.3891.000

Missing values

2023-12-13T05:18:44.591457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:18:44.707560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명영업소주소(도로명)소재지전화업태명
01제일미용실강원특별자치도 태백시 상장남길 43 (황지동)033- 552-3891일반미용업
12현주미용실강원특별자치도 태백시 시장북길 24 (황지동)033- 552-2791일반미용업
23수정미용실강원특별자치도 태백시 장성로 31 (장성동)033- 581-6116일반미용업
34서울미용실강원특별자치도 태백시 장성로 40 (장성동)033- 582-9909일반미용업
45현대미용실강원특별자치도 태백시 연지로 22 (황지동)033- 552-3172일반미용업
56은행미용실강원특별자치도 태백시 먹거리길 30 (황지동)033- 552-7023일반미용업
67오늘헤어강원특별자치도 태백시 황지로 136 (황지동)033- 552-8468일반미용업
78백조미용실강원특별자치도 태백시 황지로 257-1 (황지동)033- 553-4029일반미용업
89정희미용실강원특별자치도 태백시 동태백로 375 (철암동)033- 582-7176일반미용업
910정현미용실강원특별자치도 태백시 장성1길 180 (장성동)033- 581-8565일반미용업
연번업소명영업소주소(도로명)소재지전화업태명
126127Magic헤어강원특별자치도태백시먹거리길86(황지동)033-554-7770일반미용업
127128윤헤어#강원특별자치도태백시장성로40-1(장성동)033-581-5817일반미용업
128129봄날헤어살롱강원특별자치도태백시연지로20,1층(황지동)033-645-9727일반미용업
129130나비네일강원특별자치도태백시번영로366,1층(황지동)033-552-8149네일아트업
130131퀸스네일강원특별자치도태백시해지개길13(황지동)033-554-5050네일아트업
131132오늘도네일강원특별자치도태백시황지로237-1(황지동)033-553-8007네일아트업
132133K네일강원특별자치도태백시서황지로62,2층(황지동)033-552-5104네일아트업
133134아드리네일강원특별자치도태백시장성로21(장성동)033-581-4144네일아트업
134135제이헤어샵강원특별자치도태백시서황지로76,2층(황지동)033-554-1006일반미용업
135136강남별이강원특별자치도태백시황지로156(황지동)033-554-7780일반미용업