Overview

Dataset statistics

Number of variables5
Number of observations184
Missing cells24
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.5 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description2022년 12월 20일 기준 보은군 공중위생업소에 대한 데이터로 업종명, 업소명, 영업소 주소, 전화번호 정보를 제공합니다.
Author충청북도 보은군
URLhttps://www.data.go.kr/data/15006968/fileData.do

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
소재지전화 has 24 (13.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:35:57.011179
Analysis finished2023-12-12 16:35:57.596280
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct184
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.5
Minimum1
Maximum184
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T01:35:57.677802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.15
Q146.75
median92.5
Q3138.25
95-th percentile174.85
Maximum184
Range183
Interquartile range (IQR)91.5

Descriptive statistics

Standard deviation53.260367
Coefficient of variation (CV)0.57578775
Kurtosis-1.2
Mean92.5
Median Absolute Deviation (MAD)46
Skewness0
Sum17020
Variance2836.6667
MonotonicityStrictly increasing
2023-12-13T01:35:57.830334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
128 1
 
0.5%
119 1
 
0.5%
120 1
 
0.5%
121 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
Other values (174) 174
94.6%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
179 1
0.5%
178 1
0.5%
177 1
0.5%
176 1
0.5%
175 1
0.5%

업종명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
숙박업(일반)
69 
미용업
43 
일반미용업
15 
이용업
13 
세탁업
12 
Other values (6)
32 

Length

Max length7
Median length5
Mean length5.0978261
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 69
37.5%
미용업 43
23.4%
일반미용업 15
 
8.2%
이용업 13
 
7.1%
세탁업 12
 
6.5%
건물위생관리업 8
 
4.3%
종합미용업 7
 
3.8%
목욕장업 6
 
3.3%
피부미용업 6
 
3.3%
숙박업(생활) 3
 
1.6%

Length

2023-12-13T01:35:57.992353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
숙박업(일반 69
37.5%
미용업 43
23.4%
일반미용업 15
 
8.2%
이용업 13
 
7.1%
세탁업 12
 
6.5%
건물위생관리업 8
 
4.3%
종합미용업 7
 
3.8%
목욕장업 6
 
3.3%
피부미용업 6
 
3.3%
숙박업(생활 3
 
1.6%
Distinct183
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T01:35:58.342302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length5.451087
Min length2

Characters and Unicode

Total characters1003
Distinct characters264
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)98.9%

Sample

1st row그랜드호텔
2nd row속리산송림펜션
3rd row비바호텔&펜션
4th row해운펜션
5th row어래모텔
ValueCountFrequency (%)
현미용실 2
 
1.0%
미용실 2
 
1.0%
헤어샵 2
 
1.0%
소울메이트헤어샵 1
 
0.5%
o2 1
 
0.5%
포인트 1
 
0.5%
우리 1
 
0.5%
헤어스튜디오 1
 
0.5%
수빈이네 1
 
0.5%
제이스헤어샵 1
 
0.5%
Other values (196) 196
93.8%
2023-12-13T01:35:58.855176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
3.6%
34
 
3.4%
34
 
3.4%
31
 
3.1%
29
 
2.9%
27
 
2.7%
27
 
2.7%
26
 
2.6%
25
 
2.5%
20
 
2.0%
Other values (254) 714
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 930
92.7%
Space Separator 25
 
2.5%
Lowercase Letter 15
 
1.5%
Uppercase Letter 12
 
1.2%
Other Punctuation 7
 
0.7%
Close Punctuation 5
 
0.5%
Decimal Number 5
 
0.5%
Open Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
3.9%
34
 
3.7%
34
 
3.7%
31
 
3.3%
29
 
3.1%
27
 
2.9%
27
 
2.9%
26
 
2.8%
20
 
2.2%
18
 
1.9%
Other values (226) 648
69.7%
Lowercase Letter
ValueCountFrequency (%)
o 3
20.0%
t 3
20.0%
b 2
13.3%
i 1
 
6.7%
a 1
 
6.7%
n 1
 
6.7%
d 1
 
6.7%
y 1
 
6.7%
r 1
 
6.7%
s 1
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
K 3
25.0%
O 2
16.7%
I 1
 
8.3%
J 1
 
8.3%
W 1
 
8.3%
M 1
 
8.3%
A 1
 
8.3%
R 1
 
8.3%
T 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 2
28.6%
# 2
28.6%
& 2
28.6%
, 1
14.3%
Decimal Number
ValueCountFrequency (%)
8 4
80.0%
2 1
 
20.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 930
92.7%
Common 46
 
4.6%
Latin 27
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
3.9%
34
 
3.7%
34
 
3.7%
31
 
3.3%
29
 
3.1%
27
 
2.9%
27
 
2.9%
26
 
2.8%
20
 
2.2%
18
 
1.9%
Other values (226) 648
69.7%
Latin
ValueCountFrequency (%)
o 3
 
11.1%
t 3
 
11.1%
K 3
 
11.1%
O 2
 
7.4%
b 2
 
7.4%
I 1
 
3.7%
i 1
 
3.7%
a 1
 
3.7%
n 1
 
3.7%
d 1
 
3.7%
Other values (9) 9
33.3%
Common
ValueCountFrequency (%)
25
54.3%
) 5
 
10.9%
( 4
 
8.7%
8 4
 
8.7%
. 2
 
4.3%
# 2
 
4.3%
& 2
 
4.3%
, 1
 
2.2%
2 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 930
92.7%
ASCII 73
 
7.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
 
3.9%
34
 
3.7%
34
 
3.7%
31
 
3.3%
29
 
3.1%
27
 
2.9%
27
 
2.9%
26
 
2.8%
20
 
2.2%
18
 
1.9%
Other values (226) 648
69.7%
ASCII
ValueCountFrequency (%)
25
34.2%
) 5
 
6.8%
( 4
 
5.5%
8 4
 
5.5%
o 3
 
4.1%
t 3
 
4.1%
K 3
 
4.1%
O 2
 
2.7%
. 2
 
2.7%
b 2
 
2.7%
Other values (18) 20
27.4%
Distinct172
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T01:35:59.107719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length33
Mean length22.429348
Min length18

Characters and Unicode

Total characters4127
Distinct characters116
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)88.0%

Sample

1st row충청북도 보은군 속리산면 사내4길 6-9
2nd row충청북도 보은군 속리산면 사내7길 11-1
3rd row충청북도 보은군 속리산면 사내6길 6-9
4th row충청북도 보은군 속리산면 사내6길 12-5
5th row충청북도 보은군 속리산면 사내2길 78-5
ValueCountFrequency (%)
충청북도 184
19.0%
보은군 184
19.0%
보은읍 114
 
11.8%
속리산면 44
 
4.6%
삼산남로 23
 
2.4%
보은로 22
 
2.3%
사내6길 14
 
1.4%
삼산로 13
 
1.3%
보청대로 10
 
1.0%
삼산로1길 9
 
0.9%
Other values (213) 350
36.2%
2023-12-13T01:35:59.489291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
783
19.0%
335
 
8.1%
325
 
7.9%
196
 
4.7%
188
 
4.6%
184
 
4.5%
184
 
4.5%
184
 
4.5%
1 150
 
3.6%
128
 
3.1%
Other values (106) 1470
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2646
64.1%
Space Separator 783
 
19.0%
Decimal Number 581
 
14.1%
Dash Punctuation 75
 
1.8%
Other Punctuation 36
 
0.9%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
335
12.7%
325
12.3%
196
 
7.4%
188
 
7.1%
184
 
7.0%
184
 
7.0%
184
 
7.0%
128
 
4.8%
118
 
4.5%
114
 
4.3%
Other values (91) 690
26.1%
Decimal Number
ValueCountFrequency (%)
1 150
25.8%
2 72
12.4%
3 63
10.8%
6 60
 
10.3%
4 59
 
10.2%
5 41
 
7.1%
8 38
 
6.5%
9 37
 
6.4%
0 34
 
5.9%
7 27
 
4.6%
Space Separator
ValueCountFrequency (%)
783
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Other Punctuation
ValueCountFrequency (%)
, 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2646
64.1%
Common 1481
35.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
335
12.7%
325
12.3%
196
 
7.4%
188
 
7.1%
184
 
7.0%
184
 
7.0%
184
 
7.0%
128
 
4.8%
118
 
4.5%
114
 
4.3%
Other values (91) 690
26.1%
Common
ValueCountFrequency (%)
783
52.9%
1 150
 
10.1%
- 75
 
5.1%
2 72
 
4.9%
3 63
 
4.3%
6 60
 
4.1%
4 59
 
4.0%
5 41
 
2.8%
8 38
 
2.6%
9 37
 
2.5%
Other values (5) 103
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2646
64.1%
ASCII 1481
35.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
783
52.9%
1 150
 
10.1%
- 75
 
5.1%
2 72
 
4.9%
3 63
 
4.3%
6 60
 
4.1%
4 59
 
4.0%
5 41
 
2.8%
8 38
 
2.6%
9 37
 
2.5%
Other values (5) 103
 
7.0%
Hangul
ValueCountFrequency (%)
335
12.7%
325
12.3%
196
 
7.4%
188
 
7.1%
184
 
7.0%
184
 
7.0%
184
 
7.0%
128
 
4.8%
118
 
4.5%
114
 
4.3%
Other values (91) 690
26.1%

소재지전화
Text

MISSING 

Distinct157
Distinct (%)98.1%
Missing24
Missing (%)13.0%
Memory size1.6 KiB
2023-12-13T01:35:59.794281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.99375
Min length13

Characters and Unicode

Total characters2239
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)96.2%

Sample

1st row043 -542 -2500
2nd row043 -543 -3941
3rd row043 -544 -7888
4th row043 -543 -3754
5th row043 -543 -3882
ValueCountFrequency (%)
043 148
34.6%
543 58
 
13.6%
542 31
 
7.2%
544 25
 
5.8%
0435 2
 
0.5%
4000 2
 
0.5%
542-2211 2
 
0.5%
5070 2
 
0.5%
5955 2
 
0.5%
5651 1
 
0.2%
Other values (155) 155
36.2%
2023-12-13T01:36:00.436542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 415
18.5%
- 320
14.3%
305
13.6%
3 291
13.0%
0 265
11.8%
5 214
9.6%
2 113
 
5.0%
7 73
 
3.3%
8 70
 
3.1%
1 68
 
3.0%
Other values (2) 105
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1614
72.1%
Dash Punctuation 320
 
14.3%
Space Separator 305
 
13.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 415
25.7%
3 291
18.0%
0 265
16.4%
5 214
13.3%
2 113
 
7.0%
7 73
 
4.5%
8 70
 
4.3%
1 68
 
4.2%
6 55
 
3.4%
9 50
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 320
100.0%
Space Separator
ValueCountFrequency (%)
305
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2239
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 415
18.5%
- 320
14.3%
305
13.6%
3 291
13.0%
0 265
11.8%
5 214
9.6%
2 113
 
5.0%
7 73
 
3.3%
8 70
 
3.1%
1 68
 
3.0%
Other values (2) 105
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2239
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 415
18.5%
- 320
14.3%
305
13.6%
3 291
13.0%
0 265
11.8%
5 214
9.6%
2 113
 
5.0%
7 73
 
3.3%
8 70
 
3.1%
1 68
 
3.0%
Other values (2) 105
 
4.7%

Interactions

2023-12-13T01:35:57.321782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:36:00.515885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.899
업종명0.8991.000
2023-12-13T01:36:00.579679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.671
업종명0.6711.000

Missing values

2023-12-13T01:35:57.453334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:35:57.555666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명영업소 주소(도로명)소재지전화
01숙박업(일반)그랜드호텔충청북도 보은군 속리산면 사내4길 6-9043 -542 -2500
12숙박업(일반)속리산송림펜션충청북도 보은군 속리산면 사내7길 11-1043 -543 -3941
23숙박업(일반)비바호텔&펜션충청북도 보은군 속리산면 사내6길 6-9043 -544 -7888
34숙박업(일반)해운펜션충청북도 보은군 속리산면 사내6길 12-5043 -543 -3754
45숙박업(일반)어래모텔충청북도 보은군 속리산면 사내2길 78-5043 -543 -3882
56숙박업(일반)청심호텔충청북도 보은군 속리산면 사내5길 6-9043 -544 -1255
67숙박업(일반)레이크힐스호텔속리산충청북도 보은군 속리산면 법주사로 305043 -542 -5281
78숙박업(일반)속리여관충청북도 보은군 속리산면 사내6길 5-2, 모텔043 -543 -5070
89숙박업(일반)속리산펜션모텔충청북도 보은군 속리산면 사내6길 5-10043 -544 -3844
910숙박업(일반)아모르충청북도 보은군 속리산면 사내2길 12-16043 -543 -6767
연번업종명업소명영업소 주소(도로명)소재지전화
174175종합미용업K&J story충청북도 보은군 보은읍 보은로 160-20507-1324-6678
175176종합미용업애니헤어샾충청북도 보은군 보은읍 풍취길 32-19<NA>
176177종합미용업헤어하우스 결충청북도 보은군 보은읍 삼산로2길 8-1, 주점043 -544 -0025
177178종합미용업네일은 혜윰충청북도 보은군 보은읍 삼산로3길 11, 통일시장<NA>
178179종합미용업빗앤붓(bit and boot)충청북도 보은군 보은읍 보은로 112<NA>
179180숙박업(생활)해찬솔펜션충청북도 보은군 속리산면 비룡동관로 63043 -542 -9444
180181숙박업(생활)써니힐충청북도 보은군 속리산면 사내2길 44, 민박<NA>
181182숙박업(생활)다님펜션충청북도 보은군 속리산면 사내6길 10, 여관<NA>
182183네일미용업네일 #충청북도 보은군 보은읍 남부로 4303-12, 사무실<NA>
183184네일미용업봄네일충청북도 보은군 보은읍 보은로 158-1, 공인중개업소<NA>