Overview

Dataset statistics

Number of variables5
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory43.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description강원 영월군 공중위생업소(미용업,이용업, 세탁업, 숙박업, 위생관리용역업) 현황 정보
Author강원도 영월군
URLhttps://www.data.go.kr/data/15007034/fileData.do

Alerts

업종명 has constant value ""Constant
연번 is highly overall correlated with 위생관리 등급High correlation
위생관리 등급 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:44:04.210350
Analysis finished2023-12-12 15:44:04.646389
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.5
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-13T00:44:04.763400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.25
Q117.25
median33.5
Q349.75
95-th percentile62.75
Maximum66
Range65
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation19.196354
Coefficient of variation (CV)0.57302549
Kurtosis-1.2
Mean33.5
Median Absolute Deviation (MAD)16.5
Skewness0
Sum2211
Variance368.5
MonotonicityStrictly increasing
2023-12-13T00:44:04.912058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
51 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
44 1
 
1.5%
Other values (56) 56
84.8%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
미용업(일반)
66 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미용업(일반)
2nd row미용업(일반)
3rd row미용업(일반)
4th row미용업(일반)
5th row미용업(일반)

Common Values

ValueCountFrequency (%)
미용업(일반) 66
100.0%

Length

2023-12-13T00:44:05.065715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:44:05.164477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미용업(일반 66
100.0%

업소명
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-13T00:44:05.415763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.1363636
Min length2

Characters and Unicode

Total characters339
Distinct characters112
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st row화신미용실
2nd row심미용실
3rd row이나은헤어
4th row지연미용실
5th row뉴스헤어라인
ValueCountFrequency (%)
화신미용실 1
 
1.5%
꽃샘미용실 1
 
1.5%
선미헤어라인 1
 
1.5%
헤어짱 1
 
1.5%
영란헤어샵 1
 
1.5%
원헤어 1
 
1.5%
천일미용실 1
 
1.5%
중앙미용실 1
 
1.5%
수희미용실 1
 
1.5%
최아영헤어 1
 
1.5%
Other values (56) 56
84.8%
2023-12-13T00:44:05.864574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
9.7%
33
 
9.7%
30
 
8.8%
26
 
7.7%
24
 
7.1%
15
 
4.4%
10
 
2.9%
9
 
2.7%
8
 
2.4%
7
 
2.1%
Other values (102) 144
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
9.7%
33
 
9.7%
30
 
8.8%
26
 
7.7%
24
 
7.1%
15
 
4.4%
10
 
2.9%
9
 
2.7%
8
 
2.4%
7
 
2.1%
Other values (102) 144
42.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 339
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
9.7%
33
 
9.7%
30
 
8.8%
26
 
7.7%
24
 
7.1%
15
 
4.4%
10
 
2.9%
9
 
2.7%
8
 
2.4%
7
 
2.1%
Other values (102) 144
42.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 339
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
9.7%
33
 
9.7%
30
 
8.8%
26
 
7.7%
24
 
7.1%
15
 
4.4%
10
 
2.9%
9
 
2.7%
8
 
2.4%
7
 
2.1%
Other values (102) 144
42.5%
Distinct64
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-13T00:44:06.151684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length23.393939
Min length20

Characters and Unicode

Total characters1544
Distinct characters46
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)93.9%

Sample

1st row 영월군 주천면 주천리 1228번지 4호
2nd row 영월군 영월읍 하송리 253번지 5호
3rd row 영월군 영월읍 하송리 363번지 3호
4th row 영월군 한반도면 쌍용리 609번지 2호
5th row 영월군 영월읍 하송리 373번지 1호
ValueCountFrequency (%)
영월군 66
20.2%
영월읍 51
15.6%
영흥리 31
 
9.5%
하송리 17
 
5.2%
1호 9
 
2.8%
5호 8
 
2.4%
947번지 7
 
2.1%
2호 7
 
2.1%
주천리 5
 
1.5%
주천면 5
 
1.5%
Other values (86) 121
37.0%
2023-12-13T00:44:06.544911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
475
30.8%
148
 
9.6%
117
 
7.6%
66
 
4.3%
66
 
4.3%
66
 
4.3%
66
 
4.3%
63
 
4.1%
53
 
3.4%
1 47
 
3.0%
Other values (36) 377
24.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 787
51.0%
Space Separator 475
30.8%
Decimal Number 282
 
18.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
148
18.8%
117
14.9%
66
8.4%
66
8.4%
66
8.4%
66
8.4%
63
8.0%
53
 
6.7%
31
 
3.9%
17
 
2.2%
Other values (25) 94
11.9%
Decimal Number
ValueCountFrequency (%)
1 47
16.7%
9 38
13.5%
4 36
12.8%
5 32
11.3%
2 29
10.3%
6 26
9.2%
3 24
8.5%
8 22
7.8%
7 20
7.1%
0 8
 
2.8%
Space Separator
ValueCountFrequency (%)
475
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 787
51.0%
Common 757
49.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
148
18.8%
117
14.9%
66
8.4%
66
8.4%
66
8.4%
66
8.4%
63
8.0%
53
 
6.7%
31
 
3.9%
17
 
2.2%
Other values (25) 94
11.9%
Common
ValueCountFrequency (%)
475
62.7%
1 47
 
6.2%
9 38
 
5.0%
4 36
 
4.8%
5 32
 
4.2%
2 29
 
3.8%
6 26
 
3.4%
3 24
 
3.2%
8 22
 
2.9%
7 20
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 787
51.0%
ASCII 757
49.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
475
62.7%
1 47
 
6.2%
9 38
 
5.0%
4 36
 
4.8%
5 32
 
4.2%
2 29
 
3.8%
6 26
 
3.4%
3 24
 
3.2%
8 22
 
2.9%
7 20
 
2.6%
Hangul
ValueCountFrequency (%)
148
18.8%
117
14.9%
66
8.4%
66
8.4%
66
8.4%
66
8.4%
63
8.0%
53
 
6.7%
31
 
3.9%
17
 
2.2%
Other values (25) 94
11.9%

위생관리 등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
황색(우수)
31 
녹색(최우수)
24 
백색(일반관리)
11 

Length

Max length8
Median length7
Mean length6.6969697
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색(최우수)
2nd row녹색(최우수)
3rd row녹색(최우수)
4th row녹색(최우수)
5th row녹색(최우수)

Common Values

ValueCountFrequency (%)
황색(우수) 31
47.0%
녹색(최우수) 24
36.4%
백색(일반관리) 11
 
16.7%

Length

2023-12-13T00:44:06.740029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:44:06.863596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
황색(우수 31
47.0%
녹색(최우수 24
36.4%
백색(일반관리 11
 
16.7%

Interactions

2023-12-13T00:44:04.396431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:44:06.945747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명업소소재지(지번)위생관리 등급
연번1.0001.0000.8870.938
업소명1.0001.0001.0001.000
업소소재지(지번)0.8871.0001.0000.929
위생관리 등급0.9381.0000.9291.000
2023-12-13T00:44:07.057639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위생관리 등급
연번1.0000.860
위생관리 등급0.8601.000

Missing values

2023-12-13T00:44:04.507114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:44:04.606119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명업소소재지(지번)위생관리 등급
01미용업(일반)화신미용실영월군 주천면 주천리 1228번지 4호녹색(최우수)
12미용업(일반)심미용실영월군 영월읍 하송리 253번지 5호녹색(최우수)
23미용업(일반)이나은헤어영월군 영월읍 하송리 363번지 3호녹색(최우수)
34미용업(일반)지연미용실영월군 한반도면 쌍용리 609번지 2호녹색(최우수)
45미용업(일반)뉴스헤어라인영월군 영월읍 하송리 373번지 1호녹색(최우수)
56미용업(일반)댕기머리영월군 주천면 주천리 1256번지 10호녹색(최우수)
67미용업(일반)미화영월군 영월읍 영흥리 947번지 58호녹색(최우수)
78미용업(일반)미앤미헤어샵영월군 영월읍 영흥리 936번지녹색(최우수)
89미용업(일반)승미네미용실영월군 영월읍 하송리 389번지 1호녹색(최우수)
910미용업(일반)은하헤어영월군 북면 마차리 1156번지 19호녹색(최우수)
연번업종명업소명업소소재지(지번)위생관리 등급
5657미용업(일반)용헤어샵영월군 영월읍 영흥리 913번지 5호백색(일반관리)
5758미용업(일반)가위소리미용실영월군 영월읍 영흥리 947번지 27호백색(일반관리)
5859미용업(일반)샬롬미용실영월군 영월읍 영흥리 996번지 1호백색(일반관리)
5960미용업(일반)까까머리영월군 영월읍 영흥리 944번지 8호백색(일반관리)
6061미용업(일반)샬롬헤어프라자영월군 영월읍 덕포리 574번지 9호백색(일반관리)
6162미용업(일반)오드리헤어샵영월군 영월읍 영흥리 954번지 9호백색(일반관리)
6263미용업(일반)찬찬헤어영월군 영월읍 영흥리 928번지 9호백색(일반관리)
6364미용업(일반)우리머리방영월군 상동읍 내덕리 656번지 50호백색(일반관리)
6465미용업(일반)선미헤어라인영월군 영월읍 영흥리 971번지 23호백색(일반관리)
6566미용업(일반)준헤어샵영월군 영월읍 영흥리 801번지 7호백색(일반관리)