Overview

Dataset statistics

Number of variables4
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory35.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description전라북도 고창군에 위치한 숙박업소의 위생 관리 등급 현황입니다. 업소명, 위생등급, 소재지 도로명주소에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15055391/fileData.do

Alerts

연번 is highly overall correlated with 등급High correlation
등급 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:08:29.867017
Analysis finished2023-12-12 23:08:30.263511
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.5
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-13T08:08:30.333008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.95
Q115.75
median30.5
Q345.25
95-th percentile57.05
Maximum60
Range59
Interquartile range (IQR)29.5

Descriptive statistics

Standard deviation17.464249
Coefficient of variation (CV)0.57259833
Kurtosis-1.2
Mean30.5
Median Absolute Deviation (MAD)15
Skewness0
Sum1830
Variance305
MonotonicityStrictly increasing
2023-12-13T08:08:30.472103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
32 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
41 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
60 1
1.7%
59 1
1.7%
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%

업소명
Text

UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
2023-12-13T08:08:30.696000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length6.6333333
Min length3

Characters and Unicode

Total characters398
Distinct characters105
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)100.0%

Sample

1st row뉴프린스관광모텔
2nd row모양성호텔
3rd row강선달힐링센터
4th row파머스빌리지
5th row탑모텔
ValueCountFrequency (%)
보다더펜션 4
 
5.9%
여관 3
 
4.4%
뉴프린스관광모텔 1
 
1.5%
서해안모텔 1
 
1.5%
선운산호텔 1
 
1.5%
금오장여관 1
 
1.5%
리온 1
 
1.5%
도담 1
 
1.5%
늘봄 1
 
1.5%
가람 1
 
1.5%
Other values (53) 53
77.9%
2023-12-13T08:08:31.145798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.5%
21
 
5.3%
20
 
5.0%
18
 
4.5%
18
 
4.5%
18
 
4.5%
18
 
4.5%
17
 
4.3%
17
 
4.3%
15
 
3.8%
Other values (95) 214
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 365
91.7%
Decimal Number 25
 
6.3%
Space Separator 8
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
6.0%
21
 
5.8%
20
 
5.5%
18
 
4.9%
18
 
4.9%
18
 
4.9%
18
 
4.9%
17
 
4.7%
17
 
4.7%
15
 
4.1%
Other values (84) 181
49.6%
Decimal Number
ValueCountFrequency (%)
1 10
40.0%
2 2
 
8.0%
6 2
 
8.0%
7 2
 
8.0%
5 2
 
8.0%
4 2
 
8.0%
3 2
 
8.0%
8 1
 
4.0%
9 1
 
4.0%
0 1
 
4.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 365
91.7%
Common 33
 
8.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
6.0%
21
 
5.8%
20
 
5.5%
18
 
4.9%
18
 
4.9%
18
 
4.9%
18
 
4.9%
17
 
4.7%
17
 
4.7%
15
 
4.1%
Other values (84) 181
49.6%
Common
ValueCountFrequency (%)
1 10
30.3%
8
24.2%
2 2
 
6.1%
6 2
 
6.1%
7 2
 
6.1%
5 2
 
6.1%
4 2
 
6.1%
3 2
 
6.1%
8 1
 
3.0%
9 1
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 365
91.7%
ASCII 33
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
6.0%
21
 
5.8%
20
 
5.5%
18
 
4.9%
18
 
4.9%
18
 
4.9%
18
 
4.9%
17
 
4.7%
17
 
4.7%
15
 
4.1%
Other values (84) 181
49.6%
ASCII
ValueCountFrequency (%)
1 10
30.3%
8
24.2%
2 2
 
6.1%
6 2
 
6.1%
7 2
 
6.1%
5 2
 
6.1%
4 2
 
6.1%
3 2
 
6.1%
8 1
 
3.0%
9 1
 
3.0%

등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
최우수
29 
우수
23 
일반

Length

Max length3
Median length2
Mean length2.4833333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row최우수
2nd row최우수
3rd row최우수
4th row최우수
5th row최우수

Common Values

ValueCountFrequency (%)
최우수 29
48.3%
우수 23
38.3%
일반 8
 
13.3%

Length

2023-12-13T08:08:31.273334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:31.371019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
최우수 29
48.3%
우수 23
38.3%
일반 8
 
13.3%
Distinct56
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
2023-12-13T08:08:31.593759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length21.766667
Min length18

Characters and Unicode

Total characters1306
Distinct characters80
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)90.0%

Sample

1st row전라북도 고창군 고창읍 성산8길 4
2nd row전라북도 고창군 고창읍 중앙로 331
3rd row전라북도 고창군 상하면 구시포해변길 160-3
4th row전라북도 고창군 상하면 상하농원길 6-36
5th row전라북도 고창군 고창읍 보릿골로 125
ValueCountFrequency (%)
전라북도 60
20.0%
고창군 60
20.0%
고창읍 36
 
12.0%
석정2로 17
 
5.7%
흥덕면 5
 
1.7%
아산면 4
 
1.3%
부안면 4
 
1.3%
복분자로 4
 
1.3%
513 4
 
1.3%
상하면 4
 
1.3%
Other values (86) 102
34.0%
2023-12-13T08:08:31.983104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
240
18.4%
97
 
7.4%
96
 
7.4%
60
 
4.6%
60
 
4.6%
60
 
4.6%
60
 
4.6%
60
 
4.6%
2 55
 
4.2%
40
 
3.1%
Other values (70) 478
36.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 812
62.2%
Space Separator 240
 
18.4%
Decimal Number 228
 
17.5%
Dash Punctuation 26
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
11.9%
96
11.8%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
40
 
4.9%
36
 
4.4%
24
 
3.0%
Other values (58) 219
27.0%
Decimal Number
ValueCountFrequency (%)
2 55
24.1%
1 37
16.2%
7 28
12.3%
3 28
12.3%
0 26
11.4%
4 17
 
7.5%
5 11
 
4.8%
6 11
 
4.8%
8 10
 
4.4%
9 5
 
2.2%
Space Separator
ValueCountFrequency (%)
240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 812
62.2%
Common 494
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
11.9%
96
11.8%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
40
 
4.9%
36
 
4.4%
24
 
3.0%
Other values (58) 219
27.0%
Common
ValueCountFrequency (%)
240
48.6%
2 55
 
11.1%
1 37
 
7.5%
7 28
 
5.7%
3 28
 
5.7%
- 26
 
5.3%
0 26
 
5.3%
4 17
 
3.4%
5 11
 
2.2%
6 11
 
2.2%
Other values (2) 15
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 812
62.2%
ASCII 494
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
240
48.6%
2 55
 
11.1%
1 37
 
7.5%
7 28
 
5.7%
3 28
 
5.7%
- 26
 
5.3%
0 26
 
5.3%
4 17
 
3.4%
5 11
 
2.2%
6 11
 
2.2%
Other values (2) 15
 
3.0%
Hangul
ValueCountFrequency (%)
97
11.9%
96
11.8%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
60
 
7.4%
40
 
4.9%
36
 
4.4%
24
 
3.0%
Other values (58) 219
27.0%

Interactions

2023-12-13T08:08:30.063742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:08:32.059025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명등급소재지도로명주소
연번1.0001.0000.9330.987
업소명1.0001.0001.0001.000
등급0.9331.0001.0001.000
소재지도로명주소0.9871.0001.0001.000
2023-12-13T08:08:32.131843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등급
연번1.0000.853
등급0.8531.000

Missing values

2023-12-13T08:08:30.161198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:08:30.233629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명등급소재지도로명주소
01뉴프린스관광모텔최우수전라북도 고창군 고창읍 성산8길 4
12모양성호텔최우수전라북도 고창군 고창읍 중앙로 331
23강선달힐링센터최우수전라북도 고창군 상하면 구시포해변길 160-3
34파머스빌리지최우수전라북도 고창군 상하면 상하농원길 6-36
45탑모텔최우수전라북도 고창군 고창읍 보릿골로 125
56아리랑호텔최우수전라북도 고창군 고창읍 월곡6길 6
67그린파크장여관최우수전라북도 고창군 고창읍 화신1길 23
78호텔석정힐최우수전라북도 고창군 고창읍 방장로 12
89석정힐링카운티14동최우수전라북도 고창군 고창읍 석정2로 207-13
910석정힐링카운티9동최우수전라북도 고창군 고창읍 석정2로 207-46
연번업소명등급소재지도로명주소
5051에덴파크여관우수전라북도 고창군 성내면 선운대로 3968
5152송악모텔우수전라북도 고창군 아산면 선운사로 100
5253신영장 여관일반전라북도 고창군 고창읍 중앙로 201
5354대산모텔일반전라북도 고창군 대산면 대산길 6
5455월드모텔일반전라북도 고창군 대산면 대성로 221
5556한양여관일반전라북도 고창군 고창읍 남정10길 17
5657평화장여인숙일반전라북도 고창군 고창읍 남정10길 20
5758나주여인숙일반전라북도 고창군 고창읍 남정7길 3
5859성덕여인숙일반전라북도 고창군 고창읍 남정6길 30-2
5960은성여인숙일반전라북도 고창군 고창읍 동리로 62-12