Overview

Dataset statistics

Number of variables4
Number of observations115
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory34.1 B

Variable types

Categorical1
Text2
Numeric1

Dataset

Description경상북도 관광펜션업 등록현황에 대한 데이터로시도명, 시군명, 시설명, 주소, 객실수 등의 정보가 포함되어 있습니다.
Author경상북도
URLhttps://www.data.go.kr/data/15063106/fileData.do

Alerts

관광펜션명 has unique valuesUnique

Reproduction

Analysis started2023-12-23 07:40:18.217030
Analysis finished2023-12-23 07:40:20.846079
Duration2.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구
Categorical

Distinct13
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
경주시
29 
포항시
18 
문경시
18 
영덕군
14 
울릉군
11 
Other values (8)
25 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique3 ?
Unique (%)2.6%

Sample

1st row포항시
2nd row포항시
3rd row포항시
4th row포항시
5th row포항시

Common Values

ValueCountFrequency (%)
경주시 29
25.2%
포항시 18
15.7%
문경시 18
15.7%
영덕군 14
12.2%
울릉군 11
 
9.6%
청도군 7
 
6.1%
칠곡군 6
 
5.2%
울진군 4
 
3.5%
안동시 3
 
2.6%
영주시 2
 
1.7%
Other values (3) 3
 
2.6%

Length

2023-12-23T07:40:21.285808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경주시 29
25.2%
포항시 18
15.7%
문경시 18
15.7%
영덕군 14
12.2%
울릉군 11
 
9.6%
청도군 7
 
6.1%
칠곡군 6
 
5.2%
울진군 4
 
3.5%
안동시 3
 
2.6%
영주시 2
 
1.7%
Other values (3) 3
 
2.6%

관광펜션명
Text

UNIQUE 

Distinct115
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-23T07:40:22.604245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length12
Mean length6.9478261
Min length2

Characters and Unicode

Total characters799
Distinct characters217
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)100.0%

Sample

1st row라메르펜션리조트
2nd row네이처 풀빌라
3rd row씨 캐슬
4th row다모디
5th row송라 아쿠아
ValueCountFrequency (%)
펜션 7
 
3.9%
관광펜션 5
 
2.8%
풀빌라 5
 
2.8%
프레젠트 4
 
2.2%
마리벨317 3
 
1.7%
비클래시 3
 
1.7%
칠곡왜관점 3
 
1.7%
3
 
1.7%
경주 3
 
1.7%
포항 3
 
1.7%
Other values (135) 140
78.2%
2023-12-23T07:40:24.338860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
8.0%
47
 
5.9%
44
 
5.5%
22
 
2.8%
20
 
2.5%
15
 
1.9%
15
 
1.9%
15
 
1.9%
13
 
1.6%
12
 
1.5%
Other values (207) 532
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 646
80.9%
Space Separator 64
 
8.0%
Uppercase Letter 46
 
5.8%
Decimal Number 29
 
3.6%
Open Punctuation 7
 
0.9%
Close Punctuation 7
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
7.3%
44
 
6.8%
22
 
3.4%
20
 
3.1%
15
 
2.3%
15
 
2.3%
15
 
2.3%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (182) 431
66.7%
Uppercase Letter
ValueCountFrequency (%)
I 5
10.9%
E 5
10.9%
A 4
 
8.7%
S 4
 
8.7%
L 4
 
8.7%
V 4
 
8.7%
D 2
 
4.3%
O 2
 
4.3%
T 2
 
4.3%
H 2
 
4.3%
Other values (6) 12
26.1%
Decimal Number
ValueCountFrequency (%)
1 9
31.0%
0 6
20.7%
7 6
20.7%
3 5
17.2%
2 2
 
6.9%
9 1
 
3.4%
Space Separator
ValueCountFrequency (%)
64
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 646
80.9%
Common 107
 
13.4%
Latin 46
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
7.3%
44
 
6.8%
22
 
3.4%
20
 
3.1%
15
 
2.3%
15
 
2.3%
15
 
2.3%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (182) 431
66.7%
Latin
ValueCountFrequency (%)
I 5
10.9%
E 5
10.9%
A 4
 
8.7%
S 4
 
8.7%
L 4
 
8.7%
V 4
 
8.7%
D 2
 
4.3%
O 2
 
4.3%
T 2
 
4.3%
H 2
 
4.3%
Other values (6) 12
26.1%
Common
ValueCountFrequency (%)
64
59.8%
1 9
 
8.4%
( 7
 
6.5%
) 7
 
6.5%
0 6
 
5.6%
7 6
 
5.6%
3 5
 
4.7%
2 2
 
1.9%
9 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 646
80.9%
ASCII 153
 
19.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
64
41.8%
1 9
 
5.9%
( 7
 
4.6%
) 7
 
4.6%
0 6
 
3.9%
7 6
 
3.9%
3 5
 
3.3%
I 5
 
3.3%
E 5
 
3.3%
A 4
 
2.6%
Other values (15) 35
22.9%
Hangul
ValueCountFrequency (%)
47
 
7.3%
44
 
6.8%
22
 
3.4%
20
 
3.1%
15
 
2.3%
15
 
2.3%
15
 
2.3%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (182) 431
66.7%

객실수
Real number (ℝ)

Distinct18
Distinct (%)15.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5130435
Minimum1
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-23T07:40:25.266774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q36
95-th percentile15
Maximum27
Range26
Interquartile range (IQR)5

Descriptive statistics

Standard deviation5.1560988
Coefficient of variation (CV)1.1424882
Kurtosis5.8634383
Mean4.5130435
Median Absolute Deviation (MAD)2
Skewness2.2916858
Sum519
Variance26.585355
MonotonicityNot monotonic
2023-12-23T07:40:25.966879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1 42
36.5%
3 17
14.8%
2 12
 
10.4%
5 7
 
6.1%
4 7
 
6.1%
6 7
 
6.1%
7 5
 
4.3%
15 4
 
3.5%
8 3
 
2.6%
12 2
 
1.7%
Other values (8) 9
 
7.8%
ValueCountFrequency (%)
1 42
36.5%
2 12
 
10.4%
3 17
14.8%
4 7
 
6.1%
5 7
 
6.1%
6 7
 
6.1%
7 5
 
4.3%
8 3
 
2.6%
9 1
 
0.9%
11 1
 
0.9%
ValueCountFrequency (%)
27 2
1.7%
19 1
 
0.9%
18 1
 
0.9%
16 1
 
0.9%
15 4
3.5%
14 1
 
0.9%
13 1
 
0.9%
12 2
1.7%
11 1
 
0.9%
9 1
 
0.9%

주소
Text

Distinct112
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-23T07:40:27.106782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length22.652174
Min length11

Characters and Unicode

Total characters2605
Distinct characters162
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)95.7%

Sample

1st row경상북도 포항시 남구 호미곶면 일출로568번길 19-16
2nd row경상북도 포항시 북구 청하면 해안로 2092
3rd row경상북도 포항시 북구 청하면 해안로2000번길 3
4th row경상북도 포항시 남구 호미곶면 관암일출길 29
5th row경상북도 포항시 북구 송라면 보경로196번길 41-84
ValueCountFrequency (%)
경상북도 105
 
17.7%
경주시 29
 
4.9%
포항시 18
 
3.0%
문경시 18
 
3.0%
영덕군 14
 
2.4%
울릉군 11
 
1.9%
남구 9
 
1.5%
북구 9
 
1.5%
울릉읍 8
 
1.4%
청도군 7
 
1.2%
Other values (243) 364
61.5%
2023-12-23T07:40:28.994408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
478
 
18.3%
161
 
6.2%
118
 
4.5%
115
 
4.4%
109
 
4.2%
1 88
 
3.4%
75
 
2.9%
2 71
 
2.7%
68
 
2.6%
59
 
2.3%
Other values (152) 1263
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1576
60.5%
Space Separator 478
 
18.3%
Decimal Number 437
 
16.8%
Dash Punctuation 50
 
1.9%
Close Punctuation 23
 
0.9%
Open Punctuation 23
 
0.9%
Other Punctuation 14
 
0.5%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
161
 
10.2%
118
 
7.5%
115
 
7.3%
109
 
6.9%
75
 
4.8%
68
 
4.3%
59
 
3.7%
59
 
3.7%
57
 
3.6%
38
 
2.4%
Other values (133) 717
45.5%
Decimal Number
ValueCountFrequency (%)
1 88
20.1%
2 71
16.2%
6 46
10.5%
3 46
10.5%
4 43
9.8%
9 35
 
8.0%
5 34
 
7.8%
8 32
 
7.3%
0 22
 
5.0%
7 20
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
D 1
25.0%
C 1
25.0%
A 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
478
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1576
60.5%
Common 1025
39.3%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
161
 
10.2%
118
 
7.5%
115
 
7.3%
109
 
6.9%
75
 
4.8%
68
 
4.3%
59
 
3.7%
59
 
3.7%
57
 
3.6%
38
 
2.4%
Other values (133) 717
45.5%
Common
ValueCountFrequency (%)
478
46.6%
1 88
 
8.6%
2 71
 
6.9%
- 50
 
4.9%
6 46
 
4.5%
3 46
 
4.5%
4 43
 
4.2%
9 35
 
3.4%
5 34
 
3.3%
8 32
 
3.1%
Other values (5) 102
 
10.0%
Latin
ValueCountFrequency (%)
D 1
25.0%
C 1
25.0%
A 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1576
60.5%
ASCII 1029
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
478
46.5%
1 88
 
8.6%
2 71
 
6.9%
- 50
 
4.9%
6 46
 
4.5%
3 46
 
4.5%
4 43
 
4.2%
9 35
 
3.4%
5 34
 
3.3%
8 32
 
3.1%
Other values (9) 106
 
10.3%
Hangul
ValueCountFrequency (%)
161
 
10.2%
118
 
7.5%
115
 
7.3%
109
 
6.9%
75
 
4.8%
68
 
4.3%
59
 
3.7%
59
 
3.7%
57
 
3.6%
38
 
2.4%
Other values (133) 717
45.5%

Interactions

2023-12-23T07:40:19.606100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T07:40:29.448735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구객실수
시군구1.0000.605
객실수0.6051.000
2023-12-23T07:40:29.961736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
객실수시군구
객실수1.0000.332
시군구0.3321.000

Missing values

2023-12-23T07:40:20.400002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:40:20.746119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구관광펜션명객실수주소
0포항시라메르펜션리조트14경상북도 포항시 남구 호미곶면 일출로568번길 19-16
1포항시네이처 풀빌라8경상북도 포항시 북구 청하면 해안로 2092
2포항시씨 캐슬9경상북도 포항시 북구 청하면 해안로2000번길 3
3포항시다모디5경상북도 포항시 남구 호미곶면 관암일출길 29
4포항시송라 아쿠아8경상북도 포항시 북구 송라면 보경로196번길 41-84
5포항시보드레펜션4경상북도 포항시 남구 장기면 동해안로 2788
6포항시190워터프런트27경상북도 포항시 남구 장기면 동해안로 3952-42
7포항시더 헤이븐11경상북도 포항시 남구 장기면 동해안로 3952-44
8포항시슬로우오션 풀빌라 펜션8경상북도 포항시 북구 송라면 동해대로3218번길 39
9포항시케렌시아2경상북도 포항시 남구 호미곶면 호미로 1504-7
시군구관광펜션명객실수주소
105울릉군(주)울릉드림1경상북도 울릉군 울릉읍 사동2길 190
106울릉군해오름관광펜션1경상북도 울릉군 울릉읍 도동1길 35-10, 해오름펜션
107울릉군나리관광펜션1경상북도 울릉군 북면 천부3길 217-37
108울릉군쉐르빌관광펜션1경상북도 울릉군 울릉읍 울릉순환로 590-9
109울릉군추억관광펜션1경상북도 울릉군 북면 울릉순환로 2626 (현포해양박물관)
110울릉군하얀고래펜션1경상북도 울릉군 울릉읍 간령길 81-2
111울릉군그때그기펜션1경상북도 울릉군 울릉읍 간령길 93-9
112울릉군진미펜션1경상북도 울릉군 울릉읍 울릉순환로 538
113울릉군섬지기펜션1경상북도 울릉군 울릉읍 봉래1길 19-32, 1동 3층
114울릉군아라펜션1경상북도 울릉군 울릉읍 간령길 119