Overview

Dataset statistics

Number of variables4
Number of observations104
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory33.3 B

Variable types

Categorical2
Text2

Dataset

Description강원특별자치도 소재 관광펜션업 현황에 대한 자료를 제공합니다. - 제공 데이터 : 시군구명, 사업장명, 도로명전체주소, 문화체육업종
URLhttps://www.data.go.kr/data/3045495/fileData.do

Alerts

문화체육업종명 has constant value ""Constant
사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:14:20.539853
Analysis finished2023-12-12 18:14:20.934620
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구명
Categorical

Distinct16
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size964.0 B
홍천군
29 
양양군
14 
강릉시
12 
평창군
고성군
Other values (11)
34 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)1.9%

Sample

1st row춘천시
2nd row춘천시
3rd row춘천시
4th row춘천시
5th row춘천시

Common Values

ValueCountFrequency (%)
홍천군 29
27.9%
양양군 14
13.5%
강릉시 12
11.5%
평창군 8
 
7.7%
고성군 7
 
6.7%
춘천시 6
 
5.8%
원주시 4
 
3.8%
삼척시 4
 
3.8%
정선군 4
 
3.8%
철원군 4
 
3.8%
Other values (6) 12
11.5%

Length

2023-12-13T03:14:21.024849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
홍천군 29
27.9%
양양군 14
13.5%
강릉시 12
11.5%
평창군 8
 
7.7%
고성군 7
 
6.7%
춘천시 6
 
5.8%
원주시 4
 
3.8%
삼척시 4
 
3.8%
정선군 4
 
3.8%
철원군 4
 
3.8%
Other values (6) 12
11.5%

사업장명
Text

UNIQUE 

Distinct104
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
2023-12-13T03:14:21.396544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length6.5
Min length2

Characters and Unicode

Total characters676
Distinct characters201
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)100.0%

Sample

1st row아이플러스 펜션
2nd row2020스파엔풀빌라
3rd row리버스토리 1210
4th row썸원스페이지
5th row포레스트문
ValueCountFrequency (%)
펜션 7
 
4.8%
관광펜션 6
 
4.1%
토리아이풀빌라 5
 
3.4%
오캄럭스클럽 4
 
2.7%
빌리지 2
 
1.4%
바다에서 2
 
1.4%
떠오르는 2
 
1.4%
1 2
 
1.4%
풀빌라 2
 
1.4%
라포레 2
 
1.4%
Other values (111) 113
76.9%
2023-12-13T03:14:22.039474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
6.5%
40
 
5.9%
40
 
5.9%
32
 
4.7%
29
 
4.3%
20
 
3.0%
14
 
2.1%
14
 
2.1%
13
 
1.9%
13
 
1.9%
Other values (191) 417
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 588
87.0%
Space Separator 44
 
6.5%
Decimal Number 33
 
4.9%
Uppercase Letter 3
 
0.4%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Letter Number 1
 
0.1%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
6.8%
40
 
6.8%
32
 
5.4%
29
 
4.9%
20
 
3.4%
14
 
2.4%
14
 
2.4%
13
 
2.2%
13
 
2.2%
11
 
1.9%
Other values (172) 362
61.6%
Decimal Number
ValueCountFrequency (%)
1 9
27.3%
2 7
21.2%
0 4
12.1%
3 3
 
9.1%
8 3
 
9.1%
9 2
 
6.1%
6 2
 
6.1%
7 2
 
6.1%
4 1
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
A 1
33.3%
H 1
33.3%
Space Separator
ValueCountFrequency (%)
44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 589
87.1%
Common 83
 
12.3%
Latin 4
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
6.8%
40
 
6.8%
32
 
5.4%
29
 
4.9%
20
 
3.4%
14
 
2.4%
14
 
2.4%
13
 
2.2%
13
 
2.2%
11
 
1.9%
Other values (173) 363
61.6%
Common
ValueCountFrequency (%)
44
53.0%
1 9
 
10.8%
2 7
 
8.4%
0 4
 
4.8%
3 3
 
3.6%
8 3
 
3.6%
) 2
 
2.4%
( 2
 
2.4%
9 2
 
2.4%
6 2
 
2.4%
Other values (4) 5
 
6.0%
Latin
ValueCountFrequency (%)
B 1
25.0%
1
25.0%
A 1
25.0%
H 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 588
87.0%
ASCII 86
 
12.7%
Number Forms 1
 
0.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
44
51.2%
1 9
 
10.5%
2 7
 
8.1%
0 4
 
4.7%
3 3
 
3.5%
8 3
 
3.5%
) 2
 
2.3%
( 2
 
2.3%
9 2
 
2.3%
6 2
 
2.3%
Other values (7) 8
 
9.3%
Hangul
ValueCountFrequency (%)
40
 
6.8%
40
 
6.8%
32
 
5.4%
29
 
4.9%
20
 
3.4%
14
 
2.4%
14
 
2.4%
13
 
2.2%
13
 
2.2%
11
 
1.9%
Other values (172) 362
61.6%
Number Forms
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct101
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size964.0 B
2023-12-13T03:14:22.388010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length12.855769
Min length7

Characters and Unicode

Total characters1337
Distinct characters154
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)94.2%

Sample

1st row남산면 풀무골1길 22-1
2nd row남산면 서천길 49
3rd row서면 박사로 1214
4th row신동면 삼포길 155
5th row서면 월송길 381-17
ValueCountFrequency (%)
서면 23
 
7.5%
봉평면 5
 
1.6%
토성면 5
 
1.6%
굴업솔골길 5
 
1.6%
설밀길 5
 
1.6%
현북면 4
 
1.3%
북방면 4
 
1.3%
손양면 3
 
1.0%
화촌면 3
 
1.0%
팔봉강변길 3
 
1.0%
Other values (213) 247
80.5%
2023-12-13T03:14:22.926728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204
 
15.3%
1 93
 
7.0%
85
 
6.4%
72
 
5.4%
2 61
 
4.6%
- 57
 
4.3%
9 40
 
3.0%
40
 
3.0%
3 39
 
2.9%
5 39
 
2.9%
Other values (144) 607
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 676
50.6%
Decimal Number 400
29.9%
Space Separator 204
 
15.3%
Dash Punctuation 57
 
4.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
12.6%
72
 
10.7%
40
 
5.9%
31
 
4.6%
14
 
2.1%
12
 
1.8%
12
 
1.8%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (132) 377
55.8%
Decimal Number
ValueCountFrequency (%)
1 93
23.2%
2 61
15.2%
9 40
10.0%
3 39
9.8%
5 39
9.8%
7 31
 
7.8%
6 30
 
7.5%
0 27
 
6.8%
4 26
 
6.5%
8 14
 
3.5%
Space Separator
ValueCountFrequency (%)
204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 676
50.6%
Common 661
49.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
12.6%
72
 
10.7%
40
 
5.9%
31
 
4.6%
14
 
2.1%
12
 
1.8%
12
 
1.8%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (132) 377
55.8%
Common
ValueCountFrequency (%)
204
30.9%
1 93
14.1%
2 61
 
9.2%
- 57
 
8.6%
9 40
 
6.1%
3 39
 
5.9%
5 39
 
5.9%
7 31
 
4.7%
6 30
 
4.5%
0 27
 
4.1%
Other values (2) 40
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 676
50.6%
ASCII 661
49.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204
30.9%
1 93
14.1%
2 61
 
9.2%
- 57
 
8.6%
9 40
 
6.1%
3 39
 
5.9%
5 39
 
5.9%
7 31
 
4.7%
6 30
 
4.5%
0 27
 
4.1%
Other values (2) 40
 
6.1%
Hangul
ValueCountFrequency (%)
85
 
12.6%
72
 
10.7%
40
 
5.9%
31
 
4.6%
14
 
2.1%
12
 
1.8%
12
 
1.8%
11
 
1.6%
11
 
1.6%
11
 
1.6%
Other values (132) 377
55.8%

문화체육업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
관광펜션업
104 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관광펜션업
2nd row관광펜션업
3rd row관광펜션업
4th row관광펜션업
5th row관광펜션업

Common Values

ValueCountFrequency (%)
관광펜션업 104
100.0%

Length

2023-12-13T03:14:23.128464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:14:23.264034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관광펜션업 104
100.0%

Missing values

2023-12-13T03:14:20.789936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:14:20.888594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구명사업장명도로명전체주소문화체육업종명
0춘천시아이플러스 펜션남산면 풀무골1길 22-1관광펜션업
1춘천시2020스파엔풀빌라남산면 서천길 49관광펜션업
2춘천시리버스토리 1210서면 박사로 1214관광펜션업
3춘천시썸원스페이지신동면 삼포길 155관광펜션업
4춘천시포레스트문서면 월송길 381-17관광펜션업
5춘천시산뜰신북읍 영서로 3337관광펜션업
6원주시자작나무숲펜션지정면 구재로 40-13관광펜션업
7원주시또아리하우스판부면 금대리 1298관광펜션업
8원주시지니의 캠프지정면 장지길 69-22관광펜션업
9원주시젤코바480신림면 치악로 480-11관광펜션업
시군구명사업장명도로명전체주소문화체육업종명
94양양군지중해 풀빌라서면 남대천로 917-12관광펜션업
95양양군리틀포레스트강현면 복골길 126관광펜션업
96양양군몬띠마르강현면 진미로 510-36관광펜션업
97양양군스테이비욘드양양읍 거마천로 350-57관광펜션업
98양양군요트랑손양면 문화마을길 6관광펜션업
99양양군아름다운 펜션현남면 북죽로 233관광펜션업
100양양군트리플펜션&글램핑손양면 선사유적로 316-54관광펜션업
101양양군탄비치현북면 하조대해안길 77관광펜션업
102양양군발렌타인손양면 상왕도리 742관광펜션업
103양양군스테이다정현남면 임호정리 70관광펜션업