Overview

Dataset statistics

Number of variables4
Number of observations59
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory34.2 B

Variable types

Categorical2
Text2

Dataset

Description강원특별자치 한옥(전통가옥)체험업(숙박체험)에 대한 자료를 제공합니다 - 제공데이터 : 시군구명, 사업장명, 도로명전체주소, 문화체육업종명
URLhttps://www.data.go.kr/data/3045499/fileData.do

Alerts

문화체육업종명 has constant value ""Constant
사업장명 has unique valuesUnique
도로명전체주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:34:15.314906
Analysis finished2023-12-12 08:34:15.811266
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구명
Categorical

Distinct14
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
강릉시
24 
고성군
홍천군
춘천시
동해시
Other values (9)
13 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique5 ?
Unique (%)8.5%

Sample

1st row춘천시
2nd row춘천시
3rd row춘천시
4th row춘천시
5th row강릉시

Common Values

ValueCountFrequency (%)
강릉시 24
40.7%
고성군 9
 
15.3%
홍천군 5
 
8.5%
춘천시 4
 
6.8%
동해시 4
 
6.8%
영월군 2
 
3.4%
화천군 2
 
3.4%
인제군 2
 
3.4%
양양군 2
 
3.4%
태백시 1
 
1.7%
Other values (4) 4
 
6.8%

Length

2023-12-12T17:34:15.885636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강릉시 24
40.7%
고성군 9
 
15.3%
홍천군 5
 
8.5%
춘천시 4
 
6.8%
동해시 4
 
6.8%
영월군 2
 
3.4%
화천군 2
 
3.4%
인제군 2
 
3.4%
양양군 2
 
3.4%
태백시 1
 
1.7%
Other values (4) 4
 
6.8%

사업장명
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T17:34:16.166402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length5.7288136
Min length2

Characters and Unicode

Total characters338
Distinct characters147
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row춘천고택
2nd row스테이 한량
3rd row스테이 그믐
4th row스테이 자하
5th row강릉선교장
ValueCountFrequency (%)
스테이 3
 
3.7%
고향의 2
 
2.5%
춘천고택 1
 
1.2%
우구정 1
 
1.2%
가옥(조견당 1
 
1.2%
김종길 1
 
1.2%
1
 
1.2%
온양의 1
 
1.2%
1
 
1.2%
가을 1
 
1.2%
Other values (68) 68
84.0%
2023-12-12T17:34:16.600240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
6.5%
14
 
4.1%
12
 
3.6%
12
 
3.6%
12
 
3.6%
9
 
2.7%
7
 
2.1%
6
 
1.8%
6
 
1.8%
( 5
 
1.5%
Other values (137) 233
68.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 279
82.5%
Space Separator 22
 
6.5%
Lowercase Letter 20
 
5.9%
Open Punctuation 5
 
1.5%
Close Punctuation 5
 
1.5%
Other Punctuation 3
 
0.9%
Uppercase Letter 3
 
0.9%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
5.0%
12
 
4.3%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (119) 191
68.5%
Lowercase Letter
ValueCountFrequency (%)
e 4
20.0%
n 3
15.0%
a 3
15.0%
i 2
10.0%
d 2
10.0%
s 2
10.0%
r 2
10.0%
o 1
 
5.0%
v 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
F 1
33.3%
P 1
33.3%
C 1
33.3%
Other Punctuation
ValueCountFrequency (%)
? 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 279
82.5%
Common 36
 
10.7%
Latin 23
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
5.0%
12
 
4.3%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (119) 191
68.5%
Latin
ValueCountFrequency (%)
e 4
17.4%
n 3
13.0%
a 3
13.0%
i 2
8.7%
d 2
8.7%
s 2
8.7%
r 2
8.7%
F 1
 
4.3%
P 1
 
4.3%
o 1
 
4.3%
Other values (2) 2
8.7%
Common
ValueCountFrequency (%)
22
61.1%
( 5
 
13.9%
) 5
 
13.9%
? 2
 
5.6%
& 1
 
2.8%
- 1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 279
82.5%
ASCII 59
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22
37.3%
( 5
 
8.5%
) 5
 
8.5%
e 4
 
6.8%
n 3
 
5.1%
a 3
 
5.1%
i 2
 
3.4%
d 2
 
3.4%
s 2
 
3.4%
? 2
 
3.4%
Other values (8) 9
15.3%
Hangul
ValueCountFrequency (%)
14
 
5.0%
12
 
4.3%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (119) 191
68.5%
Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T17:34:16.940731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length11.576271
Min length6

Characters and Unicode

Total characters683
Distinct characters108
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row신동면 솟발1길 44
2nd row중앙로67번길 42-1
3rd row법원뒷길 35-7
4th row소양로211번길 16-1
5th row운정길 63
ValueCountFrequency (%)
죽왕면 9
 
6.1%
왕곡마을길 9
 
6.1%
서면 5
 
3.4%
한치골길 4
 
2.7%
10 2
 
1.4%
천곡1길 2
 
1.4%
초당원길 2
 
1.4%
11 2
 
1.4%
986 1
 
0.7%
984 1
 
0.7%
Other values (111) 111
75.0%
2023-12-12T17:34:17.322988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
 
13.2%
54
 
7.9%
1 53
 
7.8%
2 33
 
4.8%
- 28
 
4.1%
3 27
 
4.0%
25
 
3.7%
21
 
3.1%
4 20
 
2.9%
18
 
2.6%
Other values (98) 314
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 342
50.1%
Decimal Number 219
32.1%
Space Separator 90
 
13.2%
Dash Punctuation 28
 
4.1%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
15.8%
25
 
7.3%
21
 
6.1%
18
 
5.3%
16
 
4.7%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (84) 158
46.2%
Decimal Number
ValueCountFrequency (%)
1 53
24.2%
2 33
15.1%
3 27
12.3%
4 20
 
9.1%
0 17
 
7.8%
7 17
 
7.8%
6 15
 
6.8%
8 14
 
6.4%
5 13
 
5.9%
9 10
 
4.6%
Space Separator
ValueCountFrequency (%)
90
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 342
50.1%
Common 341
49.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
15.8%
25
 
7.3%
21
 
6.1%
18
 
5.3%
16
 
4.7%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (84) 158
46.2%
Common
ValueCountFrequency (%)
90
26.4%
1 53
15.5%
2 33
 
9.7%
- 28
 
8.2%
3 27
 
7.9%
4 20
 
5.9%
0 17
 
5.0%
7 17
 
5.0%
6 15
 
4.4%
8 14
 
4.1%
Other values (4) 27
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 342
50.1%
ASCII 341
49.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
90
26.4%
1 53
15.5%
2 33
 
9.7%
- 28
 
8.2%
3 27
 
7.9%
4 20
 
5.9%
0 17
 
5.0%
7 17
 
5.0%
6 15
 
4.4%
8 14
 
4.1%
Other values (4) 27
 
7.9%
Hangul
ValueCountFrequency (%)
54
 
15.8%
25
 
7.3%
21
 
6.1%
18
 
5.3%
16
 
4.7%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (84) 158
46.2%

문화체육업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
한옥체험업
59 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한옥체험업
2nd row한옥체험업
3rd row한옥체험업
4th row한옥체험업
5th row한옥체험업

Common Values

ValueCountFrequency (%)
한옥체험업 59
100.0%

Length

2023-12-12T17:34:17.447551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:34:17.553504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한옥체험업 59
100.0%

Correlations

2023-12-12T17:34:17.627996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명사업장명도로명전체주소
시군구명1.0001.0001.000
사업장명1.0001.0001.000
도로명전체주소1.0001.0001.000

Missing values

2023-12-12T17:34:15.659401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:34:15.767816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구명사업장명도로명전체주소문화체육업종명
0춘천시춘천고택신동면 솟발1길 44한옥체험업
1춘천시스테이 한량중앙로67번길 42-1한옥체험업
2춘천시스테이 그믐법원뒷길 35-7한옥체험업
3춘천시스테이 자하소양로211번길 16-1한옥체험업
4강릉시강릉선교장운정길 63한옥체험업
5강릉시경복궁펜션주문진읍 주문진서당길 33한옥체험업
6강릉시석가헌모산로 70번길 30한옥체험업
7강릉시고전한옥난곡길 7-8한옥체험업
8강릉시Pine & Friends(우송재)팔송길15번길 10한옥체험업
9강릉시관광펜션 휴심저동골길 21한옥체험업
시군구명사업장명도로명전체주소문화체육업종명
49고성군큰상나말집죽왕면 왕곡마을길 42-5한옥체험업
50고성군한고개집죽왕면 왕곡마을길 13-7한옥체험업
51고성군큰백촌집죽왕면 왕곡마을길 48-10한옥체험업
52고성군진부집죽왕면 왕곡마을길 47-3한옥체험업
53고성군여섯째집죽왕면 왕곡마을길 41한옥체험업
54고성군성천집죽왕면 왕곡마을길 42-3한옥체험업
55고성군갈벌집죽왕면 왕곡마을길 13-4한옥체험업
56고성군장은상나말집죽왕면 왕곡마을길 38한옥체험업
57양양군어성전현북면 남대천로 1670-61한옥체험업
58양양군양양가옥서면 설악로 1389한옥체험업