Overview

Dataset statistics

Number of variables5
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory46.0 B

Variable types

Numeric2
Categorical1
Text2

Dataset

Description대구시 구군에 등록되어있는 관광객 이용시설업 중 한옥체험업 현황 자료입니다.(구·군, 상호(업체명), 소재지, 객실수)
Author대구광역시
URLhttps://www.data.go.kr/data/15035952/fileData.do

Alerts

연번 is highly overall correlated with 구군High correlation
구군 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업체명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:05:44.780453
Analysis finished2024-04-21 01:05:46.840573
Duration2.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2024-04-21T10:05:46.895542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2024-04-21T10:05:47.014714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

구군
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Memory size396.0 B
중구
13 
동구
달성군
남구
북구
 
1
Other values (3)

Length

Max length3
Median length2
Mean length2.2121212
Min length2

Unique

Unique4 ?
Unique (%)12.1%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 13
39.4%
동구 9
27.3%
달성군 4
 
12.1%
남구 3
 
9.1%
북구 1
 
3.0%
수성구 1
 
3.0%
달서구 1
 
3.0%
군위군 1
 
3.0%

Length

2024-04-21T10:05:47.119785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:05:47.225182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 13
39.4%
동구 9
27.3%
달성군 4
 
12.1%
남구 3
 
9.1%
북구 1
 
3.0%
수성구 1
 
3.0%
달서구 1
 
3.0%
군위군 1
 
3.0%

업체명
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2024-04-21T10:05:47.398248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length6.2727273
Min length2

Characters and Unicode

Total characters207
Distinct characters98
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row서문한옥게스트하우스
2nd row㈜소스
3rd row한옥1957
4th row한옥게스트하우스 '安(Ahn)'
5th row퍼센트17-7
ValueCountFrequency (%)
스테이 3
 
7.1%
대구전통문화센터 2
 
4.8%
전통한옥 2
 
4.8%
서문한옥게스트하우스 1
 
2.4%
사)영남선비문화수련원(구암서원 1
 
2.4%
동계정 1
 
2.4%
수구당 1
 
2.4%
소화 1
 
2.4%
대구펜션 1
 
2.4%
원향 1
 
2.4%
Other values (28) 28
66.7%
2024-04-21T10:05:47.714306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
4.8%
9
 
4.3%
9
 
4.3%
8
 
3.9%
6
 
2.9%
6
 
2.9%
6
 
2.9%
6
 
2.9%
6
 
2.9%
5
 
2.4%
Other values (88) 136
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 170
82.1%
Decimal Number 11
 
5.3%
Space Separator 9
 
4.3%
Lowercase Letter 5
 
2.4%
Close Punctuation 3
 
1.4%
Open Punctuation 3
 
1.4%
Other Punctuation 2
 
1.0%
Uppercase Letter 2
 
1.0%
Dash Punctuation 1
 
0.5%
Other Symbol 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
8
 
4.7%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
Other values (69) 103
60.6%
Decimal Number
ValueCountFrequency (%)
1 3
27.3%
7 3
27.3%
9 2
18.2%
5 1
 
9.1%
4 1
 
9.1%
2 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
y 1
20.0%
a 1
20.0%
t 1
20.0%
n 1
20.0%
h 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
' 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 170
82.1%
Common 29
 
14.0%
Latin 7
 
3.4%
Han 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
8
 
4.7%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
Other values (69) 103
60.6%
Common
ValueCountFrequency (%)
9
31.0%
) 3
 
10.3%
1 3
 
10.3%
( 3
 
10.3%
7 3
 
10.3%
9 2
 
6.9%
' 2
 
6.9%
- 1
 
3.4%
5 1
 
3.4%
4 1
 
3.4%
Latin
ValueCountFrequency (%)
y 1
14.3%
a 1
14.3%
t 1
14.3%
S 1
14.3%
n 1
14.3%
h 1
14.3%
A 1
14.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 169
81.6%
ASCII 36
 
17.4%
CJK 1
 
0.5%
None 1
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
8
 
4.7%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
5
 
3.0%
Other values (68) 102
60.4%
ASCII
ValueCountFrequency (%)
9
25.0%
) 3
 
8.3%
1 3
 
8.3%
( 3
 
8.3%
7 3
 
8.3%
9 2
 
5.6%
' 2
 
5.6%
- 1
 
2.8%
y 1
 
2.8%
a 1
 
2.8%
Other values (8) 8
22.2%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%

소재지
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2024-04-21T10:05:47.921989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length24.121212
Min length21

Characters and Unicode

Total characters796
Distinct characters82
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row대구광역시 중구 큰장로24길 26(대신동)
2nd row대구광역시 중구 명덕로321-49(대봉동)
3rd row대구광역시 중구 국채보상로101길 20-2 (인교동)
4th row대구광역시 중구 국채보상로149길 98 (동인동3가)
5th row대구광역시 중구 달구벌대로446길 28-8 (대봉동)
ValueCountFrequency (%)
대구광역시 33
22.9%
중구 13
 
9.0%
동구 9
 
6.2%
옻골로 7
 
4.9%
달성군 4
 
2.8%
대봉동 3
 
2.1%
남구 3
 
2.1%
인교동 2
 
1.4%
육신사길 2
 
1.4%
서성로13길 2
 
1.4%
Other values (63) 66
45.8%
2024-04-21T10:05:48.230521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
13.9%
66
 
8.3%
45
 
5.7%
39
 
4.9%
34
 
4.3%
33
 
4.1%
33
 
4.1%
1 29
 
3.6%
( 28
 
3.5%
) 28
 
3.5%
Other values (72) 350
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 476
59.8%
Decimal Number 136
 
17.1%
Space Separator 111
 
13.9%
Open Punctuation 28
 
3.5%
Close Punctuation 28
 
3.5%
Dash Punctuation 17
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
13.9%
45
 
9.5%
39
 
8.2%
34
 
7.1%
33
 
6.9%
33
 
6.9%
28
 
5.9%
21
 
4.4%
13
 
2.7%
12
 
2.5%
Other values (58) 152
31.9%
Decimal Number
ValueCountFrequency (%)
1 29
21.3%
2 22
16.2%
4 18
13.2%
3 14
10.3%
5 11
 
8.1%
9 10
 
7.4%
0 9
 
6.6%
8 9
 
6.6%
7 8
 
5.9%
6 6
 
4.4%
Space Separator
ValueCountFrequency (%)
111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 476
59.8%
Common 320
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
13.9%
45
 
9.5%
39
 
8.2%
34
 
7.1%
33
 
6.9%
33
 
6.9%
28
 
5.9%
21
 
4.4%
13
 
2.7%
12
 
2.5%
Other values (58) 152
31.9%
Common
ValueCountFrequency (%)
111
34.7%
1 29
 
9.1%
( 28
 
8.8%
) 28
 
8.8%
2 22
 
6.9%
4 18
 
5.6%
- 17
 
5.3%
3 14
 
4.4%
5 11
 
3.4%
9 10
 
3.1%
Other values (4) 32
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 476
59.8%
ASCII 320
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111
34.7%
1 29
 
9.1%
( 28
 
8.8%
) 28
 
8.8%
2 22
 
6.9%
4 18
 
5.6%
- 17
 
5.3%
3 14
 
4.4%
5 11
 
3.4%
9 10
 
3.1%
Other values (4) 32
 
10.0%
Hangul
ValueCountFrequency (%)
66
13.9%
45
 
9.5%
39
 
8.2%
34
 
7.1%
33
 
6.9%
33
 
6.9%
28
 
5.9%
21
 
4.4%
13
 
2.7%
12
 
2.5%
Other values (58) 152
31.9%

객실수(실)
Real number (ℝ)

Distinct7
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0606061
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2024-04-21T10:05:48.334355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile6.4
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.784296
Coefficient of variation (CV)0.58298779
Kurtosis-0.39743971
Mean3.0606061
Median Absolute Deviation (MAD)1
Skewness0.64105261
Sum101
Variance3.1837121
MonotonicityNot monotonic
2024-04-21T10:05:48.415679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
3 8
24.2%
1 8
24.2%
2 6
18.2%
5 5
15.2%
4 3
 
9.1%
7 2
 
6.1%
6 1
 
3.0%
ValueCountFrequency (%)
1 8
24.2%
2 6
18.2%
3 8
24.2%
4 3
 
9.1%
5 5
15.2%
6 1
 
3.0%
7 2
 
6.1%
ValueCountFrequency (%)
7 2
 
6.1%
6 1
 
3.0%
5 5
15.2%
4 3
 
9.1%
3 8
24.2%
2 6
18.2%
1 8
24.2%

Interactions

2024-04-21T10:05:46.524598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:05:46.315884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:05:46.601406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:05:46.440264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:05:48.480090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구군업체명소재지객실수(실)
연번1.0000.7911.0001.0000.200
구군0.7911.0001.0001.0000.426
업체명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
객실수(실)0.2000.4261.0001.0001.000
2024-04-21T10:05:48.564420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번객실수(실)구군
연번1.0000.0660.508
객실수(실)0.0661.0000.223
구군0.5080.2231.000

Missing values

2024-04-21T10:05:46.706313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:05:46.801224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구군업체명소재지객실수(실)
01중구서문한옥게스트하우스대구광역시 중구 큰장로24길 26(대신동)7
12중구㈜소스대구광역시 중구 명덕로321-49(대봉동)4
23중구한옥1957대구광역시 중구 국채보상로101길 20-2 (인교동)5
34중구한옥게스트하우스 '安(Ahn)'대구광역시 중구 국채보상로149길 98 (동인동3가)3
45중구퍼센트17-7대구광역시 중구 달구벌대로446길 28-8 (대봉동)4
56중구Stay지안대구광역시 중구 달구벌대로447길 72-3 (삼덕동3가)1
67중구애가대구광역시 중구 남산로2길 11-11 (남산동)5
78중구스테이 너와대구광역시 중구 동덕로 1-38 (대봉동)1
89중구한옥1942대구광역시 중구 서성로13길 57 (인교동)3
910중구모가대구광역시 중구 달구벌대로447길34-3 (삼덕동3가)2
연번구군업체명소재지객실수(실)
2324남구대구펜션 원향대구광역시 남구 이천로28길 27-4(이천동)1
2425남구스테이 모노대구광역시 남구 명덕시장길 106-4(대명동)1
2526북구(사)영남선비문화수련원(구암서원)대구광역시 북구 연암공원로17길 20(산격동)3
2627수성구월드컵장미한옥대구광역시 수성구 월드컵로5안길 22(삼덕동)2
2728달서구대구전통문화센터 병암서원대구광역시 달서구 새방로 21(용산동)5
2829달성군묘골 전통한옥대구광역시 달성군 하빈면 육신사길 554
2930달성군육신사 전통한옥대구광역시 달성군 하빈면 육신사길 645
3031달성군대니골 니암고택대구광역시 달성군 구지면 구지서로60길 41-55
3132달성군한훤당대구광역시 달성군 현풍면 지동1길 433
3233군위군군위남천고택대구광역시 군위군 부계면 한밤5길 197