Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells9
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory54.5 B

Variable types

Categorical2
Text3
Numeric1

Dataset

Description대구광역시_한옥 체험업체 운영현황_20191105
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15054191&dataSetDetailId=15054191180f2c0b9e639&provdMethod=FILE

Alerts

시도 has constant value ""Constant
전화 has 9 (37.5%) missing valuesMissing
가옥명 has unique valuesUnique

Reproduction

Analysis started2024-04-17 03:34:41.072991
Analysis finished2024-04-17 03:34:41.480540
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
대구광역시
24 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시
2nd row대구광역시
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 24
100.0%

Length

2024-04-17T12:34:41.525686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T12:34:41.590443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 24
100.0%

시군구
Categorical

Distinct6
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
중구
11 
동구
달성군
달서구
북구
 
1

Length

Max length3
Median length2
Mean length2.2916667
Min length2

Unique

Unique2 ?
Unique (%)8.3%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 11
45.8%
동구 5
20.8%
달성군 4
 
16.7%
달서구 2
 
8.3%
북구 1
 
4.2%
수성구 1
 
4.2%

Length

2024-04-17T12:34:41.662757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T12:34:41.756103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 11
45.8%
동구 5
20.8%
달성군 4
 
16.7%
달서구 2
 
8.3%
북구 1
 
4.2%
수성구 1
 
4.2%

가옥명
Text

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
2024-04-17T12:34:41.921225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length6.3333333
Min length2

Characters and Unicode

Total characters152
Distinct characters78
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row 옛 구암서원
2nd row아제
3rd row공감한옥게스트하우스
4th row더한옥&스파
5th row애가
ValueCountFrequency (%)
애가 2
 
6.5%
전통한옥 2
 
6.5%
병암서원 2
 
6.5%
1
 
3.2%
금전고택 1
 
3.2%
한훤당 1
 
3.2%
니암고택 1
 
3.2%
대니골 1
 
3.2%
육신사 1
 
3.2%
묘골 1
 
3.2%
Other values (18) 18
58.1%
2024-04-17T12:34:42.183040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
5.3%
8
 
5.3%
7
 
4.6%
6
 
3.9%
6
 
3.9%
5
 
3.3%
5
 
3.3%
5
 
3.3%
4
 
2.6%
4
 
2.6%
Other values (68) 94
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131
86.2%
Space Separator 8
 
5.3%
Decimal Number 5
 
3.3%
Open Punctuation 3
 
2.0%
Close Punctuation 3
 
2.0%
Other Symbol 1
 
0.7%
Other Punctuation 1
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
6.1%
7
 
5.3%
6
 
4.6%
6
 
4.6%
5
 
3.8%
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
4
 
3.1%
Other values (58) 77
58.8%
Decimal Number
ValueCountFrequency (%)
2 1
20.0%
1 1
20.0%
9 1
20.0%
5 1
20.0%
7 1
20.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132
86.8%
Common 20
 
13.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
6.1%
7
 
5.3%
6
 
4.5%
6
 
4.5%
5
 
3.8%
5
 
3.8%
5
 
3.8%
4
 
3.0%
4
 
3.0%
4
 
3.0%
Other values (59) 78
59.1%
Common
ValueCountFrequency (%)
8
40.0%
( 3
 
15.0%
) 3
 
15.0%
& 1
 
5.0%
2 1
 
5.0%
1 1
 
5.0%
9 1
 
5.0%
5 1
 
5.0%
7 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131
86.2%
ASCII 20
 
13.2%
None 1
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8
40.0%
( 3
 
15.0%
) 3
 
15.0%
& 1
 
5.0%
2 1
 
5.0%
1 1
 
5.0%
9 1
 
5.0%
5 1
 
5.0%
7 1
 
5.0%
Hangul
ValueCountFrequency (%)
8
 
6.1%
7
 
5.3%
6
 
4.6%
6
 
4.6%
5
 
3.8%
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
4
 
3.1%
Other values (58) 77
58.8%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct23
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size324.0 B
2024-04-17T12:34:42.363640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length23.208333
Min length19

Characters and Unicode

Total characters557
Distinct characters77
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)91.7%

Sample

1st row대구광역시 중구 국채보상로 492-58 (동산동)
2nd row대구광역시 중구 경상감영길 43-9 (서내동)
3rd row대구광역시 중구 진골목길 9 (종로2가)
4th row대구광역시 중구 서성로16길 46-5 (북내동)
5th row대구광역시 중구 서성로 16길 46-4 (북내동)
ValueCountFrequency (%)
대구광역시 20
 
18.3%
중구 11
 
10.1%
동구 5
 
4.6%
옻골로 4
 
3.7%
달성군 4
 
3.7%
대구시 4
 
3.7%
북내동 3
 
2.8%
육신사길 2
 
1.8%
하빈면 2
 
1.8%
새방로 2
 
1.8%
Other values (48) 52
47.7%
2024-04-17T12:34:42.628822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
15.3%
46
 
8.3%
27
 
4.8%
26
 
4.7%
24
 
4.3%
20
 
3.6%
20
 
3.6%
20
 
3.6%
) 20
 
3.6%
( 20
 
3.6%
Other values (67) 249
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 333
59.8%
Decimal Number 87
 
15.6%
Space Separator 85
 
15.3%
Close Punctuation 20
 
3.6%
Open Punctuation 20
 
3.6%
Dash Punctuation 12
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
13.8%
27
 
8.1%
26
 
7.8%
24
 
7.2%
20
 
6.0%
20
 
6.0%
20
 
6.0%
14
 
4.2%
11
 
3.3%
8
 
2.4%
Other values (53) 117
35.1%
Decimal Number
ValueCountFrequency (%)
1 18
20.7%
2 14
16.1%
4 13
14.9%
5 10
11.5%
6 10
11.5%
9 7
 
8.0%
8 5
 
5.7%
3 4
 
4.6%
0 4
 
4.6%
7 2
 
2.3%
Space Separator
ValueCountFrequency (%)
85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 333
59.8%
Common 224
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
13.8%
27
 
8.1%
26
 
7.8%
24
 
7.2%
20
 
6.0%
20
 
6.0%
20
 
6.0%
14
 
4.2%
11
 
3.3%
8
 
2.4%
Other values (53) 117
35.1%
Common
ValueCountFrequency (%)
85
37.9%
) 20
 
8.9%
( 20
 
8.9%
1 18
 
8.0%
2 14
 
6.2%
4 13
 
5.8%
- 12
 
5.4%
5 10
 
4.5%
6 10
 
4.5%
9 7
 
3.1%
Other values (4) 15
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 333
59.8%
ASCII 224
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
37.9%
) 20
 
8.9%
( 20
 
8.9%
1 18
 
8.0%
2 14
 
6.2%
4 13
 
5.8%
- 12
 
5.4%
5 10
 
4.5%
6 10
 
4.5%
9 7
 
3.1%
Other values (4) 15
 
6.7%
Hangul
ValueCountFrequency (%)
46
 
13.8%
27
 
8.1%
26
 
7.8%
24
 
7.2%
20
 
6.0%
20
 
6.0%
20
 
6.0%
14
 
4.2%
11
 
3.3%
8
 
2.4%
Other values (53) 117
35.1%

전화
Text

MISSING 

Distinct11
Distinct (%)73.3%
Missing9
Missing (%)37.5%
Memory size324.0 B
2024-04-17T12:34:42.773209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)53.3%

Sample

1st row053-428-9900
2nd row053-252-6336
3rd row053-255-7173
4th row053-424-2237
5th row053-428-9901
ValueCountFrequency (%)
053-983-1040 3
20.0%
053-428-9900 2
13.3%
053-586-4672 2
13.3%
053-252-6336 1
 
6.7%
053-255-7173 1
 
6.7%
053-424-2237 1
 
6.7%
053-428-9901 1
 
6.7%
053-959-7200 1
 
6.7%
053-638-6171 1
 
6.7%
053-611-9910 1
 
6.7%
2024-04-17T12:34:42.998292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 30
16.7%
0 29
16.1%
3 23
12.8%
5 21
11.7%
9 14
7.8%
1 14
7.8%
2 12
 
6.7%
8 10
 
5.6%
4 10
 
5.6%
6 10
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 150
83.3%
Dash Punctuation 30
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29
19.3%
3 23
15.3%
5 21
14.0%
9 14
9.3%
1 14
9.3%
2 12
8.0%
8 10
 
6.7%
4 10
 
6.7%
6 10
 
6.7%
7 7
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 30
16.7%
0 29
16.1%
3 23
12.8%
5 21
11.7%
9 14
7.8%
1 14
7.8%
2 12
 
6.7%
8 10
 
5.6%
4 10
 
5.6%
6 10
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 30
16.7%
0 29
16.1%
3 23
12.8%
5 21
11.7%
9 14
7.8%
1 14
7.8%
2 12
 
6.7%
8 10
 
5.6%
4 10
 
5.6%
6 10
 
5.6%

객실수
Real number (ℝ)

Distinct7
Distinct (%)29.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9583333
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-04-17T12:34:43.086437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q12.75
median4
Q35
95-th percentile6.85
Maximum7
Range6
Interquartile range (IQR)2.25

Descriptive statistics

Standard deviation1.6805581
Coefficient of variation (CV)0.42456203
Kurtosis-0.8405664
Mean3.9583333
Median Absolute Deviation (MAD)1
Skewness0.13112472
Sum95
Variance2.8242754
MonotonicityNot monotonic
2024-04-17T12:34:43.174106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
5 6
25.0%
2 5
20.8%
4 4
16.7%
3 4
16.7%
6 2
 
8.3%
7 2
 
8.3%
1 1
 
4.2%
ValueCountFrequency (%)
1 1
 
4.2%
2 5
20.8%
3 4
16.7%
4 4
16.7%
5 6
25.0%
6 2
 
8.3%
7 2
 
8.3%
ValueCountFrequency (%)
7 2
 
8.3%
6 2
 
8.3%
5 6
25.0%
4 4
16.7%
3 4
16.7%
2 5
20.8%
1 1
 
4.2%

Interactions

2024-04-17T12:34:41.270285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T12:34:43.238154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구가옥명주소전화객실수
시군구1.0001.0001.0000.9280.000
가옥명1.0001.0001.0001.0001.000
주소1.0001.0001.0000.9051.000
전화0.9281.0000.9051.0000.898
객실수0.0001.0001.0000.8981.000
2024-04-17T12:34:43.312125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
객실수시군구
객실수1.0000.000
시군구0.0001.000

Missing values

2024-04-17T12:34:41.360966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T12:34:41.444540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구가옥명주소전화객실수
0대구광역시중구옛 구암서원대구광역시 중구 국채보상로 492-58 (동산동)053-428-99006
1대구광역시중구아제대구광역시 중구 경상감영길 43-9 (서내동)<NA>5
2대구광역시중구공감한옥게스트하우스대구광역시 중구 진골목길 9 (종로2가)<NA>4
3대구광역시중구더한옥&스파대구광역시 중구 서성로16길 46-5 (북내동)<NA>7
4대구광역시중구애가대구광역시 중구 서성로 16길 46-4 (북내동)<NA>4
5대구광역시중구잔치대구광역시 중구 종로 45-28 (종로1가)053-252-63361
6대구광역시중구라온대구광역시 중구 동덕로 16-3 (대봉동)<NA>2
7대구광역시중구애가 2대구광역시 중구 서성로16길 46-4 (북내동)<NA>3
8대구광역시중구서문한옥게스트하우스대구광역시 중구 큰장로24길 26(대신동)<NA>7
9대구광역시중구㈜소스대구광역시 중구 명덕로321-49(대봉동)053-255-71734
시도시군구가옥명주소전화객실수
14대구광역시동구금전고택대구광역시 동구 옻골로 195-2(둔산동)053-983-10402
15대구광역시동구춘제대구광역시 동구 옻골로 198(둔산동)053-983-10402
16대구광역시북구(사) 영남선비문화수련원(구암서원)대구광역시 북구 연암공원로17길 20(산격동)053-959-72003
17대구광역시달서구대구전통문화센터 병암서원대구광역시 달서구 새방로 21(용산동)053-428-99005
18대구광역시달서구병암서원대구광역시 달서구 새방로 21(용산동)053-638-61715
19대구광역시달성군묘골 전통한옥대구시 달성군 하빈면 육신사길 55053-586-46724
20대구광역시달성군육신사 전통한옥대구시 달성군 하빈면 육신사길 64053-586-46725
21대구광역시달성군대니골 니암고택대구시 달성군 구지면 구지서로60길 41-5053-611-99105
22대구광역시달성군한훤당대구시 달성군 현풍면 지동1길 43053-611-11983
23대구광역시수성구월드컵장미한옥대구광역시 수성구 월드컵로5안길 22(삼덕동)<NA>2