Overview

Dataset statistics

Number of variables5
Number of observations594
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.3 KiB
Average record size in memory40.2 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description경상북도내 한옥체험업으로 등록된 업체에 관한 자료로 시군명, 한옥명, 주소, 대표자, 등록일자와 관련된 자료를 제공합니다.
Author경상북도
URLhttps://www.data.go.kr/data/3083301/fileData.do

Reproduction

Analysis started2024-03-14 22:56:16.505615
Analysis finished2024-03-14 22:56:17.900855
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct20
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
경주시
256 
안동시
166 
영주시
31 
영덕군
 
24
예천군
 
18
Other values (15)
99 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포항시
2nd row포항시
3rd row경주시
4th row경주시
5th row경주시

Common Values

ValueCountFrequency (%)
경주시 256
43.1%
안동시 166
27.9%
영주시 31
 
5.2%
영덕군 24
 
4.0%
예천군 18
 
3.0%
고령군 18
 
3.0%
청송군 16
 
2.7%
봉화군 12
 
2.0%
영양군 10
 
1.7%
성주군 8
 
1.3%
Other values (10) 35
 
5.9%

Length

2024-03-15T07:56:18.118413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경주시 256
43.1%
안동시 166
27.9%
영주시 31
 
5.2%
영덕군 24
 
4.0%
예천군 18
 
3.0%
고령군 18
 
3.0%
청송군 16
 
2.7%
봉화군 12
 
2.0%
영양군 10
 
1.7%
성주군 8
 
1.3%
Other values (10) 35
 
5.9%
Distinct582
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-03-15T07:56:19.240352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length5.6245791
Min length1

Characters and Unicode

Total characters3341
Distinct characters413
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique572 ?
Unique (%)96.3%

Sample

1st row새마을 인성교육관
2nd row포항전통문화체험관
3rd row남산댁
4th row락희원
5th row옥산정사 독락당
ValueCountFrequency (%)
경주 33
 
3.9%
한옥스테이 20
 
2.4%
한옥 19
 
2.3%
스테이 16
 
1.9%
신라 10
 
1.2%
펜션 9
 
1.1%
한옥호텔 8
 
1.0%
풀빌라 7
 
0.8%
한옥펜션 6
 
0.7%
6
 
0.7%
Other values (648) 706
84.0%
2024-03-15T07:56:20.905507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
 
7.4%
142
 
4.3%
137
 
4.1%
86
 
2.6%
74
 
2.2%
73
 
2.2%
67
 
2.0%
64
 
1.9%
61
 
1.8%
55
 
1.6%
Other values (403) 2336
69.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2940
88.0%
Space Separator 246
 
7.4%
Lowercase Letter 34
 
1.0%
Open Punctuation 31
 
0.9%
Close Punctuation 31
 
0.9%
Decimal Number 26
 
0.8%
Uppercase Letter 26
 
0.8%
Other Punctuation 5
 
0.1%
Other Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
4.8%
137
 
4.7%
86
 
2.9%
74
 
2.5%
73
 
2.5%
67
 
2.3%
64
 
2.2%
61
 
2.1%
55
 
1.9%
54
 
1.8%
Other values (357) 2127
72.3%
Lowercase Letter
ValueCountFrequency (%)
o 5
14.7%
a 4
11.8%
t 3
8.8%
r 3
8.8%
n 3
8.8%
e 3
8.8%
k 2
 
5.9%
h 2
 
5.9%
p 2
 
5.9%
x 1
 
2.9%
Other values (6) 6
17.6%
Uppercase Letter
ValueCountFrequency (%)
O 4
15.4%
C 3
11.5%
K 3
11.5%
A 3
11.5%
T 2
7.7%
L 2
7.7%
N 2
7.7%
D 1
 
3.8%
W 1
 
3.8%
H 1
 
3.8%
Other values (4) 4
15.4%
Decimal Number
ValueCountFrequency (%)
1 6
23.1%
2 5
19.2%
0 4
15.4%
9 3
11.5%
8 2
 
7.7%
6 2
 
7.7%
3 2
 
7.7%
4 1
 
3.8%
7 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 3
60.0%
; 1
 
20.0%
& 1
 
20.0%
Space Separator
ValueCountFrequency (%)
246
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2923
87.5%
Common 339
 
10.1%
Latin 60
 
1.8%
Han 19
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
4.9%
137
 
4.7%
86
 
2.9%
74
 
2.5%
73
 
2.5%
67
 
2.3%
64
 
2.2%
61
 
2.1%
55
 
1.9%
54
 
1.8%
Other values (339) 2110
72.2%
Latin
ValueCountFrequency (%)
o 5
 
8.3%
a 4
 
6.7%
O 4
 
6.7%
C 3
 
5.0%
K 3
 
5.0%
A 3
 
5.0%
t 3
 
5.0%
r 3
 
5.0%
n 3
 
5.0%
e 3
 
5.0%
Other values (20) 26
43.3%
Han
ValueCountFrequency (%)
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (9) 9
47.4%
Common
ValueCountFrequency (%)
246
72.6%
( 31
 
9.1%
) 31
 
9.1%
1 6
 
1.8%
2 5
 
1.5%
0 4
 
1.2%
9 3
 
0.9%
, 3
 
0.9%
8 2
 
0.6%
6 2
 
0.6%
Other values (5) 6
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2921
87.4%
ASCII 399
 
11.9%
CJK 16
 
0.5%
CJK Compat Ideographs 3
 
0.1%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
246
61.7%
( 31
 
7.8%
) 31
 
7.8%
1 6
 
1.5%
o 5
 
1.3%
2 5
 
1.3%
a 4
 
1.0%
0 4
 
1.0%
O 4
 
1.0%
9 3
 
0.8%
Other values (35) 60
 
15.0%
Hangul
ValueCountFrequency (%)
142
 
4.9%
137
 
4.7%
86
 
2.9%
74
 
2.5%
73
 
2.5%
67
 
2.3%
64
 
2.2%
61
 
2.1%
55
 
1.9%
54
 
1.8%
Other values (338) 2108
72.2%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (6) 6
37.5%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

주소
Text

Distinct582
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-03-15T07:56:22.007926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length32
Mean length23.616162
Min length16

Characters and Unicode

Total characters14028
Distinct characters261
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique579 ?
Unique (%)97.5%

Sample

1st row경상북도 포항시 북구 기계면 새마을발상지길 130
2nd row경상북도 포항시 북구 기북면 덕동문화길 7
3rd row경상북도 경주시 강동면 양동마을길 147-6
4th row경상북도 경주시 포석로1050번길 43 (황남동)
5th row경상북도 경주시 안강읍 옥산서원길 300-3
ValueCountFrequency (%)
경상북도 592
 
19.7%
경주시 256
 
8.5%
안동시 166
 
5.5%
황남동 46
 
1.5%
사정동 46
 
1.5%
풍천면 42
 
1.4%
영주시 31
 
1.0%
포석로 29
 
1.0%
영덕군 24
 
0.8%
태화동 22
 
0.7%
Other values (932) 1748
58.2%
2024-03-15T07:56:23.532964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2421
 
17.3%
865
 
6.2%
622
 
4.4%
613
 
4.4%
610
 
4.3%
508
 
3.6%
1 485
 
3.5%
479
 
3.4%
473
 
3.4%
- 364
 
2.6%
Other values (251) 6588
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8399
59.9%
Space Separator 2421
 
17.3%
Decimal Number 2250
 
16.0%
Dash Punctuation 364
 
2.6%
Open Punctuation 287
 
2.0%
Close Punctuation 287
 
2.0%
Other Punctuation 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
865
 
10.3%
622
 
7.4%
613
 
7.3%
610
 
7.3%
508
 
6.0%
479
 
5.7%
473
 
5.6%
308
 
3.7%
274
 
3.3%
216
 
2.6%
Other values (236) 3431
40.9%
Decimal Number
ValueCountFrequency (%)
1 485
21.6%
2 305
13.6%
3 270
12.0%
5 204
9.1%
4 191
 
8.5%
6 178
 
7.9%
0 160
 
7.1%
9 156
 
6.9%
8 155
 
6.9%
7 146
 
6.5%
Space Separator
ValueCountFrequency (%)
2421
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 364
100.0%
Open Punctuation
ValueCountFrequency (%)
( 287
100.0%
Close Punctuation
ValueCountFrequency (%)
) 287
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8399
59.9%
Common 5629
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
865
 
10.3%
622
 
7.4%
613
 
7.3%
610
 
7.3%
508
 
6.0%
479
 
5.7%
473
 
5.6%
308
 
3.7%
274
 
3.3%
216
 
2.6%
Other values (236) 3431
40.9%
Common
ValueCountFrequency (%)
2421
43.0%
1 485
 
8.6%
- 364
 
6.5%
2 305
 
5.4%
( 287
 
5.1%
) 287
 
5.1%
3 270
 
4.8%
5 204
 
3.6%
4 191
 
3.4%
6 178
 
3.2%
Other values (5) 637
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8399
59.9%
ASCII 5629
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2421
43.0%
1 485
 
8.6%
- 364
 
6.5%
2 305
 
5.4%
( 287
 
5.1%
) 287
 
5.1%
3 270
 
4.8%
5 204
 
3.6%
4 191
 
3.4%
6 178
 
3.2%
Other values (5) 637
 
11.3%
Hangul
ValueCountFrequency (%)
865
 
10.3%
622
 
7.4%
613
 
7.3%
610
 
7.3%
508
 
6.0%
479
 
5.7%
473
 
5.6%
308
 
3.7%
274
 
3.3%
216
 
2.6%
Other values (236) 3431
40.9%
Distinct520
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-03-15T07:56:24.723961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.1464646
Min length2

Characters and Unicode

Total characters1869
Distinct characters193
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique471 ?
Unique (%)79.3%

Sample

1st row김완용
2nd row포항시(이강덕)
3rd row권오본
4th row이상문
5th row이해철
ValueCountFrequency (%)
조재환 11
 
1.8%
7
 
1.1%
박윤정 5
 
0.8%
손동우 5
 
0.8%
5
 
0.8%
1 5
 
0.8%
김규리 4
 
0.6%
김인자 4
 
0.6%
이성화 3
 
0.5%
김미경 3
 
0.5%
Other values (517) 565
91.6%
2024-03-15T07:56:26.399662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
135
 
7.2%
96
 
5.1%
66
 
3.5%
52
 
2.8%
42
 
2.2%
37
 
2.0%
37
 
2.0%
36
 
1.9%
33
 
1.8%
33
 
1.8%
Other values (183) 1302
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1830
97.9%
Space Separator 23
 
1.2%
Decimal Number 7
 
0.4%
Open Punctuation 4
 
0.2%
Close Punctuation 4
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
135
 
7.4%
96
 
5.2%
66
 
3.6%
52
 
2.8%
42
 
2.3%
37
 
2.0%
37
 
2.0%
36
 
2.0%
33
 
1.8%
33
 
1.8%
Other values (178) 1263
69.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Decimal Number
ValueCountFrequency (%)
1 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1830
97.9%
Common 39
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
135
 
7.4%
96
 
5.2%
66
 
3.6%
52
 
2.8%
42
 
2.3%
37
 
2.0%
37
 
2.0%
36
 
2.0%
33
 
1.8%
33
 
1.8%
Other values (178) 1263
69.0%
Common
ValueCountFrequency (%)
23
59.0%
1 7
 
17.9%
( 4
 
10.3%
) 4
 
10.3%
, 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1830
97.9%
ASCII 39
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
135
 
7.4%
96
 
5.2%
66
 
3.6%
52
 
2.8%
42
 
2.3%
37
 
2.0%
37
 
2.0%
36
 
2.0%
33
 
1.8%
33
 
1.8%
Other values (178) 1263
69.0%
ASCII
ValueCountFrequency (%)
23
59.0%
1 7
 
17.9%
( 4
 
10.3%
) 4
 
10.3%
, 1
 
2.6%
Distinct441
Distinct (%)74.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2010-01-27 00:00:00
Maximum2023-12-28 00:00:00
2024-03-15T07:56:26.828495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:56:27.244208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2024-03-15T07:56:17.314717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T07:56:17.777487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명한옥명주소대표자등록일자
0포항시새마을 인성교육관경상북도 포항시 북구 기계면 새마을발상지길 130김완용2014-12-30
1포항시포항전통문화체험관경상북도 포항시 북구 기북면 덕동문화길 7포항시(이강덕)2016-02-07
2경주시남산댁경상북도 경주시 강동면 양동마을길 147-6권오본2010-05-26
3경주시락희원경상북도 경주시 포석로1050번길 43 (황남동)이상문2010-06-25
4경주시옥산정사 독락당경상북도 경주시 안강읍 옥산서원길 300-3이해철2010-06-25
5경주시유연재경상북도 경주시 강동면 양동마을안길 7-4권순원2010-07-13
6경주시사랑채경상북도 경주시 포석로1068번길 23 (황남동, 사랑채)추종원2010-07-13
7경주시향단경상북도 경주시 강동면 양동마을길 121-75이난희2010-09-20
8경주시야선재경상북도 경주시 남산예길 103 (남산동)박정희2011-03-08
9경주시수리뫼경상북도 경주시 내남면 포석로 110-34 (용산서원)박미숙2011-03-14
시군명한옥명주소대표자등록일자
584봉화군토향고택경상북도 봉화읍 바래미1길 43김성윤2012-07-30
585봉화군추 원 재경상북도 봉화읍 충재길 87-21권용철2012-07-30
586봉화군만산고택경상북도 봉화군 춘양면 서동길 21-19강연정2012-07-30
587봉화군계서종택경상북도 봉화군 물야면 계서당길 34-1성기호2013-01-16
588봉화군성 암 재경상북도 봉화군 춘양면 서동길 19-18강지연2013-06-11
589봉화군만회고택경상북도 봉화군 봉화읍 바래미1길 51김시원2013-08-09
590봉화군소강고택경상북도 봉화군 봉화읍 바래미길 22김호진2014-07-28
591봉화군문 행 당경상북도 봉화군 봉화읍 충재길 87-65권도현(이경선)2015-02-16
592봉화군기헌고택경상북도 봉화군 법전면 경체정길 18정영림(강석우)2015-04-13
593봉화군망와고택경상북도 봉화군 물야면 오록2길 48김기홍2017-07-20