Overview

Dataset statistics

Number of variables6
Number of observations235
Missing cells57
Missing cells (%)4.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.4 KiB
Average record size in memory49.6 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description제주특별자치도 내에 위치하고 있는 각종 전통 건축물 관련 조사 결과로 지역, 건축물명, 건축년도 등 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15069608/fileData.do

Alerts

구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
건축년도 is highly overall correlated with 건축년도 비고High correlation
건축년도 비고 is highly overall correlated with 건축년도High correlation
건축년도 비고 is highly imbalanced (62.5%)Imbalance
건축년도 has 57 (24.3%) missing valuesMissing
건축물명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:22:41.029529
Analysis finished2023-12-12 23:22:41.567026
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
제주시
156 
서귀포시
79 

Length

Max length4
Median length3
Mean length3.3361702
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제주시
2nd row제주시
3rd row제주시
4th row제주시
5th row제주시

Common Values

ValueCountFrequency (%)
제주시 156
66.4%
서귀포시 79
33.6%

Length

2023-12-13T08:22:41.629009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:41.738357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제주시 156
66.4%
서귀포시 79
33.6%

건축물명
Text

UNIQUE 

Distinct235
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-13T08:22:41.998670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length12.229787
Min length4

Characters and Unicode

Total characters2874
Distinct characters212
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)100.0%

Sample

1st row화북일동 4271-1번지 단독주택
2nd row화복일동 4067번지 단독주택
3rd row화북일동 4051-1번지 단독주택
4th row화북일동 4284번지 단독주택
5th row화북일동 4049-1 번지 단독주택
ValueCountFrequency (%)
주택 150
 
23.1%
단독주택 37
 
5.7%
화북일동 13
 
2.0%
성읍리 10
 
1.5%
번지 9
 
1.4%
가옥 9
 
1.4%
조천리 9
 
1.4%
한림리 6
 
0.9%
대림리 6
 
0.9%
귀덕리 6
 
0.9%
Other values (323) 393
60.6%
2023-12-13T08:22:42.420678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
413
 
14.4%
199
 
6.9%
194
 
6.8%
1 182
 
6.3%
128
 
4.5%
2 127
 
4.4%
- 95
 
3.3%
94
 
3.3%
89
 
3.1%
4 76
 
2.6%
Other values (202) 1277
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1540
53.6%
Decimal Number 784
27.3%
Space Separator 413
 
14.4%
Dash Punctuation 95
 
3.3%
Close Punctuation 21
 
0.7%
Open Punctuation 21
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
199
 
12.9%
194
 
12.6%
128
 
8.3%
94
 
6.1%
89
 
5.8%
72
 
4.7%
37
 
2.4%
37
 
2.4%
23
 
1.5%
20
 
1.3%
Other values (188) 647
42.0%
Decimal Number
ValueCountFrequency (%)
1 182
23.2%
2 127
16.2%
4 76
9.7%
3 75
9.6%
5 63
 
8.0%
6 56
 
7.1%
9 53
 
6.8%
7 52
 
6.6%
8 51
 
6.5%
0 49
 
6.2%
Space Separator
ValueCountFrequency (%)
413
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 95
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1466
51.0%
Common 1334
46.4%
Han 74
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
199
 
13.6%
194
 
13.2%
128
 
8.7%
94
 
6.4%
89
 
6.1%
72
 
4.9%
37
 
2.5%
37
 
2.5%
23
 
1.6%
20
 
1.4%
Other values (138) 573
39.1%
Han
ValueCountFrequency (%)
5
 
6.8%
5
 
6.8%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (40) 45
60.8%
Common
ValueCountFrequency (%)
413
31.0%
1 182
13.6%
2 127
 
9.5%
- 95
 
7.1%
4 76
 
5.7%
3 75
 
5.6%
5 63
 
4.7%
6 56
 
4.2%
9 53
 
4.0%
7 52
 
3.9%
Other values (4) 142
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1466
51.0%
ASCII 1334
46.4%
CJK 73
 
2.5%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
413
31.0%
1 182
13.6%
2 127
 
9.5%
- 95
 
7.1%
4 76
 
5.7%
3 75
 
5.6%
5 63
 
4.7%
6 56
 
4.2%
9 53
 
4.0%
7 52
 
3.9%
Other values (4) 142
 
10.6%
Hangul
ValueCountFrequency (%)
199
 
13.6%
194
 
13.2%
128
 
8.7%
94
 
6.4%
89
 
6.1%
72
 
4.9%
37
 
2.5%
37
 
2.5%
23
 
1.6%
20
 
1.4%
Other values (138) 573
39.1%
CJK
ValueCountFrequency (%)
5
 
6.8%
5
 
6.8%
3
 
4.1%
3
 
4.1%
3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (39) 44
60.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
전통건축
235 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전통건축
2nd row전통건축
3rd row전통건축
4th row전통건축
5th row전통건축

Common Values

ValueCountFrequency (%)
전통건축 235
100.0%

Length

2023-12-13T08:22:42.568882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:42.647545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전통건축 235
100.0%

건축년도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct49
Distinct (%)27.5%
Missing57
Missing (%)24.3%
Infinite0
Infinite (%)0.0%
Mean1922.382
Minimum1694
Maximum2016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T08:22:42.732148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1694
5-th percentile1834.65
Q11928.5
median1933
Q31940
95-th percentile1945
Maximum2016
Range322
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation46.860294
Coefficient of variation (CV)0.024376161
Kurtosis13.367607
Mean1922.382
Median Absolute Deviation (MAD)7
Skewness-3.5562284
Sum342184
Variance2195.8871
MonotonicityIncreasing
2023-12-13T08:22:42.839344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1940 22
 
9.4%
1933 21
 
8.9%
1930 17
 
7.2%
1945 17
 
7.2%
1937 11
 
4.7%
1935 10
 
4.3%
1925 9
 
3.8%
1934 6
 
2.6%
1932 6
 
2.6%
1920 6
 
2.6%
Other values (39) 53
22.6%
(Missing) 57
24.3%
ValueCountFrequency (%)
1694 1
0.4%
1700 2
0.9%
1710 1
0.4%
1721 1
0.4%
1730 1
0.4%
1790 1
0.4%
1800 1
0.4%
1810 1
0.4%
1839 1
0.4%
1864 1
0.4%
ValueCountFrequency (%)
2016 1
 
0.4%
1974 1
 
0.4%
1969 1
 
0.4%
1967 1
 
0.4%
1952 1
 
0.4%
1950 1
 
0.4%
1947 1
 
0.4%
1945 17
7.2%
1943 2
 
0.9%
1941 2
 
0.9%

건축년도 비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
특이사항없음
178 
건축년도 미상
22 
조선시대
 
17
건축년도 1930년대 이전
 
4
건축년도 1940 이전
 
3
Other values (9)
 
11

Length

Max length18
Median length6
Mean length6.4425532
Min length4

Unique

Unique7 ?
Unique (%)3.0%

Sample

1st row특이사항없음
2nd row특이사항없음
3rd row특이사항없음
4th row특이사항없음
5th row특이사항없음

Common Values

ValueCountFrequency (%)
특이사항없음 178
75.7%
건축년도 미상 22
 
9.4%
조선시대 17
 
7.2%
건축년도 1930년대 이전 4
 
1.7%
건축년도 1940 이전 3
 
1.3%
고려시대 2
 
0.9%
건축년도 1930년대 2
 
0.9%
건축년도 1373 예측 1
 
0.4%
건축년도 1900년대 1
 
0.4%
건축년도 1930(1961) 1
 
0.4%
Other values (4) 4
 
1.7%

Length

2023-12-13T08:22:42.946698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
특이사항없음 178
62.9%
건축년도 38
 
13.4%
미상 22
 
7.8%
조선시대 17
 
6.0%
이전 7
 
2.5%
1930년대 6
 
2.1%
1940 3
 
1.1%
고려시대 2
 
0.7%
1945 1
 
0.4%
1800년대 1
 
0.4%
Other values (8) 8
 
2.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2021-10-15
235 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-10-15
2nd row2021-10-15
3rd row2021-10-15
4th row2021-10-15
5th row2021-10-15

Common Values

ValueCountFrequency (%)
2021-10-15 235
100.0%

Length

2023-12-13T08:22:43.037318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:43.104352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-10-15 235
100.0%

Interactions

2023-12-13T08:22:41.251050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:22:43.145740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역건축년도건축년도 비고
지역1.0000.1060.000
건축년도0.1061.000NaN
건축년도 비고0.000NaN1.000
2023-12-13T08:22:43.221542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역건축년도 비고
지역1.0000.000
건축년도 비고0.0001.000
2023-12-13T08:22:43.300712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축년도지역건축년도 비고
건축년도1.0000.0941.000
지역0.0941.0000.000
건축년도 비고1.0000.0001.000

Missing values

2023-12-13T08:22:41.398872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:22:41.523273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역건축물명구분건축년도건축년도 비고데이터기준일자
0제주시화북일동 4271-1번지 단독주택전통건축1694특이사항없음2021-10-15
1제주시화복일동 4067번지 단독주택전통건축1700특이사항없음2021-10-15
2제주시화북일동 4051-1번지 단독주택전통건축1700특이사항없음2021-10-15
3제주시화북일동 4284번지 단독주택전통건축1710특이사항없음2021-10-15
4제주시화북일동 4049-1 번지 단독주택전통건축1721특이사항없음2021-10-15
5제주시화북일동 4252-1번지 단독주택전통건축1730특이사항없음2021-10-15
6제주시화북일동 1817-1번지 단독주택전통건축1790특이사항없음2021-10-15
7제주시화북일동 1833-1번지 단독주택전통건축1800특이사항없음2021-10-15
8제주시화북일동 1520-1번지 단독주택전통건축1810특이사항없음2021-10-15
9제주시화북일동 1475-1번지 단독주택전통건축1839특이사항없음2021-10-15
지역건축물명구분건축년도건축년도 비고데이터기준일자
225제주시조천정미소전통건축<NA>건축년도 1930(1954)2021-10-15
226서귀포시천미연대(川尾烟臺)전통건축<NA>조선시대2021-10-15
227제주시최영장군 사당전통건축<NA>건축년도 미상2021-10-15
228제주시하귀2리 2027-1 주택전통건축<NA>건축년도 1930년대 이전2021-10-15
229제주시한림리 1445-6 주택전통건축<NA>건축년도 미상2021-10-15
230서귀포시한봉일 가옥전통건축<NA>건축년도 미상2021-10-15
231제주시한수리 892 주택전통건축<NA>건축년도 1800년대 추정2021-10-15
232제주시해신사(海神祠)전통건축<NA>조선시대2021-10-15
233제주시향사당(鄕射堂)전통건축<NA>조선시대2021-10-15
234서귀포시협자연대(俠子烟臺)전통건축<NA>조선시대2021-10-15