Overview

Dataset statistics

Number of variables6
Number of observations39
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory52.3 B

Variable types

Numeric1
Categorical2
Text2
DateTime1

Dataset

Description연번,지정,문화재명,소 재 지,지 정 별,지정일자
Author강북구
URLhttps://data.seoul.go.kr/dataList/OA-11635/S/1/datasetView.do

Alerts

연번 is highly overall correlated with 지정High correlation
지정 is highly overall correlated with 연번High correlation
문화재명 has unique valuesUnique
지 정 별 has unique valuesUnique

Reproduction

Analysis started2024-04-20 17:53:09.201416
Analysis finished2024-04-20 17:53:10.415512
Duration1.21 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION 

Distinct33
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.153846
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size479.0 B
2024-04-21T02:53:10.605403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.9
Q110.5
median18
Q323.5
95-th percentile31.1
Maximum33
Range32
Interquartile range (IQR)13

Descriptive statistics

Standard deviation8.8809034
Coefficient of variation (CV)0.51772083
Kurtosis-0.83379538
Mean17.153846
Median Absolute Deviation (MAD)7
Skewness-0.054546958
Sum669
Variance78.870445
MonotonicityNot monotonic
2024-04-21T02:53:11.011764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
18 7
 
17.9%
33 1
 
2.6%
4 1
 
2.6%
7 1
 
2.6%
6 1
 
2.6%
29 1
 
2.6%
17 1
 
2.6%
16 1
 
2.6%
5 1
 
2.6%
31 1
 
2.6%
Other values (23) 23
59.0%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%
29 1
2.6%
28 1
2.6%
27 1
2.6%
26 1
2.6%
25 1
2.6%
24 1
2.6%

지정
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size440.0 B
유형
23 
등록
문화재자료
 
2
기념물
 
2
보물
 
2
Other values (3)

Length

Max length5
Median length2
Mean length2.2051282
Min length2

Unique

Unique3 ?
Unique (%)7.7%

Sample

1st row문화재자료
2nd row기념물
3rd row유형
4th row유형
5th row문화재자료

Common Values

ValueCountFrequency (%)
유형 23
59.0%
등록 7
 
17.9%
문화재자료 2
 
5.1%
기념물 2
 
5.1%
보물 2
 
5.1%
무형 1
 
2.6%
명승 1
 
2.6%
사적 1
 
2.6%

Length

2024-04-21T02:53:11.451756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T02:53:11.816063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유형 23
59.0%
등록 7
 
17.9%
문화재자료 2
 
5.1%
기념물 2
 
5.1%
보물 2
 
5.1%
무형 1
 
2.6%
명승 1
 
2.6%
사적 1
 
2.6%

문화재명
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size440.0 B
2024-04-21T02:53:12.653727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length10.846154
Min length3

Characters and Unicode

Total characters423
Distinct characters140
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row송계별업 터
2nd row사릉 석물 채석장 터
3rd row현수제승법수
4th row도선사 석조관음보살좌상
5th row천지명양수륙재의찬요
ValueCountFrequency (%)
화계사 11
 
10.7%
서울 7
 
6.8%
묘소 6
 
5.8%
5
 
4.9%
도선사 5
 
4.9%
명부전 3
 
2.9%
삼각산 2
 
1.9%
2
 
1.9%
2
 
1.9%
일괄 2
 
1.9%
Other values (58) 58
56.3%
2024-04-21T02:53:13.942804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
15.4%
22
 
5.2%
13
 
3.1%
12
 
2.8%
11
 
2.6%
10
 
2.4%
8
 
1.9%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (130) 258
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 344
81.3%
Space Separator 65
 
15.4%
Close Punctuation 6
 
1.4%
Open Punctuation 6
 
1.4%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
6.4%
13
 
3.8%
12
 
3.5%
11
 
3.2%
10
 
2.9%
8
 
2.3%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
Other values (125) 237
68.9%
Space Separator
ValueCountFrequency (%)
65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
? 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 326
77.1%
Common 79
 
18.7%
Han 18
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
6.7%
13
 
4.0%
12
 
3.7%
11
 
3.4%
10
 
3.1%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.1%
Other values (111) 219
67.2%
Han
ValueCountFrequency (%)
4
22.2%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Common
ValueCountFrequency (%)
65
82.3%
) 6
 
7.6%
( 6
 
7.6%
- 1
 
1.3%
? 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 326
77.1%
ASCII 79
 
18.7%
CJK 18
 
4.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65
82.3%
) 6
 
7.6%
( 6
 
7.6%
- 1
 
1.3%
? 1
 
1.3%
Hangul
ValueCountFrequency (%)
22
 
6.7%
13
 
4.0%
12
 
3.7%
11
 
3.4%
10
 
3.1%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.1%
Other values (111) 219
67.2%
CJK
ValueCountFrequency (%)
4
22.2%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%

소 재 지
Categorical

Distinct15
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size440.0 B
화계사길 117(수유동)
11 
삼양로173길 504 (우이동)
10 
수유동 산127-1
4.19로28길 101(운가사)
우이동 산 68-1 외1
Other values (10)
10 

Length

Max length28
Median length20
Mean length14.384615
Min length9

Unique

Unique10 ?
Unique (%)25.6%

Sample

1st row수유동 산127-1
2nd row수유동 산 127-1, 산 86-1
3rd row4.19로28길 101(운가사)
4th row삼양로173길 504(우이동)
5th row4.19로28길 101(운가사)

Common Values

ValueCountFrequency (%)
화계사길 117(수유동) 11
28.2%
삼양로173길 504 (우이동) 10
25.6%
수유동 산127-1 4
 
10.3%
4.19로28길 101(운가사) 2
 
5.1%
우이동 산 68-1 외1 2
 
5.1%
수유동 산 127-1, 산 86-1 1
 
2.6%
삼양로173길 504(우이동) 1
 
2.6%
우이동 106-1 1
 
2.6%
수유동 산127-1외 1 1
 
2.6%
수유동 산127-4 1
 
2.6%
Other values (5) 5
12.8%

Length

2024-04-21T02:53:14.188014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
우이동 15
14.6%
삼양로173길 12
11.7%
화계사길 11
10.7%
117(수유동 11
10.7%
504 10
9.7%
수유동 8
 
7.8%
산127-1 4
 
3.9%
4
 
3.9%
외1 2
 
1.9%
1 2
 
1.9%
Other values (21) 24
23.3%

지 정 별
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size440.0 B
2024-04-21T02:53:14.912729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0769231
Min length3

Characters and Unicode

Total characters198
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row제75호
2nd row제44호
3rd row제433호
4th row제396호
5th row제67호
ValueCountFrequency (%)
제75호 1
 
2.6%
제514호 1
 
2.6%
제259-2호 1
 
2.6%
제259호 1
 
2.6%
제259-1호 1
 
2.6%
제259-3호 1
 
2.6%
제259-4호 1
 
2.6%
제259-5호 1
 
2.6%
제259-6호 1
 
2.6%
제42호 1
 
2.6%
Other values (29) 29
74.4%
2024-04-21T02:53:16.069330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
19.7%
39
19.7%
5 19
9.6%
3 16
8.1%
2 16
8.1%
1 15
 
7.6%
9 14
 
7.1%
6 9
 
4.5%
4 8
 
4.0%
8 8
 
4.0%
Other values (3) 15
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 113
57.1%
Other Letter 78
39.4%
Dash Punctuation 7
 
3.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 19
16.8%
3 16
14.2%
2 16
14.2%
1 15
13.3%
9 14
12.4%
6 9
8.0%
4 8
7.1%
8 8
7.1%
0 5
 
4.4%
7 3
 
2.7%
Other Letter
ValueCountFrequency (%)
39
50.0%
39
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 120
60.6%
Hangul 78
39.4%

Most frequent character per script

Common
ValueCountFrequency (%)
5 19
15.8%
3 16
13.3%
2 16
13.3%
1 15
12.5%
9 14
11.7%
6 9
7.5%
4 8
6.7%
8 8
6.7%
- 7
 
5.8%
0 5
 
4.2%
Hangul
ValueCountFrequency (%)
39
50.0%
39
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120
60.6%
Hangul 78
39.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
50.0%
39
50.0%
ASCII
ValueCountFrequency (%)
5 19
15.8%
3 16
13.3%
2 16
13.3%
1 15
12.5%
9 14
11.7%
6 9
7.5%
4 8
6.7%
8 8
6.7%
- 7
 
5.8%
0 5
 
4.2%
Distinct20
Distinct (%)51.3%
Missing0
Missing (%)0.0%
Memory size440.0 B
Minimum1968-12-05 00:00:00
Maximum2019-08-08 00:00:00
2024-04-21T02:53:16.269978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T02:53:16.478645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

Interactions

2024-04-21T02:53:09.606210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T02:53:16.641799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정문화재명소 재 지지 정 별지정일자
연번1.0000.7921.0000.8571.0000.962
지정0.7921.0001.0000.8411.0000.992
문화재명1.0001.0001.0001.0001.0001.000
소 재 지0.8570.8411.0001.0001.0000.961
지 정 별1.0001.0001.0001.0001.0001.000
지정일자0.9620.9921.0000.9611.0001.000
2024-04-21T02:53:16.810505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정소 재 지
지정1.0000.489
소 재 지0.4891.000
2024-04-21T02:53:16.946706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정소 재 지
연번1.0000.5160.457
지정0.5161.0000.489
소 재 지0.4570.4891.000

Missing values

2024-04-21T02:53:09.939254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T02:53:10.283797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지정문화재명소 재 지지 정 별지정일자
033문화재자료송계별업 터수유동 산127-1제75호2019.08.08.
131기념물사릉 석물 채석장 터수유동 산 127-1, 산 86-1제44호2019.08.08.
228유형현수제승법수4.19로28길 101(운가사)제433호2018.12.13.
327유형도선사 석조관음보살좌상삼양로173길 504(우이동)제396호2016.12.08.
432문화재자료천지명양수륙재의찬요4.19로28길 101(운가사)제67호2016.10.06.
522유형화계사 탑다라니판화계사길 117(수유동)제388호2016.08.04.
623유형화계사 아미타후불도화계사길 117(수유동)제389호2016.08.04.
724유형화계사 명부전 지장보살도화계사길 117(수유동)제390호2016.08.04.
820유형화계사 아미타괘불도 및 오여래도화계사길 117(수유동)제386호2016.08.04.
925유형화계사 명부전 십대왕도화계사길 117(수유동)제391호2016.08.04.
연번지정문화재명소 재 지지 정 별지정일자
2917유형도선사 석 독성상삼양로173길 504 (우이동)제192호2004.09.30.
3016유형도선사 목 아미타불?대세지보살상삼양로173길 504 (우이동)제191호2004.09.30.
314명승삼각산우이동 산 68-1 외1제10호2003.10.31.
325등록서울 창녕위궁 재사월계로 173 (번동)제40호2002.09.13.
3315유형본원정사 목 보살좌상(지장보살)삼각산로 1 (수유동)제136호2001.09.15.
341보물사인비구 제작 동종-서울 화계사 동종화계사길 117(수유동)제11-5호2000.02.15.
3514유형화계사 대웅전화계사길 117(수유동)제65호1985.12.05.
3613유형도선사 마애불입상삼양로173길 504 (우이동)제34호1977.09.05.
3712유형봉황각삼양로173길 107-12 (우이동)제2호1969.09.18.
383사적북한산성우이동 산 68-1 외1제162호1968.12.05.