Overview

Dataset statistics

Number of variables5
Number of observations76
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory41.7 B

Variable types

Text3
DateTime1
Categorical1

Dataset

Description괴산군의 문화관광 보물 보유 현황입니다. 문화재 명칭, 지정별, 문화재 위치, 문화재 지정일, 데이터기준일 등의 항목을 제공합니다.
Author충청북도 괴산군
URLhttps://www.data.go.kr/data/3070899/fileData.do

Alerts

데이터기준일 has constant value ""Constant
지정별 has unique valuesUnique
문 화 재 명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:36:18.850332
Analysis finished2023-12-12 12:36:19.516742
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지정별
Text

UNIQUE 

Distinct76
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T21:36:19.700974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length6.5657895
Min length5

Characters and Unicode

Total characters499
Distinct characters31
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)100.0%

Sample

1st row보물 97호
2nd row보물 433호
3rd row보물 566호
4th row보물1295호
5th row보물1299호
ValueCountFrequency (%)
지유 25
 
16.9%
문자 14
 
9.5%
지기 11
 
7.4%
천기 7
 
4.7%
민속 4
 
2.7%
보물 3
 
2.0%
13호 2
 
1.4%
12호 2
 
1.4%
30호 2
 
1.4%
22호 2
 
1.4%
Other values (72) 76
51.4%
2023-12-12T21:36:20.123719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
76
15.2%
72
14.4%
1 50
 
10.0%
36
 
7.2%
2 30
 
6.0%
3 27
 
5.4%
25
 
5.0%
18
 
3.6%
4 16
 
3.2%
6 15
 
3.0%
Other values (21) 134
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 228
45.7%
Decimal Number 199
39.9%
Space Separator 72
 
14.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
76
33.3%
36
15.8%
25
 
11.0%
18
 
7.9%
14
 
6.1%
14
 
6.1%
7
 
3.1%
7
 
3.1%
7
 
3.1%
6
 
2.6%
Other values (10) 18
 
7.9%
Decimal Number
ValueCountFrequency (%)
1 50
25.1%
2 30
15.1%
3 27
13.6%
4 16
 
8.0%
6 15
 
7.5%
7 15
 
7.5%
0 14
 
7.0%
5 13
 
6.5%
9 13
 
6.5%
8 6
 
3.0%
Space Separator
ValueCountFrequency (%)
72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 271
54.3%
Hangul 228
45.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
76
33.3%
36
15.8%
25
 
11.0%
18
 
7.9%
14
 
6.1%
14
 
6.1%
7
 
3.1%
7
 
3.1%
7
 
3.1%
6
 
2.6%
Other values (10) 18
 
7.9%
Common
ValueCountFrequency (%)
72
26.6%
1 50
18.5%
2 30
11.1%
3 27
 
10.0%
4 16
 
5.9%
6 15
 
5.5%
7 15
 
5.5%
0 14
 
5.2%
5 13
 
4.8%
9 13
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 271
54.3%
Hangul 228
45.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
76
33.3%
36
15.8%
25
 
11.0%
18
 
7.9%
14
 
6.1%
14
 
6.1%
7
 
3.1%
7
 
3.1%
7
 
3.1%
6
 
2.6%
Other values (10) 18
 
7.9%
ASCII
ValueCountFrequency (%)
72
26.6%
1 50
18.5%
2 30
11.1%
3 27
 
10.0%
4 16
 
5.9%
6 15
 
5.5%
7 15
 
5.5%
0 14
 
5.2%
5 13
 
4.8%
9 13
 
4.8%

문 화 재 명
Text

UNIQUE 

Distinct76
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T21:36:20.419608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length10.618421
Min length4

Characters and Unicode

Total characters807
Distinct characters268
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)100.0%

Sample

1st row槐山 院豊里 磨崖二佛 坐像
2nd row槐山 覺淵寺 石造毘盧遮那佛坐像
3rd row柳根肖像-71세상
4th row槐山 覺淵寺 通一大師塔碑
5th row槐山 寶安寺 三層石塔
ValueCountFrequency (%)
槐山 57
27.1%
괴산 7
 
3.3%
覺淵寺 6
 
2.9%
자생지 4
 
1.9%
미선나무 3
 
1.4%
墓所 3
 
1.4%
空林寺 3
 
1.4%
古宅 2
 
1.0%
辛景行 2
 
1.0%
古家 2
 
1.0%
Other values (114) 121
57.6%
2023-12-12T21:36:20.943334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
135
 
16.7%
64
 
7.9%
61
 
7.6%
15
 
1.9%
12
 
1.5%
9
 
1.1%
9
 
1.1%
8
 
1.0%
8
 
1.0%
8
 
1.0%
Other values (258) 478
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 641
79.4%
Space Separator 135
 
16.7%
Decimal Number 12
 
1.5%
Close Punctuation 7
 
0.9%
Open Punctuation 7
 
0.9%
Other Symbol 2
 
0.2%
Other Punctuation 2
 
0.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
10.0%
61
 
9.5%
15
 
2.3%
12
 
1.9%
9
 
1.4%
9
 
1.4%
8
 
1.2%
8
 
1.2%
8
 
1.2%
7
 
1.1%
Other values (244) 440
68.6%
Decimal Number
ValueCountFrequency (%)
7 3
25.0%
1 3
25.0%
3 2
16.7%
2 1
 
8.3%
5 1
 
8.3%
0 1
 
8.3%
6 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
135
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 528
65.4%
Common 166
 
20.6%
Hangul 113
 
14.0%

Most frequent character per script

Han
ValueCountFrequency (%)
64
 
12.1%
61
 
11.6%
15
 
2.8%
12
 
2.3%
9
 
1.7%
9
 
1.7%
8
 
1.5%
7
 
1.3%
7
 
1.3%
7
 
1.3%
Other values (187) 329
62.3%
Hangul
ValueCountFrequency (%)
8
 
7.1%
8
 
7.1%
7
 
6.2%
7
 
6.2%
7
 
6.2%
5
 
4.4%
5
 
4.4%
4
 
3.5%
4
 
3.5%
3
 
2.7%
Other values (47) 55
48.7%
Common
ValueCountFrequency (%)
135
81.3%
) 7
 
4.2%
( 7
 
4.2%
7 3
 
1.8%
1 3
 
1.8%
3 2
 
1.2%
2
 
1.2%
, 1
 
0.6%
2 1
 
0.6%
5 1
 
0.6%
Other values (4) 4
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
CJK 513
63.6%
ASCII 164
 
20.3%
Hangul 113
 
14.0%
CJK Compat Ideographs 15
 
1.9%
CJK Compat 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
135
82.3%
) 7
 
4.3%
( 7
 
4.3%
7 3
 
1.8%
1 3
 
1.8%
3 2
 
1.2%
, 1
 
0.6%
2 1
 
0.6%
5 1
 
0.6%
. 1
 
0.6%
Other values (3) 3
 
1.8%
CJK
ValueCountFrequency (%)
64
 
12.5%
61
 
11.9%
15
 
2.9%
12
 
2.3%
9
 
1.8%
9
 
1.8%
8
 
1.6%
7
 
1.4%
7
 
1.4%
7
 
1.4%
Other values (178) 314
61.2%
Hangul
ValueCountFrequency (%)
8
 
7.1%
8
 
7.1%
7
 
6.2%
7
 
6.2%
7
 
6.2%
5
 
4.4%
5
 
4.4%
4
 
3.5%
4
 
3.5%
3
 
2.7%
Other values (47) 55
48.7%
CJK Compat Ideographs
ValueCountFrequency (%)
3
20.0%
3
20.0%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
CJK Compat
ValueCountFrequency (%)
2
100.0%
Distinct70
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T21:36:21.350379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length12.644737
Min length8

Characters and Unicode

Total characters961
Distinct characters107
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)85.5%

Sample

1st row연풍면 원풍리 124-2
2nd row칠성면 각연길 451
3rd row소수면 몽촌리 114-1
4th row칠성면 태성리 산7-1
5th row청안면 효근리 385-2
ValueCountFrequency (%)
청천면 13
 
5.6%
괴산읍 12
 
5.1%
칠성면 11
 
4.7%
청안면 11
 
4.7%
불정면 8
 
3.4%
연풍면 6
 
2.6%
장연면 5
 
2.1%
사담리 4
 
1.7%
삼방리 3
 
1.3%
괴산로 3
 
1.3%
Other values (131) 158
67.5%
2023-12-12T21:36:21.896223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
159
 
16.5%
61
 
6.3%
1 55
 
5.7%
38
 
4.0%
36
 
3.7%
3 35
 
3.6%
- 34
 
3.5%
33
 
3.4%
2 33
 
3.4%
32
 
3.3%
Other values (97) 445
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 539
56.1%
Decimal Number 228
23.7%
Space Separator 159
 
16.5%
Dash Punctuation 34
 
3.5%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
11.3%
38
 
7.1%
36
 
6.7%
33
 
6.1%
32
 
5.9%
21
 
3.9%
19
 
3.5%
16
 
3.0%
16
 
3.0%
15
 
2.8%
Other values (84) 252
46.8%
Decimal Number
ValueCountFrequency (%)
1 55
24.1%
3 35
15.4%
2 33
14.5%
4 30
13.2%
5 23
10.1%
8 14
 
6.1%
0 12
 
5.3%
6 9
 
3.9%
7 9
 
3.9%
9 8
 
3.5%
Space Separator
ValueCountFrequency (%)
159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 539
56.1%
Common 422
43.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
11.3%
38
 
7.1%
36
 
6.7%
33
 
6.1%
32
 
5.9%
21
 
3.9%
19
 
3.5%
16
 
3.0%
16
 
3.0%
15
 
2.8%
Other values (84) 252
46.8%
Common
ValueCountFrequency (%)
159
37.7%
1 55
 
13.0%
3 35
 
8.3%
- 34
 
8.1%
2 33
 
7.8%
4 30
 
7.1%
5 23
 
5.5%
8 14
 
3.3%
0 12
 
2.8%
6 9
 
2.1%
Other values (3) 18
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 539
56.1%
ASCII 422
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
159
37.7%
1 55
 
13.0%
3 35
 
8.3%
- 34
 
8.1%
2 33
 
7.8%
4 30
 
7.1%
5 23
 
5.5%
8 14
 
3.3%
0 12
 
2.8%
6 9
 
2.1%
Other values (3) 18
 
4.3%
Hangul
ValueCountFrequency (%)
61
 
11.3%
38
 
7.1%
36
 
6.7%
33
 
6.1%
32
 
5.9%
21
 
3.9%
19
 
3.5%
16
 
3.0%
16
 
3.0%
15
 
2.8%
Other values (84) 252
46.8%
Distinct54
Distinct (%)71.1%
Missing0
Missing (%)0.0%
Memory size740.0 B
Minimum1962-12-03 00:00:00
Maximum2021-07-28 00:00:00
2023-12-12T21:36:22.036219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:36:22.197346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
2021-08-01
76 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-08-01
2nd row2021-08-01
3rd row2021-08-01
4th row2021-08-01
5th row2021-08-01

Common Values

ValueCountFrequency (%)
2021-08-01 76
100.0%

Length

2023-12-12T21:36:22.353518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:36:22.472762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-08-01 76
100.0%

Correlations

2023-12-12T21:36:22.552730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정별문 화 재 명위 치지 정 일
지정별1.0001.0001.0001.000
문 화 재 명1.0001.0001.0001.000
위 치1.0001.0001.0000.993
지 정 일1.0001.0000.9931.000

Missing values

2023-12-12T21:36:19.362276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:36:19.470367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지정별문 화 재 명위 치지 정 일데이터기준일
0보물 97호槐山 院豊里 磨崖二佛 坐像연풍면 원풍리 124-21963-01-212021-08-01
1보물 433호槐山 覺淵寺 石造毘盧遮那佛坐像칠성면 각연길 4511966-02-282021-08-01
2보물 566호柳根肖像-71세상소수면 몽촌리 114-11972-07-062021-08-01
3보물1295호槐山 覺淵寺 通一大師塔碑칠성면 태성리 산7-11999-06-232021-08-01
4보물1299호槐山 寶安寺 三層石塔청안면 효근리 385-22000-08-042021-08-01
5보물1370호槐山 覺淵寺 通一大師塔칠성면 태성리 산7-12003-03-142021-08-01
6보물1380호辛景行 難功臣敎書및關聯文籍청주 상당 명암로 393 국립청주박물관2003-08-212021-08-01
7사적 401호槐山 彌勒山城청천면 고성리 산8-371997-12-162021-08-01
8사적 417호槐山 尤庵 宋時烈 關聯遺蹟청천면 화양동길 188 외1999-12-292021-08-01
9천기 147호괴산 송덕리 미선나무 자생지장연면 송덕리 산58-131962-12-032021-08-01
지정별문 화 재 명위 치지 정 일데이터기준일
66문자 22호槐山 寶安寺 石造藥師如來坐像청안면 효근1길 3-81998-01-092021-08-01
67문자 27호槐山 芝莊里 石造如來坐像불정면 지장리 산41-21998-11-202021-08-01
68문자 30호槐山 彩雲庵 大雄殿청천면 화양동2길 302000-05-042021-08-01
69문자 35호槐山 空林寺 僧塔청천면 사담리 산2-12002-01-112021-08-01
70문자 57호槐山 平康全氏 孝烈閣소수면 정용월곡길 22007-03-092021-08-01
71문자 61호槐山 醉墨堂괴산읍 충민사길 452007-09-072021-08-01
72문자 62호槐山 金時讓 神道碑괴산읍 능촌리 산242008-01-042021-08-01
73문자 73호槐山 張潭 忠臣閣청안면 조천리 산3-12010-03-122021-08-01
74등록 144호槐山郡守 官舍 (3동 173㎡)괴산읍 읍내로13길 122004-12-312021-08-01
75등록 354호괴산중학교구본관(1동,725.06㎡)괴산읍 괴산로 35272007-09-212021-08-01