Overview

Dataset statistics

Number of variables5
Number of observations3614
Missing cells2123
Missing cells (%)11.7%
Duplicate rows25
Duplicate rows (%)0.7%
Total size in memory141.3 KiB
Average record size in memory40.0 B

Variable types

Unsupported1
Text3
Categorical1

Dataset

Description2003년 지역별 전시회 등 시각예술 관련 정보(행사명, 장소, 일시, 부문 등 포함)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076499/fileData.do

Alerts

Dataset has 25 (0.7%) duplicate rowsDuplicates
비고 has 2123 (58.7%) missing valuesMissing
일시 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 23:43:16.398793
Analysis finished2023-12-12 23:43:17.461570
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일시
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size28.4 KiB
Distinct3455
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
2023-12-13T08:43:17.684969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length7.0199225
Min length2

Characters and Unicode

Total characters25370
Distinct characters713
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3312 ?
Unique (%)91.6%

Sample

1st row김 태은전
2nd row한봉림 도예전
3rd row오원희 서양화전
4th row윤귀원전
5th row오원희 전
ValueCountFrequency (%)
136
 
2.2%
사진전 103
 
1.6%
한국화전 95
 
1.5%
작품전 88
 
1.4%
도예전 69
 
1.1%
조각전 53
 
0.8%
도자전 51
 
0.8%
서양화전 29
 
0.5%
2인전 28
 
0.4%
2003 28
 
0.4%
Other values (4428) 5639
89.2%
2023-12-13T08:43:18.116878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3672
 
14.5%
2705
 
10.7%
514
 
2.0%
488
 
1.9%
437
 
1.7%
417
 
1.6%
369
 
1.5%
357
 
1.4%
309
 
1.2%
288
 
1.1%
Other values (703) 15814
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21554
85.0%
Space Separator 2705
 
10.7%
Decimal Number 672
 
2.6%
Other Punctuation 144
 
0.6%
Lowercase Letter 94
 
0.4%
Uppercase Letter 88
 
0.3%
Dash Punctuation 74
 
0.3%
Close Punctuation 19
 
0.1%
Open Punctuation 18
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3672
 
17.0%
514
 
2.4%
488
 
2.3%
437
 
2.0%
417
 
1.9%
369
 
1.7%
357
 
1.7%
309
 
1.4%
288
 
1.3%
287
 
1.3%
Other values (631) 14416
66.9%
Uppercase Letter
ValueCountFrequency (%)
A 10
 
11.4%
F 9
 
10.2%
S 7
 
8.0%
T 6
 
6.8%
M 6
 
6.8%
C 5
 
5.7%
I 5
 
5.7%
O 4
 
4.5%
K 4
 
4.5%
E 4
 
4.5%
Other values (14) 28
31.8%
Lowercase Letter
ValueCountFrequency (%)
i 11
11.7%
n 10
10.6%
e 9
9.6%
t 8
 
8.5%
a 7
 
7.4%
o 6
 
6.4%
l 6
 
6.4%
r 5
 
5.3%
c 5
 
5.3%
s 5
 
5.3%
Other values (11) 22
23.4%
Decimal Number
ValueCountFrequency (%)
2 170
25.3%
0 147
21.9%
3 108
16.1%
1 89
13.2%
4 32
 
4.8%
6 29
 
4.3%
5 25
 
3.7%
8 25
 
3.7%
9 24
 
3.6%
7 23
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 84
58.3%
, 29
 
20.1%
& 12
 
8.3%
7
 
4.9%
' 6
 
4.2%
! 2
 
1.4%
2
 
1.4%
; 1
 
0.7%
/ 1
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 73
98.6%
1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
17
89.5%
) 2
 
10.5%
Open Punctuation
ValueCountFrequency (%)
16
88.9%
( 2
 
11.1%
Space Separator
ValueCountFrequency (%)
2705
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21534
84.9%
Common 3634
 
14.3%
Latin 182
 
0.7%
Han 20
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3672
 
17.1%
514
 
2.4%
488
 
2.3%
437
 
2.0%
417
 
1.9%
369
 
1.7%
357
 
1.7%
309
 
1.4%
288
 
1.3%
287
 
1.3%
Other values (613) 14396
66.9%
Latin
ValueCountFrequency (%)
i 11
 
6.0%
n 10
 
5.5%
A 10
 
5.5%
F 9
 
4.9%
e 9
 
4.9%
t 8
 
4.4%
a 7
 
3.8%
S 7
 
3.8%
T 6
 
3.3%
o 6
 
3.3%
Other values (35) 99
54.4%
Common
ValueCountFrequency (%)
2705
74.4%
2 170
 
4.7%
0 147
 
4.0%
3 108
 
3.0%
1 89
 
2.4%
. 84
 
2.3%
- 73
 
2.0%
4 32
 
0.9%
, 29
 
0.8%
6 29
 
0.8%
Other values (17) 168
 
4.6%
Han
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (8) 8
40.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21534
84.9%
ASCII 3773
 
14.9%
None 33
 
0.1%
CJK 20
 
0.1%
Punctuation 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3672
 
17.1%
514
 
2.4%
488
 
2.3%
437
 
2.0%
417
 
1.9%
369
 
1.7%
357
 
1.7%
309
 
1.4%
288
 
1.3%
287
 
1.3%
Other values (613) 14396
66.9%
ASCII
ValueCountFrequency (%)
2705
71.7%
2 170
 
4.5%
0 147
 
3.9%
3 108
 
2.9%
1 89
 
2.4%
. 84
 
2.2%
- 73
 
1.9%
4 32
 
0.8%
, 29
 
0.8%
6 29
 
0.8%
Other values (57) 307
 
8.1%
None
ValueCountFrequency (%)
17
51.5%
16
48.5%
Punctuation
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
CJK
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (8) 8
40.0%

장소
Text

Distinct358
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
2023-12-13T08:43:18.386139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length6.060321
Min length3

Characters and Unicode

Total characters21902
Distinct characters320
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)4.2%

Sample

1st row갤러리 창
2nd row통인화랑
3rd row경인미술관
4th row경인미술관
5th row경인미술관
ValueCountFrequency (%)
갤러리 598
 
12.4%
인사갤러리 142
 
3.0%
인사아트센터 141
 
2.9%
스페이스 116
 
2.4%
미술관 116
 
2.4%
관훈갤러리 115
 
2.4%
경인미술관 110
 
2.3%
라메르갤러리 92
 
1.9%
가나아트 87
 
1.8%
세종문화회관 86
 
1.8%
Other values (430) 3209
66.7%
2023-12-13T08:43:18.755595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1879
 
8.6%
1839
 
8.4%
1839
 
8.4%
1199
 
5.5%
765
 
3.5%
729
 
3.3%
699
 
3.2%
698
 
3.2%
562
 
2.6%
519
 
2.4%
Other values (310) 11174
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20574
93.9%
Space Separator 1199
 
5.5%
Uppercase Letter 83
 
0.4%
Other Punctuation 19
 
0.1%
Decimal Number 17
 
0.1%
Lowercase Letter 6
 
< 0.1%
Final Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1879
 
9.1%
1839
 
8.9%
1839
 
8.9%
765
 
3.7%
729
 
3.5%
699
 
3.4%
698
 
3.4%
562
 
2.7%
519
 
2.5%
502
 
2.4%
Other values (280) 10543
51.2%
Uppercase Letter
ValueCountFrequency (%)
L 22
26.5%
F 21
25.3%
G 21
25.3%
S 5
 
6.0%
P 4
 
4.8%
T 4
 
4.8%
A 3
 
3.6%
K 2
 
2.4%
D 1
 
1.2%
Decimal Number
ValueCountFrequency (%)
5 6
35.3%
1 3
17.6%
2 2
 
11.8%
0 1
 
5.9%
7 1
 
5.9%
9 1
 
5.9%
8 1
 
5.9%
6 1
 
5.9%
3 1
 
5.9%
Other Punctuation
ValueCountFrequency (%)
' 5
26.3%
. 5
26.3%
/ 4
21.1%
, 2
 
10.5%
2
 
10.5%
& 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
m 2
33.3%
k 2
33.3%
p 2
33.3%
Space Separator
ValueCountFrequency (%)
1199
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20574
93.9%
Common 1239
 
5.7%
Latin 89
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1879
 
9.1%
1839
 
8.9%
1839
 
8.9%
765
 
3.7%
729
 
3.5%
699
 
3.4%
698
 
3.4%
562
 
2.7%
519
 
2.5%
502
 
2.4%
Other values (280) 10543
51.2%
Common
ValueCountFrequency (%)
1199
96.8%
5 6
 
0.5%
' 5
 
0.4%
. 5
 
0.4%
/ 4
 
0.3%
1 3
 
0.2%
3
 
0.2%
, 2
 
0.2%
2 2
 
0.2%
2
 
0.2%
Other values (8) 8
 
0.6%
Latin
ValueCountFrequency (%)
L 22
24.7%
F 21
23.6%
G 21
23.6%
S 5
 
5.6%
P 4
 
4.5%
T 4
 
4.5%
A 3
 
3.4%
m 2
 
2.2%
k 2
 
2.2%
p 2
 
2.2%
Other values (2) 3
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20574
93.9%
ASCII 1323
 
6.0%
Punctuation 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1879
 
9.1%
1839
 
8.9%
1839
 
8.9%
765
 
3.7%
729
 
3.5%
699
 
3.4%
698
 
3.4%
562
 
2.7%
519
 
2.5%
502
 
2.4%
Other values (280) 10543
51.2%
ASCII
ValueCountFrequency (%)
1199
90.6%
L 22
 
1.7%
F 21
 
1.6%
G 21
 
1.6%
5 6
 
0.5%
' 5
 
0.4%
S 5
 
0.4%
. 5
 
0.4%
P 4
 
0.3%
/ 4
 
0.3%
Other values (18) 31
 
2.3%
Punctuation
ValueCountFrequency (%)
3
60.0%
2
40.0%

비고
Text

MISSING 

Distinct1418
Distinct (%)95.1%
Missing2123
Missing (%)58.7%
Memory size28.4 KiB
2023-12-13T08:43:18.999085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length65
Mean length24.112005
Min length3

Characters and Unicode

Total characters35951
Distinct characters759
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1382 ?
Unique (%)92.7%

Sample

1st row공간을 이미지로 활용한 설치작품
2nd row전통 민화를 소재로 한 그림
3rd row철을 이용한 평면작품
4th row강현식, 김승학, 김영순, 김인선, 김학곤, 박충호, 손기종, 손영락, 최영식 등 9명 참여
5th row김영수, 강태화, 신인덕, 유재옥, 이은경, 김순진, 이은숙, 김혜선 등 8명 참여
ValueCountFrequency (%)
347
 
3.9%
참여 218
 
2.4%
전시 203
 
2.3%
그림 119
 
1.3%
82
 
0.9%
설치작품 80
 
0.9%
이용한 80
 
0.9%
작품 73
 
0.8%
작품전 64
 
0.7%
한지에 56
 
0.6%
Other values (4569) 7605
85.2%
2023-12-13T08:43:19.379908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7437
 
20.7%
, 1845
 
5.1%
926
 
2.6%
651
 
1.8%
592
 
1.6%
569
 
1.6%
560
 
1.6%
550
 
1.5%
434
 
1.2%
400
 
1.1%
Other values (749) 21987
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25453
70.8%
Space Separator 7437
 
20.7%
Other Punctuation 1956
 
5.4%
Decimal Number 1046
 
2.9%
Uppercase Letter 36
 
0.1%
Close Punctuation 6
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Math Symbol 4
 
< 0.1%
Initial Punctuation 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
926
 
3.6%
651
 
2.6%
592
 
2.3%
569
 
2.2%
560
 
2.2%
550
 
2.2%
434
 
1.7%
400
 
1.6%
369
 
1.4%
357
 
1.4%
Other values (709) 20045
78.8%
Uppercase Letter
ValueCountFrequency (%)
F 4
11.1%
D 4
11.1%
C 4
11.1%
M 3
8.3%
P 3
8.3%
R 3
8.3%
A 3
8.3%
J 2
 
5.6%
B 2
 
5.6%
V 2
 
5.6%
Other values (6) 6
16.7%
Decimal Number
ValueCountFrequency (%)
1 217
20.7%
0 189
18.1%
2 132
12.6%
3 128
12.2%
5 85
 
8.1%
4 79
 
7.6%
6 62
 
5.9%
8 58
 
5.5%
7 53
 
5.1%
9 43
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 1845
94.3%
. 65
 
3.3%
' 37
 
1.9%
5
 
0.3%
& 2
 
0.1%
: 1
 
0.1%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
7437
100.0%
Close Punctuation
ValueCountFrequency (%)
6
100.0%
Open Punctuation
ValueCountFrequency (%)
6
100.0%
Math Symbol
ValueCountFrequency (%)
= 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25449
70.8%
Common 10462
29.1%
Latin 36
 
0.1%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
926
 
3.6%
651
 
2.6%
592
 
2.3%
569
 
2.2%
560
 
2.2%
550
 
2.2%
434
 
1.7%
400
 
1.6%
369
 
1.4%
357
 
1.4%
Other values (705) 20041
78.7%
Common
ValueCountFrequency (%)
7437
71.1%
, 1845
 
17.6%
1 217
 
2.1%
0 189
 
1.8%
2 132
 
1.3%
3 128
 
1.2%
5 85
 
0.8%
4 79
 
0.8%
. 65
 
0.6%
6 62
 
0.6%
Other values (14) 223
 
2.1%
Latin
ValueCountFrequency (%)
F 4
11.1%
D 4
11.1%
C 4
11.1%
M 3
8.3%
P 3
8.3%
R 3
8.3%
A 3
8.3%
J 2
 
5.6%
B 2
 
5.6%
V 2
 
5.6%
Other values (6) 6
16.7%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25449
70.8%
ASCII 10475
29.1%
None 17
 
< 0.1%
Punctuation 6
 
< 0.1%
CJK 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7437
71.0%
, 1845
 
17.6%
1 217
 
2.1%
0 189
 
1.8%
2 132
 
1.3%
3 128
 
1.2%
5 85
 
0.8%
4 79
 
0.8%
. 65
 
0.6%
6 62
 
0.6%
Other values (24) 236
 
2.3%
Hangul
ValueCountFrequency (%)
926
 
3.6%
651
 
2.6%
592
 
2.3%
569
 
2.2%
560
 
2.2%
550
 
2.2%
434
 
1.7%
400
 
1.6%
369
 
1.4%
357
 
1.4%
Other values (705) 20041
78.7%
None
ValueCountFrequency (%)
6
35.3%
6
35.3%
5
29.4%
Punctuation
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

부문
Categorical

Distinct15
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
회화
1664 
종합
539 
디자인
315 
공예
230 
조소
214 
Other values (10)
652 

Length

Max length6
Median length2
Mean length2.144715
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row신매체
2nd row공예
3rd row회화
4th row회화
5th row회화

Common Values

ValueCountFrequency (%)
회화 1664
46.0%
종합 539
 
14.9%
디자인 315
 
8.7%
공예 230
 
6.4%
조소 214
 
5.9%
사진 207
 
5.7%
신매체 196
 
5.4%
서예 123
 
3.4%
판화 98
 
2.7%
건축 21
 
0.6%
Other values (5) 7
 
0.2%

Length

2023-12-13T08:43:19.506432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
회화 1668
46.1%
종합 539
 
14.9%
디자인 317
 
8.8%
공예 231
 
6.4%
조소 214
 
5.9%
사진 207
 
5.7%
신매체 196
 
5.4%
서예 123
 
3.4%
판화 98
 
2.7%
건축 21
 
0.6%
Other values (2) 3
 
0.1%

Missing values

2023-12-13T08:43:17.349427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:43:17.426021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일시행사명장소비고부문
01.1-7김 태은전갤러리 창공간을 이미지로 활용한 설치작품신매체
11.1-7한봉림 도예전통인화랑<NA>공예
21.2-7오원희 서양화전경인미술관<NA>회화
31.2-7윤귀원전경인미술관전통 민화를 소재로 한 그림회화
41.2-7오원희 전경인미술관<NA>회화
51.3-9박정우 염색전라메르갤러리<NA>디자인
61.3-14박기웅전인사아트프라자철을 이용한 평면작품종합
71.3-14김영식전인사아트프라자<NA>회화
81.3-14한국의 산하전라메르갤러리강현식, 김승학, 김영순, 김인선, 김학곤, 박충호, 손기종, 손영락, 최영식 등 9명 참여회화
91.3-19김을전도올갤러리<NA>회화
일시행사명장소비고부문
360412.24-2004.2.15빛과 색채의 탐험전예술의 전당 미술관강형구, 김영진, 박광성, 송성진, 주성혜, 김강용, 홍지윤, 서용선 등 총 47명 참여종합
360512.25-2004.2.1아트 북 아트 2003전국립현대미술관한국, 미국, 영국, 프랑스, 스페인, 독일 등 20여개국의 500여종의 아트북 및 북 아트와 미술작품전종합
360612.26-31 12.27-2004.1.4정상경 인형전 이인애전롯데월드화랑 갤러리 조<NA>디자인 회화
360712.27-2004.1.6국제일러스트 협회전한전프라자갤러리<NA>디자인
360812.27-204.2.29강세황전예술의 전당 서예관서울서예박물관이 선정한 한국서예사 특별전의 작가전서예
360912.29-20041.25김상길 사진전정미소갤러리<NA>사진
361012.31-2004.1.5정인경전한서 갤러리이라크 전쟁을 풍자 만화로 표현한 작품회화
361112.31-2004.1.6협업전갤러리 가이아프로젝트 그룹 이동시점의 전시로서 타분야의 전문가들과 공동작업을 시도한 전시종합
361212.31-2004.1.13김예진 한복전라메르갤러리<NA>디자인
361312.31-2004.1.13하이브리드 이론전공화랑아토마우스로 대변되는 이동기의 작품과 컴퓨터 프린트 기법의 김태중의 작품이 어우러진 전시종합

Duplicate rows

Most frequently occurring

행사명장소비고부문# duplicates
0권영석전동덕아트갤러리<NA>회화2
1김미경전인사갤러리<NA>회화2
2김미자전롯데월드화랑<NA>회화2
3김순겸전가산화랑<NA>회화2
4김영순전갤러리 창<NA>회화2
5김재권전조선화랑<NA>회화2
6김태영전청화랑<NA>회화2
7김형권전우림화랑<NA>회화2
8문인표전우림화랑<NA>회화2
9박상미전삼정아트스페이스화선지에 수묵화회화2