Overview

Dataset statistics

Number of variables5
Number of observations281
Missing cells30
Missing cells (%)2.1%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory11.1 KiB
Average record size in memory40.5 B

Variable types

Unsupported1
Text3
Categorical1

Dataset

Description2001년 지역별 공연 및 행사 등 무용 공연예술 관련 정보(공연명, 일시, 부문 등 포함)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076441/fileData.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates
비고 has 30 (10.7%) missing valuesMissing
일 시 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 14:03:19.020027
Analysis finished2023-12-12 14:03:19.786056
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일 시
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.3 KiB
Distinct277
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T23:03:20.054654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length12.676157
Min length5

Characters and Unicode

Total characters3562
Distinct characters382
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique273 ?
Unique (%)97.2%

Sample

1st row국립국악원 무용단 교사연 수공연
2nd row일본무용공연
3rd row국립국악원무용단 〈청소년 국악문화강좌 공연〉
4th row바리시 니코프 화이트오크무용단
5th row김영희 무트댄스
ValueCountFrequency (%)
49
 
6.3%
공연 45
 
5.8%
2001 24
 
3.1%
정기공연 21
 
2.7%
18
 
2.3%
국립발레단 8
 
1.0%
있는 7
 
0.9%
국립국악원무용단 6
 
0.8%
제4회 6
 
0.8%
기념공연 6
 
0.8%
Other values (488) 589
75.6%
2023-12-12T23:03:20.513557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
498
 
14.0%
125
 
3.5%
115
 
3.2%
112
 
3.1%
102
 
2.9%
95
 
2.7%
95
 
2.7%
0 82
 
2.3%
71
 
2.0%
58
 
1.6%
Other values (372) 2209
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2612
73.3%
Space Separator 498
 
14.0%
Decimal Number 217
 
6.1%
Uppercase Letter 71
 
2.0%
Lowercase Letter 64
 
1.8%
Open Punctuation 29
 
0.8%
Close Punctuation 27
 
0.8%
Dash Punctuation 24
 
0.7%
Other Punctuation 18
 
0.5%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
125
 
4.8%
115
 
4.4%
112
 
4.3%
102
 
3.9%
95
 
3.6%
95
 
3.6%
71
 
2.7%
58
 
2.2%
57
 
2.2%
53
 
2.0%
Other values (309) 1729
66.2%
Uppercase Letter
ValueCountFrequency (%)
A 8
 
11.3%
C 8
 
11.3%
D 7
 
9.9%
O 5
 
7.0%
R 5
 
7.0%
M 4
 
5.6%
E 4
 
5.6%
T 3
 
4.2%
S 3
 
4.2%
N 3
 
4.2%
Other values (13) 21
29.6%
Lowercase Letter
ValueCountFrequency (%)
e 9
14.1%
a 7
10.9%
o 6
9.4%
i 6
9.4%
c 5
 
7.8%
n 5
 
7.8%
t 4
 
6.2%
v 3
 
4.7%
r 3
 
4.7%
s 3
 
4.7%
Other values (8) 13
20.3%
Decimal Number
ValueCountFrequency (%)
0 82
37.8%
1 53
24.4%
2 52
24.0%
4 9
 
4.1%
3 7
 
3.2%
5 6
 
2.8%
8 4
 
1.8%
6 2
 
0.9%
7 2
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 12
66.7%
2
 
11.1%
. 2
 
11.1%
& 2
 
11.1%
Open Punctuation
ValueCountFrequency (%)
27
93.1%
1
 
3.4%
( 1
 
3.4%
Close Punctuation
ValueCountFrequency (%)
25
92.6%
1
 
3.7%
) 1
 
3.7%
Space Separator
ValueCountFrequency (%)
498
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2612
73.3%
Common 815
 
22.9%
Latin 135
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
125
 
4.8%
115
 
4.4%
112
 
4.3%
102
 
3.9%
95
 
3.6%
95
 
3.6%
71
 
2.7%
58
 
2.2%
57
 
2.2%
53
 
2.0%
Other values (309) 1729
66.2%
Latin
ValueCountFrequency (%)
e 9
 
6.7%
A 8
 
5.9%
C 8
 
5.9%
D 7
 
5.2%
a 7
 
5.2%
o 6
 
4.4%
i 6
 
4.4%
c 5
 
3.7%
n 5
 
3.7%
O 5
 
3.7%
Other values (31) 69
51.1%
Common
ValueCountFrequency (%)
498
61.1%
0 82
 
10.1%
1 53
 
6.5%
2 52
 
6.4%
27
 
3.3%
25
 
3.1%
- 24
 
2.9%
, 12
 
1.5%
4 9
 
1.1%
3 7
 
0.9%
Other values (12) 26
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2612
73.3%
ASCII 894
 
25.1%
None 54
 
1.5%
Punctuation 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
498
55.7%
0 82
 
9.2%
1 53
 
5.9%
2 52
 
5.8%
- 24
 
2.7%
, 12
 
1.3%
e 9
 
1.0%
4 9
 
1.0%
A 8
 
0.9%
C 8
 
0.9%
Other values (48) 139
 
15.5%
Hangul
ValueCountFrequency (%)
125
 
4.8%
115
 
4.4%
112
 
4.3%
102
 
3.9%
95
 
3.6%
95
 
3.6%
71
 
2.7%
58
 
2.2%
57
 
2.2%
53
 
2.0%
Other values (309) 1729
66.2%
None
ValueCountFrequency (%)
27
50.0%
25
46.3%
1
 
1.9%
1
 
1.9%
Punctuation
ValueCountFrequency (%)
2
100.0%

장소
Text

Distinct76
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T23:03:20.713532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length26
Mean length9.0391459
Min length3

Characters and Unicode

Total characters2540
Distinct characters151
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)17.8%

Sample

1st row국립국악원 우면당
2nd row일본공보원
3rd row국립국악원 우면당
4th rowLG 아트센터
5th row문예회관 소극장
ValueCountFrequency (%)
국립국악원 56
 
10.6%
국립극장 49
 
9.3%
문예회관 39
 
7.4%
대극장 39
 
7.4%
예술의전당 34
 
6.5%
우면당 34
 
6.5%
달오름극장 27
 
5.1%
예악당 22
 
4.2%
토월극장 20
 
3.8%
세종문화회관 16
 
3.0%
Other values (83) 190
36.1%
2023-12-12T23:03:21.016280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
9.6%
202
 
8.0%
198
 
7.8%
171
 
6.7%
112
 
4.4%
106
 
4.2%
96
 
3.8%
81
 
3.2%
78
 
3.1%
76
 
3.0%
Other values (141) 1175
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2258
88.9%
Space Separator 245
 
9.6%
Uppercase Letter 16
 
0.6%
Other Punctuation 12
 
0.5%
Decimal Number 5
 
0.2%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
202
 
8.9%
198
 
8.8%
171
 
7.6%
112
 
5.0%
106
 
4.7%
96
 
4.3%
81
 
3.6%
78
 
3.5%
76
 
3.4%
66
 
2.9%
Other values (131) 1072
47.5%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
0 1
 
20.0%
2 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
, 11
91.7%
. 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
G 8
50.0%
L 8
50.0%
Space Separator
ValueCountFrequency (%)
245
100.0%
Close Punctuation
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2258
88.9%
Common 266
 
10.5%
Latin 16
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
202
 
8.9%
198
 
8.8%
171
 
7.6%
112
 
5.0%
106
 
4.7%
96
 
4.3%
81
 
3.6%
78
 
3.5%
76
 
3.4%
66
 
2.9%
Other values (131) 1072
47.5%
Common
ValueCountFrequency (%)
245
92.1%
, 11
 
4.1%
1 3
 
1.1%
2
 
0.8%
2
 
0.8%
. 1
 
0.4%
0 1
 
0.4%
2 1
 
0.4%
Latin
ValueCountFrequency (%)
G 8
50.0%
L 8
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2258
88.9%
ASCII 278
 
10.9%
None 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245
88.1%
, 11
 
4.0%
G 8
 
2.9%
L 8
 
2.9%
1 3
 
1.1%
. 1
 
0.4%
0 1
 
0.4%
2 1
 
0.4%
Hangul
ValueCountFrequency (%)
202
 
8.9%
198
 
8.8%
171
 
7.6%
112
 
5.0%
106
 
4.7%
96
 
4.3%
81
 
3.6%
78
 
3.5%
76
 
3.4%
66
 
2.9%
Other values (131) 1072
47.5%
None
ValueCountFrequency (%)
2
50.0%
2
50.0%

비고
Text

MISSING 

Distinct251
Distinct (%)100.0%
Missing30
Missing (%)10.7%
Memory size2.3 KiB
2023-12-12T23:03:21.287954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length576
Median length115
Mean length31.40239
Min length4

Characters and Unicode

Total characters7882
Distinct characters599
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique251 ?
Unique (%)100.0%

Sample

1st row〈살풀이〉,〈춘앵전〉등
2nd row〈부채춤〉등
3rd row〈콘체르토〉등
4th row안수연〈눈물〉, 김숙〈Sometime〉, 안소연〈눈을 뜨고 꿈을 꿀 수 있을까〉, 안정연〈표적〉
5th row주최 - (사)한국현대무용진흥회 경연참가자 공연 17일 : 최경실〈미소〉, 홍은주〈난지도블루스〉, 장종윤〈머리와 밧줄〉 18일 : 전호진〈오페라와 춤〉, 이경은〈모모와 함께 - 동행〉, 류석훈〈항해 111>
ValueCountFrequency (%)
101
 
7.3%
주최 23
 
1.7%
안무 13
 
0.9%
위한 7
 
0.5%
2001 6
 
0.4%
무대예술지원작품 6
 
0.4%
선정작품 6
 
0.4%
공연 5
 
0.4%
5
 
0.4%
국립국악원 5
 
0.4%
Other values (1057) 1210
87.2%
2023-12-12T23:03:21.754419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1132
 
14.4%
393
 
5.0%
386
 
4.9%
, 360
 
4.6%
166
 
2.1%
- 135
 
1.7%
90
 
1.1%
1 78
 
1.0%
77
 
1.0%
2 73
 
0.9%
Other values (589) 4992
63.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4632
58.8%
Space Separator 1132
 
14.4%
Other Punctuation 446
 
5.7%
Open Punctuation 433
 
5.5%
Close Punctuation 428
 
5.4%
Decimal Number 263
 
3.3%
Lowercase Letter 191
 
2.4%
Uppercase Letter 186
 
2.4%
Dash Punctuation 135
 
1.7%
Math Symbol 30
 
0.4%
Other values (3) 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
3.6%
90
 
1.9%
77
 
1.7%
70
 
1.5%
68
 
1.5%
66
 
1.4%
66
 
1.4%
65
 
1.4%
64
 
1.4%
63
 
1.4%
Other values (510) 3837
82.8%
Uppercase Letter
ValueCountFrequency (%)
A 23
 
12.4%
O 16
 
8.6%
R 12
 
6.5%
L 12
 
6.5%
N 11
 
5.9%
I 11
 
5.9%
T 10
 
5.4%
S 10
 
5.4%
D 9
 
4.8%
P 8
 
4.3%
Other values (14) 64
34.4%
Lowercase Letter
ValueCountFrequency (%)
e 35
18.3%
o 20
10.5%
r 19
9.9%
a 13
 
6.8%
i 12
 
6.3%
n 12
 
6.3%
t 11
 
5.8%
m 10
 
5.2%
d 7
 
3.7%
l 7
 
3.7%
Other values (11) 45
23.6%
Other Punctuation
ValueCountFrequency (%)
, 360
80.7%
: 49
 
11.0%
. 10
 
2.2%
8
 
1.8%
8
 
1.8%
4
 
0.9%
? 3
 
0.7%
@ 2
 
0.4%
! 1
 
0.2%
& 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 78
29.7%
2 73
27.8%
0 45
17.1%
3 13
 
4.9%
7 10
 
3.8%
5 10
 
3.8%
9 10
 
3.8%
6 9
 
3.4%
4 8
 
3.0%
8 7
 
2.7%
Close Punctuation
ValueCountFrequency (%)
386
90.2%
35
 
8.2%
) 6
 
1.4%
} 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
393
90.8%
35
 
8.1%
( 5
 
1.2%
Math Symbol
ValueCountFrequency (%)
> 18
60.0%
< 12
40.0%
Space Separator
ValueCountFrequency (%)
1132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 135
100.0%
Control
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4629
58.7%
Common 2873
36.5%
Latin 377
 
4.8%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
3.6%
90
 
1.9%
77
 
1.7%
70
 
1.5%
68
 
1.5%
66
 
1.4%
66
 
1.4%
65
 
1.4%
64
 
1.4%
63
 
1.4%
Other values (507) 3834
82.8%
Latin
ValueCountFrequency (%)
e 35
 
9.3%
A 23
 
6.1%
o 20
 
5.3%
r 19
 
5.0%
O 16
 
4.2%
a 13
 
3.4%
i 12
 
3.2%
R 12
 
3.2%
n 12
 
3.2%
L 12
 
3.2%
Other values (35) 203
53.8%
Common
ValueCountFrequency (%)
1132
39.4%
393
 
13.7%
386
 
13.4%
, 360
 
12.5%
- 135
 
4.7%
1 78
 
2.7%
2 73
 
2.5%
: 49
 
1.7%
0 45
 
1.6%
35
 
1.2%
Other values (24) 187
 
6.5%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4629
58.7%
ASCII 2380
30.2%
None 853
 
10.8%
Punctuation 17
 
0.2%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1132
47.6%
, 360
 
15.1%
- 135
 
5.7%
1 78
 
3.3%
2 73
 
3.1%
: 49
 
2.1%
0 45
 
1.9%
e 35
 
1.5%
A 23
 
1.0%
o 20
 
0.8%
Other values (61) 430
 
18.1%
None
ValueCountFrequency (%)
393
46.1%
386
45.3%
35
 
4.1%
35
 
4.1%
4
 
0.5%
Hangul
ValueCountFrequency (%)
166
 
3.6%
90
 
1.9%
77
 
1.7%
70
 
1.5%
68
 
1.5%
66
 
1.4%
66
 
1.4%
65
 
1.4%
64
 
1.4%
63
 
1.4%
Other values (507) 3834
82.8%
Punctuation
ValueCountFrequency (%)
8
47.1%
8
47.1%
1
 
5.9%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

부문
Categorical

Distinct10
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
한국무용
82 
현대무용
55 
발레
50 
전통무용
40 
종합
38 
Other values (5)
16 

Length

Max length5
Median length4
Mean length3.3772242
Min length2

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row전통무용
2nd row외국무용
3rd row전통무용
4th row현대무용
5th row한국무용

Common Values

ValueCountFrequency (%)
한국무용 82
29.2%
현대무용 55
19.6%
발레 50
17.8%
전통무용 40
14.2%
종합 38
13.5%
외국무용 7
 
2.5%
재즈댄스 4
 
1.4%
퍼포먼스 3
 
1.1%
댄스뮤지컬 1
 
0.4%
<NA> 1
 
0.4%

Length

2023-12-12T23:03:21.898484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:03:22.002971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국무용 82
29.2%
현대무용 55
19.6%
발레 50
17.8%
전통무용 40
14.2%
종합 38
13.5%
외국무용 7
 
2.5%
재즈댄스 4
 
1.4%
퍼포먼스 3
 
1.1%
댄스뮤지컬 1
 
0.4%
na 1
 
0.4%

Correlations

2023-12-12T23:03:22.098441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장소부문
장소1.0000.855
부문0.8551.000

Missing values

2023-12-12T23:03:19.638037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:03:19.744901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일 시공연명장소비고부문
01.4국립국악원 무용단 교사연 수공연국립국악원 우면당〈살풀이〉,〈춘앵전〉등전통무용
11.16일본무용공연일본공보원<NA>외국무용
21.2국립국악원무용단 〈청소년 국악문화강좌 공연〉국립국악원 우면당〈부채춤〉등전통무용
32.9 -11바리시 니코프 화이트오크무용단LG 아트센터〈콘체르토〉등현대무용
42.16김영희 무트댄스문예회관 소극장안수연〈눈물〉, 김숙〈Sometime〉, 안소연〈눈을 뜨고 꿈을 꿀 수 있을까〉, 안정연〈표적〉한국무용
52.17 -18제4회 한국 안무가페스티벌문예회관 대극장주최 - (사)한국현대무용진흥회 경연참가자 공연 17일 : 최경실〈미소〉, 홍은주〈난지도블루스〉, 장종윤〈머리와 밧줄〉 18일 : 전호진〈오페라와 춤〉, 이경은〈모모와 함께 - 동행〉, 류석훈〈항해 111>종합
62.20 - 21제4회 한국 안무가페스티벌문예회관 대극장20일 : 이윤경〈기우는 달〉, 일본 바네토무용단〈시간짜기 스웨터〉, 21일 : 박은화 (Turing 111>, 윤성주〈월〉, 조윤라〈지나간 기억의 그림자〉, 손관중〈적 (기다림)〉, 윤미라〈꽃등 11>종합
72.22무사 창단공연섬유센터홀〈약속〉.〈천공〉등한국무용
82.22 - 27김나리, 최정민, 류재미 공연덕원갤러리〈나비처럼 날아서 벌처럼 쏘다〉퍼포먼스
93.2국립국악원무용단 춤공연 〈쌍가인전목단〉국립국악원 예악당국악FM방송 개국축하공연전통무용
일 시공연명장소비고부문
27112.17 -18이매방 춤공연국립극장 해오름극장〈승무〉등전통무용
27212.18 - 25국립발레단 송년공연예술의전당 오페라극장〈호두까기 인형〉. 안무-유리 그리가로비치발레
27312.18염현주의 우리춤국립국악원 우면당〈태평무〉,〈교방무고〉등전통무용
27412.18한일고전예능제2001국립국악원 예악당〈무산향〉,〈나가나타야시마〉등종합
27512.192001유정숙의 춤국립국악원 우면당<NA>한국무용
27612.21 - 22최현 춤전2001호암아트홀〈비파연〉, 20()1문화관광부 무대예술지원작품한국무용
27712.21 - 26유니버설발레단세종문화회관 대극장〈호두까기인형〉, 안무-바실리 바이노넨발레
27812.26한명옥의 살풀이한전아츠풀센터2001복합적 전통무용 II한국무용
27912.29-30육완순의 춤문예회관 대극장〈학아 학아〉, 20이무대공연작품지원 선정작품현대무용
28012.29 - 302001이해준의 춤문예회관 소극장문예진흥원 신진무용가지 원 선정작품현대무용

Duplicate rows

Most frequently occurring

공연명장소비고부문# duplicates
0국립발레단-해설이 있는 발레국립극장 달오름극장<NA>발레2