Overview

Dataset statistics

Number of variables13
Number of observations4324
Missing cells8123
Missing cells (%)14.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory439.3 KiB
Average record size in memory104.0 B

Variable types

Text7
Categorical4
DateTime2

Dataset

Description국립극장에서 공연 되었거나 공연 할 모든 공연정보에 대한 데이터로 공연명, 시작일 및 종료일, 제작기관 등에 대한 데이터
URLhttps://www.data.go.kr/data/3034757/fileData.do

Alerts

전속단체 is highly overall correlated with 제작구분High correlation
제작구분 is highly overall correlated with 전속단체High correlation
전속단체 is highly imbalanced (69.4%)Imbalance
시간 has 519 (12.0%) missing valuesMissing
티켓가격 has 566 (13.1%) missing valuesMissing
관람연령 has 2382 (55.1%) missing valuesMissing
러닝타임 has 2115 (48.9%) missing valuesMissing
주최 has 2541 (58.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:51:01.915931
Analysis finished2023-12-12 19:51:04.707053
Duration2.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3942
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2023-12-13T04:51:05.039235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length45
Mean length17.257401
Min length1

Characters and Unicode

Total characters74621
Distinct characters1208
Distinct categories15 ?
Distinct scripts5 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3745 ?
Unique (%)86.6%

Sample

1st row관현악시리즈Ⅳ <부재(不在)>
2nd row윤진철 X 김동언 <불문율>
3rd row2023 국립극장 <완창판소리> 6월
4th row국립무용단 <산조>
5th row우리 읍내
ValueCountFrequency (%)
국립극장 257
 
1.8%
242
 
1.7%
토요문화광장 193
 
1.3%
정오의 183
 
1.3%
음악회 168
 
1.1%
완창판소리 113
 
0.8%
111
 
0.8%
콘서트 93
 
0.6%
국립무용단 89
 
0.6%
있는 84
 
0.6%
Other values (6367) 13095
89.5%
2023-12-13T04:51:05.621806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10381
 
13.9%
. 3743
 
5.0%
< 1562
 
2.1%
> 1559
 
2.1%
0 1293
 
1.7%
2 1204
 
1.6%
1103
 
1.5%
1018
 
1.4%
962
 
1.3%
1 959
 
1.3%
Other values (1198) 50837
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45435
60.9%
Space Separator 10381
 
13.9%
Decimal Number 4637
 
6.2%
Other Punctuation 4573
 
6.1%
Math Symbol 3137
 
4.2%
Lowercase Letter 2995
 
4.0%
Uppercase Letter 1638
 
2.2%
Close Punctuation 520
 
0.7%
Open Punctuation 518
 
0.7%
Dash Punctuation 406
 
0.5%
Other values (5) 381
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1103
 
2.4%
1018
 
2.2%
962
 
2.1%
936
 
2.1%
801
 
1.8%
754
 
1.7%
742
 
1.6%
735
 
1.6%
690
 
1.5%
684
 
1.5%
Other values (1088) 37010
81.5%
Lowercase Letter
ValueCountFrequency (%)
e 431
14.4%
a 281
 
9.4%
i 260
 
8.7%
o 253
 
8.4%
n 242
 
8.1%
r 217
 
7.2%
t 215
 
7.2%
l 165
 
5.5%
s 130
 
4.3%
h 98
 
3.3%
Other values (16) 703
23.5%
Uppercase Letter
ValueCountFrequency (%)
S 137
 
8.4%
T 137
 
8.4%
A 131
 
8.0%
N 124
 
7.6%
O 98
 
6.0%
C 93
 
5.7%
I 88
 
5.4%
B 86
 
5.3%
E 86
 
5.3%
L 84
 
5.1%
Other values (16) 574
35.0%
Other Punctuation
ValueCountFrequency (%)
. 3743
81.8%
, 314
 
6.9%
" 124
 
2.7%
' 112
 
2.4%
! 70
 
1.5%
& 58
 
1.3%
: 57
 
1.2%
· 44
 
1.0%
/ 20
 
0.4%
? 13
 
0.3%
Other values (5) 18
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 1293
27.9%
2 1204
26.0%
1 959
20.7%
3 238
 
5.1%
5 230
 
5.0%
4 188
 
4.1%
6 174
 
3.8%
9 123
 
2.7%
7 120
 
2.6%
8 108
 
2.3%
Letter Number
ValueCountFrequency (%)
25
35.2%
15
21.1%
15
21.1%
10
 
14.1%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Math Symbol
ValueCountFrequency (%)
< 1562
49.8%
> 1559
49.7%
+ 8
 
0.3%
~ 6
 
0.2%
× 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 418
80.7%
[ 88
 
17.0%
8
 
1.5%
2
 
0.4%
2
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 418
80.4%
] 88
 
16.9%
8
 
1.5%
4
 
0.8%
2
 
0.4%
Initial Punctuation
ValueCountFrequency (%)
59
60.2%
39
39.8%
Final Punctuation
ValueCountFrequency (%)
59
61.5%
37
38.5%
Space Separator
ValueCountFrequency (%)
10381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 406
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 115
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45023
60.3%
Common 24482
32.8%
Latin 4704
 
6.3%
Han 411
 
0.6%
Hiragana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1103
 
2.4%
1018
 
2.3%
962
 
2.1%
936
 
2.1%
801
 
1.8%
754
 
1.7%
742
 
1.6%
735
 
1.6%
690
 
1.5%
684
 
1.5%
Other values (854) 36598
81.3%
Han
ValueCountFrequency (%)
25
 
6.1%
16
 
3.9%
8
 
1.9%
8
 
1.9%
7
 
1.7%
6
 
1.5%
6
 
1.5%
5
 
1.2%
5
 
1.2%
5
 
1.2%
Other values (223) 320
77.9%
Latin
ValueCountFrequency (%)
e 431
 
9.2%
a 281
 
6.0%
i 260
 
5.5%
o 253
 
5.4%
n 242
 
5.1%
r 217
 
4.6%
t 215
 
4.6%
l 165
 
3.5%
S 137
 
2.9%
T 137
 
2.9%
Other values (52) 2366
50.3%
Common
ValueCountFrequency (%)
10381
42.4%
. 3743
 
15.3%
< 1562
 
6.4%
> 1559
 
6.4%
0 1293
 
5.3%
2 1204
 
4.9%
1 959
 
3.9%
( 418
 
1.7%
) 418
 
1.7%
- 406
 
1.7%
Other values (38) 2539
 
10.4%
Hiragana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45019
60.3%
ASCII 28844
38.7%
CJK 377
 
0.5%
Punctuation 198
 
0.3%
None 72
 
0.1%
Number Forms 71
 
0.1%
CJK Compat Ideographs 34
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Hiragana 1
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10381
36.0%
. 3743
 
13.0%
< 1562
 
5.4%
> 1559
 
5.4%
0 1293
 
4.5%
2 1204
 
4.2%
1 959
 
3.3%
e 431
 
1.5%
( 418
 
1.4%
) 418
 
1.4%
Other values (76) 6876
23.8%
Hangul
ValueCountFrequency (%)
1103
 
2.5%
1018
 
2.3%
962
 
2.1%
936
 
2.1%
801
 
1.8%
754
 
1.7%
742
 
1.6%
735
 
1.6%
690
 
1.5%
684
 
1.5%
Other values (852) 36594
81.3%
Punctuation
ValueCountFrequency (%)
59
29.8%
59
29.8%
39
19.7%
37
18.7%
4
 
2.0%
None
ValueCountFrequency (%)
· 44
61.1%
8
 
11.1%
8
 
11.1%
4
 
5.6%
2
 
2.8%
2
 
2.8%
2
 
2.8%
× 2
 
2.8%
CJK Compat Ideographs
ValueCountFrequency (%)
25
73.5%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Number Forms
ValueCountFrequency (%)
25
35.2%
15
21.1%
15
21.1%
10
 
14.1%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
1
 
1.4%
CJK
ValueCountFrequency (%)
16
 
4.2%
8
 
2.1%
8
 
2.1%
7
 
1.9%
6
 
1.6%
6
 
1.6%
5
 
1.3%
5
 
1.3%
5
 
1.3%
5
 
1.3%
Other values (214) 306
81.2%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
2
50.0%
Hiragana
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct298
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2023-12-13T04:51:06.100752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length4
Mean length4.506013
Min length2

Characters and Unicode

Total characters19484
Distinct characters394
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique252 ?
Unique (%)5.8%

Sample

1st row국립극장
2nd row국립극장
3rd row국립극장
4th row국립극장
5th row국립극장
ValueCountFrequency (%)
국립극장 3961
87.7%
사단법인 34
 
0.8%
재)전통공연예술진흥재단 10
 
0.2%
8
 
0.2%
재)국립오페라단 6
 
0.1%
조직위원회 5
 
0.1%
순헌무용단 5
 
0.1%
재)국립합창단 5
 
0.1%
국립오페라단 5
 
0.1%
한국소극장오페라연합 5
 
0.1%
Other values (376) 475
 
10.5%
2023-12-13T04:51:06.735837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4058
20.8%
3991
20.5%
3986
20.5%
3981
20.4%
196
 
1.0%
150
 
0.8%
91
 
0.5%
85
 
0.4%
) 81
 
0.4%
( 77
 
0.4%
Other values (384) 2788
14.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18891
97.0%
Space Separator 196
 
1.0%
Close Punctuation 81
 
0.4%
Open Punctuation 77
 
0.4%
Uppercase Letter 73
 
0.4%
Lowercase Letter 70
 
0.4%
Other Punctuation 60
 
0.3%
Decimal Number 22
 
0.1%
Other Symbol 9
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4058
21.5%
3991
21.1%
3986
21.1%
3981
21.1%
150
 
0.8%
91
 
0.5%
85
 
0.4%
75
 
0.4%
71
 
0.4%
66
 
0.3%
Other values (328) 2337
12.4%
Uppercase Letter
ValueCountFrequency (%)
C 13
17.8%
M 9
12.3%
T 9
12.3%
A 6
8.2%
S 6
8.2%
B 4
 
5.5%
E 4
 
5.5%
D 4
 
5.5%
K 3
 
4.1%
U 3
 
4.1%
Other values (9) 12
16.4%
Lowercase Letter
ValueCountFrequency (%)
e 10
14.3%
o 8
11.4%
l 7
10.0%
r 6
8.6%
u 5
 
7.1%
t 5
 
7.1%
a 5
 
7.1%
w 4
 
5.7%
g 3
 
4.3%
n 3
 
4.3%
Other values (7) 14
20.0%
Decimal Number
ValueCountFrequency (%)
2 6
27.3%
1 6
27.3%
0 3
13.6%
3 2
 
9.1%
5 2
 
9.1%
7 1
 
4.5%
6 1
 
4.5%
9 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 30
50.0%
. 19
31.7%
/ 8
 
13.3%
: 2
 
3.3%
& 1
 
1.7%
Space Separator
ValueCountFrequency (%)
196
100.0%
Close Punctuation
ValueCountFrequency (%)
) 81
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18898
97.0%
Common 441
 
2.3%
Latin 143
 
0.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4058
21.5%
3991
21.1%
3986
21.1%
3981
21.1%
150
 
0.8%
91
 
0.5%
85
 
0.4%
75
 
0.4%
71
 
0.4%
66
 
0.3%
Other values (327) 2344
12.4%
Latin
ValueCountFrequency (%)
C 13
 
9.1%
e 10
 
7.0%
M 9
 
6.3%
T 9
 
6.3%
o 8
 
5.6%
l 7
 
4.9%
A 6
 
4.2%
S 6
 
4.2%
r 6
 
4.2%
u 5
 
3.5%
Other values (26) 64
44.8%
Common
ValueCountFrequency (%)
196
44.4%
) 81
18.4%
( 77
 
17.5%
, 30
 
6.8%
. 19
 
4.3%
/ 8
 
1.8%
2 6
 
1.4%
1 6
 
1.4%
- 3
 
0.7%
0 3
 
0.7%
Other values (9) 12
 
2.7%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18889
96.9%
ASCII 582
 
3.0%
None 9
 
< 0.1%
CJK 2
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4058
21.5%
3991
21.1%
3986
21.1%
3981
21.1%
150
 
0.8%
91
 
0.5%
85
 
0.4%
75
 
0.4%
71
 
0.4%
66
 
0.3%
Other values (326) 2335
12.4%
ASCII
ValueCountFrequency (%)
196
33.7%
) 81
13.9%
( 77
 
13.2%
, 30
 
5.2%
. 19
 
3.3%
C 13
 
2.2%
e 10
 
1.7%
M 9
 
1.5%
T 9
 
1.5%
/ 8
 
1.4%
Other values (43) 130
22.3%
None
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

장르
Categorical

Distinct14
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
기타
1594 
국악
727 
무용
656 
연극
522 
콘서트
266 
Other values (9)
559 

Length

Max length4
Median length2
Mean length2.1345976
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국악
2nd row콘서트
3rd row국악
4th row무용
5th row연극

Common Values

ValueCountFrequency (%)
기타 1594
36.9%
국악 727
16.8%
무용 656
15.2%
연극 522
 
12.1%
콘서트 266
 
6.2%
뮤지컬 179
 
4.1%
창극 128
 
3.0%
오페라 78
 
1.8%
발레 72
 
1.7%
클래식 44
 
1.0%
Other values (4) 58
 
1.3%

Length

2023-12-13T04:51:07.302082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 1594
36.9%
국악 727
16.8%
무용 656
15.2%
연극 522
 
12.1%
콘서트 266
 
6.2%
뮤지컬 179
 
4.1%
창극 128
 
3.0%
오페라 78
 
1.8%
발레 72
 
1.7%
클래식 44
 
1.0%
Other values (4) 58
 
1.3%

전속단체
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
<NA>
3510 
국립국악관현악단
 
250
국립창극단
 
194
국립무용단
 
150
국립극장
 
136
Other values (8)
 
84

Length

Max length17
Median length4
Mean length4.2682701
Min length1

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row국립국악관현악단
2nd row<NA>
3rd row국립창극단
4th row국립무용단
5th row국립극장

Common Values

ValueCountFrequency (%)
<NA> 3510
81.2%
국립국악관현악단 250
 
5.8%
국립창극단 194
 
4.5%
국립무용단 150
 
3.5%
국립극장 136
 
3.1%
75
 
1.7%
공연예술박물관 3
 
0.1%
들숨 1
 
< 0.1%
평양검무전승보존회 1
 
< 0.1%
중구 1
 
< 0.1%
Other values (3) 3
 
0.1%

Length

2023-12-13T04:51:07.494625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 3510
82.6%
국립국악관현악단 250
 
5.9%
국립창극단 194
 
4.6%
국립무용단 150
 
3.5%
국립극장 138
 
3.2%
공연예술박물관 3
 
0.1%
들숨 1
 
< 0.1%
평양검무전승보존회 1
 
< 0.1%
중구 1
 
< 0.1%
주홍콩한국문화원/국립국악관현악단 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
Distinct3078
Distinct (%)71.2%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
Minimum2001-01-01 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T04:51:07.672486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:51:07.846907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2945
Distinct (%)68.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
Minimum2001-01-04 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T04:51:08.048690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:51:08.246400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제작구분
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
대관공연
1934 
공동주최
1261 
기획-극장
536 
기획-관현악단
242 
기획-창극단
 
179
Other values (3)
 
172

Length

Max length7
Median length4
Mean length4.4010176
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기획-관현악단
2nd row기획-극장
3rd row기획-창극단
4th row기획-무용단
5th row기획-극장

Common Values

ValueCountFrequency (%)
대관공연 1934
44.7%
공동주최 1261
29.2%
기획-극장 536
 
12.4%
기획-관현악단 242
 
5.6%
기획-창극단 179
 
4.1%
기획-무용단 113
 
2.6%
기타 56
 
1.3%
<NA> 3
 
0.1%

Length

2023-12-13T04:51:08.461362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:51:08.629034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대관공연 1934
44.7%
공동주최 1261
29.2%
기획-극장 536
 
12.4%
기획-관현악단 242
 
5.6%
기획-창극단 179
 
4.1%
기획-무용단 113
 
2.6%
기타 56
 
1.3%
na 3
 
0.1%

시간
Text

MISSING 

Distinct1633
Distinct (%)42.9%
Missing519
Missing (%)12.0%
Memory size33.9 KiB
2023-12-13T04:51:09.024114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length173
Median length108
Mean length16.314323
Min length1

Characters and Unicode

Total characters62076
Distinct characters332
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1444 ?
Unique (%)38.0%

Sample

1st row19:30
2nd row19:30
3rd row15:00
4th row금 19:30, 토·일 15:00
5th row목·금 19:30, 토·일 15:00
ValueCountFrequency (%)
오후 1968
 
13.0%
953
 
6.3%
7시 666
 
4.4%
평일 508
 
3.3%
8시 502
 
3.3%
30분 491
 
3.2%
4시 440
 
2.9%
3시 389
 
2.6%
348
 
2.3%
19:30 290
 
1.9%
Other values (1695) 8638
56.9%
2023-12-13T04:51:09.748930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12013
19.4%
0 5389
 
8.7%
4413
 
7.1%
2827
 
4.6%
3 2729
 
4.4%
: 2728
 
4.4%
1 2719
 
4.4%
2585
 
4.2%
2411
 
3.9%
, 2014
 
3.2%
Other values (322) 22248
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21923
35.3%
Decimal Number 17935
28.9%
Space Separator 12014
19.4%
Other Punctuation 6459
 
10.4%
Lowercase Letter 1362
 
2.2%
Close Punctuation 896
 
1.4%
Open Punctuation 879
 
1.4%
Math Symbol 332
 
0.5%
Dash Punctuation 183
 
0.3%
Uppercase Letter 79
 
0.1%
Other values (4) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4413
20.1%
2827
12.9%
2585
11.8%
2411
11.0%
1129
 
5.1%
925
 
4.2%
647
 
3.0%
599
 
2.7%
578
 
2.6%
514
 
2.3%
Other values (259) 5295
24.2%
Other Punctuation
ValueCountFrequency (%)
: 2728
42.2%
, 2014
31.2%
/ 1004
 
15.5%
* 257
 
4.0%
. 230
 
3.6%
· 164
 
2.5%
34
 
0.5%
! 12
 
0.2%
& 7
 
0.1%
' 4
 
0.1%
Other values (3) 5
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
m 650
47.7%
p 628
46.1%
l 50
 
3.7%
a 21
 
1.5%
o 3
 
0.2%
n 2
 
0.1%
r 2
 
0.1%
s 1
 
0.1%
b 1
 
0.1%
c 1
 
0.1%
Other values (3) 3
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 5389
30.0%
3 2729
15.2%
1 2719
15.2%
7 1839
 
10.3%
4 1111
 
6.2%
2 1041
 
5.8%
8 959
 
5.3%
5 798
 
4.4%
6 697
 
3.9%
9 653
 
3.6%
Math Symbol
ValueCountFrequency (%)
~ 234
70.5%
| 51
 
15.4%
> 16
 
4.8%
< 14
 
4.2%
7
 
2.1%
5
 
1.5%
4
 
1.2%
= 1
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
P 32
40.5%
M 32
40.5%
I 8
 
10.1%
O 2
 
2.5%
X 2
 
2.5%
D 1
 
1.3%
B 1
 
1.3%
K 1
 
1.3%
Space Separator
ValueCountFrequency (%)
12013
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 888
99.1%
] 8
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 871
99.1%
[ 8
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 183
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38712
62.4%
Hangul 21917
35.3%
Latin 1441
 
2.3%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4413
20.1%
2827
12.9%
2585
11.8%
2411
11.0%
1129
 
5.2%
925
 
4.2%
647
 
3.0%
599
 
2.7%
578
 
2.6%
514
 
2.3%
Other values (253) 5289
24.1%
Common
ValueCountFrequency (%)
12013
31.0%
0 5389
13.9%
3 2729
 
7.0%
: 2728
 
7.0%
1 2719
 
7.0%
, 2014
 
5.2%
7 1839
 
4.8%
4 1111
 
2.9%
2 1041
 
2.7%
/ 1004
 
2.6%
Other values (32) 6125
15.8%
Latin
ValueCountFrequency (%)
m 650
45.1%
p 628
43.6%
l 50
 
3.5%
P 32
 
2.2%
M 32
 
2.2%
a 21
 
1.5%
I 8
 
0.6%
o 3
 
0.2%
n 2
 
0.1%
r 2
 
0.1%
Other values (11) 13
 
0.9%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39932
64.3%
Hangul 21878
35.2%
None 176
 
0.3%
Compat Jamo 39
 
0.1%
Punctuation 36
 
0.1%
Math Operators 7
 
< 0.1%
CJK 6
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12013
30.1%
0 5389
13.5%
3 2729
 
6.8%
: 2728
 
6.8%
1 2719
 
6.8%
, 2014
 
5.0%
7 1839
 
4.6%
4 1111
 
2.8%
2 1041
 
2.6%
/ 1004
 
2.5%
Other values (42) 7345
18.4%
Hangul
ValueCountFrequency (%)
4413
20.2%
2827
12.9%
2585
11.8%
2411
11.0%
1129
 
5.2%
925
 
4.2%
647
 
3.0%
599
 
2.7%
578
 
2.6%
514
 
2.3%
Other values (252) 5250
24.0%
None
ValueCountFrequency (%)
· 164
93.2%
5
 
2.8%
4
 
2.3%
1
 
0.6%
  1
 
0.6%
1
 
0.6%
Compat Jamo
ValueCountFrequency (%)
39
100.0%
Punctuation
ValueCountFrequency (%)
34
94.4%
1
 
2.8%
1
 
2.8%
Math Operators
ValueCountFrequency (%)
7
100.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

장소
Categorical

Distinct27
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
달오름극장
998 
해오름극장
704 
별오름극장
512 
국립극장 KB 국민은행 청소년 하늘극장.
407 
KB청소년 하늘극장
391 
Other values (22)
1312 

Length

Max length22
Median length5
Mean length7.8073543
Min length2

Unique

Unique7 ?
Unique (%)0.2%

Sample

1st row해오름극장
2nd row하늘극장
3rd row하늘극장
4th row해오름극장
5th row달오름극장

Common Values

ValueCountFrequency (%)
달오름극장 998
23.1%
해오름극장 704
16.3%
별오름극장 512
11.8%
국립극장 KB 국민은행 청소년 하늘극장. 407
9.4%
KB청소년 하늘극장 391
 
9.0%
기타 301
 
7.0%
국립극장 해오름극장. 287
 
6.6%
국립극장 달오름극장. 268
 
6.2%
하늘극장 237
 
5.5%
국립극장 별오름극장. 163
 
3.8%
Other values (17) 56
 
1.3%

Length

2023-12-13T04:51:09.945875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
달오름극장 1266
17.9%
국립극장 1131
16.0%
하늘극장 1035
14.6%
해오름극장 991
14.0%
별오름극장 675
9.5%
kb 407
 
5.8%
국민은행 407
 
5.8%
청소년 407
 
5.8%
kb청소년 391
 
5.5%
기타 301
 
4.3%
Other values (24) 67
 
0.9%

티켓가격
Text

MISSING 

Distinct1479
Distinct (%)39.4%
Missing566
Missing (%)13.1%
Memory size33.9 KiB
2023-12-13T04:51:10.301791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length131
Median length96
Mean length18.415647
Min length1

Characters and Unicode

Total characters69206
Distinct characters351
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1234 ?
Unique (%)32.8%

Sample

1st rowR석 50,000원 / S석 30,000원 / A석 20,000원
2nd row전석 30,000원
3rd row전석 20,000원
4th rowVIP석 70,000원 / R석 50,000원 / S석 30,000원 / A석 20,000원
5th rowR석 40,000원 / S석 30,000원
ValueCountFrequency (%)
1346
 
9.1%
전석 1155
 
7.8%
20,000원 704
 
4.8%
s석 617
 
4.2%
r석 604
 
4.1%
30,000원 569
 
3.8%
2만원 503
 
3.4%
무료 406
 
2.7%
3만원 406
 
2.7%
a석 395
 
2.7%
Other values (1332) 8091
54.7%
2023-12-13T04:51:10.867038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14413
20.8%
12276
17.7%
5336
 
7.7%
, 4636
 
6.7%
4062
 
5.9%
2206
 
3.2%
2 1909
 
2.8%
5 1490
 
2.2%
/ 1446
 
2.1%
1 1436
 
2.1%
Other values (341) 19996
28.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22545
32.6%
Decimal Number 21823
31.5%
Space Separator 12278
17.7%
Other Punctuation 7039
 
10.2%
Uppercase Letter 3878
 
5.6%
Open Punctuation 572
 
0.8%
Close Punctuation 571
 
0.8%
Dash Punctuation 228
 
0.3%
Lowercase Letter 178
 
0.3%
Math Symbol 75
 
0.1%
Other values (3) 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5336
23.7%
4062
18.0%
2206
 
9.8%
1243
 
5.5%
463
 
2.1%
460
 
2.0%
451
 
2.0%
378
 
1.7%
377
 
1.7%
369
 
1.6%
Other values (264) 7200
31.9%
Lowercase Letter
ValueCountFrequency (%)
l 62
34.8%
e 14
 
7.9%
b 11
 
6.2%
o 11
 
6.2%
i 10
 
5.6%
t 10
 
5.6%
r 9
 
5.1%
n 8
 
4.5%
a 7
 
3.9%
c 6
 
3.4%
Other values (12) 30
16.9%
Uppercase Letter
ValueCountFrequency (%)
S 972
25.1%
R 931
24.0%
A 696
17.9%
I 370
 
9.5%
V 331
 
8.5%
P 330
 
8.5%
B 196
 
5.1%
C 25
 
0.6%
T 7
 
0.2%
O 6
 
0.2%
Other values (6) 14
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 4636
65.9%
/ 1446
 
20.5%
: 515
 
7.3%
* 325
 
4.6%
. 55
 
0.8%
% 27
 
0.4%
· 19
 
0.3%
\ 9
 
0.1%
" 5
 
0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 14413
66.0%
2 1909
 
8.7%
5 1490
 
6.8%
1 1436
 
6.6%
3 1414
 
6.5%
8 341
 
1.6%
4 329
 
1.5%
7 300
 
1.4%
6 149
 
0.7%
9 42
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 18
24.0%
< 17
22.7%
~ 13
17.3%
| 11
14.7%
5
 
6.7%
+ 4
 
5.3%
4
 
5.3%
= 3
 
4.0%
Space Separator
ValueCountFrequency (%)
12276
> 99.9%
  2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 565
98.8%
[ 7
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 564
98.8%
] 7
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 228
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42605
61.6%
Hangul 22544
32.6%
Latin 4056
 
5.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5336
23.7%
4062
18.0%
2206
 
9.8%
1243
 
5.5%
463
 
2.1%
460
 
2.0%
451
 
2.0%
378
 
1.7%
377
 
1.7%
369
 
1.6%
Other values (263) 7199
31.9%
Common
ValueCountFrequency (%)
0 14413
33.8%
12276
28.8%
, 4636
 
10.9%
2 1909
 
4.5%
5 1490
 
3.5%
/ 1446
 
3.4%
1 1436
 
3.4%
3 1414
 
3.3%
( 565
 
1.3%
) 564
 
1.3%
Other values (29) 2456
 
5.8%
Latin
ValueCountFrequency (%)
S 972
24.0%
R 931
23.0%
A 696
17.2%
I 370
 
9.1%
V 331
 
8.2%
P 330
 
8.1%
B 196
 
4.8%
l 62
 
1.5%
C 25
 
0.6%
e 14
 
0.3%
Other values (28) 129
 
3.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 46626
67.4%
Hangul 22436
32.4%
Compat Jamo 108
 
0.2%
None 25
 
< 0.1%
Arrows 5
 
< 0.1%
Punctuation 5
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14413
30.9%
12276
26.3%
, 4636
 
9.9%
2 1909
 
4.1%
5 1490
 
3.2%
/ 1446
 
3.1%
1 1436
 
3.1%
3 1414
 
3.0%
S 972
 
2.1%
R 931
 
2.0%
Other values (60) 5703
 
12.2%
Hangul
ValueCountFrequency (%)
5336
23.8%
4062
18.1%
2206
 
9.8%
1243
 
5.5%
463
 
2.1%
460
 
2.1%
451
 
2.0%
378
 
1.7%
377
 
1.7%
369
 
1.6%
Other values (261) 7091
31.6%
Compat Jamo
ValueCountFrequency (%)
107
99.1%
1
 
0.9%
None
ValueCountFrequency (%)
· 19
76.0%
4
 
16.0%
  2
 
8.0%
Arrows
ValueCountFrequency (%)
5
100.0%
Punctuation
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
CJK
ValueCountFrequency (%)
1
100.0%

관람연령
Text

MISSING 

Distinct120
Distinct (%)6.2%
Missing2382
Missing (%)55.1%
Memory size33.9 KiB
2023-12-13T04:51:11.107710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length24
Mean length7.3182286
Min length2

Characters and Unicode

Total characters14212
Distinct characters51
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)2.9%

Sample

1st row초등학생 이상 관람가
2nd row초등학생 이상 관람가
3rd row8세 이상 관람가
4th row8세 이상 관람가
5th row8세 이상 관람가
ValueCountFrequency (%)
이상 1048
26.1%
8세 676
16.8%
관람가 576
14.3%
초등학생이상 497
12.4%
관람 313
 
7.8%
전체관람 118
 
2.9%
90
 
2.2%
초등학생 78
 
1.9%
7세 63
 
1.6%
48개월이상 53
 
1.3%
Other values (72) 509
12.7%
2023-12-13T04:51:11.527429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2082
14.6%
1811
12.7%
1808
12.7%
1020
 
7.2%
1016
 
7.1%
1015
 
7.1%
8 779
 
5.5%
647
 
4.6%
641
 
4.5%
597
 
4.2%
Other values (41) 2796
19.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10615
74.7%
Space Separator 2082
 
14.6%
Decimal Number 1488
 
10.5%
Close Punctuation 13
 
0.1%
Open Punctuation 13
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1811
17.1%
1808
17.0%
1020
9.6%
1016
9.6%
1015
9.6%
647
 
6.1%
641
 
6.0%
597
 
5.6%
577
 
5.4%
575
 
5.4%
Other values (28) 908
8.6%
Decimal Number
ValueCountFrequency (%)
8 779
52.4%
1 157
 
10.6%
7 115
 
7.7%
4 104
 
7.0%
3 91
 
6.1%
5 88
 
5.9%
6 76
 
5.1%
2 61
 
4.1%
0 17
 
1.1%
Space Separator
ValueCountFrequency (%)
2082
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10615
74.7%
Common 3597
 
25.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1811
17.1%
1808
17.0%
1020
9.6%
1016
9.6%
1015
9.6%
647
 
6.1%
641
 
6.0%
597
 
5.6%
577
 
5.4%
575
 
5.4%
Other values (28) 908
8.6%
Common
ValueCountFrequency (%)
2082
57.9%
8 779
 
21.7%
1 157
 
4.4%
7 115
 
3.2%
4 104
 
2.9%
3 91
 
2.5%
5 88
 
2.4%
6 76
 
2.1%
2 61
 
1.7%
0 17
 
0.5%
Other values (3) 27
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10615
74.7%
ASCII 3597
 
25.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2082
57.9%
8 779
 
21.7%
1 157
 
4.4%
7 115
 
3.2%
4 104
 
2.9%
3 91
 
2.5%
5 88
 
2.4%
6 76
 
2.1%
2 61
 
1.7%
0 17
 
0.5%
Other values (3) 27
 
0.8%
Hangul
ValueCountFrequency (%)
1811
17.1%
1808
17.0%
1020
9.6%
1016
9.6%
1015
9.6%
647
 
6.1%
641
 
6.0%
597
 
5.6%
577
 
5.4%
575
 
5.4%
Other values (28) 908
8.6%

러닝타임
Text

MISSING 

Distinct320
Distinct (%)14.5%
Missing2115
Missing (%)48.9%
Memory size33.9 KiB
2023-12-13T04:51:11.844154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length52
Mean length4.4495247
Min length2

Characters and Unicode

Total characters9829
Distinct characters99
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)9.9%

Sample

1st row70분(휴식 20분)
2nd row150분
3rd row300분
4th row60분
5th row130분
ValueCountFrequency (%)
90분 181
 
6.9%
70 179
 
6.8%
90 175
 
6.7%
70분 146
 
5.5%
60분 118
 
4.5%
100분 111
 
4.2%
100 103
 
3.9%
80분 95
 
3.6%
60 87
 
3.3%
포함 85
 
3.2%
Other values (269) 1351
51.3%
2023-12-13T04:51:12.354856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2283
23.2%
1242
12.6%
1126
11.5%
1 879
 
8.9%
9 391
 
4.0%
7 388
 
3.9%
2 366
 
3.7%
5 292
 
3.0%
6 249
 
2.5%
( 244
 
2.5%
Other values (89) 2369
24.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5260
53.5%
Other Letter 2873
29.2%
Space Separator 1126
 
11.5%
Open Punctuation 244
 
2.5%
Close Punctuation 243
 
2.5%
Other Punctuation 50
 
0.5%
Lowercase Letter 12
 
0.1%
Math Symbol 11
 
0.1%
Dash Punctuation 7
 
0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1242
43.2%
225
 
7.8%
144
 
5.0%
136
 
4.7%
136
 
4.7%
91
 
3.2%
91
 
3.2%
85
 
3.0%
78
 
2.7%
77
 
2.7%
Other values (64) 568
19.8%
Decimal Number
ValueCountFrequency (%)
0 2283
43.4%
1 879
 
16.7%
9 391
 
7.4%
7 388
 
7.4%
2 366
 
7.0%
5 292
 
5.6%
6 249
 
4.7%
8 220
 
4.2%
3 110
 
2.1%
4 82
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 17
34.0%
/ 17
34.0%
: 15
30.0%
! 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
R 1
33.3%
S 1
33.3%
A 1
33.3%
Math Symbol
ValueCountFrequency (%)
~ 10
90.9%
| 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
p 6
50.0%
m 6
50.0%
Space Separator
ValueCountFrequency (%)
1126
100.0%
Open Punctuation
ValueCountFrequency (%)
( 244
100.0%
Close Punctuation
ValueCountFrequency (%)
) 243
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6941
70.6%
Hangul 2873
29.2%
Latin 15
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1242
43.2%
225
 
7.8%
144
 
5.0%
136
 
4.7%
136
 
4.7%
91
 
3.2%
91
 
3.2%
85
 
3.0%
78
 
2.7%
77
 
2.7%
Other values (64) 568
19.8%
Common
ValueCountFrequency (%)
0 2283
32.9%
1126
16.2%
1 879
 
12.7%
9 391
 
5.6%
7 388
 
5.6%
2 366
 
5.3%
5 292
 
4.2%
6 249
 
3.6%
( 244
 
3.5%
) 243
 
3.5%
Other values (10) 480
 
6.9%
Latin
ValueCountFrequency (%)
p 6
40.0%
m 6
40.0%
R 1
 
6.7%
S 1
 
6.7%
A 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6956
70.8%
Hangul 2873
29.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2283
32.8%
1126
16.2%
1 879
 
12.6%
9 391
 
5.6%
7 388
 
5.6%
2 366
 
5.3%
5 292
 
4.2%
6 249
 
3.6%
( 244
 
3.5%
) 243
 
3.5%
Other values (15) 495
 
7.1%
Hangul
ValueCountFrequency (%)
1242
43.2%
225
 
7.8%
144
 
5.0%
136
 
4.7%
136
 
4.7%
91
 
3.2%
91
 
3.2%
85
 
3.0%
78
 
2.7%
77
 
2.7%
Other values (64) 568
19.8%

주최
Text

MISSING 

Distinct591
Distinct (%)33.1%
Missing2541
Missing (%)58.8%
Memory size33.9 KiB
2023-12-13T04:51:12.677761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length41
Mean length7.239484
Min length2

Characters and Unicode

Total characters12908
Distinct characters501
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique485 ?
Unique (%)27.2%

Sample

1st row국립극장
2nd row국립극장
3rd row국립극장
4th row국립극장
5th row국립극장
ValueCountFrequency (%)
국립극장 939
41.0%
국립중앙극장 122
 
5.3%
사단법인 47
 
2.1%
극단 41
 
1.8%
문화체육관광부 21
 
0.9%
국립발레단 15
 
0.7%
사)서울국제문화교류회 13
 
0.6%
꾸러기예술단 12
 
0.5%
무용단 12
 
0.5%
한국장애인문화예술단체총연합회 10
 
0.4%
Other values (741) 1060
46.2%
2023-12-13T04:51:13.251974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1398
 
10.8%
1227
 
9.5%
1201
 
9.3%
1154
 
8.9%
523
 
4.1%
358
 
2.8%
, 235
 
1.8%
201
 
1.6%
182
 
1.4%
) 174
 
1.3%
Other values (491) 6255
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11276
87.4%
Space Separator 523
 
4.1%
Other Punctuation 257
 
2.0%
Uppercase Letter 224
 
1.7%
Lowercase Letter 203
 
1.6%
Close Punctuation 174
 
1.3%
Open Punctuation 163
 
1.3%
Decimal Number 62
 
0.5%
Other Symbol 14
 
0.1%
Dash Punctuation 6
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1398
 
12.4%
1227
 
10.9%
1201
 
10.7%
1154
 
10.2%
358
 
3.2%
201
 
1.8%
182
 
1.6%
147
 
1.3%
144
 
1.3%
141
 
1.3%
Other values (423) 5123
45.4%
Uppercase Letter
ValueCountFrequency (%)
A 26
11.6%
T 20
 
8.9%
C 19
 
8.5%
S 18
 
8.0%
E 17
 
7.6%
N 16
 
7.1%
M 15
 
6.7%
D 11
 
4.9%
B 10
 
4.5%
O 9
 
4.0%
Other values (14) 63
28.1%
Lowercase Letter
ValueCountFrequency (%)
e 26
12.8%
a 21
10.3%
o 20
9.9%
n 17
 
8.4%
c 17
 
8.4%
m 13
 
6.4%
t 12
 
5.9%
r 11
 
5.4%
h 11
 
5.4%
i 10
 
4.9%
Other values (10) 45
22.2%
Decimal Number
ValueCountFrequency (%)
1 21
33.9%
2 10
16.1%
5 10
16.1%
3 9
14.5%
0 5
 
8.1%
4 2
 
3.2%
8 2
 
3.2%
7 2
 
3.2%
6 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 235
91.4%
/ 9
 
3.5%
. 6
 
2.3%
& 3
 
1.2%
' 2
 
0.8%
· 1
 
0.4%
: 1
 
0.4%
Space Separator
ValueCountFrequency (%)
523
100.0%
Close Punctuation
ValueCountFrequency (%)
) 174
100.0%
Open Punctuation
ValueCountFrequency (%)
( 163
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11288
87.4%
Common 1191
 
9.2%
Latin 427
 
3.3%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1398
 
12.4%
1227
 
10.9%
1201
 
10.6%
1154
 
10.2%
358
 
3.2%
201
 
1.8%
182
 
1.6%
147
 
1.3%
144
 
1.3%
141
 
1.2%
Other values (422) 5135
45.5%
Latin
ValueCountFrequency (%)
A 26
 
6.1%
e 26
 
6.1%
a 21
 
4.9%
T 20
 
4.7%
o 20
 
4.7%
C 19
 
4.4%
S 18
 
4.2%
n 17
 
4.0%
E 17
 
4.0%
c 17
 
4.0%
Other values (34) 226
52.9%
Common
ValueCountFrequency (%)
523
43.9%
, 235
19.7%
) 174
 
14.6%
( 163
 
13.7%
1 21
 
1.8%
2 10
 
0.8%
5 10
 
0.8%
3 9
 
0.8%
/ 9
 
0.8%
. 6
 
0.5%
Other values (13) 31
 
2.6%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11273
87.3%
ASCII 1614
 
12.5%
None 15
 
0.1%
Punctuation 3
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1398
 
12.4%
1227
 
10.9%
1201
 
10.7%
1154
 
10.2%
358
 
3.2%
201
 
1.8%
182
 
1.6%
147
 
1.3%
144
 
1.3%
141
 
1.3%
Other values (420) 5120
45.4%
ASCII
ValueCountFrequency (%)
523
32.4%
, 235
14.6%
) 174
 
10.8%
( 163
 
10.1%
A 26
 
1.6%
e 26
 
1.6%
a 21
 
1.3%
1 21
 
1.3%
T 20
 
1.2%
o 20
 
1.2%
Other values (54) 385
23.9%
None
ValueCountFrequency (%)
14
93.3%
· 1
 
6.7%
Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-13T04:51:13.396656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장르전속단체제작구분장소
장르1.0000.8670.5750.625
전속단체0.8671.0000.8560.806
제작구분0.5750.8561.0000.628
장소0.6250.8060.6281.000
2023-12-13T04:51:13.516546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장르제작구분장소전속단체
장르1.0000.3120.2200.424
제작구분0.3121.0000.3270.651
장소0.2200.3271.0000.366
전속단체0.4240.6510.3661.000
2023-12-13T04:51:13.640098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장르전속단체제작구분장소
장르1.0000.4240.3120.220
전속단체0.4241.0000.6510.366
제작구분0.3120.6511.0000.327
장소0.2200.3660.3271.000

Missing values

2023-12-13T04:51:04.089347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:51:04.353817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:51:04.569182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

공연명기획사장르전속단체공연시작일공연종료일제작구분시간장소티켓가격관람연령러닝타임주최
0관현악시리즈Ⅳ <부재(不在)>국립극장국악국립국악관현악단2023-06-302023-06-30기획-관현악단19:30해오름극장R석 50,000원 / S석 30,000원 / A석 20,000원초등학생 이상 관람가70분(휴식 20분)국립극장
1윤진철 X 김동언 <불문율>국립극장콘서트<NA>2023-06-302023-06-30기획-극장19:30하늘극장전석 30,000원초등학생 이상 관람가150분국립극장
22023 국립극장 <완창판소리> 6월국립극장국악국립창극단2023-06-242023-06-24기획-창극단15:00하늘극장전석 20,000원8세 이상 관람가300분국립극장
3국립무용단 <산조>국립극장무용국립무용단2023-06-232023-06-25기획-무용단금 19:30, 토·일 15:00해오름극장VIP석 70,000원 / R석 50,000원 / S석 30,000원 / A석 20,000원8세 이상 관람가60분국립극장
4우리 읍내국립극장연극국립극장2023-06-222023-06-25기획-극장목·금 19:30, 토·일 15:00달오름극장R석 40,000원 / S석 30,000원8세 이상 관람가130분국립극장
5꾸러기음악회-신나는 여름사단법인꾸러기예술단클래식<NA>2023-06-172023-06-18대관공연15:30하늘극장R석 50,000원, S석 40,000원36개월 이상 입장가100분(휴식 15분)사단법인꾸러기예술단
6국립무용단 <산조> 오픈 클래스국립극장무용국립무용단2023-06-102023-06-10기획-무용단15:00해오름 제 2연습실전석 10,000원8세 이상 관람가60분국립극장
7만추지절(萬秋之節) 남산골을 거닐다.컬쳐스테이지무용<NA>2023-06-102023-06-10대관공연19:00하늘극장전석 30,000원8세 이상 관람가120분컬쳐스테이지
8국립창극단 <베니스의 상인들>국립극장창극국립창극단2023-06-082023-06-11기획-창극단목·금 19:30, 토·일 15:00해오름극장VIP석 80,000원 / R석 60,000원 / S석 40,000원 / A석 20,000원8세 이상 관람가160분(중간휴식 포함)국립극장
9만해 한용운 선생 79주기 추모예술제재단법인 선학원행사<NA>2023-06-042023-06-04대관공연15:00하늘극장전석 초대8세 이상 관람가120분재단법인 선학원
공연명기획사장르전속단체공연시작일공연종료일제작구분시간장소티켓가격관람연령러닝타임주최
4314전국 무용지도교사 연수.국립극장기타<NA>2001-01-082001-01-20공동주최09:00 - 17:00달오름극장<NA><NA><NA>
4315제71회 정기연주회.국립극장기타<NA>2001-01-082001-02-08대관공연오후 7시 30분해오름극장<NA><NA><NA>
4316국립극장과 함께하는 남산 문화탐방.국립극장연극<NA>2001-01-082001-12-31공동주최기타<NA><NA><NA>
4317개막공연 FAMFAM-소리미로.국립극장연극<NA>2001-01-082001-02-08대관공연오후 7:30해오름극장R석 30,000원 S석 20,000원 A석 10,000원<NA><NA><NA>
4318제5회 서울소극장오페라페스티벌.국립극장연극<NA>2001-01-082001-02-08공동주최해오름극장<NA><NA><NA>
4319셰익스피어 난장 - 뮤지컬 십이야.국립극장연극<NA>2001-01-082001-02-08대관공연KB청소년 하늘극장<NA><NA><NA>
4320셰익스피어 난장 - 뮤지컬 십이야.국립극장연극<NA>2001-01-082001-02-08대관공연5. 13(목) - 16(일)19:30 (*우천시 공연 없음)KB청소년 하늘극장성인(25,000원), 학생(15,000원)<NA><NA><NA>
4321타오르는 어둠 속에서.국립극장연극<NA>2001-01-082001-02-08공동주최해오름극장일반 - 20,000원 / 대학생 - 15,000원 / 청소년 - 10,000원<NA><NA><NA>
4322마법의 성.국립극장뮤지컬<NA>2001-01-032001-01-07공동주최오후1시, 3시달오름극장으뜸석15,000원 /일반석10,000원<NA><NA><NA>
4323지저스 크라이스트 수퍼스타.국립극장뮤지컬<NA>2001-01-012001-01-13공동주최4시/7시((첫날 오후 7시 1회 공연)해오름극장R석(50,000원) S석(40,000원) A석(30,000원) B석(20,000원)<NA><NA><NA>