Overview

Dataset statistics

Number of variables15
Number of observations10000
Missing cells65363
Missing cells (%)43.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory134.0 B

Variable types

Numeric4
Text9
Unsupported2

Dataset

Description1999-2006 국내 공연예술 활동 정보(공연명, 단체명, 공연기간, 주최, 장소, 장르, 지역 정보 등 포함)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15097637/fileData.do

Alerts

연번 is highly overall correlated with 연도High correlation
연도 is highly overall correlated with 연번High correlation
공연횟수 is highly overall correlated with 공연기간High correlation
공연기간 is highly overall correlated with 공연횟수High correlation
단체명 has 6056 (60.6%) missing valuesMissing
공연횟수 has 9869 (98.7%) missing valuesMissing
주최 has 9333 (93.3%) missing valuesMissing
지역1 has 10000 (100.0%) missing valuesMissing
지역2 has 10000 (100.0%) missing valuesMissing
비고 has 3113 (31.1%) missing valuesMissing
has 8861 (88.6%) missing valuesMissing
장르 has 8020 (80.2%) missing valuesMissing
연번 has unique valuesUnique
지역1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
지역2 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 06:23:29.869096
Analysis finished2023-12-12 06:23:36.379534
Duration6.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26767.676
Minimum5
Maximum53301
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:23:36.484550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile2777.45
Q113223.25
median26851.5
Q340233
95-th percentile50702.15
Maximum53301
Range53296
Interquartile range (IQR)27009.75

Descriptive statistics

Standard deviation15485.662
Coefficient of variation (CV)0.57852096
Kurtosis-1.2172711
Mean26767.676
Median Absolute Deviation (MAD)13494.5
Skewness-0.0040762773
Sum2.6767676 × 108
Variance2.3980572 × 108
MonotonicityNot monotonic
2023-12-12T15:23:36.668124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37473 1
 
< 0.1%
51570 1
 
< 0.1%
48504 1
 
< 0.1%
15755 1
 
< 0.1%
41712 1
 
< 0.1%
28071 1
 
< 0.1%
26734 1
 
< 0.1%
34564 1
 
< 0.1%
42011 1
 
< 0.1%
51768 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5 1
< 0.1%
14 1
< 0.1%
22 1
< 0.1%
28 1
< 0.1%
29 1
< 0.1%
33 1
< 0.1%
42 1
< 0.1%
43 1
< 0.1%
44 1
< 0.1%
45 1
< 0.1%
ValueCountFrequency (%)
53301 1
< 0.1%
53291 1
< 0.1%
53288 1
< 0.1%
53286 1
< 0.1%
53285 1
< 0.1%
53273 1
< 0.1%
53268 1
< 0.1%
53267 1
< 0.1%
53266 1
< 0.1%
53263 1
< 0.1%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2001.3181
Minimum1999
Maximum2004
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:23:36.838378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1999
5-th percentile1999
Q12000
median2001
Q32003
95-th percentile2004
Maximum2004
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.646142
Coefficient of variation (CV)0.00082252889
Kurtosis-1.1765407
Mean2001.3181
Median Absolute Deviation (MAD)1
Skewness0.12567404
Sum20013181
Variance2.7097834
MonotonicityNot monotonic
2023-12-12T15:23:36.987238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2000 1926
19.3%
2002 1843
18.4%
1999 1756
17.6%
2001 1714
17.1%
2003 1507
15.1%
2004 1254
12.5%
ValueCountFrequency (%)
1999 1756
17.6%
2000 1926
19.3%
2001 1714
17.1%
2002 1843
18.4%
2003 1507
15.1%
2004 1254
12.5%
ValueCountFrequency (%)
2004 1254
12.5%
2003 1507
15.1%
2002 1843
18.4%
2001 1714
17.1%
2000 1926
19.3%
1999 1756
17.6%
Distinct8902
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:23:37.420145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length92
Median length59
Mean length15.6919
Min length1

Characters and Unicode

Total characters156919
Distinct characters1157
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8282 ?
Unique (%)82.8%

Sample

1st row2002 부산합창올림픽 만남의 콘서트 - 청소년합창단 글로리아
2nd row대구시립교향악단 가정음악회
3rd row청소년 음악회
4th row장세정 현대 피아노음악 콘서트
5th row해설이 있는 전통예술로의 여행
ValueCountFrequency (%)
정기연주회 965
 
2.8%
연주회 687
 
2.0%
독주회 653
 
1.9%
피아노 489
 
1.4%
공연 459
 
1.3%
446
 
1.3%
음악회 349
 
1.0%
위한 271
 
0.8%
독창회 229
 
0.7%
오케스트라 226
 
0.7%
Other values (12729) 29930
86.2%
2023-12-12T15:23:38.056238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24719
 
15.8%
6441
 
4.1%
4323
 
2.8%
4052
 
2.6%
3582
 
2.3%
2488
 
1.6%
2352
 
1.5%
2071
 
1.3%
1975
 
1.3%
1846
 
1.2%
Other values (1147) 103070
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120066
76.5%
Space Separator 24719
 
15.8%
Decimal Number 6144
 
3.9%
Dash Punctuation 1117
 
0.7%
Uppercase Letter 882
 
0.6%
Lowercase Letter 800
 
0.5%
Math Symbol 666
 
0.4%
Other Punctuation 662
 
0.4%
Final Punctuation 580
 
0.4%
Initial Punctuation 449
 
0.3%
Other values (4) 834
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6441
 
5.4%
4323
 
3.6%
4052
 
3.4%
3582
 
3.0%
2488
 
2.1%
2352
 
2.0%
2071
 
1.7%
1975
 
1.6%
1846
 
1.5%
1837
 
1.5%
Other values (1040) 89099
74.2%
Uppercase Letter
ValueCountFrequency (%)
S 149
16.9%
B 130
14.7%
K 97
11.0%
C 88
10.0%
M 55
 
6.2%
A 45
 
5.1%
T 34
 
3.9%
F 29
 
3.3%
I 25
 
2.8%
E 24
 
2.7%
Other values (16) 206
23.4%
Lowercase Letter
ValueCountFrequency (%)
o 89
11.1%
a 84
10.5%
e 81
10.1%
n 79
9.9%
i 65
 
8.1%
r 56
 
7.0%
t 48
 
6.0%
s 44
 
5.5%
l 39
 
4.9%
c 33
 
4.1%
Other values (15) 182
22.8%
Other Punctuation
ValueCountFrequency (%)
, 324
48.9%
127
 
19.2%
. 48
 
7.3%
& 37
 
5.6%
! 28
 
4.2%
" 28
 
4.2%
' 28
 
4.2%
? 15
 
2.3%
10
 
1.5%
/ 6
 
0.9%
Other values (6) 11
 
1.7%
Decimal Number
ValueCountFrequency (%)
0 1625
26.4%
2 1332
21.7%
1 835
13.6%
3 502
 
8.2%
9 471
 
7.7%
4 433
 
7.0%
5 349
 
5.7%
7 202
 
3.3%
6 200
 
3.3%
8 195
 
3.2%
Math Symbol
ValueCountFrequency (%)
314
47.1%
314
47.1%
> 15
 
2.3%
< 12
 
1.8%
+ 6
 
0.9%
= 3
 
0.5%
~ 1
 
0.2%
± 1
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 1103
98.7%
7
 
0.6%
6
 
0.5%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
182
44.4%
102
24.9%
64
 
15.6%
( 62
 
15.1%
Close Punctuation
ValueCountFrequency (%)
179
43.8%
103
25.2%
) 64
 
15.6%
63
 
15.4%
Letter Number
ValueCountFrequency (%)
6
42.9%
6
42.9%
1
 
7.1%
1
 
7.1%
Final Punctuation
ValueCountFrequency (%)
575
99.1%
5
 
0.9%
Initial Punctuation
ValueCountFrequency (%)
444
98.9%
5
 
1.1%
Space Separator
ValueCountFrequency (%)
24719
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 119949
76.4%
Common 35157
 
22.4%
Latin 1696
 
1.1%
Han 117
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6441
 
5.4%
4323
 
3.6%
4052
 
3.4%
3582
 
3.0%
2488
 
2.1%
2352
 
2.0%
2071
 
1.7%
1975
 
1.6%
1846
 
1.5%
1837
 
1.5%
Other values (963) 88982
74.2%
Han
ValueCountFrequency (%)
9
 
7.7%
6
 
5.1%
6
 
5.1%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2
 
1.7%
2
 
1.7%
Other values (67) 75
64.1%
Latin
ValueCountFrequency (%)
S 149
 
8.8%
B 130
 
7.7%
K 97
 
5.7%
o 89
 
5.2%
C 88
 
5.2%
a 84
 
5.0%
e 81
 
4.8%
n 79
 
4.7%
i 65
 
3.8%
r 56
 
3.3%
Other values (45) 778
45.9%
Common
ValueCountFrequency (%)
24719
70.3%
0 1625
 
4.6%
2 1332
 
3.8%
- 1103
 
3.1%
1 835
 
2.4%
575
 
1.6%
3 502
 
1.4%
9 471
 
1.3%
444
 
1.3%
4 433
 
1.2%
Other values (42) 3118
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 119945
76.4%
ASCII 34334
 
21.9%
None 1330
 
0.8%
Punctuation 1174
 
0.7%
CJK 117
 
0.1%
Number Forms 14
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Jamo 2
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24719
72.0%
0 1625
 
4.7%
2 1332
 
3.9%
- 1103
 
3.2%
1 835
 
2.4%
3 502
 
1.5%
9 471
 
1.4%
4 433
 
1.3%
5 349
 
1.0%
, 324
 
0.9%
Other values (72) 2641
 
7.7%
Hangul
ValueCountFrequency (%)
6441
 
5.4%
4323
 
3.6%
4052
 
3.4%
3582
 
3.0%
2488
 
2.1%
2352
 
2.0%
2071
 
1.7%
1975
 
1.6%
1846
 
1.5%
1837
 
1.5%
Other values (959) 88978
74.2%
Punctuation
ValueCountFrequency (%)
575
49.0%
444
37.8%
127
 
10.8%
10
 
0.9%
7
 
0.6%
5
 
0.4%
5
 
0.4%
1
 
0.1%
None
ValueCountFrequency (%)
314
23.6%
314
23.6%
182
13.7%
179
13.5%
103
 
7.7%
102
 
7.7%
64
 
4.8%
63
 
4.7%
6
 
0.5%
1
 
0.1%
Other values (2) 2
 
0.2%
CJK
ValueCountFrequency (%)
9
 
7.7%
6
 
5.1%
6
 
5.1%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2
 
1.7%
2
 
1.7%
Other values (67) 75
64.1%
Number Forms
ValueCountFrequency (%)
6
42.9%
6
42.9%
1
 
7.1%
1
 
7.1%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

단체명
Text

MISSING 

Distinct2798
Distinct (%)70.9%
Missing6056
Missing (%)60.6%
Memory size156.2 KiB
2023-12-12T15:23:38.380415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length210
Median length177
Mean length14.386156
Min length1

Characters and Unicode

Total characters56739
Distinct characters790
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2370 ?
Unique (%)60.1%

Sample

1st row양영욱(색소폰)
2nd row잉에 로자(피아노), 신선미(소프라노)
3rd row임형주
4th row티엔에스컴퍼니
5th row부여군 충남국악단
ValueCountFrequency (%)
극단 162
 
1.6%
155
 
1.5%
합창단 111
 
1.1%
102
 
1.0%
(피아노) 83
 
0.8%
78
 
0.8%
오케스트라 78
 
0.8%
국립국악원 46
 
0.5%
44
 
0.4%
앙상블 43
 
0.4%
Other values (5930) 9175
91.0%
2023-12-12T15:23:38.852493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6144
 
10.8%
, 2861
 
5.0%
2455
 
4.3%
2429
 
4.3%
1390
 
2.4%
1099
 
1.9%
841
 
1.5%
821
 
1.4%
766
 
1.4%
762
 
1.3%
Other values (780) 37171
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41083
72.4%
Space Separator 6144
 
10.8%
Other Punctuation 3094
 
5.5%
Close Punctuation 2476
 
4.4%
Open Punctuation 2474
 
4.4%
Lowercase Letter 684
 
1.2%
Uppercase Letter 492
 
0.9%
Decimal Number 178
 
0.3%
Math Symbol 29
 
0.1%
Final Punctuation 28
 
< 0.1%
Other values (4) 57
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1390
 
3.4%
1099
 
2.7%
841
 
2.0%
821
 
2.0%
766
 
1.9%
762
 
1.9%
744
 
1.8%
730
 
1.8%
722
 
1.8%
594
 
1.4%
Other values (690) 32614
79.4%
Lowercase Letter
ValueCountFrequency (%)
a 92
13.5%
i 90
13.2%
n 70
10.2%
o 57
 
8.3%
u 48
 
7.0%
h 40
 
5.8%
s 37
 
5.4%
r 37
 
5.4%
e 34
 
5.0%
t 27
 
3.9%
Other values (16) 152
22.2%
Uppercase Letter
ValueCountFrequency (%)
S 72
14.6%
B 55
11.2%
K 49
 
10.0%
M 42
 
8.5%
A 35
 
7.1%
C 34
 
6.9%
N 23
 
4.7%
T 20
 
4.1%
Y 18
 
3.7%
P 15
 
3.0%
Other values (16) 129
26.2%
Decimal Number
ValueCountFrequency (%)
0 40
22.5%
2 35
19.7%
4 27
15.2%
1 24
13.5%
5 14
 
7.9%
6 12
 
6.7%
7 10
 
5.6%
3 7
 
3.9%
8 7
 
3.9%
9 2
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 2861
92.5%
129
 
4.2%
/ 31
 
1.0%
. 31
 
1.0%
: 20
 
0.6%
& 15
 
0.5%
5
 
0.2%
? 1
 
< 0.1%
" 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
2429
98.2%
( 39
 
1.6%
3
 
0.1%
[ 2
 
0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
11
37.9%
11
37.9%
< 3
 
10.3%
> 3
 
10.3%
~ 1
 
3.4%
Close Punctuation
ValueCountFrequency (%)
2455
99.2%
) 20
 
0.8%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
6144
100.0%
Final Punctuation
ValueCountFrequency (%)
28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Initial Punctuation
ValueCountFrequency (%)
27
100.0%
Spacing Mark
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41074
72.4%
Common 14477
 
25.5%
Latin 1176
 
2.1%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1390
 
3.4%
1099
 
2.7%
841
 
2.0%
821
 
2.0%
766
 
1.9%
762
 
1.9%
744
 
1.8%
730
 
1.8%
722
 
1.8%
594
 
1.4%
Other values (680) 32605
79.4%
Latin
ValueCountFrequency (%)
a 92
 
7.8%
i 90
 
7.7%
S 72
 
6.1%
n 70
 
6.0%
o 57
 
4.8%
B 55
 
4.7%
K 49
 
4.2%
u 48
 
4.1%
M 42
 
3.6%
h 40
 
3.4%
Other values (42) 561
47.7%
Common
ValueCountFrequency (%)
6144
42.4%
, 2861
19.8%
2455
 
17.0%
2429
 
16.8%
129
 
0.9%
0 40
 
0.3%
( 39
 
0.3%
2 35
 
0.2%
/ 31
 
0.2%
. 31
 
0.2%
Other values (26) 283
 
2.0%
Han
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41071
72.4%
ASCII 10553
 
18.6%
None 4919
 
8.7%
Punctuation 184
 
0.3%
CJK 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6144
58.2%
, 2861
27.1%
a 92
 
0.9%
i 90
 
0.9%
S 72
 
0.7%
n 70
 
0.7%
o 57
 
0.5%
B 55
 
0.5%
K 49
 
0.5%
u 48
 
0.5%
Other values (67) 1015
 
9.6%
None
ValueCountFrequency (%)
2455
49.9%
2429
49.4%
11
 
0.2%
11
 
0.2%
5
 
0.1%
3
 
0.1%
2
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1390
 
3.4%
1099
 
2.7%
841
 
2.0%
821
 
2.0%
766
 
1.9%
762
 
1.9%
744
 
1.8%
730
 
1.8%
722
 
1.8%
594
 
1.4%
Other values (678) 32602
79.4%
Punctuation
ValueCountFrequency (%)
129
70.1%
28
 
15.2%
27
 
14.7%
CJK
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%
Distinct2075
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:23:39.232892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique249 ?
Unique (%)2.5%

Sample

1st row2002-10-24
2nd row1999-03-18
3rd row1999-03-27
4th row2003-12-05
5th row2003-09-17
ValueCountFrequency (%)
1999-09-04 20
 
0.2%
2000-05-27 19
 
0.2%
2002-09-28 19
 
0.2%
2002-09-07 19
 
0.2%
2000-11-18 19
 
0.2%
2002-05-04 17
 
0.2%
2002-10-05 16
 
0.2%
2000-11-13 16
 
0.2%
2001-09-15 15
 
0.1%
2000-09-23 15
 
0.1%
Other values (2065) 9825
98.2%
2023-12-12T15:23:39.803479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 30185
30.2%
- 20000
20.0%
2 15761
15.8%
1 12724
12.7%
9 7178
 
7.2%
3 3579
 
3.6%
4 3168
 
3.2%
5 2082
 
2.1%
6 1849
 
1.8%
8 1789
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
80.0%
Dash Punctuation 20000
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 30185
37.7%
2 15761
19.7%
1 12724
15.9%
9 7178
 
9.0%
3 3579
 
4.5%
4 3168
 
4.0%
5 2082
 
2.6%
6 1849
 
2.3%
8 1789
 
2.2%
7 1685
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 30185
30.2%
- 20000
20.0%
2 15761
15.8%
1 12724
12.7%
9 7178
 
7.2%
3 3579
 
3.6%
4 3168
 
3.2%
5 2082
 
2.1%
6 1849
 
1.8%
8 1789
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 30185
30.2%
- 20000
20.0%
2 15761
15.8%
1 12724
12.7%
9 7178
 
7.2%
3 3579
 
3.6%
4 3168
 
3.2%
5 2082
 
2.1%
6 1849
 
1.8%
8 1789
 
1.8%
Distinct2083
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T15:23:40.247024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique258 ?
Unique (%)2.6%

Sample

1st row2002-10-24
2nd row1999-03-18
3rd row1999-03-27
4th row2003-12-05
5th row2003-09-17
ValueCountFrequency (%)
1999-09-04 21
 
0.2%
2000-05-27 19
 
0.2%
2002-09-28 19
 
0.2%
2000-11-18 19
 
0.2%
2000-10-01 16
 
0.2%
2000-11-13 16
 
0.2%
2000-10-22 16
 
0.2%
2002-09-07 16
 
0.2%
2001-11-02 15
 
0.1%
2002-04-28 15
 
0.1%
Other values (2073) 9828
98.3%
2023-12-12T15:23:40.734091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 30067
30.1%
- 20000
20.0%
2 15821
15.8%
1 12747
12.7%
9 7139
 
7.1%
3 3661
 
3.7%
4 3114
 
3.1%
5 2057
 
2.1%
6 1883
 
1.9%
8 1841
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
80.0%
Dash Punctuation 20000
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 30067
37.6%
2 15821
19.8%
1 12747
15.9%
9 7139
 
8.9%
3 3661
 
4.6%
4 3114
 
3.9%
5 2057
 
2.6%
6 1883
 
2.4%
8 1841
 
2.3%
7 1670
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 30067
30.1%
- 20000
20.0%
2 15821
15.8%
1 12747
12.7%
9 7139
 
7.1%
3 3661
 
3.7%
4 3114
 
3.1%
5 2057
 
2.1%
6 1883
 
1.9%
8 1841
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 30067
30.1%
- 20000
20.0%
2 15821
15.8%
1 12747
12.7%
9 7139
 
7.1%
3 3661
 
3.7%
4 3114
 
3.1%
5 2057
 
2.1%
6 1883
 
1.9%
8 1841
 
1.8%

공연횟수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9
Distinct (%)6.9%
Missing9869
Missing (%)98.7%
Infinite0
Infinite (%)0.0%
Mean1.9389313
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:23:40.885144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum23
Range22
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.6333825
Coefficient of variation (CV)1.3581618
Kurtosis41.375876
Mean1.9389313
Median Absolute Deviation (MAD)0
Skewness5.9369116
Sum254
Variance6.9347035
MonotonicityNot monotonic
2023-12-12T15:23:41.020021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 86
 
0.9%
2 22
 
0.2%
3 13
 
0.1%
5 4
 
< 0.1%
7 2
 
< 0.1%
23 1
 
< 0.1%
6 1
 
< 0.1%
4 1
 
< 0.1%
18 1
 
< 0.1%
(Missing) 9869
98.7%
ValueCountFrequency (%)
1 86
0.9%
2 22
 
0.2%
3 13
 
0.1%
4 1
 
< 0.1%
5 4
 
< 0.1%
6 1
 
< 0.1%
7 2
 
< 0.1%
18 1
 
< 0.1%
23 1
 
< 0.1%
ValueCountFrequency (%)
23 1
 
< 0.1%
18 1
 
< 0.1%
7 2
 
< 0.1%
6 1
 
< 0.1%
5 4
 
< 0.1%
4 1
 
< 0.1%
3 13
 
0.1%
2 22
 
0.2%
1 86
0.9%

공연기간
Real number (ℝ)

HIGH CORRELATION 

Distinct117
Distinct (%)1.2%
Missing37
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean3.0853157
Minimum-359
Maximum366
Zeros0
Zeros (%)0.0%
Negative14
Negative (%)0.1%
Memory size166.0 KiB
2023-12-12T15:23:41.160300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-359
5-th percentile1
Q11
median1
Q31
95-th percentile10
Maximum366
Range725
Interquartile range (IQR)0

Descriptive statistics

Standard deviation17.147568
Coefficient of variation (CV)5.5578003
Kurtosis222.06909
Mean3.0853157
Median Absolute Deviation (MAD)0
Skewness3.7301042
Sum30739
Variance294.0391
MonotonicityNot monotonic
2023-12-12T15:23:41.315419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 8185
81.8%
2 637
 
6.4%
3 243
 
2.4%
4 128
 
1.3%
6 80
 
0.8%
5 79
 
0.8%
10 51
 
0.5%
9 31
 
0.3%
13 30
 
0.3%
30 24
 
0.2%
Other values (107) 475
 
4.8%
(Missing) 37
 
0.4%
ValueCountFrequency (%)
-359 1
< 0.1%
-331 1
< 0.1%
-327 1
< 0.1%
-313 1
< 0.1%
-305 1
< 0.1%
-304 1
< 0.1%
-300 1
< 0.1%
-213 1
< 0.1%
-28 1
< 0.1%
-21 1
< 0.1%
ValueCountFrequency (%)
366 1
< 0.1%
359 1
< 0.1%
356 1
< 0.1%
298 1
< 0.1%
281 1
< 0.1%
267 1
< 0.1%
260 2
< 0.1%
259 1
< 0.1%
248 1
< 0.1%
246 1
< 0.1%

주최
Text

MISSING 

Distinct325
Distinct (%)48.7%
Missing9333
Missing (%)93.3%
Memory size156.2 KiB
2023-12-12T15:23:41.638535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length152
Median length35
Mean length8.041979
Min length2

Characters and Unicode

Total characters5364
Distinct characters354
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique249 ?
Unique (%)37.3%

Sample

1st row흥덕문화의 집
2nd row국립오페라단
3rd row강원도립국악관현악단
4th row품리프러덕션
5th row한국문화재보호재단
ValueCountFrequency (%)
전주 53
 
5.1%
국립국악원 52
 
5.0%
전통문화센터 47
 
4.5%
한국문화재보호재단 25
 
2.4%
음연 20
 
1.9%
한국국악협회 14
 
1.3%
전주세계소리축제 14
 
1.3%
오페라단 13
 
1.2%
조직위원회 13
 
1.2%
상설무대 12
 
1.2%
Other values (484) 777
74.7%
2023-12-12T15:23:42.086233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
373
 
7.0%
266
 
5.0%
186
 
3.5%
163
 
3.0%
147
 
2.7%
143
 
2.7%
142
 
2.6%
125
 
2.3%
123
 
2.3%
109
 
2.0%
Other values (344) 3587
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4720
88.0%
Space Separator 373
 
7.0%
Decimal Number 103
 
1.9%
Other Punctuation 58
 
1.1%
Uppercase Letter 49
 
0.9%
Open Punctuation 19
 
0.4%
Close Punctuation 19
 
0.4%
Lowercase Letter 9
 
0.2%
Dash Punctuation 7
 
0.1%
Math Symbol 3
 
0.1%
Other values (2) 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
266
 
5.6%
186
 
3.9%
163
 
3.5%
147
 
3.1%
143
 
3.0%
142
 
3.0%
125
 
2.6%
123
 
2.6%
109
 
2.3%
98
 
2.1%
Other values (302) 3218
68.2%
Uppercase Letter
ValueCountFrequency (%)
S 11
22.4%
B 10
20.4%
M 8
16.3%
K 7
14.3%
C 5
10.2%
P 2
 
4.1%
A 1
 
2.0%
Y 1
 
2.0%
J 1
 
2.0%
T 1
 
2.0%
Other values (2) 2
 
4.1%
Decimal Number
ValueCountFrequency (%)
0 48
46.6%
2 26
25.2%
4 13
 
12.6%
3 7
 
6.8%
1 6
 
5.8%
5 2
 
1.9%
9 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 39
67.2%
" 9
 
15.5%
& 4
 
6.9%
/ 3
 
5.2%
' 2
 
3.4%
. 1
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
u 3
33.3%
i 2
22.2%
e 1
 
11.1%
p 1
 
11.1%
o 1
 
11.1%
r 1
 
11.1%
Math Symbol
ValueCountFrequency (%)
1
33.3%
1
33.3%
+ 1
33.3%
Open Punctuation
ValueCountFrequency (%)
18
94.7%
( 1
 
5.3%
Close Punctuation
ValueCountFrequency (%)
17
89.5%
) 2
 
10.5%
Space Separator
ValueCountFrequency (%)
373
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4718
88.0%
Common 586
 
10.9%
Latin 58
 
1.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
266
 
5.6%
186
 
3.9%
163
 
3.5%
147
 
3.1%
143
 
3.0%
142
 
3.0%
125
 
2.6%
123
 
2.6%
109
 
2.3%
98
 
2.1%
Other values (300) 3216
68.2%
Common
ValueCountFrequency (%)
373
63.7%
0 48
 
8.2%
, 39
 
6.7%
2 26
 
4.4%
18
 
3.1%
17
 
2.9%
4 13
 
2.2%
" 9
 
1.5%
3 7
 
1.2%
- 7
 
1.2%
Other values (14) 29
 
4.9%
Latin
ValueCountFrequency (%)
S 11
19.0%
B 10
17.2%
M 8
13.8%
K 7
12.1%
C 5
8.6%
u 3
 
5.2%
P 2
 
3.4%
i 2
 
3.4%
A 1
 
1.7%
Y 1
 
1.7%
Other values (8) 8
13.8%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4718
88.0%
ASCII 603
 
11.2%
None 37
 
0.7%
Punctuation 4
 
0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
373
61.9%
0 48
 
8.0%
, 39
 
6.5%
2 26
 
4.3%
4 13
 
2.2%
S 11
 
1.8%
B 10
 
1.7%
" 9
 
1.5%
M 8
 
1.3%
3 7
 
1.2%
Other values (26) 59
 
9.8%
Hangul
ValueCountFrequency (%)
266
 
5.6%
186
 
3.9%
163
 
3.5%
147
 
3.1%
143
 
3.0%
142
 
3.0%
125
 
2.6%
123
 
2.6%
109
 
2.3%
98
 
2.1%
Other values (300) 3216
68.2%
None
ValueCountFrequency (%)
18
48.6%
17
45.9%
1
 
2.7%
1
 
2.7%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

지역1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

지역2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

장소
Text

Distinct3077
Distinct (%)31.0%
Missing74
Missing (%)0.7%
Memory size156.2 KiB
2023-12-12T15:23:42.800912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length133
Median length57
Mean length9.7650615
Min length2

Characters and Unicode

Total characters96928
Distinct characters576
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2194 ?
Unique (%)22.1%

Sample

1st row부산 금곡청소년수련원
2nd row대구시 민회관 대강당
3rd row예술의전당 리사이틀홀
4th row오퍼스홀
5th row홍덕문화의 집
ValueCountFrequency (%)
예술의전당 1016
 
5.1%
대극장 747
 
3.8%
소극장 673
 
3.4%
대공연장 541
 
2.7%
리사이틀홀 458
 
2.3%
국립국악원 436
 
2.2%
콘서트홀 397
 
2.0%
세종문화회관 352
 
1.8%
대강당 340
 
1.7%
부산문화회관 286
 
1.4%
Other values (3085) 14562
73.5%
2023-12-12T15:23:43.438880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9893
 
10.2%
3943
 
4.1%
3709
 
3.8%
3558
 
3.7%
3550
 
3.7%
3542
 
3.7%
3082
 
3.2%
2817
 
2.9%
2528
 
2.6%
2522
 
2.6%
Other values (566) 57784
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85640
88.4%
Space Separator 9893
 
10.2%
Uppercase Letter 544
 
0.6%
Decimal Number 465
 
0.5%
Other Punctuation 246
 
0.3%
Dash Punctuation 49
 
0.1%
Close Punctuation 35
 
< 0.1%
Open Punctuation 35
 
< 0.1%
Lowercase Letter 11
 
< 0.1%
Initial Punctuation 3
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3943
 
4.6%
3709
 
4.3%
3558
 
4.2%
3550
 
4.1%
3542
 
4.1%
3082
 
3.6%
2817
 
3.3%
2528
 
3.0%
2522
 
2.9%
2439
 
2.8%
Other values (514) 53950
63.0%
Uppercase Letter
ValueCountFrequency (%)
B 128
23.5%
K 89
16.4%
S 89
16.4%
L 53
9.7%
C 52
9.6%
G 52
9.6%
M 47
 
8.6%
O 7
 
1.3%
A 7
 
1.3%
U 5
 
0.9%
Other values (8) 15
 
2.8%
Decimal Number
ValueCountFrequency (%)
1 102
21.9%
2 97
20.9%
0 72
15.5%
3 47
10.1%
5 40
 
8.6%
8 36
 
7.7%
4 25
 
5.4%
7 21
 
4.5%
6 14
 
3.0%
9 11
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
c 2
18.2%
a 2
18.2%
e 2
18.2%
t 1
9.1%
p 1
9.1%
m 1
9.1%
h 1
9.1%
r 1
9.1%
Other Punctuation
ValueCountFrequency (%)
, 107
43.5%
. 64
26.0%
/ 43
17.5%
: 18
 
7.3%
11
 
4.5%
' 3
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 48
98.0%
1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
34
97.1%
( 1
 
2.9%
Space Separator
ValueCountFrequency (%)
9893
100.0%
Close Punctuation
ValueCountFrequency (%)
35
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Control
ValueCountFrequency (%)
3
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85641
88.4%
Common 10732
 
11.1%
Latin 555
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3943
 
4.6%
3709
 
4.3%
3558
 
4.2%
3550
 
4.1%
3542
 
4.1%
3082
 
3.6%
2817
 
3.3%
2528
 
3.0%
2522
 
2.9%
2439
 
2.8%
Other values (515) 53951
63.0%
Latin
ValueCountFrequency (%)
B 128
23.1%
K 89
16.0%
S 89
16.0%
L 53
9.5%
C 52
9.4%
G 52
9.4%
M 47
 
8.5%
O 7
 
1.3%
A 7
 
1.3%
U 5
 
0.9%
Other values (16) 26
 
4.7%
Common
ValueCountFrequency (%)
9893
92.2%
, 107
 
1.0%
1 102
 
1.0%
2 97
 
0.9%
0 72
 
0.7%
. 64
 
0.6%
- 48
 
0.4%
3 47
 
0.4%
/ 43
 
0.4%
5 40
 
0.4%
Other values (15) 219
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85640
88.4%
ASCII 11200
 
11.6%
None 71
 
0.1%
Punctuation 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9893
88.3%
B 128
 
1.1%
, 107
 
1.0%
1 102
 
0.9%
2 97
 
0.9%
K 89
 
0.8%
S 89
 
0.8%
0 72
 
0.6%
. 64
 
0.6%
L 53
 
0.5%
Other values (35) 506
 
4.5%
Hangul
ValueCountFrequency (%)
3943
 
4.6%
3709
 
4.3%
3558
 
4.2%
3550
 
4.1%
3542
 
4.1%
3082
 
3.6%
2817
 
3.3%
2528
 
3.0%
2522
 
2.9%
2439
 
2.8%
Other values (514) 53950
63.0%
None
ValueCountFrequency (%)
35
49.3%
34
47.9%
1
 
1.4%
1
 
1.4%
Punctuation
ValueCountFrequency (%)
11
64.7%
3
 
17.6%
3
 
17.6%

비고
Text

MISSING 

Distinct5960
Distinct (%)86.5%
Missing3113
Missing (%)31.1%
Memory size156.2 KiB
2023-12-12T15:23:43.846546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length993
Median length244
Mean length29.100479
Min length1

Characters and Unicode

Total characters200415
Distinct characters1219
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5560 ?
Unique (%)80.7%

Sample

1st row호소가와 <방의 소리>, 슈톡하우젠 <피아노곡 11>, 료노디 <즉홍연주 1>, 림 <피아노곡 4>, 메시앙 <‘아기예수를 바라보는 20개의 시선’ 중에서> 외
2nd row권재은의 ‘서도소리 그 내력을 따라’
3rd row신애정(피) 김태형(플)
4th row슈베르트 <바위의 목동>, 베르디 <오페라 ‘해적’ 중 ‘저를 구해주세요’>, 바흐 <영국 모음곡 제4번>, 모차르트 <황혼의 느낌>, <루이세의 사랑의 편지>, <클로에를 위한 노래>, 슈만 <유모레스크, op. 20> 외
5th row<sally garden> <ave maria> 등
ValueCountFrequency (%)
1712
 
4.1%
986
 
2.3%
관객수 617
 
1.5%
524
 
1.2%
위한 417
 
1.0%
369
 
0.9%
op 367
 
0.9%
지휘 236
 
0.6%
베토벤 212
 
0.5%
모차르트 210
 
0.5%
Other values (17881) 36503
86.6%
2023-12-12T15:23:44.405966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35271
 
17.6%
, 10521
 
5.2%
3795
 
1.9%
3775
 
1.9%
2681
 
1.3%
2659
 
1.3%
2650
 
1.3%
2263
 
1.1%
2140
 
1.1%
1934
 
1.0%
Other values (1209) 132726
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 121287
60.5%
Space Separator 35271
 
17.6%
Other Punctuation 13697
 
6.8%
Math Symbol 7756
 
3.9%
Decimal Number 7107
 
3.5%
Lowercase Letter 3689
 
1.8%
Close Punctuation 3382
 
1.7%
Open Punctuation 3375
 
1.7%
Uppercase Letter 1575
 
0.8%
Final Punctuation 1278
 
0.6%
Other values (6) 1998
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2659
 
2.2%
2263
 
1.9%
2140
 
1.8%
1934
 
1.6%
1881
 
1.6%
1753
 
1.4%
1749
 
1.4%
1622
 
1.3%
1505
 
1.2%
1470
 
1.2%
Other values (1094) 102311
84.4%
Lowercase Letter
ValueCountFrequency (%)
o 672
18.2%
p 446
12.1%
a 327
8.9%
e 293
 
7.9%
n 255
 
6.9%
i 236
 
6.4%
r 228
 
6.2%
t 180
 
4.9%
s 136
 
3.7%
b 115
 
3.1%
Other values (16) 801
21.7%
Uppercase Letter
ValueCountFrequency (%)
D 149
 
9.5%
B 148
 
9.4%
C 143
 
9.1%
A 115
 
7.3%
K 113
 
7.2%
S 109
 
6.9%
F 102
 
6.5%
E 97
 
6.2%
G 85
 
5.4%
O 68
 
4.3%
Other values (16) 446
28.3%
Other Punctuation
ValueCountFrequency (%)
, 10521
76.8%
. 1517
 
11.1%
: 1064
 
7.8%
/ 234
 
1.7%
213
 
1.6%
' 40
 
0.3%
20
 
0.1%
! 14
 
0.1%
14
 
0.1%
13
 
0.1%
Other values (8) 47
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 1301
18.3%
1 1248
17.6%
0 1119
15.7%
3 714
10.0%
5 668
9.4%
4 586
8.2%
6 404
 
5.7%
8 383
 
5.4%
9 347
 
4.9%
7 337
 
4.7%
Math Symbol
ValueCountFrequency (%)
3795
48.9%
3775
48.7%
> 93
 
1.2%
< 80
 
1.0%
+ 5
 
0.1%
~ 4
 
0.1%
4
 
0.1%
Open Punctuation
ValueCountFrequency (%)
2650
78.5%
402
 
11.9%
226
 
6.7%
( 92
 
2.7%
3
 
0.1%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
2681
79.3%
401
 
11.9%
225
 
6.7%
) 72
 
2.1%
3
 
0.1%
Letter Number
ValueCountFrequency (%)
7
53.8%
3
23.1%
2
 
15.4%
1
 
7.7%
Dash Punctuation
ValueCountFrequency (%)
- 591
97.7%
9
 
1.5%
5
 
0.8%
Final Punctuation
ValueCountFrequency (%)
1274
99.7%
4
 
0.3%
Initial Punctuation
ValueCountFrequency (%)
1237
99.8%
2
 
0.2%
Control
ValueCountFrequency (%)
137
99.3%
1
 
0.7%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
35271
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 121155
60.5%
Common 73851
36.8%
Latin 5277
 
2.6%
Han 132
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2659
 
2.2%
2263
 
1.9%
2140
 
1.8%
1934
 
1.6%
1881
 
1.6%
1753
 
1.4%
1749
 
1.4%
1622
 
1.3%
1505
 
1.2%
1470
 
1.2%
Other values (986) 102179
84.3%
Han
ValueCountFrequency (%)
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (98) 106
80.3%
Common
ValueCountFrequency (%)
35271
47.8%
, 10521
 
14.2%
3795
 
5.1%
3775
 
5.1%
2681
 
3.6%
2650
 
3.6%
. 1517
 
2.1%
2 1301
 
1.8%
1274
 
1.7%
1 1248
 
1.7%
Other values (49) 9818
 
13.3%
Latin
ValueCountFrequency (%)
o 672
 
12.7%
p 446
 
8.5%
a 327
 
6.2%
e 293
 
5.6%
n 255
 
4.8%
i 236
 
4.5%
r 228
 
4.3%
t 180
 
3.4%
D 149
 
2.8%
B 148
 
2.8%
Other values (46) 2343
44.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 121155
60.5%
ASCII 62153
31.0%
None 14208
 
7.1%
Punctuation 2748
 
1.4%
CJK 132
 
0.1%
Number Forms 13
 
< 0.1%
Arrows 4
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35271
56.7%
, 10521
 
16.9%
. 1517
 
2.4%
2 1301
 
2.1%
1 1248
 
2.0%
0 1119
 
1.8%
: 1064
 
1.7%
3 714
 
1.1%
o 672
 
1.1%
5 668
 
1.1%
Other values (76) 8058
 
13.0%
None
ValueCountFrequency (%)
3795
26.7%
3775
26.6%
2681
18.9%
2650
18.7%
402
 
2.8%
401
 
2.8%
226
 
1.6%
225
 
1.6%
20
 
0.1%
14
 
0.1%
Other values (5) 19
 
0.1%
Hangul
ValueCountFrequency (%)
2659
 
2.2%
2263
 
1.9%
2140
 
1.8%
1934
 
1.6%
1881
 
1.6%
1753
 
1.4%
1749
 
1.4%
1622
 
1.3%
1505
 
1.2%
1470
 
1.2%
Other values (986) 102179
84.3%
Punctuation
ValueCountFrequency (%)
1274
46.4%
1237
45.0%
213
 
7.8%
13
 
0.5%
5
 
0.2%
4
 
0.1%
2
 
0.1%
Number Forms
ValueCountFrequency (%)
7
53.8%
3
23.1%
2
 
15.4%
1
 
7.7%
CJK
ValueCountFrequency (%)
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (98) 106
80.3%
Arrows
ValueCountFrequency (%)
4
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%


Text

MISSING 

Distinct975
Distinct (%)85.6%
Missing8861
Missing (%)88.6%
Memory size156.2 KiB
2023-12-12T15:23:44.812645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length8.5443371
Min length2

Characters and Unicode

Total characters9732
Distinct characters458
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique857 ?
Unique (%)75.2%

Sample

1st row오은희 작/배해일 연출
2nd row윤형섭, 성준현
3rd row엄인희, 강영걸
4th row유수민 외 출연
5th row몰리에르 작/김정옥 연출
ValueCountFrequency (%)
연출 175
 
6.9%
작/연출 33
 
1.3%
이윤택 32
 
1.3%
29
 
1.1%
공동창작 26
 
1.0%
셰익스피어 24
 
1.0%
박근형 23
 
0.9%
오태석 18
 
0.7%
장진 16
 
0.6%
이만희 13
 
0.5%
Other values (1225) 2134
84.6%
2023-12-12T15:23:45.409109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1386
 
14.2%
, 680
 
7.0%
370
 
3.8%
306
 
3.1%
260
 
2.7%
240
 
2.5%
238
 
2.4%
/ 206
 
2.1%
150
 
1.5%
148
 
1.5%
Other values (448) 5748
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7190
73.9%
Space Separator 1386
 
14.2%
Other Punctuation 911
 
9.4%
Lowercase Letter 179
 
1.8%
Uppercase Letter 44
 
0.5%
Decimal Number 8
 
0.1%
Close Punctuation 7
 
0.1%
Open Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
370
 
5.1%
306
 
4.3%
260
 
3.6%
240
 
3.3%
238
 
3.3%
150
 
2.1%
148
 
2.1%
124
 
1.7%
114
 
1.6%
104
 
1.4%
Other values (393) 5136
71.4%
Lowercase Letter
ValueCountFrequency (%)
a 22
12.3%
e 21
11.7%
r 19
10.6%
n 19
10.6%
o 17
9.5%
t 13
 
7.3%
i 11
 
6.1%
l 7
 
3.9%
g 6
 
3.4%
v 6
 
3.4%
Other values (13) 38
21.2%
Uppercase Letter
ValueCountFrequency (%)
S 6
13.6%
R 5
11.4%
T 5
11.4%
E 5
11.4%
J 4
9.1%
L 3
 
6.8%
B 2
 
4.5%
K 2
 
4.5%
G 2
 
4.5%
W 2
 
4.5%
Other values (6) 8
18.2%
Decimal Number
ValueCountFrequency (%)
2 2
25.0%
7 2
25.0%
6 1
12.5%
8 1
12.5%
1 1
12.5%
3 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 680
74.6%
/ 206
 
22.6%
15
 
1.6%
. 9
 
1.0%
' 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
5
71.4%
) 2
 
28.6%
Open Punctuation
ValueCountFrequency (%)
5
71.4%
( 2
 
28.6%
Space Separator
ValueCountFrequency (%)
1386
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7184
73.8%
Common 2319
 
23.8%
Latin 223
 
2.3%
Han 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
370
 
5.2%
306
 
4.3%
260
 
3.6%
240
 
3.3%
238
 
3.3%
150
 
2.1%
148
 
2.1%
124
 
1.7%
114
 
1.6%
104
 
1.4%
Other values (387) 5130
71.4%
Latin
ValueCountFrequency (%)
a 22
 
9.9%
e 21
 
9.4%
r 19
 
8.5%
n 19
 
8.5%
o 17
 
7.6%
t 13
 
5.8%
i 11
 
4.9%
l 7
 
3.1%
S 6
 
2.7%
g 6
 
2.7%
Other values (29) 82
36.8%
Common
ValueCountFrequency (%)
1386
59.8%
, 680
29.3%
/ 206
 
8.9%
15
 
0.6%
. 9
 
0.4%
5
 
0.2%
5
 
0.2%
) 2
 
0.1%
2 2
 
0.1%
7 2
 
0.1%
Other values (6) 7
 
0.3%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7184
73.8%
ASCII 2517
 
25.9%
Punctuation 15
 
0.2%
None 10
 
0.1%
CJK 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1386
55.1%
, 680
27.0%
/ 206
 
8.2%
a 22
 
0.9%
e 21
 
0.8%
r 19
 
0.8%
n 19
 
0.8%
o 17
 
0.7%
t 13
 
0.5%
i 11
 
0.4%
Other values (42) 123
 
4.9%
Hangul
ValueCountFrequency (%)
370
 
5.2%
306
 
4.3%
260
 
3.6%
240
 
3.3%
238
 
3.3%
150
 
2.1%
148
 
2.1%
124
 
1.7%
114
 
1.6%
104
 
1.4%
Other values (387) 5130
71.4%
Punctuation
ValueCountFrequency (%)
15
100.0%
None
ValueCountFrequency (%)
5
50.0%
5
50.0%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

장르
Text

MISSING 

Distinct67
Distinct (%)3.4%
Missing8020
Missing (%)80.2%
Memory size156.2 KiB
2023-12-12T15:23:45.622498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length2
Mean length2.7934343
Min length1

Characters and Unicode

Total characters5531
Distinct characters71
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)1.2%

Sample

1st row혼합
2nd row전통
3rd row현대무용
4th row전통, 판소리
5th row창작
ValueCountFrequency (%)
혼합 461
21.8%
전통 370
17.5%
창작 228
10.8%
한국무용 169
 
8.0%
현대무용 111
 
5.3%
발레 104
 
4.9%
종합 101
 
4.8%
전통무용 62
 
2.9%
타악 53
 
2.5%
극음악 50
 
2.4%
Other values (41) 402
19.0%
2023-12-12T15:23:45.967983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
562
 
10.2%
468
 
8.5%
468
 
8.5%
461
 
8.3%
393
 
7.1%
393
 
7.1%
295
 
5.3%
294
 
5.3%
231
 
4.2%
223
 
4.0%
Other values (61) 1743
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5272
95.3%
Space Separator 132
 
2.4%
Other Punctuation 127
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
562
 
10.7%
468
 
8.9%
468
 
8.9%
461
 
8.7%
393
 
7.5%
393
 
7.5%
295
 
5.6%
294
 
5.6%
231
 
4.4%
223
 
4.2%
Other values (58) 1484
28.1%
Other Punctuation
ValueCountFrequency (%)
, 126
99.2%
/ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
132
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5272
95.3%
Common 259
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
562
 
10.7%
468
 
8.9%
468
 
8.9%
461
 
8.7%
393
 
7.5%
393
 
7.5%
295
 
5.6%
294
 
5.6%
231
 
4.4%
223
 
4.2%
Other values (58) 1484
28.1%
Common
ValueCountFrequency (%)
132
51.0%
, 126
48.6%
/ 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5272
95.3%
ASCII 259
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
562
 
10.7%
468
 
8.9%
468
 
8.9%
461
 
8.7%
393
 
7.5%
393
 
7.5%
295
 
5.6%
294
 
5.6%
231
 
4.4%
223
 
4.2%
Other values (58) 1484
28.1%
ASCII
ValueCountFrequency (%)
132
51.0%
, 126
48.6%
/ 1
 
0.4%

Interactions

2023-12-12T15:23:35.205724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:33.716410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.253631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.727106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:35.305109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:33.857715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.369552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.866780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:35.431672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:33.985595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.492538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:35.001014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:35.537091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.112217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:34.608140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:23:35.094809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:23:46.100010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연도공연횟수공연기간장르
연번1.0000.934NaN0.1210.775
연도0.9341.000NaN0.0250.704
공연횟수NaNNaN1.0000.7600.000
공연기간0.1210.0250.7601.0000.204
장르0.7750.7040.0000.2041.000
2023-12-12T15:23:46.233806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연도공연횟수공연기간
연번1.0000.9850.2070.028
연도0.9851.000NaN-0.005
공연횟수0.207NaN1.0000.920
공연기간0.028-0.0050.9201.000

Missing values

2023-12-12T15:23:35.723406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:23:35.997595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:23:36.263437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번연도공연명단체명시작일종료일공연횟수공연기간주최지역1지역2장소비고장르
374723747320022002 부산합창올림픽 만남의 콘서트 - 청소년합창단 글로리아<NA>2002-10-242002-10-24<NA>1<NA><NA><NA>부산 금곡청소년수련원<NA><NA><NA>
426242631999대구시립교향악단 가정음악회<NA>1999-03-181999-03-18<NA>1<NA><NA><NA>대구시 민회관 대강당<NA><NA><NA>
319131921999청소년 음악회<NA>1999-03-271999-03-27<NA>1<NA><NA><NA>예술의전당 리사이틀홀<NA><NA><NA>
43215432162003장세정 현대 피아노음악 콘서트양영욱(색소폰)2003-12-052003-12-05<NA>1<NA><NA><NA>오퍼스홀호소가와 <방의 소리>, 슈톡하우젠 <피아노곡 11>, 료노디 <즉홍연주 1>, 림 <피아노곡 4>, 메시앙 <‘아기예수를 바라보는 20개의 시선’ 중에서> 외<NA><NA>
39928399292003해설이 있는 전통예술로의 여행<NA>2003-09-172003-09-17<NA>1흥덕문화의 집<NA><NA>홍덕문화의 집권재은의 ‘서도소리 그 내력을 따라’<NA><NA>
25164251652001메조소프라노 이지선 독창회<NA>2001-08-162001-08-16<NA>1<NA><NA><NA>연강홀신애정(피) 김태형(플)<NA><NA>
41829418302003잉에 로자, 신선미 두 오 리사이틀잉에 로자(피아노), 신선미(소프라노)2003-08-152003-08-15<NA>1<NA><NA><NA>금호리사이트홀슈베르트 <바위의 목동>, 베르디 <오페라 ‘해적’ 중 ‘저를 구해주세요’>, 바흐 <영국 모음곡 제4번>, 모차르트 <황혼의 느낌>, <루이세의 사랑의 편지>, <클로에를 위한 노래>, 슈만 <유모레스크, op. 20> 외<NA><NA>
24316243172001조영방 피아노 독주회<NA>2001-05-172001-05-17<NA>1<NA><NA><NA>이원문화센터<NA><NA><NA>
51897518982004임형주 콘서트임형주2004-12-302004-12-31<NA>2<NA><NA><NA>광주문예회관 대극장<sally garden> <ave maria> 등<NA><NA>
36965369662002가족 오페라 <봄봄봄><NA>2002-11-162002-11-17<NA>2국립오페라단<NA><NA>군포시민회관 대공연장관객수 : 151<NA><NA>
연번연도공연명단체명시작일종료일공연횟수공연기간주최지역1지역2장소비고장르
17483174842000한양대 음대 콘서트콰이어 연주회<NA>2000-11-082000-11-08<NA>1<NA><NA><NA>울산문예회관 대공연장<NA><NA><NA>
26551265522001한국현대음악작곡연구회 제4회 작곡 발표회<NA>2001-06-052001-06-05<NA>1<NA><NA><NA>국립극장 달오름극장홍순만 안정모 전인평 정명혜 윤해종 정부기 김영식 (작곡)<NA><NA>
17657176582000쏠티와 함께샬롬 노래선교단2000-01-132000-01-13<NA>1<NA><NA><NA>광주문예회관 대극장<NA><NA><NA>
29580295812002남원시립국악단 상설공연남원시립국악단2002-09-142002-09-14<NA>1<NA><NA><NA>춘향멀티프라자가야금병창 <방아타령>, 살풀이, 남도민요<성주풀이, 남원산성, 진도아리랑>, 바람의 유희 외<NA>혼합
51376513772004세라믹 팔레스홀 개관 1주년 기념 뮤직페스티벌-니콜라스 추마 첸코 바이올린 독주회니콜라스 추마첸코(바이올린)2004-10-122004-10-12<NA>1세라믹팔레스홀<NA><NA>세라믹 팔레스홀<NA><NA><NA>
43044430452003안명주 플루트 독주회<NA>2003-10-242003-10-24<NA>1<NA><NA><NA>서울 예술의전당 리사이틀홀<NA><NA><NA>
34985349862002이진이 피아노 독주회<NA>2002-05-312002-05-31<NA>1<NA><NA><NA>예술의전당 리사이틀홀관객수 : 240<NA><NA>
7557561999전주시립예술단 제 87회 정기연주회<NA>1999-11-151999-11-15<NA>1<NA><NA><NA>전주덕진예술회관전주시립예술단<NA><NA>
46026460272003유종선발레2003<NA>2003-10-042003-10-0421<NA><NA><NA>춘천문화예술회관<졸업무도회>(안무 데이비드 리싱)<NA>발레
42601426022003김혜란 바이올린 독주회김세정 (피아노)2003-04-302003-04-30<NA>1<NA><NA><NA>서울 예술의전당 리사이틀홀쉬니트게, 슈만, 그리그<NA><NA>