Overview

Dataset statistics

Number of variables7
Number of observations188
Missing cells12
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory57.7 B

Variable types

Text4
Categorical2
Numeric1

Dataset

Description국립중앙극장 공연예술자료 분류유형에 대한 데이터로 코드번호, 코드명, 코드전체명, 최상위코드명, 레벨 등의 정보를 제공합니다.
Author문화체육관광부 국립중앙극장
URLhttps://www.data.go.kr/data/15090167/fileData.do

Alerts

최상위코드 is highly overall correlated with 최상위코드명High correlation
최상위코드명 is highly overall correlated with 최상위코드High correlation
차상위전체코드명 has 11 (5.9%) missing valuesMissing
코드번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:43:45.275453
Analysis finished2023-12-12 13:43:45.958410
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

코드번호
Text

UNIQUE 

Distinct188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T22:43:46.259335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length6.1702128
Min length2

Characters and Unicode

Total characters1160
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)100.0%

Sample

1st rowROOT
2nd rowDn
3rd rowMs
4th rowPl
5th rowDo
ValueCountFrequency (%)
root 1
 
0.5%
ms0103 1
 
0.5%
ms010303 1
 
0.5%
dn0403 1
 
0.5%
dn0501 1
 
0.5%
dn0502 1
 
0.5%
dn0503 1
 
0.5%
dn9901 1
 
0.5%
dn9999 1
 
0.5%
ms0101 1
 
0.5%
Other values (178) 178
94.7%
2023-12-12T22:43:46.777660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 358
30.9%
1 136
 
11.7%
M 110
 
9.5%
s 110
 
9.5%
2 105
 
9.1%
9 60
 
5.2%
3 58
 
5.0%
D 38
 
3.3%
n 31
 
2.7%
5 20
 
1.7%
Other values (17) 134
 
11.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 782
67.4%
Uppercase Letter 191
 
16.5%
Lowercase Letter 187
 
16.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 358
45.8%
1 136
 
17.4%
2 105
 
13.4%
9 60
 
7.7%
3 58
 
7.4%
5 20
 
2.6%
6 18
 
2.3%
4 16
 
2.0%
7 6
 
0.8%
8 5
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
M 110
57.6%
D 38
 
19.9%
E 15
 
7.9%
P 14
 
7.3%
G 5
 
2.6%
S 5
 
2.6%
O 2
 
1.0%
T 1
 
0.5%
R 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
s 110
58.8%
n 31
 
16.6%
l 14
 
7.5%
r 11
 
5.9%
h 9
 
4.8%
d 8
 
4.3%
v 3
 
1.6%
o 1
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Common 782
67.4%
Latin 378
32.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 110
29.1%
s 110
29.1%
D 38
 
10.1%
n 31
 
8.2%
E 15
 
4.0%
P 14
 
3.7%
l 14
 
3.7%
r 11
 
2.9%
h 9
 
2.4%
d 8
 
2.1%
Other values (7) 18
 
4.8%
Common
ValueCountFrequency (%)
0 358
45.8%
1 136
 
17.4%
2 105
 
13.4%
9 60
 
7.7%
3 58
 
7.4%
5 20
 
2.6%
6 18
 
2.3%
4 16
 
2.0%
7 6
 
0.8%
8 5
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1160
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 358
30.9%
1 136
 
11.7%
M 110
 
9.5%
s 110
 
9.5%
2 105
 
9.1%
9 60
 
5.2%
3 58
 
5.0%
D 38
 
3.3%
n 31
 
2.7%
5 20
 
1.7%
Other values (17) 134
 
11.6%
Distinct151
Distinct (%)80.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T22:43:47.084936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length2.9946809
Min length1

Characters and Unicode

Total characters563
Distinct characters162
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)73.9%

Sample

1st row분류체계
2nd row무용
3rd row음악
4th row연희
5th row기증
ValueCountFrequency (%)
기타 21
 
11.1%
종합 5
 
2.6%
듀엣 3
 
1.6%
트리오 3
 
1.6%
4중창 3
 
1.6%
관악기 2
 
1.1%
피아노 2
 
1.1%
4중주 2
 
1.1%
건반악기 2
 
1.1%
타악기 2
 
1.1%
Other values (141) 145
76.3%
2023-12-12T22:43:47.466929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
6.0%
33
 
5.9%
24
 
4.3%
20
 
3.6%
19
 
3.4%
18
 
3.2%
15
 
2.7%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (152) 364
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 554
98.4%
Decimal Number 5
 
0.9%
Space Separator 2
 
0.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
6.1%
33
 
6.0%
24
 
4.3%
20
 
3.6%
19
 
3.4%
18
 
3.2%
15
 
2.7%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (148) 355
64.1%
Decimal Number
ValueCountFrequency (%)
4 5
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 554
98.4%
Common 9
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
6.1%
33
 
6.0%
24
 
4.3%
20
 
3.6%
19
 
3.4%
18
 
3.2%
15
 
2.7%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (148) 355
64.1%
Common
ValueCountFrequency (%)
4 5
55.6%
2
 
22.2%
( 1
 
11.1%
) 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 554
98.4%
ASCII 9
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
6.1%
33
 
6.0%
24
 
4.3%
20
 
3.6%
19
 
3.4%
18
 
3.2%
15
 
2.7%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (148) 355
64.1%
ASCII
ValueCountFrequency (%)
4 5
55.6%
2
 
22.2%
( 1
 
11.1%
) 1
 
11.1%
Distinct187
Distinct (%)100.0%
Missing1
Missing (%)0.5%
Memory size1.6 KiB
2023-12-12T22:43:47.812753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length10.983957
Min length2

Characters and Unicode

Total characters2054
Distinct characters160
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)100.0%

Sample

1st row무용
2nd row음악
3rd row연희
4th row기증
5th row연극
ValueCountFrequency (%)
음악 110
 
19.0%
서양성악 31
 
5.3%
기타 31
 
5.3%
무용 26
 
4.5%
서양기악 24
 
4.1%
중창 16
 
2.8%
연희 14
 
2.4%
한국기악 12
 
2.1%
한국무용 12
 
2.1%
연극 11
 
1.9%
Other values (140) 293
50.5%
2023-12-12T22:43:48.309892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
393
19.1%
241
 
11.7%
137
 
6.7%
86
 
4.2%
69
 
3.4%
68
 
3.3%
66
 
3.2%
65
 
3.2%
62
 
3.0%
60
 
2.9%
Other values (150) 807
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1654
80.5%
Space Separator 393
 
19.1%
Decimal Number 5
 
0.2%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
241
 
14.6%
137
 
8.3%
86
 
5.2%
69
 
4.2%
68
 
4.1%
66
 
4.0%
65
 
3.9%
62
 
3.7%
60
 
3.6%
59
 
3.6%
Other values (146) 741
44.8%
Space Separator
ValueCountFrequency (%)
393
100.0%
Decimal Number
ValueCountFrequency (%)
4 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1654
80.5%
Common 400
 
19.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
241
 
14.6%
137
 
8.3%
86
 
5.2%
69
 
4.2%
68
 
4.1%
66
 
4.0%
65
 
3.9%
62
 
3.7%
60
 
3.6%
59
 
3.6%
Other values (146) 741
44.8%
Common
ValueCountFrequency (%)
393
98.2%
4 5
 
1.2%
) 1
 
0.2%
( 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1654
80.5%
ASCII 400
 
19.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
393
98.2%
4 5
 
1.2%
) 1
 
0.2%
( 1
 
0.2%
Hangul
ValueCountFrequency (%)
241
 
14.6%
137
 
8.3%
86
 
5.2%
69
 
4.2%
68
 
4.1%
66
 
4.0%
65
 
3.9%
62
 
3.7%
60
 
3.6%
59
 
3.6%
Other values (146) 741
44.8%

최상위코드명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
음악
110 
무용
26 
연희
14 
연극
 
11
교육
 
8
Other values (6)
19 

Length

Max length4
Median length2
Mean length2.0638298
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row<NA>
2nd row무용
3rd row음악
4th row연희
5th row기증

Common Values

ValueCountFrequency (%)
음악 110
58.5%
무용 26
 
13.8%
연희 14
 
7.4%
연극 11
 
5.9%
교육 8
 
4.3%
기타 5
 
2.7%
공연일반 5
 
2.7%
전시 4
 
2.1%
행사 3
 
1.6%
<NA> 1
 
0.5%

Length

2023-12-12T22:43:48.482891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음악 110
58.5%
무용 26
 
13.8%
연희 14
 
7.4%
연극 11
 
5.9%
교육 8
 
4.3%
기타 5
 
2.7%
공연일반 5
 
2.7%
전시 4
 
2.1%
행사 3
 
1.6%
na 1
 
0.5%

최상위코드
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Ms
110 
Dn
26 
Pl
14 
Dr
 
11
Ed
 
8
Other values (6)
19 

Length

Max length4
Median length2
Mean length2.0106383
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row<NA>
2nd rowDn
3rd rowMs
4th rowPl
5th rowDo

Common Values

ValueCountFrequency (%)
Ms 110
58.5%
Dn 26
 
13.8%
Pl 14
 
7.4%
Dr 11
 
5.9%
Ed 8
 
4.3%
Gn 5
 
2.7%
Sh 5
 
2.7%
Eh 4
 
2.1%
Ev 3
 
1.6%
<NA> 1
 
0.5%

Length

2023-12-12T22:43:48.620168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ms 110
58.5%
dn 26
 
13.8%
pl 14
 
7.4%
dr 11
 
5.9%
ed 8
 
4.3%
gn 5
 
2.7%
sh 5
 
2.7%
eh 4
 
2.1%
ev 3
 
1.6%
na 1
 
0.5%
Distinct52
Distinct (%)29.4%
Missing11
Missing (%)5.9%
Memory size1.6 KiB
2023-12-12T22:43:48.841290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length7.4463277
Min length2

Characters and Unicode

Total characters1318
Distinct characters47
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)5.1%

Sample

1st row공연일반
2nd row기타
3rd row교육
4th row교육
5th row교육
ValueCountFrequency (%)
음악 109
27.9%
서양음악 30
 
7.7%
무용 25
 
6.4%
서양기악 23
 
5.9%
중창 15
 
3.8%
연희 13
 
3.3%
한국기악 11
 
2.8%
한국무용 11
 
2.8%
기타 10
 
2.6%
연극 10
 
2.6%
Other values (35) 134
34.3%
2023-12-12T22:43:49.269440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
16.2%
208
15.8%
157
11.9%
55
 
4.2%
55
 
4.2%
52
 
3.9%
50
 
3.8%
50
 
3.8%
50
 
3.8%
47
 
3.6%
Other values (37) 380
28.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1104
83.8%
Space Separator 214
 
16.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
208
18.8%
157
14.2%
55
 
5.0%
55
 
5.0%
52
 
4.7%
50
 
4.5%
50
 
4.5%
50
 
4.5%
47
 
4.3%
37
 
3.4%
Other values (36) 343
31.1%
Space Separator
ValueCountFrequency (%)
214
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1104
83.8%
Common 214
 
16.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
208
18.8%
157
14.2%
55
 
5.0%
55
 
5.0%
52
 
4.7%
50
 
4.5%
50
 
4.5%
50
 
4.5%
47
 
4.3%
37
 
3.4%
Other values (36) 343
31.1%
Common
ValueCountFrequency (%)
214
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1104
83.8%
ASCII 214
 
16.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
214
100.0%
Hangul
ValueCountFrequency (%)
208
18.8%
157
14.2%
55
 
5.0%
55
 
5.0%
52
 
4.7%
50
 
4.5%
50
 
4.5%
50
 
4.5%
47
 
4.3%
37
 
3.4%
Other values (36) 343
31.1%

레벨
Real number (ℝ)

Distinct6
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0744681
Minimum0
Maximum5
Zeros1
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T22:43:49.411608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.116137
Coefficient of variation (CV)0.36303417
Kurtosis-0.48633382
Mean3.0744681
Median Absolute Deviation (MAD)1
Skewness0.13169742
Sum578
Variance1.2457617
MonotonicityIncreasing
2023-12-12T22:43:49.543211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 70
37.2%
2 48
25.5%
4 33
17.6%
5 26
 
13.8%
1 10
 
5.3%
0 1
 
0.5%
ValueCountFrequency (%)
0 1
 
0.5%
1 10
 
5.3%
2 48
25.5%
3 70
37.2%
4 33
17.6%
5 26
 
13.8%
ValueCountFrequency (%)
5 26
 
13.8%
4 33
17.6%
3 70
37.2%
2 48
25.5%
1 10
 
5.3%
0 1
 
0.5%

Interactions

2023-12-12T22:43:45.575935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:43:49.644120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최상위코드명최상위코드차상위전체코드명레벨
최상위코드명1.0001.0001.0000.685
최상위코드1.0001.0001.0000.685
차상위전체코드명1.0001.0001.0001.000
레벨0.6850.6851.0001.000
2023-12-12T22:43:49.764277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최상위코드최상위코드명
최상위코드1.0001.000
최상위코드명1.0001.000
2023-12-12T22:43:49.856176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
레벨최상위코드명최상위코드
레벨1.0000.3470.347
최상위코드명0.3471.0001.000
최상위코드0.3471.0001.000

Missing values

2023-12-12T22:43:45.667286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:43:45.800446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:43:45.898604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

코드번호코드명코드전체명최상위코드명최상위코드차상위전체코드명레벨
0ROOT분류체계<NA><NA><NA><NA>0
1Dn무용무용무용Dn<NA>1
2Ms음악음악음악Ms<NA>1
3Pl연희연희연희Pl<NA>1
4Do기증기증기증Do<NA>1
5Dr연극연극연극Dr<NA>1
6Gn기타기타기타Gn<NA>1
7Ev행사행사행사Ev<NA>1
8Eh전시전시전시Eh<NA>1
9Ed교육교육교육Ed<NA>1
코드번호코드명코드전체명최상위코드명최상위코드차상위전체코드명레벨
178Ms02010301음악 서양기악 독주 타악기 북음악Ms음악 서양기악 독주 타악기5
179Ms02010201플롯음악 서양기악 독주 관악기 플롯음악Ms음악 서양기악 독주 관악기5
180Ms02010101바이올린음악 서양기악 독주 현악기 바이올린음악Ms음악 서양기악 독주 현악기5
181Ms01020399기타음악 서양성악 중창 혼성중창 기타음악Ms음악 서양음악 중창 혼성중창5
182Ms010203034중창음악 서양성악 중창 혼성중창 4중창음악Ms음악 서양음악 중창 혼성중창5
183Ms01020302트리오음악 서양성악 중창 혼성중창 트리오음악Ms음악 서양음악 중창 혼성중창5
184Ms01020301듀엣음악 서양성악 중창 혼성중창 듀엣음악Ms음악 서양음악 중창 혼성중창5
185Ms01020202트리오음악 서양성악 중창 여성중창 트리오음악Ms음악 서양음악 중창 여성중창5
186Ms010202034중창음악 서양성악 중창 여성중창 4중창음악Ms음악 서양음악 중창 여성중창5
187Ms01020299기타음악 서양성악 중창 여성중창 기타음악Ms음악 서양음악 중창 여성중창5