Overview

Dataset statistics

Number of variables5
Number of observations120
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory45.1 B

Variable types

Text1
Numeric4

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 공연예술 분야 "올해의 신작" 지원 사업의 참여예술인 현황(예: 기획/제작/스태프 참여예술인 수, 출연진 참여예술인 수)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076410/fileData.do

Alerts

참여예술인수_기획제작스태프(명) is highly overall correlated with 참여예술인수(명)High correlation
참여예술인수_출연진(명) is highly overall correlated with 참여예술인수(명)High correlation
참여예술인수(명) is highly overall correlated with 참여예술인수_기획제작스태프(명) and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 19:12:17.573509
Analysis finished2023-12-12 19:12:19.602541
Duration2.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct106
Distinct (%)88.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T04:12:19.866445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters600
Distinct characters120
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)76.7%

Sample

1st row*나**단
2nd row*경**단
3rd row*러**단
4th row*울**터
5th row*선**단
ValueCountFrequency (%)
컴**니 2
 
1.7%
s**y 2
 
1.7%
하**소 2
 
1.7%
블**티 2
 
1.7%
단**화 2
 
1.7%
발**단 2
 
1.7%
벨**단 2
 
1.7%
이**트 2
 
1.7%
울**터 2
 
1.7%
단**수 2
 
1.7%
Other values (96) 100
83.3%
2023-12-13T04:12:20.668945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 360
60.0%
44
 
7.3%
8
 
1.3%
7
 
1.2%
6
 
1.0%
4
 
0.7%
4
 
0.7%
y 4
 
0.7%
4
 
0.7%
o 4
 
0.7%
Other values (110) 155
25.8%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 361
60.2%
Other Letter 215
35.8%
Lowercase Letter 13
 
2.2%
Uppercase Letter 10
 
1.7%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
20.5%
8
 
3.7%
7
 
3.3%
6
 
2.8%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (94) 127
59.1%
Lowercase Letter
ValueCountFrequency (%)
y 4
30.8%
o 4
30.8%
n 1
 
7.7%
i 1
 
7.7%
r 1
 
7.7%
t 1
 
7.7%
a 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
C 3
30.0%
S 2
20.0%
D 2
20.0%
P 1
 
10.0%
R 1
 
10.0%
J 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
* 360
99.7%
! 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 362
60.3%
Hangul 215
35.8%
Latin 23
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
20.5%
8
 
3.7%
7
 
3.3%
6
 
2.8%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (94) 127
59.1%
Latin
ValueCountFrequency (%)
y 4
17.4%
o 4
17.4%
C 3
13.0%
S 2
8.7%
D 2
8.7%
P 1
 
4.3%
n 1
 
4.3%
i 1
 
4.3%
r 1
 
4.3%
t 1
 
4.3%
Other values (3) 3
13.0%
Common
ValueCountFrequency (%)
* 360
99.4%
1 1
 
0.3%
! 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 385
64.2%
Hangul 215
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 360
93.5%
y 4
 
1.0%
o 4
 
1.0%
C 3
 
0.8%
S 2
 
0.5%
D 2
 
0.5%
P 1
 
0.3%
n 1
 
0.3%
1 1
 
0.3%
i 1
 
0.3%
Other values (6) 6
 
1.6%
Hangul
ValueCountFrequency (%)
44
 
20.5%
8
 
3.7%
7
 
3.3%
6
 
2.8%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
3
 
1.4%
Other values (94) 127
59.1%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.5917
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T04:12:20.831991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12015
median2017
Q32018
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6423791
Coefficient of variation (CV)0.00081443313
Kurtosis-1.1746027
Mean2016.5917
Median Absolute Deviation (MAD)1
Skewness-0.026263594
Sum241991
Variance2.697409
MonotonicityIncreasing
2023-12-13T04:12:20.969184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2016 22
18.3%
2017 22
18.3%
2015 21
17.5%
2018 20
16.7%
2019 20
16.7%
2014 15
12.5%
ValueCountFrequency (%)
2014 15
12.5%
2015 21
17.5%
2016 22
18.3%
2017 22
18.3%
2018 20
16.7%
2019 20
16.7%
ValueCountFrequency (%)
2019 20
16.7%
2018 20
16.7%
2017 22
18.3%
2016 22
18.3%
2015 21
17.5%
2014 15
12.5%

참여예술인수_기획제작스태프(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.116667
Minimum2
Maximum64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T04:12:21.100895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4
Q18
median12
Q317
95-th percentile27.05
Maximum64
Range62
Interquartile range (IQR)9

Descriptive statistics

Standard deviation9.2383775
Coefficient of variation (CV)0.65443052
Kurtosis8.7565762
Mean14.116667
Median Absolute Deviation (MAD)4
Skewness2.404424
Sum1694
Variance85.347619
MonotonicityNot monotonic
2023-12-13T04:12:21.254481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
15 10
 
8.3%
12 10
 
8.3%
9 10
 
8.3%
8 8
 
6.7%
6 7
 
5.8%
4 7
 
5.8%
16 7
 
5.8%
11 7
 
5.8%
7 6
 
5.0%
10 6
 
5.0%
Other values (20) 42
35.0%
ValueCountFrequency (%)
2 1
 
0.8%
3 1
 
0.8%
4 7
5.8%
5 1
 
0.8%
6 7
5.8%
7 6
5.0%
8 8
6.7%
9 10
8.3%
10 6
5.0%
11 7
5.8%
ValueCountFrequency (%)
64 1
 
0.8%
47 2
1.7%
44 1
 
0.8%
30 1
 
0.8%
28 1
 
0.8%
27 1
 
0.8%
26 2
1.7%
24 3
2.5%
23 4
3.3%
22 3
2.5%

참여예술인수_출연진(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)25.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.916667
Minimum2
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T04:12:21.405620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile3.95
Q16
median9.5
Q314.25
95-th percentile26.05
Maximum120
Range118
Interquartile range (IQR)8.25

Descriptive statistics

Standard deviation13.482378
Coefficient of variation (CV)1.043797
Kurtosis34.606083
Mean12.916667
Median Absolute Deviation (MAD)3.5
Skewness5.0158427
Sum1550
Variance181.77451
MonotonicityNot monotonic
2023-12-13T04:12:21.559527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
4 11
 
9.2%
7 11
 
9.2%
8 9
 
7.5%
9 9
 
7.5%
6 9
 
7.5%
14 7
 
5.8%
11 7
 
5.8%
10 7
 
5.8%
5 5
 
4.2%
3 5
 
4.2%
Other values (21) 40
33.3%
ValueCountFrequency (%)
2 1
 
0.8%
3 5
4.2%
4 11
9.2%
5 5
4.2%
6 9
7.5%
7 11
9.2%
8 9
7.5%
9 9
7.5%
10 7
5.8%
11 7
5.8%
ValueCountFrequency (%)
120 1
 
0.8%
60 1
 
0.8%
51 1
 
0.8%
50 1
 
0.8%
30 1
 
0.8%
27 1
 
0.8%
26 2
1.7%
25 1
 
0.8%
24 3
2.5%
23 1
 
0.8%

참여예술인수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct46
Distinct (%)38.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.025
Minimum7
Maximum167
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T04:12:21.720108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile9.95
Q117
median22.5
Q332
95-th percentile56.45
Maximum167
Range160
Interquartile range (IQR)15

Descriptive statistics

Standard deviation18.996887
Coefficient of variation (CV)0.70293755
Kurtosis24.780887
Mean27.025
Median Absolute Deviation (MAD)7.5
Skewness3.9738932
Sum3243
Variance360.88172
MonotonicityNot monotonic
2023-12-13T04:12:21.904539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
19 9
 
7.5%
21 8
 
6.7%
12 5
 
4.2%
32 5
 
4.2%
23 5
 
4.2%
28 4
 
3.3%
22 4
 
3.3%
17 4
 
3.3%
31 4
 
3.3%
16 4
 
3.3%
Other values (36) 68
56.7%
ValueCountFrequency (%)
7 3
2.5%
8 2
 
1.7%
9 1
 
0.8%
10 2
 
1.7%
11 3
2.5%
12 5
4.2%
13 2
 
1.7%
14 3
2.5%
15 3
2.5%
16 4
3.3%
ValueCountFrequency (%)
167 1
0.8%
81 1
0.8%
75 1
0.8%
73 1
0.8%
68 1
0.8%
65 1
0.8%
56 1
0.8%
49 1
0.8%
48 1
0.8%
45 1
0.8%

Interactions

2023-12-13T04:12:18.982098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:17.771089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.224073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.596962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:19.082973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:17.874188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.317363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.684771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:19.195037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:17.987340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.411856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.787695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:19.307111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.092301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.507597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:12:18.884723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:12:22.037203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도참여예술인수_기획제작스태프(명)참여예술인수_출연진(명)참여예술인수(명)
사업연도1.0000.2400.0000.148
참여예술인수_기획제작스태프(명)0.2401.0000.5690.773
참여예술인수_출연진(명)0.0000.5691.0000.814
참여예술인수(명)0.1480.7730.8141.000
2023-12-13T04:12:22.184397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도참여예술인수_기획제작스태프(명)참여예술인수_출연진(명)참여예술인수(명)
사업연도1.0000.3390.0470.237
참여예술인수_기획제작스태프(명)0.3391.0000.3360.796
참여예술인수_출연진(명)0.0470.3361.0000.783
참여예술인수(명)0.2370.7960.7831.000

Missing values

2023-12-13T04:12:19.445755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:12:19.551324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공연단체명사업연도참여예술인수_기획제작스태프(명)참여예술인수_출연진(명)참여예술인수(명)
0*나**단2014347
1*경**단201491019
2*러**단2014151025
3*울**터201462430
4*선**단201491120
5*스**창20149211
6*경**영20144913
7*o**r2014639
8*이**쳐2014162238
9*능**부2014181230
공연단체명사업연도참여예술인수_기획제작스태프(명)참여예술인수_출연진(명)참여예술인수(명)
110*정**옥20198917
111*단**수2019151429
112*단**고2019301141
113*금**스2019231437
114*처**인2019171027
115*단**리201913821
116*이**트201944448
117*컴**니201919827
118*단**희201926531
119*단**자201913821