Overview

Dataset statistics

Number of variables5
Number of observations409
Missing cells18
Missing cells (%)0.9%
Duplicate rows46
Duplicate rows (%)11.2%
Total size in memory17.7 KiB
Average record size in memory44.3 B

Variable types

Text1
Numeric4

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "문예지발간" 지원 사업의 홍보실적(예: 언론보도 실적, 행사 개최횟수, 행사 참가 예술인 수)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076440/fileData.do

Alerts

Dataset has 46 (11.2%) duplicate rowsDuplicates
행사참가예술인수(명) has 18 (4.4%) missing valuesMissing
언론보도실적(건) has 162 (39.6%) zerosZeros
행사개최실적(건) has 49 (12.0%) zerosZeros
행사참가예술인수(명) has 49 (12.0%) zerosZeros

Reproduction

Analysis started2023-12-12 00:27:19.575937
Analysis finished2023-12-12 00:27:21.530125
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct65
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T09:27:21.684443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters2045
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)2.4%

Sample

1st row*제**부
2nd row*국**회
3rd row*국**회
4th row*국**회
5th row*국**회
ValueCountFrequency (%)
국**회 90
22.0%
대**학 28
 
6.8%
디**원 22
 
5.4%
비**비 14
 
3.4%
학**상 13
 
3.2%
아**원 12
 
2.9%
서**성 11
 
2.7%
시**아 9
 
2.2%
조**학 9
 
2.2%
간**선 7
 
1.7%
Other values (55) 194
47.4%
2023-12-12T09:27:22.435490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 1227
60.0%
104
 
5.1%
103
 
5.0%
98
 
4.8%
34
 
1.7%
32
 
1.6%
28
 
1.4%
23
 
1.1%
23
 
1.1%
22
 
1.1%
Other values (60) 351
 
17.2%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 1227
60.0%
Other Letter 804
39.3%
Close Punctuation 8
 
0.4%
Decimal Number 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
12.9%
103
 
12.8%
98
 
12.2%
34
 
4.2%
32
 
4.0%
28
 
3.5%
23
 
2.9%
23
 
2.9%
22
 
2.7%
22
 
2.7%
Other values (57) 315
39.2%
Other Punctuation
ValueCountFrequency (%)
* 1227
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Decimal Number
ValueCountFrequency (%)
1 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1241
60.7%
Hangul 804
39.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
12.9%
103
 
12.8%
98
 
12.2%
34
 
4.2%
32
 
4.0%
28
 
3.5%
23
 
2.9%
23
 
2.9%
22
 
2.7%
22
 
2.7%
Other values (57) 315
39.2%
Common
ValueCountFrequency (%)
* 1227
98.9%
) 8
 
0.6%
1 6
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1241
60.7%
Hangul 804
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 1227
98.9%
) 8
 
0.6%
1 6
 
0.5%
Hangul
ValueCountFrequency (%)
104
 
12.9%
103
 
12.8%
98
 
12.2%
34
 
4.2%
32
 
4.0%
28
 
3.5%
23
 
2.9%
23
 
2.9%
22
 
2.7%
22
 
2.7%
Other values (57) 315
39.2%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.5037
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-12T09:27:22.576728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12014
median2017
Q32019
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.1581154
Coefficient of variation (CV)0.0010702264
Kurtosis-1.7904888
Mean2016.5037
Median Absolute Deviation (MAD)2
Skewness-0.089571651
Sum824750
Variance4.657462
MonotonicityIncreasing
2023-12-12T09:27:22.702046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2014 147
35.9%
2019 112
27.4%
2018 91
22.2%
2015 31
 
7.6%
2016 15
 
3.7%
2017 13
 
3.2%
ValueCountFrequency (%)
2014 147
35.9%
2015 31
 
7.6%
2016 15
 
3.7%
2017 13
 
3.2%
2018 91
22.2%
2019 112
27.4%
ValueCountFrequency (%)
2019 112
27.4%
2018 91
22.2%
2017 13
 
3.2%
2016 15
 
3.7%
2015 31
 
7.6%
2014 147
35.9%

언론보도실적(건)
Real number (ℝ)

ZEROS 

Distinct17
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3863081
Minimum0
Maximum26
Zeros162
Zeros (%)39.6%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-12T09:27:22.845559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q34
95-th percentile10
Maximum26
Range26
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.2625937
Coefficient of variation (CV)1.367214
Kurtosis10.199206
Mean2.3863081
Median Absolute Deviation (MAD)2
Skewness2.5785219
Sum976
Variance10.644518
MonotonicityNot monotonic
2023-12-12T09:27:22.950350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 162
39.6%
2 73
17.8%
4 51
 
12.5%
1 36
 
8.8%
3 29
 
7.1%
6 13
 
3.2%
5 11
 
2.7%
10 8
 
2.0%
11 7
 
1.7%
7 5
 
1.2%
Other values (7) 14
 
3.4%
ValueCountFrequency (%)
0 162
39.6%
1 36
 
8.8%
2 73
17.8%
3 29
 
7.1%
4 51
 
12.5%
5 11
 
2.7%
6 13
 
3.2%
7 5
 
1.2%
8 4
 
1.0%
9 2
 
0.5%
ValueCountFrequency (%)
26 1
 
0.2%
20 1
 
0.2%
16 2
 
0.5%
13 2
 
0.5%
12 2
 
0.5%
11 7
1.7%
10 8
2.0%
9 2
 
0.5%
8 4
1.0%
7 5
1.2%

행사개최실적(건)
Real number (ℝ)

ZEROS 

Distinct11
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1687042
Minimum0
Maximum11
Zeros49
Zeros (%)12.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-12T09:27:23.047239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile9
Maximum11
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5492018
Coefficient of variation (CV)0.80449348
Kurtosis1.3117565
Mean3.1687042
Median Absolute Deviation (MAD)2
Skewness1.121822
Sum1296
Variance6.4984299
MonotonicityNot monotonic
2023-12-12T09:27:23.152628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 77
18.8%
5 62
15.2%
2 61
14.9%
4 58
14.2%
3 58
14.2%
0 49
12.0%
11 13
 
3.2%
8 9
 
2.2%
9 9
 
2.2%
7 7
 
1.7%
ValueCountFrequency (%)
0 49
12.0%
1 77
18.8%
2 61
14.9%
3 58
14.2%
4 58
14.2%
5 62
15.2%
6 6
 
1.5%
7 7
 
1.7%
8 9
 
2.2%
9 9
 
2.2%
ValueCountFrequency (%)
11 13
 
3.2%
9 9
 
2.2%
8 9
 
2.2%
7 7
 
1.7%
6 6
 
1.5%
5 62
15.2%
4 58
14.2%
3 58
14.2%
2 61
14.9%
1 77
18.8%

행사참가예술인수(명)
Real number (ℝ)

MISSING  ZEROS 

Distinct82
Distinct (%)21.0%
Missing18
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean85.048593
Minimum0
Maximum5000
Zeros49
Zeros (%)12.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-12T09:27:23.277293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median44
Q3100
95-th percentile204
Maximum5000
Range5000
Interquartile range (IQR)96

Descriptive statistics

Standard deviation314.64585
Coefficient of variation (CV)3.6996009
Kurtosis191.83497
Mean85.048593
Median Absolute Deviation (MAD)42
Skewness13.375782
Sum33254
Variance99002.01
MonotonicityNot monotonic
2023-12-12T09:27:23.413013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 49
 
12.0%
50 23
 
5.6%
1 20
 
4.9%
150 20
 
4.9%
100 20
 
4.9%
80 18
 
4.4%
10 16
 
3.9%
120 14
 
3.4%
3 13
 
3.2%
2 12
 
2.9%
Other values (72) 186
45.5%
(Missing) 18
 
4.4%
ValueCountFrequency (%)
0 49
12.0%
1 20
4.9%
2 12
 
2.9%
3 13
 
3.2%
4 11
 
2.7%
5 5
 
1.2%
6 5
 
1.2%
7 4
 
1.0%
8 5
 
1.2%
10 16
 
3.9%
ValueCountFrequency (%)
5000 1
 
0.2%
3584 1
 
0.2%
420 1
 
0.2%
400 1
 
0.2%
300 7
1.7%
260 1
 
0.2%
250 1
 
0.2%
240 2
 
0.5%
230 1
 
0.2%
226 1
 
0.2%

Interactions

2023-12-12T09:27:20.918598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:19.811687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.199922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.554866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:21.005725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:19.918474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.284993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.648367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:21.137280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.021086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.368559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.740217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:21.268654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.115718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.463777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:27:20.833569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:27:23.498090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도언론보도실적(건)행사개최실적(건)행사참가예술인수(명)
문학단체명1.0000.5500.7570.8530.914
사업연도0.5501.0000.2630.6080.195
언론보도실적(건)0.7570.2631.0000.5510.000
행사개최실적(건)0.8530.6080.5511.0000.000
행사참가예술인수(명)0.9140.1950.0000.0001.000
2023-12-12T09:27:23.595319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도언론보도실적(건)행사개최실적(건)행사참가예술인수(명)
사업연도1.000-0.088-0.310-0.161
언론보도실적(건)-0.0881.0000.256-0.026
행사개최실적(건)-0.3100.2561.0000.110
행사참가예술인수(명)-0.161-0.0260.1101.000

Missing values

2023-12-12T09:27:21.398930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:27:21.492759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문학단체명사업연도언론보도실적(건)행사개최실적(건)행사참가예술인수(명)
0*제**부2014200
1*국**회201404208
2*국**회201404175
3*국**회20140487
4*국**회201402300
5*국**회201402300
6*1**학20144540
7*1**학2014457
8*1**학2014457
9*1**학2014457
문학단체명사업연도언론보도실적(건)행사개최실적(건)행사참가예술인수(명)
399*국**연201901150
400*국**연201923100
401*대**학2019063
402*대**학2019063
403*대**학2019063
404*대**학2019064
405*대**학2019063
406*대**학2019064
407*국**회201921120
408*천**학2019000

Duplicate rows

Most frequently occurring

문학단체명사업연도언론보도실적(건)행사개최실적(건)행사참가예술인수(명)# duplicates
25*디**원201441119
27*디**원20192916
12*국**회20173004
22*대**학20190634
26*디**원2014411<NA>4
29*로**상201405104
0*1**학20144573
4*국**회2014051053
24*동**론2014641303
1*간**계201435802