Overview

Dataset statistics

Number of variables6
Number of observations64
Missing cells75
Missing cells (%)19.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory54.1 B

Variable types

Categorical2
Numeric4

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "집필공간운영" 지원 사업의 분야별 사업성과(예: 입주 작가 수, 입주율, 입주일수)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076473/fileData.do

Alerts

입주작가수(명) is highly overall correlated with 입주율(%) and 1 other fieldsHigh correlation
입주율(%) is highly overall correlated with 입주작가수(명) and 1 other fieldsHigh correlation
입주일수(일) is highly overall correlated with 입주작가수(명) and 1 other fieldsHigh correlation
입주작가수(명) has 23 (35.9%) missing valuesMissing
입주율(%) has 23 (35.9%) missing valuesMissing
입주일수(일) has 29 (45.3%) missing valuesMissing
입주작가수(명) has 6 (9.4%) zerosZeros
입주율(%) has 6 (9.4%) zerosZeros
입주일수(일) has 6 (9.4%) zerosZeros

Reproduction

Analysis started2023-12-12 17:07:14.444477
Analysis finished2023-12-12 17:07:16.701785
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

문학단체명
Categorical

Distinct7
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size644.0 B
*지**단
23 
*을**집
12 
*악**원
10 
*버**집
*1**학
Other values (2)

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row*악**원
2nd row*을**집
3rd row*1**학
4th row*지**단
5th row*날**날

Common Values

ValueCountFrequency (%)
*지**단 23
35.9%
*을**집 12
18.8%
*악**원 10
15.6%
*버**집 9
 
14.1%
*1**학 5
 
7.8%
*날**날 4
 
6.2%
*산**꽃 1
 
1.6%

Length

2023-12-13T02:07:16.775500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:07:16.899447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지**단 23
35.9%
을**집 12
18.8%
악**원 10
15.6%
버**집 9
 
14.1%
1**학 5
 
7.8%
날**날 4
 
6.2%
산**꽃 1
 
1.6%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.5938
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T02:07:17.053239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014.15
Q12017
median2018
Q32019
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6204913
Coefficient of variation (CV)0.00080318018
Kurtosis-0.40870115
Mean2017.5938
Median Absolute Deviation (MAD)1
Skewness-0.90131313
Sum129126
Variance2.6259921
MonotonicityIncreasing
2023-12-13T02:07:17.157888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2019 28
43.8%
2018 11
 
17.2%
2017 10
 
15.6%
2015 6
 
9.4%
2016 5
 
7.8%
2014 4
 
6.2%
ValueCountFrequency (%)
2014 4
 
6.2%
2015 6
 
9.4%
2016 5
 
7.8%
2017 10
 
15.6%
2018 11
 
17.2%
2019 28
43.8%
ValueCountFrequency (%)
2019 28
43.8%
2018 11
 
17.2%
2017 10
 
15.6%
2016 5
 
7.8%
2015 6
 
9.4%
2014 4
 
6.2%

분야
Categorical

Distinct8
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size644.0 B
<NA>
23 
희곡
시(조)
소설
아동문학
Other values (3)
17 

Length

Max length4
Median length4
Mean length3.09375
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 23
35.9%
희곡 6
 
9.4%
시(조) 6
 
9.4%
소설 6
 
9.4%
아동문학 6
 
9.4%
산문 6
 
9.4%
평론 6
 
9.4%
기타 5
 
7.8%

Length

2023-12-13T02:07:17.323412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:07:17.482427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 23
35.9%
희곡 6
 
9.4%
시(조 6
 
9.4%
소설 6
 
9.4%
아동문학 6
 
9.4%
산문 6
 
9.4%
평론 6
 
9.4%
기타 5
 
7.8%

입주작가수(명)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct15
Distinct (%)36.6%
Missing23
Missing (%)35.9%
Infinite0
Infinite (%)0.0%
Mean5.6097561
Minimum0
Maximum23
Zeros6
Zeros (%)9.4%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T02:07:17.610391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median4
Q38
95-th percentile17
Maximum23
Range23
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.8261396
Coefficient of variation (CV)1.0385727
Kurtosis1.2897249
Mean5.6097561
Median Absolute Deviation (MAD)3
Skewness1.3305698
Sum230
Variance33.943902
MonotonicityNot monotonic
2023-12-13T02:07:17.731482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 7
 
10.9%
0 6
 
9.4%
12 4
 
6.2%
3 4
 
6.2%
4 3
 
4.7%
7 3
 
4.7%
2 3
 
4.7%
9 2
 
3.1%
6 2
 
3.1%
5 2
 
3.1%
Other values (5) 5
 
7.8%
(Missing) 23
35.9%
ValueCountFrequency (%)
0 6
9.4%
1 7
10.9%
2 3
4.7%
3 4
6.2%
4 3
4.7%
5 2
 
3.1%
6 2
 
3.1%
7 3
4.7%
8 1
 
1.6%
9 2
 
3.1%
ValueCountFrequency (%)
23 1
 
1.6%
20 1
 
1.6%
17 1
 
1.6%
16 1
 
1.6%
12 4
6.2%
9 2
3.1%
8 1
 
1.6%
7 3
4.7%
6 2
3.1%
5 2
3.1%

입주율(%)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct30
Distinct (%)73.2%
Missing23
Missing (%)35.9%
Infinite0
Infinite (%)0.0%
Mean14.534146
Minimum0
Maximum59
Zeros6
Zeros (%)9.4%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T02:07:17.867870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13.6
median10
Q318.8
95-th percentile42.8
Maximum59
Range59
Interquartile range (IQR)15.2

Descriptive statistics

Standard deviation14.566702
Coefficient of variation (CV)1.0022399
Kurtosis1.1257115
Mean14.534146
Median Absolute Deviation (MAD)7.9
Skewness1.2714265
Sum595.9
Variance212.1888
MonotonicityNot monotonic
2023-12-13T02:07:18.020120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0.0 6
 
9.4%
2.1 3
 
4.7%
14.3 2
 
3.1%
25.0 2
 
3.1%
18.8 2
 
3.1%
9.6 2
 
3.1%
1.9 1
 
1.6%
11.5 1
 
1.6%
42.8 1
 
1.6%
10.0 1
 
1.6%
Other values (20) 20
31.2%
(Missing) 23
35.9%
ValueCountFrequency (%)
0.0 6
9.4%
1.9 1
 
1.6%
2.1 3
4.7%
3.6 1
 
1.6%
4.0 1
 
1.6%
4.2 1
 
1.6%
5.8 1
 
1.6%
6.2 1
 
1.6%
6.5 1
 
1.6%
7.2 1
 
1.6%
ValueCountFrequency (%)
59.0 1
1.6%
44.2 1
1.6%
42.8 1
1.6%
41.7 1
1.6%
35.4 1
1.6%
35.0 1
1.6%
31.3 1
1.6%
25.0 2
3.1%
23.1 1
1.6%
18.8 2
3.1%

입주일수(일)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct27
Distinct (%)77.1%
Missing29
Missing (%)45.3%
Infinite0
Infinite (%)0.0%
Mean300.6
Minimum0
Maximum1438
Zeros6
Zeros (%)9.4%
Negative0
Negative (%)0.0%
Memory size708.0 B
2023-12-13T02:07:18.181470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q164.5
median148
Q3482.5
95-th percentile846.1
Maximum1438
Range1438
Interquartile range (IQR)418

Descriptive statistics

Standard deviation329.67954
Coefficient of variation (CV)1.0967383
Kurtosis2.9260291
Mean300.6
Median Absolute Deviation (MAD)148
Skewness1.6250048
Sum10521
Variance108688.6
MonotonicityNot monotonic
2023-12-13T02:07:18.311884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0 6
 
9.4%
135 3
 
4.7%
148 2
 
3.1%
30 1
 
1.6%
144 1
 
1.6%
793 1
 
1.6%
218 1
 
1.6%
60 1
 
1.6%
61 1
 
1.6%
116 1
 
1.6%
Other values (17) 17
26.6%
(Missing) 29
45.3%
ValueCountFrequency (%)
0 6
9.4%
30 1
 
1.6%
60 1
 
1.6%
61 1
 
1.6%
68 1
 
1.6%
116 1
 
1.6%
121 1
 
1.6%
135 3
4.7%
144 1
 
1.6%
148 2
 
3.1%
ValueCountFrequency (%)
1438 1
1.6%
949 1
1.6%
802 1
1.6%
793 1
1.6%
718 1
1.6%
591 1
1.6%
549 1
1.6%
538 1
1.6%
531 1
1.6%
434 1
1.6%

Interactions

2023-12-13T02:07:15.877532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:14.687563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.115400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.497817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.983565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:14.783524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.218693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.598943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:16.090184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:14.883930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.297199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.686959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:16.221802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:14.996816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.389312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:07:15.780107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:07:18.402925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도분야입주작가수(명)입주율(%)입주일수(일)
문학단체명1.0000.3270.0000.0000.0000.000
사업연도0.3271.0000.0000.4760.0000.000
분야0.0000.0001.0000.3520.6510.574
입주작가수(명)0.0000.4760.3521.0000.7490.956
입주율(%)0.0000.0000.6510.7491.0000.922
입주일수(일)0.0000.0000.5740.9560.9221.000
2023-12-13T02:07:18.506722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명분야
문학단체명1.0000.000
분야0.0001.000
2023-12-13T02:07:18.594707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도입주작가수(명)입주율(%)입주일수(일)문학단체명분야
사업연도1.000-0.225-0.109-0.2020.1550.000
입주작가수(명)-0.2251.0000.9350.9530.0000.173
입주율(%)-0.1090.9351.0000.9250.0000.411
입주일수(일)-0.2020.9530.9251.0000.0000.338
문학단체명0.1550.0000.0000.0001.0000.000
분야0.0000.1730.4110.3380.0001.000

Missing values

2023-12-13T02:07:16.381873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:07:16.517305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:07:16.641405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

문학단체명사업연도분야입주작가수(명)입주율(%)입주일수(일)
0*악**원2014<NA><NA><NA><NA>
1*을**집2014<NA><NA><NA><NA>
2*1**학2014<NA><NA><NA><NA>
3*지**단2014<NA><NA><NA><NA>
4*날**날2015<NA><NA><NA><NA>
5*악**원2015<NA><NA><NA><NA>
6*산**꽃2015<NA><NA><NA><NA>
7*을**집2015<NA><NA><NA><NA>
8*1**학2015<NA><NA><NA><NA>
9*지**단2015<NA><NA><NA><NA>
문학단체명사업연도분야입주작가수(명)입주율(%)입주일수(일)
54*버**집2019평론00.00
55*버**집2019희곡00.00
56*버**집2019기타414.3135
57*악**원2019기타110.0135
58*악**원2019희곡00.00
59*악**원2019평론14.060
60*악**원2019산문00.00
61*악**원2019아동문학216.0218
62*악**원2019소설759.0793
63*악**원2019시(조)211.0144