Overview

Dataset statistics

Number of variables5
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory48.6 B

Variable types

Categorical4
Numeric1

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "집필공간운영" 지원 사업의 성과(예: 입주 작가 수, 입주일수, 작품 창작 수)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076476/fileData.do

Alerts

성과_입주작가수(명) is highly overall correlated with 사업연도 and 3 other fieldsHigh correlation
문학단체명 is highly overall correlated with 성과_입주작가수(명) and 2 other fieldsHigh correlation
성과_작가입주일수(일) is highly overall correlated with 사업연도 and 3 other fieldsHigh correlation
성과_작품창작수(건) is highly overall correlated with 사업연도 and 3 other fieldsHigh correlation
사업연도 is highly overall correlated with 성과_입주작가수(명) and 2 other fieldsHigh correlation
성과_입주작가수(명) is highly imbalanced (58.9%)Imbalance
성과_작가입주일수(일) is highly imbalanced (58.9%)Imbalance
성과_작품창작수(건) is highly imbalanced (58.9%)Imbalance

Reproduction

Analysis started2023-12-12 12:39:55.528710
Analysis finished2023-12-12 12:39:56.028185
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

문학단체명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Memory size364.0 B
*을**집
*지**단
*1**학
*악**원
*날**날
Other values (2)

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row*악**원
2nd row*을**집
3rd row*1**학
4th row*지**단
5th row*날**날

Common Values

ValueCountFrequency (%)
*을**집 6
20.7%
*지**단 6
20.7%
*1**학 5
17.2%
*악**원 4
13.8%
*날**날 4
13.8%
*버**집 3
10.3%
*산**꽃 1
 
3.4%

Length

2023-12-12T21:39:56.110508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:56.241161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
을**집 6
20.7%
지**단 6
20.7%
1**학 5
17.2%
악**원 4
13.8%
날**날 4
13.8%
버**집 3
10.3%
산**꽃 1
 
3.4%

사업연도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.4483
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-12T21:39:56.382631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12015
median2016
Q32018
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.660168
Coefficient of variation (CV)0.00082331294
Kurtosis-1.1983475
Mean2016.4483
Median Absolute Deviation (MAD)1
Skewness0.072131122
Sum58477
Variance2.7561576
MonotonicityIncreasing
2023-12-12T21:39:56.493081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2015 6
20.7%
2016 5
17.2%
2017 5
17.2%
2018 5
17.2%
2014 4
13.8%
2019 4
13.8%
ValueCountFrequency (%)
2014 4
13.8%
2015 6
20.7%
2016 5
17.2%
2017 5
17.2%
2018 5
17.2%
2019 4
13.8%
ValueCountFrequency (%)
2019 4
13.8%
2018 5
17.2%
2017 5
17.2%
2016 5
17.2%
2015 6
20.7%
2014 4
13.8%

성과_입주작가수(명)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
<NA>
24 
48
 
1
44
 
1
52
 
1
28
 
1

Length

Max length4
Median length4
Mean length3.6551724
Min length2

Unique

Unique5 ?
Unique (%)17.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 24
82.8%
48 1
 
3.4%
44 1
 
3.4%
52 1
 
3.4%
28 1
 
3.4%
13 1
 
3.4%

Length

2023-12-12T21:39:56.625986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:56.752288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 24
82.8%
48 1
 
3.4%
44 1
 
3.4%
52 1
 
3.4%
28 1
 
3.4%
13 1
 
3.4%

성과_작가입주일수(일)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
<NA>
24 
2717
 
1
22293
 
1
2989
 
1
1172
 
1

Length

Max length5
Median length4
Mean length4.0344828
Min length4

Unique

Unique5 ?
Unique (%)17.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 24
82.8%
2717 1
 
3.4%
22293 1
 
3.4%
2989 1
 
3.4%
1172 1
 
3.4%
1350 1
 
3.4%

Length

2023-12-12T21:39:56.883679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:57.002139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 24
82.8%
2717 1
 
3.4%
22293 1
 
3.4%
2989 1
 
3.4%
1172 1
 
3.4%
1350 1
 
3.4%

성과_작품창작수(건)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
<NA>
24 
395
 
1
214
 
1
316
 
1
208
 
1

Length

Max length4
Median length4
Mean length3.7931034
Min length2

Unique

Unique5 ?
Unique (%)17.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 24
82.8%
395 1
 
3.4%
214 1
 
3.4%
316 1
 
3.4%
208 1
 
3.4%
96 1
 
3.4%

Length

2023-12-12T21:39:57.161124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:57.285556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 24
82.8%
395 1
 
3.4%
214 1
 
3.4%
316 1
 
3.4%
208 1
 
3.4%
96 1
 
3.4%

Interactions

2023-12-12T21:39:55.750840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:39:57.389738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도성과_입주작가수(명)성과_작가입주일수(일)성과_작품창작수(건)
문학단체명1.0000.0001.0001.0001.000
사업연도0.0001.0001.0001.0001.000
성과_입주작가수(명)1.0001.0001.0001.0001.000
성과_작가입주일수(일)1.0001.0001.0001.0001.000
성과_작품창작수(건)1.0001.0001.0001.0001.000
2023-12-12T21:39:57.512444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성과_입주작가수(명)문학단체명성과_작가입주일수(일)성과_작품창작수(건)
성과_입주작가수(명)1.0001.0001.0001.000
문학단체명1.0001.0001.0001.000
성과_작가입주일수(일)1.0001.0001.0001.000
성과_작품창작수(건)1.0001.0001.0001.000
2023-12-12T21:39:57.630857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도문학단체명성과_입주작가수(명)성과_작가입주일수(일)성과_작품창작수(건)
사업연도1.0000.0001.0001.0001.000
문학단체명0.0001.0001.0001.0001.000
성과_입주작가수(명)1.0001.0001.0001.0001.000
성과_작가입주일수(일)1.0001.0001.0001.0001.000
성과_작품창작수(건)1.0001.0001.0001.0001.000

Missing values

2023-12-12T21:39:55.881101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:39:55.984416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문학단체명사업연도성과_입주작가수(명)성과_작가입주일수(일)성과_작품창작수(건)
0*악**원2014<NA><NA><NA>
1*을**집2014<NA><NA><NA>
2*1**학2014<NA><NA><NA>
3*지**단2014<NA><NA><NA>
4*날**날2015<NA><NA><NA>
5*악**원2015<NA><NA><NA>
6*산**꽃2015<NA><NA><NA>
7*을**집2015<NA><NA><NA>
8*1**학2015<NA><NA><NA>
9*지**단2015<NA><NA><NA>
문학단체명사업연도성과_입주작가수(명)성과_작가입주일수(일)성과_작품창작수(건)
19*을**집2017<NA><NA><NA>
20*버**집2018<NA><NA><NA>
21*날**날2018<NA><NA><NA>
22*을**집2018<NA><NA><NA>
23*1**학2018<NA><NA><NA>
24*지**단2018482717395
25*을**집20194422293214
26*지**단2019522989316
27*버**집2019281172208
28*악**원201913135096