Overview

Dataset statistics

Number of variables5
Number of observations29
Missing cells9
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory48.6 B

Variable types

Categorical1
Numeric4

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "집필공간운영" 지원 사업의 공모 및 선정 내역(예: 입주 선정 예정 인원, 입주 신청 인원, 입주 선정 인원)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076469/fileData.do

Alerts

입주선정예정인원(명) is highly overall correlated with 입주신청인원(명) and 1 other fieldsHigh correlation
입주신청인원(명) is highly overall correlated with 입주선정예정인원(명) and 1 other fieldsHigh correlation
입주선정인원(명) is highly overall correlated with 입주선정예정인원(명) and 1 other fieldsHigh correlation
입주선정예정인원(명) has 4 (13.8%) missing valuesMissing
입주신청인원(명) has 1 (3.4%) missing valuesMissing
입주선정인원(명) has 4 (13.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:46:41.148436
Analysis finished2023-12-12 20:46:42.962955
Duration1.81 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

문학단체명
Categorical

Distinct7
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Memory size364.0 B
*을**집
*지**단
*1**학
*악**원
*날**날
Other values (2)

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row*악**원
2nd row*을**집
3rd row*1**학
4th row*지**단
5th row*날**날

Common Values

ValueCountFrequency (%)
*을**집 6
20.7%
*지**단 6
20.7%
*1**학 5
17.2%
*악**원 4
13.8%
*날**날 4
13.8%
*버**집 3
10.3%
*산**꽃 1
 
3.4%

Length

2023-12-13T05:46:43.032908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:46:43.148990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
을**집 6
20.7%
지**단 6
20.7%
1**학 5
17.2%
악**원 4
13.8%
날**날 4
13.8%
버**집 3
10.3%
산**꽃 1
 
3.4%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.4483
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T05:46:43.269308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12015
median2016
Q32018
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.660168
Coefficient of variation (CV)0.00082331294
Kurtosis-1.1983475
Mean2016.4483
Median Absolute Deviation (MAD)1
Skewness0.072131122
Sum58477
Variance2.7561576
MonotonicityIncreasing
2023-12-13T05:46:43.389041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2015 6
20.7%
2016 5
17.2%
2017 5
17.2%
2018 5
17.2%
2014 4
13.8%
2019 4
13.8%
ValueCountFrequency (%)
2014 4
13.8%
2015 6
20.7%
2016 5
17.2%
2017 5
17.2%
2018 5
17.2%
2019 4
13.8%
ValueCountFrequency (%)
2019 4
13.8%
2018 5
17.2%
2017 5
17.2%
2016 5
17.2%
2015 6
20.7%
2014 4
13.8%

입주선정예정인원(명)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct15
Distinct (%)60.0%
Missing4
Missing (%)13.8%
Infinite0
Infinite (%)0.0%
Mean35.2
Minimum7
Maximum56
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T05:46:43.571033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile16.6
Q128
median40
Q344
95-th percentile46.6
Maximum56
Range49
Interquartile range (IQR)16

Descriptive statistics

Standard deviation11.870833
Coefficient of variation (CV)0.33723956
Kurtosis-0.15616212
Mean35.2
Median Absolute Deviation (MAD)5
Skewness-0.68574894
Sum880
Variance140.91667
MonotonicityNot monotonic
2023-12-13T05:46:43.686196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
40 5
17.2%
44 4
13.8%
45 3
10.3%
28 2
 
6.9%
35 1
 
3.4%
7 1
 
3.4%
37 1
 
3.4%
56 1
 
3.4%
25 1
 
3.4%
21 1
 
3.4%
Other values (5) 5
17.2%
(Missing) 4
13.8%
ValueCountFrequency (%)
7 1
3.4%
16 1
3.4%
19 1
3.4%
20 1
3.4%
21 1
3.4%
25 1
3.4%
28 2
6.9%
30 1
3.4%
35 1
3.4%
37 1
3.4%
ValueCountFrequency (%)
56 1
 
3.4%
47 1
 
3.4%
45 3
10.3%
44 4
13.8%
40 5
17.2%
37 1
 
3.4%
35 1
 
3.4%
30 1
 
3.4%
28 2
 
6.9%
25 1
 
3.4%

입주신청인원(명)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct24
Distinct (%)85.7%
Missing1
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean71
Minimum7
Maximum147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T05:46:43.809024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile22.35
Q139
median65
Q399.25
95-th percentile129.5
Maximum147
Range140
Interquartile range (IQR)60.25

Descriptive statistics

Standard deviation36.987485
Coefficient of variation (CV)0.5209505
Kurtosis-0.71223105
Mean71
Median Absolute Deviation (MAD)28
Skewness0.34880408
Sum1988
Variance1368.0741
MonotonicityNot monotonic
2023-12-13T05:46:43.940602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
39 2
 
6.9%
115 2
 
6.9%
65 2
 
6.9%
62 2
 
6.9%
23 1
 
3.4%
32 1
 
3.4%
60 1
 
3.4%
95 1
 
3.4%
70 1
 
3.4%
90 1
 
3.4%
Other values (14) 14
48.3%
ValueCountFrequency (%)
7 1
3.4%
22 1
3.4%
23 1
3.4%
31 1
3.4%
32 1
3.4%
35 1
3.4%
39 2
6.9%
57 1
3.4%
60 1
3.4%
61 1
3.4%
ValueCountFrequency (%)
147 1
3.4%
133 1
3.4%
123 1
3.4%
117 1
3.4%
115 2
6.9%
112 1
3.4%
95 1
3.4%
90 1
3.4%
78 1
3.4%
70 1
3.4%

입주선정인원(명)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct19
Distinct (%)76.0%
Missing4
Missing (%)13.8%
Infinite0
Infinite (%)0.0%
Mean35.12
Minimum7
Maximum62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T05:46:44.043663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile16.6
Q127
median35
Q345
95-th percentile51.4
Maximum62
Range55
Interquartile range (IQR)18

Descriptive statistics

Standard deviation13.115894
Coefficient of variation (CV)0.37345939
Kurtosis-0.38807797
Mean35.12
Median Absolute Deviation (MAD)10
Skewness-0.15298166
Sum878
Variance172.02667
MonotonicityNot monotonic
2023-12-13T05:46:44.147137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
21 2
 
6.9%
48 2
 
6.9%
44 2
 
6.9%
30 2
 
6.9%
45 2
 
6.9%
35 2
 
6.9%
31 1
 
3.4%
27 1
 
3.4%
42 1
 
3.4%
49 1
 
3.4%
Other values (9) 9
31.0%
(Missing) 4
13.8%
ValueCountFrequency (%)
7 1
3.4%
16 1
3.4%
19 1
3.4%
21 2
6.9%
22 1
3.4%
27 1
3.4%
28 1
3.4%
30 2
6.9%
31 1
3.4%
35 2
6.9%
ValueCountFrequency (%)
62 1
3.4%
52 1
3.4%
49 1
3.4%
48 2
6.9%
45 2
6.9%
44 2
6.9%
42 1
3.4%
39 1
3.4%
38 1
3.4%
35 2
6.9%

Interactions

2023-12-13T05:46:42.317965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.313285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.642533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.981359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.397228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.391521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.737155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.066140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.474294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.467742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.815697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.153760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.585982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.560426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:41.894223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:46:42.238420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:46:44.235656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도입주선정예정인원(명)입주신청인원(명)입주선정인원(명)
문학단체명1.0000.0000.7030.7690.652
사업연도0.0001.0000.5670.0000.450
입주선정예정인원(명)0.7030.5671.0000.9000.895
입주신청인원(명)0.7690.0000.9001.0000.901
입주선정인원(명)0.6520.4500.8950.9011.000
2023-12-13T05:46:44.331597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도입주선정예정인원(명)입주신청인원(명)입주선정인원(명)문학단체명
사업연도1.0000.127-0.152-0.0650.000
입주선정예정인원(명)0.1271.0000.7320.9490.397
입주신청인원(명)-0.1520.7321.0000.8470.447
입주선정인원(명)-0.0650.9490.8471.0000.345
문학단체명0.0000.3970.4470.3451.000

Missing values

2023-12-13T05:46:42.691144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:46:42.804833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:46:42.899259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

문학단체명사업연도입주선정예정인원(명)입주신청인원(명)입주선정인원(명)
0*악**원2014283930
1*을**집2014356638
2*1**학20142814735
3*지**단20144511545
4*날**날2015777
5*악**원201537<NA>31
6*산**꽃2015<NA>5727
7*을**집2015406542
8*1**학20154413349
9*지**단20155611562
문학단체명사업연도입주선정예정인원(명)입주신청인원(명)입주선정인원(명)
19*을**집2017406739
20*버**집2018303921
21*날**날2018193119
22*을**집20184061<NA>
23*1**학201844123<NA>
24*지**단2018459048
25*을**집2019<NA>70<NA>
26*지**단2019479552
27*버**집2019<NA>6028
28*악**원2019<NA>32<NA>