Overview

Dataset statistics

Number of variables6
Number of observations181
Missing cells539
Missing cells (%)49.6%
Duplicate rows9
Duplicate rows (%)5.0%
Total size in memory9.5 KiB
Average record size in memory53.7 B

Variable types

Text1
Numeric5

Dataset

Description2014-2019년 문예진흥기금 공모사업 중 문학 분야 "문예지발간" 지원 사업의 모니터링 실시 현황(예: 모니터링 회의 개최 횟수, 모니터링 추진 실적 변동율, 모니터링 회의 참여 인원 등)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076413/fileData.do

Alerts

Dataset has 9 (5.0%) duplicate rowsDuplicates
금년_모니터링회의개최횟수(회) is highly overall correlated with 작년_모니터링회의개최횟수(회)High correlation
작년_모니터링회의개최횟수(회) is highly overall correlated with 금년_모니터링회의개최횟수(회)High correlation
금년_모니터링회의개최횟수(회) has 134 (74.0%) missing valuesMissing
작년_모니터링회의개최횟수(회) has 134 (74.0%) missing valuesMissing
모니터링추진실적변동율(%) has 134 (74.0%) missing valuesMissing
모니터링회의참여인원(명) has 137 (75.7%) missing valuesMissing
금년_모니터링회의개최횟수(회) has 4 (2.2%) zerosZeros
작년_모니터링회의개최횟수(회) has 5 (2.8%) zerosZeros
모니터링추진실적변동율(%) has 28 (15.5%) zerosZeros
모니터링회의참여인원(명) has 4 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 20:44:25.585229
Analysis finished2023-12-12 20:44:28.716236
Duration3.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct62
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T05:44:28.878641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters905
Distinct characters68
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)10.5%

Sample

1st row*제**부
2nd row*국**회
3rd row*1**학
4th row*학**네
5th row*학**상
ValueCountFrequency (%)
국**회 45
24.9%
대**학 8
 
4.4%
제**부 5
 
2.8%
학**네 4
 
2.2%
학**상 4
 
2.2%
비**비 4
 
2.2%
학**사 4
 
2.2%
음**음 4
 
2.2%
년**작 3
 
1.7%
학**당 3
 
1.7%
Other values (52) 97
53.6%
2023-12-13T05:44:29.229077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 543
60.0%
54
 
6.0%
48
 
5.3%
41
 
4.5%
17
 
1.9%
11
 
1.2%
10
 
1.1%
9
 
1.0%
9
 
1.0%
8
 
0.9%
Other values (58) 155
 
17.1%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 543
60.0%
Other Letter 360
39.8%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
15.0%
48
 
13.3%
41
 
11.4%
17
 
4.7%
11
 
3.1%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.2%
7
 
1.9%
Other values (56) 146
40.6%
Other Punctuation
ValueCountFrequency (%)
* 543
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 545
60.2%
Hangul 360
39.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
15.0%
48
 
13.3%
41
 
11.4%
17
 
4.7%
11
 
3.1%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.2%
7
 
1.9%
Other values (56) 146
40.6%
Common
ValueCountFrequency (%)
* 543
99.6%
1 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 545
60.2%
Hangul 360
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 543
99.6%
1 2
 
0.4%
Hangul
ValueCountFrequency (%)
54
 
15.0%
48
 
13.3%
41
 
11.4%
17
 
4.7%
11
 
3.1%
10
 
2.8%
9
 
2.5%
9
 
2.5%
8
 
2.2%
7
 
1.9%
Other values (56) 146
40.6%

사업연도
Real number (ℝ)

Distinct6
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.7624
Minimum2014
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T05:44:29.358467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12014
median2018
Q32019
95-th percentile2019
Maximum2019
Range5
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.0395867
Coefficient of variation (CV)0.0010113173
Kurtosis-1.5893876
Mean2016.7624
Median Absolute Deviation (MAD)1
Skewness-0.35284142
Sum365034
Variance4.1599141
MonotonicityIncreasing
2023-12-13T05:44:29.473017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2014 51
28.2%
2018 50
27.6%
2019 47
26.0%
2015 14
 
7.7%
2017 13
 
7.2%
2016 6
 
3.3%
ValueCountFrequency (%)
2014 51
28.2%
2015 14
 
7.7%
2016 6
 
3.3%
2017 13
 
7.2%
2018 50
27.6%
2019 47
26.0%
ValueCountFrequency (%)
2019 47
26.0%
2018 50
27.6%
2017 13
 
7.2%
2016 6
 
3.3%
2015 14
 
7.7%
2014 51
28.2%

금년_모니터링회의개최횟수(회)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct13
Distinct (%)27.7%
Missing134
Missing (%)74.0%
Infinite0
Infinite (%)0.0%
Mean8.1914894
Minimum0
Maximum52
Zeros4
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T05:44:29.575157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median4
Q310
95-th percentile31.1
Maximum52
Range52
Interquartile range (IQR)6

Descriptive statistics

Standard deviation10.330702
Coefficient of variation (CV)1.2611506
Kurtosis9.2648722
Mean8.1914894
Median Absolute Deviation (MAD)2
Skewness2.9679157
Sum385
Variance106.7234
MonotonicityNot monotonic
2023-12-13T05:44:29.687069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
4 18
 
9.9%
12 7
 
3.9%
6 6
 
3.3%
0 4
 
2.2%
5 3
 
1.7%
10 2
 
1.1%
2 1
 
0.6%
35 1
 
0.6%
43 1
 
0.6%
1 1
 
0.6%
Other values (3) 3
 
1.7%
(Missing) 134
74.0%
ValueCountFrequency (%)
0 4
 
2.2%
1 1
 
0.6%
2 1
 
0.6%
3 1
 
0.6%
4 18
9.9%
5 3
 
1.7%
6 6
 
3.3%
10 2
 
1.1%
12 7
 
3.9%
22 1
 
0.6%
ValueCountFrequency (%)
52 1
 
0.6%
43 1
 
0.6%
35 1
 
0.6%
22 1
 
0.6%
12 7
 
3.9%
10 2
 
1.1%
6 6
 
3.3%
5 3
 
1.7%
4 18
9.9%
3 1
 
0.6%

작년_모니터링회의개최횟수(회)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct13
Distinct (%)27.7%
Missing134
Missing (%)74.0%
Infinite0
Infinite (%)0.0%
Mean7.4255319
Minimum0
Maximum52
Zeros5
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T05:44:29.819638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median4
Q37
95-th percentile31.2
Maximum52
Range52
Interquartile range (IQR)4

Descriptive statistics

Standard deviation10.530999
Coefficient of variation (CV)1.4182148
Kurtosis9.4167674
Mean7.4255319
Median Absolute Deviation (MAD)2
Skewness3.0221016
Sum349
Variance110.90194
MonotonicityNot monotonic
2023-12-13T05:44:30.062372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
4 19
 
10.5%
12 6
 
3.3%
2 6
 
3.3%
0 5
 
2.8%
3 3
 
1.7%
10 1
 
0.6%
36 1
 
0.6%
43 1
 
0.6%
5 1
 
0.6%
6 1
 
0.6%
Other values (3) 3
 
1.7%
(Missing) 134
74.0%
ValueCountFrequency (%)
0 5
 
2.8%
2 6
 
3.3%
3 3
 
1.7%
4 19
10.5%
5 1
 
0.6%
6 1
 
0.6%
8 1
 
0.6%
10 1
 
0.6%
12 6
 
3.3%
20 1
 
0.6%
ValueCountFrequency (%)
52 1
 
0.6%
43 1
 
0.6%
36 1
 
0.6%
20 1
 
0.6%
12 6
 
3.3%
10 1
 
0.6%
8 1
 
0.6%
6 1
 
0.6%
5 1
 
0.6%
4 19
10.5%

모니터링추진실적변동율(%)
Real number (ℝ)

MISSING  ZEROS 

Distinct9
Distinct (%)19.1%
Missing134
Missing (%)74.0%
Infinite0
Infinite (%)0.0%
Mean24.797872
Minimum-2.8
Maximum150
Zeros28
Zeros (%)15.5%
Negative1
Negative (%)0.6%
Memory size1.7 KiB
2023-12-13T05:44:30.177702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2.8
5-th percentile0
Q10
median0
Q350
95-th percentile100
Maximum150
Range152.8
Interquartile range (IQR)50

Descriptive statistics

Standard deviation38.588574
Coefficient of variation (CV)1.5561244
Kurtosis1.5166101
Mean24.797872
Median Absolute Deviation (MAD)0
Skewness1.5112957
Sum1165.5
Variance1489.078
MonotonicityNot monotonic
2023-12-13T05:44:30.299384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0.0 28
 
15.5%
50.0 7
 
3.9%
100.0 5
 
2.8%
33.3 2
 
1.1%
-2.8 1
 
0.6%
25.0 1
 
0.6%
10.0 1
 
0.6%
66.7 1
 
0.6%
150.0 1
 
0.6%
(Missing) 134
74.0%
ValueCountFrequency (%)
-2.8 1
 
0.6%
0.0 28
15.5%
10.0 1
 
0.6%
25.0 1
 
0.6%
33.3 2
 
1.1%
50.0 7
 
3.9%
66.7 1
 
0.6%
100.0 5
 
2.8%
150.0 1
 
0.6%
ValueCountFrequency (%)
150.0 1
 
0.6%
100.0 5
 
2.8%
66.7 1
 
0.6%
50.0 7
 
3.9%
33.3 2
 
1.1%
25.0 1
 
0.6%
10.0 1
 
0.6%
0.0 28
15.5%
-2.8 1
 
0.6%

모니터링회의참여인원(명)
Real number (ℝ)

MISSING  ZEROS 

Distinct21
Distinct (%)47.7%
Missing137
Missing (%)75.7%
Infinite0
Infinite (%)0.0%
Mean18.363636
Minimum0
Maximum100
Zeros4
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T05:44:30.440469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16.5
median9.5
Q315.5
95-th percentile77.9
Maximum100
Range100
Interquartile range (IQR)9

Descriptive statistics

Standard deviation24.432242
Coefficient of variation (CV)1.3304686
Kurtosis5.0486926
Mean18.363636
Median Absolute Deviation (MAD)5
Skewness2.3851185
Sum808
Variance596.93446
MonotonicityNot monotonic
2023-12-13T05:44:30.581177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
8 5
 
2.8%
4 4
 
2.2%
0 4
 
2.2%
12 3
 
1.7%
7 3
 
1.7%
9 3
 
1.7%
5 3
 
1.7%
15 3
 
1.7%
10 3
 
1.7%
13 2
 
1.1%
Other values (11) 11
 
6.1%
(Missing) 137
75.7%
ValueCountFrequency (%)
0 4
2.2%
4 4
2.2%
5 3
1.7%
7 3
1.7%
8 5
2.8%
9 3
1.7%
10 3
1.7%
12 3
1.7%
13 2
 
1.1%
15 3
1.7%
ValueCountFrequency (%)
100 1
0.6%
99 1
0.6%
80 1
0.6%
66 1
0.6%
60 1
0.6%
31 1
0.6%
30 1
0.6%
26 1
0.6%
23 1
0.6%
20 1
0.6%

Interactions

2023-12-13T05:44:28.039406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.143276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.689756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.167830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.639772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:28.119181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.261016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.791926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.263305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.724954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:28.191146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.371426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.881305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.348508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.800695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:28.264603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.480492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.972138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.437073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.877720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:28.337702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:26.595569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.083431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.551852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:44:27.962515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:44:30.674642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도금년_모니터링회의개최횟수(회)작년_모니터링회의개최횟수(회)모니터링추진실적변동율(%)모니터링회의참여인원(명)
문학단체명1.0000.0000.9810.9790.7840.000
사업연도0.0001.000NaNNaNNaNNaN
금년_모니터링회의개최횟수(회)0.981NaN1.0000.9960.0000.000
작년_모니터링회의개최횟수(회)0.979NaN0.9961.0000.0000.000
모니터링추진실적변동율(%)0.784NaN0.0000.0001.0000.258
모니터링회의참여인원(명)0.000NaN0.0000.0000.2581.000
2023-12-13T05:44:30.811803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도금년_모니터링회의개최횟수(회)작년_모니터링회의개최횟수(회)모니터링추진실적변동율(%)모니터링회의참여인원(명)
사업연도1.000NaNNaNNaNNaN
금년_모니터링회의개최횟수(회)NaN1.0000.897-0.0530.173
작년_모니터링회의개최횟수(회)NaN0.8971.000-0.4210.186
모니터링추진실적변동율(%)NaN-0.053-0.4211.0000.027
모니터링회의참여인원(명)NaN0.1730.1860.0271.000

Missing values

2023-12-13T05:44:28.428735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:44:28.522517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:44:28.630913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

문학단체명사업연도금년_모니터링회의개최횟수(회)작년_모니터링회의개최횟수(회)모니터링추진실적변동율(%)모니터링회의참여인원(명)
0*제**부2014<NA><NA><NA><NA>
1*국**회2014<NA><NA><NA><NA>
2*1**학2014<NA><NA><NA><NA>
3*학**네2014<NA><NA><NA><NA>
4*학**상2014<NA><NA><NA><NA>
5*음**사2014<NA><NA><NA><NA>
6*천**학2014<NA><NA><NA><NA>
7*행**사2014<NA><NA><NA><NA>
8*년**작2014<NA><NA><NA><NA>
9*간**선2014<NA><NA><NA><NA>
문학단체명사업연도금년_모니터링회의개최횟수(회)작년_모니터링회의개최횟수(회)모니터링추진실적변동율(%)모니터링회의참여인원(명)
171*서**망2019222010.08
172*와**시201942100.020
173*국**회20195366.730
174*시**아2019104150.012
175*국**회2019000.00
176*행**사201912120.0<NA>
177*국**연201942100.07
178*대**학201912120.09
179*국**회2019440.013
180*천**학20196450.015

Duplicate rows

Most frequently occurring

문학단체명사업연도금년_모니터링회의개최횟수(회)작년_모니터링회의개최횟수(회)모니터링추진실적변동율(%)모니터링회의참여인원(명)# duplicates
0*국**회2014<NA><NA><NA><NA>10
3*국**회2017<NA><NA><NA><NA>10
4*국**회2018<NA><NA><NA><NA>10
2*국**회2016<NA><NA><NA><NA>5
1*국**회2015<NA><NA><NA><NA>2
5*국**회2019000.002
6*대**학2014<NA><NA><NA><NA>2
7*대**학2015<NA><NA><NA><NA>2
8*대**학2018<NA><NA><NA><NA>2