Overview

Dataset statistics

Number of variables11
Number of observations37
Missing cells199
Missing cells (%)48.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory93.6 B

Variable types

Text1
Categorical1
DateTime3
Boolean5
Numeric1

Dataset

Description2014, 2015, 2018, 2019년 문예진흥기금 공모사업 중 문학 분야 "문학행사 및 연구" 지원 사업의 세부내용(연구사업, 예: 연구시작일, 연구종료일, 부대행사 참여자 수 등)
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15076466/fileData.do

Alerts

부대행사성격_워크숍 has constant value ""Constant
부대행사성격_기타 has constant value ""Constant
사업연도 is highly overall correlated with 부대행사성격_세미나 and 2 other fieldsHigh correlation
부대행사성격_세미나 is highly overall correlated with 사업연도High correlation
부대행사성격_간담회 is highly overall correlated with 사업연도High correlation
부대행사성격_심포지엄 is highly overall correlated with 사업연도High correlation
연구시작일 has 19 (51.4%) missing valuesMissing
연구종료일 has 19 (51.4%) missing valuesMissing
부대행사성격_세미나 has 31 (83.8%) missing valuesMissing
부대행사성격_심포지엄 has 31 (83.8%) missing valuesMissing
부대행사성격_워크숍 has 31 (83.8%) missing valuesMissing
부대행사성격_간담회 has 31 (83.8%) missing valuesMissing
부대행사성격_기타 has 31 (83.8%) missing valuesMissing
부대행사일시 has 6 (16.2%) missing valuesMissing
부대행사참여자수(명) has 5 (13.5%) zerosZeros

Reproduction

Analysis started2023-12-12 06:13:02.460568
Analysis finished2023-12-12 06:13:03.449713
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct19
Distinct (%)51.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T15:13:03.545310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters185
Distinct characters28
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)37.8%

Sample

1st row*국**회
2nd row*린**회
3rd row*린**회
4th row*국**회
5th row*오**촌
ValueCountFrequency (%)
국**회 13
35.1%
주**의 3
 
8.1%
린**회 3
 
8.1%
b**회 2
 
5.4%
국**관 2
 
5.4%
소**회 1
 
2.7%
청**회 1
 
2.7%
린**대 1
 
2.7%
림**회 1
 
2.7%
조**사 1
 
2.7%
Other values (9) 9
24.3%
2023-12-12T15:13:03.806209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 111
60.0%
22
 
11.9%
15
 
8.1%
4
 
2.2%
3
 
1.6%
3
 
1.6%
3
 
1.6%
B 2
 
1.1%
2
 
1.1%
2
 
1.1%
Other values (18) 18
 
9.7%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 111
60.0%
Other Letter 72
38.9%
Uppercase Letter 2
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
30.6%
15
20.8%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
Other values (16) 16
22.2%
Other Punctuation
ValueCountFrequency (%)
* 111
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 111
60.0%
Hangul 72
38.9%
Latin 2
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
30.6%
15
20.8%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
Other values (16) 16
22.2%
Common
ValueCountFrequency (%)
* 111
100.0%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 113
61.1%
Hangul 72
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 111
98.2%
B 2
 
1.8%
Hangul
ValueCountFrequency (%)
22
30.6%
15
20.8%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
Other values (16) 16
22.2%

사업연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size428.0 B
2014
18 
2015
2019
2018

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014
2nd row2014
3rd row2014
4th row2014
5th row2014

Common Values

ValueCountFrequency (%)
2014 18
48.6%
2015 9
24.3%
2019 6
 
16.2%
2018 4
 
10.8%

Length

2023-12-12T15:13:03.929280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:13:04.027180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 18
48.6%
2015 9
24.3%
2019 6
 
16.2%
2018 4
 
10.8%

연구시작일
Date

MISSING 

Distinct14
Distinct (%)77.8%
Missing19
Missing (%)51.4%
Memory size428.0 B
Minimum2013-10-30 00:00:00
Maximum2019-05-01 00:00:00
2023-12-12T15:13:04.122705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:13:04.218899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

연구종료일
Date

MISSING 

Distinct11
Distinct (%)61.1%
Missing19
Missing (%)51.4%
Memory size428.0 B
Minimum2014-10-10 00:00:00
Maximum2019-12-31 00:00:00
2023-12-12T15:13:04.312128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:13:04.403778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

부대행사성격_세미나
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)33.3%
Missing31
Missing (%)83.8%
Memory size206.0 B
False
True
 
2
(Missing)
31 
ValueCountFrequency (%)
False 4
 
10.8%
True 2
 
5.4%
(Missing) 31
83.8%
2023-12-12T15:13:04.492825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대행사성격_심포지엄
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)33.3%
Missing31
Missing (%)83.8%
Memory size206.0 B
True
False
 
1
(Missing)
31 
ValueCountFrequency (%)
True 5
 
13.5%
False 1
 
2.7%
(Missing) 31
83.8%
2023-12-12T15:13:04.580457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대행사성격_워크숍
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)16.7%
Missing31
Missing (%)83.8%
Memory size206.0 B
False
(Missing)
31 
ValueCountFrequency (%)
False 6
 
16.2%
(Missing) 31
83.8%
2023-12-12T15:13:04.665948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대행사성격_간담회
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)33.3%
Missing31
Missing (%)83.8%
Memory size206.0 B
False
True
 
1
(Missing)
31 
ValueCountFrequency (%)
False 5
 
13.5%
True 1
 
2.7%
(Missing) 31
83.8%
2023-12-12T15:13:04.759842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대행사성격_기타
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)16.7%
Missing31
Missing (%)83.8%
Memory size206.0 B
False
(Missing)
31 
ValueCountFrequency (%)
False 6
 
16.2%
(Missing) 31
83.8%
2023-12-12T15:13:04.873603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

부대행사일시
Date

MISSING 

Distinct31
Distinct (%)100.0%
Missing6
Missing (%)16.2%
Memory size428.0 B
Minimum2014-05-10 00:00:00
Maximum2019-12-07 00:00:00
2023-12-12T15:13:04.995973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:13:05.131568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

부대행사참여자수(명)
Real number (ℝ)

ZEROS 

Distinct26
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean378.67568
Minimum0
Maximum1360
Zeros5
Zeros (%)13.5%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T15:13:05.533071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1103
median203
Q3865
95-th percentile991.8
Maximum1360
Range1360
Interquartile range (IQR)762

Descriptive statistics

Standard deviation396.98979
Coefficient of variation (CV)1.0483636
Kurtosis-0.49957276
Mean378.67568
Median Absolute Deviation (MAD)111
Skewness0.9944744
Sum14011
Variance157600.89
MonotonicityNot monotonic
2023-12-12T15:13:05.674010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0 5
 
13.5%
150 4
 
10.8%
203 3
 
8.1%
100 2
 
5.4%
308 2
 
5.4%
995 1
 
2.7%
21 1
 
2.7%
92 1
 
2.7%
120 1
 
2.7%
103 1
 
2.7%
Other values (16) 16
43.2%
ValueCountFrequency (%)
0 5
13.5%
21 1
 
2.7%
92 1
 
2.7%
100 2
 
5.4%
103 1
 
2.7%
120 1
 
2.7%
129 1
 
2.7%
132 1
 
2.7%
150 4
10.8%
171 1
 
2.7%
ValueCountFrequency (%)
1360 1
2.7%
995 1
2.7%
991 1
2.7%
988 1
2.7%
986 1
2.7%
980 1
2.7%
967 1
2.7%
890 1
2.7%
875 1
2.7%
865 1
2.7%

Interactions

2023-12-12T15:13:02.840387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:13:05.796271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문학단체명사업연도연구시작일연구종료일부대행사성격_세미나부대행사성격_심포지엄부대행사성격_간담회부대행사일시부대행사참여자수(명)
문학단체명1.0000.0000.9310.9581.0001.0001.0001.0000.000
사업연도0.0001.0001.0001.000NaNNaNNaN1.0000.430
연구시작일0.9311.0001.0000.9211.0000.0000.0001.0000.962
연구종료일0.9581.0000.9211.0000.3461.0000.0001.0000.796
부대행사성격_세미나1.000NaN1.0000.3461.0000.0000.0001.0000.000
부대행사성격_심포지엄1.000NaN0.0001.0000.0001.0000.0001.0000.000
부대행사성격_간담회1.000NaN0.0000.0000.0000.0001.0001.0000.000
부대행사일시1.0001.0001.0001.0001.0001.0001.0001.0001.000
부대행사참여자수(명)0.0000.4300.9620.7960.0000.0000.0001.0001.000
2023-12-12T15:13:05.940516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도부대행사성격_세미나부대행사성격_간담회부대행사성격_심포지엄
사업연도1.0001.0001.0001.000
부대행사성격_세미나1.0001.0000.0000.000
부대행사성격_간담회1.0000.0001.0000.000
부대행사성격_심포지엄1.0000.0000.0001.000
2023-12-12T15:13:06.033843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부대행사참여자수(명)사업연도부대행사성격_세미나부대행사성격_심포지엄부대행사성격_간담회
부대행사참여자수(명)1.0000.2880.0000.0000.000
사업연도0.2881.0001.0001.0001.000
부대행사성격_세미나0.0001.0001.0000.0000.000
부대행사성격_심포지엄0.0001.0000.0001.0000.000
부대행사성격_간담회0.0001.0000.0000.0001.000

Missing values

2023-12-12T15:13:03.015224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:13:03.194189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:13:03.341543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

문학단체명사업연도연구시작일연구종료일부대행사성격_세미나부대행사성격_심포지엄부대행사성격_워크숍부대행사성격_간담회부대행사성격_기타부대행사일시부대행사참여자수(명)
0*국**회2014<NA><NA><NA><NA><NA><NA><NA>2014-06-03995
1*린**회20142014-01-012014-12-31<NA><NA><NA><NA><NA>2014-07-05203
2*린**회20142014-01-012014-12-31<NA><NA><NA><NA><NA>2014-08-11203
3*국**회2014<NA><NA><NA><NA><NA><NA><NA>2014-06-28967
4*오**촌20142014-03-042014-11-30<NA><NA><NA><NA><NA>2014-08-30875
5*서**요2014<NA><NA><NA><NA><NA><NA><NA><NA>0
6*국**회2014<NA><NA><NA><NA><NA><NA><NA>2014-06-27132
7*우**터2014<NA><NA><NA><NA><NA><NA><NA>2014-10-04991
8*국**회2014<NA><NA><NA><NA><NA><NA><NA>2014-08-09980
9*국**회2014<NA><NA><NA><NA><NA><NA><NA><NA>1360
문학단체명사업연도연구시작일연구종료일부대행사성격_세미나부대행사성격_심포지엄부대행사성격_워크숍부대행사성격_간담회부대행사성격_기타부대행사일시부대행사참여자수(명)
27*국**회20182018-05-012018-12-31<NA><NA><NA><NA><NA><NA>204
28*B**회20182018-06-012018-12-31<NA><NA><NA><NA><NA>2018-11-17103
29*국**회20182018-07-172018-12-31<NA><NA><NA><NA><NA>2018-08-30150
30*주**의20182018-09-012018-12-31<NA><NA><NA><NA><NA><NA>0
31*B**회20192019-02-012019-10-31YYNYN2019-11-16100
32*림**회20192019-02-012019-12-07YNNNN2019-12-07120
33*주**의20192019-05-012019-12-31NYNNN2019-06-15150
34*주**의20192019-05-012019-12-31NYNNN2019-09-07150
35*린**대20192019-03-012019-10-31NYNNN2019-09-220
36*지**션2019<NA><NA>NYNNN2019-06-2992