Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 181 |
Missing cells | 288 |
Missing cells (%) | 19.9% |
Duplicate rows | 5 |
Duplicate rows (%) | 2.8% |
Total size in memory | 12.3 KiB |
Average record size in memory | 69.7 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Categorical | 2 |
Dataset
Description | 2014-2019년 문예진흥기금 공모사업 중 문학 분야 "문예지발간" 지원 사업의 문예지 연간발간내역(예: 발간주기, 연간발간횟수, 연간제작비) |
---|---|
Author | 한국문화예술위원회 |
URL | https://www.data.go.kr/data/15076420/fileData.do |
Dataset has 5 (2.8%) duplicate rows | Duplicates |
종이책_연간발간횟수(회) is highly overall correlated with 종이책_연간제작비총액(원) and 1 other fields | High correlation |
종이책_연간제작비총액(원) is highly overall correlated with 종이책_연간발간횟수(회) | High correlation |
전자책웹진_연간발간횟수(회) is highly overall correlated with 전자책웹진_연간제작비총액(원) and 1 other fields | High correlation |
전자책웹진_연간제작비총액(원) is highly overall correlated with 전자책웹진_연간발간횟수(회) and 1 other fields | High correlation |
종이책_주기 is highly overall correlated with 종이책_연간발간횟수(회) | High correlation |
전자책웹진_주기 is highly overall correlated with 전자책웹진_연간발간횟수(회) and 1 other fields | High correlation |
종이책_연간발간횟수(회) has 71 (39.2%) missing values | Missing |
종이책_연간제작비총액(원) has 72 (39.8%) missing values | Missing |
전자책웹진_연간발간횟수(회) has 71 (39.2%) missing values | Missing |
전자책웹진_연간제작비총액(원) has 74 (40.9%) missing values | Missing |
전자책웹진_연간발간횟수(회) has 96 (53.0%) zeros | Zeros |
전자책웹진_연간제작비총액(원) has 96 (53.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 14:09:37.743219 |
---|---|
Analysis finished | 2023-12-12 14:09:41.375577 |
Duration | 3.63 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
문학단체명
Text
Distinct | 62 |
---|---|
Distinct (%) | 34.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.5 KiB |
Value | Count | Frequency (%) |
국**회 | 45 | |
대**학 | 8 | 4.4% |
제**부 | 5 | 2.8% |
학**네 | 4 | 2.2% |
학**상 | 4 | 2.2% |
비**비 | 4 | 2.2% |
학**사 | 4 | 2.2% |
음**음 | 4 | 2.2% |
년**작 | 3 | 1.7% |
학**당 | 3 | 1.7% |
Other values (52) | 97 |
Most occurring characters
Value | Count | Frequency (%) |
* | 543 | |
국 | 54 | 6.0% |
회 | 48 | 5.3% |
학 | 41 | 4.5% |
사 | 17 | 1.9% |
음 | 11 | 1.2% |
대 | 10 | 1.1% |
시 | 9 | 1.0% |
서 | 9 | 1.0% |
비 | 8 | 0.9% |
Other values (58) | 155 | 17.1% |
Most occurring categories
Value | Count | Frequency (%) |
Other Punctuation | 543 | |
Other Letter | 360 | |
Decimal Number | 2 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
국 | 54 | 15.0% |
회 | 48 | 13.3% |
학 | 41 | 11.4% |
사 | 17 | 4.7% |
음 | 11 | 3.1% |
대 | 10 | 2.8% |
시 | 9 | 2.5% |
서 | 9 | 2.5% |
비 | 8 | 2.2% |
린 | 7 | 1.9% |
Other values (56) | 146 |
Other Punctuation
Value | Count | Frequency (%) |
* | 543 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 545 | |
Hangul | 360 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
국 | 54 | 15.0% |
회 | 48 | 13.3% |
학 | 41 | 11.4% |
사 | 17 | 4.7% |
음 | 11 | 3.1% |
대 | 10 | 2.8% |
시 | 9 | 2.5% |
서 | 9 | 2.5% |
비 | 8 | 2.2% |
린 | 7 | 1.9% |
Other values (56) | 146 |
Common
Value | Count | Frequency (%) |
* | 543 | |
1 | 2 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 545 | |
Hangul | 360 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 543 | |
1 | 2 | 0.4% |
Hangul
Value | Count | Frequency (%) |
국 | 54 | 15.0% |
회 | 48 | 13.3% |
학 | 41 | 11.4% |
사 | 17 | 4.7% |
음 | 11 | 3.1% |
대 | 10 | 2.8% |
시 | 9 | 2.5% |
서 | 9 | 2.5% |
비 | 8 | 2.2% |
린 | 7 | 1.9% |
Other values (56) | 146 |
사업연도
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2016.7624 |
Minimum | 2014 |
---|---|
Maximum | 2019 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 KiB |
Quantile statistics
Minimum | 2014 |
---|---|
5-th percentile | 2014 |
Q1 | 2014 |
median | 2018 |
Q3 | 2019 |
95-th percentile | 2019 |
Maximum | 2019 |
Range | 5 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.0395867 |
---|---|
Coefficient of variation (CV) | 0.0010113173 |
Kurtosis | -1.5893876 |
Mean | 2016.7624 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.35284142 |
Sum | 365034 |
Variance | 4.1599141 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2014 | 51 | |
2018 | 50 | |
2019 | 47 | |
2015 | 14 | 7.7% |
2017 | 13 | 7.2% |
2016 | 6 | 3.3% |
Value | Count | Frequency (%) |
2014 | 51 | |
2015 | 14 | 7.7% |
2016 | 6 | 3.3% |
2017 | 13 | 7.2% |
2018 | 50 | |
2019 | 47 |
Value | Count | Frequency (%) |
2019 | 47 | |
2018 | 50 | |
2017 | 13 | 7.2% |
2016 | 6 | 3.3% |
2015 | 14 | 7.7% |
2014 | 51 |
종이책_주기
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.5 KiB |
<NA> | |
---|---|
계간 | |
월간 | |
반년간 | |
격월간 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.9392265 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 71 | |
계간 | 65 | |
월간 | 17 | 9.4% |
반년간 | 16 | 8.8% |
격월간 | 12 | 6.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 71 | |
계간 | 65 | |
월간 | 17 | 9.4% |
반년간 | 16 | 8.8% |
격월간 | 12 | 6.6% |
종이책_연간발간횟수(회)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 5.5% |
Missing | 71 |
Missing (%) | 39.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.0181818 |
Minimum | 2 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2 |
Q1 | 4 |
median | 4 |
Q3 | 5.5 |
95-th percentile | 12 |
Maximum | 12 |
Range | 10 |
Interquartile range (IQR) | 1.5 |
Descriptive statistics
Standard deviation | 2.936583 |
---|---|
Coefficient of variation (CV) | 0.58518864 |
Kurtosis | 1.4704865 |
Mean | 5.0181818 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.5979992 |
Sum | 552 |
Variance | 8.6235196 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 64 | |
2 | 16 | 8.8% |
12 | 14 | 7.7% |
6 | 11 | 6.1% |
8 | 3 | 1.7% |
3 | 2 | 1.1% |
(Missing) | 71 |
Value | Count | Frequency (%) |
2 | 16 | 8.8% |
3 | 2 | 1.1% |
4 | 64 | |
6 | 11 | 6.1% |
8 | 3 | 1.7% |
12 | 14 | 7.7% |
Value | Count | Frequency (%) |
12 | 14 | 7.7% |
8 | 3 | 1.7% |
6 | 11 | 6.1% |
4 | 64 | |
3 | 2 | 1.1% |
2 | 16 | 8.8% |
종이책_연간제작비총액(원)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 92 |
---|---|
Distinct (%) | 84.4% |
Missing | 72 |
Missing (%) | 39.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 63559351 |
Minimum | 7800000 |
---|---|
Maximum | 2.7009535 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 KiB |
Quantile statistics
Minimum | 7800000 |
---|---|
5-th percentile | 18584800 |
Q1 | 28000000 |
median | 40000000 |
Q3 | 80000000 |
95-th percentile | 1.8744 × 108 |
Maximum | 2.7009535 × 108 |
Range | 2.6229535 × 108 |
Interquartile range (IQR) | 52000000 |
Descriptive statistics
Standard deviation | 55805942 |
---|---|
Coefficient of variation (CV) | 0.87801309 |
Kurtosis | 3.0232201 |
Mean | 63559351 |
Median Absolute Deviation (MAD) | 15600000 |
Skewness | 1.8224337 |
Sum | 6.9279693 × 109 |
Variance | 3.1143032 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40000000 | 3 | 1.7% |
29400000 | 2 | 1.1% |
28000000 | 2 | 1.1% |
26000000 | 2 | 1.1% |
40500000 | 2 | 1.1% |
112400000 | 2 | 1.1% |
38400000 | 2 | 1.1% |
44000000 | 2 | 1.1% |
34800000 | 2 | 1.1% |
96000000 | 2 | 1.1% |
Other values (82) | 88 | |
(Missing) | 72 |
Value | Count | Frequency (%) |
7800000 | 1 | |
12000000 | 1 | |
13000000 | 1 | |
16600000 | 1 | |
17400000 | 1 | |
18308000 | 1 | |
19000000 | 1 | |
19744000 | 1 | |
20000000 | 1 | |
21000000 | 2 |
Value | Count | Frequency (%) |
270095352 | 1 | |
254876832 | 1 | |
247841796 | 1 | |
193808000 | 1 | |
190000000 | 1 | |
188400000 | 1 | |
186000000 | 1 | |
184000000 | 1 | |
182400000 | 1 | |
150000000 | 1 |
전자책웹진_주기
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.5 KiB |
미분류 | |
---|---|
<NA> | |
계간 | |
월간 | 2 |
반년간 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.3259669 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
미분류 | 96 | |
<NA> | 71 | |
계간 | 10 | 5.5% |
월간 | 2 | 1.1% |
반년간 | 1 | 0.6% |
격월간 | 1 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
미분류 | 96 | |
na | 71 | |
계간 | 10 | 5.5% |
월간 | 2 | 1.1% |
반년간 | 1 | 0.6% |
격월간 | 1 | 0.6% |
전자책웹진_연간발간횟수(회)
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 5.5% |
Missing | 71 |
Missing (%) | 39.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.64545455 |
Minimum | 0 |
---|---|
Maximum | 12 |
Zeros | 96 |
Zeros (%) | 53.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4 |
Maximum | 12 |
Range | 12 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.9981017 |
---|---|
Coefficient of variation (CV) | 3.0956505 |
Kurtosis | 18.356677 |
Mean | 0.64545455 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.9952305 |
Sum | 71 |
Variance | 3.9924103 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 96 | |
4 | 9 | 5.0% |
12 | 2 | 1.1% |
3 | 1 | 0.6% |
2 | 1 | 0.6% |
6 | 1 | 0.6% |
(Missing) | 71 |
Value | Count | Frequency (%) |
0 | 96 | |
2 | 1 | 0.6% |
3 | 1 | 0.6% |
4 | 9 | 5.0% |
6 | 1 | 0.6% |
12 | 2 | 1.1% |
Value | Count | Frequency (%) |
12 | 2 | 1.1% |
6 | 1 | 0.6% |
4 | 9 | 5.0% |
3 | 1 | 0.6% |
2 | 1 | 0.6% |
0 | 96 |
전자책웹진_연간제작비총액(원)
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 10.3% |
Missing | 74 |
Missing (%) | 40.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4220484.1 |
Minimum | 0 |
---|---|
Maximum | 2.478418 × 108 |
Zeros | 96 |
Zeros (%) | 53.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 15282000 |
Maximum | 2.478418 × 108 |
Range | 2.478418 × 108 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 25381183 |
---|---|
Coefficient of variation (CV) | 6.0138086 |
Kurtosis | 82.078935 |
Mean | 4220484.1 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.6993272 |
Sum | 4.515918 × 108 |
Variance | 6.4420447 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 96 | |
5440000 | 2 | 1.1% |
63600000 | 1 | 0.6% |
247841796 | 1 | 0.6% |
450000 | 1 | 0.6% |
42000000 | 1 | 0.6% |
19500000 | 1 | 0.6% |
1200000 | 1 | 0.6% |
20000000 | 1 | 0.6% |
46000000 | 1 | 0.6% |
(Missing) | 74 |
Value | Count | Frequency (%) |
0 | 96 | |
120000 | 1 | 0.6% |
450000 | 1 | 0.6% |
1200000 | 1 | 0.6% |
5440000 | 2 | 1.1% |
19500000 | 1 | 0.6% |
20000000 | 1 | 0.6% |
42000000 | 1 | 0.6% |
46000000 | 1 | 0.6% |
63600000 | 1 | 0.6% |
Value | Count | Frequency (%) |
247841796 | 1 | |
63600000 | 1 | |
46000000 | 1 | |
42000000 | 1 | |
20000000 | 1 | |
19500000 | 1 | |
5440000 | 2 | |
1200000 | 1 | |
450000 | 1 | |
120000 | 1 |
문학단체명 | 사업연도 | 종이책_주기 | 종이책_연간발간횟수(회) | 종이책_연간제작비총액(원) | 전자책웹진_주기 | 전자책웹진_연간발간횟수(회) | 전자책웹진_연간제작비총액(원) | |
---|---|---|---|---|---|---|---|---|
문학단체명 | 1.000 | 0.000 | 0.852 | 0.620 | 0.701 | 0.000 | 0.187 | 0.545 |
사업연도 | 0.000 | 1.000 | 0.000 | 0.000 | 0.242 | 0.063 | 0.000 | 0.000 |
종이책_주기 | 0.852 | 0.000 | 1.000 | 0.999 | 0.611 | 0.230 | 0.250 | 0.000 |
종이책_연간발간횟수(회) | 0.620 | 0.000 | 0.999 | 1.000 | 0.625 | 0.198 | 0.701 | 0.000 |
종이책_연간제작비총액(원) | 0.701 | 0.242 | 0.611 | 0.625 | 1.000 | 0.355 | 0.352 | 0.368 |
전자책웹진_주기 | 0.000 | 0.063 | 0.230 | 0.198 | 0.355 | 1.000 | 1.000 | 0.921 |
전자책웹진_연간발간횟수(회) | 0.187 | 0.000 | 0.250 | 0.701 | 0.352 | 1.000 | 1.000 | 0.698 |
전자책웹진_연간제작비총액(원) | 0.545 | 0.000 | 0.000 | 0.000 | 0.368 | 0.921 | 0.698 | 1.000 |
종이책_주기 | 전자책웹진_주기 | |
---|---|---|
종이책_주기 | 1.000 | 0.187 |
전자책웹진_주기 | 0.187 | 1.000 |
사업연도 | 종이책_연간발간횟수(회) | 종이책_연간제작비총액(원) | 전자책웹진_연간발간횟수(회) | 전자책웹진_연간제작비총액(원) | 종이책_주기 | 전자책웹진_주기 | |
---|---|---|---|---|---|---|---|
사업연도 | 1.000 | 0.011 | 0.033 | -0.039 | -0.096 | 0.000 | 0.043 |
종이책_연간발간횟수(회) | 0.011 | 1.000 | 0.697 | -0.036 | -0.075 | 0.954 | 0.114 |
종이책_연간제작비총액(원) | 0.033 | 0.697 | 1.000 | 0.195 | 0.197 | 0.435 | 0.210 |
전자책웹진_연간발간횟수(회) | -0.039 | -0.036 | 0.195 | 1.000 | 0.998 | 0.160 | 0.995 |
전자책웹진_연간제작비총액(원) | -0.096 | -0.075 | 0.197 | 0.998 | 1.000 | 0.000 | 0.628 |
종이책_주기 | 0.000 | 0.954 | 0.435 | 0.160 | 0.000 | 1.000 | 0.187 |
전자책웹진_주기 | 0.043 | 0.114 | 0.210 | 0.995 | 0.628 | 0.187 | 1.000 |
문학단체명 | 사업연도 | 종이책_주기 | 종이책_연간발간횟수(회) | 종이책_연간제작비총액(원) | 전자책웹진_주기 | 전자책웹진_연간발간횟수(회) | 전자책웹진_연간제작비총액(원) | |
---|---|---|---|---|---|---|---|---|
0 | *제**부 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | *국**회 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | *1**학 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | *학**네 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | *학**상 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | *음**사 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | *천**학 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | *행**사 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | *년**작 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | *간**선 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
문학단체명 | 사업연도 | 종이책_주기 | 종이책_연간발간횟수(회) | 종이책_연간제작비총액(원) | 전자책웹진_주기 | 전자책웹진_연간발간횟수(회) | 전자책웹진_연간제작비총액(원) | |
---|---|---|---|---|---|---|---|---|
171 | *서**망 | 2019 | 계간 | 4 | 44000000 | 미분류 | 0 | 0 |
172 | *와**시 | 2019 | 계간 | 4 | 19744000 | 미분류 | 0 | 0 |
173 | *국**회 | 2019 | 계간 | 4 | 26000000 | 미분류 | 0 | 0 |
174 | *시**아 | 2019 | 계간 | 4 | 68800000 | 미분류 | 0 | 0 |
175 | *국**회 | 2019 | 월간 | 8 | 66400000 | 미분류 | 0 | 0 |
176 | *행**사 | 2019 | 격월간 | 6 | 127920000 | 미분류 | 0 | 0 |
177 | *국**연 | 2019 | 월간 | 12 | 96000000 | 미분류 | 0 | 0 |
178 | *대**학 | 2019 | 월간 | 12 | 188400000 | 미분류 | 0 | 0 |
179 | *국**회 | 2019 | 격월간 | 6 | 34000000 | 격월간 | 6 | <NA> |
180 | *천**학 | 2019 | 계간 | 4 | 28000000 | 미분류 | 0 | 0 |
Most frequently occurring
문학단체명 | 사업연도 | 종이책_주기 | 종이책_연간발간횟수(회) | 종이책_연간제작비총액(원) | 전자책웹진_주기 | 전자책웹진_연간발간횟수(회) | 전자책웹진_연간제작비총액(원) | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
0 | *국**회 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 10 |
2 | *국**회 | 2016 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 5 |
1 | *국**회 | 2015 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |
3 | *대**학 | 2014 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |
4 | *대**학 | 2015 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |