Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 34 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.7 KiB |
Average record size in memory | 80.9 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Categorical | 3 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 성균관대학교 산학협력단 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=663752c0-2fb1-11ea-94b6-73a02796bba4 |
연월일 has constant value "" | Constant |
일간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fields | High correlation |
긍정언급량 is highly overall correlated with 부정언급량 and 4 other fields | High correlation |
부정언급량 is highly overall correlated with 긍정언급량 and 5 other fields | High correlation |
중립언급량 is highly overall correlated with 긍정언급량 and 5 other fields | High correlation |
총언급량 is highly overall correlated with 긍정언급량 and 5 other fields | High correlation |
환경플랫폼 하위 도메인명 is highly overall correlated with 일간언급량연번 and 5 other fields | High correlation |
도메인 하위 카테고리명 is highly overall correlated with 긍정언급량 and 4 other fields | High correlation |
SNS 채널명 is highly overall correlated with 일간언급량연번 and 3 other fields | High correlation |
일간언급량연번 has unique values | Unique |
긍정언급량 has 3 (8.8%) zeros | Zeros |
부정언급량 has 1 (2.9%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:25:40.298929 |
---|---|
Analysis finished | 2023-12-10 13:25:44.889776 |
Duration | 4.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
일간언급량연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 34 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.5 |
Minimum | 1 |
---|---|
Maximum | 34 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.65 |
Q1 | 9.25 |
median | 17.5 |
Q3 | 25.75 |
95-th percentile | 32.35 |
Maximum | 34 |
Range | 33 |
Interquartile range (IQR) | 16.5 |
Descriptive statistics
Standard deviation | 9.9582462 |
---|---|
Coefficient of variation (CV) | 0.56904264 |
Kurtosis | -1.2 |
Mean | 17.5 |
Median Absolute Deviation (MAD) | 8.5 |
Skewness | 0 |
Sum | 595 |
Variance | 99.166667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 2.9% |
27 | 1 | 2.9% |
21 | 1 | 2.9% |
22 | 1 | 2.9% |
23 | 1 | 2.9% |
24 | 1 | 2.9% |
25 | 1 | 2.9% |
26 | 1 | 2.9% |
28 | 1 | 2.9% |
19 | 1 | 2.9% |
Other values (24) | 24 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
34 | 1 | |
33 | 1 | |
32 | 1 | |
31 | 1 | |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 |
연월일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
Minimum | 2021-01-01 00:00:00 |
---|---|
Maximum | 2021-01-01 00:00:00 |
환경플랫폼 하위 도메인명
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
자연환경 | |
---|---|
물환경 | |
생활환경 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6470588 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 물환경 |
---|---|
2nd row | 물환경 |
3rd row | 물환경 |
4th row | 물환경 |
5th row | 물환경 |
Common Values
Value | Count | Frequency (%) |
자연환경 | 14 | |
물환경 | 12 | |
생활환경 | 8 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
자연환경 | 14 | |
물환경 | 12 | |
생활환경 | 8 |
도메인 하위 카테고리명
Categorical
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 44.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
폐기물 | |
---|---|
화학물질 | |
생태계 | |
지질 | |
물재난 | 2 |
Other values (10) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 2.8235294 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 물재난 |
---|---|
2nd row | 상수도 |
3rd row | 지하수 |
4th row | 하수도 |
5th row | 하천 |
Common Values
Value | Count | Frequency (%) |
폐기물 | 3 | 8.8% |
화학물질 | 3 | 8.8% |
생태계 | 3 | 8.8% |
지질 | 3 | 8.8% |
물재난 | 2 | 5.9% |
상수도 | 2 | 5.9% |
지하수 | 2 | 5.9% |
하수도 | 2 | 5.9% |
하천 | 2 | 5.9% |
호소 | 2 | 5.9% |
Other values (5) | 10 |
Length
Value | Count | Frequency (%) |
폐기물 | 3 | 8.8% |
화학물질 | 3 | 8.8% |
생태계 | 3 | 8.8% |
지질 | 3 | 8.8% |
물재난 | 2 | 5.9% |
상수도 | 2 | 5.9% |
지하수 | 2 | 5.9% |
하수도 | 2 | 5.9% |
하천 | 2 | 5.9% |
호소 | 2 | 5.9% |
Other values (5) | 10 |
SNS 채널명
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
All | |
---|---|
blog | |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 3.9117647 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | All |
---|---|
2nd row | All |
3rd row | All |
4th row | All |
5th row | All |
Common Values
Value | Count | Frequency (%) |
All | 15 | |
blog | 15 | |
4 | 11.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
all | 15 | |
blog | 15 | |
4 | 11.8% |
긍정언급량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 18 |
---|---|
Distinct (%) | 52.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 68.411765 |
Minimum | 0 |
---|---|
Maximum | 170 |
Zeros | 3 |
Zeros (%) | 8.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 50 |
median | 65 |
Q3 | 85 |
95-th percentile | 135.55 |
Maximum | 170 |
Range | 170 |
Interquartile range (IQR) | 35 |
Descriptive statistics
Standard deviation | 42.643069 |
---|---|
Coefficient of variation (CV) | 0.62332948 |
Kurtosis | 0.41788385 |
Mean | 68.411765 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 0.43478972 |
Sum | 2326 |
Variance | 1818.4314 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3 | 8.8% |
60 | 2 | 5.9% |
170 | 2 | 5.9% |
117 | 2 | 5.9% |
65 | 2 | 5.9% |
79 | 2 | 5.9% |
106 | 2 | 5.9% |
50 | 2 | 5.9% |
114 | 2 | 5.9% |
78 | 2 | 5.9% |
Other values (8) | 13 |
Value | Count | Frequency (%) |
0 | 3 | |
2 | 1 | 2.9% |
17 | 2 | |
41 | 2 | |
50 | 2 | |
54 | 1 | 2.9% |
56 | 1 | 2.9% |
57 | 2 | |
60 | 2 | |
65 | 2 |
Value | Count | Frequency (%) |
170 | 2 | |
117 | 2 | |
114 | 2 | |
106 | 2 | |
85 | 2 | |
79 | 2 | |
78 | 2 | |
68 | 2 | |
65 | 2 | |
60 | 2 |
부정언급량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 18 |
---|---|
Distinct (%) | 52.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 32.411765 |
Minimum | 0 |
---|---|
Maximum | 82 |
Zeros | 1 |
Zeros (%) | 2.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 13 |
median | 25.5 |
Q3 | 55 |
95-th percentile | 73.85 |
Maximum | 82 |
Range | 82 |
Interquartile range (IQR) | 42 |
Descriptive statistics
Standard deviation | 24.556821 |
---|---|
Coefficient of variation (CV) | 0.75765145 |
Kurtosis | -0.86999948 |
Mean | 32.411765 |
Median Absolute Deviation (MAD) | 14.5 |
Skewness | 0.56660183 |
Sum | 1102 |
Variance | 603.03743 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 4 | |
61 | 4 | |
1 | 3 | 8.8% |
13 | 2 | 5.9% |
55 | 2 | 5.9% |
70 | 2 | 5.9% |
39 | 2 | 5.9% |
30 | 2 | 5.9% |
24 | 2 | 5.9% |
15 | 2 | 5.9% |
Other values (8) | 9 |
Value | Count | Frequency (%) |
0 | 1 | 2.9% |
1 | 3 | |
11 | 4 | |
13 | 2 | |
15 | 2 | |
18 | 2 | |
24 | 2 | |
25 | 1 | 2.9% |
26 | 1 | 2.9% |
30 | 2 |
Value | Count | Frequency (%) |
82 | 1 | 2.9% |
81 | 1 | 2.9% |
70 | 2 | |
61 | 4 | |
55 | 2 | |
39 | 2 | |
35 | 1 | 2.9% |
34 | 1 | 2.9% |
30 | 2 | |
26 | 1 | 2.9% |
중립언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 67.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2625.5294 |
Minimum | 8 |
---|---|
Maximum | 5659 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 8 |
---|---|
5-th percentile | 19.65 |
Q1 | 1790 |
median | 2749 |
Q3 | 3618 |
95-th percentile | 4906.8 |
Maximum | 5659 |
Range | 5651 |
Interquartile range (IQR) | 1828 |
Descriptive statistics
Standard deviation | 1538.3109 |
---|---|
Coefficient of variation (CV) | 0.58590504 |
Kurtosis | -0.53085278 |
Mean | 2625.5294 |
Median Absolute Deviation (MAD) | 917 |
Skewness | -0.025348967 |
Sum | 89268 |
Variance | 2366400.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1790 | 2 | 5.9% |
1832 | 2 | 5.9% |
1349 | 2 | 5.9% |
2357 | 2 | 5.9% |
876 | 2 | 5.9% |
2088 | 2 | 5.9% |
3657 | 2 | 5.9% |
4333 | 2 | 5.9% |
3618 | 2 | 5.9% |
4512 | 2 | 5.9% |
Other values (13) | 14 |
Value | Count | Frequency (%) |
8 | 1 | |
19 | 1 | |
20 | 1 | |
40 | 1 | |
876 | 2 | |
1349 | 2 | |
1790 | 2 | |
1832 | 2 | |
2088 | 2 | |
2357 | 2 |
Value | Count | Frequency (%) |
5659 | 1 | |
5640 | 1 | |
4512 | 2 | |
4333 | 2 | |
3657 | 2 | |
3618 | 2 | |
3527 | 1 | |
3507 | 1 | |
3393 | 1 | |
3353 | 1 |
총언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 67.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2726.3529 |
Minimum | 10 |
---|---|
Maximum | 5911 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 20.65 |
Q1 | 1863 |
median | 2853 |
Q3 | 3793 |
95-th percentile | 5099.3 |
Maximum | 5911 |
Range | 5901 |
Interquartile range (IQR) | 1930 |
Descriptive statistics
Standard deviation | 1601.798 |
---|---|
Coefficient of variation (CV) | 0.58752409 |
Kurtosis | -0.51076232 |
Mean | 2726.3529 |
Median Absolute Deviation (MAD) | 950 |
Skewness | -0.0077123172 |
Sum | 92696 |
Variance | 2565757 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1863 | 2 | 5.9% |
1912 | 2 | 5.9% |
1401 | 2 | 5.9% |
2432 | 2 | 5.9% |
904 | 2 | 5.9% |
2171 | 2 | 5.9% |
3803 | 2 | 5.9% |
4520 | 2 | 5.9% |
3793 | 2 | 5.9% |
4673 | 2 | 5.9% |
Other values (13) | 14 |
Value | Count | Frequency (%) |
10 | 1 | |
20 | 1 | |
21 | 1 | |
41 | 1 | |
904 | 2 | |
1401 | 2 | |
1863 | 2 | |
1912 | 2 | |
2171 | 2 | |
2432 | 2 |
Value | Count | Frequency (%) |
5911 | 1 | |
5891 | 1 | |
4673 | 2 | |
4520 | 2 | |
3803 | 2 | |
3793 | 2 | |
3640 | 1 | |
3619 | 1 | |
3498 | 1 | |
3457 | 1 |
일간언급량연번 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|
일간언급량연번 | 1.000 | 0.814 | 0.000 | 0.965 | 0.636 | 0.562 | 0.655 | 0.655 |
환경플랫폼 하위 도메인명 | 0.814 | 1.000 | 1.000 | 0.000 | 0.761 | 0.903 | 0.964 | 0.964 |
도메인 하위 카테고리명 | 0.000 | 1.000 | 1.000 | 0.000 | 0.956 | 0.961 | 0.963 | 0.963 |
SNS 채널명 | 0.965 | 0.000 | 0.000 | 1.000 | 0.717 | 0.867 | 0.867 | 0.867 |
긍정언급량 | 0.636 | 0.761 | 0.956 | 0.717 | 1.000 | 0.871 | 0.922 | 0.922 |
부정언급량 | 0.562 | 0.903 | 0.961 | 0.867 | 0.871 | 1.000 | 0.966 | 0.966 |
중립언급량 | 0.655 | 0.964 | 0.963 | 0.867 | 0.922 | 0.966 | 1.000 | 1.000 |
총언급량 | 0.655 | 0.964 | 0.963 | 0.867 | 0.922 | 0.966 | 1.000 | 1.000 |
SNS 채널명 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | |
---|---|---|---|
SNS 채널명 | 1.000 | 0.000 | 0.000 |
환경플랫폼 하위 도메인명 | 0.000 | 1.000 | 0.783 |
도메인 하위 카테고리명 | 0.000 | 0.783 | 1.000 |
일간언급량연번 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | |
---|---|---|---|---|---|---|---|---|
일간언급량연번 | 1.000 | -0.115 | -0.113 | -0.095 | -0.095 | 0.623 | 0.000 | 0.853 |
긍정언급량 | -0.115 | 1.000 | 0.905 | 0.939 | 0.939 | 0.606 | 0.654 | 0.412 |
부정언급량 | -0.113 | 0.905 | 1.000 | 0.940 | 0.940 | 0.573 | 0.716 | 0.524 |
중립언급량 | -0.095 | 0.939 | 0.940 | 1.000 | 1.000 | 0.693 | 0.722 | 0.524 |
총언급량 | -0.095 | 0.939 | 0.940 | 1.000 | 1.000 | 0.693 | 0.722 | 0.524 |
환경플랫폼 하위 도메인명 | 0.623 | 0.606 | 0.573 | 0.693 | 0.693 | 1.000 | 0.783 | 0.000 |
도메인 하위 카테고리명 | 0.000 | 0.654 | 0.716 | 0.722 | 0.722 | 0.783 | 1.000 | 0.000 |
SNS 채널명 | 0.853 | 0.412 | 0.524 | 0.524 | 0.524 | 0.000 | 0.000 | 1.000 |
일간언급량연번 | 연월일 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2021-01-01 | 물환경 | 물재난 | All | 60 | 13 | 1790 | 1863 |
1 | 2 | 2021-01-01 | 물환경 | 상수도 | All | 50 | 30 | 1832 | 1912 |
2 | 3 | 2021-01-01 | 물환경 | 지하수 | All | 41 | 11 | 1349 | 1401 |
3 | 4 | 2021-01-01 | 물환경 | 하수도 | All | 57 | 18 | 2357 | 2432 |
4 | 5 | 2021-01-01 | 물환경 | 하천 | All | 17 | 11 | 876 | 904 |
5 | 6 | 2021-01-01 | 물환경 | 호소 | All | 68 | 15 | 2088 | 2171 |
6 | 7 | 2021-01-01 | 생활환경 | 대기 | All | 85 | 61 | 3657 | 3803 |
7 | 8 | 2021-01-01 | 생활환경 | 폐기물 | All | 78 | 35 | 3527 | 3640 |
8 | 9 | 2021-01-01 | 생활환경 | 화학물질 | All | 170 | 82 | 5659 | 5911 |
9 | 10 | 2021-01-01 | 자연환경 | 기상변화 | All | 114 | 61 | 3618 | 3793 |
일간언급량연번 | 연월일 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|---|
24 | 25 | 2021-01-01 | 자연환경 | 기상변화 | blog | 114 | 61 | 3618 | 3793 |
25 | 26 | 2021-01-01 | 자연환경 | 기후변화 | blog | 106 | 55 | 4512 | 4673 |
26 | 27 | 2021-01-01 | 자연환경 | 생태계 | blog | 54 | 24 | 2886 | 2964 |
27 | 28 | 2021-01-01 | 자연환경 | 지질 | blog | 79 | 25 | 3353 | 3457 |
28 | 29 | 2021-01-01 | 자연환경 | 지형 | blog | 65 | 39 | 2749 | 2853 |
29 | 30 | 2021-01-01 | 자연환경 | 토양 | blog | 117 | 70 | 4333 | 4520 |
30 | 31 | 2021-01-01 | 생활환경 | 폐기물 | 0 | 1 | 20 | 21 | |
31 | 32 | 2021-01-01 | 생활환경 | 화학물질 | 0 | 1 | 19 | 20 | |
32 | 33 | 2021-01-01 | 자연환경 | 생태계 | 2 | 0 | 8 | 10 | |
33 | 34 | 2021-01-01 | 자연환경 | 지질 | 0 | 1 | 40 | 41 |