Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.4 KiB |
Average record size in memory | 81.4 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Categorical | 2 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 성균관대학교 산학협력단 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4 |
연월일 has constant value "" | Constant |
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 | High correlation |
긍정언급량 is highly overall correlated with 부정언급량 and 2 other fields | High correlation |
부정언급량 is highly overall correlated with 긍정언급량 and 3 other fields | High correlation |
중립언급량 is highly overall correlated with 긍정언급량 and 2 other fields | High correlation |
총언급량 is highly overall correlated with 긍정언급량 and 2 other fields | High correlation |
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 1 other fields | High correlation |
주간언급량연번 has unique values | Unique |
Reproduction
Analysis started | 2024-04-17 04:41:36.317411 |
---|---|
Analysis finished | 2024-04-17 04:41:38.313971 |
Duration | 2 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
주간언급량연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
연월일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2020-10-05 00:00:00 |
---|---|
Maximum | 2020-10-05 00:00:00 |
환경플랫폼 하위 도메인명
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
물환경 | |
---|---|
자연환경 | |
생활환경 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 물환경 |
---|---|
2nd row | 물환경 |
3rd row | 물환경 |
4th row | 물환경 |
5th row | 물환경 |
Common Values
Value | Count | Frequency (%) |
물환경 | 12 | |
자연환경 | 12 | |
생활환경 | 6 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
물환경 | 12 | |
자연환경 | 12 | |
생활환경 | 6 |
도메인 하위 카테고리명
Text
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
물재난 | 2 | 6.7% |
상수도 | 2 | 6.7% |
지하수 | 2 | 6.7% |
하수도 | 2 | 6.7% |
하천 | 2 | 6.7% |
호소 | 2 | 6.7% |
대기 | 2 | 6.7% |
폐기물 | 2 | 6.7% |
화학물질 | 2 | 6.7% |
기상변화 | 2 | 6.7% |
Other values (5) | 10 |
Most occurring characters
Value | Count | Frequency (%) |
기 | 8 | 9.5% |
지 | 6 | 7.1% |
화 | 6 | 7.1% |
하 | 6 | 7.1% |
물 | 6 | 7.1% |
수 | 6 | 7.1% |
도 | 4 | 4.8% |
질 | 4 | 4.8% |
상 | 4 | 4.8% |
변 | 4 | 4.8% |
Other values (15) | 30 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 84 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 8 | 9.5% |
지 | 6 | 7.1% |
화 | 6 | 7.1% |
하 | 6 | 7.1% |
물 | 6 | 7.1% |
수 | 6 | 7.1% |
도 | 4 | 4.8% |
질 | 4 | 4.8% |
상 | 4 | 4.8% |
변 | 4 | 4.8% |
Other values (15) | 30 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 84 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 8 | 9.5% |
지 | 6 | 7.1% |
화 | 6 | 7.1% |
하 | 6 | 7.1% |
물 | 6 | 7.1% |
수 | 6 | 7.1% |
도 | 4 | 4.8% |
질 | 4 | 4.8% |
상 | 4 | 4.8% |
변 | 4 | 4.8% |
Other values (15) | 30 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 84 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
기 | 8 | 9.5% |
지 | 6 | 7.1% |
화 | 6 | 7.1% |
하 | 6 | 7.1% |
물 | 6 | 7.1% |
수 | 6 | 7.1% |
도 | 4 | 4.8% |
질 | 4 | 4.8% |
상 | 4 | 4.8% |
변 | 4 | 4.8% |
Other values (15) | 30 |
SNS 채널명
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
All | |
---|---|
blog |
Length
Max length | 4 |
---|---|
Median length | 3.5 |
Mean length | 3.5 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | All |
---|---|
2nd row | blog |
3rd row | All |
4th row | blog |
5th row | All |
Common Values
Value | Count | Frequency (%) |
All | 15 | |
blog | 15 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
all | 15 | |
blog | 15 |
긍정언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1094.7333 |
Minimum | 167 |
---|---|
Maximum | 2598 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 167 |
---|---|
5-th percentile | 171.95 |
Q1 | 323.5 |
median | 825 |
Q3 | 1895.25 |
95-th percentile | 2570.1 |
Maximum | 2598 |
Range | 2431 |
Interquartile range (IQR) | 1571.75 |
Descriptive statistics
Standard deviation | 873.42019 |
---|---|
Coefficient of variation (CV) | 0.7978383 |
Kurtosis | -1.0010992 |
Mean | 1094.7333 |
Median Absolute Deviation (MAD) | 567 |
Skewness | 0.71394811 |
Sum | 32842 |
Variance | 762862.82 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
718 | 2 | 6.7% |
302 | 2 | 6.7% |
178 | 2 | 6.7% |
230 | 2 | 6.7% |
2598 | 2 | 6.7% |
828 | 2 | 6.7% |
825 | 2 | 6.7% |
2536 | 2 | 6.7% |
812 | 2 | 6.7% |
970 | 2 | 6.7% |
Other values (5) | 10 |
Value | Count | Frequency (%) |
167 | 2 | |
178 | 2 | |
230 | 2 | |
302 | 2 | |
388 | 2 | |
718 | 2 | |
812 | 2 | |
825 | 2 | |
828 | 2 | |
970 | 2 |
Value | Count | Frequency (%) |
2598 | 2 | |
2536 | 2 | |
2414 | 2 | |
2063 | 2 | |
1392 | 2 | |
970 | 2 | |
828 | 2 | |
825 | 2 | |
812 | 2 | |
718 | 2 |
부정언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 523.13333 |
Minimum | 62 |
---|---|
Maximum | 1317 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 62 |
---|---|
5-th percentile | 77.3 |
Q1 | 162.75 |
median | 498 |
Q3 | 850.5 |
95-th percentile | 1190.55 |
Maximum | 1317 |
Range | 1255 |
Interquartile range (IQR) | 687.75 |
Descriptive statistics
Standard deviation | 390.02049 |
---|---|
Coefficient of variation (CV) | 0.745547 |
Kurtosis | -0.90946045 |
Mean | 523.13333 |
Median Absolute Deviation (MAD) | 344 |
Skewness | 0.54887541 |
Sum | 15694 |
Variance | 152115.98 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
335 | 2 | 6.7% |
189 | 2 | 6.7% |
62 | 2 | 6.7% |
154 | 2 | 6.7% |
960 | 2 | 6.7% |
231 | 2 | 6.7% |
508 | 2 | 6.7% |
1317 | 2 | 6.7% |
498 | 2 | 6.7% |
616 | 2 | 6.7% |
Other values (5) | 10 |
Value | Count | Frequency (%) |
62 | 2 | |
96 | 2 | |
151 | 2 | |
154 | 2 | |
189 | 2 | |
231 | 2 | |
335 | 2 | |
498 | 2 | |
508 | 2 | |
616 | 2 |
Value | Count | Frequency (%) |
1317 | 2 | |
1036 | 2 | |
960 | 2 | |
854 | 2 | |
840 | 2 | |
616 | 2 | |
508 | 2 | |
498 | 2 | |
335 | 2 | |
231 | 2 |
중립언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 45294 |
Minimum | 9205 |
---|---|
Maximum | 113384 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 9205 |
---|---|
5-th percentile | 10061.8 |
Q1 | 14814.5 |
median | 29522 |
Q3 | 80401 |
95-th percentile | 107806.7 |
Maximum | 113384 |
Range | 104179 |
Interquartile range (IQR) | 65586.5 |
Descriptive statistics
Standard deviation | 36297.693 |
---|---|
Coefficient of variation (CV) | 0.80137972 |
Kurtosis | -0.93331191 |
Mean | 45294 |
Median Absolute Deviation (MAD) | 18234 |
Skewness | 0.82227257 |
Sum | 1358820 |
Variance | 1.3175225 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
29522 | 2 | 6.7% |
13155 | 2 | 6.7% |
9205 | 2 | 6.7% |
11109 | 2 | 6.7% |
113384 | 2 | 6.7% |
29132 | 2 | 6.7% |
30767 | 2 | 6.7% |
99689 | 2 | 6.7% |
26880 | 2 | 6.7% |
37308 | 2 | 6.7% |
Other values (5) | 10 |
Value | Count | Frequency (%) |
9205 | 2 | |
11109 | 2 | |
11288 | 2 | |
13155 | 2 | |
19793 | 2 | |
26880 | 2 | |
29132 | 2 | |
29522 | 2 | |
30767 | 2 | |
37308 | 2 |
Value | Count | Frequency (%) |
113384 | 2 | |
100990 | 2 | |
99689 | 2 | |
87208 | 2 | |
59980 | 2 | |
37308 | 2 | |
30767 | 2 | |
29522 | 2 | |
29132 | 2 | |
26880 | 2 |
총언급량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 46911.867 |
Minimum | 9445 |
---|---|
Maximum | 116942 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 9445 |
---|---|
5-th percentile | 10366.6 |
Q1 | 15317.5 |
median | 30575 |
Q3 | 83146.75 |
95-th percentile | 111316.1 |
Maximum | 116942 |
Range | 107497 |
Interquartile range (IQR) | 67829.25 |
Descriptive statistics
Standard deviation | 37531.204 |
---|---|
Coefficient of variation (CV) | 0.80003647 |
Kurtosis | -0.94339141 |
Mean | 46911.867 |
Median Absolute Deviation (MAD) | 19024 |
Skewness | 0.81559851 |
Sum | 1407356 |
Variance | 1.4085913 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30575 | 2 | 6.7% |
13646 | 2 | 6.7% |
9445 | 2 | 6.7% |
11493 | 2 | 6.7% |
116942 | 2 | 6.7% |
30191 | 2 | 6.7% |
32100 | 2 | 6.7% |
103542 | 2 | 6.7% |
28190 | 2 | 6.7% |
38894 | 2 | 6.7% |
Other values (5) | 10 |
Value | Count | Frequency (%) |
9445 | 2 | |
11493 | 2 | |
11551 | 2 | |
13646 | 2 | |
20332 | 2 | |
28190 | 2 | |
30191 | 2 | |
30575 | 2 | |
32100 | 2 | |
38894 | 2 |
Value | Count | Frequency (%) |
116942 | 2 | |
104440 | 2 | |
103542 | 2 | |
90125 | 2 | |
62212 | 2 | |
38894 | 2 | |
32100 | 2 | |
30575 | 2 | |
30191 | 2 | |
28190 | 2 |
주간언급량연번 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|
주간언급량연번 | 1.000 | 1.000 | 0.969 | 0.000 | 0.750 | 0.811 | 0.675 | 0.675 |
환경플랫폼 하위 도메인명 | 1.000 | 1.000 | 1.000 | 0.000 | 0.811 | 0.862 | 0.513 | 0.513 |
도메인 하위 카테고리명 | 0.969 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 |
SNS 채널명 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
긍정언급량 | 0.750 | 0.811 | 1.000 | 0.000 | 1.000 | 0.921 | 0.927 | 0.927 |
부정언급량 | 0.811 | 0.862 | 1.000 | 0.000 | 0.921 | 1.000 | 0.814 | 0.814 |
중립언급량 | 0.675 | 0.513 | 1.000 | 0.000 | 0.927 | 0.814 | 1.000 | 1.000 |
총언급량 | 0.675 | 0.513 | 1.000 | 0.000 | 0.927 | 0.814 | 1.000 | 1.000 |
환경플랫폼 하위 도메인명 | SNS 채널명 | |
---|---|---|
환경플랫폼 하위 도메인명 | 1.000 | 0.000 |
SNS 채널명 | 0.000 | 1.000 |
주간언급량연번 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | 환경플랫폼 하위 도메인명 | SNS 채널명 | |
---|---|---|---|---|---|---|---|
주간언급량연번 | 1.000 | 0.210 | 0.221 | 0.271 | 0.271 | 0.861 | 0.000 |
긍정언급량 | 0.210 | 1.000 | 0.954 | 0.968 | 0.968 | 0.409 | 0.000 |
부정언급량 | 0.221 | 0.954 | 1.000 | 0.957 | 0.957 | 0.739 | 0.000 |
중립언급량 | 0.271 | 0.968 | 0.957 | 1.000 | 1.000 | 0.398 | 0.000 |
총언급량 | 0.271 | 0.968 | 0.957 | 1.000 | 1.000 | 0.398 | 0.000 |
환경플랫폼 하위 도메인명 | 0.861 | 0.409 | 0.739 | 0.398 | 0.398 | 1.000 | 0.000 |
SNS 채널명 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
주간언급량연번 | 연월일 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2020-10-05 | 물환경 | 물재난 | All | 718 | 335 | 29522 | 30575 |
1 | 2 | 2020-10-05 | 물환경 | 물재난 | blog | 718 | 335 | 29522 | 30575 |
2 | 3 | 2020-10-05 | 물환경 | 상수도 | All | 302 | 189 | 13155 | 13646 |
3 | 4 | 2020-10-05 | 물환경 | 상수도 | blog | 302 | 189 | 13155 | 13646 |
4 | 5 | 2020-10-05 | 물환경 | 지하수 | All | 178 | 62 | 9205 | 9445 |
5 | 6 | 2020-10-05 | 물환경 | 지하수 | blog | 178 | 62 | 9205 | 9445 |
6 | 7 | 2020-10-05 | 물환경 | 하수도 | All | 230 | 154 | 11109 | 11493 |
7 | 8 | 2020-10-05 | 물환경 | 하수도 | blog | 230 | 154 | 11109 | 11493 |
8 | 9 | 2020-10-05 | 물환경 | 하천 | All | 2598 | 960 | 113384 | 116942 |
9 | 10 | 2020-10-05 | 물환경 | 하천 | blog | 2598 | 960 | 113384 | 116942 |
주간언급량연번 | 연월일 | 환경플랫폼 하위 도메인명 | 도메인 하위 카테고리명 | SNS 채널명 | 긍정언급량 | 부정언급량 | 중립언급량 | 총언급량 | |
---|---|---|---|---|---|---|---|---|---|
20 | 21 | 2020-10-05 | 자연환경 | 기후변화 | All | 1392 | 840 | 59980 | 62212 |
21 | 22 | 2020-10-05 | 자연환경 | 기후변화 | blog | 1392 | 840 | 59980 | 62212 |
22 | 23 | 2020-10-05 | 자연환경 | 생태계 | All | 2063 | 854 | 87208 | 90125 |
23 | 24 | 2020-10-05 | 자연환경 | 생태계 | blog | 2063 | 854 | 87208 | 90125 |
24 | 25 | 2020-10-05 | 자연환경 | 지질 | All | 388 | 151 | 19793 | 20332 |
25 | 26 | 2020-10-05 | 자연환경 | 지질 | blog | 388 | 151 | 19793 | 20332 |
26 | 27 | 2020-10-05 | 자연환경 | 지형 | All | 2414 | 1036 | 100990 | 104440 |
27 | 28 | 2020-10-05 | 자연환경 | 지형 | blog | 2414 | 1036 | 100990 | 104440 |
28 | 29 | 2020-10-05 | 자연환경 | 토양 | All | 167 | 96 | 11288 | 11551 |
29 | 30 | 2020-10-05 | 자연환경 | 토양 | blog | 167 | 96 | 11288 | 11551 |