Overview

Dataset statistics

Number of variables9
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory80.9 B

Variable types

Numeric5
DateTime1
Categorical3

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=663752c0-2fb1-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
일간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
긍정언급량 is highly overall correlated with 부정언급량 and 4 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 일간언급량연번 and 5 other fieldsHigh correlation
도메인 하위 카테고리명 is highly overall correlated with 긍정언급량 and 4 other fieldsHigh correlation
SNS 채널명 is highly overall correlated with 일간언급량연번 and 3 other fieldsHigh correlation
일간언급량연번 has unique valuesUnique
긍정언급량 has 3 (8.8%) zerosZeros
부정언급량 has 1 (2.9%) zerosZeros

Reproduction

Analysis started2023-12-10 13:25:40.298929
Analysis finished2023-12-10 13:25:44.889776
Duration4.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T22:25:45.002001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.65
Q19.25
median17.5
Q325.75
95-th percentile32.35
Maximum34
Range33
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation9.9582462
Coefficient of variation (CV)0.56904264
Kurtosis-1.2
Mean17.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum595
Variance99.166667
MonotonicityStrictly increasing
2023-12-10T22:25:45.244900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 1
 
2.9%
27 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
28 1
 
2.9%
19 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%
25 1
2.9%

연월일
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2021-01-01 00:00:00
Maximum2021-01-01 00:00:00
2023-12-10T22:25:45.420956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:45.583993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
자연환경
14 
물환경
12 
생활환경

Length

Max length4
Median length4
Mean length3.6470588
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
자연환경 14
41.2%
물환경 12
35.3%
생활환경 8
23.5%

Length

2023-12-10T22:25:45.804809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:25:45.958456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연환경 14
41.2%
물환경 12
35.3%
생활환경 8
23.5%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)44.1%
Missing0
Missing (%)0.0%
Memory size404.0 B
폐기물
화학물질
생태계
지질
물재난
 
2
Other values (10)
20 

Length

Max length4
Median length3
Mean length2.8235294
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물재난
2nd row상수도
3rd row지하수
4th row하수도
5th row하천

Common Values

ValueCountFrequency (%)
폐기물 3
 
8.8%
화학물질 3
 
8.8%
생태계 3
 
8.8%
지질 3
 
8.8%
물재난 2
 
5.9%
상수도 2
 
5.9%
지하수 2
 
5.9%
하수도 2
 
5.9%
하천 2
 
5.9%
호소 2
 
5.9%
Other values (5) 10
29.4%

Length

2023-12-10T22:25:46.151076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
폐기물 3
 
8.8%
화학물질 3
 
8.8%
생태계 3
 
8.8%
지질 3
 
8.8%
물재난 2
 
5.9%
상수도 2
 
5.9%
지하수 2
 
5.9%
하수도 2
 
5.9%
하천 2
 
5.9%
호소 2
 
5.9%
Other values (5) 10
29.4%

SNS 채널명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
All
15 
blog
15 
twitter

Length

Max length7
Median length4
Mean length3.9117647
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowAll
3rd rowAll
4th rowAll
5th rowAll

Common Values

ValueCountFrequency (%)
All 15
44.1%
blog 15
44.1%
twitter 4
 
11.8%

Length

2023-12-10T22:25:46.397208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:25:46.631908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
44.1%
blog 15
44.1%
twitter 4
 
11.8%

긍정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)52.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.411765
Minimum0
Maximum170
Zeros3
Zeros (%)8.8%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T22:25:46.851549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q150
median65
Q385
95-th percentile135.55
Maximum170
Range170
Interquartile range (IQR)35

Descriptive statistics

Standard deviation42.643069
Coefficient of variation (CV)0.62332948
Kurtosis0.41788385
Mean68.411765
Median Absolute Deviation (MAD)20
Skewness0.43478972
Sum2326
Variance1818.4314
MonotonicityNot monotonic
2023-12-10T22:25:47.033782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 3
 
8.8%
60 2
 
5.9%
170 2
 
5.9%
117 2
 
5.9%
65 2
 
5.9%
79 2
 
5.9%
106 2
 
5.9%
50 2
 
5.9%
114 2
 
5.9%
78 2
 
5.9%
Other values (8) 13
38.2%
ValueCountFrequency (%)
0 3
8.8%
2 1
 
2.9%
17 2
5.9%
41 2
5.9%
50 2
5.9%
54 1
 
2.9%
56 1
 
2.9%
57 2
5.9%
60 2
5.9%
65 2
5.9%
ValueCountFrequency (%)
170 2
5.9%
117 2
5.9%
114 2
5.9%
106 2
5.9%
85 2
5.9%
79 2
5.9%
78 2
5.9%
68 2
5.9%
65 2
5.9%
60 2
5.9%

부정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)52.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.411765
Minimum0
Maximum82
Zeros1
Zeros (%)2.9%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T22:25:47.217360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q113
median25.5
Q355
95-th percentile73.85
Maximum82
Range82
Interquartile range (IQR)42

Descriptive statistics

Standard deviation24.556821
Coefficient of variation (CV)0.75765145
Kurtosis-0.86999948
Mean32.411765
Median Absolute Deviation (MAD)14.5
Skewness0.56660183
Sum1102
Variance603.03743
MonotonicityNot monotonic
2023-12-10T22:25:47.383923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
11 4
11.8%
61 4
11.8%
1 3
 
8.8%
13 2
 
5.9%
55 2
 
5.9%
70 2
 
5.9%
39 2
 
5.9%
30 2
 
5.9%
24 2
 
5.9%
15 2
 
5.9%
Other values (8) 9
26.5%
ValueCountFrequency (%)
0 1
 
2.9%
1 3
8.8%
11 4
11.8%
13 2
5.9%
15 2
5.9%
18 2
5.9%
24 2
5.9%
25 1
 
2.9%
26 1
 
2.9%
30 2
5.9%
ValueCountFrequency (%)
82 1
 
2.9%
81 1
 
2.9%
70 2
5.9%
61 4
11.8%
55 2
5.9%
39 2
5.9%
35 1
 
2.9%
34 1
 
2.9%
30 2
5.9%
26 1
 
2.9%

중립언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)67.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2625.5294
Minimum8
Maximum5659
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T22:25:47.596162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile19.65
Q11790
median2749
Q33618
95-th percentile4906.8
Maximum5659
Range5651
Interquartile range (IQR)1828

Descriptive statistics

Standard deviation1538.3109
Coefficient of variation (CV)0.58590504
Kurtosis-0.53085278
Mean2625.5294
Median Absolute Deviation (MAD)917
Skewness-0.025348967
Sum89268
Variance2366400.5
MonotonicityNot monotonic
2023-12-10T22:25:47.772342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1790 2
 
5.9%
1832 2
 
5.9%
1349 2
 
5.9%
2357 2
 
5.9%
876 2
 
5.9%
2088 2
 
5.9%
3657 2
 
5.9%
4333 2
 
5.9%
3618 2
 
5.9%
4512 2
 
5.9%
Other values (13) 14
41.2%
ValueCountFrequency (%)
8 1
2.9%
19 1
2.9%
20 1
2.9%
40 1
2.9%
876 2
5.9%
1349 2
5.9%
1790 2
5.9%
1832 2
5.9%
2088 2
5.9%
2357 2
5.9%
ValueCountFrequency (%)
5659 1
2.9%
5640 1
2.9%
4512 2
5.9%
4333 2
5.9%
3657 2
5.9%
3618 2
5.9%
3527 1
2.9%
3507 1
2.9%
3393 1
2.9%
3353 1
2.9%

총언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)67.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2726.3529
Minimum10
Maximum5911
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T22:25:48.011292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile20.65
Q11863
median2853
Q33793
95-th percentile5099.3
Maximum5911
Range5901
Interquartile range (IQR)1930

Descriptive statistics

Standard deviation1601.798
Coefficient of variation (CV)0.58752409
Kurtosis-0.51076232
Mean2726.3529
Median Absolute Deviation (MAD)950
Skewness-0.0077123172
Sum92696
Variance2565757
MonotonicityNot monotonic
2023-12-10T22:25:48.402665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1863 2
 
5.9%
1912 2
 
5.9%
1401 2
 
5.9%
2432 2
 
5.9%
904 2
 
5.9%
2171 2
 
5.9%
3803 2
 
5.9%
4520 2
 
5.9%
3793 2
 
5.9%
4673 2
 
5.9%
Other values (13) 14
41.2%
ValueCountFrequency (%)
10 1
2.9%
20 1
2.9%
21 1
2.9%
41 1
2.9%
904 2
5.9%
1401 2
5.9%
1863 2
5.9%
1912 2
5.9%
2171 2
5.9%
2432 2
5.9%
ValueCountFrequency (%)
5911 1
2.9%
5891 1
2.9%
4673 2
5.9%
4520 2
5.9%
3803 2
5.9%
3793 2
5.9%
3640 1
2.9%
3619 1
2.9%
3498 1
2.9%
3457 1
2.9%

Interactions

2023-12-10T22:25:43.867015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.088563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.863018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.543884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.153738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:44.027024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.237991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.023331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.663823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.300577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:44.159959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.385264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.169284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.798067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.445260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:44.283868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.542134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.287416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.904717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.602374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:44.409912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:41.724660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:42.418841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.025670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:25:43.729630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:25:48.719318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
일간언급량연번1.0000.8140.0000.9650.6360.5620.6550.655
환경플랫폼 하위 도메인명0.8141.0001.0000.0000.7610.9030.9640.964
도메인 하위 카테고리명0.0001.0001.0000.0000.9560.9610.9630.963
SNS 채널명0.9650.0000.0001.0000.7170.8670.8670.867
긍정언급량0.6360.7610.9560.7171.0000.8710.9220.922
부정언급량0.5620.9030.9610.8670.8711.0000.9660.966
중립언급량0.6550.9640.9630.8670.9220.9661.0001.000
총언급량0.6550.9640.9630.8670.9220.9661.0001.000
2023-12-10T22:25:49.072707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SNS 채널명환경플랫폼 하위 도메인명도메인 하위 카테고리명
SNS 채널명1.0000.0000.000
환경플랫폼 하위 도메인명0.0001.0000.783
도메인 하위 카테고리명0.0000.7831.000
2023-12-10T22:25:49.748156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
일간언급량연번1.000-0.115-0.113-0.095-0.0950.6230.0000.853
긍정언급량-0.1151.0000.9050.9390.9390.6060.6540.412
부정언급량-0.1130.9051.0000.9400.9400.5730.7160.524
중립언급량-0.0950.9390.9401.0001.0000.6930.7220.524
총언급량-0.0950.9390.9401.0001.0000.6930.7220.524
환경플랫폼 하위 도메인명0.6230.6060.5730.6930.6931.0000.7830.000
도메인 하위 카테고리명0.0000.6540.7160.7220.7220.7831.0000.000
SNS 채널명0.8530.4120.5240.5240.5240.0000.0001.000

Missing values

2023-12-10T22:25:44.589052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:25:44.808233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012021-01-01물환경물재난All601317901863
122021-01-01물환경상수도All503018321912
232021-01-01물환경지하수All411113491401
342021-01-01물환경하수도All571823572432
452021-01-01물환경하천All1711876904
562021-01-01물환경호소All681520882171
672021-01-01생활환경대기All856136573803
782021-01-01생활환경폐기물All783535273640
892021-01-01생활환경화학물질All1708256595911
9102021-01-01자연환경기상변화All1146136183793
일간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
24252021-01-01자연환경기상변화blog1146136183793
25262021-01-01자연환경기후변화blog1065545124673
26272021-01-01자연환경생태계blog542428862964
27282021-01-01자연환경지질blog792533533457
28292021-01-01자연환경지형blog653927492853
29302021-01-01자연환경토양blog1177043334520
30312021-01-01생활환경폐기물twitter012021
31322021-01-01생활환경화학물질twitter011920
32332021-01-01자연환경생태계twitter20810
33342021-01-01자연환경지질twitter014041