Overview

Dataset statistics

Number of variables9
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory79.9 B

Variable types

Numeric5
Categorical4

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
긍정언급량 is highly overall correlated with 부정언급량 and 2 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 1 other fieldsHigh correlation
도메인 하위 카테고리명 is highly overall correlated with 주간언급량연번 and 1 other fieldsHigh correlation
주간언급량연번 has unique valuesUnique
긍정언급량 has unique valuesUnique
중립언급량 has unique valuesUnique
총언급량 has unique valuesUnique

Reproduction

Analysis started2024-04-17 04:41:31.468056
Analysis finished2024-04-17 04:41:33.506609
Duration2.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-17T13:41:33.579254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2024-04-17T13:41:33.711296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
2020-01-06
45 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-01-06
2nd row2020-01-06
3rd row2020-01-06
4th row2020-01-06
5th row2020-01-06

Common Values

ValueCountFrequency (%)
2020-01-06 45
100.0%

Length

2024-04-17T13:41:33.827672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:33.901703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-01-06 45
100.0%

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
물환경
18 
자연환경
18 
생활환경

Length

Max length4
Median length4
Mean length3.6
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
물환경 18
40.0%
자연환경 18
40.0%
생활환경 9
20.0%

Length

2024-04-17T13:41:33.985152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:34.082822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물환경 18
40.0%
자연환경 18
40.0%
생활환경 9
20.0%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size492.0 B
물재난
 
3
상수도
 
3
지하수
 
3
하수도
 
3
하천
 
3
Other values (10)
30 

Length

Max length4
Median length3
Mean length2.8
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물재난
2nd row물재난
3rd row물재난
4th row상수도
5th row상수도

Common Values

ValueCountFrequency (%)
물재난 3
 
6.7%
상수도 3
 
6.7%
지하수 3
 
6.7%
하수도 3
 
6.7%
하천 3
 
6.7%
호소 3
 
6.7%
대기 3
 
6.7%
폐기물 3
 
6.7%
화학물질 3
 
6.7%
기상변화 3
 
6.7%
Other values (5) 15
33.3%

Length

2024-04-17T13:41:34.209137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
물재난 3
 
6.7%
상수도 3
 
6.7%
지하수 3
 
6.7%
하수도 3
 
6.7%
하천 3
 
6.7%
호소 3
 
6.7%
대기 3
 
6.7%
폐기물 3
 
6.7%
화학물질 3
 
6.7%
기상변화 3
 
6.7%
Other values (5) 15
33.3%

SNS 채널명
Categorical

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
All
15 
blog
15 
twitter
15 

Length

Max length7
Median length4
Mean length4.6666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowblog
3rd rowtwitter
4th rowAll
5th rowblog

Common Values

ValueCountFrequency (%)
All 15
33.3%
blog 15
33.3%
twitter 15
33.3%

Length

2024-04-17T13:41:34.330779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:34.421022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
33.3%
blog 15
33.3%
twitter 15
33.3%

긍정언급량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean627.28889
Minimum21
Maximum2302
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-17T13:41:34.515584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile32.2
Q1113
median331
Q3837
95-th percentile2219.8
Maximum2302
Range2281
Interquartile range (IQR)724

Descriptive statistics

Standard deviation679.64321
Coefficient of variation (CV)1.0834613
Kurtosis0.71449977
Mean627.28889
Median Absolute Deviation (MAD)252
Skewness1.3246712
Sum28228
Variance461914.89
MonotonicityNot monotonic
2024-04-17T13:41:34.615557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1016 1
 
2.2%
79 1
 
2.2%
512 1
 
2.2%
58 1
 
2.2%
372 1
 
2.2%
210 1
 
2.2%
162 1
 
2.2%
2229 1
 
2.2%
1694 1
 
2.2%
535 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
21 1
2.2%
22 1
2.2%
32 1
2.2%
33 1
2.2%
58 1
2.2%
76 1
2.2%
79 1
2.2%
86 1
2.2%
106 1
2.2%
107 1
2.2%
ValueCountFrequency (%)
2302 1
2.2%
2275 1
2.2%
2229 1
2.2%
2183 1
2.2%
1694 1
2.2%
1512 1
2.2%
1501 1
2.2%
1379 1
2.2%
1167 1
2.2%
1135 1
2.2%

부정언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean407.06667
Minimum13
Maximum1957
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-17T13:41:34.713224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13
5-th percentile19.4
Q166
median160
Q3563
95-th percentile1377.6
Maximum1957
Range1944
Interquartile range (IQR)497

Descriptive statistics

Standard deviation472.98613
Coefficient of variation (CV)1.1619378
Kurtosis1.7693962
Mean407.06667
Median Absolute Deviation (MAD)127
Skewness1.5101569
Sum18318
Variance223715.88
MonotonicityNot monotonic
2024-04-17T13:41:35.063920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
49 3
 
6.7%
111 2
 
4.4%
57 2
 
4.4%
13 2
 
4.4%
33 1
 
2.2%
233 1
 
2.2%
90 1
 
2.2%
143 1
 
2.2%
1391 1
 
2.2%
951 1
 
2.2%
Other values (30) 30
66.7%
ValueCountFrequency (%)
13 2
4.4%
16 1
 
2.2%
33 1
 
2.2%
36 1
 
2.2%
39 1
 
2.2%
49 3
6.7%
57 2
4.4%
66 1
 
2.2%
70 1
 
2.2%
82 1
 
2.2%
ValueCountFrequency (%)
1957 1
2.2%
1486 1
2.2%
1391 1
2.2%
1324 1
2.2%
1044 1
2.2%
962 1
2.2%
951 1
2.2%
913 1
2.2%
897 1
2.2%
761 1
2.2%

중립언급량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19966.356
Minimum655
Maximum82860
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-17T13:41:35.175293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum655
5-th percentile1119.2
Q14201
median9245
Q327337
95-th percentile69569.2
Maximum82860
Range82205
Interquartile range (IQR)23136

Descriptive statistics

Standard deviation21733.885
Coefficient of variation (CV)1.0885254
Kurtosis1.3095038
Mean19966.356
Median Absolute Deviation (MAD)7399
Skewness1.4503174
Sum898486
Variance4.7236176 × 108
MonotonicityNot monotonic
2024-04-17T13:41:35.275393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
27337 1
 
2.2%
2524 1
 
2.2%
13595 1
 
2.2%
1645 1
 
2.2%
16307 1
 
2.2%
8370 1
 
2.2%
7937 1
 
2.2%
72479 1
 
2.2%
54375 1
 
2.2%
18104 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
655 1
2.2%
954 1
2.2%
1036 1
2.2%
1452 1
2.2%
1645 1
2.2%
1924 1
2.2%
2524 1
2.2%
2947 1
2.2%
3338 1
2.2%
3602 1
2.2%
ValueCountFrequency (%)
82860 1
2.2%
74424 1
2.2%
72479 1
2.2%
57930 1
2.2%
54375 1
2.2%
50513 1
2.2%
47874 1
2.2%
39155 1
2.2%
35269 1
2.2%
34020 1
2.2%

총언급량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21000.711
Minimum689
Maximum87092
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2024-04-17T13:41:35.387976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum689
5-th percentile1163.6
Q14414
median9888
Q329075
95-th percentile73166.6
Maximum87092
Range86403
Interquartile range (IQR)24661

Descriptive statistics

Standard deviation22866.971
Coefficient of variation (CV)1.0888665
Kurtosis1.2911924
Mean21000.711
Median Absolute Deviation (MAD)7849
Skewness1.4457069
Sum945032
Variance5.2289838 × 108
MonotonicityNot monotonic
2024-04-17T13:41:35.490000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
29075 1
 
2.2%
2636 1
 
2.2%
14364 1
 
2.2%
1769 1
 
2.2%
16912 1
 
2.2%
8670 1
 
2.2%
8242 1
 
2.2%
76099 1
 
2.2%
57020 1
 
2.2%
19079 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
689 1
2.2%
1002 1
2.2%
1071 1
2.2%
1534 1
2.2%
1769 1
2.2%
2039 1
2.2%
2636 1
2.2%
3069 1
2.2%
3500 1
2.2%
3758 1
2.2%
ValueCountFrequency (%)
87092 1
2.2%
78212 1
2.2%
76099 1
2.2%
61437 1
2.2%
57020 1
2.2%
52938 1
2.2%
50272 1
2.2%
41284 1
2.2%
36928 1
2.2%
36160 1
2.2%

Interactions

2024-04-17T13:41:32.994639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:31.729932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.021141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.323617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.680725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:33.055933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:31.783856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.076121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.386768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.744396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:33.119843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:31.838941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.131485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.452418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.803048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:33.205674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:31.906417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.196784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.532864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.870869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:33.277121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:31.963546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.261636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.613208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:32.933414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T13:41:35.562113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
주간언급량연번1.0001.0000.9840.0000.4850.3210.2850.432
환경플랫폼 하위 도메인명1.0001.0001.0000.0000.0000.1900.0000.000
도메인 하위 카테고리명0.9841.0001.0000.0000.5240.4320.6000.616
SNS 채널명0.0000.0000.0001.0000.6200.0000.2770.330
긍정언급량0.4850.0000.5240.6201.0000.9330.9550.895
부정언급량0.3210.1900.4320.0000.9331.0000.9740.943
중립언급량0.2850.0000.6000.2770.9550.9741.0001.000
총언급량0.4320.0000.6160.3300.8950.9431.0001.000
2024-04-17T13:41:35.652884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
환경플랫폼 하위 도메인명1.0000.8450.000
도메인 하위 카테고리명0.8451.0000.000
SNS 채널명0.0000.0001.000
2024-04-17T13:41:35.727818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
주간언급량연번1.000-0.0380.012-0.022-0.0290.9130.8050.000
긍정언급량-0.0381.0000.9640.9900.9910.0000.2030.310
부정언급량0.0120.9641.0000.9740.9740.0440.1490.000
중립언급량-0.0220.9900.9741.0001.0000.0000.2690.122
총언급량-0.0290.9910.9741.0001.0000.0000.2620.194
환경플랫폼 하위 도메인명0.9130.0000.0440.0000.0001.0000.8450.000
도메인 하위 카테고리명0.8050.2030.1490.2690.2620.8451.0000.000
SNS 채널명0.0000.3100.0000.1220.1940.0000.0001.000

Missing values

2024-04-17T13:41:33.355688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T13:41:33.462938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012020-01-06물환경물재난All10167222733729075
122020-01-06물환경물재난blog6724231809219187
232020-01-06물환경물재난twitter34429992459888
342020-01-06물환경상수도All3089683558759
452020-01-06물환경상수도blog2325764316720
562020-01-06물환경상수도twitter763919242039
672020-01-06물환경지하수All1074936023758
782020-01-06물환경지하수blog863629473069
892020-01-06물환경지하수twitter2113655689
9102020-01-06물환경하수도All1737062406483
주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
35362020-01-06자연환경생태계twitter334914521534
36372020-01-06자연환경지질All15018974787450272
37382020-01-06자연환경지질blog6643862104022090
38392020-01-06자연환경지질twitter8375112683428182
39402020-01-06자연환경지형All4502421664417336
40412020-01-06자연환경지형blog3311481244312922
41422020-01-06자연환경지형twitter1199442014414
42432020-01-06자연환경토양All29712777718195
43442020-01-06자연환경토양blog26511168177193
44452020-01-06자연환경토양twitter32169541002