Overview

Dataset statistics

Number of variables9
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory80.2 B

Variable types

Numeric5
DateTime1
Categorical3

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
긍정언급량 is highly overall correlated with 부정언급량 and 4 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 4 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 3 other fieldsHigh correlation
도메인 하위 카테고리명 is highly overall correlated with 주간언급량연번 and 5 other fieldsHigh correlation
SNS 채널명 is highly overall correlated with 긍정언급량 and 3 other fieldsHigh correlation
주간언급량연번 has unique valuesUnique
긍정언급량 has 3 (7.3%) zerosZeros
부정언급량 has 2 (4.9%) zerosZeros

Reproduction

Analysis started2024-04-17 04:41:21.455146
Analysis finished2024-04-17 04:41:23.735025
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-17T13:41:23.788808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2024-04-17T13:41:23.905637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%

연월일
Date

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum2021-01-04 00:00:00
Maximum2021-01-04 00:00:00
2024-04-17T13:41:23.999302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:24.067087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size460.0 B
자연환경
17 
물환경
15 
생활환경

Length

Max length4
Median length4
Mean length3.6341463
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
자연환경 17
41.5%
물환경 15
36.6%
생활환경 9
22.0%

Length

2024-04-17T13:41:24.152146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:24.234726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연환경 17
41.5%
물환경 15
36.6%
생활환경 9
22.0%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
지하수
하수도
하천
대기
폐기물
Other values (10)
26 

Length

Max length4
Median length3
Mean length2.8292683
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물재난
2nd row물재난
3rd row상수도
4th row상수도
5th row지하수

Common Values

ValueCountFrequency (%)
지하수 3
 
7.3%
하수도 3
 
7.3%
하천 3
 
7.3%
대기 3
 
7.3%
폐기물 3
 
7.3%
화학물질 3
 
7.3%
기상변화 3
 
7.3%
기후변화 3
 
7.3%
생태계 3
 
7.3%
지질 3
 
7.3%
Other values (5) 11
26.8%

Length

2024-04-17T13:41:24.332330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지하수 3
 
7.3%
하수도 3
 
7.3%
하천 3
 
7.3%
대기 3
 
7.3%
폐기물 3
 
7.3%
화학물질 3
 
7.3%
기상변화 3
 
7.3%
기후변화 3
 
7.3%
생태계 3
 
7.3%
지질 3
 
7.3%
Other values (5) 11
26.8%

SNS 채널명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size460.0 B
All
15 
blog
15 
twitter
11 

Length

Max length7
Median length4
Mean length4.4390244
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowblog
3rd rowAll
4th rowblog
5th rowAll

Common Values

ValueCountFrequency (%)
All 15
36.6%
blog 15
36.6%
twitter 11
26.8%

Length

2024-04-17T13:41:24.437375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:24.809925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
36.6%
blog 15
36.6%
twitter 11
26.8%

긍정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)73.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean454.29268
Minimum0
Maximum1068
Zeros3
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-17T13:41:24.893469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q116
median467
Q3785
95-th percentile845
Maximum1068
Range1068
Interquartile range (IQR)769

Descriptive statistics

Standard deviation339.90471
Coefficient of variation (CV)0.74820644
Kurtosis-1.2356635
Mean454.29268
Median Absolute Deviation (MAD)339
Skewness-0.036876584
Sum18626
Variance115535.21
MonotonicityNot monotonic
2024-04-17T13:41:24.997920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0 3
 
7.3%
828 2
 
4.9%
278 2
 
4.9%
845 2
 
4.9%
1 2
 
4.9%
467 2
 
4.9%
2 2
 
4.9%
304 2
 
4.9%
319 2
 
4.9%
550 2
 
4.9%
Other values (20) 20
48.8%
ValueCountFrequency (%)
0 3
7.3%
1 2
4.9%
2 2
4.9%
5 1
 
2.4%
7 1
 
2.4%
8 1
 
2.4%
16 1
 
2.4%
278 2
4.9%
304 2
4.9%
319 2
4.9%
ValueCountFrequency (%)
1068 1
2.4%
1066 1
2.4%
845 2
4.9%
828 2
4.9%
815 1
2.4%
814 1
2.4%
811 1
2.4%
806 1
2.4%
785 1
2.4%
777 1
2.4%

부정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct32
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean227.5122
Minimum0
Maximum644
Zeros2
Zeros (%)4.9%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-17T13:41:25.090947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q118
median230
Q3373
95-th percentile511
Maximum644
Range644
Interquartile range (IQR)355

Descriptive statistics

Standard deviation186.57574
Coefficient of variation (CV)0.82006918
Kurtosis-0.64621517
Mean227.5122
Median Absolute Deviation (MAD)148
Skewness0.41472885
Sum9328
Variance34810.506
MonotonicityNot monotonic
2024-04-17T13:41:25.182931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
430 2
 
4.9%
0 2
 
4.9%
230 2
 
4.9%
82 2
 
4.9%
294 2
 
4.9%
2 2
 
4.9%
1 2
 
4.9%
182 2
 
4.9%
417 2
 
4.9%
6 1
 
2.4%
Other values (22) 22
53.7%
ValueCountFrequency (%)
0 2
4.9%
1 2
4.9%
2 2
4.9%
4 1
2.4%
5 1
2.4%
6 1
2.4%
16 1
2.4%
18 1
2.4%
82 2
4.9%
131 1
2.4%
ValueCountFrequency (%)
644 1
2.4%
640 1
2.4%
511 1
2.4%
509 1
2.4%
430 2
4.9%
417 2
4.9%
394 1
2.4%
378 1
2.4%
373 1
2.4%
368 1
2.4%

중립언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18387.268
Minimum6
Maximum37872
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-17T13:41:25.280920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile17
Q1511
median22570
Q330284
95-th percentile36174
Maximum37872
Range37866
Interquartile range (IQR)29773

Descriptive statistics

Standard deviation13666.581
Coefficient of variation (CV)0.74326329
Kurtosis-1.5385482
Mean18387.268
Median Absolute Deviation (MAD)10790
Skewness-0.20244944
Sum753878
Variance1.8677545 × 108
MonotonicityNot monotonic
2024-04-17T13:41:25.381835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
30284 2
 
4.9%
8789 2
 
4.9%
17662 2
 
4.9%
36174 2
 
4.9%
32490 1
 
2.4%
29326 1
 
2.4%
29302 1
 
2.4%
24 1
 
2.4%
32412 1
 
2.4%
32150 1
 
2.4%
Other values (27) 27
65.9%
ValueCountFrequency (%)
6 1
2.4%
8 1
2.4%
17 1
2.4%
23 1
2.4%
24 1
2.4%
73 1
2.4%
105 1
2.4%
262 1
2.4%
323 1
2.4%
405 1
2.4%
ValueCountFrequency (%)
37872 1
2.4%
37799 1
2.4%
36174 2
4.9%
32490 1
2.4%
32412 1
2.4%
32167 1
2.4%
32150 1
2.4%
31183 1
2.4%
30778 1
2.4%
30284 2
4.9%

총언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19069.073
Minimum7
Maximum39584
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-17T13:41:25.483820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile18
Q1545
median23330
Q331542
95-th percentile37436
Maximum39584
Range39577
Interquartile range (IQR)30997

Descriptive statistics

Standard deviation14170.793
Coefficient of variation (CV)0.74312962
Kurtosis-1.5305933
Mean19069.073
Median Absolute Deviation (MAD)11080
Skewness-0.19897731
Sum781832
Variance2.0081137 × 108
MonotonicityNot monotonic
2024-04-17T13:41:25.588589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
31542 2
 
4.9%
9149 2
 
4.9%
18359 2
 
4.9%
37436 2
 
4.9%
33531 1
 
2.4%
30652 1
 
2.4%
30625 1
 
2.4%
27 1
 
2.4%
33484 1
 
2.4%
33210 1
 
2.4%
Other values (27) 27
65.9%
ValueCountFrequency (%)
7 1
2.4%
10 1
2.4%
18 1
2.4%
24 1
2.4%
27 1
2.4%
79 1
2.4%
107 1
2.4%
274 1
2.4%
337 1
2.4%
426 1
2.4%
ValueCountFrequency (%)
39584 1
2.4%
39505 1
2.4%
37436 2
4.9%
33531 1
2.4%
33484 1
2.4%
33210 1
2.4%
33194 1
2.4%
32388 1
2.4%
31962 1
2.4%
31542 2
4.9%

Interactions

2024-04-17T13:41:23.201768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:21.732065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.119096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.482178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.835452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.280526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:21.805310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.198212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.554139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.908899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.346553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:21.883714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.268847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.615803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.973140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.417436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:21.974320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.343103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.692404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.043304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.490164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.046770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.410865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:22.761495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:23.131557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T13:41:25.666670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
주간언급량연번1.0000.9580.9710.0000.5410.6940.7750.775
환경플랫폼 하위 도메인명0.9581.0001.0000.0000.5260.4930.8420.842
도메인 하위 카테고리명0.9711.0001.0000.0000.8740.9150.9250.925
SNS 채널명0.0000.0000.0001.0000.7220.8920.8860.886
긍정언급량0.5410.5260.8740.7221.0000.9180.8780.878
부정언급량0.6940.4930.9150.8920.9181.0000.9500.950
중립언급량0.7750.8420.9250.8860.8780.9501.0001.000
총언급량0.7750.8420.9250.8860.8780.9501.0001.000
2024-04-17T13:41:25.766890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
환경플랫폼 하위 도메인명1.0000.8270.000
도메인 하위 카테고리명0.8271.0000.000
SNS 채널명0.0000.0001.000
2024-04-17T13:41:25.844907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
주간언급량연번1.0000.1340.0280.1530.1510.8090.7460.000
긍정언급량0.1341.0000.9600.9570.9590.3870.5580.607
부정언급량0.0280.9601.0000.9060.9140.2180.6190.570
중립언급량0.1530.9570.9061.0000.9980.5070.6430.562
총언급량0.1510.9590.9140.9981.0000.5070.6430.562
환경플랫폼 하위 도메인명0.8090.3870.2180.5070.5071.0000.8270.000
도메인 하위 카테고리명0.7460.5580.6190.6430.6430.8271.0000.000
SNS 채널명0.0000.6070.5700.5620.5620.0000.0001.000

Missing values

2024-04-17T13:41:23.586884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T13:41:23.691873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012021-01-04물환경물재난All8284303028431542
122021-01-04물환경물재난blog8284303028431542
232021-01-04물환경상수도All4672301766218359
342021-01-04물환경상수도blog4672301766218359
452021-01-04물환경지하수All304132998410420
562021-01-04물환경지하수blog304131996110396
672021-01-04물환경지하수twitter012324
782021-01-04물환경하수도All5502633014330956
892021-01-04물환경하수도blog5502623013730949
9102021-01-04물환경하수도twitter0167
주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
31322021-01-04자연환경생태계blog7772503216733194
32332021-01-04자연환경생태계twitter86323337
33342021-01-04자연환경지질All6122942305923965
34352021-01-04자연환경지질blog6112942304223947
35362021-01-04자연환경지질twitter101718
36372021-01-04자연환경지형All4771822321123870
37382021-01-04자연환경지형blog4751822320323860
38392021-01-04자연환경지형twitter20810
39402021-01-04자연환경토양All8454173617437436
40412021-01-04자연환경토양blog8454173617437436