Overview

Dataset statistics

Number of variables9
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory80.1 B

Variable types

Numeric5
Categorical4

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
긍정언급량 is highly overall correlated with 부정언급량 and 4 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 5 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 3 other fieldsHigh correlation
도메인 하위 카테고리명 is highly overall correlated with 주간언급량연번 and 4 other fieldsHigh correlation
SNS 채널명 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
주간언급량연번 has unique valuesUnique
부정언급량 has 2 (4.8%) zerosZeros

Reproduction

Analysis started2024-04-17 04:41:14.597771
Analysis finished2024-04-17 04:41:18.542335
Duration3.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.5
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T13:41:18.600346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.05
Q111.25
median21.5
Q331.75
95-th percentile39.95
Maximum42
Range41
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation12.267844
Coefficient of variation (CV)0.5705974
Kurtosis-1.2
Mean21.5
Median Absolute Deviation (MAD)10.5
Skewness0
Sum903
Variance150.5
MonotonicityStrictly increasing
2024-04-17T13:41:18.704854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 1
 
2.4%
33 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
32 1
 
2.4%
Other values (32) 32
76.2%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
42 1
2.4%
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
2021-04-05
42 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-04-05
2nd row2021-04-05
3rd row2021-04-05
4th row2021-04-05
5th row2021-04-05

Common Values

ValueCountFrequency (%)
2021-04-05 42
100.0%

Length

2024-04-17T13:41:18.805730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:18.889493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-04-05 42
100.0%

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
자연환경
18 
물환경
16 
생활환경

Length

Max length4
Median length4
Mean length3.6190476
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
자연환경 18
42.9%
물환경 16
38.1%
생활환경 8
19.0%

Length

2024-04-17T13:41:18.989023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:19.094773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연환경 18
42.9%
물환경 16
38.1%
생활환경 8
19.0%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)35.7%
Missing0
Missing (%)0.0%
Memory size468.0 B
물재난
지하수
하수도
하천
대기
Other values (10)
27 

Length

Max length4
Median length3
Mean length2.7857143
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물재난
2nd row물재난
3rd row물재난
4th row상수도
5th row상수도

Common Values

ValueCountFrequency (%)
물재난 3
 
7.1%
지하수 3
 
7.1%
하수도 3
 
7.1%
하천 3
 
7.1%
대기 3
 
7.1%
폐기물 3
 
7.1%
기상변화 3
 
7.1%
기후변화 3
 
7.1%
생태계 3
 
7.1%
지질 3
 
7.1%
Other values (5) 12
28.6%

Length

2024-04-17T13:41:19.221914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
물재난 3
 
7.1%
지하수 3
 
7.1%
하수도 3
 
7.1%
하천 3
 
7.1%
대기 3
 
7.1%
폐기물 3
 
7.1%
기상변화 3
 
7.1%
기후변화 3
 
7.1%
생태계 3
 
7.1%
지질 3
 
7.1%
Other values (5) 12
28.6%

SNS 채널명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
All
15 
blog
15 
twitter
12 

Length

Max length7
Median length4
Mean length4.5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowblog
3rd rowtwitter
4th rowAll
5th rowblog

Common Values

ValueCountFrequency (%)
All 15
35.7%
blog 15
35.7%
twitter 12
28.6%

Length

2024-04-17T13:41:19.351162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:19.438125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
35.7%
blog 15
35.7%
twitter 12
28.6%

긍정언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean269.47619
Minimum1
Maximum784
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T13:41:19.764398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q113.25
median286.5
Q3398
95-th percentile598
Maximum784
Range783
Interquartile range (IQR)384.75

Descriptive statistics

Standard deviation221.75412
Coefficient of variation (CV)0.82290803
Kurtosis-0.40135752
Mean269.47619
Median Absolute Deviation (MAD)167.5
Skewness0.46844785
Sum11318
Variance49174.89
MonotonicityNot monotonic
2024-04-17T13:41:19.880158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 5
 
11.9%
598 2
 
4.8%
156 2
 
4.8%
2 2
 
4.8%
295 2
 
4.8%
490 1
 
2.4%
784 1
 
2.4%
779 1
 
2.4%
5 1
 
2.4%
457 1
 
2.4%
Other values (24) 24
57.1%
ValueCountFrequency (%)
1 5
11.9%
2 2
 
4.8%
3 1
 
2.4%
5 1
 
2.4%
6 1
 
2.4%
12 1
 
2.4%
17 1
 
2.4%
156 2
 
4.8%
193 1
 
2.4%
196 1
 
2.4%
ValueCountFrequency (%)
784 1
2.4%
779 1
2.4%
598 2
4.8%
560 1
2.4%
559 1
2.4%
491 1
2.4%
490 1
2.4%
457 1
2.4%
451 1
2.4%
401 1
2.4%

부정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct32
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116
Minimum0
Maximum347
Zeros2
Zeros (%)4.8%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T13:41:19.983527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q115.25
median103.5
Q3171.25
95-th percentile332
Maximum347
Range347
Interquartile range (IQR)156

Descriptive statistics

Standard deviation108.87496
Coefficient of variation (CV)0.93857721
Kurtosis-0.44019902
Mean116
Median Absolute Deviation (MAD)88
Skewness0.78079622
Sum4872
Variance11853.756
MonotonicityNot monotonic
2024-04-17T13:41:20.090769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 4
 
9.5%
144 2
 
4.8%
0 2
 
4.8%
4 2
 
4.8%
24 2
 
4.8%
332 2
 
4.8%
232 2
 
4.8%
108 2
 
4.8%
104 1
 
2.4%
114 1
 
2.4%
Other values (22) 22
52.4%
ValueCountFrequency (%)
0 2
4.8%
1 4
9.5%
3 1
 
2.4%
4 2
4.8%
7 1
 
2.4%
15 1
 
2.4%
16 1
 
2.4%
24 2
4.8%
58 1
 
2.4%
59 1
 
2.4%
ValueCountFrequency (%)
347 1
2.4%
343 1
2.4%
332 2
4.8%
280 1
2.4%
276 1
2.4%
232 2
4.8%
222 1
2.4%
215 1
2.4%
175 1
2.4%
160 1
2.4%

중립언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct38
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10347.286
Minimum12
Maximum33657
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T13:41:20.197459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile21.05
Q1776
median9399
Q315052.5
95-th percentile23856.95
Maximum33657
Range33645
Interquartile range (IQR)14276.5

Descriptive statistics

Standard deviation8915.768
Coefficient of variation (CV)0.86165283
Kurtosis0.38361236
Mean10347.286
Median Absolute Deviation (MAD)7897
Skewness0.74916188
Sum434586
Variance79490919
MonotonicityNot monotonic
2024-04-17T13:41:20.295181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
20241 2
 
4.8%
9272 2
 
4.8%
12 2
 
4.8%
4723 2
 
4.8%
512 1
 
2.4%
34 1
 
2.4%
33657 1
 
2.4%
33511 1
 
2.4%
146 1
 
2.4%
17554 1
 
2.4%
Other values (28) 28
66.7%
ValueCountFrequency (%)
12 2
4.8%
21 1
2.4%
22 1
2.4%
25 1
2.4%
34 1
2.4%
36 1
2.4%
103 1
2.4%
146 1
2.4%
512 1
2.4%
642 1
2.4%
ValueCountFrequency (%)
33657 1
2.4%
33511 1
2.4%
23858 1
2.4%
23837 1
2.4%
20241 2
4.8%
17584 1
2.4%
17554 1
2.4%
17550 1
2.4%
17042 1
2.4%
15347 1
2.4%

총언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct38
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10732.762
Minimum14
Maximum34788
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T13:41:20.409952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile22.2
Q1804.75
median9804
Q315670.75
95-th percentile24648.9
Maximum34788
Range34774
Interquartile range (IQR)14866

Descriptive statistics

Standard deviation9235.9114
Coefficient of variation (CV)0.86053446
Kurtosis0.34733169
Mean10732.762
Median Absolute Deviation (MAD)8078
Skewness0.73900527
Sum450776
Variance85302060
MonotonicityNot monotonic
2024-04-17T13:41:20.538830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
21171 2
 
4.8%
9675 2
 
4.8%
14 2
 
4.8%
4903 2
 
4.8%
534 1
 
2.4%
39 1
 
2.4%
34788 1
 
2.4%
34633 1
 
2.4%
155 1
 
2.4%
18149 1
 
2.4%
Other values (28) 28
66.7%
ValueCountFrequency (%)
14 2
4.8%
22 1
2.4%
26 1
2.4%
27 1
2.4%
38 1
2.4%
39 1
2.4%
108 1
2.4%
155 1
2.4%
534 1
2.4%
674 1
2.4%
ValueCountFrequency (%)
34788 1
2.4%
34633 1
2.4%
24650 1
2.4%
24628 1
2.4%
21171 2
4.8%
18355 1
2.4%
18316 1
2.4%
18149 1
2.4%
17615 1
2.4%
15970 1
2.4%

Interactions

2024-04-17T13:41:17.982899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.275094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.845397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.213472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.595283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:18.050151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.433905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.913925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.283014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.659945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:18.124432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.567994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.987395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.365501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.736463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:18.196560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.675699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.061472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.450053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.826170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:18.268137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:16.780137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.138744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.519517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:17.907632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T13:41:20.622255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
주간언급량연번1.0000.9310.9620.0000.7670.7850.6950.733
환경플랫폼 하위 도메인명0.9311.0001.0000.0000.7360.4580.8900.928
도메인 하위 카테고리명0.9621.0001.0000.0000.9070.8470.9090.918
SNS 채널명0.0000.0000.0001.0000.8940.6980.8930.888
긍정언급량0.7670.7360.9070.8941.0000.8830.9780.980
부정언급량0.7850.4580.8470.6980.8831.0000.8910.877
중립언급량0.6950.8900.9090.8930.9780.8911.0001.000
총언급량0.7330.9280.9180.8880.9800.8771.0001.000
2024-04-17T13:41:20.713439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
환경플랫폼 하위 도메인명1.0000.8320.000
도메인 하위 카테고리명0.8321.0000.000
SNS 채널명0.0000.0001.000
2024-04-17T13:41:20.795677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
주간언급량연번1.0000.1130.1280.1860.1840.8220.7190.000
긍정언급량0.1131.0000.9800.9770.9780.4010.6030.574
부정언급량0.1280.9801.0000.9720.9740.2720.4650.498
중립언급량0.1860.9770.9721.0001.0000.5700.6090.573
총언급량0.1840.9780.9741.0001.0000.6290.6290.566
환경플랫폼 하위 도메인명0.8220.4010.2720.5700.6291.0000.8320.000
도메인 하위 카테고리명0.7190.6030.4650.6090.6290.8321.0000.000
SNS 채널명0.0000.5740.4980.5730.5660.0000.0001.000

Missing values

2024-04-17T13:41:18.370426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T13:41:18.495236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012021-04-05물환경물재난All3461441285313343
122021-04-05물환경물재난blog3441441282813316
232021-04-05물환경물재난twitter202527
342021-04-05물환경상수도All29510892729675
452021-04-05물환경상수도blog29510892729675
562021-04-05물환경지하수All1965964296684
672021-04-05물환경지하수blog1935864076658
782021-04-05물환경지하수twitter312226
892021-04-05물환경하수도All206801065510941
9102021-04-05물환경하수도blog205791064310927
주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
32332021-04-05자연환경생태계twitter616512534
33342021-04-05자연환경지질All29111594179823
34352021-04-05자연환경지질blog29011493819785
35362021-04-05자연환경지질twitter113638
36372021-04-05자연환경지형All2871041258712978
37382021-04-05자연환경지형blog2861031257512964
38392021-04-05자연환경지형twitter111214
39402021-04-05자연환경토양All5602322385824650
40412021-04-05자연환경토양blog5592322383724628
41422021-04-05자연환경토양twitter102122