Overview

Dataset statistics

Number of variables9
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory81.3 B

Variable types

Numeric5
Categorical3
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명High correlation
긍정언급량 is highly overall correlated with 부정언급량 and 2 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 3 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 3 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 3 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 3 other fieldsHigh correlation
주간언급량연번 has unique valuesUnique

Reproduction

Analysis started2024-04-20 22:49:16.243595
Analysis finished2024-04-20 22:49:24.478658
Duration8.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-04-21T07:49:24.648746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2024-04-21T07:49:25.040165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size368.0 B
2020-07-06
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-07-06
2nd row2020-07-06
3rd row2020-07-06
4th row2020-07-06
5th row2020-07-06

Common Values

ValueCountFrequency (%)
2020-07-06 30
100.0%

Length

2024-04-21T07:49:25.441227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:49:25.739971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-07-06 30
100.0%

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
물환경
12 
자연환경
12 
생활환경

Length

Max length4
Median length4
Mean length3.6
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
물환경 12
40.0%
자연환경 12
40.0%
생활환경 6
20.0%

Length

2024-04-21T07:49:26.076688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:49:26.553497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물환경 12
40.0%
자연환경 12
40.0%
생활환경 6
20.0%
Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
2024-04-21T07:49:27.079120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.8
Min length2

Characters and Unicode

Total characters84
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물재난
2nd row물재난
3rd row상수도
4th row상수도
5th row지하수
ValueCountFrequency (%)
물재난 2
 
6.7%
상수도 2
 
6.7%
지하수 2
 
6.7%
하수도 2
 
6.7%
하천 2
 
6.7%
호소 2
 
6.7%
대기 2
 
6.7%
폐기물 2
 
6.7%
화학물질 2
 
6.7%
기상변화 2
 
6.7%
Other values (5) 10
33.3%
2024-04-21T07:49:27.883159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
9.5%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
Other values (15) 30
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
9.5%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
Other values (15) 30
35.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 84
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
9.5%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
Other values (15) 30
35.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 84
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
9.5%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
6
 
7.1%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
Other values (15) 30
35.7%

SNS 채널명
Categorical

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size368.0 B
All
15 
blog
15 

Length

Max length4
Median length3.5
Mean length3.5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowblog
3rd rowAll
4th rowblog
5th rowAll

Common Values

ValueCountFrequency (%)
All 15
50.0%
blog 15
50.0%

Length

2024-04-21T07:49:28.103473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:49:28.276109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
50.0%
blog 15
50.0%

긍정언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1023.6
Minimum157
Maximum2474
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-04-21T07:49:28.448093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum157
5-th percentile181.75
Q1368
median717
Q31684.25
95-th percentile2378.6
Maximum2474
Range2317
Interquartile range (IQR)1316.25

Descriptive statistics

Standard deviation749.75258
Coefficient of variation (CV)0.73246638
Kurtosis-0.90787577
Mean1023.6
Median Absolute Deviation (MAD)505
Skewness0.63325099
Sum30708
Variance562128.94
MonotonicityNot monotonic
2024-04-21T07:49:28.663683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
659 2
 
6.7%
342 2
 
6.7%
212 2
 
6.7%
279 2
 
6.7%
1721 2
 
6.7%
797 2
 
6.7%
717 2
 
6.7%
2262 2
 
6.7%
699 2
 
6.7%
1574 2
 
6.7%
Other values (5) 10
33.3%
ValueCountFrequency (%)
157 2
6.7%
212 2
6.7%
279 2
6.7%
342 2
6.7%
446 2
6.7%
659 2
6.7%
699 2
6.7%
717 2
6.7%
797 2
6.7%
1273 2
6.7%
ValueCountFrequency (%)
2474 2
6.7%
2262 2
6.7%
1742 2
6.7%
1721 2
6.7%
1574 2
6.7%
1273 2
6.7%
797 2
6.7%
717 2
6.7%
699 2
6.7%
659 2
6.7%

부정언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean531.73333
Minimum60
Maximum1351
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-04-21T07:49:28.857711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile90.6
Q1179.75
median449
Q3794
95-th percentile1231.75
Maximum1351
Range1291
Interquartile range (IQR)614.25

Descriptive statistics

Standard deviation384.83377
Coefficient of variation (CV)0.72373452
Kurtosis-0.57236492
Mean531.73333
Median Absolute Deviation (MAD)276
Skewness0.6697119
Sum15952
Variance148097.03
MonotonicityNot monotonic
2024-04-21T07:49:29.067765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
470 2
 
6.7%
168 2
 
6.7%
128 2
 
6.7%
173 2
 
6.7%
682 2
 
6.7%
286 2
 
6.7%
449 2
 
6.7%
1351 2
 
6.7%
418 2
 
6.7%
1086 2
 
6.7%
Other values (5) 10
33.3%
ValueCountFrequency (%)
60 2
6.7%
128 2
6.7%
168 2
6.7%
173 2
6.7%
200 2
6.7%
286 2
6.7%
418 2
6.7%
449 2
6.7%
470 2
6.7%
682 2
6.7%
ValueCountFrequency (%)
1351 2
6.7%
1086 2
6.7%
965 2
6.7%
818 2
6.7%
722 2
6.7%
682 2
6.7%
470 2
6.7%
449 2
6.7%
418 2
6.7%
286 2
6.7%

중립언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41912.2
Minimum6899
Maximum108002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-04-21T07:49:29.273263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6899
5-th percentile8592.8
Q116612
median27263
Q365200
95-th percentile101648.45
Maximum108002
Range101103
Interquartile range (IQR)48588

Descriptive statistics

Standard deviation31368.126
Coefficient of variation (CV)0.7484247
Kurtosis-0.53561888
Mean41912.2
Median Absolute Deviation (MAD)16600
Skewness0.82282766
Sum1257366
Variance9.839593 × 108
MonotonicityNot monotonic
2024-04-21T07:49:29.498030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
31180 2
 
6.7%
13257 2
 
6.7%
10663 2
 
6.7%
15089 2
 
6.7%
72894 2
 
6.7%
27134 2
 
6.7%
27263 2
 
6.7%
93883 2
 
6.7%
22151 2
 
6.7%
65047 2
 
6.7%
Other values (5) 10
33.3%
ValueCountFrequency (%)
6899 2
6.7%
10663 2
6.7%
13257 2
6.7%
15089 2
6.7%
21181 2
6.7%
22151 2
6.7%
27134 2
6.7%
27263 2
6.7%
31180 2
6.7%
48789 2
6.7%
ValueCountFrequency (%)
108002 2
6.7%
93883 2
6.7%
72894 2
6.7%
65251 2
6.7%
65047 2
6.7%
48789 2
6.7%
31180 2
6.7%
27263 2
6.7%
27134 2
6.7%
22151 2
6.7%

총언급량
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43467.533
Minimum7116
Maximum111441
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-04-21T07:49:29.693089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7116
5-th percentile8865.15
Q117112.5
median28429
Q367713
95-th percentile105165.75
Maximum111441
Range104325
Interquartile range (IQR)50600.5

Descriptive statistics

Standard deviation32462.173
Coefficient of variation (CV)0.74681425
Kurtosis-0.56105166
Mean43467.533
Median Absolute Deviation (MAD)17426
Skewness0.81214269
Sum1304026
Variance1.0537927 × 109
MonotonicityNot monotonic
2024-04-21T07:49:29.937673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
32309 2
 
6.7%
13767 2
 
6.7%
11003 2
 
6.7%
15541 2
 
6.7%
75297 2
 
6.7%
28217 2
 
6.7%
28429 2
 
6.7%
97496 2
 
6.7%
23268 2
 
6.7%
67707 2
 
6.7%
Other values (5) 10
33.3%
ValueCountFrequency (%)
7116 2
6.7%
11003 2
6.7%
13767 2
6.7%
15541 2
6.7%
21827 2
6.7%
23268 2
6.7%
28217 2
6.7%
28429 2
6.7%
32309 2
6.7%
50880 2
6.7%
ValueCountFrequency (%)
111441 2
6.7%
97496 2
6.7%
75297 2
6.7%
67715 2
6.7%
67707 2
6.7%
50880 2
6.7%
32309 2
6.7%
28429 2
6.7%
28217 2
6.7%
23268 2
6.7%

Interactions

2024-04-21T07:49:22.578527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:17.807343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:19.019837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:20.197209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:21.412594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:22.819346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:18.055217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:19.262203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:20.448175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:21.652707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:23.044917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:18.290223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:19.496786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:20.683691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:21.881268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:23.298692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:18.545953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:19.742417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:20.936779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:22.125849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:23.524936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:18.782590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:19.970930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:21.173078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:49:22.353289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T07:49:30.088843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
주간언급량연번1.0001.0000.9690.0000.7430.8210.7640.764
환경플랫폼 하위 도메인명1.0001.0001.0000.0000.8990.8180.8330.833
도메인 하위 카테고리명0.9691.0001.0000.0001.0001.0001.0001.000
SNS 채널명0.0000.0000.0001.0000.0000.0000.0000.000
긍정언급량0.7430.8991.0000.0001.0000.8800.9670.967
부정언급량0.8210.8181.0000.0000.8801.0000.9850.985
중립언급량0.7640.8331.0000.0000.9670.9851.0001.000
총언급량0.7640.8331.0000.0000.9670.9851.0001.000
2024-04-21T07:49:30.292821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SNS 채널명환경플랫폼 하위 도메인명
SNS 채널명1.0000.000
환경플랫폼 하위 도메인명0.0001.000
2024-04-21T07:49:30.478456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명SNS 채널명
주간언급량연번1.0000.2960.2420.2140.2140.8610.000
긍정언급량0.2961.0000.9180.9680.9680.4820.000
부정언급량0.2420.9181.0000.9460.9460.6760.000
중립언급량0.2140.9680.9461.0001.0000.6060.000
총언급량0.2140.9680.9461.0001.0000.6060.000
환경플랫폼 하위 도메인명0.8610.4820.6760.6060.6061.0000.000
SNS 채널명0.0000.0000.0000.0000.0000.0001.000

Missing values

2024-04-21T07:49:23.847308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T07:49:24.299216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012020-07-06물환경물재난All6594703118032309
122020-07-06물환경물재난blog6594703118032309
232020-07-06물환경상수도All3421681325713767
342020-07-06물환경상수도blog3421681325713767
452020-07-06물환경지하수All2121281066311003
562020-07-06물환경지하수blog2121281066311003
672020-07-06물환경하수도All2791731508915541
782020-07-06물환경하수도blog2791731508915541
892020-07-06물환경하천All17216827289475297
9102020-07-06물환경하천blog17216827289475297
주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
20212020-07-06자연환경기후변화All12738184878950880
21222020-07-06자연환경기후변화blog12738184878950880
22232020-07-06자연환경생태계All17427226525167715
23242020-07-06자연환경생태계blog17427226525167715
24252020-07-06자연환경지질All4462002118121827
25262020-07-06자연환경지질blog4462002118121827
26272020-07-06자연환경지형All2474965108002111441
27282020-07-06자연환경지형blog2474965108002111441
28292020-07-06자연환경토양All1576068997116
29302020-07-06자연환경토양blog1576068997116