Overview

Dataset statistics

Number of variables9
Number of observations90
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory78.5 B

Variable types

Numeric5
DateTime1
Categorical3

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=98e41c80-2fd4-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
환경플랫폼 하위 도메인명 is highly overall correlated with 주간언급량연번 and 1 other fieldsHigh correlation
도메인 하위 카테고리명 is highly overall correlated with 주간언급량연번 and 1 other fieldsHigh correlation
주간언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
긍정언급량 is highly overall correlated with 부정언급량 and 2 other fieldsHigh correlation
부정언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
중립언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
총언급량 is highly overall correlated with 긍정언급량 and 2 other fieldsHigh correlation
주간언급량연번 has unique valuesUnique
긍정언급량 has 3 (3.3%) zerosZeros
부정언급량 has 5 (5.6%) zerosZeros
중립언급량 has 3 (3.3%) zerosZeros
총언급량 has 1 (1.1%) zerosZeros

Reproduction

Analysis started2024-04-17 04:41:42.254002
Analysis finished2024-04-17 04:41:44.365721
Duration2.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

주간언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.5
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2024-04-17T13:41:44.423638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.45
Q123.25
median45.5
Q367.75
95-th percentile85.55
Maximum90
Range89
Interquartile range (IQR)44.5

Descriptive statistics

Standard deviation26.124701
Coefficient of variation (CV)0.57416925
Kurtosis-1.2
Mean45.5
Median Absolute Deviation (MAD)22.5
Skewness0
Sum4095
Variance682.5
MonotonicityStrictly increasing
2024-04-17T13:41:44.541588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
69 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
Other values (80) 80
88.9%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%

연월일
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size852.0 B
Minimum2017-01-02 00:00:00
Maximum2017-01-02 00:00:00
2024-04-17T13:41:44.627444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:44.702674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
물환경
36 
자연환경
36 
생활환경
18 

Length

Max length4
Median length4
Mean length3.6
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
물환경 36
40.0%
자연환경 36
40.0%
생활환경 18
20.0%

Length

2024-04-17T13:41:44.797582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:44.887552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물환경 36
40.0%
자연환경 36
40.0%
생활환경 18
20.0%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size852.0 B
하천
 
6
호소
 
6
지하수
 
6
상수도
 
6
하수도
 
6
Other values (10)
60 

Length

Max length4
Median length3
Mean length2.8
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하천
2nd row하천
3rd row하천
4th row하천
5th row하천

Common Values

ValueCountFrequency (%)
하천 6
 
6.7%
호소 6
 
6.7%
지하수 6
 
6.7%
상수도 6
 
6.7%
하수도 6
 
6.7%
물재난 6
 
6.7%
대기 6
 
6.7%
폐기물 6
 
6.7%
화학물질 6
 
6.7%
기상변화 6
 
6.7%
Other values (5) 30
33.3%

Length

2024-04-17T13:41:44.983913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
하천 6
 
6.7%
호소 6
 
6.7%
지하수 6
 
6.7%
상수도 6
 
6.7%
하수도 6
 
6.7%
물재난 6
 
6.7%
대기 6
 
6.7%
폐기물 6
 
6.7%
화학물질 6
 
6.7%
기상변화 6
 
6.7%
Other values (5) 30
33.3%

SNS 채널명
Categorical

Distinct6
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size852.0 B
All
15 
Twitter
15 
Facebook
15 
Instagram
15 
blog
15 

Length

Max length9
Median length7.5
Mean length6.6666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowTwitter
3rd rowFacebook
4th rowInstagram
5th rowblog

Common Values

ValueCountFrequency (%)
All 15
16.7%
Twitter 15
16.7%
Facebook 15
16.7%
Instagram 15
16.7%
blog 15
16.7%
community 15
16.7%

Length

2024-04-17T13:41:45.092956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T13:41:45.188748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 15
16.7%
twitter 15
16.7%
facebook 15
16.7%
instagram 15
16.7%
blog 15
16.7%
community 15
16.7%

긍정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct75
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean569.42222
Minimum0
Maximum4916
Zeros3
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size942.0 B
2024-04-17T13:41:45.299714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q114
median124.5
Q3578.25
95-th percentile2494.95
Maximum4916
Range4916
Interquartile range (IQR)564.25

Descriptive statistics

Standard deviation1016.4919
Coefficient of variation (CV)1.7851286
Kurtosis7.3636124
Mean569.42222
Median Absolute Deviation (MAD)121.5
Skewness2.6593394
Sum51248
Variance1033255.7
MonotonicityNot monotonic
2024-04-17T13:41:45.413332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 4
 
4.4%
6 3
 
3.3%
14 3
 
3.3%
0 3
 
3.3%
1 3
 
3.3%
4 2
 
2.2%
25 2
 
2.2%
12 2
 
2.2%
35 2
 
2.2%
4916 1
 
1.1%
Other values (65) 65
72.2%
ValueCountFrequency (%)
0 3
3.3%
1 3
3.3%
2 1
 
1.1%
3 4
4.4%
4 2
2.2%
6 3
3.3%
7 1
 
1.1%
9 1
 
1.1%
10 1
 
1.1%
12 2
2.2%
ValueCountFrequency (%)
4916 1
1.1%
4519 1
1.1%
4458 1
1.1%
3152 1
1.1%
2733 1
1.1%
2204 1
1.1%
2163 1
1.1%
2053 1
1.1%
1927 1
1.1%
1911 1
1.1%

부정언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct69
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean236.06667
Minimum0
Maximum2398
Zeros5
Zeros (%)5.6%
Negative0
Negative (%)0.0%
Memory size942.0 B
2024-04-17T13:41:45.519390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.45
Q17.25
median55.5
Q3234.75
95-th percentile1188.1
Maximum2398
Range2398
Interquartile range (IQR)227.5

Descriptive statistics

Standard deviation450.87886
Coefficient of variation (CV)1.9099641
Kurtosis10.906167
Mean236.06667
Median Absolute Deviation (MAD)53.5
Skewness3.1543414
Sum21246
Variance203291.75
MonotonicityNot monotonic
2024-04-17T13:41:45.628555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 5
 
5.6%
0 5
 
5.6%
2 4
 
4.4%
3 2
 
2.2%
37 2
 
2.2%
13 2
 
2.2%
90 2
 
2.2%
17 2
 
2.2%
52 2
 
2.2%
6 2
 
2.2%
Other values (59) 62
68.9%
ValueCountFrequency (%)
0 5
5.6%
1 5
5.6%
2 4
4.4%
3 2
 
2.2%
4 1
 
1.1%
5 2
 
2.2%
6 2
 
2.2%
7 2
 
2.2%
8 1
 
1.1%
9 1
 
1.1%
ValueCountFrequency (%)
2398 1
1.1%
2314 1
1.1%
1670 1
1.1%
1307 1
1.1%
1189 1
1.1%
1187 1
1.1%
851 1
1.1%
835 1
1.1%
685 1
1.1%
634 1
1.1%

중립언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct79
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1163.2444
Minimum0
Maximum13760
Zeros3
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size942.0 B
2024-04-17T13:41:45.729949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q130.75
median123
Q3794
95-th percentile7303.95
Maximum13760
Range13760
Interquartile range (IQR)763.25

Descriptive statistics

Standard deviation2669.7574
Coefficient of variation (CV)2.2950957
Kurtosis10.086447
Mean1163.2444
Median Absolute Deviation (MAD)119.5
Skewness3.1915029
Sum104692
Variance7127604.5
MonotonicityNot monotonic
2024-04-17T13:41:46.107510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 3
 
3.3%
0 3
 
3.3%
44 2
 
2.2%
8 2
 
2.2%
69 2
 
2.2%
13 2
 
2.2%
10 2
 
2.2%
39 2
 
2.2%
3 2
 
2.2%
536 1
 
1.1%
Other values (69) 69
76.7%
ValueCountFrequency (%)
0 3
3.3%
1 1
 
1.1%
3 2
2.2%
4 3
3.3%
6 1
 
1.1%
7 1
 
1.1%
8 2
2.2%
10 2
2.2%
12 1
 
1.1%
13 2
2.2%
ValueCountFrequency (%)
13760 1
1.1%
11802 1
1.1%
11266 1
1.1%
8647 1
1.1%
8343 1
1.1%
6034 1
1.1%
5598 1
1.1%
5534 1
1.1%
4288 1
1.1%
3897 1
1.1%

총언급량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct87
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1968.7333
Minimum0
Maximum20677
Zeros1
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size942.0 B
2024-04-17T13:41:46.219413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5.9
Q156.5
median434
Q31750.5
95-th percentile10776.45
Maximum20677
Range20677
Interquartile range (IQR)1694

Descriptive statistics

Standard deviation3987.2994
Coefficient of variation (CV)2.0253121
Kurtosis9.734399
Mean1968.7333
Median Absolute Deviation (MAD)416.5
Skewness3.0933187
Sum177186
Variance15898556
MonotonicityNot monotonic
2024-04-17T13:41:46.323642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 2
 
2.2%
41 2
 
2.2%
17 2
 
2.2%
18496 1
 
1.1%
1891 1
 
1.1%
640 1
 
1.1%
3384 1
 
1.1%
589 1
 
1.1%
91 1
 
1.1%
5171 1
 
1.1%
Other values (77) 77
85.6%
ValueCountFrequency (%)
0 1
1.1%
1 1
1.1%
2 1
1.1%
4 1
1.1%
5 1
1.1%
7 1
1.1%
8 1
1.1%
11 1
1.1%
13 1
1.1%
14 1
1.1%
ValueCountFrequency (%)
20677 1
1.1%
18496 1
1.1%
15042 1
1.1%
14775 1
1.1%
11514 1
1.1%
9875 1
1.1%
9166 1
1.1%
7630 1
1.1%
5171 1
1.1%
5135 1
1.1%

Interactions

2024-04-17T13:41:43.870772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.540986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.855152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.187635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.539106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.927393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.594814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.914785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.249815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.599685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.986526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.656278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.984315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.315820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.673742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:44.055826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.730579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.057232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.386607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.737348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:44.132473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:42.800674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.130826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.470055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T13:41:43.808324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T13:41:46.396049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
주간언급량연번1.0001.0000.9870.0000.2810.0340.1980.000
환경플랫폼 하위 도메인명1.0001.0001.0000.0000.0000.0000.0000.000
도메인 하위 카테고리명0.9871.0001.0000.0000.3490.1340.3210.125
SNS 채널명0.0000.0000.0001.0000.3910.2790.2480.426
긍정언급량0.2810.0000.3490.3911.0000.9560.8640.851
부정언급량0.0340.0000.1340.2790.9561.0000.9240.909
중립언급량0.1980.0000.3210.2480.8640.9241.0000.926
총언급량0.0000.0000.1250.4260.8510.9090.9261.000
2024-04-17T13:41:46.487493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
환경플랫폼 하위 도메인명1.0000.9280.000
도메인 하위 카테고리명0.9281.0000.000
SNS 채널명0.0000.0001.000
2024-04-17T13:41:46.564710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주간언급량연번긍정언급량부정언급량중립언급량총언급량환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명
주간언급량연번1.000-0.124-0.086-0.162-0.1290.9590.8550.000
긍정언급량-0.1241.0000.9420.8770.9590.0000.1440.225
부정언급량-0.0860.9421.0000.9310.9760.0000.0340.154
중립언급량-0.1620.8770.9311.0000.9690.0000.1300.135
총언급량-0.1290.9590.9760.9691.0000.0000.0220.221
환경플랫폼 하위 도메인명0.9590.0000.0000.0000.0001.0000.9280.000
도메인 하위 카테고리명0.8550.1440.0340.1300.0220.9281.0000.000
SNS 채널명0.0000.2250.1540.1350.2210.0000.0001.000

Missing values

2024-04-17T13:41:44.214315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T13:41:44.319839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
012017-01-02물환경하천All491623141126618496
122017-01-02물환경하천Twitter18641307834311514
232017-01-02물환경하천Facebook2173058
342017-01-02물환경하천Instagram90813511342177
452017-01-02물환경하천blog19276348693430
562017-01-02물환경하천community1962318901317
672017-01-02물환경호소All273383555989166
782017-01-02물환경호소Twitter80043838975135
892017-01-02물환경호소Facebook1242541
9102017-01-02물환경호소Instagram748788631689
주간언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명긍정언급량부정언급량중립언급량총언급량
80812017-01-02자연환경토양Facebook1102
81822017-01-02자연환경토양Instagram102820
82832017-01-02자연환경토양blog58112898807
83842017-01-02자연환경토양community251017
84852017-01-02자연환경생태계All1465880284
85862017-01-02자연환경생태계Twitter164451
86872017-01-02자연환경생태계Facebook0000
87882017-01-02자연환경생태계Instagram3238
88892017-01-02자연환경생태계blog1394527211
89902017-01-02자연환경생태계community35614