Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Numeric1
Categorical6
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=8a03ac90-2fc9-11ea-94b6-73a02796bba4

Alerts

연월일 has constant value ""Constant
SNS 채널명 has constant value ""Constant
도메인 하위 카테고리명 is highly overall correlated with 일간지역언급량연번 and 1 other fieldsHigh correlation
환경플랫폼 하위 도메인명 is highly overall correlated with 일간지역언급량연번 and 1 other fieldsHigh correlation
일간지역언급량연번 is highly overall correlated with 환경플랫폼 하위 도메인명 and 1 other fieldsHigh correlation
일간지역언급량연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:45:51.807174
Analysis finished2023-12-10 13:45:53.105396
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간지역언급량연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:45:53.233307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T22:45:53.487857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-10-01
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-10-01
2nd row2020-10-01
3rd row2020-10-01
4th row2020-10-01
5th row2020-10-01

Common Values

ValueCountFrequency (%)
2020-10-01 100
100.0%

Length

2023-12-10T22:45:53.732723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:53.893714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-10-01 100
100.0%

환경플랫폼 하위 도메인명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
물환경
63 
생활환경
27 
자연환경
10 

Length

Max length4
Median length3
Mean length3.37
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
물환경 63
63.0%
생활환경 27
27.0%
자연환경 10
 
10.0%

Length

2023-12-10T22:45:54.078745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:54.255845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물환경 63
63.0%
생활환경 27
27.0%
자연환경 10
 
10.0%

도메인 하위 카테고리명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
하천
35 
폐기물
27 
호소
13 
상수도
하수도
Other values (4)
11 

Length

Max length4
Median length3
Mean length2.61
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row상수도
2nd row상수도
3rd row상수도
4th row상수도
5th row상수도

Common Values

ValueCountFrequency (%)
하천 35
35.0%
폐기물 27
27.0%
호소 13
 
13.0%
상수도 9
 
9.0%
하수도 5
 
5.0%
기후변화 5
 
5.0%
기상변화 4
 
4.0%
지하수 1
 
1.0%
생태계 1
 
1.0%

Length

2023-12-10T22:45:54.463331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:54.693894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하천 35
35.0%
폐기물 27
27.0%
호소 13
 
13.0%
상수도 9
 
9.0%
하수도 5
 
5.0%
기후변화 5
 
5.0%
기상변화 4
 
4.0%
지하수 1
 
1.0%
생태계 1
 
1.0%

SNS 채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
All
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAll
2nd rowAll
3rd rowAll
4th rowAll
5th rowAll

Common Values

ValueCountFrequency (%)
All 100
100.0%

Length

2023-12-10T22:45:54.907713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:55.052043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
all 100
100.0%

시도명
Categorical

Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경기
24 
서울
11 
경북
부산
경남
Other values (11)
41 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구
2nd row대전
3rd row부산
4th row부산
5th row서울

Common Values

ValueCountFrequency (%)
경기 24
24.0%
서울 11
11.0%
경북 9
 
9.0%
부산 8
 
8.0%
경남 7
 
7.0%
인천 6
 
6.0%
충남 6
 
6.0%
대구 5
 
5.0%
강원 5
 
5.0%
대전 4
 
4.0%
Other values (6) 15
15.0%

Length

2023-12-10T22:45:55.234313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 24
24.0%
서울 11
11.0%
경북 9
 
9.0%
부산 8
 
8.0%
경남 7
 
7.0%
인천 6
 
6.0%
충남 6
 
6.0%
대구 5
 
5.0%
강원 5
 
5.0%
대전 4
 
4.0%
Other values (6) 15
15.0%
Distinct67
Distinct (%)67.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:45:55.598662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.81
Min length2

Characters and Unicode

Total characters281
Distinct characters68
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)53.0%

Sample

1st row중구
2nd row중구
3rd row강서구
4th row중구
5th row강서구
ValueCountFrequency (%)
동구 12
 
12.0%
중구 6
 
6.0%
서구 5
 
5.0%
강서구 4
 
4.0%
논산시 2
 
2.0%
연천군 2
 
2.0%
고양시 2
 
2.0%
서귀포시 2
 
2.0%
송파구 2
 
2.0%
양천구 2
 
2.0%
Other values (57) 61
61.0%
2023-12-10T22:45:56.289281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
16.4%
33
 
11.7%
24
 
8.5%
17
 
6.0%
13
 
4.6%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.1%
Other values (58) 102
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 281
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
16.4%
33
 
11.7%
24
 
8.5%
17
 
6.0%
13
 
4.6%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.1%
Other values (58) 102
36.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 281
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
16.4%
33
 
11.7%
24
 
8.5%
17
 
6.0%
13
 
4.6%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.1%
Other values (58) 102
36.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 281
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
46
16.4%
33
 
11.7%
24
 
8.5%
17
 
6.0%
13
 
4.6%
12
 
4.3%
12
 
4.3%
9
 
3.2%
7
 
2.5%
6
 
2.1%
Other values (58) 102
36.3%
Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
48 
3
26 
4
13 
2
11 
11
 
2

Length

Max length2
Median length1
Mean length1.02
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 48
48.0%
3 26
26.0%
4 13
 
13.0%
2 11
 
11.0%
11 2
 
2.0%

Length

2023-12-10T22:45:56.539045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:56.705827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 48
48.0%
3 26
26.0%
4 13
 
13.0%
2 11
 
11.0%
11 2
 
2.0%

Interactions

2023-12-10T22:45:52.526544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:45:56.858222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간지역언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명시도명시군구명일간시도언급량
일간지역언급량연번1.0000.9650.8950.6780.9040.648
환경플랫폼 하위 도메인명0.9651.0001.0000.0000.8310.504
도메인 하위 카테고리명0.8951.0001.0000.4840.7410.669
시도명0.6780.0000.4841.0000.8260.486
시군구명0.9040.8310.7410.8261.0000.846
일간시도언급량0.6480.5040.6690.4860.8461.000
2023-12-10T22:45:57.009262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명도메인 하위 카테고리명일간시도언급량환경플랫폼 하위 도메인명
시도명1.0000.2100.2540.000
도메인 하위 카테고리명0.2101.0000.4570.969
일간시도언급량0.2540.4571.0000.435
환경플랫폼 하위 도메인명0.0000.9690.4351.000
2023-12-10T22:45:57.167111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간지역언급량연번환경플랫폼 하위 도메인명도메인 하위 카테고리명시도명일간시도언급량
일간지역언급량연번1.0000.9330.6860.3300.314
환경플랫폼 하위 도메인명0.9331.0000.9690.0000.435
도메인 하위 카테고리명0.6860.9691.0000.2100.457
시도명0.3300.0000.2101.0000.254
일간시도언급량0.3140.4350.4570.2541.000

Missing values

2023-12-10T22:45:52.744292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:45:52.978183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간지역언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명시도명시군구명일간시도언급량
012020-10-01물환경상수도All대구중구1
122020-10-01물환경상수도All대전중구1
232020-10-01물환경상수도All부산강서구1
342020-10-01물환경상수도All부산중구1
452020-10-01물환경상수도All서울강서구1
562020-10-01물환경상수도All서울중구1
672020-10-01물환경상수도All울산중구1
782020-10-01물환경상수도All인천중구1
892020-10-01물환경상수도All충남논산시1
9102020-10-01물환경지하수All제주서귀포시1
일간지역언급량연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명시도명시군구명일간시도언급량
90912020-10-01자연환경기상변화All강원강릉시1
91922020-10-01자연환경기상변화All경남밀양시2
92932020-10-01자연환경기상변화All경북구미시11
93942020-10-01자연환경기상변화All경북칠곡군11
94952020-10-01자연환경기후변화All광주서구4
95962020-10-01자연환경기후변화All대구서구4
96972020-10-01자연환경기후변화All대전서구4
97982020-10-01자연환경기후변화All부산서구4
98992020-10-01자연환경기후변화All인천서구4
991002020-10-01자연환경생태계All경기고양시1