Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.5 KiB
Average record size in memory76.3 B

Variable types

Numeric2
DateTime1
Categorical5
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=67a5e680-e843-11ea-835f-5b142183dc74

Alerts

연월일 has constant value ""Constant
환경플랫폼 하위 도메인명 has constant value ""Constant
도메인 하위 카테고리명 has constant value ""Constant
SNS 채널명 has constant value ""Constant
일간연관어언급량 has constant value ""Constant
일간연관어연번 has unique valuesUnique
연관어명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:30:22.649872
Analysis finished2023-12-10 12:30:24.048011
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간연관어연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:30:24.284427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T21:30:24.552081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

연월일
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-07-31 00:00:00
Maximum2020-07-31 00:00:00
2023-12-10T21:30:24.723606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:24.909240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
생활환경
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활환경
2nd row생활환경
3rd row생활환경
4th row생활환경
5th row생활환경

Common Values

ValueCountFrequency (%)
생활환경 100
100.0%

Length

2023-12-10T21:30:25.184039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:25.502156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활환경 100
100.0%

도메인 하위 카테고리명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
대기
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기
2nd row대기
3rd row대기
4th row대기
5th row대기

Common Values

ValueCountFrequency (%)
대기 100
100.0%

Length

2023-12-10T21:30:25.695456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:25.856739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기 100
100.0%

SNS 채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
report
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowreport
2nd rowreport
3rd rowreport
4th rowreport
5th rowreport

Common Values

ValueCountFrequency (%)
report 100
100.0%

Length

2023-12-10T21:30:26.012142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:26.181761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
report 100
100.0%

단어속성명
Categorical

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
속성
46 
라이프
28 
장소
12 
기타
단체
 
2
Other values (3)

Length

Max length6
Median length2
Mean length2.32
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row속성
2nd row라이프
3rd row라이프
4th row라이프
5th row속성

Common Values

ValueCountFrequency (%)
속성 46
46.0%
라이프 28
28.0%
장소 12
 
12.0%
기타 7
 
7.0%
단체 2
 
2.0%
인물 2
 
2.0%
상품 2
 
2.0%
엔터테인먼트 1
 
1.0%

Length

2023-12-10T21:30:26.357951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:26.590461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
속성 46
46.0%
라이프 28
28.0%
장소 12
 
12.0%
기타 7
 
7.0%
단체 2
 
2.0%
인물 2
 
2.0%
상품 2
 
2.0%
엔터테인먼트 1
 
1.0%

연관어명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T21:30:27.112460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length2
Mean length2.46
Min length2

Characters and Unicode

Total characters246
Distinct characters134
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row가축
2nd row개그
3rd row건강
4th row검색
5th row과제
ValueCountFrequency (%)
가축 1
 
1.0%
연구실 1
 
1.0%
인프라 1
 
1.0%
인체 1
 
1.0%
인증 1
 
1.0%
인쇄물 1
 
1.0%
인쇄 1
 
1.0%
이하 1
 
1.0%
이천시 1
 
1.0%
이온 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T21:30:27.938713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
3.7%
9
 
3.7%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (124) 192
78.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 246
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.7%
9
 
3.7%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (124) 192
78.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 246
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.7%
9
 
3.7%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (124) 192
78.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 246
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
3.7%
9
 
3.7%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (124) 192
78.0%

일간연관어언급량
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T21:30:28.178688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:28.330820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

일간연관어단어량
Real number (ℝ)

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.03
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:30:28.462832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile8.05
Maximum11
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.2040138
Coefficient of variation (CV)1.0857211
Kurtosis6.8858756
Mean2.03
Median Absolute Deviation (MAD)0
Skewness2.7396007
Sum203
Variance4.8576768
MonotonicityNot monotonic
2023-12-10T21:30:28.636940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 65
65.0%
2 18
 
18.0%
3 6
 
6.0%
4 2
 
2.0%
10 2
 
2.0%
9 2
 
2.0%
7 2
 
2.0%
8 1
 
1.0%
11 1
 
1.0%
5 1
 
1.0%
ValueCountFrequency (%)
1 65
65.0%
2 18
 
18.0%
3 6
 
6.0%
4 2
 
2.0%
5 1
 
1.0%
7 2
 
2.0%
8 1
 
1.0%
9 2
 
2.0%
10 2
 
2.0%
11 1
 
1.0%
ValueCountFrequency (%)
11 1
 
1.0%
10 2
 
2.0%
9 2
 
2.0%
8 1
 
1.0%
7 2
 
2.0%
5 1
 
1.0%
4 2
 
2.0%
3 6
 
6.0%
2 18
 
18.0%
1 65
65.0%

Interactions

2023-12-10T21:30:23.337057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:23.048432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:23.484825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:23.185524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:30:28.784580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번단어속성명연관어명일간연관어단어량
일간연관어연번1.0000.1341.0000.161
단어속성명0.1341.0001.0000.000
연관어명1.0001.0001.0001.000
일간연관어단어량0.1610.0001.0001.000
2023-12-10T21:30:28.945292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번일간연관어단어량단어속성명
일간연관어연번1.000-0.0900.055
일간연관어단어량-0.0901.0000.000
단어속성명0.0550.0001.000

Missing values

2023-12-10T21:30:23.703953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:30:23.946782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
012020-07-31생활환경대기report속성가축12
122020-07-31생활환경대기report라이프개그11
232020-07-31생활환경대기report라이프건강13
342020-07-31생활환경대기report라이프검색13
452020-07-31생활환경대기report속성과제11
562020-07-31생활환경대기report라이프과학12
672020-07-31생활환경대기report라이프관리18
782020-07-31생활환경대기report장소광역시11
892020-07-31생활환경대기report라이프교통11
9102020-07-31생활환경대기report라이프국가11
일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
90912020-07-31생활환경대기report라이프축산업11
91922020-07-31생활환경대기report속성측정11
92932020-07-31생활환경대기report상품콩기름11
93942020-07-31생활환경대기report라이프탄소11
94952020-07-31생활환경대기report속성평가12
95962020-07-31생활환경대기report장소평택시11
96972020-07-31생활환경대기report라이프포럼11
97982020-07-31생활환경대기report속성표지11
98992020-07-31생활환경대기report라이프프로젝트11
991002020-07-31생활환경대기report장소한국15