Overview

Dataset statistics

Number of variables9
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory80.3 B

Variable types

Numeric1
Categorical7
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=7a4a32b0-e842-11ea-835f-5b142183dc74

Alerts

연월일 has constant value ""Constant
환경플랫폼 하위 도메인명 has constant value ""Constant
도메인 하위 카테고리명 has constant value ""Constant
SNS 채널명 has constant value ""Constant
일간연관어언급량 has constant value ""Constant
일간연관어연번 has unique valuesUnique
연관어명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:09:47.219076
Analysis finished2023-12-10 13:09:48.018950
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간연관어연번
Real number (ℝ)

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-10T22:09:48.139195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.2
Q17
median13
Q319
95-th percentile23.8
Maximum25
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.3598007
Coefficient of variation (CV)0.56613852
Kurtosis-1.2
Mean13
Median Absolute Deviation (MAD)6
Skewness0
Sum325
Variance54.166667
MonotonicityStrictly increasing
2023-12-10T22:09:48.405987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1 1
 
4.0%
2 1
 
4.0%
25 1
 
4.0%
24 1
 
4.0%
23 1
 
4.0%
22 1
 
4.0%
21 1
 
4.0%
20 1
 
4.0%
19 1
 
4.0%
18 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
1 1
4.0%
2 1
4.0%
3 1
4.0%
4 1
4.0%
5 1
4.0%
6 1
4.0%
7 1
4.0%
8 1
4.0%
9 1
4.0%
10 1
4.0%
ValueCountFrequency (%)
25 1
4.0%
24 1
4.0%
23 1
4.0%
22 1
4.0%
21 1
4.0%
20 1
4.0%
19 1
4.0%
18 1
4.0%
17 1
4.0%
16 1
4.0%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2020-11-30
25 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-11-30
2nd row2020-11-30
3rd row2020-11-30
4th row2020-11-30
5th row2020-11-30

Common Values

ValueCountFrequency (%)
2020-11-30 25
100.0%

Length

2023-12-10T22:09:48.594294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:48.756834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-11-30 25
100.0%
Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
생활환경
25 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활환경
2nd row생활환경
3rd row생활환경
4th row생활환경
5th row생활환경

Common Values

ValueCountFrequency (%)
생활환경 25
100.0%

Length

2023-12-10T22:09:48.904680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:49.031559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활환경 25
100.0%

도메인 하위 카테고리명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
대기
25 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기
2nd row대기
3rd row대기
4th row대기
5th row대기

Common Values

ValueCountFrequency (%)
대기 25
100.0%

Length

2023-12-10T22:09:49.228642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:49.361143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기 25
100.0%

SNS 채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
paper
25 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpaper
2nd rowpaper
3rd rowpaper
4th rowpaper
5th rowpaper

Common Values

ValueCountFrequency (%)
paper 25
100.0%

Length

2023-12-10T22:09:49.522132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:49.677934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
paper 25
100.0%

단어속성명
Categorical

Distinct6
Distinct (%)24.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
속성
10 
라이프
기타
인물
 
1
엔터테인먼트
 
1

Length

Max length6
Median length2
Mean length2.48
Min length2

Unique

Unique3 ?
Unique (%)12.0%

Sample

1st row라이프
2nd row라이프
3rd row라이프
4th row인물
5th row엔터테인먼트

Common Values

ValueCountFrequency (%)
속성 10
40.0%
라이프 8
32.0%
기타 4
 
16.0%
인물 1
 
4.0%
엔터테인먼트 1
 
4.0%
단체 1
 
4.0%

Length

2023-12-10T22:09:49.873806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:50.061885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
속성 10
40.0%
라이프 8
32.0%
기타 4
 
16.0%
인물 1
 
4.0%
엔터테인먼트 1
 
4.0%
단체 1
 
4.0%

연관어명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-10T22:09:50.365325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.56
Min length2

Characters and Unicode

Total characters64
Distinct characters55
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row계약
2nd row교통수단
3rd row구독
4th row김준범
5th row미디어
ValueCountFrequency (%)
계약 1
 
4.0%
원저 1
 
4.0%
환경공학 1
 
4.0%
형사 1
 
4.0%
출처 1
 
4.0%
체결 1
 
4.0%
책임 1
 
4.0%
전송 1
 
4.0%
저작물 1
 
4.0%
저작 1
 
4.0%
Other values (15) 15
60.0%
2023-12-10T22:09:51.076881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
Other values (45) 45
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
Other values (45) 45
70.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
Other values (45) 45
70.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
Other values (45) 45
70.3%

일간연관어언급량
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
1
25 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 25
100.0%

Length

2023-12-10T22:09:51.325499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:51.467474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 25
100.0%
Distinct5
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
1
13 
2
10
 
1
5
 
1
3
 
1

Length

Max length2
Median length1
Mean length1.04
Min length1

Unique

Unique3 ?
Unique (%)12.0%

Sample

1st row1
2nd row2
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 13
52.0%
2 9
36.0%
10 1
 
4.0%
5 1
 
4.0%
3 1
 
4.0%

Length

2023-12-10T22:09:51.612769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:09:51.781313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 13
52.0%
2 9
36.0%
10 1
 
4.0%
5 1
 
4.0%
3 1
 
4.0%

Interactions

2023-12-10T22:09:47.529365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:09:51.884033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번단어속성명연관어명일간연관어단어량
일간연관어연번1.0000.6841.0000.390
단어속성명0.6841.0001.0000.587
연관어명1.0001.0001.0001.000
일간연관어단어량0.3900.5871.0001.000
2023-12-10T22:09:52.018534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어단어량단어속성명
일간연관어단어량1.0000.427
단어속성명0.4271.000
2023-12-10T22:09:52.132900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번단어속성명일간연관어단어량
일간연관어연번1.0000.3600.000
단어속성명0.3601.0000.427
일간연관어단어량0.0000.4271.000

Missing values

2023-12-10T22:09:47.694780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:09:47.919218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
012020-11-30생활환경대기paper라이프계약11
122020-11-30생활환경대기paper라이프교통수단12
232020-11-30생활환경대기paper라이프구독11
342020-11-30생활환경대기paper인물김준범12
452020-11-30생활환경대기paper엔터테인먼트미디어11
562020-11-30생활환경대기paper라이프미세먼지12
672020-11-30생활환경대기paper속성발자국12
782020-11-30생활환경대기paper속성법령11
892020-11-30생활환경대기paper라이프보증11
9102020-11-30생활환경대기paper속성복제11
일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
15162020-11-30생활환경대기paper기타재령12
16172020-11-30생활환경대기paper기타저작12
17182020-11-30생활환경대기paper속성저작물15
18192020-11-30생활환경대기paper속성전송11
19202020-11-30생활환경대기paper속성책임12
20212020-11-30생활환경대기paper라이프체결11
21222020-11-30생활환경대기paper속성출처11
22232020-11-30생활환경대기paper라이프형사11
23242020-11-30생활환경대기paper기타환경공학13
24252020-11-30생활환경대기paper속성회지12