Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory76.3 B

Variable types

Numeric2
DateTime1
Categorical5
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=67a5e680-e843-11ea-835f-5b142183dc74

Alerts

연월일 has constant value ""Constant
환경플랫폼 하위 도메인명 has constant value ""Constant
도메인 하위 카테고리명 has constant value ""Constant
SNS 채널명 has constant value ""Constant
일간연관어언급량 has constant value ""Constant
일간연관어연번 has unique valuesUnique
연관어명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 03:31:38.841814
Analysis finished2024-04-21 03:31:40.135334
Duration1.29 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간연관어연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T12:31:40.268717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2024-04-21T12:31:40.519861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

연월일
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
Minimum2020-02-15 00:00:00
Maximum2020-02-15 00:00:00
2024-04-21T12:31:40.712167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T12:31:40.866771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
생활환경
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row생활환경
2nd row생활환경
3rd row생활환경
4th row생활환경
5th row생활환경

Common Values

ValueCountFrequency (%)
생활환경 100
100.0%

Length

2024-04-21T12:31:41.051304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T12:31:41.203468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활환경 100
100.0%

도메인 하위 카테고리명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
대기
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기
2nd row대기
3rd row대기
4th row대기
5th row대기

Common Values

ValueCountFrequency (%)
대기 100
100.0%

Length

2024-04-21T12:31:41.362998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T12:31:41.520109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기 100
100.0%

SNS 채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
report
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowreport
2nd rowreport
3rd rowreport
4th rowreport
5th rowreport

Common Values

ValueCountFrequency (%)
report 100
100.0%

Length

2024-04-21T12:31:41.679045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T12:31:41.836326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
report 100
100.0%

단어속성명
Categorical

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
속성
54 
라이프
23 
기타
상품
 
4
인물
 
4
Other values (4)

Length

Max length6
Median length2
Mean length2.35
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row속성
2nd row속성
3rd row속성
4th row라이프
5th row상품

Common Values

ValueCountFrequency (%)
속성 54
54.0%
라이프 23
23.0%
기타 8
 
8.0%
상품 4
 
4.0%
인물 4
 
4.0%
사회이슈 2
 
2.0%
장소 2
 
2.0%
엔터테인먼트 2
 
2.0%
단체 1
 
1.0%

Length

2024-04-21T12:31:42.024654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T12:31:42.246944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
속성 54
54.0%
라이프 23
23.0%
기타 8
 
8.0%
상품 4
 
4.0%
인물 4
 
4.0%
사회이슈 2
 
2.0%
장소 2
 
2.0%
엔터테인먼트 2
 
2.0%
단체 1
 
1.0%

연관어명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
2024-04-21T12:31:43.304096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.37
Min length2

Characters and Unicode

Total characters237
Distinct characters124
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row가격
2nd row가시
3rd row갱신
4th row건설
5th row경유
ValueCountFrequency (%)
가격 1
 
1.0%
세부 1
 
1.0%
안개 1
 
1.0%
심화 1
 
1.0%
신차 1
 
1.0%
시사 1
 
1.0%
시그널 1
 
1.0%
승용차 1
 
1.0%
수소 1
 
1.0%
수도 1
 
1.0%
Other values (90) 90
90.0%
2024-04-21T12:31:44.620598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
3.8%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (114) 184
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 237
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.8%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (114) 184
77.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 237
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.8%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (114) 184
77.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 237
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
3.8%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
4
 
1.7%
Other values (114) 184
77.6%

일간연관어언급량
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2024-04-21T12:31:44.843017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T12:31:44.996551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

일간연관어단어량
Real number (ℝ)

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.06
Minimum1
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T12:31:45.146734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile23.15
Maximum54
Range53
Interquartile range (IQR)3

Descriptive statistics

Standard deviation8.8040624
Coefficient of variation (CV)1.7399333
Kurtosis12.914538
Mean5.06
Median Absolute Deviation (MAD)1
Skewness3.4015323
Sum506
Variance77.511515
MonotonicityNot monotonic
2024-04-21T12:31:45.356191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 46
46.0%
2 17
 
17.0%
3 10
 
10.0%
4 4
 
4.0%
8 3
 
3.0%
5 3
 
3.0%
9 3
 
3.0%
14 2
 
2.0%
10 2
 
2.0%
7 1
 
1.0%
Other values (9) 9
 
9.0%
ValueCountFrequency (%)
1 46
46.0%
2 17
 
17.0%
3 10
 
10.0%
4 4
 
4.0%
5 3
 
3.0%
7 1
 
1.0%
8 3
 
3.0%
9 3
 
3.0%
10 2
 
2.0%
11 1
 
1.0%
ValueCountFrequency (%)
54 1
1.0%
40 1
1.0%
35 1
1.0%
32 1
1.0%
26 1
1.0%
23 1
1.0%
20 1
1.0%
18 1
1.0%
14 2
2.0%
11 1
1.0%

Interactions

2024-04-21T12:31:39.472083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T12:31:39.195290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T12:31:39.609484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T12:31:39.331881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T12:31:45.502594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번단어속성명연관어명일간연관어단어량
일간연관어연번1.0000.0001.0000.198
단어속성명0.0001.0001.0000.000
연관어명1.0001.0001.0001.000
일간연관어단어량0.1980.0001.0001.000
2024-04-21T12:31:45.657940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번일간연관어단어량단어속성명
일간연관어연번1.0000.1180.000
일간연관어단어량0.1181.0000.000
단어속성명0.0000.0001.000

Missing values

2024-04-21T12:31:39.801787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T12:31:40.042177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
012020-02-15생활환경대기report속성가격12
122020-02-15생활환경대기report속성가시11
232020-02-15생활환경대기report속성갱신11
342020-02-15생활환경대기report라이프건설11
452020-02-15생활환경대기report상품경유132
562020-02-15생활환경대기report상품경유차14
672020-02-15생활환경대기report라이프계산11
782020-02-15생활환경대기report라이프고시11
892020-02-15생활환경대기report인물고아11
9102020-02-15생활환경대기report속성고정12
일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
90912020-02-15생활환경대기report속성유종11
91922020-02-15생활환경대기report속성유지14
92932020-02-15생활환경대기report속성육상19
93942020-02-15생활환경대기report인물이승민11
94952020-02-15생활환경대기report속성인체11
95962020-02-15생활환경대기report라이프인프라11
96972020-02-15생활환경대기report속성일몰18
97982020-02-15생활환경대기report라이프일산화탄소11
98992020-02-15생활환경대기report상품자동차15
991002020-02-15생활환경대기report속성장기12