Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.5 KiB
Average record size in memory76.3 B

Variable types

Numeric2
Categorical6
Text1

Dataset

Description샘플 데이터
Author성균관대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=b9de2350-e842-11ea-a837-83d4a69b8aa7

Alerts

연월일 has constant value ""Constant
환경플랫폼 하위 도메인명 has constant value ""Constant
도메인 하위 카테고리명 has constant value ""Constant
SNS 채널명 has constant value ""Constant
일간연관어연번 has unique valuesUnique
연관어명 has unique valuesUnique

Reproduction

Analysis started2024-04-22 00:29:49.657198
Analysis finished2024-04-22 00:29:50.555810
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일간연관어연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T09:29:50.645212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2024-04-22T09:29:50.813122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

연월일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-07-01
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-07-01
2nd row2020-07-01
3rd row2020-07-01
4th row2020-07-01
5th row2020-07-01

Common Values

ValueCountFrequency (%)
2020-07-01 100
100.0%

Length

2024-04-22T09:29:50.952068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:51.057810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-07-01 100
100.0%
Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
물환경
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물환경
2nd row물환경
3rd row물환경
4th row물환경
5th row물환경

Common Values

ValueCountFrequency (%)
물환경 100
100.0%

Length

2024-04-22T09:29:51.164099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:51.284586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물환경 100
100.0%

도메인 하위 카테고리명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
지하수
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하수
2nd row지하수
3rd row지하수
4th row지하수
5th row지하수

Common Values

ValueCountFrequency (%)
지하수 100
100.0%

Length

2024-04-22T09:29:51.417293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:51.520989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하수 100
100.0%

SNS 채널명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
patent
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpatent
2nd rowpatent
3rd rowpatent
4th rowpatent
5th rowpatent

Common Values

ValueCountFrequency (%)
patent 100
100.0%

Length

2024-04-22T09:29:51.623291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:51.718653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
patent 100
100.0%

단어속성명
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
속성
57 
기타
15 
라이프
10 
상품
장소
 
5
Other values (2)

Length

Max length3
Median length2
Mean length2.13
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row라이프
3rd row속성
4th row속성
5th row기타

Common Values

ValueCountFrequency (%)
속성 57
57.0%
기타 15
 
15.0%
라이프 10
 
10.0%
상품 7
 
7.0%
장소 5
 
5.0%
인물 3
 
3.0%
브랜드 3
 
3.0%

Length

2024-04-22T09:29:51.815741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:51.930807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
속성 57
57.0%
기타 15
 
15.0%
라이프 10
 
10.0%
상품 7
 
7.0%
장소 5
 
5.0%
인물 3
 
3.0%
브랜드 3
 
3.0%

연관어명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-22T09:29:52.208166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length2
Mean length2.31
Min length2

Characters and Unicode

Total characters231
Distinct characters147
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row가열로
2nd row가이드
3rd row각도
4th row간격
5th row개구부
ValueCountFrequency (%)
가열로 1
 
1.0%
물질 1
 
1.0%
밸브 1
 
1.0%
배합 1
 
1.0%
배출 1
 
1.0%
방지 1
 
1.0%
발생 1
 
1.0%
발명 1
 
1.0%
반스 1
 
1.0%
바실 1
 
1.0%
Other values (90) 90
90.0%
2024-04-22T09:29:52.677686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (137) 185
80.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 231
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (137) 185
80.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 231
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (137) 185
80.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 231
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (137) 185
80.1%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
88 
2
12 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 88
88.0%
2 12
 
12.0%

Length

2024-04-22T09:29:52.825171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:29:52.939252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 88
88.0%
2 12
 
12.0%

일간연관어단어량
Real number (ℝ)

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.76
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T09:29:53.079342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q36
95-th percentile33.1
Maximum51
Range50
Interquartile range (IQR)4

Descriptive statistics

Standard deviation10.408932
Coefficient of variation (CV)1.5397828
Kurtosis8.0129208
Mean6.76
Median Absolute Deviation (MAD)2
Skewness2.8699556
Sum676
Variance108.34586
MonotonicityNot monotonic
2024-04-22T09:29:53.280835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
2 24
24.0%
1 24
24.0%
4 8
 
8.0%
3 7
 
7.0%
5 7
 
7.0%
6 6
 
6.0%
8 4
 
4.0%
7 3
 
3.0%
9 3
 
3.0%
18 2
 
2.0%
Other values (11) 12
12.0%
ValueCountFrequency (%)
1 24
24.0%
2 24
24.0%
3 7
 
7.0%
4 8
 
8.0%
5 7
 
7.0%
6 6
 
6.0%
7 3
 
3.0%
8 4
 
4.0%
9 3
 
3.0%
11 1
 
1.0%
ValueCountFrequency (%)
51 1
1.0%
46 2
2.0%
45 1
1.0%
35 1
1.0%
33 1
1.0%
26 1
1.0%
24 1
1.0%
19 1
1.0%
18 2
2.0%
15 1
1.0%

Interactions

2024-04-22T09:29:50.097000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:29:49.913939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:29:50.190121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:29:50.002830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T09:29:53.415656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번단어속성명연관어명일간연관어언급량일간연관어단어량
일간연관어연번1.0000.0001.0000.0410.122
단어속성명0.0001.0001.0000.0470.000
연관어명1.0001.0001.0001.0001.000
일간연관어언급량0.0410.0471.0001.0000.214
일간연관어단어량0.1220.0001.0000.2141.000
2024-04-22T09:29:53.552776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어언급량단어속성명
일간연관어언급량1.0000.042
단어속성명0.0421.000
2024-04-22T09:29:53.646625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일간연관어연번일간연관어단어량단어속성명일간연관어언급량
일간연관어연번1.000-0.0100.0000.000
일간연관어단어량-0.0101.0000.0000.000
단어속성명0.0000.0001.0000.042
일간연관어언급량0.0000.0000.0421.000

Missing values

2024-04-22T09:29:50.325649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T09:29:50.494415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
012020-07-01물환경지하수patent기타가열로12
122020-07-01물환경지하수patent라이프가이드11
232020-07-01물환경지하수patent속성각도13
342020-07-01물환경지하수patent속성간격23
452020-07-01물환경지하수patent기타개구부11
562020-07-01물환경지하수patent기타개폐12
672020-07-01물환경지하수patent기타거르다13
782020-07-01물환경지하수patent라이프거름18
892020-07-01물환경지하수patent기타거름망146
9102020-07-01물환경지하수patent라이프건식11
일간연관어연번연월일환경플랫폼 하위 도메인명도메인 하위 카테고리명SNS 채널명단어속성명연관어명일간연관어언급량일간연관어단어량
90912020-07-01물환경지하수patent속성색도135
91922020-07-01물환경지하수patent장소서울특별시25
92932020-07-01물환경지하수patent속성석회12
93942020-07-01물환경지하수patent속성선유12
94952020-07-01물환경지하수patent라이프선행22
95962020-07-01물환경지하수patent속성설치11
96972020-07-01물환경지하수patent라이프세척11
97982020-07-01물환경지하수patent속성송부11
98992020-07-01물환경지하수patent속성송풍16
991002020-07-01물환경지하수patent속성수분15