Overview

Dataset statistics

Number of variables7
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.0 KiB
Average record size in memory61.7 B

Variable types

Numeric3
Text1
Categorical3

Dataset

DescriptionSample
Author충북대학교
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=CBUISSUEWORD

Alerts

has constant value ""Constant
주차수 has constant value ""Constant
인덱스값 is highly overall correlated with 빈도차례값 and 1 other fieldsHigh correlation
빈도차례값 is highly overall correlated with 인덱스값 and 1 other fieldsHigh correlation
이슈어빈도값 is highly overall correlated with 인덱스값 and 1 other fieldsHigh correlation
인덱스값 has unique valuesUnique
빈도차례값 has unique valuesUnique
이슈어값 has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:46:21.157946
Analysis finished2023-12-10 06:46:22.844322
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인덱스값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.5
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:22.929029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median100.5
Q3150.25
95-th percentile190.05
Maximum200
Range199
Interquartile range (IQR)99.5

Descriptive statistics

Standard deviation57.879185
Coefficient of variation (CV)0.57591228
Kurtosis-1.2
Mean100.5
Median Absolute Deviation (MAD)50
Skewness0
Sum20100
Variance3350
MonotonicityStrictly increasing
2023-12-10T15:46:23.085707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%

빈도차례값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.5
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:23.254727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median100.5
Q3150.25
95-th percentile190.05
Maximum200
Range199
Interquartile range (IQR)99.5

Descriptive statistics

Standard deviation57.879185
Coefficient of variation (CV)0.57591228
Kurtosis-1.2
Mean100.5
Median Absolute Deviation (MAD)50
Skewness0
Sum20100
Variance3350
MonotonicityStrictly increasing
2023-12-10T15:46:23.412027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (190) 190
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%

이슈어값
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:46:23.706078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.04
Min length2

Characters and Unicode

Total characters608
Distinct characters232
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st row농협은행
2nd row교육
3rd row수확
4th row방제
5th row한우
ValueCountFrequency (%)
농협은행 1
 
0.5%
가축분뇨 1
 
0.5%
학교 1
 
0.5%
한국농수산대학 1
 
0.5%
여성농업 1
 
0.5%
경로당 1
 
0.5%
로컬푸드직매장 1
 
0.5%
도매시장 1
 
0.5%
소비촉진 1
 
0.5%
검사 1
 
0.5%
Other values (190) 190
95.0%
2023-12-10T15:46:24.146548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
3.3%
19
 
3.1%
19
 
3.1%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
9
 
1.5%
9
 
1.5%
9
 
1.5%
Other values (222) 480
78.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 608
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
3.3%
19
 
3.1%
19
 
3.1%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
9
 
1.5%
9
 
1.5%
9
 
1.5%
Other values (222) 480
78.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 608
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
3.3%
19
 
3.1%
19
 
3.1%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
9
 
1.5%
9
 
1.5%
9
 
1.5%
Other values (222) 480
78.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 608
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
3.3%
19
 
3.1%
19
 
3.1%
12
 
2.0%
11
 
1.8%
10
 
1.6%
10
 
1.6%
9
 
1.5%
9
 
1.5%
9
 
1.5%
Other values (222) 480
78.9%

이슈어빈도값
Real number (ℝ)

HIGH CORRELATION 

Distinct57
Distinct (%)28.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.695
Minimum23
Maximum220
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:46:24.307494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum23
5-th percentile24
Q127
median32
Q346.25
95-th percentile92.2
Maximum220
Range197
Interquartile range (IQR)19.25

Descriptive statistics

Standard deviation25.679939
Coefficient of variation (CV)0.61589973
Kurtosis14.670683
Mean41.695
Median Absolute Deviation (MAD)7.5
Skewness3.2508143
Sum8339
Variance659.45927
MonotonicityDecreasing
2023-12-10T15:46:24.512994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24 16
 
8.0%
25 16
 
8.0%
29 12
 
6.0%
27 11
 
5.5%
28 11
 
5.5%
31 11
 
5.5%
26 8
 
4.0%
23 7
 
3.5%
51 6
 
3.0%
32 6
 
3.0%
Other values (47) 96
48.0%
ValueCountFrequency (%)
23 7
3.5%
24 16
8.0%
25 16
8.0%
26 8
4.0%
27 11
5.5%
28 11
5.5%
29 12
6.0%
30 5
 
2.5%
31 11
5.5%
32 6
 
3.0%
ValueCountFrequency (%)
220 1
0.5%
160 1
0.5%
129 1
0.5%
127 1
0.5%
126 1
0.5%
115 1
0.5%
111 1
0.5%
106 1
0.5%
97 1
0.5%
96 1
0.5%


Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
6
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6
2nd row6
3rd row6
4th row6
5th row6

Common Values

ValueCountFrequency (%)
6 200
100.0%

Length

2023-12-10T15:46:24.733682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:24.865429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6 200
100.0%

주차수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
1
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 200
100.0%

Length

2023-12-10T15:46:25.004866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:25.139534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 200
100.0%

주요분류명
Categorical

Distinct13
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
경제
31 
정책
28 
생활문화
22 
농업환경
19 
지역/장소
17 
Other values (8)
83 

Length

Max length5
Median length4
Mean length2.98
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기관/단체
2nd row기타
3rd row생활문화
4th row정책
5th row생물

Common Values

ValueCountFrequency (%)
경제 31
15.5%
정책 28
14.0%
생활문화 22
11.0%
농업환경 19
9.5%
지역/장소 17
8.5%
17
8.5%
기관/단체 16
8.0%
농작물 16
8.0%
식품 9
 
4.5%
기타 8
 
4.0%
Other values (3) 17
8.5%

Length

2023-12-10T15:46:25.287652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경제 31
15.5%
정책 28
14.0%
생활문화 22
11.0%
농업환경 19
9.5%
지역/장소 17
8.5%
17
8.5%
기관/단체 16
8.0%
농작물 16
8.0%
식품 9
 
4.5%
기타 8
 
4.0%
Other values (3) 17
8.5%

Interactions

2023-12-10T15:46:22.301132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:21.429438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:21.856152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:22.416803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:21.527936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:22.036509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:22.538134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:21.674888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:46:22.191939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:46:25.420205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스값빈도차례값이슈어빈도값주요분류명
인덱스값1.0001.0000.7400.263
빈도차례값1.0001.0000.7400.263
이슈어빈도값0.7400.7401.0000.201
주요분류명0.2630.2630.2011.000
2023-12-10T15:46:25.539175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스값빈도차례값이슈어빈도값주요분류명
인덱스값1.0001.000-0.9990.109
빈도차례값1.0001.000-0.9990.109
이슈어빈도값-0.999-0.9991.0000.089
주요분류명0.1090.1090.0891.000

Missing values

2023-12-10T15:46:22.676396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:46:22.798268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인덱스값빈도차례값이슈어값이슈어빈도값주차수주요분류명
011농협은행22061기관/단체
122교육16061기타
233수확12961생활문화
344방제12761정책
455한우12661생물
566건강11561생활문화
677예방11161정책
788소비자10661경제
899서울9761지역/장소
91010환경9661농업환경
인덱스값빈도차례값이슈어값이슈어빈도값주차수주요분류명
190191191한번에측조2461농자재
191192192곰장어2461생물
192193193첨지2461기타
193194194협약2361정책
194195195축사2361생활문화
195196196기술자료집2361영농기술
196197197데이터2361정책
197198198이은혜2361기타
198199199안전성2361
199200200감자역병2361