Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/394df779-6187-4594-9f63-273dc934aa51 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
단어빈도 has unique values | Unique |
매개중심성 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:51:41.779029 |
---|---|
Analysis finished | 2023-12-10 13:51:45.797347 |
Duration | 4.02 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
경기도 | 1 | 3.3% |
투자 | 1 | 3.3% |
건설 | 1 | 3.3% |
수도권 | 1 | 3.3% |
용인 | 1 | 3.3% |
산업단지 | 1 | 3.3% |
구제역 | 1 | 3.3% |
세종시 | 1 | 3.3% |
정보 | 1 | 3.3% |
수원 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
업 | 4 | 6.0% |
경 | 3 | 4.5% |
수 | 3 | 4.5% |
도 | 3 | 4.5% |
산 | 3 | 4.5% |
지 | 3 | 4.5% |
기 | 3 | 4.5% |
시 | 3 | 4.5% |
정 | 2 | 3.0% |
부 | 2 | 3.0% |
Other values (34) | 38 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 67 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 4 | 6.0% |
경 | 3 | 4.5% |
수 | 3 | 4.5% |
도 | 3 | 4.5% |
산 | 3 | 4.5% |
지 | 3 | 4.5% |
기 | 3 | 4.5% |
시 | 3 | 4.5% |
정 | 2 | 3.0% |
부 | 2 | 3.0% |
Other values (34) | 38 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 67 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 4 | 6.0% |
경 | 3 | 4.5% |
수 | 3 | 4.5% |
도 | 3 | 4.5% |
산 | 3 | 4.5% |
지 | 3 | 4.5% |
기 | 3 | 4.5% |
시 | 3 | 4.5% |
정 | 2 | 3.0% |
부 | 2 | 3.0% |
Other values (34) | 38 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 67 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
업 | 4 | 6.0% |
경 | 3 | 4.5% |
수 | 3 | 4.5% |
도 | 3 | 4.5% |
산 | 3 | 4.5% |
지 | 3 | 4.5% |
기 | 3 | 4.5% |
시 | 3 | 4.5% |
정 | 2 | 3.0% |
부 | 2 | 3.0% |
Other values (34) | 38 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1616.8667 |
Minimum | 657 |
---|---|
Maximum | 11759 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 657 |
---|---|
5-th percentile | 685.25 |
Q1 | 805.5 |
median | 1060.5 |
Q3 | 1358 |
95-th percentile | 4053.8 |
Maximum | 11759 |
Range | 11102 |
Interquartile range (IQR) | 552.5 |
Descriptive statistics
Standard deviation | 2082.9194 |
---|---|
Coefficient of variation (CV) | 1.2882444 |
Kurtosis | 20.693361 |
Mean | 1616.8667 |
Median Absolute Deviation (MAD) | 281.5 |
Skewness | 4.3500994 |
Sum | 48506 |
Variance | 4338553.2 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
11759 | 1 | 3.3% |
968 | 1 | 3.3% |
657 | 1 | 3.3% |
674 | 1 | 3.3% |
699 | 1 | 3.3% |
711 | 1 | 3.3% |
751 | 1 | 3.3% |
775 | 1 | 3.3% |
800 | 1 | 3.3% |
805 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
657 | 1 | |
674 | 1 | |
699 | 1 | |
711 | 1 | |
751 | 1 | |
775 | 1 | |
800 | 1 | |
805 | 1 | |
807 | 1 | |
842 | 1 |
Value | Count | Frequency (%) |
11759 | 1 | |
4376 | 1 | |
3660 | 1 | |
1770 | 1 | |
1662 | 1 | |
1621 | 1 | |
1608 | 1 | |
1359 | 1 | |
1355 | 1 | |
1338 | 1 |
단어중요도
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.025666667 |
Minimum | 0.0213 |
---|---|
Maximum | 0.0377 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0213 |
---|---|
5-th percentile | 0.02238 |
Q1 | 0.023625 |
median | 0.02495 |
Q3 | 0.0268 |
95-th percentile | 0.030995 |
Maximum | 0.0377 |
Range | 0.0164 |
Interquartile range (IQR) | 0.003175 |
Descriptive statistics
Standard deviation | 0.0032387027 |
---|---|
Coefficient of variation (CV) | 0.12618322 |
Kurtosis | 5.7261215 |
Mean | 0.025666667 |
Median Absolute Deviation (MAD) | 0.00155 |
Skewness | 2.0024725 |
Sum | 0.77 |
Variance | 1.0489195 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.025 | 2 | 6.7% |
0.0245 | 2 | 6.7% |
0.0249 | 2 | 6.7% |
0.0234 | 2 | 6.7% |
0.0285 | 1 | 3.3% |
0.023 | 1 | 3.3% |
0.0226 | 1 | 3.3% |
0.027 | 1 | 3.3% |
0.0377 | 1 | 3.3% |
0.0305 | 1 | 3.3% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
0.0213 | 1 | |
0.0222 | 1 | |
0.0226 | 1 | |
0.0229 | 1 | |
0.023 | 1 | |
0.0234 | 2 | |
0.0236 | 1 | |
0.0237 | 1 | |
0.0238 | 1 | |
0.0245 | 2 |
Value | Count | Frequency (%) |
0.0377 | 1 | |
0.0314 | 1 | |
0.0305 | 1 | |
0.0285 | 1 | |
0.028 | 1 | |
0.0271 | 1 | |
0.027 | 1 | |
0.0269 | 1 | |
0.0265 | 1 | |
0.0263 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.08759 |
Minimum | 0.0248 |
---|---|
Maximum | 0.3135 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0248 |
---|---|
5-th percentile | 0.031595 |
Q1 | 0.052625 |
median | 0.0716 |
Q3 | 0.104175 |
95-th percentile | 0.193025 |
Maximum | 0.3135 |
Range | 0.2887 |
Interquartile range (IQR) | 0.05155 |
Descriptive statistics
Standard deviation | 0.058639126 |
---|---|
Coefficient of variation (CV) | 0.66947284 |
Kurtosis | 7.223329 |
Mean | 0.08759 |
Median Absolute Deviation (MAD) | 0.02105 |
Skewness | 2.3891575 |
Sum | 2.6277 |
Variance | 0.0034385471 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1053 | 2 | 6.7% |
0.0582 | 2 | 6.7% |
0.3135 | 1 | 3.3% |
0.0542 | 1 | 3.3% |
0.039 | 1 | 3.3% |
0.0607 | 1 | 3.3% |
0.0476 | 1 | 3.3% |
0.0344 | 1 | 3.3% |
0.0248 | 1 | 3.3% |
0.0293 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0248 | 1 | |
0.0293 | 1 | |
0.0344 | 1 | |
0.039 | 1 | |
0.0476 | 1 | |
0.0511 | 1 | |
0.0516 | 1 | |
0.0521 | 1 | |
0.0542 | 1 | |
0.0582 | 2 |
Value | Count | Frequency (%) |
0.3135 | 1 | |
0.2117 | 1 | |
0.1702 | 1 | |
0.1261 | 1 | |
0.1114 | 1 | |
0.1053 | 2 | |
0.1043 | 1 | |
0.1038 | 1 | |
0.0932 | 1 | |
0.0891 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.026326667 |
Minimum | 0.0016 |
---|---|
Maximum | 0.2069 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0016 |
---|---|
5-th percentile | 0.00284 |
Q1 | 0.008325 |
median | 0.0147 |
Q3 | 0.02445 |
95-th percentile | 0.088035 |
Maximum | 0.2069 |
Range | 0.2053 |
Interquartile range (IQR) | 0.016125 |
Descriptive statistics
Standard deviation | 0.040007326 |
---|---|
Coefficient of variation (CV) | 1.5196503 |
Kurtosis | 15.010897 |
Mean | 0.026326667 |
Median Absolute Deviation (MAD) | 0.00885 |
Skewness | 3.6625745 |
Sum | 0.7898 |
Variance | 0.0016005862 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.2069 | 1 | 3.3% |
0.0093 | 1 | 3.3% |
0.0035 | 1 | 3.3% |
0.0105 | 1 | 3.3% |
0.0045 | 1 | 3.3% |
0.0016 | 1 | 3.3% |
0.0066 | 1 | 3.3% |
0.0036 | 1 | 3.3% |
0.0023 | 1 | 3.3% |
0.0141 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
0.0016 | 1 | |
0.0023 | 1 | |
0.0035 | 1 | |
0.0036 | 1 | |
0.0045 | 1 | |
0.0059 | 1 | |
0.0066 | 1 | |
0.008 | 1 | |
0.0093 | 1 | |
0.0097 | 1 |
Value | Count | Frequency (%) |
0.2069 | 1 | |
0.1059 | 1 | |
0.0662 | 1 | |
0.0404 | 1 | |
0.0308 | 1 | |
0.03 | 1 | |
0.0255 | 1 | |
0.0247 | 1 | |
0.0237 | 1 | |
0.0236 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.353 | 0.172 | 0.641 | 0.725 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.353 | 1.000 | 1.000 | 0.000 | 1.000 | 0.991 |
단어중요도 | 0.172 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 |
연결정도중심성 | 0.641 | 1.000 | 1.000 | 0.000 | 1.000 | 0.943 |
매개중심성 | 0.725 | 1.000 | 0.991 | 0.000 | 0.943 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | 0.254 | -0.885 | -0.907 |
단어빈도 | -1.000 | 1.000 | -0.254 | 0.885 | 0.907 |
단어중요도 | 0.254 | -0.254 | 1.000 | -0.217 | -0.242 |
연결정도중심성 | -0.885 | 0.885 | -0.217 | 1.000 | 0.975 |
매개중심성 | -0.907 | 0.907 | -0.242 | 0.975 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 경기도 | 11759 | 0.0229 | 0.3135 | 0.2069 |
1 | 2 | 2010-01 | 투자 | 4376 | 0.0234 | 0.1702 | 0.0662 |
2 | 3 | 2010-01 | 산업 | 3660 | 0.0246 | 0.2117 | 0.1059 |
3 | 4 | 2010-01 | 지역 | 1770 | 0.0245 | 0.1261 | 0.0404 |
4 | 5 | 2010-01 | 사업 | 1662 | 0.0269 | 0.1114 | 0.0237 |
5 | 6 | 2010-01 | 자금 | 1621 | 0.0222 | 0.0886 | 0.0236 |
6 | 7 | 2010-01 | 서울 | 1608 | 0.0245 | 0.0932 | 0.0247 |
7 | 8 | 2010-01 | 수출 | 1359 | 0.0237 | 0.1053 | 0.0308 |
8 | 9 | 2010-01 | 경기 | 1355 | 0.0238 | 0.1038 | 0.0205 |
9 | 10 | 2010-01 | 지원 | 1338 | 0.0314 | 0.1043 | 0.0255 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 아파트 | 842 | 0.0254 | 0.0658 | 0.0151 |
21 | 22 | 2010-01 | 수원 | 807 | 0.0263 | 0.0516 | 0.008 |
22 | 23 | 2010-01 | 정보 | 805 | 0.0271 | 0.0759 | 0.0141 |
23 | 24 | 2010-01 | 세종시 | 800 | 0.0305 | 0.0293 | 0.0023 |
24 | 25 | 2010-01 | 구제역 | 775 | 0.0377 | 0.0248 | 0.0036 |
25 | 26 | 2010-01 | 산업단지 | 751 | 0.027 | 0.0582 | 0.0066 |
26 | 27 | 2010-01 | 용인 | 711 | 0.025 | 0.0344 | 0.0016 |
27 | 28 | 2010-01 | 수도권 | 699 | 0.0226 | 0.0476 | 0.0045 |
28 | 29 | 2010-01 | 건설 | 674 | 0.0249 | 0.0607 | 0.0105 |
29 | 30 | 2010-01 | 정부 | 657 | 0.023 | 0.039 | 0.0035 |