Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/a131598b-2d55-4fb4-a600-226e7a65243a |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
매개중심성 has 1 (3.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 14:15:23.345232 |
---|---|
Analysis finished | 2023-12-10 14:15:28.038042 |
Duration | 4.69 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2010-01-01 00:00:00 |
---|---|
Maximum | 2010-01-01 00:00:00 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
경기도 | 1 | 3.3% |
지원 | 1 | 3.3% |
평택 | 1 | 3.3% |
업체 | 1 | 3.3% |
직업 | 1 | 3.3% |
시장 | 1 | 3.3% |
기업 | 1 | 3.3% |
수원 | 1 | 3.3% |
보증 | 1 | 3.3% |
거주 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
업 | 9 | 12.9% |
기 | 4 | 5.7% |
자 | 3 | 4.3% |
경 | 2 | 2.9% |
소 | 2 | 2.9% |
보 | 2 | 2.9% |
사 | 2 | 2.9% |
영 | 2 | 2.9% |
상 | 2 | 2.9% |
공 | 2 | 2.9% |
Other values (39) | 40 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 70 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 9 | 12.9% |
기 | 4 | 5.7% |
자 | 3 | 4.3% |
경 | 2 | 2.9% |
소 | 2 | 2.9% |
보 | 2 | 2.9% |
사 | 2 | 2.9% |
영 | 2 | 2.9% |
상 | 2 | 2.9% |
공 | 2 | 2.9% |
Other values (39) | 40 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 70 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 9 | 12.9% |
기 | 4 | 5.7% |
자 | 3 | 4.3% |
경 | 2 | 2.9% |
소 | 2 | 2.9% |
보 | 2 | 2.9% |
사 | 2 | 2.9% |
영 | 2 | 2.9% |
상 | 2 | 2.9% |
공 | 2 | 2.9% |
Other values (39) | 40 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 70 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
업 | 9 | 12.9% |
기 | 4 | 5.7% |
자 | 3 | 4.3% |
경 | 2 | 2.9% |
소 | 2 | 2.9% |
보 | 2 | 2.9% |
사 | 2 | 2.9% |
영 | 2 | 2.9% |
상 | 2 | 2.9% |
공 | 2 | 2.9% |
Other values (39) | 40 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 83.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 101.03333 |
Minimum | 36 |
---|---|
Maximum | 601 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 36 |
---|---|
5-th percentile | 39.25 |
Q1 | 44 |
median | 52 |
Q3 | 96 |
95-th percentile | 239.15 |
Maximum | 601 |
Range | 565 |
Interquartile range (IQR) | 52 |
Descriptive statistics
Standard deviation | 114.30947 |
---|---|
Coefficient of variation (CV) | 1.1314035 |
Kurtosis | 12.516858 |
Mean | 101.03333 |
Median Absolute Deviation (MAD) | 12.5 |
Skewness | 3.2384667 |
Sum | 3031 |
Variance | 13066.654 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
44 | 3 | 10.0% |
42 | 2 | 6.7% |
43 | 2 | 6.7% |
50 | 2 | 6.7% |
601 | 1 | 3.3% |
245 | 1 | 3.3% |
36 | 1 | 3.3% |
37 | 1 | 3.3% |
46 | 1 | 3.3% |
48 | 1 | 3.3% |
Other values (15) | 15 |
Value | Count | Frequency (%) |
36 | 1 | 3.3% |
37 | 1 | 3.3% |
42 | 2 | |
43 | 2 | |
44 | 3 | |
46 | 1 | 3.3% |
48 | 1 | 3.3% |
49 | 1 | 3.3% |
50 | 2 | |
51 | 1 | 3.3% |
Value | Count | Frequency (%) |
601 | 1 | |
245 | 1 | |
232 | 1 | |
227 | 1 | |
211 | 1 | |
179 | 1 | |
104 | 1 | |
102 | 1 | |
78 | 1 | |
68 | 1 |
단어중요도
Real number (ℝ)
Distinct | 27 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.027476667 |
Minimum | 0.0208 |
---|---|
Maximum | 0.0452 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0208 |
---|---|
5-th percentile | 0.0209 |
Q1 | 0.023 |
median | 0.02555 |
Q3 | 0.031425 |
95-th percentile | 0.03734 |
Maximum | 0.0452 |
Range | 0.0244 |
Interquartile range (IQR) | 0.008425 |
Descriptive statistics
Standard deviation | 0.0062350336 |
---|---|
Coefficient of variation (CV) | 0.22692103 |
Kurtosis | 0.65724068 |
Mean | 0.027476667 |
Median Absolute Deviation (MAD) | 0.0036 |
Skewness | 1.1220029 |
Sum | 0.8243 |
Variance | 3.8875644 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0236 | 2 | 6.7% |
0.0303 | 2 | 6.7% |
0.0209 | 2 | 6.7% |
0.0228 | 1 | 3.3% |
0.0334 | 1 | 3.3% |
0.0364 | 1 | 3.3% |
0.0217 | 1 | 3.3% |
0.0241 | 1 | 3.3% |
0.0214 | 1 | 3.3% |
0.0222 | 1 | 3.3% |
Other values (17) | 17 |
Value | Count | Frequency (%) |
0.0208 | 1 | |
0.0209 | 2 | |
0.0214 | 1 | |
0.0217 | 1 | |
0.0222 | 1 | |
0.0228 | 1 | |
0.0229 | 1 | |
0.0233 | 1 | |
0.0236 | 2 | |
0.0241 | 1 |
Value | Count | Frequency (%) |
0.0452 | 1 | |
0.0377 | 1 | |
0.0369 | 1 | |
0.0364 | 1 | |
0.0353 | 1 | |
0.0342 | 1 | |
0.0334 | 1 | |
0.0318 | 1 | |
0.0303 | 2 | |
0.0266 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 20 |
---|---|
Distinct (%) | 66.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.04012 |
Minimum | 0.002 |
---|---|
Maximum | 0.1677 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.002 |
---|---|
5-th percentile | 0.008045 |
Q1 | 0.0207 |
median | 0.0279 |
Q3 | 0.040875 |
95-th percentile | 0.129455 |
Maximum | 0.1677 |
Range | 0.1657 |
Interquartile range (IQR) | 0.020175 |
Descriptive statistics
Standard deviation | 0.038480442 |
---|---|
Coefficient of variation (CV) | 0.95913365 |
Kurtosis | 4.4109334 |
Mean | 0.04012 |
Median Absolute Deviation (MAD) | 0.0114 |
Skewness | 2.1564782 |
Sum | 1.2036 |
Variance | 0.0014807444 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0227 | 3 | 10.0% |
0.0207 | 3 | 10.0% |
0.031 | 2 | 6.7% |
0.0124 | 2 | 6.7% |
0.0414 | 2 | 6.7% |
0.0372 | 2 | 6.7% |
0.0393 | 2 | 6.7% |
0.0248 | 2 | 6.7% |
0.0103 | 1 | 3.3% |
0.0165 | 1 | 3.3% |
Other values (10) | 10 |
Value | Count | Frequency (%) |
0.002 | 1 | 3.3% |
0.0062 | 1 | 3.3% |
0.0103 | 1 | 3.3% |
0.0124 | 2 | |
0.0165 | 1 | 3.3% |
0.0186 | 1 | 3.3% |
0.0207 | 3 | |
0.0227 | 3 | |
0.0248 | 2 | |
0.031 | 2 |
Value | Count | Frequency (%) |
0.1677 | 1 | |
0.1304 | 1 | |
0.1283 | 1 | |
0.0724 | 1 | |
0.0703 | 1 | |
0.0434 | 1 | |
0.0414 | 2 | |
0.0393 | 2 | |
0.0372 | 2 | |
0.0351 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.046796667 |
Minimum | 0 |
---|---|
Maximum | 0.2605 |
Zeros | 1 |
Zeros (%) | 3.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.00227 |
Q1 | 0.0131 |
median | 0.0232 |
Q3 | 0.03765 |
95-th percentile | 0.24042 |
Maximum | 0.2605 |
Range | 0.2605 |
Interquartile range (IQR) | 0.02455 |
Descriptive statistics
Standard deviation | 0.071261918 |
---|---|
Coefficient of variation (CV) | 1.522799 |
Kurtosis | 4.7628937 |
Mean | 0.046796667 |
Median Absolute Deviation (MAD) | 0.01235 |
Skewness | 2.4093897 |
Sum | 1.4039 |
Variance | 0.005078261 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0131 | 2 | 6.7% |
0.2605 | 1 | 3.3% |
0.035 | 1 | 3.3% |
0.0043 | 1 | 3.3% |
0.0085 | 1 | 3.3% |
0.0266 | 1 | 3.3% |
0.0037 | 1 | 3.3% |
0.036 | 1 | 3.3% |
0.0263 | 1 | 3.3% |
0.0011 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0 | 1 | |
0.0011 | 1 | |
0.0037 | 1 | |
0.0041 | 1 | |
0.0043 | 1 | |
0.0085 | 1 | |
0.0113 | 1 | |
0.0131 | 2 | |
0.0134 | 1 | |
0.0135 | 1 |
Value | Count | Frequency (%) |
0.2605 | 1 | |
0.2451 | 1 | |
0.2347 | 1 | |
0.1075 | 1 | |
0.0705 | 1 | |
0.0427 | 1 | |
0.0383 | 1 | |
0.0382 | 1 | |
0.036 | 1 | |
0.035 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.857 | 0.185 | 0.463 | 0.615 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.857 | 1.000 | 1.000 | 0.000 | 0.787 | 0.930 |
단어중요도 | 0.185 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 |
연결정도중심성 | 0.463 | 1.000 | 0.787 | 0.000 | 1.000 | 0.832 |
매개중심성 | 0.615 | 1.000 | 0.930 | 0.000 | 0.832 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -0.999 | -0.260 | -0.779 | -0.657 |
단어빈도 | -0.999 | 1.000 | 0.268 | 0.781 | 0.661 |
단어중요도 | -0.260 | 0.268 | 1.000 | 0.241 | 0.044 |
연결정도중심성 | -0.779 | 0.781 | 0.241 | 1.000 | 0.926 |
매개중심성 | -0.657 | 0.661 | 0.044 | 0.926 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 경기도 | 601 | 0.0236 | 0.1677 | 0.2605 |
1 | 2 | 2010-01 | 지원 | 245 | 0.0369 | 0.0724 | 0.0705 |
2 | 3 | 2010-01 | 창업 | 232 | 0.0377 | 0.1304 | 0.2451 |
3 | 4 | 2010-01 | 프랜차이즈 | 227 | 0.0247 | 0.1283 | 0.2347 |
4 | 5 | 2010-01 | 소상공인 | 211 | 0.0255 | 0.0434 | 0.0277 |
5 | 6 | 2010-01 | 자영업 | 179 | 0.0229 | 0.0703 | 0.1075 |
6 | 7 | 2010-01 | 중소기업 | 104 | 0.0303 | 0.0227 | 0.0113 |
7 | 8 | 2010-01 | 자금 | 102 | 0.0262 | 0.0414 | 0.0383 |
8 | 9 | 2010-01 | 사업 | 78 | 0.0256 | 0.0372 | 0.034 |
9 | 10 | 2010-01 | 경기 | 68 | 0.026 | 0.0393 | 0.0219 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 운영 | 46 | 0.0242 | 0.0186 | 0.0134 |
21 | 22 | 2010-01 | 거주 | 44 | 0.0208 | 0.002 | 0.0 |
22 | 23 | 2010-01 | 보증 | 44 | 0.0222 | 0.0207 | 0.0131 |
23 | 24 | 2010-01 | 수원 | 44 | 0.0236 | 0.0062 | 0.0011 |
24 | 25 | 2010-01 | 기업 | 43 | 0.0214 | 0.0227 | 0.0263 |
25 | 26 | 2010-01 | 시장 | 43 | 0.0241 | 0.0372 | 0.036 |
26 | 27 | 2010-01 | 직업 | 42 | 0.0217 | 0.0124 | 0.0037 |
27 | 28 | 2010-01 | 업체 | 42 | 0.0209 | 0.0248 | 0.0266 |
28 | 29 | 2010-01 | 평택 | 37 | 0.0364 | 0.0165 | 0.0085 |
29 | 30 | 2010-01 | 사업자 | 36 | 0.0334 | 0.0124 | 0.0043 |