Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/a76867e4-78ae-4335-a08d-51a054c243fe |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
단어연결중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
단어매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
단어매개중심성 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:44:47.391501 |
---|---|
Analysis finished | 2023-12-10 13:44:52.135109 |
Duration | 4.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
경기도 | 1 | 3.3% |
창업 | 1 | 3.3% |
채용 | 1 | 3.3% |
성남 | 1 | 3.3% |
성공 | 1 | 3.3% |
시장 | 1 | 3.3% |
상권 | 1 | 3.3% |
여성창업 | 1 | 3.3% |
취업 | 1 | 3.3% |
회사 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
업 | 8 | 11.8% |
기 | 5 | 7.4% |
성 | 4 | 5.9% |
공 | 3 | 4.4% |
소 | 2 | 2.9% |
처 | 2 | 2.9% |
벤 | 2 | 2.9% |
장 | 2 | 2.9% |
사 | 2 | 2.9% |
원 | 2 | 2.9% |
Other values (33) | 36 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 68 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 8 | 11.8% |
기 | 5 | 7.4% |
성 | 4 | 5.9% |
공 | 3 | 4.4% |
소 | 2 | 2.9% |
처 | 2 | 2.9% |
벤 | 2 | 2.9% |
장 | 2 | 2.9% |
사 | 2 | 2.9% |
원 | 2 | 2.9% |
Other values (33) | 36 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 68 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 8 | 11.8% |
기 | 5 | 7.4% |
성 | 4 | 5.9% |
공 | 3 | 4.4% |
소 | 2 | 2.9% |
처 | 2 | 2.9% |
벤 | 2 | 2.9% |
장 | 2 | 2.9% |
사 | 2 | 2.9% |
원 | 2 | 2.9% |
Other values (33) | 36 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 68 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
업 | 8 | 11.8% |
기 | 5 | 7.4% |
성 | 4 | 5.9% |
공 | 3 | 4.4% |
소 | 2 | 2.9% |
처 | 2 | 2.9% |
벤 | 2 | 2.9% |
장 | 2 | 2.9% |
사 | 2 | 2.9% |
원 | 2 | 2.9% |
Other values (33) | 36 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 150.3 |
Minimum | 58 |
---|---|
Maximum | 943 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 58 |
---|---|
5-th percentile | 58.45 |
Q1 | 64.5 |
median | 85.5 |
Q3 | 123 |
95-th percentile | 585.75 |
Maximum | 943 |
Range | 885 |
Interquartile range (IQR) | 58.5 |
Descriptive statistics
Standard deviation | 205.56401 |
---|---|
Coefficient of variation (CV) | 1.3676913 |
Kurtosis | 10.908823 |
Mean | 150.3 |
Median Absolute Deviation (MAD) | 24.5 |
Skewness | 3.3799693 |
Sum | 4509 |
Variance | 42256.562 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
78 | 3 | 10.0% |
58 | 2 | 6.7% |
60 | 2 | 6.7% |
943 | 1 | 3.3% |
86 | 1 | 3.3% |
59 | 1 | 3.3% |
61 | 1 | 3.3% |
63 | 1 | 3.3% |
64 | 1 | 3.3% |
66 | 1 | 3.3% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
58 | 2 | |
59 | 1 | 3.3% |
60 | 2 | |
61 | 1 | 3.3% |
63 | 1 | 3.3% |
64 | 1 | 3.3% |
66 | 1 | 3.3% |
68 | 1 | 3.3% |
78 | 3 | |
79 | 1 | 3.3% |
Value | Count | Frequency (%) |
943 | 1 | |
822 | 1 | |
297 | 1 | |
192 | 1 | |
146 | 1 | |
129 | 1 | |
127 | 1 | |
124 | 1 | |
120 | 1 | |
110 | 1 |
단어중요도
Real number (ℝ)
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.028243333 |
Minimum | 0.0193 |
---|---|
Maximum | 0.0581 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0193 |
---|---|
5-th percentile | 0.022425 |
Q1 | 0.023975 |
median | 0.0261 |
Q3 | 0.030475 |
95-th percentile | 0.03901 |
Maximum | 0.0581 |
Range | 0.0388 |
Interquartile range (IQR) | 0.0065 |
Descriptive statistics
Standard deviation | 0.0073097801 |
---|---|
Coefficient of variation (CV) | 0.25881435 |
Kurtosis | 9.0713251 |
Mean | 0.028243333 |
Median Absolute Deviation (MAD) | 0.00325 |
Skewness | 2.5863775 |
Sum | 0.8473 |
Variance | 5.3432885 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0227 | 2 | 6.7% |
0.0239 | 1 | 3.3% |
0.0306 | 1 | 3.3% |
0.0247 | 1 | 3.3% |
0.0391 | 1 | 3.3% |
0.0389 | 1 | 3.3% |
0.025 | 1 | 3.3% |
0.0236 | 1 | 3.3% |
0.0581 | 1 | 3.3% |
0.0325 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0193 | 1 | |
0.0222 | 1 | |
0.0227 | 2 | |
0.0228 | 1 | |
0.0236 | 1 | |
0.0238 | 1 | |
0.0239 | 1 | |
0.0242 | 1 | |
0.0243 | 1 | |
0.0246 | 1 |
Value | Count | Frequency (%) |
0.0581 | 1 | |
0.0391 | 1 | |
0.0389 | 1 | |
0.0325 | 1 | |
0.0324 | 1 | |
0.0322 | 1 | |
0.0307 | 1 | |
0.0306 | 1 | |
0.0301 | 1 | |
0.0299 | 1 |
단어연결중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 76.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.044363333 |
Minimum | 0.0102 |
---|---|
Maximum | 0.2032 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0102 |
---|---|
5-th percentile | 0.012275 |
Q1 | 0.0203 |
median | 0.03265 |
Q3 | 0.0421 |
95-th percentile | 0.138965 |
Maximum | 0.2032 |
Range | 0.193 |
Interquartile range (IQR) | 0.0218 |
Descriptive statistics
Standard deviation | 0.044101822 |
---|---|
Coefficient of variation (CV) | 0.99410523 |
Kurtosis | 7.8747526 |
Mean | 0.044363333 |
Median Absolute Deviation (MAD) | 0.01235 |
Skewness | 2.7811191 |
Sum | 1.3309 |
Variance | 0.0019449707 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0189 | 4 | 13.3% |
0.0203 | 2 | 6.7% |
0.029 | 2 | 6.7% |
0.0348 | 2 | 6.7% |
0.0421 | 2 | 6.7% |
0.1814 | 1 | 3.3% |
0.0116 | 1 | 3.3% |
0.0232 | 1 | 3.3% |
0.0218 | 1 | 3.3% |
0.0102 | 1 | 3.3% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
0.0102 | 1 | 3.3% |
0.0116 | 1 | 3.3% |
0.0131 | 1 | 3.3% |
0.0189 | 4 | |
0.0203 | 2 | |
0.0218 | 1 | 3.3% |
0.0232 | 1 | 3.3% |
0.029 | 2 | |
0.0305 | 1 | 3.3% |
0.0319 | 1 | 3.3% |
Value | Count | Frequency (%) |
0.2032 | 1 | |
0.1814 | 1 | |
0.0871 | 1 | |
0.0726 | 1 | |
0.0668 | 1 | |
0.0522 | 1 | |
0.0493 | 1 | |
0.0421 | 2 | |
0.0406 | 1 | |
0.0377 | 1 |
단어매개중심성
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.041036667 |
Minimum | 0.0014 |
---|---|
Maximum | 0.3231 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0014 |
---|---|
5-th percentile | 0.004845 |
Q1 | 0.010325 |
median | 0.0214 |
Q3 | 0.032775 |
95-th percentile | 0.17722 |
Maximum | 0.3231 |
Range | 0.3217 |
Interquartile range (IQR) | 0.02245 |
Descriptive statistics
Standard deviation | 0.070343382 |
---|---|
Coefficient of variation (CV) | 1.7141593 |
Kurtosis | 11.102241 |
Mean | 0.041036667 |
Median Absolute Deviation (MAD) | 0.01175 |
Skewness | 3.3415843 |
Sum | 1.2311 |
Variance | 0.0049481914 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.2521 | 1 | 3.3% |
0.0334 | 1 | 3.3% |
0.0086 | 1 | 3.3% |
0.0136 | 1 | 3.3% |
0.0087 | 1 | 3.3% |
0.0073 | 1 | 3.3% |
0.0169 | 1 | 3.3% |
0.0071 | 1 | 3.3% |
0.0099 | 1 | 3.3% |
0.021 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
0.0014 | 1 | |
0.0039 | 1 | |
0.006 | 1 | |
0.0071 | 1 | |
0.0073 | 1 | |
0.0086 | 1 | |
0.0087 | 1 | |
0.0099 | 1 | |
0.0116 | 1 | |
0.0122 | 1 |
Value | Count | Frequency (%) |
0.3231 | 1 | |
0.2521 | 1 | |
0.0857 | 1 | |
0.0718 | 1 | |
0.0583 | 1 | |
0.0363 | 1 | |
0.0356 | 1 | |
0.0334 | 1 | |
0.0309 | 1 | |
0.0254 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.437 | 0.529 | 0.402 | 0.299 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.437 | 1.000 | 1.000 | 0.000 | 0.897 | 0.988 |
단어중요도 | 0.529 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 |
단어연결중심성 | 0.402 | 1.000 | 0.897 | 0.000 | 1.000 | 1.000 |
단어매개중심성 | 0.299 | 1.000 | 0.988 | 0.000 | 1.000 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -0.999 | 0.103 | -0.744 | -0.787 |
단어빈도 | -0.999 | 1.000 | -0.102 | 0.743 | 0.784 |
단어중요도 | 0.103 | -0.102 | 1.000 | 0.074 | 0.008 |
단어연결중심성 | -0.744 | 0.743 | 0.074 | 1.000 | 0.970 |
단어매개중심성 | -0.787 | 0.784 | 0.008 | 0.970 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 경기도 | 943 | 0.0239 | 0.1814 | 0.2521 |
1 | 2 | 2010-01 | 창업 | 822 | 0.0277 | 0.2032 | 0.3231 |
2 | 3 | 2010-01 | 지원 | 297 | 0.0307 | 0.0871 | 0.0857 |
3 | 4 | 2010-01 | 기업 | 192 | 0.027 | 0.0726 | 0.0718 |
4 | 5 | 2010-01 | 사업 | 146 | 0.0228 | 0.0493 | 0.0363 |
5 | 6 | 2010-01 | 자금 | 129 | 0.0256 | 0.029 | 0.0218 |
6 | 7 | 2010-01 | 교육 | 127 | 0.0299 | 0.0668 | 0.0583 |
7 | 8 | 2010-01 | 서울 | 124 | 0.0238 | 0.0377 | 0.0309 |
8 | 9 | 2010-01 | 벤처기업 | 120 | 0.0227 | 0.0421 | 0.0239 |
9 | 10 | 2010-01 | 중소기업 | 110 | 0.0293 | 0.0334 | 0.0174 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 대학 | 68 | 0.0266 | 0.0348 | 0.0226 |
21 | 22 | 2010-01 | 회사 | 66 | 0.0222 | 0.0189 | 0.0122 |
22 | 23 | 2010-01 | 취업 | 64 | 0.0243 | 0.0348 | 0.021 |
23 | 24 | 2010-01 | 여성창업 | 63 | 0.0325 | 0.0203 | 0.0099 |
24 | 25 | 2010-01 | 상권 | 61 | 0.0581 | 0.0189 | 0.0071 |
25 | 26 | 2010-01 | 시장 | 60 | 0.0236 | 0.029 | 0.0169 |
26 | 27 | 2010-01 | 성공 | 60 | 0.025 | 0.0189 | 0.0073 |
27 | 28 | 2010-01 | 성남 | 59 | 0.0389 | 0.0218 | 0.0087 |
28 | 29 | 2010-01 | 채용 | 58 | 0.0391 | 0.0203 | 0.0136 |
29 | 30 | 2010-01 | 공장 | 58 | 0.0247 | 0.0232 | 0.0086 |