Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/60706d8d-6cf1-4ecc-827c-27c72fd2e212 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
단어중요도 is highly overall correlated with 매개중심성 | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 3 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:03:28.079105 |
---|---|
Analysis finished | 2023-12-10 14:03:32.823849 |
Duration | 4.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2010-01-01 00:00:00 |
---|---|
Maximum | 2010-01-01 00:00:00 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
고양 | 1 | 3.3% |
경기 | 1 | 3.3% |
사랑 | 1 | 3.3% |
주택 | 1 | 3.3% |
용인 | 1 | 3.3% |
사업 | 1 | 3.3% |
도시 | 1 | 3.3% |
문화 | 1 | 3.3% |
시민 | 1 | 3.3% |
인천 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 4 | 6.1% |
도 | 4 | 6.1% |
주 | 3 | 4.5% |
양 | 3 | 4.5% |
부 | 3 | 4.5% |
정 | 2 | 3.0% |
인 | 2 | 3.0% |
수 | 2 | 3.0% |
사 | 2 | 3.0% |
남 | 2 | 3.0% |
Other values (35) | 39 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 4 | 6.1% |
도 | 4 | 6.1% |
주 | 3 | 4.5% |
양 | 3 | 4.5% |
부 | 3 | 4.5% |
정 | 2 | 3.0% |
인 | 2 | 3.0% |
수 | 2 | 3.0% |
사 | 2 | 3.0% |
남 | 2 | 3.0% |
Other values (35) | 39 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 4 | 6.1% |
도 | 4 | 6.1% |
주 | 3 | 4.5% |
양 | 3 | 4.5% |
부 | 3 | 4.5% |
정 | 2 | 3.0% |
인 | 2 | 3.0% |
수 | 2 | 3.0% |
사 | 2 | 3.0% |
남 | 2 | 3.0% |
Other values (35) | 39 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 4 | 6.1% |
도 | 4 | 6.1% |
주 | 3 | 4.5% |
양 | 3 | 4.5% |
부 | 3 | 4.5% |
정 | 2 | 3.0% |
인 | 2 | 3.0% |
수 | 2 | 3.0% |
사 | 2 | 3.0% |
남 | 2 | 3.0% |
Other values (35) | 39 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 737.16667 |
Minimum | 270 |
---|---|
Maximum | 7690 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 270 |
---|---|
5-th percentile | 275.8 |
Q1 | 333 |
median | 373 |
Q3 | 466.75 |
95-th percentile | 1488.15 |
Maximum | 7690 |
Range | 7420 |
Interquartile range (IQR) | 133.75 |
Descriptive statistics
Standard deviation | 1353.0852 |
---|---|
Coefficient of variation (CV) | 1.8355214 |
Kurtosis | 26.252003 |
Mean | 737.16667 |
Median Absolute Deviation (MAD) | 54 |
Skewness | 5.0066251 |
Sum | 22115 |
Variance | 1830839.5 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
373 | 2 | 6.7% |
333 | 2 | 6.7% |
366 | 1 | 3.3% |
270 | 1 | 3.3% |
274 | 1 | 3.3% |
278 | 1 | 3.3% |
290 | 1 | 3.3% |
317 | 1 | 3.3% |
321 | 1 | 3.3% |
327 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
270 | 1 | |
274 | 1 | |
278 | 1 | |
290 | 1 | |
317 | 1 | |
321 | 1 | |
327 | 1 | |
333 | 2 | |
345 | 1 | |
349 | 1 |
Value | Count | Frequency (%) |
7690 | 1 | |
1530 | 1 | |
1437 | 1 | |
1183 | 1 | |
838 | 1 | |
686 | 1 | |
503 | 1 | |
468 | 1 | |
463 | 1 | |
448 | 1 |
단어중요도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.02641 |
Minimum | 0.0213 |
---|---|
Maximum | 0.0452 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0213 |
---|---|
5-th percentile | 0.02195 |
Q1 | 0.023825 |
median | 0.02505 |
Q3 | 0.02675 |
95-th percentile | 0.03469 |
Maximum | 0.0452 |
Range | 0.0239 |
Interquartile range (IQR) | 0.002925 |
Descriptive statistics
Standard deviation | 0.0048875881 |
---|---|
Coefficient of variation (CV) | 0.18506581 |
Kurtosis | 7.5764826 |
Mean | 0.02641 |
Median Absolute Deviation (MAD) | 0.00135 |
Skewness | 2.515336 |
Sum | 0.7923 |
Variance | 2.3888517 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0248 | 2 | 6.7% |
0.0263 | 2 | 6.7% |
0.0253 | 2 | 6.7% |
0.0225 | 2 | 6.7% |
0.0304 | 1 | 3.3% |
0.0239 | 1 | 3.3% |
0.0247 | 1 | 3.3% |
0.0245 | 1 | 3.3% |
0.0213 | 1 | 3.3% |
0.0257 | 1 | 3.3% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
0.0213 | 1 | |
0.0215 | 1 | |
0.0225 | 2 | |
0.0232 | 1 | |
0.0235 | 1 | |
0.0236 | 1 | |
0.0238 | 1 | |
0.0239 | 1 | |
0.0243 | 1 | |
0.0245 | 1 |
Value | Count | Frequency (%) |
0.0452 | 1 | |
0.0382 | 1 | |
0.0304 | 1 | |
0.0302 | 1 | |
0.0301 | 1 | |
0.029 | 1 | |
0.0288 | 1 | |
0.0269 | 1 | |
0.0263 | 2 | |
0.0257 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 70.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.078446667 |
Minimum | 0.0349 |
---|---|
Maximum | 0.3688 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0349 |
---|---|
5-th percentile | 0.0349 |
Q1 | 0.045875 |
median | 0.06375 |
Q3 | 0.07645 |
95-th percentile | 0.15432 |
Maximum | 0.3688 |
Range | 0.3339 |
Interquartile range (IQR) | 0.030575 |
Descriptive statistics
Standard deviation | 0.063053878 |
---|---|
Coefficient of variation (CV) | 0.80378021 |
Kurtosis | 16.00459 |
Mean | 0.078446667 |
Median Absolute Deviation (MAD) | 0.01315 |
Skewness | 3.6849288 |
Sum | 2.3534 |
Variance | 0.0039757915 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0611 | 3 | 10.0% |
0.0349 | 3 | 10.0% |
0.0769 | 2 | 6.7% |
0.0437 | 2 | 6.7% |
0.0751 | 2 | 6.7% |
0.0402 | 2 | 6.7% |
0.0664 | 2 | 6.7% |
0.0681 | 1 | 3.3% |
0.0629 | 1 | 3.3% |
0.0576 | 1 | 3.3% |
Other values (11) | 11 |
Value | Count | Frequency (%) |
0.0349 | 3 | |
0.0402 | 2 | |
0.0419 | 1 | 3.3% |
0.0437 | 2 | |
0.0524 | 1 | 3.3% |
0.0559 | 1 | 3.3% |
0.0576 | 1 | 3.3% |
0.0611 | 3 | |
0.0629 | 1 | 3.3% |
0.0646 | 1 | 3.3% |
Value | Count | Frequency (%) |
0.3688 | 1 | |
0.159 | 1 | |
0.1486 | 1 | |
0.1381 | 1 | |
0.0926 | 1 | |
0.0804 | 1 | |
0.0769 | 2 | |
0.0751 | 2 | |
0.0699 | 1 | |
0.0681 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 27 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.032993333 |
Minimum | 0.0035 |
---|---|
Maximum | 0.4215 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0035 |
---|---|
5-th percentile | 0.004595 |
Q1 | 0.010125 |
median | 0.0141 |
Q3 | 0.02135 |
95-th percentile | 0.06902 |
Maximum | 0.4215 |
Range | 0.418 |
Interquartile range (IQR) | 0.011225 |
Descriptive statistics
Standard deviation | 0.075435599 |
---|---|
Coefficient of variation (CV) | 2.2863891 |
Kurtosis | 26.519329 |
Mean | 0.032993333 |
Median Absolute Deviation (MAD) | 0.00565 |
Skewness | 5.0350678 |
Sum | 0.9898 |
Variance | 0.0056905296 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0141 | 3 | 10.0% |
0.006 | 2 | 6.7% |
0.4215 | 1 | 3.3% |
0.0118 | 1 | 3.3% |
0.0041 | 1 | 3.3% |
0.0115 | 1 | 3.3% |
0.0102 | 1 | 3.3% |
0.0081 | 1 | 3.3% |
0.0217 | 1 | 3.3% |
0.0161 | 1 | 3.3% |
Other values (17) | 17 |
Value | Count | Frequency (%) |
0.0035 | 1 | |
0.0041 | 1 | |
0.0052 | 1 | |
0.006 | 2 | |
0.0081 | 1 | |
0.0093 | 1 | |
0.0101 | 1 | |
0.0102 | 1 | |
0.0103 | 1 | |
0.0112 | 1 |
Value | Count | Frequency (%) |
0.4215 | 1 | |
0.071 | 1 | |
0.0666 | 1 | |
0.0597 | 1 | |
0.0375 | 1 | |
0.0297 | 1 | |
0.0284 | 1 | |
0.0217 | 1 | |
0.0203 | 1 | |
0.0194 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.575 | 0.000 | 0.596 | 0.590 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.575 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
단어중요도 | 0.000 | 1.000 | 0.000 | 1.000 | 0.265 | 0.000 |
연결정도중심성 | 0.596 | 1.000 | 1.000 | 0.265 | 1.000 | 1.000 |
매개중심성 | 0.590 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | -0.047 | -0.639 | -0.566 |
단어빈도 | -1.000 | 1.000 | 0.051 | 0.639 | 0.568 |
단어중요도 | -0.047 | 0.051 | 1.000 | -0.026 | 0.530 |
연결정도중심성 | -0.639 | 0.639 | -0.026 | 1.000 | 0.513 |
매개중심성 | -0.566 | 0.568 | 0.530 | 0.513 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 고양 | 7690 | 0.0304 | 0.3688 | 0.4215 |
1 | 2 | 2010-01 | 경기 | 1530 | 0.0251 | 0.159 | 0.071 |
2 | 3 | 2010-01 | 경기도 | 1437 | 0.0238 | 0.1486 | 0.0666 |
3 | 4 | 2010-01 | 서울 | 1183 | 0.0248 | 0.1381 | 0.0597 |
4 | 5 | 2010-01 | 분양 | 838 | 0.0302 | 0.0559 | 0.0203 |
5 | 6 | 2010-01 | 파주 | 686 | 0.0263 | 0.0804 | 0.0194 |
6 | 7 | 2010-01 | 아파트 | 503 | 0.0253 | 0.0751 | 0.0284 |
7 | 8 | 2010-01 | 수도권 | 468 | 0.0225 | 0.0419 | 0.0093 |
8 | 9 | 2010-01 | 수원 | 463 | 0.0225 | 0.0926 | 0.0141 |
9 | 10 | 2010-01 | 시장 | 448 | 0.0253 | 0.0611 | 0.0149 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 김포 | 345 | 0.0256 | 0.0769 | 0.0101 |
21 | 22 | 2010-01 | 인천 | 333 | 0.0232 | 0.0664 | 0.0141 |
22 | 23 | 2010-01 | 시민 | 333 | 0.029 | 0.0437 | 0.0112 |
23 | 24 | 2010-01 | 문화 | 327 | 0.0269 | 0.0611 | 0.0161 |
24 | 25 | 2010-01 | 도시 | 321 | 0.0288 | 0.0576 | 0.0141 |
25 | 26 | 2010-01 | 사업 | 317 | 0.0257 | 0.0437 | 0.0217 |
26 | 27 | 2010-01 | 용인 | 290 | 0.0213 | 0.0629 | 0.0081 |
27 | 28 | 2010-01 | 주택 | 278 | 0.0245 | 0.0402 | 0.0102 |
28 | 29 | 2010-01 | 사랑 | 274 | 0.0263 | 0.0349 | 0.0115 |
29 | 30 | 2010-01 | 신도시 | 270 | 0.0247 | 0.0349 | 0.0041 |