Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/ebd237d7-a8ef-476e-bcc4-6dba6cabeb7d |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:58:52.905258 |
---|---|
Analysis finished | 2023-12-10 13:58:58.177552 |
Duration | 5.27 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2010-01-01 00:00:00 |
---|---|
Maximum | 2010-01-01 00:00:00 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
맛집 | 1 | 3.3% |
경기도 | 1 | 3.3% |
화성 | 1 | 3.3% |
파주 | 1 | 3.3% |
배달 | 1 | 3.3% |
양평 | 1 | 3.3% |
전문점 | 1 | 3.3% |
가평 | 1 | 3.3% |
부천 | 1 | 3.3% |
안산 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
양 | 4 | 6.1% |
주 | 3 | 4.5% |
음 | 3 | 4.5% |
식 | 3 | 4.5% |
점 | 3 | 4.5% |
맛 | 2 | 3.0% |
안 | 2 | 3.0% |
남 | 2 | 3.0% |
성 | 2 | 3.0% |
평 | 2 | 3.0% |
Other values (37) | 40 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
양 | 4 | 6.1% |
주 | 3 | 4.5% |
음 | 3 | 4.5% |
식 | 3 | 4.5% |
점 | 3 | 4.5% |
맛 | 2 | 3.0% |
안 | 2 | 3.0% |
남 | 2 | 3.0% |
성 | 2 | 3.0% |
평 | 2 | 3.0% |
Other values (37) | 40 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
양 | 4 | 6.1% |
주 | 3 | 4.5% |
음 | 3 | 4.5% |
식 | 3 | 4.5% |
점 | 3 | 4.5% |
맛 | 2 | 3.0% |
안 | 2 | 3.0% |
남 | 2 | 3.0% |
성 | 2 | 3.0% |
평 | 2 | 3.0% |
Other values (37) | 40 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
양 | 4 | 6.1% |
주 | 3 | 4.5% |
음 | 3 | 4.5% |
식 | 3 | 4.5% |
점 | 3 | 4.5% |
맛 | 2 | 3.0% |
안 | 2 | 3.0% |
남 | 2 | 3.0% |
성 | 2 | 3.0% |
평 | 2 | 3.0% |
Other values (37) | 40 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 27 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 252.86667 |
Minimum | 74 |
---|---|
Maximum | 1728 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 74 |
---|---|
5-th percentile | 78 |
Q1 | 87.25 |
median | 110.5 |
Q3 | 162.25 |
95-th percentile | 1335.2 |
Maximum | 1728 |
Range | 1654 |
Interquartile range (IQR) | 75 |
Descriptive statistics
Standard deviation | 424.64222 |
---|---|
Coefficient of variation (CV) | 1.6793128 |
Kurtosis | 9.1440692 |
Mean | 252.86667 |
Median Absolute Deviation (MAD) | 27.5 |
Skewness | 3.1545059 |
Sum | 7586 |
Variance | 180321.02 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
78 | 2 | 6.7% |
84 | 2 | 6.7% |
110 | 2 | 6.7% |
1728 | 1 | 3.3% |
1724 | 1 | 3.3% |
74 | 1 | 3.3% |
79 | 1 | 3.3% |
82 | 1 | 3.3% |
87 | 1 | 3.3% |
88 | 1 | 3.3% |
Other values (17) | 17 |
Value | Count | Frequency (%) |
74 | 1 | |
78 | 2 | |
79 | 1 | |
82 | 1 | |
84 | 2 | |
87 | 1 | |
88 | 1 | |
96 | 1 | |
105 | 1 | |
107 | 1 |
Value | Count | Frequency (%) |
1728 | 1 | |
1724 | 1 | |
860 | 1 | |
231 | 1 | |
214 | 1 | |
209 | 1 | |
192 | 1 | |
165 | 1 | |
154 | 1 | |
141 | 1 |
단어중요도
Real number (ℝ)
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.03024 |
Minimum | 0.0227 |
---|---|
Maximum | 0.0419 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0227 |
---|---|
5-th percentile | 0.02369 |
Q1 | 0.0255 |
median | 0.0307 |
Q3 | 0.03385 |
95-th percentile | 0.04062 |
Maximum | 0.0419 |
Range | 0.0192 |
Interquartile range (IQR) | 0.00835 |
Descriptive statistics
Standard deviation | 0.005469199 |
---|---|
Coefficient of variation (CV) | 0.18085976 |
Kurtosis | -0.5623063 |
Mean | 0.03024 |
Median Absolute Deviation (MAD) | 0.00455 |
Skewness | 0.53047443 |
Sum | 0.9072 |
Variance | 2.9912138 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.032 | 2 | 6.7% |
0.0261 | 2 | 6.7% |
0.0417 | 1 | 3.3% |
0.0393 | 1 | 3.3% |
0.0348 | 1 | 3.3% |
0.0343 | 1 | 3.3% |
0.0227 | 1 | 3.3% |
0.0286 | 1 | 3.3% |
0.034 | 1 | 3.3% |
0.0329 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0227 | 1 | |
0.0236 | 1 | |
0.0238 | 1 | |
0.0241 | 1 | |
0.0244 | 1 | |
0.0247 | 1 | |
0.0252 | 1 | |
0.0253 | 1 | |
0.0261 | 2 | |
0.0262 | 1 |
Value | Count | Frequency (%) |
0.0419 | 1 | |
0.0417 | 1 | |
0.0393 | 1 | |
0.036 | 1 | |
0.0359 | 1 | |
0.0348 | 1 | |
0.0343 | 1 | |
0.034 | 1 | |
0.0334 | 1 | |
0.0329 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 76.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.04471 |
Minimum | 0.0084 |
---|---|
Maximum | 0.2661 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0084 |
---|---|
5-th percentile | 0.0112 |
Q1 | 0.0182 |
median | 0.0266 |
Q3 | 0.04025 |
95-th percentile | 0.1764 |
Maximum | 0.2661 |
Range | 0.2577 |
Interquartile range (IQR) | 0.02205 |
Descriptive statistics
Standard deviation | 0.057816167 |
---|---|
Coefficient of variation (CV) | 1.2931373 |
Kurtosis | 8.5146653 |
Mean | 0.04471 |
Median Absolute Deviation (MAD) | 0.0084 |
Skewness | 2.9426553 |
Sum | 1.3413 |
Variance | 0.0033427092 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0294 | 3 | 10.0% |
0.0182 | 3 | 10.0% |
0.0112 | 2 | 6.7% |
0.0196 | 2 | 6.7% |
0.0266 | 2 | 6.7% |
0.2661 | 1 | 3.3% |
0.0224 | 1 | 3.3% |
0.0154 | 1 | 3.3% |
0.0546 | 1 | 3.3% |
0.021 | 1 | 3.3% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
0.0084 | 1 | 3.3% |
0.0112 | 2 | |
0.0126 | 1 | 3.3% |
0.0154 | 1 | 3.3% |
0.0168 | 1 | 3.3% |
0.0182 | 3 | |
0.0196 | 2 | |
0.021 | 1 | 3.3% |
0.0224 | 1 | 3.3% |
0.0252 | 1 | 3.3% |
Value | Count | Frequency (%) |
0.2661 | 1 | |
0.2016 | 1 | |
0.1456 | 1 | |
0.0574 | 1 | |
0.0546 | 1 | |
0.0518 | 1 | |
0.0434 | 1 | |
0.042 | 1 | |
0.035 | 1 | |
0.0336 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.040926667 |
Minimum | 0.0002 |
---|---|
Maximum | 0.3772 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0002 |
---|---|
5-th percentile | 0.00058 |
Q1 | 0.003075 |
median | 0.01125 |
Q3 | 0.031325 |
95-th percentile | 0.226185 |
Maximum | 0.3772 |
Range | 0.377 |
Interquartile range (IQR) | 0.02825 |
Descriptive statistics
Standard deviation | 0.084462822 |
---|---|
Coefficient of variation (CV) | 2.0637601 |
Kurtosis | 9.8558337 |
Mean | 0.040926667 |
Median Absolute Deviation (MAD) | 0.01005 |
Skewness | 3.1500579 |
Sum | 1.2278 |
Variance | 0.0071339682 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0014 | 2 | 6.7% |
0.2688 | 1 | 3.3% |
0.0002 | 1 | 3.3% |
0.0077 | 1 | 3.3% |
0.0122 | 1 | 3.3% |
0.0222 | 1 | 3.3% |
0.0532 | 1 | 3.3% |
0.0008 | 1 | 3.3% |
0.0082 | 1 | 3.3% |
0.0067 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0002 | 1 | |
0.0004 | 1 | |
0.0008 | 1 | |
0.001 | 1 | |
0.0014 | 2 | |
0.0029 | 1 | |
0.003 | 1 | |
0.0033 | 1 | |
0.0057 | 1 | |
0.0067 | 1 |
Value | Count | Frequency (%) |
0.3772 | 1 | |
0.2688 | 1 | |
0.1741 | 1 | |
0.0532 | 1 | |
0.0464 | 1 | |
0.0399 | 1 | |
0.0367 | 1 | |
0.0327 | 1 | |
0.0272 | 1 | |
0.0254 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.681 | 0.531 | 0.396 | 0.310 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.681 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
단어중요도 | 0.531 | 1.000 | 0.000 | 1.000 | 0.000 | 0.243 |
연결정도중심성 | 0.396 | 1.000 | 1.000 | 0.000 | 1.000 | 0.992 |
매개중심성 | 0.310 | 1.000 | 1.000 | 0.243 | 0.992 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | 0.301 | -0.587 | -0.554 |
단어빈도 | -1.000 | 1.000 | -0.304 | 0.591 | 0.560 |
단어중요도 | 0.301 | -0.304 | 1.000 | -0.269 | -0.362 |
연결정도중심성 | -0.587 | 0.591 | -0.269 | 1.000 | 0.899 |
매개중심성 | -0.554 | 0.560 | -0.362 | 0.899 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 맛집 | 1728 | 0.0417 | 0.2661 | 0.3772 |
1 | 2 | 2010-01 | 경기도 | 1724 | 0.0252 | 0.2016 | 0.2688 |
2 | 3 | 2010-01 | 음식점 | 860 | 0.0253 | 0.1456 | 0.1741 |
3 | 4 | 2010-01 | 음식 | 231 | 0.0247 | 0.0434 | 0.0327 |
4 | 5 | 2010-01 | 수원 | 214 | 0.0319 | 0.0294 | 0.0254 |
5 | 6 | 2010-01 | 게 | 209 | 0.0238 | 0.0294 | 0.0272 |
6 | 7 | 2010-01 | 서울 | 192 | 0.0261 | 0.035 | 0.0203 |
7 | 8 | 2010-01 | 요리 | 165 | 0.0359 | 0.0574 | 0.0464 |
8 | 9 | 2010-01 | 성남 | 154 | 0.027 | 0.0182 | 0.003 |
9 | 10 | 2010-01 | 카페 | 141 | 0.0262 | 0.0336 | 0.0399 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 여행 | 96 | 0.0261 | 0.0308 | 0.0162 |
21 | 22 | 2010-01 | 안산 | 88 | 0.0304 | 0.021 | 0.0067 |
22 | 23 | 2010-01 | 부천 | 87 | 0.0329 | 0.0196 | 0.0082 |
23 | 24 | 2010-01 | 가평 | 84 | 0.034 | 0.0112 | 0.0008 |
24 | 25 | 2010-01 | 전문점 | 84 | 0.0286 | 0.0546 | 0.0532 |
25 | 26 | 2010-01 | 양평 | 82 | 0.032 | 0.0196 | 0.0014 |
26 | 27 | 2010-01 | 배달 | 79 | 0.0227 | 0.0294 | 0.0222 |
27 | 28 | 2010-01 | 파주 | 78 | 0.0343 | 0.0266 | 0.0122 |
28 | 29 | 2010-01 | 화성 | 78 | 0.0348 | 0.0154 | 0.0077 |
29 | 30 | 2010-01 | 의왕 | 74 | 0.0393 | 0.0112 | 0.0002 |