Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/285f43c8-94ad-4f8d-aafe-bd05a2042162 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 1 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 1 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 연결정도중심성 | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:03:46.055696 |
---|---|
Analysis finished | 2023-12-10 14:03:51.193324 |
Duration | 5.14 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
경기도 | 1 | 3.3% |
축제 | 1 | 3.3% |
인천 | 1 | 3.3% |
영상 | 1 | 3.3% |
도시 | 1 | 3.3% |
사진 | 1 | 3.3% |
광주 | 1 | 3.3% |
세계 | 1 | 3.3% |
뉴스 | 1 | 3.3% |
전국 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
천 | 3 | 4.7% |
도 | 3 | 4.7% |
울 | 3 | 4.7% |
원 | 2 | 3.1% |
양 | 2 | 3.1% |
주 | 2 | 3.1% |
행 | 2 | 3.1% |
시 | 2 | 3.1% |
평 | 2 | 3.1% |
겨 | 2 | 3.1% |
Other values (38) | 41 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 64 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
천 | 3 | 4.7% |
도 | 3 | 4.7% |
울 | 3 | 4.7% |
원 | 2 | 3.1% |
양 | 2 | 3.1% |
주 | 2 | 3.1% |
행 | 2 | 3.1% |
시 | 2 | 3.1% |
평 | 2 | 3.1% |
겨 | 2 | 3.1% |
Other values (38) | 41 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 64 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
천 | 3 | 4.7% |
도 | 3 | 4.7% |
울 | 3 | 4.7% |
원 | 2 | 3.1% |
양 | 2 | 3.1% |
주 | 2 | 3.1% |
행 | 2 | 3.1% |
시 | 2 | 3.1% |
평 | 2 | 3.1% |
겨 | 2 | 3.1% |
Other values (38) | 41 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 64 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
천 | 3 | 4.7% |
도 | 3 | 4.7% |
울 | 3 | 4.7% |
원 | 2 | 3.1% |
양 | 2 | 3.1% |
주 | 2 | 3.1% |
행 | 2 | 3.1% |
시 | 2 | 3.1% |
평 | 2 | 3.1% |
겨 | 2 | 3.1% |
Other values (38) | 41 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 183.83333 |
Minimum | 63 |
---|---|
Maximum | 1367 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 63 |
---|---|
5-th percentile | 65.9 |
Q1 | 82.25 |
median | 101 |
Q3 | 136 |
95-th percentile | 728.05 |
Maximum | 1367 |
Range | 1304 |
Interquartile range (IQR) | 53.75 |
Descriptive statistics
Standard deviation | 295.36548 |
---|---|
Coefficient of variation (CV) | 1.6067025 |
Kurtosis | 12.392562 |
Mean | 183.83333 |
Median Absolute Deviation (MAD) | 24 |
Skewness | 3.6277361 |
Sum | 5515 |
Variance | 87240.764 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
98 | 2 | 6.7% |
1147 | 1 | 3.3% |
63 | 1 | 3.3% |
65 | 1 | 3.3% |
67 | 1 | 3.3% |
69 | 1 | 3.3% |
73 | 1 | 3.3% |
78 | 1 | 3.3% |
80 | 1 | 3.3% |
82 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
63 | 1 | |
65 | 1 | |
67 | 1 | |
69 | 1 | |
73 | 1 | |
78 | 1 | |
80 | 1 | |
82 | 1 | |
83 | 1 | |
85 | 1 |
Value | Count | Frequency (%) |
1367 | 1 | |
1147 | 1 | |
216 | 1 | |
191 | 1 | |
152 | 1 | |
145 | 1 | |
140 | 1 | |
138 | 1 | |
130 | 1 | |
126 | 1 |
단어중요도
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.029373333 |
Minimum | 0.0215 |
---|---|
Maximum | 0.0676 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0215 |
---|---|
5-th percentile | 0.023405 |
Q1 | 0.024725 |
median | 0.0274 |
Q3 | 0.031475 |
95-th percentile | 0.038615 |
Maximum | 0.0676 |
Range | 0.0461 |
Interquartile range (IQR) | 0.00675 |
Descriptive statistics
Standard deviation | 0.0084077933 |
---|---|
Coefficient of variation (CV) | 0.28623899 |
Kurtosis | 15.057312 |
Mean | 0.029373333 |
Median Absolute Deviation (MAD) | 0.003 |
Skewness | 3.4785794 |
Sum | 0.8812 |
Variance | 7.0690989 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0244 | 2 | 6.7% |
0.0275 | 2 | 6.7% |
0.0239 | 2 | 6.7% |
0.0319 | 2 | 6.7% |
0.0342 | 1 | 3.3% |
0.0273 | 1 | 3.3% |
0.0247 | 1 | 3.3% |
0.0243 | 1 | 3.3% |
0.023 | 1 | 3.3% |
0.0311 | 1 | 3.3% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
0.0215 | 1 | |
0.023 | 1 | |
0.0239 | 2 | |
0.0243 | 1 | |
0.0244 | 2 | |
0.0247 | 1 | |
0.0248 | 1 | |
0.0251 | 1 | |
0.0256 | 1 | |
0.0259 | 1 |
Value | Count | Frequency (%) |
0.0676 | 1 | |
0.041 | 1 | |
0.0357 | 1 | |
0.0342 | 1 | |
0.0324 | 1 | |
0.0319 | 2 | |
0.0316 | 1 | |
0.0311 | 1 | |
0.0303 | 1 | |
0.0302 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 53.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.063306667 |
Minimum | 0.0231 |
---|---|
Maximum | 0.3333 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0231 |
---|---|
5-th percentile | 0.02571 |
Q1 | 0.0347 |
median | 0.04195 |
Q3 | 0.0521 |
95-th percentile | 0.22097 |
Maximum | 0.3333 |
Range | 0.3102 |
Interquartile range (IQR) | 0.0174 |
Descriptive statistics
Standard deviation | 0.073508915 |
---|---|
Coefficient of variation (CV) | 1.161156 |
Kurtosis | 10.59939 |
Mean | 0.063306667 |
Median Absolute Deviation (MAD) | 0.01015 |
Skewness | 3.3428024 |
Sum | 1.8992 |
Variance | 0.0054035606 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0405 | 4 | |
0.0521 | 3 | |
0.0434 | 3 | |
0.0347 | 3 | |
0.0289 | 3 | |
0.0231 | 2 | 6.7% |
0.0318 | 2 | 6.7% |
0.0463 | 2 | 6.7% |
0.0869 | 1 | 3.3% |
0.0492 | 1 | 3.3% |
Other values (6) | 6 |
Value | Count | Frequency (%) |
0.0231 | 2 | |
0.0289 | 3 | |
0.0318 | 2 | |
0.0347 | 3 | |
0.0376 | 1 | 3.3% |
0.0405 | 4 | |
0.0434 | 3 | |
0.0463 | 2 | |
0.0492 | 1 | 3.3% |
0.0521 | 3 |
Value | Count | Frequency (%) |
0.3333 | 1 | 3.3% |
0.3188 | 1 | 3.3% |
0.1014 | 1 | 3.3% |
0.0869 | 1 | 3.3% |
0.0753 | 1 | 3.3% |
0.055 | 1 | 3.3% |
0.0521 | 3 | |
0.0492 | 1 | 3.3% |
0.0463 | 2 | |
0.0434 | 3 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.04229 |
Minimum | 0.0014 |
---|---|
Maximum | 0.3883 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0014 |
---|---|
5-th percentile | 0.003825 |
Q1 | 0.0103 |
median | 0.01425 |
Q3 | 0.022825 |
95-th percentile | 0.237745 |
Maximum | 0.3883 |
Range | 0.3869 |
Interquartile range (IQR) | 0.012525 |
Descriptive statistics
Standard deviation | 0.092114709 |
---|---|
Coefficient of variation (CV) | 2.1781676 |
Kurtosis | 11.340519 |
Mean | 0.04229 |
Median Absolute Deviation (MAD) | 0.00515 |
Skewness | 3.4855595 |
Sum | 1.2687 |
Variance | 0.0084851196 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0115 | 2 | 6.7% |
0.3628 | 1 | 3.3% |
0.0097 | 1 | 3.3% |
0.0236 | 1 | 3.3% |
0.0135 | 1 | 3.3% |
0.0096 | 1 | 3.3% |
0.0185 | 1 | 3.3% |
0.0014 | 1 | 3.3% |
0.0109 | 1 | 3.3% |
0.0061 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0014 | 1 | |
0.0036 | 1 | |
0.0041 | 1 | |
0.0043 | 1 | |
0.0061 | 1 | |
0.0096 | 1 | |
0.0097 | 1 | |
0.0101 | 1 | |
0.0109 | 1 | |
0.0114 | 1 |
Value | Count | Frequency (%) |
0.3883 | 1 | |
0.3628 | 1 | |
0.0849 | 1 | |
0.0505 | 1 | |
0.0413 | 1 | |
0.0295 | 1 | |
0.0254 | 1 | |
0.0236 | 1 | |
0.0205 | 1 | |
0.0199 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.353 | 0.000 | 0.622 | 0.607 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.353 | 1.000 | 1.000 | 0.675 | 0.935 | 0.854 |
단어중요도 | 0.000 | 1.000 | 0.675 | 1.000 | 0.328 | 0.000 |
연결정도중심성 | 0.622 | 1.000 | 0.935 | 0.328 | 1.000 | 0.976 |
매개중심성 | 0.607 | 1.000 | 0.854 | 0.000 | 0.976 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | -0.231 | -0.531 | -0.340 |
단어빈도 | -1.000 | 1.000 | 0.230 | 0.532 | 0.337 |
단어중요도 | -0.231 | 0.230 | 1.000 | 0.137 | -0.034 |
연결정도중심성 | -0.531 | 0.532 | 0.137 | 1.000 | 0.706 |
매개중심성 | -0.340 | 0.337 | -0.034 | 0.706 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 경기도 | 1367 | 0.0244 | 0.3188 | 0.3628 |
1 | 2 | 2010-01 | 축제 | 1147 | 0.0281 | 0.3333 | 0.3883 |
2 | 3 | 2010-01 | 가평 | 216 | 0.041 | 0.055 | 0.0141 |
3 | 4 | 2010-01 | 문화 | 191 | 0.0259 | 0.1014 | 0.0849 |
4 | 5 | 2010-01 | 시장 | 152 | 0.0316 | 0.0405 | 0.0115 |
5 | 6 | 2010-01 | 서울 | 145 | 0.0262 | 0.0753 | 0.0413 |
6 | 7 | 2010-01 | 포천 | 140 | 0.0324 | 0.0405 | 0.0043 |
7 | 8 | 2010-01 | 부천 | 138 | 0.0302 | 0.0434 | 0.0114 |
8 | 9 | 2010-01 | 예술 | 130 | 0.0291 | 0.0347 | 0.0101 |
9 | 10 | 2010-01 | 행사 | 126 | 0.0251 | 0.0521 | 0.0295 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 공연 | 85 | 0.0357 | 0.0463 | 0.0199 |
21 | 22 | 2010-01 | 전국 | 83 | 0.0239 | 0.0521 | 0.0183 |
22 | 23 | 2010-01 | 뉴스 | 82 | 0.0244 | 0.0289 | 0.0144 |
23 | 24 | 2010-01 | 세계 | 80 | 0.0248 | 0.0318 | 0.0061 |
24 | 25 | 2010-01 | 광주 | 78 | 0.0311 | 0.0521 | 0.0109 |
25 | 26 | 2010-01 | 사진 | 73 | 0.023 | 0.0231 | 0.0014 |
26 | 27 | 2010-01 | 도시 | 69 | 0.0243 | 0.0434 | 0.0185 |
27 | 28 | 2010-01 | 영상 | 67 | 0.0247 | 0.0231 | 0.0096 |
28 | 29 | 2010-01 | 인천 | 65 | 0.0273 | 0.0405 | 0.0135 |
29 | 30 | 2010-01 | 양평 | 63 | 0.0342 | 0.0347 | 0.0236 |