Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/432d9660-ac1e-4714-9fdb-cd03c131cff1 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 3 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 3 other fields | High correlation |
단어중요도 is highly overall correlated with 분석인덱스 and 1 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:03:10.676577 |
---|---|
Analysis finished | 2023-12-10 14:03:14.882132 |
Duration | 4.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
편의점 | 1 | 3.3% |
서점 | 1 | 3.3% |
학교 | 1 | 3.3% |
친구 | 1 | 3.3% |
공부 | 1 | 3.3% |
이용 | 1 | 3.3% |
구매 | 1 | 3.3% |
온라인 | 1 | 3.3% |
여행 | 1 | 3.3% |
근처 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
구 | 4 | 5.7% |
점 | 4 | 5.7% |
서 | 3 | 4.3% |
인 | 3 | 4.3% |
이 | 3 | 4.3% |
아 | 2 | 2.9% |
터 | 2 | 2.9% |
넷 | 2 | 2.9% |
마 | 2 | 2.9% |
가 | 2 | 2.9% |
Other values (42) | 43 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 70 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 4 | 5.7% |
점 | 4 | 5.7% |
서 | 3 | 4.3% |
인 | 3 | 4.3% |
이 | 3 | 4.3% |
아 | 2 | 2.9% |
터 | 2 | 2.9% |
넷 | 2 | 2.9% |
마 | 2 | 2.9% |
가 | 2 | 2.9% |
Other values (42) | 43 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 70 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 4 | 5.7% |
점 | 4 | 5.7% |
서 | 3 | 4.3% |
인 | 3 | 4.3% |
이 | 3 | 4.3% |
아 | 2 | 2.9% |
터 | 2 | 2.9% |
넷 | 2 | 2.9% |
마 | 2 | 2.9% |
가 | 2 | 2.9% |
Other values (42) | 43 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 70 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
구 | 4 | 5.7% |
점 | 4 | 5.7% |
서 | 3 | 4.3% |
인 | 3 | 4.3% |
이 | 3 | 4.3% |
아 | 2 | 2.9% |
터 | 2 | 2.9% |
넷 | 2 | 2.9% |
마 | 2 | 2.9% |
가 | 2 | 2.9% |
Other values (42) | 43 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2292.8 |
Minimum | 758 |
---|---|
Maximum | 13694 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 758 |
---|---|
5-th percentile | 789.8 |
Q1 | 930.5 |
median | 1114.5 |
Q3 | 1467.5 |
95-th percentile | 11174.9 |
Maximum | 13694 |
Range | 12936 |
Interquartile range (IQR) | 537 |
Descriptive statistics
Standard deviation | 3339.96 |
---|---|
Coefficient of variation (CV) | 1.4567167 |
Kurtosis | 7.1344904 |
Mean | 2292.8 |
Median Absolute Deviation (MAD) | 252.5 |
Skewness | 2.8638163 |
Sum | 68784 |
Variance | 11155333 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
1043 | 2 | 6.7% |
13694 | 1 | 3.3% |
12551 | 1 | 3.3% |
758 | 1 | 3.3% |
761 | 1 | 3.3% |
825 | 1 | 3.3% |
855 | 1 | 3.3% |
869 | 1 | 3.3% |
882 | 1 | 3.3% |
913 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
758 | 1 | |
761 | 1 | |
825 | 1 | |
855 | 1 | |
869 | 1 | |
882 | 1 | |
913 | 1 | |
929 | 1 | |
935 | 1 | |
951 | 1 |
Value | Count | Frequency (%) |
13694 | 1 | |
12551 | 1 | |
9493 | 1 | |
2757 | 1 | |
2024 | 1 | |
1955 | 1 | |
1801 | 1 | |
1477 | 1 | |
1439 | 1 | |
1432 | 1 |
단어중요도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.026363333 |
Minimum | 0.0209 |
---|---|
Maximum | 0.0371 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0209 |
---|---|
5-th percentile | 0.022445 |
Q1 | 0.023975 |
median | 0.02475 |
Q3 | 0.0282 |
95-th percentile | 0.033985 |
Maximum | 0.0371 |
Range | 0.0162 |
Interquartile range (IQR) | 0.004225 |
Descriptive statistics
Standard deviation | 0.0040046941 |
---|---|
Coefficient of variation (CV) | 0.15190394 |
Kurtosis | 0.81087938 |
Mean | 0.026363333 |
Median Absolute Deviation (MAD) | 0.0015 |
Skewness | 1.2350555 |
Sum | 0.7909 |
Variance | 1.6037575 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0244 | 2 | 6.7% |
0.0242 | 2 | 6.7% |
0.0295 | 1 | 3.3% |
0.0325 | 1 | 3.3% |
0.0263 | 1 | 3.3% |
0.0225 | 1 | 3.3% |
0.0233 | 1 | 3.3% |
0.0235 | 1 | 3.3% |
0.0254 | 1 | 3.3% |
0.0224 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0209 | 1 | |
0.0224 | 1 | |
0.0225 | 1 | |
0.023 | 1 | |
0.0232 | 1 | |
0.0233 | 1 | |
0.0235 | 1 | |
0.0239 | 1 | |
0.0242 | 2 | |
0.0243 | 1 |
Value | Count | Frequency (%) |
0.0371 | 1 | |
0.0352 | 1 | |
0.0325 | 1 | |
0.0323 | 1 | |
0.0317 | 1 | |
0.0304 | 1 | |
0.0295 | 1 | |
0.0287 | 1 | |
0.0267 | 1 | |
0.0263 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 83.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.071833333 |
Minimum | 0.0268 |
---|---|
Maximum | 0.2848 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0268 |
---|---|
5-th percentile | 0.02933 |
Q1 | 0.0435 |
median | 0.054 |
Q3 | 0.0719 |
95-th percentile | 0.195955 |
Maximum | 0.2848 |
Range | 0.258 |
Interquartile range (IQR) | 0.0284 |
Descriptive statistics
Standard deviation | 0.056906913 |
---|---|
Coefficient of variation (CV) | 0.79220761 |
Kurtosis | 7.1722051 |
Mean | 0.071833333 |
Median Absolute Deviation (MAD) | 0.01195 |
Skewness | 2.6498933 |
Sum | 2.155 |
Variance | 0.0032383968 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0468 | 3 | 10.0% |
0.0435 | 2 | 6.7% |
0.054 | 2 | 6.7% |
0.0612 | 2 | 6.7% |
0.065 | 1 | 3.3% |
0.0268 | 1 | 3.3% |
0.0287 | 1 | 3.3% |
0.0392 | 1 | 3.3% |
0.0425 | 1 | 3.3% |
0.0416 | 1 | 3.3% |
Other values (15) | 15 |
Value | Count | Frequency (%) |
0.0268 | 1 | 3.3% |
0.0287 | 1 | 3.3% |
0.0301 | 1 | 3.3% |
0.0382 | 1 | 3.3% |
0.0392 | 1 | 3.3% |
0.0416 | 1 | 3.3% |
0.0425 | 1 | 3.3% |
0.0435 | 2 | |
0.0459 | 1 | 3.3% |
0.0468 | 3 |
Value | Count | Frequency (%) |
0.2848 | 1 | |
0.214 | 1 | |
0.1739 | 1 | |
0.1218 | 1 | |
0.0827 | 1 | |
0.0736 | 1 | |
0.0731 | 1 | |
0.0726 | 1 | |
0.0698 | 1 | |
0.065 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.025093333 |
Minimum | 0.005 |
---|---|
Maximum | 0.2153 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.005 |
---|---|
5-th percentile | 0.0061 |
Q1 | 0.00745 |
median | 0.0117 |
Q3 | 0.0185 |
95-th percentile | 0.10042 |
Maximum | 0.2153 |
Range | 0.2103 |
Interquartile range (IQR) | 0.01105 |
Descriptive statistics
Standard deviation | 0.042782327 |
---|---|
Coefficient of variation (CV) | 1.704928 |
Kurtosis | 14.144417 |
Mean | 0.025093333 |
Median Absolute Deviation (MAD) | 0.00505 |
Skewness | 3.6208492 |
Sum | 0.7528 |
Variance | 0.0018303275 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0061 | 2 | 6.7% |
0.0073 | 2 | 6.7% |
0.0145 | 1 | 3.3% |
0.0087 | 1 | 3.3% |
0.0082 | 1 | 3.3% |
0.0071 | 1 | 3.3% |
0.005 | 1 | 3.3% |
0.0064 | 1 | 3.3% |
0.0169 | 1 | 3.3% |
0.0129 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.005 | 1 | |
0.0061 | 2 | |
0.0064 | 1 | |
0.0068 | 1 | |
0.0071 | 1 | |
0.0073 | 2 | |
0.0079 | 1 | |
0.0082 | 1 | |
0.0085 | 1 | |
0.0087 | 1 |
Value | Count | Frequency (%) |
0.2153 | 1 | |
0.1114 | 1 | |
0.087 | 1 | |
0.0387 | 1 | |
0.0208 | 1 | |
0.0206 | 1 | |
0.0203 | 1 | |
0.0188 | 1 | |
0.0176 | 1 | |
0.0171 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.624 | 0.123 | 0.377 | 0.437 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.624 | 1.000 | 1.000 | 0.791 | 1.000 | 1.000 |
단어중요도 | 0.123 | 1.000 | 0.791 | 1.000 | 0.663 | 0.891 |
연결정도중심성 | 0.377 | 1.000 | 1.000 | 0.663 | 1.000 | 1.000 |
매개중심성 | 0.437 | 1.000 | 1.000 | 0.891 | 1.000 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | -0.540 | -0.649 | -0.653 |
단어빈도 | -1.000 | 1.000 | 0.538 | 0.653 | 0.654 |
단어중요도 | -0.540 | 0.538 | 1.000 | 0.288 | 0.245 |
연결정도중심성 | -0.649 | 0.653 | 0.288 | 1.000 | 0.890 |
매개중심성 | -0.653 | 0.654 | 0.245 | 0.890 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 편의점 | 13694 | 0.0295 | 0.2848 | 0.2153 |
1 | 2 | 2010-01 | 서점 | 12551 | 0.0262 | 0.214 | 0.1114 |
2 | 3 | 2010-01 | 책 | 9493 | 0.0317 | 0.1739 | 0.087 |
3 | 4 | 2010-01 | 슈퍼마켓 | 2757 | 0.0267 | 0.1218 | 0.0387 |
4 | 5 | 2010-01 | 구입 | 2024 | 0.0242 | 0.054 | 0.0124 |
5 | 6 | 2010-01 | 문구점 | 1955 | 0.0304 | 0.0698 | 0.0206 |
6 | 7 | 2010-01 | 아이 | 1801 | 0.0249 | 0.0535 | 0.0136 |
7 | 8 | 2010-01 | 가격 | 1477 | 0.0251 | 0.0578 | 0.0171 |
8 | 9 | 2010-01 | 택배 | 1439 | 0.0352 | 0.0382 | 0.0068 |
9 | 10 | 2010-01 | 아르바이트 | 1432 | 0.0323 | 0.0468 | 0.0079 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 가지 | 951 | 0.0209 | 0.0435 | 0.0093 |
21 | 22 | 2010-01 | 근처 | 935 | 0.0224 | 0.0616 | 0.0129 |
22 | 23 | 2010-01 | 여행 | 929 | 0.0242 | 0.0612 | 0.0169 |
23 | 24 | 2010-01 | 온라인 | 913 | 0.0254 | 0.0459 | 0.0061 |
24 | 25 | 2010-01 | 구매 | 882 | 0.0235 | 0.0416 | 0.0064 |
25 | 26 | 2010-01 | 이용 | 869 | 0.0233 | 0.0425 | 0.005 |
26 | 27 | 2010-01 | 공부 | 855 | 0.0225 | 0.0392 | 0.0073 |
27 | 28 | 2010-01 | 친구 | 825 | 0.0244 | 0.0287 | 0.0071 |
28 | 29 | 2010-01 | 학교 | 761 | 0.0263 | 0.0468 | 0.0082 |
29 | 30 | 2010-01 | 엄마 | 758 | 0.0244 | 0.0268 | 0.0087 |