Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/a35d29be-f7f8-4ebb-87ec-0265ea6983d1 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
매개중심성 has 7 (23.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:44:52.043655 |
---|---|
Analysis finished | 2023-12-10 13:44:56.639664 |
Duration | 4.6 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
경기도 | 1 | 3.3% |
데이트 | 1 | 3.3% |
남양주 | 1 | 3.3% |
분위기 | 1 | 3.3% |
고양 | 1 | 3.3% |
겨울 | 1 | 3.3% |
양평 | 1 | 3.3% |
공원 | 1 | 3.3% |
결혼 | 1 | 3.3% |
일산 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 4 | 5.6% |
양 | 3 | 4.2% |
스 | 2 | 2.8% |
가 | 2 | 2.8% |
울 | 2 | 2.8% |
평 | 2 | 2.8% |
기 | 2 | 2.8% |
남 | 2 | 2.8% |
천 | 2 | 2.8% |
코 | 2 | 2.8% |
Other values (45) | 49 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 72 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 4 | 5.6% |
양 | 3 | 4.2% |
스 | 2 | 2.8% |
가 | 2 | 2.8% |
울 | 2 | 2.8% |
평 | 2 | 2.8% |
기 | 2 | 2.8% |
남 | 2 | 2.8% |
천 | 2 | 2.8% |
코 | 2 | 2.8% |
Other values (45) | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 72 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 4 | 5.6% |
양 | 3 | 4.2% |
스 | 2 | 2.8% |
가 | 2 | 2.8% |
울 | 2 | 2.8% |
평 | 2 | 2.8% |
기 | 2 | 2.8% |
남 | 2 | 2.8% |
천 | 2 | 2.8% |
코 | 2 | 2.8% |
Other values (45) | 49 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 72 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 4 | 5.6% |
양 | 3 | 4.2% |
스 | 2 | 2.8% |
가 | 2 | 2.8% |
울 | 2 | 2.8% |
평 | 2 | 2.8% |
기 | 2 | 2.8% |
남 | 2 | 2.8% |
천 | 2 | 2.8% |
코 | 2 | 2.8% |
Other values (45) | 49 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 73.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 42.3 |
Minimum | 14 |
---|---|
Maximum | 291 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 14 |
---|---|
5-th percentile | 15 |
Q1 | 17.25 |
median | 24.5 |
Q3 | 38.5 |
95-th percentile | 143.3 |
Maximum | 291 |
Range | 277 |
Interquartile range (IQR) | 21.25 |
Descriptive statistics
Standard deviation | 58.925523 |
---|---|
Coefficient of variation (CV) | 1.3930384 |
Kurtosis | 12.861769 |
Mean | 42.3 |
Median Absolute Deviation (MAD) | 8.5 |
Skewness | 3.5732089 |
Sum | 1269 |
Variance | 3472.2172 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
15 | 4 | 13.3% |
25 | 2 | 6.7% |
16 | 2 | 6.7% |
22 | 2 | 6.7% |
30 | 2 | 6.7% |
24 | 2 | 6.7% |
291 | 1 | 3.3% |
14 | 1 | 3.3% |
17 | 1 | 3.3% |
18 | 1 | 3.3% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
14 | 1 | 3.3% |
15 | 4 | |
16 | 2 | |
17 | 1 | 3.3% |
18 | 1 | 3.3% |
19 | 1 | 3.3% |
22 | 2 | |
23 | 1 | 3.3% |
24 | 2 | |
25 | 2 |
Value | Count | Frequency (%) |
291 | 1 | |
209 | 1 | |
63 | 1 | |
60 | 1 | |
50 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 | |
34 | 1 | |
30 | 2 |
단어중요도
Real number (ℝ)
Distinct | 27 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.03012 |
Minimum | 0.018 |
---|---|
Maximum | 0.0621 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.018 |
---|---|
5-th percentile | 0.01912 |
Q1 | 0.0232 |
median | 0.0261 |
Q3 | 0.03105 |
95-th percentile | 0.05691 |
Maximum | 0.0621 |
Range | 0.0441 |
Interquartile range (IQR) | 0.00785 |
Descriptive statistics
Standard deviation | 0.01159695 |
---|---|
Coefficient of variation (CV) | 0.38502489 |
Kurtosis | 1.9535983 |
Mean | 0.03012 |
Median Absolute Deviation (MAD) | 0.0046 |
Skewness | 1.6179432 |
Sum | 0.9036 |
Variance | 0.00013448924 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0238 | 2 | 6.7% |
0.0261 | 2 | 6.7% |
0.0231 | 2 | 6.7% |
0.0242 | 1 | 3.3% |
0.0222 | 1 | 3.3% |
0.0287 | 1 | 3.3% |
0.0311 | 1 | 3.3% |
0.0309 | 1 | 3.3% |
0.0282 | 1 | 3.3% |
0.018 | 1 | 3.3% |
Other values (17) | 17 |
Value | Count | Frequency (%) |
0.018 | 1 | |
0.0184 | 1 | |
0.02 | 1 | |
0.0201 | 1 | |
0.0209 | 1 | |
0.0222 | 1 | |
0.0231 | 2 | |
0.0235 | 1 | |
0.0237 | 1 | |
0.0238 | 2 |
Value | Count | Frequency (%) |
0.0621 | 1 | |
0.057 | 1 | |
0.0568 | 1 | |
0.0472 | 1 | |
0.0391 | 1 | |
0.0376 | 1 | |
0.0332 | 1 | |
0.0311 | 1 | |
0.0309 | 1 | |
0.0308 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 43.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.052496667 |
Minimum | 0.0069 |
---|---|
Maximum | 0.3402 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0069 |
---|---|
5-th percentile | 0.0069 |
Q1 | 0.0208 |
median | 0.0347 |
Q3 | 0.067675 |
95-th percentile | 0.141625 |
Maximum | 0.3402 |
Range | 0.3333 |
Interquartile range (IQR) | 0.046875 |
Descriptive statistics
Standard deviation | 0.064849276 |
---|---|
Coefficient of variation (CV) | 1.2353027 |
Kurtosis | 13.604207 |
Mean | 0.052496667 |
Median Absolute Deviation (MAD) | 0.0174 |
Skewness | 3.3661246 |
Sum | 1.5749 |
Variance | 0.0042054286 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0208 | 6 | |
0.0347 | 4 | |
0.0416 | 4 | |
0.0069 | 4 | |
0.0763 | 3 | |
0.0138 | 2 | 6.7% |
0.3402 | 1 | 3.3% |
0.1666 | 1 | 3.3% |
0.0694 | 1 | 3.3% |
0.1111 | 1 | 3.3% |
Other values (3) | 3 |
Value | Count | Frequency (%) |
0.0069 | 4 | |
0.0138 | 2 | 6.7% |
0.0208 | 6 | |
0.0277 | 1 | 3.3% |
0.0347 | 4 | |
0.0416 | 4 | |
0.0625 | 1 | 3.3% |
0.0694 | 1 | 3.3% |
0.0763 | 3 | |
0.0833 | 1 | 3.3% |
Value | Count | Frequency (%) |
0.3402 | 1 | 3.3% |
0.1666 | 1 | 3.3% |
0.1111 | 1 | 3.3% |
0.0833 | 1 | 3.3% |
0.0763 | 3 | |
0.0694 | 1 | 3.3% |
0.0625 | 1 | 3.3% |
0.0416 | 4 | |
0.0347 | 4 | |
0.0277 | 1 | 3.3% |
매개중심성
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 80.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.052826667 |
Minimum | 0 |
---|---|
Maximum | 0.4702 |
Zeros | 7 |
Zeros (%) | 23.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.0006 |
median | 0.0297 |
Q3 | 0.0637 |
95-th percentile | 0.159935 |
Maximum | 0.4702 |
Range | 0.4702 |
Interquartile range (IQR) | 0.0631 |
Descriptive statistics
Standard deviation | 0.091216041 |
---|---|
Coefficient of variation (CV) | 1.7267045 |
Kurtosis | 15.692992 |
Mean | 0.052826667 |
Median Absolute Deviation (MAD) | 0.02965 |
Skewness | 3.6556033 |
Sum | 1.5848 |
Variance | 0.0083203662 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 7 | |
0.0566 | 1 | 3.3% |
0.0586 | 1 | 3.3% |
0.0001 | 1 | 3.3% |
0.0319 | 1 | 3.3% |
0.0721 | 1 | 3.3% |
0.0102 | 1 | 3.3% |
0.0021 | 1 | 3.3% |
0.0137 | 1 | 3.3% |
0.0592 | 1 | 3.3% |
Other values (14) | 14 |
Value | Count | Frequency (%) |
0.0 | 7 | |
0.0001 | 1 | 3.3% |
0.0021 | 1 | 3.3% |
0.0065 | 1 | 3.3% |
0.0077 | 1 | 3.3% |
0.0086 | 1 | 3.3% |
0.0102 | 1 | 3.3% |
0.0137 | 1 | 3.3% |
0.0275 | 1 | 3.3% |
0.0319 | 1 | 3.3% |
Value | Count | Frequency (%) |
0.4702 | 1 | |
0.2102 | 1 | |
0.0985 | 1 | |
0.0983 | 1 | |
0.0888 | 1 | |
0.0721 | 1 | |
0.0679 | 1 | |
0.0652 | 1 | |
0.0592 | 1 | |
0.0586 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.729 | 0.333 | 0.465 | 0.000 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.729 | 1.000 | 1.000 | 0.000 | 0.970 | 0.869 |
단어중요도 | 0.333 | 1.000 | 0.000 | 1.000 | 0.439 | 0.220 |
연결정도중심성 | 0.465 | 1.000 | 0.970 | 0.439 | 1.000 | 0.912 |
매개중심성 | 0.000 | 1.000 | 0.869 | 0.220 | 0.912 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -0.998 | -0.057 | -0.666 | -0.604 |
단어빈도 | -0.998 | 1.000 | 0.046 | 0.657 | 0.592 |
단어중요도 | -0.057 | 0.046 | 1.000 | 0.260 | 0.270 |
연결정도중심성 | -0.666 | 0.657 | 0.260 | 1.000 | 0.855 |
매개중심성 | -0.604 | 0.592 | 0.270 | 0.855 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 경기도 | 291 | 0.0242 | 0.3402 | 0.4702 |
1 | 2 | 2010-01 | 데이트 | 209 | 0.0237 | 0.1666 | 0.2102 |
2 | 3 | 2010-01 | 연인 | 63 | 0.0235 | 0.0694 | 0.0584 |
3 | 4 | 2010-01 | 데이트코스 | 60 | 0.0238 | 0.0763 | 0.0679 |
4 | 5 | 2010-01 | 펜션 | 50 | 0.0621 | 0.1111 | 0.0983 |
5 | 6 | 2010-01 | 맛집 | 42 | 0.0391 | 0.0763 | 0.0652 |
6 | 7 | 2010-01 | 서울 | 41 | 0.0209 | 0.0625 | 0.0354 |
7 | 8 | 2010-01 | 장소 | 40 | 0.0261 | 0.0347 | 0.0275 |
8 | 9 | 2010-01 | 헤이리예술마을 | 34 | 0.057 | 0.0416 | 0.0985 |
9 | 10 | 2010-01 | 가평 | 30 | 0.0472 | 0.0347 | 0.0065 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 영화 | 19 | 0.0201 | 0.0138 | 0.0021 |
21 | 22 | 2010-01 | 일산 | 18 | 0.0308 | 0.0347 | 0.0102 |
22 | 23 | 2010-01 | 결혼 | 17 | 0.018 | 0.0069 | 0.0 |
23 | 24 | 2010-01 | 공원 | 16 | 0.0282 | 0.0416 | 0.0721 |
24 | 25 | 2010-01 | 양평 | 16 | 0.0309 | 0.0208 | 0.0 |
25 | 26 | 2010-01 | 겨울 | 15 | 0.0311 | 0.0347 | 0.0319 |
26 | 27 | 2010-01 | 고양 | 15 | 0.0231 | 0.0208 | 0.0001 |
27 | 28 | 2010-01 | 분위기 | 15 | 0.0238 | 0.0069 | 0.0 |
28 | 29 | 2010-01 | 남양주 | 15 | 0.0287 | 0.0208 | 0.0 |
29 | 30 | 2010-01 | 코스 | 14 | 0.0261 | 0.0416 | 0.0586 |