Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
DateTime | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/17d039bb-8711-4042-987d-0cbcedbd3070 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 2 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
단어빈도 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:43:57.954194 |
---|---|
Analysis finished | 2023-12-10 13:44:02.929841 |
Duration | 4.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2010-01-01 00:00:00 |
---|---|
Maximum | 2010-01-01 00:00:00 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
미용업 | 1 | 3.3% |
독서실 | 1 | 3.3% |
운영 | 1 | 3.3% |
메이크업 | 1 | 3.3% |
시설 | 1 | 3.3% |
헤어스타일 | 1 | 3.3% |
가격 | 1 | 3.3% |
여자 | 1 | 3.3% |
아이 | 1 | 3.3% |
정보 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
리 | 3 | 4.2% |
용 | 3 | 4.2% |
부 | 3 | 4.2% |
미 | 2 | 2.8% |
피 | 2 | 2.8% |
이 | 2 | 2.8% |
아 | 2 | 2.8% |
일 | 2 | 2.8% |
타 | 2 | 2.8% |
스 | 2 | 2.8% |
Other values (40) | 49 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 72 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
리 | 3 | 4.2% |
용 | 3 | 4.2% |
부 | 3 | 4.2% |
미 | 2 | 2.8% |
피 | 2 | 2.8% |
이 | 2 | 2.8% |
아 | 2 | 2.8% |
일 | 2 | 2.8% |
타 | 2 | 2.8% |
스 | 2 | 2.8% |
Other values (40) | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 72 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
리 | 3 | 4.2% |
용 | 3 | 4.2% |
부 | 3 | 4.2% |
미 | 2 | 2.8% |
피 | 2 | 2.8% |
이 | 2 | 2.8% |
아 | 2 | 2.8% |
일 | 2 | 2.8% |
타 | 2 | 2.8% |
스 | 2 | 2.8% |
Other values (40) | 49 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 72 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
리 | 3 | 4.2% |
용 | 3 | 4.2% |
부 | 3 | 4.2% |
미 | 2 | 2.8% |
피 | 2 | 2.8% |
이 | 2 | 2.8% |
아 | 2 | 2.8% |
일 | 2 | 2.8% |
타 | 2 | 2.8% |
스 | 2 | 2.8% |
Other values (40) | 49 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1224.8667 |
Minimum | 377 |
---|---|
Maximum | 10112 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 377 |
---|---|
5-th percentile | 391.75 |
Q1 | 510.5 |
median | 594 |
Q3 | 789.5 |
95-th percentile | 3997.3 |
Maximum | 10112 |
Range | 9735 |
Interquartile range (IQR) | 279 |
Descriptive statistics
Standard deviation | 1920.2619 |
---|---|
Coefficient of variation (CV) | 1.5677314 |
Kurtosis | 16.519948 |
Mean | 1224.8667 |
Median Absolute Deviation (MAD) | 144 |
Skewness | 3.8573026 |
Sum | 36746 |
Variance | 3687405.8 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
10112 | 1 | 3.3% |
554 | 1 | 3.3% |
377 | 1 | 3.3% |
385 | 1 | 3.3% |
400 | 1 | 3.3% |
408 | 1 | 3.3% |
426 | 1 | 3.3% |
438 | 1 | 3.3% |
462 | 1 | 3.3% |
510 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
377 | 1 | |
385 | 1 | |
400 | 1 | |
408 | 1 | |
426 | 1 | |
438 | 1 | |
462 | 1 | |
510 | 1 | |
512 | 1 | |
526 | 1 |
Value | Count | Frequency (%) |
10112 | 1 | |
4135 | 1 | |
3829 | 1 | |
2613 | 1 | |
1220 | 1 | |
1162 | 1 | |
936 | 1 | |
800 | 1 | |
758 | 1 | |
712 | 1 |
단어중요도
Real number (ℝ)
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.032093333 |
Minimum | 0.0229 |
---|---|
Maximum | 0.0457 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0229 |
---|---|
5-th percentile | 0.024115 |
Q1 | 0.027225 |
median | 0.0308 |
Q3 | 0.036125 |
95-th percentile | 0.044815 |
Maximum | 0.0457 |
Range | 0.0228 |
Interquartile range (IQR) | 0.0089 |
Descriptive statistics
Standard deviation | 0.0065752374 |
---|---|
Coefficient of variation (CV) | 0.20487861 |
Kurtosis | -0.4954991 |
Mean | 0.032093333 |
Median Absolute Deviation (MAD) | 0.0043 |
Skewness | 0.64984389 |
Sum | 0.9628 |
Variance | 4.3233747 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0304 | 2 | 6.7% |
0.0457 | 1 | 3.3% |
0.027 | 1 | 3.3% |
0.0229 | 1 | 3.3% |
0.0312 | 1 | 3.3% |
0.0245 | 1 | 3.3% |
0.0454 | 1 | 3.3% |
0.0238 | 1 | 3.3% |
0.0397 | 1 | 3.3% |
0.028 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0229 | 1 | |
0.0238 | 1 | |
0.0245 | 1 | |
0.0248 | 1 | |
0.0252 | 1 | |
0.0253 | 1 | |
0.027 | 1 | |
0.0271 | 1 | |
0.0276 | 1 | |
0.0278 | 1 |
Value | Count | Frequency (%) |
0.0457 | 1 | |
0.0454 | 1 | |
0.0441 | 1 | |
0.0412 | 1 | |
0.0397 | 1 | |
0.0392 | 1 | |
0.0368 | 1 | |
0.0363 | 1 | |
0.0356 | 1 | |
0.0343 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 83.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.06674 |
Minimum | 0.0229 |
---|---|
Maximum | 0.3193 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0229 |
---|---|
5-th percentile | 0.02588 |
Q1 | 0.037725 |
median | 0.05135 |
Q3 | 0.0659 |
95-th percentile | 0.152435 |
Maximum | 0.3193 |
Range | 0.2964 |
Interquartile range (IQR) | 0.028175 |
Descriptive statistics
Standard deviation | 0.057930551 |
---|---|
Coefficient of variation (CV) | 0.86800345 |
Kurtosis | 12.612644 |
Mean | 0.06674 |
Median Absolute Deviation (MAD) | 0.01455 |
Skewness | 3.2546054 |
Sum | 2.0022 |
Variance | 0.0033559487 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1002 | 2 | 6.7% |
0.0309 | 2 | 6.7% |
0.065 | 2 | 6.7% |
0.0538 | 2 | 6.7% |
0.0408 | 2 | 6.7% |
0.3193 | 1 | 3.3% |
0.0446 | 1 | 3.3% |
0.047 | 1 | 3.3% |
0.0285 | 1 | 3.3% |
0.0396 | 1 | 3.3% |
Other values (15) | 15 |
Value | Count | Frequency (%) |
0.0229 | 1 | |
0.0248 | 1 | |
0.0272 | 1 | |
0.0285 | 1 | |
0.0309 | 2 | |
0.0365 | 1 | |
0.0371 | 1 | |
0.0396 | 1 | |
0.0408 | 2 | |
0.0439 | 1 |
Value | Count | Frequency (%) |
0.3193 | 1 | |
0.1739 | 1 | |
0.1262 | 1 | |
0.1002 | 2 | |
0.0941 | 1 | |
0.0699 | 1 | |
0.0662 | 1 | |
0.065 | 2 | |
0.0588 | 1 | |
0.0563 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.028493333 |
Minimum | 0.0019 |
---|---|
Maximum | 0.2874 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0019 |
---|---|
5-th percentile | 0.004445 |
Q1 | 0.006225 |
median | 0.01095 |
Q3 | 0.024875 |
95-th percentile | 0.09643 |
Maximum | 0.2874 |
Range | 0.2855 |
Interquartile range (IQR) | 0.01865 |
Descriptive statistics
Standard deviation | 0.054149741 |
---|---|
Coefficient of variation (CV) | 1.9004355 |
Kurtosis | 19.154689 |
Mean | 0.028493333 |
Median Absolute Deviation (MAD) | 0.00585 |
Skewness | 4.1715176 |
Sum | 0.8548 |
Variance | 0.0029321944 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0069 | 2 | 6.7% |
0.0097 | 2 | 6.7% |
0.0242 | 1 | 3.3% |
0.0063 | 1 | 3.3% |
0.0059 | 1 | 3.3% |
0.0152 | 1 | 3.3% |
0.0019 | 1 | 3.3% |
0.0062 | 1 | 3.3% |
0.0099 | 1 | 3.3% |
0.017 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0019 | 1 | |
0.0044 | 1 | |
0.0045 | 1 | |
0.0053 | 1 | |
0.0057 | 1 | |
0.0058 | 1 | |
0.0059 | 1 | |
0.0062 | 1 | |
0.0063 | 1 | |
0.0069 | 2 |
Value | Count | Frequency (%) |
0.2874 | 1 | |
0.1156 | 1 | |
0.073 | 1 | |
0.0372 | 1 | |
0.0344 | 1 | |
0.033 | 1 | |
0.0278 | 1 | |
0.0251 | 1 | |
0.0242 | 1 | |
0.0224 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.624 | 0.000 | 0.459 | 0.379 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.624 | 1.000 | 1.000 | 0.000 | 0.987 | 0.917 |
단어중요도 | 0.000 | 1.000 | 0.000 | 1.000 | 0.555 | 0.705 |
연결정도중심성 | 0.459 | 1.000 | 0.987 | 0.555 | 1.000 | 1.000 |
매개중심성 | 0.379 | 1.000 | 0.917 | 0.705 | 1.000 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | -0.462 | -0.680 | -0.586 |
단어빈도 | -1.000 | 1.000 | 0.462 | 0.680 | 0.586 |
단어중요도 | -0.462 | 0.462 | 1.000 | 0.487 | 0.286 |
연결정도중심성 | -0.680 | 0.680 | 0.487 | 1.000 | 0.932 |
매개중심성 | -0.586 | 0.586 | 0.286 | 0.932 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 미용업 | 10112 | 0.0457 | 0.3193 | 0.2874 |
1 | 2 | 2010-01 | 독서실 | 4135 | 0.0304 | 0.1739 | 0.1156 |
2 | 3 | 2010-01 | 머리 | 3829 | 0.0356 | 0.1002 | 0.0344 |
3 | 4 | 2010-01 | 세탁소 | 2613 | 0.0325 | 0.1262 | 0.073 |
4 | 5 | 2010-01 | 공부 | 1220 | 0.0278 | 0.0309 | 0.0057 |
5 | 6 | 2010-01 | 헤어 | 1162 | 0.0368 | 0.1002 | 0.033 |
6 | 7 | 2010-01 | 피부관리실 | 936 | 0.0363 | 0.0941 | 0.0372 |
7 | 8 | 2010-01 | 펌 | 800 | 0.0412 | 0.0563 | 0.0097 |
8 | 9 | 2010-01 | 남자 | 758 | 0.0441 | 0.0551 | 0.012 |
9 | 10 | 2010-01 | 학원 | 712 | 0.0271 | 0.0699 | 0.0224 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 친구 | 526 | 0.0253 | 0.0229 | 0.0044 |
21 | 22 | 2010-01 | 정보 | 512 | 0.0318 | 0.0538 | 0.017 |
22 | 23 | 2010-01 | 아이 | 510 | 0.028 | 0.0408 | 0.0099 |
23 | 24 | 2010-01 | 여자 | 462 | 0.0397 | 0.0408 | 0.0062 |
24 | 25 | 2010-01 | 가격 | 438 | 0.0238 | 0.0309 | 0.0069 |
25 | 26 | 2010-01 | 헤어스타일 | 426 | 0.0454 | 0.0365 | 0.0019 |
26 | 27 | 2010-01 | 시설 | 408 | 0.0245 | 0.0489 | 0.0152 |
27 | 28 | 2010-01 | 메이크업 | 400 | 0.0312 | 0.0396 | 0.0059 |
28 | 29 | 2010-01 | 운영 | 385 | 0.0229 | 0.0285 | 0.0063 |
29 | 30 | 2010-01 | 아파트 | 377 | 0.027 | 0.047 | 0.0097 |