Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/ba12aa77-caf3-4080-8b34-a3e26b658e2a |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 1 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 1 other fields | High correlation |
연결정도중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
매개중심성 is highly overall correlated with 연결정도중심성 | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:54:30.547224 |
---|---|
Analysis finished | 2023-12-10 13:54:36.433279 |
Duration | 5.89 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
성남 | 1 | 3.3% |
경기도 | 1 | 3.3% |
차 | 1 | 3.3% |
시간 | 1 | 3.3% |
지원 | 1 | 3.3% |
지만 | 1 | 3.3% |
분양 | 1 | 3.3% |
채용 | 1 | 3.3% |
수도권 | 1 | 3.3% |
안양 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
남 | 3 | 4.5% |
거 | 3 | 4.5% |
성 | 2 | 3.0% |
래 | 2 | 3.0% |
트 | 2 | 3.0% |
아 | 2 | 3.0% |
인 | 2 | 3.0% |
용 | 2 | 3.0% |
지 | 2 | 3.0% |
주 | 2 | 3.0% |
Other values (38) | 45 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 67 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
남 | 3 | 4.5% |
거 | 3 | 4.5% |
성 | 2 | 3.0% |
래 | 2 | 3.0% |
트 | 2 | 3.0% |
아 | 2 | 3.0% |
인 | 2 | 3.0% |
용 | 2 | 3.0% |
지 | 2 | 3.0% |
주 | 2 | 3.0% |
Other values (38) | 45 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 67 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
남 | 3 | 4.5% |
거 | 3 | 4.5% |
성 | 2 | 3.0% |
래 | 2 | 3.0% |
트 | 2 | 3.0% |
아 | 2 | 3.0% |
인 | 2 | 3.0% |
용 | 2 | 3.0% |
지 | 2 | 3.0% |
주 | 2 | 3.0% |
Other values (38) | 45 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 67 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
남 | 3 | 4.5% |
거 | 3 | 4.5% |
성 | 2 | 3.0% |
래 | 2 | 3.0% |
트 | 2 | 3.0% |
아 | 2 | 3.0% |
인 | 2 | 3.0% |
용 | 2 | 3.0% |
지 | 2 | 3.0% |
주 | 2 | 3.0% |
Other values (38) | 45 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1522.0667 |
Minimum | 542 |
---|---|
Maximum | 17543 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 542 |
---|---|
5-th percentile | 551.5 |
Q1 | 620.75 |
median | 754.5 |
Q3 | 1048 |
95-th percentile | 2550.8 |
Maximum | 17543 |
Range | 17001 |
Interquartile range (IQR) | 427.25 |
Descriptive statistics
Standard deviation | 3080.9102 |
---|---|
Coefficient of variation (CV) | 2.0241625 |
Kurtosis | 27.686469 |
Mean | 1522.0667 |
Median Absolute Deviation (MAD) | 145 |
Skewness | 5.1819247 |
Sum | 45662 |
Variance | 9492007.8 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
629 | 2 | 6.7% |
843 | 2 | 6.7% |
17543 | 1 | 3.3% |
690 | 1 | 3.3% |
542 | 1 | 3.3% |
547 | 1 | 3.3% |
557 | 1 | 3.3% |
562 | 1 | 3.3% |
586 | 1 | 3.3% |
607 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
542 | 1 | |
547 | 1 | |
557 | 1 | |
562 | 1 | |
586 | 1 | |
607 | 1 | |
608 | 1 | |
618 | 1 | |
629 | 2 | |
651 | 1 |
Value | Count | Frequency (%) |
17543 | 1 | |
2630 | 1 | |
2454 | 1 | |
2444 | 1 | |
1476 | 1 | |
1452 | 1 | |
1317 | 1 | |
1098 | 1 | |
898 | 1 | |
885 | 1 |
단어중요도
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.026816667 |
Minimum | 0.0199 |
---|---|
Maximum | 0.0352 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0199 |
---|---|
5-th percentile | 0.022215 |
Q1 | 0.024225 |
median | 0.0254 |
Q3 | 0.029975 |
95-th percentile | 0.03327 |
Maximum | 0.0352 |
Range | 0.0153 |
Interquartile range (IQR) | 0.00575 |
Descriptive statistics
Standard deviation | 0.0039440119 |
---|---|
Coefficient of variation (CV) | 0.14707316 |
Kurtosis | -0.60710912 |
Mean | 0.026816667 |
Median Absolute Deviation (MAD) | 0.0016 |
Skewness | 0.63577004 |
Sum | 0.8045 |
Variance | 1.555523 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0254 | 2 | 6.7% |
0.0251 | 2 | 6.7% |
0.0246 | 2 | 6.7% |
0.0239 | 2 | 6.7% |
0.0339 | 1 | 3.3% |
0.0325 | 1 | 3.3% |
0.0226 | 1 | 3.3% |
0.0243 | 1 | 3.3% |
0.0271 | 1 | 3.3% |
0.0199 | 1 | 3.3% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
0.0199 | 1 | |
0.0219 | 1 | |
0.0226 | 1 | |
0.0234 | 1 | |
0.0236 | 1 | |
0.0239 | 2 | |
0.0242 | 1 | |
0.0243 | 1 | |
0.0246 | 2 | |
0.0251 | 2 |
Value | Count | Frequency (%) |
0.0352 | 1 | |
0.0339 | 1 | |
0.0325 | 1 | |
0.0323 | 1 | |
0.0322 | 1 | |
0.032 | 1 | |
0.0319 | 1 | |
0.03 | 1 | |
0.0299 | 1 | |
0.0271 | 1 |
연결정도중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 83.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.092486667 |
Minimum | 0.0146 |
---|---|
Maximum | 0.5256 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0146 |
---|---|
5-th percentile | 0.02538 |
Q1 | 0.04275 |
median | 0.0717 |
Q3 | 0.10135 |
95-th percentile | 0.193065 |
Maximum | 0.5256 |
Range | 0.511 |
Interquartile range (IQR) | 0.0586 |
Descriptive statistics
Standard deviation | 0.094449748 |
---|---|
Coefficient of variation (CV) | 1.0212256 |
Kurtosis | 15.626865 |
Mean | 0.092486667 |
Median Absolute Deviation (MAD) | 0.0308 |
Skewness | 3.5727661 |
Sum | 2.7746 |
Variance | 0.008920755 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0717 | 3 | 10.0% |
0.1288 | 2 | 6.7% |
0.0571 | 2 | 6.7% |
0.0409 | 2 | 6.7% |
0.5256 | 1 | 3.3% |
0.0278 | 1 | 3.3% |
0.0146 | 1 | 3.3% |
0.0527 | 1 | 3.3% |
0.0893 | 1 | 3.3% |
0.0512 | 1 | 3.3% |
Other values (15) | 15 |
Value | Count | Frequency (%) |
0.0146 | 1 | |
0.0234 | 1 | |
0.0278 | 1 | |
0.0307 | 1 | |
0.0336 | 1 | |
0.038 | 1 | |
0.0409 | 2 | |
0.0483 | 1 | |
0.0512 | 1 | |
0.0527 | 1 |
Value | Count | Frequency (%) |
0.5256 | 1 | |
0.2049 | 1 | |
0.1786 | 1 | |
0.1639 | 1 | |
0.1317 | 1 | |
0.1288 | 2 | |
0.1039 | 1 | |
0.0937 | 1 | |
0.0893 | 1 | |
0.0863 | 1 |
매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.030993333 |
Minimum | 0.0006 |
---|---|
Maximum | 0.4857 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0006 |
---|---|
5-th percentile | 0.001095 |
Q1 | 0.005325 |
median | 0.0116 |
Q3 | 0.018675 |
95-th percentile | 0.05446 |
Maximum | 0.4857 |
Range | 0.4851 |
Interquartile range (IQR) | 0.01335 |
Descriptive statistics
Standard deviation | 0.087197932 |
---|---|
Coefficient of variation (CV) | 2.8134415 |
Kurtosis | 28.035383 |
Mean | 0.030993333 |
Median Absolute Deviation (MAD) | 0.00655 |
Skewness | 5.2238194 |
Sum | 0.9298 |
Variance | 0.0076034793 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0061 | 2 | 6.7% |
0.0006 | 2 | 6.7% |
0.4857 | 1 | 3.3% |
0.032 | 1 | 3.3% |
0.011 | 1 | 3.3% |
0.0191 | 1 | 3.3% |
0.0159 | 1 | 3.3% |
0.0049 | 1 | 3.3% |
0.0043 | 1 | 3.3% |
0.007 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0006 | 2 | |
0.0017 | 1 | |
0.0023 | 1 | |
0.0038 | 1 | |
0.0043 | 1 | |
0.0049 | 1 | |
0.0052 | 1 | |
0.0057 | 1 | |
0.0061 | 2 | |
0.007 | 1 |
Value | Count | Frequency (%) |
0.4857 | 1 | |
0.0604 | 1 | |
0.0472 | 1 | |
0.0446 | 1 | |
0.0415 | 1 | |
0.032 | 1 | |
0.0195 | 1 | |
0.0191 | 1 | |
0.0174 | 1 | |
0.0164 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.575 | 0.000 | 0.449 | 0.281 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.575 | 1.000 | 1.000 | 0.339 | 0.871 | 0.972 |
단어중요도 | 0.000 | 1.000 | 0.339 | 1.000 | 0.338 | 0.301 |
연결정도중심성 | 0.449 | 1.000 | 0.871 | 0.338 | 1.000 | 0.812 |
매개중심성 | 0.281 | 1.000 | 0.972 | 0.301 | 0.812 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | -0.235 | -0.564 | -0.465 |
단어빈도 | -1.000 | 1.000 | 0.240 | 0.566 | 0.463 |
단어중요도 | -0.235 | 0.240 | 1.000 | 0.048 | -0.016 |
연결정도중심성 | -0.564 | 0.566 | 0.048 | 1.000 | 0.791 |
매개중심성 | -0.465 | 0.463 | -0.016 | 0.791 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 성남 | 17543 | 0.0339 | 0.5256 | 0.4857 |
1 | 2 | 2010-01 | 경기도 | 2630 | 0.0236 | 0.1639 | 0.0415 |
2 | 3 | 2010-01 | 서울 | 2454 | 0.0255 | 0.2049 | 0.0604 |
3 | 4 | 2010-01 | 경기 | 2444 | 0.0254 | 0.1786 | 0.0472 |
4 | 5 | 2010-01 | 수원 | 1476 | 0.0251 | 0.1288 | 0.0146 |
5 | 6 | 2010-01 | 광주 | 1452 | 0.0299 | 0.1317 | 0.0195 |
6 | 7 | 2010-01 | 정보 | 1317 | 0.0246 | 0.1288 | 0.0446 |
7 | 8 | 2010-01 | 판매 | 1098 | 0.032 | 0.0571 | 0.0164 |
8 | 9 | 2010-01 | 하남 | 898 | 0.0266 | 0.0761 | 0.0023 |
9 | 10 | 2010-01 | 용인 | 885 | 0.0253 | 0.1039 | 0.013 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 연결정도중심성 | 매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 직거래 | 629 | 0.0239 | 0.0409 | 0.0052 |
21 | 22 | 2010-01 | 안양 | 629 | 0.0242 | 0.0863 | 0.007 |
22 | 23 | 2010-01 | 수도권 | 618 | 0.0234 | 0.0541 | 0.0043 |
23 | 24 | 2010-01 | 채용 | 608 | 0.0352 | 0.0483 | 0.0049 |
24 | 25 | 2010-01 | 분양 | 607 | 0.0319 | 0.0512 | 0.0061 |
25 | 26 | 2010-01 | 지만 | 586 | 0.0199 | 0.0717 | 0.0159 |
26 | 27 | 2010-01 | 지원 | 562 | 0.0271 | 0.0893 | 0.0191 |
27 | 28 | 2010-01 | 시간 | 557 | 0.0243 | 0.0527 | 0.011 |
28 | 29 | 2010-01 | 차 | 547 | 0.0246 | 0.0717 | 0.032 |
29 | 30 | 2010-01 | 구입 | 542 | 0.0226 | 0.0146 | 0.0006 |