Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 30 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 65.4 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 더아이엠씨 |
URL | https://bigdata-region.kr/#/dataset/9ac907cb-1f48-454c-9282-752d0d324c25 |
수집년월 has constant value "" | Constant |
분석인덱스 is highly overall correlated with 단어빈도 and 1 other fields | High correlation |
단어빈도 is highly overall correlated with 분석인덱스 and 1 other fields | High correlation |
단어연결중심성 is highly overall correlated with 단어매개중심성 | High correlation |
단어매개중심성 is highly overall correlated with 분석인덱스 and 2 other fields | High correlation |
분석인덱스 has unique values | Unique |
키워드명 has unique values | Unique |
단어빈도 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:07:02.588576 |
---|---|
Analysis finished | 2023-12-10 14:07:07.665154 |
Duration | 5.08 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
분석인덱스
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5 |
Minimum | 1 |
---|---|
Maximum | 30 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.45 |
Q1 | 8.25 |
median | 15.5 |
Q3 | 22.75 |
95-th percentile | 28.55 |
Maximum | 30 |
Range | 29 |
Interquartile range (IQR) | 14.5 |
Descriptive statistics
Standard deviation | 8.8034084 |
---|---|
Coefficient of variation (CV) | 0.56796183 |
Kurtosis | -1.2 |
Mean | 15.5 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 0 |
Sum | 465 |
Variance | 77.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.3% |
17 | 1 | 3.3% |
30 | 1 | 3.3% |
29 | 1 | 3.3% |
28 | 1 | 3.3% |
27 | 1 | 3.3% |
26 | 1 | 3.3% |
25 | 1 | 3.3% |
24 | 1 | 3.3% |
23 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
30 | 1 | |
29 | 1 | |
28 | 1 | |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 |
수집년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
2010-01 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010-01 |
---|---|
2nd row | 2010-01 |
3rd row | 2010-01 |
4th row | 2010-01 |
5th row | 2010-01 |
Common Values
Value | Count | Frequency (%) |
2010-01 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010-01 | 30 |
키워드명
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
수원 | 1 | 3.3% |
서울 | 1 | 3.3% |
부산 | 1 | 3.3% |
지원 | 1 | 3.3% |
수원화성 | 1 | 3.3% |
채용 | 1 | 3.3% |
대학 | 1 | 3.3% |
이사 | 1 | 3.3% |
아파트 | 1 | 3.3% |
구입 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
성 | 4 | 6.2% |
거 | 3 | 4.7% |
원 | 3 | 4.7% |
수 | 2 | 3.1% |
화 | 2 | 3.1% |
양 | 2 | 3.1% |
산 | 2 | 3.1% |
용 | 2 | 3.1% |
인 | 2 | 3.1% |
래 | 2 | 3.1% |
Other values (37) | 40 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 64 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
성 | 4 | 6.2% |
거 | 3 | 4.7% |
원 | 3 | 4.7% |
수 | 2 | 3.1% |
화 | 2 | 3.1% |
양 | 2 | 3.1% |
산 | 2 | 3.1% |
용 | 2 | 3.1% |
인 | 2 | 3.1% |
래 | 2 | 3.1% |
Other values (37) | 40 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 64 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
성 | 4 | 6.2% |
거 | 3 | 4.7% |
원 | 3 | 4.7% |
수 | 2 | 3.1% |
화 | 2 | 3.1% |
양 | 2 | 3.1% |
산 | 2 | 3.1% |
용 | 2 | 3.1% |
인 | 2 | 3.1% |
래 | 2 | 3.1% |
Other values (37) | 40 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 64 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
성 | 4 | 6.2% |
거 | 3 | 4.7% |
원 | 3 | 4.7% |
수 | 2 | 3.1% |
화 | 2 | 3.1% |
양 | 2 | 3.1% |
산 | 2 | 3.1% |
용 | 2 | 3.1% |
인 | 2 | 3.1% |
래 | 2 | 3.1% |
Other values (37) | 40 |
단어빈도
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3551.5667 |
Minimum | 1095 |
---|---|
Maximum | 46884 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 1095 |
---|---|
5-th percentile | 1131.8 |
Q1 | 1311.75 |
median | 1652 |
Q3 | 2224.5 |
95-th percentile | 5224.6 |
Maximum | 46884 |
Range | 45789 |
Interquartile range (IQR) | 912.75 |
Descriptive statistics
Standard deviation | 8261.821 |
---|---|
Coefficient of variation (CV) | 2.3262469 |
Kurtosis | 28.76724 |
Mean | 3551.5667 |
Median Absolute Deviation (MAD) | 423.5 |
Skewness | 5.3183342 |
Sum | 106547 |
Variance | 68257685 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
46884 | 1 | 3.3% |
1592 | 1 | 3.3% |
1095 | 1 | 3.3% |
1121 | 1 | 3.3% |
1145 | 1 | 3.3% |
1215 | 1 | 3.3% |
1230 | 1 | 3.3% |
1280 | 1 | 3.3% |
1289 | 1 | 3.3% |
1300 | 1 | 3.3% |
Other values (20) | 20 |
Value | Count | Frequency (%) |
1095 | 1 | |
1121 | 1 | |
1145 | 1 | |
1215 | 1 | |
1230 | 1 | |
1280 | 1 | |
1289 | 1 | |
1300 | 1 | |
1347 | 1 | |
1421 | 1 |
Value | Count | Frequency (%) |
46884 | 1 | |
5257 | 1 | |
5185 | 1 | |
4577 | 1 | |
3335 | 1 | |
2747 | 1 | |
2367 | 1 | |
2240 | 1 | |
2178 | 1 | |
2118 | 1 |
단어중요도
Real number (ℝ)
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.030116667 |
Minimum | 0.0218 |
---|---|
Maximum | 0.0659 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0218 |
---|---|
5-th percentile | 0.023425 |
Q1 | 0.0259 |
median | 0.02835 |
Q3 | 0.0318 |
95-th percentile | 0.040785 |
Maximum | 0.0659 |
Range | 0.0441 |
Interquartile range (IQR) | 0.0059 |
Descriptive statistics
Standard deviation | 0.0082837224 |
---|---|
Coefficient of variation (CV) | 0.27505443 |
Kurtosis | 12.089939 |
Mean | 0.030116667 |
Median Absolute Deviation (MAD) | 0.00315 |
Skewness | 3.0921976 |
Sum | 0.9035 |
Variance | 6.8620057 × 10-5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0241 | 2 | 6.7% |
0.0259 | 2 | 6.7% |
0.0349 | 1 | 3.3% |
0.031 | 1 | 3.3% |
0.0346 | 1 | 3.3% |
0.0263 | 1 | 3.3% |
0.026 | 1 | 3.3% |
0.0659 | 1 | 3.3% |
0.0315 | 1 | 3.3% |
0.0323 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0218 | 1 | |
0.0232 | 1 | |
0.0237 | 1 | |
0.0241 | 2 | |
0.025 | 1 | |
0.0252 | 1 | |
0.0259 | 2 | |
0.026 | 1 | |
0.0263 | 1 | |
0.0266 | 1 |
Value | Count | Frequency (%) |
0.0659 | 1 | |
0.0456 | 1 | |
0.0349 | 1 | |
0.0346 | 1 | |
0.034 | 1 | |
0.0331 | 1 | |
0.0323 | 1 | |
0.0319 | 1 | |
0.0315 | 1 | |
0.0314 | 1 |
단어연결중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.12185667 |
Minimum | 0.0404 |
---|---|
Maximum | 0.6805 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0404 |
---|---|
5-th percentile | 0.046185 |
Q1 | 0.078525 |
median | 0.0899 |
Q3 | 0.1199 |
95-th percentile | 0.218415 |
Maximum | 0.6805 |
Range | 0.6401 |
Interquartile range (IQR) | 0.041375 |
Descriptive statistics
Standard deviation | 0.11519501 |
---|---|
Coefficient of variation (CV) | 0.945332 |
Kurtosis | 20.221054 |
Mean | 0.12185667 |
Median Absolute Deviation (MAD) | 0.01565 |
Skewness | 4.2137084 |
Sum | 3.6557 |
Variance | 0.013269889 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1199 | 2 | 6.7% |
0.0834 | 2 | 6.7% |
0.0756 | 1 | 3.3% |
0.0638 | 1 | 3.3% |
0.0886 | 1 | 3.3% |
0.099 | 1 | 3.3% |
0.0808 | 1 | 3.3% |
0.0873 | 1 | 3.3% |
0.0821 | 1 | 3.3% |
0.1003 | 1 | 3.3% |
Other values (18) | 18 |
Value | Count | Frequency (%) |
0.0404 | 1 | |
0.0456 | 1 | |
0.0469 | 1 | |
0.0638 | 1 | |
0.0677 | 1 | |
0.073 | 1 | |
0.0756 | 1 | |
0.0782 | 1 | |
0.0795 | 1 | |
0.0808 | 1 |
Value | Count | Frequency (%) |
0.6805 | 1 | |
0.219 | 1 | |
0.2177 | 1 | |
0.2112 | 1 | |
0.1538 | 1 | |
0.1382 | 1 | |
0.1329 | 1 | |
0.1199 | 2 | |
0.1043 | 1 | |
0.1003 | 1 |
단어매개중심성
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.028126667 |
Minimum | 0.0015 |
---|---|
Maximum | 0.5605 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 0.0015 |
---|---|
5-th percentile | 0.002135 |
Q1 | 0.004975 |
median | 0.008 |
Q3 | 0.011275 |
95-th percentile | 0.032165 |
Maximum | 0.5605 |
Range | 0.559 |
Interquartile range (IQR) | 0.0063 |
Descriptive statistics
Standard deviation | 0.10087176 |
---|---|
Coefficient of variation (CV) | 3.5863389 |
Kurtosis | 29.576721 |
Mean | 0.028126667 |
Median Absolute Deviation (MAD) | 0.00325 |
Skewness | 5.422132 |
Sum | 0.8438 |
Variance | 0.010175112 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0052 | 2 | 6.7% |
0.5605 | 1 | 3.3% |
0.0323 | 1 | 3.3% |
0.0023 | 1 | 3.3% |
0.0033 | 1 | 3.3% |
0.007 | 1 | 3.3% |
0.0112 | 1 | 3.3% |
0.0049 | 1 | 3.3% |
0.0047 | 1 | 3.3% |
0.0072 | 1 | 3.3% |
Other values (19) | 19 |
Value | Count | Frequency (%) |
0.0015 | 1 | |
0.002 | 1 | |
0.0023 | 1 | |
0.0033 | 1 | |
0.0034 | 1 | |
0.0043 | 1 | |
0.0047 | 1 | |
0.0049 | 1 | |
0.0052 | 2 | |
0.0056 | 1 |
Value | Count | Frequency (%) |
0.5605 | 1 | |
0.0323 | 1 | |
0.032 | 1 | |
0.0298 | 1 | |
0.0151 | 1 | |
0.0126 | 1 | |
0.0115 | 1 | |
0.0113 | 1 | |
0.0112 | 1 | |
0.0109 | 1 |
분석인덱스 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|
분석인덱스 | 1.000 | 1.000 | 0.000 | 0.000 | 0.590 | 0.159 |
키워드명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
단어빈도 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.653 |
단어중요도 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 |
단어연결중심성 | 0.590 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 |
단어매개중심성 | 0.159 | 1.000 | 0.653 | 0.000 | 1.000 | 1.000 |
분석인덱스 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|
분석인덱스 | 1.000 | -1.000 | 0.198 | -0.423 | -0.706 |
단어빈도 | -1.000 | 1.000 | -0.198 | 0.423 | 0.706 |
단어중요도 | 0.198 | -0.198 | 1.000 | -0.060 | -0.268 |
단어연결중심성 | -0.423 | 0.423 | -0.060 | 1.000 | 0.601 |
단어매개중심성 | -0.706 | 0.706 | -0.268 | 0.601 | 1.000 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|---|
0 | 1 | 2010-01 | 수원 | 46884 | 0.0349 | 0.6805 | 0.5605 |
1 | 2 | 2010-01 | 서울 | 5257 | 0.0259 | 0.2177 | 0.0323 |
2 | 3 | 2010-01 | 경기 | 5185 | 0.0283 | 0.2112 | 0.0298 |
3 | 4 | 2010-01 | 경기도 | 4577 | 0.0268 | 0.219 | 0.032 |
4 | 5 | 2010-01 | 정보 | 3335 | 0.0259 | 0.1538 | 0.0151 |
5 | 6 | 2010-01 | 판매 | 2747 | 0.0319 | 0.0912 | 0.0115 |
6 | 7 | 2010-01 | 거래 | 2367 | 0.0269 | 0.0677 | 0.0077 |
7 | 8 | 2010-01 | 인천 | 2240 | 0.0241 | 0.1329 | 0.0087 |
8 | 9 | 2010-01 | 가격 | 2178 | 0.0286 | 0.0795 | 0.0102 |
9 | 10 | 2010-01 | 희망 | 2118 | 0.0331 | 0.0456 | 0.0015 |
분석인덱스 | 수집년월 | 키워드명 | 단어빈도 | 단어중요도 | 단어연결중심성 | 단어매개중심성 | |
---|---|---|---|---|---|---|---|
20 | 21 | 2010-01 | 화성 | 1421 | 0.0284 | 0.1199 | 0.0056 |
21 | 22 | 2010-01 | 구입 | 1347 | 0.0237 | 0.0469 | 0.0084 |
22 | 23 | 2010-01 | 아파트 | 1300 | 0.0306 | 0.1003 | 0.0072 |
23 | 24 | 2010-01 | 이사 | 1289 | 0.0456 | 0.0821 | 0.0047 |
24 | 25 | 2010-01 | 대학 | 1280 | 0.0323 | 0.0873 | 0.0049 |
25 | 26 | 2010-01 | 채용 | 1230 | 0.0315 | 0.0834 | 0.0112 |
26 | 27 | 2010-01 | 수원화성 | 1215 | 0.0659 | 0.0808 | 0.0052 |
27 | 28 | 2010-01 | 지원 | 1145 | 0.026 | 0.099 | 0.007 |
28 | 29 | 2010-01 | 부산 | 1121 | 0.0263 | 0.0886 | 0.0033 |
29 | 30 | 2010-01 | 취업 | 1095 | 0.0346 | 0.0638 | 0.0023 |