Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 23 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.7 KiB |
Average record size in memory | 75.7 B |
Variable types
Text | 1 |
---|---|
Numeric | 6 |
Categorical | 1 |
Dataset
Description | 인천광역시 서구 내 동별, 국가별 외국인 등록 현황(타이완(대만), 미국, 일본, 필리핀, 중국 국적, 데이터 기준일자 등)입니다. |
---|---|
Author | 인천광역시 서구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15068522&srcSe=7661IVAWM27C61E190 |
데이터기준일자 has constant value "" | Constant |
타이완(대만) is highly overall correlated with 미국 and 2 other fields | High correlation |
미국 is highly overall correlated with 타이완(대만) and 2 other fields | High correlation |
일본 is highly overall correlated with 타이완(대만) and 2 other fields | High correlation |
필리핀 is highly overall correlated with 기타 | High correlation |
중국 is highly overall correlated with 타이완(대만) and 3 other fields | High correlation |
기타 is highly overall correlated with 필리핀 and 1 other fields | High correlation |
구분 has unique values | Unique |
기타 has unique values | Unique |
타이완(대만) has 3 (13.0%) zeros | Zeros |
미국 has 3 (13.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-01-28 07:16:46.257138 |
---|---|
Analysis finished | 2024-01-28 07:16:48.910376 |
Duration | 2.65 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Text
UNIQUE
 
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
Value | Count | Frequency (%) |
검암경서동 | 1 | 4.3% |
가좌1동 | 1 | 4.3% |
마전동 | 1 | 4.3% |
오류왕길동 | 1 | 4.3% |
당하동 | 1 | 4.3% |
원당동 | 1 | 4.3% |
불로대곡동 | 1 | 4.3% |
검단동 | 1 | 4.3% |
가좌4동 | 1 | 4.3% |
가좌3동 | 1 | 4.3% |
Other values (13) | 13 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 23 | |
가 | 7 | 7.8% |
좌 | 4 | 4.4% |
라 | 4 | 4.4% |
1 | 4 | 4.4% |
2 | 4 | 4.4% |
3 | 4 | 4.4% |
석 | 3 | 3.3% |
정 | 3 | 3.3% |
남 | 3 | 3.3% |
Other values (26) | 31 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 77 | |
Decimal Number | 13 | 14.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 23 | |
가 | 7 | 9.1% |
좌 | 4 | 5.2% |
라 | 4 | 5.2% |
석 | 3 | 3.9% |
정 | 3 | 3.9% |
남 | 3 | 3.9% |
청 | 3 | 3.9% |
당 | 2 | 2.6% |
원 | 2 | 2.6% |
Other values (22) | 23 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 1 | 7.7% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 77 | |
Common | 13 | 14.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 23 | |
가 | 7 | 9.1% |
좌 | 4 | 5.2% |
라 | 4 | 5.2% |
석 | 3 | 3.9% |
정 | 3 | 3.9% |
남 | 3 | 3.9% |
청 | 3 | 3.9% |
당 | 2 | 2.6% |
원 | 2 | 2.6% |
Other values (22) | 23 |
Common
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 1 | 7.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 77 | |
ASCII | 13 | 14.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 23 | |
가 | 7 | 9.1% |
좌 | 4 | 5.2% |
라 | 4 | 5.2% |
석 | 3 | 3.9% |
정 | 3 | 3.9% |
남 | 3 | 3.9% |
청 | 3 | 3.9% |
당 | 2 | 2.6% |
원 | 2 | 2.6% |
Other values (22) | 23 |
ASCII
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 1 | 7.7% |
타이완(대만)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 65.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.0869565 |
Minimum | 0 |
---|---|
Maximum | 39 |
Zeros | 3 |
Zeros (%) | 13.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2.5 |
median | 4 |
Q3 | 13.5 |
95-th percentile | 30.5 |
Maximum | 39 |
Range | 39 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 10.833085 |
---|---|
Coefficient of variation (CV) | 1.1921577 |
Kurtosis | 1.6515054 |
Mean | 9.0869565 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 1.5561107 |
Sum | 209 |
Variance | 117.35573 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 3 | |
4 | 3 | |
0 | 3 | |
1 | 2 | 8.7% |
5 | 2 | 8.7% |
26 | 1 | 4.3% |
21 | 1 | 4.3% |
39 | 1 | 4.3% |
31 | 1 | 4.3% |
17 | 1 | 4.3% |
Other values (5) | 5 |
Value | Count | Frequency (%) |
0 | 3 | |
1 | 2 | |
2 | 1 | 4.3% |
3 | 3 | |
4 | 3 | |
5 | 2 | |
6 | 1 | 4.3% |
7 | 1 | 4.3% |
11 | 1 | 4.3% |
16 | 1 | 4.3% |
Value | Count | Frequency (%) |
39 | 1 | |
31 | 1 | |
26 | 1 | |
21 | 1 | |
17 | 1 | |
16 | 1 | |
11 | 1 | |
7 | 1 | |
6 | 1 | |
5 | 2 |
미국
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 12 |
---|---|
Distinct (%) | 52.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.6956522 |
Minimum | 0 |
---|---|
Maximum | 50 |
Zeros | 3 |
Zeros (%) | 13.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 6 |
95-th percentile | 23.2 |
Maximum | 50 |
Range | 50 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 11.133221 |
---|---|
Coefficient of variation (CV) | 1.6627538 |
Kurtosis | 10.653197 |
Mean | 6.6956522 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 3.06871 |
Sum | 154 |
Variance | 123.94862 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 6 | |
0 | 3 | |
2 | 2 | 8.7% |
6 | 2 | 8.7% |
3 | 2 | 8.7% |
4 | 2 | 8.7% |
11 | 1 | 4.3% |
16 | 1 | 4.3% |
24 | 1 | 4.3% |
50 | 1 | 4.3% |
Other values (2) | 2 | 8.7% |
Value | Count | Frequency (%) |
0 | 3 | |
1 | 6 | |
2 | 2 | 8.7% |
3 | 2 | 8.7% |
4 | 2 | 8.7% |
5 | 1 | 4.3% |
6 | 2 | 8.7% |
11 | 1 | 4.3% |
12 | 1 | 4.3% |
16 | 1 | 4.3% |
Value | Count | Frequency (%) |
50 | 1 | |
24 | 1 | |
16 | 1 | |
12 | 1 | |
11 | 1 | |
6 | 2 | |
5 | 1 | |
4 | 2 | |
3 | 2 | |
2 | 2 |
일본
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 15 |
---|---|
Distinct (%) | 65.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.304348 |
Minimum | 1 |
---|---|
Maximum | 32 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 6 |
median | 9 |
Q3 | 14 |
95-th percentile | 29.7 |
Maximum | 32 |
Range | 31 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 8.9464812 |
---|---|
Coefficient of variation (CV) | 0.79141949 |
Kurtosis | 0.43326805 |
Mean | 11.304348 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 1.159687 |
Sum | 260 |
Variance | 80.039526 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 4 | |
11 | 2 | 8.7% |
10 | 2 | 8.7% |
7 | 2 | 8.7% |
2 | 2 | 8.7% |
4 | 2 | 8.7% |
27 | 1 | 4.3% |
16 | 1 | 4.3% |
21 | 1 | 4.3% |
30 | 1 | 4.3% |
Other values (5) | 5 |
Value | Count | Frequency (%) |
1 | 1 | 4.3% |
2 | 2 | |
4 | 2 | |
6 | 4 | |
7 | 2 | |
9 | 1 | 4.3% |
10 | 2 | |
11 | 2 | |
12 | 1 | 4.3% |
16 | 1 | 4.3% |
Value | Count | Frequency (%) |
32 | 1 | |
30 | 1 | |
27 | 1 | |
21 | 1 | |
20 | 1 | |
16 | 1 | |
12 | 1 | |
11 | 2 | |
10 | 2 | |
9 | 1 |
필리핀
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 73.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.826087 |
Minimum | 4 |
---|---|
Maximum | 215 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 4 |
Q1 | 6.5 |
median | 12 |
Q3 | 38 |
95-th percentile | 163 |
Maximum | 215 |
Range | 211 |
Interquartile range (IQR) | 31.5 |
Descriptive statistics
Standard deviation | 55.885965 |
---|---|
Coefficient of variation (CV) | 1.4774451 |
Kurtosis | 4.6685902 |
Mean | 37.826087 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 2.2648995 |
Sum | 870 |
Variance | 3123.2411 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 4 | |
12 | 2 | 8.7% |
6 | 2 | 8.7% |
8 | 2 | 8.7% |
71 | 1 | 4.3% |
168 | 1 | 4.3% |
24 | 1 | 4.3% |
33 | 1 | 4.3% |
51 | 1 | 4.3% |
18 | 1 | 4.3% |
Other values (7) | 7 |
Value | Count | Frequency (%) |
4 | 4 | |
6 | 2 | |
7 | 1 | 4.3% |
8 | 2 | |
10 | 1 | 4.3% |
12 | 2 | |
15 | 1 | 4.3% |
18 | 1 | 4.3% |
24 | 1 | 4.3% |
29 | 1 | 4.3% |
Value | Count | Frequency (%) |
215 | 1 | |
168 | 1 | |
118 | 1 | |
71 | 1 | |
51 | 1 | |
43 | 1 | |
33 | 1 | |
29 | 1 | |
24 | 1 | |
18 | 1 |
중국
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 95.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 58.782609 |
Minimum | 7 |
---|---|
Maximum | 123 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 7 |
---|---|
5-th percentile | 12 |
Q1 | 32.5 |
median | 55 |
Q3 | 83.5 |
95-th percentile | 102.6 |
Maximum | 123 |
Range | 116 |
Interquartile range (IQR) | 51 |
Descriptive statistics
Standard deviation | 31.615526 |
---|---|
Coefficient of variation (CV) | 0.5378381 |
Kurtosis | -0.85270134 |
Mean | 58.782609 |
Median Absolute Deviation (MAD) | 27 |
Skewness | 0.17184608 |
Sum | 1352 |
Variance | 999.5415 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
82 | 2 | 8.7% |
123 | 1 | 4.3% |
64 | 1 | 4.3% |
33 | 1 | 4.3% |
63 | 1 | 4.3% |
50 | 1 | 4.3% |
30 | 1 | 4.3% |
32 | 1 | 4.3% |
103 | 1 | 4.3% |
26 | 1 | 4.3% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
7 | 1 | |
11 | 1 | |
21 | 1 | |
26 | 1 | |
30 | 1 | |
32 | 1 | |
33 | 1 | |
41 | 1 | |
44 | 1 | |
48 | 1 |
Value | Count | Frequency (%) |
123 | 1 | |
103 | 1 | |
99 | 1 | |
89 | 1 | |
88 | 1 | |
85 | 1 | |
82 | 2 | |
76 | 1 | |
64 | 1 | |
63 | 1 |
기타
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 433.3913 |
Minimum | 54 |
---|---|
Maximum | 1855 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 54 |
---|---|
5-th percentile | 65.1 |
Q1 | 177 |
median | 294 |
Q3 | 543.5 |
95-th percentile | 1123.5 |
Maximum | 1855 |
Range | 1801 |
Interquartile range (IQR) | 366.5 |
Descriptive statistics
Standard deviation | 433.0598 |
---|---|
Coefficient of variation (CV) | 0.9992351 |
Kurtosis | 4.3561238 |
Mean | 433.3913 |
Median Absolute Deviation (MAD) | 154 |
Skewness | 2.0026909 |
Sum | 9968 |
Variance | 187540.79 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
739 | 1 | 4.3% |
371 | 1 | 4.3% |
241 | 1 | 4.3% |
102 | 1 | 4.3% |
1855 | 1 | 4.3% |
226 | 1 | 4.3% |
294 | 1 | 4.3% |
311 | 1 | 4.3% |
1074 | 1 | 4.3% |
140 | 1 | 4.3% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
54 | 1 | |
61 | 1 | |
102 | 1 | |
106 | 1 | |
140 | 1 | |
166 | 1 | |
188 | 1 | |
216 | 1 | |
222 | 1 | |
226 | 1 |
Value | Count | Frequency (%) |
1855 | 1 | |
1129 | 1 | |
1074 | 1 | |
785 | 1 | |
739 | 1 | |
600 | 1 | |
487 | 1 | |
371 | 1 | |
311 | 1 | |
303 | 1 |
데이터기준일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 4.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
2023-02-28 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-02-28 |
---|---|
2nd row | 2023-02-28 |
3rd row | 2023-02-28 |
4th row | 2023-02-28 |
5th row | 2023-02-28 |
Common Values
Value | Count | Frequency (%) |
2023-02-28 | 23 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-02-28 | 23 |
구분 | 타이완(대만) | 미국 | 일본 | 필리핀 | 중국 | 기타 | |
---|---|---|---|---|---|---|---|
구분 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
타이완(대만) | 1.000 | 1.000 | 0.833 | 0.946 | 0.000 | 0.586 | 0.000 |
미국 | 1.000 | 0.833 | 1.000 | 0.609 | 0.000 | 0.654 | 0.000 |
일본 | 1.000 | 0.946 | 0.609 | 1.000 | 0.372 | 0.130 | 0.000 |
필리핀 | 1.000 | 0.000 | 0.000 | 0.372 | 1.000 | 0.603 | 0.961 |
중국 | 1.000 | 0.586 | 0.654 | 0.130 | 0.603 | 1.000 | 0.550 |
기타 | 1.000 | 0.000 | 0.000 | 0.000 | 0.961 | 0.550 | 1.000 |
타이완(대만) | 미국 | 일본 | 필리핀 | 중국 | 기타 | |
---|---|---|---|---|---|---|
타이완(대만) | 1.000 | 0.642 | 0.690 | -0.205 | 0.683 | 0.076 |
미국 | 0.642 | 1.000 | 0.679 | -0.155 | 0.506 | 0.132 |
일본 | 0.690 | 0.679 | 1.000 | -0.211 | 0.590 | 0.062 |
필리핀 | -0.205 | -0.155 | -0.211 | 1.000 | 0.317 | 0.735 |
중국 | 0.683 | 0.506 | 0.590 | 0.317 | 1.000 | 0.573 |
기타 | 0.076 | 0.132 | 0.062 | 0.735 | 0.573 | 1.000 |
구분 | 타이완(대만) | 미국 | 일본 | 필리핀 | 중국 | 기타 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|
0 | 검암경서동 | 26 | 11 | 27 | 71 | 123 | 739 | 2023-02-28 |
1 | 연희동 | 21 | 2 | 16 | 10 | 85 | 371 | 2023-02-28 |
2 | 청라1동 | 39 | 16 | 21 | 6 | 82 | 188 | 2023-02-28 |
3 | 청라2동 | 31 | 24 | 30 | 8 | 89 | 216 | 2023-02-28 |
4 | 청라3동 | 17 | 50 | 11 | 15 | 76 | 222 | 2023-02-28 |
5 | 가정1동 | 6 | 6 | 10 | 8 | 88 | 298 | 2023-02-28 |
6 | 가정2동 | 1 | 1 | 1 | 4 | 7 | 54 | 2023-02-28 |
7 | 가정3동 | 3 | 0 | 7 | 6 | 21 | 106 | 2023-02-28 |
8 | 신현원창동 | 2 | 1 | 7 | 43 | 55 | 600 | 2023-02-28 |
9 | 석남1동 | 7 | 2 | 6 | 29 | 99 | 487 | 2023-02-28 |
구분 | 타이완(대만) | 미국 | 일본 | 필리핀 | 중국 | 기타 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|
13 | 가좌2동 | 0 | 0 | 6 | 12 | 11 | 61 | 2023-02-28 |
14 | 가좌3동 | 1 | 0 | 6 | 18 | 48 | 303 | 2023-02-28 |
15 | 가좌4동 | 0 | 1 | 4 | 51 | 26 | 140 | 2023-02-28 |
16 | 검단동 | 4 | 3 | 10 | 33 | 103 | 1074 | 2023-02-28 |
17 | 불로대곡동 | 3 | 5 | 6 | 24 | 32 | 311 | 2023-02-28 |
18 | 원당동 | 4 | 12 | 9 | 4 | 30 | 294 | 2023-02-28 |
19 | 당하동 | 5 | 4 | 20 | 4 | 50 | 226 | 2023-02-28 |
20 | 오류왕길동 | 0 | 6 | 11 | 168 | 63 | 1855 | 2023-02-28 |
21 | 마전동 | 11 | 4 | 12 | 4 | 33 | 102 | 2023-02-28 |
22 | 아라동 | 16 | 3 | 32 | 12 | 82 | 241 | 2023-02-28 |