Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 31 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.0 KiB |
Average record size in memory | 65.3 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Numeric | 4 |
Dataset
Description | 재외공관에서 근로계약이 체결된 외국인근로자가 별다른 절차없이 국내 입국에 필요한 사증을 받을 수 있도록 국내에서 입국을 허락한 인정서인 고용허가제에 대한 국가별 사증발급인정서 발급 현황을 국가별, 성별로 제공 |
---|---|
URL | https://www.data.go.kr/data/3075820/fileData.do |
E9_01(제조업) is highly overall correlated with E9_03(농업) and 1 other fields | High correlation |
E9_02(건설업) is highly overall correlated with E9_03(농업) | High correlation |
E9_03(농업) is highly overall correlated with E9_01(제조업) and 1 other fields | High correlation |
성별 is highly overall correlated with E9_01(제조업) | High correlation |
E9_05(서비스업) is highly imbalanced (65.0%) | Imbalance |
E9_01(제조업) has unique values | Unique |
E9_02(건설업) has 26 (83.9%) zeros | Zeros |
E9_03(농업) has 20 (64.5%) zeros | Zeros |
E9_04(어업) has 24 (77.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 02:48:57.088603 |
---|---|
Analysis finished | 2023-12-12 02:48:59.474110 |
Duration | 2.39 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
국적
Text
Distinct | 16 |
---|---|
Distinct (%) | 51.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
Value | Count | Frequency (%) |
네팔 | 2 | 6.5% |
인도네시아 | 2 | 6.5% |
베트남 | 2 | 6.5% |
캄보디아 | 2 | 6.5% |
필리핀 | 2 | 6.5% |
스리랑카 | 2 | 6.5% |
타이 | 2 | 6.5% |
방글라데시 | 2 | 6.5% |
우즈베키스탄 | 2 | 6.5% |
파키스탄 | 2 | 6.5% |
Other values (6) | 11 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 8 | 6.8% |
키 | 6 | 5.1% |
네 | 4 | 3.4% |
리 | 4 | 3.4% |
국 | 4 | 3.4% |
르 | 4 | 3.4% |
탄 | 4 | 3.4% |
즈 | 4 | 3.4% |
라 | 4 | 3.4% |
시 | 4 | 3.4% |
Other values (35) | 71 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 117 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 8 | 6.8% |
키 | 6 | 5.1% |
네 | 4 | 3.4% |
리 | 4 | 3.4% |
국 | 4 | 3.4% |
르 | 4 | 3.4% |
탄 | 4 | 3.4% |
즈 | 4 | 3.4% |
라 | 4 | 3.4% |
시 | 4 | 3.4% |
Other values (35) | 71 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 117 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 8 | 6.8% |
키 | 6 | 5.1% |
네 | 4 | 3.4% |
리 | 4 | 3.4% |
국 | 4 | 3.4% |
르 | 4 | 3.4% |
탄 | 4 | 3.4% |
즈 | 4 | 3.4% |
라 | 4 | 3.4% |
시 | 4 | 3.4% |
Other values (35) | 71 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 117 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 8 | 6.8% |
키 | 6 | 5.1% |
네 | 4 | 3.4% |
리 | 4 | 3.4% |
국 | 4 | 3.4% |
르 | 4 | 3.4% |
탄 | 4 | 3.4% |
즈 | 4 | 3.4% |
라 | 4 | 3.4% |
시 | 4 | 3.4% |
Other values (35) | 71 |
성별
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
남성 | |
---|---|
여성 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남성 |
---|---|
2nd row | 여성 |
3rd row | 남성 |
4th row | 여성 |
5th row | 남성 |
Common Values
Value | Count | Frequency (%) |
남성 | 16 | |
여성 | 15 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
남성 | 16 | |
여성 | 15 |
E9_01(제조업)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 31 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2325.0968 |
Minimum | 2 |
---|---|
Maximum | 8934 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 12 |
Q1 | 96.5 |
median | 560 |
Q3 | 4648.5 |
95-th percentile | 8363.5 |
Maximum | 8934 |
Range | 8932 |
Interquartile range (IQR) | 4552 |
Descriptive statistics
Standard deviation | 3094.4732 |
---|---|
Coefficient of variation (CV) | 1.3309008 |
Kurtosis | -0.41902481 |
Mean | 2325.0968 |
Median Absolute Deviation (MAD) | 535 |
Skewness | 1.094198 |
Sum | 72078 |
Variance | 9575764.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8934 | 1 | 3.2% |
320 | 1 | 3.2% |
6 | 1 | 3.2% |
164 | 1 | 3.2% |
52 | 1 | 3.2% |
653 | 1 | 3.2% |
141 | 1 | 3.2% |
571 | 1 | 3.2% |
25 | 1 | 3.2% |
862 | 1 | 3.2% |
Other values (21) | 21 |
Value | Count | Frequency (%) |
2 | 1 | |
6 | 1 | |
18 | 1 | |
19 | 1 | |
25 | 1 | |
45 | 1 | |
46 | 1 | |
52 | 1 | |
141 | 1 | |
164 | 1 |
Value | Count | Frequency (%) |
8934 | 1 | |
8864 | 1 | |
7863 | 1 | |
7299 | 1 | |
6659 | 1 | |
6188 | 1 | |
5896 | 1 | |
5719 | 1 | |
3578 | 1 | |
3424 | 1 |
E9_02(건설업)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 19.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 57.774194 |
Minimum | 0 |
---|---|
Maximum | 1015 |
Zeros | 26 |
Zeros (%) | 83.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 326.5 |
Maximum | 1015 |
Range | 1015 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 195.58335 |
---|---|
Coefficient of variation (CV) | 3.3853065 |
Kurtosis | 20.338606 |
Mean | 57.774194 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.3383672 |
Sum | 1791 |
Variance | 38252.847 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 26 | |
65 | 1 | 3.2% |
331 | 1 | 3.2% |
322 | 1 | 3.2% |
58 | 1 | 3.2% |
1015 | 1 | 3.2% |
Value | Count | Frequency (%) |
0 | 26 | |
58 | 1 | 3.2% |
65 | 1 | 3.2% |
322 | 1 | 3.2% |
331 | 1 | 3.2% |
1015 | 1 | 3.2% |
Value | Count | Frequency (%) |
1015 | 1 | 3.2% |
331 | 1 | 3.2% |
322 | 1 | 3.2% |
65 | 1 | 3.2% |
58 | 1 | 3.2% |
0 | 26 |
E9_03(농업)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 12 |
---|---|
Distinct (%) | 38.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 433.29032 |
Minimum | 0 |
---|---|
Maximum | 4147 |
Zeros | 20 |
Zeros (%) | 64.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 489.5 |
95-th percentile | 2140 |
Maximum | 4147 |
Range | 4147 |
Interquartile range (IQR) | 489.5 |
Descriptive statistics
Standard deviation | 937.53806 |
---|---|
Coefficient of variation (CV) | 2.1637641 |
Kurtosis | 8.6712569 |
Mean | 433.29032 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.8477487 |
Sum | 13432 |
Variance | 878977.61 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 20 | |
4147 | 1 | 3.2% |
1419 | 1 | 3.2% |
727 | 1 | 3.2% |
326 | 1 | 3.2% |
1357 | 1 | 3.2% |
2861 | 1 | 3.2% |
653 | 1 | 3.2% |
6 | 1 | 3.2% |
1071 | 1 | 3.2% |
Other values (2) | 2 | 6.5% |
Value | Count | Frequency (%) |
0 | 20 | |
3 | 1 | 3.2% |
6 | 1 | 3.2% |
326 | 1 | 3.2% |
653 | 1 | 3.2% |
727 | 1 | 3.2% |
862 | 1 | 3.2% |
1071 | 1 | 3.2% |
1357 | 1 | 3.2% |
1419 | 1 | 3.2% |
Value | Count | Frequency (%) |
4147 | 1 | |
2861 | 1 | |
1419 | 1 | |
1357 | 1 | |
1071 | 1 | |
862 | 1 | |
727 | 1 | |
653 | 1 | |
326 | 1 | |
6 | 1 |
E9_04(어업)
Real number (ℝ)
ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 25.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 232 |
Minimum | 0 |
---|---|
Maximum | 3185 |
Zeros | 24 |
Zeros (%) | 77.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 1605 |
Maximum | 3185 |
Range | 3185 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 701.3391 |
---|---|
Coefficient of variation (CV) | 3.0230134 |
Kurtosis | 11.68435 |
Mean | 232 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.3929277 |
Sum | 7192 |
Variance | 491876.53 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 24 | |
3185 | 1 | 3.2% |
3 | 1 | 3.2% |
2108 | 1 | 3.2% |
4 | 1 | 3.2% |
1102 | 1 | 3.2% |
1 | 1 | 3.2% |
789 | 1 | 3.2% |
Value | Count | Frequency (%) |
0 | 24 | |
1 | 1 | 3.2% |
3 | 1 | 3.2% |
4 | 1 | 3.2% |
789 | 1 | 3.2% |
1102 | 1 | 3.2% |
2108 | 1 | 3.2% |
3185 | 1 | 3.2% |
Value | Count | Frequency (%) |
3185 | 1 | 3.2% |
2108 | 1 | 3.2% |
1102 | 1 | 3.2% |
789 | 1 | 3.2% |
4 | 1 | 3.2% |
3 | 1 | 3.2% |
1 | 1 | 3.2% |
0 | 24 |
E9_05(서비스업)
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 16.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
0 | |
---|---|
77 | 1 |
27 | 1 |
2 | 1 |
1 | 1 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.0645161 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 12.9% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 27 | |
77 | 1 | 3.2% |
27 | 1 | 3.2% |
2 | 1 | 3.2% |
1 | 1 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 27 | |
77 | 1 | 3.2% |
27 | 1 | 3.2% |
2 | 1 | 3.2% |
1 | 1 | 3.2% |
국적 | 성별 | E9_01(제조업) | E9_02(건설업) | E9_03(농업) | E9_04(어업) | E9_05(서비스업) | |
---|---|---|---|---|---|---|---|
국적 | 1.000 | 0.000 | 0.694 | 0.681 | 0.809 | 0.000 | 0.223 |
성별 | 0.000 | 1.000 | 0.790 | 0.119 | 0.000 | 0.087 | 0.015 |
E9_01(제조업) | 0.694 | 0.790 | 1.000 | 0.896 | 0.643 | 0.519 | 0.348 |
E9_02(건설업) | 0.681 | 0.119 | 0.896 | 1.000 | 0.893 | 0.000 | 0.000 |
E9_03(농업) | 0.809 | 0.000 | 0.643 | 0.893 | 1.000 | 0.000 | 0.000 |
E9_04(어업) | 0.000 | 0.087 | 0.519 | 0.000 | 0.000 | 1.000 | 0.000 |
E9_05(서비스업) | 0.223 | 0.015 | 0.348 | 0.000 | 0.000 | 0.000 | 1.000 |
E9_05(서비스업) | 성별 | |
---|---|---|
E9_05(서비스업) | 1.000 | 0.000 |
성별 | 0.000 | 1.000 |
E9_01(제조업) | E9_02(건설업) | E9_03(농업) | E9_04(어업) | 성별 | E9_05(서비스업) | |
---|---|---|---|---|---|---|
E9_01(제조업) | 1.000 | 0.472 | 0.506 | 0.232 | 0.540 | 0.188 |
E9_02(건설업) | 0.472 | 1.000 | 0.534 | 0.176 | 0.187 | 0.000 |
E9_03(농업) | 0.506 | 0.534 | 1.000 | 0.018 | 0.000 | 0.000 |
E9_04(어업) | 0.232 | 0.176 | 0.018 | 1.000 | 0.076 | 0.000 |
성별 | 0.540 | 0.187 | 0.000 | 0.076 | 1.000 | 0.000 |
E9_05(서비스업) | 0.188 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
국적 | 성별 | E9_01(제조업) | E9_02(건설업) | E9_03(농업) | E9_04(어업) | E9_05(서비스업) | |
---|---|---|---|---|---|---|---|
0 | 네팔 | 남성 | 8934 | 0 | 4147 | 0 | 0 |
1 | 네팔 | 여성 | 320 | 0 | 1419 | 0 | 0 |
2 | 인도네시아 | 남성 | 8864 | 0 | 0 | 3185 | 0 |
3 | 인도네시아 | 여성 | 271 | 0 | 0 | 3 | 0 |
4 | 베트남 | 남성 | 7863 | 65 | 727 | 2108 | 0 |
5 | 베트남 | 여성 | 832 | 0 | 326 | 4 | 0 |
6 | 캄보디아 | 남성 | 6188 | 331 | 1357 | 0 | 0 |
7 | 캄보디아 | 여성 | 560 | 0 | 2861 | 0 | 0 |
8 | 필리핀 | 남성 | 7299 | 0 | 0 | 0 | 0 |
9 | 필리핀 | 여성 | 464 | 0 | 0 | 0 | 0 |
국적 | 성별 | E9_01(제조업) | E9_02(건설업) | E9_03(농업) | E9_04(어업) | E9_05(서비스업) | |
---|---|---|---|---|---|---|---|
21 | 티모르민주공화국 | 남성 | 256 | 0 | 0 | 789 | 0 |
22 | 티모르민주공화국 | 여성 | 19 | 0 | 0 | 0 | 0 |
23 | 키르기즈 | 남성 | 862 | 0 | 0 | 0 | 0 |
24 | 키르기즈 | 여성 | 25 | 0 | 0 | 0 | 0 |
25 | 몽골 | 남성 | 571 | 0 | 0 | 0 | 27 |
26 | 몽골 | 여성 | 141 | 0 | 0 | 0 | 2 |
27 | 라오스 | 남성 | 653 | 0 | 0 | 0 | 0 |
28 | 라오스 | 여성 | 52 | 0 | 0 | 0 | 0 |
29 | 중국 | 남성 | 164 | 0 | 0 | 0 | 1 |
30 | 중국 | 여성 | 6 | 0 | 0 | 0 | 0 |