Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 109 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 63.2 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 5 |
Dataset
Description | 근로소득 백분위(상위 1% 1,000분위) 자료- 인원(명)- 총급여액(억 원)- 근로소득금액(억 원)- 소득공제액(억 원) (근로소득공제+인적공제+연금보험료공제+특별소득공제+그밖의소득공제-소득공제한도초과액)- 과세표준(억 원)- 결정세액(억 원) |
---|---|
Author | 국세청 |
URL | https://www.data.go.kr/data/15082063/fileData.do |
총급여 is highly overall correlated with 근로소득금액 and 3 other fields | High correlation |
근로소득금액 is highly overall correlated with 총급여 and 3 other fields | High correlation |
소득공제액 is highly overall correlated with 총급여 and 3 other fields | High correlation |
과세표준 is highly overall correlated with 총급여 and 3 other fields | High correlation |
결정세액 is highly overall correlated with 총급여 and 3 other fields | High correlation |
구분 has unique values | Unique |
총급여 has unique values | Unique |
근로소득금액 has unique values | Unique |
소득공제액 has unique values | Unique |
과세표준 has 7 (6.4%) zeros | Zeros |
결정세액 has 18 (16.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-14 17:24:55.852982 |
---|---|
Analysis finished | 2024-03-14 17:25:05.107741 |
Duration | 9.25 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Text
UNIQUE
 
Distinct | 109 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1000.0 B |
Length
Max length | 9 |
---|---|
Median length | 6 |
Mean length | 6.2110092 |
Min length | 5 |
Characters and Unicode
Total characters | 677 |
---|---|
Distinct characters | 16 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 109 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 상위 0.1% 내 |
---|---|
2nd row | 상위 0.2% 내 |
3rd row | 상위 0.3% 내 |
4th row | 상위 0.4% 내 |
5th row | 상위 0.5% 내 |
Value | Count | Frequency (%) |
상위 | 10 | 7.8% |
내 | 10 | 7.8% |
상위60%내 | 1 | 0.8% |
상위71%내 | 1 | 0.8% |
상위70%내 | 1 | 0.8% |
상위69%내 | 1 | 0.8% |
상위68%내 | 1 | 0.8% |
상위67%내 | 1 | 0.8% |
상위66%내 | 1 | 0.8% |
상위65%내 | 1 | 0.8% |
Other values (101) | 101 |
Most occurring characters
Value | Count | Frequency (%) |
상 | 109 | |
위 | 109 | |
% | 109 | |
내 | 109 | |
1 | 22 | 3.2% |
0 | 21 | 3.1% |
4 | 21 | 3.1% |
5 | 21 | 3.1% |
3 | 21 | 3.1% |
6 | 21 | 3.1% |
Other values (6) | 114 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 327 | |
Decimal Number | 211 | |
Other Punctuation | 119 | 17.6% |
Space Separator | 20 | 3.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 22 | |
0 | 21 | |
4 | 21 | |
5 | 21 | |
3 | 21 | |
6 | 21 | |
7 | 21 | |
8 | 21 | |
9 | 21 | |
2 | 21 |
Other Letter
Value | Count | Frequency (%) |
상 | 109 | |
위 | 109 | |
내 | 109 |
Other Punctuation
Value | Count | Frequency (%) |
% | 109 | |
. | 10 | 8.4% |
Space Separator
Value | Count | Frequency (%) |
20 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 350 | |
Hangul | 327 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
% | 109 | |
1 | 22 | 6.3% |
0 | 21 | 6.0% |
4 | 21 | 6.0% |
5 | 21 | 6.0% |
3 | 21 | 6.0% |
6 | 21 | 6.0% |
7 | 21 | 6.0% |
8 | 21 | 6.0% |
9 | 21 | 6.0% |
Other values (3) | 51 |
Hangul
Value | Count | Frequency (%) |
상 | 109 | |
위 | 109 | |
내 | 109 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 350 | |
Hangul | 327 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
상 | 109 | |
위 | 109 | |
내 | 109 |
ASCII
Value | Count | Frequency (%) |
% | 109 | |
1 | 22 | 6.3% |
0 | 21 | 6.0% |
4 | 21 | 6.0% |
5 | 21 | 6.0% |
3 | 21 | 6.0% |
6 | 21 | 6.0% |
7 | 21 | 6.0% |
8 | 21 | 6.0% |
9 | 21 | 6.0% |
Other values (3) | 51 |
인원
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 3.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1000.0 B |
205396 | |
---|---|
205397 | |
20540 | 6 |
20539 | 4 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.9082569 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20539 |
---|---|
2nd row | 20540 |
3rd row | 20539 |
4th row | 20540 |
5th row | 20540 |
Common Values
Value | Count | Frequency (%) |
205396 | 85 | |
205397 | 14 | 12.8% |
20540 | 6 | 5.5% |
20539 | 4 | 3.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
205396 | 85 | |
205397 | 14 | 12.8% |
20540 | 6 | 5.5% |
20539 | 4 | 3.7% |
총급여
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 109 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 79400.495 |
Minimum | 433 |
---|---|
Maximum | 339553 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 433 |
---|---|
5-th percentile | 7833.6 |
Q1 | 39637 |
median | 61515 |
Q3 | 102613 |
95-th percentile | 203910.4 |
Maximum | 339553 |
Range | 339120 |
Interquartile range (IQR) | 62976 |
Descriptive statistics
Standard deviation | 63459.94 |
---|---|
Coefficient of variation (CV) | 0.79923859 |
Kurtosis | 2.8759159 |
Mean | 79400.495 |
Median Absolute Deviation (MAD) | 29692 |
Skewness | 1.5574539 |
Sum | 8654654 |
Variance | 4.027164 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202921 | 1 | 0.9% |
52961 | 1 | 0.9% |
42689 | 1 | 0.9% |
44141 | 1 | 0.9% |
45387 | 1 | 0.9% |
46648 | 1 | 0.9% |
47242 | 1 | 0.9% |
47853 | 1 | 0.9% |
48801 | 1 | 0.9% |
49330 | 1 | 0.9% |
Other values (99) | 99 |
Value | Count | Frequency (%) |
433 | 1 | |
1781 | 1 | |
3183 | 1 | |
4429 | 1 | |
5809 | 1 | |
7302 | 1 | |
8631 | 1 | |
10117 | 1 | |
11685 | 1 | |
13001 | 1 |
Value | Count | Frequency (%) |
339553 | 1 | |
285277 | 1 | |
254864 | 1 | |
234303 | 1 | |
217721 | 1 | |
204570 | 1 | |
202921 | 1 | |
194353 | 1 | |
185584 | 1 | |
177859 | 1 |
근로소득금액
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 109 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 61085.771 |
Minimum | 130 |
---|---|
Maximum | 306645 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 130 |
---|---|
5-th percentile | 2350.2 |
Q1 | 24244 |
median | 44377 |
Q3 | 79906 |
95-th percentile | 182047.6 |
Maximum | 306645 |
Range | 306515 |
Interquartile range (IQR) | 55662 |
Descriptive statistics
Standard deviation | 57531.038 |
---|---|
Coefficient of variation (CV) | 0.94180752 |
Kurtosis | 3.6520725 |
Mean | 61085.771 |
Median Absolute Deviation (MAD) | 26464 |
Skewness | 1.7737694 |
Sum | 6658349 |
Variance | 3.3098204 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
198917 | 1 | 0.9% |
34234 | 1 | 0.9% |
25502 | 1 | 0.9% |
26736 | 1 | 0.9% |
27796 | 1 | 0.9% |
28868 | 1 | 0.9% |
29373 | 1 | 0.9% |
29892 | 1 | 0.9% |
30697 | 1 | 0.9% |
31148 | 1 | 0.9% |
Other values (99) | 99 |
Value | Count | Frequency (%) |
130 | 1 | |
534 | 1 | |
955 | 1 | |
1329 | 1 | |
1743 | 1 | |
2191 | 1 | |
2589 | 1 | |
3070 | 1 | |
3930 | 1 | |
4720 | 1 |
Value | Count | Frequency (%) |
306645 | 1 | |
253399 | 1 | |
223582 | 1 | |
203432 | 1 | |
198917 | 1 | |
187180 | 1 | |
174349 | 1 | |
164610 | 1 | |
156280 | 1 | |
148941 | 1 |
소득공제액
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 109 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33994.339 |
Minimum | 433 |
---|---|
Maximum | 74304 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 433 |
---|---|
5-th percentile | 7460.8 |
Q1 | 18570 |
median | 32954 |
Q3 | 48147 |
95-th percentile | 64354.2 |
Maximum | 74304 |
Range | 73871 |
Interquartile range (IQR) | 29577 |
Descriptive statistics
Standard deviation | 18823.669 |
---|---|
Coefficient of variation (CV) | 0.55372953 |
Kurtosis | -0.91732801 |
Mean | 33994.339 |
Median Absolute Deviation (MAD) | 14655 |
Skewness | 0.13945614 |
Sum | 3705383 |
Variance | 3.5433053 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
13206 | 1 | 0.9% |
30843 | 1 | 0.9% |
26317 | 1 | 0.9% |
27165 | 1 | 0.9% |
27558 | 1 | 0.9% |
28201 | 1 | 0.9% |
28919 | 1 | 0.9% |
28683 | 1 | 0.9% |
29902 | 1 | 0.9% |
28829 | 1 | 0.9% |
Other values (99) | 99 |
Value | Count | Frequency (%) |
433 | 1 | |
1781 | 1 | |
3183 | 1 | |
4429 | 1 | |
5809 | 1 | |
7302 | 1 | |
7699 | 1 | |
7748 | 1 | |
7837 | 1 | |
7916 | 1 |
Value | Count | Frequency (%) |
74304 | 1 | |
70644 | 1 | |
68309 | 1 | |
67732 | 1 | |
65952 | 1 | |
64603 | 1 | |
63981 | 1 | |
63804 | 1 | |
63075 | 1 | |
62261 | 1 |
과세표준
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 103 |
---|---|
Distinct (%) | 94.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 45406.183 |
Minimum | 0 |
---|---|
Maximum | 265250 |
Zeros | 7 |
Zeros (%) | 6.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 14992 |
median | 31576 |
Q3 | 57727 |
95-th percentile | 147048.6 |
Maximum | 265250 |
Range | 265250 |
Interquartile range (IQR) | 42735 |
Descriptive statistics
Standard deviation | 48556.587 |
---|---|
Coefficient of variation (CV) | 1.0693827 |
Kurtosis | 5.1204881 |
Mean | 45406.183 |
Median Absolute Deviation (MAD) | 20365 |
Skewness | 2.086993 |
Sum | 4949274 |
Variance | 2.3577421 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7 | 6.4% |
189715 | 1 | 0.9% |
24533 | 1 | 0.9% |
19170 | 1 | 0.9% |
18899 | 1 | 0.9% |
20501 | 1 | 0.9% |
20294 | 1 | 0.9% |
20908 | 1 | 0.9% |
21499 | 1 | 0.9% |
22119 | 1 | 0.9% |
Other values (93) | 93 |
Value | Count | Frequency (%) |
0 | 7 | |
21 | 1 | 0.9% |
357 | 1 | 0.9% |
1055 | 1 | 0.9% |
1758 | 1 | 0.9% |
2390 | 1 | 0.9% |
3114 | 1 | 0.9% |
3863 | 1 | 0.9% |
4563 | 1 | 0.9% |
5326 | 1 | 0.9% |
Value | Count | Frequency (%) |
265250 | 1 | |
214634 | 1 | |
189715 | 1 | |
186554 | 1 | |
166571 | 1 | |
151769 | 1 | |
139968 | 1 | |
130372 | 1 | |
121780 | 1 | |
114784 | 1 |
결정세액
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 92 |
---|---|
Distinct (%) | 84.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5427.1651 |
Minimum | 0 |
---|---|
Maximum | 72145 |
Zeros | 18 |
Zeros (%) | 16.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 135 |
median | 827 |
Q3 | 6185 |
95-th percentile | 23494.4 |
Maximum | 72145 |
Range | 72145 |
Interquartile range (IQR) | 6050 |
Descriptive statistics
Standard deviation | 10937.154 |
---|---|
Coefficient of variation (CV) | 2.0152609 |
Kurtosis | 17.441005 |
Mean | 5427.1651 |
Median Absolute Deviation (MAD) | 827 |
Skewness | 3.7838279 |
Sum | 591561 |
Variance | 1.1962133 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 18 | 16.5% |
72145 | 1 | 0.9% |
599 | 1 | 0.9% |
322 | 1 | 0.9% |
351 | 1 | 0.9% |
370 | 1 | 0.9% |
394 | 1 | 0.9% |
423 | 1 | 0.9% |
454 | 1 | 0.9% |
498 | 1 | 0.9% |
Other values (82) | 82 |
Value | Count | Frequency (%) |
0 | 18 | |
1 | 1 | 0.9% |
6 | 1 | 0.9% |
15 | 1 | 0.9% |
32 | 1 | 0.9% |
52 | 1 | 0.9% |
70 | 1 | 0.9% |
86 | 1 | 0.9% |
106 | 1 | 0.9% |
122 | 1 | 0.9% |
Value | Count | Frequency (%) |
72145 | 1 | |
58046 | 1 | |
40658 | 1 | |
31378 | 1 | |
25771 | 1 | |
24304 | 1 | |
22280 | 1 | |
19567 | 1 | |
17492 | 1 | |
17314 | 1 |
인원 | 총급여 | 근로소득금액 | 소득공제액 | 과세표준 | 결정세액 | |
---|---|---|---|---|---|---|
인원 | 1.000 | 0.000 | 0.000 | 0.483 | 0.098 | 0.759 |
총급여 | 0.000 | 1.000 | 0.983 | 0.935 | 0.993 | 0.916 |
근로소득금액 | 0.000 | 0.983 | 1.000 | 0.951 | 0.979 | 0.942 |
소득공제액 | 0.483 | 0.935 | 0.951 | 1.000 | 0.903 | 0.654 |
과세표준 | 0.098 | 0.993 | 0.979 | 0.903 | 1.000 | 0.916 |
결정세액 | 0.759 | 0.916 | 0.942 | 0.654 | 0.916 | 1.000 |
총급여 | 근로소득금액 | 소득공제액 | 과세표준 | 결정세액 | 인원 | |
---|---|---|---|---|---|---|
총급여 | 1.000 | 0.988 | 0.908 | 0.964 | 0.873 | 0.000 |
근로소득금액 | 0.988 | 1.000 | 0.852 | 0.993 | 0.933 | 0.000 |
소득공제액 | 0.908 | 0.852 | 1.000 | 0.795 | 0.637 | 0.289 |
과세표준 | 0.964 | 0.993 | 0.795 | 1.000 | 0.966 | 0.050 |
결정세액 | 0.873 | 0.933 | 0.637 | 0.966 | 1.000 | 0.417 |
인원 | 0.000 | 0.000 | 0.289 | 0.050 | 0.417 | 1.000 |
구분 | 인원 | 총급여 | 근로소득금액 | 소득공제액 | 과세표준 | 결정세액 | |
---|---|---|---|---|---|---|---|
0 | 상위 0.1% 내 | 20539 | 202921 | 198917 | 13206 | 189715 | 72145 |
1 | 상위 0.2% 내 | 20540 | 85586 | 81555 | 9801 | 75785 | 24304 |
2 | 상위 0.3% 내 | 20539 | 67201 | 63297 | 8978 | 58224 | 17492 |
3 | 상위 0.4% 내 | 20540 | 57912 | 54184 | 8497 | 49414 | 14199 |
4 | 상위 0.5% 내 | 20540 | 51903 | 48285 | 8201 | 43702 | 12073 |
5 | 상위 0.6% 내 | 20539 | 47924 | 44377 | 8001 | 39923 | 10666 |
6 | 상위 0.7% 내 | 20540 | 44818 | 41328 | 7916 | 36901 | 9548 |
7 | 상위 0.8% 내 | 20539 | 42476 | 39033 | 7837 | 34639 | 8696 |
8 | 상위 0.9% 내 | 20540 | 40673 | 37260 | 7748 | 32925 | 8066 |
9 | 상위 1.0% 내 | 20540 | 39154 | 35769 | 7699 | 31455 | 7521 |
구분 | 인원 | 총급여 | 근로소득금액 | 소득공제액 | 과세표준 | 결정세액 | |
---|---|---|---|---|---|---|---|
99 | 상위91%내 | 205396 | 13001 | 4720 | 11947 | 1055 | 0 |
100 | 상위92%내 | 205396 | 11685 | 3930 | 11328 | 357 | 0 |
101 | 상위93%내 | 205397 | 10117 | 3070 | 10096 | 21 | 0 |
102 | 상위94%내 | 205396 | 8631 | 2589 | 8631 | 0 | 0 |
103 | 상위95%내 | 205396 | 7302 | 2191 | 7302 | 0 | 0 |
104 | 상위96%내 | 205396 | 5809 | 1743 | 5809 | 0 | 0 |
105 | 상위97%내 | 205396 | 4429 | 1329 | 4429 | 0 | 0 |
106 | 상위98%내 | 205396 | 3183 | 955 | 3183 | 0 | 0 |
107 | 상위99%내 | 205396 | 1781 | 534 | 1781 | 0 | 0 |
108 | 상위100%내 | 205397 | 433 | 130 | 433 | 0 | 0 |