Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 64 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.5 KiB |
Average record size in memory | 72.1 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 6 |
Dataset
Description | 국세환급현황을 국세통계로 제공 - 지역별 등 구분하여 제공(종합소득세, 법인세, 부가가치세, 양도소득세, 상속 · 증여세 등) |
---|---|
URL | https://www.data.go.kr/data/3059447/fileData.do |
종합소득세(백만원) is highly overall correlated with 법인세(백만원) and 4 other fields | High correlation |
법인세(백만원) is highly overall correlated with 종합소득세(백만원) and 5 other fields | High correlation |
부가가치세(백만원) is highly overall correlated with 종합소득세(백만원) and 4 other fields | High correlation |
양도소득세(백만원) is highly overall correlated with 종합소득세(백만원) and 4 other fields | High correlation |
상속_증여세(백만원) is highly overall correlated with 종합소득세(백만원) and 5 other fields | High correlation |
기타(백만원) is highly overall correlated with 종합소득세(백만원) and 4 other fields | High correlation |
시도별 is highly overall correlated with 법인세(백만원) and 1 other fields | High correlation |
법인세(백만원) has 11 (17.2%) zeros | Zeros |
부가가치세(백만원) has 1 (1.6%) zeros | Zeros |
양도소득세(백만원) has 12 (18.8%) zeros | Zeros |
상속_증여세(백만원) has 15 (23.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 05:12:50.263039 |
---|---|
Analysis finished | 2023-12-12 05:12:54.428611 |
Duration | 4.17 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 644.0 B |
발생액 | |
---|---|
지급액 | |
충당액 | |
미처리 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 발생액 |
---|---|
2nd row | 발생액 |
3rd row | 발생액 |
4th row | 발생액 |
5th row | 발생액 |
Common Values
Value | Count | Frequency (%) |
발생액 | 16 | |
지급액 | 16 | |
충당액 | 16 | |
미처리 | 16 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
발생액 | 16 | |
지급액 | 16 | |
충당액 | 16 | |
미처리 | 16 |
시도별
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 25.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 644.0 B |
서울 | 4 |
---|---|
인천 | 4 |
경기 | 4 |
강원 | 4 |
대전 | 4 |
Other values (11) |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 4.1875 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 인천 |
3rd row | 경기 |
4th row | 강원 |
5th row | 대전 |
Common Values
Value | Count | Frequency (%) |
서울 | 4 | 6.2% |
인천 | 4 | 6.2% |
경기 | 4 | 6.2% |
강원 | 4 | 6.2% |
대전 | 4 | 6.2% |
충북 | 4 | 6.2% |
충남 세종 | 4 | 6.2% |
광주 | 4 | 6.2% |
전북 | 4 | 6.2% |
전남 | 4 | 6.2% |
Other values (6) | 24 |
Length
Value | Count | Frequency (%) |
서울 | 4 | 5.9% |
전북 | 4 | 5.9% |
경남 | 4 | 5.9% |
울산 | 4 | 5.9% |
부산 | 4 | 5.9% |
경북 | 4 | 5.9% |
대구 | 4 | 5.9% |
전남 | 4 | 5.9% |
광주 | 4 | 5.9% |
인천 | 4 | 5.9% |
Other values (7) | 28 |
종합소득세(백만원)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 62 |
---|---|
Distinct (%) | 96.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 99223.469 |
Minimum | 16 |
---|---|
Maximum | 931873 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 708.0 B |
Quantile statistics
Minimum | 16 |
---|---|
5-th percentile | 28.15 |
Q1 | 1572 |
median | 35067.5 |
Q3 | 85679 |
95-th percentile | 756353.35 |
Maximum | 931873 |
Range | 931857 |
Interquartile range (IQR) | 84107 |
Descriptive statistics
Standard deviation | 213505.86 |
---|---|
Coefficient of variation (CV) | 2.1517677 |
Kurtosis | 10.266552 |
Mean | 99223.469 |
Median Absolute Deviation (MAD) | 34979.5 |
Skewness | 3.3263994 |
Sum | 6350302 |
Variance | 4.5584752 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 2 | 3.1% |
40 | 2 | 3.1% |
890505 | 1 | 1.6% |
10222 | 1 | 1.6% |
40292 | 1 | 1.6% |
3850 | 1 | 1.6% |
3609 | 1 | 1.6% |
2426 | 1 | 1.6% |
4009 | 1 | 1.6% |
3354 | 1 | 1.6% |
Other values (52) | 52 |
Value | Count | Frequency (%) |
16 | 1 | |
19 | 2 | |
28 | 1 | |
29 | 1 | |
31 | 1 | |
37 | 1 | |
40 | 2 | |
50 | 1 | |
65 | 1 | |
67 | 1 |
Value | Count | Frequency (%) |
931873 | 1 | |
891297 | 1 | |
890505 | 1 | |
855586 | 1 | |
194035 | 1 | |
185354 | 1 | |
183582 | 1 | |
178837 | 1 | |
138371 | 1 | |
137571 | 1 |
법인세(백만원)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 54 |
---|---|
Distinct (%) | 84.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 234234.22 |
Minimum | -26 |
---|---|
Maximum | 4720013 |
Zeros | 11 |
Zeros (%) | 17.2% |
Negative | 1 |
Negative (%) | 1.6% |
Memory size | 708.0 B |
Quantile statistics
Minimum | -26 |
---|---|
5-th percentile | 0 |
Q1 | 1092.5 |
median | 42579 |
Q3 | 108220.75 |
95-th percentile | 976003.15 |
Maximum | 4720013 |
Range | 4720039 |
Interquartile range (IQR) | 107128.25 |
Descriptive statistics
Standard deviation | 827957.04 |
---|---|
Coefficient of variation (CV) | 3.5347399 |
Kurtosis | 25.754439 |
Mean | 234234.22 |
Median Absolute Deviation (MAD) | 42579 |
Skewness | 5.0787231 |
Sum | 14990990 |
Variance | 6.8551285 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 11 | 17.2% |
4720013 | 1 | 1.6% |
16018 | 1 | 1.6% |
133930 | 1 | 1.6% |
27479 | 1 | 1.6% |
95323 | 1 | 1.6% |
6302 | 1 | 1.6% |
56233 | 1 | 1.6% |
1458 | 1 | 1.6% |
1745 | 1 | 1.6% |
Other values (44) | 44 |
Value | Count | Frequency (%) |
-26 | 1 | 1.6% |
0 | 11 | |
1 | 1 | 1.6% |
2 | 1 | 1.6% |
19 | 1 | 1.6% |
35 | 1 | 1.6% |
1445 | 1 | 1.6% |
1458 | 1 | 1.6% |
1745 | 1 | 1.6% |
2276 | 1 | 1.6% |
Value | Count | Frequency (%) |
4720013 | 1 | |
4624716 | 1 | |
1167172 | 1 | |
1110904 | 1 | |
211565 | 1 | |
210147 | 1 | |
203846 | 1 | |
201528 | 1 | |
181024 | 1 | |
164345 | 1 |
부가가치세(백만원)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 60 |
---|---|
Distinct (%) | 93.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2613283.8 |
Minimum | 0 |
---|---|
Maximum | 26045597 |
Zeros | 1 |
Zeros (%) | 1.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 708.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 2696.5 |
median | 221309.5 |
Q3 | 2584809 |
95-th percentile | 20232945 |
Maximum | 26045597 |
Range | 26045597 |
Interquartile range (IQR) | 2582112.5 |
Descriptive statistics
Standard deviation | 5909891.6 |
---|---|
Coefficient of variation (CV) | 2.261481 |
Kurtosis | 10.33577 |
Mean | 2613283.8 |
Median Absolute Deviation (MAD) | 221306.5 |
Skewness | 3.3204768 |
Sum | 1.6725016 × 108 |
Variance | 3.4926819 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 2 | 3.1% |
4 | 2 | 3.1% |
3 | 2 | 3.1% |
6 | 2 | 3.1% |
7244 | 1 | 1.6% |
6378 | 1 | 1.6% |
8343 | 1 | 1.6% |
9259 | 1 | 1.6% |
11152 | 1 | 1.6% |
26045597 | 1 | 1.6% |
Other values (50) | 50 |
Value | Count | Frequency (%) |
0 | 1 | |
3 | 2 | |
4 | 2 | |
5 | 1 | |
6 | 2 | |
7 | 1 | |
12 | 2 | |
13 | 1 | |
14 | 1 | |
15 | 1 |
Value | Count | Frequency (%) |
26045597 | 1 | |
25982185 | 1 | |
22985252 | 1 | |
22872310 | 1 | |
5276543 | 1 | |
5265378 | 1 | |
4324226 | 1 | |
4307729 | 1 | |
4166444 | 1 | |
4156663 | 1 |
양도소득세(백만원)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 52 |
---|---|
Distinct (%) | 81.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13354.75 |
Minimum | 0 |
---|---|
Maximum | 195308 |
Zeros | 12 |
Zeros (%) | 18.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 708.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 57.75 |
median | 4072 |
Q3 | 10260.75 |
95-th percentile | 82255.6 |
Maximum | 195308 |
Range | 195308 |
Interquartile range (IQR) | 10203 |
Descriptive statistics
Standard deviation | 36442.201 |
---|---|
Coefficient of variation (CV) | 2.728782 |
Kurtosis | 17.920123 |
Mean | 13354.75 |
Median Absolute Deviation (MAD) | 4071.5 |
Skewness | 4.2133699 |
Sum | 854704 |
Variance | 1.328034 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 12 | 18.8% |
1 | 2 | 3.1% |
195308 | 1 | 1.6% |
166 | 1 | 1.6% |
5343 | 1 | 1.6% |
9369 | 1 | 1.6% |
2033 | 1 | 1.6% |
7050 | 1 | 1.6% |
416 | 1 | 1.6% |
5201 | 1 | 1.6% |
Other values (42) | 42 |
Value | Count | Frequency (%) |
0 | 12 | |
1 | 2 | 3.1% |
2 | 1 | 1.6% |
45 | 1 | 1.6% |
62 | 1 | 1.6% |
113 | 1 | 1.6% |
152 | 1 | 1.6% |
166 | 1 | 1.6% |
241 | 1 | 1.6% |
275 | 1 | 1.6% |
Value | Count | Frequency (%) |
195308 | 1 | |
188255 | 1 | |
98949 | 1 | |
93748 | 1 | |
17132 | 1 | |
16495 | 1 | |
14471 | 1 | |
14195 | 1 | |
12829 | 1 | |
12745 | 1 |
상속_증여세(백만원)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 50 |
---|---|
Distinct (%) | 78.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5210.6562 |
Minimum | 0 |
---|---|
Maximum | 94774 |
Zeros | 15 |
Zeros (%) | 23.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 708.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 10.25 |
median | 831.5 |
Q3 | 3101.75 |
95-th percentile | 21103.25 |
Maximum | 94774 |
Range | 94774 |
Interquartile range (IQR) | 3091.5 |
Descriptive statistics
Standard deviation | 16049.73 |
---|---|
Coefficient of variation (CV) | 3.0801744 |
Kurtosis | 24.68086 |
Mean | 5210.6562 |
Median Absolute Deviation (MAD) | 831.5 |
Skewness | 4.9156729 |
Sum | 333482 |
Variance | 2.5759384 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 15 | 23.4% |
94774 | 1 | 1.6% |
53 | 1 | 1.6% |
6539 | 1 | 1.6% |
4748 | 1 | 1.6% |
5990 | 1 | 1.6% |
1449 | 1 | 1.6% |
9471 | 1 | 1.6% |
175 | 1 | 1.6% |
1331 | 1 | 1.6% |
Other values (40) | 40 |
Value | Count | Frequency (%) |
0 | 15 | |
2 | 1 | 1.6% |
13 | 1 | 1.6% |
14 | 1 | 1.6% |
24 | 1 | 1.6% |
32 | 1 | 1.6% |
53 | 1 | 1.6% |
149 | 1 | 1.6% |
151 | 1 | 1.6% |
175 | 1 | 1.6% |
Value | Count | Frequency (%) |
94774 | 1 | |
85304 | 1 | |
24487 | 1 | |
23156 | 1 | |
9471 | 1 | |
8593 | 1 | |
8462 | 1 | |
8442 | 1 | |
6539 | 1 | |
6180 | 1 |
기타(백만원)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 60 |
---|---|
Distinct (%) | 93.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 288281.12 |
Minimum | 1 |
---|---|
Maximum | 2383468 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 708.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 6427.5 |
median | 135252.5 |
Q3 | 338885.5 |
95-th percentile | 1413202.6 |
Maximum | 2383468 |
Range | 2383467 |
Interquartile range (IQR) | 332458 |
Descriptive statistics
Standard deviation | 487011.27 |
---|---|
Coefficient of variation (CV) | 1.6893623 |
Kurtosis | 9.7733562 |
Mean | 288281.12 |
Median Absolute Deviation (MAD) | 135242.5 |
Skewness | 3.0385736 |
Sum | 18449992 |
Variance | 2.3717998 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 3 | 4.7% |
6 | 2 | 3.1% |
1 | 2 | 3.1% |
2383468 | 1 | 1.6% |
20830 | 1 | 1.6% |
149048 | 1 | 1.6% |
179180 | 1 | 1.6% |
11328 | 1 | 1.6% |
13932 | 1 | 1.6% |
23186 | 1 | 1.6% |
Other values (50) | 50 |
Value | Count | Frequency (%) |
1 | 2 | |
3 | 1 | 1.6% |
5 | 3 | |
6 | 2 | |
8 | 1 | 1.6% |
12 | 1 | 1.6% |
13 | 1 | 1.6% |
26 | 1 | 1.6% |
27 | 1 | 1.6% |
48 | 1 | 1.6% |
Value | Count | Frequency (%) |
2383468 | 1 | |
2261743 | 1 | |
1735719 | 1 | |
1556451 | 1 | |
601462 | 1 | |
591118 | 1 | |
567014 | 1 | |
544400 | 1 | |
523543 | 1 | |
464840 | 1 |
구분 | 시도별 | 종합소득세(백만원) | 법인세(백만원) | 부가가치세(백만원) | 양도소득세(백만원) | 상속_증여세(백만원) | 기타(백만원) | |
---|---|---|---|---|---|---|---|---|
구분 | 1.000 | 0.000 | 0.561 | 0.000 | 0.275 | 0.000 | 0.000 | 0.622 |
시도별 | 0.000 | 1.000 | 0.707 | 0.750 | 0.780 | 0.624 | 0.750 | 0.567 |
종합소득세(백만원) | 0.561 | 0.707 | 1.000 | 0.662 | 0.745 | 0.878 | 0.662 | 0.849 |
법인세(백만원) | 0.000 | 0.750 | 0.662 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
부가가치세(백만원) | 0.275 | 0.780 | 0.745 | 1.000 | 1.000 | 0.832 | 1.000 | 0.874 |
양도소득세(백만원) | 0.000 | 0.624 | 0.878 | 1.000 | 0.832 | 1.000 | 1.000 | 1.000 |
상속_증여세(백만원) | 0.000 | 0.750 | 0.662 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
기타(백만원) | 0.622 | 0.567 | 0.849 | 1.000 | 0.874 | 1.000 | 1.000 | 1.000 |
시도별 | 구분 | |
---|---|---|
시도별 | 1.000 | 0.000 |
구분 | 0.000 | 1.000 |
종합소득세(백만원) | 법인세(백만원) | 부가가치세(백만원) | 양도소득세(백만원) | 상속_증여세(백만원) | 기타(백만원) | 구분 | 시도별 | |
---|---|---|---|---|---|---|---|---|
종합소득세(백만원) | 1.000 | 0.957 | 0.953 | 0.952 | 0.900 | 0.965 | 0.245 | 0.359 |
법인세(백만원) | 0.957 | 1.000 | 0.960 | 0.927 | 0.901 | 0.953 | 0.000 | 0.503 |
부가가치세(백만원) | 0.953 | 0.960 | 1.000 | 0.929 | 0.876 | 0.972 | 0.224 | 0.485 |
양도소득세(백만원) | 0.952 | 0.927 | 0.929 | 1.000 | 0.892 | 0.927 | 0.000 | 0.297 |
상속_증여세(백만원) | 0.900 | 0.901 | 0.876 | 0.892 | 1.000 | 0.866 | 0.000 | 0.503 |
기타(백만원) | 0.965 | 0.953 | 0.972 | 0.927 | 0.866 | 1.000 | 0.443 | 0.280 |
구분 | 0.245 | 0.000 | 0.224 | 0.000 | 0.000 | 0.443 | 1.000 | 0.000 |
시도별 | 0.359 | 0.503 | 0.485 | 0.297 | 0.503 | 0.280 | 0.000 | 1.000 |
구분 | 시도별 | 종합소득세(백만원) | 법인세(백만원) | 부가가치세(백만원) | 양도소득세(백만원) | 상속_증여세(백만원) | 기타(백만원) | |
---|---|---|---|---|---|---|---|---|
0 | 발생액 | 서울 | 890505 | 4720013 | 26045597 | 195308 | 94774 | 2383468 |
1 | 발생액 | 인천 | 194035 | 210147 | 4324226 | 12745 | 3065 | 601462 |
2 | 발생액 | 경기 | 931873 | 1167172 | 22985252 | 98949 | 24487 | 1735719 |
3 | 발생액 | 강원 | 56797 | 81360 | 795452 | 5623 | 1203 | 275970 |
4 | 발생액 | 대전 | 79434 | 85231 | 912469 | 11642 | 1265 | 248208 |
5 | 발생액 | 충북 | 65947 | 71908 | 1930144 | 14471 | 1366 | 293024 |
6 | 발생액 | 충남 세종 | 109659 | 181024 | 5276543 | 17132 | 4438 | 448081 |
7 | 발생액 | 광주 | 91911 | 107972 | 963594 | 7682 | 2167 | 277465 |
8 | 발생액 | 전북 | 64764 | 65610 | 1160761 | 6138 | 601 | 355871 |
9 | 발생액 | 전남 | 56521 | 134198 | 3416977 | 4917 | 433 | 337580 |
구분 | 시도별 | 종합소득세(백만원) | 법인세(백만원) | 부가가치세(백만원) | 양도소득세(백만원) | 상속_증여세(백만원) | 기타(백만원) | |
---|---|---|---|---|---|---|---|---|
54 | 미처리 | 충남 세종 | 65 | 0 | 12 | 0 | 0 | 26 |
55 | 미처리 | 광주 | 109 | 0 | 6 | 0 | 2 | 5 |
56 | 미처리 | 전북 | 29 | 0 | 3 | 0 | 0 | 6 |
57 | 미처리 | 전남 | 50 | 0 | 14 | 0 | 0 | 1 |
58 | 미처리 | 대구 | 19 | 0 | 3 | 45 | 0 | 3 |
59 | 미처리 | 경북 | 19 | 19 | 12 | 1 | 0 | 5 |
60 | 미처리 | 부산 | 40 | 2 | 33 | 1 | 0 | 27 |
61 | 미처리 | 울산 | 28 | 0 | 6 | 0 | 0 | 6 |
62 | 미처리 | 경남 | 37 | 0 | 7 | 0 | 0 | 13 |
63 | 미처리 | 제주 | 16 | 0 | 0 | 0 | 0 | 8 |