Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells1212
Missing cells (%)1.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Numeric3
Categorical2
DateTime2

Dataset

Description고용노동부에서 제공하는 순번, 징수유형, 징수결의일자, 총부과금액, 수납일자, 수납금액, 징수여부 순으로 나열된 과태료 징수부과 내역 CSV파일 데이터자료입니다.
Author고용노동부
URLhttps://www.data.go.kr/data/15071639/fileData.do

Alerts

총부과금액 is highly overall correlated with 수납금액High correlation
수납금액 is highly overall correlated with 총부과금액High correlation
수납일자 has 1212 (12.1%) missing valuesMissing
총부과금액 is highly skewed (γ1 = 89.48966673)Skewed
순번 has unique valuesUnique
수납금액 has 1212 (12.1%) zerosZeros

Reproduction

Analysis started2023-12-12 16:04:21.349145
Analysis finished2023-12-12 16:04:23.007583
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50119.652
Minimum8
Maximum99997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:04:23.077283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile4771.7
Q125123
median50753.5
Q374968.25
95-th percentile94713.45
Maximum99997
Range99989
Interquartile range (IQR)49845.25

Descriptive statistics

Standard deviation28870.434
Coefficient of variation (CV)0.57603022
Kurtosis-1.1969063
Mean50119.652
Median Absolute Deviation (MAD)24946.5
Skewness-0.017539772
Sum5.0119652 × 108
Variance8.3350196 × 108
MonotonicityNot monotonic
2023-12-13T01:04:23.225481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
98550 1
 
< 0.1%
56661 1
 
< 0.1%
49742 1
 
< 0.1%
45314 1
 
< 0.1%
36113 1
 
< 0.1%
88146 1
 
< 0.1%
84397 1
 
< 0.1%
3588 1
 
< 0.1%
89437 1
 
< 0.1%
16685 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
8 1
< 0.1%
48 1
< 0.1%
53 1
< 0.1%
60 1
< 0.1%
91 1
< 0.1%
106 1
< 0.1%
114 1
< 0.1%
124 1
< 0.1%
126 1
< 0.1%
136 1
< 0.1%
ValueCountFrequency (%)
99997 1
< 0.1%
99977 1
< 0.1%
99969 1
< 0.1%
99962 1
< 0.1%
99956 1
< 0.1%
99930 1
< 0.1%
99920 1
< 0.1%
99877 1
< 0.1%
99875 1
< 0.1%
99870 1
< 0.1%

징수유형
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
자진납부 징수요청
7375 
정식납부(가산금 포함) 징수요청
2456 
<NA>
 
169

Length

Max length17
Median length9
Mean length10.8803
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자진납부 징수요청
2nd row자진납부 징수요청
3rd row자진납부 징수요청
4th row자진납부 징수요청
5th row정식납부(가산금 포함) 징수요청

Common Values

ValueCountFrequency (%)
자진납부 징수요청 7375
73.8%
정식납부(가산금 포함) 징수요청 2456
 
24.6%
<NA> 169
 
1.7%

Length

2023-12-13T01:04:23.391268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:04:23.495998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
징수요청 9831
44.1%
자진납부 7375
33.1%
정식납부(가산금 2456
 
11.0%
포함 2456
 
11.0%
na 169
 
0.8%
Distinct284
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-03 00:00:00
Maximum2022-12-19 00:00:00
2023-12-13T01:04:23.623333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:23.780734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

총부과금액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct419
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean559860.94
Minimum0
Maximum7.856 × 108
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:04:23.998787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile12000
Q124000
median40000
Q380000
95-th percentile2640000
Maximum7.856 × 108
Range7.856 × 108
Interquartile range (IQR)56000

Descriptive statistics

Standard deviation8157428.8
Coefficient of variation (CV)14.570455
Kurtosis8582.8939
Mean559860.94
Median Absolute Deviation (MAD)20000
Skewness89.489667
Sum5.5986094 × 109
Variance6.6543645 × 1013
MonotonicityNot monotonic
2023-12-13T01:04:24.189953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24000 1875
18.8%
12000 1368
13.7%
40000 1321
13.2%
30000 676
 
6.8%
48000 528
 
5.3%
15000 473
 
4.7%
50000 425
 
4.2%
60000 226
 
2.3%
800000 196
 
2.0%
72000 188
 
1.9%
Other values (409) 2724
27.2%
ValueCountFrequency (%)
0 2
 
< 0.1%
9600 1
 
< 0.1%
12000 1368
13.7%
15000 473
 
4.7%
20000 11
 
0.1%
24000 1875
18.8%
25000 6
 
0.1%
26250 2
 
< 0.1%
26550 3
 
< 0.1%
26850 1
 
< 0.1%
ValueCountFrequency (%)
785600000 1
< 0.1%
87608000 1
< 0.1%
68000000 1
< 0.1%
60105760 1
< 0.1%
56000000 1
< 0.1%
38120000 1
< 0.1%
36000000 1
< 0.1%
34800000 1
< 0.1%
32127890 1
< 0.1%
28000000 2
< 0.1%

수납일자
Date

MISSING 

Distinct424
Distinct (%)4.8%
Missing1212
Missing (%)12.1%
Memory size156.2 KiB
Minimum2020-11-06 00:00:00
Maximum2023-10-05 00:00:00
2023-12-13T01:04:24.369010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:24.556188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수납금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct522
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean428958.48
Minimum0
Maximum87608000
Zeros1212
Zeros (%)12.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:04:24.726291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q112000
median30900
Q364000
95-th percentile2400000
Maximum87608000
Range87608000
Interquartile range (IQR)52000

Descriptive statistics

Standard deviation2137003.2
Coefficient of variation (CV)4.9818416
Kurtosis529.34637
Mean428958.48
Median Absolute Deviation (MAD)18900
Skewness18.179443
Sum4.2895848 × 109
Variance4.5667826 × 1012
MonotonicityNot monotonic
2023-12-13T01:04:24.900703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24000 1824
18.2%
12000 1334
13.3%
40000 1294
12.9%
0 1212
12.1%
48000 515
 
5.1%
30000 303
 
3.0%
15000 195
 
1.9%
72000 183
 
1.8%
800000 183
 
1.8%
50000 169
 
1.7%
Other values (512) 2788
27.9%
ValueCountFrequency (%)
0 1212
12.1%
9600 1
 
< 0.1%
10000 1
 
< 0.1%
12000 1334
13.3%
15000 195
 
1.9%
15450 66
 
0.7%
15630 10
 
0.1%
15810 7
 
0.1%
15990 1
 
< 0.1%
16170 2
 
< 0.1%
ValueCountFrequency (%)
87608000 1
< 0.1%
68000000 1
< 0.1%
60105760 1
< 0.1%
56000000 1
< 0.1%
38120000 1
< 0.1%
36000000 1
< 0.1%
34800000 1
< 0.1%
32127890 1
< 0.1%
28000000 1
< 0.1%
26985580 1
< 0.1%

징수여부
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
징수완료
8784 
징수진행
1216 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row징수완료
2nd row징수완료
3rd row징수완료
4th row징수완료
5th row징수완료

Common Values

ValueCountFrequency (%)
징수완료 8784
87.8%
징수진행 1216
 
12.2%

Length

2023-12-13T01:04:25.042617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:04:25.158326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
징수완료 8784
87.8%
징수진행 1216
 
12.2%

Interactions

2023-12-13T01:04:22.480392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:21.871488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.175088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.587804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:21.969483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.274970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.703315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.089101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:04:22.383317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:04:25.237857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번징수유형총부과금액수납금액징수여부
순번1.0000.1150.0000.0350.144
징수유형0.1151.0000.0070.0470.698
총부과금액0.0000.0071.0000.7880.014
수납금액0.0350.0470.7881.0000.023
징수여부0.1440.6980.0140.0231.000
2023-12-13T01:04:25.340826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
징수여부징수유형
징수여부1.0000.492
징수유형0.4921.000
2023-12-13T01:04:25.517514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번총부과금액수납금액징수유형징수여부
순번1.0000.016-0.0080.0880.110
총부과금액0.0161.0000.7140.0120.023
수납금액-0.0080.7141.0000.0350.017
징수유형0.0880.0120.0351.0000.492
징수여부0.1100.0230.0170.4921.000

Missing values

2023-12-13T01:04:22.829509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:04:22.958444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번징수유형징수결의일자총부과금액수납일자수납금액징수여부
9854998550자진납부 징수요청2022-12-14480002023-01-1048000징수완료
6146761468자진납부 징수요청2022-08-24240002022-09-0624000징수완료
7791277913자진납부 징수요청2022-10-1964000002022-11-046400000징수완료
6780667807자진납부 징수요청2022-09-19480002022-10-1948000징수완료
7842578426정식납부(가산금 포함) 징수요청2022-10-21900002022-11-0990000징수완료
7841778418정식납부(가산금 포함) 징수요청2022-10-21300002022-12-2030000징수완료
9818698187자진납부 징수요청2022-12-13240002023-01-2624000징수완료
1455414555자진납부 징수요청2022-03-07400002022-03-2340000징수완료
6027860279정식납부(가산금 포함) 징수요청2022-08-1950000<NA>0징수진행
4754847549정식납부(가산금 포함) 징수요청2022-07-04500002022-07-0850000징수완료
순번징수유형징수결의일자총부과금액수납일자수납금액징수여부
42134214자진납부 징수요청2022-01-18400002022-01-2740000징수완료
20502051자진납부 징수요청2022-01-11240002022-01-2424000징수완료
5012350124자진납부 징수요청2022-07-12240002022-08-1124000징수완료
23872388정식납부(가산금 포함) 징수요청2022-01-12600002022-01-1363960징수완료
4398943990정식납부(가산금 포함) 징수요청2022-06-21300002022-08-0430000징수완료
2273122732<NA>2022-04-072100000<NA>0징수진행
3684836849<NA>2022-05-274725002022-05-27796500징수완료
8195981960자진납부 징수요청2022-10-28480002022-11-1148000징수완료
7936879369자진납부 징수요청2022-10-24400002022-11-1040000징수완료
5960자진납부 징수요청2022-01-0320000002022-01-072000000징수완료