Overview

Dataset statistics

Number of variables5
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory47.0 B

Variable types

Categorical2
Numeric3

Dataset

Description규모별 산업재해 신청건수 및 승인건수, 승인율 현황 데이터입니다.(ex 5인 미만, 5~30인 미만 등) 2018년부터 2022년까지 데이터
URLhttps://www.data.go.kr/data/15095101/fileData.do

Alerts

신청 is highly overall correlated with 승인 and 2 other fieldsHigh correlation
승인 is highly overall correlated with 신청 and 2 other fieldsHigh correlation
승인율(퍼센트) is highly overall correlated with 신청 and 1 other fieldsHigh correlation
사업장 규모 is highly overall correlated with 신청 and 1 other fieldsHigh correlation
신청 has 2 (4.5%) zerosZeros
승인 has 2 (4.5%) zerosZeros
승인율(퍼센트) has 2 (4.5%) zerosZeros

Reproduction

Analysis started2023-12-12 18:05:40.043582
Analysis finished2023-12-12 18:05:41.560886
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct5
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size484.0 B
2018
2019
2020
2021
2022

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 9
20.5%
2019 9
20.5%
2020 9
20.5%
2021 9
20.5%
2022 8
18.2%

Length

2023-12-13T03:05:41.632951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:05:41.777805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 9
20.5%
2019 9
20.5%
2020 9
20.5%
2021 9
20.5%
2022 8
18.2%

사업장 규모
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
5인 미만
5~30인 미만
30~50인 미만
50~100인 미만
100~300인 미만
Other values (4)
19 

Length

Max length12
Median length10
Mean length8.8636364
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5인 미만
2nd row5~30인 미만
3rd row30~50인 미만
4th row50~100인 미만
5th row100~300인 미만

Common Values

ValueCountFrequency (%)
5인 미만 5
11.4%
5~30인 미만 5
11.4%
30~50인 미만 5
11.4%
50~100인 미만 5
11.4%
100~300인 미만 5
11.4%
300~500인 미만 5
11.4%
500~1000인 미만 5
11.4%
1000인 이상 5
11.4%
분류 안됨 4
9.1%

Length

2023-12-13T03:05:41.939826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:05:42.111465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 35
39.8%
5인 5
 
5.7%
5~30인 5
 
5.7%
30~50인 5
 
5.7%
50~100인 5
 
5.7%
100~300인 5
 
5.7%
300~500인 5
 
5.7%
500~1000인 5
 
5.7%
1000인 5
 
5.7%
이상 5
 
5.7%
Other values (2) 8
 
9.1%

신청
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct43
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14913.295
Minimum0
Maximum40392
Zeros2
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-13T03:05:42.292706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile237.3
Q15814.25
median9871
Q321539.25
95-th percentile39223.05
Maximum40392
Range40392
Interquartile range (IQR)15725

Descriptive statistics

Standard deviation12888.765
Coefficient of variation (CV)0.8642466
Kurtosis-0.52981573
Mean14913.295
Median Absolute Deviation (MAD)4688
Skewness0.97057947
Sum656185
Variance1.6612026 × 108
MonotonicityNot monotonic
2023-12-13T03:05:42.466744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
0 2
 
4.5%
30838 1
 
2.3%
37478 1
 
2.3%
5884 1
 
2.3%
14830 1
 
2.3%
36204 1
 
2.3%
40392 1
 
2.3%
10321 1
 
2.3%
10758 1
 
2.3%
12252 1
 
2.3%
Other values (33) 33
75.0%
ValueCountFrequency (%)
0 2
4.5%
120 1
2.3%
902 1
2.3%
3464 1
2.3%
4144 1
2.3%
4329 1
2.3%
4330 1
2.3%
4833 1
2.3%
5454 1
2.3%
5605 1
2.3%
ValueCountFrequency (%)
40392 1
2.3%
40351 1
2.3%
39531 1
2.3%
37478 1
2.3%
37384 1
2.3%
36795 1
2.3%
36204 1
2.3%
33885 1
2.3%
33717 1
2.3%
30838 1
2.3%

승인
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct43
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13539.705
Minimum0
Maximum37346
Zeros2
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-13T03:05:42.655423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile49.25
Q14902
median8802
Q319198.75
95-th percentile36607.05
Maximum37346
Range37346
Interquartile range (IQR)14296.75

Descriptive statistics

Standard deviation12074.896
Coefficient of variation (CV)0.89181384
Kurtosis-0.5121331
Mean13539.705
Median Absolute Deviation (MAD)4273.5
Skewness0.9892711
Sum595747
Variance1.4580311 × 108
MonotonicityNot monotonic
2023-12-13T03:05:42.826936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
0 2
 
4.5%
29005 1
 
2.3%
35168 1
 
2.3%
4997 1
 
2.3%
13136 1
 
2.3%
33480 1
 
2.3%
37346 1
 
2.3%
9404 1
 
2.3%
9763 1
 
2.3%
10714 1
 
2.3%
Other values (33) 33
75.0%
ValueCountFrequency (%)
0 2
4.5%
32 1
2.3%
147 1
2.3%
2963 1
2.3%
3442 1
2.3%
3448 1
2.3%
3692 1
2.3%
4132 1
2.3%
4589 1
2.3%
4617 1
2.3%
ValueCountFrequency (%)
37346 1
2.3%
37069 1
2.3%
36861 1
2.3%
35168 1
2.3%
34755 1
2.3%
33921 1
2.3%
33480 1
2.3%
31336 1
2.3%
31279 1
2.3%
29005 1
2.3%

승인율(퍼센트)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.888636
Minimum0
Maximum94.1
Zeros2
Zeros (%)4.5%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-13T03:05:42.987944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile17.86
Q185.2
median89.2
Q391.95
95-th percentile93.17
Maximum94.1
Range94.1
Interquartile range (IQR)6.75

Descriptive statistics

Standard deviation23.284691
Coefficient of variation (CV)0.28434582
Kurtosis7.5091166
Mean81.888636
Median Absolute Deviation (MAD)3.3
Skewness-2.9413861
Sum3603.1
Variance542.17684
MonotonicityNot monotonic
2023-12-13T03:05:43.140106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
92.5 4
 
9.1%
85.5 2
 
4.5%
0.0 2
 
4.5%
94.1 1
 
2.3%
89.1 1
 
2.3%
84.9 1
 
2.3%
88.6 1
 
2.3%
91.1 1
 
2.3%
90.8 1
 
2.3%
87.4 1
 
2.3%
Other values (29) 29
65.9%
ValueCountFrequency (%)
0.0 2
4.5%
16.3 1
2.3%
26.7 1
2.3%
79.6 1
2.3%
82.4 1
2.3%
82.7 1
2.3%
83.0 1
2.3%
83.1 1
2.3%
84.1 1
2.3%
84.9 1
2.3%
ValueCountFrequency (%)
94.1 1
 
2.3%
93.8 1
 
2.3%
93.2 1
 
2.3%
93.0 1
 
2.3%
92.8 1
 
2.3%
92.5 4
9.1%
92.2 1
 
2.3%
92.1 1
 
2.3%
91.9 1
 
2.3%
91.6 1
 
2.3%

Interactions

2023-12-13T03:05:41.060339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.257516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.694362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:41.159477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.450242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.825824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:41.262035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.570724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:05:40.950956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:05:43.243352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도사업장 규모신청승인승인율(퍼센트)
연도1.0000.0000.0000.0000.000
사업장 규모0.0001.0000.9490.9340.716
신청0.0000.9491.0000.9980.650
승인0.0000.9340.9981.0000.419
승인율(퍼센트)0.0000.7160.6500.4191.000
2023-12-13T03:05:43.348754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장 규모연도
사업장 규모1.0000.000
연도0.0001.000
2023-12-13T03:05:43.451416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청승인승인율(퍼센트)연도사업장 규모
신청1.0001.0000.7970.0000.630
승인1.0001.0000.7990.0000.584
승인율(퍼센트)0.7970.7991.0000.0000.488
연도0.0000.0000.0001.0000.000
사업장 규모0.6300.5840.4880.0001.000

Missing values

2023-12-13T03:05:41.390136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:05:41.515671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도사업장 규모신청승인승인율(퍼센트)
020185인 미만308382900594.1
120185~30인 미만374783516893.8
2201830~50인 미만8625798092.5
3201850~100인 미만8734804292.1
42018100~300인 미만9755875089.7
52018300~500인 미만3464296385.5
62018500~1000인 미만4330344879.6
720181000인 이상10561939889.0
82018분류 안됨90214716.3
920195인 미만337173127992.8
연도사업장 규모신청승인승인율(퍼센트)
3420211000인 이상205161827189.1
352021분류 안됨000.0
3620225인 미만367953392192.2
3720225~30인 미만403513706991.9
38202230~50인 미만110161002991.0
39202250~100인 미만112461005889.4
402022100~300인 미만134131173487.5
412022300~500인 미만5454458984.1
422022500~1000인 미만7978660182.7
4320221000인 이상246092198289.3