Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description- 산재보험 최초요양신청 승인된 건들에 대한 정보 제공 - 직종, 사고·질병·출퇴근 구분, 업종명(대분류) 관련 정보 제공
URLhttps://www.data.go.kr/data/15121552/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:08:07.011334
Analysis finished2023-12-12 14:08:07.556984
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50053.55
Minimum4
Maximum99992
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:08:07.619731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile4811.95
Q125171.25
median50121
Q374819
95-th percentile94970.25
Maximum99992
Range99988
Interquartile range (IQR)49647.75

Descriptive statistics

Standard deviation28856.502
Coefficient of variation (CV)0.5765126
Kurtosis-1.1929484
Mean50053.55
Median Absolute Deviation (MAD)24809.5
Skewness-0.012152427
Sum5.005355 × 108
Variance8.3269771 × 108
MonotonicityNot monotonic
2023-12-12T23:08:07.735388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75151 1
 
< 0.1%
30738 1
 
< 0.1%
38566 1
 
< 0.1%
69974 1
 
< 0.1%
41152 1
 
< 0.1%
1080 1
 
< 0.1%
12584 1
 
< 0.1%
71122 1
 
< 0.1%
30853 1
 
< 0.1%
95998 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
5 1
< 0.1%
8 1
< 0.1%
20 1
< 0.1%
33 1
< 0.1%
54 1
< 0.1%
65 1
< 0.1%
71 1
< 0.1%
81 1
< 0.1%
86 1
< 0.1%
ValueCountFrequency (%)
99992 1
< 0.1%
99989 1
< 0.1%
99985 1
< 0.1%
99976 1
< 0.1%
99969 1
< 0.1%
99955 1
< 0.1%
99952 1
< 0.1%
99934 1
< 0.1%
99931 1
< 0.1%
99869 1
< 0.1%
Distinct160
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:08:08.019562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.1278
Min length3

Characters and Unicode

Total characters31278
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)0.2%

Sample

1st row910
2nd row910
3rd row819
4th row892
5th row131
ValueCountFrequency (%)
910 1407
 
14.1%
930 1103
 
11.0%
772 561
 
5.6%
441 497
 
5.0%
e0960 460
 
4.6%
941 379
 
3.8%
999 379
 
3.8%
799 367
 
3.7%
442 297
 
3.0%
922 224
 
2.2%
Other values (150) 4326
43.3%
2023-12-12T23:08:08.439447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 7137
22.8%
1 4480
14.3%
0 3959
12.7%
4 3353
10.7%
2 3091
9.9%
7 2813
 
9.0%
3 2795
 
8.9%
5 1181
 
3.8%
8 922
 
2.9%
6 907
 
2.9%
Other values (3) 640
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30638
98.0%
Uppercase Letter 640
 
2.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 7137
23.3%
1 4480
14.6%
0 3959
12.9%
4 3353
10.9%
2 3091
10.1%
7 2813
 
9.2%
3 2795
 
9.1%
5 1181
 
3.9%
8 922
 
3.0%
6 907
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
E 637
99.5%
A 2
 
0.3%
S 1
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common 30638
98.0%
Latin 640
 
2.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 7137
23.3%
1 4480
14.6%
0 3959
12.9%
4 3353
10.9%
2 3091
10.1%
7 2813
 
9.2%
3 2795
 
9.1%
5 1181
 
3.9%
8 922
 
3.0%
6 907
 
3.0%
Latin
ValueCountFrequency (%)
E 637
99.5%
A 2
 
0.3%
S 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31278
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 7137
22.8%
1 4480
14.3%
0 3959
12.7%
4 3353
10.7%
2 3091
9.9%
7 2813
 
9.0%
3 2795
 
8.9%
5 1181
 
3.8%
8 922
 
2.9%
6 907
 
2.9%
Other values (3) 640
 
2.0%
Distinct187
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:08:08.755980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length11.7846
Min length2

Characters and Unicode

Total characters117846
Distinct characters224
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.3%

Sample

1st row건설 및 광업 단순 종사원
2nd row건설 및 광업 단순 종사원
3rd row기타 식품가공관련 기계조작원
4th row인쇄 및 사진현상 관련 기계조작원
5th row연구.교육 및 법률 관련 관리자
ValueCountFrequency (%)
4090
 
12.2%
종사원 3470
 
10.4%
단순 3447
 
10.3%
종사자 2148
 
6.4%
건설 1455
 
4.4%
광업 1407
 
4.2%
제조관련 1314
 
3.9%
기타 1249
 
3.7%
기능 836
 
2.5%
관련 798
 
2.4%
Other values (274) 13210
39.5%
2023-12-12T23:08:09.251616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23424
19.9%
7402
 
6.3%
6371
 
5.4%
6298
 
5.3%
5037
 
4.3%
4325
 
3.7%
4243
 
3.6%
4131
 
3.5%
3470
 
2.9%
3447
 
2.9%
Other values (214) 49698
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92235
78.3%
Space Separator 23424
 
19.9%
Other Punctuation 791
 
0.7%
Open Punctuation 686
 
0.6%
Close Punctuation 686
 
0.6%
Decimal Number 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7402
 
8.0%
6371
 
6.9%
6298
 
6.8%
5037
 
5.5%
4325
 
4.7%
4243
 
4.6%
4131
 
4.5%
3470
 
3.8%
3447
 
3.7%
3191
 
3.5%
Other values (202) 44320
48.1%
Decimal Number
ValueCountFrequency (%)
0 9
37.5%
2 9
37.5%
3 3
 
12.5%
1 3
 
12.5%
Other Punctuation
ValueCountFrequency (%)
. 523
66.1%
· 226
28.6%
, 42
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 662
96.5%
[ 24
 
3.5%
Close Punctuation
ValueCountFrequency (%)
) 662
96.5%
] 24
 
3.5%
Space Separator
ValueCountFrequency (%)
23424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92235
78.3%
Common 25611
 
21.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7402
 
8.0%
6371
 
6.9%
6298
 
6.8%
5037
 
5.5%
4325
 
4.7%
4243
 
4.6%
4131
 
4.5%
3470
 
3.8%
3447
 
3.7%
3191
 
3.5%
Other values (202) 44320
48.1%
Common
ValueCountFrequency (%)
23424
91.5%
( 662
 
2.6%
) 662
 
2.6%
. 523
 
2.0%
· 226
 
0.9%
, 42
 
0.2%
[ 24
 
0.1%
] 24
 
0.1%
0 9
 
< 0.1%
2 9
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92235
78.3%
ASCII 25385
 
21.5%
None 226
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23424
92.3%
( 662
 
2.6%
) 662
 
2.6%
. 523
 
2.1%
, 42
 
0.2%
[ 24
 
0.1%
] 24
 
0.1%
0 9
 
< 0.1%
2 9
 
< 0.1%
3 3
 
< 0.1%
Hangul
ValueCountFrequency (%)
7402
 
8.0%
6371
 
6.9%
6298
 
6.8%
5037
 
5.5%
4325
 
4.7%
4243
 
4.6%
4131
 
4.5%
3470
 
3.8%
3447
 
3.7%
3191
 
3.5%
Other values (202) 44320
48.1%
None
ValueCountFrequency (%)
· 226
100.0%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
사고
8037 
질병
1318 
출퇴근
 
645

Length

Max length3
Median length2
Mean length2.0645
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사고
2nd row사고
3rd row사고
4th row사고
5th row사고

Common Values

ValueCountFrequency (%)
사고 8037
80.4%
질병 1318
 
13.2%
출퇴근 645
 
6.5%

Length

2023-12-12T23:08:09.431365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:08:09.592261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사고 8037
80.4%
질병 1318
 
13.2%
출퇴근 645
 
6.5%
Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
기타의사업
4005 
제조업
2379 
건설업
2249 
운수·창고및통신업
940 
광업
 
246
Other values (5)
 
181

Length

Max length11
Median length9
Mean length4.3516
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건설업
2nd row건설업
3rd row기타의사업
4th row제조업
5th row기타의사업

Common Values

ValueCountFrequency (%)
기타의사업 4005
40.1%
제조업 2379
23.8%
건설업 2249
22.5%
운수·창고및통신업 940
 
9.4%
광업 246
 
2.5%
임업 72
 
0.7%
금융및보험업 53
 
0.5%
농업 42
 
0.4%
전기·가스및상수도사업 9
 
0.1%
어업 5
 
0.1%

Length

2023-12-12T23:08:09.729573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:08:09.859141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타의사업 4005
40.1%
제조업 2379
23.8%
건설업 2249
22.5%
운수·창고및통신업 940
 
9.4%
광업 246
 
2.5%
임업 72
 
0.7%
금융및보험업 53
 
0.5%
농업 42
 
0.4%
전기·가스및상수도사업 9
 
0.1%
어업 5
 
< 0.1%

Interactions

2023-12-12T23:08:07.337634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:08:09.973659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번재해발생형태산재업종(대분류)
연번1.0000.0450.078
재해발생형태0.0451.0000.457
산재업종(대분류)0.0780.4571.000
2023-12-12T23:08:10.068684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산재업종(대분류)재해발생형태
산재업종(대분류)1.0000.309
재해발생형태0.3091.000
2023-12-12T23:08:10.177999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번재해발생형태산재업종(대분류)
연번1.0000.0260.024
재해발생형태0.0261.0000.309
산재업종(대분류)0.0240.3091.000

Missing values

2023-12-12T23:08:07.437766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:08:07.517474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번직종코드직종명재해발생형태산재업종(대분류)
7515075151910건설 및 광업 단순 종사원사고건설업
8576585766910건설 및 광업 단순 종사원사고건설업
4715247153819기타 식품가공관련 기계조작원사고기타의사업
3752437525892인쇄 및 사진현상 관련 기계조작원사고제조업
7774677747131연구.교육 및 법률 관련 관리자사고기타의사업
2786227863799기타 기능관련 종사자사고제조업
2822728228910건설 및 광업 단순 종사원사고건설업
92569257910건설 및 광업 단순 종사원사고건설업
9140291403773건축마감관련 기능 종사자사고건설업
2201322014620임업관련 종사자사고임업
연번직종코드직종명재해발생형태산재업종(대분류)
8904189042431운송 서비스 종사자사고운수·창고및통신업
9549895499441주방장 및 조리사사고기타의사업
1722317224773건축마감관련 기능 종사자사고건설업
8086480865149기타 건설.전기 및 생산 관련 관리자사고제조업
218219941청소원 및 환경 미화원출퇴근기타의사업
7280572806930제조관련 단순 종사원사고제조업
8941589416772건설관련 기능 종사자사고건설업
1935319354521매장 판매 종사자사고기타의사업
2314323144612원예 및 조경 종사자사고건설업
6463464635730목재.가구.악기 및 간판 관련 기능 종사자사고건설업