Overview

Dataset statistics

Number of variables4
Number of observations124
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory33.1 B

Variable types

Categorical2
Text2

Dataset

Description산업재해 발생형태 코드 목록입니다.(구분코드, 구분명칭, 발생형태코드, 발생형태명(떨어짐, 끼임, 깔림, 무너짐, 폭발파열 등 ))
URLhttps://www.data.go.kr/data/15049607/fileData.do

Alerts

구분명칭 is highly overall correlated with 구분코드High correlation
구분코드 is highly overall correlated with 구분명칭High correlation
발생형태코드 has unique valuesUnique
발생형태명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:56:55.759869
Analysis finished2023-12-12 18:56:56.416477
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분코드
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)33.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
6
 
7
7
 
7
4
 
7
41
 
6
Other values (37)
88 

Length

Max length2
Median length2
Mean length1.5483871
Min length1

Unique

Unique22 ?
Unique (%)17.7%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9
 
7.3%
6 7
 
5.6%
7 7
 
5.6%
4 7
 
5.6%
41 6
 
4.8%
52 6
 
4.8%
8 6
 
4.8%
2 6
 
4.8%
54 5
 
4.0%
9 5
 
4.0%
Other values (32) 60
48.4%

Length

2023-12-13T03:56:56.552099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 9
 
7.3%
7 7
 
5.6%
4 7
 
5.6%
6 7
 
5.6%
41 6
 
4.8%
52 6
 
4.8%
8 6
 
4.8%
2 6
 
4.8%
54 5
 
4.0%
9 5
 
4.0%
Other values (32) 60
48.4%

구분명칭
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)33.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
떨어짐(높이가 있는 곳에서 사람이 떨어짐)
무너짐(건축물이나 쌓여진 물체가 무너짐)
 
7
끼임(기계설비에 끼이거나 감김)
 
7
부딪힘(물체에 부딪힘)
 
7
체육행사 등의 사고
 
6
Other values (37)
88 

Length

Max length23
Median length17
Mean length10.435484
Min length2

Unique

Unique22 ?
Unique (%)17.7%

Sample

1st row떨어짐(높이가 있는 곳에서 사람이 떨어짐)
2nd row떨어짐(높이가 있는 곳에서 사람이 떨어짐)
3rd row떨어짐(높이가 있는 곳에서 사람이 떨어짐)
4th row떨어짐(높이가 있는 곳에서 사람이 떨어짐)
5th row떨어짐(높이가 있는 곳에서 사람이 떨어짐)

Common Values

ValueCountFrequency (%)
떨어짐(높이가 있는 곳에서 사람이 떨어짐) 9
 
7.3%
무너짐(건축물이나 쌓여진 물체가 무너짐) 7
 
5.6%
끼임(기계설비에 끼이거나 감김) 7
 
5.6%
부딪힘(물체에 부딪힘) 7
 
5.6%
체육행사 등의 사고 6
 
4.8%
유기화합물 6
 
4.8%
절단·베임·찔림 6
 
4.8%
넘어짐(사람이 미끄러지거나 넘어짐) 6
 
4.8%
금속류 5
 
4.0%
감전 5
 
4.0%
Other values (32) 60
48.4%

Length

2023-12-13T03:56:56.789730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
떨어짐(높이가 9
 
3.4%
사람이 9
 
3.4%
떨어짐 9
 
3.4%
있는 9
 
3.4%
곳에서 9
 
3.4%
무너짐(건축물이나 7
 
2.6%
쌓여진 7
 
2.6%
물체가 7
 
2.6%
무너짐 7
 
2.6%
끼임(기계설비에 7
 
2.6%
Other values (55) 185
69.8%

발생형태코드
Text

UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T03:56:57.369204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.5322581
Min length1

Characters and Unicode

Total characters438
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)100.0%

Sample

1st row101
2nd row102
3rd row103
4th row104
5th row105
ValueCountFrequency (%)
101 1
 
0.8%
4103 1
 
0.8%
5102 1
 
0.8%
5101 1
 
0.8%
4909 1
 
0.8%
4309 1
 
0.8%
4303 1
 
0.8%
4302 1
 
0.8%
4301 1
 
0.8%
4209 1
 
0.8%
Other values (114) 114
91.9%
2023-12-13T03:56:58.179036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 128
29.2%
1 77
17.6%
4 44
 
10.0%
2 44
 
10.0%
3 38
 
8.7%
5 33
 
7.5%
9 31
 
7.1%
6 19
 
4.3%
8 13
 
3.0%
7 10
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 437
99.8%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 128
29.3%
1 77
17.6%
4 44
 
10.1%
2 44
 
10.1%
3 38
 
8.7%
5 33
 
7.6%
9 31
 
7.1%
6 19
 
4.3%
8 13
 
3.0%
7 10
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
Z 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 437
99.8%
Latin 1
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 128
29.3%
1 77
17.6%
4 44
 
10.1%
2 44
 
10.1%
3 38
 
8.7%
5 33
 
7.6%
9 31
 
7.1%
6 19
 
4.3%
8 13
 
3.0%
7 10
 
2.3%
Latin
ValueCountFrequency (%)
Z 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 438
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 128
29.2%
1 77
17.6%
4 44
 
10.0%
2 44
 
10.0%
3 38
 
8.7%
5 33
 
7.5%
9 31
 
7.1%
6 19
 
4.3%
8 13
 
3.0%
7 10
 
2.3%

발생형태명
Text

UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T03:56:58.893549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length15
Mean length9.4516129
Min length2

Characters and Unicode

Total characters1172
Distinct characters215
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)100.0%

Sample

1st row상세정보부족 떨어짐
2nd row계단, 사다리에서 떨어짐
3rd row개구부 등 지면에서 떨어짐
4th row재료더미 및 적재물에서 떨어짐
5th row지붕에서 떨어짐
ValueCountFrequency (%)
기타 18
 
6.3%
상세정보부족 15
 
5.3%
떨어짐 9
 
3.2%
부딪힘 7
 
2.5%
무너짐 7
 
2.5%
물체에 6
 
2.1%
넘어짐 6
 
2.1%
6
 
2.1%
의한 5
 
1.8%
동물상해 4
 
1.4%
Other values (154) 201
70.8%
2023-12-13T03:56:59.731602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
 
13.7%
32
 
2.7%
32
 
2.7%
29
 
2.5%
28
 
2.4%
25
 
2.1%
22
 
1.9%
22
 
1.9%
· 20
 
1.7%
20
 
1.7%
Other values (205) 782
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 972
82.9%
Space Separator 160
 
13.7%
Other Punctuation 28
 
2.4%
Close Punctuation 6
 
0.5%
Open Punctuation 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
3.3%
32
 
3.3%
29
 
3.0%
28
 
2.9%
25
 
2.6%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (199) 725
74.6%
Other Punctuation
ValueCountFrequency (%)
· 20
71.4%
, 7
 
25.0%
. 1
 
3.6%
Space Separator
ValueCountFrequency (%)
160
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 972
82.9%
Common 200
 
17.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
3.3%
32
 
3.3%
29
 
3.0%
28
 
2.9%
25
 
2.6%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (199) 725
74.6%
Common
ValueCountFrequency (%)
160
80.0%
· 20
 
10.0%
, 7
 
3.5%
) 6
 
3.0%
( 6
 
3.0%
. 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 972
82.9%
ASCII 180
 
15.4%
None 20
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
88.9%
, 7
 
3.9%
) 6
 
3.3%
( 6
 
3.3%
. 1
 
0.6%
Hangul
ValueCountFrequency (%)
32
 
3.3%
32
 
3.3%
29
 
3.0%
28
 
2.9%
25
 
2.6%
22
 
2.3%
22
 
2.3%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (199) 725
74.6%
None
ValueCountFrequency (%)
· 20
100.0%

Correlations

2023-12-13T03:56:59.894689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분코드구분명칭
구분코드1.0001.000
구분명칭1.0001.000
2023-12-13T03:57:00.047126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분명칭구분코드
구분명칭1.0001.000
구분코드1.0001.000
2023-12-13T03:57:00.203642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분코드구분명칭
구분코드1.0001.000
구분명칭1.0001.000

Missing values

2023-12-13T03:56:56.194598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:56:56.357168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분코드구분명칭발생형태코드발생형태명
01떨어짐(높이가 있는 곳에서 사람이 떨어짐)101상세정보부족 떨어짐
11떨어짐(높이가 있는 곳에서 사람이 떨어짐)102계단, 사다리에서 떨어짐
21떨어짐(높이가 있는 곳에서 사람이 떨어짐)103개구부 등 지면에서 떨어짐
31떨어짐(높이가 있는 곳에서 사람이 떨어짐)104재료더미 및 적재물에서 떨어짐
41떨어짐(높이가 있는 곳에서 사람이 떨어짐)105지붕에서 떨어짐
51떨어짐(높이가 있는 곳에서 사람이 떨어짐)106비계 등 가설구조물에서 떨어짐
61떨어짐(높이가 있는 곳에서 사람이 떨어짐)107건물 대들보나 철골 등 기타 구조물에서 떨어짐
71떨어짐(높이가 있는 곳에서 사람이 떨어짐)108운송수단 또는 기계 등 설비에서 떨어짐
81떨어짐(높이가 있는 곳에서 사람이 떨어짐)109기타 떨어짐
92넘어짐(사람이 미끄러지거나 넘어짐)201상세정보부족 넘어짐
구분코드구분명칭발생형태코드발생형태명
11482뇌혈관질환8201뇌혈관질환
11583심혈관질환8301심장질환
11686요통8601비사고성·작업관련성요통
11786요통8602사고성요통
11887수근관증후군8701수근관증후군
11989기타 근골격계질환8901신체에 과도한 부담을 주는 작업
12091간질환9101간질환
12192스트레스성질환9201정신질환
12299작업관련성질병 기타9909작업관련성질병 기타
123Z분류불능Z분류불능