Overview

Dataset statistics

Number of variables4
Number of observations117
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory35.1 B

Variable types

Categorical2
Numeric1
Text1

Dataset

Description사업장에서 운용중인 기계에 대한 분류코드 목록입니다. (기인물명코드, 기인물명, 기인물명상세코드, 기인물명상세으로 구성되어 있습니다.)
URLhttps://www.data.go.kr/data/15072603/fileData.do

Alerts

기인물명코드 has constant value ""Constant
기인물명상세코드 is highly overall correlated with 기인물명High correlation
기인물명 is highly overall correlated with 기인물명상세코드High correlation
기인물명상세코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:09:26.344031
Analysis finished2023-12-12 14:09:26.842491
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기인물명코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
1
117 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 117
100.0%

Length

2023-12-12T23:09:26.901484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:09:26.985735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 117
100.0%

기인물명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
건설용기계
32 
일반동력기계
11 
동력크레인
가설건축구조물
운반차량
Other values (16)
48 

Length

Max length9
Median length7
Mean length4.9316239
Min length2

Unique

Unique7 ?
Unique (%)6.0%

Sample

1st row일반동력기계
2nd row일반동력기계
3rd row일반동력기계
4th row일반동력기계
5th row일반동력기계

Common Values

ValueCountFrequency (%)
건설용기계 32
27.4%
일반동력기계 11
 
9.4%
동력크레인 9
 
7.7%
가설건축구조물 9
 
7.7%
운반차량 8
 
6.8%
목재가공용기계 6
 
5.1%
환경 6
 
5.1%
유해위험물 6
 
5.1%
재료 4
 
3.4%
인력기계용구 4
 
3.4%
Other values (11) 22
18.8%

Length

2023-12-12T23:09:27.085196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건설용기계 32
27.4%
일반동력기계 11
 
9.4%
동력크레인 9
 
7.7%
가설건축구조물 9
 
7.7%
운반차량 8
 
6.8%
목재가공용기계 6
 
5.1%
환경 6
 
5.1%
유해위험물 6
 
5.1%
동력운반기 4
 
3.4%
압력용기 4
 
3.4%
Other values (11) 22
18.8%

기인물명상세코드
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct117
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean732.02564
Minimum100
Maximum2100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T23:09:27.225583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile105.8
Q1219
median500
Q31303
95-th percentile1901.2
Maximum2100
Range2000
Interquartile range (IQR)1084

Descriptive statistics

Standard deviation602.62226
Coefficient of variation (CV)0.82322562
Kurtosis-0.96601214
Mean732.02564
Median Absolute Deviation (MAD)296
Skewness0.70958845
Sum85647
Variance363153.59
MonotonicityNot monotonic
2023-12-12T23:09:27.359105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 1
 
0.9%
802 1
 
0.9%
1302 1
 
0.9%
1301 1
 
0.9%
1300 1
 
0.9%
1200 1
 
0.9%
1201 1
 
0.9%
1203 1
 
0.9%
1202 1
 
0.9%
1100 1
 
0.9%
Other values (107) 107
91.5%
ValueCountFrequency (%)
100 1
0.9%
101 1
0.9%
102 1
0.9%
103 1
0.9%
104 1
0.9%
105 1
0.9%
106 1
0.9%
107 1
0.9%
108 1
0.9%
109 1
0.9%
ValueCountFrequency (%)
2100 1
0.9%
2000 1
0.9%
1905 1
0.9%
1904 1
0.9%
1903 1
0.9%
1902 1
0.9%
1901 1
0.9%
1900 1
0.9%
1800 1
0.9%
1700 1
0.9%
Distinct103
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T23:09:27.628414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length4.1623932
Min length1

Characters and Unicode

Total characters487
Distinct characters184
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)87.2%

Sample

1st row원동기
2nd row동력전도장치
3rd row프레스 및 전단기
4th row사출성형기
5th row로울러기
ValueCountFrequency (%)
기타 15
 
12.4%
2
 
1.7%
공기압축기 1
 
0.8%
압력용기 1
 
0.8%
송배전선 1
 
0.8%
전력설비 1
 
0.8%
조명설비 1
 
0.8%
로,요등 1
 
0.8%
건조설비 1
 
0.8%
화학설비 1
 
0.8%
Other values (96) 96
79.3%
2023-12-12T23:09:28.022311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
9.0%
18
 
3.7%
17
 
3.5%
12
 
2.5%
10
 
2.1%
9
 
1.8%
9
 
1.8%
8
 
1.6%
8
 
1.6%
7
 
1.4%
Other values (174) 345
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 481
98.8%
Space Separator 5
 
1.0%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.1%
18
 
3.7%
17
 
3.5%
12
 
2.5%
10
 
2.1%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
7
 
1.5%
Other values (172) 339
70.5%
Space Separator
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 481
98.8%
Common 6
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.1%
18
 
3.7%
17
 
3.5%
12
 
2.5%
10
 
2.1%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
7
 
1.5%
Other values (172) 339
70.5%
Common
ValueCountFrequency (%)
5
83.3%
, 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 481
98.8%
ASCII 6
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
9.1%
18
 
3.7%
17
 
3.5%
12
 
2.5%
10
 
2.1%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
7
 
1.5%
Other values (172) 339
70.5%
ASCII
ValueCountFrequency (%)
5
83.3%
, 1
 
16.7%

Interactions

2023-12-12T23:09:26.510988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:09:28.129771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기인물명기인물명상세코드
기인물명1.0000.986
기인물명상세코드0.9861.000
2023-12-12T23:09:28.203357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기인물명상세코드기인물명
기인물명상세코드1.0000.864
기인물명0.8641.000

Missing values

2023-12-12T23:09:26.697324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:09:26.801267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기인물명코드기인물명기인물명상세코드기인물명상세
01일반동력기계100원동기
11일반동력기계101동력전도장치
21일반동력기계102프레스 및 전단기
31일반동력기계103사출성형기
41일반동력기계104로울러기
51일반동력기계110기타
61일반동력기계106선반
71일반동력기계107드릴머신
81일반동력기계108혼합기 및 분석기
91일반동력기계109절곡기
기인물명코드기인물명기인물명상세코드기인물명상세
1071적재물1700적재물
1081산업용로봇1800산업용로봇
1091환경1900지반암석
1101환경1901
1111환경1902이상환경
1121환경1903산소결핍
1131환경1904고온저온환경
1141환경1905기타
1151기타2000기타
1161기인물없음분류불능2100기인물없음분류불능