Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 207 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.4 KiB |
Average record size in memory | 51.6 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 한국산업안전보건공단에서 제공하는 KOSHA_MS 인증 현황에 관한 데이터로 사업장명, 현장명, 인증년도, 인증결정일, 인증번호, 인증유효기간, 진행상태에 관한 데이터를 확인하실 수 있습니다. |
---|---|
Author | 한국산업안전보건공단 |
URL | https://www.data.go.kr/data/15102716/fileData.do |
구분 is highly overall correlated with 비고 | High correlation |
비고 is highly overall correlated with 연번 and 3 other fields | High correlation |
연번 is highly overall correlated with 인증번호 and 2 other fields | High correlation |
인증번호 is highly overall correlated with 연번 and 2 other fields | High correlation |
인증연도 is highly overall correlated with 연번 and 2 other fields | High correlation |
연번 has unique values | Unique |
인증번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 01:40:13.222680 |
---|---|
Analysis finished | 2023-12-12 01:40:14.731078 |
Duration | 1.51 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 207 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 104 |
Minimum | 1 |
---|---|
Maximum | 207 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 11.3 |
Q1 | 52.5 |
median | 104 |
Q3 | 155.5 |
95-th percentile | 196.7 |
Maximum | 207 |
Range | 206 |
Interquartile range (IQR) | 103 |
Descriptive statistics
Standard deviation | 59.899917 |
---|---|
Coefficient of variation (CV) | 0.57596074 |
Kurtosis | -1.2 |
Mean | 104 |
Median Absolute Deviation (MAD) | 52 |
Skewness | 0 |
Sum | 21528 |
Variance | 3588 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.5% |
2 | 1 | 0.5% |
133 | 1 | 0.5% |
134 | 1 | 0.5% |
135 | 1 | 0.5% |
136 | 1 | 0.5% |
137 | 1 | 0.5% |
138 | 1 | 0.5% |
139 | 1 | 0.5% |
140 | 1 | 0.5% |
Other values (197) | 197 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
207 | 1 | |
206 | 1 | |
205 | 1 | |
204 | 1 | |
203 | 1 | |
202 | 1 | |
201 | 1 | |
200 | 1 | |
199 | 1 | |
198 | 1 |
구분
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
전문건설업체 | |
---|---|
종합건설업체 | |
발주기관 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.7874396 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 종합건설업체 |
---|---|
2nd row | 종합건설업체 |
3rd row | 종합건설업체 |
4th row | 종합건설업체 |
5th row | 발주기관 |
Common Values
Value | Count | Frequency (%) |
전문건설업체 | 144 | |
종합건설업체 | 41 | 19.8% |
발주기관 | 22 | 10.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
전문건설업체 | 144 | |
종합건설업체 | 41 | 19.8% |
발주기관 | 22 | 10.6% |
인증번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 207 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1549.8599 |
Minimum | 435 |
---|---|
Maximum | 3317 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 435 |
---|---|
5-th percentile | 445.3 |
Q1 | 613 |
median | 1286 |
Q3 | 2422.5 |
95-th percentile | 3041.7 |
Maximum | 3317 |
Range | 2882 |
Interquartile range (IQR) | 1809.5 |
Descriptive statistics
Standard deviation | 921.57404 |
---|---|
Coefficient of variation (CV) | 0.59461764 |
Kurtosis | -1.2711936 |
Mean | 1549.8599 |
Median Absolute Deviation (MAD) | 819 |
Skewness | 0.31916469 |
Sum | 320821 |
Variance | 849298.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
435 | 1 | 0.5% |
436 | 1 | 0.5% |
1968 | 1 | 0.5% |
1969 | 1 | 0.5% |
2010 | 1 | 0.5% |
2011 | 1 | 0.5% |
2163 | 1 | 0.5% |
2164 | 1 | 0.5% |
2165 | 1 | 0.5% |
2166 | 1 | 0.5% |
Other values (197) | 197 |
Value | Count | Frequency (%) |
435 | 1 | |
436 | 1 | |
437 | 1 | |
438 | 1 | |
439 | 1 | |
440 | 1 | |
441 | 1 | |
442 | 1 | |
443 | 1 | |
444 | 1 |
Value | Count | Frequency (%) |
3317 | 1 | |
3316 | 1 | |
3315 | 1 | |
3314 | 1 | |
3313 | 1 | |
3312 | 1 | |
3311 | 1 | |
3310 | 1 | |
3044 | 1 | |
3043 | 1 |
사업장명
Text
Distinct | 203 |
---|---|
Distinct (%) | 98.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
건설부문 | 3 | 1.4% |
삼성물산㈜ | 2 | 0.9% |
sk건설(주 | 2 | 0.9% |
현대건설㈜ | 2 | 0.9% |
㈜신성이엔지 | 2 | 0.9% |
보림토건(주 | 2 | 0.9% |
무경설비㈜ | 1 | 0.5% |
일광전설㈜ | 1 | 0.5% |
티엔에스㈜ | 1 | 0.5% |
sk | 1 | 0.5% |
Other values (204) | 204 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 122 | 8.0% |
) | 119 | 7.8% |
( | 119 | 7.8% |
㈜ | 70 | 4.6% |
건 | 61 | 4.0% |
설 | 55 | 3.6% |
이 | 37 | 2.4% |
기 | 28 | 1.8% |
엔 | 27 | 1.8% |
공 | 24 | 1.6% |
Other values (192) | 868 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1159 | |
Close Punctuation | 119 | 7.8% |
Open Punctuation | 119 | 7.8% |
Other Symbol | 70 | 4.6% |
Uppercase Letter | 42 | 2.7% |
Space Separator | 17 | 1.1% |
Other Punctuation | 4 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 122 | 10.5% |
건 | 61 | 5.3% |
설 | 55 | 4.7% |
이 | 37 | 3.2% |
기 | 28 | 2.4% |
엔 | 27 | 2.3% |
공 | 24 | 2.1% |
한 | 23 | 2.0% |
업 | 21 | 1.8% |
전 | 19 | 1.6% |
Other values (172) | 742 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 9 | |
C | 7 | |
E | 6 | |
K | 4 | |
G | 4 | |
M | 2 | 4.8% |
J | 2 | 4.8% |
L | 2 | 4.8% |
D | 1 | 2.4% |
F | 1 | 2.4% |
Other values (4) | 4 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 | |
& | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 119 |
Open Punctuation
Value | Count | Frequency (%) |
( | 119 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 70 |
Space Separator
Value | Count | Frequency (%) |
17 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1229 | |
Common | 259 | 16.9% |
Latin | 42 | 2.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 122 | 9.9% |
㈜ | 70 | 5.7% |
건 | 61 | 5.0% |
설 | 55 | 4.5% |
이 | 37 | 3.0% |
기 | 28 | 2.3% |
엔 | 27 | 2.2% |
공 | 24 | 2.0% |
한 | 23 | 1.9% |
업 | 21 | 1.7% |
Other values (173) | 761 |
Latin
Value | Count | Frequency (%) |
S | 9 | |
C | 7 | |
E | 6 | |
K | 4 | |
G | 4 | |
M | 2 | 4.8% |
J | 2 | 4.8% |
L | 2 | 4.8% |
D | 1 | 2.4% |
F | 1 | 2.4% |
Other values (4) | 4 |
Common
Value | Count | Frequency (%) |
) | 119 | |
( | 119 | |
17 | 6.6% | |
. | 2 | 0.8% |
& | 2 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1159 | |
ASCII | 301 | 19.7% |
None | 70 | 4.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 122 | 10.5% |
건 | 61 | 5.3% |
설 | 55 | 4.7% |
이 | 37 | 3.2% |
기 | 28 | 2.4% |
엔 | 27 | 2.3% |
공 | 24 | 2.1% |
한 | 23 | 2.0% |
업 | 21 | 1.8% |
전 | 19 | 1.6% |
Other values (172) | 742 |
ASCII
Value | Count | Frequency (%) |
) | 119 | |
( | 119 | |
17 | 5.6% | |
S | 9 | 3.0% |
C | 7 | 2.3% |
E | 6 | 2.0% |
K | 4 | 1.3% |
G | 4 | 1.3% |
M | 2 | 0.7% |
J | 2 | 0.7% |
Other values (9) | 12 | 4.0% |
None
Value | Count | Frequency (%) |
㈜ | 70 |
인증연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2013.3285 |
Minimum | 2002 |
---|---|
Maximum | 2022 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 2002 |
---|---|
5-th percentile | 2006.3 |
Q1 | 2010 |
median | 2012 |
Q3 | 2017 |
95-th percentile | 2021 |
Maximum | 2022 |
Range | 20 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 4.7895831 |
---|---|
Coefficient of variation (CV) | 0.0023789377 |
Kurtosis | -0.67914146 |
Mean | 2013.3285 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.051684071 |
Sum | 416759 |
Variance | 22.940106 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2012 | 25 | |
2017 | 22 | 10.6% |
2010 | 16 | 7.7% |
2011 | 16 | 7.7% |
2009 | 15 | 7.2% |
2018 | 11 | 5.3% |
2008 | 11 | 5.3% |
2015 | 11 | 5.3% |
2020 | 10 | 4.8% |
2007 | 10 | 4.8% |
Other values (11) | 60 |
Value | Count | Frequency (%) |
2002 | 3 | 1.4% |
2003 | 3 | 1.4% |
2004 | 1 | 0.5% |
2005 | 2 | 1.0% |
2006 | 2 | 1.0% |
2007 | 10 | |
2008 | 11 | |
2009 | 15 | |
2010 | 16 | |
2011 | 16 |
Value | Count | Frequency (%) |
2022 | 8 | 3.9% |
2021 | 8 | 3.9% |
2020 | 10 | |
2019 | 7 | 3.4% |
2018 | 11 | |
2017 | 22 | |
2016 | 10 | |
2015 | 11 | |
2014 | 7 | 3.4% |
2013 | 9 |
비고
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
<NA> | |
---|---|
취소 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.1497585 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 취소 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | 취소 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 119 | |
취소 | 88 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 119 | |
취소 | 88 |
연번 | 구분 | 인증번호 | 인증연도 | |
---|---|---|---|---|
연번 | 1.000 | 0.334 | 0.976 | 0.979 |
구분 | 0.334 | 1.000 | 0.000 | 0.356 |
인증번호 | 0.976 | 0.000 | 1.000 | 0.974 |
인증연도 | 0.979 | 0.356 | 0.974 | 1.000 |
구분 | 비고 | |
---|---|---|
구분 | 1.000 | 1.000 |
비고 | 1.000 | 1.000 |
연번 | 인증번호 | 인증연도 | 구분 | 비고 | |
---|---|---|---|---|---|
연번 | 1.000 | 1.000 | 0.997 | 0.206 | 1.000 |
인증번호 | 1.000 | 1.000 | 0.997 | 0.000 | 1.000 |
인증연도 | 0.997 | 0.997 | 1.000 | 0.254 | 1.000 |
구분 | 0.206 | 0.000 | 0.254 | 1.000 | 1.000 |
비고 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
연번 | 구분 | 인증번호 | 사업장명 | 인증연도 | 비고 | |
---|---|---|---|---|---|---|
0 | 1 | 종합건설업체 | 435 | 삼성물산㈜ 건설부문 | 2002 | 취소 |
1 | 2 | 종합건설업체 | 436 | 롯데건설㈜ | 2002 | <NA> |
2 | 3 | 종합건설업체 | 437 | ㈜태영건설 | 2002 | <NA> |
3 | 4 | 종합건설업체 | 438 | ㈜포스코건설 | 2003 | 취소 |
4 | 5 | 발주기관 | 439 | 한국도로공사 | 2003 | <NA> |
5 | 6 | 발주기관 | 440 | 한국서부발전㈜ 청송양수건설처 | 2003 | 취소 |
6 | 7 | 종합건설업체 | 441 | 현대건설㈜ | 2004 | 취소 |
7 | 8 | 종합건설업체 | 442 | ㈜한진중공업 건설부문 | 2005 | <NA> |
8 | 9 | 발주기관 | 443 | 한국수력원자력㈜ 예천양수건설처 | 2005 | 취소 |
9 | 10 | 발주기관 | 444 | 인천국제공항공사 | 2006 | <NA> |
연번 | 구분 | 인증번호 | 사업장명 | 인증연도 | 비고 | |
---|---|---|---|---|---|---|
197 | 198 | 전문건설업체 | 3043 | 동남통신건설㈜ | 2021 | <NA> |
198 | 199 | 전문건설업체 | 3044 | (주)세광통신 | 2021 | <NA> |
199 | 200 | 발주기관 | 3310 | 국가철도공단 | 2022 | <NA> |
200 | 201 | 종합건설업체 | 3311 | 금강주택(주) | 2022 | <NA> |
201 | 202 | 종합건설업체 | 3312 | 우미건설(주) | 2022 | <NA> |
202 | 203 | 종합건설업체 | 3313 | 삼환기업(주) | 2022 | <NA> |
203 | 204 | 전문건설업체 | 3314 | (주)창원기전 | 2022 | <NA> |
204 | 205 | 전문건설업체 | 3315 | (유)엠케이지 | 2022 | <NA> |
205 | 206 | 전문건설업체 | 3316 | (주)윈하이텍 | 2022 | <NA> |
206 | 207 | 전문건설업체 | 3317 | 동서통신(주) | 2022 | <NA> |