Dataset statistics
Number of variables | 33 |
---|---|
Number of observations | 10000 |
Missing cells | 167011 |
Missing cells (%) | 50.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.8 MiB |
Average record size in memory | 296.0 B |
Variable types
Numeric | 9 |
---|---|
Text | 4 |
Categorical | 2 |
Boolean | 2 |
DateTime | 2 |
Unsupported | 14 |
Dataset
Description | 기초안전보건교육 실시계획 정보 |
---|---|
Author | 한국산업안전보건공단 |
URL | https://www.data.go.kr/data/15066093/fileData.do |
DEL_YN has constant value "" | Constant |
LCTRUM_AR is highly imbalanced (61.2%) | Imbalance |
PROCESS_STTUS is highly imbalanced (97.8%) | Imbalance |
UNIQ_ID has 8733 (87.3%) missing values | Missing |
LCTRUM_ZIP has 9205 (92.0%) missing values | Missing |
LCTRUM_ADRES2 has 443 (4.4%) missing values | Missing |
MNG_LABOFFICE has 558 (5.6%) missing values | Missing |
MNG_AGENT has 558 (5.6%) missing values | Missing |
FRST_REGISTER_PNTTM has 473 (4.7%) missing values | Missing |
LAST_UPDUSR_PNTTM has 7041 (70.4%) missing values | Missing |
FILE_STRE_COURS1 has 10000 (100.0%) missing values | Missing |
STRE_FILE_NM1 has 10000 (100.0%) missing values | Missing |
ORIGNL_FILE_NM1 has 10000 (100.0%) missing values | Missing |
FILE_STRE_COURS2 has 10000 (100.0%) missing values | Missing |
STRE_FILE_NM2 has 10000 (100.0%) missing values | Missing |
ORIGNL_FILE_NM2 has 10000 (100.0%) missing values | Missing |
FILE_STRE_COURS3 has 10000 (100.0%) missing values | Missing |
STRE_FILE_NM3 has 10000 (100.0%) missing values | Missing |
ORIGNL_FILE_NM3 has 10000 (100.0%) missing values | Missing |
FILE_STRE_COURS4 has 10000 (100.0%) missing values | Missing |
STRE_FILE_NM4 has 10000 (100.0%) missing values | Missing |
ORIGNL_FILE_NM4 has 10000 (100.0%) missing values | Missing |
PYMNT_DCSN_DE has 10000 (100.0%) missing values | Missing |
RCOGN_NMPR has 10000 (100.0%) missing values | Missing |
EDC_YEAR is highly skewed (γ1 = 48.39583084) | Skewed |
EDC_DT is highly skewed (γ1 = 48.42075221) | Skewed |
FILE_STRE_COURS1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
STRE_FILE_NM1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ORIGNL_FILE_NM1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
FILE_STRE_COURS2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
STRE_FILE_NM2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ORIGNL_FILE_NM2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
FILE_STRE_COURS3 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
STRE_FILE_NM3 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ORIGNL_FILE_NM3 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
FILE_STRE_COURS4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
STRE_FILE_NM4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
ORIGNL_FILE_NM4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PYMNT_DCSN_DE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
RCOGN_NMPR is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 08:53:01.289324 |
---|---|
Analysis finished | 2023-12-12 08:53:02.827354 |
Duration | 1.54 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
EDC_YEAR
Real number (ℝ)
SKEWED
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2013.3657 |
Minimum | 2009 |
---|---|
Maximum | 2414 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2009 |
---|---|
5-th percentile | 2010 |
Q1 | 2013 |
median | 2014 |
Q3 | 2014 |
95-th percentile | 2014 |
Maximum | 2414 |
Range | 405 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 8.1006497 |
---|---|
Coefficient of variation (CV) | 0.0040234368 |
Kurtosis | 2391.8465 |
Mean | 2013.3657 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 48.395831 |
Sum | 20133657 |
Variance | 65.620526 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2014 | 4919 | |
2013 | 3288 | |
2012 | 1092 | 10.9% |
2009 | 397 | 4.0% |
2010 | 142 | 1.4% |
2015 | 139 | 1.4% |
2011 | 19 | 0.2% |
2414 | 4 | < 0.1% |
Value | Count | Frequency (%) |
2009 | 397 | 4.0% |
2010 | 142 | 1.4% |
2011 | 19 | 0.2% |
2012 | 1092 | 10.9% |
2013 | 3288 | |
2014 | 4919 | |
2015 | 139 | 1.4% |
2414 | 4 | < 0.1% |
Value | Count | Frequency (%) |
2414 | 4 | < 0.1% |
2015 | 139 | 1.4% |
2014 | 4919 | |
2013 | 3288 | |
2012 | 1092 | 10.9% |
2011 | 19 | 0.2% |
2010 | 142 | 1.4% |
2009 | 397 | 4.0% |
UNIQ_ID
Real number (ℝ)
MISSING
 
Distinct | 1267 |
---|---|
Distinct (%) | 100.0% |
Missing | 8733 |
Missing (%) | 87.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17586.539 |
Minimum | 13 |
---|---|
Maximum | 43496 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 13 |
---|---|
5-th percentile | 596.4 |
Q1 | 3787.5 |
median | 16114 |
Q3 | 29536 |
95-th percentile | 41053.7 |
Maximum | 43496 |
Range | 43483 |
Interquartile range (IQR) | 25748.5 |
Descriptive statistics
Standard deviation | 13935.751 |
---|---|
Coefficient of variation (CV) | 0.79241012 |
Kurtosis | -1.234343 |
Mean | 17586.539 |
Median Absolute Deviation (MAD) | 12429 |
Skewness | 0.3590532 |
Sum | 22282145 |
Variance | 1.9420517 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21186 | 1 | < 0.1% |
7732 | 1 | < 0.1% |
34255 | 1 | < 0.1% |
4123 | 1 | < 0.1% |
27512 | 1 | < 0.1% |
33996 | 1 | < 0.1% |
17267 | 1 | < 0.1% |
2299 | 1 | < 0.1% |
3595 | 1 | < 0.1% |
3458 | 1 | < 0.1% |
Other values (1257) | 1257 | 12.6% |
(Missing) | 8733 |
Value | Count | Frequency (%) |
13 | 1 | |
18 | 1 | |
20 | 1 | |
27 | 1 | |
37 | 1 | |
39 | 1 | |
63 | 1 | |
65 | 1 | |
70 | 1 | |
73 | 1 |
Value | Count | Frequency (%) |
43496 | 1 | |
43453 | 1 | |
43382 | 1 | |
43378 | 1 | |
43352 | 1 | |
43291 | 1 | |
43283 | 1 | |
43281 | 1 | |
43270 | 1 | |
43235 | 1 |
EDC_DT
Real number (ℝ)
SKEWED
 
Distinct | 1097 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20134392 |
Minimum | 20090714 |
---|---|
Maximum | 24141212 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20090714 |
---|---|
5-th percentile | 20101123 |
Q1 | 20130418 |
median | 20140108 |
Q3 | 20140723 |
95-th percentile | 20141205 |
Maximum | 24141212 |
Range | 4050498 |
Interquartile range (IQR) | 10305.25 |
Descriptive statistics
Standard deviation | 81002.295 |
---|---|
Coefficient of variation (CV) | 0.0040230813 |
Kurtosis | 2393.4802 |
Mean | 20134392 |
Median Absolute Deviation (MAD) | 8887 |
Skewness | 48.420752 |
Sum | 2.0134392 × 1011 |
Variance | 6.5613718 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20141013 | 32 | 0.3% |
20141114 | 32 | 0.3% |
20141007 | 32 | 0.3% |
20141107 | 30 | 0.3% |
20141202 | 30 | 0.3% |
20141209 | 28 | 0.3% |
20141021 | 28 | 0.3% |
20140822 | 28 | 0.3% |
20140813 | 27 | 0.3% |
20141104 | 27 | 0.3% |
Other values (1087) | 9706 |
Value | Count | Frequency (%) |
20090714 | 1 | < 0.1% |
20090718 | 2 | |
20090720 | 1 | < 0.1% |
20090721 | 3 | |
20090723 | 2 | |
20090724 | 1 | < 0.1% |
20090725 | 1 | < 0.1% |
20090728 | 1 | < 0.1% |
20090729 | 2 | |
20090730 | 3 |
Value | Count | Frequency (%) |
24141212 | 2 | |
24141208 | 2 | |
20150331 | 1 | |
20150327 | 1 | |
20150326 | 1 | |
20150324 | 1 | |
20150320 | 1 | |
20150316 | 2 | |
20150311 | 1 | |
20150306 | 1 |
EDC_TIME_S
Real number (ℝ)
Distinct | 37 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1102.449 |
Minimum | 600 |
---|---|
Maximum | 1900 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 600 |
---|---|
5-th percentile | 800 |
Q1 | 800 |
median | 1300 |
Q3 | 1300 |
95-th percentile | 1400 |
Maximum | 1900 |
Range | 1300 |
Interquartile range (IQR) | 500 |
Descriptive statistics
Standard deviation | 263.09679 |
---|---|
Coefficient of variation (CV) | 0.23864758 |
Kurtosis | -1.4448171 |
Mean | 1102.449 |
Median Absolute Deviation (MAD) | 100 |
Skewness | -0.021136498 |
Sum | 11024490 |
Variance | 69219.919 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1300 | 3238 | |
800 | 2320 | |
1330 | 1106 | 11.1% |
830 | 1088 | 10.9% |
1400 | 931 | 9.3% |
900 | 841 | 8.4% |
730 | 209 | 2.1% |
1800 | 49 | 0.5% |
700 | 33 | 0.3% |
1700 | 25 | 0.2% |
Other values (27) | 160 | 1.6% |
Value | Count | Frequency (%) |
600 | 1 | < 0.1% |
700 | 33 | 0.3% |
720 | 1 | < 0.1% |
730 | 209 | 2.1% |
740 | 1 | < 0.1% |
750 | 4 | < 0.1% |
800 | 2320 | |
810 | 2 | < 0.1% |
815 | 1 | < 0.1% |
820 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1900 | 8 | 0.1% |
1830 | 18 | 0.2% |
1810 | 1 | < 0.1% |
1800 | 49 | |
1730 | 19 | 0.2% |
1710 | 1 | < 0.1% |
1700 | 25 | |
1600 | 5 | 0.1% |
1530 | 5 | 0.1% |
1500 | 16 | 0.2% |
EDC_TIME_E
Real number (ℝ)
Distinct | 37 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1502.429 |
Minimum | 1000 |
---|---|
Maximum | 2300 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 1200 |
Q1 | 1200 |
median | 1700 |
Q3 | 1700 |
95-th percentile | 1800 |
Maximum | 2300 |
Range | 1300 |
Interquartile range (IQR) | 500 |
Descriptive statistics
Standard deviation | 263.08177 |
---|---|
Coefficient of variation (CV) | 0.17510429 |
Kurtosis | -1.44463 |
Mean | 1502.429 |
Median Absolute Deviation (MAD) | 100 |
Skewness | -0.021051632 |
Sum | 15024290 |
Variance | 69212.016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1700 | 3238 | |
1200 | 2320 | |
1730 | 1106 | 11.1% |
1230 | 1088 | 10.9% |
1800 | 930 | 9.3% |
1300 | 841 | 8.4% |
1130 | 209 | 2.1% |
2200 | 49 | 0.5% |
1100 | 33 | 0.3% |
2100 | 25 | 0.2% |
Other values (27) | 161 | 1.6% |
Value | Count | Frequency (%) |
1000 | 1 | < 0.1% |
1100 | 33 | 0.3% |
1120 | 1 | < 0.1% |
1130 | 209 | 2.1% |
1140 | 1 | < 0.1% |
1150 | 4 | < 0.1% |
1200 | 2320 | |
1210 | 2 | < 0.1% |
1215 | 1 | < 0.1% |
1220 | 1 | < 0.1% |
Value | Count | Frequency (%) |
2300 | 8 | 0.1% |
2230 | 18 | 0.2% |
2210 | 1 | < 0.1% |
2200 | 49 | |
2130 | 19 | 0.2% |
2110 | 1 | < 0.1% |
2100 | 25 | |
2000 | 5 | 0.1% |
1930 | 5 | 0.1% |
1900 | 16 | 0.2% |
EDC_SBJECT
Text
Distinct | 120 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 13.7012 |
Min length | 6 |
Characters and Unicode
Total characters | 137012 |
---|---|
Distinct characters | 78 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 49 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 건설안전및근로자건강관리 |
---|---|
2nd row | 건설안전 및 보건교육 |
3rd row | 건설안전 및 보건위생 |
4th row | 건설안전 및 근로자 건강관리 |
5th row | 건설안전 및 근로자 건강관리 |
Value | Count | Frequency (%) |
및 | 7607 | |
건설안전 | 7477 | |
건강관리 | 5442 | |
근로자 | 5238 | |
근로자건강관리 | 1448 | 4.5% |
기초안전보건교육 | 1127 | 3.5% |
건설업 | 914 | 2.9% |
보건위생 | 561 | 1.8% |
건설업기초안전보건교육 | 417 | 1.3% |
건설안전및근로자건강관리 | 367 | 1.1% |
Other values (93) | 1379 | 4.3% |
Most occurring characters
Value | Count | Frequency (%) |
22024 | ||
건 | 19494 | |
안 | 9911 | 7.2% |
전 | 9911 | 7.2% |
설 | 9618 | 7.0% |
및 | 8242 | 6.0% |
강 | 7380 | 5.4% |
관 | 7376 | 5.4% |
리 | 7375 | 5.4% |
로 | 7085 | 5.2% |
Other values (68) | 28596 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 114858 | |
Space Separator | 22024 | 16.1% |
Open Punctuation | 46 | < 0.1% |
Close Punctuation | 46 | < 0.1% |
Decimal Number | 24 | < 0.1% |
Other Punctuation | 6 | < 0.1% |
Control | 6 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
건 | 19494 | |
안 | 9911 | |
전 | 9911 | |
설 | 9618 | |
및 | 8242 | 7.2% |
강 | 7380 | 6.4% |
관 | 7376 | 6.4% |
리 | 7375 | 6.4% |
로 | 7085 | 6.2% |
근 | 7085 | 6.2% |
Other values (58) | 21381 |
Decimal Number
Value | Count | Frequency (%) |
2 | 11 | |
1 | 9 | |
3 | 4 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
22024 |
Open Punctuation
Value | Count | Frequency (%) |
( | 46 |
Close Punctuation
Value | Count | Frequency (%) |
) | 46 |
Other Punctuation
Value | Count | Frequency (%) |
, | 6 |
Control
Value | Count | Frequency (%) |
6 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 114858 | |
Common | 22154 | 16.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
건 | 19494 | |
안 | 9911 | |
전 | 9911 | |
설 | 9618 | |
및 | 8242 | 7.2% |
강 | 7380 | 6.4% |
관 | 7376 | 6.4% |
리 | 7375 | 6.4% |
로 | 7085 | 6.2% |
근 | 7085 | 6.2% |
Other values (58) | 21381 |
Common
Value | Count | Frequency (%) |
22024 | ||
( | 46 | 0.2% |
) | 46 | 0.2% |
2 | 11 | < 0.1% |
1 | 9 | < 0.1% |
, | 6 | < 0.1% |
6 | < 0.1% | |
3 | 4 | < 0.1% |
+ | 1 | < 0.1% |
- | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 114858 | |
ASCII | 22154 | 16.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
22024 | ||
( | 46 | 0.2% |
) | 46 | 0.2% |
2 | 11 | < 0.1% |
1 | 9 | < 0.1% |
, | 6 | < 0.1% |
6 | < 0.1% | |
3 | 4 | < 0.1% |
+ | 1 | < 0.1% |
- | 1 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
건 | 19494 | |
안 | 9911 | |
전 | 9911 | |
설 | 9618 | |
및 | 8242 | 7.2% |
강 | 7380 | 6.4% |
관 | 7376 | 6.4% |
리 | 7375 | 6.4% |
로 | 7085 | 6.2% |
근 | 7085 | 6.2% |
Other values (58) | 21381 |
EDC_CNT
Real number (ℝ)
Distinct | 39 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.9234 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 30 |
Q1 | 50 |
median | 50 |
Q3 | 50 |
95-th percentile | 50 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 8.2490689 |
---|---|
Coefficient of variation (CV) | 0.17213029 |
Kurtosis | 9.2358424 |
Mean | 47.9234 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -1.1978276 |
Sum | 479234 |
Variance | 68.047137 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 8581 | |
30 | 449 | 4.5% |
20 | 374 | 3.7% |
40 | 331 | 3.3% |
70 | 85 | 0.9% |
60 | 52 | 0.5% |
100 | 25 | 0.2% |
45 | 16 | 0.2% |
80 | 16 | 0.2% |
15 | 9 | 0.1% |
Other values (29) | 62 | 0.6% |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
5 | 1 | < 0.1% |
7 | 1 | < 0.1% |
8 | 1 | < 0.1% |
10 | 9 | 0.1% |
12 | 2 | < 0.1% |
15 | 9 | 0.1% |
18 | 3 | < 0.1% |
20 | 374 | |
25 | 6 | 0.1% |
Value | Count | Frequency (%) |
100 | 25 | 0.2% |
98 | 1 | < 0.1% |
90 | 3 | < 0.1% |
80 | 16 | 0.2% |
78 | 1 | < 0.1% |
75 | 2 | < 0.1% |
70 | 85 | |
68 | 1 | < 0.1% |
65 | 4 | < 0.1% |
60 | 52 |
LCTRUM_AR
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
40 | |
---|---|
<NA> | 584 |
20 | 362 |
10 | 327 |
30 | 232 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.1168 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 40 |
---|---|
2nd row | 40 |
3rd row | 40 |
4th row | 40 |
5th row | 40 |
Common Values
Value | Count | Frequency (%) |
40 | 8495 | |
<NA> | 584 | 5.8% |
20 | 362 | 3.6% |
10 | 327 | 3.3% |
30 | 232 | 2.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
40 | 8495 | |
na | 584 | 5.8% |
20 | 362 | 3.6% |
10 | 327 | 3.3% |
30 | 232 | 2.3% |
EDC_PLACE
Text
Distinct | 2198 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
강의실1 | 3361 | 21.2% |
현장 | 579 | 3.6% |
교육장 | 552 | 3.5% |
등록강의실 | 546 | 3.4% |
강의실 | 432 | 2.7% |
동화안전기술원 | 262 | 1.6% |
강의실2 | 228 | 1.4% |
신축공사 | 205 | 1.3% |
본강의장 | 198 | 1.2% |
아파트 | 178 | 1.1% |
Other values (3036) | 9348 |
Most occurring characters
Value | Count | Frequency (%) |
5955 | 7.0% | |
강 | 5507 | 6.5% |
의 | 5409 | 6.4% |
실 | 5227 | 6.1% |
1 | 4094 | 4.8% |
장 | 3162 | 3.7% |
교 | 1858 | 2.2% |
전 | 1781 | 2.1% |
육 | 1729 | 2.0% |
건 | 1661 | 2.0% |
Other values (538) | 48685 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66711 | |
Space Separator | 5955 | 7.0% |
Decimal Number | 5902 | 6.9% |
Uppercase Letter | 2977 | 3.5% |
Lowercase Letter | 934 | 1.1% |
Close Punctuation | 855 | 1.0% |
Open Punctuation | 849 | 1.0% |
Dash Punctuation | 494 | 0.6% |
Other Punctuation | 260 | 0.3% |
Other Symbol | 61 | 0.1% |
Other values (4) | 70 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
강 | 5507 | 8.3% |
의 | 5409 | 8.1% |
실 | 5227 | 7.8% |
장 | 3162 | 4.7% |
교 | 1858 | 2.8% |
전 | 1781 | 2.7% |
육 | 1729 | 2.6% |
건 | 1661 | 2.5% |
설 | 1615 | 2.4% |
안 | 1476 | 2.2% |
Other values (462) | 37286 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 344 | |
S | 304 | 10.2% |
K | 267 | 9.0% |
R | 214 | 7.2% |
P | 195 | 6.6% |
L | 182 | 6.1% |
T | 181 | 6.1% |
C | 158 | 5.3% |
D | 147 | 4.9% |
A | 141 | 4.7% |
Other values (15) | 844 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 171 | |
o | 92 | |
r | 82 | |
t | 80 | |
c | 73 | |
p | 64 | 6.9% |
a | 63 | 6.7% |
j | 63 | 6.7% |
n | 62 | 6.6% |
s | 33 | 3.5% |
Other values (13) | 151 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4094 | |
2 | 742 | 12.6% |
3 | 241 | 4.1% |
5 | 240 | 4.1% |
4 | 190 | 3.2% |
6 | 156 | 2.6% |
0 | 79 | 1.3% |
7 | 71 | 1.2% |
8 | 54 | 0.9% |
9 | 35 | 0.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 164 | |
/ | 39 | 15.0% |
. | 31 | 11.9% |
# | 14 | 5.4% |
& | 7 | 2.7% |
; | 3 | 1.2% |
' | 1 | 0.4% |
· | 1 | 0.4% |
Control
Value | Count | Frequency (%) |
9 | ||
1 | 10.0% |
Space Separator
Value | Count | Frequency (%) |
5955 |
Close Punctuation
Value | Count | Frequency (%) |
) | 855 |
Open Punctuation
Value | Count | Frequency (%) |
( | 849 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 494 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 61 |
Math Symbol
Value | Count | Frequency (%) |
~ | 47 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 9 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66772 | |
Common | 14381 | 16.9% |
Latin | 3915 | 4.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
강 | 5507 | 8.2% |
의 | 5409 | 8.1% |
실 | 5227 | 7.8% |
장 | 3162 | 4.7% |
교 | 1858 | 2.8% |
전 | 1781 | 2.7% |
육 | 1729 | 2.6% |
건 | 1661 | 2.5% |
설 | 1615 | 2.4% |
안 | 1476 | 2.2% |
Other values (463) | 37347 |
Latin
Value | Count | Frequency (%) |
B | 344 | 8.8% |
S | 304 | 7.8% |
K | 267 | 6.8% |
R | 214 | 5.5% |
P | 195 | 5.0% |
L | 182 | 4.6% |
T | 181 | 4.6% |
e | 171 | 4.4% |
C | 158 | 4.0% |
D | 147 | 3.8% |
Other values (39) | 1752 |
Common
Value | Count | Frequency (%) |
5955 | ||
1 | 4094 | |
) | 855 | 5.9% |
( | 849 | 5.9% |
2 | 742 | 5.2% |
- | 494 | 3.4% |
3 | 241 | 1.7% |
5 | 240 | 1.7% |
4 | 190 | 1.3% |
, | 164 | 1.1% |
Other values (16) | 557 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66710 | |
ASCII | 18291 | 21.5% |
None | 62 | 0.1% |
Number Forms | 4 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5955 | ||
1 | 4094 | |
) | 855 | 4.7% |
( | 849 | 4.6% |
2 | 742 | 4.1% |
- | 494 | 2.7% |
B | 344 | 1.9% |
S | 304 | 1.7% |
K | 267 | 1.5% |
3 | 241 | 1.3% |
Other values (63) | 4146 |
Hangul
Value | Count | Frequency (%) |
강 | 5507 | 8.3% |
의 | 5409 | 8.1% |
실 | 5227 | 7.8% |
장 | 3162 | 4.7% |
교 | 1858 | 2.8% |
전 | 1781 | 2.7% |
육 | 1729 | 2.6% |
건 | 1661 | 2.5% |
설 | 1615 | 2.4% |
안 | 1476 | 2.2% |
Other values (461) | 37285 |
None
Value | Count | Frequency (%) |
㈜ | 61 | |
· | 1 | 1.6% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 4 |
Compat Jamo
Value | Count | Frequency (%) |
ㅅ | 1 |
EDC_TYPE
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
10 | |
---|---|
20 | |
<NA> | 562 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.1124 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 10 |
---|---|
2nd row | 10 |
3rd row | 10 |
4th row | 20 |
5th row | 10 |
Common Values
Value | Count | Frequency (%) |
10 | 6764 | |
20 | 2674 | 26.7% |
<NA> | 562 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10 | 6764 | |
20 | 2674 | 26.7% |
na | 562 | 5.6% |
LCTRUM_ZIP
Real number (ℝ)
MISSING
 
Distinct | 439 |
---|---|
Distinct (%) | 55.2% |
Missing | 9205 |
Missing (%) | 92.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 441711.78 |
Minimum | 0 |
---|---|
Maximum | 791944 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 133796.6 |
Q1 | 339811.5 |
median | 443270 |
Q3 | 607925 |
95-th percentile | 741855.6 |
Maximum | 791944 |
Range | 791944 |
Interquartile range (IQR) | 268113.5 |
Descriptive statistics
Standard deviation | 187907.29 |
---|---|
Coefficient of variation (CV) | 0.42540701 |
Kurtosis | -0.75514287 |
Mean | 441711.78 |
Median Absolute Deviation (MAD) | 122316 |
Skewness | -0.16781649 |
Sum | 3.5116086 × 108 |
Variance | 3.5309148 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
406840 | 34 | 0.3% |
430016 | 18 | 0.2% |
461200 | 13 | 0.1% |
336841 | 13 | 0.1% |
152050 | 10 | 0.1% |
613100 | 9 | 0.1% |
540978 | 9 | 0.1% |
138170 | 9 | 0.1% |
445810 | 9 | 0.1% |
619952 | 9 | 0.1% |
Other values (429) | 662 | 6.6% |
(Missing) | 9205 |
Value | Count | Frequency (%) |
0 | 1 | |
100051 | 1 | |
100052 | 1 | |
100101 | 1 | |
100192 | 1 | |
100400 | 1 | |
100440 | 1 | |
100450 | 2 | |
100802 | 1 | |
110062 | 1 |
Value | Count | Frequency (%) |
791944 | 1 | < 0.1% |
791052 | 1 | < 0.1% |
791050 | 6 | |
790827 | 1 | < 0.1% |
790785 | 4 | |
790704 | 1 | < 0.1% |
790360 | 5 | |
790150 | 1 | < 0.1% |
780900 | 1 | < 0.1% |
780892 | 1 | < 0.1% |
LCTRUM_ADRES1
Text
Distinct | 1479 |
---|---|
Distinct (%) | 14.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 44 |
---|---|
Median length | 35 |
Mean length | 11.6224 |
Min length | 5 |
Characters and Unicode
Total characters | 116224 |
---|---|
Distinct characters | 364 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 831 ? |
---|---|
Unique (%) | 8.3% |
Sample
1st row | 부산 동구 범일2동 |
---|---|
2nd row | 인천 계양구 작전동 |
3rd row | 경기 오산시 외삼미동 |
4th row | 서울 강서구 |
5th row | 경기 수원시 장안구 영화동 |
Value | Count | Frequency (%) |
경기 | 2666 | 8.1% |
서울 | 1968 | 6.0% |
화성시 | 706 | 2.1% |
부산 | 641 | 1.9% |
북구 | 571 | 1.7% |
충남 | 561 | 1.7% |
인천 | 554 | 1.7% |
대구 | 483 | 1.5% |
수원시 | 461 | 1.4% |
경남 | 415 | 1.3% |
Other values (1622) | 24003 |
Most occurring characters
Value | Count | Frequency (%) |
24030 | ||
동 | 9114 | 7.8% |
구 | 7396 | 6.4% |
시 | 5119 | 4.4% |
경 | 3714 | 3.2% |
기 | 3177 | 2.7% |
서 | 2975 | 2.6% |
남 | 2356 | 2.0% |
산 | 2355 | 2.0% |
울 | 2331 | 2.0% |
Other values (354) | 53657 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 89090 | |
Space Separator | 24030 | 20.7% |
Decimal Number | 2852 | 2.5% |
Dash Punctuation | 209 | 0.2% |
Uppercase Letter | 28 | < 0.1% |
Open Punctuation | 4 | < 0.1% |
Close Punctuation | 4 | < 0.1% |
Lowercase Letter | 3 | < 0.1% |
Other Punctuation | 2 | < 0.1% |
Control | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9114 | 10.2% |
구 | 7396 | 8.3% |
시 | 5119 | 5.7% |
경 | 3714 | 4.2% |
기 | 3177 | 3.6% |
서 | 2975 | 3.3% |
남 | 2356 | 2.6% |
산 | 2355 | 2.6% |
울 | 2331 | 2.6% |
천 | 1685 | 1.9% |
Other values (324) | 48868 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 7 | |
B | 4 | |
C | 3 | |
L | 3 | |
M | 3 | |
E | 2 | 7.1% |
T | 2 | 7.1% |
R | 1 | 3.6% |
W | 1 | 3.6% |
O | 1 | 3.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 691 | |
3 | 618 | |
2 | 507 | |
4 | 245 | 8.6% |
5 | 186 | 6.5% |
7 | 176 | 6.2% |
6 | 116 | 4.1% |
9 | 108 | 3.8% |
0 | 108 | 3.8% |
8 | 97 | 3.4% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 1 | |
k | 1 | |
o | 1 |
Space Separator
Value | Count | Frequency (%) |
24030 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 209 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Other Punctuation
Value | Count | Frequency (%) |
, | 2 |
Control
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 89090 | |
Common | 27103 | 23.3% |
Latin | 31 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9114 | 10.2% |
구 | 7396 | 8.3% |
시 | 5119 | 5.7% |
경 | 3714 | 4.2% |
기 | 3177 | 3.6% |
서 | 2975 | 3.3% |
남 | 2356 | 2.6% |
산 | 2355 | 2.6% |
울 | 2331 | 2.6% |
천 | 1685 | 1.9% |
Other values (324) | 48868 |
Common
Value | Count | Frequency (%) |
24030 | ||
1 | 691 | 2.5% |
3 | 618 | 2.3% |
2 | 507 | 1.9% |
4 | 245 | 0.9% |
- | 209 | 0.8% |
5 | 186 | 0.7% |
7 | 176 | 0.6% |
6 | 116 | 0.4% |
9 | 108 | 0.4% |
Other values (6) | 217 | 0.8% |
Latin
Value | Count | Frequency (%) |
A | 7 | |
B | 4 | |
C | 3 | |
L | 3 | |
M | 3 | |
E | 2 | 6.5% |
T | 2 | 6.5% |
b | 1 | 3.2% |
k | 1 | 3.2% |
R | 1 | 3.2% |
Other values (4) | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 89090 | |
ASCII | 27134 | 23.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
24030 | ||
1 | 691 | 2.5% |
3 | 618 | 2.3% |
2 | 507 | 1.9% |
4 | 245 | 0.9% |
- | 209 | 0.8% |
5 | 186 | 0.7% |
7 | 176 | 0.6% |
6 | 116 | 0.4% |
9 | 108 | 0.4% |
Other values (20) | 248 | 0.9% |
Hangul
Value | Count | Frequency (%) |
동 | 9114 | 10.2% |
구 | 7396 | 8.3% |
시 | 5119 | 5.7% |
경 | 3714 | 4.2% |
기 | 3177 | 3.6% |
서 | 2975 | 3.3% |
남 | 2356 | 2.6% |
산 | 2355 | 2.6% |
울 | 2331 | 2.6% |
천 | 1685 | 1.9% |
Other values (324) | 48868 |
LCTRUM_ADRES2
Text
MISSING
 
Distinct | 1894 |
---|---|
Distinct (%) | 19.8% |
Missing | 443 |
Missing (%) | 4.4% |
Memory size | 156.2 KiB |
Length
Max length | 40 |
---|---|
Median length | 34 |
Mean length | 10.532385 |
Min length | 1 |
Characters and Unicode
Total characters | 100658 |
---|---|
Distinct characters | 395 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 1278 ? |
---|---|
Unique (%) | 13.4% |
Sample
1st row | 830-51 |
---|---|
2nd row | 853-10(5층) |
3rd row | 53번지 강남빌딩 201호 |
4th row | 화곡동 |
5th row | 392-4 칠공빌딩 3층 |
Value | Count | Frequency (%) |
4층 | 872 | 5.1% |
2층 | 575 | 3.3% |
3층 | 559 | 3.2% |
7층 | 238 | 1.4% |
694-9(401호 | 216 | 1.3% |
41-5(동탄성심플라자 | 190 | 1.1% |
54-54 | 163 | 0.9% |
중앙빌딩(5층 | 161 | 0.9% |
201호 | 157 | 0.9% |
137(피카디리플러스 | 153 | 0.9% |
Other values (2348) | 13982 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 8252 | 8.2% |
7981 | 7.9% | |
- | 6435 | 6.4% |
4 | 5909 | 5.9% |
3 | 5668 | 5.6% |
2 | 5594 | 5.6% |
5 | 4837 | 4.8% |
0 | 4151 | 4.1% |
층 | 3927 | 3.9% |
( | 3858 | 3.8% |
Other values (385) | 44046 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 45133 | |
Other Letter | 32182 | |
Space Separator | 7981 | 7.9% |
Dash Punctuation | 6435 | 6.4% |
Open Punctuation | 3859 | 3.8% |
Close Punctuation | 3853 | 3.8% |
Uppercase Letter | 476 | 0.5% |
Other Punctuation | 458 | 0.5% |
Math Symbol | 231 | 0.2% |
Lowercase Letter | 43 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
층 | 3927 | 12.2% |
지 | 1978 | 6.1% |
번 | 1577 | 4.9% |
리 | 1391 | 4.3% |
호 | 1283 | 4.0% |
빌 | 1117 | 3.5% |
딩 | 1080 | 3.4% |
동 | 713 | 2.2% |
자 | 626 | 1.9% |
라 | 607 | 1.9% |
Other values (327) | 17883 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 98 | |
B | 97 | |
L | 83 | |
M | 74 | |
J | 59 | |
C | 10 | 2.1% |
G | 8 | 1.7% |
D | 7 | 1.5% |
K | 6 | 1.3% |
R | 5 | 1.1% |
Other values (11) | 29 | 6.1% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 29 | |
b | 3 | 7.0% |
a | 3 | 7.0% |
r | 1 | 2.3% |
e | 1 | 2.3% |
h | 1 | 2.3% |
p | 1 | 2.3% |
u | 1 | 2.3% |
y | 1 | 2.3% |
l | 1 | 2.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8252 | |
4 | 5909 | |
3 | 5668 | |
2 | 5594 | |
5 | 4837 | |
0 | 4151 | |
7 | 3477 | |
6 | 2624 | 5.8% |
9 | 2561 | 5.7% |
8 | 2060 | 4.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 432 | |
/ | 18 | 3.9% |
. | 6 | 1.3% |
: | 1 | 0.2% |
\ | 1 | 0.2% |
Math Symbol
Value | Count | Frequency (%) |
> | 103 | |
∼ | 74 | |
~ | 53 | |
= | 1 | 0.4% |
Open Punctuation
Value | Count | Frequency (%) |
( | 3858 | |
[ | 1 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 3852 | |
] | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
7981 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6435 |
Control
Value | Count | Frequency (%) |
7 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 67957 | |
Hangul | 32181 | |
Latin | 519 | 0.5% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
층 | 3927 | 12.2% |
지 | 1978 | 6.1% |
번 | 1577 | 4.9% |
리 | 1391 | 4.3% |
호 | 1283 | 4.0% |
빌 | 1117 | 3.5% |
딩 | 1080 | 3.4% |
동 | 713 | 2.2% |
자 | 626 | 1.9% |
라 | 607 | 1.9% |
Other values (326) | 17882 |
Latin
Value | Count | Frequency (%) |
A | 98 | |
B | 97 | |
L | 83 | |
M | 74 | |
J | 59 | |
m | 29 | 5.6% |
C | 10 | 1.9% |
G | 8 | 1.5% |
D | 7 | 1.3% |
K | 6 | 1.2% |
Other values (22) | 48 |
Common
Value | Count | Frequency (%) |
1 | 8252 | |
7981 | ||
- | 6435 | |
4 | 5909 | |
3 | 5668 | |
2 | 5594 | |
5 | 4837 | |
0 | 4151 | 6.1% |
( | 3858 | 5.7% |
) | 3852 | 5.7% |
Other values (16) | 11420 |
Han
Value | Count | Frequency (%) |
內 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 68402 | |
Hangul | 32180 | |
Math Operators | 74 | 0.1% |
CJK | 1 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 8252 | |
7981 | ||
- | 6435 | |
4 | 5909 | |
3 | 5668 | |
2 | 5594 | |
5 | 4837 | |
0 | 4151 | 6.1% |
( | 3858 | 5.6% |
) | 3852 | 5.6% |
Other values (47) | 11865 |
Hangul
Value | Count | Frequency (%) |
층 | 3927 | 12.2% |
지 | 1978 | 6.1% |
번 | 1577 | 4.9% |
리 | 1391 | 4.3% |
호 | 1283 | 4.0% |
빌 | 1117 | 3.5% |
딩 | 1080 | 3.4% |
동 | 713 | 2.2% |
자 | 626 | 1.9% |
라 | 607 | 1.9% |
Other values (325) | 17881 |
Math Operators
Value | Count | Frequency (%) |
∼ | 74 |
CJK
Value | Count | Frequency (%) |
內 | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㅐ | 1 |
MNG_LABOFFICE
Real number (ℝ)
MISSING
 
Distinct | 56 |
---|---|
Distinct (%) | 0.6% |
Missing | 558 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4353.9503 |
Minimum | 2000 |
---|---|
Maximum | 9000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2000 |
---|---|
5-th percentile | 2020 |
Q1 | 3000 |
median | 5010 |
Q3 | 5170 |
95-th percentile | 7110 |
Maximum | 9000 |
Range | 7000 |
Interquartile range (IQR) | 2170 |
Descriptive statistics
Standard deviation | 1692.3078 |
---|---|
Coefficient of variation (CV) | 0.38868331 |
Kurtosis | -1.1770569 |
Mean | 4353.9503 |
Median Absolute Deviation (MAD) | 1210 |
Skewness | -0.046636261 |
Sum | 41109999 |
Variance | 2863905.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5110 | 1239 | 12.4% |
2060 | 591 | 5.9% |
7000 | 471 | 4.7% |
6000 | 446 | 4.5% |
2040 | 417 | 4.2% |
5160 | 395 | 4.0% |
5000 | 361 | 3.6% |
7210 | 310 | 3.1% |
2020 | 297 | 3.0% |
5180 | 285 | 2.9% |
Other values (46) | 4630 | |
(Missing) | 558 | 5.6% |
Value | Count | Frequency (%) |
2000 | 209 | 2.1% |
2010 | 126 | 1.3% |
2011 | 86 | 0.9% |
2020 | 297 | |
2030 | 30 | 0.3% |
2040 | 417 | |
2050 | 231 | 2.3% |
2060 | 591 | |
2070 | 115 | 1.1% |
2110 | 24 | 0.2% |
Value | Count | Frequency (%) |
9000 | 5 | 0.1% |
7220 | 119 | 1.2% |
7210 | 310 | |
7120 | 8 | 0.1% |
7110 | 139 | 1.4% |
7001 | 1 | < 0.1% |
7000 | 471 | |
6310 | 70 | 0.7% |
6220 | 192 | |
6210 | 33 | 0.3% |
MNG_AGENT
Real number (ℝ)
MISSING
 
Distinct | 29 |
---|---|
Distinct (%) | 0.3% |
Missing | 558 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.972887 |
Minimum | 11 |
---|---|
Maximum | 999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 21 |
median | 24 |
Q3 | 61 |
95-th percentile | 92 |
Maximum | 999 |
Range | 988 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 28.954283 |
---|---|
Coefficient of variation (CV) | 0.742934 |
Kurtosis | 126.71527 |
Mean | 38.972887 |
Median Absolute Deviation (MAD) | 13 |
Skewness | 4.489476 |
Sum | 367982 |
Variance | 838.3505 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 1527 | |
22 | 1454 | |
21 | 606 | 6.1% |
81 | 595 | 5.9% |
41 | 511 | 5.1% |
61 | 501 | 5.0% |
12 | 462 | 4.6% |
24 | 442 | 4.4% |
43 | 398 | 4.0% |
92 | 395 | 4.0% |
Other values (19) | 2551 | |
(Missing) | 558 | 5.6% |
Value | Count | Frequency (%) |
11 | 1527 | |
12 | 462 | 4.6% |
21 | 606 | 6.1% |
22 | 1454 | |
23 | 350 | 3.5% |
24 | 442 | 4.4% |
25 | 210 | 2.1% |
26 | 393 | 3.9% |
31 | 98 | 1.0% |
32 | 91 | 0.9% |
Value | Count | Frequency (%) |
999 | 1 | < 0.1% |
99 | 5 | 0.1% |
93 | 79 | 0.8% |
92 | 395 | |
91 | 109 | 1.1% |
85 | 46 | 0.5% |
84 | 94 | 0.9% |
83 | 157 | 1.6% |
82 | 366 | |
81 | 595 |
PROCESS_STTUS
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
True | |
---|---|
False | 21 |
Value | Count | Frequency (%) |
True | 9979 | |
False | 21 | 0.2% |
DEL_YN
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 10000 |
MISSING
 
Distinct | 7163 |
---|---|
Distinct (%) | 75.2% |
Missing | 473 |
Missing (%) | 4.7% |
Memory size | 156.2 KiB |
Minimum | 2009-08-17 12:43:00 |
---|---|
Maximum | 2015-02-26 16:40:53 |
MISSING
 
Distinct | 2958 |
---|---|
Distinct (%) | > 99.9% |
Missing | 7041 |
Missing (%) | 70.4% |
Memory size | 156.2 KiB |
Minimum | 2012-10-31 17:36:42 |
---|---|
Maximum | 2015-03-27 17:53:20 |
FILE_STRE_COURS1
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
STRE_FILE_NM1
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
ORIGNL_FILE_NM1
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
FILE_STRE_COURS2
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
STRE_FILE_NM2
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
ORIGNL_FILE_NM2
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
FILE_STRE_COURS3
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
STRE_FILE_NM3
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
ORIGNL_FILE_NM3
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
FILE_STRE_COURS4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
STRE_FILE_NM4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
ORIGNL_FILE_NM4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
PYMNT_DCSN_DE
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
RCOGN_NMPR
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
EDC_YEAR | UNIQ_ID | EDC_DT | EDC_TIME_S | EDC_TIME_E | EDC_SBJECT | EDC_CNT | LCTRUM_AR | EDC_PLACE | EDC_TYPE | LCTRUM_ZIP | LCTRUM_ADRES1 | LCTRUM_ADRES2 | MNG_LABOFFICE | MNG_AGENT | PROCESS_STTUS | DEL_YN | FRST_REGISTER_PNTTM | LAST_UPDUSR_PNTTM | FILE_STRE_COURS1 | STRE_FILE_NM1 | ORIGNL_FILE_NM1 | FILE_STRE_COURS2 | STRE_FILE_NM2 | ORIGNL_FILE_NM2 | FILE_STRE_COURS3 | STRE_FILE_NM3 | ORIGNL_FILE_NM3 | FILE_STRE_COURS4 | STRE_FILE_NM4 | ORIGNL_FILE_NM4 | PYMNT_DCSN_DE | RCOGN_NMPR | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
23728 | 2013 | 009_20130806211 | 20130806 | 800 | 1200 | 건설안전및근로자건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 부산 동구 범일2동 | 830-51 | 3010 | 81 | Y | N | 2013-08-02 12:03:31 | 2013-08-02 15:25:47 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
46544 | 2014 | 048_20140619146 | 20140619 | 1400 | 1800 | 건설안전 및 보건교육 | 50 | 40 | 강의실1 | 10 | <NA> | 인천 계양구 작전동 | 853-10(5층) | 5010 | 21 | Y | N | 2014-06-16 16:41:18 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
20646 | 2013 | 007_20130708241 | 20130708 | 900 | 1300 | 건설안전 및 보건위생 | 50 | 40 | KB건설안전연구원 | 10 | <NA> | 경기 오산시 외삼미동 | 53번지 강남빌딩 201호 | 5170 | 22 | Y | N | 2013-06-19 10:26:28 | 2013-07-05 18:11:55 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
30279 | 2013 | 084_20131220260 | 20131220 | 730 | 1130 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강서구 재건축 공사 현장 | 20 | <NA> | 서울 강서구 | 화곡동 | 2040 | 11 | Y | N | 2013-11-26 10:41:59 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
55850 | 2014 | 084_20141023860 | 20141023 | 800 | 1200 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 경기 수원시 장안구 영화동 | 392-4 칠공빌딩 3층 | 5110 | 22 | Y | N | 2014-09-26 09:44:47 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
23348 | 2013 | 053_20130827857 | 20130827 | 1300 | 1700 | 건설안전및보건교육 | 50 | 40 | 강의실1 | 10 | <NA> | 부산 사상구 덕포동 | 395-11(2층) | 3020 | 81 | Y | N | 2013-07-29 14:31:05 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
50035 | 2014 | 009_20140822978 | 20140822 | 800 | 1200 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 부산 부산진구 범천동 | 886-41 | 3000 | 81 | Y | N | 2014-07-22 11:45:14 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
15233 | 2013 | 007_20130327617 | 20130327 | 1200 | 1600 | 건설안전 및 보건위생 | 40 | 30 | 삼성건설 대청댐 비상여수로 현장 | 20 | <NA> | 대전 대덕구 미호동 | 643-2번지 | 7000 | 41 | Y | N | 2013-03-22 10:00:22 | 2013-03-26 16:41:29 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
13534 | 2013 | 045_20130225781 | 20130225 | 800 | 1200 | 건설업 기초 안전 보건 교육 | 20 | 10 | 김해 진영 중흥 S-클래스 신축공사 현장 | 20 | <NA> | 경남 김해시 진영읍 | 진영리 1705 | 3130 | 82 | Y | N | 2013-02-21 14:04:29 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
823 | 2012 | 16495 | 20120612 | 1330 | 1730 | 건설안전 및 근로자 건강관리 | 50 | 40 | 경주감포 국도건설 공사현장 삼성물산 교육장 | 10 | 780822 | 경북 경주시 외동읍 신계리 | 1045번지 | 5130 | 23 | Y | N | 2012-06-11 11:20:25 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
EDC_YEAR | UNIQ_ID | EDC_DT | EDC_TIME_S | EDC_TIME_E | EDC_SBJECT | EDC_CNT | LCTRUM_AR | EDC_PLACE | EDC_TYPE | LCTRUM_ZIP | LCTRUM_ADRES1 | LCTRUM_ADRES2 | MNG_LABOFFICE | MNG_AGENT | PROCESS_STTUS | DEL_YN | FRST_REGISTER_PNTTM | LAST_UPDUSR_PNTTM | FILE_STRE_COURS1 | STRE_FILE_NM1 | ORIGNL_FILE_NM1 | FILE_STRE_COURS2 | STRE_FILE_NM2 | ORIGNL_FILE_NM2 | FILE_STRE_COURS3 | STRE_FILE_NM3 | ORIGNL_FILE_NM3 | FILE_STRE_COURS4 | STRE_FILE_NM4 | ORIGNL_FILE_NM4 | PYMNT_DCSN_DE | RCOGN_NMPR | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
49382 | 2014 | 002_20140710363 | 20140710 | 800 | 1200 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 경기 안양시 동안구 관양동 | 1490-44 3층 | 5130 | 23 | Y | N | 2014-07-09 17:27:03 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
52224 | 2014 | 012_20140820977 | 20140820 | 900 | 1300 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 서울 구로구 구로동 | 803-4(구로구 도림로 11) | 2060 | 11 | Y | N | 2014-08-14 13:46:38 | 2014-08-19 09:43:09 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
26380 | 2013 | 038_20130926315 | 20130926 | 1300 | 1700 | 건설안전 및 보건위생 | 50 | 40 | 강의실1(세종교육장) | 10 | <NA> | 충남 연기군 조치원읍 남리 | 359(목화빌딩 4층) | 7000 | 41 | Y | N | 2013-09-25 15:01:19 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2136 | 2012 | 33418 | 20121030 | 900 | 1300 | 건설공사 안전 및 근로자 건강관리 | 50 | <NA> | 별관 1강의실 | 10 | <NA> | 인천광역시 남동구 소래로 688 | <NA> | 5010 | 21 | Y | N | 2012-09-13 17:54:03 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
42891 | 2014 | 054_20140520656 | 20140520 | 1330 | 1730 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실2 | 10 | <NA> | 경기 화성시 석우동 | 41-5(동탄성심플라자 4층) | 5110 | 22 | Y | N | 2014-04-29 16:37:53 | 2014-05-15 16:44:16 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
14857 | 2013 | 031_20130425280 | 20130425 | 900 | 1300 | 건설공사 안전 및 근로자 건강관리 | 50 | 40 | 강의실4 | 10 | <NA> | 인천 남동구 만수1동 | 건설기술교육원 | 5000 | 21 | Y | N | 2013-03-14 11:35:28 | 2013-04-23 13:20:21 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
23795 | 2013 | 062_20130807986 | 20130807 | 1300 | 1700 | 건설안전및근로자건강관리 | 50 | 40 | 강의실2 | 10 | <NA> | 서울 동작구 대방동 | 341-2(5층) | 2060 | 11 | Y | N | 2013-08-05 09:44:06 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
27381 | 2013 | 003_20131010961 | 20131010 | 1330 | 1730 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 서울 구로구 구로동 | 33-1 2층 | 2060 | 11 | Y | N | 2013-10-08 08:43:52 | 2013-10-10 11:19:12 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
47104 | 2014 | 023_20140701285 | 20140701 | 1300 | 1700 | 건설안전 및 근로자 건강관리 | 50 | 40 | 태안화력발전소 | 20 | <NA> | 충남 태안군 원북면 방갈리 | 831-3 | 7220 | 43 | Y | N | 2014-06-23 15:39:14 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
50170 | 2014 | 089_20140730629 | 20140730 | 830 | 1230 | 건설안전 및 근로자 건강관리 | 50 | 40 | 강의실1 | 10 | <NA> | 대구 달서구 본동 | 751 | 4000 | 92 | Y | N | 2014-07-25 09:15:03 | NaT | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |