Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 197 |
Duplicate rows (%) | 2.0% |
Total size in memory | 556.6 KiB |
Average record size in memory | 57.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 1 |
DateTime | 2 |
Categorical | 2 |
Dataset
Description | 서울특별시 광진구 대형생활폐기물 수거실적처리 데이터(수거실적처리전표번호,품목 일련번호,최초신청시간,최초신청자,최종작업시간,최종작업자) |
---|---|
Author | 서울특별시 광진구 |
URL | https://www.data.go.kr/data/15069870/fileData.do |
Dataset has 197 (2.0%) duplicate rows | Duplicates |
최초신청자 is highly imbalanced (58.2%) | Imbalance |
최종작업자 is highly imbalanced (75.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 22:40:28.712789 |
---|---|
Analysis finished | 2023-12-12 22:40:29.350282 |
Duration | 0.64 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
수거실적처리전표번호
Real number (ℝ)
Distinct | 4752 |
---|---|
Distinct (%) | 47.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.02073 × 1011 |
Minimum | 2.0201 × 1011 |
---|---|
Maximum | 2.022021 × 1011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0201 × 1011 |
---|---|
5-th percentile | 2.0201 × 1011 |
Q1 | 2.02011 × 1011 |
median | 2.02102 × 1011 |
Q3 | 2.0210726 × 1011 |
95-th percentile | 2.0211227 × 1011 |
Maximum | 2.022021 × 1011 |
Range | 1.9210025 × 108 |
Interquartile range (IQR) | 96260252 |
Descriptive statistics
Standard deviation | 53244981 |
---|---|
Coefficient of variation (CV) | 0.00026349379 |
Kurtosis | -0.72172815 |
Mean | 2.02073 × 1011 |
Median Absolute Deviation (MAD) | 9055286.5 |
Skewness | 0.13114652 |
Sum | 2.02073 × 1015 |
Variance | 2.835028 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
202011000000 | 1814 | 18.1% |
202010000000 | 1097 | 11.0% |
202012000000 | 987 | 9.9% |
202101000000 | 951 | 9.5% |
202102000000 | 334 | 3.3% |
202103080190 | 4 | < 0.1% |
202110010131 | 3 | < 0.1% |
202106180110 | 2 | < 0.1% |
202105240122 | 2 | < 0.1% |
202108140116 | 2 | < 0.1% |
Other values (4742) | 4804 |
Value | Count | Frequency (%) |
202010000000 | 1097 | |
202011000000 | 1814 | |
202012000000 | 987 | |
202101000000 | 951 | |
202102000000 | 334 | 3.3% |
202102090133 | 1 | < 0.1% |
202102090149 | 1 | < 0.1% |
202102090173 | 1 | < 0.1% |
202102090176 | 1 | < 0.1% |
202102090268 | 1 | < 0.1% |
Value | Count | Frequency (%) |
202202100252 | 1 | |
202202100124 | 1 | |
202202090652 | 1 | |
202202090633 | 1 | |
202202090607 | 1 | |
202202090563 | 1 | |
202202090532 | 1 | |
202202090510 | 1 | |
202202090457 | 1 | |
202202090438 | 1 |
품목 일련번호
Text
Distinct | 429 |
---|---|
Distinct (%) | 4.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
2-20-20 | 616 | 6.2% |
03-30-49 | 515 | 5.1% |
4-40-18 | 411 | 4.1% |
03-30-65 | 240 | 2.4% |
1017001 | 228 | 2.3% |
2-20-12 | 216 | 2.2% |
1035002 | 202 | 2.0% |
1009001 | 163 | 1.6% |
1017003 | 137 | 1.4% |
3015001 | 128 | 1.3% |
Other values (419) | 7144 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 23275 | |
- | 10366 | |
1 | 9383 | |
2 | 8920 | 12.5% |
3 | 7461 | 10.5% |
4 | 2813 | 3.9% |
5 | 2501 | 3.5% |
6 | 2089 | 2.9% |
7 | 1833 | 2.6% |
9 | 1813 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 60871 | |
Dash Punctuation | 10366 | 14.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 23275 | |
1 | 9383 | |
2 | 8920 | 14.7% |
3 | 7461 | 12.3% |
4 | 2813 | 4.6% |
5 | 2501 | 4.1% |
6 | 2089 | 3.4% |
7 | 1833 | 3.0% |
9 | 1813 | 3.0% |
8 | 783 | 1.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10366 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 71237 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 23275 | |
- | 10366 | |
1 | 9383 | |
2 | 8920 | 12.5% |
3 | 7461 | 10.5% |
4 | 2813 | 3.9% |
5 | 2501 | 3.5% |
6 | 2089 | 2.9% |
7 | 1833 | 2.6% |
9 | 1813 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 71237 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 23275 | |
- | 10366 | |
1 | 9383 | |
2 | 8920 | 12.5% |
3 | 7461 | 10.5% |
4 | 2813 | 3.9% |
5 | 2501 | 3.5% |
6 | 2089 | 2.9% |
7 | 1833 | 2.6% |
9 | 1813 | 2.5% |
최초신청시간
Date
Distinct | 8165 |
---|---|
Distinct (%) | 81.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-10-01 14:39:00 |
---|---|
Maximum | 2022-02-10 12:52:00 |
최초신청자
Categorical
IMBALANCE
 
Distinct | 17 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
guest | |
---|---|
dong14 | 206 |
dong03 | 202 |
dong04 | 190 |
dong07 | 181 |
Other values (12) |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 5.2363 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | dong13 |
---|---|
2nd row | guest |
3rd row | guest |
4th row | guest |
5th row | guest |
Common Values
Value | Count | Frequency (%) |
guest | 7636 | |
dong14 | 206 | 2.1% |
dong03 | 202 | 2.0% |
dong04 | 190 | 1.9% |
dong07 | 181 | 1.8% |
dong10 | 165 | 1.7% |
dong09 | 165 | 1.7% |
dong15 | 165 | 1.7% |
dong02 | 162 | 1.6% |
dong12 | 155 | 1.6% |
Other values (7) | 773 | 7.7% |
Length
Value | Count | Frequency (%) |
guest | 7636 | |
dong14 | 206 | 2.1% |
dong03 | 202 | 2.0% |
dong04 | 190 | 1.9% |
dong07 | 181 | 1.8% |
dong10 | 165 | 1.7% |
dong09 | 165 | 1.7% |
dong15 | 165 | 1.7% |
dong02 | 162 | 1.6% |
dong12 | 155 | 1.6% |
Other values (7) | 773 | 7.7% |
최종작업시간
Date
Distinct | 8008 |
---|---|
Distinct (%) | 80.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-10-02 11:04:00 |
---|---|
Maximum | 2022-02-23 23:45:00 |
최종작업자
Categorical
IMBALANCE
 
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
pda01 | |
---|---|
guest | 783 |
env01 | 132 |
dong04 | 66 |
dong07 | 56 |
Other values (13) | 449 |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 5.0571 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | pda01 |
---|---|
2nd row | pda01 |
3rd row | pda01 |
4th row | pda01 |
5th row | pda01 |
Common Values
Value | Count | Frequency (%) |
pda01 | 8514 | |
guest | 783 | 7.8% |
env01 | 132 | 1.3% |
dong04 | 66 | 0.7% |
dong07 | 56 | 0.6% |
dong03 | 49 | 0.5% |
dong02 | 45 | 0.4% |
dong05 | 42 | 0.4% |
dong09 | 40 | 0.4% |
dong14 | 39 | 0.4% |
Other values (8) | 234 | 2.3% |
Length
Value | Count | Frequency (%) |
pda01 | 8514 | |
guest | 783 | 7.8% |
env01 | 132 | 1.3% |
dong04 | 66 | 0.7% |
dong07 | 56 | 0.6% |
dong03 | 49 | 0.5% |
dong02 | 45 | 0.4% |
dong05 | 42 | 0.4% |
dong09 | 40 | 0.4% |
dong14 | 39 | 0.4% |
Other values (8) | 234 | 2.3% |
수거실적처리전표번호 | 최초신청자 | 최종작업자 | |
---|---|---|---|
수거실적처리전표번호 | 1.000 | 0.440 | 0.292 |
최초신청자 | 0.440 | 1.000 | 0.758 |
최종작업자 | 0.292 | 0.758 | 1.000 |
최초신청자 | 최종작업자 | |
---|---|---|
최초신청자 | 1.000 | 0.346 |
최종작업자 | 0.346 | 1.000 |
수거실적처리전표번호 | 최초신청자 | 최종작업자 | |
---|---|---|---|
수거실적처리전표번호 | 1.000 | 0.278 | 0.175 |
최초신청자 | 0.278 | 1.000 | 0.346 |
최종작업자 | 0.175 | 0.346 | 1.000 |
수거실적처리전표번호 | 품목 일련번호 | 최초신청시간 | 최초신청자 | 최종작업시간 | 최종작업자 | |
---|---|---|---|---|---|---|
83780 | 202110280426 | 3015001 | 2021-10-28 16:00 | dong13 | 2021-10-30 07:30 | pda01 |
13481 | 202011000000 | 2-20-16 | 2020-11-09 15:33 | guest | 2020-11-13 12:25 | pda01 |
77091 | 202108300204 | 1035006 | 2021-08-30 10:12 | guest | 2021-09-01 12:50 | pda01 |
31480 | 202012000000 | 2-20-20 | 2020-12-11 16:07 | guest | 2020-12-14 09:15 | pda01 |
5206 | 202010000000 | 2-20-6 | 2020-10-16 10:23 | guest | 2020-10-17 06:05 | pda01 |
10887 | 202011000000 | 03-30-49 | 2020-11-02 13:51 | guest | 2020-11-03 05:35 | pda01 |
4774 | 202010000000 | 03-30-53 | 2020-10-15 12:04 | guest | 2020-10-15 12:09 | pda01 |
30555 | 202012000000 | 2-20-32 | 2020-12-08 16:20 | guest | 2020-12-11 09:52 | pda01 |
2946 | 202010000000 | 03-30-49 | 2020-10-12 10:42 | guest | 2020-10-12 16:14 | pda01 |
61559 | 202105190276 | 1035006 | 2021-05-19 20:21 | guest | 2021-05-21 08:06 | pda01 |
수거실적처리전표번호 | 품목 일련번호 | 최초신청시간 | 최초신청자 | 최종작업시간 | 최종작업자 | |
---|---|---|---|---|---|---|
55839 | 202103270050 | 1019001 | 2021-03-27 11:28 | guest | 2021-03-29 14:24 | pda01 |
171 | 202010000000 | 2-20-62 | 2020-10-05 08:59 | guest | 2020-10-13 12:45 | pda01 |
39056 | 202101000000 | 1-20-14 | 2021-01-06 13:49 | guest | 2021-01-07 08:53 | pda01 |
59678 | 202104190224 | 1003001 | 2021-04-19 10:46 | dong13 | 2021-04-20 09:17 | pda01 |
37105 | 202012000000 | 03-30-53 | 2020-12-30 11:51 | guest | 2021-01-04 09:58 | pda01 |
81639 | 202110040088 | 6003003 | 2021-10-04 11:05 | guest | 2021-10-06 13:04 | pda01 |
70343 | 202107130003 | 7041001 | 2021-07-13 00:13 | guest | 2021-07-14 13:14 | pda01 |
50728 | 202102160428 | 6045003 | 2021-02-16 15:54 | dong04 | 2021-02-18 12:56 | pda01 |
63110 | 202105140069 | 1017002 | 2021-05-14 09:42 | dong09 | 2021-05-14 12:58 | pda01 |
838 | 202010000000 | 03-30-49 | 2020-10-05 15:09 | guest | 2020-10-06 10:33 | pda01 |
Most frequently occurring
수거실적처리전표번호 | 품목 일련번호 | 최초신청시간 | 최초신청자 | 최종작업시간 | 최종작업자 | # duplicates | |
---|---|---|---|---|---|---|---|
82 | 202011000000 | 2-20-20 | 2020-11-14 00:23 | guest | 2020-11-14 00:23 | pda01 | 384 |
98 | 202011000000 | 4-40-18 | 2020-11-14 00:23 | guest | 2020-11-14 00:23 | pda01 | 238 |
102 | 202011000000 | 4-40-18 | 2020-11-14 00:24 | guest | 2020-11-14 00:24 | pda01 | 146 |
11 | 202010000000 | 03-30-49 | 2020-10-12 10:42 | guest | 2020-10-12 16:14 | pda01 | 6 |
156 | 202101000000 | 03-30-49 | 2021-01-07 14:05 | guest | 2021-01-08 06:11 | pda01 | 5 |
14 | 202010000000 | 03-30-49 | 2020-10-20 17:45 | guest | 2020-10-23 10:41 | pda01 | 4 |
29 | 202010000000 | 2-20-16 | 2020-10-05 12:24 | guest | 2020-10-05 13:50 | pda01 | 4 |
80 | 202011000000 | 2-20-17 | 2020-11-19 10:04 | guest | 2020-11-20 10:32 | pda01 | 4 |
109 | 202012000000 | 03-30-49 | 2020-12-11 16:21 | guest | 2020-12-14 09:17 | pda01 | 4 |
154 | 202101000000 | 03-30-49 | 2021-01-06 13:23 | guest | 2021-01-07 10:53 | pda01 | 4 |