Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 31 |
Missing cells | 22 |
Missing cells (%) | 14.2% |
Duplicate rows | 1 |
Duplicate rows (%) | 3.2% |
Total size in memory | 1.4 KiB |
Average record size in memory | 46.3 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 인천광역시 미추홀구의 대형폐기물스티커 입고 현황에 대한 데이터로 입고일자, 품명, 입고수량, 기준일 항목을 제공하고 있습니다. |
---|---|
URL | https://www.data.go.kr/data/15090441/fileData.do |
Dataset has 1 (3.2%) duplicate rows | Duplicates |
발주일자 is highly overall correlated with 업체명 | High correlation |
업체명 is highly overall correlated with 발주일자 | High correlation |
발주수량 is highly overall correlated with 입고수량 | High correlation |
입고수량 is highly overall correlated with 발주수량 | High correlation |
품목 has 7 (22.6%) missing values | Missing |
발주수량 has 7 (22.6%) missing values | Missing |
입고수량 has 8 (25.8%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 02:50:50.296153 |
---|---|
Analysis finished | 2023-12-12 02:50:51.366139 |
Duration | 1.07 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
발주일자
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 25.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
<NA> | |
---|---|
2023-01-05 | |
2023-02-13 | |
2023-02-23 | |
2023-03-08 | |
Other values (3) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 8.6451613 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.2% |
Sample
1st row | 2023-01-05 |
---|---|
2nd row | 2023-01-05 |
3rd row | 2023-01-05 |
4th row | 2023-01-05 |
5th row | 2023-01-05 |
Common Values
Value | Count | Frequency (%) |
<NA> | 7 | |
2023-01-05 | 5 | |
2023-02-13 | 5 | |
2023-02-23 | 5 | |
2023-03-08 | 4 | |
2023-02-09 | 2 | 6.5% |
2023-04-18 | 2 | 6.5% |
2023-04-20 | 1 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 7 | |
2023-01-05 | 5 | |
2023-02-13 | 5 | |
2023-02-23 | 5 | |
2023-03-08 | 4 | |
2023-02-09 | 2 | 6.5% |
2023-04-18 | 2 | 6.5% |
2023-04-20 | 1 | 3.2% |
업체명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 16.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
영광산업 | |
---|---|
<NA> | |
에덴복지재단 | |
성광디자인 | |
서구구립장애인재활 |
Length
Max length | 9 |
---|---|
Median length | 4 |
Mean length | 4.8064516 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 영광산업 |
---|---|
2nd row | 영광산업 |
3rd row | 영광산업 |
4th row | 영광산업 |
5th row | 영광산업 |
Common Values
Value | Count | Frequency (%) |
영광산업 | 12 | |
<NA> | 7 | |
에덴복지재단 | 5 | |
성광디자인 | 5 | |
서구구립장애인재활 | 2 | 6.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
영광산업 | 12 | |
na | 7 | |
에덴복지재단 | 5 | |
성광디자인 | 5 | |
서구구립장애인재활 | 2 | 6.5% |
품목
Text
MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 75.0% |
Missing | 7 |
Missing (%) | 22.6% |
Memory size | 380.0 B |
Value | Count | Frequency (%) |
일반용 | 8 | |
스티커 | 5 | |
음식물 | 5 | |
원권 | 5 | |
10l | 5 | |
재사용 | 4 | 7.5% |
20l | 4 | 7.5% |
5l | 3 | 5.7% |
사업계용 | 2 | 3.8% |
10000 | 2 | 3.8% |
Other values (10) | 10 |
Most occurring characters
Value | Count | Frequency (%) |
42 | ||
0 | 29 | |
L | 19 | 9.5% |
용 | 14 | 7.0% |
1 | 9 | 4.5% |
일 | 8 | 4.0% |
반 | 8 | 4.0% |
5 | 6 | 3.0% |
사 | 6 | 3.0% |
물 | 5 | 2.5% |
Other values (14) | 53 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 84 | |
Decimal Number | 54 | |
Space Separator | 42 | |
Uppercase Letter | 19 | 9.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
용 | 14 | |
일 | 8 | |
반 | 8 | |
사 | 6 | 7.1% |
물 | 5 | 6.0% |
식 | 5 | 6.0% |
음 | 5 | 6.0% |
스 | 5 | 6.0% |
티 | 5 | 6.0% |
권 | 5 | 6.0% |
Other values (5) | 18 |
Decimal Number
Value | Count | Frequency (%) |
0 | 29 | |
1 | 9 | 16.7% |
5 | 6 | 11.1% |
2 | 5 | 9.3% |
3 | 3 | 5.6% |
6 | 1 | 1.9% |
7 | 1 | 1.9% |
Space Separator
Value | Count | Frequency (%) |
42 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 19 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 96 | |
Hangul | 84 | |
Latin | 19 | 9.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
용 | 14 | |
일 | 8 | |
반 | 8 | |
사 | 6 | 7.1% |
물 | 5 | 6.0% |
식 | 5 | 6.0% |
음 | 5 | 6.0% |
스 | 5 | 6.0% |
티 | 5 | 6.0% |
권 | 5 | 6.0% |
Other values (5) | 18 |
Common
Value | Count | Frequency (%) |
42 | ||
0 | 29 | |
1 | 9 | 9.4% |
5 | 6 | 6.2% |
2 | 5 | 5.2% |
3 | 3 | 3.1% |
6 | 1 | 1.0% |
7 | 1 | 1.0% |
Latin
Value | Count | Frequency (%) |
L | 19 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 115 | |
Hangul | 84 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
42 | ||
0 | 29 | |
L | 19 | |
1 | 9 | 7.8% |
5 | 6 | 5.2% |
2 | 5 | 4.3% |
3 | 3 | 2.6% |
6 | 1 | 0.9% |
7 | 1 | 0.9% |
Hangul
Value | Count | Frequency (%) |
용 | 14 | |
일 | 8 | |
반 | 8 | |
사 | 6 | 7.1% |
물 | 5 | 6.0% |
식 | 5 | 6.0% |
음 | 5 | 6.0% |
스 | 5 | 6.0% |
티 | 5 | 6.0% |
권 | 5 | 6.0% |
Other values (5) | 18 |
발주수량
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 22 |
---|---|
Distinct (%) | 91.7% |
Missing | 7 |
Missing (%) | 22.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 677370.83 |
Minimum | 1000 |
---|---|
Maximum | 3400000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 16400 |
Q1 | 63000 |
median | 350000 |
Q3 | 875000 |
95-th percentile | 2040000 |
Maximum | 3400000 |
Range | 3399000 |
Interquartile range (IQR) | 812000 |
Descriptive statistics
Standard deviation | 845315.3 |
---|---|
Coefficient of variation (CV) | 1.2479358 |
Kurtosis | 3.5340875 |
Mean | 677370.83 |
Median Absolute Deviation (MAD) | 295500 |
Skewness | 1.8406026 |
Sum | 16256900 |
Variance | 7.1455796 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1600000 | 2 | 6.5% |
300000 | 2 | 6.5% |
174000 | 1 | 3.2% |
429000 | 1 | 3.2% |
1000 | 1 | 3.2% |
219900 | 1 | 3.2% |
30000 | 1 | 3.2% |
14000 | 1 | 3.2% |
50000 | 1 | 3.2% |
60000 | 1 | 3.2% |
Other values (12) | 12 | |
(Missing) | 7 |
Value | Count | Frequency (%) |
1000 | 1 | |
14000 | 1 | |
30000 | 1 | |
50000 | 1 | |
59000 | 1 | |
60000 | 1 | |
64000 | 1 | |
156000 | 1 | |
174000 | 1 | |
219900 | 1 |
Value | Count | Frequency (%) |
3400000 | 1 | |
2100000 | 1 | |
1700000 | 1 | |
1600000 | 2 | |
1100000 | 1 | |
800000 | 1 | |
650000 | 1 | |
550000 | 1 | |
500000 | 1 | |
429000 | 1 |
입고수량
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 22 |
---|---|
Distinct (%) | 95.7% |
Missing | 8 |
Missing (%) | 25.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 340969.57 |
Minimum | 1000 |
---|---|
Maximum | 1332000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 14000 |
Q1 | 59500 |
median | 167000 |
Q3 | 442000 |
95-th percentile | 1231900 |
Maximum | 1332000 |
Range | 1331000 |
Interquartile range (IQR) | 382500 |
Descriptive statistics
Standard deviation | 418552.71 |
---|---|
Coefficient of variation (CV) | 1.2275369 |
Kurtosis | 0.91532006 |
Mean | 340969.57 |
Median Absolute Deviation (MAD) | 133600 |
Skewness | 1.4684036 |
Sum | 7842300 |
Variance | 1.7518637 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14000 | 2 | 6.5% |
205000 | 1 | 3.2% |
1000 | 1 | 3.2% |
20700 | 1 | 3.2% |
58000 | 1 | 3.2% |
60000 | 1 | 3.2% |
64000 | 1 | 3.2% |
145000 | 1 | 3.2% |
75000 | 1 | 3.2% |
495000 | 1 | 3.2% |
Other values (12) | 12 | |
(Missing) | 8 |
Value | Count | Frequency (%) |
1000 | 1 | |
14000 | 2 | |
20700 | 1 | |
58000 | 1 | |
59000 | 1 | |
60000 | 1 | |
64000 | 1 | |
75000 | 1 | |
76000 | 1 | |
145000 | 1 |
Value | Count | Frequency (%) |
1332000 | 1 | |
1245000 | 1 | |
1114000 | 1 | |
984000 | 1 | |
500000 | 1 | |
495000 | 1 | |
389000 | 1 | |
300600 | 1 | |
300000 | 1 | |
224000 | 1 |
발주일자 | 업체명 | 품목 | 발주수량 | 입고수량 | |
---|---|---|---|---|---|
발주일자 | 1.000 | 1.000 | 0.527 | 0.543 | 0.000 |
업체명 | 1.000 | 1.000 | 0.672 | 0.623 | 0.384 |
품목 | 0.527 | 0.672 | 1.000 | 0.000 | 0.000 |
발주수량 | 0.543 | 0.623 | 0.000 | 1.000 | 0.908 |
입고수량 | 0.000 | 0.384 | 0.000 | 0.908 | 1.000 |
발주일자 | 업체명 | |
---|---|---|
발주일자 | 1.000 | 0.922 |
업체명 | 0.922 | 1.000 |
발주수량 | 입고수량 | 발주일자 | 업체명 | |
---|---|---|---|---|
발주수량 | 1.000 | 0.900 | 0.230 | 0.457 |
입고수량 | 0.900 | 1.000 | 0.000 | 0.273 |
발주일자 | 0.230 | 0.000 | 1.000 | 0.922 |
업체명 | 0.457 | 0.273 | 0.922 | 1.000 |
발주일자 | 업체명 | 품목 | 발주수량 | 입고수량 | |
---|---|---|---|---|---|
0 | 2023-01-05 | 영광산업 | 일반용 5L | 300000 | 167000 |
1 | 2023-01-05 | 영광산업 | 일반용 10L | 59000 | 59000 |
2 | 2023-01-05 | 영광산업 | 일반용 20L | 156000 | 76000 |
3 | 2023-01-05 | 영광산업 | 재사용 10L | 174000 | 14000 |
4 | 2023-01-05 | 영광산업 | 재사용 20L | 429000 | 389000 |
5 | 2023-02-09 | 서구구립장애인재활 | 일반용 50L | 800000 | 300600 |
6 | 2023-02-09 | 서구구립장애인재활 | 일반용 75L | 400000 | 224000 |
7 | 2023-02-13 | 에덴복지재단 | 일반용 5L | 300000 | 300000 |
8 | 2023-02-13 | 에덴복지재단 | 일반용 10L | 1600000 | 1114000 |
9 | 2023-02-13 | 에덴복지재단 | 일반용 20L | 1600000 | 984000 |
발주일자 | 업체명 | 품목 | 발주수량 | 입고수량 | |
---|---|---|---|---|---|
21 | 2023-04-18 | 영광산업 | 사업계용 30L | 30000 | <NA> |
22 | 2023-04-18 | 영광산업 | 사업계용 60L | 219900 | 20700 |
23 | 2023-04-20 | 성광디자인 | 스티커 10000 원권 | 1000 | 1000 |
24 | <NA> | <NA> | <NA> | <NA> | <NA> |
25 | <NA> | <NA> | <NA> | <NA> | <NA> |
26 | <NA> | <NA> | <NA> | <NA> | <NA> |
27 | <NA> | <NA> | <NA> | <NA> | <NA> |
28 | <NA> | <NA> | <NA> | <NA> | <NA> |
29 | <NA> | <NA> | <NA> | <NA> | <NA> |
30 | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
발주일자 | 업체명 | 품목 | 발주수량 | 입고수량 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 7 |