Overview

Dataset statistics

Number of variables6
Number of observations42
Missing cells15
Missing cells (%)6.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory53.1 B

Variable types

Numeric1
Categorical1
DateTime3
Boolean1

Dataset

Description전기전자제품및자동차의재활용시스템 내 부과징수 추가관리 정보를 제공(전자납부 번호, 업체 조사 순번, 업체 조사 일자, 휴폐업 일자, 말소 여부, 말소 일자)
Author환경부
URLhttps://www.data.go.kr/data/15092521/fileData.do

Alerts

업체 조사 순번 is highly imbalanced (52.8%)Imbalance
휴폐업 일자 has 15 (35.7%) missing valuesMissing

Reproduction

Analysis started2024-04-17 10:54:39.788956
Analysis finished2024-04-17 10:54:40.141126
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

전자납부 번호
Real number (ℝ)

Distinct35
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3427118 × 1017
Minimum1.341508 × 1017
Maximum1.3499162 × 1017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-04-17T19:54:40.197237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.341508 × 1017
5-th percentile1.341508 × 1017
Q11.341509 × 1017
median1.3415121 × 1017
Q31.3415149 × 1017
95-th percentile1.3499142 × 1017
Maximum1.3499162 × 1017
Range8.40817 × 1014
Interquartile range (IQR)5.9100151 × 1011

Descriptive statistics

Standard deviation2.9762606 × 1014
Coefficient of variation (CV)0.0022166041
Kurtosis2.6062971
Mean1.3427118 × 1017
Median Absolute Deviation (MAD)3.0300396 × 1011
Skewness2.1176315
Sum5.6393897 × 1018
Variance8.858127 × 1028
MonotonicityIncreasing
2024-04-17T19:54:40.300204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
134991416104172101 3
 
7.1%
134151314101472001 2
 
4.8%
134151314101551001 2
 
4.8%
134151314102843001 2
 
4.8%
134150900101250001 2
 
4.8%
134151213101710001 2
 
4.8%
134151516103749002 1
 
2.4%
134151415101984001 1
 
2.4%
134151415103654001 1
 
2.4%
134151416105181001 1
 
2.4%
Other values (25) 25
59.5%
ValueCountFrequency (%)
134150800100854001 1
2.4%
134150800101089001 1
2.4%
134150800101427001 1
2.4%
134150800101936001 1
2.4%
134150800101991001 1
2.4%
134150900100881001 1
2.4%
134150900100976001 1
2.4%
134150900101250001 2
4.8%
134150900101943001 1
2.4%
134150900102595001 1
2.4%
ValueCountFrequency (%)
134991617104752003 1
 
2.4%
134991617104752002 1
 
2.4%
134991416104172101 3
7.1%
134991415104796102 1
 
2.4%
134151617102947002 1
 
2.4%
134151516106279006 1
 
2.4%
134151516105674003 1
 
2.4%
134151516105674002 1
 
2.4%
134151516103749002 1
 
2.4%
134151416105181001 1
 
2.4%

업체 조사 순번
Categorical

IMBALANCE 

Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
1
35 
2
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 35
83.3%
2 6
 
14.3%
3 1
 
2.4%

Length

2024-04-17T19:54:40.397207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:54:40.478204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 35
83.3%
2 6
 
14.3%
3 1
 
2.4%
Distinct27
Distinct (%)64.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2010-02-24 00:00:00
Maximum2018-02-01 00:00:00
2024-04-17T19:54:40.557719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:54:40.646895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

휴폐업 일자
Date

MISSING 

Distinct23
Distinct (%)85.2%
Missing15
Missing (%)35.7%
Memory size468.0 B
Minimum2009-12-28 00:00:00
Maximum2017-07-21 00:00:00
2024-04-17T19:54:40.733567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:54:40.827244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
Distinct2
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size174.0 B
False
33 
True
ValueCountFrequency (%)
False 33
78.6%
True 9
 
21.4%
2024-04-17T19:54:40.911005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct9
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2009-12-28 00:00:00
Maximum2099-01-01 00:00:00
2024-04-17T19:54:40.985693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:54:41.065259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

Interactions

2024-04-17T19:54:39.953769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:54:41.127927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자납부 번호업체 조사 순번업체 조사 일자휴폐업 일자말소 여부말소 일자
전자납부 번호1.0000.1971.0001.0000.0000.000
업체 조사 순번0.1971.0000.9180.0000.0000.000
업체 조사 일자1.0000.9181.0000.9460.3730.000
휴폐업 일자1.0000.0000.9461.0001.0000.000
말소 여부0.0000.0000.3731.0001.0000.797
말소 일자0.0000.0000.0000.0000.7971.000
2024-04-17T19:54:41.217401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체 조사 순번말소 여부
업체 조사 순번1.0000.000
말소 여부0.0001.000
2024-04-17T19:54:41.282253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전자납부 번호업체 조사 순번말소 여부
전자납부 번호1.0000.3210.000
업체 조사 순번0.3211.0000.000
말소 여부0.0000.0001.000

Missing values

2024-04-17T19:54:40.033864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:54:40.109870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

전자납부 번호업체 조사 순번업체 조사 일자휴폐업 일자말소 여부말소 일자
013415080010085400112010-02-242009-12-28Y2009-12-28
113415080010108900112017-06-302010-04-06N2099-01-01
213415080010142700112017-07-182010-11-30Y2099-01-01
313415080010193600112017-07-10<NA>N2099-01-01
413415080010199100112017-07-052009-12-31N2099-01-01
513415090010088100112017-12-062011-02-28Y2015-12-03
613415090010097600112017-06-092013-12-31Y2014-08-25
713415090010125000112010-08-20<NA>N2099-01-01
813415090010125000122017-07-172011-09-30Y2099-01-01
913415090010194300112017-07-052009-12-31N2010-06-14
전자납부 번호업체 조사 순번업체 조사 일자휴폐업 일자말소 여부말소 일자
3213415151610567400212016-10-19<NA>N2099-01-01
3313415151610567400312017-05-19<NA>N2099-01-01
3413415151610627900612016-12-312016-12-31N2099-01-01
3513415161710294700212017-12-06<NA>N2099-01-01
3613499141510479610212017-11-302015-06-30Y2015-12-01
3713499141610417210112018-02-01<NA>N2099-01-01
3813499141610417210122017-06-21<NA>N2099-01-01
3913499141610417210132017-12-11<NA>N2099-01-01
4013499161710475200212017-11-13<NA>N2099-01-01
4113499161710475200312018-01-25<NA>N2099-01-01