Overview

Dataset statistics

Number of variables3
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)9.5%
Total size in memory1.2 KiB
Average record size in memory29.1 B

Variable types

DateTime1
Categorical2

Dataset

Description폐기물처분부담금시스템 내 폐기물처분부담금 신고한 내역에 대해서 폐기물처분부담금 부과 고지를 위한 정보입니다.
Author한국환경공단
URLhttps://www.data.go.kr/data/15092753/fileData.do

Alerts

Dataset has 4 (9.5%) duplicate rowsDuplicates
진행상태 is highly imbalanced (72.4%)Imbalance

Reproduction

Analysis started2023-12-12 22:33:11.825595
Analysis finished2023-12-12 22:33:12.066364
Duration0.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct23
Distinct (%)54.8%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2022-01-06 14:38:35
Maximum2022-08-08 14:34:36
2023-12-13T07:33:12.119169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:33:12.232719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

순번
Categorical

Distinct4
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size468.0 B
1
25 
2
3
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 25
59.5%
2 9
 
21.4%
3 4
 
9.5%
4 4
 
9.5%

Length

2023-12-13T07:33:12.354083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:33:12.457852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 25
59.5%
2 9
 
21.4%
3 4
 
9.5%
4 4
 
9.5%

진행상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size468.0 B
1
40 
7
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 40
95.2%
7 2
 
4.8%

Length

2023-12-13T07:33:12.566035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:33:12.653387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 40
95.2%
7 2
 
4.8%

Correlations

2023-12-13T07:33:12.709501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록일시순번진행상태
등록일시1.0000.0001.000
순번0.0001.0000.000
진행상태1.0000.0001.000
2023-12-13T07:33:13.071505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번진행상태
순번1.0000.000
진행상태0.0001.000
2023-12-13T07:33:13.144125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번진행상태
순번1.0000.000
진행상태0.0001.000

Missing values

2023-12-13T07:33:11.943646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:33:12.030202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일시순번진행상태
02022-01-06 14:38:3511
12022-01-06 14:39:0211
22022-01-06 14:39:4711
32022-01-06 15:54:2511
42022-01-06 15:56:2711
52022-01-06 15:57:3211
62022-01-06 15:58:4411
72022-01-06 16:00:0911
82022-01-06 16:00:3711
92022-01-06 16:01:0911
등록일시순번진행상태
322022-04-19 14:18:2721
332022-04-19 14:18:2721
342022-04-19 15:45:0011
352022-04-19 15:57:1811
362022-04-19 15:57:1821
372022-04-19 15:57:1811
382022-04-19 15:57:1821
392022-04-19 16:06:0111
402022-05-09 14:56:2911
412022-08-08 14:34:3611

Duplicate rows

Most frequently occurring

등록일시순번진행상태# duplicates
02022-04-19 14:18:27112
12022-04-19 14:18:27212
22022-04-19 15:57:18112
32022-04-19 15:57:18212