Overview

Dataset statistics

Number of variables5
Number of observations1603
Missing cells0
Missing cells (%)0.0%
Duplicate rows127
Duplicate rows (%)7.9%
Total size in memory64.3 KiB
Average record size in memory41.1 B

Variable types

Categorical3
Text1
Boolean1

Dataset

Description전기전자제품및자동차의재활용시스템 내 폐전기전자제품의 업체지점관리 정보를 제공(의무이행 년도, 업체명, 지점 명, 본사 여부, 등록일)
Author환경부
URLhttps://www.data.go.kr/data/15092445/fileData.do

Alerts

의무이행 년도 has constant value ""Constant
본사 여부 has constant value ""Constant
등록일 has constant value ""Constant
Dataset has 127 (7.9%) duplicate rowsDuplicates
업체명 is highly imbalanced (89.5%)Imbalance

Reproduction

Analysis started2024-04-06 08:36:14.038713
Analysis finished2024-04-06 08:36:14.682363
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의무이행 년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.7 KiB
2024
1603 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024
2nd row2024
3rd row2024
4th row2024
5th row2024

Common Values

ValueCountFrequency (%)
2024 1603
100.0%

Length

2024-04-06T17:36:14.800534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:36:15.023329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024 1603
100.0%

업체명
Categorical

IMBALANCE 

Distinct17
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size12.7 KiB
(주)에스와이에스리테일
1524 
롯데쇼핑(주)
 
33
현대백화점
 
12
한화갤러리아(주)
 
9
에이케이에스앤디(주)AK분당점
 
4
Other values (12)
 
21

Length

Max length16
Median length12
Mean length11.782283
Min length2

Unique

Unique6 ?
Unique (%)0.4%

Sample

1st row(주)대구백화점
2nd row(주)디지털리치
3rd row(주)디지털리치
4th row(주)에스와이에스리테일
5th row(주)에스와이에스리테일

Common Values

ValueCountFrequency (%)
(주)에스와이에스리테일 1524
95.1%
롯데쇼핑(주) 33
 
2.1%
현대백화점 12
 
0.7%
한화갤러리아(주) 9
 
0.6%
에이케이에스앤디(주)AK분당점 4
 
0.2%
한무쇼핑 4
 
0.2%
(주)인투홈 3
 
0.2%
현진전자(주)광주지점 2
 
0.1%
(주)디지털리치 2
 
0.1%
주식회사 태현유통 산격지점 2
 
0.1%
Other values (7) 8
 
0.5%

Length

2024-04-06T17:36:15.246905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주)에스와이에스리테일 1524
94.8%
롯데쇼핑(주 33
 
2.1%
현대백화점 12
 
0.7%
한화갤러리아(주 9
 
0.6%
에이케이에스앤디(주)ak분당점 4
 
0.2%
한무쇼핑 4
 
0.2%
주)인투홈 3
 
0.2%
태현유통 2
 
0.1%
삼성디지털프라자중리점 2
 
0.1%
산격지점 2
 
0.1%
Other values (9) 12
 
0.7%
Distinct206
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size12.7 KiB
2024-04-06T17:36:15.856275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length17.16781
Min length2

Characters and Unicode

Total characters27520
Distinct characters205
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)4.9%

Sample

1st row(주)대구백화점대백프라자
2nd row주식회사 디지털리치 삼성디지털프라자 통진지점
3rd row주식회사 디지털리치 김포한강점
4th row(주) 에스와이에스리테일 G7스퀘어점
5th row(주) 에스와이에스리테일 G7스퀘어점
ValueCountFrequency (%)
948
22.5%
에스와이에스리테일 936
22.2%
주)에스와이에스리테일 576
 
13.7%
메가마트점 36
 
0.9%
롯데백화점 30
 
0.7%
울산 24
 
0.6%
중동점 14
 
0.3%
부평점 13
 
0.3%
광복점 13
 
0.3%
일산점 13
 
0.3%
Other values (202) 1610
38.2%
2024-04-06T17:36:16.783937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3068
 
11.1%
3058
 
11.1%
2610
 
9.5%
1721
 
6.3%
1580
 
5.7%
1577
 
5.7%
1550
 
5.6%
( 1544
 
5.6%
) 1544
 
5.6%
1537
 
5.6%
Other values (195) 7731
28.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21778
79.1%
Space Separator 2610
 
9.5%
Open Punctuation 1544
 
5.6%
Close Punctuation 1544
 
5.6%
Decimal Number 24
 
0.1%
Uppercase Letter 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3068
14.1%
3058
14.0%
1721
 
7.9%
1580
 
7.3%
1577
 
7.2%
1550
 
7.1%
1537
 
7.1%
1525
 
7.0%
1524
 
7.0%
211
 
1.0%
Other values (187) 4427
20.3%
Uppercase Letter
ValueCountFrequency (%)
G 12
60.0%
K 4
 
20.0%
A 4
 
20.0%
Decimal Number
ValueCountFrequency (%)
7 12
50.0%
2 12
50.0%
Space Separator
ValueCountFrequency (%)
2610
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1544
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1544
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21778
79.1%
Common 5722
 
20.8%
Latin 20
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3068
14.1%
3058
14.0%
1721
 
7.9%
1580
 
7.3%
1577
 
7.2%
1550
 
7.1%
1537
 
7.1%
1525
 
7.0%
1524
 
7.0%
211
 
1.0%
Other values (187) 4427
20.3%
Common
ValueCountFrequency (%)
2610
45.6%
( 1544
27.0%
) 1544
27.0%
7 12
 
0.2%
2 12
 
0.2%
Latin
ValueCountFrequency (%)
G 12
60.0%
K 4
 
20.0%
A 4
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21778
79.1%
ASCII 5742
 
20.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3068
14.1%
3058
14.0%
1721
 
7.9%
1580
 
7.3%
1577
 
7.2%
1550
 
7.1%
1537
 
7.1%
1525
 
7.0%
1524
 
7.0%
211
 
1.0%
Other values (187) 4427
20.3%
ASCII
ValueCountFrequency (%)
2610
45.5%
( 1544
26.9%
) 1544
26.9%
G 12
 
0.2%
7 12
 
0.2%
2 12
 
0.2%
K 4
 
0.1%
A 4
 
0.1%

본사 여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
False
1603 
ValueCountFrequency (%)
False 1603
100.0%
2024-04-06T17:36:17.081557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

등록일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.7 KiB
2024-01-01
1603 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-01
2nd row2024-01-01
3rd row2024-01-01
4th row2024-01-01
5th row2024-01-01

Common Values

ValueCountFrequency (%)
2024-01-01 1603
100.0%

Length

2024-04-06T17:36:17.341658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:36:17.537471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-01 1603
100.0%

Missing values

2024-04-06T17:36:14.320191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:36:14.556660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의무이행 년도업체명지점 명본사 여부등록일
02024(주)대구백화점(주)대구백화점대백프라자N2024-01-01
12024(주)디지털리치주식회사 디지털리치 삼성디지털프라자 통진지점N2024-01-01
22024(주)디지털리치주식회사 디지털리치 김포한강점N2024-01-01
32024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
42024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
52024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
62024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
72024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
82024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
92024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-01
의무이행 년도업체명지점 명본사 여부등록일
15932024현대백화점디큐브시티N2024-01-01
15942024현대백화점천호점N2024-01-01
15952024현대백화점부산점N2024-01-01
15962024현대백화점신촌점N2024-01-01
15972024현대백화점중동점N2024-01-01
15982024현대백화점본점N2024-01-01
15992024현대백화점판교점N2024-01-01
16002024현대백화점미아점N2024-01-01
16012024현진전자(주)광주지점현진전자(주)초월점N2024-01-01
16022024현진전자(주)광주지점현진전자(주)N2024-01-01

Duplicate rows

Most frequently occurring

의무이행 년도업체명지점 명본사 여부등록일# duplicates
02024(주)에스와이에스리테일(주) 에스와이에스 리테일 영주점N2024-01-0112
12024(주)에스와이에스리테일(주) 에스와이에스리테일 G7스퀘어점N2024-01-0112
22024(주)에스와이에스리테일(주) 에스와이에스리테일 경산점N2024-01-0112
32024(주)에스와이에스리테일(주) 에스와이에스리테일 경주점N2024-01-0112
42024(주)에스와이에스리테일(주) 에스와이에스리테일 고현점N2024-01-0112
52024(주)에스와이에스리테일(주) 에스와이에스리테일 공주점N2024-01-0112
62024(주)에스와이에스리테일(주) 에스와이에스리테일 광복점N2024-01-0112
72024(주)에스와이에스리테일(주) 에스와이에스리테일 광산점N2024-01-0112
82024(주)에스와이에스리테일(주) 에스와이에스리테일 광양점N2024-01-0112
92024(주)에스와이에스리테일(주) 에스와이에스리테일 광주수완하나로점N2024-01-0112