Overview

Dataset statistics

Number of variables5
Number of observations143
Missing cells0
Missing cells (%)0.0%
Duplicate rows12
Duplicate rows (%)8.4%
Total size in memory5.9 KiB
Average record size in memory41.9 B

Variable types

Text1
Categorical2
Boolean1
DateTime1

Dataset

Description전기전자제품및자동차의재활용시스템 내 업체 인증서 정보를 제공(업체명, 인증서 순번, 인증서 상태 코드, 이전 사용 인증서 종류, 등록일)
Author환경부
URLhttps://www.data.go.kr/data/15092561/fileData.do

Alerts

이전 사용 인증서 종류 has constant value ""Constant
Dataset has 12 (8.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-17 13:02:43.919935
Analysis finished2024-04-17 13:02:44.311542
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct69
Distinct (%)48.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-04-17T22:02:44.468275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length17
Mean length7.8811189
Min length5

Characters and Unicode

Total characters1127
Distinct characters183
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)39.9%

Sample

1st row이순환거버넌스(E-Cycle Governance)
2nd row한국엡손(주)
3rd row금오자동차해체산업
4th row신도자동차해체재활용산업
5th row주식회사 에이루트
ValueCountFrequency (%)
롯데하이마트 56
32.9%
주식회사 16
 
9.4%
캐논코리아(주 9
 
5.3%
한국환경공단 3
 
1.8%
본사11 3
 
1.8%
지멘스 2
 
1.2%
위더스컴퓨터(주 2
 
1.2%
케이티 2
 
1.2%
헬스케어(주 2
 
1.2%
주)웅진 2
 
1.2%
Other values (68) 73
42.9%
2024-04-17T22:02:44.783035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
6.7%
62
 
5.5%
58
 
5.1%
56
 
5.0%
56
 
5.0%
56
 
5.0%
56
 
5.0%
( 42
 
3.7%
) 42
 
3.7%
30
 
2.7%
Other values (173) 594
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 990
87.8%
Open Punctuation 42
 
3.7%
Close Punctuation 42
 
3.7%
Space Separator 27
 
2.4%
Lowercase Letter 13
 
1.2%
Decimal Number 6
 
0.5%
Uppercase Letter 6
 
0.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
7.6%
62
 
6.3%
58
 
5.9%
56
 
5.7%
56
 
5.7%
56
 
5.7%
56
 
5.7%
30
 
3.0%
23
 
2.3%
23
 
2.3%
Other values (153) 495
50.0%
Lowercase Letter
ValueCountFrequency (%)
e 3
23.1%
c 2
15.4%
n 2
15.4%
o 1
 
7.7%
v 1
 
7.7%
r 1
 
7.7%
a 1
 
7.7%
l 1
 
7.7%
y 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
G 1
16.7%
T 1
16.7%
I 1
16.7%
H 1
16.7%
C 1
16.7%
E 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Decimal Number
ValueCountFrequency (%)
1 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 990
87.8%
Common 118
 
10.5%
Latin 19
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
7.6%
62
 
6.3%
58
 
5.9%
56
 
5.7%
56
 
5.7%
56
 
5.7%
56
 
5.7%
30
 
3.0%
23
 
2.3%
23
 
2.3%
Other values (153) 495
50.0%
Latin
ValueCountFrequency (%)
e 3
15.8%
c 2
 
10.5%
n 2
 
10.5%
o 1
 
5.3%
G 1
 
5.3%
v 1
 
5.3%
r 1
 
5.3%
a 1
 
5.3%
T 1
 
5.3%
I 1
 
5.3%
Other values (5) 5
26.3%
Common
ValueCountFrequency (%)
( 42
35.6%
) 42
35.6%
27
22.9%
1 6
 
5.1%
- 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 990
87.8%
ASCII 137
 
12.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
7.6%
62
 
6.3%
58
 
5.9%
56
 
5.7%
56
 
5.7%
56
 
5.7%
56
 
5.7%
30
 
3.0%
23
 
2.3%
23
 
2.3%
Other values (153) 495
50.0%
ASCII
ValueCountFrequency (%)
( 42
30.7%
) 42
30.7%
27
19.7%
1 6
 
4.4%
e 3
 
2.2%
c 2
 
1.5%
n 2
 
1.5%
o 1
 
0.7%
G 1
 
0.7%
v 1
 
0.7%
Other values (10) 10
 
7.3%

인증서 순번
Categorical

Distinct4
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
1
97 
2
17 
3
15 
4
14 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 97
67.8%
2 17
 
11.9%
3 15
 
10.5%
4 14
 
9.8%

Length

2024-04-17T22:02:44.886886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T22:02:44.966519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 97
67.8%
2 17
 
11.9%
3 15
 
10.5%
4 14
 
9.8%
Distinct4
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
ISF
82 
RTN
23 
APF
21 
ISA
17 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowISF
2nd rowISF
3rd rowRTN
4th rowRTN
5th rowRTN

Common Values

ValueCountFrequency (%)
ISF 82
57.3%
RTN 23
 
16.1%
APF 21
 
14.7%
ISA 17
 
11.9%

Length

2024-04-17T22:02:45.060416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T22:02:45.142599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
isf 82
57.3%
rtn 23
 
16.1%
apf 21
 
14.7%
isa 17
 
11.9%
Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size275.0 B
False
143 
ValueCountFrequency (%)
False 143
100.0%
2024-04-17T22:02:45.222591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct74
Distinct (%)51.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2016-03-08 00:00:00
Maximum2023-03-15 00:00:00
2024-04-17T22:02:45.309262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T22:02:45.431693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2024-04-17T22:02:45.504451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명인증서 순번인증서 상태 코드등록일
업체명1.0000.0000.9261.000
인증서 순번0.0001.0000.8321.000
인증서 상태 코드0.9260.8321.0001.000
등록일1.0001.0001.0001.000
2024-04-17T22:02:45.577768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증서 순번인증서 상태 코드
인증서 순번1.0000.480
인증서 상태 코드0.4801.000
2024-04-17T22:02:45.640037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증서 순번인증서 상태 코드
인증서 순번1.0000.480
인증서 상태 코드0.4801.000

Missing values

2024-04-17T22:02:44.210541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T22:02:44.282187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명인증서 순번인증서 상태 코드이전 사용 인증서 종류등록일
0이순환거버넌스(E-Cycle Governance)1ISFN2016-04-12
1한국엡손(주)1ISFN2017-03-14
2금오자동차해체산업1RTNN2022-07-25
3신도자동차해체재활용산업1RTNN2020-06-10
4주식회사 에이루트1RTNN2018-01-10
5영월자동차해체산업1ISFN2016-03-08
6유파트폐차장1ISFN2021-10-12
7(주)영광자동차해체재활용산업1APFN2017-01-09
8강원도 횡성군1RTNN2017-04-19
9캐논코리아(주)1ISFN2017-06-14
업체명인증서 순번인증서 상태 코드이전 사용 인증서 종류등록일
133성신폐차장1RTNN2020-09-29
134(주)디바인바이오1ISFN2020-10-13
135주식회사 포엠컴퍼니1RTNN2020-12-09
136홍익테크(HIT)1RTNN2021-04-11
137주식회사 골드자동차해체재활용산업1APFN2021-05-18
138주식회사 아이닉1RTNN2021-04-27
139주식회사 라임커머스1ISAN2021-09-06
140제이비크린1RTNN2021-11-04
141주식회사 에스알씨1APFN2021-11-11
142주식회사 제이디테크1RTNN2022-01-05

Duplicate rows

Most frequently occurring

업체명인증서 순번인증서 상태 코드이전 사용 인증서 종류등록일# duplicates
2롯데하이마트1ISFN2017-06-0114
3롯데하이마트2ISFN2019-08-0214
4롯데하이마트3ISFN2021-04-0614
5롯데하이마트4APFN2022-03-2114
11캐논코리아(주)1ISFN2017-06-149
0(주)대교통상1ISAN2019-12-172
1(주)웅진1ISFN2019-10-282
6오디오월드1RTNN2018-12-122
7위더스컴퓨터(주)1ISFN2017-11-222
8이누스(주)예산지점1APFN2020-06-242