Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows520
Duplicate rows (%)5.2%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text1
Categorical1
DateTime1

Dataset

Description전기전자제품및자동차의재활용시스템 내 관리자 업무 관리정보를 제공(의무이행 년도, 업체명, 실적(RESU-재활용,RTRV-회수), 등록일)
Author환경부
URLhttps://www.data.go.kr/data/15092450/fileData.do

Alerts

Dataset has 520 (5.2%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-06 08:18:57.610063
Analysis finished2024-04-06 08:18:59.298871
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의무이행 년도
Real number (ℝ)

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2021.1367
Minimum2011
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T17:18:59.410087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2017
Q12020
median2021
Q32023
95-th percentile2024
Maximum2024
Range13
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.3192566
Coefficient of variation (CV)0.0011475011
Kurtosis0.45834547
Mean2021.1367
Median Absolute Deviation (MAD)2
Skewness-0.90844002
Sum20211367
Variance5.378951
MonotonicityNot monotonic
2024-04-06T17:18:59.662036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2022 1762
17.6%
2021 1682
16.8%
2023 1674
16.7%
2024 1555
15.6%
2020 1480
14.8%
2017 471
 
4.7%
2019 452
 
4.5%
2018 439
 
4.4%
2016 196
 
2.0%
2015 187
 
1.9%
Other values (4) 102
 
1.0%
ValueCountFrequency (%)
2011 1
 
< 0.1%
2012 1
 
< 0.1%
2013 2
 
< 0.1%
2014 98
 
1.0%
2015 187
 
1.9%
2016 196
 
2.0%
2017 471
 
4.7%
2018 439
 
4.4%
2019 452
 
4.5%
2020 1480
14.8%
ValueCountFrequency (%)
2024 1555
15.6%
2023 1674
16.7%
2022 1762
17.6%
2021 1682
16.8%
2020 1480
14.8%
2019 452
 
4.5%
2018 439
 
4.4%
2017 471
 
4.7%
2016 196
 
2.0%
2015 187
 
1.9%
Distinct2239
Distinct (%)22.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T17:19:00.131892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length21
Mean length8.3851
Min length2

Characters and Unicode

Total characters83851
Distinct characters614
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique322 ?
Unique (%)3.2%

Sample

1st row다이슨코리아 유한회사
2nd row롯데하이마트
3rd row(주)티엠어플라이언스
4th row주식회사 파트너라인
5th row미라클1019
ValueCountFrequency (%)
주식회사 1280
 
10.8%
롯데하이마트 129
 
1.1%
주)에스와이에스리테일 105
 
0.9%
97
 
0.8%
유한회사 63
 
0.5%
한샘 36
 
0.3%
lg전자 35
 
0.3%
코리아 33
 
0.3%
평택공장 30
 
0.3%
로얄앤컴퍼니(주 29
 
0.2%
Other values (2290) 9985
84.5%
2024-04-06T17:19:00.943801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7894
 
9.4%
) 6274
 
7.5%
( 6273
 
7.5%
3369
 
4.0%
3075
 
3.7%
2064
 
2.5%
1848
 
2.2%
1821
 
2.2%
1817
 
2.2%
1695
 
2.0%
Other values (604) 47721
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68044
81.1%
Close Punctuation 6274
 
7.5%
Open Punctuation 6273
 
7.5%
Space Separator 1848
 
2.2%
Uppercase Letter 783
 
0.9%
Lowercase Letter 428
 
0.5%
Other Punctuation 97
 
0.1%
Decimal Number 54
 
0.1%
Connector Punctuation 29
 
< 0.1%
Dash Punctuation 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7894
 
11.6%
3369
 
5.0%
3075
 
4.5%
2064
 
3.0%
1821
 
2.7%
1817
 
2.7%
1695
 
2.5%
1543
 
2.3%
1454
 
2.1%
1389
 
2.0%
Other values (548) 41923
61.6%
Uppercase Letter
ValueCountFrequency (%)
L 102
13.0%
S 67
 
8.6%
T 63
 
8.0%
G 58
 
7.4%
E 51
 
6.5%
K 50
 
6.4%
I 47
 
6.0%
C 46
 
5.9%
A 44
 
5.6%
B 43
 
5.5%
Other values (13) 212
27.1%
Lowercase Letter
ValueCountFrequency (%)
e 58
13.6%
o 55
12.9%
s 44
10.3%
m 35
8.2%
y 31
 
7.2%
c 31
 
7.2%
a 28
 
6.5%
n 25
 
5.8%
r 24
 
5.6%
d 16
 
3.7%
Other values (9) 81
18.9%
Decimal Number
ValueCountFrequency (%)
1 26
48.1%
0 10
 
18.5%
2 10
 
18.5%
9 5
 
9.3%
6 3
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 40
41.2%
, 38
39.2%
& 18
18.6%
; 1
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 6274
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6273
100.0%
Space Separator
ValueCountFrequency (%)
1848
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68044
81.1%
Common 14596
 
17.4%
Latin 1211
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7894
 
11.6%
3369
 
5.0%
3075
 
4.5%
2064
 
3.0%
1821
 
2.7%
1817
 
2.7%
1695
 
2.5%
1543
 
2.3%
1454
 
2.1%
1389
 
2.0%
Other values (548) 41923
61.6%
Latin
ValueCountFrequency (%)
L 102
 
8.4%
S 67
 
5.5%
T 63
 
5.2%
G 58
 
4.8%
e 58
 
4.8%
o 55
 
4.5%
E 51
 
4.2%
K 50
 
4.1%
I 47
 
3.9%
C 46
 
3.8%
Other values (32) 614
50.7%
Common
ValueCountFrequency (%)
) 6274
43.0%
( 6273
43.0%
1848
 
12.7%
. 40
 
0.3%
, 38
 
0.3%
_ 29
 
0.2%
1 26
 
0.2%
- 21
 
0.1%
& 18
 
0.1%
0 10
 
0.1%
Other values (4) 19
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68044
81.1%
ASCII 15807
 
18.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7894
 
11.6%
3369
 
5.0%
3075
 
4.5%
2064
 
3.0%
1821
 
2.7%
1817
 
2.7%
1695
 
2.5%
1543
 
2.3%
1454
 
2.1%
1389
 
2.0%
Other values (548) 41923
61.6%
ASCII
ValueCountFrequency (%)
) 6274
39.7%
( 6273
39.7%
1848
 
11.7%
L 102
 
0.6%
S 67
 
0.4%
T 63
 
0.4%
G 58
 
0.4%
e 58
 
0.4%
o 55
 
0.3%
E 51
 
0.3%
Other values (46) 958
 
6.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
RUSE
7462 
RTRV
2538 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRUSE
2nd rowRUSE
3rd rowRUSE
4th rowRTRV
5th rowRUSE

Common Values

ValueCountFrequency (%)
RUSE 7462
74.6%
RTRV 2538
 
25.4%

Length

2024-04-06T17:19:01.262983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:19:01.430472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ruse 7462
74.6%
rtrv 2538
 
25.4%
Distinct558
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-12-16 00:00:00
Maximum2024-01-18 00:00:00
2024-04-06T17:19:01.639514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:19:01.926958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-06T17:18:58.623420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:19:02.115939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의무이행 년도실적(RESU-재활용_RTRV-회수)
의무이행 년도1.0000.162
실적(RESU-재활용_RTRV-회수)0.1621.000
2024-04-06T17:19:02.292183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의무이행 년도실적(RESU-재활용_RTRV-회수)
의무이행 년도1.0000.124
실적(RESU-재활용_RTRV-회수)0.1241.000

Missing values

2024-04-06T17:18:58.970439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:18:59.214884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의무이행 년도업체명실적(RESU-재활용_RTRV-회수)등록일
90292020다이슨코리아 유한회사RUSE2020-03-10
104142018롯데하이마트RUSE2018-01-08
90672020(주)티엠어플라이언스RUSE2020-03-10
37532020주식회사 파트너라인RTRV2022-05-23
50332022미라클1019RUSE2022-01-20
20262023신광전자(주)RTRV2023-01-16
34642023롯데하이마트RUSE2023-01-16
11342024(주)신동RUSE2024-01-11
2362024주식회사 휴롬엘에스RUSE2024-01-11
61932021(주)동보올리브백화점RTRV2021-02-09
의무이행 년도업체명실적(RESU-재활용_RTRV-회수)등록일
362024(주)카스모아이티RUSE2024-01-11
51122022(주)아이엔에스엔터프라이즈RUSE2022-01-20
5882024(주)사라반도체RUSE2024-01-11
111872016(주)건평정보통신RUSE2017-01-23
104322018롯데하이마트RTRV2018-01-08
83412020(주)이엠텍아이엔씨RTRV2020-03-10
722024(주)이너트론RUSE2024-01-11
50852022(주)중산물산RUSE2022-01-20
75712021(주)바울글로벌RUSE2021-01-18
72612021(주)태크노마트RUSE2021-01-18

Duplicate rows

Most frequently occurring

의무이행 년도업체명실적(RESU-재활용_RTRV-회수)등록일# duplicates
2212021롯데하이마트RTRV2021-12-0814
3102022롯데하이마트RUSE2022-01-2014
4102023롯데하이마트RUSE2023-01-1614
2222021롯데하이마트RUSE2021-01-1813
3092022롯데하이마트RTRV2022-02-0712
952020(주)에스와이에스리테일RTRV2020-03-1211
1372020롯데하이마트RTRV2020-03-1211
1842021(주)에스와이에스리테일RUSE2021-01-1811
2702022(주)에스와이에스리테일RTRV2022-02-0711
3662023(주)에스와이에스리테일RUSE2023-01-1611