Overview

Dataset statistics

Number of variables5
Number of observations348
Missing cells48
Missing cells (%)2.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.4 KiB
Average record size in memory42.4 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description전기전자제품및자동차의재활용시스템 내 폐전기전자제품의 재활용의무이행계획 정보를 제공(의무이행 년도,업체명,도로명주소,우편번호,승인일자 등)
Author환경부
URLhttps://www.data.go.kr/data/15092364/fileData.do

Alerts

의무이행 년도 has constant value ""Constant
승인일 has 48 (13.8%) missing valuesMissing

Reproduction

Analysis started2024-04-21 13:05:01.146164
Analysis finished2024-04-21 13:05:02.511869
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의무이행 년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2022
348 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 348
100.0%

Length

2024-04-21T22:05:02.937654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T22:05:03.241536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 348
100.0%
Distinct330
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-21T22:05:04.252225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length18
Mean length8.75
Min length2

Characters and Unicode

Total characters3045
Distinct characters343
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)91.7%

Sample

1st row(유)홍석
2nd row(주) 이마트
3rd row(주) 지멘스
4th row(주) 한성피앤아이
5th row(주)ABB코리아
ValueCountFrequency (%)
주식회사 75
 
16.9%
한국전자제품자원순환공제조합 8
 
1.8%
4
 
0.9%
코웨이(주 3
 
0.7%
유한책임회사 3
 
0.7%
벨류텍 2
 
0.5%
주)이지넷유비쿼터스 2
 
0.5%
호시자키한국 2
 
0.5%
주)이음전산 2
 
0.5%
코리아 2
 
0.5%
Other values (335) 340
76.7%
2024-04-21T22:05:05.465549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
282
 
9.3%
( 197
 
6.5%
) 197
 
6.5%
121
 
4.0%
107
 
3.5%
100
 
3.3%
98
 
3.2%
95
 
3.1%
89
 
2.9%
65
 
2.1%
Other values (333) 1694
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2489
81.7%
Open Punctuation 197
 
6.5%
Close Punctuation 197
 
6.5%
Space Separator 98
 
3.2%
Lowercase Letter 36
 
1.2%
Uppercase Letter 22
 
0.7%
Decimal Number 2
 
0.1%
Connector Punctuation 2
 
0.1%
Dash Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
282
 
11.3%
121
 
4.9%
107
 
4.3%
100
 
4.0%
95
 
3.8%
89
 
3.6%
65
 
2.6%
61
 
2.5%
58
 
2.3%
50
 
2.0%
Other values (297) 1461
58.7%
Lowercase Letter
ValueCountFrequency (%)
e 6
16.7%
o 4
11.1%
r 4
11.1%
a 3
8.3%
n 3
8.3%
k 2
 
5.6%
p 2
 
5.6%
l 2
 
5.6%
c 2
 
5.6%
i 2
 
5.6%
Other values (6) 6
16.7%
Uppercase Letter
ValueCountFrequency (%)
B 4
18.2%
L 3
13.6%
O 2
9.1%
C 2
9.1%
A 2
9.1%
G 2
9.1%
E 1
 
4.5%
Y 1
 
4.5%
J 1
 
4.5%
K 1
 
4.5%
Other values (3) 3
13.6%
Open Punctuation
ValueCountFrequency (%)
( 197
100.0%
Close Punctuation
ValueCountFrequency (%)
) 197
100.0%
Space Separator
ValueCountFrequency (%)
98
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2489
81.7%
Common 498
 
16.4%
Latin 58
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
282
 
11.3%
121
 
4.9%
107
 
4.3%
100
 
4.0%
95
 
3.8%
89
 
3.6%
65
 
2.6%
61
 
2.5%
58
 
2.3%
50
 
2.0%
Other values (297) 1461
58.7%
Latin
ValueCountFrequency (%)
e 6
 
10.3%
o 4
 
6.9%
B 4
 
6.9%
r 4
 
6.9%
L 3
 
5.2%
a 3
 
5.2%
n 3
 
5.2%
O 2
 
3.4%
k 2
 
3.4%
p 2
 
3.4%
Other values (19) 25
43.1%
Common
ValueCountFrequency (%)
( 197
39.6%
) 197
39.6%
98
19.7%
1 2
 
0.4%
_ 2
 
0.4%
- 1
 
0.2%
& 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2489
81.7%
ASCII 556
 
18.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
282
 
11.3%
121
 
4.9%
107
 
4.3%
100
 
4.0%
95
 
3.8%
89
 
3.6%
65
 
2.6%
61
 
2.5%
58
 
2.3%
50
 
2.0%
Other values (297) 1461
58.7%
ASCII
ValueCountFrequency (%)
( 197
35.4%
) 197
35.4%
98
17.6%
e 6
 
1.1%
o 4
 
0.7%
B 4
 
0.7%
r 4
 
0.7%
L 3
 
0.5%
a 3
 
0.5%
n 3
 
0.5%
Other values (26) 37
 
6.7%
Distinct315
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-21T22:05:06.548119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length32
Mean length23.58046
Min length10

Characters and Unicode

Total characters8206
Distinct characters296
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)84.8%

Sample

1st row전라북도 익산시 오산면 오산로 166
2nd row서울특별시 성동구 뚝섬로 377 (성수동2가)
3rd row서울특별시 서대문구 충정로3가
4th row대구광역시 달서구 대천동
5th row충청남도 천안시 서북구 3공단4로 49 (성성동)
ValueCountFrequency (%)
경기도 124
 
7.1%
서울특별시 104
 
6.0%
서울 28
 
1.6%
경기 25
 
1.4%
강남구 25
 
1.4%
성남시 21
 
1.2%
금천구 21
 
1.2%
가산동 20
 
1.1%
강서구 18
 
1.0%
안양시 17
 
1.0%
Other values (737) 1342
76.9%
2024-04-21T22:05:07.879750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1466
 
17.9%
320
 
3.9%
316
 
3.9%
312
 
3.8%
300
 
3.7%
( 251
 
3.1%
) 251
 
3.1%
1 233
 
2.8%
204
 
2.5%
169
 
2.1%
Other values (286) 4384
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5096
62.1%
Space Separator 1466
 
17.9%
Decimal Number 1086
 
13.2%
Open Punctuation 251
 
3.1%
Close Punctuation 251
 
3.1%
Dash Punctuation 38
 
0.5%
Other Punctuation 14
 
0.2%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
320
 
6.3%
316
 
6.2%
312
 
6.1%
300
 
5.9%
204
 
4.0%
169
 
3.3%
167
 
3.3%
163
 
3.2%
138
 
2.7%
136
 
2.7%
Other values (266) 2871
56.3%
Decimal Number
ValueCountFrequency (%)
1 233
21.5%
2 157
14.5%
4 113
10.4%
3 106
9.8%
5 105
9.7%
7 88
 
8.1%
6 86
 
7.9%
8 71
 
6.5%
0 65
 
6.0%
9 62
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
K 1
25.0%
T 1
25.0%
R 1
25.0%
S 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 13
92.9%
. 1
 
7.1%
Space Separator
ValueCountFrequency (%)
1466
100.0%
Open Punctuation
ValueCountFrequency (%)
( 251
100.0%
Close Punctuation
ValueCountFrequency (%)
) 251
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5096
62.1%
Common 3106
37.9%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
320
 
6.3%
316
 
6.2%
312
 
6.1%
300
 
5.9%
204
 
4.0%
169
 
3.3%
167
 
3.3%
163
 
3.2%
138
 
2.7%
136
 
2.7%
Other values (266) 2871
56.3%
Common
ValueCountFrequency (%)
1466
47.2%
( 251
 
8.1%
) 251
 
8.1%
1 233
 
7.5%
2 157
 
5.1%
4 113
 
3.6%
3 106
 
3.4%
5 105
 
3.4%
7 88
 
2.8%
6 86
 
2.8%
Other values (6) 250
 
8.0%
Latin
ValueCountFrequency (%)
K 1
25.0%
T 1
25.0%
R 1
25.0%
S 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5096
62.1%
ASCII 3110
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1466
47.1%
( 251
 
8.1%
) 251
 
8.1%
1 233
 
7.5%
2 157
 
5.0%
4 113
 
3.6%
3 106
 
3.4%
5 105
 
3.4%
7 88
 
2.8%
6 86
 
2.8%
Other values (10) 254
 
8.2%
Hangul
ValueCountFrequency (%)
320
 
6.3%
316
 
6.2%
312
 
6.1%
300
 
5.9%
204
 
4.0%
169
 
3.3%
167
 
3.3%
163
 
3.2%
138
 
2.7%
136
 
2.7%
Other values (266) 2871
56.3%

우편번호
Real number (ℝ)

Distinct287
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52187.822
Minimum1853
Maximum704801
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2024-04-21T22:05:08.118864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1853
5-th percentile4511
Q18500.75
median14055.5
Q331446.75
95-th percentile422419.85
Maximum704801
Range702948
Interquartile range (IQR)22946

Descriptive statistics

Standard deviation110227.4
Coefficient of variation (CV)2.1121288
Kurtosis10.543234
Mean52187.822
Median Absolute Deviation (MAD)6799.5
Skewness3.3040698
Sum18161362
Variance1.215008 × 1010
MonotonicityNot monotonic
2024-04-21T22:05:08.371452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16226 9
 
2.6%
10048 6
 
1.7%
12930 4
 
1.1%
14059 4
 
1.1%
8505 3
 
0.9%
8510 3
 
0.9%
4370 3
 
0.9%
4366 3
 
0.9%
4511 3
 
0.9%
32508 3
 
0.9%
Other values (277) 307
88.2%
ValueCountFrequency (%)
1853 1
 
0.3%
2859 1
 
0.3%
3449 1
 
0.3%
4023 1
 
0.3%
4146 1
 
0.3%
4344 1
 
0.3%
4366 3
0.9%
4367 1
 
0.3%
4368 2
0.6%
4370 3
0.9%
ValueCountFrequency (%)
704801 1
0.3%
506501 1
0.3%
472868 1
0.3%
472841 1
0.3%
465250 1
0.3%
464865 1
0.3%
463827 1
0.3%
462726 1
0.3%
462120 1
0.3%
449854 1
0.3%

승인일
Date

MISSING 

Distinct85
Distinct (%)28.3%
Missing48
Missing (%)13.8%
Memory size2.8 KiB
Minimum2022-01-05 00:00:00
Maximum2024-01-03 00:00:00
2024-04-21T22:05:08.742686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T22:05:09.167399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-21T22:05:01.728754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T22:05:09.428021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호승인일
우편번호1.0000.000
승인일0.0001.000

Missing values

2024-04-21T22:05:02.075796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T22:05:02.388056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의무이행 년도업체명도로명주소우편번호승인일
02022(유)홍석전라북도 익산시 오산면 오산로 16654670<NA>
12022(주) 이마트서울특별시 성동구 뚝섬로 377 (성수동2가)47812022-02-08
22022(주) 지멘스서울특별시 서대문구 충정로3가1200132022-02-08
32022(주) 한성피앤아이대구광역시 달서구 대천동7048012022-02-23
42022(주)ABB코리아충청남도 천안시 서북구 3공단4로 49 (성성동)310932022-02-09
52022(주)거성디지털서울 광진구 광나루로56길 85 (구의동)51162022-03-29
62022(주)고고런경기도 용인시 기흥구 동백죽전대로 444 (중동)170062022-01-13
72022(주)교원프라퍼티서울특별시 중구 을지로 51 (을지로2가)45392022-02-10
82022(주)그린쿨텍경기도 하남시 초광산단로 106 (광암동)129892022-02-03
92022(주)글로벌대구광역시 남구 현충로6길 25-18(대명동, 삼삼빌)424532022-02-23
의무이행 년도업체명도로명주소우편번호승인일
3382022한국환경공단 본사11경기도 김포시 장릉로 56 (풍무동, 김포길훈아파트)101102022-06-22
3392022한국후지필름(주)서울특별시 금천구 가산디지털1로 222 (가산동)85022022-02-08
3402022헬스에어테크놀로지코리아(주)서울특별시 강남구 선릉로158길 7 (청담동)60142022-04-22
3412022현대렌탈케어서울특별시 강동구 올림픽로1348772022-02-11
3422022호시자키한국 주식회사서울 강서구 강서로56가길 55 (등촌동)75832023-02-28
3432022호시자키한국 주식회사서울특별시 강서구 강서로 468 (등촌동)75732022-04-12
3442022효성티앤에스(주)구미공장경상북도 구미시 옥계2공단로 179-15 (구포동)394162022-02-23
3452022후지필름일렉트로닉이미징코리아 주식회사서울특별시 강남구 선릉로 838 (청담동)60142022-02-10
3462022흥신금속공업(주)인천광역시 남동구 청능대로4058182022-02-11
3472022히타치하이테크코리아 주식회사경기 성남시 분당구 정자일로 155 (정자동)135572022-01-26