Overview

Dataset statistics

Number of variables5
Number of observations1030
Missing cells10
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.4 KiB
Average record size in memory43.1 B

Variable types

Categorical1
Text2
Numeric2

Dataset

Description전기전자제품및자동차의재활용시스템 내 전기전자제품 출고수입실적 정보를 제공(실적년도, 업체 명, 업체 도로명 주소, 우편번호, 연간 매출 금액)
Author환경부
URLhttps://www.data.go.kr/data/15092525/fileData.do

Alerts

실적년도 has constant value ""Constant
연간 매출 금액 is highly skewed (γ1 = 29.82857763)Skewed

Reproduction

Analysis started2024-04-06 08:56:45.944692
Analysis finished2024-04-06 08:56:48.029368
Duration2.08 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

실적년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2022
1030 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1030
100.0%

Length

2024-04-06T17:56:48.175127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:56:48.406936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1030
100.0%
Distinct1029
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2024-04-06T17:56:49.077093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length8.3223301
Min length2

Characters and Unicode

Total characters8572
Distinct characters482
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1028 ?
Unique (%)99.8%

Sample

1st row(유) 워터스코리아
2nd row(유)그룹세브코리아
3rd row(주) DX
4th row(주) 교원프라퍼티
5th row(주) 대연시스템
ValueCountFrequency (%)
주식회사 206
 
15.9%
유한회사 18
 
1.4%
14
 
1.1%
코리아 3
 
0.2%
아산공장 2
 
0.2%
쿠쿠홈시스 2
 
0.2%
아이엠 2
 
0.2%
유한책임회사 2
 
0.2%
오엠에이곰 1
 
0.1%
위더스컴퓨터(주 1
 
0.1%
Other values (1048) 1048
80.7%
2024-04-06T17:56:50.091500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
845
 
9.9%
) 585
 
6.8%
( 584
 
6.8%
331
 
3.9%
310
 
3.6%
286
 
3.3%
276
 
3.2%
271
 
3.2%
259
 
3.0%
231
 
2.7%
Other values (472) 4594
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7042
82.2%
Close Punctuation 585
 
6.8%
Open Punctuation 584
 
6.8%
Space Separator 271
 
3.2%
Uppercase Letter 48
 
0.6%
Lowercase Letter 22
 
0.3%
Other Punctuation 9
 
0.1%
Decimal Number 4
 
< 0.1%
Connector Punctuation 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
845
 
12.0%
331
 
4.7%
310
 
4.4%
286
 
4.1%
276
 
3.9%
259
 
3.7%
231
 
3.3%
203
 
2.9%
199
 
2.8%
145
 
2.1%
Other values (429) 3957
56.2%
Uppercase Letter
ValueCountFrequency (%)
L 6
12.5%
S 4
 
8.3%
G 3
 
6.2%
N 3
 
6.2%
A 3
 
6.2%
T 3
 
6.2%
O 3
 
6.2%
I 3
 
6.2%
B 3
 
6.2%
C 3
 
6.2%
Other values (9) 14
29.2%
Lowercase Letter
ValueCountFrequency (%)
e 3
13.6%
m 3
13.6%
o 2
9.1%
u 2
9.1%
s 2
9.1%
a 2
9.1%
l 2
9.1%
y 1
 
4.5%
g 1
 
4.5%
n 1
 
4.5%
Other values (3) 3
13.6%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 1
25.0%
6 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
, 2
 
22.2%
Close Punctuation
ValueCountFrequency (%)
) 585
100.0%
Open Punctuation
ValueCountFrequency (%)
( 584
100.0%
Space Separator
ValueCountFrequency (%)
271
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7044
82.2%
Common 1458
 
17.0%
Latin 70
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
845
 
12.0%
331
 
4.7%
310
 
4.4%
286
 
4.1%
276
 
3.9%
259
 
3.7%
231
 
3.3%
203
 
2.9%
199
 
2.8%
145
 
2.1%
Other values (430) 3959
56.2%
Latin
ValueCountFrequency (%)
L 6
 
8.6%
S 4
 
5.7%
G 3
 
4.3%
N 3
 
4.3%
A 3
 
4.3%
T 3
 
4.3%
O 3
 
4.3%
I 3
 
4.3%
e 3
 
4.3%
B 3
 
4.3%
Other values (22) 36
51.4%
Common
ValueCountFrequency (%)
) 585
40.1%
( 584
40.1%
271
18.6%
. 7
 
0.5%
_ 3
 
0.2%
- 2
 
0.1%
, 2
 
0.1%
2 2
 
0.1%
1 1
 
0.1%
6 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7042
82.2%
ASCII 1528
 
17.8%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
845
 
12.0%
331
 
4.7%
310
 
4.4%
286
 
4.1%
276
 
3.9%
259
 
3.7%
231
 
3.3%
203
 
2.9%
199
 
2.8%
145
 
2.1%
Other values (429) 3957
56.2%
ASCII
ValueCountFrequency (%)
) 585
38.3%
( 584
38.2%
271
17.7%
. 7
 
0.5%
L 6
 
0.4%
S 4
 
0.3%
_ 3
 
0.2%
G 3
 
0.2%
N 3
 
0.2%
A 3
 
0.2%
Other values (32) 59
 
3.9%
None
ValueCountFrequency (%)
2
100.0%
Distinct958
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2024-04-06T17:56:51.426247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length39
Mean length23.520388
Min length10

Characters and Unicode

Total characters24226
Distinct characters390
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique891 ?
Unique (%)86.5%

Sample

1st row서울특별시 영등포구 여의공원로 101 (여의도동)
2nd row서울특별시 종로구 종로1길 50 (중학동)
3rd row경기도 시흥시 배미골길 24 (목감동)
4th row서울 중구 을지로 51 (을지로2가)
5th row서울 성동구 광나루로8길 10 (성수동2가)
ValueCountFrequency (%)
서울특별시 309
 
6.0%
경기도 308
 
6.0%
경기 104
 
2.0%
서울 94
 
1.8%
강남구 85
 
1.7%
성남시 51
 
1.0%
금천구 46
 
0.9%
가산동 42
 
0.8%
강서구 41
 
0.8%
인천광역시 39
 
0.8%
Other values (1775) 4028
78.3%
2024-04-06T17:56:52.470283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4345
 
17.9%
967
 
4.0%
905
 
3.7%
890
 
3.7%
794
 
3.3%
) 729
 
3.0%
( 729
 
3.0%
1 683
 
2.8%
608
 
2.5%
2 518
 
2.1%
Other values (380) 13058
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14922
61.6%
Space Separator 4345
 
17.9%
Decimal Number 3316
 
13.7%
Close Punctuation 729
 
3.0%
Open Punctuation 729
 
3.0%
Dash Punctuation 138
 
0.6%
Other Punctuation 39
 
0.2%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
967
 
6.5%
905
 
6.1%
890
 
6.0%
794
 
5.3%
608
 
4.1%
459
 
3.1%
446
 
3.0%
430
 
2.9%
425
 
2.8%
411
 
2.8%
Other values (357) 8587
57.5%
Decimal Number
ValueCountFrequency (%)
1 683
20.6%
2 518
15.6%
3 352
10.6%
5 310
9.3%
4 304
9.2%
6 256
 
7.7%
7 243
 
7.3%
8 239
 
7.2%
9 207
 
6.2%
0 204
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
K 2
25.0%
T 1
12.5%
W 1
12.5%
E 1
12.5%
I 1
12.5%
V 1
12.5%
S 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 38
97.4%
. 1
 
2.6%
Space Separator
ValueCountFrequency (%)
4345
100.0%
Close Punctuation
ValueCountFrequency (%)
) 729
100.0%
Open Punctuation
ValueCountFrequency (%)
( 729
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14922
61.6%
Common 9296
38.4%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
967
 
6.5%
905
 
6.1%
890
 
6.0%
794
 
5.3%
608
 
4.1%
459
 
3.1%
446
 
3.0%
430
 
2.9%
425
 
2.8%
411
 
2.8%
Other values (357) 8587
57.5%
Common
ValueCountFrequency (%)
4345
46.7%
) 729
 
7.8%
( 729
 
7.8%
1 683
 
7.3%
2 518
 
5.6%
3 352
 
3.8%
5 310
 
3.3%
4 304
 
3.3%
6 256
 
2.8%
7 243
 
2.6%
Other values (6) 827
 
8.9%
Latin
ValueCountFrequency (%)
K 2
25.0%
T 1
12.5%
W 1
12.5%
E 1
12.5%
I 1
12.5%
V 1
12.5%
S 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14922
61.6%
ASCII 9304
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4345
46.7%
) 729
 
7.8%
( 729
 
7.8%
1 683
 
7.3%
2 518
 
5.6%
3 352
 
3.8%
5 310
 
3.3%
4 304
 
3.3%
6 256
 
2.8%
7 243
 
2.6%
Other values (13) 835
 
9.0%
Hangul
ValueCountFrequency (%)
967
 
6.5%
905
 
6.1%
890
 
6.0%
794
 
5.3%
608
 
4.1%
459
 
3.1%
446
 
3.0%
430
 
2.9%
425
 
2.8%
411
 
2.8%
Other values (357) 8587
57.5%

우편번호
Real number (ℝ)

Distinct785
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53584.991
Minimum1722
Maximum642315
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2024-04-06T17:56:52.740960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1722
5-th percentile4379
Q17418
median13403
Q330804.75
95-th percentile414968.55
Maximum642315
Range640593
Interquartile range (IQR)23386.75

Descriptive statistics

Standard deviation114209.28
Coefficient of variation (CV)2.131367
Kurtosis8.4592075
Mean53584.991
Median Absolute Deviation (MAD)6834
Skewness3.0756227
Sum55192541
Variance1.304376 × 1010
MonotonicityNot monotonic
2024-04-06T17:56:52.992254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10048 8
 
0.8%
10049 8
 
0.8%
5836 6
 
0.6%
8501 5
 
0.5%
13494 5
 
0.5%
18469 5
 
0.5%
4370 5
 
0.5%
14067 4
 
0.4%
8592 4
 
0.4%
10594 4
 
0.4%
Other values (775) 976
94.8%
ValueCountFrequency (%)
1722 1
0.1%
2509 1
0.1%
2584 1
0.1%
2633 1
0.1%
2642 1
0.1%
2787 1
0.1%
2811 1
0.1%
2859 1
0.1%
3007 1
0.1%
3104 1
0.1%
ValueCountFrequency (%)
642315 1
0.1%
626210 1
0.1%
621844 1
0.1%
621842 1
0.1%
540813 1
0.1%
487914 1
0.1%
487911 1
0.1%
483080 1
0.1%
482150 1
0.1%
480819 1
0.1%

연간 매출 금액
Real number (ℝ)

SKEWED 

Distinct1006
Distinct (%)98.6%
Missing10
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean4.2501778 × 1011
Minimum1
Maximum3.0223 × 1014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2024-04-06T17:56:53.346868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.9932684 × 108
Q18.2832825 × 108
median2.6861542 × 109
Q31.1977856 × 1010
95-th percentile1.414019 × 1011
Maximum3.0223 × 1014
Range3.0223 × 1014
Interquartile range (IQR)1.1149528 × 1010

Descriptive statistics

Standard deviation9.7100862 × 1012
Coefficient of variation (CV)22.846306
Kurtosis920.22662
Mean4.2501778 × 1011
Median Absolute Deviation (MAD)2.269722 × 109
Skewness29.828578
Sum4.3351814 × 1014
Variance9.4285774 × 1025
MonotonicityNot monotonic
2024-04-06T17:56:53.600593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000000000 4
 
0.4%
1000000000 3
 
0.3%
60000000 2
 
0.2%
1300000000 2
 
0.2%
480000000 2
 
0.2%
16035885042 2
 
0.2%
4300000000 2
 
0.2%
1400000000 2
 
0.2%
600000000 2
 
0.2%
6000000000 2
 
0.2%
Other values (996) 997
96.8%
(Missing) 10
 
1.0%
ValueCountFrequency (%)
1 1
0.1%
2000 1
0.1%
551792 1
0.1%
1000000 1
0.1%
1200000 1
0.1%
3000000 1
0.1%
4847825 1
0.1%
5031756 1
0.1%
10504083 1
0.1%
16513835 1
0.1%
ValueCountFrequency (%)
302230000000000 1
0.1%
64714800000000 1
0.1%
26151800000000 1
0.1%
4561210000000 1
0.1%
2847120000000 1
0.1%
1556910000000 1
0.1%
1425520000000 1
0.1%
1077320000000 1
0.1%
1028900000000 1
0.1%
1002300000000 1
0.1%

Interactions

2024-04-06T17:56:47.258206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:56:46.772333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:56:47.486825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:56:46.997401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:56:53.785235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호연간 매출 금액
우편번호1.0000.000
연간 매출 금액0.0001.000
2024-04-06T17:56:53.957794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호연간 매출 금액
우편번호1.0000.022
연간 매출 금액0.0221.000

Missing values

2024-04-06T17:56:47.735140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:56:47.941687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

실적년도업체 명업체 도로명 주소우편번호연간 매출 금액
02022(유) 워터스코리아서울특별시 영등포구 여의공원로 101 (여의도동)72411042929653
12022(유)그룹세브코리아서울특별시 종로구 종로1길 50 (중학동)314249055207098
22022(주) DX경기도 시흥시 배미골길 24 (목감동)149854300000000
32022(주) 교원프라퍼티서울 중구 을지로 51 (을지로2가)4539214606000000
42022(주) 대연시스템서울 성동구 광나루로8길 10 (성수동2가)47994496000000
52022(주) 세인홈시스대전광역시 동구 하소로 66 (하소동)3471230005972474
62022(주) 아이디스대전광역시 유성구 테크노3로 8-10 (관평동)34012189866000000
72022(주) 에스에너지경기 성남시 분당구 판교역로241번길 20 (삼평동)1349494670806097
82022(주) 이마트서울특별시 성동구 뚝섬로 377 (성수동2가)478116035885042
92022(주) 이피씨경기도 수원시 영통구 삼성로 274 (원천동)16522521257990
실적년도업체 명업체 도로명 주소우편번호연간 매출 금액
10202022현일 LAB-MATE서울특별시 마포구 성미산로121846241774236
10212022호시자키한국 주식회사서울 강서구 강서로56가길 55 (등촌동)7583448924707
10222022홍진테크주식회사경기도 시흥시 윗대야2길 10 (대야동)1490030879305014
10232022효성ITX서울특별시 영등포구 선유동2로 57 (양평동4가)721220034135095
10242022후지일렉트릭(주)서울특별시 서초구 사임당로 151 (서초동)66241478847291
10252022후지전기코리아서울특별시 영등포구 여의나루로 67 (여의도동)7327600000000
10262022후지필름일렉트로닉이미징코리아 주식회사서울특별시 강남구 선릉로 838 (청담동)6014713479600
10272022휘슬러코리아(주)서울특별시 강남구 강남대로 556 (논현동)60441305260080
10282022흥신금속공업(주)인천광역시 남동구 청능대로4058182200000000
10292022히타치하이테크코리아 주식회사경기 성남시 분당구 정자일로 155 (정자동)135571042175796