Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description전기전자제품및자동차의재활용시스템 내 자동차 유해물질 준선선언 정보를 제공(준수선언번호,준수선언업체,제품명,최초수입업체명,승인일자)
URLhttps://www.data.go.kr/data/15092535/fileData.do

Alerts

준수선언번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:49:56.793940
Analysis finished2023-12-12 14:49:57.698744
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

준수선언번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean96752.626
Minimum11305
Maximum123069
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:49:57.785758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11305
5-th percentile33478.95
Q184951.25
median109483.5
Q3116708.25
95-th percentile121586.05
Maximum123069
Range111764
Interquartile range (IQR)31757

Descriptive statistics

Standard deviation29240.447
Coefficient of variation (CV)0.30221864
Kurtosis0.45821727
Mean96752.626
Median Absolute Deviation (MAD)8906.5
Skewness-1.3471008
Sum9.6752626 × 108
Variance8.5500373 × 108
MonotonicityNot monotonic
2023-12-12T23:49:57.920976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
111109 1
 
< 0.1%
119164 1
 
< 0.1%
108323 1
 
< 0.1%
45405 1
 
< 0.1%
115338 1
 
< 0.1%
104760 1
 
< 0.1%
108242 1
 
< 0.1%
33974 1
 
< 0.1%
115893 1
 
< 0.1%
33474 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
11305 1
< 0.1%
11306 1
< 0.1%
11307 1
< 0.1%
11308 1
< 0.1%
11309 1
< 0.1%
11310 1
< 0.1%
11311 1
< 0.1%
11312 1
< 0.1%
11314 1
< 0.1%
11433 1
< 0.1%
ValueCountFrequency (%)
123069 1
< 0.1%
123063 1
< 0.1%
123057 1
< 0.1%
123047 1
< 0.1%
123046 1
< 0.1%
123034 1
< 0.1%
123033 1
< 0.1%
123032 1
< 0.1%
123031 1
< 0.1%
123028 1
< 0.1%
Distinct107
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:49:58.152863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length8.6996
Min length2

Characters and Unicode

Total characters86996
Distinct characters194
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st row한국닛산
2nd row포드세일즈서비스코리아(유)
3rd row비엠더블유코리아(주)
4th row현대자동차
5th row기아자동차
ValueCountFrequency (%)
비엠더블유코리아(주 2533
23.1%
현대자동차 1908
17.4%
기아자동차 1533
14.0%
주식회사 911
 
8.3%
메르세데스벤츠코리아(주 771
 
7.0%
포르쉐코리아(주 576
 
5.3%
폭스바겐그룹코리아 334
 
3.0%
포드세일즈서비스코리아(유 251
 
2.3%
주)볼보자동차코리아 228
 
2.1%
에프엠케이 189
 
1.7%
Other values (103) 1721
15.7%
2023-12-12T23:49:58.596924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6899
 
7.9%
5657
 
6.5%
5384
 
6.2%
5243
 
6.0%
) 4886
 
5.6%
( 4886
 
5.6%
3827
 
4.4%
3816
 
4.4%
3815
 
4.4%
2856
 
3.3%
Other values (184) 39727
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75837
87.2%
Close Punctuation 4886
 
5.6%
Open Punctuation 4886
 
5.6%
Space Separator 985
 
1.1%
Uppercase Letter 396
 
0.5%
Decimal Number 5
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6899
 
9.1%
5657
 
7.5%
5384
 
7.1%
5243
 
6.9%
3827
 
5.0%
3816
 
5.0%
3815
 
5.0%
2856
 
3.8%
2854
 
3.8%
2809
 
3.7%
Other values (160) 32677
43.1%
Uppercase Letter
ValueCountFrequency (%)
K 151
38.1%
G 141
35.6%
R 18
 
4.5%
O 18
 
4.5%
J 9
 
2.3%
E 8
 
2.0%
T 7
 
1.8%
M 7
 
1.8%
V 6
 
1.5%
S 6
 
1.5%
Other values (9) 25
 
6.3%
Close Punctuation
ValueCountFrequency (%)
) 4886
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4886
100.0%
Space Separator
ValueCountFrequency (%)
985
100.0%
Decimal Number
ValueCountFrequency (%)
4 5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75837
87.2%
Common 10763
 
12.4%
Latin 396
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6899
 
9.1%
5657
 
7.5%
5384
 
7.1%
5243
 
6.9%
3827
 
5.0%
3816
 
5.0%
3815
 
5.0%
2856
 
3.8%
2854
 
3.8%
2809
 
3.7%
Other values (160) 32677
43.1%
Latin
ValueCountFrequency (%)
K 151
38.1%
G 141
35.6%
R 18
 
4.5%
O 18
 
4.5%
J 9
 
2.3%
E 8
 
2.0%
T 7
 
1.8%
M 7
 
1.8%
V 6
 
1.5%
S 6
 
1.5%
Other values (9) 25
 
6.3%
Common
ValueCountFrequency (%)
) 4886
45.4%
( 4886
45.4%
985
 
9.2%
4 5
 
< 0.1%
& 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75837
87.2%
ASCII 11159
 
12.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6899
 
9.1%
5657
 
7.5%
5384
 
7.1%
5243
 
6.9%
3827
 
5.0%
3816
 
5.0%
3815
 
5.0%
2856
 
3.8%
2854
 
3.8%
2809
 
3.7%
Other values (160) 32677
43.1%
ASCII
ValueCountFrequency (%)
) 4886
43.8%
( 4886
43.8%
985
 
8.8%
K 151
 
1.4%
G 141
 
1.3%
R 18
 
0.2%
O 18
 
0.2%
J 9
 
0.1%
E 8
 
0.1%
T 7
 
0.1%
Other values (14) 50
 
0.4%
Distinct2326
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:49:58.912972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length42
Mean length12.5073
Min length2

Characters and Unicode

Total characters125073
Distinct characters343
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1030 ?
Unique (%)10.3%

Sample

1st rowINFINITI QX50 2.0
2nd rowExplorer 3.0
3rd rowMINI Cooper SD
4th row그랜드스타랙스
5th rowK9
ValueCountFrequency (%)
bmw 1913
 
8.1%
mini 473
 
2.0%
cooper 470
 
2.0%
xdrive 467
 
2.0%
4matic 432
 
1.8%
coupe 411
 
1.7%
s 341
 
1.4%
쏘렌토 326
 
1.4%
d 239
 
1.0%
볼보 228
 
1.0%
Other values (1514) 18221
77.5%
2023-12-12T23:49:59.370766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13625
 
10.9%
e 5038
 
4.0%
M 4278
 
3.4%
0 4008
 
3.2%
r 3466
 
2.8%
o 3323
 
2.7%
i 3183
 
2.5%
B 2729
 
2.2%
C 2557
 
2.0%
S 2524
 
2.0%
Other values (333) 80342
64.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 37729
30.2%
Lowercase Letter 33213
26.6%
Other Letter 20934
16.7%
Decimal Number 15140
12.1%
Space Separator 13625
 
10.9%
Open Punctuation 1297
 
1.0%
Close Punctuation 1292
 
1.0%
Dash Punctuation 797
 
0.6%
Other Punctuation 600
 
0.5%
Letter Number 386
 
0.3%
Other values (2) 60
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1328
 
6.3%
1035
 
4.9%
878
 
4.2%
679
 
3.2%
586
 
2.8%
572
 
2.7%
552
 
2.6%
527
 
2.5%
522
 
2.5%
501
 
2.4%
Other values (257) 13754
65.7%
Uppercase Letter
ValueCountFrequency (%)
M 4278
 
11.3%
B 2729
 
7.2%
C 2557
 
6.8%
S 2524
 
6.7%
I 2485
 
6.6%
A 2472
 
6.6%
D 2463
 
6.5%
W 2429
 
6.4%
T 2106
 
5.6%
E 1721
 
4.6%
Other values (16) 11965
31.7%
Lowercase Letter
ValueCountFrequency (%)
e 5038
15.2%
r 3466
10.4%
o 3323
 
10.0%
i 3183
 
9.6%
a 1855
 
5.6%
d 1761
 
5.3%
n 1688
 
5.1%
t 1636
 
4.9%
u 1428
 
4.3%
c 1365
 
4.1%
Other values (15) 8470
25.5%
Decimal Number
ValueCountFrequency (%)
0 4008
26.5%
5 2157
14.2%
4 1844
12.2%
3 1638
10.8%
2 1474
 
9.7%
1 1031
 
6.8%
8 896
 
5.9%
6 866
 
5.7%
7 719
 
4.7%
9 507
 
3.3%
Other Punctuation
ValueCountFrequency (%)
. 568
94.7%
, 22
 
3.7%
/ 3
 
0.5%
? 2
 
0.3%
& 2
 
0.3%
; 2
 
0.3%
# 1
 
0.2%
Letter Number
ValueCountFrequency (%)
276
71.5%
110
 
28.5%
Space Separator
ValueCountFrequency (%)
13625
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1297
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1292
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 797
100.0%
Math Symbol
ValueCountFrequency (%)
+ 57
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 71328
57.0%
Common 32811
26.2%
Hangul 20934
 
16.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1328
 
6.3%
1035
 
4.9%
878
 
4.2%
679
 
3.2%
586
 
2.8%
572
 
2.7%
552
 
2.6%
527
 
2.5%
522
 
2.5%
501
 
2.4%
Other values (257) 13754
65.7%
Latin
ValueCountFrequency (%)
e 5038
 
7.1%
M 4278
 
6.0%
r 3466
 
4.9%
o 3323
 
4.7%
i 3183
 
4.5%
B 2729
 
3.8%
C 2557
 
3.6%
S 2524
 
3.5%
I 2485
 
3.5%
A 2472
 
3.5%
Other values (43) 39273
55.1%
Common
ValueCountFrequency (%)
13625
41.5%
0 4008
 
12.2%
5 2157
 
6.6%
4 1844
 
5.6%
3 1638
 
5.0%
2 1474
 
4.5%
( 1297
 
4.0%
) 1292
 
3.9%
1 1031
 
3.1%
8 896
 
2.7%
Other values (13) 3549
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 103753
83.0%
Hangul 20934
 
16.7%
Number Forms 386
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13625
 
13.1%
e 5038
 
4.9%
M 4278
 
4.1%
0 4008
 
3.9%
r 3466
 
3.3%
o 3323
 
3.2%
i 3183
 
3.1%
B 2729
 
2.6%
C 2557
 
2.5%
S 2524
 
2.4%
Other values (64) 59022
56.9%
Hangul
ValueCountFrequency (%)
1328
 
6.3%
1035
 
4.9%
878
 
4.2%
679
 
3.2%
586
 
2.8%
572
 
2.7%
552
 
2.6%
527
 
2.5%
522
 
2.5%
501
 
2.4%
Other values (257) 13754
65.7%
Number Forms
ValueCountFrequency (%)
276
71.5%
110
 
28.5%
Distinct192
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T23:49:59.730927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length8.4427
Min length1

Characters and Unicode

Total characters84427
Distinct characters217
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)0.5%

Sample

1st row한국닛산
2nd row포드세일즈서비스코리아(유)
3rd row비엠더블유코리아(주)
4th row현대자동차㈜
5th row기아㈜
ValueCountFrequency (%)
현대자동차㈜ 1889
17.1%
비엠더블유코리아(주 1862
16.8%
기아㈜ 758
 
6.9%
기아자동차㈜ 683
 
6.2%
비엠더블유코리아㈜ 670
 
6.1%
코리아 405
 
3.7%
메르세데스-벤츠 404
 
3.7%
주식회사 382
 
3.5%
포르쉐코리아㈜ 305
 
2.8%
아우디폭스바겐코리아 289
 
2.6%
Other values (168) 3411
30.8%
2023-12-12T23:50:00.203316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6847
 
8.1%
4987
 
5.9%
4987
 
5.9%
4797
 
5.7%
3593
 
4.3%
( 3370
 
4.0%
) 3369
 
4.0%
3190
 
3.8%
3185
 
3.8%
3184
 
3.8%
Other values (207) 42918
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 70892
84.0%
Other Symbol 4797
 
5.7%
Open Punctuation 3370
 
4.0%
Close Punctuation 3369
 
4.0%
Space Separator 1173
 
1.4%
Dash Punctuation 465
 
0.6%
Uppercase Letter 272
 
0.3%
Lowercase Letter 42
 
< 0.1%
Control 20
 
< 0.1%
Other Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6847
 
9.7%
4987
 
7.0%
4987
 
7.0%
3593
 
5.1%
3190
 
4.5%
3185
 
4.5%
3184
 
4.5%
2873
 
4.1%
2861
 
4.0%
2789
 
3.9%
Other values (172) 32396
45.7%
Uppercase Letter
ValueCountFrequency (%)
O 37
13.6%
E 36
13.2%
R 23
 
8.5%
T 21
 
7.7%
K 20
 
7.4%
F 17
 
6.2%
A 15
 
5.5%
G 15
 
5.5%
U 15
 
5.5%
P 14
 
5.1%
Other values (10) 59
21.7%
Lowercase Letter
ValueCountFrequency (%)
o 11
26.2%
r 11
26.2%
e 9
21.4%
a 9
21.4%
d 2
 
4.8%
Other Punctuation
ValueCountFrequency (%)
/ 14
93.3%
& 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
4 8
66.7%
0 4
33.3%
Other Symbol
ValueCountFrequency (%)
4797
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3370
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3369
100.0%
Space Separator
ValueCountFrequency (%)
1173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 465
100.0%
Control
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75689
89.7%
Common 8424
 
10.0%
Latin 314
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6847
 
9.0%
4987
 
6.6%
4987
 
6.6%
4797
 
6.3%
3593
 
4.7%
3190
 
4.2%
3185
 
4.2%
3184
 
4.2%
2873
 
3.8%
2861
 
3.8%
Other values (173) 35185
46.5%
Latin
ValueCountFrequency (%)
O 37
 
11.8%
E 36
 
11.5%
R 23
 
7.3%
T 21
 
6.7%
K 20
 
6.4%
F 17
 
5.4%
A 15
 
4.8%
G 15
 
4.8%
U 15
 
4.8%
P 14
 
4.5%
Other values (15) 101
32.2%
Common
ValueCountFrequency (%)
( 3370
40.0%
) 3369
40.0%
1173
 
13.9%
- 465
 
5.5%
20
 
0.2%
/ 14
 
0.2%
4 8
 
0.1%
0 4
 
< 0.1%
& 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 70892
84.0%
ASCII 8738
 
10.3%
None 4797
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6847
 
9.7%
4987
 
7.0%
4987
 
7.0%
3593
 
5.1%
3190
 
4.5%
3185
 
4.5%
3184
 
4.5%
2873
 
4.1%
2861
 
4.0%
2789
 
3.9%
Other values (172) 32396
45.7%
None
ValueCountFrequency (%)
4797
100.0%
ASCII
ValueCountFrequency (%)
( 3370
38.6%
) 3369
38.6%
1173
 
13.4%
- 465
 
5.3%
O 37
 
0.4%
E 36
 
0.4%
R 23
 
0.3%
T 21
 
0.2%
K 20
 
0.2%
20
 
0.2%
Other values (24) 204
 
2.3%
Distinct784
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2009-06-02 00:00:00
Maximum2022-12-29 00:00:00
2023-12-12T23:50:00.350480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:50:00.470079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T23:49:57.354468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T23:49:57.495656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:49:57.651225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

준수선언번호준수선언업체제품명최초수입업체명승인일자
5552111109한국닛산INFINITI QX50 2.0한국닛산2020-01-14
4371114096포드세일즈서비스코리아(유)Explorer 3.0포드세일즈서비스코리아(유)2020-12-23
6830108214비엠더블유코리아(주)MINI Cooper SD비엠더블유코리아(주)2018-07-30
1153833925현대자동차그랜드스타랙스현대자동차㈜2011-02-15
260122465기아자동차K9기아㈜2022-11-14
928121163비엠더블유코리아(주)BMW 740i비엠더블유코리아(주)2022-09-07
1408119956기아자동차쏘렌토기아㈜2022-07-04
515121701현대자동차싼타페(SANTAFE)현대자동차㈜2022-11-10
6896108046기아자동차K9기아자동차㈜2018-05-21
855699925(주)알브이모터스코리아포드F150(주)알브이모터스코리아2015-10-23
준수선언번호준수선언업체제품명최초수입업체명승인일자
3716115535한불모터스(주)Peugeot 508SW 1.5 BlueHDi한불모터스(주)2021-08-06
1101344195비엠더블유코리아(주)BMW X1 xDrive20d비엠더블유코리아㈜2011-09-19
1167233594기아자동차봉고Ⅲ 내장차기아자동차㈜2011-02-16
1546119818기아자동차쏘렌토 하이브리드기아㈜2022-07-04
1126236950포르쉐코리아(주)박스터 S스투트가르트스포츠카㈜2011-03-31
6379109181오디오월드FORD F150FORD오디오월드2018-12-27
1180831300르노코리아자동차 주식회사SM7LE르노삼성자동차㈜2011-01-21
2517117934기아자동차카니발기아㈜2022-03-22
8021105240한국닛산INFINITI QX80한국 닛산2016-11-11
1207227548원모터스코리아BMW750LI원모터스코리아2010-12-03