Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory322.3 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text2

Dataset

Description국가기술표준원이 운영하고 있는 제품안전정보포털(센터)에서 제공되는 제품안전 인증제품 파생모델 정보를 공유합니다.
URLhttps://www.data.go.kr/data/15040700/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:15:20.527181
Analysis finished2023-12-12 21:15:21.087395
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17328.863
Minimum1
Maximum34591
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:15:21.164376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1749.85
Q18527.75
median17393
Q326062.25
95-th percentile32840.2
Maximum34591
Range34590
Interquartile range (IQR)17534.5

Descriptive statistics

Standard deviation10016.617
Coefficient of variation (CV)0.57803081
Kurtosis-1.2137588
Mean17328.863
Median Absolute Deviation (MAD)8755
Skewness-0.005827692
Sum1.7328863 × 108
Variance1.0033261 × 108
MonotonicityNot monotonic
2023-12-13T06:15:21.286562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3726 1
 
< 0.1%
15403 1
 
< 0.1%
22260 1
 
< 0.1%
3154 1
 
< 0.1%
31754 1
 
< 0.1%
27714 1
 
< 0.1%
19158 1
 
< 0.1%
31262 1
 
< 0.1%
31678 1
 
< 0.1%
27040 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
17 1
< 0.1%
22 1
< 0.1%
24 1
< 0.1%
28 1
< 0.1%
31 1
< 0.1%
35 1
< 0.1%
ValueCountFrequency (%)
34591 1
< 0.1%
34590 1
< 0.1%
34583 1
< 0.1%
34582 1
< 0.1%
34581 1
< 0.1%
34570 1
< 0.1%
34568 1
< 0.1%
34566 1
< 0.1%
34565 1
< 0.1%
34564 1
< 0.1%
Distinct1792
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:15:21.464821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length28
Mean length8.6292
Min length1

Characters and Unicode

Total characters86292
Distinct characters166
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique711 ?
Unique (%)7.1%

Sample

1st rowW&TAD1806B050H
2nd rowGQ18190AK
3rd rowGM981506007F
4th rowRXMXT22A
5th rowGM981506007F
ValueCountFrequency (%)
f873mt95 230
 
2.3%
ad 143
 
1.4%
tsl2001 141
 
1.4%
xhb 134
 
1.3%
aa 129
 
1.3%
hr02 111
 
1.1%
ago450a 89
 
0.9%
mpx310vx20.1uf 83
 
0.8%
빌라트2등 79
 
0.8%
tclean3200 77
 
0.8%
Other values (1782) 8784
87.8%
2023-12-13T06:15:21.751713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13034
 
15.1%
1 6768
 
7.8%
2 5869
 
6.8%
5 3989
 
4.6%
A 3465
 
4.0%
S 3330
 
3.9%
3 3005
 
3.5%
D 2487
 
2.9%
4 2445
 
2.8%
C 2256
 
2.6%
Other values (156) 39644
45.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 41670
48.3%
Uppercase Letter 40373
46.8%
Other Letter 1793
 
2.1%
Lowercase Letter 1226
 
1.4%
Other Punctuation 658
 
0.8%
Close Punctuation 200
 
0.2%
Open Punctuation 200
 
0.2%
Other Symbol 107
 
0.1%
Math Symbol 38
 
< 0.1%
Connector Punctuation 25
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
7.4%
133
 
7.4%
121
 
6.7%
86
 
4.8%
84
 
4.7%
82
 
4.6%
79
 
4.4%
76
 
4.2%
74
 
4.1%
48
 
2.7%
Other values (81) 877
48.9%
Uppercase Letter
ValueCountFrequency (%)
A 3465
 
8.6%
S 3330
 
8.2%
D 2487
 
6.2%
C 2256
 
5.6%
B 2154
 
5.3%
H 2076
 
5.1%
K 2035
 
5.0%
T 1979
 
4.9%
F 1888
 
4.7%
R 1842
 
4.6%
Other values (16) 16861
41.8%
Lowercase Letter
ValueCountFrequency (%)
u 267
21.8%
l 121
9.9%
a 113
9.2%
e 113
9.2%
n 109
8.9%
i 70
 
5.7%
o 50
 
4.1%
h 49
 
4.0%
r 49
 
4.0%
t 43
 
3.5%
Other values (13) 242
19.7%
Decimal Number
ValueCountFrequency (%)
0 13034
31.3%
1 6768
16.2%
2 5869
14.1%
5 3989
 
9.6%
3 3005
 
7.2%
4 2445
 
5.9%
6 2172
 
5.2%
8 1841
 
4.4%
7 1468
 
3.5%
9 1079
 
2.6%
Other Punctuation
ValueCountFrequency (%)
. 346
52.6%
/ 129
 
19.6%
, 91
 
13.8%
* 42
 
6.4%
& 28
 
4.3%
· 12
 
1.8%
: 10
 
1.5%
Other Symbol
ValueCountFrequency (%)
61
57.0%
29
27.1%
17
 
15.9%
Math Symbol
ValueCountFrequency (%)
× 31
81.6%
+ 7
 
18.4%
Close Punctuation
ValueCountFrequency (%)
) 200
100.0%
Open Punctuation
ValueCountFrequency (%)
( 200
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 25
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42898
49.7%
Latin 41601
48.2%
Hangul 1793
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
7.4%
133
 
7.4%
121
 
6.7%
86
 
4.8%
84
 
4.7%
82
 
4.6%
79
 
4.4%
76
 
4.2%
74
 
4.1%
48
 
2.7%
Other values (81) 877
48.9%
Latin
ValueCountFrequency (%)
A 3465
 
8.3%
S 3330
 
8.0%
D 2487
 
6.0%
C 2256
 
5.4%
B 2154
 
5.2%
H 2076
 
5.0%
K 2035
 
4.9%
T 1979
 
4.8%
F 1888
 
4.5%
R 1842
 
4.4%
Other values (40) 18089
43.5%
Common
ValueCountFrequency (%)
0 13034
30.4%
1 6768
15.8%
2 5869
13.7%
5 3989
 
9.3%
3 3005
 
7.0%
4 2445
 
5.7%
6 2172
 
5.1%
8 1841
 
4.3%
7 1468
 
3.4%
9 1079
 
2.5%
Other values (15) 1228
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 84347
97.7%
Hangul 1793
 
2.1%
CJK Compat 78
 
0.1%
None 43
 
< 0.1%
Letterlike Symbols 29
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13034
 
15.5%
1 6768
 
8.0%
2 5869
 
7.0%
5 3989
 
4.7%
A 3465
 
4.1%
S 3330
 
3.9%
3 3005
 
3.6%
D 2487
 
2.9%
4 2445
 
2.9%
C 2256
 
2.7%
Other values (59) 37699
44.7%
Hangul
ValueCountFrequency (%)
133
 
7.4%
133
 
7.4%
121
 
6.7%
86
 
4.8%
84
 
4.7%
82
 
4.6%
79
 
4.4%
76
 
4.2%
74
 
4.1%
48
 
2.7%
Other values (81) 877
48.9%
CJK Compat
ValueCountFrequency (%)
61
78.2%
17
 
21.8%
None
ValueCountFrequency (%)
× 31
72.1%
· 12
 
27.9%
Letterlike Symbols
ValueCountFrequency (%)
29
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Distinct9615
Distinct (%)96.2%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T06:15:22.010597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length255
Median length32
Mean length10.008301
Min length2

Characters and Unicode

Total characters100073
Distinct characters265
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9326 ?
Unique (%)93.3%

Sample

1st rowW&TAD1806A072050H
2nd rowGQ18180085AK
3rd rowGM98120600F
4th rowNW06
5th rowGM981107007FE
ValueCountFrequency (%)
hpofficejetpro9 22
 
0.2%
sq07lajwaj(벽걸이실내기 7
 
0.1%
6㎟ 7
 
0.1%
fq27adau(실외기 5
 
0.1%
gpe024p120d 5
 
0.1%
4㎟ 5
 
0.1%
tsl2 5
 
0.1%
jns14 4
 
< 0.1%
lx050l 4
 
< 0.1%
sq07lajwaz(벽걸이실내기 4
 
< 0.1%
Other values (9601) 9931
99.3%
2023-12-13T06:15:22.428783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13938
 
13.9%
1 7433
 
7.4%
2 6549
 
6.5%
5 5337
 
5.3%
S 3997
 
4.0%
A 3711
 
3.7%
3 3497
 
3.5%
D 3191
 
3.2%
4 2766
 
2.8%
H 2591
 
2.6%
Other values (255) 47063
47.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 48070
48.0%
Uppercase Letter 45295
45.3%
Other Punctuation 2804
 
2.8%
Lowercase Letter 1724
 
1.7%
Other Letter 1286
 
1.3%
Connector Punctuation 274
 
0.3%
Close Punctuation 218
 
0.2%
Open Punctuation 212
 
0.2%
Other Symbol 156
 
0.2%
Math Symbol 31
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
3.9%
45
 
3.5%
45
 
3.5%
41
 
3.2%
40
 
3.1%
37
 
2.9%
36
 
2.8%
36
 
2.8%
36
 
2.8%
35
 
2.7%
Other values (176) 885
68.8%
Uppercase Letter
ValueCountFrequency (%)
S 3997
 
8.8%
A 3711
 
8.2%
D 3191
 
7.0%
H 2591
 
5.7%
C 2557
 
5.6%
B 2470
 
5.5%
F 2286
 
5.0%
T 2045
 
4.5%
P 1998
 
4.4%
K 1893
 
4.2%
Other values (16) 18556
41.0%
Lowercase Letter
ValueCountFrequency (%)
p 293
17.0%
u 218
12.6%
e 139
 
8.1%
o 136
 
7.9%
i 134
 
7.8%
r 116
 
6.7%
t 86
 
5.0%
a 78
 
4.5%
n 58
 
3.4%
h 51
 
3.0%
Other values (16) 415
24.1%
Decimal Number
ValueCountFrequency (%)
0 13938
29.0%
1 7433
15.5%
2 6549
13.6%
5 5337
 
11.1%
3 3497
 
7.3%
4 2766
 
5.8%
6 2538
 
5.3%
8 2387
 
5.0%
7 2162
 
4.5%
9 1463
 
3.0%
Other Punctuation
ValueCountFrequency (%)
/ 1397
49.8%
# 1021
36.4%
. 337
 
12.0%
& 28
 
1.0%
· 12
 
0.4%
: 5
 
0.2%
* 4
 
0.1%
Other Symbol
ValueCountFrequency (%)
71
45.5%
56
35.9%
29
18.6%
Math Symbol
ValueCountFrequency (%)
× 17
54.8%
+ 14
45.2%
Connector Punctuation
ValueCountFrequency (%)
_ 274
100.0%
Close Punctuation
ValueCountFrequency (%)
) 218
100.0%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 51766
51.7%
Latin 47021
47.0%
Hangul 1286
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
3.9%
45
 
3.5%
45
 
3.5%
41
 
3.2%
40
 
3.1%
37
 
2.9%
36
 
2.8%
36
 
2.8%
36
 
2.8%
35
 
2.7%
Other values (176) 885
68.8%
Latin
ValueCountFrequency (%)
S 3997
 
8.5%
A 3711
 
7.9%
D 3191
 
6.8%
H 2591
 
5.5%
C 2557
 
5.4%
B 2470
 
5.3%
F 2286
 
4.9%
T 2045
 
4.3%
P 1998
 
4.2%
K 1893
 
4.0%
Other values (43) 20282
43.1%
Common
ValueCountFrequency (%)
0 13938
26.9%
1 7433
14.4%
2 6549
12.7%
5 5337
 
10.3%
3 3497
 
6.8%
4 2766
 
5.3%
6 2538
 
4.9%
8 2387
 
4.6%
7 2162
 
4.2%
9 1463
 
2.8%
Other values (16) 3696
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 98600
98.5%
Hangul 1286
 
1.3%
CJK Compat 127
 
0.1%
Letterlike Symbols 29
 
< 0.1%
None 29
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13938
 
14.1%
1 7433
 
7.5%
2 6549
 
6.6%
5 5337
 
5.4%
S 3997
 
4.1%
A 3711
 
3.8%
3 3497
 
3.5%
D 3191
 
3.2%
4 2766
 
2.8%
H 2591
 
2.6%
Other values (63) 45590
46.2%
CJK Compat
ValueCountFrequency (%)
71
55.9%
56
44.1%
Hangul
ValueCountFrequency (%)
50
 
3.9%
45
 
3.5%
45
 
3.5%
41
 
3.2%
40
 
3.1%
37
 
2.9%
36
 
2.8%
36
 
2.8%
36
 
2.8%
35
 
2.7%
Other values (176) 885
68.8%
Letterlike Symbols
ValueCountFrequency (%)
29
100.0%
None
ValueCountFrequency (%)
× 17
58.6%
· 12
41.4%
Number Forms
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-13T06:15:20.836542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T06:15:20.946499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:15:21.040606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번모델명파생모델명
37253726W&TAD1806B050HW&TAD1806A072050H
3182531826GQ18190AKGQ18180085AK
2221422215GM981506007FGM98120600F
1932519326RXMXT22ANW06
2220422205GM981506007FGM981107007FE
2214922150GM982404007DGM982254007D
77627763DYS836200200WDYS836170235W
1620816209DLP2DLP20311
2683126832빌라트2등VFTP2E
2059720598748974b6
순번모델명파생모델명
1649216493BTM502SJD02SBC800FQ10
3002630027AB110AB1102
79567957ECXd700.166ECXe1050.
1919019191QH09BXCVSI2000
2849928500GRD24FWCHLM613SMM75
1467314674BK15U1511MEBBKB9605D
2796527966SYSS15SYNJP15
3244732448S0F500BDC1A021S0F500BDC11021
1717817179M870SMM451M870GBM451
37093710W&TAD1806B050HW&TAD1806A090030H