Overview

Dataset statistics

Number of variables10
Number of observations2013
Missing cells3762
Missing cells (%)18.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory159.4 KiB
Average record size in memory81.1 B

Variable types

Numeric1
Text6
Categorical3

Dataset

Description승강기안전부품 인증은 승강기 사용자가 승강기를 안전하게 사용할 수 있도록 하기 위하여, 안전인증기관이 승강기를 시험하고 제조·검사설비 등 생산체제를 평가함으로써 승강기의 안전성을 확보하기 위한 인증제도입니다.
Author한국승강기안전공단
URLhttps://www.data.go.kr/data/15039130/fileData.do

Alerts

구분1 is highly overall correlated with 구분2High correlation
구분2 is highly overall correlated with 구분1High correlation
구분1 is highly imbalanced (55.0%)Imbalance
파생모델명 has 1761 (87.5%) missing valuesMissing
비고 has 1988 (98.8%) missing valuesMissing
인증번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:51:53.049258
Analysis finished2024-03-23 05:51:54.662551
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

Distinct2012
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1302.1207
Minimum1
Maximum2443
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.8 KiB
2024-03-23T14:51:54.782266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile121.2
Q1704
median1326
Q31938
95-th percentile2342.4
Maximum2443
Range2442
Interquartile range (IQR)1234

Descriptive statistics

Standard deviation713.74175
Coefficient of variation (CV)0.54813793
Kurtosis-1.1669974
Mean1302.1207
Median Absolute Deviation (MAD)617
Skewness-0.16557898
Sum2621169
Variance509427.29
MonotonicityIncreasing
2024-03-23T14:51:54.996830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
84 2
 
0.1%
1 1
 
< 0.1%
1755 1
 
< 0.1%
1768 1
 
< 0.1%
1767 1
 
< 0.1%
1766 1
 
< 0.1%
1765 1
 
< 0.1%
1764 1
 
< 0.1%
1763 1
 
< 0.1%
1762 1
 
< 0.1%
Other values (2002) 2002
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
ValueCountFrequency (%)
2443 1
< 0.1%
2442 1
< 0.1%
2441 1
< 0.1%
2440 1
< 0.1%
2439 1
< 0.1%
2438 1
< 0.1%
2437 1
< 0.1%
2436 1
< 0.1%
2435 1
< 0.1%
2434 1
< 0.1%

인증번호
Text

UNIQUE 

Distinct2013
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2024-03-23T14:51:55.281995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length15.977149
Min length11

Characters and Unicode

Total characters32162
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2013 ?
Unique (%)100.0%

Sample

1st rowAAD55-R011-21003
2nd rowAAD55-R011-21005
3rd rowAAD55-R011-21001
4th rowAAD52-R004-21001
5th rowAAA54-R003-20002
ValueCountFrequency (%)
aad55-r011-21003 1
 
< 0.1%
aad55-r002-20010 1
 
< 0.1%
aad53-t001-20002 1
 
< 0.1%
aad14-r005-20003 1
 
< 0.1%
aad14-r005-20002 1
 
< 0.1%
aad14-r004-20003 1
 
< 0.1%
aad14-r005-20001 1
 
< 0.1%
aad53-r006-20002 1
 
< 0.1%
aad53-r006-20001 1
 
< 0.1%
aab10-h012-20004 1
 
< 0.1%
Other values (2003) 2003
99.5%
2024-03-23T14:51:55.743938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9219
28.7%
A 5613
17.5%
- 4013
12.5%
1 3702
11.5%
2 3135
 
9.7%
3 1472
 
4.6%
5 777
 
2.4%
R 738
 
2.3%
D 588
 
1.8%
4 588
 
1.8%
Other values (15) 2317
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20121
62.6%
Uppercase Letter 8026
 
25.0%
Dash Punctuation 4013
 
12.5%
Control 2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 5613
69.9%
R 738
 
9.2%
D 588
 
7.3%
H 352
 
4.4%
J 190
 
2.4%
B 152
 
1.9%
T 133
 
1.7%
M 123
 
1.5%
K 111
 
1.4%
C 11
 
0.1%
Other values (3) 15
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 9219
45.8%
1 3702
18.4%
2 3135
 
15.6%
3 1472
 
7.3%
5 777
 
3.9%
4 588
 
2.9%
9 442
 
2.2%
7 346
 
1.7%
6 245
 
1.2%
8 195
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 4013
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24136
75.0%
Latin 8026
 
25.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 5613
69.9%
R 738
 
9.2%
D 588
 
7.3%
H 352
 
4.4%
J 190
 
2.4%
B 152
 
1.9%
T 133
 
1.7%
M 123
 
1.5%
K 111
 
1.4%
C 11
 
0.1%
Other values (3) 15
 
0.2%
Common
ValueCountFrequency (%)
0 9219
38.2%
- 4013
16.6%
1 3702
15.3%
2 3135
 
13.0%
3 1472
 
6.1%
5 777
 
3.2%
4 588
 
2.4%
9 442
 
1.8%
7 346
 
1.4%
6 245
 
1.0%
Other values (2) 197
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32162
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9219
28.7%
A 5613
17.5%
- 4013
12.5%
1 3702
11.5%
2 3135
 
9.7%
3 1472
 
4.6%
5 777
 
2.4%
R 738
 
2.3%
D 588
 
1.8%
4 588
 
1.8%
Other values (15) 2317
 
7.2%

변경
Categorical

Distinct12
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
<NA>
994 
A
562 
B
209 
C
 
82
D
 
50
Other values (7)
116 

Length

Max length4
Median length1
Mean length2.4818679
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowC
2nd rowC
3rd rowC
4th rowB
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 994
49.4%
A 562
27.9%
B 209
 
10.4%
C 82
 
4.1%
D 50
 
2.5%
E 32
 
1.6%
F 28
 
1.4%
G 24
 
1.2%
H 18
 
0.9%
I 11
 
0.5%
Other values (2) 3
 
0.1%

Length

2024-03-23T14:51:55.963353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 994
49.4%
a 562
27.9%
b 210
 
10.4%
c 82
 
4.1%
d 50
 
2.5%
e 32
 
1.6%
f 28
 
1.4%
g 24
 
1.2%
h 18
 
0.9%
i 11
 
0.5%

구분1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
엘리베이터
1640 
에스컬레이터
367 
휠체어리프트
 
6

Length

Max length6
Median length5
Mean length5.1852956
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row에스컬레이터
2nd row에스컬레이터
3rd row에스컬레이터
4th row에스컬레이터
5th row에스컬레이터

Common Values

ValueCountFrequency (%)
엘리베이터 1640
81.5%
에스컬레이터 367
 
18.2%
휠체어리프트 6
 
0.3%

Length

2024-03-23T14:51:56.164012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T14:51:56.712255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
엘리베이터 1640
81.5%
에스컬레이터 367
 
18.2%
휠체어리프트 6
 
0.3%

구분2
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
출입문조립체
391 
문열림출발방지장치
245 
구동기
239 
상승과속방지장치
186 
제어반
170 
Other values (12)
782 

Length

Max length9
Median length7
Mean length5.6249379
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row디딤판체인
2nd row디딤판체인
3rd row디딤판체인
4th row구동기
5th row디딤판

Common Values

ValueCountFrequency (%)
출입문조립체 391
19.4%
문열림출발방지장치 245
12.2%
구동기 239
11.9%
상승과속방지장치 186
9.2%
제어반 170
8.4%
디딤판체인 116
 
5.8%
매다는장치 90
 
4.5%
디딤판 84
 
4.2%
완충기 80
 
4.0%
출입문잠금장치 80
 
4.0%
Other values (7) 332
16.5%

Length

2024-03-23T14:51:56.885253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
출입문조립체 391
19.4%
문열림출발방지장치 245
12.2%
구동기 239
11.9%
상승과속방지장치 186
9.2%
제어반 170
8.4%
디딤판체인 116
 
5.8%
매다는장치 90
 
4.5%
디딤판 84
 
4.2%
출입문잠금장치 80
 
4.0%
완충기 80
 
4.0%
Other values (7) 332
16.5%
Distinct52
Distinct (%)2.6%
Missing13
Missing (%)0.6%
Memory size15.9 KiB
2024-03-23T14:51:57.219455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length5.771
Min length2

Characters and Unicode

Total characters11542
Distinct characters97
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.4%

Sample

1st row일반형
2nd row일반형
3rd row일반형
4th row기어드방식
5th row팔레트형
ValueCountFrequency (%)
승강장문 358
16.9%
이중브레이크형 173
 
8.2%
기어리스방식 172
 
8.1%
이중브레이크형(제동요소 138
 
6.5%
일반형 115
 
5.4%
플랫케이블 76
 
3.6%
로프 72
 
3.4%
저항제동방식 68
 
3.2%
스텝형 67
 
3.2%
작동형 66
 
3.1%
Other values (45) 817
38.5%
2024-03-23T14:51:57.705901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
807
 
7.0%
657
 
5.7%
648
 
5.6%
568
 
4.9%
472
 
4.1%
416
 
3.6%
415
 
3.6%
411
 
3.6%
410
 
3.6%
408
 
3.5%
Other values (87) 6330
54.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10772
93.3%
Open Punctuation 297
 
2.6%
Close Punctuation 297
 
2.6%
Space Separator 122
 
1.1%
Uppercase Letter 52
 
0.5%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
807
 
7.5%
657
 
6.1%
648
 
6.0%
568
 
5.3%
472
 
4.4%
416
 
3.9%
415
 
3.9%
411
 
3.8%
410
 
3.8%
408
 
3.8%
Other values (81) 5560
51.6%
Uppercase Letter
ValueCountFrequency (%)
A 42
80.8%
B 10
 
19.2%
Open Punctuation
ValueCountFrequency (%)
( 297
100.0%
Close Punctuation
ValueCountFrequency (%)
) 297
100.0%
Space Separator
ValueCountFrequency (%)
122
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10772
93.3%
Common 718
 
6.2%
Latin 52
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
807
 
7.5%
657
 
6.1%
648
 
6.0%
568
 
5.3%
472
 
4.4%
416
 
3.9%
415
 
3.9%
411
 
3.8%
410
 
3.8%
408
 
3.8%
Other values (81) 5560
51.6%
Common
ValueCountFrequency (%)
( 297
41.4%
) 297
41.4%
122
17.0%
/ 2
 
0.3%
Latin
ValueCountFrequency (%)
A 42
80.8%
B 10
 
19.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10772
93.3%
ASCII 770
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
807
 
7.5%
657
 
6.1%
648
 
6.0%
568
 
5.3%
472
 
4.4%
416
 
3.9%
415
 
3.9%
411
 
3.8%
410
 
3.8%
408
 
3.8%
Other values (81) 5560
51.6%
ASCII
ValueCountFrequency (%)
( 297
38.6%
) 297
38.6%
122
15.8%
A 42
 
5.5%
B 10
 
1.3%
/ 2
 
0.3%
Distinct1833
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2024-03-23T14:51:58.060274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length39
Mean length12.879781
Min length2

Characters and Unicode

Total characters25927
Distinct characters123
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1681 ?
Unique (%)83.5%

Sample

1st rowST135F2
2nd rowST135F13
3rd rowPT135F2
4th rowEC-W1_7.5/9KW
5th row1000P
ValueCountFrequency (%)
chain 28
 
1.0%
1.2t 27
 
1.0%
evvf-h 24
 
0.9%
22
 
0.8%
x 20
 
0.7%
300v 18
 
0.7%
1.5t 17
 
0.6%
evvf-l 17
 
0.6%
2pco 13
 
0.5%
l2n-co(sts 12
 
0.4%
Other values (1961) 2514
92.7%
2024-03-23T14:51:58.620883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2331
 
9.0%
- 2074
 
8.0%
1 1505
 
5.8%
S 1392
 
5.4%
2 1278
 
4.9%
C 885
 
3.4%
5 822
 
3.2%
704
 
2.7%
D 692
 
2.7%
T 681
 
2.6%
Other values (113) 13563
52.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 11910
45.9%
Decimal Number 8162
31.5%
Dash Punctuation 2075
 
8.0%
Lowercase Letter 752
 
2.9%
Space Separator 704
 
2.7%
Other Punctuation 634
 
2.4%
Close Punctuation 521
 
2.0%
Open Punctuation 521
 
2.0%
Math Symbol 211
 
0.8%
Other Letter 207
 
0.8%
Other values (6) 230
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
19.8%
36
17.4%
11
 
5.3%
10
 
4.8%
10
 
4.8%
10
 
4.8%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
Other values (23) 59
28.5%
Uppercase Letter
ValueCountFrequency (%)
S 1392
 
11.7%
C 885
 
7.4%
D 692
 
5.8%
T 681
 
5.7%
M 648
 
5.4%
E 613
 
5.1%
L 602
 
5.1%
B 597
 
5.0%
H 579
 
4.9%
A 556
 
4.7%
Other values (18) 4665
39.2%
Lowercase Letter
ValueCountFrequency (%)
x 125
16.6%
t 119
15.8%
k 75
10.0%
o 74
9.8%
i 43
 
5.7%
a 35
 
4.7%
e 33
 
4.4%
n 33
 
4.4%
s 30
 
4.0%
r 28
 
3.7%
Other values (15) 157
20.9%
Decimal Number
ValueCountFrequency (%)
0 2331
28.6%
1 1505
18.4%
2 1278
15.7%
5 822
 
10.1%
3 590
 
7.2%
4 497
 
6.1%
6 355
 
4.3%
7 330
 
4.0%
8 262
 
3.2%
9 192
 
2.4%
Other Punctuation
ValueCountFrequency (%)
. 492
77.6%
/ 63
 
9.9%
, 31
 
4.9%
* 22
 
3.5%
" 15
 
2.4%
: 8
 
1.3%
3
 
0.5%
Other Symbol
ValueCountFrequency (%)
41
85.4%
4
 
8.3%
2
 
4.2%
© 1
 
2.1%
Math Symbol
ValueCountFrequency (%)
+ 126
59.7%
× 68
32.2%
~ 17
 
8.1%
Dash Punctuation
ValueCountFrequency (%)
- 2074
> 99.9%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 520
99.8%
] 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 520
99.8%
[ 1
 
0.2%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
704
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 172
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13054
50.3%
Latin 12664
48.8%
Hangul 207
 
0.8%
Greek 2
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 1392
 
11.0%
C 885
 
7.0%
D 692
 
5.5%
T 681
 
5.4%
M 648
 
5.1%
E 613
 
4.8%
L 602
 
4.8%
B 597
 
4.7%
H 579
 
4.6%
A 556
 
4.4%
Other values (43) 5419
42.8%
Common
ValueCountFrequency (%)
0 2331
17.9%
- 2074
15.9%
1 1505
11.5%
2 1278
9.8%
5 822
 
6.3%
704
 
5.4%
3 590
 
4.5%
) 520
 
4.0%
( 520
 
4.0%
4 497
 
3.8%
Other values (25) 2213
17.0%
Hangul
ValueCountFrequency (%)
41
19.8%
36
17.4%
11
 
5.3%
10
 
4.8%
10
 
4.8%
10
 
4.8%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
Other values (23) 59
28.5%
Greek
ValueCountFrequency (%)
Φ 1
50.0%
Ω 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25587
98.7%
Hangul 207
 
0.8%
None 74
 
0.3%
CJK Compat 47
 
0.2%
Punctuation 8
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2331
 
9.1%
- 2074
 
8.1%
1 1505
 
5.9%
S 1392
 
5.4%
2 1278
 
5.0%
C 885
 
3.5%
5 822
 
3.2%
704
 
2.8%
D 692
 
2.7%
T 681
 
2.7%
Other values (66) 13223
51.7%
None
ValueCountFrequency (%)
× 68
91.9%
ø 2
 
2.7%
© 1
 
1.4%
Φ 1
 
1.4%
² 1
 
1.4%
Ω 1
 
1.4%
Hangul
ValueCountFrequency (%)
41
19.8%
36
17.4%
11
 
5.3%
10
 
4.8%
10
 
4.8%
10
 
4.8%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
Other values (23) 59
28.5%
CJK Compat
ValueCountFrequency (%)
41
87.2%
4
 
8.5%
2
 
4.3%
Punctuation
ValueCountFrequency (%)
4
50.0%
3
37.5%
1
 
12.5%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

파생모델명
Text

MISSING 

Distinct246
Distinct (%)97.6%
Missing1761
Missing (%)87.5%
Memory size15.9 KiB
2024-03-23T14:51:58.962871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length140
Median length62
Mean length19.329365
Min length2

Characters and Unicode

Total characters4871
Distinct characters98
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)95.2%

Sample

1st rowLYW125
2nd rowGTW9S
3rd rowRSR1500/8010(DAF270M,DAF270L)-1.RSR1500/8010(DAF270M,DAF270L)-2
4th rowFJ125
5th rowSA GED 10A
ValueCountFrequency (%)
evvf-h 12
 
2.8%
300v 11
 
2.5%
x 10
 
2.3%
9
 
2.1%
evvf-l 7
 
1.6%
sa 6
 
1.4%
0.75㎟ 5
 
1.1%
ged 5
 
1.1%
l2n-co(sts 4
 
0.9%
evvf-h-hs 4
 
0.9%
Other values (333) 363
83.3%
2024-03-23T14:51:59.607666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 452
 
9.3%
0 398
 
8.2%
S 280
 
5.7%
2 245
 
5.0%
1 245
 
5.0%
5 211
 
4.3%
185
 
3.8%
B 133
 
2.7%
E 124
 
2.5%
3 121
 
2.5%
Other values (88) 2477
50.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2003
41.1%
Decimal Number 1573
32.3%
Dash Punctuation 452
 
9.3%
Other Punctuation 219
 
4.5%
Space Separator 185
 
3.8%
Lowercase Letter 116
 
2.4%
Open Punctuation 102
 
2.1%
Close Punctuation 102
 
2.1%
Other Letter 45
 
0.9%
Connector Punctuation 31
 
0.6%
Other values (2) 43
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
13.3%
5
 
11.1%
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
Other values (18) 18
40.0%
Uppercase Letter
ValueCountFrequency (%)
S 280
 
14.0%
B 133
 
6.6%
E 124
 
6.2%
A 115
 
5.7%
C 112
 
5.6%
D 107
 
5.3%
M 103
 
5.1%
L 100
 
5.0%
G 98
 
4.9%
R 92
 
4.6%
Other values (16) 739
36.9%
Lowercase Letter
ValueCountFrequency (%)
x 38
32.8%
t 10
 
8.6%
i 9
 
7.8%
n 8
 
6.9%
e 7
 
6.0%
k 6
 
5.2%
p 5
 
4.3%
m 5
 
4.3%
b 4
 
3.4%
s 4
 
3.4%
Other values (8) 20
17.2%
Decimal Number
ValueCountFrequency (%)
0 398
25.3%
2 245
15.6%
1 245
15.6%
5 211
13.4%
3 121
 
7.7%
4 121
 
7.7%
7 81
 
5.1%
6 62
 
3.9%
8 49
 
3.1%
9 40
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 99
45.2%
. 90
41.1%
/ 24
 
11.0%
: 4
 
1.8%
* 2
 
0.9%
Math Symbol
ValueCountFrequency (%)
+ 19
73.1%
~ 4
 
15.4%
× 3
 
11.5%
Other Symbol
ValueCountFrequency (%)
15
88.2%
© 1
 
5.9%
1
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 452
100.0%
Space Separator
ValueCountFrequency (%)
185
100.0%
Open Punctuation
ValueCountFrequency (%)
( 102
100.0%
Close Punctuation
ValueCountFrequency (%)
) 102
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2707
55.6%
Latin 2119
43.5%
Hangul 45
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 280
 
13.2%
B 133
 
6.3%
E 124
 
5.9%
A 115
 
5.4%
C 112
 
5.3%
D 107
 
5.0%
M 103
 
4.9%
L 100
 
4.7%
G 98
 
4.6%
R 92
 
4.3%
Other values (34) 855
40.3%
Hangul
ValueCountFrequency (%)
6
 
13.3%
5
 
11.1%
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
Other values (18) 18
40.0%
Common
ValueCountFrequency (%)
- 452
16.7%
0 398
14.7%
2 245
9.1%
1 245
9.1%
5 211
7.8%
185
 
6.8%
3 121
 
4.5%
4 121
 
4.5%
( 102
 
3.8%
) 102
 
3.8%
Other values (16) 525
19.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4806
98.7%
Hangul 45
 
0.9%
CJK Compat 16
 
0.3%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 452
 
9.4%
0 398
 
8.3%
S 280
 
5.8%
2 245
 
5.1%
1 245
 
5.1%
5 211
 
4.4%
185
 
3.8%
B 133
 
2.8%
E 124
 
2.6%
3 121
 
2.5%
Other values (56) 2412
50.2%
CJK Compat
ValueCountFrequency (%)
15
93.8%
1
 
6.2%
Hangul
ValueCountFrequency (%)
6
 
13.3%
5
 
11.1%
3
 
6.7%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
Other values (18) 18
40.0%
None
ValueCountFrequency (%)
× 3
75.0%
© 1
 
25.0%
Distinct117
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2024-03-23T14:51:59.996728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length8.5494287
Min length3

Characters and Unicode

Total characters17210
Distinct characters162
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)0.7%

Sample

1st row오티스엘리베이터(유)
2nd row오티스엘리베이터(유)
3rd row오티스엘리베이터(유)
4th row오티스엘리베이터(유)
5th row㈜쉰들러엘리베이터
ValueCountFrequency (%)
티케이엘리베이터코리아㈜ 248
 
11.7%
현대엘리베이터㈜ 246
 
11.6%
오티스엘리베이터(유 196
 
9.2%
한국미쓰비시엘리베이터㈜ 150
 
7.1%
㈜쉰들러엘리베이터 85
 
4.0%
주식회사 75
 
3.5%
후지테크코리아㈜ 61
 
2.9%
㈜비티알수성 58
 
2.7%
㈜나우테크 41
 
1.9%
디앤아이솔루션㈜ 37
 
1.7%
Other values (111) 930
43.7%
2024-03-23T14:52:00.638142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1642
 
9.5%
1527
 
8.9%
1509
 
8.8%
1129
 
6.6%
1082
 
6.3%
1057
 
6.1%
624
 
3.6%
436
 
2.5%
408
 
2.4%
373
 
2.2%
Other values (152) 7423
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15053
87.5%
Other Symbol 1642
 
9.5%
Close Punctuation 197
 
1.1%
Open Punctuation 197
 
1.1%
Space Separator 115
 
0.7%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1527
 
10.1%
1509
 
10.0%
1129
 
7.5%
1082
 
7.2%
1057
 
7.0%
624
 
4.1%
436
 
2.9%
408
 
2.7%
373
 
2.5%
329
 
2.2%
Other values (145) 6579
43.7%
Uppercase Letter
ValueCountFrequency (%)
E 2
33.3%
S 2
33.3%
L 2
33.3%
Other Symbol
ValueCountFrequency (%)
1642
100.0%
Close Punctuation
ValueCountFrequency (%)
) 197
100.0%
Open Punctuation
ValueCountFrequency (%)
( 197
100.0%
Space Separator
ValueCountFrequency (%)
115
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16695
97.0%
Common 509
 
3.0%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1642
 
9.8%
1527
 
9.1%
1509
 
9.0%
1129
 
6.8%
1082
 
6.5%
1057
 
6.3%
624
 
3.7%
436
 
2.6%
408
 
2.4%
373
 
2.2%
Other values (146) 6908
41.4%
Common
ValueCountFrequency (%)
) 197
38.7%
( 197
38.7%
115
22.6%
Latin
ValueCountFrequency (%)
E 2
33.3%
S 2
33.3%
L 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15053
87.5%
None 1642
 
9.5%
ASCII 515
 
3.0%

Most frequent character per block

None
ValueCountFrequency (%)
1642
100.0%
Hangul
ValueCountFrequency (%)
1527
 
10.1%
1509
 
10.0%
1129
 
7.5%
1082
 
7.2%
1057
 
7.0%
624
 
4.1%
436
 
2.9%
408
 
2.7%
373
 
2.5%
329
 
2.2%
Other values (145) 6579
43.7%
ASCII
ValueCountFrequency (%)
) 197
38.3%
( 197
38.3%
115
22.3%
E 2
 
0.4%
S 2
 
0.4%
L 2
 
0.4%

비고
Text

MISSING 

Distinct14
Distinct (%)56.0%
Missing1988
Missing (%)98.8%
Memory size15.9 KiB
2024-03-23T14:52:00.926758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length38
Mean length20.84
Min length4

Characters and Unicode

Total characters521
Distinct characters43
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)48.0%

Sample

1st row동일부품
2nd row동일부품
3rd row동일부품
4th row유효만료일 조정
5th row유효만료일 조정
ValueCountFrequency (%)
동일부품 11
17.7%
유효만료일 10
16.1%
6
 
9.7%
2023-12-17 3
 
4.8%
2024-08-16 3
 
4.8%
재신청 3
 
4.8%
3
 
4.8%
2023-12-31 2
 
3.2%
조정(a20230831-102 2
 
3.2%
조정 2
 
3.2%
Other values (17) 17
27.4%
2024-03-23T14:52:01.352308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 78
15.0%
0 56
 
10.7%
1 53
 
10.2%
- 46
 
8.8%
37
 
7.1%
3 23
 
4.4%
21
 
4.0%
A 12
 
2.3%
11
 
2.1%
( 11
 
2.1%
Other values (33) 173
33.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 247
47.4%
Other Letter 139
26.7%
Dash Punctuation 46
 
8.8%
Space Separator 37
 
7.1%
Uppercase Letter 17
 
3.3%
Open Punctuation 11
 
2.1%
Close Punctuation 11
 
2.1%
Other Punctuation 7
 
1.3%
Math Symbol 6
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
15.1%
11
7.9%
11
7.9%
11
7.9%
10
 
7.2%
10
 
7.2%
10
 
7.2%
10
 
7.2%
9
 
6.5%
9
 
6.5%
Other values (12) 27
19.4%
Decimal Number
ValueCountFrequency (%)
2 78
31.6%
0 56
22.7%
1 53
21.5%
3 23
 
9.3%
4 9
 
3.6%
8 9
 
3.6%
7 8
 
3.2%
5 6
 
2.4%
6 3
 
1.2%
9 2
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
A 12
70.6%
D 2
 
11.8%
G 1
 
5.9%
E 1
 
5.9%
S 1
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Math Symbol
ValueCountFrequency (%)
> 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 365
70.1%
Hangul 139
 
26.7%
Latin 17
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
15.1%
11
7.9%
11
7.9%
11
7.9%
10
 
7.2%
10
 
7.2%
10
 
7.2%
10
 
7.2%
9
 
6.5%
9
 
6.5%
Other values (12) 27
19.4%
Common
ValueCountFrequency (%)
2 78
21.4%
0 56
15.3%
1 53
14.5%
- 46
12.6%
37
10.1%
3 23
 
6.3%
( 11
 
3.0%
) 11
 
3.0%
4 9
 
2.5%
8 9
 
2.5%
Other values (6) 32
8.8%
Latin
ValueCountFrequency (%)
A 12
70.6%
D 2
 
11.8%
G 1
 
5.9%
E 1
 
5.9%
S 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 382
73.3%
Hangul 139
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 78
20.4%
0 56
14.7%
1 53
13.9%
- 46
12.0%
37
9.7%
3 23
 
6.0%
A 12
 
3.1%
( 11
 
2.9%
) 11
 
2.9%
4 9
 
2.4%
Other values (11) 46
12.0%
Hangul
ValueCountFrequency (%)
21
15.1%
11
7.9%
11
7.9%
11
7.9%
10
 
7.2%
10
 
7.2%
10
 
7.2%
10
 
7.2%
9
 
6.5%
9
 
6.5%
Other values (12) 27
19.4%

Interactions

2024-03-23T14:51:53.869924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T14:52:01.474148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번변경구분1구분2구분3비고
연번1.0000.4180.3540.6540.7310.927
변경0.4181.0000.1390.5180.6010.994
구분10.3540.1391.0000.8110.9311.000
구분20.6540.5180.8111.0000.9990.912
구분30.7310.6010.9310.9991.0000.906
비고0.9270.9941.0000.9120.9061.000
2024-03-23T14:52:01.632465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
변경구분1구분2
변경1.0000.0810.221
구분10.0811.0000.643
구분20.2210.6431.000
2024-03-23T14:52:01.852021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번변경구분1구분2
연번1.0000.1920.2270.322
변경0.1921.0000.0810.221
구분10.2270.0811.0000.643
구분20.3220.2210.6431.000

Missing values

2024-03-23T14:51:54.145982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:51:54.383200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T14:51:54.555812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번인증번호변경구분1구분2구분3기본모델명파생모델명제조수입업자명(대리인)비고
01AAD55-R011-21003C에스컬레이터디딤판체인일반형ST135F2<NA>오티스엘리베이터(유)<NA>
12AAD55-R011-21005C에스컬레이터디딤판체인일반형ST135F13<NA>오티스엘리베이터(유)<NA>
23AAD55-R011-21001C에스컬레이터디딤판체인일반형PT135F2<NA>오티스엘리베이터(유)<NA>
35AAD52-R004-21001B에스컬레이터구동기기어드방식EC-W1_7.5/9KW<NA>오티스엘리베이터(유)<NA>
46AAA54-R003-20002<NA>에스컬레이터디딤판팔레트형1000P<NA>㈜쉰들러엘리베이터<NA>
57AAA54-R003-21003<NA>에스컬레이터디딤판스텝형1000S<NA>㈜쉰들러엘리베이터<NA>
68AAA54-R003-20003<NA>에스컬레이터디딤판팔레트형1200P<NA>㈜쉰들러엘리베이터<NA>
79AAA55-R001-20004A에스컬레이터디딤판체인일반형1705 7250 (160kN)<NA>티케이엘리베이터코리아㈜<NA>
812AAA54-R003-21001<NA>에스컬레이터디딤판스텝형600S<NA>㈜쉰들러엘리베이터<NA>
913AAA55-R001-20001A에스컬레이터디딤판체인일반형7005 9300 (110kN)<NA>티케이엘리베이터코리아㈜<NA>
연번인증번호변경구분1구분2구분3기본모델명파생모델명제조수입업자명(대리인)비고
20032434AAA54-R006-24002<NA>에스컬레이터디딤판스텝형LR1000K-4<NA>㈜대륜엘리스<NA>
20042435AAA54-R006-24003<NA>에스컬레이터디딤판스텝형LR800K-4<NA>㈜대륜엘리스<NA>
20052436AAA54-R006-24004<NA>에스컬레이터디딤판스텝형LR600K-4<NA>㈜대륜엘리스<NA>
20062437AAA51-R008-24001<NA>에스컬레이터과속역행방지장치폴래칫휠방식HS200-Y-3B<NA>주식회사 한선엘리베이터<NA>
20072438AAA55-H001-24001<NA>에스컬레이터디딤판체인일반형HC19TT CHAIN<NA>㈜대동모빌리티<NA>
20082439AAA55-H001-24002<NA>에스컬레이터디딤판체인일반형HC19TT CHAIN<NA>㈜대동모빌리티<NA>
20092440AAA03-R011-24003<NA>엘리베이터구동기기어리스방식GETM2.6T<NA>한국미쓰비시엘리베이터㈜<NA>
20102441AAA01-R024-24002<NA>엘리베이터문열림출발방지장치이중브레이크형(제동요소)BLB(WYT-H1T)<NA>㈜해성티피씨<NA>
20112442AAA03-R020-24002<NA>엘리베이터구동기기어리스방식WYT-H1T<NA>㈜해성티피씨<NA>
20122443AAA07-R020-24002<NA>엘리베이터상승과속방지장치이중브레이크형BLB(WYT-H1T)<NA>㈜해성티피씨<NA>