Overview

Dataset statistics

Number of variables4
Number of observations2465
Missing cells14
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory79.6 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text3

Dataset

Description한국동서발전의 발전설비용어 정보를 제공합니다. 발전설비용어는 번호, 약어, 원어, 한글풀이의 항목을 나타냅니다.
Author한국동서발전(주)
URLhttps://www.data.go.kr/data/15087680/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:21:23.441267
Analysis finished2023-12-12 13:21:24.309357
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct2465
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1233
Minimum1
Maximum2465
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2023-12-12T22:21:24.384816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile124.2
Q1617
median1233
Q31849
95-th percentile2341.8
Maximum2465
Range2464
Interquartile range (IQR)1232

Descriptive statistics

Standard deviation711.72853
Coefficient of variation (CV)0.5772332
Kurtosis-1.2
Mean1233
Median Absolute Deviation (MAD)616
Skewness0
Sum3039345
Variance506557.5
MonotonicityStrictly increasing
2023-12-12T22:21:24.555303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1647 1
 
< 0.1%
1640 1
 
< 0.1%
1641 1
 
< 0.1%
1642 1
 
< 0.1%
1643 1
 
< 0.1%
1644 1
 
< 0.1%
1645 1
 
< 0.1%
1646 1
 
< 0.1%
1648 1
 
< 0.1%
Other values (2455) 2455
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2465 1
< 0.1%
2464 1
< 0.1%
2463 1
< 0.1%
2462 1
< 0.1%
2461 1
< 0.1%
2460 1
< 0.1%
2459 1
< 0.1%
2458 1
< 0.1%
2457 1
< 0.1%
2456 1
< 0.1%

약어
Text

Distinct1895
Distinct (%)76.9%
Missing2
Missing (%)0.1%
Memory size19.4 KiB
2023-12-12T22:21:24.949075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length3.0235485
Min length1

Characters and Unicode

Total characters7447
Distinct characters71
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1564 ?
Unique (%)63.5%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowA
ValueCountFrequency (%)
pc 11
 
0.4%
n 8
 
0.3%
s 8
 
0.3%
cr 7
 
0.3%
pi 7
 
0.3%
cc 7
 
0.3%
tc 7
 
0.3%
ms 7
 
0.3%
fc 7
 
0.3%
a 7
 
0.3%
Other values (1860) 2408
96.9%
2023-12-12T22:21:25.574050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 742
 
10.0%
S 714
 
9.6%
P 518
 
7.0%
T 513
 
6.9%
A 408
 
5.5%
R 372
 
5.0%
D 359
 
4.8%
M 358
 
4.8%
F 337
 
4.5%
B 309
 
4.1%
Other values (61) 2817
37.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 6991
93.9%
Lowercase Letter 256
 
3.4%
Other Punctuation 98
 
1.3%
Decimal Number 47
 
0.6%
Dash Punctuation 24
 
0.3%
Space Separator 21
 
0.3%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Other Letter 2
 
< 0.1%
Other Number 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 742
 
10.6%
S 714
 
10.2%
P 518
 
7.4%
T 513
 
7.3%
A 408
 
5.8%
R 372
 
5.3%
D 359
 
5.1%
M 358
 
5.1%
F 337
 
4.8%
B 309
 
4.4%
Other values (16) 2361
33.8%
Lowercase Letter
ValueCountFrequency (%)
i 26
 
10.2%
e 25
 
9.8%
a 22
 
8.6%
t 20
 
7.8%
n 19
 
7.4%
o 18
 
7.0%
r 14
 
5.5%
s 13
 
5.1%
u 11
 
4.3%
p 11
 
4.3%
Other values (14) 77
30.1%
Decimal Number
ValueCountFrequency (%)
1 17
36.2%
2 15
31.9%
0 7
14.9%
3 2
 
4.3%
6 2
 
4.3%
5 2
 
4.3%
4 1
 
2.1%
8 1
 
2.1%
Other Punctuation
ValueCountFrequency (%)
/ 79
80.6%
& 12
 
12.2%
, 4
 
4.1%
% 2
 
2.0%
. 1
 
1.0%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7245
97.3%
Common 200
 
2.7%
Hangul 2
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 742
 
10.2%
S 714
 
9.9%
P 518
 
7.1%
T 513
 
7.1%
A 408
 
5.6%
R 372
 
5.1%
D 359
 
5.0%
M 358
 
4.9%
F 337
 
4.7%
B 309
 
4.3%
Other values (39) 2615
36.1%
Common
ValueCountFrequency (%)
/ 79
39.5%
- 24
 
12.0%
21
 
10.5%
1 17
 
8.5%
2 15
 
7.5%
& 12
 
6.0%
0 7
 
3.5%
, 4
 
2.0%
) 3
 
1.5%
( 3
 
1.5%
Other values (10) 15
 
7.5%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7441
99.9%
Letterlike Symbols 2
 
< 0.1%
Hangul 2
 
< 0.1%
None 1
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 742
 
10.0%
S 714
 
9.6%
P 518
 
7.0%
T 513
 
6.9%
A 408
 
5.5%
R 372
 
5.0%
D 359
 
4.8%
M 358
 
4.8%
F 337
 
4.5%
B 309
 
4.2%
Other values (56) 2811
37.8%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
² 1
100.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

원어
Text

Distinct2367
Distinct (%)96.3%
Missing6
Missing (%)0.2%
Memory size19.4 KiB
2023-12-12T22:21:26.018849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length45
Mean length19.978447
Min length3

Characters and Unicode

Total characters49127
Distinct characters80
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2281 ?
Unique (%)92.8%

Sample

1st rowAlarm
2nd rowAmperemeter,Ampere
3rd rowAnalog
4th rowLogic Steady Signal For Auto (Sqc)
5th rowOutput(Apc)
ValueCountFrequency (%)
control 124
 
1.8%
system 120
 
1.8%
valve 78
 
1.2%
water 77
 
1.1%
pump 68
 
1.0%
relay 67
 
1.0%
air 62
 
0.9%
switch 59
 
0.9%
power 57
 
0.8%
55
 
0.8%
Other values (1915) 6014
88.7%
2023-12-12T22:21:26.700456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 5000
 
10.2%
4322
 
8.8%
r 3541
 
7.2%
t 3465
 
7.1%
i 3246
 
6.6%
a 3067
 
6.2%
n 3059
 
6.2%
o 3045
 
6.2%
l 2120
 
4.3%
u 1478
 
3.0%
Other values (70) 16784
34.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 37504
76.3%
Uppercase Letter 6860
 
14.0%
Space Separator 4322
 
8.8%
Other Punctuation 124
 
0.3%
Dash Punctuation 103
 
0.2%
Close Punctuation 93
 
0.2%
Open Punctuation 93
 
0.2%
Other Letter 15
 
< 0.1%
Decimal Number 6
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 5000
13.3%
r 3541
9.4%
t 3465
9.2%
i 3246
8.7%
a 3067
 
8.2%
n 3059
 
8.2%
o 3045
 
8.1%
l 2120
 
5.7%
u 1478
 
3.9%
s 1461
 
3.9%
Other values (16) 8022
21.4%
Uppercase Letter
ValueCountFrequency (%)
C 807
 
11.8%
S 779
 
11.4%
P 534
 
7.8%
T 471
 
6.9%
A 417
 
6.1%
D 357
 
5.2%
M 348
 
5.1%
F 337
 
4.9%
R 336
 
4.9%
O 319
 
4.7%
Other values (16) 2155
31.4%
Other Letter
ValueCountFrequency (%)
2
13.3%
2
13.3%
2
13.3%
2
13.3%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
Other Punctuation
ValueCountFrequency (%)
/ 40
32.3%
& 39
31.5%
, 31
25.0%
. 11
 
8.9%
' 2
 
1.6%
# 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
6 1
 
16.7%
1 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
3
50.0%
+ 2
33.3%
= 1
 
16.7%
Space Separator
ValueCountFrequency (%)
4322
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%
Close Punctuation
ValueCountFrequency (%)
) 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 93
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 44364
90.3%
Common 4748
 
9.7%
Hangul 15
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 5000
 
11.3%
r 3541
 
8.0%
t 3465
 
7.8%
i 3246
 
7.3%
a 3067
 
6.9%
n 3059
 
6.9%
o 3045
 
6.9%
l 2120
 
4.8%
u 1478
 
3.3%
s 1461
 
3.3%
Other values (42) 14882
33.5%
Common
ValueCountFrequency (%)
4322
91.0%
- 103
 
2.2%
) 93
 
2.0%
( 93
 
2.0%
/ 40
 
0.8%
& 39
 
0.8%
, 31
 
0.7%
. 11
 
0.2%
2 4
 
0.1%
3
 
0.1%
Other values (7) 9
 
0.2%
Hangul
ValueCountFrequency (%)
2
13.3%
2
13.3%
2
13.3%
2
13.3%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49109
> 99.9%
Hangul 15
 
< 0.1%
Arrows 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 5000
 
10.2%
4322
 
8.8%
r 3541
 
7.2%
t 3465
 
7.1%
i 3246
 
6.6%
a 3067
 
6.2%
n 3059
 
6.2%
o 3045
 
6.2%
l 2120
 
4.3%
u 1478
 
3.0%
Other values (58) 16766
34.1%
Arrows
ValueCountFrequency (%)
3
100.0%
Hangul
ValueCountFrequency (%)
2
13.3%
2
13.3%
2
13.3%
2
13.3%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
1
6.7%
Distinct2364
Distinct (%)96.1%
Missing6
Missing (%)0.2%
Memory size19.4 KiB
2023-12-12T22:21:27.052765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length21
Mean length7.2326149
Min length1

Characters and Unicode

Total characters17785
Distinct characters640
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2281 ?
Unique (%)92.8%

Sample

1st row경보
2nd row전류계, 전류단위
3rd rowDIGITAL 아날로그
4th row자동준비신호
5th row출력
ValueCountFrequency (%)
40
 
0.8%
보일러 33
 
0.7%
스위치 32
 
0.7%
계전기 31
 
0.7%
터빈 30
 
0.6%
장치 30
 
0.6%
제어 29
 
0.6%
시스템 29
 
0.6%
주파수 28
 
0.6%
자동 27
 
0.6%
Other values (2785) 4410
93.5%
2023-12-12T22:21:27.593988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2260
 
12.7%
791
 
4.4%
572
 
3.2%
270
 
1.5%
265
 
1.5%
245
 
1.4%
244
 
1.4%
230
 
1.3%
204
 
1.1%
204
 
1.1%
Other values (630) 12500
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14600
82.1%
Space Separator 2260
 
12.7%
Uppercase Letter 301
 
1.7%
Other Punctuation 175
 
1.0%
Lowercase Letter 99
 
0.6%
Decimal Number 96
 
0.5%
Open Punctuation 90
 
0.5%
Close Punctuation 89
 
0.5%
Math Symbol 45
 
0.3%
Dash Punctuation 19
 
0.1%
Other values (4) 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
791
 
5.4%
572
 
3.9%
270
 
1.8%
265
 
1.8%
245
 
1.7%
244
 
1.7%
230
 
1.6%
204
 
1.4%
204
 
1.4%
200
 
1.4%
Other values (558) 11375
77.9%
Uppercase Letter
ValueCountFrequency (%)
C 29
 
9.6%
T 21
 
7.0%
P 19
 
6.3%
M 19
 
6.3%
S 19
 
6.3%
A 17
 
5.6%
L 17
 
5.6%
B 16
 
5.3%
V 15
 
5.0%
E 15
 
5.0%
Other values (13) 114
37.9%
Lowercase Letter
ValueCountFrequency (%)
e 13
13.1%
o 11
11.1%
n 10
10.1%
l 8
 
8.1%
t 7
 
7.1%
r 7
 
7.1%
a 6
 
6.1%
p 5
 
5.1%
s 4
 
4.0%
g 4
 
4.0%
Other values (11) 24
24.2%
Decimal Number
ValueCountFrequency (%)
0 41
42.7%
1 28
29.2%
2 15
 
15.6%
4 3
 
3.1%
3 3
 
3.1%
9 3
 
3.1%
5 2
 
2.1%
8 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 128
73.1%
/ 40
 
22.9%
. 5
 
2.9%
% 1
 
0.6%
& 1
 
0.6%
Other Symbol
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
Math Symbol
ValueCountFrequency (%)
= 36
80.0%
5
 
11.1%
+ 4
 
8.9%
Space Separator
ValueCountFrequency (%)
2260
100.0%
Open Punctuation
ValueCountFrequency (%)
( 90
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Number
ValueCountFrequency (%)
² 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14539
81.7%
Common 2784
 
15.7%
Latin 401
 
2.3%
Han 61
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
791
 
5.4%
572
 
3.9%
270
 
1.9%
265
 
1.8%
245
 
1.7%
244
 
1.7%
230
 
1.6%
204
 
1.4%
204
 
1.4%
200
 
1.4%
Other values (507) 11314
77.8%
Han
ValueCountFrequency (%)
4
 
6.6%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.6%
1
 
1.6%
Other values (41) 41
67.2%
Latin
ValueCountFrequency (%)
C 29
 
7.2%
T 21
 
5.2%
P 19
 
4.7%
M 19
 
4.7%
S 19
 
4.7%
A 17
 
4.2%
L 17
 
4.2%
B 16
 
4.0%
V 15
 
3.7%
E 15
 
3.7%
Other values (35) 214
53.4%
Common
ValueCountFrequency (%)
2260
81.2%
, 128
 
4.6%
( 90
 
3.2%
) 89
 
3.2%
0 41
 
1.5%
/ 40
 
1.4%
= 36
 
1.3%
1 28
 
1.0%
- 19
 
0.7%
2 15
 
0.5%
Other values (17) 38
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14538
81.7%
ASCII 3170
 
17.8%
CJK 59
 
0.3%
CJK Compat 6
 
< 0.1%
Arrows 5
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Number Forms 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2260
71.3%
, 128
 
4.0%
( 90
 
2.8%
) 89
 
2.8%
0 41
 
1.3%
/ 40
 
1.3%
= 36
 
1.1%
C 29
 
0.9%
1 28
 
0.9%
T 21
 
0.7%
Other values (54) 408
 
12.9%
Hangul
ValueCountFrequency (%)
791
 
5.4%
572
 
3.9%
270
 
1.9%
265
 
1.8%
245
 
1.7%
244
 
1.7%
230
 
1.6%
204
 
1.4%
204
 
1.4%
200
 
1.4%
Other values (506) 11313
77.8%
Arrows
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
4
 
6.8%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Other values (39) 39
66.1%
None
ValueCountFrequency (%)
² 2
100.0%
CJK Compat
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Number Forms
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-12T22:21:23.935637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T22:21:24.064039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:21:24.155872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:21:24.258321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호약어원어한글풀이
01AAlarm경보
12AAmperemeter,Ampere전류계, 전류단위
23AAnalogDIGITAL 아날로그
34ALogic Steady Signal For Auto (Sqc)자동준비신호
45AOutput(Apc)출력
56AAnalysis출력
67A C/FActivated Carbon filter활성탄 여과기
78A/BAirwaybill항공화물 수취증
89A/BB개 입력중 A개 이상이면 만족<NA>
910A/D CAnalog/Digital ConverterA.D변환기
번호약어원어한글풀이
24552456ZBBZero Base Budget예산편성방법
24562457ZCTZero Current Transformer영상변류기
24572458ZDZero Defect Movement무결점 운동
24582459dGOUpper Limitation상한치
24592460dGULower Limitation하한치
24602461dWSValve Added To WsWS 첨가값
24612462nNano10억분의 1배
24622463pFPico Farad용량 단위
24632464t1Switch On Delay (Sqc)한시동작
24642465t2Switch Of Delay (Sqc)한시복귀