Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 2465 |
Missing cells | 14 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 79.6 KiB |
Average record size in memory | 33.1 B |
Variable types
Numeric | 1 |
---|---|
Text | 3 |
Dataset
Description | 한국동서발전의 발전설비용어 정보를 제공합니다. 발전설비용어는 번호, 약어, 원어, 한글풀이의 항목을 나타냅니다. |
---|---|
Author | 한국동서발전(주) |
URL | https://www.data.go.kr/data/15087680/fileData.do |
번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 13:21:23.441267 |
---|---|
Analysis finished | 2023-12-12 13:21:24.309357 |
Duration | 0.87 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 2465 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1233 |
Minimum | 1 |
---|---|
Maximum | 2465 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 21.8 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 124.2 |
Q1 | 617 |
median | 1233 |
Q3 | 1849 |
95-th percentile | 2341.8 |
Maximum | 2465 |
Range | 2464 |
Interquartile range (IQR) | 1232 |
Descriptive statistics
Standard deviation | 711.72853 |
---|---|
Coefficient of variation (CV) | 0.5772332 |
Kurtosis | -1.2 |
Mean | 1233 |
Median Absolute Deviation (MAD) | 616 |
Skewness | 0 |
Sum | 3039345 |
Variance | 506557.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
1647 | 1 | < 0.1% |
1640 | 1 | < 0.1% |
1641 | 1 | < 0.1% |
1642 | 1 | < 0.1% |
1643 | 1 | < 0.1% |
1644 | 1 | < 0.1% |
1645 | 1 | < 0.1% |
1646 | 1 | < 0.1% |
1648 | 1 | < 0.1% |
Other values (2455) | 2455 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
2465 | 1 | |
2464 | 1 | |
2463 | 1 | |
2462 | 1 | |
2461 | 1 | |
2460 | 1 | |
2459 | 1 | |
2458 | 1 | |
2457 | 1 | |
2456 | 1 |
약어
Text
Distinct | 1895 |
---|---|
Distinct (%) | 76.9% |
Missing | 2 |
Missing (%) | 0.1% |
Memory size | 19.4 KiB |
Value | Count | Frequency (%) |
pc | 11 | 0.4% |
n | 8 | 0.3% |
s | 8 | 0.3% |
cr | 7 | 0.3% |
pi | 7 | 0.3% |
cc | 7 | 0.3% |
tc | 7 | 0.3% |
ms | 7 | 0.3% |
fc | 7 | 0.3% |
a | 7 | 0.3% |
Other values (1860) | 2408 |
Most occurring characters
Value | Count | Frequency (%) |
C | 742 | 10.0% |
S | 714 | 9.6% |
P | 518 | 7.0% |
T | 513 | 6.9% |
A | 408 | 5.5% |
R | 372 | 5.0% |
D | 359 | 4.8% |
M | 358 | 4.8% |
F | 337 | 4.5% |
B | 309 | 4.1% |
Other values (61) | 2817 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 6991 | |
Lowercase Letter | 256 | 3.4% |
Other Punctuation | 98 | 1.3% |
Decimal Number | 47 | 0.6% |
Dash Punctuation | 24 | 0.3% |
Space Separator | 21 | 0.3% |
Close Punctuation | 3 | < 0.1% |
Open Punctuation | 3 | < 0.1% |
Other Letter | 2 | < 0.1% |
Other Number | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 742 | 10.6% |
S | 714 | 10.2% |
P | 518 | 7.4% |
T | 513 | 7.3% |
A | 408 | 5.8% |
R | 372 | 5.3% |
D | 359 | 5.1% |
M | 358 | 5.1% |
F | 337 | 4.8% |
B | 309 | 4.4% |
Other values (16) | 2361 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 26 | 10.2% |
e | 25 | 9.8% |
a | 22 | 8.6% |
t | 20 | 7.8% |
n | 19 | 7.4% |
o | 18 | 7.0% |
r | 14 | 5.5% |
s | 13 | 5.1% |
u | 11 | 4.3% |
p | 11 | 4.3% |
Other values (14) | 77 |
Decimal Number
Value | Count | Frequency (%) |
1 | 17 | |
2 | 15 | |
0 | 7 | |
3 | 2 | 4.3% |
6 | 2 | 4.3% |
5 | 2 | 4.3% |
4 | 1 | 2.1% |
8 | 1 | 2.1% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 79 | |
& | 12 | 12.2% |
, | 4 | 4.1% |
% | 2 | 2.0% |
. | 1 | 1.0% |
Other Letter
Value | Count | Frequency (%) |
접 | 1 | |
점 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 24 |
Space Separator
Value | Count | Frequency (%) |
21 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3 |
Other Number
Value | Count | Frequency (%) |
² | 1 |
Other Symbol
Value | Count | Frequency (%) |
㎥ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7245 | |
Common | 200 | 2.7% |
Hangul | 2 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 742 | 10.2% |
S | 714 | 9.9% |
P | 518 | 7.1% |
T | 513 | 7.1% |
A | 408 | 5.6% |
R | 372 | 5.1% |
D | 359 | 5.0% |
M | 358 | 4.9% |
F | 337 | 4.7% |
B | 309 | 4.3% |
Other values (39) | 2615 |
Common
Value | Count | Frequency (%) |
/ | 79 | |
- | 24 | 12.0% |
21 | 10.5% | |
1 | 17 | 8.5% |
2 | 15 | 7.5% |
& | 12 | 6.0% |
0 | 7 | 3.5% |
, | 4 | 2.0% |
) | 3 | 1.5% |
( | 3 | 1.5% |
Other values (10) | 15 | 7.5% |
Hangul
Value | Count | Frequency (%) |
접 | 1 | |
점 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7441 | |
Letterlike Symbols | 2 | < 0.1% |
Hangul | 2 | < 0.1% |
None | 1 | < 0.1% |
CJK Compat | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 742 | 10.0% |
S | 714 | 9.6% |
P | 518 | 7.0% |
T | 513 | 6.9% |
A | 408 | 5.5% |
R | 372 | 5.0% |
D | 359 | 4.8% |
M | 358 | 4.8% |
F | 337 | 4.5% |
B | 309 | 4.2% |
Other values (56) | 2811 |
Letterlike Symbols
Value | Count | Frequency (%) |
ℓ | 2 |
None
Value | Count | Frequency (%) |
² | 1 |
Hangul
Value | Count | Frequency (%) |
접 | 1 | |
점 | 1 |
CJK Compat
Value | Count | Frequency (%) |
㎥ | 1 |
원어
Text
Distinct | 2367 |
---|---|
Distinct (%) | 96.3% |
Missing | 6 |
Missing (%) | 0.2% |
Memory size | 19.4 KiB |
Length
Max length | 65 |
---|---|
Median length | 45 |
Mean length | 19.978447 |
Min length | 3 |
Characters and Unicode
Total characters | 49127 |
---|---|
Distinct characters | 80 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2281 ? |
---|---|
Unique (%) | 92.8% |
Sample
1st row | Alarm |
---|---|
2nd row | Amperemeter,Ampere |
3rd row | Analog |
4th row | Logic Steady Signal For Auto (Sqc) |
5th row | Output(Apc) |
Value | Count | Frequency (%) |
control | 124 | 1.8% |
system | 120 | 1.8% |
valve | 78 | 1.2% |
water | 77 | 1.1% |
pump | 68 | 1.0% |
relay | 67 | 1.0% |
air | 62 | 0.9% |
switch | 59 | 0.9% |
power | 57 | 0.8% |
55 | 0.8% | |
Other values (1915) | 6014 |
Most occurring characters
Value | Count | Frequency (%) |
e | 5000 | 10.2% |
4322 | 8.8% | |
r | 3541 | 7.2% |
t | 3465 | 7.1% |
i | 3246 | 6.6% |
a | 3067 | 6.2% |
n | 3059 | 6.2% |
o | 3045 | 6.2% |
l | 2120 | 4.3% |
u | 1478 | 3.0% |
Other values (70) | 16784 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 37504 | |
Uppercase Letter | 6860 | 14.0% |
Space Separator | 4322 | 8.8% |
Other Punctuation | 124 | 0.3% |
Dash Punctuation | 103 | 0.2% |
Close Punctuation | 93 | 0.2% |
Open Punctuation | 93 | 0.2% |
Other Letter | 15 | < 0.1% |
Decimal Number | 6 | < 0.1% |
Math Symbol | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 5000 | |
r | 3541 | |
t | 3465 | |
i | 3246 | |
a | 3067 | 8.2% |
n | 3059 | 8.2% |
o | 3045 | 8.1% |
l | 2120 | 5.7% |
u | 1478 | 3.9% |
s | 1461 | 3.9% |
Other values (16) | 8022 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 807 | 11.8% |
S | 779 | 11.4% |
P | 534 | 7.8% |
T | 471 | 6.9% |
A | 417 | 6.1% |
D | 357 | 5.2% |
M | 348 | 5.1% |
F | 337 | 4.9% |
R | 336 | 4.9% |
O | 319 | 4.7% |
Other values (16) | 2155 |
Other Letter
Value | Count | Frequency (%) |
는 | 2 | |
또 | 2 | |
이 | 2 | |
개 | 2 | |
족 | 1 | |
만 | 1 | |
면 | 1 | |
상 | 1 | |
중 | 1 | |
입 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 40 | |
& | 39 | |
, | 31 | |
. | 11 | 8.9% |
' | 2 | 1.6% |
# | 1 | 0.8% |
Decimal Number
Value | Count | Frequency (%) |
2 | 4 | |
6 | 1 | 16.7% |
1 | 1 | 16.7% |
Math Symbol
Value | Count | Frequency (%) |
↔ | 3 | |
+ | 2 | |
= | 1 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
4322 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 103 |
Close Punctuation
Value | Count | Frequency (%) |
) | 93 |
Open Punctuation
Value | Count | Frequency (%) |
( | 93 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 44364 | |
Common | 4748 | 9.7% |
Hangul | 15 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 5000 | 11.3% |
r | 3541 | 8.0% |
t | 3465 | 7.8% |
i | 3246 | 7.3% |
a | 3067 | 6.9% |
n | 3059 | 6.9% |
o | 3045 | 6.9% |
l | 2120 | 4.8% |
u | 1478 | 3.3% |
s | 1461 | 3.3% |
Other values (42) | 14882 |
Common
Value | Count | Frequency (%) |
4322 | ||
- | 103 | 2.2% |
) | 93 | 2.0% |
( | 93 | 2.0% |
/ | 40 | 0.8% |
& | 39 | 0.8% |
, | 31 | 0.7% |
. | 11 | 0.2% |
2 | 4 | 0.1% |
↔ | 3 | 0.1% |
Other values (7) | 9 | 0.2% |
Hangul
Value | Count | Frequency (%) |
는 | 2 | |
또 | 2 | |
이 | 2 | |
개 | 2 | |
족 | 1 | |
만 | 1 | |
면 | 1 | |
상 | 1 | |
중 | 1 | |
입 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49109 | |
Hangul | 15 | < 0.1% |
Arrows | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 5000 | 10.2% |
4322 | 8.8% | |
r | 3541 | 7.2% |
t | 3465 | 7.1% |
i | 3246 | 6.6% |
a | 3067 | 6.2% |
n | 3059 | 6.2% |
o | 3045 | 6.2% |
l | 2120 | 4.3% |
u | 1478 | 3.0% |
Other values (58) | 16766 |
Arrows
Value | Count | Frequency (%) |
↔ | 3 |
Hangul
Value | Count | Frequency (%) |
는 | 2 | |
또 | 2 | |
이 | 2 | |
개 | 2 | |
족 | 1 | |
만 | 1 | |
면 | 1 | |
상 | 1 | |
중 | 1 | |
입 | 1 |
한글풀이
Text
Distinct | 2364 |
---|---|
Distinct (%) | 96.1% |
Missing | 6 |
Missing (%) | 0.2% |
Memory size | 19.4 KiB |
Value | Count | Frequency (%) |
40 | 0.8% | |
보일러 | 33 | 0.7% |
스위치 | 32 | 0.7% |
계전기 | 31 | 0.7% |
터빈 | 30 | 0.6% |
장치 | 30 | 0.6% |
제어 | 29 | 0.6% |
시스템 | 29 | 0.6% |
주파수 | 28 | 0.6% |
자동 | 27 | 0.6% |
Other values (2785) | 4410 |
Most occurring characters
Value | Count | Frequency (%) |
2260 | 12.7% | |
기 | 791 | 4.4% |
전 | 572 | 3.2% |
수 | 270 | 1.5% |
제 | 265 | 1.5% |
계 | 245 | 1.4% |
시 | 244 | 1.4% |
스 | 230 | 1.3% |
치 | 204 | 1.1% |
동 | 204 | 1.1% |
Other values (630) | 12500 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 14600 | |
Space Separator | 2260 | 12.7% |
Uppercase Letter | 301 | 1.7% |
Other Punctuation | 175 | 1.0% |
Lowercase Letter | 99 | 0.6% |
Decimal Number | 96 | 0.5% |
Open Punctuation | 90 | 0.5% |
Close Punctuation | 89 | 0.5% |
Math Symbol | 45 | 0.3% |
Dash Punctuation | 19 | 0.1% |
Other values (4) | 11 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 791 | 5.4% |
전 | 572 | 3.9% |
수 | 270 | 1.8% |
제 | 265 | 1.8% |
계 | 245 | 1.7% |
시 | 244 | 1.7% |
스 | 230 | 1.6% |
치 | 204 | 1.4% |
동 | 204 | 1.4% |
장 | 200 | 1.4% |
Other values (558) | 11375 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 29 | 9.6% |
T | 21 | 7.0% |
P | 19 | 6.3% |
M | 19 | 6.3% |
S | 19 | 6.3% |
A | 17 | 5.6% |
L | 17 | 5.6% |
B | 16 | 5.3% |
V | 15 | 5.0% |
E | 15 | 5.0% |
Other values (13) | 114 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 13 | |
o | 11 | |
n | 10 | |
l | 8 | 8.1% |
t | 7 | 7.1% |
r | 7 | 7.1% |
a | 6 | 6.1% |
p | 5 | 5.1% |
s | 4 | 4.0% |
g | 4 | 4.0% |
Other values (11) | 24 |
Decimal Number
Value | Count | Frequency (%) |
0 | 41 | |
1 | 28 | |
2 | 15 | 15.6% |
4 | 3 | 3.1% |
3 | 3 | 3.1% |
9 | 3 | 3.1% |
5 | 2 | 2.1% |
8 | 1 | 1.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 128 | |
/ | 40 | 22.9% |
. | 5 | 2.9% |
% | 1 | 0.6% |
& | 1 | 0.6% |
Other Symbol
Value | Count | Frequency (%) |
㎥ | 2 | |
㎾ | 2 | |
㎏ | 1 | |
㎘ | 1 | |
℃ | 1 |
Math Symbol
Value | Count | Frequency (%) |
= | 36 | |
↔ | 5 | 11.1% |
+ | 4 | 8.9% |
Space Separator
Value | Count | Frequency (%) |
2260 |
Open Punctuation
Value | Count | Frequency (%) |
( | 90 |
Close Punctuation
Value | Count | Frequency (%) |
) | 89 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19 |
Other Number
Value | Count | Frequency (%) |
² | 2 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 14539 | |
Common | 2784 | 15.7% |
Latin | 401 | 2.3% |
Han | 61 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 791 | 5.4% |
전 | 572 | 3.9% |
수 | 270 | 1.9% |
제 | 265 | 1.8% |
계 | 245 | 1.7% |
시 | 244 | 1.7% |
스 | 230 | 1.6% |
치 | 204 | 1.4% |
동 | 204 | 1.4% |
장 | 200 | 1.4% |
Other values (507) | 11314 |
Han
Value | Count | Frequency (%) |
水 | 4 | 6.6% |
開 | 2 | 3.3% |
分 | 2 | 3.3% |
主 | 2 | 3.3% |
防 | 2 | 3.3% |
相 | 2 | 3.3% |
炭 | 2 | 3.3% |
差 | 2 | 3.3% |
膜 | 1 | 1.6% |
頭 | 1 | 1.6% |
Other values (41) | 41 |
Latin
Value | Count | Frequency (%) |
C | 29 | 7.2% |
T | 21 | 5.2% |
P | 19 | 4.7% |
M | 19 | 4.7% |
S | 19 | 4.7% |
A | 17 | 4.2% |
L | 17 | 4.2% |
B | 16 | 4.0% |
V | 15 | 3.7% |
E | 15 | 3.7% |
Other values (35) | 214 |
Common
Value | Count | Frequency (%) |
2260 | ||
, | 128 | 4.6% |
( | 90 | 3.2% |
) | 89 | 3.2% |
0 | 41 | 1.5% |
/ | 40 | 1.4% |
= | 36 | 1.3% |
1 | 28 | 1.0% |
- | 19 | 0.7% |
2 | 15 | 0.5% |
Other values (17) | 38 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 14538 | |
ASCII | 3170 | 17.8% |
CJK | 59 | 0.3% |
CJK Compat | 6 | < 0.1% |
Arrows | 5 | < 0.1% |
None | 2 | < 0.1% |
CJK Compat Ideographs | 2 | < 0.1% |
Number Forms | 1 | < 0.1% |
Letterlike Symbols | 1 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2260 | ||
, | 128 | 4.0% |
( | 90 | 2.8% |
) | 89 | 2.8% |
0 | 41 | 1.3% |
/ | 40 | 1.3% |
= | 36 | 1.1% |
C | 29 | 0.9% |
1 | 28 | 0.9% |
T | 21 | 0.7% |
Other values (54) | 408 | 12.9% |
Hangul
Value | Count | Frequency (%) |
기 | 791 | 5.4% |
전 | 572 | 3.9% |
수 | 270 | 1.9% |
제 | 265 | 1.8% |
계 | 245 | 1.7% |
시 | 244 | 1.7% |
스 | 230 | 1.6% |
치 | 204 | 1.4% |
동 | 204 | 1.4% |
장 | 200 | 1.4% |
Other values (506) | 11313 |
Arrows
Value | Count | Frequency (%) |
↔ | 5 |
CJK
Value | Count | Frequency (%) |
水 | 4 | 6.8% |
開 | 2 | 3.4% |
分 | 2 | 3.4% |
主 | 2 | 3.4% |
防 | 2 | 3.4% |
相 | 2 | 3.4% |
炭 | 2 | 3.4% |
差 | 2 | 3.4% |
膜 | 1 | 1.7% |
頭 | 1 | 1.7% |
Other values (39) | 39 |
None
Value | Count | Frequency (%) |
² | 2 |
CJK Compat
Value | Count | Frequency (%) |
㎥ | 2 | |
㎾ | 2 | |
㎏ | 1 | |
㎘ | 1 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
Letterlike Symbols
Value | Count | Frequency (%) |
℃ | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㅍ | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
曆 | 1 | |
不 | 1 |
번호 | 약어 | 원어 | 한글풀이 | |
---|---|---|---|---|
0 | 1 | A | Alarm | 경보 |
1 | 2 | A | Amperemeter,Ampere | 전류계, 전류단위 |
2 | 3 | A | Analog | DIGITAL 아날로그 |
3 | 4 | A | Logic Steady Signal For Auto (Sqc) | 자동준비신호 |
4 | 5 | A | Output(Apc) | 출력 |
5 | 6 | A | Analysis | 출력 |
6 | 7 | A C/F | Activated Carbon filter | 활성탄 여과기 |
7 | 8 | A/B | Airwaybill | 항공화물 수취증 |
8 | 9 | A/B | B개 입력중 A개 이상이면 만족 | <NA> |
9 | 10 | A/D C | Analog/Digital Converter | A.D변환기 |
번호 | 약어 | 원어 | 한글풀이 | |
---|---|---|---|---|
2455 | 2456 | ZBB | Zero Base Budget | 예산편성방법 |
2456 | 2457 | ZCT | Zero Current Transformer | 영상변류기 |
2457 | 2458 | ZD | Zero Defect Movement | 무결점 운동 |
2458 | 2459 | dGO | Upper Limitation | 상한치 |
2459 | 2460 | dGU | Lower Limitation | 하한치 |
2460 | 2461 | dWS | Valve Added To Ws | WS 첨가값 |
2461 | 2462 | n | Nano | 10억분의 1배 |
2462 | 2463 | pF | Pico Farad | 용량 단위 |
2463 | 2464 | t1 | Switch On Delay (Sqc) | 한시동작 |
2464 | 2465 | t2 | Switch Of Delay (Sqc) | 한시복귀 |