Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 3316 |
Missing cells | 8 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 107.0 KiB |
Average record size in memory | 33.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 3 |
Dataset
Description | 한국수력원자력 원자력기술정보시스템의 중수로관련 약어집 데이터입니다. 중수로관련 약자와 약자풀이를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15117245/fileData.do |
문서번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 01:59:47.315937 |
---|---|
Analysis finished | 2023-12-12 01:59:48.170142 |
Duration | 0.85 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
문서번호
Real number (ℝ)
UNIQUE
 
Distinct | 3316 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1658.5 |
Minimum | 1 |
---|---|
Maximum | 3316 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 29.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 166.75 |
Q1 | 829.75 |
median | 1658.5 |
Q3 | 2487.25 |
95-th percentile | 3150.25 |
Maximum | 3316 |
Range | 3315 |
Interquartile range (IQR) | 1657.5 |
Descriptive statistics
Standard deviation | 957.39107 |
---|---|
Coefficient of variation (CV) | 0.57726323 |
Kurtosis | -1.2 |
Mean | 1658.5 |
Median Absolute Deviation (MAD) | 829 |
Skewness | 0 |
Sum | 5499586 |
Variance | 916597.67 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
2217 | 1 | < 0.1% |
2207 | 1 | < 0.1% |
2208 | 1 | < 0.1% |
2209 | 1 | < 0.1% |
2210 | 1 | < 0.1% |
2211 | 1 | < 0.1% |
2212 | 1 | < 0.1% |
2213 | 1 | < 0.1% |
2214 | 1 | < 0.1% |
Other values (3306) | 3306 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
3316 | 1 | |
3315 | 1 | |
3314 | 1 | |
3313 | 1 | |
3312 | 1 | |
3311 | 1 | |
3310 | 1 | |
3309 | 1 | |
3308 | 1 | |
3307 | 1 |
영문용어
Text
Distinct | 3280 |
---|---|
Distinct (%) | 99.0% |
Missing | 3 |
Missing (%) | 0.1% |
Memory size | 26.0 KiB |
Length
Max length | 136 |
---|---|
Median length | 70 |
Mean length | 26.971929 |
Min length | 2 |
Characters and Unicode
Total characters | 89358 |
---|---|
Distinct characters | 99 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 3 ? |
Unique
Unique | 3247 ? |
---|---|
Unique (%) | 98.0% |
Sample
1st row | Atomics International |
---|---|
2nd row | Analog Input |
3rd row | Authorized Inspection Agency |
4th row | American Institute of Architects |
5th row | Accelerator Information Center |
Value | Count | Frequency (%) |
reactor | 310 | 2.7% |
system | 303 | 2.6% |
of | 204 | 1.8% |
nuclear | 183 | 1.6% |
and | 142 | 1.2% |
test | 134 | 1.2% |
power | 113 | 1.0% |
energy | 110 | 1.0% |
water | 108 | 0.9% |
control | 104 | 0.9% |
Other values (2660) | 9747 |
Most occurring characters
Value | Count | Frequency (%) |
e | 9232 | 10.3% |
8180 | 9.2% | |
t | 6578 | 7.4% |
a | 6171 | 6.9% |
r | 5962 | 6.7% |
i | 5868 | 6.6% |
n | 5845 | 6.5% |
o | 5632 | 6.3% |
s | 3504 | 3.9% |
l | 3355 | 3.8% |
Other values (89) | 29031 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 70090 | |
Uppercase Letter | 10359 | 11.6% |
Space Separator | 8180 | 9.2% |
Other Punctuation | 295 | 0.3% |
Dash Punctuation | 237 | 0.3% |
Open Punctuation | 63 | 0.1% |
Close Punctuation | 63 | 0.1% |
Decimal Number | 44 | < 0.1% |
Other Letter | 24 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 9232 | |
t | 6578 | |
a | 6171 | |
r | 5962 | |
i | 5868 | 8.4% |
n | 5845 | 8.3% |
o | 5632 | 8.0% |
s | 3504 | 5.0% |
l | 3355 | 4.8% |
c | 3255 | 4.6% |
Other values (17) | 14688 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 1130 | 10.9% |
C | 1118 | 10.8% |
R | 849 | 8.2% |
P | 802 | 7.7% |
A | 798 | 7.7% |
E | 660 | 6.4% |
T | 551 | 5.3% |
I | 532 | 5.1% |
F | 485 | 4.7% |
D | 473 | 4.6% |
Other values (16) | 2961 |
Other Letter
Value | Count | Frequency (%) |
고 | 2 | 8.3% |
회 | 2 | 8.3% |
유 | 1 | 4.2% |
증 | 1 | 4.2% |
라 | 1 | 4.2% |
테 | 1 | 4.2% |
수 | 1 | 4.2% |
기 | 1 | 4.2% |
압 | 1 | 4.2% |
커 | 1 | 4.2% |
Other values (12) | 12 |
Decimal Number
Value | Count | Frequency (%) |
0 | 9 | |
1 | 7 | |
5 | 6 | |
2 | 6 | |
3 | 4 | |
9 | 4 | |
6 | 3 | 6.8% |
7 | 2 | 4.5% |
4 | 2 | 4.5% |
8 | 1 | 2.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 108 | |
, | 90 | |
' | 46 | |
/ | 24 | 8.1% |
& | 21 | 7.1% |
% | 3 | 1.0% |
# | 2 | 0.7% |
; | 1 | 0.3% |
Space Separator
Value | Count | Frequency (%) |
8180 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 237 |
Open Punctuation
Value | Count | Frequency (%) |
( | 63 |
Close Punctuation
Value | Count | Frequency (%) |
) | 63 |
Math Symbol
Value | Count | Frequency (%) |
= | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 80448 | |
Common | 8885 | 9.9% |
Hangul | 24 | < 0.1% |
Greek | 1 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 9232 | 11.5% |
t | 6578 | 8.2% |
a | 6171 | 7.7% |
r | 5962 | 7.4% |
i | 5868 | 7.3% |
n | 5845 | 7.3% |
o | 5632 | 7.0% |
s | 3504 | 4.4% |
l | 3355 | 4.2% |
c | 3255 | 4.0% |
Other values (42) | 25046 |
Common
Value | Count | Frequency (%) |
8180 | ||
- | 237 | 2.7% |
. | 108 | 1.2% |
, | 90 | 1.0% |
( | 63 | 0.7% |
) | 63 | 0.7% |
' | 46 | 0.5% |
/ | 24 | 0.3% |
& | 21 | 0.2% |
0 | 9 | 0.1% |
Other values (14) | 44 | 0.5% |
Hangul
Value | Count | Frequency (%) |
고 | 2 | 8.3% |
회 | 2 | 8.3% |
유 | 1 | 4.2% |
증 | 1 | 4.2% |
라 | 1 | 4.2% |
테 | 1 | 4.2% |
수 | 1 | 4.2% |
기 | 1 | 4.2% |
압 | 1 | 4.2% |
커 | 1 | 4.2% |
Other values (12) | 12 |
Greek
Value | Count | Frequency (%) |
β | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 89333 | |
Hangul | 24 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 9232 | 10.3% |
8180 | 9.2% | |
t | 6578 | 7.4% |
a | 6171 | 6.9% |
r | 5962 | 6.7% |
i | 5868 | 6.6% |
n | 5845 | 6.5% |
o | 5632 | 6.3% |
s | 3504 | 3.9% |
l | 3355 | 3.8% |
Other values (66) | 29006 |
Hangul
Value | Count | Frequency (%) |
고 | 2 | 8.3% |
회 | 2 | 8.3% |
유 | 1 | 4.2% |
증 | 1 | 4.2% |
라 | 1 | 4.2% |
테 | 1 | 4.2% |
수 | 1 | 4.2% |
기 | 1 | 4.2% |
압 | 1 | 4.2% |
커 | 1 | 4.2% |
Other values (12) | 12 |
None
Value | Count | Frequency (%) |
β | 1 |
한글용어
Text
Distinct | 3143 |
---|---|
Distinct (%) | 94.8% |
Missing | 2 |
Missing (%) | 0.1% |
Memory size | 26.0 KiB |
Length
Max length | 63 |
---|---|
Median length | 41 |
Mean length | 10.713941 |
Min length | 1 |
Characters and Unicode
Total characters | 35506 |
---|---|
Distinct characters | 618 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 3049 ? |
---|---|
Unique (%) | 92.0% |
Sample
1st row | 회사명(미국) |
---|---|
2nd row | 아날로그 입력신호 |
3rd row | 공인 검사 기관 |
4th row | 미국 건축 기구 |
5th row | 가속기 정보 센터(미국) |
Value | Count | Frequency (%) |
계통 | 259 | 2.6% |
원자력 | 203 | 2.1% |
원자로 | 151 | 1.5% |
시험 | 112 | 1.1% |
핵연료 | 94 | 1.0% |
단위 | 70 | 0.7% |
및 | 67 | 0.7% |
미국 | 65 | 0.7% |
안전 | 65 | 0.7% |
시설 | 62 | 0.6% |
Other values (3121) | 8693 |
Most occurring characters
Value | Count | Frequency (%) |
6535 | 18.4% | |
( | 855 | 2.4% |
) | 855 | 2.4% |
기 | 790 | 2.2% |
자 | 686 | 1.9% |
원 | 598 | 1.7% |
계 | 561 | 1.6% |
국 | 535 | 1.5% |
전 | 491 | 1.4% |
사 | 458 | 1.3% |
Other values (608) | 23142 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 26008 | |
Space Separator | 6535 | 18.4% |
Open Punctuation | 870 | 2.5% |
Close Punctuation | 870 | 2.5% |
Uppercase Letter | 528 | 1.5% |
Lowercase Letter | 276 | 0.8% |
Other Punctuation | 221 | 0.6% |
Decimal Number | 166 | 0.5% |
Dash Punctuation | 21 | 0.1% |
Math Symbol | 11 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 790 | 3.0% |
자 | 686 | 2.6% |
원 | 598 | 2.3% |
계 | 561 | 2.2% |
국 | 535 | 2.1% |
전 | 491 | 1.9% |
사 | 458 | 1.8% |
로 | 433 | 1.7% |
력 | 403 | 1.5% |
수 | 403 | 1.5% |
Other values (533) | 20650 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 59 | |
E | 56 | |
N | 47 | 8.9% |
I | 44 | 8.3% |
C | 42 | 8.0% |
L | 33 | 6.2% |
S | 33 | 6.2% |
R | 32 | 6.1% |
O | 32 | 6.1% |
D | 24 | 4.5% |
Other values (16) | 126 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 33 | |
a | 33 | |
o | 28 | |
n | 27 | 9.8% |
l | 19 | 6.9% |
r | 18 | 6.5% |
t | 14 | 5.1% |
s | 13 | 4.7% |
h | 11 | 4.0% |
u | 11 | 4.0% |
Other values (15) | 69 |
Decimal Number
Value | Count | Frequency (%) |
1 | 65 | |
0 | 31 | |
2 | 25 | 15.1% |
3 | 14 | 8.4% |
9 | 9 | 5.4% |
5 | 6 | 3.6% |
6 | 6 | 3.6% |
8 | 5 | 3.0% |
4 | 3 | 1.8% |
7 | 2 | 1.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 151 | |
· | 40 | 18.1% |
/ | 21 | 9.5% |
. | 6 | 2.7% |
% | 2 | 0.9% |
: | 1 | 0.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 855 | |
[ | 15 | 1.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 855 | |
] | 15 | 1.7% |
Math Symbol
Value | Count | Frequency (%) |
= | 9 | |
+ | 2 | 18.2% |
Space Separator
Value | Count | Frequency (%) |
6535 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 21 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 26008 | |
Common | 8694 | 24.5% |
Latin | 804 | 2.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 790 | 3.0% |
자 | 686 | 2.6% |
원 | 598 | 2.3% |
계 | 561 | 2.2% |
국 | 535 | 2.1% |
전 | 491 | 1.9% |
사 | 458 | 1.8% |
로 | 433 | 1.7% |
력 | 403 | 1.5% |
수 | 403 | 1.5% |
Other values (533) | 20650 |
Latin
Value | Count | Frequency (%) |
A | 59 | 7.3% |
E | 56 | 7.0% |
N | 47 | 5.8% |
I | 44 | 5.5% |
C | 42 | 5.2% |
e | 33 | 4.1% |
L | 33 | 4.1% |
S | 33 | 4.1% |
a | 33 | 4.1% |
R | 32 | 4.0% |
Other values (41) | 392 |
Common
Value | Count | Frequency (%) |
6535 | ||
( | 855 | 9.8% |
) | 855 | 9.8% |
, | 151 | 1.7% |
1 | 65 | 0.7% |
· | 40 | 0.5% |
0 | 31 | 0.4% |
2 | 25 | 0.3% |
/ | 21 | 0.2% |
- | 21 | 0.2% |
Other values (14) | 95 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 26008 | |
ASCII | 9458 | 26.6% |
None | 40 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6535 | ||
( | 855 | 9.0% |
) | 855 | 9.0% |
, | 151 | 1.6% |
1 | 65 | 0.7% |
A | 59 | 0.6% |
E | 56 | 0.6% |
N | 47 | 0.5% |
I | 44 | 0.5% |
C | 42 | 0.4% |
Other values (64) | 749 | 7.9% |
Hangul
Value | Count | Frequency (%) |
기 | 790 | 3.0% |
자 | 686 | 2.6% |
원 | 598 | 2.3% |
계 | 561 | 2.2% |
국 | 535 | 2.1% |
전 | 491 | 1.9% |
사 | 458 | 1.8% |
로 | 433 | 1.7% |
력 | 403 | 1.5% |
수 | 403 | 1.5% |
Other values (533) | 20650 |
None
Value | Count | Frequency (%) |
· | 40 |
약어
Text
Distinct | 2778 |
---|---|
Distinct (%) | 83.9% |
Missing | 3 |
Missing (%) | 0.1% |
Memory size | 26.0 KiB |
Value | Count | Frequency (%) |
24 | 0.7% | |
d | 14 | 0.4% |
t | 13 | 0.4% |
k | 12 | 0.4% |
m | 12 | 0.4% |
f | 11 | 0.3% |
c | 11 | 0.3% |
n | 10 | 0.3% |
l | 10 | 0.3% |
r | 10 | 0.3% |
Other values (2700) | 3260 |
Most occurring characters
Value | Count | Frequency (%) |
S | 1060 | 9.4% |
C | 1058 | 9.4% |
R | 865 | 7.7% |
A | 797 | 7.1% |
P | 772 | 6.9% |
E | 694 | 6.2% |
T | 584 | 5.2% |
I | 542 | 4.8% |
F | 489 | 4.4% |
D | 480 | 4.3% |
Other values (66) | 3891 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 10513 | |
Lowercase Letter | 408 | 3.6% |
Other Punctuation | 133 | 1.2% |
Space Separator | 74 | 0.7% |
Dash Punctuation | 32 | 0.3% |
Decimal Number | 32 | 0.3% |
Open Punctuation | 18 | 0.2% |
Close Punctuation | 17 | 0.2% |
Other Symbol | 2 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 1060 | 10.1% |
C | 1058 | 10.1% |
R | 865 | 8.2% |
A | 797 | 7.6% |
P | 772 | 7.3% |
E | 694 | 6.6% |
T | 584 | 5.6% |
I | 542 | 5.2% |
F | 489 | 4.7% |
D | 480 | 4.6% |
Other values (18) | 3172 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 40 | 9.8% |
m | 33 | 8.1% |
a | 27 | 6.6% |
r | 25 | 6.1% |
t | 25 | 6.1% |
d | 23 | 5.6% |
k | 22 | 5.4% |
g | 20 | 4.9% |
l | 20 | 4.9% |
p | 20 | 4.9% |
Other values (17) | 153 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 62 | |
& | 32 | |
, | 20 | 15.0% |
. | 14 | 10.5% |
· | 3 | 2.3% |
# | 2 | 1.5% |
Decimal Number
Value | Count | Frequency (%) |
2 | 9 | |
5 | 7 | |
1 | 6 | |
0 | 5 | |
3 | 4 | |
8 | 1 | 3.1% |
Other Symbol
Value | Count | Frequency (%) |
℉ | 1 | |
℃ | 1 |
Math Symbol
Value | Count | Frequency (%) |
∞ | 1 | |
+ | 1 |
Space Separator
Value | Count | Frequency (%) |
74 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 32 |
Open Punctuation
Value | Count | Frequency (%) |
( | 18 |
Close Punctuation
Value | Count | Frequency (%) |
) | 17 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 10916 | |
Common | 311 | 2.8% |
Greek | 5 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 1060 | 9.7% |
C | 1058 | 9.7% |
R | 865 | 7.9% |
A | 797 | 7.3% |
P | 772 | 7.1% |
E | 694 | 6.4% |
T | 584 | 5.3% |
I | 542 | 5.0% |
F | 489 | 4.5% |
D | 480 | 4.4% |
Other values (41) | 3575 |
Common
Value | Count | Frequency (%) |
74 | ||
/ | 62 | |
& | 32 | |
- | 32 | |
, | 20 | 6.4% |
( | 18 | 5.8% |
) | 17 | 5.5% |
. | 14 | 4.5% |
2 | 9 | 2.9% |
5 | 7 | 2.3% |
Other values (11) | 26 | 8.4% |
Greek
Value | Count | Frequency (%) |
σ | 2 | |
Δ | 1 | |
ρ | 1 | |
Ω | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11221 | |
None | 8 | 0.1% |
Letterlike Symbols | 2 | < 0.1% |
Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 1060 | 9.4% |
C | 1058 | 9.4% |
R | 865 | 7.7% |
A | 797 | 7.1% |
P | 772 | 6.9% |
E | 694 | 6.2% |
T | 584 | 5.2% |
I | 542 | 4.8% |
F | 489 | 4.4% |
D | 480 | 4.3% |
Other values (58) | 3880 |
None
Value | Count | Frequency (%) |
· | 3 | |
σ | 2 | |
Δ | 1 | 12.5% |
ρ | 1 | 12.5% |
Ω | 1 | 12.5% |
Letterlike Symbols
Value | Count | Frequency (%) |
℉ | 1 | |
℃ | 1 |
Math Operators
Value | Count | Frequency (%) |
∞ | 1 |
문서번호 | 영문용어 | 한글용어 | 약어 | |
---|---|---|---|---|
0 | 1 | Atomics International | 회사명(미국) | AI |
1 | 2 | Analog Input | 아날로그 입력신호 | AI |
2 | 3 | Authorized Inspection Agency | 공인 검사 기관 | AIA |
3 | 4 | American Institute of Architects | 미국 건축 기구 | AIA |
4 | 5 | Accelerator Information Center | 가속기 정보 센터(미국) | AIC |
5 | 6 | American Institute of Chemical Engineers | 미국 화학 공학 협회 | AICE, AICHE |
6 | 7 | Atomic Industrial Forum, Inc, | 미국 원자력 산업 회의 | AIF |
7 | 8 | Auxiliary Inerting Gas Subsystem | 보조 불활성 가스부계통 | AIGS |
8 | 9 | Auxiliary Intermediate Heat Exchanger | 보조 중간열 교환기 | AIHX |
9 | 10 | American Institute of Physics | 미국 물리 학회 | AIP |
문서번호 | 영문용어 | 한글용어 | 약어 | |
---|---|---|---|---|
3306 | 3307 | Union Carbide Corporation | 회사명(미국) | UCC |
3307 | 3308 | University of California, Lawrence Radiation Laboratory | 캘리포니아 공학 로렌스 방사선 연구소(미국) | UCLR |
3308 | 3309 | University of California Radiation Laboratory | 캘리포니아 대학 방사선 연구소(미국) | UCRL |
3309 | 3310 | Union of Concerned Scientists | 반핵 과학자 단체명 | UCS |
3310 | 3311 | Uranium Enrichment Associates | 우라늄 농축 협회(미국) | UEA |
3311 | 3312 | University of Florida Teaching React | or 플로리다 대학 교육용 원자로 | UFTR |
3312 | 3313 | Ultra High Frequency | 초 고주파 | UHF |
3313 | 3314 | Upper High Injection | 상부덮개 주입 | UHI |
3314 | 3315 | Earth Leakage Circuit Breaker | 누전차단기 | ELB |
3315 | 3316 | No Fuse Breaker | 배선용 차단기, MCCB | NFB |