Overview

Dataset statistics

Number of variables3
Number of observations99
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory25.3 B

Variable types

Categorical1
Text2

Dataset

Description영문 기술용어를 한글 기술용어로 번역하여 축적한 사전인 영한 특허기술용어 번역사전 정보를 제공합니다. (KIPRISPlus 서비스)
Author특허청
URLhttps://www.data.go.kr/data/15053823/fileData.do

Alerts

EK has constant value ""Constant

Reproduction

Analysis started2023-12-12 09:00:26.102814
Analysis finished2023-12-12 09:00:27.351476
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

EK
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
EK
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEK
2nd rowEK
3rd rowEK
4th rowEK
5th rowEK

Common Values

ValueCountFrequency (%)
EK 99
100.0%

Length

2023-12-12T18:00:27.427990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:27.534333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ek 99
100.0%
Distinct98
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-12T18:00:27.801801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length43
Mean length33.212121
Min length12

Characters and Unicode

Total characters3288
Distinct characters51
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)98.0%

Sample

1st rowAID,sub-intrusion_detection_system
2nd rowAMC,adaptive_mobile_computing
3rd rowANC,active_noise_control
4th rowAPII,asia-pacific_information_infrastructure_base
5th rowAPT,asia-pacific_telecommunity
ValueCountFrequency (%)
apt,asia-pacific_telecommunity 2
 
2.0%
insc,international_nuclear_societies_council 1
 
1.0%
mc,multi-code_signal 1
 
1.0%
mbo,minimum_bounding_octangle 1
 
1.0%
mbe,molecular_beam_epitaxial 1
 
1.0%
mac,modified_a_conduction 1
 
1.0%
lpc,lupinseed_protein_concentrate 1
 
1.0%
lp,length_of_patch 1
 
1.0%
lfn,local_fixation_node 1
 
1.0%
ldm,lagrangian_diffusion_model 1
 
1.0%
Other values (88) 88
88.9%
2023-12-12T18:00:28.336401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 287
 
8.7%
i 268
 
8.2%
_ 228
 
6.9%
a 223
 
6.8%
n 222
 
6.8%
t 218
 
6.6%
o 196
 
6.0%
r 161
 
4.9%
c 148
 
4.5%
s 139
 
4.2%
Other values (41) 1198
36.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2605
79.2%
Uppercase Letter 332
 
10.1%
Connector Punctuation 228
 
6.9%
Other Punctuation 101
 
3.1%
Dash Punctuation 22
 
0.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 287
11.0%
i 268
10.3%
a 223
 
8.6%
n 222
 
8.5%
t 218
 
8.4%
o 196
 
7.5%
r 161
 
6.2%
c 148
 
5.7%
s 139
 
5.3%
l 112
 
4.3%
Other values (15) 631
24.2%
Uppercase Letter
ValueCountFrequency (%)
S 51
15.4%
C 37
11.1%
I 35
10.5%
M 29
 
8.7%
A 24
 
7.2%
D 24
 
7.2%
P 19
 
5.7%
F 13
 
3.9%
R 13
 
3.9%
L 12
 
3.6%
Other values (12) 75
22.6%
Other Punctuation
ValueCountFrequency (%)
, 99
98.0%
. 2
 
2.0%
Connector Punctuation
ValueCountFrequency (%)
_ 228
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2937
89.3%
Common 351
 
10.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 287
 
9.8%
i 268
 
9.1%
a 223
 
7.6%
n 222
 
7.6%
t 218
 
7.4%
o 196
 
6.7%
r 161
 
5.5%
c 148
 
5.0%
s 139
 
4.7%
l 112
 
3.8%
Other values (37) 963
32.8%
Common
ValueCountFrequency (%)
_ 228
65.0%
, 99
28.2%
- 22
 
6.3%
. 2
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3288
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 287
 
8.7%
i 268
 
8.2%
_ 228
 
6.9%
a 223
 
6.8%
n 222
 
6.8%
t 218
 
6.6%
o 196
 
6.0%
r 161
 
4.9%
c 148
 
4.5%
s 139
 
4.2%
Other values (41) 1198
36.4%
Distinct98
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-12T18:00:28.632318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length9.7979798
Min length3

Characters and Unicode

Total characters970
Distinct characters270
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)98.0%

Sample

1st row하부_침입_탐지_시스템
2nd row적응형_이동_컴퓨팅
3rd row활동_소음_제어
4th row태평양_초고속_정보_통신_기반
5th row아태_전기통신협의체
ValueCountFrequency (%)
분무_연소_합성 2
 
2.0%
국제_지리학_연맹_총회 1
 
1.0%
수정_a_도통 1
 
1.0%
루우핀콩_단백질_농축물 1
 
1.0%
패치의_세로폭 1
 
1.0%
지역_고정_노드 1
 
1.0%
라그랑지안_확산_모델 1
 
1.0%
린_건설_학회 1
 
1.0%
지역_근거리망 1
 
1.0%
한글_영상 1
 
1.0%
Other values (88) 88
88.9%
2023-12-12T18:00:29.139352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 200
 
20.6%
20
 
2.1%
19
 
2.0%
14
 
1.4%
13
 
1.3%
13
 
1.3%
12
 
1.2%
12
 
1.2%
12
 
1.2%
11
 
1.1%
Other values (260) 644
66.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 762
78.6%
Connector Punctuation 200
 
20.6%
Uppercase Letter 6
 
0.6%
Other Punctuation 1
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
10
 
1.3%
Other values (252) 626
82.2%
Uppercase Letter
ValueCountFrequency (%)
D 2
33.3%
E 1
16.7%
I 1
16.7%
S 1
16.7%
M 1
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 200
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 762
78.6%
Common 201
 
20.7%
Latin 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
10
 
1.3%
Other values (252) 626
82.2%
Latin
ValueCountFrequency (%)
D 2
28.6%
E 1
14.3%
a 1
14.3%
I 1
14.3%
S 1
14.3%
M 1
14.3%
Common
ValueCountFrequency (%)
_ 200
99.5%
/ 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 761
78.5%
ASCII 208
 
21.4%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 200
96.2%
D 2
 
1.0%
/ 1
 
0.5%
E 1
 
0.5%
a 1
 
0.5%
I 1
 
0.5%
S 1
 
0.5%
M 1
 
0.5%
Hangul
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
10
 
1.3%
Other values (251) 625
82.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T18:00:29.252832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
AFAQ,association_of_france_assurance_quality프랑스_품질_보증_협회
AFAQ,association_of_france_assurance_quality1.0000.999
프랑스_품질_보증_협회0.9991.000

Missing values

2023-12-12T18:00:27.210657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:00:27.310088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

EKAFAQ,association_of_france_assurance_quality프랑스_품질_보증_협회
0EKAID,sub-intrusion_detection_system하부_침입_탐지_시스템
1EKAMC,adaptive_mobile_computing적응형_이동_컴퓨팅
2EKANC,active_noise_control활동_소음_제어
3EKAPII,asia-pacific_information_infrastructure_base태평양_초고속_정보_통신_기반
4EKAPT,asia-pacific_telecommunity아태_전기통신협의체
5EKAPT,asia-pacific_telecommunity아/태지역_전기통신연합체
6EKASDU,application_service_data_unit응용_서비스_데이터_단위
7EKASS,_adaptive_streaming_service적응형_스트리밍_서비스
8EKASS,average_signal_strength평균_신호_세기
9EKASS,average_soft-output_syndrome평균_연출력_신드롬
EKAFAQ,association_of_france_assurance_quality프랑스_품질_보증_협회
89EKSGDFB,sampled_grating_distributed_feedback추출_격자_분포_궤환
90EKSIMS,school_information_management_system종합_정보_관리_시스템
91EKSMW,soil_movement_wall토유벽
92EKSN,sign_number서명_번호
93EKSRL,search_range_limit탐색_영역_제한
94EKSSC,sequence-switch_coding순서_변환_코딩
95EKSVM,support_vector_machine지지_벡터_기계
96EKSaccharmoyces_cerevisiae,건조효모
97EKSamsung_electronics_co.,ltd._automation_research_center삼성전자_자동화연구소_따
98EKSeoul_city_digital_elevation_model,Seoul_city_DEM서울시_DEM