Overview

Dataset statistics

Number of variables3
Number of observations199
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory24.7 B

Variable types

Text2
DateTime1

Dataset

Description한국전력기술(주)에서 보유한 논문발표현황에 대한 정보로 논문 제목, 등록일자, 발표기관에 대한 데이터입니다.
URLhttps://www.data.go.kr/data/15074293/fileData.do

Alerts

논문제목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:26:51.490275
Analysis finished2023-12-12 18:26:52.196480
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

논문제목
Text

UNIQUE 

Distinct199
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T03:26:52.452228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length45
Mean length32.246231
Min length13

Characters and Unicode

Total characters6417
Distinct characters409
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)100.0%

Sample

1st row자가용 전기설비의 변압기 검사 통계 기반 교체 주기 분석
2nd row이동편의성을 고려한 건축시설의 물리적방호 체계 강화방법 제안
3rd rowFAVOR 전산코드를 이용한 원자로용기의 확률론적 파괴역학 평가
4th row국내 복합화력발전의 대기오염방지시설
5th row대형 진동대실험을 이용한 다자유도구조물의 관성 상호작용 효과 평가
ValueCountFrequency (%)
39
 
2.6%
위한 31
 
2.1%
원전 28
 
1.9%
고찰 26
 
1.8%
대한 20
 
1.3%
분석 19
 
1.3%
평가 17
 
1.1%
설계 15
 
1.0%
이용한 15
 
1.0%
연구 14
 
0.9%
Other values (909) 1258
84.9%
2023-12-13T03:26:52.935191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1284
 
20.0%
120
 
1.9%
116
 
1.8%
111
 
1.7%
92
 
1.4%
90
 
1.4%
78
 
1.2%
73
 
1.1%
69
 
1.1%
62
 
1.0%
Other values (399) 4322
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4314
67.2%
Space Separator 1284
 
20.0%
Uppercase Letter 403
 
6.3%
Lowercase Letter 197
 
3.1%
Decimal Number 144
 
2.2%
Other Punctuation 33
 
0.5%
Dash Punctuation 22
 
0.3%
Open Punctuation 9
 
0.1%
Close Punctuation 9
 
0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
2.8%
116
 
2.7%
111
 
2.6%
92
 
2.1%
90
 
2.1%
78
 
1.8%
73
 
1.7%
69
 
1.6%
62
 
1.4%
62
 
1.4%
Other values (334) 3441
79.8%
Lowercase Letter
ValueCountFrequency (%)
e 31
15.7%
a 22
11.2%
o 18
9.1%
r 16
 
8.1%
t 15
 
7.6%
n 12
 
6.1%
l 12
 
6.1%
c 11
 
5.6%
i 9
 
4.6%
u 9
 
4.6%
Other values (12) 42
21.3%
Uppercase Letter
ValueCountFrequency (%)
P 56
13.9%
A 46
11.4%
C 41
10.2%
S 38
9.4%
R 35
8.7%
E 32
 
7.9%
M 24
 
6.0%
T 19
 
4.7%
N 17
 
4.2%
I 15
 
3.7%
Other values (11) 80
19.9%
Decimal Number
ValueCountFrequency (%)
0 60
41.7%
1 29
20.1%
4 14
 
9.7%
3 11
 
7.6%
2 10
 
6.9%
6 8
 
5.6%
9 6
 
4.2%
5 3
 
2.1%
8 2
 
1.4%
7 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
, 10
30.3%
/ 9
27.3%
. 7
21.2%
& 3
 
9.1%
· 2
 
6.1%
: 1
 
3.0%
% 1
 
3.0%
Space Separator
ValueCountFrequency (%)
1284
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4314
67.2%
Common 1501
 
23.4%
Latin 602
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
2.8%
116
 
2.7%
111
 
2.6%
92
 
2.1%
90
 
2.1%
78
 
1.8%
73
 
1.7%
69
 
1.6%
62
 
1.4%
62
 
1.4%
Other values (334) 3441
79.8%
Latin
ValueCountFrequency (%)
P 56
 
9.3%
A 46
 
7.6%
C 41
 
6.8%
S 38
 
6.3%
R 35
 
5.8%
E 32
 
5.3%
e 31
 
5.1%
M 24
 
4.0%
a 22
 
3.7%
T 19
 
3.2%
Other values (34) 258
42.9%
Common
ValueCountFrequency (%)
1284
85.5%
0 60
 
4.0%
1 29
 
1.9%
- 22
 
1.5%
4 14
 
0.9%
3 11
 
0.7%
, 10
 
0.7%
2 10
 
0.7%
( 9
 
0.6%
) 9
 
0.6%
Other values (11) 43
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4314
67.2%
ASCII 2099
32.7%
None 2
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1284
61.2%
0 60
 
2.9%
P 56
 
2.7%
A 46
 
2.2%
C 41
 
2.0%
S 38
 
1.8%
R 35
 
1.7%
E 32
 
1.5%
e 31
 
1.5%
1 29
 
1.4%
Other values (53) 447
 
21.3%
Hangul
ValueCountFrequency (%)
120
 
2.8%
116
 
2.7%
111
 
2.6%
92
 
2.1%
90
 
2.1%
78
 
1.8%
73
 
1.7%
69
 
1.6%
62
 
1.4%
62
 
1.4%
Other values (334) 3441
79.8%
None
ValueCountFrequency (%)
· 2
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Distinct37
Distinct (%)18.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2022-01-10 00:00:00
Maximum2022-12-31 00:00:00
2023-12-13T03:26:53.059261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:26:53.187871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
Distinct81
Distinct (%)40.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T03:26:53.468624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length5
Mean length11.417085
Min length4

Characters and Unicode

Total characters2272
Distinct characters150
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)35.7%

Sample

1st row전기학회논문지
2nd row대한건축학회논문집
3rd row대한기계학회 논문집 A
4th row공업화학전망
5th row한국지반공학회 논문집
ValueCountFrequency (%)
전력기술지 105
27.9%
2022 22
 
5.8%
2022년도 15
 
4.0%
한국원자력학회 14
 
3.7%
논문집 10
 
2.7%
연차학술대회 9
 
2.4%
춘계학술대회 9
 
2.4%
2022년 8
 
2.1%
kpvp 7
 
1.9%
한국부식방식학회 7
 
1.9%
Other values (107) 171
45.4%
2023-12-13T03:26:53.882834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
178
 
7.8%
152
 
6.7%
141
 
6.2%
2 141
 
6.2%
134
 
5.9%
126
 
5.5%
120
 
5.3%
119
 
5.2%
116
 
5.1%
55
 
2.4%
Other values (140) 990
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1552
68.3%
Lowercase Letter 199
 
8.8%
Decimal Number 198
 
8.7%
Space Separator 178
 
7.8%
Uppercase Letter 115
 
5.1%
Close Punctuation 12
 
0.5%
Open Punctuation 12
 
0.5%
Other Punctuation 5
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
152
 
9.8%
141
 
9.1%
134
 
8.6%
126
 
8.1%
120
 
7.7%
119
 
7.7%
116
 
7.5%
55
 
3.5%
46
 
3.0%
46
 
3.0%
Other values (91) 497
32.0%
Lowercase Letter
ValueCountFrequency (%)
e 27
13.6%
n 24
12.1%
o 21
10.6%
r 17
8.5%
c 15
7.5%
u 14
 
7.0%
i 13
 
6.5%
a 12
 
6.0%
l 12
 
6.0%
s 9
 
4.5%
Other values (10) 35
17.6%
Uppercase Letter
ValueCountFrequency (%)
P 31
27.0%
K 15
13.0%
V 14
12.2%
N 12
 
10.4%
C 11
 
9.6%
S 9
 
7.8%
I 6
 
5.2%
M 4
 
3.5%
A 4
 
3.5%
T 3
 
2.6%
Other values (4) 6
 
5.2%
Decimal Number
ValueCountFrequency (%)
2 141
71.2%
0 46
 
23.2%
5 3
 
1.5%
1 3
 
1.5%
4 3
 
1.5%
3 1
 
0.5%
6 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
& 2
40.0%
1
20.0%
· 1
20.0%
, 1
20.0%
Space Separator
ValueCountFrequency (%)
178
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1552
68.3%
Common 406
 
17.9%
Latin 314
 
13.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
152
 
9.8%
141
 
9.1%
134
 
8.6%
126
 
8.1%
120
 
7.7%
119
 
7.7%
116
 
7.5%
55
 
3.5%
46
 
3.0%
46
 
3.0%
Other values (91) 497
32.0%
Latin
ValueCountFrequency (%)
P 31
 
9.9%
e 27
 
8.6%
n 24
 
7.6%
o 21
 
6.7%
r 17
 
5.4%
K 15
 
4.8%
c 15
 
4.8%
V 14
 
4.5%
u 14
 
4.5%
i 13
 
4.1%
Other values (24) 123
39.2%
Common
ValueCountFrequency (%)
178
43.8%
2 141
34.7%
0 46
 
11.3%
) 12
 
3.0%
( 12
 
3.0%
5 3
 
0.7%
1 3
 
0.7%
4 3
 
0.7%
& 2
 
0.5%
1
 
0.2%
Other values (5) 5
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1552
68.3%
ASCII 718
31.6%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
178
24.8%
2 141
19.6%
0 46
 
6.4%
P 31
 
4.3%
e 27
 
3.8%
n 24
 
3.3%
o 21
 
2.9%
r 17
 
2.4%
K 15
 
2.1%
c 15
 
2.1%
Other values (37) 203
28.3%
Hangul
ValueCountFrequency (%)
152
 
9.8%
141
 
9.1%
134
 
8.6%
126
 
8.1%
120
 
7.7%
119
 
7.7%
116
 
7.5%
55
 
3.5%
46
 
3.0%
46
 
3.0%
Other values (91) 497
32.0%
None
ValueCountFrequency (%)
1
50.0%
· 1
50.0%

Correlations

2023-12-13T03:26:53.996101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록일자발표기관
등록일자1.0000.996
발표기관0.9961.000

Missing values

2023-12-13T03:26:52.106565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:26:52.172455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

논문제목등록일자발표기관
0자가용 전기설비의 변압기 검사 통계 기반 교체 주기 분석2022-01-10전기학회논문지
1이동편의성을 고려한 건축시설의 물리적방호 체계 강화방법 제안2022-02-01대한건축학회논문집
2FAVOR 전산코드를 이용한 원자로용기의 확률론적 파괴역학 평가2022-02-01대한기계학회 논문집 A
3국내 복합화력발전의 대기오염방지시설2022-02-28공업화학전망
4대형 진동대실험을 이용한 다자유도구조물의 관성 상호작용 효과 평가2022-02-28한국지반공학회 논문집
5대구경 배관의 운전중 보수기술 적용부에 대한 부식 모델링2022-03-31전력기술지
6Barakah 원전 24개월 장주기 운전 타당성 평가2022-03-31전력기술지
7딥러닝 기반 스마트 공정계획 수립 기술의 활용성 연구2022-03-31전력기술지
8신형경수로1400 원전 장기교류전원 완전상실 사고시 원자로냉각재계통 역류냉각 분석2022-03-31전력기술지
9구매상용 소프트웨어의 대체사용과 수락시험2022-03-31전력기술지
논문제목등록일자발표기관
189BNPP 피로감시시스템(NuFMS) 과도상태 자동계수 모듈 검증2022-12-31전력기술지
190Temperature Decay 10D Rule 유효성 확인2022-12-31전력기술지
191AC/DC 통합 전력조류해석 및 전산프로그램간 결과 비교2022-12-31전력기술지
192ALARA 분석·평가 프로그램 및 3D-BIM 기반 실감·몰입형 피폭선량 예측진단 통합시스템 기술개발 소개2022-12-31전력기술지
193원자력발전소 대형기기(증기발생기) 교체2022-12-31전력기술지
194APR1400노형의 노외 노심용융물 퍼짐현상 계산2022-12-31전력기술지
195사용후핵연료 관리시나리오별 중간저장시설 설계 방안 개발2022-12-31전력기술지
196관통부 내 편심 배관의 열전달 해석2022-12-31전력기술지
197ASCE 7-10과 EN1991-1-4의 풍속압 비교 분석2022-12-31전력기술지
198복합화력 설계강우량 고찰2022-12-31전력기술지