Overview

Dataset statistics

Number of variables11
Number of observations3214
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory282.6 KiB
Average record size in memory90.0 B

Variable types

Numeric1
Text5
Categorical3
DateTime2

Dataset

Description한국과학기술원 사료 정보(초록,저자,서지정보,자료유형,발행일, 등)에 대한 정보 입니다. 해당 데이터가 보유한 컬럼은 다음과 같습니다. 컬럼명: 저자,서지정보,자료유형,발행일,아이템 ID,키워드,출판사,입력일,자료명,갱신일,통합자원식별자
URLhttps://www.data.go.kr/data/3051377/fileData.do

Alerts

자료유형 has constant value ""Constant
발행일 has constant value ""Constant
데이터기준일 has constant value ""Constant
아이템(ID) has unique valuesUnique
통합지원식별자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:39:15.885331
Analysis finished2023-12-12 19:39:17.235795
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이템(ID)
Real number (ℝ)

UNIQUE 

Distinct3214
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean302980.01
Minimum290777
Maximum308806
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size28.4 KiB
2023-12-13T04:39:17.302570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum290777
5-th percentile299020.65
Q1301025.25
median302946.5
Q3305390.75
95-th percentile306457.35
Maximum308806
Range18029
Interquartile range (IQR)4365.5

Descriptive statistics

Standard deviation2568.6777
Coefficient of variation (CV)0.0084780434
Kurtosis-0.086929552
Mean302980.01
Median Absolute Deviation (MAD)2163
Skewness-0.24701044
Sum9.7377775 × 108
Variance6598104.9
MonotonicityNot monotonic
2023-12-13T04:39:17.749100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
301262 1
 
< 0.1%
300377 1
 
< 0.1%
306425 1
 
< 0.1%
300379 1
 
< 0.1%
305814 1
 
< 0.1%
300380 1
 
< 0.1%
300381 1
 
< 0.1%
300383 1
 
< 0.1%
300384 1
 
< 0.1%
300391 1
 
< 0.1%
Other values (3204) 3204
99.7%
ValueCountFrequency (%)
290777 1
< 0.1%
293237 1
< 0.1%
293497 1
< 0.1%
293498 1
< 0.1%
293499 1
< 0.1%
293920 1
< 0.1%
293921 1
< 0.1%
293922 1
< 0.1%
293923 1
< 0.1%
293924 1
< 0.1%
ValueCountFrequency (%)
308806 1
< 0.1%
308805 1
< 0.1%
308804 1
< 0.1%
308803 1
< 0.1%
308795 1
< 0.1%
308789 1
< 0.1%
308788 1
< 0.1%
308787 1
< 0.1%
308786 1
< 0.1%
308785 1
< 0.1%

저자
Text

Distinct2791
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-12-13T04:39:18.064099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length342
Median length192
Mean length40.75949
Min length3

Characters and Unicode

Total characters131001
Distinct characters299
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2507 ?
Unique (%)78.0%

Sample

1st rowLee, Donggun; Kim, Taesu; Suk, Hyeon-Jeong
2nd rowKim, Sungjoong; Hong, Kyungwoo; Bang, Hyochoong
3rd rowHan, Seung-Ho; Choi, Ho-Jin
4th rowKim, Beomyoung; Lee, Janghyeon; Lee, Sihaeng; Kim, Doyeon; Kim, Junmo
5th rowJoo, Hyo-Jun; Kim, Youngmin; Burt, Daniel; Jung, Yongduck; Zhang, Lin; Chen, Melvina; Parluhutan, Samuel Jior; Kang, Dong-Ho; Lee, Chulwon; Assali, Simone; Ikonic, Zoran; Moutanabbir, Oussama; Cho, Yong-Hoon; Tan, Chuan Seng; Nam, Donguk
ValueCountFrequency (%)
kim 1358
 
6.4%
lee 908
 
4.2%
park 449
 
2.1%
choi 245
 
1.1%
cho 206
 
1.0%
shin 175
 
0.8%
yoon 166
 
0.8%
kang 157
 
0.7%
oh 135
 
0.6%
jeong 132
 
0.6%
Other values (5039) 17441
81.6%
2023-12-13T04:39:18.662965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18167
 
13.9%
n 10208
 
7.8%
; 9206
 
7.0%
o 8749
 
6.7%
, 7503
 
5.7%
e 6383
 
4.9%
a 5172
 
3.9%
u 4879
 
3.7%
i 4766
 
3.6%
g 4764
 
3.6%
Other values (289) 51204
39.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 61059
46.6%
Uppercase Letter 19078
 
14.6%
Space Separator 18167
 
13.9%
Other Punctuation 16827
 
12.8%
Other Letter 14595
 
11.1%
Dash Punctuation 1273
 
1.0%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1000
 
6.9%
964
 
6.6%
456
 
3.1%
456
 
3.1%
422
 
2.9%
386
 
2.6%
358
 
2.5%
332
 
2.3%
316
 
2.2%
312
 
2.1%
Other values (227) 9593
65.7%
Lowercase Letter
ValueCountFrequency (%)
n 10208
16.7%
o 8749
14.3%
e 6383
10.5%
a 5172
8.5%
u 4879
8.0%
i 4766
7.8%
g 4764
7.8%
h 3216
 
5.3%
y 2401
 
3.9%
m 2383
 
3.9%
Other values (17) 8138
13.3%
Uppercase Letter
ValueCountFrequency (%)
K 2435
12.8%
J 2266
11.9%
S 2191
11.5%
H 1924
10.1%
Y 1389
 
7.3%
L 1200
 
6.3%
C 1010
 
5.3%
M 842
 
4.4%
P 645
 
3.4%
D 605
 
3.2%
Other values (17) 4571
24.0%
Other Punctuation
ValueCountFrequency (%)
; 9206
54.7%
, 7503
44.6%
. 117
 
0.7%
& 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1273
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 80137
61.2%
Common 36269
27.7%
Hangul 14595
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1000
 
6.9%
964
 
6.6%
456
 
3.1%
456
 
3.1%
422
 
2.9%
386
 
2.6%
358
 
2.5%
332
 
2.3%
316
 
2.2%
312
 
2.1%
Other values (227) 9593
65.7%
Latin
ValueCountFrequency (%)
n 10208
 
12.7%
o 8749
 
10.9%
e 6383
 
8.0%
a 5172
 
6.5%
u 4879
 
6.1%
i 4766
 
5.9%
g 4764
 
5.9%
h 3216
 
4.0%
K 2435
 
3.0%
y 2401
 
3.0%
Other values (44) 27164
33.9%
Common
ValueCountFrequency (%)
18167
50.1%
; 9206
25.4%
, 7503
20.7%
- 1273
 
3.5%
. 117
 
0.3%
( 1
 
< 0.1%
) 1
 
< 0.1%
& 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 116404
88.9%
Hangul 14595
 
11.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18167
15.6%
n 10208
 
8.8%
; 9206
 
7.9%
o 8749
 
7.5%
, 7503
 
6.4%
e 6383
 
5.5%
a 5172
 
4.4%
u 4879
 
4.2%
i 4766
 
4.1%
g 4764
 
4.1%
Other values (50) 36607
31.4%
Hangul
ValueCountFrequency (%)
1000
 
6.9%
964
 
6.6%
456
 
3.1%
456
 
3.1%
422
 
2.9%
386
 
2.6%
358
 
2.5%
332
 
2.3%
316
 
2.2%
312
 
2.1%
Other values (227) 9593
65.7%
None
ValueCountFrequency (%)
Ø 1
50.0%
ß 1
50.0%
Distinct1470
Distinct (%)45.7%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-12-13T04:39:19.083040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length165
Median length120
Mean length46.884879
Min length6

Characters and Unicode

Total characters150688
Distinct characters303
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1023 ?
Unique (%)31.8%

Sample

1st rowIS and T International Symposium on Electronic Imaging: 27th Color Imaging: Displaying, Processing, Hardcopy, and Applications, COLOR 2022
2nd rowAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
3rd rowIEEE International Conference on Big Data and Smart Computing (BigComp), pp.391 - 394
4th row22nd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.3421 - 3430
5th rowConference on Silicon Photonics XVII Part of SPIE Photonics West OPTO Conference
ValueCountFrequency (%)
2022 2154
 
9.8%
on 1019
 
4.6%
conference 978
 
4.4%
international 716
 
3.2%
and 642
 
2.9%
563
 
2.6%
the 471
 
2.1%
427
 
1.9%
학술대회 400
 
1.8%
2022년도 281
 
1.3%
Other values (2628) 14406
65.3%
2023-12-13T04:39:19.721913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18850
 
12.5%
n 10378
 
6.9%
2 9178
 
6.1%
e 8701
 
5.8%
o 7032
 
4.7%
t 5862
 
3.9%
i 5739
 
3.8%
a 5224
 
3.5%
r 4579
 
3.0%
c 3375
 
2.2%
Other values (293) 71770
47.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 71102
47.2%
Other Letter 21333
 
14.2%
Space Separator 18850
 
12.5%
Uppercase Letter 18456
 
12.2%
Decimal Number 16710
 
11.1%
Other Punctuation 2435
 
1.6%
Dash Punctuation 844
 
0.6%
Close Punctuation 473
 
0.3%
Open Punctuation 473
 
0.3%
Math Symbol 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2931
 
13.7%
2717
 
12.7%
1387
 
6.5%
1324
 
6.2%
1240
 
5.8%
1167
 
5.5%
946
 
4.4%
508
 
2.4%
505
 
2.4%
435
 
2.0%
Other values (218) 8173
38.3%
Lowercase Letter
ValueCountFrequency (%)
n 10378
14.6%
e 8701
12.2%
o 7032
9.9%
t 5862
8.2%
i 5739
 
8.1%
a 5224
 
7.3%
r 4579
 
6.4%
c 3375
 
4.7%
s 3100
 
4.4%
l 2548
 
3.6%
Other values (16) 14564
20.5%
Uppercase Letter
ValueCountFrequency (%)
C 2967
16.1%
I 2375
12.9%
E 2079
11.3%
S 1878
10.2%
A 1623
8.8%
M 1108
 
6.0%
T 893
 
4.8%
P 792
 
4.3%
R 743
 
4.0%
N 728
 
3.9%
Other values (16) 3270
17.7%
Decimal Number
ValueCountFrequency (%)
2 9178
54.9%
0 3150
 
18.9%
1 1053
 
6.3%
3 753
 
4.5%
4 531
 
3.2%
7 479
 
2.9%
6 409
 
2.4%
9 403
 
2.4%
5 394
 
2.4%
8 360
 
2.2%
Other Punctuation
ValueCountFrequency (%)
, 1424
58.5%
. 548
 
22.5%
; 136
 
5.6%
& 136
 
5.6%
/ 114
 
4.7%
: 40
 
1.6%
· 35
 
1.4%
@ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
18850
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 844
100.0%
Close Punctuation
ValueCountFrequency (%)
) 473
100.0%
Open Punctuation
ValueCountFrequency (%)
( 473
100.0%
Math Symbol
ValueCountFrequency (%)
+ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 89558
59.4%
Common 39797
26.4%
Hangul 21328
 
14.2%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2931
 
13.7%
2717
 
12.7%
1387
 
6.5%
1324
 
6.2%
1240
 
5.8%
1167
 
5.5%
946
 
4.4%
508
 
2.4%
505
 
2.4%
435
 
2.0%
Other values (213) 8168
38.3%
Latin
ValueCountFrequency (%)
n 10378
 
11.6%
e 8701
 
9.7%
o 7032
 
7.9%
t 5862
 
6.5%
i 5739
 
6.4%
a 5224
 
5.8%
r 4579
 
5.1%
c 3375
 
3.8%
s 3100
 
3.5%
C 2967
 
3.3%
Other values (42) 32601
36.4%
Common
ValueCountFrequency (%)
18850
47.4%
2 9178
23.1%
0 3150
 
7.9%
, 1424
 
3.6%
1 1053
 
2.6%
- 844
 
2.1%
3 753
 
1.9%
. 548
 
1.4%
4 531
 
1.3%
7 479
 
1.2%
Other values (13) 2987
 
7.5%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 129320
85.8%
Hangul 21328
 
14.2%
None 35
 
< 0.1%
CJK 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18850
 
14.6%
n 10378
 
8.0%
2 9178
 
7.1%
e 8701
 
6.7%
o 7032
 
5.4%
t 5862
 
4.5%
i 5739
 
4.4%
a 5224
 
4.0%
r 4579
 
3.5%
c 3375
 
2.6%
Other values (64) 50402
39.0%
Hangul
ValueCountFrequency (%)
2931
 
13.7%
2717
 
12.7%
1387
 
6.5%
1324
 
6.2%
1240
 
5.8%
1167
 
5.5%
946
 
4.4%
508
 
2.4%
505
 
2.4%
435
 
2.0%
Other values (213) 8168
38.3%
None
ValueCountFrequency (%)
· 35
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

자료유형
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
Conference
3214 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowConference
2nd rowConference
3rd rowConference
4th rowConference
5th rowConference

Common Values

ValueCountFrequency (%)
Conference 3214
100.0%

Length

2023-12-13T04:39:19.912837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:39:20.055634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
conference 3214
100.0%

발행일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2022
3214 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 3214
100.0%

Length

2023-12-13T04:39:20.198359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:39:20.345438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 3214
100.0%
Distinct673
Distinct (%)20.9%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-12-13T04:39:20.670576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length96
Mean length21.688239
Min length3

Characters and Unicode

Total characters69706
Distinct characters260
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique326 ?
Unique (%)10.1%

Sample

1st rowSociety for Imaging Science and Technology
2nd rowAmerican Institute of Aeronautics and Astronautics Inc, AIAA
3rd rowIEEE
4th rowIEEE COMPUTER SOC
5th rowSPIE-INT SOC OPTICAL ENGINEERING
ValueCountFrequency (%)
of 491
 
5.4%
society 470
 
5.1%
and 386
 
4.2%
institute 277
 
3.0%
the 263
 
2.9%
ieee 257
 
2.8%
for 221
 
2.4%
engineers 194
 
2.1%
inc 188
 
2.0%
association 188
 
2.0%
Other values (849) 6237
68.0%
2023-12-13T04:39:21.222266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5960
 
8.6%
e 4703
 
6.7%
n 4445
 
6.4%
i 4182
 
6.0%
o 3879
 
5.6%
t 3605
 
5.2%
a 3083
 
4.4%
c 2875
 
4.1%
r 2699
 
3.9%
s 2380
 
3.4%
Other values (250) 31895
45.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 40379
57.9%
Other Letter 11285
 
16.2%
Uppercase Letter 10843
 
15.6%
Space Separator 5960
 
8.6%
Other Punctuation 465
 
0.7%
Close Punctuation 297
 
0.4%
Open Punctuation 297
 
0.4%
Decimal Number 96
 
0.1%
Dash Punctuation 84
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1581
 
14.0%
1440
 
12.8%
1382
 
12.2%
1164
 
10.3%
512
 
4.5%
274
 
2.4%
233
 
2.1%
231
 
2.0%
186
 
1.6%
185
 
1.6%
Other values (182) 4097
36.3%
Uppercase Letter
ValueCountFrequency (%)
E 1826
16.8%
I 1588
14.6%
S 1224
11.3%
C 972
9.0%
A 895
8.3%
M 612
 
5.6%
T 519
 
4.8%
N 478
 
4.4%
R 458
 
4.2%
P 386
 
3.6%
Other values (16) 1885
17.4%
Lowercase Letter
ValueCountFrequency (%)
e 4703
11.6%
n 4445
11.0%
i 4182
10.4%
o 3879
9.6%
t 3605
8.9%
a 3083
7.6%
c 2875
7.1%
r 2699
 
6.7%
s 2380
 
5.9%
l 1561
 
3.9%
Other values (15) 6967
17.3%
Decimal Number
ValueCountFrequency (%)
2 51
53.1%
0 20
 
20.8%
1 14
 
14.6%
9 5
 
5.2%
3 3
 
3.1%
8 2
 
2.1%
5 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 225
48.4%
. 147
31.6%
& 33
 
7.1%
; 33
 
7.1%
/ 21
 
4.5%
· 6
 
1.3%
Space Separator
ValueCountFrequency (%)
5960
100.0%
Close Punctuation
ValueCountFrequency (%)
) 297
100.0%
Open Punctuation
ValueCountFrequency (%)
( 297
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 51222
73.5%
Hangul 11281
 
16.2%
Common 7199
 
10.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1581
 
14.0%
1440
 
12.8%
1382
 
12.3%
1164
 
10.3%
512
 
4.5%
274
 
2.4%
233
 
2.1%
231
 
2.0%
186
 
1.6%
185
 
1.6%
Other values (178) 4093
36.3%
Latin
ValueCountFrequency (%)
e 4703
 
9.2%
n 4445
 
8.7%
i 4182
 
8.2%
o 3879
 
7.6%
t 3605
 
7.0%
a 3083
 
6.0%
c 2875
 
5.6%
r 2699
 
5.3%
s 2380
 
4.6%
E 1826
 
3.6%
Other values (41) 17545
34.3%
Common
ValueCountFrequency (%)
5960
82.8%
) 297
 
4.1%
( 297
 
4.1%
, 225
 
3.1%
. 147
 
2.0%
- 84
 
1.2%
2 51
 
0.7%
& 33
 
0.5%
; 33
 
0.5%
/ 21
 
0.3%
Other values (7) 51
 
0.7%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58415
83.8%
Hangul 11281
 
16.2%
None 6
 
< 0.1%
CJK 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5960
 
10.2%
e 4703
 
8.1%
n 4445
 
7.6%
i 4182
 
7.2%
o 3879
 
6.6%
t 3605
 
6.2%
a 3083
 
5.3%
c 2875
 
4.9%
r 2699
 
4.6%
s 2380
 
4.1%
Other values (57) 20604
35.3%
Hangul
ValueCountFrequency (%)
1581
 
14.0%
1440
 
12.8%
1382
 
12.3%
1164
 
10.3%
512
 
4.5%
274
 
2.4%
233
 
2.1%
231
 
2.0%
186
 
1.6%
185
 
1.6%
Other values (178) 4093
36.3%
None
ValueCountFrequency (%)
· 6
100.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct123
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
Minimum2021-11-17 00:00:00
Maximum2023-06-14 00:00:00
2023-12-13T04:39:21.420215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:39:21.609501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3118
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-12-13T04:39:22.025791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length246
Median length143
Mean length82.43435
Min length12

Characters and Unicode

Total characters264944
Distinct characters673
Distinct categories17 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3038 ?
Unique (%)94.5%

Sample

1st rowPokemon Color Adjustments for Augmented Reality Contents
2nd rowVision-based Map-referenced Navigation using Terrain Classification of Aerial Images
3rd rowChecklist for Validating Trustworthy AI
4th rowTricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection
5th row1D Photonic Crystal GeSn-on-Insulator Nanobeam Laser
ValueCountFrequency (%)
of 1435
 
4.1%
for 1042
 
3.0%
and 710
 
2.0%
in 547
 
1.6%
with 431
 
1.2%
a 405
 
1.1%
the 379
 
1.1%
on 327
 
0.9%
using 311
 
0.9%
by 212
 
0.6%
Other values (9751) 29466
83.6%
2023-12-13T04:39:22.643252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32076
 
12.1%
e 21726
 
8.2%
i 18348
 
6.9%
o 16333
 
6.2%
n 16055
 
6.1%
t 15257
 
5.8%
a 15138
 
5.7%
r 13962
 
5.3%
s 10057
 
3.8%
l 9676
 
3.7%
Other values (663) 96316
36.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 191007
72.1%
Space Separator 32076
 
12.1%
Uppercase Letter 23175
 
8.7%
Other Letter 13630
 
5.1%
Dash Punctuation 2979
 
1.1%
Decimal Number 968
 
0.4%
Other Punctuation 820
 
0.3%
Open Punctuation 113
 
< 0.1%
Close Punctuation 113
 
< 0.1%
Final Punctuation 28
 
< 0.1%
Other values (7) 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
408
 
3.0%
356
 
2.6%
333
 
2.4%
321
 
2.4%
228
 
1.7%
228
 
1.7%
207
 
1.5%
207
 
1.5%
160
 
1.2%
159
 
1.2%
Other values (560) 11023
80.9%
Lowercase Letter
ValueCountFrequency (%)
e 21726
11.4%
i 18348
 
9.6%
o 16333
 
8.6%
n 16055
 
8.4%
t 15257
 
8.0%
a 15138
 
7.9%
r 13962
 
7.3%
s 10057
 
5.3%
l 9676
 
5.1%
c 8321
 
4.4%
Other values (20) 46134
24.2%
Uppercase Letter
ValueCountFrequency (%)
S 2407
 
10.4%
C 1924
 
8.3%
A 1895
 
8.2%
M 1561
 
6.7%
P 1560
 
6.7%
D 1516
 
6.5%
E 1377
 
5.9%
R 1204
 
5.2%
T 1152
 
5.0%
I 1064
 
4.6%
Other values (20) 7515
32.4%
Other Punctuation
ValueCountFrequency (%)
: 389
47.4%
, 140
 
17.1%
/ 127
 
15.5%
. 92
 
11.2%
? 27
 
3.3%
' 20
 
2.4%
% 10
 
1.2%
& 5
 
0.6%
@ 4
 
0.5%
3
 
0.4%
Other values (2) 3
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 253
26.1%
3 201
20.8%
1 115
11.9%
0 101
 
10.4%
4 85
 
8.8%
6 63
 
6.5%
5 48
 
5.0%
8 44
 
4.5%
9 36
 
3.7%
7 22
 
2.3%
Math Symbol
ValueCountFrequency (%)
+ 13
72.2%
× 2
 
11.1%
> 2
 
11.1%
| 1
 
5.6%
Open Punctuation
ValueCountFrequency (%)
( 111
98.2%
[ 2
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 111
98.2%
] 2
 
1.8%
Final Punctuation
ValueCountFrequency (%)
24
85.7%
4
 
14.3%
Modifier Symbol
ValueCountFrequency (%)
´ 2
66.7%
^ 1
33.3%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
32076
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2979
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Other Symbol
ValueCountFrequency (%)
° 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 214167
80.8%
Common 37130
 
14.0%
Hangul 13630
 
5.1%
Greek 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
408
 
3.0%
356
 
2.6%
333
 
2.4%
321
 
2.4%
228
 
1.7%
228
 
1.7%
207
 
1.5%
207
 
1.5%
160
 
1.2%
159
 
1.2%
Other values (560) 11023
80.9%
Latin
ValueCountFrequency (%)
e 21726
 
10.1%
i 18348
 
8.6%
o 16333
 
7.6%
n 16055
 
7.5%
t 15257
 
7.1%
a 15138
 
7.1%
r 13962
 
6.5%
s 10057
 
4.7%
l 9676
 
4.5%
c 8321
 
3.9%
Other values (44) 69294
32.4%
Common
ValueCountFrequency (%)
32076
86.4%
- 2979
 
8.0%
: 389
 
1.0%
2 253
 
0.7%
3 201
 
0.5%
, 140
 
0.4%
/ 127
 
0.3%
1 115
 
0.3%
( 111
 
0.3%
) 111
 
0.3%
Other values (31) 628
 
1.7%
Greek
ValueCountFrequency (%)
μ 5
29.4%
β 4
23.5%
π 2
 
11.8%
Ω 2
 
11.8%
δ 1
 
5.9%
Γ 1
 
5.9%
Δ 1
 
5.9%
Σ 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 251251
94.8%
Hangul 13629
 
5.1%
Punctuation 35
 
< 0.1%
None 26
 
< 0.1%
Number Forms 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32076
 
12.8%
e 21726
 
8.6%
i 18348
 
7.3%
o 16333
 
6.5%
n 16055
 
6.4%
t 15257
 
6.1%
a 15138
 
6.0%
r 13962
 
5.6%
s 10057
 
4.0%
l 9676
 
3.9%
Other values (74) 82623
32.9%
Hangul
ValueCountFrequency (%)
408
 
3.0%
356
 
2.6%
333
 
2.4%
321
 
2.4%
228
 
1.7%
228
 
1.7%
207
 
1.5%
207
 
1.5%
160
 
1.2%
159
 
1.2%
Other values (559) 11022
80.9%
Punctuation
ValueCountFrequency (%)
24
68.6%
4
 
11.4%
4
 
11.4%
3
 
8.6%
None
ValueCountFrequency (%)
μ 5
19.2%
β 4
15.4%
° 3
11.5%
π 2
 
7.7%
× 2
 
7.7%
´ 2
 
7.7%
Ω 2
 
7.7%
1
 
3.8%
1
 
3.8%
δ 1
 
3.8%
Other values (3) 3
11.5%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct147
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
Minimum2022-02-24 00:00:00
Maximum2023-06-14 00:00:00
2023-12-13T04:39:22.817897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:39:23.007418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3214
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-12-13T04:39:23.335205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length46
Mean length46
Min length46

Characters and Unicode

Total characters147844
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3214 ?
Unique (%)100.0%

Sample

1st rowhttps://koasas.kaist.ac.kr/handle/10203/299740
2nd rowhttps://koasas.kaist.ac.kr/handle/10203/299812
3rd rowhttps://koasas.kaist.ac.kr/handle/10203/298311
4th rowhttps://koasas.kaist.ac.kr/handle/10203/298275
5th rowhttps://koasas.kaist.ac.kr/handle/10203/298327
ValueCountFrequency (%)
https://koasas.kaist.ac.kr/handle/10203/299740 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/298867 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/305183 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/305638 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/302208 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/304898 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/298855 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/304287 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/298856 1
 
< 0.1%
https://koasas.kaist.ac.kr/handle/10203/298857 1
 
< 0.1%
Other values (3204) 3204
99.7%
2023-12-13T04:39:23.808493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16070
 
10.9%
a 16070
 
10.9%
s 12856
 
8.7%
0 9824
 
6.6%
k 9642
 
6.5%
. 9642
 
6.5%
t 9642
 
6.5%
3 6529
 
4.4%
h 6428
 
4.3%
2 5476
 
3.7%
Other values (17) 45665
30.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 83564
56.5%
Decimal Number 35354
23.9%
Other Punctuation 28926
 
19.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 16070
19.2%
s 12856
15.4%
k 9642
11.5%
t 9642
11.5%
h 6428
 
7.7%
d 3214
 
3.8%
e 3214
 
3.8%
l 3214
 
3.8%
n 3214
 
3.8%
r 3214
 
3.8%
Other values (4) 12856
15.4%
Decimal Number
ValueCountFrequency (%)
0 9824
27.8%
3 6529
18.5%
2 5476
15.5%
1 4728
13.4%
9 2509
 
7.1%
4 1539
 
4.4%
8 1441
 
4.1%
6 1219
 
3.4%
7 1110
 
3.1%
5 979
 
2.8%
Other Punctuation
ValueCountFrequency (%)
/ 16070
55.6%
. 9642
33.3%
: 3214
 
11.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 83564
56.5%
Common 64280
43.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 16070
19.2%
s 12856
15.4%
k 9642
11.5%
t 9642
11.5%
h 6428
 
7.7%
d 3214
 
3.8%
e 3214
 
3.8%
l 3214
 
3.8%
n 3214
 
3.8%
r 3214
 
3.8%
Other values (4) 12856
15.4%
Common
ValueCountFrequency (%)
/ 16070
25.0%
0 9824
15.3%
. 9642
15.0%
3 6529
10.2%
2 5476
 
8.5%
1 4728
 
7.4%
: 3214
 
5.0%
9 2509
 
3.9%
4 1539
 
2.4%
8 1441
 
2.2%
Other values (3) 3308
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 147844
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16070
 
10.9%
a 16070
 
10.9%
s 12856
 
8.7%
0 9824
 
6.6%
k 9642
 
6.5%
. 9642
 
6.5%
t 9642
 
6.5%
3 6529
 
4.4%
h 6428
 
4.3%
2 5476
 
3.7%
Other values (17) 45665
30.9%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.2 KiB
2023-06-21
3214 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-21
2nd row2023-06-21
3rd row2023-06-21
4th row2023-06-21
5th row2023-06-21

Common Values

ValueCountFrequency (%)
2023-06-21 3214
100.0%

Length

2023-12-13T04:39:24.012511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:39:24.151974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-21 3214
100.0%

Interactions

2023-12-13T04:39:16.870332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T04:39:17.019631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:39:17.174248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이템(ID)저자서지정보자료유형발행일출판사입력일자료명갱신일통합지원식별자데이터기준일
0301262Lee, Donggun; Kim, Taesu; Suk, Hyeon-JeongIS and T International Symposium on Electronic Imaging: 27th Color Imaging: Displaying, Processing, Hardcopy, and Applications, COLOR 2022Conference2022Society for Imaging Science and Technology2022-11-16Pokemon Color Adjustments for Augmented Reality Contents2022-11-16https://koasas.kaist.ac.kr/handle/10203/2997402023-06-21
1301334Kim, Sungjoong; Hong, Kyungwoo; Bang, HyochoongAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022Conference2022American Institute of Aeronautics and Astronautics Inc, AIAA2022-11-17Vision-based Map-referenced Navigation using Terrain Classification of Aerial Images2022-11-17https://koasas.kaist.ac.kr/handle/10203/2998122023-06-21
2299835Han, Seung-Ho; Choi, Ho-JinIEEE International Conference on Big Data and Smart Computing (BigComp), pp.391 - 394Conference2022IEEE2022-09-05Checklist for Validating Trustworthy AI2022-11-27https://koasas.kaist.ac.kr/handle/10203/2983112023-06-21
3299799Kim, Beomyoung; Lee, Janghyeon; Lee, Sihaeng; Kim, Doyeon; Kim, Junmo22nd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.3421 - 3430Conference2022IEEE COMPUTER SOC2022-09-02TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection2022-09-02https://koasas.kaist.ac.kr/handle/10203/2982752023-06-21
4299851Joo, Hyo-Jun; Kim, Youngmin; Burt, Daniel; Jung, Yongduck; Zhang, Lin; Chen, Melvina; Parluhutan, Samuel Jior; Kang, Dong-Ho; Lee, Chulwon; Assali, Simone; Ikonic, Zoran; Moutanabbir, Oussama; Cho, Yong-Hoon; Tan, Chuan Seng; Nam, DongukConference on Silicon Photonics XVII Part of SPIE Photonics West OPTO ConferenceConference2022SPIE-INT SOC OPTICAL ENGINEERING2022-09-051D Photonic Crystal GeSn-on-Insulator Nanobeam Laser2022-09-05https://koasas.kaist.ac.kr/handle/10203/2983272023-06-21
5301290Han, Juyeop; Tahk, Min-Jea; Choi, Han-LimAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022Conference2022American Institute of Aeronautics and Astronautics Inc, AIAA2022-11-16Pseudospectral method-based safe motion planning for quadrotors in a cluttered environment2022-11-16https://koasas.kaist.ac.kr/handle/10203/2997682023-06-21
6299844Yoon, Jinhyeong; Kim, Jaeyong; Kim, Junhyeong; Yoon, Hyeonho; Park, Hyo-Hoon; Kurt, HamzaConference on Silicon Photonics XVII Part of SPIE Photonics West OPTO ConferenceConference2022SPIE-INT SOC OPTICAL ENGINEERING2022-09-05Inverse design of high-performance grating structure for out-of-plane radiation of waveguide mode2022-09-05https://koasas.kaist.ac.kr/handle/10203/2983202023-06-21
7297861Choi, Tae Sung; Lee, Dong Kun; Jung, Yu Chae; Choi, Ho-JinThe 36th International Conference on Information Networking (ICOIN), pp.250 - 253Conference2022Korean Institute of Information Scientists and Engineers2022-04-28Multivariate Time-series Anomaly Detection using SeqVAE-CNN Hybrid Model2022-09-01https://koasas.kaist.ac.kr/handle/10203/2963372023-06-21
8301260Kim, Taesu; Park, Hyeonju; Suk, Hyeon-JeongIS and T International Symposium on Electronic Imaging: Human Vision and Electronic Imagings, HVEI 2022Conference2022Society for Imaging Science and Technology2022-11-16A Method Proposal for Evaluating Color Tolerance in Viewing Multiple White Points Focusing on the Vehicle Instrument Panels2022-11-16https://koasas.kaist.ac.kr/handle/10203/2997382023-06-21
9299856Byun, Junyoung; Go, Hyojun; Kim, Changick22nd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp.3809 - 3818Conference2022IEEE COMPUTER SOC2022-09-05Geometrically Adaptive Dictionary Attack on Face Recognition2022-09-05https://koasas.kaist.ac.kr/handle/10203/2983322023-06-21
아이템(ID)저자서지정보자료유형발행일출판사입력일자료명갱신일통합지원식별자데이터기준일
3204306338Son, Keeyoung; Kim, Joungho; Lho, Daehwan; Kim, Keunwoo; Choi, Seonguk; Kim, Haeyeon; Park, Hyunwook; Sim, BooGyo; Shin, TaeInIEEE Electrical Design of Advanced Packaging and Systems, EDAPS 2022Conference2022IEEE2023-01-31Power Distribution Network Impedance Analysis considering Thermal Distribution2023-02-09https://koasas.kaist.ac.kr/handle/10203/3048112023-06-21
3205306422Bae, Ji Eun; Calmano, Thomas; Krnkel, Christian; Rotermund, FabianOptica Laser Congress (Advanced Solid State Lasers Conference)Conference2022Optica (formerly OSA)2023-01-31Beam-Splitter-Type Waveguide Laser for Controllable Single- and Dual-Channel Q-Switching2023-01-31https://koasas.kaist.ac.kr/handle/10203/3048952023-06-21
3206305547Kwon, Soonjae; Park, Sung Hyuk; Lee, Gene; Lee, DongwonInternational Conference on Information Systems 2022Conference2022Association for Information Systems2023-01-05Learning Faces to Predict Matching Probability in an Online Matching Platform2023-01-05https://koasas.kaist.ac.kr/handle/10203/3040202023-06-21
3207305321김재훈2022 한국분자세포생물학회 에피유전체학분과 심포지엄Conference2022한국분자세포생물학회2022-12-27Transcription regulation by histone globular domain modifications2022-12-27https://koasas.kaist.ac.kr/handle/10203/3037942023-06-21
3208306435Nagarathinam, David; Cho, Yeunwoo9th International and 49th National conference of Fluid Mechanics and Fluid Power, FMFP-2022Conference2022National Society for Fluid Mechanics and Fluid Power2023-02-01Air lubrication on a flat plate in a steady water stream2023-02-01https://koasas.kaist.ac.kr/handle/10203/3049082023-06-21
3209306452김일두; 홍승범; 신병하3rd KAIST Emerging Materials e-SymposiumConference2022한국과학기술원 신소재공학과2023-02-01Emerging Energy Materials2023-02-01https://koasas.kaist.ac.kr/handle/10203/3049252023-06-21
3210308185이승재오믹스 기술 활용 연구 심포지엄Conference2022오믹스 기술 활용 연구 심포지엄2023-05-10Molecular genetic approaches for healthy longevity using Caenorhaboditis elegans2023-05-10https://koasas.kaist.ac.kr/handle/10203/3066592023-06-21
3211305576Kim, Hyun UkKorea Advanced Institute of Science and Technology-National Taiwan University 2nd ChemE WorkshopConference2022National Taiwan University2023-01-05Computational platform technologies for medicine and biotechnology2023-01-05https://koasas.kaist.ac.kr/handle/10203/3040492023-06-21
3212306059임우빈; 김우재; 윤성의한국소프트웨어종합학술대회Conference2022한국정보과학회2023-01-17Scenario Generation by Action Scene-Graph Prediction2023-01-17https://koasas.kaist.ac.kr/handle/10203/3045322023-06-21
3213305357Im, Bo-HaeThe 10th(2022) NCTS-POSTECH-PMI Joint Workshop on Number TheoryConference2022National Taiwa University2022-12-29Zagier-Hoffman's conjectures in positive characterisitc2022-12-29https://koasas.kaist.ac.kr/handle/10203/3038302023-06-21