Overview

Dataset statistics

Number of variables1
Number of observations4993
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory39.1 KiB
Average record size in memory8.0 B

Variable types

Text1

Dataset

Description한국사학진흥재단 홈페이지에서 국민을 대상으로 실시한 설문조사에 응답한 설문조사 참여자에 대한 정보(아이디)를 제공합니다.
Author한국사학진흥재단
URLhttps://www.data.go.kr/data/15067219/fileData.do

Alerts

메시지아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:22:50.017341
Analysis finished2023-12-11 23:22:50.230284
Duration0.21 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메시지아이디
Text

UNIQUE 

Distinct4993
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
2023-12-12T08:22:50.474156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.7810935
Min length3

Characters and Unicode

Total characters38851
Distinct characters58
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4993 ?
Unique (%)100.0%

Sample

1st rowYMHan1201
2nd rowdsthih
3rd rowelee2060
4th rowamls01
5th rowhjfreea
ValueCountFrequency (%)
andrei 2
 
< 0.1%
csrcha 2
 
< 0.1%
fortress17 1
 
< 0.1%
snshin33 1
 
< 0.1%
brang1123 1
 
< 0.1%
csleere 1
 
< 0.1%
penderah 1
 
< 0.1%
jhkim3273 1
 
< 0.1%
try2000 1
 
< 0.1%
yhl05050 1
 
< 0.1%
Other values (4981) 4981
99.8%
2023-12-12T08:22:51.038469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 2136
 
5.5%
o 2105
 
5.4%
a 2101
 
5.4%
s 2071
 
5.3%
e 1974
 
5.1%
0 1937
 
5.0%
1 1799
 
4.6%
i 1693
 
4.4%
k 1627
 
4.2%
h 1553
 
4.0%
Other values (48) 19855
51.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 27918
71.9%
Decimal Number 10856
 
27.9%
Uppercase Letter 76
 
0.2%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 2136
 
7.7%
o 2105
 
7.5%
a 2101
 
7.5%
s 2071
 
7.4%
e 1974
 
7.1%
i 1693
 
6.1%
k 1627
 
5.8%
h 1553
 
5.6%
j 1177
 
4.2%
m 1175
 
4.2%
Other values (16) 10306
36.9%
Uppercase Letter
ValueCountFrequency (%)
H 9
 
11.8%
A 7
 
9.2%
J 5
 
6.6%
Y 5
 
6.6%
C 4
 
5.3%
M 4
 
5.3%
I 4
 
5.3%
K 4
 
5.3%
S 4
 
5.3%
T 4
 
5.3%
Other values (11) 26
34.2%
Decimal Number
ValueCountFrequency (%)
0 1937
17.8%
1 1799
16.6%
2 1406
13.0%
7 1009
9.3%
3 926
8.5%
9 850
7.8%
8 790
7.3%
5 788
7.3%
4 707
 
6.5%
6 644
 
5.9%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 27994
72.1%
Common 10857
 
27.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 2136
 
7.6%
o 2105
 
7.5%
a 2101
 
7.5%
s 2071
 
7.4%
e 1974
 
7.1%
i 1693
 
6.0%
k 1627
 
5.8%
h 1553
 
5.5%
j 1177
 
4.2%
m 1175
 
4.2%
Other values (37) 10382
37.1%
Common
ValueCountFrequency (%)
0 1937
17.8%
1 1799
16.6%
2 1406
13.0%
7 1009
9.3%
3 926
8.5%
9 850
7.8%
8 790
7.3%
5 788
7.3%
4 707
 
6.5%
6 644
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38851
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 2136
 
5.5%
o 2105
 
5.4%
a 2101
 
5.4%
s 2071
 
5.3%
e 1974
 
5.1%
0 1937
 
5.0%
1 1799
 
4.6%
i 1693
 
4.4%
k 1627
 
4.2%
h 1553
 
4.0%
Other values (48) 19855
51.1%

Missing values

2023-12-12T08:22:50.118845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:22:50.187958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

메시지아이디
0YMHan1201
1dsthih
2elee2060
3amls01
4hjfreea
5romane
6longli22
7a0075a
8jhs123123
9kmj2047
메시지아이디
4983ksl8787
4984bjh2413
4985simongsp
4986tm0880
4987dlcktodi
4988nagnehoon
4989kuis2007
4990todana
4991yhpark00
4992sumi2223