Overview

Dataset statistics

Number of variables3
Number of observations3814
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory89.5 KiB
Average record size in memory24.0 B

Variable types

Text2
DateTime1

Dataset

Description사용자 검색어 정보에 대한 데이터로 검색어, 검색횟수, 업데이트일 등의 항목을 제공합니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15070753/fileData.do

Alerts

검색어 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:58:10.855174
Analysis finished2023-12-12 05:58:11.357873
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검색어
Text

UNIQUE 

Distinct3814
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
2023-12-12T14:58:11.640874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length24
Mean length4.9737808
Min length1

Characters and Unicode

Total characters18970
Distinct characters498
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3814 ?
Unique (%)100.0%

Sample

1st row0
2nd row1
3rd rowC
4th rowK
5th rowk
ValueCountFrequency (%)
학습자 42
 
0.9%
온라인 42
 
0.9%
자격증 41
 
0.9%
학점 41
 
0.9%
신청 41
 
0.9%
학점인정 35
 
0.8%
증명서 34
 
0.8%
등록 27
 
0.6%
전공 25
 
0.6%
신청서 24
 
0.5%
Other values (3264) 4099
92.1%
2023-12-12T14:58:12.134821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1177
 
6.2%
698
 
3.7%
605
 
3.2%
441
 
2.3%
394
 
2.1%
375
 
2.0%
374
 
2.0%
346
 
1.8%
338
 
1.8%
295
 
1.6%
Other values (488) 13927
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17723
93.4%
Space Separator 698
 
3.7%
Lowercase Letter 251
 
1.3%
Decimal Number 186
 
1.0%
Uppercase Letter 96
 
0.5%
Other Punctuation 12
 
0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1177
 
6.6%
605
 
3.4%
441
 
2.5%
394
 
2.2%
375
 
2.1%
374
 
2.1%
346
 
2.0%
338
 
1.9%
295
 
1.7%
290
 
1.6%
Other values (428) 13088
73.8%
Lowercase Letter
ValueCountFrequency (%)
r 22
 
8.8%
c 19
 
7.6%
s 19
 
7.6%
d 16
 
6.4%
k 16
 
6.4%
m 16
 
6.4%
t 16
 
6.4%
o 14
 
5.6%
b 13
 
5.2%
a 13
 
5.2%
Other values (15) 87
34.7%
Uppercase Letter
ValueCountFrequency (%)
S 13
13.5%
C 11
11.5%
D 8
 
8.3%
T 7
 
7.3%
B 7
 
7.3%
K 6
 
6.2%
A 6
 
6.2%
I 5
 
5.2%
E 5
 
5.2%
H 5
 
5.2%
Other values (9) 23
24.0%
Decimal Number
ValueCountFrequency (%)
2 53
28.5%
3 39
21.0%
1 33
17.7%
4 14
 
7.5%
0 13
 
7.0%
8 12
 
6.5%
5 11
 
5.9%
6 6
 
3.2%
7 3
 
1.6%
9 2
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 8
66.7%
/ 2
 
16.7%
: 1
 
8.3%
1
 
8.3%
Space Separator
ValueCountFrequency (%)
698
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17723
93.4%
Common 900
 
4.7%
Latin 347
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1177
 
6.6%
605
 
3.4%
441
 
2.5%
394
 
2.2%
375
 
2.1%
374
 
2.1%
346
 
2.0%
338
 
1.9%
295
 
1.7%
290
 
1.6%
Other values (428) 13088
73.8%
Latin
ValueCountFrequency (%)
r 22
 
6.3%
c 19
 
5.5%
s 19
 
5.5%
d 16
 
4.6%
k 16
 
4.6%
m 16
 
4.6%
t 16
 
4.6%
o 14
 
4.0%
b 13
 
3.7%
S 13
 
3.7%
Other values (34) 183
52.7%
Common
ValueCountFrequency (%)
698
77.6%
2 53
 
5.9%
3 39
 
4.3%
1 33
 
3.7%
4 14
 
1.6%
0 13
 
1.4%
8 12
 
1.3%
5 11
 
1.2%
. 8
 
0.9%
6 6
 
0.7%
Other values (6) 13
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17719
93.4%
ASCII 1246
 
6.6%
Compat Jamo 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1177
 
6.6%
605
 
3.4%
441
 
2.5%
394
 
2.2%
375
 
2.1%
374
 
2.1%
346
 
2.0%
338
 
1.9%
295
 
1.7%
290
 
1.6%
Other values (426) 13084
73.8%
ASCII
ValueCountFrequency (%)
698
56.0%
2 53
 
4.3%
3 39
 
3.1%
1 33
 
2.6%
r 22
 
1.8%
c 19
 
1.5%
s 19
 
1.5%
d 16
 
1.3%
k 16
 
1.3%
m 16
 
1.3%
Other values (49) 315
25.3%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
2
50.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct901
Distinct (%)23.6%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
2023-12-12T14:58:12.548397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length2.7574725
Min length2

Characters and Unicode

Total characters10517
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique488 ?
Unique (%)12.8%

Sample

1st row668
2nd row137
3rd row55
4th row54
5th row51
ValueCountFrequency (%)
51 63
 
1.7%
54 60
 
1.6%
53 54
 
1.4%
60 53
 
1.4%
56 46
 
1.2%
61 46
 
1.2%
55 44
 
1.2%
52 44
 
1.2%
71 42
 
1.1%
57 41
 
1.1%
Other values (891) 3321
87.1%
2023-12-12T14:58:13.148868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1962
18.7%
5 1120
10.6%
6 1089
10.4%
2 1082
10.3%
7 943
9.0%
3 934
8.9%
8 785
7.5%
4 782
 
7.4%
9 781
 
7.4%
0 750
 
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10228
97.3%
Other Punctuation 289
 
2.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1962
19.2%
5 1120
11.0%
6 1089
10.6%
2 1082
10.6%
7 943
9.2%
3 934
9.1%
8 785
7.7%
4 782
 
7.6%
9 781
 
7.6%
0 750
 
7.3%
Other Punctuation
ValueCountFrequency (%)
, 289
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10517
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1962
18.7%
5 1120
10.6%
6 1089
10.4%
2 1082
10.3%
7 943
9.0%
3 934
8.9%
8 785
7.5%
4 782
 
7.4%
9 781
 
7.4%
0 750
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10517
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1962
18.7%
5 1120
10.6%
6 1089
10.4%
2 1082
10.3%
7 943
9.0%
3 934
8.9%
8 785
7.5%
4 782
 
7.4%
9 781
 
7.4%
0 750
 
7.1%
Distinct199
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size29.9 KiB
Minimum2019-01-12 00:00:00
Maximum2021-11-02 00:00:00
2023-12-12T14:58:13.300176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:58:13.426404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T14:58:11.226258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:58:11.324460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검색어검색횟수업데이트일
006682021-04-11
111372021-10-22
2C552021-08-02
3K542021-09-30
4k512021-08-04
518892021-10-26
63D1472021-10-21
73d1632021-10-11
848572021-10-31
9A11002021-10-06
검색어검색횟수업데이트일
3804사회복지사 자격증 발급 신청서1012021-09-24
3805온라인 학습자등록 및 학점인정562021-10-27
3806중복과목 및 대체과목 처리기준2442021-10-18
3807한국열린사이버대학교평생교육원572021-02-18
380803232323232323333333333333333333333552019-08-09
3809교육훈련기관용 학점인정 신청 시스템792021-10-08
3810온라인 학습자등록 및 학점인정 등 각종신청1142021-10-25
3811온라인학습자 등록 및 학점인정 등 각종신청782021-10-18
3812온라인 학습자등록 및 학점인정 등 각종 신청972021-10-09
3813온라인 학습자 등록 및 학점인정 등 각종 신청552021-07-29