Overview

Dataset statistics

Number of variables3
Number of observations150
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory25.9 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description디클 플랫폼에서 사용자의 해시태그 검색 기록에 대한 정보로 번호, 검색 분류(카테고리), 검색 해시태그에 대한 정보를 제공합니다.
Author한국양성평등교육진흥원
URLhttps://www.data.go.kr/data/15126163/fileData.do

Alerts

번호 is highly overall correlated with 분류High correlation
분류 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:13:01.044445
Analysis finished2024-04-29 23:13:02.843155
Duration1.8 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct150
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.5
Minimum1
Maximum150
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-30T08:13:02.922362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.45
Q138.25
median75.5
Q3112.75
95-th percentile142.55
Maximum150
Range149
Interquartile range (IQR)74.5

Descriptive statistics

Standard deviation43.445368
Coefficient of variation (CV)0.57543534
Kurtosis-1.2
Mean75.5
Median Absolute Deviation (MAD)37.5
Skewness0
Sum11325
Variance1887.5
MonotonicityStrictly increasing
2024-04-30T08:13:03.071449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
96 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
105 1
 
0.7%
Other values (140) 140
93.3%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
150 1
0.7%
149 1
0.7%
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%

분류
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
교사·양육자
31 
클립영상
31 
7-12세
30 
16-18세
30 
13-15세
28 

Length

Max length6
Median length6
Mean length5.3866667
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7-12세
2nd row7-12세
3rd row7-12세
4th row7-12세
5th row7-12세

Common Values

ValueCountFrequency (%)
교사·양육자 31
20.7%
클립영상 31
20.7%
7-12세 30
20.0%
16-18세 30
20.0%
13-15세 28
18.7%

Length

2024-04-30T08:13:03.226434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T08:13:03.358130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교사·양육자 31
20.7%
클립영상 31
20.7%
7-12세 30
20.0%
16-18세 30
20.0%
13-15세 28
18.7%
Distinct68
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-30T08:13:03.612190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length4.78
Min length2

Characters and Unicode

Total characters717
Distinct characters172
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)19.3%

Sample

1st row메타버스
2nd row개인정보보호
3rd row온라인그루밍
4th row온라인채팅
5th row프로파일러권일용
ValueCountFrequency (%)
메타버스 5
 
3.3%
온라인그루밍 5
 
3.3%
불법촬영 5
 
3.3%
웹드라마 4
 
2.6%
온라인스토킹 4
 
2.6%
이달의콘텐츠 4
 
2.6%
다큐멘터리 4
 
2.6%
고민상담 4
 
2.6%
개인정보보호 4
 
2.6%
피해지원기관 4
 
2.6%
Other values (59) 109
71.7%
2024-04-30T08:13:03.979689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
3.3%
21
 
2.9%
20
 
2.8%
19
 
2.6%
17
 
2.4%
15
 
2.1%
14
 
2.0%
12
 
1.7%
12
 
1.7%
12
 
1.7%
Other values (162) 551
76.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 702
97.9%
Lowercase Letter 6
 
0.8%
Uppercase Letter 4
 
0.6%
Decimal Number 3
 
0.4%
Space Separator 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
3.4%
21
 
3.0%
20
 
2.8%
19
 
2.7%
17
 
2.4%
15
 
2.1%
14
 
2.0%
12
 
1.7%
12
 
1.7%
12
 
1.7%
Other values (157) 536
76.4%
Uppercase Letter
ValueCountFrequency (%)
Q 2
50.0%
A 2
50.0%
Lowercase Letter
ValueCountFrequency (%)
n 6
100.0%
Decimal Number
ValueCountFrequency (%)
2 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 702
97.9%
Latin 10
 
1.4%
Common 5
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
3.4%
21
 
3.0%
20
 
2.8%
19
 
2.7%
17
 
2.4%
15
 
2.1%
14
 
2.0%
12
 
1.7%
12
 
1.7%
12
 
1.7%
Other values (157) 536
76.4%
Latin
ValueCountFrequency (%)
n 6
60.0%
Q 2
 
20.0%
A 2
 
20.0%
Common
ValueCountFrequency (%)
2 3
60.0%
2
40.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 702
97.9%
ASCII 15
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
3.4%
21
 
3.0%
20
 
2.8%
19
 
2.7%
17
 
2.4%
15
 
2.1%
14
 
2.0%
12
 
1.7%
12
 
1.7%
12
 
1.7%
Other values (157) 536
76.4%
ASCII
ValueCountFrequency (%)
n 6
40.0%
2 3
20.0%
2
 
13.3%
Q 2
 
13.3%
A 2
 
13.3%

Interactions

2024-04-30T08:13:02.528634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:13:04.075359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류해시태그
번호1.0000.9990.000
분류0.9991.0000.000
해시태그0.0000.0001.000
2024-04-30T08:13:04.192037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류
번호1.0000.943
분류0.9431.000

Missing values

2024-04-30T08:13:02.723386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:13:02.804766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호분류해시태그
017-12세메타버스
127-12세개인정보보호
237-12세온라인그루밍
347-12세온라인채팅
457-12세프로파일러권일용
567-12세게임
677-12세이달의콘텐츠
787-12세애니메이션
897-12세오픈채팅
9107-12세불법촬영
번호분류해시태그
140141클립영상n번방사건
141142클립영상디지털성범죄
142143클립영상사이버언어폭력
143144클립영상정책제안
144145클립영상토론
145146클립영상다큐멘터리
146147클립영상디지털네이티브
147148클립영상실험
148149클립영상사이버 성적괴롭힘
149150클립영상온라인 그루밍