Overview

Dataset statistics

Number of variables4
Number of observations648
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.6 KiB
Average record size in memory34.2 B

Variable types

Numeric2
Text1
DateTime1

Dataset

Description부산진구신문에 대한 해시태그 검색 정보(등록일자, 검색횟수, 검색해시태그, 해시태그 고유번호)를 제공합니다.
URLhttps://www.data.go.kr/data/15120152/fileData.do

Alerts

해시태그 고유번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:40:06.286025
Analysis finished2023-12-12 05:40:07.530452
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해시태그 고유번호
Real number (ℝ)

UNIQUE 

Distinct648
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean332.29938
Minimum1
Maximum656
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.8 KiB
2023-12-12T14:40:07.615096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile41.35
Q1170.75
median332.5
Q3494.25
95-th percentile623.65
Maximum656
Range655
Interquartile range (IQR)323.5

Descriptive statistics

Standard deviation187.54851
Coefficient of variation (CV)0.56439621
Kurtosis-1.1918085
Mean332.29938
Median Absolute Deviation (MAD)162
Skewness-0.0061535255
Sum215330
Variance35174.445
MonotonicityStrictly increasing
2023-12-12T14:40:07.777020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
455 1
 
0.2%
437 1
 
0.2%
438 1
 
0.2%
439 1
 
0.2%
440 1
 
0.2%
441 1
 
0.2%
442 1
 
0.2%
443 1
 
0.2%
444 1
 
0.2%
Other values (638) 638
98.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
656 1
0.2%
655 1
0.2%
654 1
0.2%
653 1
0.2%
652 1
0.2%
651 1
0.2%
650 1
0.2%
649 1
0.2%
648 1
0.2%
647 1
0.2%
Distinct548
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2023-12-12T14:40:08.160162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length4.566358
Min length1

Characters and Unicode

Total characters2959
Distinct characters434
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique476 ?
Unique (%)73.5%

Sample

1st row전포카페거리
2nd row전리단길
3rd row맛집
4th row공연
5th row여행
ValueCountFrequency (%)
부산진구 13
 
1.8%
당감종합사회복지관 7
 
0.9%
범천 6
 
0.8%
규제 6
 
0.8%
서면 6
 
0.8%
꿈드림 5
 
0.7%
걷기 5
 
0.7%
brt 4
 
0.5%
임대인 4
 
0.5%
가야2동 4
 
0.5%
Other values (562) 682
91.9%
2023-12-12T14:40:08.694161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
3.8%
58
 
2.0%
52
 
1.8%
51
 
1.7%
51
 
1.7%
51
 
1.7%
49
 
1.7%
42
 
1.4%
36
 
1.2%
36
 
1.2%
Other values (424) 2421
81.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2671
90.3%
Space Separator 112
 
3.8%
Decimal Number 85
 
2.9%
Uppercase Letter 39
 
1.3%
Lowercase Letter 35
 
1.2%
Other Punctuation 11
 
0.4%
Close Punctuation 3
 
0.1%
Other Symbol 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
2.2%
52
 
1.9%
51
 
1.9%
51
 
1.9%
51
 
1.9%
49
 
1.8%
42
 
1.6%
36
 
1.3%
36
 
1.3%
35
 
1.3%
Other values (375) 2210
82.7%
Lowercase Letter
ValueCountFrequency (%)
t 5
14.3%
l 4
11.4%
s 4
11.4%
r 3
8.6%
e 3
8.6%
d 3
8.6%
n 2
 
5.7%
a 2
 
5.7%
h 2
 
5.7%
w 1
 
2.9%
Other values (6) 6
17.1%
Uppercase Letter
ValueCountFrequency (%)
T 5
12.8%
R 5
12.8%
B 4
10.3%
K 4
10.3%
L 3
7.7%
A 3
7.7%
F 2
 
5.1%
Q 2
 
5.1%
S 2
 
5.1%
N 2
 
5.1%
Other values (6) 7
17.9%
Decimal Number
ValueCountFrequency (%)
2 21
24.7%
0 16
18.8%
1 15
17.6%
3 11
12.9%
5 9
10.6%
4 6
 
7.1%
6 2
 
2.4%
9 2
 
2.4%
8 2
 
2.4%
7 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
# 10
90.9%
. 1
 
9.1%
Space Separator
ValueCountFrequency (%)
112
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2669
90.2%
Common 214
 
7.2%
Latin 74
 
2.5%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
2.2%
52
 
1.9%
51
 
1.9%
51
 
1.9%
51
 
1.9%
49
 
1.8%
42
 
1.6%
36
 
1.3%
36
 
1.3%
35
 
1.3%
Other values (373) 2208
82.7%
Latin
ValueCountFrequency (%)
t 5
 
6.8%
T 5
 
6.8%
R 5
 
6.8%
l 4
 
5.4%
B 4
 
5.4%
K 4
 
5.4%
s 4
 
5.4%
L 3
 
4.1%
r 3
 
4.1%
A 3
 
4.1%
Other values (22) 34
45.9%
Common
ValueCountFrequency (%)
112
52.3%
2 21
 
9.8%
0 16
 
7.5%
1 15
 
7.0%
3 11
 
5.1%
# 10
 
4.7%
5 9
 
4.2%
4 6
 
2.8%
) 3
 
1.4%
6 2
 
0.9%
Other values (7) 9
 
4.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2655
89.7%
ASCII 287
 
9.7%
Compat Jamo 14
 
0.5%
CJK 2
 
0.1%
Specials 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
112
39.0%
2 21
 
7.3%
0 16
 
5.6%
1 15
 
5.2%
3 11
 
3.8%
# 10
 
3.5%
5 9
 
3.1%
4 6
 
2.1%
t 5
 
1.7%
T 5
 
1.7%
Other values (38) 77
26.8%
Hangul
ValueCountFrequency (%)
58
 
2.2%
52
 
2.0%
51
 
1.9%
51
 
1.9%
51
 
1.9%
49
 
1.8%
42
 
1.6%
36
 
1.4%
36
 
1.4%
35
 
1.3%
Other values (367) 2194
82.6%
Compat Jamo
ValueCountFrequency (%)
3
21.4%
3
21.4%
3
21.4%
2
14.3%
2
14.3%
1
 
7.1%
Specials
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

검색횟수
Real number (ℝ)

Distinct19
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1296296
Minimum1
Maximum47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.8 KiB
2023-12-12T14:40:08.851427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile6
Maximum47
Range46
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3.0748295
Coefficient of variation (CV)1.443833
Kurtosis84.160243
Mean2.1296296
Median Absolute Deviation (MAD)0
Skewness7.5772786
Sum1380
Variance9.4545767
MonotonicityNot monotonic
2023-12-12T14:40:08.999126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 410
63.3%
2 105
 
16.2%
3 48
 
7.4%
5 26
 
4.0%
4 25
 
3.9%
6 10
 
1.5%
7 7
 
1.1%
8 3
 
0.5%
11 3
 
0.5%
20 2
 
0.3%
Other values (9) 9
 
1.4%
ValueCountFrequency (%)
1 410
63.3%
2 105
 
16.2%
3 48
 
7.4%
4 25
 
3.9%
5 26
 
4.0%
6 10
 
1.5%
7 7
 
1.1%
8 3
 
0.5%
9 1
 
0.2%
10 1
 
0.2%
ValueCountFrequency (%)
47 1
 
0.2%
25 1
 
0.2%
23 1
 
0.2%
21 1
 
0.2%
20 2
0.3%
19 1
 
0.2%
14 1
 
0.2%
12 1
 
0.2%
11 3
0.5%
10 1
 
0.2%
Distinct320
Distinct (%)49.4%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2019-10-08 00:00:00
Maximum2023-08-24 00:00:00
2023-12-12T14:40:09.152132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:09.307945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T14:40:07.002132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:06.592421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:07.211293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:40:06.746823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:40:09.399466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그 고유번호검색횟수
해시태그 고유번호1.0000.063
검색횟수0.0631.000
2023-12-12T14:40:09.517780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그 고유번호검색횟수
해시태그 고유번호1.0000.014
검색횟수0.0141.000

Missing values

2023-12-12T14:40:07.365785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:40:07.493132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해시태그 고유번호검색해시태그검색횟수등록일자
01전포카페거리22019-10-08
12전리단길22019-10-08
23맛집12019-10-08
34공연62019-10-08
45여행32019-10-08
56관광지22019-10-08
67복지32019-10-08
78ㅜㄹ아ㅣㅗㅜㅎㄹ아ㅣㅗㅜㅎㄹ아ㅣ12019-10-08
89nsdjflhnd12019-10-08
910해시태그12019-10-10
해시태그 고유번호검색해시태그검색횟수등록일자
638647카페12023-07-31
639648창업12023-07-31
640649주차단속체험단12023-08-02
641650평생102023-08-03
642651공모32023-08-12
643652공모모집12023-08-12
644653test12023-08-18
645654부산진구12023-08-18
646655진구12023-08-24
647656노인12023-08-24