Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 1 |
Missing cells (%) | < 0.1% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 419.9 KiB |
Average record size in memory | 43.0 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 주식회사 여기어때컴퍼니 |
URL | https://www.bigdata-finance.kr/dataset/datasetView.do?datastId=SET0400002 |
기준년월 has constant value "" | Constant |
대상기준년월 has constant value "" | Constant |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
서비스이용횟수 is highly skewed (γ1 = 27.77856616) | Skewed |
Reproduction
Analysis started | 2023-12-10 13:13:00.255076 |
---|---|
Analysis finished | 2023-12-10 13:13:02.245773 |
Duration | 1.99 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202108 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202108 |
---|---|
2nd row | 202108 |
3rd row | 202108 |
4th row | 202108 |
5th row | 202108 |
Common Values
Value | Count | Frequency (%) |
202108 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202108 | 10000 |
검색키워드명
Text
Distinct | 9994 |
---|---|
Distinct (%) | 99.9% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
맛집 | 289 | 2.1% |
카페 | 136 | 1.0% |
곳 | 99 | 0.7% |
베스트 | 96 | 0.7% |
제주 | 63 | 0.5% |
파스타 | 47 | 0.3% |
부산 | 43 | 0.3% |
스시 | 41 | 0.3% |
홍대 | 38 | 0.3% |
강남 | 37 | 0.3% |
Other values (9411) | 12755 |
Most occurring characters
Value | Count | Frequency (%) |
3856 | 7.5% | |
이 | 984 | 1.9% |
스 | 974 | 1.9% |
집 | 748 | 1.5% |
동 | 710 | 1.4% |
리 | 676 | 1.3% |
대 | 576 | 1.1% |
수 | 544 | 1.1% |
카 | 467 | 0.9% |
시 | 456 | 0.9% |
Other values (1199) | 41381 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 43119 | |
Space Separator | 3856 | 7.5% |
Lowercase Letter | 3742 | 7.3% |
Uppercase Letter | 545 | 1.1% |
Other Punctuation | 79 | 0.2% |
Dash Punctuation | 14 | < 0.1% |
Open Punctuation | 8 | < 0.1% |
Close Punctuation | 8 | < 0.1% |
Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 984 | 2.3% |
스 | 974 | 2.3% |
집 | 748 | 1.7% |
동 | 710 | 1.6% |
리 | 676 | 1.6% |
대 | 576 | 1.3% |
수 | 544 | 1.3% |
카 | 467 | 1.1% |
시 | 456 | 1.1% |
맛 | 445 | 1.0% |
Other values (1133) | 36539 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 438 | |
e | 373 | 10.0% |
o | 339 | 9.1% |
n | 305 | 8.2% |
i | 243 | 6.5% |
s | 227 | 6.1% |
t | 209 | 5.6% |
r | 194 | 5.2% |
l | 178 | 4.8% |
u | 176 | 4.7% |
Other values (16) | 1060 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 59 | 10.8% |
B | 47 | 8.6% |
C | 37 | 6.8% |
G | 36 | 6.6% |
M | 35 | 6.4% |
P | 33 | 6.1% |
T | 30 | 5.5% |
O | 28 | 5.1% |
A | 26 | 4.8% |
N | 23 | 4.2% |
Other values (15) | 191 |
Other Punctuation
Value | Count | Frequency (%) |
. | 24 | |
/ | 18 | |
? | 15 | |
& | 9 | 11.4% |
' | 6 | 7.6% |
, | 3 | 3.8% |
\ | 2 | 2.5% |
· | 2 | 2.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 | |
[ | 3 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 | |
] | 3 |
Space Separator
Value | Count | Frequency (%) |
3856 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 43097 | |
Latin | 4287 | 8.3% |
Common | 3966 | 7.7% |
Han | 14 | < 0.1% |
Hiragana | 8 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 984 | 2.3% |
스 | 974 | 2.3% |
집 | 748 | 1.7% |
동 | 710 | 1.6% |
리 | 676 | 1.6% |
대 | 576 | 1.3% |
수 | 544 | 1.3% |
카 | 467 | 1.1% |
시 | 456 | 1.1% |
맛 | 445 | 1.0% |
Other values (1111) | 36517 |
Latin
Value | Count | Frequency (%) |
a | 438 | 10.2% |
e | 373 | 8.7% |
o | 339 | 7.9% |
n | 305 | 7.1% |
i | 243 | 5.7% |
s | 227 | 5.3% |
t | 209 | 4.9% |
r | 194 | 4.5% |
l | 178 | 4.2% |
u | 176 | 4.1% |
Other values (41) | 1605 |
Common
Value | Count | Frequency (%) |
3856 | ||
. | 24 | 0.6% |
/ | 18 | 0.5% |
? | 15 | 0.4% |
- | 14 | 0.4% |
& | 9 | 0.2% |
' | 6 | 0.2% |
( | 5 | 0.1% |
) | 5 | 0.1% |
, | 3 | 0.1% |
Other values (5) | 11 | 0.3% |
Han
Value | Count | Frequency (%) |
小 | 1 | 7.1% |
料 | 1 | 7.1% |
屋 | 1 | 7.1% |
石 | 1 | 7.1% |
姐 | 1 | 7.1% |
油 | 1 | 7.1% |
理 | 1 | 7.1% |
益 | 1 | 7.1% |
善 | 1 | 7.1% |
洞 | 1 | 7.1% |
Other values (4) | 4 |
Hiragana
Value | Count | Frequency (%) |
そ | 1 | |
ば | 1 | |
い | 1 | |
ん | 1 | |
ま | 1 | |
ざ | 1 | |
し | 1 | |
す | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 42849 | |
ASCII | 8250 | 16.1% |
Compat Jamo | 248 | 0.5% |
CJK | 14 | < 0.1% |
Hiragana | 8 | < 0.1% |
None | 2 | < 0.1% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3856 | ||
a | 438 | 5.3% |
e | 373 | 4.5% |
o | 339 | 4.1% |
n | 305 | 3.7% |
i | 243 | 2.9% |
s | 227 | 2.8% |
t | 209 | 2.5% |
r | 194 | 2.4% |
l | 178 | 2.2% |
Other values (54) | 1888 |
Hangul
Value | Count | Frequency (%) |
이 | 984 | 2.3% |
스 | 974 | 2.3% |
집 | 748 | 1.7% |
동 | 710 | 1.7% |
리 | 676 | 1.6% |
대 | 576 | 1.3% |
수 | 544 | 1.3% |
카 | 467 | 1.1% |
시 | 456 | 1.1% |
맛 | 445 | 1.0% |
Other values (1081) | 36269 |
Compat Jamo
Value | Count | Frequency (%) |
ㅇ | 26 | 10.5% |
ㄱ | 22 | 8.9% |
ㄹ | 19 | 7.7% |
ㄴ | 19 | 7.7% |
ㅂ | 16 | 6.5% |
ㅎ | 15 | 6.0% |
ㆍ | 15 | 6.0% |
ㄷ | 12 | 4.8% |
ㅣ | 11 | 4.4% |
ㅗ | 11 | 4.4% |
Other values (20) | 82 |
None
Value | Count | Frequency (%) |
· | 2 |
CJK
Value | Count | Frequency (%) |
小 | 1 | 7.1% |
料 | 1 | 7.1% |
屋 | 1 | 7.1% |
石 | 1 | 7.1% |
姐 | 1 | 7.1% |
油 | 1 | 7.1% |
理 | 1 | 7.1% |
益 | 1 | 7.1% |
善 | 1 | 7.1% |
洞 | 1 | 7.1% |
Other values (4) | 4 |
Hiragana
Value | Count | Frequency (%) |
そ | 1 | |
ば | 1 | |
い | 1 | |
ん | 1 | |
ま | 1 | |
ざ | 1 | |
し | 1 | |
す | 1 |
Punctuation
Value | Count | Frequency (%) |
’ | 1 |
서비스이용횟수
Real number (ℝ)
SKEWED
 
Distinct | 184 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.3946 |
Minimum | 1 |
---|---|
Maximum | 3544 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 23 |
Maximum | 3544 |
Range | 3543 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 75.148305 |
---|---|
Coefficient of variation (CV) | 7.9990958 |
Kurtosis | 995.28409 |
Mean | 9.3946 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 27.778566 |
Sum | 93946 |
Variance | 5647.2678 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 4483 | |
2 | 2011 | |
3 | 911 | 9.1% |
4 | 501 | 5.0% |
5 | 330 | 3.3% |
6 | 234 | 2.3% |
7 | 171 | 1.7% |
8 | 146 | 1.5% |
9 | 107 | 1.1% |
10 | 89 | 0.9% |
Other values (174) | 1017 | 10.2% |
Value | Count | Frequency (%) |
1 | 4483 | |
2 | 2011 | |
3 | 911 | 9.1% |
4 | 501 | 5.0% |
5 | 330 | 3.3% |
6 | 234 | 2.3% |
7 | 171 | 1.7% |
8 | 146 | 1.5% |
9 | 107 | 1.1% |
10 | 89 | 0.9% |
Value | Count | Frequency (%) |
3544 | 1 | |
2913 | 1 | |
2611 | 1 | |
2099 | 1 | |
1753 | 1 | |
1319 | 1 | |
1282 | 1 | |
1140 | 1 | |
1095 | 1 | |
1061 | 1 |
대상기준년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202108 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202108 |
---|---|
2nd row | 202108 |
3rd row | 202108 |
4th row | 202108 |
5th row | 202108 |
Common Values
Value | Count | Frequency (%) |
202108 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202108 | 10000 |
기준년월 | 검색키워드명 | 서비스이용횟수 | 대상기준년월 | |
---|---|---|---|---|
47459 | 202108 | 공단해장국 | 2 | 202108 |
7542 | 202108 | 한일식당 | 14 | 202108 |
25300 | 202108 | 잠실 치킨 | 4 | 202108 |
54682 | 202108 | 산드레 | 2 | 202108 |
56714 | 202108 | 리스팬케이크 | 1 | 202108 |
18646 | 202108 | 광화문 짬뽕 | 5 | 202108 |
26663 | 202108 | 사당역 횟집 | 3 | 202108 |
34336 | 202108 | 빨간어묵 | 3 | 202108 |
41860 | 202108 | 애월해녀의집 | 2 | 202108 |
38389 | 202108 | 씰국수 | 2 | 202108 |
기준년월 | 검색키워드명 | 서비스이용횟수 | 대상기준년월 | |
---|---|---|---|---|
98603 | 202108 | 엄마의 일품 김치찜 | 1 | 202108 |
53690 | 202108 | 신용 | 2 | 202108 |
90770 | 202108 | ofr seoul | 1 | 202108 |
42179 | 202108 | 호호식당 익선 | 2 | 202108 |
70788 | 202108 | 사상역 막창 | 1 | 202108 |
6194 | 202108 | 을지로 술집 | 18 | 202108 |
91142 | 202108 | 한강진 | 1 | 202108 |
23074 | 202108 | 장흥읍 | 4 | 202108 |
49992 | 202108 | 네기 다이닝 | 2 | 202108 |
87188 | 202108 | 대구콘서트하우스 | 1 | 202108 |
Most frequently occurring
기준년월 | 검색키워드명 | 서비스이용횟수 | 대상기준년월 | # duplicates | |
---|---|---|---|---|---|
0 | 202108 | 닭 | 1 | 202108 | 2 |