Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.4 KiB |
Average record size in memory | 35.3 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Dataset
Description | Sample |
---|---|
Author | 레드테이블 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=6411d895-95d9-4dab-84cf-b76724ce9424 |
Reproduction
Analysis started | 2023-12-10 09:44:03.678465 |
---|---|
Analysis finished | 2023-12-10 09:44:05.277637 |
Duration | 1.6 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
sake_id
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16060.44 |
Minimum | 15609 |
---|---|
Maximum | 29021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 15609 |
---|---|
5-th percentile | 15614.95 |
Q1 | 15635.75 |
median | 15661.5 |
Q3 | 15686.25 |
95-th percentile | 15706.05 |
Maximum | 29021 |
Range | 13412 |
Interquartile range (IQR) | 50.5 |
Descriptive statistics
Standard deviation | 2290.7638 |
---|---|
Coefficient of variation (CV) | 0.14263394 |
Kurtosis | 29.88781 |
Mean | 16060.44 |
Median Absolute Deviation (MAD) | 25.5 |
Skewness | 5.5932971 |
Sum | 1606044 |
Variance | 5247599 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15609 | 1 | 1.0% |
15673 | 1 | 1.0% |
15683 | 1 | 1.0% |
15682 | 1 | 1.0% |
15681 | 1 | 1.0% |
15680 | 1 | 1.0% |
15679 | 1 | 1.0% |
15678 | 1 | 1.0% |
15677 | 1 | 1.0% |
15676 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
15609 | 1 | |
15611 | 1 | |
15612 | 1 | |
15613 | 1 | |
15614 | 1 | |
15615 | 1 | |
15617 | 1 | |
15618 | 1 | |
15619 | 1 | |
15620 | 1 |
Value | Count | Frequency (%) |
29021 | 1 | |
29020 | 1 | |
29019 | 1 | |
15708 | 1 | |
15707 | 1 | |
15706 | 1 | |
15705 | 1 | |
15704 | 1 | |
15703 | 1 | |
15702 | 1 |
sake_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 24 |
---|---|
Median length | 19 |
Mean length | 11.77 |
Min length | 6 |
Characters and Unicode
Total characters | 1177 |
---|---|
Distinct characters | 304 |
Distinct categories | 9 ? |
Distinct scripts | 5 ? |
Distinct blocks | 5 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 十四代 (じゅうよんだい) |
---|---|
2nd row | 神聖 純米大吟醸 山田錦氷温囲い 1.8L |
3rd row | 而今 (じこん) |
4th row | No.6 (ナンバーシックス) |
5th row | 花邑 (はなむら) |
Value | Count | Frequency (%) |
純米大吟醸 | 3 | 1.4% |
神聖 | 3 | 1.4% |
720ml | 2 | 0.9% |
山田錦 | 2 | 0.9% |
十四代 | 1 | 0.5% |
(みやかんばい) | 1 | 0.5% |
(うごのつき) | 1 | 0.5% |
羽根屋 | 1 | 0.5% |
(はねや) | 1 | 0.5% |
春霞 | 1 | 0.5% |
Other values (195) | 195 |
Most occurring characters
Value | Count | Frequency (%) |
111 | 9.4% | |
) | 102 | 8.7% |
( | 102 | 8.7% |
ん | 35 | 3.0% |
う | 31 | 2.6% |
い | 27 | 2.3% |
の | 26 | 2.2% |
し | 23 | 2.0% |
か | 22 | 1.9% |
ま | 20 | 1.7% |
Other values (294) | 678 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 838 | |
Space Separator | 111 | 9.4% |
Close Punctuation | 102 | 8.7% |
Open Punctuation | 102 | 8.7% |
Decimal Number | 9 | 0.8% |
Lowercase Letter | 5 | 0.4% |
Uppercase Letter | 5 | 0.4% |
Modifier Letter | 3 | 0.3% |
Other Punctuation | 2 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
ん | 35 | 4.2% |
う | 31 | 3.7% |
い | 27 | 3.2% |
の | 26 | 3.1% |
し | 23 | 2.7% |
か | 22 | 2.6% |
ま | 20 | 2.4% |
き | 19 | 2.3% |
く | 17 | 2.0% |
じ | 14 | 1.7% |
Other values (275) | 604 |
Decimal Number
Value | Count | Frequency (%) |
2 | 2 | |
0 | 2 | |
7 | 2 | |
1 | 1 | |
8 | 1 | |
6 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 2 | |
D | 1 | |
A | 1 | |
L | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 2 | |
l | 2 | |
o | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 | |
. | 1 |
Space Separator
Value | Count | Frequency (%) |
111 |
Close Punctuation
Value | Count | Frequency (%) |
) | 102 |
Open Punctuation
Value | Count | Frequency (%) |
( | 102 |
Modifier Letter
Value | Count | Frequency (%) |
ー | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hiragana | 527 | |
Common | 329 | |
Han | 287 | |
Katakana | 24 | 2.0% |
Latin | 10 | 0.8% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
田 | 8 | 2.8% |
山 | 7 | 2.4% |
美 | 6 | 2.1% |
錦 | 4 | 1.4% |
醸 | 4 | 1.4% |
人 | 4 | 1.4% |
大 | 4 | 1.4% |
吟 | 3 | 1.0% |
楽 | 3 | 1.0% |
屋 | 3 | 1.0% |
Other values (192) | 241 |
Hiragana
Value | Count | Frequency (%) |
ん | 35 | 6.6% |
う | 31 | 5.9% |
い | 27 | 5.1% |
の | 26 | 4.9% |
し | 23 | 4.4% |
か | 22 | 4.2% |
ま | 20 | 3.8% |
き | 19 | 3.6% |
く | 17 | 3.2% |
じ | 14 | 2.7% |
Other values (58) | 293 |
Katakana
Value | Count | Frequency (%) |
ス | 3 | |
エ | 2 | 8.3% |
フ | 2 | 8.3% |
ペ | 2 | 8.3% |
ル | 2 | 8.3% |
ィ | 2 | 8.3% |
ガ | 2 | 8.3% |
ソ | 2 | 8.3% |
ロ | 1 | 4.2% |
ン | 1 | 4.2% |
Other values (5) | 5 |
Common
Value | Count | Frequency (%) |
111 | ||
) | 102 | |
( | 102 | |
ー | 3 | 0.9% |
2 | 2 | 0.6% |
0 | 2 | 0.6% |
7 | 2 | 0.6% |
1 | 1 | 0.3% |
8 | 1 | 0.3% |
. | 1 | 0.3% |
Other values (2) | 2 | 0.6% |
Latin
Value | Count | Frequency (%) |
m | 2 | |
l | 2 | |
N | 2 | |
D | 1 | |
A | 1 | |
L | 1 | |
o | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hiragana | 527 | |
CJK | 287 | |
None | 218 | |
ASCII | 118 | 10.0% |
Katakana | 27 | 2.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
111 | ||
N | 2 | 1.7% |
D | 1 | 0.8% |
A | 1 | 0.8% |
o | 1 | 0.8% |
. | 1 | 0.8% |
6 | 1 | 0.8% |
None
Value | Count | Frequency (%) |
) | 102 | |
( | 102 | |
2 | 2 | 0.9% |
0 | 2 | 0.9% |
7 | 2 | 0.9% |
m | 2 | 0.9% |
l | 2 | 0.9% |
1 | 1 | 0.5% |
8 | 1 | 0.5% |
. | 1 | 0.5% |
Hiragana
Value | Count | Frequency (%) |
ん | 35 | 6.6% |
う | 31 | 5.9% |
い | 27 | 5.1% |
の | 26 | 4.9% |
し | 23 | 4.4% |
か | 22 | 4.2% |
ま | 20 | 3.8% |
き | 19 | 3.6% |
く | 17 | 3.2% |
じ | 14 | 2.7% |
Other values (58) | 293 |
CJK
Value | Count | Frequency (%) |
田 | 8 | 2.8% |
山 | 7 | 2.4% |
美 | 6 | 2.1% |
錦 | 4 | 1.4% |
醸 | 4 | 1.4% |
人 | 4 | 1.4% |
大 | 4 | 1.4% |
吟 | 3 | 1.0% |
楽 | 3 | 1.0% |
屋 | 3 | 1.0% |
Other values (192) | 241 |
Katakana
Value | Count | Frequency (%) |
ス | 3 | |
ー | 3 | |
エ | 2 | 7.4% |
フ | 2 | 7.4% |
ペ | 2 | 7.4% |
ル | 2 | 7.4% |
ィ | 2 | 7.4% |
ガ | 2 | 7.4% |
ソ | 2 | 7.4% |
ロ | 1 | 3.7% |
Other values (6) | 6 |
sake_region_nm
Text
Distinct | 92 |
---|---|
Distinct (%) | 92.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
97 | ||
秋田 | 12 | 4.1% |
福島 | 7 | 2.4% |
愛知 | 6 | 2.0% |
山形 | 6 | 2.0% |
山口 | 5 | 1.7% |
宮城 | 5 | 1.7% |
新政酒造 | 4 | 1.4% |
三重 | 4 | 1.4% |
長野 | 4 | 1.4% |
Other values (117) | 145 |
Most occurring characters
Value | Count | Frequency (%) |
195 | ||
| | 97 | 10.2% |
造 | 86 | 9.0% |
酒 | 79 | 8.3% |
山 | 22 | 2.3% |
田 | 17 | 1.8% |
本 | 14 | 1.5% |
店 | 14 | 1.5% |
秋 | 13 | 1.4% |
島 | 13 | 1.4% |
Other values (188) | 402 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 659 | |
Space Separator | 195 | 20.5% |
Math Symbol | 97 | 10.2% |
Modifier Letter | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
造 | 86 | 13.1% |
酒 | 79 | 12.0% |
山 | 22 | 3.3% |
田 | 17 | 2.6% |
本 | 14 | 2.1% |
店 | 14 | 2.1% |
秋 | 13 | 2.0% |
島 | 13 | 2.0% |
新 | 12 | 1.8% |
醸 | 11 | 1.7% |
Other values (185) | 378 |
Space Separator
Value | Count | Frequency (%) |
195 |
Math Symbol
Value | Count | Frequency (%) |
| | 97 |
Modifier Letter
Value | Count | Frequency (%) |
ー | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Han | 647 | |
Common | 293 | |
Hiragana | 8 | 0.8% |
Katakana | 4 | 0.4% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
造 | 86 | 13.3% |
酒 | 79 | 12.2% |
山 | 22 | 3.4% |
田 | 17 | 2.6% |
本 | 14 | 2.2% |
店 | 14 | 2.2% |
秋 | 13 | 2.0% |
島 | 13 | 2.0% |
新 | 12 | 1.9% |
醸 | 11 | 1.7% |
Other values (177) | 366 |
Hiragana
Value | Count | Frequency (%) |
の | 4 | |
ん | 2 | |
き | 1 | 12.5% |
せ | 1 | 12.5% |
Katakana
Value | Count | Frequency (%) |
リ | 1 | |
ナ | 1 | |
イ | 1 | |
ワ | 1 |
Common
Value | Count | Frequency (%) |
195 | ||
| | 97 | |
ー | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
CJK | 647 | |
ASCII | 292 | |
Hiragana | 8 | 0.8% |
Katakana | 5 | 0.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
195 | ||
| | 97 |
CJK
Value | Count | Frequency (%) |
造 | 86 | 13.3% |
酒 | 79 | 12.2% |
山 | 22 | 3.4% |
田 | 17 | 2.6% |
本 | 14 | 2.2% |
店 | 14 | 2.2% |
秋 | 13 | 2.0% |
島 | 13 | 2.0% |
新 | 12 | 1.9% |
醸 | 11 | 1.7% |
Other values (177) | 366 |
Hiragana
Value | Count | Frequency (%) |
の | 4 | |
ん | 2 | |
き | 1 | 12.5% |
せ | 1 | 12.5% |
Katakana
Value | Count | Frequency (%) |
ー | 1 | |
リ | 1 | |
ナ | 1 | |
イ | 1 | |
ワ | 1 |
sake_pc
Real number (ℝ)
ZEROS
 
Distinct | 73 |
---|---|
Distinct (%) | 73.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24343.11 |
Minimum | 0 |
---|---|
Maximum | 328000 |
Zeros | 2 |
Zeros (%) | 2.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3424.2 |
Q1 | 5971.75 |
median | 10800 |
Q3 | 21950 |
95-th percentile | 108000 |
Maximum | 328000 |
Range | 328000 |
Interquartile range (IQR) | 15978.25 |
Descriptive statistics
Standard deviation | 42797.3 |
---|---|
Coefficient of variation (CV) | 1.7580868 |
Kurtosis | 27.044851 |
Mean | 24343.11 |
Median Absolute Deviation (MAD) | 5515.5 |
Skewness | 4.6351477 |
Sum | 2434311 |
Variance | 1.8316089 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10800 | 9 | 9.0% |
32400 | 5 | 5.0% |
12960 | 3 | 3.0% |
8640 | 3 | 3.0% |
50900 | 2 | 2.0% |
13845 | 2 | 2.0% |
11000 | 2 | 2.0% |
21600 | 2 | 2.0% |
20000 | 2 | 2.0% |
0 | 2 | 2.0% |
Other values (63) | 68 |
Value | Count | Frequency (%) |
0 | 2 | |
2354 | 1 | |
2970 | 1 | |
3200 | 1 | |
3436 | 1 | |
3500 | 1 | |
3672 | 1 | |
3780 | 1 | |
3800 | 1 | |
3888 | 2 |
Value | Count | Frequency (%) |
328000 | 1 | |
171951 | 1 | |
148000 | 1 | |
120960 | 1 | |
108000 | 2 | |
76380 | 1 | |
75600 | 1 | |
64800 | 1 | |
54000 | 1 | |
50900 | 2 |
sake_id | sake_nm | sake_region_nm | sake_pc | |
---|---|---|---|---|
sake_id | 1.000 | 1.000 | 1.000 | 0.626 |
sake_nm | 1.000 | 1.000 | 1.000 | 1.000 |
sake_region_nm | 1.000 | 1.000 | 1.000 | 0.403 |
sake_pc | 0.626 | 1.000 | 0.403 | 1.000 |
sake_id | sake_pc | |
---|---|---|
sake_id | 1.000 | -0.177 |
sake_pc | -0.177 | 1.000 |
sake_id | sake_nm | sake_region_nm | sake_pc | |
---|---|---|---|---|
0 | 15609 | 十四代 (じゅうよんだい) | 山形 | 高木酒造 | 328000 |
1 | 29019 | 神聖 純米大吟醸 山田錦氷温囲い 1.8L | 株式会社山本本家 | 50900 |
2 | 15611 | 而今 (じこん) | 三重 | 木屋正酒造 | 108000 |
3 | 15612 | No.6 (ナンバーシックス) | 秋田 | 新政酒造 | 17280 |
4 | 15613 | 花邑 (はなむら) | 秋田 | 両関酒造 | 32400 |
5 | 15614 | 川中島 幻舞 (かわなかじま げんぶ) | 長野 | 酒千蔵野 | 3888 |
6 | 15615 | 信州亀齢 (きれい) | 長野 | 岡崎酒造 | 3436 |
7 | 29020 | 神聖 純米大吟醸 山田錦 氷温囲い 720ml | 株式会社山本本家 | 30540 |
8 | 15617 | 陽乃鳥 (ひのとり) | 秋田 | 新政酒造 | 16200 |
9 | 15618 | 鳳凰美田 (ほうおうびでん) | 栃木 | 小林酒造 | 27000 |
sake_id | sake_nm | sake_region_nm | sake_pc | |
---|---|---|---|---|
90 | 15699 | 勝山 (かつやま) | 宮城 | 仙台伊澤家 勝山酒造 | 32400 |
91 | 15700 | 流輝 (るか) | 群馬 | 松屋酒造 | 2970 |
92 | 15701 | 手取川 (てどりがわ) | 石川 | 吉田酒造店 | 12790 |
93 | 15702 | 遊穂 (ゆうほ) | 石川 | 御祖酒造 | 3500 |
94 | 15703 | 田光 (たびか) | 三重 | 早川酒造 | 8802 |
95 | 15704 | 仙介 (せんすけ) | 兵庫 | 泉酒造 | 11016 |
96 | 15705 | 五橋 (ごきょう) | 山口 | 酒井酒造 | 11880 |
97 | 15706 | 豊盃 (ほうはい) | 青森 | 三浦酒造店 | 35999 |
98 | 15707 | 花の香 (はなのか) | 熊本 | 花の香酒造 | 5400 |
99 | 15708 | にいだしぜんしゅ (にいだしぜんしゅ) | 福島 | 仁井田本家 | 3780 |