Overview

Dataset statistics

Number of variables4
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory35.3 B

Variable types

Numeric2
Text1
Categorical1

Alerts

korean_liquor_id is highly overall correlated with korean_liquor_catHigh correlation
korean_liquor_cat is highly overall correlated with korean_liquor_idHigh correlation
korean_liquor_id has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:15:00.992211
Analysis finished2023-12-10 10:15:02.767563
Duration1.78 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

korean_liquor_id
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21263.11
Minimum20736
Maximum36092
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:15:02.907912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20736
5-th percentile20741.95
Q120762.75
median20788.5
Q320815.25
95-th percentile20927.05
Maximum36092
Range15356
Interquartile range (IQR)52.5

Descriptive statistics

Standard deviation2621.4777
Coefficient of variation (CV)0.12328759
Kurtosis29.864565
Mean21263.11
Median Absolute Deviation (MAD)26.5
Skewness5.5901565
Sum2126311
Variance6872145.3
MonotonicityNot monotonic
2023-12-10T19:15:03.206653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20736 1
 
1.0%
20802 1
 
1.0%
20812 1
 
1.0%
20811 1
 
1.0%
20810 1
 
1.0%
20809 1
 
1.0%
20808 1
 
1.0%
20807 1
 
1.0%
20806 1
 
1.0%
20805 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
20736 1
1.0%
20738 1
1.0%
20739 1
1.0%
20740 1
1.0%
20741 1
1.0%
20742 1
1.0%
20744 1
1.0%
20745 1
1.0%
20746 1
1.0%
20747 1
1.0%
ValueCountFrequency (%)
36092 1
1.0%
36091 1
1.0%
36090 1
1.0%
20929 1
1.0%
20928 1
1.0%
20927 1
1.0%
20926 1
1.0%
20925 1
1.0%
20924 1
1.0%
20923 1
1.0%
Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:15:03.810616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length37
Mean length24.33
Min length4

Characters and Unicode

Total characters2433
Distinct characters282
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)92.0%

Sample

1st row한국 전통주 한산소곡주 호암제조소 750ml
2nd row조은술 세종 괴산 찰옥수수주 6도 750ml 옥수수 막걸리 전통주
3rd row한비 전통주 오가피술 750ml(Acl 35%)
4th row21년산 로얄 안동소주 명절 전통주 선물세트
5th row복순도가 손 막걸리 6.5도 935ml
ValueCountFrequency (%)
전통주 16
 
3.1%
안동소주 12
 
2.3%
750ml 11
 
2.1%
배상면주가 10
 
1.9%
x 10
 
1.9%
선물세트 9
 
1.7%
막걸리 8
 
1.5%
7
 
1.4%
느린마을 7
 
1.4%
생막걸리 6
 
1.2%
Other values (287) 422
81.5%
2023-12-10T19:15:04.629278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
423
 
17.4%
104
 
4.3%
0 92
 
3.8%
5 54
 
2.2%
1 53
 
2.2%
l 50
 
2.1%
3 50
 
2.1%
m 49
 
2.0%
48
 
2.0%
43
 
1.8%
Other values (272) 1467
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1288
52.9%
Space Separator 423
 
17.4%
Decimal Number 404
 
16.6%
Lowercase Letter 124
 
5.1%
Uppercase Letter 73
 
3.0%
Other Punctuation 42
 
1.7%
Open Punctuation 36
 
1.5%
Close Punctuation 36
 
1.5%
Dash Punctuation 3
 
0.1%
Other Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
8.1%
48
 
3.7%
43
 
3.3%
40
 
3.1%
31
 
2.4%
31
 
2.4%
31
 
2.4%
30
 
2.3%
29
 
2.3%
23
 
1.8%
Other values (215) 878
68.2%
Uppercase Letter
ValueCountFrequency (%)
L 13
17.8%
H 6
 
8.2%
X 5
 
6.8%
F 5
 
6.8%
J 5
 
6.8%
N 5
 
6.8%
W 4
 
5.5%
T 4
 
5.5%
E 3
 
4.1%
R 3
 
4.1%
Other values (12) 20
27.4%
Lowercase Letter
ValueCountFrequency (%)
l 50
40.3%
m 49
39.5%
x 15
 
12.1%
e 2
 
1.6%
n 1
 
0.8%
i 1
 
0.8%
w 1
 
0.8%
c 1
 
0.8%
g 1
 
0.8%
1
 
0.8%
Other values (2) 2
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 92
22.8%
5 54
13.4%
1 53
13.1%
3 50
12.4%
6 33
 
8.2%
2 32
 
7.9%
7 32
 
7.9%
4 26
 
6.4%
8 21
 
5.2%
9 11
 
2.7%
Other Punctuation
ValueCountFrequency (%)
% 15
35.7%
. 13
31.0%
/ 10
23.8%
, 3
 
7.1%
& 1
 
2.4%
Open Punctuation
ValueCountFrequency (%)
( 27
75.0%
[ 9
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 27
75.0%
] 9
 
25.0%
Space Separator
ValueCountFrequency (%)
423
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1288
52.9%
Common 949
39.0%
Latin 196
 
8.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
8.1%
48
 
3.7%
43
 
3.3%
40
 
3.1%
31
 
2.4%
31
 
2.4%
31
 
2.4%
30
 
2.3%
29
 
2.3%
23
 
1.8%
Other values (215) 878
68.2%
Latin
ValueCountFrequency (%)
l 50
25.5%
m 49
25.0%
x 15
 
7.7%
L 13
 
6.6%
H 6
 
3.1%
X 5
 
2.6%
F 5
 
2.6%
J 5
 
2.6%
N 5
 
2.6%
W 4
 
2.0%
Other values (23) 39
19.9%
Common
ValueCountFrequency (%)
423
44.6%
0 92
 
9.7%
5 54
 
5.7%
1 53
 
5.6%
3 50
 
5.3%
6 33
 
3.5%
2 32
 
3.4%
7 32
 
3.4%
( 27
 
2.8%
) 27
 
2.8%
Other values (14) 126
 
13.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1288
52.9%
ASCII 1141
46.9%
CJK Compat 3
 
0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
423
37.1%
0 92
 
8.1%
5 54
 
4.7%
1 53
 
4.6%
l 50
 
4.4%
3 50
 
4.4%
m 49
 
4.3%
6 33
 
2.9%
2 32
 
2.8%
7 32
 
2.8%
Other values (45) 273
23.9%
Hangul
ValueCountFrequency (%)
104
 
8.1%
48
 
3.7%
43
 
3.3%
40
 
3.1%
31
 
2.4%
31
 
2.4%
31
 
2.4%
30
 
2.3%
29
 
2.3%
23
 
1.8%
Other values (215) 878
68.2%
CJK Compat
ValueCountFrequency (%)
3
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

korean_liquor_cat
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
탁주
37 
소주
18 
와인
10 
약주
리큐르주
Other values (7)
18 

Length

Max length9
Median length2
Mean length2.7
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row약주
2nd row기타주류(살균주)
3rd row리큐르주
4th row소주
5th row탁주

Common Values

ValueCountFrequency (%)
탁주 37
37.0%
소주 18
18.0%
와인 10
 
10.0%
약주 9
 
9.0%
리큐르주 8
 
8.0%
<NA> 4
 
4.0%
일반증류주 4
 
4.0%
전통주선물세트 3
 
3.0%
기타주류 3
 
3.0%
복분자주 2
 
2.0%
Other values (2) 2
 
2.0%

Length

2023-12-10T19:15:04.884678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
탁주 37
37.0%
소주 18
18.0%
와인 10
 
10.0%
약주 9
 
9.0%
리큐르주 8
 
8.0%
na 4
 
4.0%
일반증류주 4
 
4.0%
전통주선물세트 3
 
3.0%
기타주류 3
 
3.0%
복분자주 2
 
2.0%
Other values (2) 2
 
2.0%

korean_liquor_pc
Real number (ℝ)

Distinct84
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26735.4
Minimum1470
Maximum132000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:15:05.169509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1470
5-th percentile1584.5
Q111752.5
median19340
Q332850
95-th percentile67250
Maximum132000
Range130530
Interquartile range (IQR)21097.5

Descriptive statistics

Standard deviation26881.897
Coefficient of variation (CV)1.0054795
Kurtosis6.1202754
Mean26735.4
Median Absolute Deviation (MAD)11500
Skewness2.3138504
Sum2673540
Variance7.226364 × 108
MonotonicityNot monotonic
2023-12-10T19:15:05.497537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1470 3
 
3.0%
36000 3
 
3.0%
12350 3
 
3.0%
2600 2
 
2.0%
1480 2
 
2.0%
26000 2
 
2.0%
20000 2
 
2.0%
18810 2
 
2.0%
125000 2
 
2.0%
27000 2
 
2.0%
Other values (74) 77
77.0%
ValueCountFrequency (%)
1470 3
3.0%
1480 2
2.0%
1590 1
 
1.0%
1980 1
 
1.0%
2000 1
 
1.0%
2280 1
 
1.0%
2500 1
 
1.0%
2550 1
 
1.0%
2580 1
 
1.0%
2600 2
2.0%
ValueCountFrequency (%)
132000 1
1.0%
125000 2
2.0%
120000 1
1.0%
110000 1
1.0%
65000 1
1.0%
61750 1
1.0%
60000 1
1.0%
57900 1
1.0%
56910 1
1.0%
49500 1
1.0%

Interactions

2023-12-10T19:15:01.982062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:15:01.580737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:15:02.255167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:15:01.805102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:15:05.707252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
korean_liquor_idkorean_liquor_nmkorean_liquor_catkorean_liquor_pc
korean_liquor_id1.0001.0000.8590.069
korean_liquor_nm1.0001.0001.0000.994
korean_liquor_cat0.8591.0001.0000.469
korean_liquor_pc0.0690.9940.4691.000
2023-12-10T19:15:05.885731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
korean_liquor_idkorean_liquor_pckorean_liquor_cat
korean_liquor_id1.000-0.0480.820
korean_liquor_pc-0.0481.0000.258
korean_liquor_cat0.8200.2581.000

Missing values

2023-12-10T19:15:02.495850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:15:02.682565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

korean_liquor_idkorean_liquor_nmkorean_liquor_catkorean_liquor_pc
020736한국 전통주 한산소곡주 호암제조소 750ml약주12000
136090조은술 세종 괴산 찰옥수수주 6도 750ml 옥수수 막걸리 전통주기타주류(살균주)2000
220738한비 전통주 오가피술 750ml(Acl 35%)리큐르주49500
32073921년산 로얄 안동소주 명절 전통주 선물세트소주132000
420740복순도가 손 막걸리 6.5도 935ml탁주36000
520741금이산농원 복숭아와인 375ml 12%와인16000
620742배상면주가 무아스파탐 느린마을 생막걸리 6도 1L탁주12450
736091제주샘주 오메기술 미니어쳐 80ml 13도 제주 전통주살균약주4900
820744배상면주가 고창LB 빙탄복 370ml 6입 저온숙성 탄산 복분자주복분자주17740
920745명인 안동소주 호리병 45도 800ml소주30000
korean_liquor_idkorean_liquor_nmkorean_liquor_catkorean_liquor_pc
9020920문배술 40도 700ml소주33900
9120921공주 알밤왕밤주 전통주 1000mlx5병기타주류11870
9220922한산소곡주 18도 1.8L약주27670
9320923[ 배꽃 필 무렵 ] 20ml x 8개입 / 14도 / 이화주탁주26000
9420924[충북제천] 용두산조은술 참조은증류식소주 20.5% 360ml X 1병 명품증류주일반증류주2500
9520925제주 감귤 신례명주 미니어처 50도 100ml일반증류주7600
9620926문경 오미자 생막걸리 500ml X 10병 6.5도탁주20000
9720927배상면주가 빙탄복 선물세트 스파클링 와인 7도 370ml 6병입와인22800
9820928문배술 호리병 40도 400ml소주25900
9920929세인트하우스 딸기와인 12도 500ml 스위트 1병와인19000