Dataset statistics
Number of variables | 16 |
---|---|
Number of observations | 85 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.3 KiB |
Average record size in memory | 135.6 B |
Variable types
Categorical | 7 |
---|---|
Numeric | 5 |
Text | 4 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=f816ab66-88f8-40d3-8a9f-1f93e5105b03 |
FILE_NAME has constant value "" | Constant |
base_ymd has constant value "" | Constant |
sale_ratio is highly overall correlated with sales_amt | High correlation |
sales_amt is highly overall correlated with sale_ratio and 1 other fields | High correlation |
sample_cnt is highly overall correlated with to_watch and 1 other fields | High correlation |
to_watch is highly overall correlated with sample_cnt | High correlation |
sa_2 is highly overall correlated with sa_3 | High correlation |
sa_3 is highly overall correlated with sales_amt and 2 other fields | High correlation |
sa_1 is highly imbalanced (76.9%) | Imbalance |
sa_2 is highly imbalanced (70.2%) | Imbalance |
sale_ratio has 8 (9.4%) zeros | Zeros |
sales_amt has 8 (9.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 10:05:53.584148 |
---|---|
Analysis finished | 2023-12-10 10:06:00.441993 |
Duration | 6.86 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
region
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
인천 | 5 |
---|---|
울산 | 5 |
경기 | 5 |
제주 | 5 |
서울 | 5 |
Other values (12) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 인천 |
---|---|
2nd row | 울산 |
3rd row | 경기 |
4th row | 제주 |
5th row | 인천 |
Common Values
Value | Count | Frequency (%) |
인천 | 5 | 5.9% |
울산 | 5 | 5.9% |
경기 | 5 | 5.9% |
제주 | 5 | 5.9% |
서울 | 5 | 5.9% |
부산 | 5 | 5.9% |
대구 | 5 | 5.9% |
광주 | 5 | 5.9% |
강원 | 5 | 5.9% |
전북 | 5 | 5.9% |
Other values (7) | 35 |
Length
Value | Count | Frequency (%) |
인천 | 5 | 5.9% |
전북 | 5 | 5.9% |
전남 | 5 | 5.9% |
충남 | 5 | 5.9% |
충북 | 5 | 5.9% |
세종 | 5 | 5.9% |
대전 | 5 | 5.9% |
경북 | 5 | 5.9% |
강원 | 5 | 5.9% |
울산 | 5 | 5.9% |
Other values (7) | 35 |
gubun
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 5.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
서양음악 | |
---|---|
전통예술(국악) | |
연극 | |
뮤지컬 | |
무용 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 3.8 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서양음악 |
---|---|
2nd row | 서양음악 |
3rd row | 서양음악 |
4th row | 서양음악 |
5th row | 전통예술(국악) |
Common Values
Value | Count | Frequency (%) |
서양음악 | 17 | |
전통예술(국악) | 17 | |
연극 | 17 | |
뮤지컬 | 17 | |
무용 | 17 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서양음악 | 17 | |
전통예술(국악 | 17 | |
연극 | 17 | |
뮤지컬 | 17 | |
무용 | 17 |
sale_ratio
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 71 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.390588 |
Minimum | 0 |
---|---|
Maximum | 87.3 |
Zeros | 8 |
Zeros (%) | 9.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 897.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3.2 |
median | 9 |
Q3 | 20.65 |
95-th percentile | 77.16 |
Maximum | 87.3 |
Range | 87.3 |
Interquartile range (IQR) | 17.45 |
Descriptive statistics
Standard deviation | 22.592176 |
---|---|
Coefficient of variation (CV) | 1.2991036 |
Kurtosis | 2.5812399 |
Mean | 17.390588 |
Median Absolute Deviation (MAD) | 7.5 |
Skewness | 1.8419689 |
Sum | 1478.2 |
Variance | 510.4064 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 8 | 9.4% |
3.6 | 3 | 3.5% |
5.9 | 2 | 2.4% |
4.0 | 2 | 2.4% |
0.3 | 2 | 2.4% |
2.3 | 2 | 2.4% |
9.0 | 2 | 2.4% |
62.0 | 1 | 1.2% |
52.7 | 1 | 1.2% |
35.7 | 1 | 1.2% |
Other values (61) | 61 |
Value | Count | Frequency (%) |
0.0 | 8 | |
0.3 | 2 | 2.4% |
0.4 | 1 | 1.2% |
0.5 | 1 | 1.2% |
0.8 | 1 | 1.2% |
0.9 | 1 | 1.2% |
1.0 | 1 | 1.2% |
1.6 | 1 | 1.2% |
2.0 | 1 | 1.2% |
2.3 | 2 | 2.4% |
Value | Count | Frequency (%) |
87.3 | 1 | |
86.6 | 1 | |
83.3 | 1 | |
80.0 | 1 | |
79.6 | 1 | |
67.4 | 1 | |
62.0 | 1 | |
58.8 | 1 | |
52.7 | 1 | |
51.8 | 1 |
sales_amt
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 77 |
---|---|
Distinct (%) | 90.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 96670872 |
Minimum | 0 |
---|---|
Maximum | 3.2548327 × 109 |
Zeros | 8 |
Zeros (%) | 9.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 897.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 731883 |
median | 5099175 |
Q3 | 13585880 |
95-th percentile | 2.3499064 × 108 |
Maximum | 3.2548327 × 109 |
Range | 3.2548327 × 109 |
Interquartile range (IQR) | 12853997 |
Descriptive statistics
Standard deviation | 4.2889182 × 108 |
---|---|
Coefficient of variation (CV) | 4.4366189 |
Kurtosis | 38.765368 |
Mean | 96670872 |
Median Absolute Deviation (MAD) | 4774601 |
Skewness | 5.9866239 |
Sum | 8.2170242 × 109 |
Variance | 1.8394819 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8 | 9.4% |
452373 | 2 | 2.4% |
6547531 | 1 | 1.2% |
1013672 | 1 | 1.2% |
4441840 | 1 | 1.2% |
6478025 | 1 | 1.2% |
3093571 | 1 | 1.2% |
2630356 | 1 | 1.2% |
1382541 | 1 | 1.2% |
68838507 | 1 | 1.2% |
Other values (67) | 67 |
Value | Count | Frequency (%) |
0 | 8 | |
69220 | 1 | 1.2% |
92152 | 1 | 1.2% |
184586 | 1 | 1.2% |
222092 | 1 | 1.2% |
296698 | 1 | 1.2% |
324574 | 1 | 1.2% |
334037 | 1 | 1.2% |
352546 | 1 | 1.2% |
360320 | 1 | 1.2% |
Value | Count | Frequency (%) |
3254832724 | 1 | |
1775363304 | 1 | |
1449880032 | 1 | |
532608991 | 1 | |
266304496 | 1 | |
109735192 | 1 | |
95605013 | 1 | |
68838507 | 1 | |
59098578 | 1 | |
49359652 | 1 |
sample_cnt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 53 |
---|---|
Distinct (%) | 62.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.082353 |
Minimum | 2 |
---|---|
Maximum | 278 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 897.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 3 |
Q1 | 14 |
median | 28 |
Q3 | 54 |
95-th percentile | 156 |
Maximum | 278 |
Range | 276 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 56.795836 |
---|---|
Coefficient of variation (CV) | 1.2063084 |
Kurtosis | 6.8127509 |
Mean | 47.082353 |
Median Absolute Deviation (MAD) | 18 |
Skewness | 2.5524673 |
Sum | 4002 |
Variance | 3225.7669 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 4 | 4.7% |
22 | 4 | 4.7% |
11 | 4 | 4.7% |
8 | 3 | 3.5% |
14 | 3 | 3.5% |
27 | 3 | 3.5% |
39 | 3 | 3.5% |
29 | 3 | 3.5% |
32 | 3 | 3.5% |
3 | 2 | 2.4% |
Other values (43) | 53 |
Value | Count | Frequency (%) |
2 | 4 | |
3 | 2 | |
5 | 1 | 1.2% |
6 | 2 | |
7 | 2 | |
8 | 3 | |
9 | 1 | 1.2% |
11 | 4 | |
13 | 2 | |
14 | 3 |
Value | Count | Frequency (%) |
278 | 1 | |
258 | 1 | |
243 | 1 | |
239 | 1 | |
159 | 1 | |
144 | 1 | |
135 | 1 | |
109 | 2 | |
107 | 1 | |
92 | 1 |
sa_1
Categorical
IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 10.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
- | |
---|---|
2.2 | 1 |
0.5 | 1 |
9.1 | 1 |
2.3 | 1 |
Other values (4) | 4 |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.1882353 |
Min length | 1 |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 9.4% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 77 | |
2.2 | 1 | 1.2% |
0.5 | 1 | 1.2% |
9.1 | 1 | 1.2% |
2.3 | 1 | 1.2% |
2.4 | 1 | 1.2% |
1.6 | 1 | 1.2% |
3.9 | 1 | 1.2% |
4.7 | 1 | 1.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
77 | ||
2.2 | 1 | 1.2% |
0.5 | 1 | 1.2% |
9.1 | 1 | 1.2% |
2.3 | 1 | 1.2% |
2.4 | 1 | 1.2% |
1.6 | 1 | 1.2% |
3.9 | 1 | 1.2% |
4.7 | 1 | 1.2% |
sa_2
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 12 |
---|---|
Distinct (%) | 14.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
- | |
---|---|
1.7 | 2 |
1.6 | 1 |
5.1 | 1 |
2.9 | 1 |
Other values (7) | 7 |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.2588235 |
Min length | 1 |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 11.8% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | 1.6 |
4th row | 5.1 |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 73 | |
1.7 | 2 | 2.4% |
1.6 | 1 | 1.2% |
5.1 | 1 | 1.2% |
2.9 | 1 | 1.2% |
8.1 | 1 | 1.2% |
1.2 | 1 | 1.2% |
2.4 | 1 | 1.2% |
0.4 | 1 | 1.2% |
0.5 | 1 | 1.2% |
Other values (2) | 2 | 2.4% |
Length
Value | Count | Frequency (%) |
73 | ||
1.7 | 2 | 2.4% |
1.6 | 1 | 1.2% |
5.1 | 1 | 1.2% |
2.9 | 1 | 1.2% |
8.1 | 1 | 1.2% |
1.2 | 1 | 1.2% |
2.4 | 1 | 1.2% |
0.4 | 1 | 1.2% |
0.5 | 1 | 1.2% |
Other values (2) | 2 | 2.4% |
sa_3
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 32.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
- | |
---|---|
0.7 | 3 |
1 | 2 |
0.5 | 2 |
2.8 | 2 |
Other values (23) |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.7058824 |
Min length | 1 |
Unique
Unique | 22 ? |
---|---|
Unique (%) | 25.9% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | 1.3 |
4th row | 6.5 |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 52 | |
0.7 | 3 | 3.5% |
1 | 2 | 2.4% |
0.5 | 2 | 2.4% |
2.8 | 2 | 2.4% |
0.3 | 2 | 2.4% |
8.4 | 1 | 1.2% |
6.5 | 1 | 1.2% |
2.5 | 1 | 1.2% |
2.9 | 1 | 1.2% |
Other values (18) | 18 | 21.2% |
Length
Value | Count | Frequency (%) |
52 | ||
0.7 | 3 | 3.5% |
1 | 2 | 2.4% |
0.5 | 2 | 2.4% |
2.8 | 2 | 2.4% |
0.3 | 2 | 2.4% |
1.3 | 1 | 1.2% |
0.2 | 1 | 1.2% |
4 | 1 | 1.2% |
4.7 | 1 | 1.2% |
Other values (18) | 18 | 21.2% |
sa_4
Text
Distinct | 56 |
---|---|
Distinct (%) | 65.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
Value | Count | Frequency (%) |
18 | 21.2% | |
9.7 | 5 | 5.9% |
7.2 | 2 | 2.4% |
6 | 2 | 2.4% |
11.3 | 2 | 2.4% |
16.2 | 2 | 2.4% |
5.2 | 2 | 2.4% |
5.5 | 2 | 2.4% |
11.9 | 2 | 2.4% |
13.5 | 2 | 2.4% |
Other values (46) | 46 |
Most occurring characters
Value | Count | Frequency (%) |
. | 62 | |
1 | 35 | |
2 | 26 | |
- | 18 | 7.5% |
6 | 15 | 6.2% |
5 | 15 | 6.2% |
9 | 14 | 5.8% |
3 | 14 | 5.8% |
4 | 14 | 5.8% |
7 | 11 | 4.6% |
Other values (2) | 17 | 7.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 161 | |
Other Punctuation | 62 | 25.7% |
Dash Punctuation | 18 | 7.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 35 | |
2 | 26 | |
6 | 15 | |
5 | 15 | |
9 | 14 | 8.7% |
3 | 14 | 8.7% |
4 | 14 | 8.7% |
7 | 11 | 6.8% |
8 | 11 | 6.8% |
0 | 6 | 3.7% |
Other Punctuation
Value | Count | Frequency (%) |
. | 62 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 241 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 62 | |
1 | 35 | |
2 | 26 | |
- | 18 | 7.5% |
6 | 15 | 6.2% |
5 | 15 | 6.2% |
9 | 14 | 5.8% |
3 | 14 | 5.8% |
4 | 14 | 5.8% |
7 | 11 | 4.6% |
Other values (2) | 17 | 7.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 241 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 62 | |
1 | 35 | |
2 | 26 | |
- | 18 | 7.5% |
6 | 15 | 6.2% |
5 | 15 | 6.2% |
9 | 14 | 5.8% |
3 | 14 | 5.8% |
4 | 14 | 5.8% |
7 | 11 | 4.6% |
Other values (2) | 17 | 7.1% |
sa_5
Text
Distinct | 78 |
---|---|
Distinct (%) | 91.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
Value | Count | Frequency (%) |
30.1 | 2 | 2.4% |
14.1 | 2 | 2.4% |
26.4 | 2 | 2.4% |
38.3 | 2 | 2.4% |
19.8 | 2 | 2.4% |
2 | 2.4% | |
41.1 | 2 | 2.4% |
29.3 | 1 | 1.2% |
41.5 | 1 | 1.2% |
52.6 | 1 | 1.2% |
Other values (68) | 68 |
Most occurring characters
Value | Count | Frequency (%) |
. | 76 | |
1 | 38 | |
3 | 36 | |
4 | 33 | |
2 | 31 | |
7 | 28 | 8.9% |
8 | 21 | 6.7% |
6 | 17 | 5.4% |
9 | 15 | 4.8% |
5 | 15 | 4.8% |
Other values (2) | 4 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 236 | |
Other Punctuation | 76 | 24.2% |
Dash Punctuation | 2 | 0.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 38 | |
3 | 36 | |
4 | 33 | |
2 | 31 | |
7 | 28 | |
8 | 21 | |
6 | 17 | |
9 | 15 | 6.4% |
5 | 15 | 6.4% |
0 | 2 | 0.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 76 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 314 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 76 | |
1 | 38 | |
3 | 36 | |
4 | 33 | |
2 | 31 | |
7 | 28 | 8.9% |
8 | 21 | 6.7% |
6 | 17 | 5.4% |
9 | 15 | 4.8% |
5 | 15 | 4.8% |
Other values (2) | 4 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 314 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 76 | |
1 | 38 | |
3 | 36 | |
4 | 33 | |
2 | 31 | |
7 | 28 | 8.9% |
8 | 21 | 6.7% |
6 | 17 | 5.4% |
9 | 15 | 4.8% |
5 | 15 | 4.8% |
Other values (2) | 4 | 1.3% |
sa_6
Text
Distinct | 77 |
---|---|
Distinct (%) | 90.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
Value | Count | Frequency (%) |
46.1 | 3 | 3.5% |
33.9 | 2 | 2.4% |
31.9 | 2 | 2.4% |
27.1 | 2 | 2.4% |
32.4 | 2 | 2.4% |
27.8 | 2 | 2.4% |
33 | 2 | 2.4% |
45.6 | 1 | 1.2% |
18.4 | 1 | 1.2% |
82.3 | 1 | 1.2% |
Other values (67) | 67 |
Most occurring characters
Value | Count | Frequency (%) |
. | 72 | |
3 | 49 | |
4 | 37 | |
2 | 30 | |
6 | 27 | 8.7% |
1 | 27 | 8.7% |
9 | 20 | 6.4% |
8 | 17 | 5.5% |
7 | 15 | 4.8% |
5 | 10 | 3.2% |
Other values (2) | 7 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 238 | |
Other Punctuation | 72 | 23.2% |
Dash Punctuation | 1 | 0.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 49 | |
4 | 37 | |
2 | 30 | |
6 | 27 | |
1 | 27 | |
9 | 20 | |
8 | 17 | 7.1% |
7 | 15 | 6.3% |
5 | 10 | 4.2% |
0 | 6 | 2.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 72 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 311 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 72 | |
3 | 49 | |
4 | 37 | |
2 | 30 | |
6 | 27 | 8.7% |
1 | 27 | 8.7% |
9 | 20 | 6.4% |
8 | 17 | 5.5% |
7 | 15 | 4.8% |
5 | 10 | 3.2% |
Other values (2) | 7 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 311 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 72 | |
3 | 49 | |
4 | 37 | |
2 | 30 | |
6 | 27 | 8.7% |
1 | 27 | 8.7% |
9 | 20 | 6.4% |
8 | 17 | 5.5% |
7 | 15 | 4.8% |
5 | 10 | 3.2% |
Other values (2) | 7 | 2.3% |
sa_7
Text
Distinct | 71 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
Value | Count | Frequency (%) |
9 | 10.6% | |
7 | 2 | 2.4% |
3.9 | 2 | 2.4% |
36.9 | 2 | 2.4% |
19 | 2 | 2.4% |
8.7 | 2 | 2.4% |
24.9 | 2 | 2.4% |
29 | 1 | 1.2% |
42.4 | 1 | 1.2% |
15.3 | 1 | 1.2% |
Other values (61) | 61 |
Most occurring characters
Value | Count | Frequency (%) |
. | 62 | |
1 | 36 | |
2 | 29 | |
3 | 28 | |
4 | 20 | 7.5% |
7 | 20 | 7.5% |
9 | 18 | 6.7% |
8 | 15 | 5.6% |
5 | 14 | 5.2% |
6 | 11 | 4.1% |
Other values (2) | 14 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 196 | |
Other Punctuation | 62 | 23.2% |
Dash Punctuation | 9 | 3.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 36 | |
2 | 29 | |
3 | 28 | |
4 | 20 | |
7 | 20 | |
9 | 18 | |
8 | 15 | |
5 | 14 | 7.1% |
6 | 11 | 5.6% |
0 | 5 | 2.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 62 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 267 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 62 | |
1 | 36 | |
2 | 29 | |
3 | 28 | |
4 | 20 | 7.5% |
7 | 20 | 7.5% |
9 | 18 | 6.7% |
8 | 15 | 5.6% |
5 | 14 | 5.2% |
6 | 11 | 4.1% |
Other values (2) | 14 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 267 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 62 | |
1 | 36 | |
2 | 29 | |
3 | 28 | |
4 | 20 | 7.5% |
7 | 20 | 7.5% |
9 | 18 | 6.7% |
8 | 15 | 5.6% |
5 | 14 | 5.2% |
6 | 11 | 4.1% |
Other values (2) | 14 | 5.2% |
avr
Real number (ℝ)
Distinct | 62 |
---|---|
Distinct (%) | 72.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.6170588 |
Minimum | 4.86 |
---|---|
Maximum | 6.56 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 897.0 B |
Quantile statistics
Minimum | 4.86 |
---|---|
5-th percentile | 5.092 |
Q1 | 5.32 |
median | 5.59 |
Q3 | 5.84 |
95-th percentile | 6.278 |
Maximum | 6.56 |
Range | 1.7 |
Interquartile range (IQR) | 0.52 |
Descriptive statistics
Standard deviation | 0.3590723 |
---|---|
Coefficient of variation (CV) | 0.063925322 |
Kurtosis | -0.28845408 |
Mean | 5.6170588 |
Median Absolute Deviation (MAD) | 0.27 |
Skewness | 0.39249085 |
Sum | 477.45 |
Variance | 0.12893291 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5.61 | 4 | 4.7% |
5.57 | 3 | 3.5% |
5.72 | 3 | 3.5% |
5.31 | 3 | 3.5% |
5.7 | 2 | 2.4% |
5.3 | 2 | 2.4% |
5.25 | 2 | 2.4% |
6.0 | 2 | 2.4% |
5.59 | 2 | 2.4% |
5.67 | 2 | 2.4% |
Other values (52) | 60 |
Value | Count | Frequency (%) |
4.86 | 1 | |
4.97 | 1 | |
5.07 | 1 | |
5.08 | 2 | |
5.14 | 1 | |
5.16 | 1 | |
5.17 | 1 | |
5.19 | 1 | |
5.2 | 1 | |
5.21 | 1 |
Value | Count | Frequency (%) |
6.56 | 1 | |
6.39 | 1 | |
6.31 | 2 | |
6.29 | 1 | |
6.23 | 1 | |
6.22 | 1 | |
6.18 | 1 | |
6.11 | 1 | |
6.1 | 1 | |
6.09 | 1 |
to_watch
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 75 |
---|---|
Distinct (%) | 88.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.856471 |
Minimum | 1.1 |
---|---|
Maximum | 29.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 897.0 B |
Quantile statistics
Minimum | 1.1 |
---|---|
5-th percentile | 1.84 |
Q1 | 5.6 |
median | 9.4 |
Q3 | 17.2 |
95-th percentile | 26.56 |
Maximum | 29.9 |
Range | 28.8 |
Interquartile range (IQR) | 11.6 |
Descriptive statistics
Standard deviation | 7.911105 |
---|---|
Coefficient of variation (CV) | 0.66723946 |
Kurtosis | -0.74018093 |
Mean | 11.856471 |
Median Absolute Deviation (MAD) | 5.8 |
Skewness | 0.56217858 |
Sum | 1007.8 |
Variance | 62.585583 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4.3 | 3 | 3.5% |
5.6 | 3 | 3.5% |
7.6 | 3 | 3.5% |
4.7 | 2 | 2.4% |
22.8 | 2 | 2.4% |
16.1 | 2 | 2.4% |
6.6 | 2 | 2.4% |
9.4 | 1 | 1.2% |
16.8 | 1 | 1.2% |
2.1 | 1 | 1.2% |
Other values (65) | 65 |
Value | Count | Frequency (%) |
1.1 | 1 | |
1.2 | 1 | |
1.5 | 1 | |
1.7 | 1 | |
1.8 | 1 | |
2.0 | 1 | |
2.1 | 1 | |
2.2 | 1 | |
2.3 | 1 | |
2.6 | 1 |
Value | Count | Frequency (%) |
29.9 | 1 | |
29.3 | 1 | |
28.0 | 1 | |
27.4 | 1 | |
26.7 | 1 | |
26.0 | 1 | |
25.2 | 1 | |
24.0 | 1 | |
23.8 | 1 | |
22.9 | 1 |
FILE_NAME
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
---|
Length
Max length | 34 |
---|---|
Median length | 34 |
Mean length | 34 |
Min length | 34 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
---|---|
2nd row | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
3rd row | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
4th row | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
5th row | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 |
Common Values
Value | Count | Frequency (%) |
KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 85 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_602_play_type_cust_exp_map_2019 | 85 |
base_ymd
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 812.0 B |
20200221 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200221 |
---|---|
2nd row | 20200221 |
3rd row | 20200221 |
4th row | 20200221 |
5th row | 20200221 |
Common Values
Value | Count | Frequency (%) |
20200221 | 85 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200221 | 85 |
region | gubun | sale_ratio | sales_amt | sample_cnt | sa_1 | sa_2 | sa_3 | sa_4 | sa_5 | sa_6 | sa_7 | avr | to_watch | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
region | 1.000 | 0.000 | 0.012 | 0.206 | 0.447 | 0.211 | 0.087 | 0.653 | 0.707 | 0.723 | 0.678 | 0.677 | 0.472 | 0.188 |
gubun | 0.000 | 1.000 | 0.656 | 0.033 | 0.381 | 0.068 | 0.000 | 0.052 | 0.528 | 0.499 | 0.637 | 0.850 | 0.293 | 0.686 |
sale_ratio | 0.012 | 0.656 | 1.000 | 0.272 | 0.524 | 0.000 | 0.688 | 0.157 | 0.821 | 0.752 | 0.847 | 0.978 | 0.326 | 0.274 |
sales_amt | 0.206 | 0.033 | 0.272 | 1.000 | 0.669 | 0.616 | 0.587 | 0.976 | 0.957 | 0.000 | 1.000 | 1.000 | 0.000 | 0.672 |
sample_cnt | 0.447 | 0.381 | 0.524 | 0.669 | 1.000 | 0.248 | 0.743 | 0.952 | 0.912 | 0.000 | 0.817 | 0.978 | 0.000 | 0.688 |
sa_1 | 0.211 | 0.068 | 0.000 | 0.616 | 0.248 | 1.000 | 0.000 | 0.000 | 0.886 | 0.893 | 0.000 | 0.972 | 0.000 | 0.142 |
sa_2 | 0.087 | 0.000 | 0.688 | 0.587 | 0.743 | 0.000 | 1.000 | 0.956 | 0.883 | 0.000 | 0.778 | 0.000 | 0.311 | 0.364 |
sa_3 | 0.653 | 0.052 | 0.157 | 0.976 | 0.952 | 0.000 | 0.956 | 1.000 | 0.985 | 0.000 | 0.877 | 0.990 | 0.390 | 0.755 |
sa_4 | 0.707 | 0.528 | 0.821 | 0.957 | 0.912 | 0.886 | 0.883 | 0.985 | 1.000 | 0.821 | 0.933 | 0.996 | 0.485 | 0.865 |
sa_5 | 0.723 | 0.499 | 0.752 | 0.000 | 0.000 | 0.893 | 0.000 | 0.000 | 0.821 | 1.000 | 0.934 | 0.936 | 0.812 | 0.915 |
sa_6 | 0.678 | 0.637 | 0.847 | 1.000 | 0.817 | 0.000 | 0.778 | 0.877 | 0.933 | 0.934 | 1.000 | 0.921 | 0.212 | 0.823 |
sa_7 | 0.677 | 0.850 | 0.978 | 1.000 | 0.978 | 0.972 | 0.000 | 0.990 | 0.996 | 0.936 | 0.921 | 1.000 | 0.851 | 0.926 |
avr | 0.472 | 0.293 | 0.326 | 0.000 | 0.000 | 0.000 | 0.311 | 0.390 | 0.485 | 0.812 | 0.212 | 0.851 | 1.000 | 0.284 |
to_watch | 0.188 | 0.686 | 0.274 | 0.672 | 0.688 | 0.142 | 0.364 | 0.755 | 0.865 | 0.915 | 0.823 | 0.926 | 0.284 | 1.000 |
sa_2 | region | gubun | sa_3 | sa_1 | |
---|---|---|---|---|---|
sa_2 | 1.000 | 0.000 | 0.000 | 0.645 | 0.000 |
region | 0.000 | 1.000 | 0.000 | 0.212 | 0.064 |
gubun | 0.000 | 0.000 | 1.000 | 0.000 | 0.016 |
sa_3 | 0.645 | 0.212 | 0.000 | 1.000 | 0.000 |
sa_1 | 0.000 | 0.064 | 0.016 | 0.000 | 1.000 |
sale_ratio | sales_amt | sample_cnt | avr | to_watch | region | gubun | sa_1 | sa_2 | sa_3 | |
---|---|---|---|---|---|---|---|---|---|---|
sale_ratio | 1.000 | 0.792 | 0.245 | 0.284 | 0.327 | 0.000 | 0.441 | 0.000 | 0.368 | 0.000 |
sales_amt | 0.792 | 1.000 | 0.323 | 0.190 | 0.230 | 0.086 | 0.000 | 0.404 | 0.356 | 0.761 |
sample_cnt | 0.245 | 0.323 | 1.000 | -0.160 | 0.829 | 0.186 | 0.238 | 0.118 | 0.414 | 0.656 |
avr | 0.284 | 0.190 | -0.160 | 1.000 | 0.029 | 0.253 | 0.114 | 0.000 | 0.119 | 0.097 |
to_watch | 0.327 | 0.230 | 0.829 | 0.029 | 1.000 | 0.050 | 0.340 | 0.052 | 0.154 | 0.333 |
region | 0.000 | 0.086 | 0.186 | 0.253 | 0.050 | 1.000 | 0.000 | 0.064 | 0.000 | 0.212 |
gubun | 0.441 | 0.000 | 0.238 | 0.114 | 0.340 | 0.000 | 1.000 | 0.016 | 0.000 | 0.000 |
sa_1 | 0.000 | 0.404 | 0.118 | 0.000 | 0.052 | 0.064 | 0.016 | 1.000 | 0.000 | 0.000 |
sa_2 | 0.368 | 0.356 | 0.414 | 0.119 | 0.154 | 0.000 | 0.000 | 0.000 | 1.000 | 0.645 |
sa_3 | 0.000 | 0.761 | 0.656 | 0.097 | 0.333 | 0.212 | 0.000 | 0.000 | 0.645 | 1.000 |
region | gubun | sale_ratio | sales_amt | sample_cnt | sa_1 | sa_2 | sa_3 | sa_4 | sa_5 | sa_6 | sa_7 | avr | to_watch | FILE_NAME | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 인천 | 서양음악 | 8.15 | 6547531 | 39 | - | - | - | 7.2 | 39.3 | 24.4 | 29 | 5.75 | 11.9 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
1 | 울산 | 서양음악 | 4.65 | 3065875 | 11 | - | - | - | - | 38.8 | 61.2 | - | 5.61 | 4.3 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
2 | 경기 | 서양음악 | 11.65 | 49359652 | 66 | - | 1.6 | 1.3 | 17.3 | 41.6 | 36.4 | 1.9 | 5.16 | 7.7 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
3 | 제주 | 서양음악 | 5.75 | 731883 | 32 | - | 5.1 | 6.5 | 11.9 | 27.8 | 23.8 | 24.9 | 5.33 | 15.1 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
4 | 인천 | 전통예술(국악) | 1.0 | 401689 | 88 | - | - | - | 20.1 | 42.3 | 30.6 | 7 | 5.24 | 18.8 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
5 | 울산 | 전통예술(국악) | 0.0 | 0 | 17 | - | - | - | 5.5 | 13.2 | 63.1 | 18.3 | 5.94 | 8.4 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
6 | 경기 | 전통예술(국악) | 3.3 | 6990852 | 144 | - | - | 2.5 | 14.5 | 49.3 | 27.8 | 5.8 | 5.2 | 12.7 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
7 | 제주 | 전통예술(국악) | 0.0 | 0 | 36 | 2.2 | - | 2.9 | 33.9 | 17.5 | 19.4 | 24.1 | 5.19 | 17.0 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
8 | 인천 | 연극 | 21.9 | 8796989 | 159 | - | - | 0.5 | 14.4 | 41.4 | 41.1 | 2.6 | 5.31 | 29.9 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
9 | 울산 | 연극 | 9.5 | 3131808 | 22 | - | - | - | 0 | 13.8 | 49 | 37.2 | 6.23 | 16.1 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
region | gubun | sale_ratio | sales_amt | sample_cnt | sa_1 | sa_2 | sa_3 | sa_4 | sa_5 | sa_6 | sa_7 | avr | to_watch | FILE_NAME | base_ymd | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
75 | 충북 | 뮤지컬 | 86.6 | 19981478 | 40 | - | - | - | 5.2 | 37.7 | 39.7 | 17.3 | 5.69 | 14.3 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
76 | 충남 | 뮤지컬 | 45.1 | 11823732 | 47 | - | - | 0.7 | 6.1 | 24.9 | 45.6 | 227 | 5.63 | 17.2 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
77 | 전남 | 뮤지컬 | 42.1 | 9276356 | 28 | - | - | - | - | 8.6 | 43.7 | 47.7 | 6.39 | 15.3 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
78 | 경남 | 뮤지컬 | 87.3 | 48471579 | 23 | 4.7 | - | - | - | 56.5 | 34.3 | 4.5 | 5.25 | 6.9 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
79 | 대전 | 무용 | 18.6 | 27535403 | 8 | - | - | - | 13.5 | 58.9 | 8.9 | 18.7 | 5.33 | 3.4 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
80 | 세종 | 무용 | 3.6 | 334037 | 2 | - | - | - | - | 33.6 | 66.4 | - | 5.66 | 1.2 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
81 | 충북 | 무용 | 0.8 | 184586 | 7 | - | - | - | 26.8 | 51.8 | 7.9 | 13.5 | 5.08 | 1.7 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
82 | 충남 | 무용 | 21.0 | 5505507 | 22 | - | 1.7 | - | 11.9 | 43.8 | 38.8 | 3.9 | 5.3 | 4.7 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
83 | 전남 | 무용 | 1.6 | 352546 | 2 | - | - | - | - | 48.7 | 51.3 | - | 5.51 | 1.1 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |
84 | 경남 | 무용 | 0.4 | 222092 | 3 | - | - | - | - | 18.6 | 81.4 | - | 5.81 | 6.6 | KC_602_PLAY_TYPE_CUST_EXP_MAP_2019 | 20200221 |