Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 101 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 1 |
Boolean | 1 |
DateTime | 1 |
Dataset
Description | 제주관광공사 온라인면세점 관리자시스템을 통해 온라인면세점 고객의 주문번호와 상품 및 검색어이력을 연계해서 관리하는 데이터 |
---|---|
Author | 공공데이터포털 |
URL | https://www.data.go.kr/data/15118684/fileData.do |
삭제여부 has constant value "" | Constant |
아이디 is highly overall correlated with 검색어이력아이디 | High correlation |
검색어이력아이디 is highly overall correlated with 아이디 | High correlation |
회원번호 is highly overall correlated with 장바구니아이디 | High correlation |
장바구니아이디 is highly overall correlated with 회원번호 | High correlation |
주문서아이디 is highly imbalanced (59.9%) | Imbalance |
아이디 has unique values | Unique |
회원번호 has 23 (22.8%) zeros | Zeros |
장바구니아이디 has 42 (41.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-21 12:22:34.212615 |
---|---|
Analysis finished | 2024-04-21 12:22:38.842780 |
Duration | 4.63 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아이디
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 101 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 149.0198 |
Minimum | 1 |
---|---|
Maximum | 200 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 105 |
Q1 | 125 |
median | 150 |
Q3 | 175 |
95-th percentile | 195 |
Maximum | 200 |
Range | 199 |
Interquartile range (IQR) | 50 |
Descriptive statistics
Standard deviation | 32.473676 |
---|---|
Coefficient of variation (CV) | 0.21791517 |
Kurtosis | 2.6951551 |
Mean | 149.0198 |
Median Absolute Deviation (MAD) | 25 |
Skewness | -0.85583623 |
Sum | 15051 |
Variance | 1054.5396 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
164 | 1 | 1.0% |
174 | 1 | 1.0% |
173 | 1 | 1.0% |
172 | 1 | 1.0% |
171 | 1 | 1.0% |
170 | 1 | 1.0% |
169 | 1 | 1.0% |
168 | 1 | 1.0% |
167 | 1 | 1.0% |
Other values (91) | 91 |
Value | Count | Frequency (%) |
1 | 1 | |
101 | 1 | |
102 | 1 | |
103 | 1 | |
104 | 1 | |
105 | 1 | |
106 | 1 | |
107 | 1 | |
108 | 1 | |
109 | 1 |
Value | Count | Frequency (%) |
200 | 1 | |
199 | 1 | |
198 | 1 | |
197 | 1 | |
196 | 1 | |
195 | 1 | |
194 | 1 | |
193 | 1 | |
192 | 1 | |
191 | 1 |
검색어이력아이디
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 97 |
---|---|
Distinct (%) | 96.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3564574.3 |
Minimum | 3563292 |
---|---|
Maximum | 3566405 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 3563292 |
---|---|
5-th percentile | 3563360 |
Q1 | 3564076 |
median | 3564658 |
Q3 | 3565199 |
95-th percentile | 3565477 |
Maximum | 3566405 |
Range | 3113 |
Interquartile range (IQR) | 1123 |
Descriptive statistics
Standard deviation | 682.93637 |
---|---|
Coefficient of variation (CV) | 0.00019158988 |
Kurtosis | -0.77204129 |
Mean | 3564574.3 |
Median Absolute Deviation (MAD) | 561 |
Skewness | -0.064255132 |
Sum | 3.6002201 × 108 |
Variance | 466402.09 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3563292 | 2 | 2.0% |
3564517 | 2 | 2.0% |
3564248 | 2 | 2.0% |
3564319 | 2 | 2.0% |
3566405 | 1 | 1.0% |
3564929 | 1 | 1.0% |
3565183 | 1 | 1.0% |
3565075 | 1 | 1.0% |
3565058 | 1 | 1.0% |
3564971 | 1 | 1.0% |
Other values (87) | 87 |
Value | Count | Frequency (%) |
3563292 | 2 | |
3563331 | 1 | |
3563335 | 1 | |
3563358 | 1 | |
3563360 | 1 | |
3563499 | 1 | |
3563513 | 1 | |
3563533 | 1 | |
3563679 | 1 | |
3563705 | 1 |
Value | Count | Frequency (%) |
3566405 | 1 | |
3565628 | 1 | |
3565555 | 1 | |
3565553 | 1 | |
3565518 | 1 | |
3565477 | 1 | |
3565467 | 1 | |
3565458 | 1 | |
3565436 | 1 | |
3565415 | 1 |
상품아이디
Real number (ℝ)
Distinct | 18 |
---|---|
Distinct (%) | 17.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.4381188 × 1012 |
Minimum | 1.11 × 1012 |
---|---|
Maximum | 5.56 × 1012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.11 × 1012 |
---|---|
5-th percentile | 1.11 × 1012 |
Q1 | 1.12 × 1012 |
median | 3.42 × 1012 |
Q3 | 5.41 × 1012 |
95-th percentile | 5.56 × 1012 |
Maximum | 5.56 × 1012 |
Range | 4.45 × 1012 |
Interquartile range (IQR) | 4.29 × 1012 |
Descriptive statistics
Standard deviation | 1.9341772 × 1012 |
---|---|
Coefficient of variation (CV) | 0.56256846 |
Kurtosis | -1.8031674 |
Mean | 3.4381188 × 1012 |
Median Absolute Deviation (MAD) | 2 × 1012 |
Skewness | -0.17817672 |
Sum | 3.4725 × 1014 |
Variance | 3.7410414 × 1024 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1120000000000 | 20 | |
5410000000000 | 13 | |
1110000000000 | 9 | |
5560000000000 | 8 | 7.9% |
5420000000000 | 8 | 7.9% |
5120000000000 | 8 | 7.9% |
3420000000000 | 6 | 5.9% |
3410000000000 | 5 | 5.0% |
5230000000000 | 5 | 5.0% |
3220000000000 | 3 | 3.0% |
Other values (8) | 16 |
Value | Count | Frequency (%) |
1110000000000 | 9 | |
1120000000000 | 20 | |
1130000000000 | 3 | 3.0% |
1220000000000 | 2 | 2.0% |
1410000000000 | 2 | 2.0% |
1510000000000 | 3 | 3.0% |
3130000000000 | 1 | 1.0% |
3220000000000 | 3 | 3.0% |
3410000000000 | 5 | 5.0% |
3420000000000 | 6 | 5.9% |
Value | Count | Frequency (%) |
5560000000000 | 8 | |
5430000000000 | 1 | 1.0% |
5420000000000 | 8 | |
5410000000000 | 13 | |
5230000000000 | 5 | 5.0% |
5220000000000 | 1 | 1.0% |
5130000000000 | 3 | 3.0% |
5120000000000 | 8 | |
3420000000000 | 6 | |
3410000000000 | 5 | 5.0% |
회원번호
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 48 |
---|---|
Distinct (%) | 47.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 77425659 |
Minimum | 0 |
---|---|
Maximum | 1.0030097 × 108 |
Zeros | 23 |
Zeros (%) | 22.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1.0000232 × 108 |
median | 1.0029901 × 108 |
Q3 | 1.0030094 × 108 |
95-th percentile | 1.0030097 × 108 |
Maximum | 1.0030097 × 108 |
Range | 1.0030097 × 108 |
Interquartile range (IQR) | 298618 |
Descriptive statistics
Standard deviation | 42253524 |
---|---|
Coefficient of variation (CV) | 0.54573025 |
Kurtosis | -0.26810208 |
Mean | 77425659 |
Median Absolute Deviation (MAD) | 1953 |
Skewness | -1.3181747 |
Sum | 7.8199915 × 109 |
Variance | 1.7853603 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 23 | |
100299014 | 6 | 5.9% |
100300957 | 4 | 4.0% |
100300946 | 4 | 4.0% |
100300969 | 4 | 4.0% |
100300921 | 3 | 3.0% |
100300935 | 3 | 3.0% |
100300122 | 3 | 3.0% |
100298895 | 3 | 3.0% |
100298711 | 2 | 2.0% |
Other values (38) | 46 |
Value | Count | Frequency (%) |
0 | 23 | |
100000274 | 1 | 1.0% |
100002151 | 1 | 1.0% |
100002317 | 1 | 1.0% |
100022199 | 1 | 1.0% |
100066215 | 2 | 2.0% |
100075821 | 1 | 1.0% |
100076598 | 1 | 1.0% |
100104491 | 2 | 2.0% |
100110131 | 1 | 1.0% |
Value | Count | Frequency (%) |
100300969 | 4 | |
100300968 | 1 | 1.0% |
100300967 | 2 | |
100300965 | 1 | 1.0% |
100300964 | 1 | 1.0% |
100300959 | 1 | 1.0% |
100300957 | 4 | |
100300952 | 1 | 1.0% |
100300951 | 1 | 1.0% |
100300950 | 1 | 1.0% |
장바구니아이디
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 39 |
---|---|
Distinct (%) | 38.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 107365.48 |
Minimum | 0 |
---|---|
Maximum | 210893 |
Zeros | 42 |
Zeros (%) | 41.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 138639 |
Q3 | 210813 |
95-th percentile | 210884 |
Maximum | 210893 |
Range | 210893 |
Interquartile range (IQR) | 210813 |
Descriptive statistics
Standard deviation | 102183.12 |
---|---|
Coefficient of variation (CV) | 0.95173165 |
Kurtosis | -1.9929347 |
Mean | 107365.48 |
Median Absolute Deviation (MAD) | 72254 |
Skewness | -0.036102418 |
Sum | 10843913 |
Variance | 1.044139 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 42 | |
208075 | 6 | 5.9% |
210865 | 4 | 4.0% |
210833 | 3 | 3.0% |
210893 | 3 | 3.0% |
209685 | 3 | 3.0% |
210813 | 3 | 3.0% |
207898 | 2 | 2.0% |
13946 | 2 | 2.0% |
208567 | 2 | 2.0% |
Other values (29) | 31 |
Value | Count | Frequency (%) |
0 | 42 | |
6400 | 1 | 1.0% |
13946 | 2 | 2.0% |
25031 | 2 | 2.0% |
45056 | 1 | 1.0% |
73593 | 1 | 1.0% |
87921 | 1 | 1.0% |
138639 | 1 | 1.0% |
165140 | 1 | 1.0% |
178028 | 1 | 1.0% |
Value | Count | Frequency (%) |
210893 | 3 | |
210891 | 1 | 1.0% |
210884 | 2 | |
210880 | 1 | 1.0% |
210876 | 1 | 1.0% |
210868 | 1 | 1.0% |
210867 | 1 | 1.0% |
210865 | 4 | |
210856 | 1 | 1.0% |
210851 | 1 | 1.0% |
주문서아이디
Categorical
IMBALANCE
 
Distinct | 11 |
---|---|
Distinct (%) | 10.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 936.0 B |
0 | |
---|---|
B2019121721420950655 | 4 |
B2019121723175897966 | 4 |
B2019121720132235867 | 3 |
B2019121721183138325 | 2 |
Other values (6) | 8 |
Length
Max length | 20 |
---|---|
Median length | 1 |
Mean length | 4.950495 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 80 | |
B2019121721420950655 | 4 | 4.0% |
B2019121723175897966 | 4 | 4.0% |
B2019121720132235867 | 3 | 3.0% |
B2019121721183138325 | 2 | 2.0% |
B2019121722401503647 | 2 | 2.0% |
B2019121801221486241 | 2 | 2.0% |
B2019121718302962274 | 1 | 1.0% |
B2019121720073909696 | 1 | 1.0% |
B2019121723174380772 | 1 | 1.0% |
Length
Value | Count | Frequency (%) |
0 | 80 | |
b2019121721420950655 | 4 | 4.0% |
b2019121723175897966 | 4 | 4.0% |
b2019121720132235867 | 3 | 3.0% |
b2019121721183138325 | 2 | 2.0% |
b2019121722401503647 | 2 | 2.0% |
b2019121801221486241 | 2 | 2.0% |
b2019121718302962274 | 1 | 1.0% |
b2019121720073909696 | 1 | 1.0% |
b2019121723174380772 | 1 | 1.0% |
삭제여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 229.0 B |
False |
---|
Value | Count | Frequency (%) |
False | 101 |
등록일자
Date
Distinct | 85 |
---|---|
Distinct (%) | 84.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 936.0 B |
Minimum | 2019-12-17 17:41:00 |
---|---|
Maximum | 2019-12-18 09:44:00 |
아이디 | 검색어이력아이디 | 상품아이디 | 회원번호 | 장바구니아이디 | 주문서아이디 | 등록일자 | |
---|---|---|---|---|---|---|---|
아이디 | 1.000 | 0.970 | 0.277 | 0.247 | 0.714 | 0.403 | 0.996 |
검색어이력아이디 | 0.970 | 1.000 | 0.388 | 0.239 | 0.779 | 0.549 | 1.000 |
상품아이디 | 0.277 | 0.388 | 1.000 | 0.179 | 0.288 | 0.245 | 0.847 |
회원번호 | 0.247 | 0.239 | 0.179 | 1.000 | 0.556 | 0.000 | 0.000 |
장바구니아이디 | 0.714 | 0.779 | 0.288 | 0.556 | 1.000 | 0.732 | 0.947 |
주문서아이디 | 0.403 | 0.549 | 0.245 | 0.000 | 0.732 | 1.000 | 0.973 |
등록일자 | 0.996 | 1.000 | 0.847 | 0.000 | 0.947 | 0.973 | 1.000 |
아이디 | 검색어이력아이디 | 상품아이디 | 회원번호 | 장바구니아이디 | 주문서아이디 | |
---|---|---|---|---|---|---|
아이디 | 1.000 | 0.919 | 0.199 | 0.054 | 0.071 | 0.212 |
검색어이력아이디 | 0.919 | 1.000 | 0.221 | -0.004 | 0.040 | 0.278 |
상품아이디 | 0.199 | 0.221 | 1.000 | -0.043 | 0.008 | 0.050 |
회원번호 | 0.054 | -0.004 | -0.043 | 1.000 | 0.669 | 0.000 |
장바구니아이디 | 0.071 | 0.040 | 0.008 | 0.669 | 1.000 | 0.440 |
주문서아이디 | 0.212 | 0.278 | 0.050 | 0.000 | 0.440 | 1.000 |
아이디 | 검색어이력아이디 | 상품아이디 | 회원번호 | 장바구니아이디 | 주문서아이디 | 삭제여부 | 등록일자 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 3566405 | 5560000000000 | 100002317 | 165140 | 0 | N | 2019-12-18 9:44 |
1 | 101 | 3563292 | 1130000000000 | 100230430 | 0 | 0 | N | 2019-12-17 17:41 |
2 | 102 | 3563292 | 1130000000000 | 100230430 | 0 | 0 | N | 2019-12-17 17:41 |
3 | 103 | 3563331 | 1110000000000 | 100300912 | 210805 | 0 | N | 2019-12-17 17:47 |
4 | 104 | 3563335 | 5560000000000 | 100300851 | 210717 | 0 | N | 2019-12-17 17:48 |
5 | 105 | 3563360 | 1120000000000 | 0 | 0 | 0 | N | 2019-12-17 17:59 |
6 | 106 | 3563358 | 1110000000000 | 100300349 | 210766 | 0 | N | 2019-12-17 18:00 |
7 | 107 | 3563499 | 5420000000000 | 100300917 | 210809 | 0 | N | 2019-12-17 18:27 |
8 | 108 | 3563513 | 1110000000000 | 100000274 | 73593 | B2019121718302962274 | N | 2019-12-17 18:30 |
9 | 109 | 3563679 | 1120000000000 | 100300920 | 0 | 0 | N | 2019-12-17 19:15 |
아이디 | 검색어이력아이디 | 상품아이디 | 회원번호 | 장바구니아이디 | 주문서아이디 | 삭제여부 | 등록일자 | |
---|---|---|---|---|---|---|---|---|
91 | 191 | 3565407 | 5220000000000 | 100299578 | 208895 | 0 | N | 2019-12-18 0:34 |
92 | 192 | 3565415 | 5120000000000 | 0 | 0 | 0 | N | 2019-12-18 0:34 |
93 | 193 | 3565436 | 3410000000000 | 100300969 | 210893 | 0 | N | 2019-12-18 0:49 |
94 | 194 | 3565458 | 3410000000000 | 100300969 | 210893 | 0 | N | 2019-12-18 0:51 |
95 | 195 | 3565467 | 3410000000000 | 100300969 | 210893 | 0 | N | 2019-12-18 0:52 |
96 | 196 | 3565477 | 3410000000000 | 100300969 | 0 | 0 | N | 2019-12-18 0:56 |
97 | 197 | 3565518 | 5420000000000 | 100066215 | 13946 | 0 | N | 2019-12-18 1:06 |
98 | 198 | 3565553 | 5420000000000 | 100066215 | 13946 | 0 | N | 2019-12-18 1:11 |
99 | 199 | 3565555 | 5410000000000 | 100185314 | 208567 | B2019121801221486241 | N | 2019-12-18 1:13 |
100 | 200 | 3565628 | 5130000000000 | 100300197 | 209777 | B2019121801281116268 | N | 2019-12-18 1:28 |