Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 70 |
Missing cells | 93 |
Missing cells (%) | 16.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.9 KiB |
Average record size in memory | 71.9 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 4 |
Text | 1 |
Boolean | 1 |
Dataset
Description | 아임셀러 업체별 카테고리 정보를 제공합니다. 기준연도, 기준월, 업체카테고리, 카테고리 사용여부 등을 제공합니다. |
---|---|
Author | (주)중소기업유통센터 |
URL | https://www.data.go.kr/data/15067139/fileData.do |
기준연도 has constant value "" | Constant |
기준월 has constant value "" | Constant |
업체카테고리번호 is highly overall correlated with 상위업체카테고리번호 and 1 other fields | High correlation |
상위업체카테고리번호 is highly overall correlated with 업체카테고리번호 | High correlation |
기준카테고리번호 is highly overall correlated with 업체카테고리번호 and 1 other fields | High correlation |
사용여부 is highly overall correlated with 기준카테고리번호 | High correlation |
사용여부 is highly imbalanced (89.2%) | Imbalance |
상위업체카테고리번호 has 41 (58.6%) missing values | Missing |
기준카테고리번호 has 52 (74.3%) missing values | Missing |
업체카테고리번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 12:54:42.778028 |
---|---|
Analysis finished | 2023-12-12 12:54:45.316713 |
Duration | 2.54 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준연도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 692.0 B |
2020 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 70 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 70 |
기준월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 692.0 B |
9 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9 |
---|---|
2nd row | 9 |
3rd row | 9 |
4th row | 9 |
5th row | 9 |
Common Values
Value | Count | Frequency (%) |
9 | 70 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
9 | 70 |
업체번호
Real number (ℝ)
Distinct | 19 |
---|---|
Distinct (%) | 27.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 985.57143 |
Minimum | 152 |
---|---|
Maximum | 1652 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 762.0 B |
Quantile statistics
Minimum | 152 |
---|---|
5-th percentile | 171 |
Q1 | 556.75 |
median | 1234 |
Q3 | 1234 |
95-th percentile | 1459.1 |
Maximum | 1652 |
Range | 1500 |
Interquartile range (IQR) | 677.25 |
Descriptive statistics
Standard deviation | 441.80029 |
---|---|
Coefficient of variation (CV) | 0.44826816 |
Kurtosis | -0.63385659 |
Mean | 985.57143 |
Median Absolute Deviation (MAD) | 55 |
Skewness | -0.93787704 |
Sum | 68990 |
Variance | 195187.49 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1234 | 30 | |
1001 | 6 | 8.6% |
171 | 6 | 8.6% |
1284 | 5 | 7.1% |
1174 | 4 | 5.7% |
152 | 3 | 4.3% |
283 | 2 | 2.9% |
510 | 2 | 2.9% |
1537 | 2 | 2.9% |
442 | 1 | 1.4% |
Other values (9) | 9 | 12.9% |
Value | Count | Frequency (%) |
152 | 3 | |
171 | 6 | |
253 | 1 | 1.4% |
283 | 2 | 2.9% |
368 | 1 | 1.4% |
442 | 1 | 1.4% |
466 | 1 | 1.4% |
479 | 1 | 1.4% |
510 | 2 | 2.9% |
697 | 1 | 1.4% |
Value | Count | Frequency (%) |
1652 | 1 | 1.4% |
1537 | 2 | 2.9% |
1469 | 1 | 1.4% |
1447 | 1 | 1.4% |
1433 | 1 | 1.4% |
1284 | 5 | 7.1% |
1234 | 30 | |
1174 | 4 | 5.7% |
1001 | 6 | 8.6% |
697 | 1 | 1.4% |
업체카테고리번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 70 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.357143 |
Minimum | 1 |
---|---|
Maximum | 80 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 762.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4.45 |
Q1 | 18.25 |
median | 35.5 |
Q3 | 62.75 |
95-th percentile | 76.55 |
Maximum | 80 |
Range | 79 |
Interquartile range (IQR) | 44.5 |
Descriptive statistics
Standard deviation | 24.626506 |
---|---|
Coefficient of variation (CV) | 0.62571885 |
Kurtosis | -1.3657145 |
Mean | 39.357143 |
Median Absolute Deviation (MAD) | 22.5 |
Skewness | 0.13632963 |
Sum | 2755 |
Variance | 606.4648 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1 | 1.4% |
67 | 1 | 1.4% |
38 | 1 | 1.4% |
30 | 1 | 1.4% |
29 | 1 | 1.4% |
28 | 1 | 1.4% |
10 | 1 | 1.4% |
3 | 1 | 1.4% |
63 | 1 | 1.4% |
40 | 1 | 1.4% |
Other values (60) | 60 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
80 | 1 | |
79 | 1 | |
78 | 1 | |
77 | 1 | |
76 | 1 | |
75 | 1 | |
74 | 1 | |
73 | 1 | |
72 | 1 | |
71 | 1 |
상위업체카테고리번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 31.0% |
Missing | 41 |
Missing (%) | 58.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.241379 |
Minimum | 1 |
---|---|
Maximum | 76 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 762.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14 |
Q1 | 14 |
median | 24 |
Q3 | 31 |
95-th percentile | 76 |
Maximum | 76 |
Range | 75 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 20.272527 |
---|---|
Coefficient of variation (CV) | 0.69328219 |
Kurtosis | 1.8282228 |
Mean | 29.241379 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 1.6217943 |
Sum | 848 |
Variance | 410.97537 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14 | 7 | 10.0% |
31 | 6 | 8.6% |
23 | 4 | 5.7% |
24 | 3 | 4.3% |
76 | 3 | 4.3% |
20 | 2 | 2.9% |
28 | 2 | 2.9% |
1 | 1 | 1.4% |
75 | 1 | 1.4% |
(Missing) | 41 |
Value | Count | Frequency (%) |
1 | 1 | 1.4% |
14 | 7 | |
20 | 2 | 2.9% |
23 | 4 | |
24 | 3 | |
28 | 2 | 2.9% |
31 | 6 | |
75 | 1 | 1.4% |
76 | 3 |
Value | Count | Frequency (%) |
76 | 3 | |
75 | 1 | 1.4% |
31 | 6 | |
28 | 2 | 2.9% |
24 | 3 | |
23 | 4 | |
20 | 2 | 2.9% |
14 | 7 | |
1 | 1 | 1.4% |
업체카테고리명
Text
Distinct | 69 |
---|---|
Distinct (%) | 98.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 692.0 B |
Value | Count | Frequency (%) |
오리지날산양유 | 2 | 2.9% |
가방 | 1 | 1.4% |
팜핑/팜파티 | 1 | 1.4% |
방문 | 1 | 1.4% |
의약용품 | 1 | 1.4% |
산양유화장품 | 1 | 1.4% |
미용/건강/의약품 | 1 | 1.4% |
건조대 | 1 | 1.4% |
테스트 | 1 | 1.4% |
장갑 | 1 | 1.4% |
Other values (59) | 59 |
Most occurring characters
Value | Count | Frequency (%) |
양 | 16 | 4.7% |
기 | 12 | 3.5% |
품 | 11 | 3.2% |
산 | 11 | 3.2% |
유 | 10 | 2.9% |
/ | 10 | 2.9% |
스 | 7 | 2.0% |
r | 6 | 1.7% |
e | 6 | 1.7% |
고 | 5 | 1.5% |
Other values (161) | 249 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 290 | |
Lowercase Letter | 31 | 9.0% |
Uppercase Letter | 12 | 3.5% |
Other Punctuation | 10 | 2.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
양 | 16 | 5.5% |
기 | 12 | 4.1% |
품 | 11 | 3.8% |
산 | 11 | 3.8% |
유 | 10 | 3.4% |
스 | 7 | 2.4% |
고 | 5 | 1.7% |
제 | 5 | 1.7% |
용 | 4 | 1.4% |
리 | 4 | 1.4% |
Other values (135) | 205 |
Lowercase Letter
Value | Count | Frequency (%) |
r | 6 | |
e | 6 | |
t | 3 | |
o | 3 | |
i | 2 | 6.5% |
a | 2 | 6.5% |
y | 1 | 3.2% |
h | 1 | 3.2% |
l | 1 | 3.2% |
d | 1 | 3.2% |
Other values (5) | 5 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 2 | |
E | 2 | |
S | 1 | |
M | 1 | |
T | 1 | |
W | 1 | |
K | 1 | |
D | 1 | |
L | 1 | |
P | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 290 | |
Latin | 43 | 12.5% |
Common | 10 | 2.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
양 | 16 | 5.5% |
기 | 12 | 4.1% |
품 | 11 | 3.8% |
산 | 11 | 3.8% |
유 | 10 | 3.4% |
스 | 7 | 2.4% |
고 | 5 | 1.7% |
제 | 5 | 1.7% |
용 | 4 | 1.4% |
리 | 4 | 1.4% |
Other values (135) | 205 |
Latin
Value | Count | Frequency (%) |
r | 6 | 14.0% |
e | 6 | 14.0% |
t | 3 | 7.0% |
o | 3 | 7.0% |
i | 2 | 4.7% |
G | 2 | 4.7% |
E | 2 | 4.7% |
a | 2 | 4.7% |
y | 1 | 2.3% |
S | 1 | 2.3% |
Other values (15) | 15 |
Common
Value | Count | Frequency (%) |
/ | 10 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 290 | |
ASCII | 53 | 15.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
양 | 16 | 5.5% |
기 | 12 | 4.1% |
품 | 11 | 3.8% |
산 | 11 | 3.8% |
유 | 10 | 3.4% |
스 | 7 | 2.4% |
고 | 5 | 1.7% |
제 | 5 | 1.7% |
용 | 4 | 1.4% |
리 | 4 | 1.4% |
Other values (135) | 205 |
ASCII
Value | Count | Frequency (%) |
/ | 10 | |
r | 6 | 11.3% |
e | 6 | 11.3% |
t | 3 | 5.7% |
o | 3 | 5.7% |
i | 2 | 3.8% |
G | 2 | 3.8% |
E | 2 | 3.8% |
a | 2 | 3.8% |
y | 1 | 1.9% |
Other values (16) | 16 |
기준카테고리번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 94.4% |
Missing | 52 |
Missing (%) | 74.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4550.6667 |
Minimum | 36 |
---|---|
Maximum | 8066 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 762.0 B |
Quantile statistics
Minimum | 36 |
---|---|
5-th percentile | 501.8 |
Q1 | 1681 |
median | 4938 |
Q3 | 7735.5 |
95-th percentile | 8026.9 |
Maximum | 8066 |
Range | 8030 |
Interquartile range (IQR) | 6054.5 |
Descriptive statistics
Standard deviation | 3048.7521 |
---|---|
Coefficient of variation (CV) | 0.66995724 |
Kurtosis | -1.8520343 |
Mean | 4550.6667 |
Median Absolute Deviation (MAD) | 3061 |
Skewness | -0.083999633 |
Sum | 81912 |
Variance | 9294889.3 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7997 | 2 | 2.9% |
8001 | 1 | 1.4% |
584 | 1 | 1.4% |
1711 | 1 | 1.4% |
1634 | 1 | 1.4% |
6951 | 1 | 1.4% |
2987 | 1 | 1.4% |
36 | 1 | 1.4% |
6491 | 1 | 1.4% |
3427 | 1 | 1.4% |
Other values (7) | 7 | 10.0% |
(Missing) | 52 |
Value | Count | Frequency (%) |
36 | 1 | |
584 | 1 | |
1370 | 1 | |
1634 | 1 | |
1671 | 1 | |
1711 | 1 | |
2040 | 1 | |
2987 | 1 | |
3427 | 1 | |
6449 | 1 |
Value | Count | Frequency (%) |
8066 | 1 | |
8020 | 1 | |
8001 | 1 | |
7997 | 2 | |
6951 | 1 | |
6491 | 1 | |
6480 | 1 | |
6449 | 1 | |
3427 | 1 | |
2987 | 1 |
사용여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 202.0 B |
True | |
---|---|
False | 1 |
Value | Count | Frequency (%) |
True | 69 | |
False | 1 | 1.4% |
업체번호 | 업체카테고리번호 | 상위업체카테고리번호 | 업체카테고리명 | 기준카테고리번호 | 사용여부 | |
---|---|---|---|---|---|---|
업체번호 | 1.000 | 0.821 | 1.000 | 1.000 | 0.897 | 0.000 |
업체카테고리번호 | 0.821 | 1.000 | 0.902 | 0.953 | 0.655 | 0.000 |
상위업체카테고리번호 | 1.000 | 0.902 | 1.000 | 1.000 | NaN | 0.000 |
업체카테고리명 | 1.000 | 0.953 | 1.000 | 1.000 | 1.000 | 0.000 |
기준카테고리번호 | 0.897 | 0.655 | NaN | 1.000 | 1.000 | NaN |
사용여부 | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 1.000 |
업체번호 | 업체카테고리번호 | 상위업체카테고리번호 | 기준카테고리번호 | 사용여부 | |
---|---|---|---|---|---|
업체번호 | 1.000 | 0.338 | -0.357 | -0.377 | 0.000 |
업체카테고리번호 | 0.338 | 1.000 | 0.872 | -0.521 | 0.000 |
상위업체카테고리번호 | -0.357 | 0.872 | 1.000 | NaN | 0.000 |
기준카테고리번호 | -0.377 | -0.521 | NaN | 1.000 | 1.000 |
사용여부 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
기준연도 | 기준월 | 업체번호 | 업체카테고리번호 | 상위업체카테고리번호 | 업체카테고리명 | 기준카테고리번호 | 사용여부 | |
---|---|---|---|---|---|---|---|---|
0 | 2020 | 9 | 510 | 1 | <NA> | 장갑 | <NA> | Y |
1 | 2020 | 9 | 152 | 4 | <NA> | 제초기 | 6491 | Y |
2 | 2020 | 9 | 697 | 11 | <NA> | 통기타 | 3427 | Y |
3 | 2020 | 9 | 479 | 13 | <NA> | 전두부 | 8020 | Y |
4 | 2020 | 9 | 1234 | 25 | 24 | 캐릭터상품 | <NA> | Y |
5 | 2020 | 9 | 1234 | 26 | 24 | 의류 | <NA> | Y |
6 | 2020 | 9 | 1234 | 27 | 24 | 양가죽제품 | <NA> | Y |
7 | 2020 | 9 | 1234 | 31 | <NA> | TheGoatWorld | <NA> | Y |
8 | 2020 | 9 | 171 | 62 | <NA> | 네일클리퍼 | <NA> | Y |
9 | 2020 | 9 | 1537 | 64 | <NA> | 방향제 | 1671 | Y |
기준연도 | 기준월 | 업체번호 | 업체카테고리번호 | 상위업체카테고리번호 | 업체카테고리명 | 기준카테고리번호 | 사용여부 | |
---|---|---|---|---|---|---|---|---|
60 | 2020 | 9 | 1284 | 66 | <NA> | 칼라복합기 | <NA> | Y |
61 | 2020 | 9 | 1284 | 68 | <NA> | 칼라프린터 | <NA> | Y |
62 | 2020 | 9 | 1469 | 74 | <NA> | 향초선물세트 | 1711 | Y |
63 | 2020 | 9 | 1001 | 76 | <NA> | 스킨케어 | <NA> | Y |
64 | 2020 | 9 | 1001 | 75 | <NA> | 헤어/바디 | <NA> | Y |
65 | 2020 | 9 | 1001 | 77 | 75 | 바디로션/오일 | <NA> | Y |
66 | 2020 | 9 | 1001 | 79 | 76 | 토너/에멀젼 | <NA> | Y |
67 | 2020 | 9 | 1001 | 80 | 76 | 에센스/크림 | <NA> | Y |
68 | 2020 | 9 | 1447 | 73 | <NA> | 가방 | 584 | Y |
69 | 2020 | 9 | 1001 | 78 | 76 | 입술 | <NA> | Y |