Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 400 |
Missing cells | 400 |
Missing cells (%) | 9.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 36.8 KiB |
Average record size in memory | 94.3 B |
Variable types
Categorical | 6 |
---|---|
Unsupported | 1 |
Numeric | 3 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 소상공인연합회 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KFMZEROSTT011 |
소상공인결제분류코드 has constant value "" | Constant |
년월 has constant value "" | Constant |
광역시도코드 has constant value "" | Constant |
광역시도명 has constant value "" | Constant |
소상공인시스템로그일시 has constant value "" | Constant |
결제건수 is highly overall correlated with 합계금액 | High correlation |
합계금액 is highly overall correlated with 결제건수 | High correlation |
표준산업업종상세분류코드 is highly overall correlated with 표준산업업종대분류코드 | High correlation |
표준산업업종대분류코드 is highly overall correlated with 표준산업업종상세분류코드 | High correlation |
소상공인시스템로그ID has 400 (100.0%) missing values | Missing |
표준산업업종상세분류코드 has unique values | Unique |
표준산업업종상세분류명 has unique values | Unique |
소상공인시스템로그ID is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
결제건수 has 229 (57.2%) zeros | Zeros |
합계금액 has 229 (57.2%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:55:14.661042 |
---|---|
Analysis finished | 2023-12-10 06:55:16.131513 |
Duration | 1.47 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
소상공인결제분류코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
ZEROP42000 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ZEROP42000 |
---|---|
2nd row | ZEROP42000 |
3rd row | ZEROP42000 |
4th row | ZEROP42000 |
5th row | ZEROP42000 |
Common Values
Value | Count | Frequency (%) |
ZEROP42000 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
zerop42000 | 400 |
년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
202008 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202008 |
---|---|
2nd row | 202008 |
3rd row | 202008 |
4th row | 202008 |
5th row | 202008 |
Common Values
Value | Count | Frequency (%) |
202008 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202008 | 400 |
소상공인시스템로그ID
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 400 |
---|---|
Missing (%) | 100.0% |
Memory size | 3.6 KiB |
광역시도코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
42 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 42 |
---|---|
2nd row | 42 |
3rd row | 42 |
4th row | 42 |
5th row | 42 |
Common Values
Value | Count | Frequency (%) |
42 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
42 | 400 |
광역시도명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
강원도 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 강원도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
강원도 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
강원도 | 400 |
결제건수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 75 |
---|---|
Distinct (%) | 18.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 131.1975 |
Minimum | 0 |
---|---|
Maximum | 18513 |
Zeros | 229 |
Zeros (%) | 57.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 7 |
95-th percentile | 185.05 |
Maximum | 18513 |
Range | 18513 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 1181.8317 |
---|---|
Coefficient of variation (CV) | 9.0080354 |
Kurtosis | 174.4912 |
Mean | 131.1975 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 12.736001 |
Sum | 52479 |
Variance | 1396726.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 229 | |
1 | 18 | 4.5% |
3 | 16 | 4.0% |
2 | 12 | 3.0% |
4 | 9 | 2.2% |
5 | 9 | 2.2% |
7 | 8 | 2.0% |
8 | 5 | 1.2% |
15 | 4 | 1.0% |
19 | 4 | 1.0% |
Other values (65) | 86 | 21.5% |
Value | Count | Frequency (%) |
0 | 229 | |
1 | 18 | 4.5% |
2 | 12 | 3.0% |
3 | 16 | 4.0% |
4 | 9 | 2.2% |
5 | 9 | 2.2% |
6 | 2 | 0.5% |
7 | 8 | 2.0% |
8 | 5 | 1.2% |
9 | 3 | 0.8% |
Value | Count | Frequency (%) |
18513 | 1 | |
10958 | 1 | |
9646 | 1 | |
1328 | 1 | |
1183 | 1 | |
1104 | 1 | |
1028 | 1 | |
1027 | 1 | |
691 | 1 | |
564 | 1 |
합계금액
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 168 |
---|---|
Distinct (%) | 42.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4608035.2 |
Minimum | 0 |
---|---|
Maximum | 4.8997702 × 108 |
Zeros | 229 |
Zeros (%) | 57.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 421250 |
95-th percentile | 16582361 |
Maximum | 4.8997702 × 108 |
Range | 4.8997702 × 108 |
Interquartile range (IQR) | 421250 |
Descriptive statistics
Standard deviation | 30481824 |
---|---|
Coefficient of variation (CV) | 6.6149286 |
Kurtosis | 180.89756 |
Mean | 4608035.2 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 12.600091 |
Sum | 1.8432141 × 109 |
Variance | 9.291416 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 229 | |
78000 | 2 | 0.5% |
76000 | 2 | 0.5% |
200000 | 2 | 0.5% |
97000 | 2 | 0.5% |
3302110 | 1 | 0.2% |
87130507 | 1 | 0.2% |
20521609 | 1 | 0.2% |
16534655 | 1 | 0.2% |
6556105 | 1 | 0.2% |
Other values (158) | 158 |
Value | Count | Frequency (%) |
0 | 229 | |
1 | 1 | 0.2% |
3000 | 1 | 0.2% |
6000 | 1 | 0.2% |
10200 | 1 | 0.2% |
12560 | 1 | 0.2% |
12900 | 1 | 0.2% |
13500 | 1 | 0.2% |
16000 | 1 | 0.2% |
17500 | 1 | 0.2% |
Value | Count | Frequency (%) |
489977022 | 1 | |
285105325 | 1 | |
163523266 | 1 | |
87130507 | 1 | |
63980801 | 1 | |
55675154 | 1 | |
52576826 | 1 | |
50934000 | 1 | |
35788980 | 1 | |
35136000 | 1 |
표준산업업종대분류코드
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
C | |
---|---|
G | |
A | |
F | |
H | |
Other values (3) | 8 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A |
---|---|
2nd row | A |
3rd row | A |
4th row | A |
5th row | A |
Common Values
Value | Count | Frequency (%) |
C | 169 | |
G | 161 | |
A | 24 | 6.0% |
F | 21 | 5.2% |
H | 17 | 4.2% |
D | 3 | 0.8% |
E | 3 | 0.8% |
I | 2 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
c | 169 | |
g | 161 | |
a | 24 | 6.0% |
f | 21 | 5.2% |
h | 17 | 4.2% |
d | 3 | 0.8% |
e | 3 | 0.8% |
i | 2 | 0.5% |
표준산업업종상세분류코드
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 32794.677 |
Minimum | 1110 |
---|---|
Maximum | 55102 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1110 |
---|---|
5-th percentile | 2039.5 |
Q1 | 17901.75 |
median | 41111.5 |
Q3 | 46693.75 |
95-th percentile | 47992.05 |
Maximum | 55102 |
Range | 53992 |
Interquartile range (IQR) | 28792 |
Descriptive statistics
Standard deviation | 15885.66 |
---|---|
Coefficient of variation (CV) | 0.4843975 |
Kurtosis | -1.1773585 |
Mean | 32794.677 |
Median Absolute Deviation (MAD) | 7947 |
Skewness | -0.53017602 |
Sum | 13117871 |
Variance | 2.5235419 × 108 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1110 | 1 | 0.2% |
46431 | 1 | 0.2% |
46453 | 1 | 0.2% |
46452 | 1 | 0.2% |
46451 | 1 | 0.2% |
46444 | 1 | 0.2% |
46443 | 1 | 0.2% |
46442 | 1 | 0.2% |
46441 | 1 | 0.2% |
46439 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
1110 | 1 | |
1121 | 1 | |
1122 | 1 | |
1123 | 1 | |
1131 | 1 | |
1132 | 1 | |
1140 | 1 | |
1152 | 1 | |
1159 | 1 | |
1212 | 1 |
Value | Count | Frequency (%) |
55102 | 1 | |
55101 | 1 | |
52999 | 1 | |
52992 | 1 | |
52929 | 1 | |
52919 | 1 | |
52915 | 1 | |
52913 | 1 | |
52109 | 1 | |
52103 | 1 |
표준산업업종상세분류명
Text
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Length
Max length | 28 |
---|---|
Median length | 22 |
Mean length | 13.3325 |
Min length | 3 |
Characters and Unicode
Total characters | 5333 |
---|---|
Distinct characters | 313 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 400 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 곡물 및 기타 식량작물 재배업 |
---|---|
2nd row | 채소작물 재배업 |
3rd row | 화훼작물 재배업 |
4th row | 종자 및 묘목 생산업 |
5th row | 과실작물 재배업 |
Value | Count | Frequency (%) |
및 | 195 | 12.2% |
제조업 | 140 | 8.7% |
기타 | 90 | 5.6% |
도매업 | 72 | 4.5% |
소매업 | 64 | 4.0% |
외 | 23 | 1.4% |
그 | 23 | 1.4% |
기기 | 12 | 0.7% |
판매업 | 12 | 0.7% |
자동차 | 12 | 0.7% |
Other values (575) | 960 |
Most occurring characters
Value | Count | Frequency (%) |
1203 | ||
업 | 417 | 7.8% |
제 | 213 | 4.0% |
및 | 195 | 3.7% |
기 | 180 | 3.4% |
조 | 173 | 3.2% |
매 | 151 | 2.8% |
품 | 136 | 2.6% |
용 | 117 | 2.2% |
타 | 96 | 1.8% |
Other values (303) | 2452 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4125 | |
Space Separator | 1203 | 22.6% |
Close Punctuation | 2 | < 0.1% |
Open Punctuation | 2 | < 0.1% |
Decimal Number | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 417 | 10.1% |
제 | 213 | 5.2% |
및 | 195 | 4.7% |
기 | 180 | 4.4% |
조 | 173 | 4.2% |
매 | 151 | 3.7% |
품 | 136 | 3.3% |
용 | 117 | 2.8% |
타 | 96 | 2.3% |
도 | 84 | 2.0% |
Other values (299) | 2363 |
Space Separator
Value | Count | Frequency (%) |
1203 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4125 | |
Common | 1208 | 22.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 417 | 10.1% |
제 | 213 | 5.2% |
및 | 195 | 4.7% |
기 | 180 | 4.4% |
조 | 173 | 4.2% |
매 | 151 | 3.7% |
품 | 136 | 3.3% |
용 | 117 | 2.8% |
타 | 96 | 2.3% |
도 | 84 | 2.0% |
Other values (299) | 2363 |
Common
Value | Count | Frequency (%) |
1203 | ||
) | 2 | 0.2% |
( | 2 | 0.2% |
1 | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4106 | |
ASCII | 1208 | 22.7% |
Compat Jamo | 19 | 0.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1203 | ||
) | 2 | 0.2% |
( | 2 | 0.2% |
1 | 1 | 0.1% |
Hangul
Value | Count | Frequency (%) |
업 | 417 | 10.2% |
제 | 213 | 5.2% |
및 | 195 | 4.7% |
기 | 180 | 4.4% |
조 | 173 | 4.2% |
매 | 151 | 3.7% |
품 | 136 | 3.3% |
용 | 117 | 2.8% |
타 | 96 | 2.3% |
도 | 84 | 2.0% |
Other values (298) | 2344 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 19 |
소상공인시스템로그일시
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
2020-10-21 12:28:43.0 |
---|
Length
Max length | 21 |
---|---|
Median length | 21 |
Mean length | 21 |
Min length | 21 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-10-21 12:28:43.0 |
---|---|
2nd row | 2020-10-21 12:28:43.0 |
3rd row | 2020-10-21 12:28:43.0 |
4th row | 2020-10-21 12:28:43.0 |
5th row | 2020-10-21 12:28:43.0 |
Common Values
Value | Count | Frequency (%) |
2020-10-21 12:28:43.0 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-10-21 | 400 | |
12:28:43.0 | 400 |
결제건수 | 합계금액 | 표준산업업종대분류코드 | 표준산업업종상세분류코드 | |
---|---|---|---|---|
결제건수 | 1.000 | 1.000 | 0.000 | 0.000 |
합계금액 | 1.000 | 1.000 | 0.000 | 0.000 |
표준산업업종대분류코드 | 0.000 | 0.000 | 1.000 | 0.910 |
표준산업업종상세분류코드 | 0.000 | 0.000 | 0.910 | 1.000 |
결제건수 | 합계금액 | 표준산업업종상세분류코드 | 표준산업업종대분류코드 | |
---|---|---|---|---|
결제건수 | 1.000 | 0.986 | 0.280 | 0.000 |
합계금액 | 0.986 | 1.000 | 0.301 | 0.000 |
표준산업업종상세분류코드 | 0.280 | 0.301 | 1.000 | 0.744 |
표준산업업종대분류코드 | 0.000 | 0.000 | 0.744 | 1.000 |
소상공인결제분류코드 | 년월 | 소상공인시스템로그ID | 광역시도코드 | 광역시도명 | 결제건수 | 합계금액 | 표준산업업종대분류코드 | 표준산업업종상세분류코드 | 표준산업업종상세분류명 | 소상공인시스템로그일시 | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1110 | 곡물 및 기타 식량작물 재배업 | 2020-10-21 12:28:43.0 |
1 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1121 | 채소작물 재배업 | 2020-10-21 12:28:43.0 |
2 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1122 | 화훼작물 재배업 | 2020-10-21 12:28:43.0 |
3 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1123 | 종자 및 묘목 생산업 | 2020-10-21 12:28:43.0 |
4 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 48 | 2180000 | A | 1131 | 과실작물 재배업 | 2020-10-21 12:28:43.0 |
5 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1132 | 음료용 및 향신용 작물 재배업 | 2020-10-21 12:28:43.0 |
6 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1140 | 기타 작물 재배업 | 2020-10-21 12:28:43.0 |
7 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1152 | 채소화훼 및 과실작물 시설 재배업 | 2020-10-21 12:28:43.0 |
8 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | A | 1159 | 기타 시설작물 재배업 | 2020-10-21 12:28:43.0 |
9 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 2 | 1600000 | A | 1212 | 육우 사육업 | 2020-10-21 12:28:43.0 |
소상공인결제분류코드 | 년월 | 소상공인시스템로그ID | 광역시도코드 | 광역시도명 | 결제건수 | 합계금액 | 표준산업업종대분류코드 | 표준산업업종상세분류코드 | 표준산업업종상세분류명 | 소상공인시스템로그일시 | |
---|---|---|---|---|---|---|---|---|---|---|---|
390 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52103 | 농산물 창고업 | 2020-10-21 12:28:43.0 |
391 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52109 | 기타 보관 및 창고업 | 2020-10-21 12:28:43.0 |
392 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52913 | 물류 터미널 운영업 | 2020-10-21 12:28:43.0 |
393 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 10 | 84800 | H | 52915 | 주차장 운영업 | 2020-10-21 12:28:43.0 |
394 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 4 | 161697 | H | 52919 | 기타 육상 운송지원 서비스업 | 2020-10-21 12:28:43.0 |
395 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52929 | 기타 수상 운송 지원 서비스업 | 2020-10-21 12:28:43.0 |
396 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52992 | 화물 운송 중개대리 및 관련 서비스업 | 2020-10-21 12:28:43.0 |
397 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 0 | 0 | H | 52999 | 그 외 기타 분류 안된 운송관련 서비스업 | 2020-10-21 12:28:43.0 |
398 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 92 | 4556700 | I | 55101 | 호텔업 | 2020-10-21 12:28:43.0 |
399 | ZEROP42000 | 202008 | <NA> | 42 | 강원도 | 77 | 16404320 | I | 55102 | 여관업 | 2020-10-21 12:28:43.0 |