Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 10000 |
Missing cells (%) | 12.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 742.2 KiB |
Average record size in memory | 76.0 B |
Variable types
Numeric | 3 |
---|---|
Unsupported | 1 |
Categorical | 3 |
DateTime | 1 |
Dataset
Description | 경상북도 상주시 대금지급 테이블 |
---|---|
Author | 경상북도 상주시 |
URL | https://www.data.go.kr/data/15063670/fileData.do |
REG_DATE is highly overall correlated with SEQ and 2 other fields | High correlation |
MODIFY_DATE is highly overall correlated with SEQ and 2 other fields | High correlation |
SEQ is highly overall correlated with CONTRACT_MNG_NO and 2 other fields | High correlation |
CONTRACT_MNG_NO is highly overall correlated with SEQ and 2 other fields | High correlation |
PAYMENT_KIND is highly imbalanced (53.9%) | Imbalance |
MODIFY_DATE is highly imbalanced (59.5%) | Imbalance |
REG_DATE is highly imbalanced (59.5%) | Imbalance |
CONTRACT_SEQ has 10000 (100.0%) missing values | Missing |
SEQ has unique values | Unique |
CONTRACT_SEQ is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-11 23:10:26.869104 |
---|---|
Analysis finished | 2023-12-11 23:10:28.704119 |
Duration | 1.84 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SEQ
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33725.102 |
Minimum | 5 |
---|---|
Maximum | 75289 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 3118.9 |
Q1 | 16297 |
median | 33071.5 |
Q3 | 49874.25 |
95-th percentile | 71067.45 |
Maximum | 75289 |
Range | 75284 |
Interquartile range (IQR) | 33577.25 |
Descriptive statistics
Standard deviation | 20382.534 |
---|---|
Coefficient of variation (CV) | 0.60437279 |
Kurtosis | -0.94056154 |
Mean | 33725.102 |
Median Absolute Deviation (MAD) | 16789 |
Skewness | 0.19122796 |
Sum | 3.3725102 × 108 |
Variance | 4.154477 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4178 | 1 | < 0.1% |
19518 | 1 | < 0.1% |
38617 | 1 | < 0.1% |
72422 | 1 | < 0.1% |
27772 | 1 | < 0.1% |
23131 | 1 | < 0.1% |
74639 | 1 | < 0.1% |
4400 | 1 | < 0.1% |
1088 | 1 | < 0.1% |
20552 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
5 | 1 | |
10 | 1 | |
28 | 1 | |
31 | 1 | |
40 | 1 | |
52 | 1 | |
66 | 1 | |
69 | 1 | |
70 | 1 | |
78 | 1 |
Value | Count | Frequency (%) |
75289 | 1 | |
75283 | 1 | |
75277 | 1 | |
75273 | 1 | |
75269 | 1 | |
75266 | 1 | |
75253 | 1 | |
75245 | 1 | |
75226 | 1 | |
75205 | 1 |
CONTRACT_SEQ
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
CONTRACT_MNG_NO
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0129958 × 1011 |
Minimum | 2.004 × 1011 |
---|---|
Maximum | 2.017 × 1011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.004 × 1011 |
---|---|
5-th percentile | 2.00901 × 1011 |
Q1 | 2.011 × 1011 |
median | 2.013 × 1011 |
Q3 | 2.015 × 1011 |
95-th percentile | 2.017 × 1011 |
Maximum | 2.017 × 1011 |
Range | 1.3 × 109 |
Interquartile range (IQR) | 4 × 108 |
Descriptive statistics
Standard deviation | 2.4328872 × 108 |
---|---|
Coefficient of variation (CV) | 0.0012085903 |
Kurtosis | -0.78300088 |
Mean | 2.0129958 × 1011 |
Median Absolute Deviation (MAD) | 2 × 108 |
Skewness | -0.19278069 |
Sum | 2.0129958 × 1015 |
Variance | 5.9189401 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
201500000000 | 1502 | |
201300000000 | 1448 | |
201400000000 | 1343 | |
201200000000 | 1248 | |
201100000000 | 1123 | |
201700000000 | 844 | |
201000000000 | 790 | |
201600000000 | 757 | |
200901000000 | 548 | 5.5% |
200801000000 | 396 | 4.0% |
Value | Count | Frequency (%) |
200400000000 | 1 | < 0.1% |
200801000000 | 396 | 4.0% |
200901000000 | 548 | 5.5% |
201000000000 | 790 | |
201100000000 | 1123 | |
201200000000 | 1248 | |
201300000000 | 1448 | |
201400000000 | 1343 | |
201500000000 | 1502 | |
201600000000 | 757 |
Value | Count | Frequency (%) |
201700000000 | 844 | |
201600000000 | 757 | |
201500000000 | 1502 | |
201400000000 | 1343 | |
201300000000 | 1448 | |
201200000000 | 1248 | |
201100000000 | 1123 | |
201000000000 | 790 | |
200901000000 | 548 | 5.5% |
200801000000 | 396 | 4.0% |
PAYMENT_KIND
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
준공금 | |
---|---|
기성금 | |
선금 | 364 |
노무비지급금 | 244 |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.0368 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 준공금 |
---|---|
2nd row | 준공금 |
3rd row | 준공금 |
4th row | 준공금 |
5th row | 기성금 |
Common Values
Value | Count | Frequency (%) |
준공금 | 8138 | |
기성금 | 1254 | 12.5% |
선금 | 364 | 3.6% |
노무비지급금 | 244 | 2.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
준공금 | 8138 | |
기성금 | 1254 | 12.5% |
선금 | 364 | 3.6% |
노무비지급금 | 244 | 2.4% |
PAYMENT_PRICE
Real number (ℝ)
Distinct | 7645 |
---|---|
Distinct (%) | 76.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19895799 |
Minimum | 10 |
---|---|
Maximum | 3.6 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 410000 |
Q1 | 2114350 |
median | 6187750 |
Q3 | 13916670 |
95-th percentile | 55190357 |
Maximum | 3.6 × 109 |
Range | 3.6 × 109 |
Interquartile range (IQR) | 11802320 |
Descriptive statistics
Standard deviation | 92797720 |
---|---|
Coefficient of variation (CV) | 4.6641867 |
Kurtosis | 496.848 |
Mean | 19895799 |
Median Absolute Deviation (MAD) | 4710855 |
Skewness | 18.758346 |
Sum | 1.9895799 × 1011 |
Variance | 8.6114169 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18000000 | 20 | 0.2% |
4500000 | 17 | 0.2% |
9500000 | 16 | 0.2% |
19000000 | 15 | 0.1% |
2700000 | 15 | 0.1% |
385000 | 14 | 0.1% |
900000 | 14 | 0.1% |
3000000 | 13 | 0.1% |
440000 | 13 | 0.1% |
1000000 | 13 | 0.1% |
Other values (7635) | 9850 |
Value | Count | Frequency (%) |
10 | 2 | |
5650 | 1 | |
16140 | 1 | |
16160 | 1 | |
17310 | 1 | |
18200 | 1 | |
20000 | 1 | |
20800 | 1 | |
27030 | 1 | |
27900 | 1 |
Value | Count | Frequency (%) |
3600000000 | 1 | |
3017033440 | 1 | |
2527000000 | 1 | |
2063000000 | 1 | |
1899600000 | 1 | |
1800000000 | 1 | |
1575000000 | 1 | |
1540725000 | 1 | |
1532900000 | 1 | |
1490307000 | 1 |
PAYMENT_DATE
Date
Distinct | 2160 |
---|---|
Distinct (%) | 21.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2008-02-26 00:00:00 |
---|---|
Maximum | 2020-01-23 00:00:00 |
MODIFY_DATE
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2020-08-21 3:31 | |
---|---|
2020-08-21 3:32 | 809 |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-08-21 3:31 |
---|---|
2nd row | 2020-08-21 3:31 |
3rd row | 2020-08-21 3:31 |
4th row | 2020-08-21 3:31 |
5th row | 2020-08-21 3:31 |
Common Values
Value | Count | Frequency (%) |
2020-08-21 3:31 | 9191 | |
2020-08-21 3:32 | 809 | 8.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-08-21 | 10000 | |
3:31 | 9191 | |
3:32 | 809 | 4.0% |
REG_DATE
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2020-08-21 3:31 | |
---|---|
2020-08-21 3:32 | 809 |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-08-21 3:31 |
---|---|
2nd row | 2020-08-21 3:31 |
3rd row | 2020-08-21 3:31 |
4th row | 2020-08-21 3:31 |
5th row | 2020-08-21 3:31 |
Common Values
Value | Count | Frequency (%) |
2020-08-21 3:31 | 9191 | |
2020-08-21 3:32 | 809 | 8.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-08-21 | 10000 | |
3:31 | 9191 | |
3:32 | 809 | 4.0% |
SEQ | CONTRACT_MNG_NO | PAYMENT_KIND | PAYMENT_PRICE | MODIFY_DATE | REG_DATE | |
---|---|---|---|---|---|---|
SEQ | 1.000 | 0.922 | 0.167 | 0.039 | 0.938 | 0.938 |
CONTRACT_MNG_NO | 0.922 | 1.000 | 0.155 | 0.035 | 0.630 | 0.630 |
PAYMENT_KIND | 0.167 | 0.155 | 1.000 | 0.201 | 0.097 | 0.097 |
PAYMENT_PRICE | 0.039 | 0.035 | 0.201 | 1.000 | 0.000 | 0.000 |
MODIFY_DATE | 0.938 | 0.630 | 0.097 | 0.000 | 1.000 | 1.000 |
REG_DATE | 0.938 | 0.630 | 0.097 | 0.000 | 1.000 | 1.000 |
REG_DATE | MODIFY_DATE | PAYMENT_KIND | |
---|---|---|---|
REG_DATE | 1.000 | 0.999 | 0.064 |
MODIFY_DATE | 0.999 | 1.000 | 0.064 |
PAYMENT_KIND | 0.064 | 0.064 | 1.000 |
SEQ | CONTRACT_MNG_NO | PAYMENT_PRICE | PAYMENT_KIND | MODIFY_DATE | REG_DATE | |
---|---|---|---|---|---|---|
SEQ | 1.000 | 0.993 | 0.006 | 0.108 | 0.977 | 0.977 |
CONTRACT_MNG_NO | 0.993 | 1.000 | 0.028 | 0.107 | 0.679 | 0.679 |
PAYMENT_PRICE | 0.006 | 0.028 | 1.000 | 0.129 | 0.000 | 0.000 |
PAYMENT_KIND | 0.108 | 0.107 | 0.129 | 1.000 | 0.064 | 0.064 |
MODIFY_DATE | 0.977 | 0.679 | 0.000 | 0.064 | 1.000 | 0.999 |
REG_DATE | 0.977 | 0.679 | 0.000 | 0.064 | 0.999 | 1.000 |
SEQ | CONTRACT_SEQ | CONTRACT_MNG_NO | PAYMENT_KIND | PAYMENT_PRICE | PAYMENT_DATE | MODIFY_DATE | REG_DATE | |
---|---|---|---|---|---|---|---|---|
4299 | 4178 | <NA> | 200901000000 | 준공금 | 5946600 | 2009-05-29 | 2020-08-21 3:31 | 2020-08-21 3:31 |
4457 | 5190 | <NA> | 200901000000 | 준공금 | 13049490 | 2009-12-23 | 2020-08-21 3:31 | 2020-08-21 3:31 |
3261 | 2890 | <NA> | 200901000000 | 준공금 | 2912000 | 2009-03-18 | 2020-08-21 3:31 | 2020-08-21 3:31 |
34057 | 42995 | <NA> | 201400000000 | 준공금 | 9914800 | 2014-03-27 | 2020-08-21 3:31 | 2020-08-21 3:31 |
27681 | 14891 | <NA> | 201100000000 | 기성금 | 11050680 | 2011-08-31 | 2020-08-21 3:31 | 2020-08-21 3:31 |
51850 | 53291 | <NA> | 201500000000 | 준공금 | 1650000 | 2015-07-06 | 2020-08-21 3:31 | 2020-08-21 3:31 |
46070 | 59340 | <NA> | 201600000000 | 선금 | 35257000 | 2016-03-31 | 2020-08-21 3:31 | 2020-08-21 3:31 |
28245 | 20763 | <NA> | 201200000000 | 준공금 | 5139000 | 2012-12-14 | 2020-08-21 3:31 | 2020-08-21 3:31 |
16075 | 28984 | <NA> | 201300000000 | 준공금 | 2068000 | 2013-06-19 | 2020-08-21 3:31 | 2020-08-21 3:31 |
39828 | 32239 | <NA> | 201300000000 | 준공금 | 16996800 | 2013-04-30 | 2020-08-21 3:31 | 2020-08-21 3:31 |
SEQ | CONTRACT_SEQ | CONTRACT_MNG_NO | PAYMENT_KIND | PAYMENT_PRICE | PAYMENT_DATE | MODIFY_DATE | REG_DATE | |
---|---|---|---|---|---|---|---|---|
19580 | 24306 | <NA> | 201200000000 | 준공금 | 2628900 | 2012-03-21 | 2020-08-21 3:31 | 2020-08-21 3:31 |
34500 | 41273 | <NA> | 201400000000 | 준공금 | 29639500 | 2014-07-08 | 2020-08-21 3:31 | 2020-08-21 3:31 |
62083 | 72820 | <NA> | 201700000000 | 준공금 | 8651870 | 2017-12-20 | 2020-08-21 3:32 | 2020-08-21 3:32 |
11590 | 11469 | <NA> | 201100000000 | 준공금 | 17860000 | 2011-03-31 | 2020-08-21 3:31 | 2020-08-21 3:31 |
21948 | 15446 | <NA> | 201100000000 | 준공금 | 351480 | 2011-04-14 | 2020-08-21 3:31 | 2020-08-21 3:31 |
18071 | 29179 | <NA> | 201300000000 | 준공금 | 2370000 | 2013-08-06 | 2020-08-21 3:31 | 2020-08-21 3:31 |
47414 | 57788 | <NA> | 201600000000 | 준공금 | 87600000 | 2016-12-27 | 2020-08-21 3:31 | 2020-08-21 3:31 |
25396 | 20091 | <NA> | 201200000000 | 노무비지급금 | 3960000 | 2012-11-12 | 2020-08-21 3:31 | 2020-08-21 3:31 |
32934 | 40795 | <NA> | 201400000000 | 준공금 | 12017500 | 2015-02-04 | 2020-08-21 3:31 | 2020-08-21 3:31 |
57341 | 46101 | <NA> | 201500000000 | 준공금 | 6620000 | 2015-05-07 | 2020-08-21 3:31 | 2020-08-21 3:31 |