Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 50 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.8 KiB |
Average record size in memory | 37.6 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 코리아크레딧뷰로 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=1e1ac460-bf24-11ea-8b67-7b32ce18203a |
BASE_YM has constant value "" | Constant |
SIGNGU_CD is highly overall correlated with THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | High correlation |
THIRTY_DAY_ABOVE_ARRRG_NMPR_CO is highly overall correlated with SIGNGU_CD | High correlation |
SIGNGU_CD has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:53:57.605526 |
---|---|
Analysis finished | 2023-12-10 09:53:59.117749 |
Duration | 1.51 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
BASE_YM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 532.0 B |
202106 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202106 |
---|---|
2nd row | 202106 |
3rd row | 202106 |
4th row | 202106 |
5th row | 202106 |
Common Values
Value | Count | Frequency (%) |
202106 | 50 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202106 | 50 |
SIGNGU_NM
Text
Distinct | 45 |
---|---|
Distinct (%) | 90.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 532.0 B |
Value | Count | Frequency (%) |
중구 | 3 | 5.2% |
북구 | 3 | 5.2% |
남구 | 2 | 3.4% |
동구 | 2 | 3.4% |
고양시 | 2 | 3.4% |
수원시 | 2 | 3.4% |
보령시 | 1 | 1.7% |
인제군 | 1 | 1.7% |
진안군 | 1 | 1.7% |
완도군 | 1 | 1.7% |
Other values (40) | 40 |
Most occurring characters
Value | Count | Frequency (%) |
구 | 27 | 15.6% |
군 | 18 | 10.4% |
시 | 14 | 8.1% |
8 | 4.6% | |
양 | 7 | 4.0% |
영 | 5 | 2.9% |
중 | 4 | 2.3% |
수 | 4 | 2.3% |
안 | 4 | 2.3% |
성 | 4 | 2.3% |
Other values (48) | 78 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 165 | |
Space Separator | 8 | 4.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 27 | 16.4% |
군 | 18 | 10.9% |
시 | 14 | 8.5% |
양 | 7 | 4.2% |
영 | 5 | 3.0% |
중 | 4 | 2.4% |
수 | 4 | 2.4% |
안 | 4 | 2.4% |
성 | 4 | 2.4% |
정 | 4 | 2.4% |
Other values (47) | 74 |
Space Separator
Value | Count | Frequency (%) |
8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 165 | |
Common | 8 | 4.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 27 | 16.4% |
군 | 18 | 10.9% |
시 | 14 | 8.5% |
양 | 7 | 4.2% |
영 | 5 | 3.0% |
중 | 4 | 2.4% |
수 | 4 | 2.4% |
안 | 4 | 2.4% |
성 | 4 | 2.4% |
정 | 4 | 2.4% |
Other values (47) | 74 |
Common
Value | Count | Frequency (%) |
8 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 165 | |
ASCII | 8 | 4.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
구 | 27 | 16.4% |
군 | 18 | 10.9% |
시 | 14 | 8.5% |
양 | 7 | 4.2% |
영 | 5 | 3.0% |
중 | 4 | 2.4% |
수 | 4 | 2.4% |
안 | 4 | 2.4% |
성 | 4 | 2.4% |
정 | 4 | 2.4% |
Other values (47) | 74 |
ASCII
Value | Count | Frequency (%) |
8 |
SIGNGU_CD
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 50 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36366.82 |
Minimum | 11215 |
---|---|
Maximum | 48740 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 582.0 B |
Quantile statistics
Minimum | 11215 |
---|---|
5-th percentile | 11300.5 |
Q1 | 27828.75 |
median | 41284 |
Q3 | 45717.5 |
95-th percentile | 47765.5 |
Maximum | 48740 |
Range | 37525 |
Interquartile range (IQR) | 17888.75 |
Descriptive statistics
Standard deviation | 11522.14 |
---|---|
Coefficient of variation (CV) | 0.31683112 |
Kurtosis | -0.20031943 |
Mean | 36366.82 |
Median Absolute Deviation (MAD) | 6441 |
Skewness | -0.91769709 |
Sum | 1818341 |
Variance | 1.3275972 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
45720 | 1 | 2.0% |
26200 | 1 | 2.0% |
27110 | 1 | 2.0% |
41131 | 1 | 2.0% |
11215 | 1 | 2.0% |
11260 | 1 | 2.0% |
41115 | 1 | 2.0% |
30110 | 1 | 2.0% |
44180 | 1 | 2.0% |
46780 | 1 | 2.0% |
Other values (40) | 40 |
Value | Count | Frequency (%) |
11215 | 1 | |
11230 | 1 | |
11260 | 1 | |
11350 | 1 | |
11500 | 1 | |
26110 | 1 | |
26200 | 1 | |
26290 | 1 | |
26320 | 1 | |
26410 | 1 |
Value | Count | Frequency (%) |
48740 | 1 | |
47830 | 1 | |
47770 | 1 | |
47760 | 1 | |
47730 | 1 | |
47720 | 1 | |
47250 | 1 | |
47113 | 1 | |
46910 | 1 | |
46890 | 1 |
THIRTY_DAY_ABOVE_ARRRG_NMPR_CO
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 49 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1327.58 |
Minimum | 75 |
---|---|
Maximum | 3354 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 582.0 B |
Quantile statistics
Minimum | 75 |
---|---|
5-th percentile | 173.85 |
Q1 | 365 |
median | 1518 |
Q3 | 2039.75 |
95-th percentile | 2962.65 |
Maximum | 3354 |
Range | 3279 |
Interquartile range (IQR) | 1674.75 |
Descriptive statistics
Standard deviation | 970.42492 |
---|---|
Coefficient of variation (CV) | 0.73097284 |
Kurtosis | -1.1052517 |
Mean | 1327.58 |
Median Absolute Deviation (MAD) | 932.5 |
Skewness | 0.30460822 |
Sum | 66379 |
Variance | 941724.53 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1809 | 2 | 4.0% |
168 | 1 | 2.0% |
1043 | 1 | 2.0% |
2090 | 1 | 2.0% |
610 | 1 | 2.0% |
2128 | 1 | 2.0% |
2884 | 1 | 2.0% |
1653 | 1 | 2.0% |
746 | 1 | 2.0% |
250 | 1 | 2.0% |
Other values (39) | 39 |
Value | Count | Frequency (%) |
75 | 1 | |
102 | 1 | |
168 | 1 | |
181 | 1 | |
186 | 1 | |
197 | 1 | |
206 | 1 | |
232 | 1 | |
236 | 1 | |
250 | 1 |
Value | Count | Frequency (%) |
3354 | 1 | |
3171 | 1 | |
3027 | 1 | |
2884 | 1 | |
2809 | 1 | |
2666 | 1 | |
2475 | 1 | |
2366 | 1 | |
2289 | 1 | |
2136 | 1 |
SIGNGU_NM | SIGNGU_CD | THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | |
---|---|---|---|
SIGNGU_NM | 1.000 | 0.722 | 0.812 |
SIGNGU_CD | 0.722 | 1.000 | 0.525 |
THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | 0.812 | 0.525 | 1.000 |
SIGNGU_CD | THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | |
---|---|---|
SIGNGU_CD | 1.000 | -0.666 |
THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | -0.666 | 1.000 |
BASE_YM | SIGNGU_NM | SIGNGU_CD | THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | |
---|---|---|---|---|
0 | 202106 | 진안군 | 45720 | 168 |
1 | 202106 | 포항시 북구 | 47113 | 2115 |
2 | 202106 | 북구 | 27230 | 3027 |
3 | 202106 | 영양군 | 47760 | 75 |
4 | 202106 | 고령군 | 47830 | 236 |
5 | 202106 | 동대문구 | 11230 | 2136 |
6 | 202106 | 광산구 | 29200 | 3171 |
7 | 202106 | 영덕군 | 47770 | 326 |
8 | 202106 | 상주시 | 47250 | 480 |
9 | 202106 | 목포시 | 46110 | 2289 |
BASE_YM | SIGNGU_NM | SIGNGU_CD | THIRTY_DAY_ABOVE_ARRRG_NMPR_CO | |
---|---|---|---|---|
40 | 202106 | 의성군 | 47730 | 259 |
41 | 202106 | 수원시 영통구 | 41117 | 1401 |
42 | 202106 | 양양군 | 42830 | 186 |
43 | 202106 | 달성군 | 27710 | 1728 |
44 | 202106 | 동구 | 31170 | 1659 |
45 | 202106 | 금정구 | 26410 | 1558 |
46 | 202106 | 인제군 | 42810 | 181 |
47 | 202106 | 중구 | 26110 | 503 |
48 | 202106 | 중구 | 30140 | 1821 |
49 | 202106 | 고양시 덕양구 | 41281 | 2666 |