Overview

Dataset statistics

Number of variables14
Number of observations200
Missing cells164
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.2 KiB
Average record size in memory118.7 B

Variable types

Text5
Numeric6
Categorical3

Alerts

BANK_CD is highly overall correlated with REG_DATE and 2 other fieldsHigh correlation
BANK_NM is highly overall correlated with REG_DATE and 2 other fieldsHigh correlation
REG_DATE is highly overall correlated with BANK_CD and 2 other fieldsHigh correlation
HOUS_ID is highly overall correlated with BLD_CDHigh correlation
BLD_CD is highly overall correlated with HOUS_IDHigh correlation
SHOP_CLSS is highly overall correlated with REG_DATE and 2 other fieldsHigh correlation
SHOP_CLSS is highly imbalanced (75.8%)Imbalance
REG_DATE has 164 (82.0%) missing valuesMissing
BSHOP_CD has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:39:24.342074
Analysis finished2023-12-10 06:39:47.971471
Duration23.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

BSHOP_CD
Text

UNIQUE 

Distinct200
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:39:48.417912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters1200
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)100.0%

Sample

1st rowA00050
2nd rowA00051
3rd rowA00052
4th rowA00053
5th rowA00054
ValueCountFrequency (%)
a00050 1
 
0.5%
a09958 1
 
0.5%
a09969 1
 
0.5%
a09949 1
 
0.5%
a09950 1
 
0.5%
a09951 1
 
0.5%
a09952 1
 
0.5%
a09953 1
 
0.5%
a09954 1
 
0.5%
a09955 1
 
0.5%
Other values (190) 190
95.0%
2023-12-10T15:39:49.181226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 328
27.3%
9 292
24.3%
A 200
16.7%
8 91
 
7.6%
5 51
 
4.2%
6 49
 
4.1%
1 47
 
3.9%
7 47
 
3.9%
4 34
 
2.8%
2 31
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
83.3%
Uppercase Letter 200
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 328
32.8%
9 292
29.2%
8 91
 
9.1%
5 51
 
5.1%
6 49
 
4.9%
1 47
 
4.7%
7 47
 
4.7%
4 34
 
3.4%
2 31
 
3.1%
3 30
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
A 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1000
83.3%
Latin 200
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 328
32.8%
9 292
29.2%
8 91
 
9.1%
5 51
 
5.1%
6 49
 
4.9%
1 47
 
4.7%
7 47
 
4.7%
4 34
 
3.4%
2 31
 
3.1%
3 30
 
3.0%
Latin
ValueCountFrequency (%)
A 200
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 328
27.3%
9 292
24.3%
A 200
16.7%
8 91
 
7.6%
5 51
 
4.2%
6 49
 
4.1%
1 47
 
3.9%
7 47
 
3.9%
4 34
 
2.8%
2 31
 
2.6%
Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:39:49.678994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length7.59
Min length2

Characters and Unicode

Total characters1518
Distinct characters163
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)99.0%

Sample

1st row영업부
2nd row오창
3rd row용인
4th row울산
5th row원주
ValueCountFrequency (%)
남광주농협 17
 
5.4%
남동농협 13
 
4.1%
김천농협 11
 
3.5%
김해농협 10
 
3.2%
남대전농협 9
 
2.9%
기흥농협 8
 
2.5%
김제농협 7
 
2.2%
김포축산농협 7
 
2.2%
김해축산농협 6
 
1.9%
낙생농협 6
 
1.9%
Other values (193) 220
70.1%
2023-12-10T15:39:50.307678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
138
 
9.1%
135
 
8.9%
132
 
8.7%
121
 
8.0%
114
 
7.5%
77
 
5.1%
60
 
4.0%
43
 
2.8%
38
 
2.5%
32
 
2.1%
Other values (153) 628
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1396
92.0%
Space Separator 114
 
7.5%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
138
 
9.9%
135
 
9.7%
132
 
9.5%
121
 
8.7%
77
 
5.5%
60
 
4.3%
43
 
3.1%
38
 
2.7%
32
 
2.3%
30
 
2.1%
Other values (148) 590
42.3%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
114
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1396
92.0%
Common 122
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
138
 
9.9%
135
 
9.7%
132
 
9.5%
121
 
8.7%
77
 
5.5%
60
 
4.3%
43
 
3.1%
38
 
2.7%
32
 
2.3%
30
 
2.1%
Other values (148) 590
42.3%
Common
ValueCountFrequency (%)
114
93.4%
) 3
 
2.5%
( 3
 
2.5%
> 1
 
0.8%
< 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1396
92.0%
ASCII 122
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
138
 
9.9%
135
 
9.7%
132
 
9.5%
121
 
8.7%
77
 
5.5%
60
 
4.3%
43
 
3.1%
38
 
2.7%
32
 
2.3%
30
 
2.1%
Other values (148) 590
42.3%
ASCII
ValueCountFrequency (%)
114
93.4%
) 3
 
2.5%
( 3
 
2.5%
> 1
 
0.8%
< 1
 
0.8%
Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:39:50.860634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length17.81
Min length13

Characters and Unicode

Total characters3562
Distinct characters215
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)99.0%

Sample

1st row서울특별시 영등포구 은행로 14
2nd row충청북도 청주시 청원구 오창읍 중심상업로 47
3rd row경기도 용인시 기흥구 동백중앙로 269
4th row울산광역시 남구 봉월로 56
5th row강원도 원주시 건강로 1
ValueCountFrequency (%)
경기도 38
 
4.4%
경상남도 26
 
3.0%
경상북도 26
 
3.0%
김해시 21
 
2.5%
김천시 20
 
2.3%
광주광역시 19
 
2.2%
인천광역시 18
 
2.1%
전라남도 16
 
1.9%
김포시 15
 
1.8%
서울특별시 15
 
1.8%
Other values (410) 641
75.0%
2023-12-10T15:39:51.690985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
655
 
18.4%
192
 
5.4%
179
 
5.0%
137
 
3.8%
1 126
 
3.5%
105
 
2.9%
2 98
 
2.8%
94
 
2.6%
93
 
2.6%
79
 
2.2%
Other values (205) 1804
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2299
64.5%
Space Separator 655
 
18.4%
Decimal Number 595
 
16.7%
Dash Punctuation 13
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
 
8.4%
179
 
7.8%
137
 
6.0%
105
 
4.6%
94
 
4.1%
93
 
4.0%
79
 
3.4%
73
 
3.2%
59
 
2.6%
53
 
2.3%
Other values (193) 1235
53.7%
Decimal Number
ValueCountFrequency (%)
1 126
21.2%
2 98
16.5%
5 60
10.1%
8 52
8.7%
6 50
 
8.4%
4 48
 
8.1%
0 43
 
7.2%
7 40
 
6.7%
9 39
 
6.6%
3 39
 
6.6%
Space Separator
ValueCountFrequency (%)
655
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2299
64.5%
Common 1263
35.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
 
8.4%
179
 
7.8%
137
 
6.0%
105
 
4.6%
94
 
4.1%
93
 
4.0%
79
 
3.4%
73
 
3.2%
59
 
2.6%
53
 
2.3%
Other values (193) 1235
53.7%
Common
ValueCountFrequency (%)
655
51.9%
1 126
 
10.0%
2 98
 
7.8%
5 60
 
4.8%
8 52
 
4.1%
6 50
 
4.0%
4 48
 
3.8%
0 43
 
3.4%
7 40
 
3.2%
9 39
 
3.1%
Other values (2) 52
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2299
64.5%
ASCII 1263
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
655
51.9%
1 126
 
10.0%
2 98
 
7.8%
5 60
 
4.8%
8 52
 
4.1%
6 50
 
4.0%
4 48
 
3.8%
0 43
 
3.4%
7 40
 
3.2%
9 39
 
3.1%
Other values (2) 52
 
4.1%
Hangul
ValueCountFrequency (%)
192
 
8.4%
179
 
7.8%
137
 
6.0%
105
 
4.6%
94
 
4.1%
93
 
4.0%
79
 
3.4%
73
 
3.2%
59
 
2.6%
53
 
2.3%
Other values (193) 1235
53.7%

X_AXIS
Real number (ℝ)

Distinct198
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean346801.29
Minimum220249
Maximum522300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:51.914419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum220249
5-th percentile282296.35
Q1288854.75
median318186.5
Q3409948.25
95-th percentile481723.1
Maximum522300
Range302051
Interquartile range (IQR)121093.5

Descriptive statistics

Standard deviation72411.084
Coefficient of variation (CV)0.20879704
Kurtosis-0.43145265
Mean346801.29
Median Absolute Deviation (MAD)32516.5
Skewness0.94840365
Sum69360258
Variance5.2433651 × 109
MonotonicityNot monotonic
2023-12-10T15:39:52.138711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
273086 2
 
1.0%
282605 2
 
1.0%
282797 1
 
0.5%
282309 1
 
0.5%
283131 1
 
0.5%
283676 1
 
0.5%
283839 1
 
0.5%
283487 1
 
0.5%
283263 1
 
0.5%
282543 1
 
0.5%
Other values (188) 188
94.0%
ValueCountFrequency (%)
220249 1
0.5%
259000 1
0.5%
266211 1
0.5%
266922 1
0.5%
273086 2
1.0%
273395 1
0.5%
278746 1
0.5%
281833 1
0.5%
282056 1
0.5%
282309 1
0.5%
ValueCountFrequency (%)
522300 1
0.5%
518626 1
0.5%
511499 1
0.5%
510682 1
0.5%
509904 1
0.5%
503260 1
0.5%
499611 1
0.5%
483462 1
0.5%
482590 1
0.5%
481820 1
0.5%

Y_AXIS
Real number (ℝ)

Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean414410.45
Minimum99522
Maximum627961
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:52.341915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum99522
5-th percentile271724.05
Q1292900
median391645
Q3539871.5
95-th percentile562127.65
Maximum627961
Range528439
Interquartile range (IQR)246971.5

Descriptive statistics

Standard deviation118141.78
Coefficient of variation (CV)0.28508398
Kurtosis-1.2293577
Mean414410.45
Median Absolute Deviation (MAD)109849
Skewness-0.026627938
Sum82882089
Variance1.395748 × 1010
MonotonicityNot monotonic
2023-12-10T15:39:52.549748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
287809 2
 
1.0%
267044 1
 
0.5%
275424 1
 
0.5%
374501 1
 
0.5%
270673 1
 
0.5%
273221 1
 
0.5%
271777 1
 
0.5%
271269 1
 
0.5%
271109 1
 
0.5%
269329 1
 
0.5%
Other values (189) 189
94.5%
ValueCountFrequency (%)
99522 1
0.5%
107201 1
0.5%
225128 1
0.5%
260992 1
0.5%
267044 1
0.5%
269329 1
0.5%
269873 1
0.5%
270673 1
0.5%
271109 1
0.5%
271269 1
0.5%
ValueCountFrequency (%)
627961 1
0.5%
626372 1
0.5%
626289 1
0.5%
618281 1
0.5%
585029 1
0.5%
572433 1
0.5%
571112 1
0.5%
569917 1
0.5%
568354 1
0.5%
562159 1
0.5%

BLK_CD
Real number (ℝ)

Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean251615.94
Minimum19839
Maximum516740
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:52.777427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19839
5-th percentile39444.5
Q195924.25
median190075.5
Q3425870.75
95-th percentile495431.8
Maximum516740
Range496901
Interquartile range (IQR)329946.5

Descriptive statistics

Standard deviation168480.63
Coefficient of variation (CV)0.66959444
Kurtosis-1.5935925
Mean251615.94
Median Absolute Deviation (MAD)139041.5
Skewness0.18224278
Sum50323188
Variance2.8385724 × 1010
MonotonicityNot monotonic
2023-12-10T15:39:53.095441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
69553 2
 
1.0%
121572 1
 
0.5%
69696 1
 
0.5%
48012 1
 
0.5%
495163 1
 
0.5%
63585 1
 
0.5%
63524 1
 
0.5%
62748 1
 
0.5%
63600 1
 
0.5%
63488 1
 
0.5%
Other values (189) 189
94.5%
ValueCountFrequency (%)
19839 1
0.5%
20164 1
0.5%
23713 1
0.5%
23890 1
0.5%
26735 1
0.5%
26831 1
0.5%
28868 1
0.5%
30223 1
0.5%
35557 1
0.5%
37934 1
0.5%
ValueCountFrequency (%)
516740 1
0.5%
512980 1
0.5%
511602 1
0.5%
510854 1
0.5%
508415 1
0.5%
507890 1
0.5%
507688 1
0.5%
507535 1
0.5%
499534 1
0.5%
497081 1
0.5%

BANK_CD
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
93
136 
2
36 
11
26 
95
 
2

Length

Max length2
Median length2
Mean length1.82
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
93 136
68.0%
2 36
 
18.0%
11 26
 
13.0%
95 2
 
1.0%

Length

2023-12-10T15:39:53.453286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:53.636747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
93 136
68.0%
2 36
 
18.0%
11 26
 
13.0%
95 2
 
1.0%

BANK_NM
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
지역농협
136 
KDB산업은행
36 
NH농협은행
26 
단위수협
 
2

Length

Max length7
Median length4
Mean length4.8
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKDB산업은행
2nd rowKDB산업은행
3rd rowKDB산업은행
4th rowKDB산업은행
5th rowKDB산업은행

Common Values

ValueCountFrequency (%)
지역농협 136
68.0%
KDB산업은행 36
 
18.0%
NH농협은행 26
 
13.0%
단위수협 2
 
1.0%

Length

2023-12-10T15:39:53.841353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:54.021977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지역농협 136
68.0%
kdb산업은행 36
 
18.0%
nh농협은행 26
 
13.0%
단위수협 2
 
1.0%
Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:39:54.371541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length36
Mean length19.22
Min length11

Characters and Unicode

Total characters3844
Distinct characters253
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)99.0%

Sample

1st row서울특별시 영등포구 은행로 14
2nd row충청북도 청주시 청원구 오창읍 중심상업로 47
3rd row경기도 용인시 기흥구 동백중앙로 269
4th row울산광역시 남구 봉월로 56
5th row강원도 원주시 건강로 1
ValueCountFrequency (%)
경기도 33
 
3.7%
경상북도 23
 
2.5%
김해시 21
 
2.3%
김천시 20
 
2.2%
경상남도 20
 
2.2%
광주광역시 15
 
1.7%
서울특별시 15
 
1.7%
김포시 15
 
1.7%
서구 13
 
1.4%
남구 13
 
1.4%
Other values (461) 714
79.2%
2023-12-10T15:39:54.972463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
729
 
19.0%
183
 
4.8%
178
 
4.6%
1 137
 
3.6%
119
 
3.1%
2 109
 
2.8%
106
 
2.8%
95
 
2.5%
94
 
2.4%
80
 
2.1%
Other values (243) 2014
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2427
63.1%
Space Separator 729
 
19.0%
Decimal Number 633
 
16.5%
Open Punctuation 18
 
0.5%
Close Punctuation 18
 
0.5%
Dash Punctuation 14
 
0.4%
Uppercase Letter 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
183
 
7.5%
178
 
7.3%
119
 
4.9%
106
 
4.4%
95
 
3.9%
94
 
3.9%
80
 
3.3%
65
 
2.7%
55
 
2.3%
53
 
2.2%
Other values (224) 1399
57.6%
Decimal Number
ValueCountFrequency (%)
1 137
21.6%
2 109
17.2%
5 61
9.6%
8 54
 
8.5%
6 52
 
8.2%
0 50
 
7.9%
4 48
 
7.6%
9 41
 
6.5%
7 41
 
6.5%
3 40
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
T 1
20.0%
P 1
20.0%
A 1
20.0%
H 1
20.0%
L 1
20.0%
Space Separator
ValueCountFrequency (%)
729
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2427
63.1%
Common 1412
36.7%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
183
 
7.5%
178
 
7.3%
119
 
4.9%
106
 
4.4%
95
 
3.9%
94
 
3.9%
80
 
3.3%
65
 
2.7%
55
 
2.3%
53
 
2.2%
Other values (224) 1399
57.6%
Common
ValueCountFrequency (%)
729
51.6%
1 137
 
9.7%
2 109
 
7.7%
5 61
 
4.3%
8 54
 
3.8%
6 52
 
3.7%
0 50
 
3.5%
4 48
 
3.4%
9 41
 
2.9%
7 41
 
2.9%
Other values (4) 90
 
6.4%
Latin
ValueCountFrequency (%)
T 1
20.0%
P 1
20.0%
A 1
20.0%
H 1
20.0%
L 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2427
63.1%
ASCII 1417
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
729
51.4%
1 137
 
9.7%
2 109
 
7.7%
5 61
 
4.3%
8 54
 
3.8%
6 52
 
3.7%
0 50
 
3.5%
4 48
 
3.4%
9 41
 
2.9%
7 41
 
2.9%
Other values (9) 95
 
6.7%
Hangul
ValueCountFrequency (%)
183
 
7.5%
178
 
7.3%
119
 
4.9%
106
 
4.4%
95
 
3.9%
94
 
3.9%
80
 
3.3%
65
 
2.7%
55
 
2.3%
53
 
2.2%
Other values (224) 1399
57.6%

REG_DATE
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct30
Distinct (%)83.3%
Missing164
Missing (%)82.0%
Infinite0
Infinite (%)0.0%
Mean19949456
Minimum19540401
Maximum20160302
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:55.203883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19540401
5-th percentile19540401
Q119885772
median20010862
Q320110218
95-th percentile20150405
Maximum20160302
Range619901
Interquartile range (IQR)224445.75

Descriptive statistics

Standard deviation193430.12
Coefficient of variation (CV)0.0096960098
Kurtosis-0.0082121285
Mean19949456
Median Absolute Deviation (MAD)109998.5
Skewness-0.96572598
Sum7.1818041 × 108
Variance3.7415211 × 1010
MonotonicityNot monotonic
2023-12-10T15:39:55.521248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
19540401 4
 
2.0%
20150102 2
 
1.0%
19920824 2
 
1.0%
20110214 2
 
1.0%
19911025 1
 
0.5%
20050509 1
 
0.5%
19900909 1
 
0.5%
20050613 1
 
0.5%
19871209 1
 
0.5%
20150630 1
 
0.5%
Other values (20) 20
 
10.0%
(Missing) 164
82.0%
ValueCountFrequency (%)
19540401 4
2.0%
19680201 1
 
0.5%
19730510 1
 
0.5%
19780615 1
 
0.5%
19860418 1
 
0.5%
19871209 1
 
0.5%
19890626 1
 
0.5%
19890707 1
 
0.5%
19891028 1
 
0.5%
19900909 1
 
0.5%
ValueCountFrequency (%)
20160302 1
0.5%
20150630 1
0.5%
20150330 1
0.5%
20150102 2
1.0%
20120906 1
0.5%
20120427 1
0.5%
20120323 1
0.5%
20110228 1
0.5%
20110214 2
1.0%
20100326 1
0.5%

SHOP_CLSS
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
지점
192 
출장소
 
8

Length

Max length3
Median length2
Mean length2.04
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지점
2nd row지점
3rd row지점
4th row지점
5th row지점

Common Values

ValueCountFrequency (%)
지점 192
96.0%
출장소 8
 
4.0%

Length

2023-12-10T15:39:55.888197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:56.100797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지점 192
96.0%
출장소 8
 
4.0%

HOUS_ID
Real number (ℝ)

HIGH CORRELATION 

Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8350862 × 1018
Minimum1.1110125 × 1018
Maximum5.0110256 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:56.722013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110125 × 1018
5-th percentile1.1650108 × 1018
Q12.9155108 × 1018
median4.1570106 × 1018
Q34.7150104 × 1018
95-th percentile4.8250118 × 1018
Maximum5.0110256 × 1018
Range3.9000131 × 1018
Interquartile range (IQR)1.7994996 × 1018

Descriptive statistics

Standard deviation1.0661138 × 1018
Coefficient of variation (CV)0.27798952
Kurtosis0.45133262
Mean3.8350862 × 1018
Median Absolute Deviation (MAD)5.5802497 × 1017
Skewness-1.1413022
Sum-7.7460129 × 1018
Variance1.1365986 × 1036
MonotonicityNot monotonic
2023-12-10T15:39:56.929268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4686037024008220031 2
 
1.0%
4617012800001880011 1
 
0.5%
4686035026008560003 1
 
0.5%
4513014400008050001 1
 
0.5%
4617011100000600010 1
 
0.5%
4617011600003160000 1
 
0.5%
4617012200013900001 1
 
0.5%
4617011500000910000 1
 
0.5%
4617011500000610001 1
 
0.5%
4617010200011000000 1
 
0.5%
Other values (189) 189
94.5%
ValueCountFrequency (%)
1111012500000140000 1
0.5%
1114011800001180000 1
0.5%
1154510100003710037 1
0.5%
1154510200009540004 1
0.5%
1156011000000160003 1
0.5%
1162010200006070073 1
0.5%
1165010200000110001 1
0.5%
1165010300007600000 1
0.5%
1165010600000360014 1
0.5%
1165010600000500006 1
0.5%
ValueCountFrequency (%)
5011025634016340000 1
0.5%
5011012200037840008 1
0.5%
4888038021010170000 1
0.5%
4888037021005560001 1
0.5%
4882040021008190019 1
0.5%
4825032028006380003 1
0.5%
4825032025012750001 1
0.5%
4825013100013340002 1
0.5%
4825012500011610001 1
0.5%
4825012000001610024 1
0.5%

BLD_CD
Real number (ℝ)

HIGH CORRELATION 

Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8396805 × 1024
Minimum1.1110125 × 1024
Maximum5.0110256 × 1024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:39:57.139777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110125 × 1024
5-th percentile1.1650108 × 1024
Q12.9155108 × 1024
median4.1570106 × 1024
Q34.7150104 × 1024
95-th percentile4.8250118 × 1024
Maximum5.0110256 × 1024
Range3.9000131 × 1024
Interquartile range (IQR)1.7994996 × 1024

Descriptive statistics

Standard deviation1.0671 × 1024
Coefficient of variation (CV)0.27791376
Kurtosis0.45933568
Mean3.8396805 × 1024
Median Absolute Deviation (MAD)5.5802492 × 1023
Skewness-1.1499002
Sum7.679361 × 1026
Variance1.1387025 × 1048
MonotonicityNot monotonic
2023-12-10T15:39:57.405733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.68603702410822e+24 2
 
1.0%
4.61701280010188e+24 1
 
0.5%
4.68603502610856e+24 1
 
0.5%
4.5130144001080496e+24 1
 
0.5%
4.6170111001006e+24 1
 
0.5%
4.61701160010316e+24 1
 
0.5%
4.6170122001138895e+24 1
 
0.5%
4.61701150010088e+24 1
 
0.5%
4.61701150010061e+24 1
 
0.5%
4.617010200111e+24 1
 
0.5%
Other values (189) 189
94.5%
ValueCountFrequency (%)
1.11101250010014e+24 1
0.5%
1.11401180010118e+24 1
0.5%
1.15451010010371e+24 1
0.5%
1.15451020010954e+24 1
0.5%
1.15601100010016e+24 1
0.5%
1.16201020010607e+24 1
0.5%
1.16501020010011e+24 1
0.5%
1.1650103001076e+24 1
0.5%
1.16501060010036e+24 1
0.5%
1.1650106001005e+24 1
0.5%
ValueCountFrequency (%)
5.01102563401634e+24 1
0.5%
5.01101220013784e+24 1
0.5%
4.88803802111017e+24 1
0.5%
4.88803702110556e+24 1
0.5%
4.88204002110819e+24 1
0.5%
4.82503202810638e+24 1
0.5%
4.8250320251127505e+24 1
0.5%
4.82503102911334e+24 1
0.5%
4.82503102610039e+24 1
0.5%
4.8250120001016096e+24 1
0.5%
Distinct199
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:39:57.894286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length20.815
Min length16

Characters and Unicode

Total characters4163
Distinct characters181
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)99.0%

Sample

1st row서울특별시 영등포구 여의도동 16-3번지
2nd row충청북도 청주시 청원구 오창읍 양청리 792-1번지
3rd row경기도 용인시 기흥구 중동 830-3번지
4th row울산광역시 남구 신정동 1156-4번지
5th row강원도 원주시 반곡동 1901-2번지
ValueCountFrequency (%)
경기도 38
 
4.4%
경상북도 26
 
3.0%
경상남도 26
 
3.0%
김해시 21
 
2.5%
김천시 20
 
2.3%
광주광역시 19
 
2.2%
인천광역시 18
 
2.1%
전라남도 16
 
1.9%
김포시 15
 
1.8%
서울특별시 15
 
1.8%
Other values (435) 641
75.0%
2023-12-10T15:39:58.581350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
655
 
15.7%
202
 
4.9%
200
 
4.8%
1 192
 
4.6%
191
 
4.6%
188
 
4.5%
- 148
 
3.6%
137
 
3.3%
102
 
2.5%
93
 
2.2%
Other values (171) 2055
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2567
61.7%
Decimal Number 793
 
19.0%
Space Separator 655
 
15.7%
Dash Punctuation 148
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
202
 
7.9%
200
 
7.8%
191
 
7.4%
188
 
7.3%
137
 
5.3%
102
 
4.0%
93
 
3.6%
90
 
3.5%
73
 
2.8%
67
 
2.6%
Other values (159) 1224
47.7%
Decimal Number
ValueCountFrequency (%)
1 192
24.2%
4 89
11.2%
2 81
10.2%
5 71
 
9.0%
6 70
 
8.8%
8 65
 
8.2%
0 63
 
7.9%
3 60
 
7.6%
7 55
 
6.9%
9 47
 
5.9%
Space Separator
ValueCountFrequency (%)
655
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 148
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2567
61.7%
Common 1596
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
202
 
7.9%
200
 
7.8%
191
 
7.4%
188
 
7.3%
137
 
5.3%
102
 
4.0%
93
 
3.6%
90
 
3.5%
73
 
2.8%
67
 
2.6%
Other values (159) 1224
47.7%
Common
ValueCountFrequency (%)
655
41.0%
1 192
 
12.0%
- 148
 
9.3%
4 89
 
5.6%
2 81
 
5.1%
5 71
 
4.4%
6 70
 
4.4%
8 65
 
4.1%
0 63
 
3.9%
3 60
 
3.8%
Other values (2) 102
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2567
61.7%
ASCII 1596
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
655
41.0%
1 192
 
12.0%
- 148
 
9.3%
4 89
 
5.6%
2 81
 
5.1%
5 71
 
4.4%
6 70
 
4.4%
8 65
 
4.1%
0 63
 
3.9%
3 60
 
3.8%
Other values (2) 102
 
6.4%
Hangul
ValueCountFrequency (%)
202
 
7.9%
200
 
7.8%
191
 
7.4%
188
 
7.3%
137
 
5.3%
102
 
4.0%
93
 
3.6%
90
 
3.5%
73
 
2.8%
67
 
2.6%
Other values (159) 1224
47.7%

Interactions

2023-12-10T15:39:37.602162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:25.617022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:27.947396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:30.688296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.199141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:34.691364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:39.433594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:25.798049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.074893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:30.843898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.343682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:34.878275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:40.871176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:25.958647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.217865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:30.990726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.493553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:35.070654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:42.065538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:26.110762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.351605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:31.125255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.634867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:35.253707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:42.434387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:26.251652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.496644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:31.278288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.767765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:35.520613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:43.684807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:26.429651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:28.659447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:31.469605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:33.917306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:35.782155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:39:58.743393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
X_AXISY_AXISBLK_CDBANK_CDBANK_NMREG_DATESHOP_CLSSHOUS_IDBLD_CD
X_AXIS1.0000.6950.4880.4840.4840.0000.0000.7070.715
Y_AXIS0.6951.0000.5900.0000.0000.0000.0000.6600.643
BLK_CD0.4880.5901.0000.2750.2750.1420.0000.5870.598
BANK_CD0.4840.0000.2751.0001.000NaN0.7230.4380.308
BANK_NM0.4840.0000.2751.0001.000NaN0.7230.4380.308
REG_DATE0.0000.0000.142NaNNaN1.000NaN0.6170.617
SHOP_CLSS0.0000.0000.0000.7230.723NaN1.0000.0000.000
HOUS_ID0.7070.6600.5870.4380.4380.6170.0001.0001.000
BLD_CD0.7150.6430.5980.3080.3080.6170.0001.0001.000
2023-12-10T15:39:58.913896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
BANK_CDSHOP_CLSSBANK_NM
BANK_CD1.0000.5151.000
SHOP_CLSS0.5151.0000.515
BANK_NM1.0000.5151.000
2023-12-10T15:39:59.076341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
X_AXISY_AXISBLK_CDREG_DATEHOUS_IDBLD_CDBANK_CDBANK_NMSHOP_CLSS
X_AXIS1.000-0.1630.1410.0200.4100.4120.3190.3190.000
Y_AXIS-0.1631.0000.1570.218-0.435-0.4380.0000.0000.000
BLK_CD0.1410.1571.0000.1310.0340.0360.1570.1570.000
REG_DATE0.0200.2180.1311.000-0.057-0.0521.0001.0001.000
HOUS_ID0.410-0.4350.034-0.0571.0000.9980.2130.2130.000
BLD_CD0.412-0.4380.036-0.0520.9981.0000.1230.1230.071
BANK_CD0.3190.0000.1571.0000.2130.1231.0001.0000.515
BANK_NM0.3190.0000.1571.0000.2130.1231.0001.0000.515
SHOP_CLSS0.0000.0000.0001.0000.0000.0710.5150.5151.000

Missing values

2023-12-10T15:39:47.477474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:39:47.828521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

BSHOP_CDBSHOP_NMROAD_ADDRX_AXISY_AXISBLK_CDBANK_CDBANK_NMADDRESSREG_DATESHOP_CLSSHOUS_IDBLD_CDHOUS_ADDR
0A00050영업부서울특별시 영등포구 은행로 143048105477592273882KDB산업은행서울특별시 영등포구 은행로 1419680201지점11560110000001600031156011000100160003031606서울특별시 영등포구 여의도동 16-3번지
1A00051오창충청북도 청주시 청원구 오창읍 중심상업로 473491814572524430392KDB산업은행충청북도 청주시 청원구 오창읍 중심상업로 4720160302지점43114253450079200014371025345107920001000001충청북도 청주시 청원구 오창읍 양청리 792-1번지
2A00052용인경기도 용인시 기흥구 동백중앙로 2693250865197584313402KDB산업은행경기도 용인시 기흥구 동백중앙로 26920110214지점41463116000083000034146311600108300003017945경기도 용인시 기흥구 중동 830-3번지
3A00053울산울산광역시 남구 봉월로 565186263272232744252KDB산업은행울산광역시 남구 봉월로 5619730510지점31140104000115600043114010400111560004011993울산광역시 남구 신정동 1156-4번지
4A00054원주강원도 원주시 건강로 13989505250284291472KDB산업은행강원도 원주시 건강로 120060515지점42130117000190100024213011700119010002000001강원도 원주시 반곡동 1901-2번지
5A00055의정부경기도 의정부시 시민로 313154695711122824392KDB산업은행경기도 의정부시 시민로 3120150102지점41150101000048600114115010100104860011020763경기도 의정부시 의정부동 486-11번지
6A00056인천인천광역시 남동구 미래로 422856935398584227242KDB산업은행인천광역시 남동구 미래로 4219540401지점28200101000114000002820010100111400000000001인천광역시 남동구 구월동 1140번지
7A00057일산경기도 고양시 일산동구 중앙로 12012922135619702929312KDB산업은행경기도 고양시 일산동구 중앙로 120120040726지점41285104000089000004128510400108900000002637경기도 고양시 일산동구 장항동 890번지
8A00058잠실서울특별시 송파구 올림픽로 2893208565462031548792KDB산업은행서울특별시 송파구 올림픽로 28919890626지점11710102000000700191171010200100070019000140서울특별시 송파구 신천동 7-19번지
9A00059잠원서울특별시 서초구 강남대로 565313586546010238902KDB산업은행서울특별시 서초구 강남대로 56520120427지점11650106000003600141165010600100360014019363서울특별시 서초구 잠원동 36-14번지
BSHOP_CDBSHOP_NMROAD_ADDRX_AXISY_AXISBLK_CDBANK_CDBANK_NMADDRESSREG_DATESHOP_CLSSHOUS_IDBLD_CDHOUS_ADDR
190A00002강남서울특별시 강남구 영동대로 5083173545457582707512KDB산업은행서울특별시 강남구 영동대로 50819920824지점11680105000016800011168010500101680001016579서울특별시 강남구 삼성동 168-1번지
191A00003경산경상북도 경산시 대학로 734661973596341646592KDB산업은행경상북도 경산시 대학로 7320120906지점47290111000025501204729011100102550099000001경상북도 경산시 정평동 255-120번지
192A00004경주경상북도 경주시 화랑로 125509904361228778242KDB산업은행경상북도 경주시 화랑로 12520150630지점47130104000038600064713010400103860006005652경상북도 경주시 성동동 386-6번지
193A00005광주광주광역시 광산구 무진대로 2612917152854394896392KDB산업은행광주광역시 광산구 무진대로 26119540401지점29200109000158400012920010900115840001041538광주광역시 광산구 우산동 1584-1번지
194A00006구미경상북도 구미시 송정대로 1074317043908193214202KDB산업은행경상북도 구미시 송정대로 10719871209지점47190110000007800004719011000100780000027926경상북도 구미시 송정동 78번지
195A00007군산전라북도 군산시 월명로 2062843893743394605442KDB산업은행전라북도 군산시 월명로 20619891028지점45130141000085400024513014100108540002000001전라북도 군산시 수송동 854-2번지
196A00008금남로광주광역시 동구 금남로 1683010412842491093492KDB산업은행광주광역시 동구 금남로 16820150102지점29110102000006200172911010200100620017003445광주광역시 동구 금남로5가 62-17번지
197A00009금정부산광역시 금정구 중앙대로 1817499611294702834432KDB산업은행부산광역시 금정구 중앙대로 181719900909지점26410107000008400182641010700100840018011702부산광역시 금정구 구서동 84-18번지
198A00010금천서울특별시 금천구 시흥대로 488302834542228201642KDB산업은행서울특별시 금천구 시흥대로 48819911025지점11545102000095400041154510200109540004007611서울특별시 금천구 독산동 954-4번지
199A00011김포경기도 김포시 중봉로 152851695587284101302KDB산업은행경기도 김포시 중봉로 1520050509지점41570105000050600004157010500105060000023794경기도 김포시 감정동 506번지