Overview

Dataset statistics

Number of variables14
Number of observations497
Missing cells117
Missing cells (%)1.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.4 KiB
Average record size in memory118.3 B

Variable types

Categorical3
Text4
DateTime1
Numeric6

Dataset

Description경기도 용인시 관내 공동주택 현황입니다. 위치, 단지명, 동수, 층수 등의 데이터를 제공합니다.※ 데이터기준일자 : 2014-02-27
Author경기도 용인시
URLhttps://www.data.go.kr/data/3063525/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
구분 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 구분High correlation
동수 is highly overall correlated with 세대수 and 3 other fieldsHigh correlation
층수 is highly overall correlated with 세대수 and 1 other fieldsHigh correlation
세대수 is highly overall correlated with 동수 and 4 other fieldsHigh correlation
대지면적 is highly overall correlated with 동수 and 3 other fieldsHigh correlation
건축면적 is highly overall correlated with 동수 and 3 other fieldsHigh correlation
연면적 is highly overall correlated with 동수 and 4 other fieldsHigh correlation
구분 is highly imbalanced (71.2%)Imbalance
비고 is highly imbalanced (62.4%)Imbalance
평형(세대수) has 27 (5.4%) missing valuesMissing
대지면적 has 27 (5.4%) missing valuesMissing
관리사무소 전화번호 has 59 (11.9%) missing valuesMissing
연면적 is highly skewed (γ1 = 20.54541685)Skewed
위치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:32:58.954093
Analysis finished2023-12-12 05:33:04.509170
Duration5.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
분양아파트
472 
임대아파트
 
25

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row분양아파트
2nd row분양아파트
3rd row분양아파트
4th row분양아파트
5th row분양아파트

Common Values

ValueCountFrequency (%)
분양아파트 472
95.0%
임대아파트 25
 
5.0%

Length

2023-12-12T14:33:04.589312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:33:04.702642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분양아파트 472
95.0%
임대아파트 25
 
5.0%

위치
Text

UNIQUE 

Distinct497
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T14:33:04.954005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length34
Mean length12.61167
Min length10

Characters and Unicode

Total characters6268
Distinct characters103
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique497 ?
Unique (%)100.0%

Sample

1st row처인구 고림동 264-8
2nd row처인구 고림동 411-1
3rd row처인구 고림동 748-6 ,748-11
4th row처인구 고림동 794-29
5th row처인구 고림동 995
ValueCountFrequency (%)
기흥구 221
 
14.4%
수지구 186
 
12.1%
처인구 91
 
5.9%
죽전1동 46
 
3.0%
구갈동 27
 
1.8%
상현1동 25
 
1.6%
중동 23
 
1.5%
신갈동 21
 
1.4%
영덕동 21
 
1.4%
동천동 19
 
1.2%
Other values (512) 857
55.8%
2023-12-12T14:33:05.306535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1045
16.7%
535
 
8.5%
519
 
8.3%
1 442
 
7.1%
221
 
3.5%
221
 
3.5%
2 206
 
3.3%
199
 
3.2%
186
 
3.0%
6 180
 
2.9%
Other values (93) 2514
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3243
51.7%
Decimal Number 1836
29.3%
Space Separator 1045
 
16.7%
Dash Punctuation 102
 
1.6%
Close Punctuation 13
 
0.2%
Open Punctuation 13
 
0.2%
Uppercase Letter 12
 
0.2%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
535
16.5%
519
16.0%
221
 
6.8%
221
 
6.8%
199
 
6.1%
186
 
5.7%
91
 
2.8%
91
 
2.8%
71
 
2.2%
65
 
2.0%
Other values (75) 1044
32.2%
Decimal Number
ValueCountFrequency (%)
1 442
24.1%
2 206
11.2%
6 180
9.8%
5 157
 
8.6%
8 155
 
8.4%
3 150
 
8.2%
9 144
 
7.8%
0 142
 
7.7%
4 131
 
7.1%
7 129
 
7.0%
Uppercase Letter
ValueCountFrequency (%)
A 6
50.0%
L 3
25.0%
B 3
25.0%
Space Separator
ValueCountFrequency (%)
1045
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3240
51.7%
Common 3013
48.1%
Latin 12
 
0.2%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
535
16.5%
519
16.0%
221
 
6.8%
221
 
6.8%
199
 
6.1%
186
 
5.7%
91
 
2.8%
91
 
2.8%
71
 
2.2%
65
 
2.0%
Other values (74) 1041
32.1%
Common
ValueCountFrequency (%)
1045
34.7%
1 442
14.7%
2 206
 
6.8%
6 180
 
6.0%
5 157
 
5.2%
8 155
 
5.1%
3 150
 
5.0%
9 144
 
4.8%
0 142
 
4.7%
4 131
 
4.3%
Other values (5) 261
 
8.7%
Latin
ValueCountFrequency (%)
A 6
50.0%
L 3
25.0%
B 3
25.0%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3240
51.7%
ASCII 3025
48.3%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1045
34.5%
1 442
14.6%
2 206
 
6.8%
6 180
 
6.0%
5 157
 
5.2%
8 155
 
5.1%
3 150
 
5.0%
9 144
 
4.8%
0 142
 
4.7%
4 131
 
4.3%
Other values (8) 273
 
9.0%
Hangul
ValueCountFrequency (%)
535
16.5%
519
16.0%
221
 
6.8%
221
 
6.8%
199
 
6.1%
186
 
5.7%
91
 
2.8%
91
 
2.8%
71
 
2.2%
65
 
2.0%
Other values (74) 1041
32.1%
CJK
ValueCountFrequency (%)
3
100.0%
Distinct489
Distinct (%)98.8%
Missing2
Missing (%)0.4%
Memory size4.0 KiB
2023-12-12T14:33:05.596606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length11.707071
Min length2

Characters and Unicode

Total characters5795
Distinct characters299
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique483 ?
Unique (%)97.6%

Sample

1st row인정 피렌체 빌리지 1차
2nd row이삭아파트
3rd row용성빌라
4th row예진마을 2차아파트
5th row임원마을 영화1차 아파트
ValueCountFrequency (%)
수지 20
 
1.9%
휴먼시아 16
 
1.5%
상현마을 15
 
1.4%
1단지 14
 
1.3%
2단지 12
 
1.1%
내대지마을 11
 
1.0%
3단지 11
 
1.0%
신봉마을 11
 
1.0%
교동마을 10
 
0.9%
도담마을 10
 
0.9%
Other values (542) 937
87.8%
2023-12-12T14:33:06.147037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
577
 
10.0%
357
 
6.2%
348
 
6.0%
269
 
4.6%
240
 
4.1%
225
 
3.9%
200
 
3.5%
125
 
2.2%
102
 
1.8%
99
 
1.7%
Other values (289) 3253
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4810
83.0%
Space Separator 577
 
10.0%
Decimal Number 279
 
4.8%
Close Punctuation 42
 
0.7%
Open Punctuation 42
 
0.7%
Uppercase Letter 29
 
0.5%
Dash Punctuation 11
 
0.2%
Other Punctuation 4
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
357
 
7.4%
348
 
7.2%
269
 
5.6%
240
 
5.0%
225
 
4.7%
200
 
4.2%
125
 
2.6%
102
 
2.1%
99
 
2.1%
91
 
1.9%
Other values (266) 2754
57.3%
Decimal Number
ValueCountFrequency (%)
1 76
27.2%
2 69
24.7%
3 40
14.3%
0 22
 
7.9%
4 22
 
7.9%
5 18
 
6.5%
6 13
 
4.7%
7 8
 
2.9%
8 6
 
2.2%
9 5
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
G 8
27.6%
A 8
27.6%
L 8
27.6%
C 2
 
6.9%
B 2
 
6.9%
J 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
, 1
 
25.0%
Space Separator
ValueCountFrequency (%)
577
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4810
83.0%
Common 955
 
16.5%
Latin 30
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
357
 
7.4%
348
 
7.2%
269
 
5.6%
240
 
5.0%
225
 
4.7%
200
 
4.2%
125
 
2.6%
102
 
2.1%
99
 
2.1%
91
 
1.9%
Other values (266) 2754
57.3%
Common
ValueCountFrequency (%)
577
60.4%
1 76
 
8.0%
2 69
 
7.2%
) 42
 
4.4%
( 42
 
4.4%
3 40
 
4.2%
0 22
 
2.3%
4 22
 
2.3%
5 18
 
1.9%
6 13
 
1.4%
Other values (6) 34
 
3.6%
Latin
ValueCountFrequency (%)
G 8
26.7%
A 8
26.7%
L 8
26.7%
C 2
 
6.7%
B 2
 
6.7%
J 1
 
3.3%
1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4810
83.0%
ASCII 984
 
17.0%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
577
58.6%
1 76
 
7.7%
2 69
 
7.0%
) 42
 
4.3%
( 42
 
4.3%
3 40
 
4.1%
0 22
 
2.2%
4 22
 
2.2%
5 18
 
1.8%
6 13
 
1.3%
Other values (12) 63
 
6.4%
Hangul
ValueCountFrequency (%)
357
 
7.4%
348
 
7.2%
269
 
5.6%
240
 
5.0%
225
 
4.7%
200
 
4.2%
125
 
2.6%
102
 
2.1%
99
 
2.1%
91
 
1.9%
Other values (266) 2754
57.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct400
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
Minimum1984-11-09 00:00:00
Maximum2014-02-27 00:00:00
2023-12-12T14:33:06.296713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:06.463296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

동수
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.7867203
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:06.622191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile15
Maximum36
Range35
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.8885548
Coefficient of variation (CV)0.72031182
Kurtosis6.2393876
Mean6.7867203
Median Absolute Deviation (MAD)3
Skewness1.842973
Sum3373
Variance23.897968
MonotonicityNot monotonic
2023-12-12T14:33:06.778394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 50
10.1%
3 49
9.9%
7 48
9.7%
5 47
9.5%
8 46
9.3%
6 46
9.3%
4 43
8.7%
2 35
7.0%
9 27
 
5.4%
10 23
 
4.6%
Other values (19) 83
16.7%
ValueCountFrequency (%)
1 50
10.1%
2 35
7.0%
3 49
9.9%
4 43
8.7%
5 47
9.5%
6 46
9.3%
7 48
9.7%
8 46
9.3%
9 27
5.4%
10 23
4.6%
ValueCountFrequency (%)
36 1
0.2%
35 1
0.2%
30 1
0.2%
27 1
0.2%
26 1
0.2%
24 1
0.2%
23 1
0.2%
22 1
0.2%
21 1
0.2%
20 1
0.2%

층수
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.953722
Minimum3
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:06.933063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4
Q115
median19
Q320
95-th percentile25
Maximum40
Range37
Interquartile range (IQR)5

Descriptive statistics

Standard deviation5.9996532
Coefficient of variation (CV)0.35388412
Kurtosis0.77911216
Mean16.953722
Median Absolute Deviation (MAD)3
Skewness-0.71096466
Sum8426
Variance35.995838
MonotonicityNot monotonic
2023-12-12T14:33:07.134779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
20 142
28.6%
15 81
16.3%
18 42
 
8.5%
19 34
 
6.8%
25 27
 
5.4%
4 18
 
3.6%
5 17
 
3.4%
3 16
 
3.2%
17 12
 
2.4%
16 12
 
2.4%
Other values (20) 96
19.3%
ValueCountFrequency (%)
3 16
3.2%
4 18
3.6%
5 17
3.4%
6 10
2.0%
7 3
 
0.6%
8 1
 
0.2%
9 1
 
0.2%
10 7
 
1.4%
11 3
 
0.6%
12 6
 
1.2%
ValueCountFrequency (%)
40 1
 
0.2%
36 1
 
0.2%
30 3
 
0.6%
29 1
 
0.2%
28 1
 
0.2%
27 2
 
0.4%
26 1
 
0.2%
25 27
5.4%
24 10
 
2.0%
23 12
2.4%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct356
Distinct (%)71.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean449.68209
Minimum21
Maximum1998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:07.326720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile45
Q1196
median389
Q3592
95-th percentile1119.2
Maximum1998
Range1977
Interquartile range (IQR)396

Descriptive statistics

Standard deviation348.50673
Coefficient of variation (CV)0.77500691
Kurtosis3.0130937
Mean449.68209
Median Absolute Deviation (MAD)195
Skewness1.4767395
Sum223492
Variance121456.94
MonotonicityNot monotonic
2023-12-12T14:33:07.512295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24 7
 
1.4%
72 4
 
0.8%
70 4
 
0.8%
50 4
 
0.8%
432 4
 
0.8%
159 3
 
0.6%
367 3
 
0.6%
296 3
 
0.6%
404 3
 
0.6%
336 3
 
0.6%
Other values (346) 459
92.4%
ValueCountFrequency (%)
21 3
0.6%
24 7
1.4%
25 1
 
0.2%
27 2
 
0.4%
30 1
 
0.2%
32 1
 
0.2%
36 3
0.6%
37 1
 
0.2%
39 1
 
0.2%
40 2
 
0.4%
ValueCountFrequency (%)
1998 1
0.2%
1990 1
0.2%
1902 1
0.2%
1828 1
0.2%
1744 1
0.2%
1701 1
0.2%
1626 1
0.2%
1620 1
0.2%
1596 1
0.2%
1576 1
0.2%

평형(세대수)
Text

MISSING 

Distinct467
Distinct (%)99.4%
Missing27
Missing (%)5.4%
Memory size4.0 KiB
2023-12-12T14:33:07.767765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length137
Median length75
Mean length19.593617
Min length6

Characters and Unicode

Total characters9209
Distinct characters22
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique464 ?
Unique (%)98.7%

Sample

1st row24(639)
2nd row24(120),32(180)
3rd row54.54㎡(30),58.02㎡(5),64.44㎡(10)
4th row59㎡(4),60㎡(4),62㎡(4),63㎡(15),66㎡(1),68㎡(2),73㎡(6),75㎡(3)
5th row35(356),53(60)
ValueCountFrequency (%)
25(59 2
 
0.4%
34(87),26(62 2
 
0.4%
33(342 2
 
0.4%
24(1239 1
 
0.2%
49(35),32(96),24(138 1
 
0.2%
51(190),44(120 1
 
0.2%
41(377),51(302),61(216 1
 
0.2%
24(639 1
 
0.2%
67(224 1
 
0.2%
36(152),51(80)57(80 1
 
0.2%
Other values (480) 480
97.4%
2023-12-12T14:33:08.213754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1172
12.7%
) 1172
12.7%
2 829
9.0%
3 812
8.8%
1 779
8.5%
4 771
8.4%
, 646
7.0%
5 536
 
5.8%
6 473
 
5.1%
8 438
 
4.8%
Other values (12) 1581
17.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5743
62.4%
Open Punctuation 1172
 
12.7%
Close Punctuation 1172
 
12.7%
Other Punctuation 808
 
8.8%
Other Symbol 274
 
3.0%
Space Separator 25
 
0.3%
Other Letter 12
 
0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 829
14.4%
3 812
14.1%
1 779
13.6%
4 771
13.4%
5 536
9.3%
6 473
8.2%
8 438
7.6%
0 397
6.9%
7 356
6.2%
9 352
6.1%
Other Letter
ValueCountFrequency (%)
4
33.3%
4
33.3%
2
16.7%
2
16.7%
Other Punctuation
ValueCountFrequency (%)
, 646
80.0%
. 161
 
19.9%
* 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1172
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1172
100.0%
Other Symbol
ValueCountFrequency (%)
274
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9197
99.9%
Hangul 12
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
( 1172
12.7%
) 1172
12.7%
2 829
9.0%
3 812
8.8%
1 779
8.5%
4 771
8.4%
, 646
7.0%
5 536
 
5.8%
6 473
 
5.1%
8 438
 
4.8%
Other values (8) 1569
17.1%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
2
16.7%
2
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8923
96.9%
CJK Compat 274
 
3.0%
Hangul 12
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1172
13.1%
) 1172
13.1%
2 829
9.3%
3 812
9.1%
1 779
8.7%
4 771
8.6%
, 646
7.2%
5 536
6.0%
6 473
 
5.3%
8 438
 
4.9%
Other values (7) 1295
14.5%
CJK Compat
ValueCountFrequency (%)
274
100.0%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
2
16.7%
2
16.7%

대지면적
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct469
Distinct (%)99.8%
Missing27
Missing (%)5.4%
Infinite0
Infinite (%)0.0%
Mean29013.055
Minimum699
Maximum172102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:08.391290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum699
5-th percentile3370.55
Q112748
median23295.5
Q335300.75
95-th percentile76779.2
Maximum172102
Range171403
Interquartile range (IQR)22552.75

Descriptive statistics

Standard deviation24451.484
Coefficient of variation (CV)0.8427752
Kurtosis6.0857397
Mean29013.055
Median Absolute Deviation (MAD)11074.5
Skewness2.0863048
Sum13636136
Variance5.9787505 × 108
MonotonicityNot monotonic
2023-12-12T14:33:08.563649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1653 2
 
0.4%
35980 1
 
0.2%
89251 1
 
0.2%
5792 1
 
0.2%
114997 1
 
0.2%
93718 1
 
0.2%
31143 1
 
0.2%
54725 1
 
0.2%
57753 1
 
0.2%
34083 1
 
0.2%
Other values (459) 459
92.4%
(Missing) 27
 
5.4%
ValueCountFrequency (%)
699 1
0.2%
960 1
0.2%
1124 1
0.2%
1653 2
0.4%
1662 1
0.2%
1774 1
0.2%
1830 1
0.2%
2024 1
0.2%
2107 1
0.2%
2179 1
0.2%
ValueCountFrequency (%)
172102 1
0.2%
144041 1
0.2%
142740 1
0.2%
128356 1
0.2%
128176 1
0.2%
126411 1
0.2%
117693 1
0.2%
114997 1
0.2%
112448 1
0.2%
102749 1
0.2%

건축면적
Real number (ℝ)

HIGH CORRELATION 

Distinct482
Distinct (%)97.4%
Missing2
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean4865.3535
Minimum247
Maximum62890
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:08.734407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum247
5-th percentile699.1
Q12248.5
median3870
Q35898.5
95-th percentile12715.4
Maximum62890
Range62643
Interquartile range (IQR)3650

Descriptive statistics

Standard deviation4826.9285
Coefficient of variation (CV)0.99210232
Kurtosis53.331266
Mean4865.3535
Median Absolute Deviation (MAD)1698
Skewness5.500902
Sum2408350
Variance23299239
MonotonicityNot monotonic
2023-12-12T14:33:09.293348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 2
 
0.4%
9449 2
 
0.4%
3288 2
 
0.4%
2628 2
 
0.4%
3568 2
 
0.4%
1756 2
 
0.4%
12826 2
 
0.4%
1994 2
 
0.4%
2445 2
 
0.4%
4791 2
 
0.4%
Other values (472) 475
95.6%
ValueCountFrequency (%)
247 1
0.2%
320 1
0.2%
323 1
0.2%
324 1
0.2%
356 1
0.2%
377 1
0.2%
418 1
0.2%
419 1
0.2%
450 1
0.2%
453 1
0.2%
ValueCountFrequency (%)
62890 1
0.2%
46630 1
0.2%
22484 1
0.2%
21874 1
0.2%
20997 1
0.2%
19266 1
0.2%
19264 1
0.2%
17076 1
0.2%
16600 1
0.2%
16049 1
0.2%

연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct495
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81008.513
Minimum1098
Maximum5938609
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T14:33:09.479047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1098
5-th percentile4401
Q127430
median55418
Q386752
95-th percentile192985.2
Maximum5938609
Range5937511
Interquartile range (IQR)59322

Descriptive statistics

Standard deviation270624.12
Coefficient of variation (CV)3.3406874
Kurtosis444.90733
Mean81008.513
Median Absolute Deviation (MAD)29391
Skewness20.545417
Sum40261231
Variance7.3237416 × 1010
MonotonicityNot monotonic
2023-12-12T14:33:09.628366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
152928 2
 
0.4%
191987 2
 
0.4%
65037 1
 
0.2%
79166 1
 
0.2%
75727 1
 
0.2%
80134 1
 
0.2%
88014 1
 
0.2%
154522 1
 
0.2%
214793 1
 
0.2%
23057 1
 
0.2%
Other values (485) 485
97.6%
ValueCountFrequency (%)
1098 1
0.2%
1253 1
0.2%
1278 1
0.2%
1538 1
0.2%
1557 1
0.2%
1619 1
0.2%
1874 1
0.2%
1918 1
0.2%
1933 1
0.2%
1961 1
0.2%
ValueCountFrequency (%)
5938609 1
0.2%
433522 1
0.2%
368030 1
0.2%
351920 1
0.2%
325231 1
0.2%
323519 1
0.2%
321753 1
0.2%
309630 1
0.2%
291404 1
0.2%
283003 1
0.2%
Distinct421
Distinct (%)96.1%
Missing59
Missing (%)11.9%
Memory size4.0 KiB
2023-12-12T14:33:09.945393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.013699
Min length12

Characters and Unicode

Total characters5262
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique407 ?
Unique (%)92.9%

Sample

1st row031-321-2514
2nd row031-321-4410
3rd row031-322-5990
4th row031-323-4500
5th row031-323-5581
ValueCountFrequency (%)
031-272-7522 3
 
0.7%
031-897-5598 3
 
0.7%
031-323-2450 3
 
0.7%
031-281-4373 2
 
0.5%
031-285-6056 2
 
0.5%
031-287-7445 2
 
0.5%
031-261-9122 2
 
0.5%
031-275-0136 2
 
0.5%
031-283-8182 2
 
0.5%
031-263-0894 2
 
0.5%
Other values (411) 415
94.7%
2023-12-12T14:33:10.409503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 876
16.6%
3 787
15.0%
1 714
13.6%
0 708
13.5%
2 508
9.7%
6 368
7.0%
8 325
 
6.2%
5 268
 
5.1%
9 261
 
5.0%
7 239
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4386
83.4%
Dash Punctuation 876
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 787
17.9%
1 714
16.3%
0 708
16.1%
2 508
11.6%
6 368
8.4%
8 325
7.4%
5 268
 
6.1%
9 261
 
6.0%
7 239
 
5.4%
4 208
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 876
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5262
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 876
16.6%
3 787
15.0%
1 714
13.6%
0 708
13.5%
2 508
9.7%
6 368
7.0%
8 325
 
6.2%
5 268
 
5.1%
9 261
 
5.0%
7 239
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5262
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 876
16.6%
3 787
15.0%
1 714
13.6%
0 708
13.5%
2 508
9.7%
6 368
7.0%
8 325
 
6.2%
5 268
 
5.1%
9 261
 
5.0%
7 239
 
4.5%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
의무관리
376 
비의무관리
93 
국민임대(30년)
 
22
비의무(도시형)
 
3
50년 공공임대
 
1
Other values (2)
 
2

Length

Max length9
Median length4
Mean length4.4527163
Min length4

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st row의무관리
2nd row의무관리
3rd row비의무관리
4th row비의무관리
5th row의무관리

Common Values

ValueCountFrequency (%)
의무관리 376
75.7%
비의무관리 93
 
18.7%
국민임대(30년) 22
 
4.4%
비의무(도시형) 3
 
0.6%
50년 공공임대 1
 
0.2%
10년 공공임대 1
 
0.2%
임대(국민) 1
 
0.2%

Length

2023-12-12T14:33:10.595613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:33:10.721218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의무관리 376
75.4%
비의무관리 93
 
18.6%
국민임대(30년 22
 
4.4%
비의무(도시형 3
 
0.6%
공공임대 2
 
0.4%
50년 1
 
0.2%
10년 1
 
0.2%
임대(국민 1
 
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2014-02-27
497 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014-02-27
2nd row2014-02-27
3rd row2014-02-27
4th row2014-02-27
5th row2014-02-27

Common Values

ValueCountFrequency (%)
2014-02-27 497
100.0%

Length

2023-12-12T14:33:10.878274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:33:10.982089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014-02-27 497
100.0%

Interactions

2023-12-12T14:33:03.350270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:32:59.712749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.434562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.094808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.119493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.757039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.445699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:32:59.838665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.539618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.220515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.235897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.847640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.540272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:32:59.965043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.654822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.330753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.351785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.936796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.662289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.086772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.768258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.438633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.454737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.060567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.768839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.206625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.866492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.868371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.545704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.162643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.872497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.337510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:00.968784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:01.991042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:02.651492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:33:03.249590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:33:11.046243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분동수층수세대수대지면적건축면적연면적비고
구분1.0000.1330.0860.3070.1340.0000.0001.000
동수0.1331.0000.3550.8990.9090.8260.0000.344
층수0.0860.3551.0000.6790.5900.2340.0000.539
세대수0.3070.8990.6791.0000.8850.7090.0000.560
대지면적0.1340.9090.5900.8851.0000.8160.0000.353
건축면적0.0000.8260.2340.7090.8161.0000.0000.080
연면적0.0000.0000.0000.0000.0000.0001.0000.000
비고1.0000.3440.5390.5600.3530.0800.0001.000
2023-12-12T14:33:11.158423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분비고
구분1.0000.995
비고0.9951.000
2023-12-12T14:33:11.248064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동수층수세대수대지면적건축면적연면적구분비고
동수1.0000.3560.8560.8940.8960.8540.1320.182
층수0.3561.0000.5570.4220.3790.5760.0650.309
세대수0.8560.5571.0000.8910.8570.9000.2330.324
대지면적0.8940.4220.8911.0000.9200.9470.1020.187
건축면적0.8960.3790.8570.9201.0000.9180.0000.049
연면적0.8540.5760.9000.9470.9181.0000.0000.000
구분0.1320.0650.2330.1020.0000.0001.0000.995
비고0.1820.3090.3240.1870.0490.0000.9951.000

Missing values

2023-12-12T14:33:04.046064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:33:04.262030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:33:04.417454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분위치단지명사용검사일동수층수세대수평형(세대수)대지면적건축면적연면적관리사무소 전화번호비고데이터기준일자
0분양아파트처인구 고림동 264-8인정 피렌체 빌리지 1차2000-01-15101563924(639)26904492666395031-321-2514의무관리2014-02-27
1분양아파트처인구 고림동 411-1이삭아파트1998-12-2442030024(120),32(180)12173222933720031-321-4410의무관리2014-02-27
2분양아파트처인구 고림동 748-6 ,748-11<NA>1990-02-23354554.54㎡(30),58.02㎡(5),64.44㎡(10)<NA>5853458<NA>비의무관리2014-02-27
3분양아파트처인구 고림동 794-29용성빌라1986-11-21333959㎡(4),60㎡(4),62㎡(4),63㎡(15),66㎡(1),68㎡(2),73㎡(6),75㎡(3)<NA>7643058<NA>비의무관리2014-02-27
4분양아파트처인구 고림동 995예진마을 2차아파트2001-04-2081541635(356),53(60)27627460863908031-322-5990의무관리2014-02-27
5분양아파트처인구 고림동 996임원마을 영화1차 아파트2002-06-2661851325(162),31(211)33(140)24463408460879031-323-4500의무관리2014-02-27
6분양아파트처인구 고림동 997예진마을 3차아파트2003-07-01101551625(280),33(236)29404422568029031-323-5581의무관리2014-02-27
7분양아파트처인구 고림동 1000금평마을 영화2차아파트2003-12-2361542832(280),38(28),41(120)25835448562015031-332-5319의무관리2014-02-27
8분양아파트처인구 고림동 1001보평마을 삼정그린뷰아파트2004-04-3071536029(80),33(280)20069373645545031-323-0141의무관리2014-02-27
9분양아파트처인구 고림동 1002예원마을 코아루아파트2004-12-0871540824(90),34(318)27164380153249031-323-5221의무관리2014-02-27
구분위치단지명사용검사일동수층수세대수평형(세대수)대지면적건축면적연면적관리사무소 전화번호비고데이터기준일자
487임대아파트기흥구 청덕동 503구성 물푸레마을 휴먼시아 5단지2008-06-03112071217(252),20(318),25(142)26853497359403031-8005-7487국민임대(30년)2014-02-27
488임대아파트기흥구 청덕동 506구성 물푸레마을 휴먼시아 2단지2008-06-0361838985이하(389)15755294231913031-693-6430국민임대(30년)2014-02-27
489임대아파트기흥구 청덕동 513구성 물푸레마을 휴먼시아 1단지2008-07-1681561817(266),20(266),21(86)27635436749693031-8005-8620국민임대(30년)2014-02-27
490임대아파트기흥구 청덕동 569구성 물푸레마을 휴먼시아 9단지2008-09-11132094817(282),20(216),21(336),25(114)42985601081190031-275-2192국민임대(30년)2014-02-27
491임대아파트기흥구 영덕동 1069흥덕마을 12단지 신동아파밀리에2009-10-29142075942(280)46(180),49(146)52(153)658168747175370031-212-794610년 공공임대2014-02-27
492임대아파트수지구 죽전1동 1226내대지마을 주공2단지아파트2005-07-2021513618(136)63451106811068031-892-0280국민임대(30년)2014-02-27
493임대아파트수지구 죽전1동 1233성현마을 주공3단지아파트2005-07-2061538826(388)18183274133546031-891-0138국민임대(30년)2014-02-27
494임대아파트수지구 죽전1동 1279도담마을 주공8단지아파트2005-08-0571564316(177),19(230)25(236)30168422753748031-891-0244국민임대(30년)2014-02-27
495임대아파트수지구 죽전1동 1367내대지마을 주공아파트(임대)2006-04-1731322517(113),20(112)12069153517645031-889-5405임대(국민)2014-02-27
496임대아파트수지구 상현1동 1135(광교A30BL) -수지구 상현동 광교마을 90휴먼시아아파트(광교)2011-11-169251117<NA>48508702886832031-265-6083국민임대(30년)2014-02-27