Overview

Dataset statistics

Number of variables7
Number of observations2859
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)0.3%
Total size in memory162.1 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Text2
Categorical2
DateTime1

Dataset

Description대구광역시 달서구 다가구(원룸 등) 및 오피스텔에 대한 정보가 담겨져 있음. (건물위치, 건물명, 주택유형구분 등)
URLhttps://www.data.go.kr/data/15083651/fileData.do

Alerts

관리부서 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 9 (0.3%) duplicate rowsDuplicates
주택유형구분 is highly imbalanced (80.2%)Imbalance

Reproduction

Analysis started2023-12-12 06:22:30.254091
Analysis finished2023-12-12 06:22:31.309842
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위도
Real number (ℝ)

Distinct2842
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.839743
Minimum35.795295
Maximum35.86237
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.3 KiB
2023-12-12T15:22:31.404868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.795295
5-th percentile35.810778
Q135.825754
median35.847092
Q335.854701
95-th percentile35.858711
Maximum35.86237
Range0.0670752
Interquartile range (IQR)0.02894697

Descriptive statistics

Standard deviation0.01727279
Coefficient of variation (CV)0.00048194514
Kurtosis-1.1045649
Mean35.839743
Median Absolute Deviation (MAD)0.01057398
Skewness-0.56380801
Sum102465.83
Variance0.00029834927
MonotonicityNot monotonic
2023-12-12T15:22:31.564953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.81414772 4
 
0.1%
35.85237557 2
 
0.1%
35.8082206 2
 
0.1%
35.85786245 2
 
0.1%
35.84742143 2
 
0.1%
35.80834461 2
 
0.1%
35.84266783 2
 
0.1%
35.84415123 2
 
0.1%
35.84751293 2
 
0.1%
35.84335415 2
 
0.1%
Other values (2832) 2837
99.2%
ValueCountFrequency (%)
35.79529475 1
< 0.1%
35.7953767 1
< 0.1%
35.79567408 1
< 0.1%
35.79580085 1
< 0.1%
35.79582701 1
< 0.1%
35.79601721 1
< 0.1%
35.79617881 1
< 0.1%
35.79635006 1
< 0.1%
35.79635075 1
< 0.1%
35.79650392 1
< 0.1%
ValueCountFrequency (%)
35.86236995 1
< 0.1%
35.86224035 1
< 0.1%
35.8618726 1
< 0.1%
35.86183268 1
< 0.1%
35.86182437 1
< 0.1%
35.86178888 1
< 0.1%
35.86178038 1
< 0.1%
35.86171778 1
< 0.1%
35.86164593 1
< 0.1%
35.86161632 1
< 0.1%

경도
Real number (ℝ)

Distinct2840
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.5333
Minimum128.47579
Maximum128.57371
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.3 KiB
2023-12-12T15:22:31.726215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.47579
5-th percentile128.49283
Q1128.52191
median128.53777
Q3128.55035
95-th percentile128.56332
Maximum128.57371
Range0.0979276
Interquartile range (IQR)0.02844535

Descriptive statistics

Standard deviation0.022881447
Coefficient of variation (CV)0.0001780196
Kurtosis-0.36571344
Mean128.5333
Median Absolute Deviation (MAD)0.013252
Skewness-0.716305
Sum367476.7
Variance0.0005235606
MonotonicityNot monotonic
2023-12-12T15:22:31.910871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.5189692 4
 
0.1%
128.4898478 2
 
0.1%
128.5418486 2
 
0.1%
128.5057673 2
 
0.1%
128.5118268 2
 
0.1%
128.5536263 2
 
0.1%
128.4994467 2
 
0.1%
128.5544732 2
 
0.1%
128.4976865 2
 
0.1%
128.5123366 2
 
0.1%
Other values (2830) 2837
99.2%
ValueCountFrequency (%)
128.475786 1
< 0.1%
128.4763038 1
< 0.1%
128.4764707 1
< 0.1%
128.4766956 1
< 0.1%
128.476768 1
< 0.1%
128.476778 1
< 0.1%
128.4768245 1
< 0.1%
128.4768395 1
< 0.1%
128.4769436 1
< 0.1%
128.4769814 1
< 0.1%
ValueCountFrequency (%)
128.5737136 1
< 0.1%
128.5735008 1
< 0.1%
128.573494 1
< 0.1%
128.57347 1
< 0.1%
128.5734581 1
< 0.1%
128.5734564 1
< 0.1%
128.5734407 1
< 0.1%
128.5733649 1
< 0.1%
128.5732246 1
< 0.1%
128.5731269 1
< 0.1%
Distinct2842
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size22.5 KiB
2023-12-12T15:22:32.172787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length19.710738
Min length16

Characters and Unicode

Total characters56353
Distinct characters53
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2827 ?
Unique (%)98.9%

Sample

1st row대구광역시 달서구 성당동 54-19
2nd row대구광역시 달서구 성당동 55-3
3rd row대구광역시 달서구 성당동 55-13
4th row대구광역시 달서구 성당동 56-21
5th row대구광역시 달서구 성당동 384-6
ValueCountFrequency (%)
대구광역시 2859
25.0%
달서구 2859
25.0%
송현동 504
 
4.4%
신당동 334
 
2.9%
두류동 317
 
2.8%
상인동 305
 
2.7%
감삼동 301
 
2.6%
진천동 202
 
1.8%
성당동 170
 
1.5%
이곡동 141
 
1.2%
Other values (2787) 3448
30.1%
2023-12-12T15:22:32.638140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8581
15.2%
5718
 
10.1%
1 3186
 
5.7%
2926
 
5.2%
2859
 
5.1%
2859
 
5.1%
2859
 
5.1%
2859
 
5.1%
2859
 
5.1%
2859
 
5.1%
Other values (43) 18788
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31409
55.7%
Decimal Number 13507
24.0%
Space Separator 8581
 
15.2%
Dash Punctuation 2856
 
5.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5718
18.2%
2926
9.3%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
504
 
1.6%
504
 
1.6%
Other values (31) 4603
14.7%
Decimal Number
ValueCountFrequency (%)
1 3186
23.6%
2 1507
11.2%
0 1254
 
9.3%
3 1230
 
9.1%
4 1172
 
8.7%
7 1089
 
8.1%
8 1073
 
7.9%
6 1015
 
7.5%
5 1012
 
7.5%
9 969
 
7.2%
Space Separator
ValueCountFrequency (%)
8581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2856
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31409
55.7%
Common 24944
44.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5718
18.2%
2926
9.3%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
504
 
1.6%
504
 
1.6%
Other values (31) 4603
14.7%
Common
ValueCountFrequency (%)
8581
34.4%
1 3186
 
12.8%
- 2856
 
11.4%
2 1507
 
6.0%
0 1254
 
5.0%
3 1230
 
4.9%
4 1172
 
4.7%
7 1089
 
4.4%
8 1073
 
4.3%
6 1015
 
4.1%
Other values (2) 1981
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31409
55.7%
ASCII 24944
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8581
34.4%
1 3186
 
12.8%
- 2856
 
11.4%
2 1507
 
6.0%
0 1254
 
5.0%
3 1230
 
4.9%
4 1172
 
4.7%
7 1089
 
4.4%
8 1073
 
4.3%
6 1015
 
4.1%
Other values (2) 1981
 
7.9%
Hangul
ValueCountFrequency (%)
5718
18.2%
2926
9.3%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
2859
9.1%
504
 
1.6%
504
 
1.6%
Other values (31) 4603
14.7%
Distinct1707
Distinct (%)59.7%
Missing0
Missing (%)0.0%
Memory size22.5 KiB
2023-12-12T15:22:33.011813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length4.3109479
Min length2

Characters and Unicode

Total characters12325
Distinct characters513
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1437 ?
Unique (%)50.3%

Sample

1st row양지빌
2nd row미정빌라
3rd row(무명)
4th row(무명)
5th row(무명)
ValueCountFrequency (%)
무명 680
 
23.1%
단독주택 17
 
0.6%
리치빌 12
 
0.4%
해피하우스 11
 
0.4%
로즈빌 8
 
0.3%
베네치아 8
 
0.3%
행복빌 7
 
0.2%
다온빌 7
 
0.2%
아이파크 7
 
0.2%
에이스빌 7
 
0.2%
Other values (1736) 2175
74.0%
2023-12-12T15:22:33.564511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
956
 
7.8%
731
 
5.9%
( 695
 
5.6%
) 695
 
5.6%
689
 
5.6%
511
 
4.1%
350
 
2.8%
320
 
2.6%
274
 
2.2%
251
 
2.0%
Other values (503) 6853
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10498
85.2%
Open Punctuation 695
 
5.6%
Close Punctuation 695
 
5.6%
Uppercase Letter 181
 
1.5%
Decimal Number 120
 
1.0%
Space Separator 83
 
0.7%
Lowercase Letter 28
 
0.2%
Dash Punctuation 16
 
0.1%
Other Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
956
 
9.1%
731
 
7.0%
689
 
6.6%
511
 
4.9%
350
 
3.3%
320
 
3.0%
274
 
2.6%
251
 
2.4%
212
 
2.0%
188
 
1.8%
Other values (451) 6016
57.3%
Uppercase Letter
ValueCountFrequency (%)
I 36
19.9%
S 22
12.2%
A 16
 
8.8%
B 14
 
7.7%
E 10
 
5.5%
H 10
 
5.5%
L 9
 
5.0%
J 9
 
5.0%
T 8
 
4.4%
K 7
 
3.9%
Other values (12) 40
22.1%
Lowercase Letter
ValueCountFrequency (%)
e 6
21.4%
s 4
14.3%
o 3
10.7%
b 2
 
7.1%
t 2
 
7.1%
k 2
 
7.1%
h 2
 
7.1%
c 1
 
3.6%
u 1
 
3.6%
i 1
 
3.6%
Other values (4) 4
14.3%
Decimal Number
ValueCountFrequency (%)
1 29
24.2%
2 25
20.8%
3 14
11.7%
0 11
 
9.2%
4 9
 
7.5%
8 8
 
6.7%
5 8
 
6.7%
6 7
 
5.8%
9 6
 
5.0%
7 3
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 5
55.6%
& 4
44.4%
Open Punctuation
ValueCountFrequency (%)
( 695
100.0%
Close Punctuation
ValueCountFrequency (%)
) 695
100.0%
Space Separator
ValueCountFrequency (%)
83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10498
85.2%
Common 1618
 
13.1%
Latin 209
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
956
 
9.1%
731
 
7.0%
689
 
6.6%
511
 
4.9%
350
 
3.3%
320
 
3.0%
274
 
2.6%
251
 
2.4%
212
 
2.0%
188
 
1.8%
Other values (451) 6016
57.3%
Latin
ValueCountFrequency (%)
I 36
17.2%
S 22
 
10.5%
A 16
 
7.7%
B 14
 
6.7%
E 10
 
4.8%
H 10
 
4.8%
L 9
 
4.3%
J 9
 
4.3%
T 8
 
3.8%
K 7
 
3.3%
Other values (26) 68
32.5%
Common
ValueCountFrequency (%)
( 695
43.0%
) 695
43.0%
83
 
5.1%
1 29
 
1.8%
2 25
 
1.5%
- 16
 
1.0%
3 14
 
0.9%
0 11
 
0.7%
4 9
 
0.6%
8 8
 
0.5%
Other values (6) 33
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10498
85.2%
ASCII 1827
 
14.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
956
 
9.1%
731
 
7.0%
689
 
6.6%
511
 
4.9%
350
 
3.3%
320
 
3.0%
274
 
2.6%
251
 
2.4%
212
 
2.0%
188
 
1.8%
Other values (451) 6016
57.3%
ASCII
ValueCountFrequency (%)
( 695
38.0%
) 695
38.0%
83
 
4.5%
I 36
 
2.0%
1 29
 
1.6%
2 25
 
1.4%
S 22
 
1.2%
- 16
 
0.9%
A 16
 
0.9%
3 14
 
0.8%
Other values (42) 196
 
10.7%

주택유형구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.5 KiB
다가구주택
2771 
오피스텔
 
88

Length

Max length5
Median length5
Mean length4.96922
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row다가구주택
2nd row다가구주택
3rd row다가구주택
4th row다가구주택
5th row다가구주택

Common Values

ValueCountFrequency (%)
다가구주택 2771
96.9%
오피스텔 88
 
3.1%

Length

2023-12-12T15:22:33.741003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:33.838861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
다가구주택 2771
96.9%
오피스텔 88
 
3.1%

관리부서
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.5 KiB
건축과
2859 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건축과
2nd row건축과
3rd row건축과
4th row건축과
5th row건축과

Common Values

ValueCountFrequency (%)
건축과 2859
100.0%

Length

2023-12-12T15:22:33.957322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:34.077614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건축과 2859
100.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size22.5 KiB
Minimum2022-12-31 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T15:22:34.157899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:34.270928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T15:22:30.929186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:30.688374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:31.028001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:30.789368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:22:34.374177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도주택유형구분
위도1.0000.7950.344
경도0.7951.0000.210
주택유형구분0.3440.2101.000
2023-12-12T15:22:34.509451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도주택유형구분
위도1.000-0.0960.264
경도-0.0961.0000.161
주택유형구분0.2640.1611.000

Missing values

2023-12-12T15:22:31.139977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:22:31.247013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위도경도건물위치건물명주택유형구분관리부서기준일자
035.85052128.571661대구광역시 달서구 성당동 54-19양지빌다가구주택건축과2022-12-31
135.850818128.571519대구광역시 달서구 성당동 55-3미정빌라다가구주택건축과2022-12-31
235.850925128.571775대구광역시 달서구 성당동 55-13(무명)다가구주택건축과2022-12-31
335.851253128.57098대구광역시 달서구 성당동 56-21(무명)다가구주택건축과2022-12-31
435.840343128.554596대구광역시 달서구 성당동 384-6(무명)다가구주택건축과2022-12-31
535.839919128.554792대구광역시 달서구 성당동 385-1(무명)다가구주택건축과2022-12-31
635.849416128.568719대구광역시 달서구 성당동 81-65남진빌다가구주택건축과2022-12-31
735.848633128.568481대구광역시 달서구 성당동 81-89위드다가구주택건축과2022-12-31
835.837804128.554125대구광역시 달서구 성당동 275-0썬하우스다가구주택건축과2022-12-31
935.838625128.553942대구광역시 달서구 성당동 348-0(무명)다가구주택건축과2022-12-31
위도경도건물위치건물명주택유형구분관리부서기준일자
284935.843864128.489075대구광역시 달서구 호산동 716-5(무명)오피스텔건축과2022-12-31
285035.859684128.564519대구광역시 달서구 두류동 100-8대경오피스텔오피스텔건축과2022-12-31
285135.842775128.486434대구광역시 달서구 호산동 708-15꿈에그린오피스텔오피스텔건축과2022-12-31
285235.842464128.487527대구광역시 달서구 호산동 709-8아이파크오피스텔건축과2022-12-31
285335.842668128.532904대구광역시 달서구 장기동 552-4코지하우스오피스텔건축과2022-12-31
285435.844226128.488867대구광역시 달서구 호산동 716-31노블오피스텔오피스텔건축과2022-12-31
285535.844224128.489155대구광역시 달서구 호산동 716-32플로라오피스텔오피스텔건축과2022-12-31
285635.843365128.490024대구광역시 달서구 호산동 715-7태림오피스텔오피스텔건축과2022-12-31
285735.842658128.533026대구광역시 달서구 장기동 552-5로뎀빌오피스텔건축과2022-12-31
285835.835849128.555893대구광역시 달서구 송현동 1033-4 외1필지루지움오피스텔건축과2022-12-31

Duplicate rows

Most frequently occurring

위도경도건물위치건물명주택유형구분관리부서기준일자# duplicates
235.814148128.518969대구광역시 달서구 유천동 120-1진천역 화성파크 리젠시 2단지오피스텔건축과2022-12-313
035.808345128.512337대구광역시 달서구 대곡동 37-0(무명)다가구주택건축과2022-12-312
135.812898128.520333대구광역시 달서구 진천동 670-0(무명)다가구주택건축과2022-12-312
335.843354128.554473대구광역시 달서구 성당동 645-2(무명)다가구주택건축과2022-12-312
435.844151128.553626대구광역시 달서구 성당동 541-2(무명)다가구주택건축과2022-12-312
535.847421128.546121대구광역시 달서구 성당동 695-140(무명)다가구주택건축과2022-12-312
635.847513128.541849대구광역시 달서구 감삼동 188-22(무명)다가구주택건축과2022-12-312
735.848423128.515341대구광역시 달서구 장기동 180-26(무명)다가구주택건축과2022-12-312
835.857862128.497686대구광역시 달서구 신당동 156-0(무명)다가구주택건축과2022-12-312