Overview

Dataset statistics

Number of variables11
Number of observations921
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory83.8 KiB
Average record size in memory93.1 B

Variable types

Numeric3
Categorical7
DateTime1

Dataset

Description대전광역시 유성구 빈집현황 에 대한 데이터로 주택구분, 시군구코드, 법정동코드, 법정동이름 등의 항목을 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15111165/fileData.do

Alerts

시도코드 has constant value ""Constant
시도이름 has constant value ""Constant
시군구코드 has constant value ""Constant
시군구이름 has constant value ""Constant
기준일자 has constant value ""Constant
행정동이름 is highly overall correlated with 행정동코드 and 2 other fieldsHigh correlation
법정동이름 is highly overall correlated with 번호 and 4 other fieldsHigh correlation
번호 is highly overall correlated with 행정동코드 and 2 other fieldsHigh correlation
행정동코드 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
법정동코드 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
주택구분 is highly overall correlated with 법정동이름High correlation
행정동이름 is highly imbalanced (58.0%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:06:26.107442
Analysis finished2023-12-12 02:06:28.248074
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct921
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean461
Minimum1
Maximum921
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2023-12-12T11:06:28.366107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile47
Q1231
median461
Q3691
95-th percentile875
Maximum921
Range920
Interquartile range (IQR)460

Descriptive statistics

Standard deviation266.0141
Coefficient of variation (CV)0.57703709
Kurtosis-1.2
Mean461
Median Absolute Deviation (MAD)230
Skewness0
Sum424581
Variance70763.5
MonotonicityNot monotonic
2023-12-12T11:06:28.568535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
605 1
 
0.1%
607 1
 
0.1%
608 1
 
0.1%
609 1
 
0.1%
610 1
 
0.1%
611 1
 
0.1%
612 1
 
0.1%
613 1
 
0.1%
614 1
 
0.1%
Other values (911) 911
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
921 1
0.1%
920 1
0.1%
919 1
0.1%
918 1
0.1%
917 1
0.1%
916 1
0.1%
915 1
0.1%
914 1
0.1%
913 1
0.1%
912 1
0.1%

주택구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
공동주택(다세대/연립)
678 
단독주택(다가구)
168 
그 외 주택
 
66
아파트
 
9

Length

Max length12
Median length12
Mean length10.934853
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단독주택(다가구)
2nd row공동주택(다세대/연립)
3rd row공동주택(다세대/연립)
4th row그 외 주택
5th row단독주택(다가구)

Common Values

ValueCountFrequency (%)
공동주택(다세대/연립) 678
73.6%
단독주택(다가구) 168
 
18.2%
그 외 주택 66
 
7.2%
아파트 9
 
1.0%

Length

2023-12-12T11:06:28.758295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:06:28.884439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동주택(다세대/연립 678
64.4%
단독주택(다가구 168
 
16.0%
66
 
6.3%
66
 
6.3%
주택 66
 
6.3%
아파트 9
 
0.9%

시도코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
3000000000
921 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3000000000
2nd row3000000000
3rd row3000000000
4th row3000000000
5th row3000000000

Common Values

ValueCountFrequency (%)
3000000000 921
100.0%

Length

2023-12-12T11:06:29.022169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:06:29.139734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3000000000 921
100.0%

시도이름
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
대전광역시
921 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 921
100.0%

Length

2023-12-12T11:06:29.245867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:06:29.354973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 921
100.0%

시군구코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
3020000000
921 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3020000000
2nd row3020000000
3rd row3020000000
4th row3020000000
5th row3020000000

Common Values

ValueCountFrequency (%)
3020000000 921
100.0%

Length

2023-12-12T11:06:29.455681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:06:29.544023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3020000000 921
100.0%

시군구이름
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
유성구
921 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유성구
2nd row유성구
3rd row유성구
4th row유성구
5th row유성구

Common Values

ValueCountFrequency (%)
유성구 921
100.0%

Length

2023-12-12T11:06:29.633402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:06:29.731331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유성구 921
100.0%

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0200525 × 109
Minimum3.020052 × 109
Maximum3.02006 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2023-12-12T11:06:29.831457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.020052 × 109
5-th percentile3.020052 × 109
Q13.020052 × 109
median3.020052 × 109
Q33.0200527 × 109
95-th percentile3.020055 × 109
Maximum3.02006 × 109
Range8000
Interquartile range (IQR)700

Descriptive statistics

Standard deviation1204.2593
Coefficient of variation (CV)3.9875442 × 10-7
Kurtosis15.517512
Mean3.0200525 × 109
Median Absolute Deviation (MAD)0
Skewness3.7529486
Sum2.7814683 × 1012
Variance1450240.4
MonotonicityNot monotonic
2023-12-12T11:06:29.965443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
3020052000 672
73.0%
3020052700 111
 
12.1%
3020053000 52
 
5.6%
3020054000 21
 
2.3%
3020055000 21
 
2.3%
3020058000 17
 
1.8%
3020054700 8
 
0.9%
3020057000 6
 
0.7%
3020052600 5
 
0.5%
3020060000 5
 
0.5%
Other values (2) 3
 
0.3%
ValueCountFrequency (%)
3020052000 672
73.0%
3020052600 5
 
0.5%
3020052700 111
 
12.1%
3020053000 52
 
5.6%
3020054000 21
 
2.3%
3020054600 1
 
0.1%
3020054700 8
 
0.9%
3020054800 2
 
0.2%
3020055000 21
 
2.3%
3020057000 6
 
0.7%
ValueCountFrequency (%)
3020060000 5
 
0.5%
3020058000 17
 
1.8%
3020057000 6
 
0.7%
3020055000 21
 
2.3%
3020054800 2
 
0.2%
3020054700 8
 
0.9%
3020054600 1
 
0.1%
3020054000 21
 
2.3%
3020053000 52
5.6%
3020052700 111
12.1%

행정동이름
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
진잠동
672 
상대동
111 
온천1동
 
52
온천2동
 
21
신성동
 
21
Other values (7)
 
44

Length

Max length4
Median length3
Mean length3.0912052
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row진잠동
2nd row진잠동
3rd row진잠동
4th row진잠동
5th row진잠동

Common Values

ValueCountFrequency (%)
진잠동 672
73.0%
상대동 111
 
12.1%
온천1동 52
 
5.6%
온천2동 21
 
2.3%
신성동 21
 
2.3%
구즉동 17
 
1.8%
노은2동 8
 
0.9%
전민동 6
 
0.7%
학하동 5
 
0.5%
관평동 5
 
0.5%
Other values (2) 3
 
0.3%

Length

2023-12-12T11:06:30.108834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
진잠동 672
73.0%
상대동 111
 
12.1%
온천1동 52
 
5.6%
온천2동 21
 
2.3%
신성동 21
 
2.3%
구즉동 17
 
1.8%
노은2동 8
 
0.9%
전민동 6
 
0.7%
학하동 5
 
0.5%
관평동 5
 
0.5%
Other values (2) 3
 
0.3%

법정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct39
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0200108 × 109
Minimum3.0200101 × 109
Maximum3.0200153 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.2 KiB
2023-12-12T11:06:30.252228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0200101 × 109
5-th percentile3.0200103 × 109
Q13.0200104 × 109
median3.0200104 × 109
Q33.0200112 × 109
95-th percentile3.0200127 × 109
Maximum3.0200153 × 109
Range5200
Interquartile range (IQR)800

Descriptive statistics

Standard deviation940.41823
Coefficient of variation (CV)3.1139565 × 10-7
Kurtosis7.4892652
Mean3.0200108 × 109
Median Absolute Deviation (MAD)100
Skewness2.6498936
Sum2.78143 × 1012
Variance884386.44
MonotonicityNot monotonic
2023-12-12T11:06:30.403221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
3020010400 457
49.6%
3020010300 158
 
17.2%
3020011600 110
 
11.9%
3020011200 45
 
4.9%
3020011700 15
 
1.6%
3020010900 11
 
1.2%
3020010100 10
 
1.1%
3020010800 10
 
1.1%
3020010700 9
 
1.0%
3020011000 9
 
1.0%
Other values (29) 87
 
9.4%
ValueCountFrequency (%)
3020010100 10
 
1.1%
3020010200 8
 
0.9%
3020010300 158
 
17.2%
3020010400 457
49.6%
3020010500 3
 
0.3%
3020010600 2
 
0.2%
3020010700 9
 
1.0%
3020010800 10
 
1.1%
3020010900 11
 
1.2%
3020011000 9
 
1.0%
ValueCountFrequency (%)
3020015300 2
 
0.2%
3020015200 1
 
0.1%
3020015000 5
0.5%
3020014900 5
0.5%
3020014700 1
 
0.1%
3020014600 3
0.3%
3020014500 3
0.3%
3020014400 1
 
0.1%
3020014300 1
 
0.1%
3020014200 4
0.4%

법정동이름
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
용계동
457 
대정동
158 
복용동
110 
구암동
 
45
장대동
 
15
Other values (34)
136 

Length

Max length3
Median length3
Mean length2.9652552
Min length2

Unique

Unique9 ?
Unique (%)1.0%

Sample

1st row원내동
2nd row원내동
3rd row원내동
4th row원내동
5th row원내동

Common Values

ValueCountFrequency (%)
용계동 457
49.6%
대정동 158
 
17.2%
복용동 110
 
11.9%
구암동 45
 
4.9%
장대동 15
 
1.6%
송정동 11
 
1.2%
원내동 10
 
1.1%
세동 10
 
1.1%
방동 9
 
1.0%
성북동 9
 
1.0%
Other values (29) 87
 
9.4%

Length

2023-12-12T11:06:30.627719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용계동 457
49.6%
대정동 158
 
17.2%
복용동 110
 
11.9%
구암동 45
 
4.9%
장대동 15
 
1.6%
송정동 11
 
1.2%
원내동 10
 
1.1%
세동 10
 
1.1%
성북동 9
 
1.0%
방동 9
 
1.0%
Other values (29) 87
 
9.4%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
Minimum2021-10-01 00:00:00
Maximum2021-10-01 00:00:00
2023-12-12T11:06:30.770056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:31.151386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T11:06:27.474908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:26.554610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:27.012388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:27.611243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:26.676830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:27.151002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:27.752662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:26.861124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:06:27.319526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:06:31.227703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호주택구분행정동코드행정동이름법정동코드법정동이름
번호1.0000.4910.8170.7990.8820.917
주택구분0.4911.0000.5190.6320.6550.809
행정동코드0.8170.5191.0001.0000.8981.000
행정동이름0.7990.6321.0001.0000.9241.000
법정동코드0.8820.6550.8980.9241.0001.000
법정동이름0.9170.8091.0001.0001.0001.000
2023-12-12T11:06:31.341888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동이름주택구분법정동이름
행정동이름1.0000.3370.985
주택구분0.3371.0000.548
법정동이름0.9850.5481.000
2023-12-12T11:06:31.461190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호행정동코드법정동코드주택구분행정동이름법정동이름
번호1.0000.7700.9330.3140.4970.617
행정동코드0.7701.0000.8250.3080.9970.982
법정동코드0.9330.8251.0000.4490.7340.984
주택구분0.3140.3080.4491.0000.3370.548
행정동이름0.4970.9970.7340.3371.0000.985
법정동이름0.6170.9820.9840.5480.9851.000

Missing values

2023-12-12T11:06:27.933897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:06:28.159669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호주택구분시도코드시도이름시군구코드시군구이름행정동코드행정동이름법정동코드법정동이름기준일자
01단독주택(다가구)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
12공동주택(다세대/연립)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
23공동주택(다세대/연립)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
34그 외 주택3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
45단독주택(다가구)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
56단독주택(다가구)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
67공동주택(다세대/연립)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
78공동주택(다세대/연립)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
89단독주택(다가구)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
910단독주택(다가구)3000000000대전광역시3020000000유성구3020052000진잠동3020010100원내동2021-10-01
번호주택구분시도코드시도이름시군구코드시군구이름행정동코드행정동이름법정동코드법정동이름기준일자
911912단독주택(다가구)3000000000대전광역시3020000000유성구3020058000구즉동3020014900대동2021-10-01
912913단독주택(다가구)3000000000대전광역시3020000000유성구3020058000구즉동3020014900대동2021-10-01
913914단독주택(다가구)3000000000대전광역시3020000000유성구3020058000구즉동3020015000금탄동2021-10-01
914915단독주택(다가구)3000000000대전광역시3020000000유성구3020058000구즉동3020015000금탄동2021-10-01
915916그 외 주택3000000000대전광역시3020000000유성구3020058000구즉동3020015000금탄동2021-10-01
916917그 외 주택3000000000대전광역시3020000000유성구3020058000구즉동3020015000금탄동2021-10-01
917918그 외 주택3000000000대전광역시3020000000유성구3020058000구즉동3020015000금탄동2021-10-01
918919그 외 주택3000000000대전광역시3020000000유성구3020058000구즉동3020015200둔곡동2021-10-01
919920단독주택(다가구)3000000000대전광역시3020000000유성구3020058000구즉동3020015300구룡동2021-10-01
920921그 외 주택3000000000대전광역시3020000000유성구3020058000구즉동3020015300구룡동2021-10-01