Overview

Dataset statistics

Number of variables6
Number of observations154
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.9 KiB
Average record size in memory52.9 B

Variable types

Numeric4
Text2

Dataset

Description대전광역시 동구 공동주택 현황에 대한 데이터로, 공동주택명, 주소, 층수, 동수, 세대수등의 데이터를 제공하고있습니다.
URLhttps://www.data.go.kr/data/15013553/fileData.do

Alerts

연번 is highly overall correlated with 층수 and 2 other fieldsHigh correlation
층수 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
동수 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
세대수 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:38:10.339794
Analysis finished2023-12-12 23:38:12.201055
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct154
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77.5
Minimum1
Maximum154
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T08:38:12.263389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.65
Q139.25
median77.5
Q3115.75
95-th percentile146.35
Maximum154
Range153
Interquartile range (IQR)76.5

Descriptive statistics

Standard deviation44.600075
Coefficient of variation (CV)0.57548484
Kurtosis-1.2
Mean77.5
Median Absolute Deviation (MAD)38.5
Skewness0
Sum11935
Variance1989.1667
MonotonicityStrictly increasing
2023-12-13T08:38:12.402041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
107 1
 
0.6%
100 1
 
0.6%
101 1
 
0.6%
102 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
108 1
 
0.6%
Other values (144) 144
93.5%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
150 1
0.6%
149 1
0.6%
148 1
0.6%
147 1
0.6%
146 1
0.6%
145 1
0.6%
Distinct150
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T08:38:12.714762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.8831169
Min length2

Characters and Unicode

Total characters1060
Distinct characters202
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique147 ?
Unique (%)95.5%

Sample

1st row현대아파트
2nd row용전서민아파트
3rd row삼성아파트A동
4th row삼성아파트B동
5th row가양글로벌아파트
ValueCountFrequency (%)
주공 6
 
3.5%
대호아파트 3
 
1.7%
평화아파트 2
 
1.2%
평화맨션 2
 
1.2%
주공아파트 2
 
1.2%
올인하우징 2
 
1.2%
인동누리보듬아파트 2
 
1.2%
용운 1
 
0.6%
현대아파트 1
 
0.6%
용운2고층 1
 
0.6%
Other values (150) 150
87.2%
2023-12-13T08:38:13.529144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
 
8.7%
91
 
8.6%
84
 
7.9%
29
 
2.7%
18
 
1.7%
1 18
 
1.7%
18
 
1.7%
18
 
1.7%
17
 
1.6%
2 17
 
1.6%
Other values (192) 658
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 957
90.3%
Decimal Number 55
 
5.2%
Space Separator 18
 
1.7%
Open Punctuation 9
 
0.8%
Close Punctuation 9
 
0.8%
Other Punctuation 4
 
0.4%
Uppercase Letter 4
 
0.4%
Lowercase Letter 2
 
0.2%
Math Symbol 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
9.6%
91
 
9.5%
84
 
8.8%
29
 
3.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (173) 560
58.5%
Decimal Number
ValueCountFrequency (%)
1 18
32.7%
2 17
30.9%
3 8
14.5%
5 4
 
7.3%
4 4
 
7.3%
6 2
 
3.6%
9 1
 
1.8%
8 1
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
K 1
25.0%
S 1
25.0%
B 1
25.0%
A 1
25.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 957
90.3%
Common 97
 
9.2%
Latin 6
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
9.6%
91
 
9.5%
84
 
8.8%
29
 
3.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (173) 560
58.5%
Common
ValueCountFrequency (%)
18
18.6%
1 18
18.6%
2 17
17.5%
( 9
9.3%
) 9
9.3%
3 8
8.2%
5 4
 
4.1%
4 4
 
4.1%
, 4
 
4.1%
6 2
 
2.1%
Other values (4) 4
 
4.1%
Latin
ValueCountFrequency (%)
e 2
33.3%
K 1
16.7%
S 1
16.7%
B 1
16.7%
A 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 957
90.3%
ASCII 103
 
9.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
92
 
9.6%
91
 
9.5%
84
 
8.8%
29
 
3.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (173) 560
58.5%
ASCII
ValueCountFrequency (%)
18
17.5%
1 18
17.5%
2 17
16.5%
( 9
8.7%
) 9
8.7%
3 8
7.8%
5 4
 
3.9%
4 4
 
3.9%
, 4
 
3.9%
6 2
 
1.9%
Other values (9) 10
9.7%

위치
Text

Distinct153
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T08:38:14.010485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length23.714286
Min length19

Characters and Unicode

Total characters3652
Distinct characters87
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)98.7%

Sample

1st row대전광역시 동구 계족로443번길 24(용전동)
2nd row대전광역시 동구 계족로 485-35(용전동)
3rd row대전광역시 동구 선화로 183(삼성동)
4th row대전광역시 동구 대전천동로 612(삼성동)
5th row대전광역시 동구 동서대로1778번길 122(가양동)
ValueCountFrequency (%)
대전광역시 154
24.8%
동구 154
24.8%
대전로 11
 
1.8%
은어송로 5
 
0.8%
계족로 5
 
0.8%
우암로 4
 
0.6%
대흥로202번길 4
 
0.6%
동서대로 4
 
0.6%
대전천동로 4
 
0.6%
산내로 3
 
0.5%
Other values (237) 274
44.1%
2023-12-13T08:38:14.539169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
469
 
12.8%
352
 
9.6%
225
 
6.2%
207
 
5.7%
158
 
4.3%
154
 
4.2%
154
 
4.2%
154
 
4.2%
) 153
 
4.2%
( 153
 
4.2%
Other values (77) 1473
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2213
60.6%
Decimal Number 641
 
17.6%
Space Separator 469
 
12.8%
Close Punctuation 153
 
4.2%
Open Punctuation 153
 
4.2%
Dash Punctuation 23
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
352
15.9%
225
10.2%
207
9.4%
158
 
7.1%
154
 
7.0%
154
 
7.0%
154
 
7.0%
152
 
6.9%
92
 
4.2%
90
 
4.1%
Other values (63) 475
21.5%
Decimal Number
ValueCountFrequency (%)
1 114
17.8%
2 87
13.6%
5 71
11.1%
7 60
9.4%
3 59
9.2%
4 58
9.0%
0 56
8.7%
6 54
8.4%
8 45
 
7.0%
9 37
 
5.8%
Space Separator
ValueCountFrequency (%)
469
100.0%
Close Punctuation
ValueCountFrequency (%)
) 153
100.0%
Open Punctuation
ValueCountFrequency (%)
( 153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2213
60.6%
Common 1439
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
352
15.9%
225
10.2%
207
9.4%
158
 
7.1%
154
 
7.0%
154
 
7.0%
154
 
7.0%
152
 
6.9%
92
 
4.2%
90
 
4.1%
Other values (63) 475
21.5%
Common
ValueCountFrequency (%)
469
32.6%
) 153
 
10.6%
( 153
 
10.6%
1 114
 
7.9%
2 87
 
6.0%
5 71
 
4.9%
7 60
 
4.2%
3 59
 
4.1%
4 58
 
4.0%
0 56
 
3.9%
Other values (4) 159
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2213
60.6%
ASCII 1439
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
469
32.6%
) 153
 
10.6%
( 153
 
10.6%
1 114
 
7.9%
2 87
 
6.0%
5 71
 
4.9%
7 60
 
4.2%
3 59
 
4.1%
4 58
 
4.0%
0 56
 
3.9%
Other values (4) 159
 
11.0%
Hangul
ValueCountFrequency (%)
352
15.9%
225
10.2%
207
9.4%
158
 
7.1%
154
 
7.0%
154
 
7.0%
154
 
7.0%
152
 
6.9%
92
 
4.2%
90
 
4.1%
Other values (63) 475
21.5%

층수
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.746753
Minimum3
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T08:38:14.682458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile5
Q15
median10
Q315
95-th percentile27.7
Maximum34
Range31
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.7918691
Coefficient of variation (CV)0.61128265
Kurtosis-0.14932338
Mean12.746753
Median Absolute Deviation (MAD)5
Skewness0.86843696
Sum1963
Variance60.713225
MonotonicityNot monotonic
2023-12-13T08:38:14.821838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
5 34
22.1%
15 28
18.2%
9 16
10.4%
7 11
 
7.1%
25 8
 
5.2%
8 5
 
3.2%
20 5
 
3.2%
18 5
 
3.2%
4 4
 
2.6%
10 4
 
2.6%
Other values (16) 34
22.1%
ValueCountFrequency (%)
3 2
 
1.3%
4 4
 
2.6%
5 34
22.1%
6 2
 
1.3%
7 11
 
7.1%
8 5
 
3.2%
9 16
10.4%
10 4
 
2.6%
11 3
 
1.9%
12 3
 
1.9%
ValueCountFrequency (%)
34 1
 
0.6%
33 1
 
0.6%
32 1
 
0.6%
31 3
 
1.9%
29 2
 
1.3%
27 3
 
1.9%
25 8
5.2%
24 3
 
1.9%
23 3
 
1.9%
21 1
 
0.6%

동수
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5519481
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T08:38:14.957608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q37
95-th percentile15
Maximum20
Range19
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.9309509
Coefficient of variation (CV)1.0832617
Kurtosis0.48994499
Mean4.5519481
Median Absolute Deviation (MAD)1
Skewness1.3007899
Sum701
Variance24.314277
MonotonicityNot monotonic
2023-12-13T08:38:15.086001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1 70
45.5%
2 18
 
11.7%
3 9
 
5.8%
5 8
 
5.2%
9 6
 
3.9%
4 5
 
3.2%
16 5
 
3.2%
11 5
 
3.2%
13 4
 
2.6%
12 4
 
2.6%
Other values (8) 20
 
13.0%
ValueCountFrequency (%)
1 70
45.5%
2 18
 
11.7%
3 9
 
5.8%
4 5
 
3.2%
5 8
 
5.2%
6 3
 
1.9%
7 4
 
2.6%
8 2
 
1.3%
9 6
 
3.9%
10 3
 
1.9%
ValueCountFrequency (%)
20 1
 
0.6%
19 1
 
0.6%
16 5
3.2%
15 4
2.6%
14 2
 
1.3%
13 4
2.6%
12 4
2.6%
11 5
3.2%
10 3
1.9%
9 6
3.9%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct120
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean356.01948
Minimum6
Maximum2414
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T08:38:15.243964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile16
Q141.5
median117.5
Q3615.5
95-th percentile1242.4
Maximum2414
Range2408
Interquartile range (IQR)574

Descriptive statistics

Standard deviation460.64335
Coefficient of variation (CV)1.2938712
Kurtosis4.7159418
Mean356.01948
Median Absolute Deviation (MAD)99
Skewness1.9995839
Sum54827
Variance212192.29
MonotonicityNot monotonic
2023-12-13T08:38:15.449506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40 5
 
3.2%
48 4
 
2.6%
19 4
 
2.6%
45 3
 
1.9%
110 3
 
1.9%
16 3
 
1.9%
38 3
 
1.9%
18 3
 
1.9%
60 2
 
1.3%
152 2
 
1.3%
Other values (110) 122
79.2%
ValueCountFrequency (%)
6 1
 
0.6%
7 2
1.3%
10 1
 
0.6%
12 1
 
0.6%
13 1
 
0.6%
16 3
1.9%
18 3
1.9%
19 4
2.6%
21 2
1.3%
22 1
 
0.6%
ValueCountFrequency (%)
2414 1
0.6%
2267 1
0.6%
2146 1
0.6%
1588 1
0.6%
1436 1
0.6%
1350 1
0.6%
1345 1
0.6%
1245 1
0.6%
1241 1
0.6%
1115 1
0.6%

Interactions

2023-12-13T08:38:11.621706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.572267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.892127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.201898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.775933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.660908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.974296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.291647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.869210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.738233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.043119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.380463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.950516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:10.818955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.131491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:38:11.465774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:38:15.593586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번층수동수세대수
연번1.0000.7940.5730.531
층수0.7941.0000.6060.614
동수0.5730.6061.0000.887
세대수0.5310.6140.8871.000
2023-12-13T08:38:15.683414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번층수동수세대수
연번1.0000.7140.6280.697
층수0.7141.0000.5800.745
동수0.6280.5801.0000.867
세대수0.6970.7450.8671.000

Missing values

2023-12-13T08:38:12.060863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:38:12.166829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번공동주택명위치층수동수세대수
01현대아파트대전광역시 동구 계족로443번길 24(용전동)4148
12용전서민아파트대전광역시 동구 계족로 485-35(용전동)42112
23삼성아파트A동대전광역시 동구 선화로 183(삼성동)5145
34삼성아파트B동대전광역시 동구 대전천동로 612(삼성동)5140
45가양글로벌아파트대전광역시 동구 동서대로1778번길 122(가양동)5140
56빌라맨션대전광역시 동구 대전천동로 618(삼성동)122120
67백조아파트대전광역시 동구 흥룡로71번길 109(가양동)4132
78대동연립대전광역시 동구 동대전로104번길 29-2(대동)3343
89가양시장아파트대전광역시 동구 매봉로 18(가양동)5148
910한산아파트대전광역시 동구 흥룡로 40(가양동)5145
연번공동주택명위치층수동수세대수
144145대전산내 (주공)대전광역시 동구 산내로 1352-25(낭월동)611624
145146대전삼성1(삼성타운1) (주공)대전광역시 동구 우암로 133(삼성동)235427
146147대전판암3 (주공)대전광역시 동구 옥천로 152-9(판암동)153768
147148대전판암4 (주공)대전광역시 동구 동부로 56-7(판암동)15152414
148149새들뫼휴먼시아2단지대전광역시 동구 계족로 135(대동)205375
149150석천들주공아파트 (주공)대전광역시 동구 산내로1257번길 40(낭월동)18151072
150151은어송마을4단지대전광역시 동구 은어송로 116(가오동)146307
151152인동누리보듬아파트 (도시공사)대전광역시 동구 계족로51번길 38(인동)74244
152153인동참좋은대전광역시 동구 대전로 706(인동)151280
153154천동휴먼시아1단지 (주공)대전광역시 동구 대전로542번길 78(천동)239946