Overview

Dataset statistics

Number of variables6
Number of observations246
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.9 KiB
Average record size in memory49.5 B

Variable types

Text3
Numeric1
DateTime1
Categorical1

Dataset

Description대전광역시 유성구에 위치한 공동주택현황에 대한 데이터로 아파트명, 지번주소, 도로명주소, 세대수 등의 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15013653/fileData.do

Alerts

소재지지번주소 has unique valuesUnique
소재지도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-11 22:49:26.182955
Analysis finished2023-12-11 22:49:26.727446
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct245
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T07:49:26.870915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length15
Mean length8.199187
Min length2

Characters and Unicode

Total characters2017
Distinct characters276
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)99.2%

Sample

1st row원자력연료사원아파트
2nd row우성아파트
3rd row연구원 현대아파트
4th row과기원교수아파트
5th row삼정하이츠아파트
ValueCountFrequency (%)
대덕테크노밸리 12
 
3.4%
열매마을 11
 
3.2%
1단지 10
 
2.9%
2단지 9
 
2.6%
반석마을 8
 
2.3%
송림마을 6
 
1.7%
4단지 5
 
1.4%
3단지 4
 
1.1%
6단지 4
 
1.1%
5단지 4
 
1.1%
Other values (243) 275
79.0%
2023-12-12T07:49:27.198280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
298
 
14.8%
84
 
4.2%
68
 
3.4%
60
 
3.0%
59
 
2.9%
51
 
2.5%
48
 
2.4%
45
 
2.2%
39
 
1.9%
38
 
1.9%
Other values (266) 1227
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1562
77.4%
Space Separator 298
 
14.8%
Decimal Number 118
 
5.9%
Uppercase Letter 13
 
0.6%
Close Punctuation 8
 
0.4%
Open Punctuation 8
 
0.4%
Letter Number 4
 
0.2%
Dash Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
5.4%
68
 
4.4%
60
 
3.8%
59
 
3.8%
51
 
3.3%
48
 
3.1%
45
 
2.9%
39
 
2.5%
38
 
2.4%
34
 
2.2%
Other values (235) 1036
66.3%
Uppercase Letter
ValueCountFrequency (%)
L 2
15.4%
B 1
7.7%
H 1
7.7%
W 1
7.7%
E 1
7.7%
I 1
7.7%
V 1
7.7%
K 1
7.7%
S 1
7.7%
J 1
7.7%
Other values (2) 2
15.4%
Decimal Number
ValueCountFrequency (%)
1 32
27.1%
2 29
24.6%
3 17
14.4%
4 11
 
9.3%
6 7
 
5.9%
5 7
 
5.9%
8 5
 
4.2%
7 4
 
3.4%
0 3
 
2.5%
9 3
 
2.5%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
298
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1562
77.4%
Common 436
 
21.6%
Latin 19
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
5.4%
68
 
4.4%
60
 
3.8%
59
 
3.8%
51
 
3.3%
48
 
3.1%
45
 
2.9%
39
 
2.5%
38
 
2.4%
34
 
2.2%
Other values (235) 1036
66.3%
Latin
ValueCountFrequency (%)
2
 
10.5%
2
 
10.5%
L 2
 
10.5%
B 1
 
5.3%
H 1
 
5.3%
W 1
 
5.3%
E 1
 
5.3%
I 1
 
5.3%
V 1
 
5.3%
K 1
 
5.3%
Other values (6) 6
31.6%
Common
ValueCountFrequency (%)
298
68.3%
1 32
 
7.3%
2 29
 
6.7%
3 17
 
3.9%
4 11
 
2.5%
) 8
 
1.8%
( 8
 
1.8%
6 7
 
1.6%
5 7
 
1.6%
8 5
 
1.1%
Other values (5) 14
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1562
77.4%
ASCII 451
 
22.4%
Number Forms 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
298
66.1%
1 32
 
7.1%
2 29
 
6.4%
3 17
 
3.8%
4 11
 
2.4%
) 8
 
1.8%
( 8
 
1.8%
6 7
 
1.6%
5 7
 
1.6%
8 5
 
1.1%
Other values (19) 29
 
6.4%
Hangul
ValueCountFrequency (%)
84
 
5.4%
68
 
4.4%
60
 
3.8%
59
 
3.8%
51
 
3.3%
48
 
3.1%
45
 
2.9%
39
 
2.5%
38
 
2.4%
34
 
2.2%
Other values (235) 1036
66.3%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%
Distinct246
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T07:49:27.538702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length18.398374
Min length15

Characters and Unicode

Total characters4526
Distinct characters65
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)100.0%

Sample

1st row대전광역시 유성구 도룡동 392-2
2nd row대전광역시 유성구 도룡동 383-3
3rd row대전광역시 유성구 도룡동 431-6
4th row대전광역시 유성구 도룡동 383-2
5th row대전광역시 유성구 구암동 600-2
ValueCountFrequency (%)
대전광역시 246
24.9%
유성구 246
24.9%
봉명동 73
 
7.4%
지족동 30
 
3.0%
도룡동 16
 
1.6%
장대동 10
 
1.0%
관평동 10
 
1.0%
용산동 9
 
0.9%
상대동 9
 
0.9%
궁동 8
 
0.8%
Other values (268) 330
33.4%
2023-12-12T07:49:27.997166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
743
16.4%
267
 
5.9%
254
 
5.6%
250
 
5.5%
249
 
5.5%
246
 
5.4%
246
 
5.4%
246
 
5.4%
246
 
5.4%
246
 
5.4%
Other values (55) 1533
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2722
60.1%
Decimal Number 914
 
20.2%
Space Separator 743
 
16.4%
Dash Punctuation 141
 
3.1%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
267
9.8%
254
9.3%
250
9.2%
249
9.1%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
78
 
2.9%
Other values (42) 394
14.5%
Decimal Number
ValueCountFrequency (%)
6 143
15.6%
1 117
12.8%
4 110
12.0%
2 102
11.2%
5 96
10.5%
3 94
10.3%
8 75
8.2%
0 63
6.9%
9 63
6.9%
7 51
 
5.6%
Space Separator
ValueCountFrequency (%)
743
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 141
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2722
60.1%
Common 1804
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
267
9.8%
254
9.3%
250
9.2%
249
9.1%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
78
 
2.9%
Other values (42) 394
14.5%
Common
ValueCountFrequency (%)
743
41.2%
6 143
 
7.9%
- 141
 
7.8%
1 117
 
6.5%
4 110
 
6.1%
2 102
 
5.7%
5 96
 
5.3%
3 94
 
5.2%
8 75
 
4.2%
0 63
 
3.5%
Other values (3) 120
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2722
60.1%
ASCII 1804
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
743
41.2%
6 143
 
7.9%
- 141
 
7.8%
1 117
 
6.5%
4 110
 
6.1%
2 102
 
5.7%
5 96
 
5.3%
3 94
 
5.2%
8 75
 
4.2%
0 63
 
3.5%
Other values (3) 120
 
6.7%
Hangul
ValueCountFrequency (%)
267
9.8%
254
9.3%
250
9.2%
249
9.1%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
246
9.0%
78
 
2.9%
Other values (42) 394
14.5%
Distinct246
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T07:49:28.286096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length40
Mean length35.345528
Min length1

Characters and Unicode

Total characters8695
Distinct characters271
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)100.0%

Sample

1st row대전광역시 유성구 대덕대로577번길 51, (도룡동, 한전원전사택)
2nd row대전광역시 유성구 문지로 22, (도룡동, 우성아파트)
3rd row대전광역시 유성구 대덕대로541번길 68, (도룡동, 대덕연구원현대아파트)
4th row대전광역시 유성구 문지로 14, (도룡동, 카이스트교수아파트)
5th row대전광역시 유성구 유성대로668번길 29, (구암동, 삼정하이츠)
ValueCountFrequency (%)
대전광역시 245
 
16.6%
유성구 245
 
16.6%
봉명동 74
 
5.0%
지족동 31
 
2.1%
온천북로33번길 21
 
1.4%
문화원로 17
 
1.2%
도룡동 16
 
1.1%
장대동 10
 
0.7%
관평동 10
 
0.7%
배울2로 10
 
0.7%
Other values (535) 799
54.1%
2023-12-12T07:49:28.669298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1244
 
14.3%
, 482
 
5.5%
359
 
4.1%
282
 
3.2%
278
 
3.2%
268
 
3.1%
267
 
3.1%
267
 
3.1%
258
 
3.0%
251
 
2.9%
Other values (261) 4739
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5434
62.5%
Space Separator 1244
 
14.3%
Decimal Number 1000
 
11.5%
Other Punctuation 482
 
5.5%
Close Punctuation 245
 
2.8%
Open Punctuation 245
 
2.8%
Dash Punctuation 42
 
0.5%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
359
 
6.6%
282
 
5.2%
278
 
5.1%
268
 
4.9%
267
 
4.9%
267
 
4.9%
258
 
4.7%
251
 
4.6%
246
 
4.5%
245
 
4.5%
Other values (243) 2713
49.9%
Decimal Number
ValueCountFrequency (%)
1 203
20.3%
3 170
17.0%
2 158
15.8%
6 84
8.4%
4 81
 
8.1%
5 72
 
7.2%
0 65
 
6.5%
9 62
 
6.2%
7 58
 
5.8%
8 47
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
K 1
33.3%
S 1
33.3%
E 1
33.3%
Space Separator
ValueCountFrequency (%)
1244
100.0%
Other Punctuation
ValueCountFrequency (%)
, 482
100.0%
Close Punctuation
ValueCountFrequency (%)
) 245
100.0%
Open Punctuation
ValueCountFrequency (%)
( 245
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5434
62.5%
Common 3258
37.5%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
359
 
6.6%
282
 
5.2%
278
 
5.1%
268
 
4.9%
267
 
4.9%
267
 
4.9%
258
 
4.7%
251
 
4.6%
246
 
4.5%
245
 
4.5%
Other values (243) 2713
49.9%
Common
ValueCountFrequency (%)
1244
38.2%
, 482
 
14.8%
) 245
 
7.5%
( 245
 
7.5%
1 203
 
6.2%
3 170
 
5.2%
2 158
 
4.8%
6 84
 
2.6%
4 81
 
2.5%
5 72
 
2.2%
Other values (5) 274
 
8.4%
Latin
ValueCountFrequency (%)
K 1
33.3%
S 1
33.3%
E 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5434
62.5%
ASCII 3261
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1244
38.1%
, 482
 
14.8%
) 245
 
7.5%
( 245
 
7.5%
1 203
 
6.2%
3 170
 
5.2%
2 158
 
4.8%
6 84
 
2.6%
4 81
 
2.5%
5 72
 
2.2%
Other values (8) 277
 
8.5%
Hangul
ValueCountFrequency (%)
359
 
6.6%
282
 
5.2%
278
 
5.1%
268
 
4.9%
267
 
4.9%
267
 
4.9%
258
 
4.7%
251
 
4.6%
246
 
4.5%
245
 
4.5%
Other values (243) 2713
49.9%

세대수
Real number (ℝ)

Distinct187
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean434.64634
Minimum20
Maximum3958
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T07:49:28.813662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile25.5
Q185
median252.5
Q3658.75
95-th percentile1213.5
Maximum3958
Range3938
Interquartile range (IQR)573.75

Descriptive statistics

Standard deviation489.16251
Coefficient of variation (CV)1.1254265
Kurtosis14.041119
Mean434.64634
Median Absolute Deviation (MAD)219.5
Skewness2.8203265
Sum106923
Variance239279.96
MonotonicityNot monotonic
2023-12-12T07:49:28.952829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28 6
 
2.4%
299 5
 
2.0%
42 5
 
2.0%
20 4
 
1.6%
150 4
 
1.6%
24 4
 
1.6%
112 4
 
1.6%
76 3
 
1.2%
80 3
 
1.2%
54 3
 
1.2%
Other values (177) 205
83.3%
ValueCountFrequency (%)
20 4
1.6%
21 2
 
0.8%
22 1
 
0.4%
24 4
1.6%
25 2
 
0.8%
27 1
 
0.4%
28 6
2.4%
29 2
 
0.8%
30 1
 
0.4%
33 2
 
0.8%
ValueCountFrequency (%)
3958 1
0.4%
3144 1
0.4%
1966 1
0.4%
1830 1
0.4%
1828 1
0.4%
1668 1
0.4%
1647 1
0.4%
1460 1
0.4%
1320 1
0.4%
1306 1
0.4%
Distinct219
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum1986-04-16 00:00:00
Maximum2023-03-31 00:00:00
2023-12-12T07:49:29.082923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:49:29.223380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct4
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
아파트
140 
도시형생활주택
72 
주상복합
32 
연립주택
 
2

Length

Max length7
Median length3
Mean length4.3089431
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아파트
2nd row아파트
3rd row아파트
4th row아파트
5th row아파트

Common Values

ValueCountFrequency (%)
아파트 140
56.9%
도시형생활주택 72
29.3%
주상복합 32
 
13.0%
연립주택 2
 
0.8%

Length

2023-12-12T07:49:29.345832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:49:29.440614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아파트 140
56.9%
도시형생활주택 72
29.3%
주상복합 32
 
13.0%
연립주택 2
 
0.8%

Interactions

2023-12-12T07:49:26.508728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:49:29.504690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수공동주택구분
세대수1.0000.539
공동주택구분0.5391.000
2023-12-12T07:49:29.570122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수공동주택구분
세대수1.0000.399
공동주택구분0.3991.000

Missing values

2023-12-12T07:49:26.609246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:49:26.693828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공동주택명소재지지번주소소재지도로명주소세대수사용승인일공동주택구분
0원자력연료사원아파트대전광역시 유성구 도룡동 392-2대전광역시 유성구 대덕대로577번길 51, (도룡동, 한전원전사택)811986-04-16아파트
1우성아파트대전광역시 유성구 도룡동 383-3대전광역시 유성구 문지로 22, (도룡동, 우성아파트)551989-06-13아파트
2연구원 현대아파트대전광역시 유성구 도룡동 431-6대전광역시 유성구 대덕대로541번길 68, (도룡동, 대덕연구원현대아파트)1501989-10-12아파트
3과기원교수아파트대전광역시 유성구 도룡동 383-2대전광역시 유성구 문지로 14, (도룡동, 카이스트교수아파트)761990-09-18아파트
4삼정하이츠아파트대전광역시 유성구 구암동 600-2대전광역시 유성구 유성대로668번길 29, (구암동, 삼정하이츠)1501991-12-17아파트
5한울아파트대전광역시 유성구 신성동 160-1대전광역시 유성구 가정로 43, (신성동, 한울아파트)8141992-12-19아파트
6한빛아파트대전광역시 유성구 어은동 99대전광역시 유성구 어은로 57, (어은동, 한빛아파트)31441992-12-19아파트
7럭키하나아파트대전광역시 유성구 신성동 153대전광역시 유성구 가정로 63, (신성동, 럭키하나아파트)7201993-01-15아파트
8대림두레아파트대전광역시 유성구 신성동 152-1대전광역시 유성구 가정로 65, (신성동, 대림두레아파트)8401993-03-27아파트
9우성햇살아파트대전광역시 유성구 구암동 609-2대전광역시 유성구 계룡로60번길 86, (구암동, 우성햇살아파트)1361993-04-24아파트
공동주택명소재지지번주소소재지도로명주소세대수사용승인일공동주택구분
236대전아이파크시티 2단지대전광역시 유성구 상대동 580대전광역시 유성구 상대복용로29번길 51, (상대동, 대전아이파크시티2단지)13062021-10-29아파트
237리버스토리대전광역시 유성구 궁동 495-1대전광역시 유성구 대학로76번길 65, (궁동, 리버스토리)762022-02-10도시형생활주택
238대광로제비앙대전광역시 유성구 봉산동 1023대전광역시 유성구 와룡로 206, (봉산동)8162022-03-30아파트
239레자미탐앤탐대전광역시 유성구 봉명동 621-2대전광역시 유성구 문화원로 94, (봉명동, 레자미탐앤탐)1562022-05-16도시형생활주택
240학하리슈빌포레대전광역시 유성구 학하동 787대전광역시 유성구 학하남로90번길 76, (학하동, 학하리슈빌포레)6342022-08-05아파트
241서한이다음1단지대전광역시 유성구 둔곡동 430대전광역시 유성구 과학성장로 77, (둔곡동, 서한이다음1단지)8162022-09-30아파트
242서한이다음2단지대전광역시 유성구 둔곡동 432대전광역시 유성구 과학성장로 33, (둔곡동, 서한이다음2단지)6852022-09-30아파트
243둔곡우미린대전광역시 유성구 둔곡동 431대전광역시 유성구 과학성장로 80, (둔곡동, 우미린)7602022-09-26아파트
244호반써밋유성그랜드파크1단지대전광역시 유성구 용산동 371-2번지 일원대전광역시 유성구 용성로 20, (용산동, 호반써밋그랜드파크1단지)10592023-03-31아파트
245호반써밋유성그랜드파크3단지대전광역시 유성구 용산동 390번지 일원대전광역시 유성구 용산1로 83, (용산동, 호반써밋그랜드파크3단지)6882023-03-31아파트