Overview

Dataset statistics

Number of variables6
Number of observations123
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.0 KiB
Average record size in memory50.1 B

Variable types

Text2
Categorical2
Numeric1
DateTime1

Dataset

Description공공데이터 제공 신청 건으로 강원특별자치도 동해시 원룸 및 오피스텔 현황에 대해 동해시의 다가구주택 및 오피스텔의 대지위치주소, 도로명주소, 주용도코드명, 기타용도내용, 가구(세대)수, 사용승인일의 항목을 제공합니다.
Author강원특별자치도 동해시
URLhttps://www.data.go.kr/data/15127165/fileData.do

Alerts

주용도코드명 is highly overall correlated with 기타용도내용High correlation
기타용도내용 is highly overall correlated with 주용도코드명High correlation
주용도코드명 is highly imbalanced (91.4%)Imbalance
대지위치주소 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2024-03-16 04:12:11.158916
Analysis finished2024-03-16 04:12:12.014903
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대지위치주소
Text

UNIQUE 

Distinct123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-16T13:12:12.392900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length24.520325
Min length20

Characters and Unicode

Total characters3016
Distinct characters45
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)100.0%

Sample

1st row강원특별자치도 동해시 천곡동 1012
2nd row강원특별자치도 동해시 묵호진동 0002-0002
3rd row강원특별자치도 동해시 천곡동 1086-0006
4th row강원특별자치도 동해시 송정동 0851-0008
5th row강원특별자치도 동해시 동회동 0370-0003
ValueCountFrequency (%)
강원특별자치도 123
25.0%
동해시 123
25.0%
천곡동 42
 
8.5%
효가동 18
 
3.7%
구미동 17
 
3.5%
지흥동 17
 
3.5%
망상동 8
 
1.6%
송정동 7
 
1.4%
어달동 4
 
0.8%
발한동 3
 
0.6%
Other values (127) 130
26.4%
2024-03-16T13:12:13.013942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 453
15.0%
369
 
12.2%
248
 
8.2%
123
 
4.1%
123
 
4.1%
123
 
4.1%
123
 
4.1%
123
 
4.1%
123
 
4.1%
123
 
4.1%
Other values (35) 1085
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1600
53.1%
Decimal Number 936
31.0%
Space Separator 369
 
12.2%
Dash Punctuation 111
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
248
15.5%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
Other values (23) 245
15.3%
Decimal Number
ValueCountFrequency (%)
0 453
48.4%
1 108
 
11.5%
6 60
 
6.4%
4 59
 
6.3%
2 55
 
5.9%
8 51
 
5.4%
3 47
 
5.0%
9 44
 
4.7%
5 36
 
3.8%
7 23
 
2.5%
Space Separator
ValueCountFrequency (%)
369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 111
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1600
53.1%
Common 1416
46.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
248
15.5%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
Other values (23) 245
15.3%
Common
ValueCountFrequency (%)
0 453
32.0%
369
26.1%
- 111
 
7.8%
1 108
 
7.6%
6 60
 
4.2%
4 59
 
4.2%
2 55
 
3.9%
8 51
 
3.6%
3 47
 
3.3%
9 44
 
3.1%
Other values (2) 59
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1600
53.1%
ASCII 1416
46.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 453
32.0%
369
26.1%
- 111
 
7.8%
1 108
 
7.6%
6 60
 
4.2%
4 59
 
4.2%
2 55
 
3.9%
8 51
 
3.6%
3 47
 
3.3%
9 44
 
3.1%
Other values (2) 59
 
4.2%
Hangul
ValueCountFrequency (%)
248
15.5%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
123
7.7%
Other values (23) 245
15.3%

도로명주소
Text

UNIQUE 

Distinct123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-16T13:12:13.556343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length19.804878
Min length18

Characters and Unicode

Total characters2436
Distinct characters69
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)100.0%

Sample

1st row강원특별자치도 동해시 동굴4길 6
2nd row강원특별자치도 동해시 일출로 151
3rd row강원특별자치도 동해시 항골1길 13
4th row강원특별자치도 동해시 동해항1길 27-2
5th row강원특별자치도 동해시 효자로 606-1
ValueCountFrequency (%)
강원특별자치도 123
25.0%
동해시 123
25.0%
지양길 18
 
3.7%
효가로 7
 
1.4%
일출로 6
 
1.2%
동굴1길 5
 
1.0%
항골1길 5
 
1.0%
구미택지3길 5
 
1.0%
효가2길 5
 
1.0%
10 4
 
0.8%
Other values (151) 191
38.8%
2024-03-16T13:12:14.257602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
 
15.1%
138
 
5.7%
129
 
5.3%
125
 
5.1%
124
 
5.1%
124
 
5.1%
123
 
5.0%
123
 
5.0%
123
 
5.0%
123
 
5.0%
Other values (59) 935
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1639
67.3%
Space Separator 369
 
15.1%
Decimal Number 367
 
15.1%
Dash Punctuation 61
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
138
 
8.4%
129
 
7.9%
125
 
7.6%
124
 
7.6%
124
 
7.6%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
Other values (47) 384
23.4%
Decimal Number
ValueCountFrequency (%)
1 84
22.9%
2 61
16.6%
3 45
12.3%
4 35
9.5%
5 31
 
8.4%
7 31
 
8.4%
8 23
 
6.3%
6 22
 
6.0%
0 21
 
5.7%
9 14
 
3.8%
Space Separator
ValueCountFrequency (%)
369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1639
67.3%
Common 797
32.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
138
 
8.4%
129
 
7.9%
125
 
7.6%
124
 
7.6%
124
 
7.6%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
Other values (47) 384
23.4%
Common
ValueCountFrequency (%)
369
46.3%
1 84
 
10.5%
- 61
 
7.7%
2 61
 
7.7%
3 45
 
5.6%
4 35
 
4.4%
5 31
 
3.9%
7 31
 
3.9%
8 23
 
2.9%
6 22
 
2.8%
Other values (2) 35
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1639
67.3%
ASCII 797
32.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
369
46.3%
1 84
 
10.5%
- 61
 
7.7%
2 61
 
7.7%
3 45
 
5.6%
4 35
 
4.4%
5 31
 
3.9%
7 31
 
3.9%
8 23
 
2.9%
6 22
 
2.8%
Other values (2) 35
 
4.4%
Hangul
ValueCountFrequency (%)
138
 
8.4%
129
 
7.9%
125
 
7.6%
124
 
7.6%
124
 
7.6%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
123
 
7.5%
Other values (47) 384
23.4%

주용도코드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
단독주택
121 
공동주택
 
1
제2종근린생활시설
 
1

Length

Max length9
Median length4
Mean length4.0406504
Min length4

Unique

Unique2 ?
Unique (%)1.6%

Sample

1st row공동주택
2nd row제2종근린생활시설
3rd row단독주택
4th row단독주택
5th row단독주택

Common Values

ValueCountFrequency (%)
단독주택 121
98.4%
공동주택 1
 
0.8%
제2종근린생활시설 1
 
0.8%

Length

2024-03-16T13:12:14.549376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:12:14.789330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단독주택 121
98.4%
공동주택 1
 
0.8%
제2종근린생활시설 1
 
0.8%

기타용도내용
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)29.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
다가구주택
43 
단독주택(다가구주택)
18 
단독주택(다가구)
14 
근린생활시설, 다가구주택
다가구주택, 근린생활시설
 
3
Other values (31)
36 

Length

Max length26
Median length21
Mean length9.4471545
Min length5

Unique

Unique27 ?
Unique (%)22.0%

Sample

1st row근린생활시설,아파트,업무시설
2nd row근린생활시설, 업무시설, 연립주택, 교육연구시설
3rd row다세대주택
4th row다가구주택
5th row근린생활시설, 주택(다가구주택)

Common Values

ValueCountFrequency (%)
다가구주택 43
35.0%
단독주택(다가구주택) 18
14.6%
단독주택(다가구) 14
 
11.4%
근린생활시설, 다가구주택 9
 
7.3%
다가구주택, 근린생활시설 3
 
2.4%
다세대주택 3
 
2.4%
단독주택(다가구), 근린생활시설 2
 
1.6%
제2종근린생활시설,다가구주택 2
 
1.6%
근린생활시설,다가구주택 2
 
1.6%
다가구용단독주택 1
 
0.8%
Other values (26) 26
21.1%

Length

2024-03-16T13:12:14.949221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
다가구주택 58
37.9%
근린생활시설 20
 
13.1%
단독주택(다가구주택 18
 
11.8%
단독주택(다가구 16
 
10.5%
다세대주택 4
 
2.6%
주택 3
 
2.0%
다가구 2
 
1.3%
근린생활시설,다가구주택 2
 
1.3%
제2종근린생활시설,다가구주택 2
 
1.3%
다가구및근린생활시설 1
 
0.7%
Other values (27) 27
17.6%

세대수
Real number (ℝ)

Distinct15
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.4065041
Minimum5
Maximum19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-03-16T13:12:15.105255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile5
Q16
median7
Q312
95-th percentile18
Maximum19
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.3904468
Coefficient of variation (CV)0.46674585
Kurtosis-0.6391762
Mean9.4065041
Median Absolute Deviation (MAD)2
Skewness0.82936166
Sum1157
Variance19.276023
MonotonicityNot monotonic
2024-03-16T13:12:15.251573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
6 39
31.7%
5 17
13.8%
10 11
 
8.9%
7 7
 
5.7%
18 6
 
4.9%
12 6
 
4.9%
11 5
 
4.1%
9 5
 
4.1%
15 5
 
4.1%
14 4
 
3.3%
Other values (5) 18
14.6%
ValueCountFrequency (%)
5 17
13.8%
6 39
31.7%
7 7
 
5.7%
8 3
 
2.4%
9 5
 
4.1%
10 11
 
8.9%
11 5
 
4.1%
12 6
 
4.9%
13 4
 
3.3%
14 4
 
3.3%
ValueCountFrequency (%)
19 4
 
3.3%
18 6
4.9%
17 4
 
3.3%
16 3
 
2.4%
15 5
4.1%
14 4
 
3.3%
13 4
 
3.3%
12 6
4.9%
11 5
4.1%
10 11
8.9%
Distinct117
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum1977-11-04 00:00:00
Maximum2022-12-01 00:00:00
2024-03-16T13:12:15.461498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:12:15.774772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-16T13:12:11.560075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:12:15.951171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도코드명기타용도내용세대수
주용도코드명1.0001.0000.470
기타용도내용1.0001.0000.689
세대수0.4700.6891.000
2024-03-16T13:12:16.220221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도코드명기타용도내용
주용도코드명1.0000.851
기타용도내용0.8511.000
2024-03-16T13:12:16.399182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수주용도코드명기타용도내용
세대수1.0000.3100.234
주용도코드명0.3101.0000.851
기타용도내용0.2340.8511.000

Missing values

2024-03-16T13:12:11.764711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:12:11.942636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대지위치주소도로명주소주용도코드명기타용도내용세대수사용승인일자
0강원특별자치도 동해시 천곡동 1012강원특별자치도 동해시 동굴4길 6공동주택근린생활시설,아파트,업무시설141997-03-12
1강원특별자치도 동해시 묵호진동 0002-0002강원특별자치도 동해시 일출로 151제2종근린생활시설근린생활시설, 업무시설, 연립주택, 교육연구시설181998-06-27
2강원특별자치도 동해시 천곡동 1086-0006강원특별자치도 동해시 항골1길 13단독주택다세대주택61989-12-02
3강원특별자치도 동해시 송정동 0851-0008강원특별자치도 동해시 동해항1길 27-2단독주택다가구주택62002-10-29
4강원특별자치도 동해시 동회동 0370-0003강원특별자치도 동해시 효자로 606-1단독주택근린생활시설, 주택(다가구주택)181994-12-29
5강원특별자치도 동해시 천곡동 0376-0005강원특별자치도 동해시 한섬로 28-5단독주택다가구주택102004-01-08
6강원특별자치도 동해시 천곡동 0967-0021강원특별자치도 동해시 감추2길 17단독주택단독주택(다가구주택)112017-01-19
7강원특별자치도 동해시 지흥동 0139-0003강원특별자치도 동해시 지양길 80-4단독주택다가구주택61996-09-21
8강원특별자치도 동해시 지흥동 0088-0002강원특별자치도 동해시 지양길 55단독주택다가구주택(19가구)191996-02-07
9강원특별자치도 동해시 지흥동 0138-0004강원특별자치도 동해시 지양길 80-1단독주택근린생활시설, 다가구주택61996-04-26
대지위치주소도로명주소주용도코드명기타용도내용세대수사용승인일자
113강원특별자치도 동해시 천곡동 0805-0005강원특별자치도 동해시 천곡1길 19단독주택근린생활시설,다가구주택92003-12-05
114강원특별자치도 동해시 효가동 0397-0002강원특별자치도 동해시 효가로 24-2단독주택단독주택(다가구)61991-12-03
115강원특별자치도 동해시 효가동 0244-0012강원특별자치도 동해시 효가로 6-4단독주택다가구주택61999-09-08
116강원특별자치도 동해시 구미동 0581-0006강원특별자치도 동해시 구미택지1길 86단독주택단독주택(다가구), 근린생활시설51994-08-08
117강원특별자치도 동해시 송정동 0663-0002강원특별자치도 동해시 솔밭2길 7-2단독주택근린생활및다가구162002-08-07
118강원특별자치도 동해시 송정동 0848-0020강원특별자치도 동해시 송정시장길 6-1단독주택단독주택(다가구주택)62003-06-11
119강원특별자치도 동해시 송정동 0615-0015강원특별자치도 동해시 솔밭1길 7-1단독주택단독(다가구)주택61998-07-06
120강원특별자치도 동해시 구미동 0596-0007강원특별자치도 동해시 구미택지4길 7단독주택다가구주택171998-01-13
121강원특별자치도 동해시 효가동 0162-0041강원특별자치도 동해시 효가2길 7-5단독주택단독주택(다가구주택)62003-08-18
122강원특별자치도 동해시 지흥동 0088강원특별자치도 동해시 지양길 53단독주택근린생활시설, 다가구주택(14가구)141996-03-19