Overview

Dataset statistics

Number of variables8
Number of observations110
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.9%
Total size in memory7.4 KiB
Average record size in memory69.2 B

Variable types

Categorical5
Text1
Numeric2

Dataset

Description횡성군 다가구주택 현황으로 횡성 관내의 다가구 주택의 주소 주용도 세부용도 지상층수 지하층수 연면적 가구수등에 대한 자료임.
Author강원특별자치도 횡성군
URLhttps://www.data.go.kr/data/15127258/fileData.do

Alerts

시군구명 has constant value ""Constant
Dataset has 1 (0.9%) duplicate rowsDuplicates
연면적 is highly overall correlated with 지상층수High correlation
주용도 is highly overall correlated with 세부용도High correlation
세부용도 is highly overall correlated with 주용도High correlation
지상층수 is highly overall correlated with 연면적High correlation
주용도 is highly imbalanced (88.8%)Imbalance

Reproduction

Analysis started2024-03-23 05:55:31.375568
Analysis finished2024-03-23 05:55:36.062027
Duration4.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1012.0 B
횡성군
110 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row횡성군
2nd row횡성군
3rd row횡성군
4th row횡성군
5th row횡성군

Common Values

ValueCountFrequency (%)
횡성군 110
100.0%

Length

2024-03-23T05:55:36.590788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T05:55:36.942838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
횡성군 110
100.0%

주소
Text

Distinct102
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-03-23T05:55:37.866949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length24.8
Min length21

Characters and Unicode

Total characters2728
Distinct characters78
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)86.4%

Sample

1st row강원특별자치도 횡성군 청일면 봉명리 246-2
2nd row강원특별자치도 횡성군 횡성읍 북천리 115-44
3rd row강원특별자치도 횡성군 서원면 석화리 351-8
4th row강원특별자치도 횡성군 횡성읍 읍하리 519-3
5th row강원특별자치도 횡성군 횡성읍 읍상리 32-42
ValueCountFrequency (%)
강원특별자치도 110
20.0%
횡성군 110
20.0%
횡성읍 54
 
9.8%
읍하리 27
 
4.9%
둔내면 25
 
4.5%
두원리 14
 
2.5%
읍상리 12
 
2.2%
북천리 7
 
1.3%
우천면 6
 
1.1%
청일면 6
 
1.1%
Other values (138) 179
32.5%
2024-03-23T05:55:39.990103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
440
 
16.1%
164
 
6.0%
164
 
6.0%
131
 
4.8%
116
 
4.3%
114
 
4.2%
110
 
4.0%
110
 
4.0%
110
 
4.0%
110
 
4.0%
Other values (68) 1159
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1766
64.7%
Space Separator 440
 
16.1%
Decimal Number 437
 
16.0%
Dash Punctuation 85
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
9.3%
164
 
9.3%
131
 
7.4%
116
 
6.6%
114
 
6.5%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
Other values (56) 527
29.8%
Decimal Number
ValueCountFrequency (%)
1 86
19.7%
5 62
14.2%
3 60
13.7%
6 53
12.1%
4 48
11.0%
2 43
9.8%
7 28
 
6.4%
8 22
 
5.0%
0 20
 
4.6%
9 15
 
3.4%
Space Separator
ValueCountFrequency (%)
440
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1766
64.7%
Common 962
35.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
9.3%
164
 
9.3%
131
 
7.4%
116
 
6.6%
114
 
6.5%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
Other values (56) 527
29.8%
Common
ValueCountFrequency (%)
440
45.7%
1 86
 
8.9%
- 85
 
8.8%
5 62
 
6.4%
3 60
 
6.2%
6 53
 
5.5%
4 48
 
5.0%
2 43
 
4.5%
7 28
 
2.9%
8 22
 
2.3%
Other values (2) 35
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1766
64.7%
ASCII 962
35.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
440
45.7%
1 86
 
8.9%
- 85
 
8.8%
5 62
 
6.4%
3 60
 
6.2%
6 53
 
5.5%
4 48
 
5.0%
2 43
 
4.5%
7 28
 
2.9%
8 22
 
2.3%
Other values (2) 35
 
3.6%
Hangul
ValueCountFrequency (%)
164
 
9.3%
164
 
9.3%
131
 
7.4%
116
 
6.6%
114
 
6.5%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
110
 
6.2%
Other values (56) 527
29.8%

주용도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1012.0 B
단독주택
107 
교육연구시설
 
1
제2종근린생활시설
 
1
제1종근린생활시설
 
1

Length

Max length9
Median length4
Mean length4.1090909
Min length4

Unique

Unique3 ?
Unique (%)2.7%

Sample

1st row단독주택
2nd row단독주택
3rd row단독주택
4th row단독주택
5th row단독주택

Common Values

ValueCountFrequency (%)
단독주택 107
97.3%
교육연구시설 1
 
0.9%
제2종근린생활시설 1
 
0.9%
제1종근린생활시설 1
 
0.9%

Length

2024-03-23T05:55:40.695381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T05:55:41.139576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단독주택 107
97.3%
교육연구시설 1
 
0.9%
제2종근린생활시설 1
 
0.9%
제1종근린생활시설 1
 
0.9%

세부용도
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size1012.0 B
단독주택(다가구주택)
19 
다가구주택
15 
단독주택(다가구)
14 
근린생활시설, 다가구주택
다가구주택, 근린생활시설
 
5
Other values (37)
51 

Length

Max length38
Median length26
Mean length11.218182
Min length3

Unique

Unique26 ?
Unique (%)23.6%

Sample

1st row단독주택(다가구)
2nd row주택(다가구)
3rd row단독주택(다가구주택)
4th row단독주택,다가구
5th row다가구주택, 근린생활시설(사무실)

Common Values

ValueCountFrequency (%)
단독주택(다가구주택) 19
17.3%
다가구주택 15
 
13.6%
단독주택(다가구) 14
 
12.7%
근린생활시설, 다가구주택 6
 
5.5%
다가구주택, 근린생활시설 5
 
4.5%
다가구주택(2가구) 4
 
3.6%
단독주택(3가구) 3
 
2.7%
단독주택(다가구용) 2
 
1.8%
제1종근린생활시설, 단독주택(다가구) 2
 
1.8%
제2종근린생활시설, 단독주택(다가구) 2
 
1.8%
Other values (32) 38
34.5%

Length

2024-03-23T05:55:41.911091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
다가구주택 28
20.7%
단독주택(다가구 20
14.8%
단독주택(다가구주택 19
14.1%
근린생활시설 14
 
10.4%
다가구주택(2가구 4
 
3.0%
제2종근린생활시설 4
 
3.0%
단독주택(3가구 3
 
2.2%
3
 
2.2%
주택(2가구 2
 
1.5%
단독주택(2가구 2
 
1.5%
Other values (30) 36
26.7%

지상층수
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2
57 
3
20 
1
19 
4
14 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 57
51.8%
3 20
 
18.2%
1 19
 
17.3%
4 14
 
12.7%

Length

2024-03-23T05:55:42.460364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T05:55:42.877268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 57
51.8%
3 20
 
18.2%
1 19
 
17.3%
4 14
 
12.7%

지하층수
Categorical

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1012.0 B
0
94 
1
16 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 94
85.5%
1 16
 
14.5%

Length

2024-03-23T05:55:43.374436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T05:55:43.748705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 94
85.5%
1 16
 
14.5%

연면적
Real number (ℝ)

HIGH CORRELATION 

Distinct107
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean280.04891
Minimum28.44
Maximum728.23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-23T05:55:44.318541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28.44
5-th percentile84.557
Q1166.9125
median229.55
Q3353.855
95-th percentile627.8715
Maximum728.23
Range699.79
Interquartile range (IQR)186.9425

Descriptive statistics

Standard deviation160.11053
Coefficient of variation (CV)0.57172347
Kurtosis0.11311965
Mean280.04891
Median Absolute Deviation (MAD)90.425
Skewness0.91620466
Sum30805.38
Variance25635.383
MonotonicityNot monotonic
2024-03-23T05:55:44.917594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135.2 3
 
2.7%
154.66 2
 
1.8%
226.21 1
 
0.9%
375.28 1
 
0.9%
365.21 1
 
0.9%
285.91 1
 
0.9%
492.89 1
 
0.9%
222.85 1
 
0.9%
340.66 1
 
0.9%
148.55 1
 
0.9%
Other values (97) 97
88.2%
ValueCountFrequency (%)
28.44 1
0.9%
49.5 1
0.9%
53.51 1
0.9%
66.0 1
0.9%
70.16 1
0.9%
73.46 1
0.9%
98.12 1
0.9%
109.27 1
0.9%
111.84 1
0.9%
112.02 1
0.9%
ValueCountFrequency (%)
728.23 1
0.9%
658.39 1
0.9%
657.9 1
0.9%
642.56 1
0.9%
633.23 1
0.9%
629.82 1
0.9%
625.49 1
0.9%
590.0 1
0.9%
568.69 1
0.9%
563.63 1
0.9%

가구수
Real number (ℝ)

Distinct15
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2454545
Minimum2
Maximum19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-23T05:55:45.445981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q12
median3
Q34
95-th percentile12.55
Maximum19
Range17
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.6402096
Coefficient of variation (CV)0.85743695
Kurtosis5.7901531
Mean4.2454545
Median Absolute Deviation (MAD)1
Skewness2.4291454
Sum467
Variance13.251126
MonotonicityNot monotonic
2024-03-23T05:55:45.951723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2 44
40.0%
3 24
21.8%
4 17
 
15.5%
5 5
 
4.5%
7 3
 
2.7%
8 3
 
2.7%
6 3
 
2.7%
12 2
 
1.8%
13 2
 
1.8%
17 2
 
1.8%
Other values (5) 5
 
4.5%
ValueCountFrequency (%)
2 44
40.0%
3 24
21.8%
4 17
 
15.5%
5 5
 
4.5%
6 3
 
2.7%
7 3
 
2.7%
8 3
 
2.7%
9 1
 
0.9%
10 1
 
0.9%
11 1
 
0.9%
ValueCountFrequency (%)
19 1
 
0.9%
18 1
 
0.9%
17 2
1.8%
13 2
1.8%
12 2
1.8%
11 1
 
0.9%
10 1
 
0.9%
9 1
 
0.9%
8 3
2.7%
7 3
2.7%

Interactions

2024-03-23T05:55:33.224335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:55:32.077883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:55:33.668622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:55:32.855221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T05:55:46.317995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주용도세부용도지상층수지하층수연면적가구수
주용도1.0000.9680.2650.0000.5470.000
세부용도0.9681.0000.5860.4490.6810.651
지상층수0.2650.5861.0000.0000.7930.557
지하층수0.0000.4490.0001.0000.0000.000
연면적0.5470.6810.7930.0001.0000.673
가구수0.0000.6510.5570.0000.6731.000
2024-03-23T05:55:46.718757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지상층수주용도세부용도지하층수
지상층수1.0000.1050.2630.000
주용도0.1051.0000.6840.000
세부용도0.2630.6841.0000.279
지하층수0.0000.0000.2791.000
2024-03-23T05:55:47.116590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적가구수주용도세부용도지상층수지하층수
연면적1.0000.3960.3480.2480.5970.000
가구수0.3961.0000.0000.2370.3820.000
주용도0.3480.0001.0000.6840.1050.000
세부용도0.2480.2370.6841.0000.2630.279
지상층수0.5970.3820.1050.2631.0000.000
지하층수0.0000.0000.0000.2790.0001.000

Missing values

2024-03-23T05:55:34.242878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T05:55:35.327267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구명주소주용도세부용도지상층수지하층수연면적가구수
0횡성군강원특별자치도 횡성군 청일면 봉명리 246-2단독주택단독주택(다가구)20226.214
1횡성군강원특별자치도 횡성군 횡성읍 북천리 115-44단독주택주택(다가구)21290.522
2횡성군강원특별자치도 횡성군 서원면 석화리 351-8단독주택단독주택(다가구주택)20212.082
3횡성군강원특별자치도 횡성군 횡성읍 읍하리 519-3단독주택단독주택,다가구20184.292
4횡성군강원특별자치도 횡성군 횡성읍 읍상리 32-42단독주택다가구주택, 근린생활시설(사무실)20208.53
5횡성군강원특별자치도 횡성군 횡성읍 읍하리 495-1단독주택다가구주택21252.713
6횡성군강원특별자치도 횡성군 횡성읍 북천리 115-137단독주택근린생활시설, 주택(2가구)31437.5152
7횡성군강원특별자치도 횡성군 횡성읍 읍상리 584-9단독주택다가구주택,제1종근린생활시설40388.237
8횡성군강원특별자치도 횡성군 둔내면 자포곡리 208단독주택단독주택(다가구)1073.462
9횡성군강원특별자치도 횡성군 횡성읍 읍상리 665-7단독주택다가구주택40633.2312
시군구명주소주용도세부용도지상층수지하층수연면적가구수
100횡성군강원특별자치도 횡성군 강림면 부곡리 355-3단독주택다가구주택20273.583
101횡성군강원특별자치도 횡성군 둔내면 자포곡리 345-17단독주택다가구주택(2가구)20514.672
102횡성군강원특별자치도 횡성군 횡성읍 교항리 13단독주택단독주택(다가구,제1종근린생활시설(소매점),제2종근린생활시설(사무소)40493.929
103횡성군강원특별자치도 횡성군 횡성읍 읍상리 586-5단독주택다가구주택(2가구)20210.962
104횡성군강원특별자치도 횡성군 둔내면 삽교리 734-1단독주택단독주택(4가구용)10111.844
105횡성군강원특별자치도 횡성군 강림면 부곡리 321단독주택다가구주택30340.653
106횡성군강원특별자치도 횡성군 횡성읍 읍하리 572-9단독주택단독주택(다가구주택)40658.3917
107횡성군강원특별자치도 횡성군 갑천면 삼거리 101-1단독주택단독주택(다가구주택)30135.24
108횡성군강원특별자치도 횡성군 우천면 백달리 315단독주택다가구주택20563.6312
109횡성군강원특별자치도 횡성군 횡성읍 읍하리 507-7단독주택제1종근린생활시설, 단독주택(다가구)30343.525

Duplicate rows

Most frequently occurring

시군구명주소주용도세부용도지상층수지하층수연면적가구수# duplicates
0횡성군강원특별자치도 횡성군 둔내면 두원리 666-3단독주택단독주택(다가구2가구)20154.6622