Overview

Dataset statistics

Number of variables6
Number of observations152
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory7.6 KiB
Average record size in memory50.9 B

Variable types

Categorical2
Text2
Numeric2

Dataset

Description전라북도 정읍시 소재 공동주택 현황(공동주택유형, 공동주택명, 소재지도로명주소, 동수, 세대수) 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15048109/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.7%) duplicate rowsDuplicates
동수 is highly overall correlated with 세대수High correlation
세대수 is highly overall correlated with 동수High correlation

Reproduction

Analysis started2023-12-12 17:27:27.761950
Analysis finished2023-12-12 17:27:28.592933
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
아파트
73 
다세대
50 
연립주택
29 

Length

Max length4
Median length3
Mean length3.1907895
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아파트
2nd row아파트
3rd row아파트
4th row아파트
5th row아파트

Common Values

ValueCountFrequency (%)
아파트 73
48.0%
다세대 50
32.9%
연립주택 29
 
19.1%

Length

2023-12-13T02:27:28.667908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:27:28.788981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아파트 73
48.0%
다세대 50
32.9%
연립주택 29
 
19.1%
Distinct140
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T02:27:29.125110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length13
Mean length6.4144737
Min length2

Characters and Unicode

Total characters975
Distinct characters175
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)88.8%

Sample

1st row시기 주공
2nd row명진 로얄
3rd row삼화맨션
4th row시기 현대
5th row성원 1차(사원아파트)
ValueCountFrequency (%)
상동 11
 
4.4%
다세대 9
 
3.6%
2차 9
 
3.6%
1차 8
 
3.2%
주공 6
 
2.4%
삼성쉐르빌 6
 
2.4%
a동 5
 
2.0%
b동 5
 
2.0%
3차 5
 
2.0%
수성 5
 
2.0%
Other values (138) 179
72.2%
2023-12-13T02:27:29.637429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
 
10.1%
38
 
3.9%
34
 
3.5%
32
 
3.3%
28
 
2.9%
28
 
2.9%
1 27
 
2.8%
23
 
2.4%
21
 
2.2%
20
 
2.1%
Other values (165) 626
64.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 747
76.6%
Space Separator 98
 
10.1%
Decimal Number 73
 
7.5%
Uppercase Letter 22
 
2.3%
Close Punctuation 12
 
1.2%
Open Punctuation 12
 
1.2%
Other Punctuation 5
 
0.5%
Math Symbol 4
 
0.4%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
5.1%
34
 
4.6%
32
 
4.3%
28
 
3.7%
28
 
3.7%
23
 
3.1%
21
 
2.8%
20
 
2.7%
17
 
2.3%
15
 
2.0%
Other values (142) 491
65.7%
Decimal Number
ValueCountFrequency (%)
1 27
37.0%
2 19
26.0%
3 10
 
13.7%
0 10
 
13.7%
4 2
 
2.7%
5 2
 
2.7%
7 1
 
1.4%
6 1
 
1.4%
8 1
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
A 6
27.3%
B 6
27.3%
K 3
13.6%
J 3
13.6%
C 2
 
9.1%
S 1
 
4.5%
L 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
& 4
80.0%
. 1
 
20.0%
Space Separator
ValueCountFrequency (%)
98
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 747
76.6%
Common 206
 
21.1%
Latin 22
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
5.1%
34
 
4.6%
32
 
4.3%
28
 
3.7%
28
 
3.7%
23
 
3.1%
21
 
2.8%
20
 
2.7%
17
 
2.3%
15
 
2.0%
Other values (142) 491
65.7%
Common
ValueCountFrequency (%)
98
47.6%
1 27
 
13.1%
2 19
 
9.2%
) 12
 
5.8%
( 12
 
5.8%
3 10
 
4.9%
0 10
 
4.9%
& 4
 
1.9%
~ 4
 
1.9%
- 2
 
1.0%
Other values (6) 8
 
3.9%
Latin
ValueCountFrequency (%)
A 6
27.3%
B 6
27.3%
K 3
13.6%
J 3
13.6%
C 2
 
9.1%
S 1
 
4.5%
L 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 747
76.6%
ASCII 228
 
23.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
98
43.0%
1 27
 
11.8%
2 19
 
8.3%
) 12
 
5.3%
( 12
 
5.3%
3 10
 
4.4%
0 10
 
4.4%
A 6
 
2.6%
B 6
 
2.6%
& 4
 
1.8%
Other values (13) 24
 
10.5%
Hangul
ValueCountFrequency (%)
38
 
5.1%
34
 
4.6%
32
 
4.3%
28
 
3.7%
28
 
3.7%
23
 
3.1%
21
 
2.8%
20
 
2.7%
17
 
2.3%
15
 
2.0%
Other values (142) 491
65.7%
Distinct140
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T02:27:30.181098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length17.460526
Min length15

Characters and Unicode

Total characters2654
Distinct characters86
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)84.2%

Sample

1st row전라북도 정읍시 천변로 350
2nd row전라북도 정읍시 충정로 50-1
3rd row전라북도 정읍시 천변로 128
4th row전라북도 정읍시 초산로 3-1
5th row전라북도 정읍시 정읍북로 631-26
ValueCountFrequency (%)
전라북도 152
24.8%
정읍시 152
24.8%
학산로 10
 
1.6%
충정로 9
 
1.5%
천변로 8
 
1.3%
상동중앙로 5
 
0.8%
수성4로 5
 
0.8%
상신경2길 4
 
0.7%
연지1길 4
 
0.7%
연지6길 4
 
0.7%
Other values (195) 260
42.4%
2023-12-13T02:27:31.246321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
462
17.4%
169
 
6.4%
160
 
6.0%
156
 
5.9%
155
 
5.8%
154
 
5.8%
152
 
5.7%
152
 
5.7%
1 132
 
5.0%
- 83
 
3.1%
Other values (76) 879
33.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1592
60.0%
Decimal Number 515
 
19.4%
Space Separator 462
 
17.4%
Dash Punctuation 83
 
3.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
169
10.6%
160
10.1%
156
9.8%
155
9.7%
154
9.7%
152
9.5%
152
9.5%
77
 
4.8%
74
 
4.6%
28
 
1.8%
Other values (63) 315
19.8%
Decimal Number
ValueCountFrequency (%)
1 132
25.6%
3 66
12.8%
2 61
11.8%
4 50
 
9.7%
5 47
 
9.1%
6 47
 
9.1%
9 34
 
6.6%
7 31
 
6.0%
8 26
 
5.0%
0 21
 
4.1%
Space Separator
ValueCountFrequency (%)
462
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1592
60.0%
Common 1062
40.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
169
10.6%
160
10.1%
156
9.8%
155
9.7%
154
9.7%
152
9.5%
152
9.5%
77
 
4.8%
74
 
4.6%
28
 
1.8%
Other values (63) 315
19.8%
Common
ValueCountFrequency (%)
462
43.5%
1 132
 
12.4%
- 83
 
7.8%
3 66
 
6.2%
2 61
 
5.7%
4 50
 
4.7%
5 47
 
4.4%
6 47
 
4.4%
9 34
 
3.2%
7 31
 
2.9%
Other values (3) 49
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1592
60.0%
ASCII 1062
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
462
43.5%
1 132
 
12.4%
- 83
 
7.8%
3 66
 
6.2%
2 61
 
5.7%
4 50
 
4.7%
5 47
 
4.4%
6 47
 
4.4%
9 34
 
3.2%
7 31
 
2.9%
Other values (3) 49
 
4.6%
Hangul
ValueCountFrequency (%)
169
10.6%
160
10.1%
156
9.8%
155
9.7%
154
9.7%
152
9.5%
152
9.5%
77
 
4.8%
74
 
4.6%
28
 
1.8%
Other values (63) 315
19.8%

동수
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3289474
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T02:27:31.434852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum12
Range11
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.0676516
Coefficient of variation (CV)0.88780519
Kurtosis4.6983113
Mean2.3289474
Median Absolute Deviation (MAD)0
Skewness2.078272
Sum354
Variance4.275183
MonotonicityNot monotonic
2023-12-13T02:27:31.597420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 81
53.3%
2 25
 
16.4%
3 15
 
9.9%
4 13
 
8.6%
6 5
 
3.3%
5 4
 
2.6%
7 4
 
2.6%
9 2
 
1.3%
12 1
 
0.7%
10 1
 
0.7%
ValueCountFrequency (%)
1 81
53.3%
2 25
 
16.4%
3 15
 
9.9%
4 13
 
8.6%
5 4
 
2.6%
6 5
 
3.3%
7 4
 
2.6%
8 1
 
0.7%
9 2
 
1.3%
10 1
 
0.7%
ValueCountFrequency (%)
12 1
 
0.7%
10 1
 
0.7%
9 2
 
1.3%
8 1
 
0.7%
7 4
 
2.6%
6 5
 
3.3%
5 4
 
2.6%
4 13
8.6%
3 15
9.9%
2 25
16.4%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct77
Distinct (%)50.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean140.79605
Minimum2
Maximum1277
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T02:27:31.769396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4
Q19
median20
Q3187.25
95-th percentile545.1
Maximum1277
Range1275
Interquartile range (IQR)178.25

Descriptive statistics

Standard deviation230.86412
Coefficient of variation (CV)1.6397059
Kurtosis7.5634775
Mean140.79605
Median Absolute Deviation (MAD)16
Skewness2.5092728
Sum21401
Variance53298.243
MonotonicityNot monotonic
2023-12-13T02:27:32.001324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 16
 
10.5%
16 12
 
7.9%
6 8
 
5.3%
18 8
 
5.3%
8 6
 
3.9%
9 5
 
3.3%
20 4
 
2.6%
19 4
 
2.6%
12 4
 
2.6%
15 4
 
2.6%
Other values (67) 81
53.3%
ValueCountFrequency (%)
2 2
 
1.3%
3 1
 
0.7%
4 16
10.5%
5 2
 
1.3%
6 8
5.3%
7 1
 
0.7%
8 6
 
3.9%
9 5
 
3.3%
11 1
 
0.7%
12 4
 
2.6%
ValueCountFrequency (%)
1277 1
0.7%
1260 1
0.7%
925 1
0.7%
835 1
0.7%
821 1
0.7%
654 1
0.7%
600 1
0.7%
577 1
0.7%
519 1
0.7%
495 1
0.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-06-26
152 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-26
2nd row2023-06-26
3rd row2023-06-26
4th row2023-06-26
5th row2023-06-26

Common Values

ValueCountFrequency (%)
2023-06-26 152
100.0%

Length

2023-12-13T02:27:32.185435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:27:32.318782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-26 152
100.0%

Interactions

2023-12-13T02:27:28.204676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:27:28.008419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:27:28.307091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:27:28.105186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:27:32.393106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공동주택유형동수세대수
공동주택유형1.0000.3970.733
동수0.3971.0000.821
세대수0.7330.8211.000
2023-12-13T02:27:32.488332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동수세대수공동주택유형
동수1.0000.7810.253
세대수0.7811.0000.429
공동주택유형0.2530.4291.000

Missing values

2023-12-13T02:27:28.436119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:27:28.546499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공동주택유형공동주택명소재지도로명주소동수세대수데이터기준일자
0아파트시기 주공전라북도 정읍시 천변로 35094102023-06-26
1아파트명진 로얄전라북도 정읍시 충정로 50-12902023-06-26
2아파트삼화맨션전라북도 정읍시 천변로 1281482023-06-26
3아파트시기 현대전라북도 정읍시 초산로 3-143722023-06-26
4아파트성원 1차(사원아파트)전라북도 정읍시 정읍북로 631-2642002023-06-26
5아파트백조전라북도 정읍시 초산1길 5-121402023-06-26
6아파트상동 현대 1차전라북도 정읍시 하신경9길 14-2154672023-06-26
7아파트목련 1차전라북도 정읍시 충정로 531-321002023-06-26
8아파트목련 2차전라북도 정읍시 충정로 531-342462023-06-26
9아파트신흥 장미 1차전라북도 정읍시 용흥1길 15-192952023-06-26
공동주택유형공동주택명소재지도로명주소동수세대수데이터기준일자
142다세대수성 삼성쉐르빌 가동전라북도 정읍시 수성5로 45-10182023-06-26
143다세대이지스 타워팰리스(다세대)전라북도 정읍시 수성로 142162023-06-26
144다세대상동 삼성쉐르빌 2차(101~103동)전라북도 정읍시 상신경2길 253242023-06-26
145다세대상동 삼성쉐르빌 2차(104~105동)전라북도 정읍시 상신경2길 25-12162023-06-26
146다세대상동 삼성쉐르빌 2차(106~107동)전라북도 정읍시 상신경2길 25-22162023-06-26
147다세대상동 삼성쉐르빌 2차(108~110동)전라북도 정읍시 상신경2길 25-33242023-06-26
148다세대우솔타운전라북도 정읍시 수성6로 66182023-06-26
149다세대우솔타운전라북도 정읍시 중앙1길 113-32162023-06-26
150다세대덕산 타워펠리스전라북도 정읍시 우암로 47-5192023-06-26
151다세대공평동 324-5 공동주택전라북도 정읍시 충정로 513162023-06-26

Duplicate rows

Most frequently occurring

공동주택유형공동주택명소재지도로명주소동수세대수데이터기준일자# duplicates
0다세대은하연립전라북도 정읍시 문학길 6-7142023-06-262