Overview

Dataset statistics

Number of variables8
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory68.2 B

Variable types

Categorical6
Text2

Dataset

Description경상북도개발공사의 사업지구별 미분양현황입니다. 사업명, 소재지, 지번 등이 포함되어 있습니다. 금년 "필지수" 정보를 추가로 개방합니다.
Author경상북도개발공사
URLhttps://www.data.go.kr/data/15044476/fileData.do

Alerts

필지수 has constant value ""Constant
사업명 is highly overall correlated with 소재지 and 3 other fieldsHigh correlation
소재지 is highly overall correlated with 사업명 and 3 other fieldsHigh correlation
중분류 is highly overall correlated with 사업명 and 3 other fieldsHigh correlation
대분류 is highly overall correlated with 사업명 and 3 other fieldsHigh correlation
소분류 is highly overall correlated with 사업명 and 3 other fieldsHigh correlation
사업명 is highly imbalanced (51.2%)Imbalance
소재지 is highly imbalanced (53.4%)Imbalance
대분류 is highly imbalanced (50.2%)Imbalance
중분류 is highly imbalanced (51.2%)Imbalance
소분류 is highly imbalanced (53.0%)Imbalance
지번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:55:23.045523
Analysis finished2023-12-12 18:55:24.107434
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size460.0 B
경산1-1일반산업단지 조성사업
32 
경북도청신도시건설사업(1단계)
포항초곡지구 도시개발사업
 
1
구미구평2지구 택지개발사업
 
1

Length

Max length16
Median length16
Mean length15.878049
Min length13

Unique

Unique2 ?
Unique (%)4.9%

Sample

1st row경북도청신도시건설사업(1단계)
2nd row경북도청신도시건설사업(1단계)
3rd row경북도청신도시건설사업(1단계)
4th row경북도청신도시건설사업(1단계)
5th row경북도청신도시건설사업(1단계)

Common Values

ValueCountFrequency (%)
경산1-1일반산업단지 조성사업 32
78.0%
경북도청신도시건설사업(1단계) 7
 
17.1%
포항초곡지구 도시개발사업 1
 
2.4%
구미구평2지구 택지개발사업 1
 
2.4%

Length

2023-12-13T03:55:24.287408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:24.604897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경산1-1일반산업단지 32
42.7%
조성사업 32
42.7%
경북도청신도시건설사업(1단계 7
 
9.3%
포항초곡지구 1
 
1.3%
도시개발사업 1
 
1.3%
구미구평2지구 1
 
1.3%
택지개발사업 1
 
1.3%

소재지
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
경산시 진량읍 신상리
32 
예천군 호명면 산합리
 
2
안동시 풍천면 가곡리
 
2
안동시 풍천면 갈전리
 
2
안동시 풍천면 도양리
 
1
Other values (2)
 
2

Length

Max length14
Median length11
Mean length10.97561
Min length7

Unique

Unique3 ?
Unique (%)7.3%

Sample

1st row예천군 호명면 산합리
2nd row예천군 호명면 산합리
3rd row안동시 풍천면 도양리
4th row안동시 풍천면 가곡리
5th row안동시 풍천면 갈전리

Common Values

ValueCountFrequency (%)
경산시 진량읍 신상리 32
78.0%
예천군 호명면 산합리 2
 
4.9%
안동시 풍천면 가곡리 2
 
4.9%
안동시 풍천면 갈전리 2
 
4.9%
안동시 풍천면 도양리 1
 
2.4%
포항시 북구 흥해읍 초곡리 1
 
2.4%
구미시 구평동 1
 
2.4%

Length

2023-12-13T03:55:24.927203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:25.242592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경산시 32
26.0%
신상리 32
26.0%
진량읍 32
26.0%
안동시 5
 
4.1%
풍천면 5
 
4.1%
갈전리 2
 
1.6%
가곡리 2
 
1.6%
산합리 2
 
1.6%
호명면 2
 
1.6%
예천군 2
 
1.6%
Other values (7) 7
 
5.7%

지번
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-13T03:55:25.722347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.4146341
Min length4

Characters and Unicode

Total characters263
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row1118
2nd row1117-1
3rd row1419
4th row1284
5th row1298
ValueCountFrequency (%)
1118 1
 
2.4%
지원1-⑤-6 1
 
2.4%
지원1-⑤-8 1
 
2.4%
지원1-⑤-9 1
 
2.4%
지원2-①-1 1
 
2.4%
지원2-①-2 1
 
2.4%
지원2-②-2 1
 
2.4%
지원2-②-3 1
 
2.4%
지원2-③-1 1
 
2.4%
지원2-③-2 1
 
2.4%
Other values (31) 31
75.6%
2023-12-13T03:55:26.494387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 65
24.7%
1 38
14.4%
31
11.8%
31
11.8%
2 25
 
9.5%
14
 
5.3%
8 8
 
3.0%
4 7
 
2.7%
6
 
2.3%
3 6
 
2.3%
Other values (10) 32
12.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 102
38.8%
Dash Punctuation 65
24.7%
Other Letter 64
24.3%
Other Number 32
 
12.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 38
37.3%
2 25
24.5%
8 8
 
7.8%
4 7
 
6.9%
3 6
 
5.9%
6 4
 
3.9%
5 4
 
3.9%
9 4
 
3.9%
7 4
 
3.9%
0 2
 
2.0%
Other Number
ValueCountFrequency (%)
14
43.8%
6
18.8%
5
 
15.6%
4
 
12.5%
3
 
9.4%
Other Letter
ValueCountFrequency (%)
31
48.4%
31
48.4%
1
 
1.6%
1
 
1.6%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 199
75.7%
Hangul 64
 
24.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 65
32.7%
1 38
19.1%
2 25
 
12.6%
14
 
7.0%
8 8
 
4.0%
4 7
 
3.5%
6
 
3.0%
3 6
 
3.0%
5
 
2.5%
4
 
2.0%
Other values (6) 21
 
10.6%
Hangul
ValueCountFrequency (%)
31
48.4%
31
48.4%
1
 
1.6%
1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 167
63.5%
Hangul 64
 
24.3%
Enclosed Alphanum 32
 
12.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 65
38.9%
1 38
22.8%
2 25
 
15.0%
8 8
 
4.8%
4 7
 
4.2%
3 6
 
3.6%
6 4
 
2.4%
5 4
 
2.4%
9 4
 
2.4%
7 4
 
2.4%
Hangul
ValueCountFrequency (%)
31
48.4%
31
48.4%
1
 
1.6%
1
 
1.6%
Enclosed Alphanum
ValueCountFrequency (%)
14
43.8%
6
18.8%
5
 
15.6%
4
 
12.5%
3
 
9.4%

대분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
지원용지
31 
공동주택건설용지
 
3
도시기반시설용지
 
2
공공시설용지
 
2
근린생활시설용지
 
1
Other values (2)
 
2

Length

Max length8
Median length4
Mean length4.7317073
Min length4

Unique

Unique3 ?
Unique (%)7.3%

Sample

1st row공동주택건설용지
2nd row공동주택건설용지
3rd row근린생활시설용지
4th row도시기반시설용지
5th row상업시설용지

Common Values

ValueCountFrequency (%)
지원용지 31
75.6%
공동주택건설용지 3
 
7.3%
도시기반시설용지 2
 
4.9%
공공시설용지 2
 
4.9%
근린생활시설용지 1
 
2.4%
상업시설용지 1
 
2.4%
산업용지 1
 
2.4%

Length

2023-12-13T03:55:26.728198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:26.933828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지원용지 31
75.6%
공동주택건설용지 3
 
7.3%
도시기반시설용지 2
 
4.9%
공공시설용지 2
 
4.9%
근린생활시설용지 1
 
2.4%
상업시설용지 1
 
2.4%
산업용지 1
 
2.4%

중분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)19.5%
Missing0
Missing (%)0.0%
Memory size460.0 B
지원시설용지
31 
임대주택건설
 
2
도시기반시설
 
2
교육연구시설
 
2
근린생활시설
 
1
Other values (3)
 
3

Length

Max length6
Median length6
Mean length5.902439
Min length4

Unique

Unique4 ?
Unique (%)9.8%

Sample

1st row임대주택건설
2nd row임대주택건설
3rd row근린생활시설
4th row도시기반시설
5th row상업시설

Common Values

ValueCountFrequency (%)
지원시설용지 31
75.6%
임대주택건설 2
 
4.9%
도시기반시설 2
 
4.9%
교육연구시설 2
 
4.9%
근린생활시설 1
 
2.4%
상업시설 1
 
2.4%
공동주택 1
 
2.4%
산업시설용지 1
 
2.4%

Length

2023-12-13T03:55:27.153054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:27.346102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지원시설용지 31
75.6%
임대주택건설 2
 
4.9%
도시기반시설 2
 
4.9%
교육연구시설 2
 
4.9%
근린생활시설 1
 
2.4%
상업시설 1
 
2.4%
공동주택 1
 
2.4%
산업시설용지 1
 
2.4%

소분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
지원시설용지
31 
공동주택(임대)
 
2
근린생활시설
 
1
주유소
 
1
상업시설용지
 
1
Other values (5)

Length

Max length8
Median length6
Mean length5.8536585
Min length3

Unique

Unique8 ?
Unique (%)19.5%

Sample

1st row공동주택(임대)
2nd row공동주택(임대)
3rd row근린생활시설
4th row주유소
5th row상업시설용지

Common Values

ValueCountFrequency (%)
지원시설용지 31
75.6%
공동주택(임대) 2
 
4.9%
근린생활시설 1
 
2.4%
주유소 1
 
2.4%
상업시설용지 1
 
2.4%
주차장 1
 
2.4%
유치원 1
 
2.4%
공동주택 1
 
2.4%
초,중학교용지 1
 
2.4%
산업시설용지 1
 
2.4%

Length

2023-12-13T03:55:27.561241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:27.752050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지원시설용지 31
75.6%
공동주택(임대 2
 
4.9%
근린생활시설 1
 
2.4%
주유소 1
 
2.4%
상업시설용지 1
 
2.4%
주차장 1
 
2.4%
유치원 1
 
2.4%
공동주택 1
 
2.4%
초,중학교용지 1
 
2.4%
산업시설용지 1
 
2.4%

필지수
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
1
41 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 41
100.0%

Length

2023-12-13T03:55:27.983461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:28.132210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 41
100.0%
Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-13T03:55:28.383781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length5
Mean length5.6829268
Min length3

Characters and Unicode

Total characters233
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)95.1%

Sample

1st row23,535.00
2nd row15,370.10
3rd row2,702.10
4th row1,212.80
5th row2,564.10
ValueCountFrequency (%)
420.9 2
 
4.9%
15,370.10 1
 
2.4%
421.1 1
 
2.4%
466 1
 
2.4%
471.7 1
 
2.4%
477.2 1
 
2.4%
528.2 1
 
2.4%
521.4 1
 
2.4%
763.9 1
 
2.4%
783.4 1
 
2.4%
Other values (30) 30
73.2%
2023-12-13T03:55:28.876908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 38
16.3%
4 28
12.0%
2 27
11.6%
7 23
9.9%
0 22
9.4%
1 22
9.4%
5 19
8.2%
3 15
 
6.4%
8 11
 
4.7%
, 10
 
4.3%
Other values (2) 18
7.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 185
79.4%
Other Punctuation 48
 
20.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 28
15.1%
2 27
14.6%
7 23
12.4%
0 22
11.9%
1 22
11.9%
5 19
10.3%
3 15
8.1%
8 11
 
5.9%
9 9
 
4.9%
6 9
 
4.9%
Other Punctuation
ValueCountFrequency (%)
. 38
79.2%
, 10
 
20.8%

Most occurring scripts

ValueCountFrequency (%)
Common 233
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 38
16.3%
4 28
12.0%
2 27
11.6%
7 23
9.9%
0 22
9.4%
1 22
9.4%
5 19
8.2%
3 15
 
6.4%
8 11
 
4.7%
, 10
 
4.3%
Other values (2) 18
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 233
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 38
16.3%
4 28
12.0%
2 27
11.6%
7 23
9.9%
0 22
9.4%
1 22
9.4%
5 19
8.2%
3 15
 
6.4%
8 11
 
4.7%
, 10
 
4.3%
Other values (2) 18
7.7%

Correlations

2023-12-13T03:55:29.045504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명소재지지번대분류중분류소분류면적(제곱미터)
사업명1.0001.0001.0000.7960.9921.0001.000
소재지1.0001.0001.0000.9740.9471.0001.000
지번1.0001.0001.0001.0001.0001.0001.000
대분류0.7960.9741.0001.0001.0001.0001.000
중분류0.9920.9471.0001.0001.0001.0001.000
소분류1.0001.0001.0001.0001.0001.0001.000
면적(제곱미터)1.0001.0001.0001.0001.0001.0001.000
2023-12-13T03:55:29.256775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명소재지중분류대분류소분류
사업명1.0000.9590.8280.6620.915
소재지0.9591.0000.8520.7480.955
중분류0.8280.8521.0000.9850.969
대분류0.6620.7480.9851.0000.955
소분류0.9150.9550.9690.9551.000
2023-12-13T03:55:29.427409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명소재지대분류중분류소분류
사업명1.0000.9590.6620.8280.915
소재지0.9591.0000.7480.8520.955
대분류0.6620.7481.0000.9850.955
중분류0.8280.8520.9851.0000.969
소분류0.9150.9550.9550.9691.000

Missing values

2023-12-13T03:55:23.726727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:55:23.997293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업명소재지지번대분류중분류소분류필지수면적(제곱미터)
0경북도청신도시건설사업(1단계)예천군 호명면 산합리1118공동주택건설용지임대주택건설공동주택(임대)123,535.00
1경북도청신도시건설사업(1단계)예천군 호명면 산합리1117-1공동주택건설용지임대주택건설공동주택(임대)115,370.10
2경북도청신도시건설사업(1단계)안동시 풍천면 도양리1419근린생활시설용지근린생활시설근린생활시설12,702.10
3경북도청신도시건설사업(1단계)안동시 풍천면 가곡리1284도시기반시설용지도시기반시설주유소11,212.80
4경북도청신도시건설사업(1단계)안동시 풍천면 갈전리1298상업시설용지상업시설상업시설용지12,564.10
5경북도청신도시건설사업(1단계)안동시 풍천면 갈전리1518도시기반시설용지도시기반시설주차장11,346.20
6경북도청신도시건설사업(1단계)안동시 풍천면 가곡리1286공공시설용지교육연구시설유치원12,107.90
7포항초곡지구 도시개발사업포항시 북구 흥해읍 초곡리1765공동주택건설용지공동주택공동주택121,412.70
8구미구평2지구 택지개발사업구미시 구평동1084공공시설용지교육연구시설초,중학교용지113,212.00
9경산1-1일반산업단지 조성사업경산시 진량읍 신상리산업1-①-3산업용지산업시설용지산업시설용지18,944.30
사업명소재지지번대분류중분류소분류필지수면적(제곱미터)
31경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-③-3지원용지지원시설용지지원시설용지1420.9
32경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-2지원용지지원시설용지지원시설용지1476
33경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-3지원용지지원시설용지지원시설용지1474.4
34경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-4지원용지지원시설용지지원시설용지1475.3
35경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-5지원용지지원시설용지지원시설용지1475.7
36경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-6지원용지지원시설용지지원시설용지1474.7
37경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-7지원용지지원시설용지지원시설용지1473.5
38경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-8지원용지지원시설용지지원시설용지1472.9
39경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-9지원용지지원시설용지지원시설용지1473.8
40경산1-1일반산업단지 조성사업경산시 진량읍 신상리지원2-④-10지원용지지원시설용지지원시설용지1632.1