Overview

Dataset statistics

Number of variables18
Number of observations212
Missing cells85
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.6 KiB
Average record size in memory147.6 B

Variable types

Categorical7
Text4
Numeric3
DateTime4

Dataset

Description경남도 내 산업단지 현황(유형, 시도, 시군, 단지명, 조성상태, 지정면적, 사업기간, 분양상태, 유치업종 등)입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3084288

Alerts

시도 has constant value ""Constant
시군 is highly overall correlated with 관리기관High correlation
관리기관 is highly overall correlated with 시군High correlation
공정율(퍼센트) is highly overall correlated with 조성상태 and 1 other fieldsHigh correlation
조성상태 is highly overall correlated with 공정율(퍼센트)High correlation
분양상태 is highly overall correlated with 공정율(퍼센트)High correlation
사업시행자 has 4 (1.9%) missing valuesMissing
착공일 has 12 (5.7%) missing valuesMissing
준공인가일 has 68 (32.1%) missing valuesMissing
단지명 has unique valuesUnique
지정면적 has unique valuesUnique
공정율(퍼센트) has 17 (8.0%) zerosZeros
산업용지평균분양가 has 68 (32.1%) zerosZeros

Reproduction

Analysis started2023-12-11 00:29:31.562677
Analysis finished2023-12-11 00:29:33.918769
Duration2.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

유형
Categorical

Distinct4
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
일반
119 
농공
81 
국가
 
11
도시첨단
 
1

Length

Max length4
Median length2
Mean length2.009434
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row국가
2nd row국가
3rd row국가
4th row국가
5th row국가

Common Values

ValueCountFrequency (%)
일반 119
56.1%
농공 81
38.2%
국가 11
 
5.2%
도시첨단 1
 
0.5%

Length

2023-12-11T09:29:34.002964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:34.127392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 119
56.1%
농공 81
38.2%
국가 11
 
5.2%
도시첨단 1
 
0.5%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
경남
212 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남
2nd row경남
3rd row경남
4th row경남
5th row경남

Common Values

ValueCountFrequency (%)
경남 212
100.0%

Length

2023-12-11T09:29:34.251023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:34.360052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 212
100.0%

시군
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
김해시
29 
함안군
25 
사천시
21 
창원시
20 
진주시
13 
Other values (13)
104 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row창원시
2nd row통영시
3rd row거제시
4th row거제시
5th row창원시

Common Values

ValueCountFrequency (%)
김해시 29
13.7%
함안군 25
11.8%
사천시 21
9.9%
창원시 20
9.4%
진주시 13
 
6.1%
밀양시 13
 
6.1%
양산시 13
 
6.1%
고성군 12
 
5.7%
창녕군 11
 
5.2%
거창군 9
 
4.2%
Other values (8) 46
21.7%

Length

2023-12-11T09:29:34.470866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
김해시 29
13.7%
함안군 25
11.8%
사천시 21
9.9%
창원시 20
9.4%
진주시 13
 
6.1%
밀양시 13
 
6.1%
양산시 13
 
6.1%
고성군 12
 
5.7%
창녕군 11
 
5.2%
함양군 9
 
4.2%
Other values (8) 46
21.7%

단지명
Text

UNIQUE 

Distinct212
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-11T09:29:34.676119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length8.7735849
Min length6

Characters and Unicode

Total characters1860
Distinct characters193
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique212 ?
Unique (%)100.0%

Sample

1st row녹산지구(주거단지)
2nd row안정국가산업단지
3rd row옥포국가산업단지
4th row죽도국가산업단지
5th row진해국가산업단지
ValueCountFrequency (%)
녹산지구(주거단지 1
 
0.5%
봉림농공단지 1
 
0.5%
대곡농공단지 1
 
0.5%
병동농공단지 1
 
0.5%
진북농공단지 1
 
0.5%
부북특별농공단지 1
 
0.5%
상남특별농공단지 1
 
0.5%
초동특별농공단지 1
 
0.5%
하남농공단지 1
 
0.5%
곤양농공단지 1
 
0.5%
Other values (202) 202
95.3%
2023-12-11T09:29:35.074729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
224
 
12.0%
211
 
11.3%
153
 
8.2%
130
 
7.0%
114
 
6.1%
114
 
6.1%
85
 
4.6%
81
 
4.4%
21
 
1.1%
18
 
1.0%
Other values (183) 709
38.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1773
95.3%
Close Punctuation 20
 
1.1%
Open Punctuation 20
 
1.1%
Decimal Number 18
 
1.0%
Lowercase Letter 13
 
0.7%
Uppercase Letter 9
 
0.5%
Other Punctuation 6
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
224
 
12.6%
211
 
11.9%
153
 
8.6%
130
 
7.3%
114
 
6.4%
114
 
6.4%
85
 
4.8%
81
 
4.6%
21
 
1.2%
18
 
1.0%
Other values (160) 622
35.1%
Lowercase Letter
ValueCountFrequency (%)
o 5
38.5%
e 2
 
15.4%
n 2
 
15.4%
l 1
 
7.7%
t 1
 
7.7%
d 1
 
7.7%
c 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
M 2
22.2%
R 2
22.2%
O 1
11.1%
A 1
11.1%
G 1
11.1%
Z 1
11.1%
E 1
11.1%
Decimal Number
ValueCountFrequency (%)
2 14
77.8%
1 3
 
16.7%
3 1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 12
60.0%
] 8
40.0%
Open Punctuation
ValueCountFrequency (%)
( 12
60.0%
[ 8
40.0%
Other Punctuation
ValueCountFrequency (%)
: 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1773
95.3%
Common 65
 
3.5%
Latin 22
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
224
 
12.6%
211
 
11.9%
153
 
8.6%
130
 
7.3%
114
 
6.4%
114
 
6.4%
85
 
4.8%
81
 
4.6%
21
 
1.2%
18
 
1.0%
Other values (160) 622
35.1%
Latin
ValueCountFrequency (%)
o 5
22.7%
M 2
 
9.1%
e 2
 
9.1%
n 2
 
9.1%
R 2
 
9.1%
O 1
 
4.5%
A 1
 
4.5%
l 1
 
4.5%
t 1
 
4.5%
d 1
 
4.5%
Other values (4) 4
18.2%
Common
ValueCountFrequency (%)
2 14
21.5%
) 12
18.5%
( 12
18.5%
] 8
12.3%
[ 8
12.3%
: 6
9.2%
1 3
 
4.6%
- 1
 
1.5%
3 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1773
95.3%
ASCII 87
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
224
 
12.6%
211
 
11.9%
153
 
8.6%
130
 
7.3%
114
 
6.4%
114
 
6.4%
85
 
4.8%
81
 
4.6%
21
 
1.2%
18
 
1.0%
Other values (160) 622
35.1%
ASCII
ValueCountFrequency (%)
2 14
16.1%
) 12
13.8%
( 12
13.8%
] 8
9.2%
[ 8
9.2%
: 6
 
6.9%
o 5
 
5.7%
1 3
 
3.4%
M 2
 
2.3%
e 2
 
2.3%
Other values (13) 15
17.2%

조성상태
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
완료
144 
조성중
56 
준비중
 
6
보상중
 
6

Length

Max length3
Median length2
Mean length2.3207547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row완료
2nd row조성중
3rd row조성중
4th row조성중
5th row조성중

Common Values

ValueCountFrequency (%)
완료 144
67.9%
조성중 56
 
26.4%
준비중 6
 
2.8%
보상중 6
 
2.8%

Length

2023-12-11T09:29:35.238465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:35.341024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
완료 144
67.9%
조성중 56
 
26.4%
준비중 6
 
2.8%
보상중 6
 
2.8%

지정면적
Real number (ℝ)

UNIQUE 

Distinct212
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean674948.79
Minimum37180
Maximum35870341
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-11T09:29:35.461172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37180
5-th percentile63968.5
Q1111534
median193708.5
Q3482144
95-th percentile2053098.3
Maximum35870341
Range35833161
Interquartile range (IQR)370610

Descriptive statistics

Standard deviation2573215.5
Coefficient of variation (CV)3.8124603
Kurtosis167.80829
Mean674948.79
Median Absolute Deviation (MAD)102725.5
Skewness12.354128
Sum1.4308914 × 108
Variance6.6214378 × 1012
MonotonicityNot monotonic
2023-12-11T09:29:35.594182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1672153 1
 
0.5%
405147 1
 
0.5%
133170 1
 
0.5%
160941 1
 
0.5%
83620 1
 
0.5%
326788 1
 
0.5%
179159 1
 
0.5%
84092 1
 
0.5%
567762 1
 
0.5%
104391 1
 
0.5%
Other values (202) 202
95.3%
ValueCountFrequency (%)
37180 1
0.5%
40479 1
0.5%
42354 1
0.5%
44340 1
0.5%
44524 1
0.5%
49166 1
0.5%
52139 1
0.5%
53530 1
0.5%
58590 1
0.5%
61511 1
0.5%
ValueCountFrequency (%)
35870341 1
0.5%
5986182 1
0.5%
5612938 1
0.5%
4184814 1
0.5%
3863734 1
0.5%
3269235 1
0.5%
3051845 1
0.5%
2942000 1
0.5%
2803758 1
0.5%
2545259 1
0.5%
Distinct186
Distinct (%)87.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum1974-04-01 00:00:00
Maximum2022-12-29 00:00:00
2023-12-11T09:29:35.721737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:35.888852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct134
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-11T09:29:36.210394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters1908
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)42.5%

Sample

1st row1990~1999
2nd row1997~2024
3rd row1974~2024
4th row1977~2025
5th row1983~2023
ValueCountFrequency (%)
1990~1991 6
 
2.8%
2017~2024 6
 
2.8%
2010~2023 6
 
2.8%
2008~2014 4
 
1.9%
1988~1989 4
 
1.9%
2012~2024 4
 
1.9%
2015~2023 4
 
1.9%
2013~2023 3
 
1.4%
2014~2024 3
 
1.4%
2014~2019 3
 
1.4%
Other values (124) 169
79.7%
2023-12-11T09:29:36.641346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 454
23.8%
2 435
22.8%
1 291
15.3%
9 218
11.4%
~ 212
11.1%
8 70
 
3.7%
4 60
 
3.1%
3 58
 
3.0%
7 50
 
2.6%
5 35
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1696
88.9%
Math Symbol 212
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 454
26.8%
2 435
25.6%
1 291
17.2%
9 218
12.9%
8 70
 
4.1%
4 60
 
3.5%
3 58
 
3.4%
7 50
 
2.9%
5 35
 
2.1%
6 25
 
1.5%
Math Symbol
ValueCountFrequency (%)
~ 212
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1908
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 454
23.8%
2 435
22.8%
1 291
15.3%
9 218
11.4%
~ 212
11.1%
8 70
 
3.7%
4 60
 
3.1%
3 58
 
3.0%
7 50
 
2.6%
5 35
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1908
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 454
23.8%
2 435
22.8%
1 291
15.3%
9 218
11.4%
~ 212
11.1%
8 70
 
3.7%
4 60
 
3.1%
3 58
 
3.0%
7 50
 
2.6%
5 35
 
1.8%

사업시행자
Text

MISSING 

Distinct137
Distinct (%)65.9%
Missing4
Missing (%)1.9%
Memory size1.8 KiB
2023-12-11T09:29:36.898782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length121
Median length75
Mean length11.225962
Min length4

Characters and Unicode

Total characters2335
Distinct characters227
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)56.7%

Sample

1st row한국토지주택공사
2nd row(주)대우건설, 한국가스공사, 성동조선해양(주), (주)가야중공업, (주)동양조선, 성동철강(주)
3rd row대우조선해양(주)
4th row삼성중공업(주)
5th row(주)오리엔탈정공, (주)케이조선
ValueCountFrequency (%)
14
 
4.4%
경남_사천시 9
 
2.8%
경남개발공사 9
 
2.8%
경남_김해시 8
 
2.5%
경남_진주시 8
 
2.5%
한국토지주택공사 8
 
2.5%
주식회사 7
 
2.2%
경남_창원시 7
 
2.2%
경남_거창군 6
 
1.9%
경남_의령군 5
 
1.6%
Other values (206) 237
74.5%
2023-12-11T09:29:37.280613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
7.9%
) 150
 
6.4%
( 150
 
6.4%
112
 
4.8%
89
 
3.8%
87
 
3.7%
_ 73
 
3.1%
, 70
 
3.0%
67
 
2.9%
48
 
2.1%
Other values (217) 1305
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1736
74.3%
Close Punctuation 152
 
6.5%
Open Punctuation 152
 
6.5%
Space Separator 112
 
4.8%
Connector Punctuation 73
 
3.1%
Other Punctuation 73
 
3.1%
Decimal Number 23
 
1.0%
Uppercase Letter 13
 
0.6%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
10.6%
89
 
5.1%
87
 
5.0%
67
 
3.9%
48
 
2.8%
44
 
2.5%
44
 
2.5%
39
 
2.2%
37
 
2.1%
37
 
2.1%
Other values (189) 1060
61.1%
Decimal Number
ValueCountFrequency (%)
3 9
39.1%
1 3
 
13.0%
2 3
 
13.0%
4 2
 
8.7%
8 2
 
8.7%
7 1
 
4.3%
0 1
 
4.3%
6 1
 
4.3%
5 1
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
E 3
23.1%
P 2
15.4%
S 2
15.4%
T 1
 
7.7%
N 1
 
7.7%
H 1
 
7.7%
K 1
 
7.7%
R 1
 
7.7%
G 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 70
95.9%
: 2
 
2.7%
& 1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 150
98.7%
] 2
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 150
98.7%
[ 2
 
1.3%
Space Separator
ValueCountFrequency (%)
112
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 73
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1737
74.4%
Common 585
 
25.1%
Latin 13
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
10.6%
89
 
5.1%
87
 
5.0%
67
 
3.9%
48
 
2.8%
44
 
2.5%
44
 
2.5%
39
 
2.2%
37
 
2.1%
37
 
2.1%
Other values (190) 1061
61.1%
Common
ValueCountFrequency (%)
) 150
25.6%
( 150
25.6%
112
19.1%
_ 73
12.5%
, 70
12.0%
3 9
 
1.5%
1 3
 
0.5%
2 3
 
0.5%
] 2
 
0.3%
4 2
 
0.3%
Other values (8) 11
 
1.9%
Latin
ValueCountFrequency (%)
E 3
23.1%
P 2
15.4%
S 2
15.4%
T 1
 
7.7%
N 1
 
7.7%
H 1
 
7.7%
K 1
 
7.7%
R 1
 
7.7%
G 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1736
74.3%
ASCII 598
 
25.6%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
184
 
10.6%
89
 
5.1%
87
 
5.0%
67
 
3.9%
48
 
2.8%
44
 
2.5%
44
 
2.5%
39
 
2.2%
37
 
2.1%
37
 
2.1%
Other values (189) 1060
61.1%
ASCII
ValueCountFrequency (%)
) 150
25.1%
( 150
25.1%
112
18.7%
_ 73
12.2%
, 70
11.7%
3 9
 
1.5%
1 3
 
0.5%
2 3
 
0.5%
E 3
 
0.5%
] 2
 
0.3%
Other values (17) 23
 
3.8%
None
ValueCountFrequency (%)
1
100.0%

관리기관
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
경남_김해시
26 
경남_함안군
24 
경남_사천시
15 
경남_창원시
14 
경남_양산시
13 
Other values (25)
120 

Length

Max length25
Median length6
Mean length6.5613208
Min length4

Unique

Unique9 ?
Unique (%)4.2%

Sample

1st row한국산업단지공단
2nd row한국산업단지공단
3rd row경상남도
4th row경상남도
5th row한국산업단지공단

Common Values

ValueCountFrequency (%)
경남_김해시 26
 
12.3%
경남_함안군 24
 
11.3%
경남_사천시 15
 
7.1%
경남_창원시 14
 
6.6%
경남_양산시 13
 
6.1%
경남_고성군 12
 
5.7%
경남_진주시 12
 
5.7%
경남_창녕군 11
 
5.2%
경남_밀양시 11
 
5.2%
한국산업단지공단 9
 
4.2%
Other values (20) 65
30.7%

Length

2023-12-11T09:29:37.412686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경남_김해시 27
12.4%
경남_함안군 24
 
11.1%
경남_사천시 17
 
7.8%
경남_창원시 14
 
6.5%
경남_양산시 13
 
6.0%
경남_고성군 12
 
5.5%
경남_진주시 12
 
5.5%
경남_밀양시 12
 
5.5%
경남_창녕군 11
 
5.1%
한국산업단지공단 11
 
5.1%
Other values (19) 64
29.5%

분양상태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
완료
161 
분양중
34 
분양계획
17 

Length

Max length4
Median length2
Mean length2.3207547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row완료
2nd row완료
3rd row완료
4th row완료
5th row완료

Common Values

ValueCountFrequency (%)
완료 161
75.9%
분양중 34
 
16.0%
분양계획 17
 
8.0%

Length

2023-12-11T09:29:37.558572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:37.690440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
완료 161
75.9%
분양중 34
 
16.0%
분양계획 17
 
8.0%

공정율(퍼센트)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct38
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.783019
Minimum0
Maximum100
Zeros17
Zeros (%)8.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-11T09:29:37.794996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q190
median100
Q3100
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)10

Descriptive statistics

Standard deviation32.65285
Coefficient of variation (CV)0.38973112
Kurtosis1.8862669
Mean83.783019
Median Absolute Deviation (MAD)0
Skewness-1.8767325
Sum17762
Variance1066.2086
MonotonicityNot monotonic
2023-12-11T09:29:37.915470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
100 148
69.8%
0 17
 
8.0%
5 3
 
1.4%
90 3
 
1.4%
98 2
 
0.9%
62 2
 
0.9%
83 2
 
0.9%
95 2
 
0.9%
99 2
 
0.9%
15 2
 
0.9%
Other values (28) 29
 
13.7%
ValueCountFrequency (%)
0 17
8.0%
4 1
 
0.5%
5 3
 
1.4%
10 1
 
0.5%
15 2
 
0.9%
20 1
 
0.5%
23 1
 
0.5%
26 1
 
0.5%
27 1
 
0.5%
30 1
 
0.5%
ValueCountFrequency (%)
100 148
69.8%
99 2
 
0.9%
98 2
 
0.9%
96 1
 
0.5%
95 2
 
0.9%
94 1
 
0.5%
92 1
 
0.5%
91 1
 
0.5%
90 3
 
1.4%
88 1
 
0.5%
Distinct192
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum1974-07-14 00:00:00
Maximum2022-12-29 00:00:00
2023-12-11T09:29:38.038122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:38.178872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

착공일
Date

MISSING 

Distinct193
Distinct (%)96.5%
Missing12
Missing (%)5.7%
Memory size1.8 KiB
Minimum1974-02-04 00:00:00
Maximum2022-06-20 00:00:00
2023-12-11T09:29:38.296880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:38.419769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공인가일
Date

MISSING 

Distinct136
Distinct (%)94.4%
Missing68
Missing (%)32.1%
Memory size1.8 KiB
Minimum1981-09-30 00:00:00
Maximum2023-01-11 00:00:00
2023-12-11T09:29:38.555704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:38.677021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

산업용지평균분양가
Real number (ℝ)

ZEROS 

Distinct143
Distinct (%)67.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean143831.89
Minimum0
Maximum920725
Zeros68
Zeros (%)32.1%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-11T09:29:38.798619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median52970
Q3208610
95-th percentile536410
Maximum920725
Range920725
Interquartile range (IQR)208610

Descriptive statistics

Standard deviation195290.26
Coefficient of variation (CV)1.3577675
Kurtosis2.7523995
Mean143831.89
Median Absolute Deviation (MAD)52970
Skewness1.741979
Sum30492361
Variance3.8138286 × 1010
MonotonicityNot monotonic
2023-12-11T09:29:38.927266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 68
32.1%
257000 2
 
0.9%
123670 2
 
0.9%
26057 1
 
0.5%
31760 1
 
0.5%
17240 1
 
0.5%
42400 1
 
0.5%
45900 1
 
0.5%
44740 1
 
0.5%
41626 1
 
0.5%
Other values (133) 133
62.7%
ValueCountFrequency (%)
0 68
32.1%
104 1
 
0.5%
7250 1
 
0.5%
9261 1
 
0.5%
11287 1
 
0.5%
11792 1
 
0.5%
13410 1
 
0.5%
14210 1
 
0.5%
15000 1
 
0.5%
15200 1
 
0.5%
ValueCountFrequency (%)
920725 1
0.5%
870598 1
0.5%
842671 1
0.5%
768350 1
0.5%
721073 1
0.5%
719259 1
0.5%
650889 1
0.5%
625058 1
0.5%
584480 1
0.5%
550740 1
0.5%
Distinct205
Distinct (%)97.2%
Missing1
Missing (%)0.5%
Memory size1.8 KiB
2023-12-11T09:29:39.130625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length280
Median length140
Mean length89.85782
Min length5

Characters and Unicode

Total characters18960
Distinct characters279
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)95.3%

Sample

1st row전기 가스 및 증기업, 조립금속, 기타 운송 장비, 재생용가공원료생산업, 기초무기화합물 제조업
2nd row신조선, 해상플랜크, 특수선박, 수리조선연관 기계공업
3rd row중형조선소, 조선기자재 공장(1차금속,금속가공,전기장비,기타기계 및 장비, 기타 운송장비)
4th row중형조선소, 조선기자재 공장
5th row원유비축시설
ValueCountFrequency (%)
417
 
15.5%
제조업 185
 
6.9%
기타 78
 
2.9%
장비 59
 
2.2%
금속가공제품 54
 
2.0%
기계 53
 
2.0%
자동차 48
 
1.8%
트레일러 42
 
1.6%
한함 41
 
1.5%
운송장비 38
 
1.4%
Other values (674) 1682
62.4%
2023-12-11T09:29:39.451215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2507
 
13.2%
, 1318
 
7.0%
875
 
4.6%
831
 
4.4%
828
 
4.4%
653
 
3.4%
564
 
3.0%
494
 
2.6%
446
 
2.4%
376
 
2.0%
Other values (269) 10068
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12951
68.3%
Space Separator 2507
 
13.2%
Other Punctuation 1625
 
8.6%
Decimal Number 783
 
4.1%
Open Punctuation 373
 
2.0%
Close Punctuation 373
 
2.0%
Uppercase Letter 290
 
1.5%
Connector Punctuation 53
 
0.3%
Math Symbol 4
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
875
 
6.8%
831
 
6.4%
828
 
6.4%
653
 
5.0%
564
 
4.4%
494
 
3.8%
446
 
3.4%
376
 
2.9%
338
 
2.6%
280
 
2.2%
Other values (234) 7266
56.1%
Decimal Number
ValueCountFrequency (%)
1 218
27.8%
2 169
21.6%
3 106
13.5%
5 81
 
10.3%
4 44
 
5.6%
6 39
 
5.0%
0 39
 
5.0%
8 38
 
4.9%
9 37
 
4.7%
7 12
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
C 218
75.2%
D 32
 
11.0%
L 18
 
6.2%
H 16
 
5.5%
M 2
 
0.7%
J 2
 
0.7%
G 1
 
0.3%
E 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 1318
81.1%
· 244
 
15.0%
* 24
 
1.5%
. 20
 
1.2%
: 15
 
0.9%
; 2
 
0.1%
2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 359
96.2%
[ 13
 
3.5%
{ 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 359
96.2%
] 12
 
3.2%
} 2
 
0.5%
Space Separator
ValueCountFrequency (%)
2507
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 53
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12951
68.3%
Common 5719
30.2%
Latin 290
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
875
 
6.8%
831
 
6.4%
828
 
6.4%
653
 
5.0%
564
 
4.4%
494
 
3.8%
446
 
3.4%
376
 
2.9%
338
 
2.6%
280
 
2.2%
Other values (234) 7266
56.1%
Common
ValueCountFrequency (%)
2507
43.8%
, 1318
23.0%
( 359
 
6.3%
) 359
 
6.3%
· 244
 
4.3%
1 218
 
3.8%
2 169
 
3.0%
3 106
 
1.9%
5 81
 
1.4%
_ 53
 
0.9%
Other values (17) 305
 
5.3%
Latin
ValueCountFrequency (%)
C 218
75.2%
D 32
 
11.0%
L 18
 
6.2%
H 16
 
5.5%
M 2
 
0.7%
J 2
 
0.7%
G 1
 
0.3%
E 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12951
68.3%
ASCII 5763
30.4%
None 244
 
1.3%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2507
43.5%
, 1318
22.9%
( 359
 
6.2%
) 359
 
6.2%
C 218
 
3.8%
1 218
 
3.8%
2 169
 
2.9%
3 106
 
1.8%
5 81
 
1.4%
_ 53
 
0.9%
Other values (23) 375
 
6.5%
Hangul
ValueCountFrequency (%)
875
 
6.8%
831
 
6.4%
828
 
6.4%
653
 
5.0%
564
 
4.4%
494
 
3.8%
446
 
3.4%
376
 
2.9%
338
 
2.6%
280
 
2.2%
Other values (234) 7266
56.1%
None
ValueCountFrequency (%)
· 244
100.0%
Punctuation
ValueCountFrequency (%)
2
100.0%

사업주체
Categorical

Distinct5
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
민간
112 
시군구
69 
공사
18 
민/관합작
12 
시도
 
1

Length

Max length5
Median length2
Mean length2.495283
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row공사
2nd row민간
3rd row민간
4th row민간
5th row민간

Common Values

ValueCountFrequency (%)
민간 112
52.8%
시군구 69
32.5%
공사 18
 
8.5%
민/관합작 12
 
5.7%
시도 1
 
0.5%

Length

2023-12-11T09:29:39.583991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:39.684990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 112
52.8%
시군구 69
32.5%
공사 18
 
8.5%
민/관합작 12
 
5.7%
시도 1
 
0.5%

Interactions

2023-12-11T09:29:32.915530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.416001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.661331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:33.001063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.500734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.753638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:33.116193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.584874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:32.835707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:29:39.771747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형시군조성상태지정면적관리기관분양상태공정율(퍼센트)산업용지평균분양가사업주체
유형1.0000.4160.6420.3680.7780.2840.5200.3680.455
시군0.4161.0000.0880.4240.9990.3030.0000.0000.361
조성상태0.6420.0881.0000.1480.1010.4200.7570.0870.300
지정면적0.3680.4240.1481.0000.5640.2190.5120.0000.234
관리기관0.7780.9990.1010.5641.0000.4390.3780.6170.686
분양상태0.2840.3030.4200.2190.4391.0000.7980.5630.242
공정율(퍼센트)0.5200.0000.7570.5120.3780.7981.0000.3810.384
산업용지평균분양가0.3680.0000.0870.0000.6170.5630.3811.0000.214
사업주체0.4550.3610.3000.2340.6860.2420.3840.2141.000
2023-12-11T09:29:40.156765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군분양상태관리기관사업주체조성상태유형
시군1.0000.1390.9440.1860.0420.230
분양상태0.1391.0000.2110.1860.4110.272
관리기관0.9440.2111.0000.3480.0420.493
사업주체0.1860.1860.3481.0000.2480.385
조성상태0.0420.4110.0420.2481.0000.298
유형0.2300.2720.4930.3850.2981.000
2023-12-11T09:29:40.264253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정면적공정율(퍼센트)산업용지평균분양가유형시군조성상태관리기관분양상태사업주체
지정면적1.000-0.3200.0860.3570.2080.1400.2960.0650.180
공정율(퍼센트)-0.3201.0000.3150.3550.0000.5870.1410.5080.218
산업용지평균분양가0.0860.3151.0000.2240.0000.0490.2300.4000.088
유형0.3570.3550.2241.0000.2300.2980.4930.2720.385
시군0.2080.0000.0000.2301.0000.0420.9440.1390.186
조성상태0.1400.5870.0490.2980.0421.0000.0420.4110.248
관리기관0.2960.1410.2300.4930.9440.0421.0000.2110.348
분양상태0.0650.5080.4000.2720.1390.4110.2111.0000.186
사업주체0.1800.2180.0880.3850.1860.2480.3480.1861.000

Missing values

2023-12-11T09:29:33.233041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:29:33.440234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:29:33.845670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

유형시도시군단지명조성상태지정면적지정일자사업기간(년)사업시행자관리기관분양상태공정율(퍼센트)실시승인일착공일준공인가일산업용지평균분양가유치업종사업주체
0국가경남창원시녹산지구(주거단지)완료16721531989-10-201990~1999한국토지주택공사한국산업단지공단완료1001990-05-161990-07-231999-08-310<NA>공사
1국가경남통영시안정국가산업단지조성중38637341974-04-011997~2024(주)대우건설, 한국가스공사, 성동조선해양(주), (주)가야중공업, (주)동양조선, 성동철강(주)한국산업단지공단완료1001999-01-071999-01-07<NA>121000전기 가스 및 증기업, 조립금속, 기타 운송 장비, 재생용가공원료생산업, 기초무기화합물 제조업민간
2국가경남거제시옥포국가산업단지조성중59861821974-04-011974~2024대우조선해양(주)경상남도완료981974-07-141974-07-14<NA>0신조선, 해상플랜크, 특수선박, 수리조선연관 기계공업민간
3국가경남거제시죽도국가산업단지조성중41848141974-04-011977~2025삼성중공업(주)경상남도완료881974-07-201974-07-20<NA>0중형조선소, 조선기자재 공장(1차금속,금속가공,전기장비,기타기계 및 장비, 기타 운송장비)민간
4국가경남창원시진해국가산업단지조성중32692351982-08-021983~2023(주)오리엔탈정공, (주)케이조선한국산업단지공단완료521984-01-051983-10-22<NA>257000중형조선소, 조선기자재 공장민간
5국가경남거제시지세포자원비축단지완료29420001974-09-201981~2012한국석유공사한국석유공사완료1001989-11-021981-09-012012-12-27148191원유비축시설공사
6국가경남창원시창원국가산업단지[재생사업지구:부분]조성중358703411974-04-011974~2023한국수자원공사, 경남_창원시, 한국전기연구소, 실수요자한국산업단지공단분양중811974-12-111974-02-04<NA>185000기계공업 및 관련공업인 소재형공업, 산업용기계공업, 전기기기 및 수송기계(다만 차룡단지는 도시기능의 활성화를 위한 공업을 유치할 수 있음)민/관합작
7국가경남사천시경남항공국가산업단지조성중16549102017-05-022017~2024한국토지주택공사한국산업단지공단분양중622017-05-022018-12-26<NA>0금속가공제품,전기장비,기타기계및장비,자동차및트레일러,기타운송장비,화학물질및화학제품,고무제품및플라스틱,비금속광물,1차금속 제조업공사
8국가경남진주시경남항공(진주지구)조성중8348702017-05-022017~2024한국토지주택공사한국산업단지공단분양중682017-05-022018-12-26<NA>516134금속가공제품,전기장비,기타기계및장비,자동차및트레일러,기타운송장비,화학물질및화학제품,고무제품및플라스틱,비금속광물,1차금속 제조업공사
9국가경남사천시경남항공(사천지구)조성중8200402017-05-022017~2024한국토지주택공사한국산업단지공단분양중562017-05-022018-12-26<NA>413147금속가공제품,전기장비,기타기계및장비,자동차및트레일러,기타운송장비,화학물질및화학제품,고무제품및플라스틱,비금속광물,1차금속 제조업공사
유형시도시군단지명조성상태지정면적지정일자사업기간(년)사업시행자관리기관분양상태공정율(퍼센트)실시승인일착공일준공인가일산업용지평균분양가유치업종사업주체
202농공경남하동군금성조선농공단지조성중1453352009-12-072009~2023비엔금성개발(주)경남_하동군완료1002009-12-072013-10-18<NA>0기타 운송장비제조업민간
203농공경남함안군칠원운서농공단지완료945312010-04-152010~2012운서농공단지(주)경남_함안군완료1002010-04-152010-09-012012-05-100목재 및 나무제품 제조업, 전기 장비 제조업, 기타 기계및 장비 제조업, 자동차 및 트레일러 제조업, 기타 운송장비 제조업민간
204농공경남밀양시미전농공단지완료1660932010-09-092010~2014미전지구농공단지협동화(주)경남_밀양시완료1002010-09-092011-04-202014-12-240섬유제품제조업, 화학제품제조업, 고무 및 플라스틱제조업, 금속가공제품제조업, 기타기계 및 장비제조업, 자동차 및 트레일러제조업, 기타 운송장비 제조업, 창고 및 운송관련서비스업, 태양력발전업, 부동산임대업(*입주자격 고시문 참조)민간
205농공경남함안군대산장암농공단지완료1447262010-07-272010~2014(주)비앤더블유경남_함안군분양중1002010-07-272012-07-272014-07-10273340식료품제조업(C10),목재 및 나무제품(C16),화학물질 및 화학제품(C20),비금속광물제품(C23),1차금속제조업(C24),금속가공제품(C25),전자제품·컴퓨터·영상·음향 및 통신장비제조업(C26),기타기계 및 장비제조업(C29), 자동차 및 트레일러(C30),기타 운송장비 제조업(C31), 창고 및 운송관련 서비스업(H52), 전기·가스·증기 및 공기조절 공급업 중 태양력발전업, 부동산임대업(L6811)(*입주자격 고시문 참조)민간
206농공경남밀양시대미농공단지완료645702010-12-232010~2016(주)아이스푸드경남_밀양시완료1002010-12-232011-06-302017-06-130금속가공제품, 음료 제조업, 창고 및 운송관련서비스업, 태양력발전업(산업시설구역 지붕 또는 옥상에만 설치가능)민간
207농공경남함양군함양중방농공단지완료993512010-09-082010~2013(주)서동경남_함양군완료1002010-09-082010-11-022013-04-100기타기계및 장비 제조업(C29),비금속 광물제품 제조업(C23), 식료품 제조업(C10), 섬유제품 제조업(C13), 창고 및 운송관련 서비스업(H52),태양력 발전업(D35114),부동산임대업(L6811)민간
208농공경남고성군제일농공단지완료445242011-04-122011~2015제일리버스(주)경남_고성군완료1002011-04-122015-04-012015-12-180식료품 제조업민간
209농공경남거창군승강기전문농공단지완료3081512011-08-192009~2017(주)산양종합개발경남_거창군완료1002011-08-192014-07-302017-07-19120877전자부품·컴퓨터·영상·음향 및 통신장비, 전기장비, 기타 기계 및 장비 제조업, 인쇄 및 기록매체복제업, 전기·가스·증기 및 공기조절 공급업 중 태양력에너지발전업, 연구개발업, 교육서비스업민간
210농공경남산청군화현농공단지완료852172013-04-042012~2018(주)케이와이텍 외 2개사경남_산청군분양중1002013-04-042016-04-112018-03-30158000식료품 제조업(C10), 섬유제품 제조업(C13), 화학물질 및 화학제품 제조업(C20), 태양력 발전업(D35114), 창고 및 운송관련 서비스업(H52), 부동산임대업(L6811)민간
211농공경남함양군인산죽염항노화지역특화농공단지조성중2107222017-07-212017~2024(주)인산가경남_함양군완료912017-07-212019-12-17<NA>0식료품제조업, 화학물질 및 화학제품제조업 중 가공 및 정제염 제조업, 창고 및 운송관련 서비스업민간