Overview

Dataset statistics

Number of variables6
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory53.0 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description전남개발공사 택지개발 이후, "23. 8월 현재 미분양된 재고 산업단지 용지의 소재지, 지번, 면적, 공급용도 등 정보입니다.
URLhttps://www.data.go.kr/data/15032508/fileData.do

Alerts

소재지 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
공급용도 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 구분 and 2 other fieldsHigh correlation
구분 is highly imbalanced (56.1%)Imbalance
연번 has unique valuesUnique
지번 has unique valuesUnique
면적_제곱미터 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:32:03.854568
Analysis finished2023-12-12 23:32:04.600541
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.5
Minimum1
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-13T08:32:04.673722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.15
Q111.75
median22.5
Q333.25
95-th percentile41.85
Maximum44
Range43
Interquartile range (IQR)21.5

Descriptive statistics

Standard deviation12.845233
Coefficient of variation (CV)0.57089923
Kurtosis-1.2
Mean22.5
Median Absolute Deviation (MAD)11
Skewness0
Sum990
Variance165
MonotonicityStrictly increasing
2023-12-13T08:32:04.835464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 1
 
2.3%
24 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
33 1
 
2.3%
Other values (34) 34
77.3%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
44 1
2.3%
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
장흥바이오식품산업단지
40 
대불산업단지
 
4

Length

Max length11
Median length11
Mean length10.545455
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대불산업단지
2nd row대불산업단지
3rd row대불산업단지
4th row대불산업단지
5th row장흥바이오식품산업단지

Common Values

ValueCountFrequency (%)
장흥바이오식품산업단지 40
90.9%
대불산업단지 4
 
9.1%

Length

2023-12-13T08:32:04.965162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:05.100231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장흥바이오식품산업단지 40
90.9%
대불산업단지 4
 
9.1%

소재지
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size484.0 B
전라남도 장흥군 장흥읍 해당리
19 
전라남도 장흥군 장흥읍 향양리
12 
전라남도 장흥군 장흥읍 삼산리
전라남도 영암군 삼호읍 용앙리

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도 영암군 삼호읍 용앙리
2nd row전라남도 영암군 삼호읍 용앙리
3rd row전라남도 영암군 삼호읍 용앙리
4th row전라남도 영암군 삼호읍 용앙리
5th row전라남도 장흥군 장흥읍 해당리

Common Values

ValueCountFrequency (%)
전라남도 장흥군 장흥읍 해당리 19
43.2%
전라남도 장흥군 장흥읍 향양리 12
27.3%
전라남도 장흥군 장흥읍 삼산리 9
20.5%
전라남도 영암군 삼호읍 용앙리 4
 
9.1%

Length

2023-12-13T08:32:05.262516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:05.368759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 44
25.0%
장흥군 40
22.7%
장흥읍 40
22.7%
해당리 19
10.8%
향양리 12
 
6.8%
삼산리 9
 
5.1%
영암군 4
 
2.3%
삼호읍 4
 
2.3%
용앙리 4
 
2.3%

지번
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-13T08:32:05.610032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.2727273
Min length3

Characters and Unicode

Total characters232
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row1702-4
2nd row1702-6
3rd row1703-9
4th row1703-11
5th row648-5
ValueCountFrequency (%)
1702-4 1
 
2.3%
1702-6 1
 
2.3%
754-4 1
 
2.3%
671-2 1
 
2.3%
671-3 1
 
2.3%
672-1 1
 
2.3%
672-2 1
 
2.3%
751 1
 
2.3%
752-7 1
 
2.3%
752-2 1
 
2.3%
Other values (34) 34
77.3%
2023-12-13T08:32:06.023382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 41
17.7%
6 37
15.9%
7 28
12.1%
1 27
11.6%
0 20
8.6%
2 19
8.2%
5 17
7.3%
9 14
 
6.0%
3 13
 
5.6%
4 12
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 191
82.3%
Dash Punctuation 41
 
17.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 37
19.4%
7 28
14.7%
1 27
14.1%
0 20
10.5%
2 19
9.9%
5 17
8.9%
9 14
 
7.3%
3 13
 
6.8%
4 12
 
6.3%
8 4
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 232
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 41
17.7%
6 37
15.9%
7 28
12.1%
1 27
11.6%
0 20
8.6%
2 19
8.2%
5 17
7.3%
9 14
 
6.0%
3 13
 
5.6%
4 12
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 232
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 41
17.7%
6 37
15.9%
7 28
12.1%
1 27
11.6%
0 20
8.6%
2 19
8.2%
5 17
7.3%
9 14
 
6.0%
3 13
 
5.6%
4 12
 
5.2%

면적_제곱미터
Real number (ℝ)

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12171.643
Minimum1187.7
Maximum46102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-13T08:32:06.180559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1187.7
5-th percentile1271.675
Q14585.25
median8281.15
Q315002.95
95-th percentile42109.61
Maximum46102
Range44914.3
Interquartile range (IQR)10417.7

Descriptive statistics

Standard deviation12005.771
Coefficient of variation (CV)0.98637222
Kurtosis1.9425716
Mean12171.643
Median Absolute Deviation (MAD)5693.65
Skewness1.6201763
Sum535552.3
Variance1.4413853 × 108
MonotonicityNot monotonic
2023-12-13T08:32:06.325745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
11970.0 1
 
2.3%
17893.7 1
 
2.3%
12007.8 1
 
2.3%
17076.0 1
 
2.3%
8602.7 1
 
2.3%
43141.4 1
 
2.3%
7409.5 1
 
2.3%
15512.8 1
 
2.3%
31976.9 1
 
2.3%
16465.8 1
 
2.3%
Other values (34) 34
77.3%
ValueCountFrequency (%)
1187.7 1
2.3%
1193.1 1
2.3%
1271.6 1
2.3%
1272.1 1
2.3%
1460.8 1
2.3%
1494.5 1
2.3%
1494.8 1
2.3%
2178.7 1
2.3%
2593.6 1
2.3%
3306.0 1
2.3%
ValueCountFrequency (%)
46102.0 1
2.3%
44014.0 1
2.3%
43141.4 1
2.3%
36262.8 1
2.3%
31976.9 1
2.3%
29758.3 1
2.3%
21945.0 1
2.3%
17893.7 1
2.3%
17076.0 1
2.3%
16465.8 1
2.3%

공급용도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
산업시설용지
32 
지원시설용지
12 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산업시설용지
2nd row산업시설용지
3rd row산업시설용지
4th row산업시설용지
5th row산업시설용지

Common Values

ValueCountFrequency (%)
산업시설용지 32
72.7%
지원시설용지 12
 
27.3%

Length

2023-12-13T08:32:06.463445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:06.556090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업시설용지 32
72.7%
지원시설용지 12
 
27.3%

Interactions

2023-12-13T08:32:04.232504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:32:04.076310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:32:04.322651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:32:04.149733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:32:06.622551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분소재지지번면적_제곱미터공급용도
연번1.0000.9700.9561.0000.4790.996
구분0.9701.0001.0001.0000.6630.000
소재지0.9561.0001.0001.0000.8031.000
지번1.0001.0001.0001.0001.0001.000
면적_제곱미터0.4790.6630.8031.0001.0000.597
공급용도0.9960.0001.0001.0000.5971.000
2023-12-13T08:32:06.727193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지공급용도구분
소재지1.0000.9760.976
공급용도0.9761.0000.000
구분0.9760.0001.000
2023-12-13T08:32:06.831320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면적_제곱미터구분소재지공급용도
연번1.000-0.2540.7640.8170.849
면적_제곱미터-0.2541.0000.4680.4520.441
구분0.7640.4681.0000.9760.000
소재지0.8170.4520.9761.0000.976
공급용도0.8490.4410.0000.9761.000

Missing values

2023-12-13T08:32:04.422370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:32:04.547358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분소재지지번면적_제곱미터공급용도
01대불산업단지전라남도 영암군 삼호읍 용앙리1702-411970.0산업시설용지
12대불산업단지전라남도 영암군 삼호읍 용앙리1702-614833.0산업시설용지
23대불산업단지전라남도 영암군 삼호읍 용앙리1703-944014.0산업시설용지
34대불산업단지전라남도 영암군 삼호읍 용앙리1703-1146102.0산업시설용지
45장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리648-58910.9산업시설용지
56장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리648-69913.0산업시설용지
67장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리649-156922.5산업시설용지
78장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리649-32593.6산업시설용지
89장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리649-186611.6산업시설용지
910장흥바이오식품산업단지전라남도 장흥군 장흥읍 해당리649-64580.6산업시설용지
연번구분소재지지번면적_제곱미터공급용도
3435장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리755-28939.7지원시설용지
3536장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리753-136262.8지원시설용지
3637장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리75729758.3지원시설용지
3738장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1027-21460.8지원시설용지
3839장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1029-21494.8지원시설용지
3940장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1029-51494.5지원시설용지
4041장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1030-21272.1지원시설용지
4142장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1030-31193.1지원시설용지
4243장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1030-41187.7지원시설용지
4344장흥바이오식품산업단지전라남도 장흥군 장흥읍 향양리1030-51271.6지원시설용지