Overview

Dataset statistics

Number of variables4
Number of observations41
Missing cells1
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory36.2 B

Variable types

Categorical2
Numeric1
Text1

Dataset

Description2023년도 기준 현전라남도 내 산업단지 현황으로 한국산업단지공단 제공 자료입니다.(국가산단, 일반산단, 산단명 정보 제공)
Author전라남도
URLhttps://www.data.go.kr/data/15065036/fileData.do

Alerts

시군구 코드 is highly overall correlated with 시군구High correlation
시군구 is highly overall correlated with 시군구 코드High correlation
시군구 코드 has 1 (2.4%) missing valuesMissing
산업단지 이름 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:55:31.957796
Analysis finished2023-12-11 23:55:32.418058
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct2
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size460.0 B
일반산단
34 
국가산단

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반산단
2nd row일반산단
3rd row일반산단
4th row일반산단
5th row일반산단

Common Values

ValueCountFrequency (%)
일반산단 34
82.9%
국가산단 7
 
17.1%

Length

2023-12-12T08:55:32.506561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:55:32.643256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반산단 34
82.9%
국가산단 7
 
17.1%

시군구
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)41.5%
Missing0
Missing (%)0.0%
Memory size460.0 B
여수시
광양시
나주시
영암군
목포시
Other values (12)
17 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique7 ?
Unique (%)17.1%

Sample

1st row해남군
2nd row목포시
3rd row여수시
4th row여수시
5th row순천시

Common Values

ValueCountFrequency (%)
여수시 7
17.1%
광양시 6
14.6%
나주시 5
12.2%
영암군 3
7.3%
목포시 3
7.3%
화순군 2
 
4.9%
함평군 2
 
4.9%
강진군 2
 
4.9%
고흥군 2
 
4.9%
순천시 2
 
4.9%
Other values (7) 7
17.1%

Length

2023-12-12T08:55:32.778246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
여수시 7
17.1%
광양시 6
14.6%
나주시 5
12.2%
영암군 3
7.3%
목포시 3
7.3%
강진군 2
 
4.9%
순천시 2
 
4.9%
고흥군 2
 
4.9%
함평군 2
 
4.9%
화순군 2
 
4.9%
Other values (7) 7
17.1%

시군구 코드
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct16
Distinct (%)40.0%
Missing1
Missing (%)2.4%
Infinite0
Infinite (%)0.0%
Mean46442.5
Minimum46110
Maximum46900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-12T08:55:32.915854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum46110
5-th percentile46110
Q146145
median46230
Q346810
95-th percentile46870.5
Maximum46900
Range790
Interquartile range (IQR)665

Descriptive statistics

Standard deviation331.2428
Coefficient of variation (CV)0.0071323207
Kurtosis-1.9273542
Mean46442.5
Median Absolute Deviation (MAD)110
Skewness0.30975075
Sum1857700
Variance109721.79
MonotonicityNot monotonic
2023-12-12T08:55:33.070759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
46130 7
17.1%
46230 6
14.6%
46170 5
12.2%
46830 3
7.3%
46110 3
7.3%
46150 2
 
4.9%
46770 2
 
4.9%
46790 2
 
4.9%
46810 2
 
4.9%
46860 2
 
4.9%
Other values (6) 6
14.6%
ValueCountFrequency (%)
46110 3
7.3%
46130 7
17.1%
46150 2
 
4.9%
46170 5
12.2%
46230 6
14.6%
46710 1
 
2.4%
46770 2
 
4.9%
46790 2
 
4.9%
46800 1
 
2.4%
46810 2
 
4.9%
ValueCountFrequency (%)
46900 1
 
2.4%
46880 1
 
2.4%
46870 1
 
2.4%
46860 2
4.9%
46830 3
7.3%
46820 1
 
2.4%
46810 2
4.9%
46800 1
 
2.4%
46790 2
4.9%
46770 2
4.9%

산업단지 이름
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T08:55:33.310097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length12.243902
Min length9

Characters and Unicode

Total characters502
Distinct characters102
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row해남 화원조선 일반산업단지
2nd row목포 대양 일반산업단지
3rd row여수 율촌 제2일반산업단지
4th row여수 율촌 제3일반산업단지
5th row순천 해룡 산업단지
ValueCountFrequency (%)
일반산업단지 28
24.6%
여수 7
 
6.1%
국가산업단지 7
 
6.1%
광양 6
 
5.3%
나주 5
 
4.4%
영암 3
 
2.6%
목포 3
 
2.6%
율촌 3
 
2.6%
화순 2
 
1.8%
고흥 2
 
1.8%
Other values (47) 48
42.1%
2023-12-12T08:55:33.726633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73
14.5%
41
 
8.2%
41
 
8.2%
41
 
8.2%
40
 
8.0%
33
 
6.6%
32
 
6.4%
8
 
1.6%
7
 
1.4%
7
 
1.4%
Other values (92) 179
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 424
84.5%
Space Separator 73
 
14.5%
Decimal Number 5
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
9.7%
41
 
9.7%
41
 
9.7%
40
 
9.4%
33
 
7.8%
32
 
7.5%
8
 
1.9%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (88) 167
39.4%
Decimal Number
ValueCountFrequency (%)
2 3
60.0%
1 1
 
20.0%
3 1
 
20.0%
Space Separator
ValueCountFrequency (%)
73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 424
84.5%
Common 78
 
15.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
9.7%
41
 
9.7%
41
 
9.7%
40
 
9.4%
33
 
7.8%
32
 
7.5%
8
 
1.9%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (88) 167
39.4%
Common
ValueCountFrequency (%)
73
93.6%
2 3
 
3.8%
1 1
 
1.3%
3 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 424
84.5%
ASCII 78
 
15.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73
93.6%
2 3
 
3.8%
1 1
 
1.3%
3 1
 
1.3%
Hangul
ValueCountFrequency (%)
41
 
9.7%
41
 
9.7%
41
 
9.7%
40
 
9.4%
33
 
7.8%
32
 
7.5%
8
 
1.9%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (88) 167
39.4%

Interactions

2023-12-12T08:55:32.160684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:55:33.819982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시군구시군구 코드산업단지 이름
구분1.0000.0000.0001.000
시군구0.0001.0001.0001.000
시군구 코드0.0001.0001.0001.000
산업단지 이름1.0001.0001.0001.000
2023-12-12T08:55:33.910786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시군구
구분1.0000.000
시군구0.0001.000
2023-12-12T08:55:33.989710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구 코드구분시군구
시군구 코드1.0000.0000.828
구분0.0001.0000.000
시군구0.8280.0001.000

Missing values

2023-12-12T08:55:32.286348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:55:32.380429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분시군구시군구 코드산업단지 이름
0일반산단해남군46820해남 화원조선 일반산업단지
1일반산단목포시46110목포 대양 일반산업단지
2일반산단여수시46130여수 율촌 제2일반산업단지
3일반산단여수시46130여수 율촌 제3일반산업단지
4일반산단순천시46150순천 해룡 산업단지
5일반산단나주시46170나주 혁신산업단지
6일반산단나주시46170나주 일반산업단지
7일반산단광양시46230광양 신금 일반산업단지
8일반산단광양시46230광양 익신 일반산업단지
9일반산단광양시46230광양 황금 일반산업단지
구분시군구시군구 코드산업단지 이름
31국가산단여수시46130여수 삼일비축 국가산업단지
32일반산단여수시46130여수 오천 일반산업단지
33일반산단순천시46150순천 일반산업단지
34일반산단나주시46170나주 문평 일반산업단지
35국가산단고흥군46770고흥 우주발사테 국가산업단지
36국가산단나주시46170나주 에너지 국가산업단지
37일반산단강진군46810강진2 일반산업단지
38일반산단광양시46230광양 염포 일반산업단지
39일반산단화순군46790화순 생물의약2 일반산업단지
40일반산단무안군<NA>무안 항공특화 일반산업단지