Overview

Dataset statistics

Number of variables9
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory81.3 B

Variable types

Categorical3
Text2
Numeric3
DateTime1

Dataset

Description제주특별자치도 서귀포시에서 추진하고 있는 각종 지역투자사업과 관련한 데이터로 구분(국책/민자), 사업명, 위치, 규모, 사업기간, 총사업비 등의 정보를 제공합니다.
Author제주특별자치도 서귀포시
URLhttps://www.data.go.kr/data/15034170/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
규모(천 제곱미터) is highly overall correlated with 총사업비(억원) and 1 other fieldsHigh correlation
총사업비(억원) is highly overall correlated with 규모(천 제곱미터) and 1 other fieldsHigh correlation
구분 is highly overall correlated with 규모(천 제곱미터) and 1 other fieldsHigh correlation
사업명 has unique valuesUnique
규모(천 제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:51:52.327904
Analysis finished2023-12-12 00:51:54.275366
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
민자사업
18 
국책사업

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국책사업
2nd row국책사업
3rd row국책사업
4th row국책사업
5th row국책사업

Common Values

ValueCountFrequency (%)
민자사업 18
72.0%
국책사업 7
 
28.0%

Length

2023-12-12T09:51:54.373065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:54.526966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민자사업 18
72.0%
국책사업 7
 
28.0%

사업명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T09:51:54.787800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length9.4
Min length6

Characters and Unicode

Total characters235
Distinct characters113
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row제주영어교육도시
2nd row서귀포 혁신도시
3rd row제주헬스케어타운
4th row신화역사 공원
5th row예래휴양형주거단지
ValueCountFrequency (%)
관광지 5
 
9.8%
관광단지 3
 
5.9%
리조트 2
 
3.9%
골프리조트 2
 
3.9%
제주영어교육도시 1
 
2.0%
밸리 1
 
2.0%
수망관광지구 1
 
2.0%
남원1ㆍ2차 1
 
2.0%
백통신원 1
 
2.0%
우리들메디컬 1
 
2.0%
Other values (33) 33
64.7%
2023-12-12T09:51:55.198635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
11.1%
12
 
5.1%
11
 
4.7%
11
 
4.7%
9
 
3.8%
6
 
2.6%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (103) 140
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 203
86.4%
Space Separator 26
 
11.1%
Decimal Number 2
 
0.9%
Uppercase Letter 2
 
0.9%
Open Punctuation 1
 
0.4%
Close Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.9%
11
 
5.4%
11
 
5.4%
9
 
4.4%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (96) 129
63.5%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
N 1
50.0%
H 1
50.0%
Space Separator
ValueCountFrequency (%)
26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 203
86.4%
Common 30
 
12.8%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.9%
11
 
5.4%
11
 
5.4%
9
 
4.4%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (96) 129
63.5%
Common
ValueCountFrequency (%)
26
86.7%
( 1
 
3.3%
2 1
 
3.3%
1 1
 
3.3%
) 1
 
3.3%
Latin
ValueCountFrequency (%)
N 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 202
86.0%
ASCII 32
 
13.6%
Compat Jamo 1
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26
81.2%
( 1
 
3.1%
2 1
 
3.1%
1 1
 
3.1%
N 1
 
3.1%
H 1
 
3.1%
) 1
 
3.1%
Hangul
ValueCountFrequency (%)
12
 
5.9%
11
 
5.4%
11
 
5.4%
9
 
4.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (95) 128
63.4%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

위치
Text

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T09:51:55.473016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length7
Mean length6.2
Min length3

Characters and Unicode

Total characters155
Distinct characters50
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)92.0%

Sample

1st row대정읍 구억리
2nd row서호동
3rd row동홍동+토평동
4th row안덕면 서광리
5th row예래동
ValueCountFrequency (%)
안덕면 5
 
11.9%
표선면 5
 
11.9%
남원읍 3
 
7.1%
대정읍 2
 
4.8%
성산읍 2
 
4.8%
상천리 2
 
4.8%
세화리 1
 
2.4%
서호동 1
 
2.4%
사계리 1
 
2.4%
상창리 1
 
2.4%
Other values (19) 19
45.2%
2023-12-12T09:51:55.972439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
11.0%
17
 
11.0%
12
 
7.7%
10
 
6.5%
8
 
5.2%
6
 
3.9%
6
 
3.9%
5
 
3.2%
5
 
3.2%
5
 
3.2%
Other values (40) 64
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 135
87.1%
Space Separator 17
 
11.0%
Math Symbol 3
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
12.6%
12
 
8.9%
10
 
7.4%
8
 
5.9%
6
 
4.4%
6
 
4.4%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
3.0%
Other values (38) 57
42.2%
Space Separator
ValueCountFrequency (%)
17
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 135
87.1%
Common 20
 
12.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
12.6%
12
 
8.9%
10
 
7.4%
8
 
5.9%
6
 
4.4%
6
 
4.4%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
3.0%
Other values (38) 57
42.2%
Common
ValueCountFrequency (%)
17
85.0%
+ 3
 
15.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 135
87.1%
ASCII 20
 
12.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
85.0%
+ 3
 
15.0%
Hangul
ValueCountFrequency (%)
17
 
12.6%
12
 
8.9%
10
 
7.4%
8
 
5.9%
6
 
4.4%
6
 
4.4%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
3.0%
Other values (38) 57
42.2%

규모(천 제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1080.52
Minimum30
Maximum3985
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T09:51:56.145088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile99.2
Q1164
median549
Q31272
95-th percentile3746
Maximum3985
Range3955
Interquartile range (IQR)1108

Descriptive statistics

Standard deviation1251.733
Coefficient of variation (CV)1.1584543
Kurtosis0.6765676
Mean1080.52
Median Absolute Deviation (MAD)417
Skewness1.3947772
Sum27013
Variance1566835.6
MonotonicityNot monotonic
2023-12-12T09:51:56.290941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3792 1
 
4.0%
1135 1
 
4.0%
192 1
 
4.0%
254 1
 
4.0%
1114 1
 
4.0%
146 1
 
4.0%
132 1
 
4.0%
376 1
 
4.0%
108 1
 
4.0%
1272 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
30 1
4.0%
97 1
4.0%
108 1
4.0%
132 1
4.0%
146 1
4.0%
156 1
4.0%
164 1
4.0%
192 1
4.0%
254 1
4.0%
335 1
4.0%
ValueCountFrequency (%)
3985 1
4.0%
3792 1
4.0%
3562 1
4.0%
3000 1
4.0%
2394 1
4.0%
1539 1
4.0%
1272 1
4.0%
1135 1
4.0%
1114 1
4.0%
747 1
4.0%

사업시작년도
Real number (ℝ)

Distinct15
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2004.84
Minimum1978
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T09:51:56.466369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1978
5-th percentile1996.2
Q12001
median2006
Q32008
95-th percentile2014.6
Maximum2015
Range37
Interquartile range (IQR)7

Descriptive statistics

Standard deviation7.6685505
Coefficient of variation (CV)0.0038250187
Kurtosis5.3146152
Mean2004.84
Median Absolute Deviation (MAD)3
Skewness-1.7617781
Sum50121
Variance58.806667
MonotonicityNot monotonic
2023-12-12T09:51:56.627268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2008 3
12.0%
2007 3
12.0%
2005 3
12.0%
2006 2
 
8.0%
2015 2
 
8.0%
2000 2
 
8.0%
2012 2
 
8.0%
1978 1
 
4.0%
2003 1
 
4.0%
1998 1
 
4.0%
Other values (5) 5
20.0%
ValueCountFrequency (%)
1978 1
 
4.0%
1996 1
 
4.0%
1997 1
 
4.0%
1998 1
 
4.0%
2000 2
8.0%
2001 1
 
4.0%
2003 1
 
4.0%
2005 3
12.0%
2006 2
8.0%
2007 3
12.0%
ValueCountFrequency (%)
2015 2
8.0%
2013 1
 
4.0%
2012 2
8.0%
2009 1
 
4.0%
2008 3
12.0%
2007 3
12.0%
2006 2
8.0%
2005 3
12.0%
2003 1
 
4.0%
2001 1
 
4.0%
Distinct5
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2018
2016
2017
2015
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2015
3rd row2018
4th row2018
5th row2017

Common Values

ValueCountFrequency (%)
2018 8
32.0%
2016 7
28.0%
2017 5
20.0%
2015 3
 
12.0%
2020 2
 
8.0%

Length

2023-12-12T09:51:56.788951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:56.925683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 8
32.0%
2016 7
28.0%
2017 5
20.0%
2015 3
 
12.0%
2020 2
 
8.0%

총사업비(억원)
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5981.4
Minimum242
Maximum29111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T09:51:57.058549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum242
5-th percentile436.8
Q11667
median2736
Q34327
95-th percentile24129.8
Maximum29111
Range28869
Interquartile range (IQR)2660

Descriptive statistics

Standard deviation8013.9603
Coefficient of variation (CV)1.3398135
Kurtosis2.7869914
Mean5981.4
Median Absolute Deviation (MAD)1591
Skewness1.9297896
Sum149535
Variance64223559
MonotonicityNot monotonic
2023-12-12T09:51:57.192896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
2350 2
 
8.0%
2746 1
 
4.0%
599 1
 
4.0%
4327 1
 
4.0%
400 1
 
4.0%
1020 1
 
4.0%
1120 1
 
4.0%
584 1
 
4.0%
1978 1
 
4.0%
2380 1
 
4.0%
Other values (14) 14
56.0%
ValueCountFrequency (%)
242 1
4.0%
400 1
4.0%
584 1
4.0%
599 1
4.0%
1020 1
4.0%
1120 1
4.0%
1667 1
4.0%
1978 1
4.0%
2225 1
4.0%
2350 2
8.0%
ValueCountFrequency (%)
29111 1
4.0%
25000 1
4.0%
20649 1
4.0%
15214 1
4.0%
10936 1
4.0%
8775 1
4.0%
4327 1
4.0%
3870 1
4.0%
3257 1
4.0%
3060 1
4.0%

비고
Categorical

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
관광개발
21 
없음

Length

Max length4
Median length4
Mean length3.68
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row관광개발
4th row관광개발
5th row관광개발

Common Values

ValueCountFrequency (%)
관광개발 21
84.0%
없음 4
 
16.0%

Length

2023-12-12T09:51:57.354865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:57.470881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관광개발 21
84.0%
없음 4
 
16.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
Minimum2023-10-18 00:00:00
Maximum2023-10-18 00:00:00
2023-12-12T09:51:57.570802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:57.676848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T09:51:53.634609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:52.824203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.225634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.749007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:52.950108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.383329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.866458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.082797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:51:53.523438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:51:57.747863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분사업명위치규모(천 제곱미터)사업시작년도사업종료년도총사업비(억원)비고
구분1.0001.0001.0000.8070.5490.0000.8870.418
사업명1.0001.0001.0001.0001.0001.0001.0001.000
위치1.0001.0001.0001.0000.8970.7931.0001.000
규모(천 제곱미터)0.8071.0001.0001.0000.0000.3290.8940.000
사업시작년도0.5491.0000.8970.0001.0000.3350.2100.000
사업종료년도0.0001.0000.7930.3290.3351.0000.0000.153
총사업비(억원)0.8871.0001.0000.8940.2100.0001.0000.000
비고0.4181.0001.0000.0000.0000.1530.0001.000
2023-12-12T09:51:57.854896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분사업종료년도비고
구분1.0000.0000.272
사업종료년도0.0001.0000.154
비고0.2720.1541.000
2023-12-12T09:51:58.204082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모(천 제곱미터)사업시작년도총사업비(억원)구분사업종료년도비고
규모(천 제곱미터)1.000-0.1140.6780.5330.1530.000
사업시작년도-0.1141.000-0.0100.4020.1490.000
총사업비(억원)0.678-0.0101.0000.6140.0000.000
구분0.5330.4020.6141.0000.0000.272
사업종료년도0.1530.1490.0000.0001.0000.154
비고0.0000.0000.0000.2720.1541.000

Missing values

2023-12-12T09:51:54.047904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:51:54.208677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분사업명위치규모(천 제곱미터)사업시작년도사업종료년도총사업비(억원)비고데이터기준일자
0국책사업제주영어교육도시대정읍 구억리3792200820162746없음2023-10-18
1국책사업서귀포 혁신도시서호동1135200720152939없음2023-10-18
2국책사업제주헬스케어타운동홍동+토평동15392008201815214관광개발2023-10-18
3국책사업신화역사 공원안덕면 서광리39852006201820649관광개발2023-10-18
4국책사업예래휴양형주거단지예래동7412005201725000관광개발2023-10-18
5국책사업중문 관광단지색달동+중문동+대포동35621978201829111관광개발2023-10-18
6국책사업민군복합형 관광미항강정동6702007201710936없음2023-10-18
7민자사업성산포해양 관광단지성산읍 고성리747200320173870관광개발2023-10-18
8민자사업미천굴 관광지성산읍 삼달리9719982016242관광개발2023-10-18
9민자사업팜파스 종합휴양 관광단지표선면 성읍리3000200820188775관광개발2023-10-18
구분사업명위치규모(천 제곱미터)사업시작년도사업종료년도총사업비(억원)비고데이터기준일자
15민자사업남원1ㆍ2차 관광지남원읍 남원리164199620161667관광개발2023-10-18
16민자사업백통신원 리조트남원읍 위미리549201220162350관광개발2023-10-18
17민자사업우리들메디컬 골프리조트상효동1272200520162350관광개발2023-10-18
18민자사업삼매봉 밸리 유원지 개발호근동108200720172380관광개발2023-10-18
19민자사업제주롯데 리조트색달동376200520181978관광개발2023-10-18
20민자사업한라힐링파크안덕면 상천리13220092015584관광개발2023-10-18
21민자사업핀크스 비오토피아 휴양리조트안덕면 상천리146201220181120관광개발2023-10-18
22민자사업테디밸리 골프리조트안덕면 상창리1114200620151020관광개발2023-10-18
23민자사업용머리 관광지안덕면 사계리25420002020400관광개발2023-10-18
24민자사업뉴오션타운(송악산)대정읍 상모리192201520184327관광개발2023-10-18