Overview

Dataset statistics

Number of variables8
Number of observations76
Missing cells75
Missing cells (%)12.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory67.7 B

Variable types

Numeric2
Text3
Categorical3

Dataset

Description개발이익환수에 관한 법률(약칭 개발이익환수법) 중 개발부담금 대상 사업의 종류 및 대상사업명, 근거법률, 인가등 받은날, 준공받은날의 기준 정보
URLhttps://www.data.go.kr/data/15063429/fileData.do

Alerts

비고 has constant value ""Constant
변경일시 has constant value ""Constant
대상사업종류 is highly overall correlated with 준공등받은날High correlation
인가등받은날 is highly overall correlated with 준공등받은날High correlation
준공등받은날 is highly overall correlated with 대상사업종류 and 1 other fieldsHigh correlation
근거법률 has 1 (1.3%) missing valuesMissing
비고 has 74 (97.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 08:33:39.625297
Analysis finished2023-12-12 08:33:40.841080
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대상사업종류
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.0657895
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size816.0 B
2023-12-12T17:33:40.915291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q310
95-th percentile10
Maximum11
Range10
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.4960379
Coefficient of variation (CV)0.57635331
Kurtosis-1.6598649
Mean6.0657895
Median Absolute Deviation (MAD)3
Skewness-0.0044685424
Sum461
Variance12.222281
MonotonicityNot monotonic
2023-12-12T17:33:41.060663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
10 27
35.5%
2 9
 
11.8%
4 9
 
11.8%
3 8
 
10.5%
1 7
 
9.2%
5 7
 
9.2%
8 4
 
5.3%
6 2
 
2.6%
7 1
 
1.3%
9 1
 
1.3%
ValueCountFrequency (%)
1 7
 
9.2%
2 9
 
11.8%
3 8
 
10.5%
4 9
 
11.8%
5 7
 
9.2%
6 2
 
2.6%
7 1
 
1.3%
8 4
 
5.3%
9 1
 
1.3%
10 27
35.5%
ValueCountFrequency (%)
11 1
 
1.3%
10 27
35.5%
9 1
 
1.3%
8 4
 
5.3%
7 1
 
1.3%
6 2
 
2.6%
5 7
 
9.2%
4 9
 
11.8%
3 8
 
10.5%
2 9
 
11.8%

대상사업구분
Real number (ℝ)

Distinct27
Distinct (%)35.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.5789474
Minimum1
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size816.0 B
2023-12-12T17:33:41.212349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12.75
median5
Q39
95-th percentile23.25
Maximum27
Range26
Interquartile range (IQR)6.25

Descriptive statistics

Standard deviation6.9823838
Coefficient of variation (CV)0.92128676
Kurtosis0.88103031
Mean7.5789474
Median Absolute Deviation (MAD)3
Skewness1.355079
Sum576
Variance48.753684
MonotonicityNot monotonic
2023-12-12T17:33:41.353226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
1 11
14.5%
2 8
10.5%
3 7
9.2%
4 7
9.2%
5 6
 
7.9%
6 6
 
7.9%
7 6
 
7.9%
8 4
 
5.3%
9 3
 
3.9%
23 1
 
1.3%
Other values (17) 17
22.4%
ValueCountFrequency (%)
1 11
14.5%
2 8
10.5%
3 7
9.2%
4 7
9.2%
5 6
7.9%
6 6
7.9%
7 6
7.9%
8 4
 
5.3%
9 3
 
3.9%
10 1
 
1.3%
ValueCountFrequency (%)
27 1
1.3%
26 1
1.3%
25 1
1.3%
24 1
1.3%
23 1
1.3%
22 1
1.3%
21 1
1.3%
20 1
1.3%
19 1
1.3%
18 1
1.3%
Distinct75
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T17:33:41.604891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length35
Mean length13.552632
Min length4

Characters and Unicode

Total characters1030
Distinct characters124
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)97.4%

Sample

1st row택지개발사업
2nd row대지조성사업 및 주택건설사업-폐지
3rd row아파트지구개발사업-폐지
4th row산업단지안의 주택지조성사업
5th row일단의 주택지조성사업-폐지
ValueCountFrequency (%)
8
 
5.3%
또는 5
 
3.3%
위한 5
 
3.3%
여객자동차터미널사업 4
 
2.6%
개발사업-폐지 3
 
2.0%
사실상 3
 
2.0%
공부상의 3
 
2.0%
지목변경이 3
 
2.0%
수반되는 3
 
2.0%
사업 3
 
2.0%
Other values (98) 112
73.7%
2023-12-12T17:33:42.065069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
90
 
8.7%
85
 
8.3%
78
 
7.6%
76
 
7.4%
- 31
 
3.0%
31
 
3.0%
29
 
2.8%
25
 
2.4%
25
 
2.4%
19
 
1.8%
Other values (114) 541
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 909
88.3%
Space Separator 76
 
7.4%
Dash Punctuation 31
 
3.0%
Open Punctuation 7
 
0.7%
Close Punctuation 7
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
9.9%
85
 
9.4%
78
 
8.6%
31
 
3.4%
29
 
3.2%
25
 
2.8%
25
 
2.8%
19
 
2.1%
19
 
2.1%
17
 
1.9%
Other values (110) 491
54.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 909
88.3%
Common 121
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
9.9%
85
 
9.4%
78
 
8.6%
31
 
3.4%
29
 
3.2%
25
 
2.8%
25
 
2.8%
19
 
2.1%
19
 
2.1%
17
 
1.9%
Other values (110) 491
54.0%
Common
ValueCountFrequency (%)
76
62.8%
- 31
25.6%
( 7
 
5.8%
) 7
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 909
88.3%
ASCII 121
 
11.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
90
 
9.9%
85
 
9.4%
78
 
8.6%
31
 
3.4%
29
 
3.2%
25
 
2.8%
25
 
2.8%
19
 
2.1%
19
 
2.1%
17
 
1.9%
Other values (110) 491
54.0%
ASCII
ValueCountFrequency (%)
76
62.8%
- 31
25.6%
( 7
 
5.8%
) 7
 
5.8%

근거법률
Text

MISSING 

Distinct42
Distinct (%)56.0%
Missing1
Missing (%)1.3%
Memory size740.0 B
2023-12-12T17:33:42.380785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length23
Mean length12.32
Min length2

Characters and Unicode

Total characters924
Distinct characters101
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)33.3%

Sample

1st row택지개발촉진법
2nd row주택법
3rd row주택건설촉진법
4th row산업입지 및 개발에관한법률
5th row도시계획법
ValueCountFrequency (%)
28
 
12.2%
관한 26
 
11.4%
법률 21
 
9.2%
이용에 14
 
6.1%
국토계획 7
 
3.1%
산업입지 6
 
2.6%
개발에관한법률 6
 
2.6%
계획 5
 
2.2%
특별법 5
 
2.2%
기타 4
 
1.7%
Other values (65) 107
46.7%
2023-12-12T17:33:42.833324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
154
 
16.7%
71
 
7.7%
42
 
4.5%
41
 
4.4%
40
 
4.3%
31
 
3.4%
29
 
3.1%
29
 
3.1%
18
 
1.9%
16
 
1.7%
Other values (91) 453
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 770
83.3%
Space Separator 154
 
16.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
9.2%
42
 
5.5%
41
 
5.3%
40
 
5.2%
31
 
4.0%
29
 
3.8%
29
 
3.8%
18
 
2.3%
16
 
2.1%
15
 
1.9%
Other values (90) 438
56.9%
Space Separator
ValueCountFrequency (%)
154
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 770
83.3%
Common 154
 
16.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
9.2%
42
 
5.5%
41
 
5.3%
40
 
5.2%
31
 
4.0%
29
 
3.8%
29
 
3.8%
18
 
2.3%
16
 
2.1%
15
 
1.9%
Other values (90) 438
56.9%
Common
ValueCountFrequency (%)
154
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 770
83.3%
ASCII 154
 
16.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
154
100.0%
Hangul
ValueCountFrequency (%)
71
 
9.2%
42
 
5.5%
41
 
5.3%
40
 
5.2%
31
 
4.0%
29
 
3.8%
29
 
3.8%
18
 
2.3%
16
 
2.1%
15
 
1.9%
Other values (90) 438
56.9%

인가등받은날
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size740.0 B
실시계획승인일
13 
실시계획인가일
12 
사업계획승인일
행위허가일
사업시행인가일
 
3
Other values (24)
35 

Length

Max length29
Median length7
Mean length7.9736842
Min length4

Unique

Unique15 ?
Unique (%)19.7%

Sample

1st row택지개발지구지정일
2nd row사업계획승인일
3rd row사업시행인가일
4th row실시계획승인일
5th row실시계획승인일

Common Values

ValueCountFrequency (%)
실시계획승인일 13
17.1%
실시계획인가일 12
15.8%
사업계획승인일 7
 
9.2%
행위허가일 6
 
7.9%
사업시행인가일 3
 
3.9%
조성계획승인일 3
 
3.9%
공사시행인가일 3
 
3.9%
개발행위 농지전용 산지전용 또는 초지전용허가(신고)일 2
 
2.6%
공장설립승인일 2
 
2.6%
굴착허가일 2
 
2.6%
Other values (19) 23
30.3%

Length

2023-12-12T17:33:43.016616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실시계획승인일 13
 
13.4%
실시계획인가일 12
 
12.4%
사업계획승인일 7
 
7.2%
행위허가일 7
 
7.2%
승인일 5
 
5.2%
또는 4
 
4.1%
사업시행인가일 3
 
3.1%
조성계획승인일 3
 
3.1%
공사시행인가일 3
 
3.1%
실시계획 2
 
2.1%
Other values (28) 38
39.2%

준공등받은날
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)27.6%
Missing0
Missing (%)0.0%
Memory size740.0 B
준공검사일
22 
준공인가일
12 
제9조제3항에 따른 날
건축물사용승인일
제9조제3항에 따른 신고일
Other values (16)
24 

Length

Max length35
Median length31.5
Mean length9.7894737
Min length4

Unique

Unique11 ?
Unique (%)14.5%

Sample

1st row준공검사일
2nd row사용검사일
3rd row준공검사일
4th row준공인가일
5th row준공검사일

Common Values

ValueCountFrequency (%)
준공검사일 22
28.9%
준공인가일 12
15.8%
제9조제3항에 따른 날 7
 
9.2%
건축물사용승인일 6
 
7.9%
제9조제3항에 따른 신고일 5
 
6.6%
제9조제3항의 신고일 또는 시설물의 사용승인일 4
 
5.3%
사용검사일 3
 
3.9%
준공검사일 건축물(시설물)사용승인일 또는 제9조제3항에 따른 날 2
 
2.6%
9조제3항에 따른 날 2
 
2.6%
준공검사일 또는 제9조제3항에 따른 날 2
 
2.6%
Other values (11) 11
14.5%

Length

2023-12-12T17:33:43.192836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
준공검사일 26
18.2%
따른 18
12.6%
제9조제3항에 16
11.2%
13
9.1%
준공인가일 12
8.4%
신고일 11
7.7%
또는 8
 
5.6%
건축물사용승인일 6
 
4.2%
시설물의 4
 
2.8%
사용승인일 4
 
2.8%
Other values (18) 25
17.5%

비고
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing74
Missing (%)97.4%
Memory size740.0 B
2023-12-12T17:33:43.289934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters2
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowD
2nd rowD
ValueCountFrequency (%)
d 2
100.0%
2023-12-12T17:33:43.486878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
D 2
100.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
D 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
D 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
D 2
100.0%

변경일시
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-06-02 00:00:00
76 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-02 00:00:00
2nd row2023-06-02 00:00:00
3rd row2023-06-02 00:00:00
4th row2023-06-02 00:00:00
5th row2023-06-02 00:00:00

Common Values

ValueCountFrequency (%)
2023-06-02 00:00:00 76
100.0%

Length

2023-12-12T17:33:43.618514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:33:43.740776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-02 76
50.0%
00:00:00 76
50.0%

Interactions

2023-12-12T17:33:40.288863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:33:40.105588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:33:40.366627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:33:40.194044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:33:43.825430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상사업종류대상사업구분대상사업명근거법률인가등받은날준공등받은날
대상사업종류1.0000.0001.0000.9560.8050.899
대상사업구분0.0001.0000.9830.8890.8240.566
대상사업명1.0000.9831.0000.9870.9940.000
근거법률0.9560.8890.9871.0000.9800.942
인가등받은날0.8050.8240.9940.9801.0000.966
준공등받은날0.8990.5660.0000.9420.9661.000
2023-12-12T17:33:43.944431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
준공등받은날인가등받은날
준공등받은날1.0000.642
인가등받은날0.6421.000
2023-12-12T17:33:44.049391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상사업종류대상사업구분인가등받은날준공등받은날
대상사업종류1.0000.3850.4880.550
대상사업구분0.3851.0000.4500.247
인가등받은날0.4880.4501.0000.642
준공등받은날0.5500.2470.6421.000

Missing values

2023-12-12T17:33:40.474869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:33:40.633704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T17:33:40.772833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

대상사업종류대상사업구분대상사업명근거법률인가등받은날준공등받은날비고변경일시
011택지개발사업택지개발촉진법택지개발지구지정일준공검사일<NA>2023-06-02 00:00:00
112대지조성사업 및 주택건설사업-폐지주택법사업계획승인일사용검사일<NA>2023-06-02 00:00:00
213아파트지구개발사업-폐지주택건설촉진법사업시행인가일준공검사일D2023-06-02 00:00:00
314산업단지안의 주택지조성사업산업입지 및 개발에관한법률실시계획승인일준공인가일<NA>2023-06-02 00:00:00
415일단의 주택지조성사업-폐지도시계획법실시계획승인일준공검사일D2023-06-02 00:00:00
521국가산업단지개발사업산업입지 및 개발에관한법률실시계획승인일준공인가일<NA>2023-06-02 00:00:00
622지방산업단지개발사업-폐지산업입지 및 개발에관한법률실시계획승인일준공인가일<NA>2023-06-02 00:00:00
723농공단지개발사업산업입지 및 개발에관한법률실시계획승인일준공인가일<NA>2023-06-02 00:00:00
824협동화산업단지조성사업중소기업진흥에 관한 법률실시계획승인일준공인가일<NA>2023-06-02 00:00:00
925산업단지외의 지역에서의 공장용지조성 및 공장설립을 위한 부지조성사업-폐지산업집적활성화 공장설립에 관한 법률공장설립승인일공장설립완료신고일<NA>2023-06-02 00:00:00
대상사업종류대상사업구분대상사업명근거법률인가등받은날준공등받은날비고변경일시
6657여객자동차터미널사업여객자동차 운수사업법공사시행인가일시설확인일<NA>2023-06-02 00:00:00
6782경륜장설치사업경륜경정법설치허가일제9조제3항에 따른 날<NA>2023-06-02 00:00:00
6883경정장설치사업경륜경정법설치허가일제9조제3항에 따른 날<NA>2023-06-02 00:00:00
6984체육시설업을위한부지조성사업체육시설의 설치 이용에 관한 법률실시계획인가일준공검사일<NA>2023-06-02 00:00:00
701023산지전용허가(신고)산지관리법행위허가일제9조제3항의 신고일 또는 시설물의 사용승인일<NA>2023-06-02 00:00:00
711024창고시설의 설치로 사실상 또는 공부상의 지목변경이 수반되는 사업건축법건축허가(신고)일건축물사용승인일<NA>2023-06-02 00:00:00
721025창고시설의 설치를 위한 용지조성사업국토계획 및 이용에 관한 법률행위허가일 또는 실시계획 인가일준공검사일<NA>2023-06-02 00:00:00
731026공장용지조성사업중소기업창업 지원법사업계획승인일건축물사용승인일<NA>2023-06-02 00:00:00
741027사실상 또는 공부상의 지목변경이 수반되는 사업기타개발행위 농지전용 산지전용 또는 초지전용허가(신고)일준공검사일 건축물(시설물)사용승인일 또는 제9조제3항에 따른 날<NA>2023-06-02 00:00:00
75109주택을 건축하기 위한 용도로 토지를 개발하는사업 등(국토교통부령으로정하는사업)기타개발행위 농지전용 산지전용 또는 초지전용허가(신고)일준공검사일 건축물(시설물)사용승인일 또는 제9조제3항에 따른 날<NA>2023-06-02 00:00:00