Overview

Dataset statistics

Number of variables10
Number of observations161
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.0 KiB
Average record size in memory82.8 B

Variable types

Numeric2
DateTime1
Categorical6
Text1

Dataset

Description광주광역시 동구 개발행위허가정보 데이터입니다.연번, 허가일, 개발행위구분, 동명, 지번, 지목, 용도지역, 허가목적, 허가면적, 데이터기준일자로 구성되어있습니다.
Author광주광역시 동구
URLhttps://www.data.go.kr/data/15112708/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
허가면적(제곱미터) is highly overall correlated with 지목 and 2 other fieldsHigh correlation
개발행위구분 is highly overall correlated with 지목High correlation
지목 is highly overall correlated with 허가면적(제곱미터) and 2 other fieldsHigh correlation
용도지역 is highly overall correlated with 허가면적(제곱미터)High correlation
허가목적(용도) is highly overall correlated with 허가면적(제곱미터) and 1 other fieldsHigh correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-16 04:14:04.669919
Analysis finished2024-03-16 04:14:06.027419
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct161
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81
Minimum1
Maximum161
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-03-16T13:14:06.144946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q141
median81
Q3121
95-th percentile153
Maximum161
Range160
Interquartile range (IQR)80

Descriptive statistics

Standard deviation46.620811
Coefficient of variation (CV)0.57556557
Kurtosis-1.2
Mean81
Median Absolute Deviation (MAD)40
Skewness0
Sum13041
Variance2173.5
MonotonicityStrictly increasing
2024-03-16T13:14:06.709814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
122 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
Other values (151) 151
93.8%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
Distinct116
Distinct (%)72.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2020-01-17 00:00:00
Maximum2023-12-28 00:00:00
2024-03-16T13:14:06.973171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:14:07.164362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

개발행위구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
공작물설치
94 
토지형질변경
66 
형질변경
 
1

Length

Max length6
Median length5
Mean length5.4037267
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row토지형질변경
2nd row토지형질변경
3rd row토지형질변경
4th row공작물설치
5th row토지형질변경

Common Values

ValueCountFrequency (%)
공작물설치 94
58.4%
토지형질변경 66
41.0%
형질변경 1
 
0.6%

Length

2024-03-16T13:14:07.407768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:14:07.657043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공작물설치 94
58.4%
토지형질변경 66
41.0%
형질변경 1
 
0.6%

동명
Categorical

Distinct22
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
지산동
23 
소태동
21 
운림동
19 
산수동
18 
학동
14 
Other values (17)
66 

Length

Max length8
Median length3
Mean length2.9627329
Min length2

Unique

Unique6 ?
Unique (%)3.7%

Sample

1st row지산동
2nd row학동
3rd row용연동
4th row운림동
5th row용산동

Common Values

ValueCountFrequency (%)
지산동 23
14.3%
소태동 21
13.0%
운림동 19
11.8%
산수동 18
11.2%
학동 14
8.7%
내남동 12
7.5%
계림동 10
6.2%
용산동 9
 
5.6%
동명동 8
 
5.0%
용연동 6
 
3.7%
Other values (12) 21
13.0%

Length

2024-03-16T13:14:07.834938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지산동 23
14.2%
소태동 22
13.6%
운림동 19
11.7%
산수동 18
11.1%
학동 14
8.6%
내남동 12
7.4%
계림동 10
6.2%
용산동 10
6.2%
동명동 8
 
4.9%
용연동 6
 
3.7%
Other values (11) 20
12.3%

지번
Text

Distinct141
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-16T13:14:08.401811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length5.8136646
Min length1

Characters and Unicode

Total characters936
Distinct characters25
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)79.5%

Sample

1st row310-17 외 1
2nd row221-3외1
3rd row277외3
4th row469
5th row385-10외3
ValueCountFrequency (%)
22
 
9.5%
1 21
 
9.1%
320외 8
 
3.5%
3 5
 
2.2%
2 4
 
1.7%
453-1 3
 
1.3%
101-4 2
 
0.9%
01월 2
 
0.9%
외1 2
 
0.9%
외2 2
 
0.9%
Other values (145) 160
69.3%
2024-03-16T13:14:10.182599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 138
14.7%
- 111
11.9%
2 101
10.8%
3 86
9.2%
70
7.5%
5 68
7.3%
7 61
6.5%
0 56
6.0%
55
 
5.9%
4 53
 
5.7%
Other values (15) 137
14.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 671
71.7%
Dash Punctuation 111
 
11.9%
Other Letter 75
 
8.0%
Space Separator 70
 
7.5%
Lowercase Letter 4
 
0.4%
Uppercase Letter 2
 
0.2%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 138
20.6%
2 101
15.1%
3 86
12.8%
5 68
10.1%
7 61
9.1%
0 56
8.3%
4 53
 
7.9%
6 44
 
6.6%
8 41
 
6.1%
9 23
 
3.4%
Other Letter
ValueCountFrequency (%)
55
73.3%
10
 
13.3%
5
 
6.7%
5
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
u 1
25.0%
g 1
25.0%
a 1
25.0%
n 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
J 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 111
100.0%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 855
91.3%
Hangul 75
 
8.0%
Latin 6
 
0.6%

Most frequent character per script

Common
ValueCountFrequency (%)
1 138
16.1%
- 111
13.0%
2 101
11.8%
3 86
10.1%
70
8.2%
5 68
8.0%
7 61
7.1%
0 56
6.5%
4 53
 
6.2%
6 44
 
5.1%
Other values (5) 67
7.8%
Latin
ValueCountFrequency (%)
A 1
16.7%
u 1
16.7%
g 1
16.7%
J 1
16.7%
a 1
16.7%
n 1
16.7%
Hangul
ValueCountFrequency (%)
55
73.3%
10
 
13.3%
5
 
6.7%
5
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 861
92.0%
Hangul 75
 
8.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 138
16.0%
- 111
12.9%
2 101
11.7%
3 86
10.0%
70
8.1%
5 68
7.9%
7 61
7.1%
0 56
6.5%
4 53
 
6.2%
6 44
 
5.1%
Other values (11) 73
8.5%
Hangul
ValueCountFrequency (%)
55
73.3%
10
 
13.3%
5
 
6.7%
5
 
6.7%

지목
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
86 
19 
전,답
12 
10 
 
6
Other values (21)
28 

Length

Max length9
Median length1
Mean length1.6335404
Min length1

Unique

Unique16 ?
Unique (%)9.9%

Sample

1st row대,임
2nd row
3rd row전,답
4th row
5th row전,대

Common Values

ValueCountFrequency (%)
86
53.4%
19
 
11.8%
전,답 12
 
7.5%
10
 
6.2%
6
 
3.7%
전,대 4
 
2.5%
주차장 2
 
1.2%
대,도 2
 
1.2%
대,전 2
 
1.2%
대,임,종 2
 
1.2%
Other values (16) 16
 
9.9%

Length

2024-03-16T13:14:10.508348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
87
52.7%
20
 
12.1%
전,답 12
 
7.3%
11
 
6.7%
8
 
4.8%
전,대 4
 
2.4%
대,임,종 2
 
1.2%
2
 
1.2%
임,대 2
 
1.2%
대,전 2
 
1.2%
Other values (13) 15
 
9.1%

용도지역
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
제1종일반주거
25 
제2종일반주거
18 
제2종일반주거지역
13 
자연녹지지역
11 
준주거지역
11 
Other values (24)
83 

Length

Max length15
Median length10
Mean length6.7142857
Min length3

Unique

Unique8 ?
Unique (%)5.0%

Sample

1st row제2종일반주거
2nd row제1종일반주거
3rd row제1종일반주거
4th row자연녹지
5th row보전녹지

Common Values

ValueCountFrequency (%)
제1종일반주거 25
15.5%
제2종일반주거 18
11.2%
제2종일반주거지역 13
 
8.1%
자연녹지지역 11
 
6.8%
준주거지역 11
 
6.8%
보전,자연녹지 10
 
6.2%
중심상업지역 10
 
6.2%
제1종일반주거지역 9
 
5.6%
제2종일반 8
 
5.0%
보전녹지 7
 
4.3%
Other values (19) 39
24.2%

Length

2024-03-16T13:14:10.849208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제1종일반주거 27
16.5%
제2종일반주거 18
11.0%
제2종일반주거지역 13
 
7.9%
자연녹지지역 11
 
6.7%
준주거지역 11
 
6.7%
보전,자연녹지 10
 
6.1%
중심상업지역 10
 
6.1%
제1종일반주거지역 9
 
5.5%
제2종일반 8
 
4.9%
보전녹지 7
 
4.3%
Other values (18) 40
24.4%

허가목적(용도)
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
태양광
85 
단독주택
18 
근린생활시설
 
8
연립주택
 
3
사방사업
 
2
Other values (41)
45 

Length

Max length34
Median length3
Mean length4.9813665
Min length3

Unique

Unique37 ?
Unique (%)23.0%

Sample

1st row단독주택
2nd row근린생활시설
3rd row단독주택
4th row태양광발전설비
5th row어린이집

Common Values

ValueCountFrequency (%)
태양광 85
52.8%
단독주택 18
 
11.2%
근린생활시설 8
 
5.0%
연립주택 3
 
1.9%
사방사업 2
 
1.2%
창고시설 2
 
1.2%
임시주차장 2
 
1.2%
문화 및 집회시설 2
 
1.2%
공동주택(연립주택) 2
 
1.2%
근린생활시설, 문화 및 집회시설 1
 
0.6%
Other values (36) 36
22.4%

Length

2024-03-16T13:14:11.203701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
태양광 85
46.7%
단독주택 19
 
10.4%
근린생활시설 11
 
6.0%
5
 
2.7%
연립주택 3
 
1.6%
설치 3
 
1.6%
임시주차장 3
 
1.6%
문화 3
 
1.6%
집회시설 3
 
1.6%
제2종근생(사무소 2
 
1.1%
Other values (41) 45
24.7%

허가면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct128
Distinct (%)79.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean835.86062
Minimum6
Maximum27332
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-03-16T13:14:11.486415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile26.24
Q177.25
median188
Q3711
95-th percentile3565
Maximum27332
Range27326
Interquartile range (IQR)633.75

Descriptive statistics

Standard deviation2387.4102
Coefficient of variation (CV)2.85623
Kurtosis96.100079
Mean835.86062
Median Absolute Deviation (MAD)153
Skewness8.9476138
Sum134573.56
Variance5699727.3
MonotonicityNot monotonic
2024-03-16T13:14:11.690760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
44.0 8
 
5.0%
26.0 6
 
3.7%
30.0 4
 
2.5%
2198.0 4
 
2.5%
44.98 4
 
2.5%
96.0 3
 
1.9%
60.0 3
 
1.9%
523.0 2
 
1.2%
1376.0 2
 
1.2%
3826.0 2
 
1.2%
Other values (118) 123
76.4%
ValueCountFrequency (%)
6.0 1
 
0.6%
24.0 1
 
0.6%
26.0 6
3.7%
26.24 2
 
1.2%
29.99 1
 
0.6%
30.0 4
2.5%
35.0 1
 
0.6%
36.0 1
 
0.6%
44.0 8
5.0%
44.98 4
2.5%
ValueCountFrequency (%)
27332.0 1
0.6%
7732.0 1
0.6%
5834.0 1
0.6%
4677.0 1
0.6%
3826.0 2
1.2%
3770.0 1
0.6%
3632.0 1
0.6%
3565.0 1
0.6%
3445.0 1
0.6%
2809.0 1
0.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-08
161 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-08
2nd row2024-03-08
3rd row2024-03-08
4th row2024-03-08
5th row2024-03-08

Common Values

ValueCountFrequency (%)
2024-03-08 161
100.0%

Length

2024-03-16T13:14:11.900443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:14:12.080706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-08 161
100.0%

Interactions

2024-03-16T13:14:05.496489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:14:05.233623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:14:05.613564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:14:05.371430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:14:12.193103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번개발행위구분동명지목용도지역허가목적(용도)허가면적(제곱미터)
연번1.0000.6250.6380.6130.8340.7550.000
개발행위구분0.6251.0000.5280.7550.6780.7800.196
동명0.6380.5281.0000.3760.8240.0000.075
지목0.6130.7550.3761.0000.9140.9840.956
용도지역0.8340.6780.8240.9141.0000.9440.873
허가목적(용도)0.7550.7800.0000.9840.9441.0001.000
허가면적(제곱미터)0.0000.1960.0750.9560.8731.0001.000
2024-03-16T13:14:12.371139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도지역개발행위구분허가목적(용도)지목동명
용도지역1.0000.4140.4720.4740.346
개발행위구분0.4141.0000.4710.5030.304
허가목적(용도)0.4720.4711.0000.6840.000
지목0.4740.5030.6841.0000.100
동명0.3460.3040.0000.1001.000
2024-03-16T13:14:12.588739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가면적(제곱미터)개발행위구분동명지목용도지역허가목적(용도)
연번1.000-0.3570.4460.2650.2120.4220.302
허가면적(제곱미터)-0.3571.0000.1860.0240.7820.6100.856
개발행위구분0.4460.1861.0000.3040.5030.4140.471
동명0.2650.0240.3041.0000.1000.3460.000
지목0.2120.7820.5030.1001.0000.4740.684
용도지역0.4220.6100.4140.3460.4741.0000.472
허가목적(용도)0.3020.8560.4710.0000.6840.4721.000

Missing values

2024-03-16T13:14:05.725860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:14:05.923508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번허가일개발행위구분동명지번지목용도지역허가목적(용도)허가면적(제곱미터)데이터기준일자
012020-01-17토지형질변경지산동310-17 외 1대,임제2종일반주거단독주택248.02024-03-08
122020-01-17토지형질변경학동221-3외1제1종일반주거근린생활시설308.02024-03-08
232020-01-21토지형질변경용연동277외3전,답제1종일반주거단독주택728.02024-03-08
342020-01-30공작물설치운림동469자연녹지태양광발전설비526.532024-03-08
452020-03-12토지형질변경용산동385-10외3전,대보전녹지어린이집1816.02024-03-08
562020-03-16토지형질변경지산동산3외30전,대,임,도,수제2.3종일반주거공동주택27332.02024-03-08
672020-03-18토지형질변경학동산157-1외3임,대, 종자연녹지연립주택3770.02024-03-08
782020-03-18토지형질변경용연동278외1전, 답제1종일반주거단독주택, 근린생활시설490.02024-03-08
892020-03-25공작물설치학동165-31외2일반상업태양광495.02024-03-08
9102020-05-13토지형질변경학동256-7외3전,답제3종일반주거다세대주택(2동)692.02024-03-08
연번허가일개발행위구분동명지번지목용도지역허가목적(용도)허가면적(제곱미터)데이터기준일자
1511522023-11-08공작물설치동명동Jan-80제2종일반주거지역태양광74.982024-03-08
1521532023-11-08공작물설치산수동542-53준주거지역태양광44.982024-03-08
1531542023-11-23공작물설치학동12월 26일준주거지역태양광170.582024-03-08
1541552023-11-23공작물설치계림동1008준주거지역태양광179.02024-03-08
1551562023-11-23공작물설치운림동산134 외 1자연녹지지역통신공용기지국112.082024-03-08
1561572023-11-27공작물설치내남동857-12제1종일반주거지역태양광185.982024-03-08
1571582023-12-22공작물설치금남로5가183-2중심상업지역태양광308.472024-03-08
1581592023-12-22공작물설치황금동57중심상업지역태양광174.02024-03-08
1591602023-12-28공작물설치계림동1817제3종일반주거지역태양광92.882024-03-08
1601612023-12-28공작물설치지산동703-60제2종일반주거지역태양광66.292024-03-08