Overview

Dataset statistics

Number of variables6
Number of observations1220
Missing cells0
Missing cells (%)0.0%
Duplicate rows14
Duplicate rows (%)1.1%
Total size in memory58.5 KiB
Average record size in memory49.1 B

Variable types

DateTime1
Text3
Numeric1
Categorical1

Dataset

Description경상남도 합천군 개발행위 허가(허가일자, 지번주소, 지목, 면적, 용도지역, 허가목적)에 대한 정보를 제공하고 있습니다.
Author경상남도 합천군
URLhttps://www.data.go.kr/data/15035697/fileData.do

Alerts

Dataset has 14 (1.1%) duplicate rowsDuplicates
용도지역 is highly imbalanced (50.4%)Imbalance

Reproduction

Analysis started2023-12-12 20:31:28.231429
Analysis finished2023-12-12 20:31:29.112987
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

Distinct339
Distinct (%)27.8%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
Minimum2018-05-01 00:00:00
Maximum2020-12-30 00:00:00
2023-12-13T05:31:29.205873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:29.381197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1050
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
2023-12-13T05:31:29.697414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length103
Median length75
Mean length23.682787
Min length17

Characters and Unicode

Total characters28893
Distinct characters164
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique936 ?
Unique (%)76.7%

Sample

1st row경상남도 합천군 쌍백면 평지리45
2nd row경상남도 합천군 야로면 하림리280
3rd row경상남도 합천군 가야면 죽전리720-3,720-4
4th row경상남도 합천군 율곡면 갑산리 306-1
5th row경상남도 합천군 쌍백면 장전리454-2
ValueCountFrequency (%)
경상남도 1220
22.2%
합천군 1220
22.2%
가야면 140
 
2.5%
대병면 94
 
1.7%
율곡면 92
 
1.7%
용주면 87
 
1.6%
합천읍 85
 
1.5%
봉산면 85
 
1.5%
쌍백면 76
 
1.4%
묘산면 73
 
1.3%
Other values (1459) 2326
42.3%
2023-12-13T05:31:30.240213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4280
 
14.8%
1367
 
4.7%
1326
 
4.6%
1262
 
4.4%
1247
 
4.3%
1 1246
 
4.3%
1224
 
4.2%
1220
 
4.2%
1220
 
4.2%
1220
 
4.2%
Other values (154) 13281
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16069
55.6%
Decimal Number 6783
23.5%
Space Separator 4280
 
14.8%
Dash Punctuation 930
 
3.2%
Other Punctuation 787
 
2.7%
Open Punctuation 18
 
0.1%
Close Punctuation 18
 
0.1%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1367
 
8.5%
1326
 
8.3%
1262
 
7.9%
1247
 
7.8%
1224
 
7.6%
1220
 
7.6%
1220
 
7.6%
1220
 
7.6%
1135
 
7.1%
522
 
3.2%
Other values (131) 4326
26.9%
Decimal Number
ValueCountFrequency (%)
1 1246
18.4%
2 869
12.8%
3 725
10.7%
6 648
9.6%
4 647
9.5%
5 622
9.2%
7 592
8.7%
9 503
7.4%
8 484
 
7.1%
0 447
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
G 1
12.5%
H 1
12.5%
A 1
12.5%
F 1
12.5%
E 1
12.5%
D 1
12.5%
C 1
12.5%
B 1
12.5%
Space Separator
ValueCountFrequency (%)
4280
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 930
100.0%
Other Punctuation
ValueCountFrequency (%)
, 787
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16069
55.6%
Common 12816
44.4%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1367
 
8.5%
1326
 
8.3%
1262
 
7.9%
1247
 
7.8%
1224
 
7.6%
1220
 
7.6%
1220
 
7.6%
1220
 
7.6%
1135
 
7.1%
522
 
3.2%
Other values (131) 4326
26.9%
Common
ValueCountFrequency (%)
4280
33.4%
1 1246
 
9.7%
- 930
 
7.3%
2 869
 
6.8%
, 787
 
6.1%
3 725
 
5.7%
6 648
 
5.1%
4 647
 
5.0%
5 622
 
4.9%
7 592
 
4.6%
Other values (5) 1470
 
11.5%
Latin
ValueCountFrequency (%)
G 1
12.5%
H 1
12.5%
A 1
12.5%
F 1
12.5%
E 1
12.5%
D 1
12.5%
C 1
12.5%
B 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16068
55.6%
ASCII 12824
44.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4280
33.4%
1 1246
 
9.7%
- 930
 
7.3%
2 869
 
6.8%
, 787
 
6.1%
3 725
 
5.7%
6 648
 
5.1%
4 647
 
5.0%
5 622
 
4.9%
7 592
 
4.6%
Other values (13) 1478
 
11.5%
Hangul
ValueCountFrequency (%)
1367
 
8.5%
1326
 
8.3%
1262
 
7.9%
1247
 
7.8%
1224
 
7.6%
1220
 
7.6%
1220
 
7.6%
1220
 
7.6%
1135
 
7.1%
522
 
3.2%
Other values (130) 4325
26.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

지목
Text

Distinct69
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
2023-12-13T05:31:30.495634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length1
Mean length1.3344262
Min length1

Characters and Unicode

Total characters1628
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)2.8%

Sample

1st row
2nd row
3rd row
4th row
5th row
ValueCountFrequency (%)
565
46.3%
191
 
15.7%
190
 
15.6%
28
 
2.3%
26
 
2.1%
21
 
1.7%
21
 
1.7%
전+답 19
 
1.6%
전+임 15
 
1.2%
답+전 11
 
0.9%
Other values (59) 133
 
10.9%
2023-12-13T05:31:30.946521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
662
40.7%
277
17.0%
256
 
15.7%
+ 204
 
12.5%
49
 
3.0%
48
 
2.9%
47
 
2.9%
31
 
1.9%
13
 
0.8%
8
 
0.5%
Other values (9) 33
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1424
87.5%
Math Symbol 204
 
12.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
662
46.5%
277
19.5%
256
 
18.0%
49
 
3.4%
48
 
3.4%
47
 
3.3%
31
 
2.2%
13
 
0.9%
8
 
0.6%
8
 
0.6%
Other values (8) 25
 
1.8%
Math Symbol
ValueCountFrequency (%)
+ 204
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1424
87.5%
Common 204
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
662
46.5%
277
19.5%
256
 
18.0%
49
 
3.4%
48
 
3.4%
47
 
3.3%
31
 
2.2%
13
 
0.9%
8
 
0.6%
8
 
0.6%
Other values (8) 25
 
1.8%
Common
ValueCountFrequency (%)
+ 204
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1424
87.5%
ASCII 204
 
12.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
662
46.5%
277
19.5%
256
 
18.0%
49
 
3.4%
48
 
3.4%
47
 
3.3%
31
 
2.2%
13
 
0.9%
8
 
0.6%
8
 
0.6%
Other values (8) 25
 
1.8%
ASCII
ValueCountFrequency (%)
+ 204
100.0%

면적
Real number (ℝ)

Distinct895
Distinct (%)73.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1593.9918
Minimum12
Maximum29711
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.9 KiB
2023-12-13T05:31:31.129628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile140
Q1477.5
median832
Q31724
95-th percentile5411.15
Maximum29711
Range29699
Interquartile range (IQR)1246.5

Descriptive statistics

Standard deviation2221.9551
Coefficient of variation (CV)1.3939564
Kurtosis40.16269
Mean1593.9918
Median Absolute Deviation (MAD)464.25
Skewness4.7689866
Sum1944670
Variance4937084.5
MonotonicityNot monotonic
2023-12-13T05:31:31.340667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
660.0 17
 
1.4%
330.0 10
 
0.8%
658.0 9
 
0.7%
600.0 8
 
0.7%
990.0 7
 
0.6%
450.0 7
 
0.6%
200.0 6
 
0.5%
99.0 6
 
0.5%
650.0 6
 
0.5%
655.0 6
 
0.5%
Other values (885) 1138
93.3%
ValueCountFrequency (%)
12.0 1
0.1%
13.0 1
0.1%
14.0 1
0.1%
15.0 1
0.1%
17.0 1
0.1%
26.0 1
0.1%
32.0 1
0.1%
33.0 1
0.1%
36.0 1
0.1%
44.0 1
0.1%
ValueCountFrequency (%)
29711.0 1
0.1%
27152.0 1
0.1%
15021.0 1
0.1%
14000.0 2
0.2%
13284.0 1
0.1%
12654.4 1
0.1%
11106.3 1
0.1%
10825.0 1
0.1%
10574.0 1
0.1%
10355.0 1
0.1%

용도지역
Categorical

IMBALANCE 

Distinct33
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
계획관리지역
550 
농림지역
243 
보전관리지역
154 
생산관리지역
132 
자연녹지지역
 
33
Other values (28)
108 

Length

Max length18
Median length6
Mean length5.8754098
Min length1

Unique

Unique12 ?
Unique (%)1.0%

Sample

1st row계획관리지역
2nd row보전관리지역
3rd row생산관리지역
4th row생산관리지역
5th row계획관리지역

Common Values

ValueCountFrequency (%)
계획관리지역 550
45.1%
농림지역 243
19.9%
보전관리지역 154
 
12.6%
생산관리지역 132
 
10.8%
자연녹지지역 33
 
2.7%
제1종일반주거지역 32
 
2.6%
생산녹지지역 9
 
0.7%
제2종일반주거지역 8
 
0.7%
계회관리지역 8
 
0.7%
농림지역+보전관리지역 6
 
0.5%
Other values (23) 45
 
3.7%

Length

2023-12-13T05:31:31.503562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
계획관리지역 550
45.1%
농림지역 243
19.9%
보전관리지역 154
 
12.6%
생산관리지역 132
 
10.8%
자연녹지지역 33
 
2.7%
제1종일반주거지역 32
 
2.6%
생산녹지지역 9
 
0.7%
제2종일반주거지역 8
 
0.7%
계회관리지역 8
 
0.7%
농림지역+보전관리지역 6
 
0.5%
Other values (23) 45
 
3.7%
Distinct253
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
2023-12-13T05:31:31.748091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length10.94918
Min length4

Characters and Unicode

Total characters13358
Distinct characters222
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique168 ?
Unique (%)13.8%

Sample

1st row단독주택건립
2nd row단독주택건립
3rd row단독주택건립및진입도로개설
4th row단독주택건립
5th row단독주택건립
ValueCountFrequency (%)
단독주택건립 240
19.7%
발전시설(태양광)조성 145
 
11.9%
동식물관련시설(우사)건립 88
 
7.2%
창고시설(농업용)건립 86
 
7.0%
발전시설(태양광)조성-지붕위 54
 
4.4%
태양광발전시설설치(지붕위 49
 
4.0%
태양광발전시설설치 49
 
4.0%
단독주택(농업용)건립 44
 
3.6%
태양광발전시설조성 35
 
2.9%
창고시설건립 14
 
1.1%
Other values (242) 416
34.1%
2023-12-13T05:31:32.105579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
955
 
7.1%
729
 
5.5%
714
 
5.3%
( 706
 
5.3%
) 706
 
5.3%
696
 
5.2%
374
 
2.8%
370
 
2.8%
368
 
2.8%
368
 
2.8%
Other values (212) 7372
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11692
87.5%
Open Punctuation 706
 
5.3%
Close Punctuation 706
 
5.3%
Dash Punctuation 144
 
1.1%
Decimal Number 76
 
0.6%
Other Punctuation 21
 
0.2%
Uppercase Letter 9
 
0.1%
Connector Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
955
 
8.2%
729
 
6.2%
714
 
6.1%
696
 
6.0%
374
 
3.2%
370
 
3.2%
368
 
3.1%
368
 
3.1%
367
 
3.1%
366
 
3.1%
Other values (195) 6385
54.6%
Decimal Number
ValueCountFrequency (%)
2 50
65.8%
1 22
28.9%
9 1
 
1.3%
3 1
 
1.3%
5 1
 
1.3%
4 1
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
P 4
44.4%
B 2
22.2%
G 1
 
11.1%
C 1
 
11.1%
L 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 706
100.0%
Close Punctuation
ValueCountFrequency (%)
) 706
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11692
87.5%
Common 1657
 
12.4%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
955
 
8.2%
729
 
6.2%
714
 
6.1%
696
 
6.0%
374
 
3.2%
370
 
3.2%
368
 
3.1%
368
 
3.1%
367
 
3.1%
366
 
3.1%
Other values (195) 6385
54.6%
Common
ValueCountFrequency (%)
( 706
42.6%
) 706
42.6%
- 144
 
8.7%
2 50
 
3.0%
1 22
 
1.3%
, 21
 
1.3%
_ 3
 
0.2%
9 1
 
0.1%
3 1
 
0.1%
5 1
 
0.1%
Other values (2) 2
 
0.1%
Latin
ValueCountFrequency (%)
P 4
44.4%
B 2
22.2%
G 1
 
11.1%
C 1
 
11.1%
L 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11692
87.5%
ASCII 1665
 
12.5%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
955
 
8.2%
729
 
6.2%
714
 
6.1%
696
 
6.0%
374
 
3.2%
370
 
3.2%
368
 
3.1%
368
 
3.1%
367
 
3.1%
366
 
3.1%
Other values (195) 6385
54.6%
ASCII
ValueCountFrequency (%)
( 706
42.4%
) 706
42.4%
- 144
 
8.6%
2 50
 
3.0%
1 22
 
1.3%
, 21
 
1.3%
P 4
 
0.2%
_ 3
 
0.2%
B 2
 
0.1%
G 1
 
0.1%
Other values (6) 6
 
0.4%
Arrows
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T05:31:28.718614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:31:32.189623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지목면적용도지역
지목1.0000.5300.784
면적0.5301.0000.000
용도지역0.7840.0001.000
2023-12-13T05:31:32.265003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적용도지역
면적1.0000.000
용도지역0.0001.000

Missing values

2023-12-13T05:31:28.907043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:31:29.054469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자지번주소지목면적용도지역허가목적
02018-05-01경상남도 합천군 쌍백면 평지리45601.0계획관리지역단독주택건립
12018-05-01경상남도 합천군 야로면 하림리280651.0보전관리지역단독주택건립
22018-05-01경상남도 합천군 가야면 죽전리720-3,720-4800.0생산관리지역단독주택건립및진입도로개설
32018-05-02경상남도 합천군 율곡면 갑산리 306-1837.1생산관리지역단독주택건립
42018-05-02경상남도 합천군 쌍백면 장전리454-2655.0계획관리지역단독주택건립
52018-05-02경상남도 합천군 초계면 택리135752539.4농림지역동식물관련시설(우사)건립
62018-05-03경상남도 합천군 율곡면 낙민리91, 92, 93 ,98, 1517-17답+전+구4900.0계획관리지역주기장설치(중장비주차장)
72018-05-03경상남도 합천군 쌍백면 장전리214-31570.0생산관리지역동식물관련시설(우사)건립
82018-05-03경상남도 합천군 야로면 하림리252-4524.0계획관리지역단독주택(농업용)건립
92018-05-03경상남도 합천군 가야면 치인리531-1554.0계획관리지역단독주택건립
일자지번주소지목면적용도지역허가목적
12102020-12-20경상남도 합천군 덕곡면 율지리284-2200.0농림지역제2종근린생활시설(농산물가공장)건립
12112020-12-20경상남도 합천군 쌍책면 상포리469-5496.0보전관리지역제1종근린생활시설(소매점)건립
12122020-12-23경상남도 합천군 삼가면 외토리929704.0계획관리지역단독주택건립
12132020-12-26경상남도 합천군 묘산면 도옥리31-19496.0농림지역단독주택(농업용)건립
12142020-12-26경상남도 합천군 쌍백면 평구리5741008.0계획관리지역하수관로공사현장사무실설치
12152020-12-27경상남도 합천군 야로면 묵촌리657155.0계획관리지역제1종근린생활시설(마을공동작업장)건립
12162020-12-27경상남도 합천군 묘산면 안성리405-3555.0농림지역창고시설(퇴비사)건립
12172020-12-27경상남도 합천군 삼가면 용흥리822513.0생산관리지역단독주택건립
12182020-12-30경상남도 합천군 봉산면 봉계리621, 623, 1037-61747.0계획관리지역단독주택건립
12192020-12-30경상남도 합천군 묘산면 안성리406-2490.0농림지역창고시설(퇴비사)건립

Duplicate rows

Most frequently occurring

일자지번주소지목면적용도지역허가목적# duplicates
12020-01-01경상남도 합천군 율곡면 임북리555-1, 555-4목+잡426.4생산관리지역태양광발전시설설치(지붕위)4
72020-06-19경상남도 합천군 용주면 성산리132, 133-2, 134450.0계획관리지역태양광발전시설설치(지붕위)4
02018-08-22경상남도 합천군 합천읍 금양리320540.0계획관리지역태양광발전시설설이(지붕위)2
22020-01-06경상남도 합천군 율곡면 본천리468381.0보전관리지역제2종근린생활시설(제실)및단독주택증축건립2
32020-01-29경상남도 합천군 야로면 묵촌리01월 17일521.0계획관리지역발전시설(태양광)조성-지붕위-2
42020-04-11경상남도 합천군 야로면 하빈리705-1, 704-1, 702-22998.0생산녹지지역공장(농산물가공장)건립2
52020-06-16경상남도 합천군 적중면 상부리582600.0농림지역발전시설(태양광)조성-지붕위-2
62020-06-19경상남도 합천군 용주면 성산리132, 133-2, 134375.0계획관리지역태양광발전시설설치(지붕위)2
82020-06-21경상남도 합천군 가회면 장대리958-1560.0농림지역제2종근린생활시설(농산물제조업소)건립2
92020-08-02경상남도 합천군 초계면 관평리76-164710.0농림지역동식물관련시설(우사)건립2