Overview

Dataset statistics

Number of variables7
Number of observations377
Missing cells344
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.1 KiB
Average record size in memory57.4 B

Variable types

Text3
Numeric1
DateTime3

Dataset

Description충주시 내의 개발행위허가(협의) 신청에 따른 개발행위 허가를 득하여 사업을 진행 중인 허가지 현황(허가번호, 소재지, 허가면적, 용도, 허가일, 준공일, 데이터 기준일)
Author충청북도 충주시
URLhttps://www.data.go.kr/data/15125428/fileData.do

Alerts

데이터 기준일 has constant value ""Constant
준공일 has 344 (91.2%) missing valuesMissing

Reproduction

Analysis started2023-12-16 15:06:12.282227
Analysis finished2023-12-16 15:06:14.763092
Duration2.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct357
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-16T15:06:15.768191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length7.7480106
Min length6

Characters and Unicode

Total characters2921
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique337 ?
Unique (%)89.4%

Sample

1st row2023-003
2nd row2023-008
3rd row2023-017
4th row2023-018
5th row2023-004
ValueCountFrequency (%)
2023-74 2
 
0.5%
2023-007 2
 
0.5%
2023-75 2
 
0.5%
2023-73 2
 
0.5%
2023-72 2
 
0.5%
2023-71 2
 
0.5%
2023-054 2
 
0.5%
2023-010 2
 
0.5%
2023-65 2
 
0.5%
2023-60 2
 
0.5%
Other values (347) 357
94.7%
2023-12-16T15:06:17.724450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 931
31.9%
0 508
17.4%
3 503
17.2%
- 378
12.9%
1 179
 
6.1%
4 74
 
2.5%
6 74
 
2.5%
7 73
 
2.5%
5 72
 
2.5%
9 65
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2543
87.1%
Dash Punctuation 378
 
12.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 931
36.6%
0 508
20.0%
3 503
19.8%
1 179
 
7.0%
4 74
 
2.9%
6 74
 
2.9%
7 73
 
2.9%
5 72
 
2.8%
9 65
 
2.6%
8 64
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 378
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2921
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 931
31.9%
0 508
17.4%
3 503
17.2%
- 378
12.9%
1 179
 
6.1%
4 74
 
2.5%
6 74
 
2.5%
7 73
 
2.5%
5 72
 
2.5%
9 65
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2921
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 931
31.9%
0 508
17.4%
3 503
17.2%
- 378
12.9%
1 179
 
6.1%
4 74
 
2.5%
6 74
 
2.5%
7 73
 
2.5%
5 72
 
2.5%
9 65
 
2.2%
Distinct353
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-16T15:06:18.585372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length32
Mean length25.103448
Min length17

Characters and Unicode

Total characters9464
Distinct characters162
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique338 ?
Unique (%)89.7%

Sample

1st row충청북도 충주시 엄정면 가춘리 208번지 외1필지
2nd row충청북도 충주시 앙성면 본평리 26-7
3rd row충청북도 충주시 노은면 신효리 산102
4th row충청북도 충주시 앙성면 돈산리 169번지 외 2필지
5th row충청북도 충주시 금가면 원포리 631
ValueCountFrequency (%)
충청북도 377
 
17.9%
충주시 377
 
17.9%
151
 
7.2%
1필지 73
 
3.5%
엄정면 37
 
1.8%
2필지 33
 
1.6%
신니면 32
 
1.5%
주덕읍 27
 
1.3%
앙성면 26
 
1.2%
동량면 25
 
1.2%
Other values (488) 944
44.9%
2023-12-16T15:06:21.075850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1725
18.2%
754
 
8.0%
520
 
5.5%
414
 
4.4%
387
 
4.1%
378
 
4.0%
377
 
4.0%
377
 
4.0%
358
 
3.8%
1 349
 
3.7%
Other values (152) 3825
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5945
62.8%
Space Separator 1725
 
18.2%
Decimal Number 1509
 
15.9%
Dash Punctuation 261
 
2.8%
Open Punctuation 7
 
0.1%
Close Punctuation 7
 
0.1%
Math Symbol 5
 
0.1%
Other Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
754
12.7%
520
 
8.7%
414
 
7.0%
387
 
6.5%
378
 
6.4%
377
 
6.3%
377
 
6.3%
358
 
6.0%
280
 
4.7%
255
 
4.3%
Other values (136) 1845
31.0%
Decimal Number
ValueCountFrequency (%)
1 349
23.1%
2 240
15.9%
4 155
10.3%
3 153
10.1%
5 128
 
8.5%
6 113
 
7.5%
8 109
 
7.2%
7 103
 
6.8%
9 80
 
5.3%
0 79
 
5.2%
Space Separator
ValueCountFrequency (%)
1725
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 261
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5945
62.8%
Common 3519
37.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
754
12.7%
520
 
8.7%
414
 
7.0%
387
 
6.5%
378
 
6.4%
377
 
6.3%
377
 
6.3%
358
 
6.0%
280
 
4.7%
255
 
4.3%
Other values (136) 1845
31.0%
Common
ValueCountFrequency (%)
1725
49.0%
1 349
 
9.9%
- 261
 
7.4%
2 240
 
6.8%
4 155
 
4.4%
3 153
 
4.3%
5 128
 
3.6%
6 113
 
3.2%
8 109
 
3.1%
7 103
 
2.9%
Other values (6) 183
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5945
62.8%
ASCII 3514
37.1%
Arrows 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1725
49.1%
1 349
 
9.9%
- 261
 
7.4%
2 240
 
6.8%
4 155
 
4.4%
3 153
 
4.4%
5 128
 
3.6%
6 113
 
3.2%
8 109
 
3.1%
7 103
 
2.9%
Other values (5) 178
 
5.1%
Hangul
ValueCountFrequency (%)
754
12.7%
520
 
8.7%
414
 
7.0%
387
 
6.5%
378
 
6.4%
377
 
6.3%
377
 
6.3%
358
 
6.0%
280
 
4.7%
255
 
4.3%
Other values (136) 1845
31.0%
Arrows
ValueCountFrequency (%)
5
100.0%

허가면적
Real number (ℝ)

Distinct297
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1973.7276
Minimum0
Maximum55539
Zeros1
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-16T15:06:22.027570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile99
Q1463
median703
Q31193
95-th percentile5827.4
Maximum55539
Range55539
Interquartile range (IQR)730

Descriptive statistics

Standard deviation5480.1971
Coefficient of variation (CV)2.7765722
Kurtosis57.773882
Mean1973.7276
Median Absolute Deviation (MAD)294
Skewness7.1792898
Sum744095.3
Variance30032560
MonotonicityNot monotonic
2023-12-16T15:06:22.866840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
660.0 13
 
3.4%
99.0 11
 
2.9%
990.0 7
 
1.9%
330.0 6
 
1.6%
659.0 5
 
1.3%
995.0 5
 
1.3%
1950.0 3
 
0.8%
998.0 3
 
0.8%
694.0 3
 
0.8%
826.0 3
 
0.8%
Other values (287) 318
84.4%
ValueCountFrequency (%)
0.0 1
 
0.3%
67.0 1
 
0.3%
71.0 1
 
0.3%
74.0 1
 
0.3%
82.0 2
 
0.5%
90.0 1
 
0.3%
92.0 1
 
0.3%
93.0 1
 
0.3%
95.0 1
 
0.3%
99.0 11
2.9%
ValueCountFrequency (%)
55539.0 1
0.3%
52843.0 1
0.3%
41262.9 1
0.3%
38752.0 1
0.3%
29758.7 1
0.3%
28313.7 1
0.3%
14340.0 1
0.3%
14000.0 1
0.3%
13430.0 1
0.3%
8959.0 1
0.3%

용도
Text

Distinct191
Distinct (%)50.7%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-16T15:06:23.596737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length18.30504
Min length4

Characters and Unicode

Total characters6901
Distinct characters173
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)40.6%

Sample

1st row제2종근린생활시설(사무소)신축 부지 및 진입로조성
2nd row단독(일반)주택 신축 및 진입로 부지조성
3rd row묘지관련시설(가족묘지) 부지조성
4th row공동주택(기숙사)신축 부지조성
5th row단독(일반)주택 및 제1종근린생활시설(소매점)신축 부지 및 진입
ValueCountFrequency (%)
부지조성 230
18.0%
신축 198
15.5%
단독주택 108
 
8.4%
97
 
7.6%
진입로 53
 
4.1%
단독(일반)주택 50
 
3.9%
조성 48
 
3.7%
신축부지 36
 
2.8%
제2종 32
 
2.5%
부지 27
 
2.1%
Other values (189) 402
31.4%
2023-12-16T15:06:24.910107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
904
 
13.1%
374
 
5.4%
339
 
4.9%
329
 
4.8%
328
 
4.8%
266
 
3.9%
254
 
3.7%
) 238
 
3.4%
( 237
 
3.4%
196
 
2.8%
Other values (163) 3436
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5380
78.0%
Space Separator 904
 
13.1%
Close Punctuation 238
 
3.4%
Open Punctuation 238
 
3.4%
Decimal Number 101
 
1.5%
Other Punctuation 37
 
0.5%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
374
 
7.0%
339
 
6.3%
329
 
6.1%
328
 
6.1%
266
 
4.9%
254
 
4.7%
196
 
3.6%
178
 
3.3%
175
 
3.3%
173
 
3.2%
Other values (153) 2768
51.4%
Decimal Number
ValueCountFrequency (%)
2 57
56.4%
1 40
39.6%
3 2
 
2.0%
4 2
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 237
99.6%
[ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
904
100.0%
Close Punctuation
ValueCountFrequency (%)
) 238
100.0%
Other Punctuation
ValueCountFrequency (%)
, 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5380
78.0%
Common 1521
 
22.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
374
 
7.0%
339
 
6.3%
329
 
6.1%
328
 
6.1%
266
 
4.9%
254
 
4.7%
196
 
3.6%
178
 
3.3%
175
 
3.3%
173
 
3.2%
Other values (153) 2768
51.4%
Common
ValueCountFrequency (%)
904
59.4%
) 238
 
15.6%
( 237
 
15.6%
2 57
 
3.7%
1 40
 
2.6%
, 37
 
2.4%
- 3
 
0.2%
3 2
 
0.1%
4 2
 
0.1%
[ 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5380
78.0%
ASCII 1521
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
904
59.4%
) 238
 
15.6%
( 237
 
15.6%
2 57
 
3.7%
1 40
 
2.6%
, 37
 
2.4%
- 3
 
0.2%
3 2
 
0.1%
4 2
 
0.1%
[ 1
 
0.1%
Hangul
ValueCountFrequency (%)
374
 
7.0%
339
 
6.3%
329
 
6.1%
328
 
6.1%
266
 
4.9%
254
 
4.7%
196
 
3.6%
178
 
3.3%
175
 
3.3%
173
 
3.2%
Other values (153) 2768
51.4%
Distinct168
Distinct (%)44.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Minimum2023-01-02 00:00:00
Maximum2023-11-23 00:00:00
2023-12-16T15:06:25.530631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:06:26.572500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공일
Date

MISSING 

Distinct29
Distinct (%)87.9%
Missing344
Missing (%)91.2%
Memory size3.1 KiB
Minimum2023-02-27 00:00:00
Maximum2023-11-21 00:00:00
2023-12-16T15:06:27.313065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:06:28.216270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

데이터 기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Minimum2023-11-30 00:00:00
Maximum2023-11-30 00:00:00
2023-12-16T15:06:28.821406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:06:29.309733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-16T15:06:13.247088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:06:29.675461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
허가면적준공일
허가면적1.0000.000
준공일0.0001.000

Missing values

2023-12-16T15:06:13.866344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:06:14.480153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

허가번호소재지허가면적용도허가일준공일데이터 기준일
02023-003충청북도 충주시 엄정면 가춘리 208번지 외1필지990.0제2종근린생활시설(사무소)신축 부지 및 진입로조성2023-01-02<NA>2023-11-30
12023-008충청북도 충주시 앙성면 본평리 26-7659.0단독(일반)주택 신축 및 진입로 부지조성2023-01-02<NA>2023-11-30
22023-017충청북도 충주시 노은면 신효리 산10299.0묘지관련시설(가족묘지) 부지조성2023-01-042023-04-192023-11-30
32023-018충청북도 충주시 앙성면 돈산리 169번지 외 2필지1950.0공동주택(기숙사)신축 부지조성2023-01-04<NA>2023-11-30
42023-004충청북도 충주시 금가면 원포리 631549.0단독(일반)주택 및 제1종근린생활시설(소매점)신축 부지 및 진입2023-01-06<NA>2023-11-30
52023-016충청북도 충주시 주덕읍 화곡리 995번지 외 7필지28313.7육상골재채취(토석채취)2023-01-06<NA>2023-11-30
62023-002충청북도 충주시 동량면 하천리 227번지998.0단독주택 신축 부지조성2023-01-09<NA>2023-11-30
72023-009충청북도 충주시 주덕읍 당우리 산2-104번지 외 1필지3151.0제2종근린생활시설(사무소)신축 부지조성2023-01-10<NA>2023-11-30
82023-109충청북도 충주시 앙성면 조천리 334-1번지1040.0단독(일반)주택 신축 부지조성2023-01-12<NA>2023-11-30
92023-001충청북도 충주시 신니면 화석리 203-1번지 외 3필지←2필지5758.0자재 야적장 부지조성2023-01-13<NA>2023-11-30
허가번호소재지허가면적용도허가일준공일데이터 기준일
3672023-331충청북도 충주시 금가면 문산리 574번지 외 2필지3007.1공장증설 부지조성2023-11-16<NA>2023-11-30
3682023-73충청북도 충주시 앙성면 지당리 270-8번지 외 2필지122.0단독주택 진입로 조성2023-11-17<NA>2023-11-30
3692023-332충청북도 충주시 교현동 107-1번지 외 2필지716.0제1,2종 근생(소매점, 일반음식점) 증축 부지조성2023-11-20<NA>2023-11-30
3702023-336충청북도 충주시 산척면 송강리 765-50번지360.0농업용창고 신축 부지조성2023-11-20<NA>2023-11-30
3712023-74충청북도 충주시 안림동 134-137번지161.0진입로 조성2023-11-21<NA>2023-11-30
3722023-330충청북도 충주시 중앙탑면 가흥리 894-3번지 외 2필지7723.0제1종 근생(소매점),제2종 근생(사무소,수리점) 신축 부지조성2023-11-21<NA>2023-11-30
3732023-333충청북도 충주시 용산동 1941-2번지 외 1필지199.0제1,2종근생(소매점,미용원,일반음식점)및단독주택 신축부지조성2023-11-21<NA>2023-11-30
3742023-315충청북도 충주시 용산동 1817번지1091.0단독주택 신축 및 진입로 부지 조성2023-11-22<NA>2023-11-30
3752023-334충청북도 충주시 금가면 오석리 512-3번지686.0단독주택 신축 부지조성2023-11-22<NA>2023-11-30
3762023-75충청북도 충주시 용탄동 886-3번지74.0공작물 설치(건물위 태양광발전시설)2023-11-23<NA>2023-11-30