Overview

Dataset statistics

Number of variables3
Number of observations234
Missing cells0
Missing cells (%)0.0%
Duplicate rows11
Duplicate rows (%)4.7%
Total size in memory5.6 KiB
Average record size in memory24.6 B

Variable types

DateTime1
Text1
Categorical1

Dataset

Description한국자산관리공사_국유재산 가설건축물 신청현황("가설건축물신청일자","가설건축물용도","대부용도") 데이터 제공
Author한국자산관리공사
URLhttps://www.data.go.kr/data/15074512/fileData.do

Alerts

Dataset has 11 (4.7%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 00:15:18.344257
Analysis finished2023-12-12 00:15:18.565857
Duration0.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct138
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2019-01-01 00:00:00
Maximum2019-12-30 00:00:00
2023-12-12T09:15:18.617007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:15:18.716243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct146
Distinct (%)62.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T09:15:18.926023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length18
Mean length6.3547009
Min length1

Characters and Unicode

Total characters1487
Distinct characters176
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)52.1%

Sample

1st row시설물
2nd row철거비
3rd row게이트볼장
4th row가설전람회장(모델하우스)
5th row굴 박신장
ValueCountFrequency (%)
창고 41
 
11.3%
사무실 26
 
7.2%
굴박신장 18
 
5.0%
18
 
5.0%
임시사무실 11
 
3.0%
비닐하우스 9
 
2.5%
보관 9
 
2.5%
8
 
2.2%
농막 7
 
1.9%
농업용 6
 
1.7%
Other values (160) 210
57.9%
2023-12-12T09:15:19.262292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
129
 
8.7%
69
 
4.6%
65
 
4.4%
52
 
3.5%
51
 
3.4%
48
 
3.2%
47
 
3.2%
46
 
3.1%
45
 
3.0%
37
 
2.5%
Other values (166) 898
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1309
88.0%
Space Separator 129
 
8.7%
Other Punctuation 15
 
1.0%
Close Punctuation 14
 
0.9%
Open Punctuation 14
 
0.9%
Uppercase Letter 4
 
0.3%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
5.3%
65
 
5.0%
52
 
4.0%
51
 
3.9%
48
 
3.7%
47
 
3.6%
46
 
3.5%
45
 
3.4%
37
 
2.8%
36
 
2.8%
Other values (156) 813
62.1%
Other Punctuation
ValueCountFrequency (%)
, 12
80.0%
· 2
 
13.3%
/ 1
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
T 1
25.0%
V 1
25.0%
Space Separator
ValueCountFrequency (%)
129
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1309
88.0%
Common 174
 
11.7%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
5.3%
65
 
5.0%
52
 
4.0%
51
 
3.9%
48
 
3.7%
47
 
3.6%
46
 
3.5%
45
 
3.4%
37
 
2.8%
36
 
2.8%
Other values (156) 813
62.1%
Common
ValueCountFrequency (%)
129
74.1%
) 14
 
8.0%
( 14
 
8.0%
, 12
 
6.9%
2 2
 
1.1%
· 2
 
1.1%
/ 1
 
0.6%
Latin
ValueCountFrequency (%)
C 2
50.0%
T 1
25.0%
V 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1308
88.0%
ASCII 176
 
11.8%
None 2
 
0.1%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
129
73.3%
) 14
 
8.0%
( 14
 
8.0%
, 12
 
6.8%
2 2
 
1.1%
C 2
 
1.1%
T 1
 
0.6%
V 1
 
0.6%
/ 1
 
0.6%
Hangul
ValueCountFrequency (%)
69
 
5.3%
65
 
5.0%
52
 
4.0%
51
 
3.9%
48
 
3.7%
47
 
3.6%
46
 
3.5%
45
 
3.4%
37
 
2.8%
36
 
2.8%
Other values (155) 812
62.1%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

대부용도
Categorical

Distinct23
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
잡종지
60 
54 
대지
29 
공장용지
19 
창고용지
11 
Other values (18)
61 

Length

Max length5
Median length4
Mean length2.542735
Min length1

Unique

Unique9 ?
Unique (%)3.8%

Sample

1st row
2nd row과수원
3rd row체육용지
4th row대지
5th row잡종지

Common Values

ValueCountFrequency (%)
잡종지 60
25.6%
54
23.1%
대지 29
12.4%
공장용지 19
 
8.1%
창고용지 11
 
4.7%
9
 
3.8%
주택부지 9
 
3.8%
주차장용지 8
 
3.4%
도로 7
 
3.0%
토지 7
 
3.0%
Other values (13) 21
 
9.0%

Length

2023-12-12T09:15:19.396886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
잡종지 60
25.6%
54
23.1%
대지 29
12.4%
공장용지 19
 
8.1%
창고용지 11
 
4.7%
9
 
3.8%
주택부지 9
 
3.8%
주차장용지 8
 
3.4%
도로 7
 
3.0%
토지 7
 
3.0%
Other values (13) 21
 
9.0%

Missing values

2023-12-12T09:15:18.477853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:15:18.542485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가설건축물신청일자가설건축물용도대부용도
02019-01-01시설물
12019-01-01철거비과수원
22019-01-10게이트볼장체육용지
32019-01-11가설전람회장(모델하우스)대지
42019-01-16굴 박신장잡종지
52019-01-25가설건축물(양어장, 관리사) 2동제방
62019-01-25화물적재시 비막이잡종지
72019-01-28컨테이너박스잡종지
82019-01-30농수산물직거래장대지
92019-02-01창고상가부지
가설건축물신청일자가설건축물용도대부용도
2242019-12-20임시휴게실공장용지
2252019-12-20임시사무실공장용지
2262019-12-20임시사무실공장용지
2272019-12-23농막
2282019-12-23농업용 자재창고
2292019-12-24비닐하우스
2302019-12-26주택부지
2312019-12-26농막
2322019-12-27컨테이너(사무실)양어장
2332019-12-30토사물 유입 방지 등을 위한 옹벽토지

Duplicate rows

Most frequently occurring

가설건축물신청일자가설건축물용도대부용도# duplicates
82019-11-18굴박신장창고용지8
72019-11-18굴박신장잡종지5
102019-12-20임시사무실공장용지4
02019-03-07창고잡종지3
12019-04-11사무실 및 창고토지2
22019-06-20농자재 보관용 가설창고2
32019-07-01사무실대지2
42019-07-11비닐하우스2
52019-07-18창고잡종지2
62019-09-17지하수관정설치2