Overview

Dataset statistics

Number of variables4
Number of observations2078
Missing cells106
Missing cells (%)1.3%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory65.1 KiB
Average record size in memory32.1 B

Variable types

Categorical1
Text1
DateTime2

Dataset

Description서울특별시 강동구의 건축허가현황입니다.건축구분, 대지위치, 착공예정일, 사용승인일의 내용을 제공합니다.
Author서울특별시 강동구
URLhttps://www.data.go.kr/data/15127638/fileData.do

Alerts

건축구분 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
사용승인일 has 105 (5.1%) missing valuesMissing

Reproduction

Analysis started2024-04-21 02:38:46.754084
Analysis finished2024-04-21 02:38:47.421386
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건축구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.4 KiB
신축
2078 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신축
2nd row신축
3rd row신축
4th row신축
5th row신축

Common Values

ValueCountFrequency (%)
신축 2078
100.0%

Length

2024-04-21T11:38:47.474515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:38:47.557222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신축 2078
100.0%
Distinct2077
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size16.4 KiB
2024-04-21T11:38:47.697632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length39
Mean length21.005775
Min length15

Characters and Unicode

Total characters43650
Distinct characters74
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2076 ?
Unique (%)99.9%

Sample

1st row서울특별시 강동구 고덕동 649
2nd row서울특별시 강동구 강일동 667-19
3rd row서울특별시 강동구 천호동 191-25 외1필지
4th row서울특별시 강동구 암사동 250-1
5th row서울특별시 강동구 강일동 667-70
ValueCountFrequency (%)
강동구 2079
23.0%
서울특별시 2078
23.0%
천호동 693
 
7.7%
외1필지 472
 
5.2%
성내동 390
 
4.3%
암사동 336
 
3.7%
고덕동 183
 
2.0%
길동 167
 
1.8%
둔촌동 130
 
1.4%
외2필지 89
 
1.0%
Other values (2053) 2419
26.8%
2024-04-21T11:38:47.977817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6960
15.9%
4158
 
9.5%
2164
 
5.0%
2112
 
4.8%
2086
 
4.8%
2086
 
4.8%
2085
 
4.8%
2078
 
4.8%
2078
 
4.8%
1 2068
 
4.7%
Other values (64) 15775
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25119
57.5%
Decimal Number 9620
 
22.0%
Space Separator 6960
 
15.9%
Dash Punctuation 1931
 
4.4%
Uppercase Letter 15
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4158
16.6%
2164
8.6%
2112
8.4%
2086
8.3%
2086
8.3%
2085
8.3%
2078
8.3%
2078
8.3%
693
 
2.8%
693
 
2.8%
Other values (47) 4886
19.5%
Decimal Number
ValueCountFrequency (%)
1 2068
21.5%
2 1362
14.2%
3 1350
14.0%
4 1185
12.3%
5 845
8.8%
7 640
 
6.7%
6 604
 
6.3%
9 539
 
5.6%
0 524
 
5.4%
8 503
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 8
53.3%
L 7
46.7%
Space Separator
ValueCountFrequency (%)
6960
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1931
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25119
57.5%
Common 18516
42.4%
Latin 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4158
16.6%
2164
8.6%
2112
8.4%
2086
8.3%
2086
8.3%
2085
8.3%
2078
8.3%
2078
8.3%
693
 
2.8%
693
 
2.8%
Other values (47) 4886
19.5%
Common
ValueCountFrequency (%)
6960
37.6%
1 2068
 
11.2%
- 1931
 
10.4%
2 1362
 
7.4%
3 1350
 
7.3%
4 1185
 
6.4%
5 845
 
4.6%
7 640
 
3.5%
6 604
 
3.3%
9 539
 
2.9%
Other values (5) 1032
 
5.6%
Latin
ValueCountFrequency (%)
B 8
53.3%
L 7
46.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25119
57.5%
ASCII 18531
42.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6960
37.6%
1 2068
 
11.2%
- 1931
 
10.4%
2 1362
 
7.3%
3 1350
 
7.3%
4 1185
 
6.4%
5 845
 
4.6%
7 640
 
3.5%
6 604
 
3.3%
9 539
 
2.9%
Other values (7) 1047
 
5.6%
Hangul
ValueCountFrequency (%)
4158
16.6%
2164
8.6%
2112
8.4%
2086
8.3%
2086
8.3%
2085
8.3%
2078
8.3%
2078
8.3%
693
 
2.8%
693
 
2.8%
Other values (47) 4886
19.5%
Distinct1283
Distinct (%)61.8%
Missing1
Missing (%)< 0.1%
Memory size16.4 KiB
Minimum2015-01-17 00:00:00
Maximum2024-04-01 00:00:00
2024-04-21T11:38:48.095048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:38:48.225493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사용승인일
Date

MISSING 

Distinct1208
Distinct (%)61.2%
Missing105
Missing (%)5.1%
Memory size16.4 KiB
Minimum2015-04-16 00:00:00
Maximum2024-03-27 00:00:00
2024-04-21T11:38:48.353035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:38:48.464162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2024-04-21T11:38:47.222177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:38:47.302033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T11:38:47.381743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

건축구분대지위치착공예정일사용승인일
0신축서울특별시 강동구 고덕동 6492024-04-01<NA>
1신축서울특별시 강동구 강일동 667-192024-03-13<NA>
2신축서울특별시 강동구 천호동 191-25 외1필지2024-03-07<NA>
3신축서울특별시 강동구 암사동 250-12024-02-26<NA>
4신축서울특별시 강동구 강일동 667-702024-02-19<NA>
5신축서울특별시 강동구 강일동 667-712024-01-29<NA>
6신축서울특별시 강동구 강일동 667-402024-02-13<NA>
7신축서울특별시 강동구 강일동 667-212024-02-19<NA>
8신축서울특별시 강동구 길동 341-21 외1필지2024-01-03<NA>
9신축서울특별시 강동구 강일동 667-372023-11-28<NA>
건축구분대지위치착공예정일사용승인일
2068신축서울특별시 강동구 성내동 135-3 외2필지2015-03-252015-07-13
2069신축서울특별시 강동구 길동 132-22015-03-112015-07-31
2070신축서울특별시 강동구 천호동 110-3 외2필지2015-03-052015-07-01
2071신축서울특별시 강동구 천호동 132-12015-01-202015-04-16
2072신축서울특별시 강동구 성내동 409-52015-06-152015-11-10
2073신축서울특별시 강동구 천호동 294-112015-02-252015-06-10
2074신축서울특별시 강동구 강일동 7022015-04-142016-01-12
2075신축서울특별시 강동구 성내동 521-12015-03-162015-08-10
2076신축서울특별시 강동구 성내동 521-22015-03-162015-08-06
2077신축서울특별시 강동구 성내동 409-162015-04-022015-08-07

Duplicate rows

Most frequently occurring

건축구분대지위치착공예정일사용승인일# duplicates
0신축서울특별시 강동구 둔촌동 170-12023-07-01<NA>2