Overview

Dataset statistics

Number of variables7
Number of observations38
Missing cells18
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory61.5 B

Variable types

Numeric2
Text1
Categorical2
DateTime2

Dataset

Description서울특별시 금천구 민간 공사장 현황 정보로 위치, 규모, 주용도, 착공일, 공사 종료일, 데이터 기준일자 등을 제공합니다.
Author서울특별시 금천구
URLhttps://www.data.go.kr/data/15108273/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
연번 is highly overall correlated with 규모(제곱미터)High correlation
규모(제곱미터) is highly overall correlated with 연번High correlation
공사 종료일 has 18 (47.4%) missing valuesMissing
연번 has unique valuesUnique
위치 has unique valuesUnique
규모(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:36:26.505612
Analysis finished2023-12-12 04:36:27.218722
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.5
Minimum1
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-12T13:36:27.292006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.85
Q110.25
median19.5
Q328.75
95-th percentile36.15
Maximum38
Range37
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation11.113055
Coefficient of variation (CV)0.56990028
Kurtosis-1.2
Mean19.5
Median Absolute Deviation (MAD)9.5
Skewness0
Sum741
Variance123.5
MonotonicityStrictly increasing
2023-12-12T13:36:27.423926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
1 1
 
2.6%
30 1
 
2.6%
23 1
 
2.6%
24 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
31 1
 
2.6%
Other values (28) 28
73.7%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%
29 1
2.6%

위치
Text

UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-12T13:36:27.666806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length21.973684
Min length18

Characters and Unicode

Total characters835
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)100.0%

Sample

1st row서울특별시 금천구 독산1동 146-5 외1
2nd row서울특별시 금천구 가산동 238-104
3rd row서울특별시 금천구 독산1동 148-30
4th row서울특별시 금천구 가산동 235-49
5th row서울특별시 금천구 독산1동 1008-11 외2
ValueCountFrequency (%)
서울특별시 38
23.6%
금천구 38
23.6%
가산동 14
 
8.7%
독산1동 11
 
6.8%
외1 4
 
2.5%
외1필지 3
 
1.9%
독산3동 3
 
1.9%
시흥1동 3
 
1.9%
시흥5동 3
 
1.9%
외2 2
 
1.2%
Other values (41) 42
26.1%
2023-12-12T13:36:27.985740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
153
18.3%
1 51
 
6.1%
46
 
5.5%
38
 
4.6%
- 38
 
4.6%
38
 
4.6%
38
 
4.6%
38
 
4.6%
38
 
4.6%
38
 
4.6%
Other values (18) 319
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 433
51.9%
Decimal Number 211
25.3%
Space Separator 153
 
18.3%
Dash Punctuation 38
 
4.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
10.6%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
30
6.9%
Other values (6) 53
12.2%
Decimal Number
ValueCountFrequency (%)
1 51
24.2%
3 28
13.3%
4 22
10.4%
0 21
10.0%
2 18
 
8.5%
5 17
 
8.1%
9 15
 
7.1%
8 14
 
6.6%
6 13
 
6.2%
7 12
 
5.7%
Space Separator
ValueCountFrequency (%)
153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 433
51.9%
Common 402
48.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
10.6%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
30
6.9%
Other values (6) 53
12.2%
Common
ValueCountFrequency (%)
153
38.1%
1 51
 
12.7%
- 38
 
9.5%
3 28
 
7.0%
4 22
 
5.5%
0 21
 
5.2%
2 18
 
4.5%
5 17
 
4.2%
9 15
 
3.7%
8 14
 
3.5%
Other values (2) 25
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 433
51.9%
ASCII 402
48.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
153
38.1%
1 51
 
12.7%
- 38
 
9.5%
3 28
 
7.0%
4 22
 
5.5%
0 21
 
5.2%
2 18
 
4.5%
5 17
 
4.2%
9 15
 
3.7%
8 14
 
3.5%
Other values (2) 25
 
6.2%
Hangul
ValueCountFrequency (%)
46
10.6%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
38
8.8%
30
6.9%
Other values (6) 53
12.2%

규모(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15622.561
Minimum195.59
Maximum91713.04
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-12T13:36:28.110536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum195.59
5-th percentile291.48
Q1548.475
median3785.965
Q322491.275
95-th percentile65104.251
Maximum91713.04
Range91517.45
Interquartile range (IQR)21942.8

Descriptive statistics

Standard deviation23964.655
Coefficient of variation (CV)1.5339774
Kurtosis2.4509132
Mean15622.561
Median Absolute Deviation (MAD)3347.265
Skewness1.785753
Sum593657.31
Variance5.7430467 × 108
MonotonicityNot monotonic
2023-12-12T13:36:28.222938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
2809.05 1
 
2.6%
334.95 1
 
2.6%
546.21 1
 
2.6%
1996.73 1
 
2.6%
1521.13 1
 
2.6%
195.59 1
 
2.6%
1371.8 1
 
2.6%
3572.9 1
 
2.6%
976.7 1
 
2.6%
585.81 1
 
2.6%
Other values (28) 28
73.7%
ValueCountFrequency (%)
195.59 1
2.6%
260.2 1
2.6%
297.0 1
2.6%
334.95 1
2.6%
396.64 1
2.6%
420.82 1
2.6%
456.58 1
2.6%
473.02 1
2.6%
515.9 1
2.6%
546.21 1
2.6%
ValueCountFrequency (%)
91713.04 1
2.6%
76595.52 1
2.6%
63076.38 1
2.6%
61611.85 1
2.6%
42373.0 1
2.6%
41310.37 1
2.6%
36518.35 1
2.6%
36497.4 1
2.6%
30515.0 1
2.6%
24483.24 1
2.6%

주용도
Categorical

Distinct6
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Memory size436.0 B
업무시설
13 
공동주택
공장
단독주택
제2종근린생활시설

Length

Max length9
Median length4
Mean length4.3157895
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row업무시설
2nd row업무시설
3rd row업무시설
4th row업무시설
5th row업무시설

Common Values

ValueCountFrequency (%)
업무시설 13
34.2%
공동주택 8
21.1%
공장 6
15.8%
단독주택 5
 
13.2%
제2종근린생활시설 4
 
10.5%
방송통신시설 2
 
5.3%

Length

2023-12-12T13:36:28.326972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:36:28.440625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
업무시설 13
34.2%
공동주택 8
21.1%
공장 6
15.8%
단독주택 5
 
13.2%
제2종근린생활시설 4
 
10.5%
방송통신시설 2
 
5.3%
Distinct34
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Memory size436.0 B
Minimum2020-11-25 00:00:00
Maximum2022-10-19 00:00:00
2023-12-12T13:36:28.547602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:36:28.650811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

공사 종료일
Date

MISSING 

Distinct18
Distinct (%)90.0%
Missing18
Missing (%)47.4%
Memory size436.0 B
Minimum2023-11-30 00:00:00
Maximum2224-08-31 00:00:00
2023-12-12T13:36:28.756314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:36:28.870648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)

데이터 기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-11-22
38 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-11-22
2nd row2023-11-22
3rd row2023-11-22
4th row2023-11-22
5th row2023-11-22

Common Values

ValueCountFrequency (%)
2023-11-22 38
100.0%

Length

2023-12-12T13:36:28.979136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:36:29.073008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-11-22 38
100.0%

Interactions

2023-12-12T13:36:26.866509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:36:26.717309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:36:26.955355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:36:26.794667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:36:29.136851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위치규모(제곱미터)주용도착공일공사 종료일
연번1.0001.0000.4890.5620.7670.983
위치1.0001.0001.0001.0001.0001.000
규모(제곱미터)0.4891.0001.0000.5110.0000.000
주용도0.5621.0000.5111.0000.7680.840
착공일0.7671.0000.0000.7681.0001.000
공사 종료일0.9831.0000.0000.8401.0001.000
2023-12-12T13:36:29.226242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번규모(제곱미터)주용도
연번1.000-0.7100.305
규모(제곱미터)-0.7101.0000.306
주용도0.3050.3061.000

Missing values

2023-12-12T13:36:27.060757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:36:27.167070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번위치규모(제곱미터)주용도착공일공사 종료일데이터 기준일자
01서울특별시 금천구 독산1동 146-5 외12809.05업무시설2022-06-082023-11-302023-11-22
12서울특별시 금천구 가산동 238-1044921.76업무시설2021-12-282023-12-012023-11-22
23서울특별시 금천구 독산1동 148-304510.52업무시설2022-07-082023-12-082023-11-22
34서울특별시 금천구 가산동 235-495272.84업무시설2022-07-062023-12-092023-11-22
45서울특별시 금천구 독산1동 1008-11 외27113.1업무시설2021-12-032023-12-102023-11-22
56서울특별시 금천구 가산동 326-4 외136497.4업무시설2021-08-242023-12-192023-11-22
67서울특별시 금천구 시흥1동 987-9 외25465.89업무시설2022-03-032024-02-202023-11-22
78서울특별시 금천구 가산동 451-191713.04공장2020-11-252024-02-282023-11-22
89서울특별시 금천구 가산동 452-161611.85공장2020-11-252024-02-282023-11-22
910서울특별시 금천구 가산동 459-630515.0공장2022-03-102024-02-292023-11-22
연번위치규모(제곱미터)주용도착공일공사 종료일데이터 기준일자
2829서울특별시 금천구 독산1동 1006-143 외1필지976.7공동주택2022-08-31<NA>2023-11-22
2930서울특별시 금천구 독산2동 1055-7334.95단독주택2022-07-20<NA>2023-11-22
3031서울특별시 금천구 독산3동 234-17585.81공동주택2022-10-11<NA>2023-11-22
3132서울특별시 금천구 독산3동 889-1297.0제2종근린생활시설2022-09-30<NA>2023-11-22
3233서울특별시 금천구 시흥1동 840-34 외1필지555.27공동주택2022-09-16<NA>2023-11-22
3334서울특별시 금천구 시흥4동 790-45260.2단독주택2022-10-13<NA>2023-11-22
3435서울특별시 금천구 시흥4동 789-48515.9공동주택2022-10-19<NA>2023-11-22
3536서울특별시 금천구 시흥5동 837-33473.02단독주택2022-10-18<NA>2023-11-22
3637서울특별시 금천구 시흥5동 910-16456.58공동주택2022-09-05<NA>2023-11-22
3738서울특별시 금천구 시흥5동 917-2 외1필지396.64단독주택2022-07-27<NA>2023-11-22