Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells9
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Numeric2
Categorical3
Text1
DateTime2

Dataset

Description서울시설공단에서 관리(감독) 중인 서울시 조경공사에 대해 년도, 공사유형, 공사명, 총공사비, 착공일, 준공일, 발주처 정보를 제공합니다.
Author서울시설공단
URLhttps://www.data.go.kr/data/15069122/fileData.do

Alerts

년도 has constant value ""Constant
착공일 has 4 (4.0%) missing valuesMissing
준공일 has 4 (4.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:46:22.093588
Analysis finished2023-12-12 06:46:23.477436
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T15:46:23.555711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-12T15:46:23.706882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 100
100.0%

Length

2023-12-12T15:46:23.863302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:46:23.960825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 100
100.0%

공사유형
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
공원조성
36 
공원정비
30 
녹지조성
21 
등산로조성
10 
기타
 
3

Length

Max length5
Median length4
Mean length4.04
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등산로조성
2nd row등산로조성
3rd row등산로조성
4th row공원정비
5th row공원조성

Common Values

ValueCountFrequency (%)
공원조성 36
36.0%
공원정비 30
30.0%
녹지조성 21
21.0%
등산로조성 10
 
10.0%
기타 3
 
3.0%

Length

2023-12-12T15:46:24.056635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:46:24.159645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공원조성 36
36.0%
공원정비 30
30.0%
녹지조성 21
21.0%
등산로조성 10
 
10.0%
기타 3
 
3.0%
Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-12T15:46:24.385345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22.5
Mean length18
Min length9

Characters and Unicode

Total characters1800
Distinct characters225
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)92.0%

Sample

1st row오동근린공원 산책로 정비사업
2nd row근교산 무장애숲길 조성사업
3rd row아차산 무장애숲길 조성사업
4th row아차산 등산로 정비사업
5th row더불어숲 시설개선사업
ValueCountFrequency (%)
조성사업 26
 
7.3%
2020년 20
 
5.6%
정비사업 17
 
4.8%
11
 
3.1%
조성공사 8
 
2.3%
2019년 7
 
2.0%
정비공사 7
 
2.0%
무장애숲길 4
 
1.1%
아차산 4
 
1.1%
등산로 4
 
1.1%
Other values (182) 247
69.6%
2023-12-12T15:46:24.828547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
255
 
14.2%
101
 
5.6%
76
 
4.2%
65
 
3.6%
56
 
3.1%
2 52
 
2.9%
49
 
2.7%
0 47
 
2.6%
46
 
2.6%
44
 
2.4%
Other values (215) 1009
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1367
75.9%
Space Separator 255
 
14.2%
Decimal Number 122
 
6.8%
Close Punctuation 26
 
1.4%
Open Punctuation 26
 
1.4%
Other Punctuation 3
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
7.4%
76
 
5.6%
65
 
4.8%
56
 
4.1%
49
 
3.6%
46
 
3.4%
44
 
3.2%
39
 
2.9%
32
 
2.3%
27
 
2.0%
Other values (202) 832
60.9%
Decimal Number
ValueCountFrequency (%)
2 52
42.6%
0 47
38.5%
1 8
 
6.6%
9 8
 
6.6%
3 4
 
3.3%
4 2
 
1.6%
6 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
· 1
33.3%
Space Separator
ValueCountFrequency (%)
255
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1367
75.9%
Common 433
 
24.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
7.4%
76
 
5.6%
65
 
4.8%
56
 
4.1%
49
 
3.6%
46
 
3.4%
44
 
3.2%
39
 
2.9%
32
 
2.3%
27
 
2.0%
Other values (202) 832
60.9%
Common
ValueCountFrequency (%)
255
58.9%
2 52
 
12.0%
0 47
 
10.9%
) 26
 
6.0%
( 26
 
6.0%
1 8
 
1.8%
9 8
 
1.8%
3 4
 
0.9%
4 2
 
0.5%
, 2
 
0.5%
Other values (3) 3
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1367
75.9%
ASCII 432
 
24.0%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
255
59.0%
2 52
 
12.0%
0 47
 
10.9%
) 26
 
6.0%
( 26
 
6.0%
1 8
 
1.9%
9 8
 
1.9%
3 4
 
0.9%
4 2
 
0.5%
, 2
 
0.5%
Other values (2) 2
 
0.5%
Hangul
ValueCountFrequency (%)
101
 
7.4%
76
 
5.6%
65
 
4.8%
56
 
4.1%
49
 
3.6%
46
 
3.4%
44
 
3.2%
39
 
2.9%
32
 
2.3%
27
 
2.0%
Other values (202) 832
60.9%
None
ValueCountFrequency (%)
· 1
100.0%

총공사비(백만원)
Real number (ℝ)

Distinct93
Distinct (%)93.9%
Missing1
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean748.88889
Minimum132
Maximum3939
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T15:46:25.017718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum132
5-th percentile193.4
Q1299
median504
Q3793.5
95-th percentile2722
Maximum3939
Range3807
Interquartile range (IQR)494.5

Descriptive statistics

Standard deviation760.33286
Coefficient of variation (CV)1.0152813
Kurtosis6.5674006
Mean748.88889
Median Absolute Deviation (MAD)223
Skewness2.5172709
Sum74140
Variance578106.06
MonotonicityNot monotonic
2023-12-12T15:46:25.168702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
360 2
 
2.0%
2722 2
 
2.0%
298 2
 
2.0%
188 2
 
2.0%
3939 2
 
2.0%
518 2
 
2.0%
576 1
 
1.0%
257 1
 
1.0%
512 1
 
1.0%
844 1
 
1.0%
Other values (83) 83
83.0%
ValueCountFrequency (%)
132 1
1.0%
134 1
1.0%
177 1
1.0%
188 2
2.0%
194 1
1.0%
206 1
1.0%
222 1
1.0%
230 1
1.0%
235 1
1.0%
241 1
1.0%
ValueCountFrequency (%)
3939 2
2.0%
3081 1
1.0%
2745 1
1.0%
2722 2
2.0%
2271 1
1.0%
2069 1
1.0%
1887 1
1.0%
1813 1
1.0%
1577 1
1.0%
1543 1
1.0%

착공일
Date

MISSING 

Distinct71
Distinct (%)74.0%
Missing4
Missing (%)4.0%
Memory size932.0 B
Minimum2018-04-23 00:00:00
Maximum2020-12-18 00:00:00
2023-12-12T15:46:25.307556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:25.435692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공일
Date

MISSING 

Distinct69
Distinct (%)71.9%
Missing4
Missing (%)4.0%
Memory size932.0 B
Minimum2020-01-19 00:00:00
Maximum2022-05-31 00:00:00
2023-12-12T15:46:25.578440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:25.768807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

발주처
Categorical

Distinct11
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
공원녹지과
56 
푸른도시과
15 
생태환경과
10 
시설과
공원시설과
 
3
Other values (6)

Length

Max length5
Median length5
Mean length4.82
Min length3

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row공원녹지과
2nd row공원녹지과
3rd row공원녹지과
4th row공원녹지과
5th row푸른도시과

Common Values

ValueCountFrequency (%)
공원녹지과 56
56.0%
푸른도시과 15
 
15.0%
생태환경과 10
 
10.0%
시설과 8
 
8.0%
공원시설과 3
 
3.0%
기반시설과 2
 
2.0%
도로시설과 2
 
2.0%
치수과 1
 
1.0%
도시경관과 1
 
1.0%
교통행정과 1
 
1.0%

Length

2023-12-12T15:46:25.904291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공원녹지과 56
56.0%
푸른도시과 15
 
15.0%
생태환경과 10
 
10.0%
시설과 8
 
8.0%
공원시설과 3
 
3.0%
기반시설과 2
 
2.0%
도로시설과 2
 
2.0%
치수과 1
 
1.0%
도시경관과 1
 
1.0%
교통행정과 1
 
1.0%

Interactions

2023-12-12T15:46:22.845645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:22.614044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:22.960938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:46:22.711427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:46:25.987764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번공사유형공사명총공사비(백만원)착공일준공일발주처
연번1.0000.7030.9890.4470.8700.9520.532
공사유형0.7031.0001.0000.6810.9000.8960.701
공사명0.9891.0001.0000.7451.0001.0001.000
총공사비(백만원)0.4470.6810.7451.0000.8810.8770.721
착공일0.8700.9001.0000.8811.0000.9910.905
준공일0.9520.8961.0000.8770.9911.0000.908
발주처0.5320.7011.0000.7210.9050.9081.000
2023-12-12T15:46:26.091154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주처공사유형
발주처1.0000.465
공사유형0.4651.000
2023-12-12T15:46:26.186789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번총공사비(백만원)공사유형발주처
연번1.000-0.1130.3560.256
총공사비(백만원)-0.1131.0000.4710.431
공사유형0.3560.4711.0000.465
발주처0.2560.4310.4651.000

Missing values

2023-12-12T15:46:23.125868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:46:23.264895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:46:23.424015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번년도공사유형공사명총공사비(백만원)착공일준공일발주처
012020등산로조성오동근린공원 산책로 정비사업1013<NA><NA>공원녹지과
122020등산로조성근교산 무장애숲길 조성사업4222020-11-052021-04-03공원녹지과
232020등산로조성아차산 무장애숲길 조성사업5042020-11-062021-04-04공원녹지과
342020공원정비아차산 등산로 정비사업3082020-11-062021-04-04공원녹지과
452020공원조성더불어숲 시설개선사업9212020-07-132021-06-30푸른도시과
562020등산로조성영축산 순환산책로 조성사업(정상~광운대역구간)13682020-07-232021-01-18푸른도시과
672020공원조성북악산 도시자연공원(성북지구) 조성사업8152020-10-202021-04-17공원녹지과
782020등산로조성봉화산근린공원 무장애숲길 조성사업10272020-12-182021-06-15공원녹지과
892020공원조성뚝섬 및 망원한강공원 자연(형)호안 복원사업27452020-04-012021-11-30생태환경과
9102020공원조성옥수역하부 한강변 주민휴식공간 조성공사22712020-06-052021-12-31공원시설과
연번년도공사유형공사명총공사비(백만원)착공일준공일발주처
90912020등산로조성2020년 계남근린공원 무장애숲길 조성사업3482020-09-162020-12-14공원녹지과
91922020공원조성길동생태공원 무장애친화공원 조성공사3602020-07-012020-12-20시설과
92932020공원정비시민의숲 시설물 정비공사2982020-04-222020-08-10시설과
93942020공원정비2020년 한강공원 어린이놀이터 조성 및 보수 정비공사2802020-06-012020-09-27녹지관리과
94952020공원조성2020년 뚝섬 이용숲 조성사업8222020-04-062020-07-04생태환경과
95962020공원조성2020년 뚝섬 생태숲 조성사업7872020-04-062020-06-30생태환경과
96972020공원조성2020년 난지 이용숲 조성사업7992020-04-062020-06-26생태환경과
97982020공원조성2020년 난지 완충숲 조성사업7612020-04-062020-06-26생태환경과
98992020공원조성광나루한강공원 자연(형)호안 복원사업27222019-10-182020-12-28생태환경과
991002020공원조성광나루한강공원 자연(형)호안 복원사업27222019-10-182020-12-28생태환경과