Overview

Dataset statistics

Number of variables5
Number of observations2629
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory108.0 KiB
Average record size in memory42.1 B

Variable types

Numeric2
Text1
Categorical1
DateTime1

Dataset

Description서울교통공사의 계약체결 현황 정보입니다. 해당 정보는 연번, 계약번호, 계약건명, 계약소속부서명, 계약일자로 구성되어 있습니다. 2023년10월 기준입니다.
Author서울교통공사
URLhttps://www.data.go.kr/data/15052323/fileData.do

Alerts

연번 is highly overall correlated with 계약번호High correlation
계약번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
계약번호 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:48:14.533177
Analysis finished2023-12-16 15:48:18.341690
Duration3.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2629
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1315
Minimum1
Maximum2629
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.2 KiB
2023-12-16T15:48:18.833381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile132.4
Q1658
median1315
Q31972
95-th percentile2497.6
Maximum2629
Range2628
Interquartile range (IQR)1314

Descriptive statistics

Standard deviation759.07125
Coefficient of variation (CV)0.5772405
Kurtosis-1.2
Mean1315
Median Absolute Deviation (MAD)657
Skewness0
Sum3457135
Variance576189.17
MonotonicityStrictly increasing
2023-12-16T15:48:19.584685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1748 1
 
< 0.1%
1750 1
 
< 0.1%
1751 1
 
< 0.1%
1752 1
 
< 0.1%
1753 1
 
< 0.1%
1754 1
 
< 0.1%
1755 1
 
< 0.1%
1756 1
 
< 0.1%
1757 1
 
< 0.1%
Other values (2619) 2619
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2629 1
< 0.1%
2628 1
< 0.1%
2627 1
< 0.1%
2626 1
< 0.1%
2625 1
< 0.1%
2624 1
< 0.1%
2623 1
< 0.1%
2622 1
< 0.1%
2621 1
< 0.1%
2620 1
< 0.1%

계약번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2629
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5072254 × 109
Minimum4.5001052 × 109
Maximum4.6001002 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.2 KiB
2023-12-16T15:48:20.478489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.5001052 × 109
5-th percentile4.5001069 × 109
Q14.5001094 × 109
median4.5001144 × 109
Q34.5001178 × 109
95-th percentile4.6001001 × 109
Maximum4.6001002 × 109
Range99995036
Interquartile range (IQR)8374

Descriptive statistics

Standard deviation25705623
Coefficient of variation (CV)0.0057032033
Kurtosis9.1550833
Mean4.5072254 × 109
Median Absolute Deviation (MAD)4045
Skewness3.3388799
Sum1.1849496 × 1013
Variance6.6077904 × 1014
MonotonicityNot monotonic
2023-12-16T15:48:21.459906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4500105252 1
 
< 0.1%
4500116192 1
 
< 0.1%
4500116128 1
 
< 0.1%
4500116106 1
 
< 0.1%
4500116145 1
 
< 0.1%
4600100197 1
 
< 0.1%
4500116229 1
 
< 0.1%
4500116195 1
 
< 0.1%
4500116178 1
 
< 0.1%
4500116286 1
 
< 0.1%
Other values (2619) 2619
99.6%
ValueCountFrequency (%)
4500105201 1
< 0.1%
4500105240 1
< 0.1%
4500105241 1
< 0.1%
4500105252 1
< 0.1%
4500105262 1
< 0.1%
4500105264 1
< 0.1%
4500105271 1
< 0.1%
4500105281 1
< 0.1%
4500105289 1
< 0.1%
4500105299 1
< 0.1%
ValueCountFrequency (%)
4600100237 1
< 0.1%
4600100236 1
< 0.1%
4600100235 1
< 0.1%
4600100234 1
< 0.1%
4600100233 1
< 0.1%
4600100232 1
< 0.1%
4600100231 1
< 0.1%
4600100230 1
< 0.1%
4600100229 1
< 0.1%
4600100228 1
< 0.1%
Distinct2574
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size20.7 KiB
2023-12-16T15:48:22.671264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length44
Mean length26.391404
Min length8

Characters and Unicode

Total characters69383
Distinct characters573
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2520 ?
Unique (%)95.9%

Sample

1st row지하철 통합관제센터 신축공사 감독권한대행 등 건설사업관리용역
2nd row2022년도 5~8호선 소방설비 법정점검 용역
3rd row5호선 거여 외 2역 승강편의시설 설치공사(47공구)
4th row지하철 3호선 독립문역 등 3개역 환기시스템 개량공사 TAB 기술용역
5th row지하철 3호선 녹번역 등 5개역 환기시스템 개량공사 TAB 기술용역
ValueCountFrequency (%)
474
 
3.3%
구매 359
 
2.5%
265
 
1.8%
2023년 256
 
1.8%
용역 253
 
1.8%
2022년 242
 
1.7%
설치 242
 
1.7%
제조구매 200
 
1.4%
제작구매 198
 
1.4%
전동차 187
 
1.3%
Other values (3335) 11687
81.4%
2023-12-16T15:48:25.424898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11742
 
16.9%
2 2181
 
3.1%
1623
 
2.3%
1602
 
2.3%
1556
 
2.2%
1536
 
2.2%
1502
 
2.2%
1415
 
2.0%
1252
 
1.8%
1217
 
1.8%
Other values (563) 43757
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47861
69.0%
Space Separator 11742
 
16.9%
Decimal Number 5802
 
8.4%
Uppercase Letter 1321
 
1.9%
Close Punctuation 1014
 
1.5%
Open Punctuation 1014
 
1.5%
Other Punctuation 265
 
0.4%
Math Symbol 264
 
0.4%
Dash Punctuation 92
 
0.1%
Connector Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1623
 
3.4%
1602
 
3.3%
1556
 
3.3%
1536
 
3.2%
1502
 
3.1%
1415
 
3.0%
1252
 
2.6%
1217
 
2.5%
1072
 
2.2%
1059
 
2.2%
Other values (507) 34027
71.1%
Uppercase Letter
ValueCountFrequency (%)
S 142
10.7%
C 134
 
10.1%
E 104
 
7.9%
P 103
 
7.8%
T 103
 
7.8%
V 100
 
7.6%
A 94
 
7.1%
L 74
 
5.6%
D 73
 
5.5%
B 56
 
4.2%
Other values (15) 338
25.6%
Decimal Number
ValueCountFrequency (%)
2 2181
37.6%
0 724
 
12.5%
3 717
 
12.4%
1 677
 
11.7%
5 415
 
7.2%
4 361
 
6.2%
7 201
 
3.5%
8 197
 
3.4%
9 166
 
2.9%
6 163
 
2.8%
Other Punctuation
ValueCountFrequency (%)
, 193
72.8%
/ 45
 
17.0%
. 12
 
4.5%
· 11
 
4.2%
# 2
 
0.8%
: 2
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 252
95.5%
+ 9
 
3.4%
= 2
 
0.8%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 924
91.1%
] 88
 
8.7%
2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 923
91.0%
[ 89
 
8.8%
2
 
0.2%
Connector Punctuation
ValueCountFrequency (%)
_ 4
80.0%
_ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
11742
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47861
69.0%
Common 20201
29.1%
Latin 1321
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1623
 
3.4%
1602
 
3.3%
1556
 
3.3%
1536
 
3.2%
1502
 
3.1%
1415
 
3.0%
1252
 
2.6%
1217
 
2.5%
1072
 
2.2%
1059
 
2.2%
Other values (507) 34027
71.1%
Common
ValueCountFrequency (%)
11742
58.1%
2 2181
 
10.8%
) 924
 
4.6%
( 923
 
4.6%
0 724
 
3.6%
3 717
 
3.5%
1 677
 
3.4%
5 415
 
2.1%
4 361
 
1.8%
~ 252
 
1.2%
Other values (21) 1285
 
6.4%
Latin
ValueCountFrequency (%)
S 142
10.7%
C 134
 
10.1%
E 104
 
7.9%
P 103
 
7.8%
T 103
 
7.8%
V 100
 
7.6%
A 94
 
7.1%
L 74
 
5.6%
D 73
 
5.5%
B 56
 
4.2%
Other values (15) 338
25.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47860
69.0%
ASCII 21502
31.0%
None 16
 
< 0.1%
CJK Compat 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11742
54.6%
2 2181
 
10.1%
) 924
 
4.3%
( 923
 
4.3%
0 724
 
3.4%
3 717
 
3.3%
1 677
 
3.1%
5 415
 
1.9%
4 361
 
1.7%
~ 252
 
1.2%
Other values (40) 2586
 
12.0%
Hangul
ValueCountFrequency (%)
1623
 
3.4%
1602
 
3.3%
1556
 
3.3%
1536
 
3.2%
1502
 
3.1%
1415
 
3.0%
1252
 
2.6%
1217
 
2.5%
1072
 
2.2%
1059
 
2.2%
Other values (506) 34026
71.1%
None
ValueCountFrequency (%)
· 11
68.8%
2
 
12.5%
2
 
12.5%
_ 1
 
6.2%
CJK Compat
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size20.7 KiB
계약처-계약팀
1242 
계약처-계약제도팀
1148 
계약처-계약처
216 
9호선운영부문
 
23

Length

Max length9
Median length7
Mean length7.8733359
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계약처-계약팀
2nd row계약처-계약팀
3rd row계약처-계약팀
4th row계약처-계약팀
5th row계약처-계약팀

Common Values

ValueCountFrequency (%)
계약처-계약팀 1242
47.2%
계약처-계약제도팀 1148
43.7%
계약처-계약처 216
 
8.2%
9호선운영부문 23
 
0.9%

Length

2023-12-16T15:48:26.081061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:48:26.678029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계약처-계약팀 1242
47.2%
계약처-계약제도팀 1148
43.7%
계약처-계약처 216
 
8.2%
9호선운영부문 23
 
0.9%
Distinct433
Distinct (%)16.5%
Missing0
Missing (%)0.0%
Memory size20.7 KiB
Minimum2022-01-03 00:00:00
Maximum2023-10-31 00:00:00
2023-12-16T15:48:27.350202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:48:28.065675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-16T15:48:16.222337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:48:15.522877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:48:16.663156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:48:15.824067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:48:28.630762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번계약번호계약소속부서명
연번1.0000.2310.207
계약번호0.2311.0000.139
계약소속부서명0.2070.1391.000
2023-12-16T15:48:29.014262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번계약번호계약소속부서명
연번1.0000.8350.125
계약번호0.8351.0000.092
계약소속부서명0.1250.0921.000

Missing values

2023-12-16T15:48:17.627234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:48:18.081335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번계약번호계약건명계약소속부서명계약일자
014500105252지하철 통합관제센터 신축공사 감독권한대행 등 건설사업관리용역계약처-계약팀2022-01-03
1246001000412022년도 5~8호선 소방설비 법정점검 용역계약처-계약팀2022-01-05
2345001052015호선 거여 외 2역 승강편의시설 설치공사(47공구)계약처-계약팀2022-01-06
344500105241지하철 3호선 독립문역 등 3개역 환기시스템 개량공사 TAB 기술용역계약처-계약팀2022-01-10
454500105240지하철 3호선 녹번역 등 5개역 환기시스템 개량공사 TAB 기술용역계약처-계약팀2022-01-10
5646001000427호선(남부구간) 에스컬레이터 유지관리 용역계약처-계약팀2022-01-10
674500105262구산역 승강편의시설 설치공사 소규모 지하안전영향평가 재협의 용역계약처-계약팀2022-01-13
784500105264지하철 승강편의시설 설치 토목건축공사(C-4 공구) 사후지하안전영향 조사 용역계약처-계약팀2022-01-14
894500105281지하철 3호선 독립문역 등 3개역 환기시스템 개량 소방공사 감리용역계약처-계약팀2022-01-17
9104500105271레일탐상차2호 차축 교체 수선계약처-계약제도팀2022-01-17
연번계약번호계약건명계약소속부서명계약일자
2619262045001205842023년 5, 6호선 승무분야 근무환경 개선공사(건축)계약처-계약팀2023-10-30
262026214500120648전자분야 하남구간 RF 단말기 구매계약처-계약제도팀2023-10-30
262126224500120583추락방지용 안전난간 제조구매계약처-계약제도팀2023-10-30
262226234500120645대용량 공기청정기 금속필터 제작구매계약처-계약제도팀2023-10-30
2623262445001206622023년 친환경(축전지형) 궤도검측차 제작 구매계약처-계약제도팀2023-10-30
2624262545001205922023년 5호선 오목교역 공용공간 조명시설 개선 전기공사계약처-계약팀2023-10-31
2625262645001205932023년 서울(1)역 등 4역 시설물 보수 전기공사계약처-계약팀2023-10-31
2626262745001205902023년 수서차량기지 검수고 근무환경개선 전기공사계약처-계약팀2023-10-31
2627262845001206092023년 정기세무조사 지원 세무대리 용역계약처-계약처2023-10-31
2628262945001206112023년 1~4호선 전기모터카 1년검수 용역계약처-계약처2023-10-31