Overview

Dataset statistics

Number of variables7
Number of observations464
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.4 KiB
Average record size in memory58.3 B

Variable types

Numeric2
Categorical4
Text1

Alerts

연번 is highly overall correlated with 발주계획번호 and 1 other fieldsHigh correlation
발주계획번호 is highly overall correlated with 연번High correlation
구분 is highly overall correlated with 담당부서High correlation
담당부서 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
담당부서 is highly imbalanced (62.8%)Imbalance
연번 has unique valuesUnique
발주계획번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 19:13:34.193218
Analysis finished2024-04-17 19:13:34.987872
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct464
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean232.5
Minimum1
Maximum464
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2024-04-18T04:13:35.043222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.15
Q1116.75
median232.5
Q3348.25
95-th percentile440.85
Maximum464
Range463
Interquartile range (IQR)231.5

Descriptive statistics

Standard deviation134.08952
Coefficient of variation (CV)0.57672913
Kurtosis-1.2
Mean232.5
Median Absolute Deviation (MAD)116
Skewness0
Sum107880
Variance17980
MonotonicityStrictly increasing
2024-04-18T04:13:35.143314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
307 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
315 1
 
0.2%
314 1
 
0.2%
313 1
 
0.2%
312 1
 
0.2%
Other values (454) 454
97.8%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%
460 1
0.2%
459 1
0.2%
458 1
0.2%
457 1
0.2%
456 1
0.2%
455 1
0.2%

발주계획번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct464
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0208064 × 1012
Minimum2.020013 × 1012
Maximum2.021052 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2024-04-18T04:13:35.248747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.020013 × 1012
5-th percentile2.0201212 × 1012
Q12.0210105 × 1012
median2.0210113 × 1012
Q32.0210321 × 1012
95-th percentile2.0210506 × 1012
Maximum2.021052 × 1012
Range1.039 × 109
Interquartile range (IQR)21550034

Descriptive statistics

Standard deviation3.9092246 × 108
Coefficient of variation (CV)0.00019344875
Kurtosis-0.47832835
Mean2.0208064 × 1012
Median Absolute Deviation (MAD)20499908
Skewness-1.2282814
Sum9.3765416 × 1014
Variance1.5282037 × 1017
MonotonicityStrictly decreasing
2024-04-18T04:13:35.371179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2021052000032 1
 
0.2%
2021010700051 1
 
0.2%
2021010700037 1
 
0.2%
2021010700038 1
 
0.2%
2021010700039 1
 
0.2%
2021010700040 1
 
0.2%
2021010700041 1
 
0.2%
2021010700042 1
 
0.2%
2021010700043 1
 
0.2%
2021010700045 1
 
0.2%
Other values (454) 454
97.8%
ValueCountFrequency (%)
2020013000064 1
0.2%
2020013100072 1
0.2%
2020020300010 1
0.2%
2020020400035 1
0.2%
2020020400040 1
0.2%
2020020700061 1
0.2%
2020020700064 1
0.2%
2020020700065 1
0.2%
2020021000168 1
0.2%
2020040300011 1
0.2%
ValueCountFrequency (%)
2021052000032 1
0.2%
2021052000031 1
0.2%
2021052000030 1
0.2%
2021052000029 1
0.2%
2021052000028 1
0.2%
2021052000027 1
0.2%
2021051800026 1
0.2%
2021051800024 1
0.2%
2021051700023 1
0.2%
2021051700022 1
0.2%

구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
내자
317 
일반용역
52 
공사
41 
기술용역
38 
외자
 
14

Length

Max length4
Median length2
Mean length2.387931
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row내자
2nd row내자
3rd row내자
4th row내자
5th row내자

Common Values

ValueCountFrequency (%)
내자 317
68.3%
일반용역 52
 
11.2%
공사 41
 
8.8%
기술용역 38
 
8.2%
외자 14
 
3.0%
임대 2
 
0.4%

Length

2024-04-18T04:13:35.479829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:13:35.573584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
내자 317
68.3%
일반용역 52
 
11.2%
공사 41
 
8.8%
기술용역 38
 
8.2%
외자 14
 
3.0%
임대 2
 
0.4%
Distinct424
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-04-18T04:13:35.815728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length37
Mean length22.064655
Min length6

Characters and Unicode

Total characters10238
Distinct characters436
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique390 ?
Unique (%)84.1%

Sample

1st row인천공항 Landside 제설제 구매사업
2nd row인천공항 4단계 팬코일유닛(FCU) 구매사업
3rd row4단계 가로등주 및 조명타워 구매사업
4th row인천공항 4단계 시스템에어컨(EHP) 구매사업
5th row4단계 가로등주 및 조명타워 구매사업
ValueCountFrequency (%)
구매 114
 
5.7%
4단계 75
 
3.8%
52
 
2.6%
인천공항 51
 
2.6%
21년 43
 
2.2%
구매사업 36
 
1.8%
부대건물 36
 
1.8%
용역 34
 
1.7%
2021년 31
 
1.6%
인천국제공항 25
 
1.3%
Other values (795) 1501
75.1%
2024-04-18T04:13:36.185153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1534
 
15.0%
299
 
2.9%
282
 
2.8%
232
 
2.3%
230
 
2.2%
2 201
 
2.0%
196
 
1.9%
169
 
1.7%
155
 
1.5%
155
 
1.5%
Other values (426) 6785
66.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7128
69.6%
Space Separator 1534
 
15.0%
Uppercase Letter 542
 
5.3%
Decimal Number 496
 
4.8%
Lowercase Letter 182
 
1.8%
Open Punctuation 128
 
1.3%
Close Punctuation 127
 
1.2%
Other Punctuation 49
 
0.5%
Letter Number 24
 
0.2%
Modifier Symbol 15
 
0.1%
Other values (2) 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
4.2%
282
 
4.0%
232
 
3.3%
230
 
3.2%
196
 
2.7%
169
 
2.4%
155
 
2.2%
155
 
2.2%
154
 
2.2%
128
 
1.8%
Other values (355) 5128
71.9%
Uppercase Letter
ValueCountFrequency (%)
S 84
15.5%
T 66
12.2%
A 56
10.3%
P 38
 
7.0%
C 37
 
6.8%
I 37
 
6.8%
B 35
 
6.5%
D 29
 
5.4%
H 27
 
5.0%
E 20
 
3.7%
Other values (14) 113
20.8%
Lowercase Letter
ValueCountFrequency (%)
i 37
20.3%
e 32
17.6%
r 24
13.2%
d 21
11.5%
s 19
10.4%
t 9
 
4.9%
c 7
 
3.8%
n 6
 
3.3%
a 6
 
3.3%
o 6
 
3.3%
Other values (12) 15
8.2%
Decimal Number
ValueCountFrequency (%)
2 201
40.5%
1 133
26.8%
4 80
 
16.1%
0 62
 
12.5%
3 11
 
2.2%
8 3
 
0.6%
6 3
 
0.6%
5 2
 
0.4%
9 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
' 15
30.6%
/ 14
28.6%
, 14
28.6%
· 3
 
6.1%
" 2
 
4.1%
& 1
 
2.0%
Letter Number
ValueCountFrequency (%)
8
33.3%
7
29.2%
5
20.8%
4
16.7%
Space Separator
ValueCountFrequency (%)
1534
100.0%
Open Punctuation
ValueCountFrequency (%)
( 128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 127
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7128
69.6%
Common 2362
 
23.1%
Latin 748
 
7.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
4.2%
282
 
4.0%
232
 
3.3%
230
 
3.2%
196
 
2.7%
169
 
2.4%
155
 
2.2%
155
 
2.2%
154
 
2.2%
128
 
1.8%
Other values (355) 5128
71.9%
Latin
ValueCountFrequency (%)
S 84
 
11.2%
T 66
 
8.8%
A 56
 
7.5%
P 38
 
5.1%
C 37
 
4.9%
I 37
 
4.9%
i 37
 
4.9%
B 35
 
4.7%
e 32
 
4.3%
D 29
 
3.9%
Other values (40) 297
39.7%
Common
ValueCountFrequency (%)
1534
64.9%
2 201
 
8.5%
1 133
 
5.6%
( 128
 
5.4%
) 127
 
5.4%
4 80
 
3.4%
0 62
 
2.6%
` 15
 
0.6%
' 15
 
0.6%
/ 14
 
0.6%
Other values (11) 53
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7128
69.6%
ASCII 3083
30.1%
Number Forms 24
 
0.2%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1534
49.8%
2 201
 
6.5%
1 133
 
4.3%
( 128
 
4.2%
) 127
 
4.1%
S 84
 
2.7%
4 80
 
2.6%
T 66
 
2.1%
0 62
 
2.0%
A 56
 
1.8%
Other values (56) 612
 
19.9%
Hangul
ValueCountFrequency (%)
299
 
4.2%
282
 
4.0%
232
 
3.3%
230
 
3.2%
196
 
2.7%
169
 
2.4%
155
 
2.2%
155
 
2.2%
154
 
2.2%
128
 
1.8%
Other values (355) 5128
71.9%
Number Forms
ValueCountFrequency (%)
8
33.3%
7
29.2%
5
20.8%
4
16.7%
None
ValueCountFrequency (%)
· 3
100.0%

조달방식
Categorical

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
자체조달(자체전자조달시스템)
365 
중앙조달
99 

Length

Max length15
Median length15
Mean length12.653017
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자체조달(자체전자조달시스템)
2nd row중앙조달
3rd row중앙조달
4th row중앙조달
5th row중앙조달

Common Values

ValueCountFrequency (%)
자체조달(자체전자조달시스템) 365
78.7%
중앙조달 99
 
21.3%

Length

2024-04-18T04:13:36.292076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:13:36.362050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자체조달(자체전자조달시스템 365
78.7%
중앙조달 99
 
21.3%

담당부서
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct42
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
<NA>
347 
통신보안팀
 
18
스마트오피스팀
 
9
터미널건설팀
 
9
운항통신팀
 
9
Other values (37)
72 

Length

Max length8
Median length4
Mean length4.3857759
Min length3

Unique

Unique18 ?
Unique (%)3.9%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 347
74.8%
통신보안팀 18
 
3.9%
스마트오피스팀 9
 
1.9%
터미널건설팀 9
 
1.9%
운항통신팀 9
 
1.9%
공항레이더팀 8
 
1.7%
지상레이더팀 5
 
1.1%
기계설비팀 4
 
0.9%
통합정보팀 3
 
0.6%
수하물운영팀 3
 
0.6%
Other values (32) 49
 
10.6%

Length

2024-04-18T04:13:36.442924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 347
74.8%
통신보안팀 18
 
3.9%
스마트오피스팀 9
 
1.9%
터미널건설팀 9
 
1.9%
운항통신팀 9
 
1.9%
공항레이더팀 8
 
1.7%
지상레이더팀 5
 
1.1%
기계설비팀 4
 
0.9%
항행시설팀 3
 
0.6%
재산관리팀 3
 
0.6%
Other values (32) 49
 
10.6%
Distinct12
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2021-04
95 
2021-03
94 
2021-02
76 
2021-05
69 
2021-01
54 
Other values (7)
76 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-06
2nd row2021-05
3rd row2021-05
4th row2021-05
5th row2021-05

Common Values

ValueCountFrequency (%)
2021-04 95
20.5%
2021-03 94
20.3%
2021-02 76
16.4%
2021-05 69
14.9%
2021-01 54
11.6%
2021-06 29
 
6.2%
2021-07 16
 
3.4%
2021-09 14
 
3.0%
2021-12 5
 
1.1%
2021-08 4
 
0.9%
Other values (2) 8
 
1.7%

Length

2024-04-18T04:13:36.529182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-04 95
20.5%
2021-03 94
20.3%
2021-02 76
16.4%
2021-05 69
14.9%
2021-01 54
11.6%
2021-06 29
 
6.2%
2021-07 16
 
3.4%
2021-09 14
 
3.0%
2021-12 5
 
1.1%
2021-08 4
 
0.9%
Other values (2) 8
 
1.7%

Interactions

2024-04-18T04:13:34.706621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:13:34.584329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:13:34.771146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:13:34.641427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T04:13:36.823511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발주계획번호구분조달방식담당부서발주예정시기
연번1.0000.8350.2170.1790.9190.607
발주계획번호0.8351.0000.2890.0530.7700.506
구분0.2170.2891.0000.4400.9490.331
조달방식0.1790.0530.4401.0000.4440.000
담당부서0.9190.7700.9490.4441.0000.000
발주예정시기0.6070.5060.3310.0000.0001.000
2024-04-18T04:13:36.900865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당부서발주예정시기구분조달방식
담당부서1.0000.0000.6400.299
발주예정시기0.0001.0000.1350.000
구분0.6400.1351.0000.316
조달방식0.2990.0000.3161.000
2024-04-18T04:13:36.979010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발주계획번호구분조달방식담당부서발주예정시기
연번1.000-1.0000.1150.1360.5300.307
발주계획번호-1.0001.0000.1140.0910.4420.268
구분0.1150.1141.0000.3160.6400.135
조달방식0.1360.0910.3161.0000.2990.000
담당부서0.5300.4420.6400.2991.0000.000
발주예정시기0.3070.2680.1350.0000.0001.000

Missing values

2024-04-18T04:13:34.870222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:13:34.953729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번발주계획번호구분발주계획명조달방식담당부서발주예정시기
012021052000032내자인천공항 Landside 제설제 구매사업자체조달(자체전자조달시스템)<NA>2021-06
122021052000031내자인천공항 4단계 팬코일유닛(FCU) 구매사업중앙조달<NA>2021-05
232021052000030내자4단계 가로등주 및 조명타워 구매사업중앙조달<NA>2021-05
342021052000029내자인천공항 4단계 시스템에어컨(EHP) 구매사업중앙조달<NA>2021-05
452021052000028내자4단계 가로등주 및 조명타워 구매사업중앙조달<NA>2021-05
562021052000027내자4단계 T2 및 부대건물 LED실내조명등 제조구매 사업(Ⅰ)자체조달(자체전자조달시스템)전기설비팀2021-06
672021051800026내자엔드포인트 보안 및 보안시스템 개선사업자체조달(자체전자조달시스템)<NA>2021-07
782021051800024내자4단계 부대건물 시설공사(PKG1) 경계석 구매중앙조달<NA>2021-05
892021051700023내자제2여객터미널 확장공사 가스소화기 단가구매자체조달(자체전자조달시스템)<NA>2021-06
9102021051700022내자제2여객터미널 확장공사 분말소화기 단가구매자체조달(자체전자조달시스템)<NA>2021-06
연번발주계획번호구분발주계획명조달방식담당부서발주예정시기
4544552020040300011기술용역4단계 T1남측 연결도로공사 건설사업관리용역자체조달(자체전자조달시스템)<NA>2021-05
4554562020021000168내자인천공항 온라인 채용관 운영 및 홍보 콘텐츠 제작 용역자체조달(자체전자조달시스템)<NA>2021-05
4564572020020700065내자4단계 경비보안시스템 구축사업자체조달(자체전자조달시스템)통신보안팀2021-04
4574582020020700064기술용역4단계 정보통신 관로선로 및 외곽보안시설 감리용역자체조달(자체전자조달시스템)통신보안팀2021-02
4584592020020700061내자4단계 5G기반 모바일업무시스템 구축사업자체조달(자체전자조달시스템)통신보안팀2021-05
4594602020020400040기술용역통합정보시스템 고도화 기본설계 용역자체조달(자체전자조달시스템)통합정보팀2021-02
4604612020020400035공사부대건물(1단계) 전기실 개선공사자체조달(자체전자조달시스템)전력계통팀2021-03
4614622020020300010외자인천국제공항 4단계 위탁수하물 보안검색장비 구매설치사업자체조달(자체전자조달시스템)수하물설비팀2021-04
4624632020013100072공사T1남측 연결도로공사(4-10공구)자체조달(자체전자조달시스템)<NA>2021-06
4634642020013000064일반용역2022년 루트회의 전시부스 대행용역자체조달(자체전자조달시스템)<NA>2021-12