Overview

Dataset statistics

Number of variables11
Number of observations2403
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory209.0 KiB
Average record size in memory89.1 B

Variable types

Categorical4
Text5
DateTime1
Numeric1

Dataset

Description한국동서발전의 공사, 용역, 물품 계약현황 정보입니다. 공사, 용역, 물품 계약현황은 계약번호, 계약명, 계약금액, 조달유형 등의 항목을 나타냅니다.
URLhttps://www.data.go.kr/data/15065323/fileData.do

Alerts

조달유형 is highly imbalanced (80.2%)Imbalance
계약금액(VAT 포함) is highly skewed (γ1 = 26.96191162)Skewed
계약번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:03:34.526708
Analysis finished2023-12-12 06:03:36.422520
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
구매
1432 
공사
544 
용역
427 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구매
2nd row구매
3rd row공사
4th row구매
5th row구매

Common Values

ValueCountFrequency (%)
구매 1432
59.6%
공사 544
 
22.6%
용역 427
 
17.8%

Length

2023-12-12T15:03:36.505264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:03:36.603368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구매 1432
59.6%
공사 544
 
22.6%
용역 427
 
17.8%

계약번호
Text

UNIQUE 

Distinct2403
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
2023-12-12T15:03:36.875061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length10.404078
Min length10

Characters and Unicode

Total characters25001
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2403 ?
Unique (%)100.0%

Sample

1st rowR081901167
2nd rowR081901170
3rd rowC0081910494
4th rowR082000005
5th rowR082000001
ValueCountFrequency (%)
r081901167 1
 
< 0.1%
c0082110350 1
 
< 0.1%
c0082120351 1
 
< 0.1%
r082100588 1
 
< 0.1%
r082100586 1
 
< 0.1%
r082100590 1
 
< 0.1%
c0082110353 1
 
< 0.1%
r082100594 1
 
< 0.1%
r082100592 1
 
< 0.1%
r082100589 1
 
< 0.1%
Other values (2393) 2393
99.6%
2023-12-12T15:03:37.316849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9084
36.3%
2 4418
17.7%
8 2884
 
11.5%
1 2289
 
9.2%
R 1432
 
5.7%
C 971
 
3.9%
3 879
 
3.5%
4 778
 
3.1%
5 668
 
2.7%
6 597
 
2.4%
Other values (2) 1001
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22598
90.4%
Uppercase Letter 2403
 
9.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9084
40.2%
2 4418
19.6%
8 2884
 
12.8%
1 2289
 
10.1%
3 879
 
3.9%
4 778
 
3.4%
5 668
 
3.0%
6 597
 
2.6%
7 513
 
2.3%
9 488
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
R 1432
59.6%
C 971
40.4%

Most occurring scripts

ValueCountFrequency (%)
Common 22598
90.4%
Latin 2403
 
9.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9084
40.2%
2 4418
19.6%
8 2884
 
12.8%
1 2289
 
10.1%
3 879
 
3.9%
4 778
 
3.4%
5 668
 
3.0%
6 597
 
2.6%
7 513
 
2.3%
9 488
 
2.2%
Latin
ValueCountFrequency (%)
R 1432
59.6%
C 971
40.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25001
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9084
36.3%
2 4418
17.7%
8 2884
 
11.5%
1 2289
 
9.2%
R 1432
 
5.7%
C 971
 
3.9%
3 879
 
3.5%
4 778
 
3.1%
5 668
 
2.7%
6 597
 
2.4%
Other values (2) 1001
 
4.0%
Distinct2394
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
2023-12-12T15:03:37.657380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length51
Mean length29.655431
Min length7

Characters and Unicode

Total characters71262
Distinct characters580
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2385 ?
Unique (%)99.3%

Sample

1st row[동해] 바이오매스 혼소 컨베이어 벨트 구매
2nd row[동해] 회처리설비 Bed/Fly Ash배관(C.I.A)구매 단가계약
3rd row당진화력본부 설비용 항온항습기 및 냉방기 정비공사
4th row[동해] 1,2호기 비상전원용 축전지 구매
5th row2020년 수처리용 화공약품(암모니아수 외 11종) 구매
ValueCountFrequency (%)
구매 730
 
5.4%
당진 581
 
4.3%
249
 
1.8%
233
 
1.7%
동해 232
 
1.7%
용역 186
 
1.4%
131
 
1.0%
일산 130
 
1.0%
구매(설치포함 117
 
0.9%
당진발전본부 112
 
0.8%
Other values (3722) 10925
80.2%
2023-12-12T15:03:38.177630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11231
 
15.8%
1804
 
2.5%
[ 1411
 
2.0%
] 1410
 
2.0%
2 1340
 
1.9%
1253
 
1.8%
1165
 
1.6%
1150
 
1.6%
1147
 
1.6%
1094
 
1.5%
Other values (570) 48257
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42585
59.8%
Space Separator 11231
 
15.8%
Decimal Number 5035
 
7.1%
Lowercase Letter 3723
 
5.2%
Uppercase Letter 2784
 
3.9%
Open Punctuation 2338
 
3.3%
Close Punctuation 2337
 
3.3%
Other Punctuation 837
 
1.2%
Math Symbol 341
 
0.5%
Dash Punctuation 38
 
0.1%
Other values (5) 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1804
 
4.2%
1253
 
2.9%
1165
 
2.7%
1150
 
2.7%
1147
 
2.7%
1094
 
2.6%
1053
 
2.5%
1037
 
2.4%
978
 
2.3%
815
 
1.9%
Other values (479) 31089
73.0%
Lowercase Letter
ValueCountFrequency (%)
e 524
14.1%
r 357
9.6%
a 317
 
8.5%
o 297
 
8.0%
t 294
 
7.9%
n 279
 
7.5%
l 278
 
7.5%
i 274
 
7.4%
s 131
 
3.5%
u 121
 
3.3%
Other values (15) 851
22.9%
Uppercase Letter
ValueCountFrequency (%)
C 428
15.4%
S 362
13.0%
T 212
 
7.6%
P 199
 
7.1%
A 191
 
6.9%
E 158
 
5.7%
R 149
 
5.4%
H 137
 
4.9%
G 122
 
4.4%
D 111
 
4.0%
Other values (15) 715
25.7%
Other Punctuation
ValueCountFrequency (%)
, 485
57.9%
# 189
 
22.6%
· 39
 
4.7%
/ 26
 
3.1%
& 26
 
3.1%
% 25
 
3.0%
. 19
 
2.3%
; 12
 
1.4%
: 10
 
1.2%
' 4
 
0.5%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 1340
26.6%
1 952
18.9%
0 792
15.7%
4 397
 
7.9%
5 339
 
6.7%
3 320
 
6.4%
8 299
 
5.9%
6 234
 
4.6%
7 182
 
3.6%
9 180
 
3.6%
Open Punctuation
ValueCountFrequency (%)
[ 1411
60.4%
( 923
39.5%
3
 
0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 1410
60.3%
) 923
39.5%
3
 
0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 339
99.4%
1
 
0.3%
+ 1
 
0.3%
Other Symbol
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
11231
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42584
59.8%
Common 22164
31.1%
Latin 6511
 
9.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1804
 
4.2%
1253
 
2.9%
1165
 
2.7%
1150
 
2.7%
1147
 
2.7%
1094
 
2.6%
1053
 
2.5%
1037
 
2.4%
978
 
2.3%
815
 
1.9%
Other values (478) 31088
73.0%
Latin
ValueCountFrequency (%)
e 524
 
8.0%
C 428
 
6.6%
S 362
 
5.6%
r 357
 
5.5%
a 317
 
4.9%
o 297
 
4.6%
t 294
 
4.5%
n 279
 
4.3%
l 278
 
4.3%
i 274
 
4.2%
Other values (41) 3101
47.6%
Common
ValueCountFrequency (%)
11231
50.7%
[ 1411
 
6.4%
] 1410
 
6.4%
2 1340
 
6.0%
1 952
 
4.3%
) 923
 
4.2%
( 923
 
4.2%
0 792
 
3.6%
, 485
 
2.2%
4 397
 
1.8%
Other values (29) 2300
 
10.4%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42581
59.8%
ASCII 28620
40.2%
None 49
 
0.1%
Number Forms 4
 
< 0.1%
CJK 3
 
< 0.1%
CJK Compat 2
 
< 0.1%
Math Operators 1
 
< 0.1%
Punctuation 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11231
39.2%
[ 1411
 
4.9%
] 1410
 
4.9%
2 1340
 
4.7%
1 952
 
3.3%
) 923
 
3.2%
( 923
 
3.2%
0 792
 
2.8%
e 524
 
1.8%
, 485
 
1.7%
Other values (71) 8629
30.2%
Hangul
ValueCountFrequency (%)
1804
 
4.2%
1253
 
2.9%
1165
 
2.7%
1150
 
2.7%
1147
 
2.7%
1094
 
2.6%
1053
 
2.5%
1037
 
2.4%
978
 
2.3%
815
 
1.9%
Other values (476) 31085
73.0%
None
ValueCountFrequency (%)
· 39
79.6%
3
 
6.1%
3
 
6.1%
2
 
4.1%
1
 
2.0%
1
 
2.0%
Number Forms
ValueCountFrequency (%)
4
100.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct694
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
Minimum2020-01-01 00:00:00
Maximum2022-12-30 00:00:00
2023-12-12T15:03:38.334151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:03:38.491909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

계약금액(VAT 포함)
Real number (ℝ)

SKEWED 

Distinct2392
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.9121676 × 108
Minimum223
Maximum3.5167 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.2 KiB
2023-12-12T15:03:38.662817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum223
5-th percentile17804724
Q145103197
median95402670
Q32.3130063 × 108
95-th percentile9.8618366 × 108
Maximum3.5167 × 1011
Range3.5167 × 1011
Interquartile range (IQR)1.8619743 × 108

Descriptive statistics

Standard deviation1.0310968 × 1010
Coefficient of variation (CV)13.031786
Kurtosis804.64428
Mean7.9121676 × 108
Median Absolute Deviation (MAD)65655920
Skewness26.961912
Sum1.9012939 × 1012
Variance1.0631606 × 1020
MonotonicityNot monotonic
2023-12-12T15:03:38.824125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
220000000 2
 
0.1%
308000000 2
 
0.1%
9044000 2
 
0.1%
550000000 2
 
0.1%
44000000 2
 
0.1%
126500000 2
 
0.1%
181500000 2
 
0.1%
51000000 2
 
0.1%
93500000 2
 
0.1%
23199000 2
 
0.1%
Other values (2382) 2383
99.2%
ValueCountFrequency (%)
223 1
< 0.1%
1279 1
< 0.1%
1326 1
< 0.1%
1493 1
< 0.1%
53800 1
< 0.1%
79790 1
< 0.1%
115137 1
< 0.1%
117000 1
< 0.1%
173892 1
< 0.1%
215877 1
< 0.1%
ValueCountFrequency (%)
351670000000 1
< 0.1%
275880000000 1
< 0.1%
167189000000 1
< 0.1%
104602000000 1
< 0.1%
100700000000 1
< 0.1%
43670000000 1
< 0.1%
34669837107 1
< 0.1%
28083000000 1
< 0.1%
27742193855 1
< 0.1%
27115000000 1
< 0.1%
Distinct2401
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
2023-12-12T15:03:39.140133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.5784436
Min length3

Characters and Unicode

Total characters20614
Distinct characters15
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2399 ?
Unique (%)99.8%

Sample

1st row33568269
2nd row43792121
3rd row385757467
4th row596086905
5th row98789970
ValueCountFrequency (%)
210000000 2
 
0.1%
777000000 2
 
0.1%
393547828 1
 
< 0.1%
37761084 1
 
< 0.1%
593366400 1
 
< 0.1%
97066954 1
 
< 0.1%
33568269 1
 
< 0.1%
40594071 1
 
< 0.1%
43587660 1
 
< 0.1%
38446009 1
 
< 0.1%
Other values (2391) 2391
99.5%
2023-12-12T15:03:39.839123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2625
12.7%
1 2481
12.0%
2 2184
10.6%
3 1988
9.6%
6 1927
9.3%
4 1918
9.3%
5 1915
9.3%
7 1872
9.1%
9 1855
9.0%
8 1831
8.9%
Other values (5) 18
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20596
99.9%
Other Punctuation 5
 
< 0.1%
Uppercase Letter 5
 
< 0.1%
Math Symbol 5
 
< 0.1%
Space Separator 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2625
12.7%
1 2481
12.0%
2 2184
10.6%
3 1988
9.7%
6 1927
9.4%
4 1918
9.3%
5 1915
9.3%
7 1872
9.1%
9 1855
9.0%
8 1831
8.9%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%
Uppercase Letter
ValueCountFrequency (%)
E 5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20609
> 99.9%
Latin 5
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2625
12.7%
1 2481
12.0%
2 2184
10.6%
3 1988
9.6%
6 1927
9.4%
4 1918
9.3%
5 1915
9.3%
7 1872
9.1%
9 1855
9.0%
8 1831
8.9%
Other values (4) 13
 
0.1%
Latin
ValueCountFrequency (%)
E 5
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20614
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2625
12.7%
1 2481
12.0%
2 2184
10.6%
3 1988
9.6%
6 1927
9.3%
4 1918
9.3%
5 1915
9.3%
7 1872
9.1%
9 1855
9.0%
8 1831
8.9%
Other values (5) 18
 
0.1%

계약방법
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
제한경쟁
1514 
일반경쟁
502 
제한경쟁(중소기업간)
386 
지명경쟁
 
1

Length

Max length11
Median length4
Mean length5.1244278
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row제한경쟁(중소기업간)
2nd row제한경쟁(중소기업간)
3rd row제한경쟁
4th row일반경쟁
5th row제한경쟁(중소기업간)

Common Values

ValueCountFrequency (%)
제한경쟁 1514
63.0%
일반경쟁 502
 
20.9%
제한경쟁(중소기업간) 386
 
16.1%
지명경쟁 1
 
< 0.1%

Length

2023-12-12T15:03:40.019858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:03:40.129004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제한경쟁 1514
63.0%
일반경쟁 502
 
20.9%
제한경쟁(중소기업간 386
 
16.1%
지명경쟁 1
 
< 0.1%

조달유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
전자입찰
2329 
수기입찰
 
74

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자입찰
2nd row전자입찰
3rd row전자입찰
4th row전자입찰
5th row전자입찰

Common Values

ValueCountFrequency (%)
전자입찰 2329
96.9%
수기입찰 74
 
3.1%

Length

2023-12-12T15:03:40.275594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:03:40.419492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전자입찰 2329
96.9%
수기입찰 74
 
3.1%
Distinct1631
Distinct (%)67.9%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
2023-12-12T15:03:40.689315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length24
Mean length8.0079068
Min length1

Characters and Unicode

Total characters19243
Distinct characters541
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1283 ?
Unique (%)53.4%

Sample

1st row드림스카이 주식회사
2nd row동명중공업주식회사
3rd row주식회사 성호이엔지
4th row주식회사허브정보통신
5th row주식회사 예인컴퍼니
ValueCountFrequency (%)
주식회사 918
 
26.7%
27
 
0.8%
미림상사 19
 
0.6%
아라 18
 
0.5%
모간산업주식회사 18
 
0.5%
바보스 17
 
0.5%
한결통상 17
 
0.5%
하늘기업 16
 
0.5%
보람 14
 
0.4%
주)행복한원덕 12
 
0.3%
Other values (1670) 2368
68.8%
2023-12-12T15:03:41.165923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1632
 
8.5%
1301
 
6.8%
1135
 
5.9%
1078
 
5.6%
1046
 
5.4%
640
 
3.3%
) 555
 
2.9%
( 555
 
2.9%
412
 
2.1%
324
 
1.7%
Other values (531) 10565
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16761
87.1%
Space Separator 1046
 
5.4%
Close Punctuation 559
 
2.9%
Open Punctuation 559
 
2.9%
Uppercase Letter 193
 
1.0%
Lowercase Letter 77
 
0.4%
Other Punctuation 44
 
0.2%
Decimal Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1632
 
9.7%
1301
 
7.8%
1135
 
6.8%
1078
 
6.4%
640
 
3.8%
412
 
2.5%
324
 
1.9%
276
 
1.6%
255
 
1.5%
247
 
1.5%
Other values (475) 9461
56.4%
Uppercase Letter
ValueCountFrequency (%)
S 26
13.5%
C 17
 
8.8%
E 15
 
7.8%
J 15
 
7.8%
N 13
 
6.7%
H 11
 
5.7%
T 11
 
5.7%
R 11
 
5.7%
L 10
 
5.2%
I 9
 
4.7%
Other values (13) 55
28.5%
Lowercase Letter
ValueCountFrequency (%)
o 10
13.0%
t 9
11.7%
d 7
 
9.1%
e 6
 
7.8%
r 5
 
6.5%
u 4
 
5.2%
n 4
 
5.2%
c 4
 
5.2%
a 4
 
5.2%
i 4
 
5.2%
Other values (10) 20
26.0%
Close Punctuation
ValueCountFrequency (%)
) 555
99.3%
3
 
0.5%
] 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 555
99.3%
3
 
0.5%
[ 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 35
79.5%
& 7
 
15.9%
2
 
4.5%
Decimal Number
ValueCountFrequency (%)
3 2
50.0%
5 1
25.0%
6 1
25.0%
Space Separator
ValueCountFrequency (%)
1046
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16742
87.0%
Common 2212
 
11.5%
Latin 270
 
1.4%
Han 19
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1632
 
9.7%
1301
 
7.8%
1135
 
6.8%
1078
 
6.4%
640
 
3.8%
412
 
2.5%
324
 
1.9%
276
 
1.6%
255
 
1.5%
247
 
1.5%
Other values (461) 9442
56.4%
Latin
ValueCountFrequency (%)
S 26
 
9.6%
C 17
 
6.3%
E 15
 
5.6%
J 15
 
5.6%
N 13
 
4.8%
H 11
 
4.1%
T 11
 
4.1%
R 11
 
4.1%
L 10
 
3.7%
o 10
 
3.7%
Other values (33) 131
48.5%
Han
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (4) 4
21.1%
Common
ValueCountFrequency (%)
1046
47.3%
) 555
25.1%
( 555
25.1%
. 35
 
1.6%
& 7
 
0.3%
3
 
0.1%
3
 
0.1%
2
 
0.1%
3 2
 
0.1%
5 1
 
< 0.1%
Other values (3) 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16742
87.0%
ASCII 2474
 
12.9%
CJK 19
 
0.1%
None 8
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1632
 
9.7%
1301
 
7.8%
1135
 
6.8%
1078
 
6.4%
640
 
3.8%
412
 
2.5%
324
 
1.9%
276
 
1.6%
255
 
1.5%
247
 
1.5%
Other values (461) 9442
56.4%
ASCII
ValueCountFrequency (%)
1046
42.3%
) 555
22.4%
( 555
22.4%
. 35
 
1.4%
S 26
 
1.1%
C 17
 
0.7%
E 15
 
0.6%
J 15
 
0.6%
N 13
 
0.5%
H 11
 
0.4%
Other values (43) 186
 
7.5%
CJK
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (4) 4
21.1%
None
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Distinct1651
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
2023-12-12T15:03:41.522656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length54
Mean length31.119434
Min length17

Characters and Unicode

Total characters74780
Distinct characters533
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1313 ?
Unique (%)54.6%

Sample

1st row강원도 동해시 공단1로, 177-0 (구호동)
2nd row경상북도 고령군 다산면 다산산단로, 223-19
3rd row충청남도 보령시 대청로, 401-0 (화산동)
4th row부산광역시 사상구 괘감로 131, 삼주오피스텔 1동 802호, (감전동)
5th row경기도 고양시 덕양구 충장로, 140, 4층 401호(행신동, 썬프라자)
ValueCountFrequency (%)
충청남도 541
 
3.8%
경기도 459
 
3.2%
당진시 353
 
2.5%
서울특별시 319
 
2.2%
울산광역시 251
 
1.8%
강원도 167
 
1.2%
144
 
1.0%
전라남도 136
 
1.0%
남구 121
 
0.8%
인천광역시 109
 
0.8%
Other values (4293) 11653
81.8%
2023-12-12T15:03:42.014714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11876
 
15.9%
, 3587
 
4.8%
1 2836
 
3.8%
2399
 
3.2%
0 2194
 
2.9%
2154
 
2.9%
1977
 
2.6%
) 1738
 
2.3%
( 1731
 
2.3%
2 1718
 
2.3%
Other values (523) 42570
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41225
55.1%
Decimal Number 13045
 
17.4%
Space Separator 11876
 
15.9%
Other Punctuation 3613
 
4.8%
Close Punctuation 1738
 
2.3%
Open Punctuation 1731
 
2.3%
Dash Punctuation 1372
 
1.8%
Uppercase Letter 175
 
0.2%
Math Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2399
 
5.8%
2154
 
5.2%
1977
 
4.8%
1673
 
4.1%
1309
 
3.2%
1245
 
3.0%
1060
 
2.6%
994
 
2.4%
863
 
2.1%
730
 
1.8%
Other values (483) 26821
65.1%
Uppercase Letter
ValueCountFrequency (%)
B 41
23.4%
A 25
14.3%
C 19
10.9%
T 17
9.7%
I 12
 
6.9%
K 11
 
6.3%
S 10
 
5.7%
R 8
 
4.6%
M 5
 
2.9%
D 5
 
2.9%
Other values (10) 22
12.6%
Decimal Number
ValueCountFrequency (%)
1 2836
21.7%
0 2194
16.8%
2 1718
13.2%
3 1414
10.8%
4 993
 
7.6%
5 933
 
7.2%
6 858
 
6.6%
7 752
 
5.8%
8 722
 
5.5%
9 625
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 3587
99.3%
. 23
 
0.6%
2
 
0.1%
& 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
11876
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1738
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1731
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1372
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41224
55.1%
Common 33378
44.6%
Latin 177
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2399
 
5.8%
2154
 
5.2%
1977
 
4.8%
1673
 
4.1%
1309
 
3.2%
1245
 
3.0%
1060
 
2.6%
994
 
2.4%
863
 
2.1%
730
 
1.8%
Other values (482) 26820
65.1%
Latin
ValueCountFrequency (%)
B 41
23.2%
A 25
14.1%
C 19
10.7%
T 17
9.6%
I 12
 
6.8%
K 11
 
6.2%
S 10
 
5.6%
R 8
 
4.5%
M 5
 
2.8%
D 5
 
2.8%
Other values (11) 24
13.6%
Common
ValueCountFrequency (%)
11876
35.6%
, 3587
 
10.7%
1 2836
 
8.5%
0 2194
 
6.6%
) 1738
 
5.2%
( 1731
 
5.2%
2 1718
 
5.1%
3 1414
 
4.2%
- 1372
 
4.1%
4 993
 
3.0%
Other values (9) 3919
 
11.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41224
55.1%
ASCII 33553
44.9%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11876
35.4%
, 3587
 
10.7%
1 2836
 
8.5%
0 2194
 
6.5%
) 1738
 
5.2%
( 1731
 
5.2%
2 1718
 
5.1%
3 1414
 
4.2%
- 1372
 
4.1%
4 993
 
3.0%
Other values (29) 4094
 
12.2%
Hangul
ValueCountFrequency (%)
2399
 
5.8%
2154
 
5.2%
1977
 
4.8%
1673
 
4.1%
1309
 
3.2%
1245
 
3.0%
1060
 
2.6%
994
 
2.4%
863
 
2.1%
730
 
1.8%
Other values (482) 26820
65.1%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

담당사업소
Categorical

Distinct16
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
당진발전본부
695 
당진화력
391 
조달처
330 
울산발전본부
208 
울산화력
159 
Other values (11)
620 

Length

Max length11
Median length9
Mean length5.1115273
Min length3

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row동해바이오화력
2nd row동해바이오화력
3rd row조달처
4th row동해바이오화력
5th row울산화력

Common Values

ValueCountFrequency (%)
당진발전본부 695
28.9%
당진화력 391
16.3%
조달처 330
13.7%
울산발전본부 208
 
8.7%
울산화력 159
 
6.6%
동해바이오화력 147
 
6.1%
상생조달처 124
 
5.2%
동해발전본부 99
 
4.1%
일산화력 77
 
3.2%
일산발전본부 53
 
2.2%
Other values (6) 120
 
5.0%

Length

2023-12-12T15:03:42.152434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
당진발전본부 695
28.9%
당진화력 391
16.3%
조달처 330
13.7%
울산발전본부 208
 
8.7%
울산화력 159
 
6.6%
동해바이오화력 147
 
6.1%
상생조달처 124
 
5.2%
동해발전본부 99
 
4.1%
일산화력 77
 
3.2%
일산발전본부 53
 
2.2%
Other values (6) 120
 
5.0%

Interactions

2023-12-12T15:03:35.947424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:03:42.227957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분계약금액(VAT 포함)계약방법조달유형담당사업소
구분1.0000.0000.3240.2340.522
계약금액(VAT 포함)0.0001.0000.0000.0000.033
계약방법0.3240.0001.0000.1210.348
조달유형0.2340.0000.1211.0000.403
담당사업소0.5220.0330.3480.4031.000
2023-12-12T15:03:42.598395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당사업소조달유형구분계약방법
담당사업소1.0000.3670.2790.204
조달유형0.3671.0000.3820.080
구분0.2790.3821.0000.313
계약방법0.2040.0800.3131.000
2023-12-12T15:03:42.685062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액(VAT 포함)구분계약방법조달유형담당사업소
계약금액(VAT 포함)1.0000.0000.0000.0000.015
구분0.0001.0000.3130.3820.279
계약방법0.0000.3131.0000.0800.204
조달유형0.0000.3820.0801.0000.367
담당사업소0.0150.2790.2040.3671.000

Missing values

2023-12-12T15:03:36.157276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:03:36.353796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분계약번호계약명계약일자계약금액(VAT 포함)예정가격(VAT 포함)계약방법조달유형계약업체소재지담당사업소
0구매R081901167[동해] 바이오매스 혼소 컨베이어 벨트 구매2020-01-012962590433568269제한경쟁(중소기업간)전자입찰드림스카이 주식회사강원도 동해시 공단1로, 177-0 (구호동)동해바이오화력
1구매R081901170[동해] 회처리설비 Bed/Fly Ash배관(C.I.A)구매 단가계약2020-01-023858877043792121제한경쟁(중소기업간)전자입찰동명중공업주식회사경상북도 고령군 다산면 다산산단로, 223-19동해바이오화력
2공사C0081910494당진화력본부 설비용 항온항습기 및 냉방기 정비공사2020-01-02337538500385757467제한경쟁전자입찰주식회사 성호이엔지충청남도 보령시 대청로, 401-0 (화산동)조달처
3구매R082000005[동해] 1,2호기 비상전원용 축전지 구매2020-01-03479823982596086905일반경쟁전자입찰주식회사허브정보통신부산광역시 사상구 괘감로 131, 삼주오피스텔 1동 802호, (감전동)동해바이오화력
4구매R0820000012020년 수처리용 화공약품(암모니아수 외 11종) 구매2020-01-069392211198789970제한경쟁(중소기업간)전자입찰주식회사 예인컴퍼니경기도 고양시 덕양구 충장로, 140, 4층 401호(행신동, 썬프라자)울산화력
5구매R082000013[당진화력]밸브 외 135품목 연간단가계약2020-01-06550391000683739920일반경쟁전자입찰제이오파인드경기도 오산시 경기대로25번길, 16 (갈곶동, 동부아파트)104동 803호당진화력
6구매R082000008[당진화력]철사 외 48품목 연간단가계약2020-01-06221989000275775731일반경쟁전자입찰두리 [dulu]경기도 평택시 포승읍 서동대로, 597당진화력
7구매R082000009[당진화력]초음파 카메라 1종2020-01-064802504754498798일반경쟁전자입찰에스디상사경기도 남양주시 오남읍 진건오남로759번길, 70 (남양주 오남 푸르지오)당진화력
8구매R082000010[당진화력]적외선 영상 온도계 69set2020-01-065147211658491820제한경쟁전자입찰웅진전라북도 전주시 완산구 장승배기로 261, ()당진화력
9용역C0082020002[동해] P2G 연계 태양광 발전 모니터링 및 측정 용역2020-01-06110499610125929530일반경쟁전자입찰주식회사 노벨전라남도 무안군 망운면 운해로, 1205-0동해바이오화력
구분계약번호계약명계약일자계약금액(VAT 포함)예정가격(VAT 포함)계약방법조달유형계약업체소재지담당사업소
2393공사C0082210359[당진 9호기] SLP #C,D 배관 보온공사(기계)2022-12-166061220069074141일반경쟁전자입찰(주)동부플랜트전라남도 광양시 직동1길, 132당진발전본부
2394용역C0082220362춘천에너지㈜ 주식가치평가 용역(협상에 의한 계약)2022-12-194400000051055000제한경쟁전자입찰삼도회계법인서울특별시 서초구 사평대로, 361, 3층(반포동, 청원빌딩)조달처
2395공사C0082210363당진발전본부 2~4호기 Long Soot Blower Lance Tube 교체공사2022-12-20154710754174913947제한경쟁전자입찰주식회사 케이티엠충청남도 보령시 대천방조제로, 43-0 (대천동)당진발전본부
2396용역C0082220361에너지신사업 태양광 발전설비 유지관리 용역2022-12-20148405000168204566일반경쟁전자입찰(주)한라전기안전관리울산광역시 남구 신정로17번길, 21-1 (달동)조달처
2397용역C00822203642022년 소셜미디어 콘텐츠 기획·제작·관리·운영 용역(협상에 의한 계약)2022-12-22275000000291783000일반경쟁전자입찰주식회사 디앤씨컴퍼니대전광역시 동구 선화로, 187,3층 (삼성동)조달처
2398용역C00822203652023년도 울산발전본부 출퇴근버스 운행 용역2022-12-22346636650393547828일반경쟁전자입찰주식회사 현대관광울산광역시 중구 염포로 62, 2층, (반구동)울산발전본부
2399용역C0082220366[당진본부] 2023년도 수질오염물질 자가측정 용역2022-12-26154526152175671182제한경쟁전자입찰주식회사 신화환경연구원경기도 안양시 동안구 엘에스로, 142(호계동, 금정역 SKV1센터) 716호,717호,718호,719호,720호,721호당진발전본부
2400공사C0082210367울산발전본부 재난안전대응센터 건설공사2022-12-2853917487166219186133제한경쟁전자입찰남송종합건설(주)울산광역시 울주군 서생면 덕골재길, 19-21조달처
2401용역C00822203752022년도 신입사원 입문공통교육 및 Mind-set 교육 용역(협상에 의한 계약)2022-12-306750000075697160제한경쟁전자입찰한국생산성본부서울특별시 종로구 새문안로5가길 32, (적선동)조달처
2402용역C0082220368재난안전대응센터 신축공사 소방감리용역2022-12-304229333047560897제한경쟁전자입찰주식회사 보명엔지니어링울산광역시 울주군 청량읍 상남길, 6-9울산발전본부