Overview

Dataset statistics

Number of variables5
Number of observations154
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory42.9 B

Variable types

Numeric2
Text3

Dataset

Description2023년 한국석유공사에서 체결한 수의계약 업체 정보로, 수의계약 일련번호, 업체명, 주소 등이 기재되어 있음.
Author한국석유공사
URLhttps://www.data.go.kr/data/15100089/fileData.do

Alerts

수의계약일련번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:07:00.070625
Analysis finished2024-04-21 02:07:02.562908
Duration2.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수의계약일련번호
Real number (ℝ)

UNIQUE 

Distinct154
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0230625 × 109
Minimum2.02301 × 109
Maximum2.0231202 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-04-21T11:07:02.660214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.02301 × 109
5-th percentile2.0230101 × 109
Q12.0230301 × 109
median2.02307 × 109
Q32.0230975 × 109
95-th percentile2.02312 × 109
Maximum2.0231202 × 109
Range110200
Interquartile range (IQR)67482.5

Descriptive statistics

Standard deviation36264.852
Coefficient of variation (CV)1.792572 × 10-5
Kurtosis-1.3418207
Mean2.0230625 × 109
Median Absolute Deviation (MAD)30100
Skewness-0.031957442
Sum3.1155163 × 1011
Variance1.3151395 × 109
MonotonicityStrictly increasing
2024-04-21T11:07:02.787917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2023010010 1
 
0.6%
2023090070 1
 
0.6%
2023080070 1
 
0.6%
2023090010 1
 
0.6%
2023090020 1
 
0.6%
2023090030 1
 
0.6%
2023090040 1
 
0.6%
2023090050 1
 
0.6%
2023090060 1
 
0.6%
2023090080 1
 
0.6%
Other values (144) 144
93.5%
ValueCountFrequency (%)
2023010010 1
0.6%
2023010020 1
0.6%
2023010030 1
0.6%
2023010040 1
0.6%
2023010050 1
0.6%
2023010060 1
0.6%
2023010070 1
0.6%
2023010080 1
0.6%
2023010090 1
0.6%
2023010100 1
0.6%
ValueCountFrequency (%)
2023120210 1
0.6%
2023120140 1
0.6%
2023120130 1
0.6%
2023120090 1
0.6%
2023120070 1
0.6%
2023120060 1
0.6%
2023120050 1
0.6%
2023120040 1
0.6%
2023120030 1
0.6%
2023120020 1
0.6%

사업자등록번호
Real number (ℝ)

Distinct125
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3338077 × 109
Minimum1.0181163 × 109
Maximum8.9376005 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-04-21T11:07:02.905425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0181163 × 109
5-th percentile1.07464 × 109
Q11.3134997 × 109
median3.0584022 × 109
Q34.9788011 × 109
95-th percentile7.2169278 × 109
Maximum8.9376005 × 109
Range7.9194842 × 109
Interquartile range (IQR)3.6653013 × 109

Descriptive statistics

Standard deviation2.1269768 × 109
Coefficient of variation (CV)0.63800223
Kurtosis-0.41293829
Mean3.3338077 × 109
Median Absolute Deviation (MAD)1.7602211 × 109
Skewness0.78532226
Sum5.1340639 × 1011
Variance4.5240301 × 1018
MonotonicityNot monotonic
2024-04-21T11:07:03.069045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6068510657 3
 
1.9%
4178123613 3
 
1.9%
1068119621 3
 
1.9%
1268164718 3
 
1.9%
1358201164 3
 
1.9%
2438200132 2
 
1.3%
6011607879 2
 
1.3%
2068601936 2
 
1.3%
1198141769 2
 
1.3%
1132769995 2
 
1.3%
Other values (115) 129
83.8%
ValueCountFrequency (%)
1018116293 1
 
0.6%
1058190922 1
 
0.6%
1058632572 1
 
0.6%
1058711399 1
 
0.6%
1058739548 1
 
0.6%
1068119621 3
1.9%
1078151022 1
 
0.6%
1108144925 1
 
0.6%
1128200217 1
 
0.6%
1132769995 2
1.3%
ValueCountFrequency (%)
8937600464 1
0.6%
8788600237 1
0.6%
8548600075 1
0.6%
8328801817 1
0.6%
8228701958 1
0.6%
8168100840 1
0.6%
7621200829 1
0.6%
7548102176 1
0.6%
7038603094 2
1.3%
6705100820 1
0.6%
Distinct125
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-21T11:07:03.326752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length8.5
Min length4

Characters and Unicode

Total characters1309
Distinct characters203
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)65.6%

Sample

1st row모아시스 주식회사
2nd row주식회사 퓨쳐누리
3rd row인스피언(주)
4th row모아이엔지
5th row주식회사우경정보기술
ValueCountFrequency (%)
주식회사 64
29.2%
주)이화테크 3
 
1.4%
오티스엘리베이터(유 3
 
1.4%
용인시산림조합 3
 
1.4%
삼일회계법인 3
 
1.4%
한특건설 3
 
1.4%
한길플랜트 2
 
0.9%
태성산업기계 2
 
0.9%
엘나인 2
 
0.9%
사단법인한마음장애인복지회 2
 
0.9%
Other values (117) 132
60.3%
2024-04-21T11:07:03.729532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
 
8.7%
81
 
6.2%
76
 
5.8%
73
 
5.6%
65
 
5.0%
) 43
 
3.3%
( 43
 
3.3%
40
 
3.1%
33
 
2.5%
27
 
2.1%
Other values (193) 714
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1154
88.2%
Space Separator 65
 
5.0%
Close Punctuation 43
 
3.3%
Open Punctuation 43
 
3.3%
Decimal Number 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
114
 
9.9%
81
 
7.0%
76
 
6.6%
73
 
6.3%
40
 
3.5%
33
 
2.9%
27
 
2.3%
22
 
1.9%
20
 
1.7%
19
 
1.6%
Other values (188) 649
56.2%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%
Space Separator
ValueCountFrequency (%)
65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1154
88.2%
Common 155
 
11.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
114
 
9.9%
81
 
7.0%
76
 
6.6%
73
 
6.3%
40
 
3.5%
33
 
2.9%
27
 
2.3%
22
 
1.9%
20
 
1.7%
19
 
1.6%
Other values (188) 649
56.2%
Common
ValueCountFrequency (%)
65
41.9%
) 43
27.7%
( 43
27.7%
1 2
 
1.3%
2 2
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1154
88.2%
ASCII 155
 
11.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
114
 
9.9%
81
 
7.0%
76
 
6.6%
73
 
6.3%
40
 
3.5%
33
 
2.9%
27
 
2.3%
22
 
1.9%
20
 
1.7%
19
 
1.6%
Other values (188) 649
56.2%
ASCII
ValueCountFrequency (%)
65
41.9%
) 43
27.7%
( 43
27.7%
1 2
 
1.3%
2 2
 
1.3%
Distinct125
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-21T11:07:04.037825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.025974
Min length2

Characters and Unicode

Total characters466
Distinct characters111
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)65.6%

Sample

1st row김종수
2nd row추정호
3rd row최정규
4th row최인수
5th row박윤하
ValueCountFrequency (%)
윤석택 3
 
1.9%
조익서 3
 
1.9%
이대영 3
 
1.9%
윤훈수 3
 
1.9%
김민영 3
 
1.9%
김재일 2
 
1.3%
문성원 2
 
1.3%
이상관 2
 
1.3%
안영주 2
 
1.3%
권태원 2
 
1.3%
Other values (118) 132
84.1%
2024-04-21T11:07:04.483850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
6.9%
26
 
5.6%
19
 
4.1%
14
 
3.0%
13
 
2.8%
12
 
2.6%
11
 
2.4%
10
 
2.1%
10
 
2.1%
9
 
1.9%
Other values (101) 310
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 463
99.4%
Space Separator 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
6.9%
26
 
5.6%
19
 
4.1%
14
 
3.0%
13
 
2.8%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
9
 
1.9%
Other values (100) 307
66.3%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 463
99.4%
Common 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
6.9%
26
 
5.6%
19
 
4.1%
14
 
3.0%
13
 
2.8%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
9
 
1.9%
Other values (100) 307
66.3%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 463
99.4%
ASCII 3
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
6.9%
26
 
5.6%
19
 
4.1%
14
 
3.0%
13
 
2.8%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
9
 
1.9%
Other values (100) 307
66.3%
ASCII
ValueCountFrequency (%)
3
100.0%

주소
Text

Distinct125
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-21T11:07:04.689547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length43
Mean length30.168831
Min length17

Characters and Unicode

Total characters4646
Distinct characters270
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)65.6%

Sample

1st row서울특별시 서초구 바우뫼로 220, 일광빌딩 4층(양재동)
2nd row서울특별시 영등포구 경인로71길 70, 벽산디지털밸리 1102호(문래동5가)
3rd row서울특별시 금천구 벚꽃로278, 1309호(가산동, 에스제이테크노빌)
4th row경기도 화성시 우정읍 버들로859-0
5th row대구광역시 수성구 알파시티1로31길24-5 (대흥동)
ValueCountFrequency (%)
서울특별시 50
 
6.5%
경기도 32
 
4.2%
충청남도 17
 
2.2%
전라남도 14
 
1.8%
여수시 12
 
1.6%
구로구 10
 
1.3%
부산광역시 10
 
1.3%
용인시 9
 
1.2%
서산시 9
 
1.2%
울산광역시 9
 
1.2%
Other values (410) 593
77.5%
2024-04-21T11:07:05.030165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
613
 
13.2%
1 202
 
4.3%
159
 
3.4%
152
 
3.3%
151
 
3.3%
136
 
2.9%
) 125
 
2.7%
( 125
 
2.7%
0 124
 
2.7%
2 117
 
2.5%
Other values (260) 2742
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2799
60.2%
Decimal Number 829
 
17.8%
Space Separator 613
 
13.2%
Close Punctuation 125
 
2.7%
Open Punctuation 125
 
2.7%
Other Punctuation 74
 
1.6%
Dash Punctuation 65
 
1.4%
Uppercase Letter 13
 
0.3%
Math Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
5.7%
152
 
5.4%
151
 
5.4%
136
 
4.9%
90
 
3.2%
85
 
3.0%
76
 
2.7%
70
 
2.5%
69
 
2.5%
65
 
2.3%
Other values (233) 1746
62.4%
Decimal Number
ValueCountFrequency (%)
1 202
24.4%
0 124
15.0%
2 117
14.1%
3 78
 
9.4%
5 64
 
7.7%
7 61
 
7.4%
6 61
 
7.4%
4 55
 
6.6%
9 38
 
4.6%
8 29
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
15.4%
I 2
15.4%
T 2
15.4%
M 1
7.7%
J 1
7.7%
A 1
7.7%
U 1
7.7%
E 1
7.7%
R 1
7.7%
W 1
7.7%
Other Punctuation
ValueCountFrequency (%)
, 73
98.6%
. 1
 
1.4%
Space Separator
ValueCountFrequency (%)
613
100.0%
Close Punctuation
ValueCountFrequency (%)
) 125
100.0%
Open Punctuation
ValueCountFrequency (%)
( 125
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2799
60.2%
Common 1834
39.5%
Latin 13
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
5.7%
152
 
5.4%
151
 
5.4%
136
 
4.9%
90
 
3.2%
85
 
3.0%
76
 
2.7%
70
 
2.5%
69
 
2.5%
65
 
2.3%
Other values (233) 1746
62.4%
Common
ValueCountFrequency (%)
613
33.4%
1 202
 
11.0%
) 125
 
6.8%
( 125
 
6.8%
0 124
 
6.8%
2 117
 
6.4%
3 78
 
4.3%
, 73
 
4.0%
- 65
 
3.5%
5 64
 
3.5%
Other values (7) 248
13.5%
Latin
ValueCountFrequency (%)
B 2
15.4%
I 2
15.4%
T 2
15.4%
M 1
7.7%
J 1
7.7%
A 1
7.7%
U 1
7.7%
E 1
7.7%
R 1
7.7%
W 1
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2799
60.2%
ASCII 1847
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
613
33.2%
1 202
 
10.9%
) 125
 
6.8%
( 125
 
6.8%
0 124
 
6.7%
2 117
 
6.3%
3 78
 
4.2%
, 73
 
4.0%
- 65
 
3.5%
5 64
 
3.5%
Other values (17) 261
14.1%
Hangul
ValueCountFrequency (%)
159
 
5.7%
152
 
5.4%
151
 
5.4%
136
 
4.9%
90
 
3.2%
85
 
3.0%
76
 
2.7%
70
 
2.5%
69
 
2.5%
65
 
2.3%
Other values (233) 1746
62.4%

Interactions

2024-04-21T11:07:02.139100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:07:01.827800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:07:02.236882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:07:01.980937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:07:05.112052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수의계약일련번호사업자등록번호
수의계약일련번호1.0000.000
사업자등록번호0.0001.000
2024-04-21T11:07:05.187134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수의계약일련번호사업자등록번호
수의계약일련번호1.0000.128
사업자등록번호0.1281.000

Missing values

2024-04-21T11:07:02.377092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:07:02.497618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수의계약일련번호사업자등록번호업체명대표자명주소
020230100102118647496모아시스 주식회사김종수서울특별시 서초구 바우뫼로 220, 일광빌딩 4층(양재동)
120230100201058711399주식회사 퓨쳐누리추정호서울특별시 영등포구 경인로71길 70, 벽산디지털밸리 1102호(문래동5가)
220230100301208741140인스피언(주)최정규서울특별시 금천구 벚꽃로278, 1309호(가산동, 에스제이테크노빌)
320230100402242860646모아이엔지최인수경기도 화성시 우정읍 버들로859-0
420230100505028198620주식회사우경정보기술박윤하대구광역시 수성구 알파시티1로31길24-5 (대흥동)
520230100602298110559이지소프트주식회사황택현서울특별시 구로구 디지털로 272, 한신IT타워 1116호(구로동)
620230100705138118285주식회사 이지닉스전종대경기도 용인시 수지구 신수로767-0 (동천동) 분당수지U타워 A동 1309~1311호
720230100801108144925주식회사 날리지큐브김학훈서울특별시 서초구 서초중앙로14 (서초동) 진로빌딩 15층
820230100902148612318나라아이넷(주)김찬훈대전광역시 유성구 테크노10로33 (탑립동,나라지식센터)
920230101002158169556네이버시스템 주식회사임병조서울특별시 송파구 중대로 135, 아이티벤처타워 동관 16층 1601(가락동)
수의계약일련번호사업자등록번호업체명대표자명주소
14420231200208228701958주식회사 이노플러스이태진부산광역시 남구 자성로152 (문현동), 1321호
14520231200302638101574제트정보기술 주식회사주은태서울특별시 강서구 양천로401-0 (가양동) 강서한강자이타워 B동 407호
14620231200403718701178주식회사 유성기술정동철충청남도 천안시 서북구 직산읍 직산로283-0 ,2동
14720231200507548102176주식회사 디케이컴퍼니박주원전라남도 영광군 영광읍 옥당로47
14820231200604178123613주식회사 한특건설윤석택전라남도 여수시 봉계6길 39(봉계동)
14920231200701200757034퓨네이처박우원서울특별시 강남구 테헤란로151-0 (역삼동,역삼하이츠빌딩 1914)
15020231200906210158096디자인예닮윤근원경기도 용인시 처인구 백옥대로1374, 304호 (유방동, 유림해피랜드빌딩)
15120231201301138199899(주)한콘트롤스심재용서울특별시 구로구 디지털로30길31-0 (구로동)
15220231201407038603094주식회사 투앤투디자인임형석경기도 안양시 동안구 관평로79번길11 (평촌동)
15320231202103118127984주식회사 드림개발이민희충청남도 당진시 순성면 남부로1088-0