Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory322.3 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text2

Dataset

Description방위사업청 및 각 군이 국내에서 조달하는 입찰참여업체정보를 제공합니다.업체명, 대표자명 정보를 포함하고 있습니다.
Author방위사업청
URLhttps://www.data.go.kr/data/15050921/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 18:10:11.338509
Analysis finished2024-03-14 18:10:13.127165
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50343.866
Minimum6
Maximum99990
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T03:10:13.269954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile5002.45
Q125039
median50451.5
Q376090
95-th percentile94725.55
Maximum99990
Range99984
Interquartile range (IQR)51051

Descriptive statistics

Standard deviation29044.303
Coefficient of variation (CV)0.57691841
Kurtosis-1.2193442
Mean50343.866
Median Absolute Deviation (MAD)25517
Skewness-0.026719504
Sum5.0343866 × 108
Variance8.4357156 × 108
MonotonicityNot monotonic
2024-03-15T03:10:13.552189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
43916 1
 
< 0.1%
58002 1
 
< 0.1%
63273 1
 
< 0.1%
98651 1
 
< 0.1%
84597 1
 
< 0.1%
413 1
 
< 0.1%
54340 1
 
< 0.1%
63015 1
 
< 0.1%
35136 1
 
< 0.1%
86620 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
6 1
< 0.1%
24 1
< 0.1%
32 1
< 0.1%
50 1
< 0.1%
76 1
< 0.1%
84 1
< 0.1%
90 1
< 0.1%
117 1
< 0.1%
130 1
< 0.1%
131 1
< 0.1%
ValueCountFrequency (%)
99990 1
< 0.1%
99988 1
< 0.1%
99974 1
< 0.1%
99957 1
< 0.1%
99948 1
< 0.1%
99944 1
< 0.1%
99935 1
< 0.1%
99933 1
< 0.1%
99919 1
< 0.1%
99908 1
< 0.1%
Distinct9844
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:10:14.283585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length34
Mean length8.5461
Min length2

Characters and Unicode

Total characters85461
Distinct characters787
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9695 ?
Unique (%)97.0%

Sample

1st row주식회사 천지
2nd row주식회사 발안수목원
3rd row주식회사 효성크린
4th row주식회사 케이티에스
5th row에이치에스 주식회사
ValueCountFrequency (%)
주식회사 4200
28.3%
유한회사 168
 
1.1%
105
 
0.7%
건축사사무소 64
 
0.4%
합자회사 36
 
0.2%
사단법인 14
 
0.1%
株式會社 11
 
0.1%
산학협력단 10
 
0.1%
사무소 9
 
0.1%
재단법인 8
 
0.1%
Other values (9867) 10237
68.9%
2024-03-15T03:10:15.529154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7903
 
9.2%
6196
 
7.3%
5259
 
6.2%
5026
 
5.9%
4897
 
5.7%
) 2982
 
3.5%
( 2979
 
3.5%
2758
 
3.2%
2258
 
2.6%
1999
 
2.3%
Other values (777) 43204
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73728
86.3%
Space Separator 4897
 
5.7%
Close Punctuation 2990
 
3.5%
Open Punctuation 2986
 
3.5%
Uppercase Letter 556
 
0.7%
Lowercase Letter 157
 
0.2%
Other Punctuation 89
 
0.1%
Decimal Number 48
 
0.1%
Dash Punctuation 8
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7903
 
10.7%
6196
 
8.4%
5259
 
7.1%
5026
 
6.8%
2758
 
3.7%
2258
 
3.1%
1999
 
2.7%
1222
 
1.7%
961
 
1.3%
930
 
1.3%
Other values (706) 39216
53.2%
Uppercase Letter
ValueCountFrequency (%)
E 55
 
9.9%
N 49
 
8.8%
C 46
 
8.3%
S 41
 
7.4%
T 36
 
6.5%
G 34
 
6.1%
A 33
 
5.9%
K 31
 
5.6%
M 27
 
4.9%
O 25
 
4.5%
Other values (16) 179
32.2%
Lowercase Letter
ValueCountFrequency (%)
o 26
16.6%
i 15
9.6%
e 14
 
8.9%
n 13
 
8.3%
t 12
 
7.6%
c 10
 
6.4%
a 9
 
5.7%
d 9
 
5.7%
m 7
 
4.5%
r 7
 
4.5%
Other values (13) 35
22.3%
Decimal Number
ValueCountFrequency (%)
1 17
35.4%
2 14
29.2%
3 5
 
10.4%
0 5
 
10.4%
5 3
 
6.2%
4 2
 
4.2%
9 1
 
2.1%
6 1
 
2.1%
Other Punctuation
ValueCountFrequency (%)
. 54
60.7%
& 14
 
15.7%
, 14
 
15.7%
· 2
 
2.2%
2
 
2.2%
2
 
2.2%
/ 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 2982
99.7%
8
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 2979
99.8%
7
 
0.2%
Space Separator
ValueCountFrequency (%)
4897
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73615
86.1%
Common 11018
 
12.9%
Latin 713
 
0.8%
Han 115
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7903
 
10.7%
6196
 
8.4%
5259
 
7.1%
5026
 
6.8%
2758
 
3.7%
2258
 
3.1%
1999
 
2.7%
1222
 
1.7%
961
 
1.3%
930
 
1.3%
Other values (663) 39103
53.1%
Latin
ValueCountFrequency (%)
E 55
 
7.7%
N 49
 
6.9%
C 46
 
6.5%
S 41
 
5.8%
T 36
 
5.0%
G 34
 
4.8%
A 33
 
4.6%
K 31
 
4.3%
M 27
 
3.8%
o 26
 
3.6%
Other values (39) 335
47.0%
Han
ValueCountFrequency (%)
14
 
12.2%
13
 
11.3%
13
 
11.3%
13
 
11.3%
6
 
5.2%
5
 
4.3%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2
 
1.7%
Other values (34) 40
34.8%
Common
ValueCountFrequency (%)
4897
44.4%
) 2982
27.1%
( 2979
27.0%
. 54
 
0.5%
1 17
 
0.2%
& 14
 
0.1%
2 14
 
0.1%
, 14
 
0.1%
8
 
0.1%
- 8
 
0.1%
Other values (11) 31
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73613
86.1%
ASCII 11710
 
13.7%
CJK 111
 
0.1%
None 23
 
< 0.1%
CJK Compat Ideographs 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7903
 
10.7%
6196
 
8.4%
5259
 
7.1%
5026
 
6.8%
2758
 
3.7%
2258
 
3.1%
1999
 
2.7%
1222
 
1.7%
961
 
1.3%
930
 
1.3%
Other values (662) 39101
53.1%
ASCII
ValueCountFrequency (%)
4897
41.8%
) 2982
25.5%
( 2979
25.4%
E 55
 
0.5%
. 54
 
0.5%
N 49
 
0.4%
C 46
 
0.4%
S 41
 
0.4%
T 36
 
0.3%
G 34
 
0.3%
Other values (55) 537
 
4.6%
CJK
ValueCountFrequency (%)
14
 
12.6%
13
 
11.7%
13
 
11.7%
13
 
11.7%
6
 
5.4%
5
 
4.5%
3
 
2.7%
3
 
2.7%
3
 
2.7%
2
 
1.8%
Other values (30) 36
32.4%
None
ValueCountFrequency (%)
8
34.8%
7
30.4%
· 2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
CJK Compat Ideographs
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct8563
Distinct (%)85.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:10:16.923803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length3
Mean length3.0066
Min length2

Characters and Unicode

Total characters30066
Distinct characters322
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7582 ?
Unique (%)75.8%

Sample

1st row이진미
2nd row이관용
3rd row권효숙
4th row김기만
5th row김현주
ValueCountFrequency (%)
김미숙 13
 
0.1%
김현주 10
 
0.1%
김경희 9
 
0.1%
이정호 9
 
0.1%
김태호 8
 
0.1%
김은영 7
 
0.1%
김선희 7
 
0.1%
이상훈 7
 
0.1%
김태형 7
 
0.1%
김영태 7
 
0.1%
Other values (8592) 9968
99.2%
2024-03-15T03:10:18.712143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2097
 
7.0%
1581
 
5.3%
1057
 
3.5%
879
 
2.9%
797
 
2.7%
588
 
2.0%
555
 
1.8%
525
 
1.7%
511
 
1.7%
510
 
1.7%
Other values (312) 20966
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29904
99.5%
Uppercase Letter 93
 
0.3%
Space Separator 58
 
0.2%
Lowercase Letter 4
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2097
 
7.0%
1581
 
5.3%
1057
 
3.5%
879
 
2.9%
797
 
2.7%
588
 
2.0%
555
 
1.9%
525
 
1.8%
511
 
1.7%
510
 
1.7%
Other values (283) 20804
69.6%
Uppercase Letter
ValueCountFrequency (%)
N 12
12.9%
A 10
10.8%
E 9
 
9.7%
K 7
 
7.5%
R 6
 
6.5%
S 6
 
6.5%
M 6
 
6.5%
U 5
 
5.4%
O 5
 
5.4%
Y 4
 
4.3%
Other values (11) 23
24.7%
Lowercase Letter
ValueCountFrequency (%)
o 1
25.0%
n 1
25.0%
e 1
25.0%
h 1
25.0%
Space Separator
ValueCountFrequency (%)
58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29904
99.5%
Latin 97
 
0.3%
Common 65
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2097
 
7.0%
1581
 
5.3%
1057
 
3.5%
879
 
2.9%
797
 
2.7%
588
 
2.0%
555
 
1.9%
525
 
1.8%
511
 
1.7%
510
 
1.7%
Other values (283) 20804
69.6%
Latin
ValueCountFrequency (%)
N 12
12.4%
A 10
 
10.3%
E 9
 
9.3%
K 7
 
7.2%
R 6
 
6.2%
S 6
 
6.2%
M 6
 
6.2%
U 5
 
5.2%
O 5
 
5.2%
Y 4
 
4.1%
Other values (15) 27
27.8%
Common
ValueCountFrequency (%)
58
89.2%
) 3
 
4.6%
( 3
 
4.6%
, 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29904
99.5%
ASCII 162
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2097
 
7.0%
1581
 
5.3%
1057
 
3.5%
879
 
2.9%
797
 
2.7%
588
 
2.0%
555
 
1.9%
525
 
1.8%
511
 
1.7%
510
 
1.7%
Other values (283) 20804
69.6%
ASCII
ValueCountFrequency (%)
58
35.8%
N 12
 
7.4%
A 10
 
6.2%
E 9
 
5.6%
K 7
 
4.3%
R 6
 
3.7%
S 6
 
3.7%
M 6
 
3.7%
U 5
 
3.1%
O 5
 
3.1%
Other values (19) 38
23.5%

Interactions

2024-03-15T03:10:12.678353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-15T03:10:12.931561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T03:10:13.063958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업체명대표자명
4391543916주식회사 천지이진미
5551155512주식회사 발안수목원이관용
7694476945주식회사 효성크린권효숙
7147071471주식회사 케이티에스김기만
65926593에이치에스 주식회사김현주
8758587586거상HK환경김경호
3717037171주식회사 창조전력박명석
3630836309대협이엔지 주식회사정복용
9423294233위드유헬스케어유희주
66466647명진종합개발(주)최원호
번호업체명대표자명
3537535376세리정보기술 주식회사이금모
4310443105(주)월드여행사최정화
1233012331유한회사 용수종합건설우정욱
3903939040팔방미인 주식회사강갑례
5567755678주식회사 유정종합건설이영호
5866058661주식회사 대양종합건설장순애
4991749918주식회사 창광건설이미미
38063807성정종합건설(주)정문희
46014602삼보종합지하개발(주)김희균
1914719148범한건설(주)홍정민