Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells18865
Missing cells (%)18.9%
Duplicate rows777
Duplicate rows (%)7.8%
Total size in memory869.1 KiB
Average record size in memory89.0 B

Variable types

Categorical1
Text7
Numeric1
DateTime1

Dataset

Description방위사업청의 각 무기개발 사업에서 국산화된 품목 목록을 제공합니다. 해당 사업을 개발한 민간 계약업체에 대한 정보도 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15119899/fileData.do

Alerts

Dataset has 777 (7.8%) duplicate rowsDuplicates
도면번호 has 1301 (13.0%) missing valuesMissing
도면부품번호 has 1301 (13.0%) missing valuesMissing
규격번호 has 8805 (88.0%) missing valuesMissing
최종수정일 has 7404 (74.0%) missing valuesMissing

Reproduction

Analysis started2024-04-21 01:32:51.058103
Analysis finished2024-04-21 01:32:54.191076
Duration3.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업명
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
K9A1 자주포 성능개량
1060 
K9 자주포
1053 
K1전차 성능개량
1025 
K55 자주포 성능개량
855 
K56 탄약운반장갑차
829 
Other values (22)
5178 

Length

Max length18
Median length12
Mean length9.1158
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowK9A1 자주포 성능개량
2nd rowK21 보병전투차량
3rd row120mm 자주박격포
4th rowK-2 전차
5th row소형전술차량

Common Values

ValueCountFrequency (%)
K9A1 자주포 성능개량 1060
10.6%
K9 자주포 1053
10.5%
K1전차 성능개량 1025
10.2%
K55 자주포 성능개량 855
8.6%
K56 탄약운반장갑차 829
8.3%
K1A1전차 성능개량 772
 
7.7%
K10 탄약운반장갑차 743
 
7.4%
경구난차량 714
 
7.1%
보병탑승차량 526
 
5.3%
K21 보병전투차량 445
 
4.5%
Other values (17) 1978
19.8%

Length

2024-04-21T10:32:54.429764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성능개량 3712
18.8%
자주포 3129
15.9%
탄약운반장갑차 1572
 
8.0%
k9a1 1060
 
5.4%
k9 1053
 
5.3%
k1전차 1025
 
5.2%
k55 855
 
4.3%
k56 829
 
4.2%
k1a1전차 772
 
3.9%
k10 743
 
3.8%
Other values (33) 4988
25.3%
Distinct6324
Distinct (%)63.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T10:32:55.502866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters90000
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4500 ?
Unique (%)45.0%

Sample

1st rowM51273627
2nd rowM00002580
3rd rowM00060994
4th rowM00043147
5th rowM51365366
ValueCountFrequency (%)
m00005708 51
 
0.5%
m51111665 47
 
0.5%
m00043011 42
 
0.4%
m51092638 41
 
0.4%
m51092651 40
 
0.4%
m00027371 38
 
0.4%
m51345522 35
 
0.4%
m00027370 35
 
0.4%
m51896297 30
 
0.3%
m51043184 28
 
0.3%
Other values (6314) 9613
96.1%
2024-04-21T10:32:56.724927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 21965
24.4%
M 10000
11.1%
1 9838
10.9%
5 8992
10.0%
3 6380
 
7.1%
2 6282
 
7.0%
7 6070
 
6.7%
4 5457
 
6.1%
6 5413
 
6.0%
8 4867
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
88.9%
Uppercase Letter 10000
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 21965
27.5%
1 9838
12.3%
5 8992
11.2%
3 6380
 
8.0%
2 6282
 
7.9%
7 6070
 
7.6%
4 5457
 
6.8%
6 5413
 
6.8%
8 4867
 
6.1%
9 4736
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
M 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 80000
88.9%
Latin 10000
 
11.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 21965
27.5%
1 9838
12.3%
5 8992
11.2%
3 6380
 
8.0%
2 6282
 
7.9%
7 6070
 
7.6%
4 5457
 
6.8%
6 5413
 
6.8%
8 4867
 
6.1%
9 4736
 
5.9%
Latin
ValueCountFrequency (%)
M 10000
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 21965
24.4%
M 10000
11.1%
1 9838
10.9%
5 8992
10.0%
3 6380
 
7.1%
2 6282
 
7.0%
7 6070
 
6.7%
4 5457
 
6.1%
6 5413
 
6.0%
8 4867
 
5.4%
Distinct6321
Distinct (%)63.2%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-21T10:32:57.701138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length8.3937181
Min length5

Characters and Unicode

Total characters83912
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4497 ?
Unique (%)45.0%

Sample

1st row375059854
2nd row375238244
3rd row375034386
4th row375166534
5th row375235546
ValueCountFrequency (%)
8094058 51
 
0.5%
375089704 47
 
0.5%
375013686 42
 
0.4%
371302032 41
 
0.4%
375053566 40
 
0.4%
375173635 38
 
0.4%
375173634 35
 
0.4%
375013680 35
 
0.4%
4518982 30
 
0.3%
8775972 28
 
0.3%
Other values (6311) 9610
96.1%
2024-04-21T10:32:59.003000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 12678
15.1%
7 11922
14.2%
5 11705
13.9%
0 10327
12.3%
1 8798
10.5%
2 6123
7.3%
8 5908
7.0%
6 5605
6.7%
9 5525
6.6%
4 4892
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83483
99.5%
Uppercase Letter 429
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 12678
15.2%
7 11922
14.3%
5 11705
14.0%
0 10327
12.4%
1 8798
10.5%
2 6123
7.3%
8 5908
7.1%
6 5605
6.7%
9 5525
6.6%
4 4892
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 429
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 83483
99.5%
Latin 429
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
3 12678
15.2%
7 11922
14.3%
5 11705
14.0%
0 10327
12.4%
1 8798
10.5%
2 6123
7.3%
8 5908
7.1%
6 5605
6.7%
9 5525
6.6%
4 4892
 
5.9%
Latin
ValueCountFrequency (%)
A 429
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 83912
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 12678
15.1%
7 11922
14.2%
5 11705
13.9%
0 10327
12.3%
1 8798
10.5%
2 6123
7.3%
8 5908
7.0%
6 5605
6.7%
9 5525
6.6%
4 4892
 
5.8%

군급분류
Real number (ℝ)

Distinct213
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4891.345
Minimum0
Maximum9999
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T10:32:59.253643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2510
Q14720
median5310
Q35340
95-th percentile6220
Maximum9999
Range9999
Interquartile range (IQR)620

Descriptive statistics

Standard deviation1545.5094
Coefficient of variation (CV)0.31596819
Kurtosis2.0852143
Mean4891.345
Median Absolute Deviation (MAD)190
Skewness0.049294812
Sum48913450
Variance2388599.4
MonotonicityNot monotonic
2024-04-21T10:32:59.506180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5310 1248
 
12.5%
5340 1158
 
11.6%
5365 431
 
4.3%
5305 411
 
4.1%
5315 404
 
4.0%
5330 403
 
4.0%
4730 353
 
3.5%
2590 309
 
3.1%
4720 252
 
2.5%
2540 248
 
2.5%
Other values (203) 4783
47.8%
ValueCountFrequency (%)
0 3
 
< 0.1%
1005 67
0.7%
1010 8
 
0.1%
1015 86
0.9%
1025 57
0.6%
1030 3
 
< 0.1%
1035 2
 
< 0.1%
1040 12
 
0.1%
1055 1
 
< 0.1%
1080 8
 
0.1%
ValueCountFrequency (%)
9999 166
1.7%
9905 32
 
0.3%
9540 1
 
< 0.1%
9535 16
 
0.2%
9520 1
 
< 0.1%
9515 23
 
0.2%
9510 1
 
< 0.1%
9505 6
 
0.1%
9390 3
 
< 0.1%
9340 3
 
< 0.1%

품명
Text

Distinct2039
Distinct (%)20.4%
Missing9
Missing (%)0.1%
Memory size156.2 KiB
2024-04-21T10:33:00.246309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length20
Mean length8.2937644
Min length1

Characters and Unicode

Total characters82863
Distinct characters564
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1082 ?
Unique (%)10.8%

Sample

1st row체인 조립체,단일구간용
2nd row덮개,입구용
3rd row판,수선용
4th row솔,소제용,포용
5th row전지,축전식
ValueCountFrequency (%)
와셔,평면형 623
 
4.6%
머리형 351
 
2.6%
조립체 337
 
2.5%
나사,캡식,6각 310
 
2.3%
와셔,잠금식 245
 
1.8%
케이블 228
 
1.7%
194
 
1.4%
브래킷,설치용 163
 
1.2%
개스킷 162
 
1.2%
158
 
1.2%
Other values (2379) 10644
79.3%
2024-04-21T10:33:01.227969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 10641
 
12.8%
4912
 
5.9%
3667
 
4.4%
3424
 
4.1%
2076
 
2.5%
1511
 
1.8%
1470
 
1.8%
1419
 
1.7%
1406
 
1.7%
1252
 
1.5%
Other values (554) 51085
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68105
82.2%
Other Punctuation 10658
 
12.9%
Space Separator 3424
 
4.1%
Decimal Number 338
 
0.4%
Dash Punctuation 159
 
0.2%
Uppercase Letter 125
 
0.2%
Close Punctuation 27
 
< 0.1%
Open Punctuation 27
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4912
 
7.2%
3667
 
5.4%
2076
 
3.0%
1511
 
2.2%
1470
 
2.2%
1419
 
2.1%
1406
 
2.1%
1252
 
1.8%
1133
 
1.7%
1062
 
1.6%
Other values (519) 48197
70.8%
Uppercase Letter
ValueCountFrequency (%)
T 34
27.2%
U 32
25.6%
S 13
 
10.4%
B 8
 
6.4%
K 4
 
3.2%
P 4
 
3.2%
C 4
 
3.2%
A 4
 
3.2%
M 3
 
2.4%
G 3
 
2.4%
Other values (9) 16
12.8%
Decimal Number
ValueCountFrequency (%)
6 312
92.3%
5 9
 
2.7%
1 7
 
2.1%
8 5
 
1.5%
3 2
 
0.6%
0 2
 
0.6%
7 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 10641
99.8%
. 11
 
0.1%
/ 4
 
< 0.1%
# 1
 
< 0.1%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
3424
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 159
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68105
82.2%
Common 14633
 
17.7%
Latin 125
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4912
 
7.2%
3667
 
5.4%
2076
 
3.0%
1511
 
2.2%
1470
 
2.2%
1419
 
2.1%
1406
 
2.1%
1252
 
1.8%
1133
 
1.7%
1062
 
1.6%
Other values (519) 48197
70.8%
Latin
ValueCountFrequency (%)
T 34
27.2%
U 32
25.6%
S 13
 
10.4%
B 8
 
6.4%
K 4
 
3.2%
P 4
 
3.2%
C 4
 
3.2%
A 4
 
3.2%
M 3
 
2.4%
G 3
 
2.4%
Other values (9) 16
12.8%
Common
ValueCountFrequency (%)
, 10641
72.7%
3424
 
23.4%
6 312
 
2.1%
- 159
 
1.1%
) 27
 
0.2%
( 27
 
0.2%
. 11
 
0.1%
5 9
 
0.1%
1 7
 
< 0.1%
8 5
 
< 0.1%
Other values (6) 11
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68105
82.2%
ASCII 14757
 
17.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 10641
72.1%
3424
 
23.2%
6 312
 
2.1%
- 159
 
1.1%
T 34
 
0.2%
U 32
 
0.2%
) 27
 
0.2%
( 27
 
0.2%
S 13
 
0.1%
. 11
 
0.1%
Other values (24) 77
 
0.5%
Hangul
ValueCountFrequency (%)
4912
 
7.2%
3667
 
5.4%
2076
 
3.0%
1511
 
2.2%
1470
 
2.2%
1419
 
2.1%
1406
 
2.1%
1252
 
1.8%
1133
 
1.7%
1062
 
1.6%
Other values (519) 48197
70.8%
None
ValueCountFrequency (%)
1
100.0%

도면번호
Text

MISSING 

Distinct5402
Distinct (%)62.1%
Missing1301
Missing (%)13.0%
Memory size156.2 KiB
2024-04-21T10:33:02.202257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length7.9942522
Min length4

Characters and Unicode

Total characters69542
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3826 ?
Unique (%)44.0%

Sample

1st row60341097
2nd row60821510
3rd row60604746
4th row61500332
5th row61481561
ValueCountFrequency (%)
60799301 175
 
2.0%
60341540 100
 
1.1%
ms27183 67
 
0.8%
10910174 55
 
0.6%
60695302 33
 
0.4%
8690458 30
 
0.3%
ms51967 29
 
0.3%
60695503 28
 
0.3%
12273242 17
 
0.2%
20101127 16
 
0.2%
Other values (5396) 8156
93.7%
2024-04-21T10:33:03.451840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14439
20.8%
6 10411
15.0%
3 6961
10.0%
1 6618
9.5%
7 5643
 
8.1%
2 5608
 
8.1%
5 5343
 
7.7%
4 5278
 
7.6%
9 3951
 
5.7%
8 3574
 
5.1%
Other values (18) 1716
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 67826
97.5%
Uppercase Letter 1677
 
2.4%
Dash Punctuation 32
 
< 0.1%
Space Separator 7
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 445
26.5%
M 443
26.4%
Q 395
23.6%
A 332
19.8%
R 16
 
1.0%
D 11
 
0.7%
N 11
 
0.7%
C 8
 
0.5%
K 3
 
0.2%
G 3
 
0.2%
Other values (6) 10
 
0.6%
Decimal Number
ValueCountFrequency (%)
0 14439
21.3%
6 10411
15.3%
3 6961
10.3%
1 6618
9.8%
7 5643
 
8.3%
2 5608
 
8.3%
5 5343
 
7.9%
4 5278
 
7.8%
9 3951
 
5.8%
8 3574
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 67865
97.6%
Latin 1677
 
2.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 445
26.5%
M 443
26.4%
Q 395
23.6%
A 332
19.8%
R 16
 
1.0%
D 11
 
0.7%
N 11
 
0.7%
C 8
 
0.5%
K 3
 
0.2%
G 3
 
0.2%
Other values (6) 10
 
0.6%
Common
ValueCountFrequency (%)
0 14439
21.3%
6 10411
15.3%
3 6961
10.3%
1 6618
9.8%
7 5643
 
8.3%
2 5608
 
8.3%
5 5343
 
7.9%
4 5278
 
7.8%
9 3951
 
5.8%
8 3574
 
5.3%
Other values (2) 39
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 69542
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14439
20.8%
6 10411
15.0%
3 6961
10.0%
1 6618
9.5%
7 5643
 
8.1%
2 5608
 
8.1%
5 5343
 
7.7%
4 5278
 
7.6%
9 3951
 
5.7%
8 3574
 
5.1%
Other values (18) 1716
 
2.5%

도면부품번호
Text

MISSING 

Distinct5847
Distinct (%)67.2%
Missing1301
Missing (%)13.0%
Memory size156.2 KiB
2024-04-21T10:33:04.674835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length8
Mean length8.3867111
Min length2

Characters and Unicode

Total characters72956
Distinct characters36
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4226 ?
Unique (%)48.6%

Sample

1st row60341097
2nd row60821510
3rd row60604746-3
4th row61500332
5th row61481561
ValueCountFrequency (%)
60799301 42
 
0.5%
60341540-4 38
 
0.4%
60341540-3 35
 
0.4%
60799310 35
 
0.4%
10910174-3 28
 
0.3%
60799302 24
 
0.3%
ms27183-13 20
 
0.2%
ms27183-12 20
 
0.2%
60799319 20
 
0.2%
20101127 16
 
0.2%
Other values (5844) 8429
96.8%
2024-04-21T10:33:06.077315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14216
19.5%
6 10523
14.4%
3 7285
10.0%
1 7093
9.7%
2 6080
8.3%
7 5755
7.9%
5 5510
 
7.6%
4 5446
 
7.5%
9 4048
 
5.5%
8 3708
 
5.1%
Other values (26) 3292
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69664
95.5%
Uppercase Letter 1828
 
2.5%
Dash Punctuation 1451
 
2.0%
Space Separator 8
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 480
26.3%
M 467
25.5%
Q 390
21.3%
A 320
17.5%
C 54
 
3.0%
N 26
 
1.4%
R 18
 
1.0%
B 17
 
0.9%
D 10
 
0.5%
G 7
 
0.4%
Other values (12) 39
 
2.1%
Decimal Number
ValueCountFrequency (%)
0 14216
20.4%
6 10523
15.1%
3 7285
10.5%
1 7093
10.2%
2 6080
8.7%
7 5755
8.3%
5 5510
 
7.9%
4 5446
 
7.8%
9 4048
 
5.8%
8 3708
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
/ 2
40.0%
Dash Punctuation
ValueCountFrequency (%)
- 1451
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 71128
97.5%
Latin 1828
 
2.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 480
26.3%
M 467
25.5%
Q 390
21.3%
A 320
17.5%
C 54
 
3.0%
N 26
 
1.4%
R 18
 
1.0%
B 17
 
0.9%
D 10
 
0.5%
G 7
 
0.4%
Other values (12) 39
 
2.1%
Common
ValueCountFrequency (%)
0 14216
20.0%
6 10523
14.8%
3 7285
10.2%
1 7093
10.0%
2 6080
8.5%
7 5755
8.1%
5 5510
 
7.7%
4 5446
 
7.7%
9 4048
 
5.7%
8 3708
 
5.2%
Other values (4) 1464
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 72956
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14216
19.5%
6 10523
14.4%
3 7285
10.0%
1 7093
9.7%
2 6080
8.3%
7 5755
7.9%
5 5510
 
7.6%
4 5446
 
7.5%
9 4048
 
5.5%
8 3708
 
5.1%
Other values (26) 3292
 
4.5%

규격번호
Text

MISSING 

Distinct186
Distinct (%)15.6%
Missing8805
Missing (%)88.0%
Memory size156.2 KiB
2024-04-21T10:33:06.887682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length8.939749
Min length5

Characters and Unicode

Total characters10683
Distinct characters36
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)6.9%

Sample

1st rowMIL-C-26482
2nd rowMS51967
3rd rowMS51967
4th rowMS17828
5th rowMS9048
ValueCountFrequency (%)
ks 238
 
14.1%
b 212
 
12.6%
ansi-b18.2.1 182
 
10.8%
ms51831 81
 
4.8%
ms27183 79
 
4.7%
ansi-b1821 73
 
4.3%
1326 65
 
3.9%
1324 53
 
3.1%
ms51967 41
 
2.4%
1023 41
 
2.4%
Other values (185) 620
36.8%
2024-04-21T10:33:07.935769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1382
12.9%
S 1137
 
10.6%
2 896
 
8.4%
8 719
 
6.7%
3 706
 
6.6%
M 685
 
6.4%
5 496
 
4.6%
490
 
4.6%
B 474
 
4.4%
- 461
 
4.3%
Other values (26) 3237
30.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5608
52.5%
Uppercase Letter 3741
35.0%
Space Separator 490
 
4.6%
Dash Punctuation 461
 
4.3%
Other Punctuation 383
 
3.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 1137
30.4%
M 685
18.3%
B 474
12.7%
I 327
 
8.7%
A 323
 
8.6%
N 293
 
7.8%
K 241
 
6.4%
L 91
 
2.4%
C 47
 
1.3%
W 26
 
0.7%
Other values (12) 97
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 1382
24.6%
2 896
16.0%
8 719
12.8%
3 706
12.6%
5 496
 
8.8%
6 345
 
6.2%
0 337
 
6.0%
7 281
 
5.0%
4 237
 
4.2%
9 209
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 367
95.8%
/ 16
 
4.2%
Space Separator
ValueCountFrequency (%)
490
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 461
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6942
65.0%
Latin 3741
35.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 1137
30.4%
M 685
18.3%
B 474
12.7%
I 327
 
8.7%
A 323
 
8.6%
N 293
 
7.8%
K 241
 
6.4%
L 91
 
2.4%
C 47
 
1.3%
W 26
 
0.7%
Other values (12) 97
 
2.6%
Common
ValueCountFrequency (%)
1 1382
19.9%
2 896
12.9%
8 719
10.4%
3 706
10.2%
5 496
 
7.1%
490
 
7.1%
- 461
 
6.6%
. 367
 
5.3%
6 345
 
5.0%
0 337
 
4.9%
Other values (4) 743
10.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10683
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1382
12.9%
S 1137
 
10.6%
2 896
 
8.4%
8 719
 
6.7%
3 706
 
6.6%
M 685
 
6.4%
5 496
 
4.6%
490
 
4.6%
B 474
 
4.4%
- 461
 
4.3%
Other values (26) 3237
30.3%
Distinct311
Distinct (%)3.1%
Missing42
Missing (%)0.4%
Memory size156.2 KiB
2024-04-21T10:33:08.672557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length30
Mean length7.9734887
Min length2

Characters and Unicode

Total characters79400
Distinct characters276
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)0.9%

Sample

1st row한화에어로스페이스 주식회사
2nd row삼호정밀
3rd row덕인산업
4th row현대로템 주식회사
5th row기아자동차(주)
ValueCountFrequency (%)
주식회사 3617
26.0%
현대로템 963
 
6.9%
태창정밀 929
 
6.7%
한화에어로스페이스 903
 
6.5%
덕인산업 781
 
5.6%
한화디펜스주식회사 383
 
2.7%
연합정밀(주 372
 
2.7%
한화디펜스 360
 
2.6%
두방산업주식회사 263
 
1.9%
주)경도 250
 
1.8%
Other values (347) 5115
36.7%
2024-04-21T10:33:09.685960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6863
 
8.6%
5003
 
6.3%
4794
 
6.0%
4750
 
6.0%
3992
 
5.0%
3611
 
4.5%
( 2158
 
2.7%
) 2158
 
2.7%
2109
 
2.7%
1999
 
2.5%
Other values (266) 41963
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69082
87.0%
Space Separator 3992
 
5.0%
Open Punctuation 2158
 
2.7%
Close Punctuation 2158
 
2.7%
Uppercase Letter 1690
 
2.1%
Other Punctuation 210
 
0.3%
Lowercase Letter 108
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6863
 
9.9%
5003
 
7.2%
4794
 
6.9%
4750
 
6.9%
3611
 
5.2%
2109
 
3.1%
1999
 
2.9%
1960
 
2.8%
1871
 
2.7%
1858
 
2.7%
Other values (220) 34264
49.6%
Uppercase Letter
ValueCountFrequency (%)
C 196
 
11.6%
A 141
 
8.3%
I 126
 
7.5%
N 124
 
7.3%
S 114
 
6.7%
H 112
 
6.6%
E 110
 
6.5%
R 100
 
5.9%
O 84
 
5.0%
T 82
 
4.9%
Other values (14) 501
29.6%
Lowercase Letter
ValueCountFrequency (%)
r 23
21.3%
e 19
17.6%
b 14
13.0%
p 10
9.3%
u 7
 
6.5%
i 7
 
6.5%
n 6
 
5.6%
o 6
 
5.6%
s 5
 
4.6%
t 5
 
4.6%
Other values (5) 6
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 135
64.3%
& 68
32.4%
, 7
 
3.3%
Space Separator
ValueCountFrequency (%)
3992
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2158
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2158
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69082
87.0%
Common 8520
 
10.7%
Latin 1798
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6863
 
9.9%
5003
 
7.2%
4794
 
6.9%
4750
 
6.9%
3611
 
5.2%
2109
 
3.1%
1999
 
2.9%
1960
 
2.8%
1871
 
2.7%
1858
 
2.7%
Other values (220) 34264
49.6%
Latin
ValueCountFrequency (%)
C 196
 
10.9%
A 141
 
7.8%
I 126
 
7.0%
N 124
 
6.9%
S 114
 
6.3%
H 112
 
6.2%
E 110
 
6.1%
R 100
 
5.6%
O 84
 
4.7%
T 82
 
4.6%
Other values (29) 609
33.9%
Common
ValueCountFrequency (%)
3992
46.9%
( 2158
25.3%
) 2158
25.3%
. 135
 
1.6%
& 68
 
0.8%
, 7
 
0.1%
- 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69082
87.0%
ASCII 10318
 
13.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6863
 
9.9%
5003
 
7.2%
4794
 
6.9%
4750
 
6.9%
3611
 
5.2%
2109
 
3.1%
1999
 
2.9%
1960
 
2.8%
1871
 
2.7%
1858
 
2.7%
Other values (220) 34264
49.6%
ASCII
ValueCountFrequency (%)
3992
38.7%
( 2158
20.9%
) 2158
20.9%
C 196
 
1.9%
A 141
 
1.4%
. 135
 
1.3%
I 126
 
1.2%
N 124
 
1.2%
S 114
 
1.1%
H 112
 
1.1%
Other values (36) 1062
 
10.3%

최종수정일
Date

MISSING 

Distinct2
Distinct (%)0.1%
Missing7404
Missing (%)74.0%
Memory size156.2 KiB
Minimum2021-06-14 00:00:00
Maximum2021-07-08 00:00:00
2024-04-21T10:33:09.869427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:33:10.018789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Interactions

2024-04-21T10:32:52.533955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:33:10.133142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명군급분류최종수정일
사업명1.0000.4730.077
군급분류0.4731.0000.000
최종수정일0.0770.0001.000
2024-04-21T10:33:10.289822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
군급분류사업명
군급분류1.0000.191
사업명0.1911.000

Missing values

2024-04-21T10:32:53.071173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:32:53.558308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T10:32:53.948685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업명부품관리번호재고번호군급분류품명도면번호도면부품번호규격번호계약업체최종수정일
31037K9A1 자주포 성능개량M512736273750598544010체인 조립체,단일구간용6034109760341097<NA>한화에어로스페이스 주식회사<NA>
11508K21 보병전투차량M000025803752382445340덮개,입구용6082151060821510<NA>삼호정밀<NA>
8859120mm 자주박격포M000609943750343865340판,수선용6060474660604746-3<NA>덕인산업2021-06-14
323K-2 전차M000431473751665341015솔,소제용,포용6150033261500332<NA>현대로템 주식회사<NA>
16515소형전술차량M513653663752355466140전지,축전식6148156161481561<NA>기아자동차(주)2021-06-14
20538K9 자주포M000273713751736355310와셔,잠금식6034154060341540-4<NA>한화에어로스페이스 주식회사<NA>
18510K9 자주포M519397923751590413020풀리,홈통식Q25028120Q25028120<NA>한화디펜스 주식회사<NA>
2350K1A1전차 성능개량M513179103750116552540판조립체,의자지지용6070321560703215<NA>현대로템 주식회사<NA>
14306보병탑승차량M000158953716008275310와셔,잠금식6050165460501480<NA>태창정밀<NA>
16411K9 자주포M00028126143756063120베어링,플레인형,로드 엔드형6035972760359727<NA>한화에어로스페이스 주식회사<NA>
사업명부품관리번호재고번호군급분류품명도면번호도면부품번호규격번호계약업체최종수정일
18520K9 자주포M000057118774935310와셔,평면형MS27183MS27183-13<NA>태창정밀<NA>
3375장애물개척전차M51896608108278574730T형관,튜브용<NA><NA>MS51854-10SS현대정밀가스켓<NA>
12680경구난차량M0000308778181192540브래킷 조립체6063511260635112<NA>덕인산업2021-06-14
22846K10 탄약운반장갑차M0000570880940585310와셔,평면형<NA><NA>MS27183태창정밀<NA>
24599K56 탄약운반장갑차M511247373750236315855장치대 조립체,관찰기용4000642740006427<NA>한화디펜스 주식회사<NA>
11765경구난차량M000141643752084725930덮개,전기 스위치용A60045964A60045964<NA>한화디펜스 주식회사<NA>
30473K9A1 자주포 성능개량M000274723750601775340브래킷,설치용6034461060344610<NA>평화정밀2021-06-14
14977보병탑승차량M0006100268216515340U자형 연결기,로드엔드용6060502360605023<NA>덕인산업2021-06-14
25839K55 자주포 성능개량M0006608553519944720호스,공기통용87247838724785<NA>한화에어로스페이스 주식회사<NA>
11922경구난차량M513541093751873005340볼트,잠금용6082015360820153<NA>우창기계 주식회사<NA>

Duplicate rows

Most frequently occurring

사업명부품관리번호재고번호군급분류품명도면번호도면부품번호규격번호계약업체최종수정일# duplicates
155K1전차 성능개량M000430113750136865310와셔,평면형6079930160799301<NA>태창정밀<NA>18
337K56 탄약운반장갑차M511116653750897045310와셔,평면형<NA><NA>KS B 1326에프투텔레콤(주)<NA>18
107K1A1전차 성능개량M000430113750136865310와셔,평면형6079930160799301<NA>태창정밀<NA>17
311K56 탄약운반장갑차M000273713751736355310와셔,잠금식6034154060341540-4<NA>한화에어로스페이스 주식회사<NA>15
199K1전차 성능개량M513455223750136805310와셔,평면형6079930160799310<NA>태창정밀<NA>14
238K21 보병전투차량M510926383713020325306나사,십자홈식<NA><NA>KS B 1023아시아자동차공업(주)<NA>14
608경구난차량M510926383713020325306나사,십자홈식<NA><NA>KS B 1023아시아자동차공업(주)<NA>14
615경구난차량M5128552037A2299785310너트<NA><NA>KS B 1012<NA>2021-06-1414
50K10 탄약운반장갑차M000273713751736355310와셔,잠금식6034154060341540-4<NA>한화에어로스페이스 주식회사<NA>13
310K56 탄약운반장갑차M000273703751736345310와셔,잠금식6034154060341540-3<NA>한화에어로스페이스 주식회사<NA>13