Overview

Dataset statistics

Number of variables5
Number of observations88
Missing cells79
Missing cells (%)18.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory42.5 B

Variable types

Numeric1
Text2
Categorical1
DateTime1

Dataset

Description- 방산업체명과 지정일자 목록을 제공함- 순번, 업체명, 지정일자, 비고 항목을 제공하며, 지정일자는 최초 지정일자 기준임
Author방위사업청
URLhttps://www.data.go.kr/data/15081929/fileData.do

Alerts

비고 has 79 (89.8%) missing valuesMissing
순번 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2024-04-13 12:18:28.140957
Analysis finished2024-04-13 12:18:31.108291
Duration2.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct88
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.5
Minimum1
Maximum88
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size920.0 B
2024-04-13T21:18:31.253057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.35
Q122.75
median44.5
Q366.25
95-th percentile83.65
Maximum88
Range87
Interquartile range (IQR)43.5

Descriptive statistics

Standard deviation25.547342
Coefficient of variation (CV)0.57409757
Kurtosis-1.2
Mean44.5
Median Absolute Deviation (MAD)22
Skewness0
Sum3916
Variance652.66667
MonotonicityStrictly increasing
2024-04-13T21:18:31.622583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
46 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
Other values (78) 78
88.6%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%
79 1
1.1%

업체명
Text

UNIQUE 

Distinct88
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size832.0 B
2024-04-13T21:18:32.626930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13.5
Mean length5.2045455
Min length2

Characters and Unicode

Total characters458
Distinct characters158
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)100.0%

Sample

1st row강남
2nd row경주전장
3rd row광림
4th row극동통신
5th row금호타이어
ValueCountFrequency (%)
강남 1
 
1.1%
경주전장 1
 
1.1%
풍산 1
 
1.1%
포스코특수강 1
 
1.1%
평화산업 1
 
1.1%
퍼스텍 1
 
1.1%
티에스택 1
 
1.1%
크로시스 1
 
1.1%
코리아일렉콤(구,한림에스티 1
 
1.1%
케이에스피 1
 
1.1%
Other values (78) 78
88.6%
2024-04-13T21:18:34.024026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
4.1%
17
 
3.7%
16
 
3.5%
12
 
2.6%
12
 
2.6%
11
 
2.4%
11
 
2.4%
10
 
2.2%
9
 
2.0%
8
 
1.7%
Other values (148) 333
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 428
93.4%
Uppercase Letter 18
 
3.9%
Other Punctuation 5
 
1.1%
Open Punctuation 3
 
0.7%
Close Punctuation 3
 
0.7%
Space Separator 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
4.4%
17
 
4.0%
16
 
3.7%
12
 
2.8%
12
 
2.8%
11
 
2.6%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
Other values (134) 303
70.8%
Uppercase Letter
ValueCountFrequency (%)
S 6
33.3%
T 4
22.2%
X 2
 
11.1%
L 2
 
11.1%
N 1
 
5.6%
G 1
 
5.6%
I 1
 
5.6%
F 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 2
40.0%
& 2
40.0%
. 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 428
93.4%
Latin 18
 
3.9%
Common 12
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
4.4%
17
 
4.0%
16
 
3.7%
12
 
2.8%
12
 
2.8%
11
 
2.6%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
Other values (134) 303
70.8%
Latin
ValueCountFrequency (%)
S 6
33.3%
T 4
22.2%
X 2
 
11.1%
L 2
 
11.1%
N 1
 
5.6%
G 1
 
5.6%
I 1
 
5.6%
F 1
 
5.6%
Common
ValueCountFrequency (%)
( 3
25.0%
) 3
25.0%
, 2
16.7%
& 2
16.7%
1
 
8.3%
. 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 428
93.4%
ASCII 30
 
6.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
4.4%
17
 
4.0%
16
 
3.7%
12
 
2.8%
12
 
2.8%
11
 
2.6%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
Other values (134) 303
70.8%
ASCII
ValueCountFrequency (%)
S 6
20.0%
T 4
13.3%
( 3
10.0%
) 3
10.0%
, 2
 
6.7%
X 2
 
6.7%
L 2
 
6.7%
& 2
 
6.7%
N 1
 
3.3%
G 1
 
3.3%
Other values (4) 4
13.3%

분야
Categorical

Distinct10
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size832.0 B
통신전자
14 
기동
13 
화력
13 
기타
13 
함정
Other values (5)
26 

Length

Max length4
Median length2
Mean length2.5454545
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row함정
2nd row항공
3rd row기동
4th row유도
5th row항공

Common Values

ValueCountFrequency (%)
통신전자 14
15.9%
기동 13
14.8%
화력 13
14.8%
기타 13
14.8%
함정 9
10.2%
항공유도 9
10.2%
탄약 8
9.1%
항공 4
 
4.5%
유도 3
 
3.4%
화생방 2
 
2.3%

Length

2024-04-13T21:18:34.470406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:18:34.871452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통신전자 14
15.9%
기동 13
14.8%
화력 13
14.8%
기타 13
14.8%
함정 9
10.2%
항공유도 9
10.2%
탄약 8
9.1%
항공 4
 
4.5%
유도 3
 
3.4%
화생방 2
 
2.3%
Distinct61
Distinct (%)69.3%
Missing0
Missing (%)0.0%
Memory size832.0 B
Minimum1972-05-01 00:00:00
Maximum2007-09-01 00:00:00
2024-04-13T21:18:35.298356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:18:35.715784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Text

MISSING 

Distinct9
Distinct (%)100.0%
Missing79
Missing (%)89.8%
Memory size832.0 B
2024-04-13T21:18:36.375933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length11.555556
Min length5

Characters and Unicode

Total characters104
Distinct characters57
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)100.0%

Sample

1st row사명변경(모트롤, 엠엔씨솔루션)
2nd row사명변경(두산에너빌리티)
3rd row비츠로밀텍
4th row사명변경(현대위아)
5th row사명변경(유아이헬리콥터)
ValueCountFrequency (%)
사명변경(모트롤 1
9.1%
엠엔씨솔루션 1
9.1%
사명변경(두산에너빌리티 1
9.1%
비츠로밀텍 1
9.1%
사명변경(현대위아 1
9.1%
사명변경(유아이헬리콥터 1
9.1%
사명변경(hj중공업 1
9.1%
한화에어로스페이스 1
9.1%
한화디펜스합병 1
9.1%
사명변경(ls엠트론 1
9.1%
2024-04-13T21:18:37.434226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
5.8%
) 6
 
5.8%
6
 
5.8%
6
 
5.8%
( 6
 
5.8%
6
 
5.8%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (47) 59
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 81
77.9%
Uppercase Letter 7
 
6.7%
Close Punctuation 6
 
5.8%
Open Punctuation 6
 
5.8%
Space Separator 2
 
1.9%
Other Punctuation 2
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
7.4%
6
 
7.4%
6
 
7.4%
6
 
7.4%
3
 
3.7%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (37) 44
54.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
28.6%
N 1
14.3%
H 1
14.3%
J 1
14.3%
L 1
14.3%
T 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 81
77.9%
Common 16
 
15.4%
Latin 7
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
7.4%
6
 
7.4%
6
 
7.4%
6
 
7.4%
3
 
3.7%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (37) 44
54.3%
Latin
ValueCountFrequency (%)
S 2
28.6%
N 1
14.3%
H 1
14.3%
J 1
14.3%
L 1
14.3%
T 1
14.3%
Common
ValueCountFrequency (%)
) 6
37.5%
( 6
37.5%
2
 
12.5%
, 2
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 81
77.9%
ASCII 23
 
22.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
7.4%
6
 
7.4%
6
 
7.4%
6
 
7.4%
3
 
3.7%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (37) 44
54.3%
ASCII
ValueCountFrequency (%)
) 6
26.1%
( 6
26.1%
2
 
8.7%
, 2
 
8.7%
S 2
 
8.7%
N 1
 
4.3%
H 1
 
4.3%
J 1
 
4.3%
L 1
 
4.3%
T 1
 
4.3%

Interactions

2024-04-13T21:18:30.611806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-13T21:18:37.692733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업체명분야지정일자비고
순번1.0001.0000.4600.4511.000
업체명1.0001.0001.0001.0001.000
분야0.4601.0001.0000.0001.000
지정일자0.4511.0000.0001.0001.000
비고1.0001.0001.0001.0001.000
2024-04-13T21:18:37.957473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야
순번1.0000.152
분야0.1521.000

Missing values

2024-04-13T21:18:30.855002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T21:18:31.036133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업체명분야지정일자비고
01강남함정1975-10-01<NA>
12경주전장항공2004-03-01<NA>
23광림기동1988-01-01<NA>
34극동통신유도2002-12-01<NA>
45금호타이어항공1975-10-01<NA>
56기아기동1973-04-01<NA>
67다윈프릭션항공유도2003-05-01<NA>
78다이모스기동1997-10-01<NA>
89단암시스템즈항공유도2007-02-01<NA>
910대동기어화력1978-12-01<NA>
순번업체명분야지정일자비고
7879현대로템기동1978-12-01<NA>
7980현대제이콤통신전자2001-07-01<NA>
8081현대중공업함정1976-03-01<NA>
8182휴니드테크놀러지스통신전자1973-04-01<NA>
8283LIG넥스원항공유도1976-06-01<NA>
8384LS전선기동1984-05-01사명변경(LS엠트론)
8485S&T대우화력1982-04-01<NA>
8586S&T중공업화력1973-04-01SNT중공업
8687STX엔진기동1977-07-01<NA>
8788STX조선함정2007-09-01<NA>