Overview

Dataset statistics

Number of variables7
Number of observations890
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory49.7 KiB
Average record size in memory57.1 B

Variable types

Text1
DateTime2
Numeric1
Boolean2
Categorical1

Dataset

Description한국기계연구원의 연구관리 분야에서 사후_미발생품상세관리 테이블 정보( 발생품명, 발생일자, 취득금액, 통합여부, 관리대상여부 등을 관리)
URLhttps://www.data.go.kr/data/15078099/fileData.do

Alerts

통합여부 has constant value ""Constant
관리대상여부 has constant value ""Constant
보관장소 has constant value ""Constant
작성일 has constant value ""Constant

Reproduction

Analysis started2023-12-12 17:19:09.547728
Analysis finished2023-12-12 17:19:10.137836
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct817
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-12-13T02:19:10.367831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length44
Mean length20.44382
Min length1

Characters and Unicode

Total characters18195
Distinct characters493
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique774 ?
Unique (%)87.0%

Sample

1st rowZT Module-004-J00 (DeepCeave Module)
2nd rowBottle wash multi-lingual외 48건
3rd row100nm 급 선폭의 라인패턴 Si 마스터
4th row300mm 니켈 원통 금형
5th rowSU-8 TF 6001
ValueCountFrequency (%)
141
 
3.8%
77
 
2.1%
모듈 47
 
1.3%
1건 39
 
1.0%
레이저 34
 
0.9%
ded 33
 
0.9%
제작 27
 
0.7%
pbf 26
 
0.7%
2건 26
 
0.7%
3d프린팅 24
 
0.6%
Other values (1752) 3251
87.3%
2023-12-13T02:19:10.914249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2844
 
15.6%
e 505
 
2.8%
r 333
 
1.8%
o 326
 
1.8%
t 326
 
1.8%
a 325
 
1.8%
i 318
 
1.7%
0 278
 
1.5%
D 277
 
1.5%
n 273
 
1.5%
Other values (483) 12390
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6783
37.3%
Lowercase Letter 4106
22.6%
Space Separator 2845
15.6%
Uppercase Letter 2446
 
13.4%
Decimal Number 1251
 
6.9%
Other Punctuation 209
 
1.1%
Open Punctuation 206
 
1.1%
Close Punctuation 205
 
1.1%
Dash Punctuation 116
 
0.6%
Connector Punctuation 17
 
0.1%
Other values (2) 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
255
 
3.8%
227
 
3.3%
212
 
3.1%
165
 
2.4%
162
 
2.4%
161
 
2.4%
100
 
1.5%
99
 
1.5%
98
 
1.4%
96
 
1.4%
Other values (391) 5208
76.8%
Lowercase Letter
ValueCountFrequency (%)
e 505
12.3%
r 333
 
8.1%
o 326
 
7.9%
t 326
 
7.9%
a 325
 
7.9%
i 318
 
7.7%
n 273
 
6.6%
m 264
 
6.4%
l 253
 
6.2%
s 214
 
5.2%
Other values (18) 969
23.6%
Uppercase Letter
ValueCountFrequency (%)
D 277
 
11.3%
S 207
 
8.5%
P 190
 
7.8%
C 166
 
6.8%
E 162
 
6.6%
M 141
 
5.8%
A 128
 
5.2%
T 123
 
5.0%
L 123
 
5.0%
F 123
 
5.0%
Other values (17) 806
33.0%
Other Punctuation
ValueCountFrequency (%)
, 69
33.0%
/ 54
25.8%
. 42
20.1%
& 11
 
5.3%
" 11
 
5.3%
* 7
 
3.3%
# 7
 
3.3%
: 4
 
1.9%
% 2
 
1.0%
1
 
0.5%
Decimal Number
ValueCountFrequency (%)
0 278
22.2%
1 246
19.7%
2 179
14.3%
3 166
13.3%
5 114
9.1%
4 98
 
7.8%
6 73
 
5.8%
8 47
 
3.8%
7 34
 
2.7%
9 16
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 164
79.6%
[ 33
 
16.0%
5
 
2.4%
4
 
1.9%
Close Punctuation
ValueCountFrequency (%)
) 163
79.5%
] 33
 
16.1%
5
 
2.4%
4
 
2.0%
Math Symbol
ValueCountFrequency (%)
+ 8
80.0%
~ 1
 
10.0%
= 1
 
10.0%
Space Separator
ValueCountFrequency (%)
2844
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 17
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6783
37.3%
Latin 6551
36.0%
Common 4861
26.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
255
 
3.8%
227
 
3.3%
212
 
3.1%
165
 
2.4%
162
 
2.4%
161
 
2.4%
100
 
1.5%
99
 
1.5%
98
 
1.4%
96
 
1.4%
Other values (391) 5208
76.8%
Latin
ValueCountFrequency (%)
e 505
 
7.7%
r 333
 
5.1%
o 326
 
5.0%
t 326
 
5.0%
a 325
 
5.0%
i 318
 
4.9%
D 277
 
4.2%
n 273
 
4.2%
m 264
 
4.0%
l 253
 
3.9%
Other values (44) 3351
51.2%
Common
ValueCountFrequency (%)
2844
58.5%
0 278
 
5.7%
1 246
 
5.1%
2 179
 
3.7%
3 166
 
3.4%
( 164
 
3.4%
) 163
 
3.4%
- 116
 
2.4%
5 114
 
2.3%
4 98
 
2.0%
Other values (28) 493
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11381
62.6%
Hangul 6783
37.3%
None 30
 
0.2%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2844
25.0%
e 505
 
4.4%
r 333
 
2.9%
o 326
 
2.9%
t 326
 
2.9%
a 325
 
2.9%
i 318
 
2.8%
0 278
 
2.4%
D 277
 
2.4%
n 273
 
2.4%
Other values (72) 5576
49.0%
Hangul
ValueCountFrequency (%)
255
 
3.8%
227
 
3.3%
212
 
3.1%
165
 
2.4%
162
 
2.4%
161
 
2.4%
100
 
1.5%
99
 
1.5%
98
 
1.4%
96
 
1.4%
Other values (391) 5208
76.8%
None
ValueCountFrequency (%)
Ø 8
26.7%
5
16.7%
5
16.7%
4
13.3%
4
13.3%
° 1
 
3.3%
  1
 
3.3%
1
 
3.3%
ø 1
 
3.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct441
Distinct (%)49.6%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
Minimum2017-02-16 00:00:00
Maximum2022-11-22 00:00:00
2023-12-13T02:19:11.075608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:11.238814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

취득금액
Real number (ℝ)

Distinct644
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5730791.4
Minimum1001000
Maximum1.8095 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.0 KiB
2023-12-13T02:19:11.385453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001000
5-th percentile1148670
Q11878640
median2860000
Q36812250
95-th percentile17961075
Maximum1.8095 × 108
Range1.79949 × 108
Interquartile range (IQR)4933610

Descriptive statistics

Standard deviation9986719.2
Coefficient of variation (CV)1.7426422
Kurtosis130.9735
Mean5730791.4
Median Absolute Deviation (MAD)1347274.5
Skewness9.4854852
Sum5.1004044 × 109
Variance9.9734561 × 1013
MonotonicityNot monotonic
2023-12-13T02:19:11.521106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8910000 16
 
1.8%
2200000 15
 
1.7%
2970000 11
 
1.2%
2860000 10
 
1.1%
1100000 10
 
1.1%
2750000 9
 
1.0%
1650000 8
 
0.9%
7920000 7
 
0.8%
2090000 7
 
0.8%
1430000 7
 
0.8%
Other values (634) 790
88.8%
ValueCountFrequency (%)
1001000 1
0.1%
1012000 1
0.1%
1013600 1
0.1%
1013750 1
0.1%
1023000 2
0.2%
1025200 1
0.1%
1034000 1
0.1%
1039500 1
0.1%
1042800 1
0.1%
1045000 1
0.1%
ValueCountFrequency (%)
180950000 1
0.1%
97900000 1
0.1%
91300000 1
0.1%
88000000 1
0.1%
83000000 1
0.1%
69300000 1
0.1%
55000000 1
0.1%
46200000 1
0.1%
21450000 1
0.1%
20900000 1
0.1%

통합여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1022.0 B
False
890 
ValueCountFrequency (%)
False 890
100.0%
2023-12-13T02:19:11.711554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

관리대상여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1022.0 B
False
890 
ValueCountFrequency (%)
False 890
100.0%
2023-12-13T02:19:11.800377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

보관장소
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
동일장소
890 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동일장소
2nd row동일장소
3rd row동일장소
4th row동일장소
5th row동일장소

Common Values

ValueCountFrequency (%)
동일장소 890
100.0%

Length

2023-12-13T02:19:11.910283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:12.018764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동일장소 890
100.0%

작성일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
Minimum2023-07-28 00:00:00
Maximum2023-07-28 00:00:00
2023-12-13T02:19:12.105417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:12.194585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T02:19:09.839213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T02:19:09.970865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:19:10.089871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발생품명발생일자취득금액통합여부관리대상여부보관장소작성일
0ZT Module-004-J00 (DeepCeave Module)2021-12-208910000NN동일장소2023-07-28
1Bottle wash multi-lingual외 48건2021-12-101837340NN동일장소2023-07-28
2100nm 급 선폭의 라인패턴 Si 마스터2019-08-085193100NN동일장소2023-07-28
3300mm 니켈 원통 금형2019-01-1117325000NN동일장소2023-07-28
4SU-8 TF 60012019-08-301276000NN동일장소2023-07-28
5OrmoDev2019-08-301221000NN동일장소2023-07-28
6Temp Sensor외 11건2019-08-012990900NN동일장소2023-07-28
7폭 300mm급 나노패턴 금속스탬프 제작2019-08-128910000NN동일장소2023-07-28
8rotation stage2019-06-202271600NN동일장소2023-07-28
9fs 레이저용 시그널 타워2019-07-241222100NN동일장소2023-07-28
발생품명발생일자취득금액통합여부관리대상여부보관장소작성일
880탈기막 장치 모듈 및 하우징2021-09-1510806000NN동일장소2023-07-28
881탈기막 장치2021-08-104510000NN동일장소2023-07-28
882미터링 펌프 외 1건2021-12-152456300NN동일장소2023-07-28
883UNION CROSS외 25건2021-12-283608000NN동일장소2023-07-28
884UNION CROSS외 25건2021-12-281332000NN동일장소2023-07-28
885공정 시뮬레이션 소프트웨어 업그레이드2019-05-286151200NN동일장소2023-07-28
886파우더피딩량 조절파트2019-11-271106712NN동일장소2023-07-28
887SB-10(U-JIN) 외 3종(이준희)2019-11-131984400NN동일장소2023-07-28
888DED 3D 프린팅용 SKD11 테스트 시편2019-10-161650000NN동일장소2023-07-28
889적층조건 테스트용 프로세싱 헤드2019-07-035907000NN동일장소2023-07-28