Overview

Dataset statistics

Number of variables19
Number of observations148
Missing cells1000
Missing cells (%)35.6%
Duplicate rows4
Duplicate rows (%)2.7%
Total size in memory22.1 KiB
Average record size in memory152.9 B

Variable types

Categorical1
Unsupported18

Dataset

Description표본은 산업분류상 제조업인 종업원 5인 이상 300인 이하의 중소업체를 산업중분류 및 종업원규모별로 층화한 후, 층화단순 임의추출법(Stratified Simple Random Sampling)에 의하여 추출, 구성하였습니다. 본 조사는 3, 6, 9, 12월 1일에서 15일 사이에 정기적으로 실시되고 있습니다.
URLhttps://www.data.go.kr/data/15002001/fileData.do

Alerts

Dataset has 4 (2.7%) duplicate rowsDuplicates
Unnamed: 1 has 3 (2.0%) missing valuesMissing
Unnamed: 2 has 22 (14.9%) missing valuesMissing
Unnamed: 3 has 55 (37.2%) missing valuesMissing
Unnamed: 4 has 58 (39.2%) missing valuesMissing
Unnamed: 5 has 56 (37.8%) missing valuesMissing
Unnamed: 6 has 58 (39.2%) missing valuesMissing
Unnamed: 7 has 95 (64.2%) missing valuesMissing
Unnamed: 8 has 97 (65.5%) missing valuesMissing
Unnamed: 9 has 42 (28.4%) missing valuesMissing
Unnamed: 10 has 44 (29.7%) missing valuesMissing
Unnamed: 11 has 43 (29.1%) missing valuesMissing
Unnamed: 12 has 44 (29.7%) missing valuesMissing
Unnamed: 13 has 62 (41.9%) missing valuesMissing
Unnamed: 14 has 65 (43.9%) missing valuesMissing
Unnamed: 15 has 63 (42.6%) missing valuesMissing
Unnamed: 16 has 65 (43.9%) missing valuesMissing
Unnamed: 17 has 63 (42.6%) missing valuesMissing
Unnamed: 18 has 65 (43.9%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 02:00:44.102723
Analysis finished2023-12-12 02:00:45.207614
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct43
Distinct (%)29.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
3/4
35 
4/4
35 
2/4
34 
<NA>
'87. 1/4
 
1
Other values (38)
38 

Length

Max length9
Median length3
Mean length4.3108108
Min length3

Unique

Unique39 ?
Unique (%)26.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row'80. 1/4

Common Values

ValueCountFrequency (%)
3/4 35
23.6%
4/4 35
23.6%
2/4 34
23.0%
<NA> 5
 
3.4%
'87. 1/4 1
 
0.7%
'88. 1/4 1
 
0.7%
'81. 1/4 1
 
0.7%
'82. 1/4 1
 
0.7%
'83. 1/4 1
 
0.7%
'84. 1/4 1
 
0.7%
Other values (33) 33
22.3%

Length

2023-12-12T11:00:45.298015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3/4 36
19.4%
1/4 36
19.4%
2/4 36
19.4%
4/4 35
18.8%
na 5
 
2.7%
15 3
 
1.6%
12 1
 
0.5%
11 1
 
0.5%
99 1
 
0.5%
09 1
 
0.5%
Other values (31) 31
16.7%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)2.0%
Memory size1.3 KiB

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing22
Missing (%)14.9%
Memory size1.3 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing55
Missing (%)37.2%
Memory size1.3 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing58
Missing (%)39.2%
Memory size1.3 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing56
Missing (%)37.8%
Memory size1.3 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing58
Missing (%)39.2%
Memory size1.3 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing95
Missing (%)64.2%
Memory size1.3 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)65.5%
Memory size1.3 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing42
Missing (%)28.4%
Memory size1.3 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing44
Missing (%)29.7%
Memory size1.3 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing43
Missing (%)29.1%
Memory size1.3 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing44
Missing (%)29.7%
Memory size1.3 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing62
Missing (%)41.9%
Memory size1.3 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)43.9%
Memory size1.3 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing63
Missing (%)42.6%
Memory size1.3 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)43.9%
Memory size1.3 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing63
Missing (%)42.6%
Memory size1.3 KiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)43.9%
Memory size1.3 KiB

Missing values

2023-12-12T11:00:44.294359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:00:44.645769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:00:44.946037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

전반적인 경기 BSI 추이Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18
0<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1<NA>중소제조업NaN규모별NaNNaNNaNNaNNaN산업별NaNNaNNaN형태별NaNNaNNaNNaNNaN
2<NA>NaNNaN중기업NaN소기업NaN영세소기업NaN중공업NaN경공업NaN가공조립NaN기초소재NaN생활관련NaN
3<NA>실적전망실적전망실적전망실적전망실적전망실적전망실적전망실적전망실적전망
4'80. 1/487NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
52/486NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
63/483NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
74/495NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
8'81. 1/499NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
92/4100NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
전반적인 경기 BSI 추이Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18
1383/485969010783938287849985918210090958391
1394/492106971139010485989210790104911079510890103
140'14. 1/4819186998089768681928191829379898292
1412/4931149912091112881099511589111931139811887110
1423/4849984107849780948410283928310386998292
1434/4841068611483105811008410683108821058610584109
14415. 1/4748880937187648271898088699176848189
14515. 2/4841088811582104821028410986107821088711187107
14615. 3/4NaN92NaN100NaN87NaN82NaN94NaN88NaN93NaN96NaN86
147<NA>중소제조업NaN중기업NaN소기업NaN영세소기업NaN중공업NaN경공업NaN조립가공NaN기초소재NaN생활관련NaN

Duplicate rows

Most frequently occurring

전반적인 경기 BSI 추이# duplicates
13/435
24/435
02/434
3<NA>5