Overview

Dataset statistics

Number of variables7
Number of observations2990
Missing cells450
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory169.5 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical2
Text3

Dataset

Description지역 내 우수기업 발굴, 소개를 통한 지역기업의 인식개선 및 기업-청년 간 취업을 연계 지원하는 희망이음 프로젝트에 참여한 전국 18개 시도의 지역 소재 중소, 중견기업 정보
URLhttps://www.data.go.kr/data/15104284/fileData.do

Alerts

번호 is highly overall correlated with 해당년도High correlation
해당년도 is highly overall correlated with 번호High correlation
기업규모 is highly imbalanced (67.0%)Imbalance
해당년도 has 119 (4.0%) missing valuesMissing
업종 has 331 (11.1%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:41:05.967424
Analysis finished2023-12-12 02:41:07.678846
Duration1.71 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2990
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1495.5
Minimum1
Maximum2990
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.4 KiB
2023-12-12T11:41:07.766746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile150.45
Q1748.25
median1495.5
Q32242.75
95-th percentile2840.55
Maximum2990
Range2989
Interquartile range (IQR)1494.5

Descriptive statistics

Standard deviation863.28298
Coefficient of variation (CV)0.57725375
Kurtosis-1.2
Mean1495.5
Median Absolute Deviation (MAD)747.5
Skewness0
Sum4471545
Variance745257.5
MonotonicityStrictly increasing
2023-12-12T11:41:07.927677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1998 1
 
< 0.1%
1989 1
 
< 0.1%
1990 1
 
< 0.1%
1991 1
 
< 0.1%
1992 1
 
< 0.1%
1993 1
 
< 0.1%
1994 1
 
< 0.1%
1995 1
 
< 0.1%
1996 1
 
< 0.1%
Other values (2980) 2980
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2990 1
< 0.1%
2989 1
< 0.1%
2988 1
< 0.1%
2987 1
< 0.1%
2986 1
< 0.1%
2985 1
< 0.1%
2984 1
< 0.1%
2983 1
< 0.1%
2982 1
< 0.1%
2981 1
< 0.1%

해당년도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)0.2%
Missing119
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean2018.4145
Minimum2016
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.4 KiB
2023-12-12T11:41:08.055403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2016
5-th percentile2016
Q12017
median2018
Q32020
95-th percentile2021
Maximum2021
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7787712
Coefficient of variation (CV)0.00088127151
Kurtosis-1.3316312
Mean2018.4145
Median Absolute Deviation (MAD)2
Skewness0.12464359
Sum5794868
Variance3.1640269
MonotonicityDecreasing
2023-12-12T11:41:08.178712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2021 555
18.6%
2016 549
18.4%
2018 530
17.7%
2017 498
16.7%
2020 382
12.8%
2019 357
11.9%
(Missing) 119
 
4.0%
ValueCountFrequency (%)
2016 549
18.4%
2017 498
16.7%
2018 530
17.7%
2019 357
11.9%
2020 382
12.8%
2021 555
18.6%
ValueCountFrequency (%)
2021 555
18.6%
2020 382
12.8%
2019 357
11.9%
2018 530
17.7%
2017 498
16.7%
2016 549
18.4%

지역
Categorical

Distinct17
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size23.5 KiB
강원
410 
전북
254 
경기
254 
충남
233 
광주
218 
Other values (12)
1621 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원
2nd row강원
3rd row강원
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
강원 410
13.7%
전북 254
 
8.5%
경기 254
 
8.5%
충남 233
 
7.8%
광주 218
 
7.3%
대전 217
 
7.3%
대구 176
 
5.9%
인천 176
 
5.9%
충북 174
 
5.8%
경남 158
 
5.3%
Other values (7) 720
24.1%

Length

2023-12-12T11:41:08.611188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 410
13.7%
전북 254
 
8.5%
경기 254
 
8.5%
충남 233
 
7.8%
광주 218
 
7.3%
대전 217
 
7.3%
대구 176
 
5.9%
인천 176
 
5.9%
충북 174
 
5.8%
경남 158
 
5.3%
Other values (7) 720
24.1%
Distinct2239
Distinct (%)74.9%
Missing0
Missing (%)0.0%
Memory size23.5 KiB
2023-12-12T11:41:08.924095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length6.2478261
Min length2

Characters and Unicode

Total characters18681
Distinct characters636
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1741 ?
Unique (%)58.2%

Sample

1st row스태프칩팩코리아
2nd row무상엠에스마트
3rd row유바이오로직스
4th row단정바이오
5th row휴젤
ValueCountFrequency (%)
주식회사 29
 
0.9%
14
 
0.4%
농업회사법인 9
 
0.3%
㈜바이오니아 8
 
0.3%
㈜아시아 8
 
0.3%
7
 
0.2%
㈜알에프세미 7
 
0.2%
상신브레이크㈜ 6
 
0.2%
인컴즈 6
 
0.2%
㈜엘앤에프 6
 
0.2%
Other values (2312) 3083
96.9%
2023-12-12T11:41:09.359537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1460
 
7.8%
833
 
4.5%
659
 
3.5%
628
 
3.4%
) 582
 
3.1%
( 580
 
3.1%
382
 
2.0%
299
 
1.6%
233
 
1.2%
204
 
1.1%
Other values (626) 12821
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15590
83.5%
Other Symbol 1460
 
7.8%
Close Punctuation 582
 
3.1%
Open Punctuation 580
 
3.1%
Uppercase Letter 227
 
1.2%
Space Separator 193
 
1.0%
Lowercase Letter 18
 
0.1%
Decimal Number 16
 
0.1%
Other Punctuation 12
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
833
 
5.3%
659
 
4.2%
628
 
4.0%
382
 
2.5%
299
 
1.9%
233
 
1.5%
204
 
1.3%
203
 
1.3%
196
 
1.3%
194
 
1.2%
Other values (576) 11759
75.4%
Uppercase Letter
ValueCountFrequency (%)
C 22
 
9.7%
S 22
 
9.7%
B 18
 
7.9%
K 18
 
7.9%
T 17
 
7.5%
D 15
 
6.6%
N 13
 
5.7%
E 12
 
5.3%
P 11
 
4.8%
A 11
 
4.8%
Other values (13) 68
30.0%
Lowercase Letter
ValueCountFrequency (%)
a 4
22.2%
r 2
11.1%
n 2
11.1%
k 2
11.1%
i 1
 
5.6%
e 1
 
5.6%
t 1
 
5.6%
d 1
 
5.6%
g 1
 
5.6%
o 1
 
5.6%
Other values (2) 2
11.1%
Decimal Number
ValueCountFrequency (%)
2 6
37.5%
3 5
31.2%
1 2
 
12.5%
4 1
 
6.2%
5 1
 
6.2%
6 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
& 8
66.7%
. 3
 
25.0%
, 1
 
8.3%
Other Symbol
ValueCountFrequency (%)
1460
100.0%
Close Punctuation
ValueCountFrequency (%)
) 582
100.0%
Open Punctuation
ValueCountFrequency (%)
( 580
100.0%
Space Separator
ValueCountFrequency (%)
193
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17050
91.3%
Common 1386
 
7.4%
Latin 245
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1460
 
8.6%
833
 
4.9%
659
 
3.9%
628
 
3.7%
382
 
2.2%
299
 
1.8%
233
 
1.4%
204
 
1.2%
203
 
1.2%
196
 
1.1%
Other values (577) 11953
70.1%
Latin
ValueCountFrequency (%)
C 22
 
9.0%
S 22
 
9.0%
B 18
 
7.3%
K 18
 
7.3%
T 17
 
6.9%
D 15
 
6.1%
N 13
 
5.3%
E 12
 
4.9%
P 11
 
4.5%
A 11
 
4.5%
Other values (25) 86
35.1%
Common
ValueCountFrequency (%)
) 582
42.0%
( 580
41.8%
193
 
13.9%
& 8
 
0.6%
2 6
 
0.4%
3 5
 
0.4%
. 3
 
0.2%
- 2
 
0.1%
1 2
 
0.1%
4 1
 
0.1%
Other values (4) 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15590
83.5%
ASCII 1631
 
8.7%
None 1460
 
7.8%

Most frequent character per block

None
ValueCountFrequency (%)
1460
100.0%
Hangul
ValueCountFrequency (%)
833
 
5.3%
659
 
4.2%
628
 
4.0%
382
 
2.5%
299
 
1.9%
233
 
1.5%
204
 
1.3%
203
 
1.3%
196
 
1.3%
194
 
1.2%
Other values (576) 11759
75.4%
ASCII
ValueCountFrequency (%)
) 582
35.7%
( 580
35.6%
193
 
11.8%
C 22
 
1.3%
S 22
 
1.3%
B 18
 
1.1%
K 18
 
1.1%
T 17
 
1.0%
D 15
 
0.9%
N 13
 
0.8%
Other values (39) 151
 
9.3%

주소
Text

Distinct2142
Distinct (%)71.6%
Missing0
Missing (%)0.0%
Memory size23.5 KiB
2023-12-12T11:41:09.682599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length53
Mean length26.173244
Min length5

Characters and Unicode

Total characters78258
Distinct characters513
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1555 ?
Unique (%)52.0%

Sample

1st row인천시 중구 자유무역로 191
2nd row강원도 춘천시 충열로 79번길 37
3rd row강원도 춘천시 동산면 원무동길 125
4th row강원도 원주시 호저면 호매곡1길 85
5th row강원도 춘천시 신북읍 신북로 61-20 (율문리)
ValueCountFrequency (%)
경기도 244
 
1.5%
강원도 237
 
1.4%
전북 203
 
1.2%
광주 199
 
1.2%
충남 180
 
1.1%
경북 178
 
1.1%
대구 176
 
1.1%
춘천시 167
 
1.0%
대전 145
 
0.9%
유성구 136
 
0.8%
Other values (4335) 14874
88.9%
2023-12-12T11:41:10.192824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13767
 
17.6%
1 2873
 
3.7%
2333
 
3.0%
2 2116
 
2.7%
2066
 
2.6%
1915
 
2.4%
3 1858
 
2.4%
1808
 
2.3%
( 1779
 
2.3%
) 1772
 
2.3%
Other values (503) 45971
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43829
56.0%
Decimal Number 15128
 
19.3%
Space Separator 13767
 
17.6%
Open Punctuation 1871
 
2.4%
Close Punctuation 1866
 
2.4%
Other Punctuation 982
 
1.3%
Dash Punctuation 606
 
0.8%
Uppercase Letter 196
 
0.3%
Math Symbol 6
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2333
 
5.3%
2066
 
4.7%
1915
 
4.4%
1808
 
4.1%
1313
 
3.0%
1166
 
2.7%
1106
 
2.5%
1003
 
2.3%
928
 
2.1%
928
 
2.1%
Other values (453) 29263
66.8%
Uppercase Letter
ValueCountFrequency (%)
B 28
14.3%
A 27
13.8%
I 22
11.2%
P 14
 
7.1%
L 14
 
7.1%
E 14
 
7.1%
T 12
 
6.1%
X 12
 
6.1%
C 10
 
5.1%
S 8
 
4.1%
Other values (11) 35
17.9%
Decimal Number
ValueCountFrequency (%)
1 2873
19.0%
2 2116
14.0%
3 1858
12.3%
0 1552
10.3%
4 1472
9.7%
5 1440
9.5%
6 1250
8.3%
7 883
 
5.8%
8 860
 
5.7%
9 824
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 898
91.4%
? 65
 
6.6%
: 7
 
0.7%
. 6
 
0.6%
/ 4
 
0.4%
& 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 4
66.7%
< 1
 
16.7%
> 1
 
16.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
50.0%
t 1
25.0%
i 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 1779
95.1%
[ 92
 
4.9%
Close Punctuation
ValueCountFrequency (%)
) 1772
95.0%
] 94
 
5.0%
Space Separator
ValueCountFrequency (%)
13767
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 606
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43832
56.0%
Common 34226
43.7%
Latin 200
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2333
 
5.3%
2066
 
4.7%
1915
 
4.4%
1808
 
4.1%
1313
 
3.0%
1166
 
2.7%
1106
 
2.5%
1003
 
2.3%
928
 
2.1%
928
 
2.1%
Other values (454) 29266
66.8%
Common
ValueCountFrequency (%)
13767
40.2%
1 2873
 
8.4%
2 2116
 
6.2%
3 1858
 
5.4%
( 1779
 
5.2%
) 1772
 
5.2%
0 1552
 
4.5%
4 1472
 
4.3%
5 1440
 
4.2%
6 1250
 
3.7%
Other values (15) 4347
 
12.7%
Latin
ValueCountFrequency (%)
B 28
14.0%
A 27
13.5%
I 22
11.0%
P 14
 
7.0%
L 14
 
7.0%
E 14
 
7.0%
T 12
 
6.0%
X 12
 
6.0%
C 10
 
5.0%
S 8
 
4.0%
Other values (14) 39
19.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43827
56.0%
ASCII 34426
44.0%
None 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13767
40.0%
1 2873
 
8.3%
2 2116
 
6.1%
3 1858
 
5.4%
( 1779
 
5.2%
) 1772
 
5.1%
0 1552
 
4.5%
4 1472
 
4.3%
5 1440
 
4.2%
6 1250
 
3.6%
Other values (39) 4547
 
13.2%
Hangul
ValueCountFrequency (%)
2333
 
5.3%
2066
 
4.7%
1915
 
4.4%
1808
 
4.1%
1313
 
3.0%
1166
 
2.7%
1106
 
2.5%
1003
 
2.3%
928
 
2.1%
928
 
2.1%
Other values (451) 29261
66.8%
None
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

기업규모
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size23.5 KiB
중소기업
2317 
중견기업
392 
기타
 
155
대기업
 
65
<NA>
 
53
Other values (6)
 
8

Length

Max length11
Median length4
Mean length3.8839465
Min length2

Unique

Unique5 ?
Unique (%)0.2%

Sample

1st row중견기업
2nd row중견기업
3rd row중견기업
4th row중소기업
5th row중견기업

Common Values

ValueCountFrequency (%)
중소기업 2317
77.5%
중견기업 392
 
13.1%
기타 155
 
5.2%
대기업 65
 
2.2%
<NA> 53
 
1.8%
기타(소상공인) 3
 
0.1%
기타(공공기관) 1
 
< 0.1%
기타(비영리재단법인) 1
 
< 0.1%
공기업 1
 
< 0.1%
기타(한시성중소기업) 1
 
< 0.1%

Length

2023-12-12T11:41:10.321397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중소기업 2317
77.5%
중견기업 392
 
13.1%
기타 155
 
5.2%
대기업 65
 
2.2%
na 53
 
1.8%
기타(소상공인 3
 
0.1%
기타(공공기관 1
 
< 0.1%
기타(비영리재단법인 1
 
< 0.1%
공기업 1
 
< 0.1%
기타(한시성중소기업 1
 
< 0.1%

업종
Text

MISSING 

Distinct59
Distinct (%)2.2%
Missing331
Missing (%)11.1%
Memory size23.5 KiB
2023-12-12T11:41:10.488023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length2
Mean length3.6032343
Min length1

Characters and Unicode

Total characters9581
Distinct characters92
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.5%

Sample

1st row제조
2nd row도매·소매
3rd row제조
4th row제조
5th row제조
ValueCountFrequency (%)
제조 1763
66.2%
정보통신 241
 
9.0%
제조업 112
 
4.2%
전문.과학및기술서비스 73
 
2.7%
전문·과학및기술서비스 66
 
2.5%
도매·소매 49
 
1.8%
건설 32
 
1.2%
도매.소매 29
 
1.1%
숙박·음식점 22
 
0.8%
21
 
0.8%
Other values (52) 255
 
9.6%
2023-12-12T11:41:10.815654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1900
19.8%
1875
19.6%
311
 
3.2%
· 299
 
3.1%
293
 
3.1%
285
 
3.0%
. 273
 
2.8%
263
 
2.7%
261
 
2.7%
256
 
2.7%
Other values (82) 3565
37.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8973
93.7%
Other Punctuation 583
 
6.1%
Dash Punctuation 21
 
0.2%
Space Separator 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1900
21.2%
1875
20.9%
311
 
3.5%
293
 
3.3%
285
 
3.2%
263
 
2.9%
261
 
2.9%
256
 
2.9%
251
 
2.8%
251
 
2.8%
Other values (77) 3027
33.7%
Other Punctuation
ValueCountFrequency (%)
· 299
51.3%
. 273
46.8%
, 11
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8973
93.7%
Common 608
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1900
21.2%
1875
20.9%
311
 
3.5%
293
 
3.3%
285
 
3.2%
263
 
2.9%
261
 
2.9%
256
 
2.9%
251
 
2.8%
251
 
2.8%
Other values (77) 3027
33.7%
Common
ValueCountFrequency (%)
· 299
49.2%
. 273
44.9%
- 21
 
3.5%
, 11
 
1.8%
4
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8973
93.7%
ASCII 309
 
3.2%
None 299
 
3.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1900
21.2%
1875
20.9%
311
 
3.5%
293
 
3.3%
285
 
3.2%
263
 
2.9%
261
 
2.9%
256
 
2.9%
251
 
2.8%
251
 
2.8%
Other values (77) 3027
33.7%
None
ValueCountFrequency (%)
· 299
100.0%
ASCII
ValueCountFrequency (%)
. 273
88.3%
- 21
 
6.8%
, 11
 
3.6%
4
 
1.3%

Interactions

2023-12-12T11:41:07.092171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:41:06.896856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:41:07.216061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:41:06.989933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:41:10.900028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호해당년도지역기업규모업종
번호1.0000.9360.5370.2600.608
해당년도0.9361.0000.3860.1790.616
지역0.5370.3861.0000.2310.651
기업규모0.2600.1790.2311.0000.643
업종0.6080.6160.6510.6431.000
2023-12-12T11:41:10.985364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역기업규모
지역1.0000.091
기업규모0.0911.000
2023-12-12T11:41:11.059776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호해당년도지역기업규모
번호1.000-0.9850.2410.082
해당년도-0.9851.0000.1840.090
지역0.2410.1841.0000.091
기업규모0.0820.0900.0911.000

Missing values

2023-12-12T11:41:07.367353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:41:07.503788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:41:07.622973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호해당년도지역기업명주소기업규모업종
012021강원스태프칩팩코리아인천시 중구 자유무역로 191중견기업제조
122021강원무상엠에스마트강원도 춘천시 충열로 79번길 37중견기업도매·소매
232021강원유바이오로직스강원도 춘천시 동산면 원무동길 125중견기업제조
342021강원단정바이오강원도 원주시 호저면 호매곡1길 85중소기업제조
452021강원휴젤강원도 춘천시 신북읍 신북로 61-20 (율문리)중견기업제조
562021강원제테마강원도 원주시 지정면 조엄로 321중견기업제조
672021강원유비니스테라퓨틱스인천시 연수구 갯벌로 12 301호 319호중소기업전문·과학및기술서비스
782021강원나무를 심는 사람들인천시 남동구 남동대로 370번길 122 102동 1802호중소기업전문·과학및기술서비스
892021경기㈜알톤스포츠경기도 성남시 분당구 판교로 256번길 258층중소기업제조
9102021경기영진산업㈜경기도 포천시 군내면 포천로 909번길 49중소기업제조
번호해당년도지역기업명주소기업규모업종
29802981<NA>제주씨앤피제주 제주시 동고산로 47중소기업<NA>
29812982<NA>경북(주)화신경북 영천시 도남공단길 94-2 (봉동)중견기업<NA>
29822983<NA>울산(주)삼기산업울산 북구 효죽3길 22 (효문동)중소기업<NA>
29832984<NA>울산(주)삼미정공울산 북구 매곡산업3길 9 (매곡동)중소기업<NA>
29842985<NA>충남삼진정공(주)충남 천안시 동남구 성남면 대흥1길 68중견기업<NA>
29852986<NA>충남(주)경신전선충남 천안시 서북구 입장면 연곡길567중견기업<NA>
29862987<NA>인천인천창조경제혁신센터인천광역시 연수구 갯벌로 12 (송도동)기타협회·단체·기타개인서비스
29872988<NA>충남(주)세정충남 아산시 둔포면 아산호로 840번길 65-20중견기업<NA>
29882989<NA>제주(유)디앤디드림제주 제주시 첨단로 213-3, 제주첨단과학기술단지 스마트빌딩 321호 (영평동)중소기업<NA>
29892990<NA>충남㈜티엠씨충남 천안시 서북구 입장면 연곡길 443 (가산리)중견기업<NA>