Overview

Dataset statistics

Number of variables9
Number of observations42
Missing cells2
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory76.1 B

Variable types

Numeric1
Text2
DateTime4
Categorical2

Dataset

Description인천광역시 bizok 블랙리스트 기업 현황(기업명, 제한사유,지원불가기간,등록일 등)
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15049267

Alerts

등록 ID is highly overall correlated with 번호 and 1 other fieldsHigh correlation
제한 사유 is highly overall correlated with 등록 IDHigh correlation
번호 is highly overall correlated with 등록 IDHigh correlation
제한 사유 is highly imbalanced (72.3%)Imbalance
등록 ID is highly imbalanced (62.9%)Imbalance
사업자등록번호 has 2 (4.8%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 11:56:37.503988
Analysis finished2024-01-28 11:56:38.118927
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.5
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-01-28T20:56:38.183586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.05
Q111.25
median21.5
Q331.75
95-th percentile39.95
Maximum42
Range41
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation12.267844
Coefficient of variation (CV)0.5705974
Kurtosis-1.2
Mean21.5
Median Absolute Deviation (MAD)10.5
Skewness0
Sum903
Variance150.5
MonotonicityStrictly increasing
2024-01-28T20:56:38.290358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 1
 
2.4%
33 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
32 1
 
2.4%
Other values (32) 32
76.2%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
42 1
2.4%
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
Distinct41
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size468.0 B
2024-01-28T20:56:38.478051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length8.1428571
Min length4

Characters and Unicode

Total characters342
Distinct characters137
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row극동전자정밀(주)
2nd row주식회사 울트라브이
3rd row주식회사에이치에스글로벌
4th row인성엔프라㈜
5th row주식회사 해내음식품
ValueCountFrequency (%)
주식회사 6
 
11.8%
2
 
3.9%
대화연료펌프 2
 
3.9%
대림시스템 1
 
2.0%
유통사업소 1
 
2.0%
주)토스코하이본 1
 
2.0%
주)피케이엘앤에스 1
 
2.0%
그린비코스메틱 1
 
2.0%
주)썬쿡 1
 
2.0%
주)제일위생 1
 
2.0%
Other values (34) 34
66.7%
2024-01-28T20:56:38.777575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
9.1%
) 23
 
6.7%
( 22
 
6.4%
13
 
3.8%
10
 
2.9%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (127) 201
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 269
78.7%
Close Punctuation 23
 
6.7%
Open Punctuation 22
 
6.4%
Uppercase Letter 16
 
4.7%
Space Separator 9
 
2.6%
Dash Punctuation 1
 
0.3%
Other Punctuation 1
 
0.3%
Other Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
11.5%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (111) 166
61.7%
Uppercase Letter
ValueCountFrequency (%)
E 3
18.8%
M 2
12.5%
H 2
12.5%
I 2
12.5%
T 2
12.5%
S 1
 
6.2%
D 1
 
6.2%
R 1
 
6.2%
C 1
 
6.2%
N 1
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 270
78.9%
Common 56
 
16.4%
Latin 16
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
11.5%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (112) 167
61.9%
Latin
ValueCountFrequency (%)
E 3
18.8%
M 2
12.5%
H 2
12.5%
I 2
12.5%
T 2
12.5%
S 1
 
6.2%
D 1
 
6.2%
R 1
 
6.2%
C 1
 
6.2%
N 1
 
6.2%
Common
ValueCountFrequency (%)
) 23
41.1%
( 22
39.3%
9
 
16.1%
- 1
 
1.8%
& 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 269
78.7%
ASCII 72
 
21.1%
None 1
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
 
11.5%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (111) 166
61.7%
ASCII
ValueCountFrequency (%)
) 23
31.9%
( 22
30.6%
9
 
12.5%
E 3
 
4.2%
M 2
 
2.8%
H 2
 
2.8%
I 2
 
2.8%
T 2
 
2.8%
S 1
 
1.4%
D 1
 
1.4%
Other values (5) 5
 
6.9%
None
ValueCountFrequency (%)
1
100.0%

사업자등록번호
Text

MISSING 

Distinct39
Distinct (%)97.5%
Missing2
Missing (%)4.8%
Memory size468.0 B
2024-01-28T20:56:38.964732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters480
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)95.0%

Sample

1st row137-81-20082
2nd row211-88-73299
3rd row121-86-31968
4th row137-81-22547
5th row551-86-00161
ValueCountFrequency (%)
139-81-08221 2
 
5.0%
137-81-20082 1
 
2.5%
131-86-62416 1
 
2.5%
131-86-55161 1
 
2.5%
134-06-54631 1
 
2.5%
122-86-43583 1
 
2.5%
137-81-32031 1
 
2.5%
105-86-20421 1
 
2.5%
377-87-00371 1
 
2.5%
137-82-04997 1
 
2.5%
Other values (29) 29
72.5%
2024-01-28T20:56:39.244323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 94
19.6%
- 80
16.7%
8 52
10.8%
2 51
10.6%
3 44
9.2%
6 37
 
7.7%
0 32
 
6.7%
7 28
 
5.8%
9 27
 
5.6%
5 20
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 400
83.3%
Dash Punctuation 80
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 94
23.5%
8 52
13.0%
2 51
12.8%
3 44
11.0%
6 37
 
9.2%
0 32
 
8.0%
7 28
 
7.0%
9 27
 
6.8%
5 20
 
5.0%
4 15
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 480
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 94
19.6%
- 80
16.7%
8 52
10.8%
2 51
10.6%
3 44
9.2%
6 37
 
7.7%
0 32
 
6.7%
7 28
 
5.8%
9 27
 
5.6%
5 20
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 480
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 94
19.6%
- 80
16.7%
8 52
10.8%
2 51
10.6%
3 44
9.2%
6 37
 
7.7%
0 32
 
6.7%
7 28
 
5.8%
9 27
 
5.6%
5 20
 
4.2%
Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-08-22 00:00:00
Maximum2018-12-20 00:00:00
2024-01-28T20:56:39.605520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:39.679645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2018-08-21 00:00:00
Maximum2021-12-31 00:00:00
2024-01-28T20:56:39.758539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:39.839139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

제한 사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Memory size468.0 B
선정포기
38 
당사 내부사정으로 인한 지원 포기
 
1
사유 : 업체 내부사정으로 포기
 
1
사유 : 업체내부사정
 
1
회사사정으로 참가 포기
 
1

Length

Max length18
Median length4
Mean length5
Min length4

Unique

Unique4 ?
Unique (%)9.5%

Sample

1st row선정포기
2nd row선정포기
3rd row선정포기
4th row선정포기
5th row선정포기

Common Values

ValueCountFrequency (%)
선정포기 38
90.5%
당사 내부사정으로 인한 지원 포기 1
 
2.4%
사유 : 업체 내부사정으로 포기 1
 
2.4%
사유 : 업체내부사정 1
 
2.4%
회사사정으로 참가 포기 1
 
2.4%

Length

2024-01-28T20:56:39.944874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:56:40.039983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선정포기 38
70.4%
포기 3
 
5.6%
내부사정으로 2
 
3.7%
사유 2
 
3.7%
2
 
3.7%
당사 1
 
1.9%
인한 1
 
1.9%
지원 1
 
1.9%
업체 1
 
1.9%
업체내부사정 1
 
1.9%
Other values (2) 2
 
3.7%

등록 ID
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size468.0 B
swayi79
39 
incheon02
 
3

Length

Max length9
Median length7
Mean length7.1428571
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowswayi79
2nd rowswayi79
3rd rowswayi79
4th rowswayi79
5th rowswayi79

Common Values

ValueCountFrequency (%)
swayi79 39
92.9%
incheon02 3
 
7.1%

Length

2024-01-28T20:56:40.141185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:56:40.243582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
swayi79 39
92.9%
incheon02 3
 
7.1%
Distinct40
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-10-13 13:18:00
Maximum2018-12-20 10:25:00
2024-01-28T20:56:40.372698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:40.481427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
Distinct40
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-10-13 13:18:00
Maximum2018-12-20 10:25:00
2024-01-28T20:56:40.585282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:40.690927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)

Interactions

2024-01-28T20:56:37.881275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T20:56:40.768836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록 ID등록일수정일
번호1.0000.9400.9320.7620.0000.0000.8401.0001.000
기업명0.9401.0001.0001.0001.0001.0001.0000.9840.984
사업자등록번호0.9321.0001.0001.0001.0001.0001.0000.9830.983
지원 불가 시작일0.7621.0001.0001.0001.0000.9021.0001.0001.000
지원 불가 종료일0.0001.0001.0001.0001.0001.0000.3321.0001.000
제한 사유0.0001.0001.0000.9021.0001.0001.0001.0001.000
등록 ID0.8401.0001.0001.0000.3321.0001.0001.0001.000
등록일1.0000.9840.9831.0001.0001.0001.0001.0001.000
수정일1.0000.9840.9831.0001.0001.0001.0001.0001.000
2024-01-28T20:56:40.861334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록 ID제한 사유
등록 ID1.0000.962
제한 사유0.9621.000
2024-01-28T20:56:40.933109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호제한 사유등록 ID
번호1.0000.0000.599
제한 사유0.0001.0000.962
등록 ID0.5990.9621.000

Missing values

2024-01-28T20:56:37.962532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T20:56:38.075562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록 ID등록일수정일
01극동전자정밀(주)137-81-200822018-12-202018-12-31선정포기swayi792018-12-20 10:252018-12-20 10:25
12주식회사 울트라브이211-88-732992018-12-202018-12-31선정포기swayi792018-12-20 09:322018-12-20 09:32
23주식회사에이치에스글로벌121-86-319682018-12-202018-12-31선정포기swayi792018-12-20 09:302018-12-20 09:30
34인성엔프라㈜137-81-225472018-12-192018-12-31선정포기swayi792018-12-19 17:462018-12-19 17:46
45주식회사 해내음식품551-86-001612018-12-192018-12-31선정포기swayi792018-12-19 17:282018-12-19 17:28
56이레포스121-14-488412018-12-192018-12-31선정포기swayi792018-12-19 17:252018-12-19 17:25
67(주)제이앤씨글로벌121-86-076052018-12-192018-12-31선정포기swayi792018-12-19 16:272018-12-19 16:27
78(주)에코매스코리아122-81-992532018-12-192018-12-31선정포기swayi792018-12-19 16:172018-12-19 16:17
89(주)앤제화137-81-136932018-12-192018-12-31선정포기swayi792018-12-19 10:502018-12-19 10:50
910경서기계산업137-09-632892018-12-192018-12-31선정포기swayi792018-12-19 10:492018-12-19 10:49
번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록 ID등록일수정일
3233웰빙헬스팜123-32-697012018-12-172018-12-31선정포기swayi792018-12-17 13:482018-12-17 13:48
3334(주)미래과학교육원122-86-112332018-12-172018-12-31선정포기swayi792018-12-17 13:432018-12-17 13:43
3435호재식품122-29-829812018-12-172018-12-31선정포기swayi792018-12-17 13:422018-12-17 13:42
3536주식회사에코란트131-86-511842018-12-172018-12-31선정포기swayi792018-12-17 13:322018-12-17 13:32
3637대신DEMISTER139-12-614772018-12-172018-12-31선정포기swayi792018-12-17 13:172018-12-17 13:17
3738(주) 대화연료펌프139-81-082212018-12-172018-12-31선정포기swayi792018-12-17 13:152018-12-17 13:15
3839(주)태영금속131-86-006712018-08-232021-12-31당사 내부사정으로 인한 지원 포기swayi792018-09-28 09:162018-09-28 09:16
3940경인북부수산업협동조합 유통사업소137-82-049972018-05-162018-12-31사유 : 업체 내부사정으로 포기incheon022018-05-16 17:222018-05-16 17:22
4041주식회사 대림시스템131-86-337172018-05-162018-12-31사유 : 업체내부사정incheon022018-05-16 17:202018-05-16 17:20
4142주식회사 새벽<NA>2017-08-222018-08-21회사사정으로 참가 포기incheon022017-10-13 13:182017-10-13 13:18