Overview

Dataset statistics

Number of variables9
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory76.1 B

Variable types

Numeric1
Text2
DateTime4
Categorical2

Dataset

Description인천광역시 bizok 블랙리스트 기업 현황(기업명, 제한사유,지원불가기간,등록일 등)에 대한 데이터를 나타내고 있습니다.
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15049267

Alerts

제한 사유 is highly overall correlated with 등록아이디High correlation
등록아이디 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
번호 is highly overall correlated with 등록아이디High correlation
등록아이디 is highly imbalanced (62.9%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 11:56:42.349126
Analysis finished2024-01-28 11:56:42.863905
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.5
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2024-01-28T20:56:42.936456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.05
Q111.25
median21.5
Q331.75
95-th percentile39.95
Maximum42
Range41
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation12.267844
Coefficient of variation (CV)0.5705974
Kurtosis-1.2
Mean21.5
Median Absolute Deviation (MAD)10.5
Skewness0
Sum903
Variance150.5
MonotonicityStrictly increasing
2024-01-28T20:56:43.052850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 1
 
2.4%
33 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
32 1
 
2.4%
Other values (32) 32
76.2%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
42 1
2.4%
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
Distinct41
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size468.0 B
2024-01-28T20:56:43.240137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length8.1904762
Min length4

Characters and Unicode

Total characters344
Distinct characters136
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row극동전자정밀(주)
2nd row주식회사 울트라브이
3rd row주식회사에이치에스글로벌
4th row인성엔프라(주)
5th row주식회사 해내음식품
ValueCountFrequency (%)
주식회사 6
 
11.8%
2
 
3.9%
대화연료펌프 2
 
3.9%
대림시스템 1
 
2.0%
유통사업소 1
 
2.0%
주)토스코하이본 1
 
2.0%
주)피케이엘앤에스 1
 
2.0%
그린비코스메틱 1
 
2.0%
주)썬쿡 1
 
2.0%
주)제일위생 1
 
2.0%
Other values (34) 34
66.7%
2024-01-28T20:56:43.530570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
9.3%
) 24
 
7.0%
( 23
 
6.7%
13
 
3.8%
10
 
2.9%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
Other values (126) 200
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 270
78.5%
Close Punctuation 24
 
7.0%
Open Punctuation 23
 
6.7%
Uppercase Letter 16
 
4.7%
Space Separator 9
 
2.6%
Dash Punctuation 1
 
0.3%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
11.9%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (111) 166
61.5%
Uppercase Letter
ValueCountFrequency (%)
E 3
18.8%
M 2
12.5%
H 2
12.5%
I 2
12.5%
T 2
12.5%
R 1
 
6.2%
S 1
 
6.2%
D 1
 
6.2%
C 1
 
6.2%
N 1
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 270
78.5%
Common 58
 
16.9%
Latin 16
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
11.9%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (111) 166
61.5%
Latin
ValueCountFrequency (%)
E 3
18.8%
M 2
12.5%
H 2
12.5%
I 2
12.5%
T 2
12.5%
R 1
 
6.2%
S 1
 
6.2%
D 1
 
6.2%
C 1
 
6.2%
N 1
 
6.2%
Common
ValueCountFrequency (%)
) 24
41.4%
( 23
39.7%
9
 
15.5%
- 1
 
1.7%
& 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 270
78.5%
ASCII 74
 
21.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
11.9%
13
 
4.8%
10
 
3.7%
9
 
3.3%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
4
 
1.5%
Other values (111) 166
61.5%
ASCII
ValueCountFrequency (%)
) 24
32.4%
( 23
31.1%
9
 
12.2%
E 3
 
4.1%
M 2
 
2.7%
H 2
 
2.7%
I 2
 
2.7%
T 2
 
2.7%
R 1
 
1.4%
S 1
 
1.4%
Other values (5) 5
 
6.8%
Distinct41
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size468.0 B
2024-01-28T20:56:43.720316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters504
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row137-81-20082
2nd row211-88-73299
3rd row121-86-31968
4th row137-81-22547
5th row551-86-00161
ValueCountFrequency (%)
139-81-08221 2
 
4.8%
137-81-20082 1
 
2.4%
131-86-55161 1
 
2.4%
134-06-54631 1
 
2.4%
122-86-43583 1
 
2.4%
137-81-32031 1
 
2.4%
105-86-20421 1
 
2.4%
377-87-00371 1
 
2.4%
683-21-00116 1
 
2.4%
206-87-00220 1
 
2.4%
Other values (31) 31
73.8%
2024-01-28T20:56:44.017546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 101
20.0%
- 84
16.7%
8 55
10.9%
2 53
10.5%
3 46
9.1%
6 40
 
7.9%
0 33
 
6.5%
7 29
 
5.8%
9 27
 
5.4%
5 20
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 420
83.3%
Dash Punctuation 84
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 101
24.0%
8 55
13.1%
2 53
12.6%
3 46
11.0%
6 40
 
9.5%
0 33
 
7.9%
7 29
 
6.9%
9 27
 
6.4%
5 20
 
4.8%
4 16
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 504
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 101
20.0%
- 84
16.7%
8 55
10.9%
2 53
10.5%
3 46
9.1%
6 40
 
7.9%
0 33
 
6.5%
7 29
 
5.8%
9 27
 
5.4%
5 20
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 504
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 101
20.0%
- 84
16.7%
8 55
10.9%
2 53
10.5%
3 46
9.1%
6 40
 
7.9%
0 33
 
6.5%
7 29
 
5.8%
9 27
 
5.4%
5 20
 
4.0%
Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-08-22 00:00:00
Maximum2018-12-20 00:00:00
2024-01-28T20:56:44.117399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:44.189795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
Distinct3
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2018-08-21 00:00:00
Maximum2021-12-31 00:00:00
2024-01-28T20:56:44.262480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:44.344156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

제한 사유
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
선정포기
21 
선정포기
17 
당사 내부사정으로 인한 지원 포기
 
1
사유 : 업체 내부사정으로 포기
 
1
사유 : 업체내부사정
 
1

Length

Max length18
Median length17.5
Mean length5.5
Min length4

Unique

Unique4 ?
Unique (%)9.5%

Sample

1st row선정포기
2nd row선정포기
3rd row선정포기
4th row선정포기
5th row선정포기

Common Values

ValueCountFrequency (%)
선정포기 21
50.0%
선정포기 17
40.5%
당사 내부사정으로 인한 지원 포기 1
 
2.4%
사유 : 업체 내부사정으로 포기 1
 
2.4%
사유 : 업체내부사정 1
 
2.4%
회사사정으로 참가 포기 1
 
2.4%

Length

2024-01-28T20:56:44.455881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:56:44.560967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선정포기 38
70.4%
포기 3
 
5.6%
내부사정으로 2
 
3.7%
사유 2
 
3.7%
2
 
3.7%
당사 1
 
1.9%
인한 1
 
1.9%
지원 1
 
1.9%
업체 1
 
1.9%
업체내부사정 1
 
1.9%
Other values (2) 2
 
3.7%

등록아이디
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size468.0 B
swayi79
39 
incheon02
 
3

Length

Max length9
Median length7
Mean length7.1428571
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowswayi79
2nd rowswayi79
3rd rowswayi79
4th rowswayi79
5th rowswayi79

Common Values

ValueCountFrequency (%)
swayi79 39
92.9%
incheon02 3
 
7.1%

Length

2024-01-28T20:56:44.664926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:56:44.750464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
swayi79 39
92.9%
incheon02 3
 
7.1%
Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-10-13 00:00:00
Maximum2018-12-20 00:00:00
2024-01-28T20:56:44.821068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:44.905419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
Distinct6
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size468.0 B
Minimum2017-10-13 00:00:00
Maximum2018-12-20 00:00:00
2024-01-28T20:56:44.989740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T20:56:45.075574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)

Interactions

2024-01-28T20:56:42.617227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T20:56:45.151720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록아이디등록일수정일
번호1.0000.9400.9400.7620.0000.4320.8400.7620.762
기업명0.9401.0001.0001.0001.0000.9801.0001.0001.000
사업자등록번호0.9401.0001.0001.0001.0000.9801.0001.0001.000
지원 불가 시작일0.7621.0001.0001.0001.0000.9701.0001.0001.000
지원 불가 종료일0.0001.0001.0001.0001.0001.0000.3321.0001.000
제한 사유0.4320.9800.9800.9701.0001.0001.0000.9700.970
등록아이디0.8401.0001.0001.0000.3321.0001.0001.0001.000
등록일0.7621.0001.0001.0001.0000.9701.0001.0001.000
수정일0.7621.0001.0001.0001.0000.9701.0001.0001.000
2024-01-28T20:56:45.243489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제한 사유등록아이디
제한 사유1.0000.949
등록아이디0.9491.000
2024-01-28T20:56:45.310020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호제한 사유등록아이디
번호1.0000.2180.599
제한 사유0.2181.0000.949
등록아이디0.5990.9491.000

Missing values

2024-01-28T20:56:42.707290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T20:56:42.815262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록아이디등록일수정일
01극동전자정밀(주)137-81-200822018-12-202018-12-31선정포기swayi792018-12-202018-12-20
12주식회사 울트라브이211-88-732992018-12-202018-12-31선정포기swayi792018-12-202018-12-20
23주식회사에이치에스글로벌121-86-319682018-12-202018-12-31선정포기swayi792018-12-202018-12-20
34인성엔프라(주)137-81-225472018-12-192018-12-31선정포기swayi792018-12-192018-12-19
45주식회사 해내음식품551-86-001612018-12-192018-12-31선정포기swayi792018-12-192018-12-19
56이레포스121-14-488412018-12-192018-12-31선정포기swayi792018-12-192018-12-19
67(주)제이앤씨글로벌121-86-076052018-12-192018-12-31선정포기swayi792018-12-192018-12-19
78(주)에코매스코리아122-81-992532018-12-192018-12-31선정포기swayi792018-12-192018-12-19
89(주)앤제화137-81-136932018-12-192018-12-31선정포기swayi792018-12-192018-12-19
910경서기계산업137-09-632892018-12-192018-12-31선정포기swayi792018-12-192018-12-19
번호기업명사업자등록번호지원 불가 시작일지원 불가 종료일제한 사유등록아이디등록일수정일
3233웰빙헬스팜123-32-697012018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3334(주)미래과학교육원122-86-112332018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3435호재식품122-29-829812018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3536주식회사에코란트131-86-511842018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3637대신DEMISTER139-12-614772018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3738(주) 대화연료펌프139-81-082212018-12-172018-12-31선정포기swayi792018-12-172018-12-17
3839(주)태영금속131-86-006712018-08-232021-12-31당사 내부사정으로 인한 지원 포기swayi792018-09-282018-09-28
3940경인북부수산업협동조합 유통사업소137-82-049972018-05-162018-12-31사유 : 업체 내부사정으로 포기incheon022018-05-162018-05-16
4041주식회사 대림시스템131-86-337172018-05-162018-12-31사유 : 업체내부사정incheon022018-05-162018-05-16
4142주식회사 새벽121-86-103612017-08-222018-08-21회사사정으로 참가 포기incheon022017-10-132017-10-13