Overview

Dataset statistics

Number of variables3
Number of observations55
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory27.4 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description중소기업 재직 핵심인력의 장기재직 유도와 자산형성 지원사업인 내일채움공제 가입을 지원하는 외부 기관의 연도별 신규참여 현황
URLhttps://www.data.go.kr/data/15092109/fileData.do

Alerts

참여년도 is highly overall correlated with 기관유형High correlation
기관유형 is highly overall correlated with 참여년도High correlation
기관유형 is highly imbalanced (69.5%)Imbalance
기관명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:53:07.334948
Analysis finished2023-12-12 11:53:07.743577
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

참여년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.5091
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size627.0 B
2023-12-12T20:53:07.814833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12020.5
median2021
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation1.5137974
Coefficient of variation (CV)0.00074921582
Kurtosis0.98661551
Mean2020.5091
Median Absolute Deviation (MAD)0
Skewness-1.4306388
Sum111128
Variance2.2915825
MonotonicityNot monotonic
2023-12-12T20:53:07.950118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2021 30
54.5%
2022 11
 
20.0%
2017 6
 
10.9%
2020 4
 
7.3%
2018 2
 
3.6%
2019 2
 
3.6%
ValueCountFrequency (%)
2017 6
 
10.9%
2018 2
 
3.6%
2019 2
 
3.6%
2020 4
 
7.3%
2021 30
54.5%
2022 11
 
20.0%
ValueCountFrequency (%)
2022 11
 
20.0%
2021 30
54.5%
2020 4
 
7.3%
2019 2
 
3.6%
2018 2
 
3.6%
2017 6
 
10.9%

기관명
Text

UNIQUE 

Distinct55
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size572.0 B
2023-12-12T20:53:08.185183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length8.4363636
Min length3

Characters and Unicode

Total characters464
Distinct characters130
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)100.0%

Sample

1st row중소기업은행
2nd row신한은행
3rd row우리은행
4th row(사)중소기업기술혁신협회 대구경북지회
5th row(사)중소기업융합경기연합회
ValueCountFrequency (%)
㈜베스트인 2
 
3.4%
중소기업은행 1
 
1.7%
여성중앙회종로여성인력개발센터 1
 
1.7%
한국직업지도진흥원강남지부 1
 
1.7%
잡모아 1
 
1.7%
잡모아부천지점 1
 
1.7%
잡모아천호지점 1
 
1.7%
전북경영자총협회 1
 
1.7%
제니엘 1
 
1.7%
제천단양상공회의소 1
 
1.7%
Other values (47) 47
81.0%
2023-12-12T20:53:08.622342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
6.0%
15
 
3.2%
15
 
3.2%
15
 
3.2%
14
 
3.0%
13
 
2.8%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
Other values (120) 321
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 450
97.0%
Other Symbol 3
 
0.6%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%
Space Separator 3
 
0.6%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
6.2%
15
 
3.3%
15
 
3.3%
15
 
3.3%
14
 
3.1%
13
 
2.9%
12
 
2.7%
11
 
2.4%
10
 
2.2%
10
 
2.2%
Other values (114) 307
68.2%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 453
97.6%
Common 9
 
1.9%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
6.2%
15
 
3.3%
15
 
3.3%
15
 
3.3%
14
 
3.1%
13
 
2.9%
12
 
2.6%
11
 
2.4%
10
 
2.2%
10
 
2.2%
Other values (115) 310
68.4%
Common
ValueCountFrequency (%)
( 3
33.3%
) 3
33.3%
3
33.3%
Latin
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 450
97.0%
ASCII 11
 
2.4%
None 3
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
 
6.2%
15
 
3.3%
15
 
3.3%
15
 
3.3%
14
 
3.1%
13
 
2.9%
12
 
2.7%
11
 
2.4%
10
 
2.2%
10
 
2.2%
Other values (114) 307
68.2%
None
ValueCountFrequency (%)
3
100.0%
ASCII
ValueCountFrequency (%)
( 3
27.3%
) 3
27.3%
3
27.3%
T 1
 
9.1%
I 1
 
9.1%

기관유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size572.0 B
비금융권
52 
금융권
 
3

Length

Max length4
Median length4
Mean length3.9454545
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금융권
2nd row금융권
3rd row금융권
4th row비금융권
5th row비금융권

Common Values

ValueCountFrequency (%)
비금융권 52
94.5%
금융권 3
 
5.5%

Length

2023-12-12T20:53:08.781924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:08.895476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비금융권 52
94.5%
금융권 3
 
5.5%

Interactions

2023-12-12T20:53:07.497790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:53:09.312415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여년도기관명기관유형
참여년도1.0001.0000.764
기관명1.0001.0001.000
기관유형0.7641.0001.000
2023-12-12T20:53:09.434672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여년도기관유형
참여년도1.0000.863
기관유형0.8631.000

Missing values

2023-12-12T20:53:07.610857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:53:07.706137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

참여년도기관명기관유형
02018중소기업은행금융권
12019신한은행금융권
22018우리은행금융권
32022(사)중소기업기술혁신협회 대구경북지회비금융권
42022(사)중소기업융합경기연합회비금융권
52022(사)한국문화산업협회비금융권
62022㈜누구나잡비금융권
72022㈜베스트인 광주지사비금융권
82022㈜베스트인 전북지사비금융권
92022경북IT융합산업기술원비금융권
참여년도기관명기관유형
452020리부트코리아비금융권
462020블루안메타비금융권
472020한국서비스진흥협회비금융권
482019글로벌최고경영자클럽비금융권
492017벤처기업협회비금융권
502017중소기업융합중앙회비금융권
512017한국경영혁신중소기업협회비금융권
522017한국여성경제인협회비금융권
532017중소기업기술혁신협회비금융권
542017한국경영기술지도사회비금융권