Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Categorical3
Text1

Dataset

Description중소벤처기업 재직 근로자의 장기재직과 자산형성 지원을 위하여 중소벤처기업진흥공단에서 관리하는 청년연계형 내일채움공제 가입가능 대상자 연계가입 희망 설문조사 결과
URLhttps://www.data.go.kr/data/15102327/fileData.do

Alerts

순번 is highly overall correlated with 청년내일채움공제 가입년도High correlation
청년내일채움공제 가입년도 is highly overall correlated with 순번High correlation
내일채움공제 연계가입 유무 is highly imbalanced (85.2%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:04:55.990472
Analysis finished2023-12-12 02:04:56.735535
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47767.245
Minimum6
Maximum95290
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:04:56.841675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile5142.65
Q124282.5
median48029.5
Q371278.75
95-th percentile90274.35
Maximum95290
Range95284
Interquartile range (IQR)46996.25

Descriptive statistics

Standard deviation27261.486
Coefficient of variation (CV)0.57071507
Kurtosis-1.1910502
Mean47767.245
Median Absolute Deviation (MAD)23515
Skewness-0.007478173
Sum4.7767245 × 108
Variance7.4318864 × 108
MonotonicityNot monotonic
2023-12-12T11:04:57.001221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
90376 1
 
< 0.1%
25011 1
 
< 0.1%
72095 1
 
< 0.1%
60680 1
 
< 0.1%
14193 1
 
< 0.1%
66574 1
 
< 0.1%
61841 1
 
< 0.1%
93470 1
 
< 0.1%
94415 1
 
< 0.1%
45964 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
6 1
< 0.1%
25 1
< 0.1%
39 1
< 0.1%
49 1
< 0.1%
66 1
< 0.1%
72 1
< 0.1%
85 1
< 0.1%
87 1
< 0.1%
89 1
< 0.1%
96 1
< 0.1%
ValueCountFrequency (%)
95290 1
< 0.1%
95266 1
< 0.1%
95265 1
< 0.1%
95264 1
< 0.1%
95263 1
< 0.1%
95257 1
< 0.1%
95251 1
< 0.1%
95221 1
< 0.1%
95217 1
< 0.1%
95214 1
< 0.1%

청년내일채움공제 가입년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2018년
7297 
2017년
2578 
2016년
 
125

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018년
2nd row2017년
3rd row2018년
4th row2018년
5th row2017년

Common Values

ValueCountFrequency (%)
2018년 7297
73.0%
2017년 2578
 
25.8%
2016년 125
 
1.2%

Length

2023-12-12T11:04:57.196719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:04:57.328439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018년 7297
73.0%
2017년 2578
 
25.8%
2016년 125
 
1.2%
Distinct111
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:04:57.542559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters40000
Distinct characters113
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)0.2%

Sample

1st row최o o
2nd row배o o
3rd row나o o
4th row김o o
5th row정o o
ValueCountFrequency (%)
o 10000
50.0%
김o 2150
 
10.8%
이o 1523
 
7.6%
박o 873
 
4.4%
정o 489
 
2.4%
최o 445
 
2.2%
조o 296
 
1.5%
강o 271
 
1.4%
윤o 206
 
1.0%
임o 202
 
1.0%
Other values (102) 3545
 
17.7%
2023-12-12T11:04:57.948938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 20000
50.0%
10000
25.0%
2150
 
5.4%
1523
 
3.8%
873
 
2.2%
489
 
1.2%
445
 
1.1%
296
 
0.7%
271
 
0.7%
206
 
0.5%
Other values (103) 3747
 
9.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 20000
50.0%
Space Separator 10000
25.0%
Other Letter 9991
25.0%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2150
21.5%
1523
15.2%
873
 
8.7%
489
 
4.9%
445
 
4.5%
296
 
3.0%
271
 
2.7%
206
 
2.1%
202
 
2.0%
186
 
1.9%
Other values (94) 3350
33.5%
Uppercase Letter
ValueCountFrequency (%)
J 2
22.2%
L 2
22.2%
S 1
11.1%
Q 1
11.1%
W 1
11.1%
H 1
11.1%
P 1
11.1%
Lowercase Letter
ValueCountFrequency (%)
o 20000
100.0%
Space Separator
ValueCountFrequency (%)
10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20009
50.0%
Common 10000
25.0%
Hangul 9991
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2150
21.5%
1523
15.2%
873
 
8.7%
489
 
4.9%
445
 
4.5%
296
 
3.0%
271
 
2.7%
206
 
2.1%
202
 
2.0%
186
 
1.9%
Other values (94) 3350
33.5%
Latin
ValueCountFrequency (%)
o 20000
> 99.9%
J 2
 
< 0.1%
L 2
 
< 0.1%
S 1
 
< 0.1%
Q 1
 
< 0.1%
W 1
 
< 0.1%
H 1
 
< 0.1%
P 1
 
< 0.1%
Common
ValueCountFrequency (%)
10000
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30009
75.0%
Hangul 9991
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 20000
66.6%
10000
33.3%
J 2
 
< 0.1%
L 2
 
< 0.1%
S 1
 
< 0.1%
Q 1
 
< 0.1%
W 1
 
< 0.1%
H 1
 
< 0.1%
P 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
2150
21.5%
1523
15.2%
873
 
8.7%
489
 
4.9%
445
 
4.5%
296
 
3.0%
271
 
2.7%
206
 
2.1%
202
 
2.0%
186
 
1.9%
Other values (94) 3350
33.5%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통이다
4113 
그렇다
2202 
매우 그렇다
1922 
아니다
1101 
매우 아니다
662 

Length

Max length6
Median length4
Mean length4.1865
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row매우 그렇다
2nd row그렇다
3rd row매우 그렇다
4th row보통이다
5th row그렇다

Common Values

ValueCountFrequency (%)
보통이다 4113
41.1%
그렇다 2202
22.0%
매우 그렇다 1922
19.2%
아니다 1101
 
11.0%
매우 아니다 662
 
6.6%

Length

2023-12-12T11:04:58.126716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:04:58.274688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
그렇다 4124
32.8%
보통이다 4113
32.7%
매우 2584
20.5%
아니다 1763
14.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미가입
9788 
가입
 
212

Length

Max length3
Median length3
Mean length2.9788
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미가입
2nd row미가입
3rd row미가입
4th row미가입
5th row미가입

Common Values

ValueCountFrequency (%)
미가입 9788
97.9%
가입 212
 
2.1%

Length

2023-12-12T11:04:58.405113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:04:58.528235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미가입 9788
97.9%
가입 212
 
2.1%

Interactions

2023-12-12T11:04:56.376267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:04:58.614823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번청년내일채움공제 가입년도내일채움공제 연계가입 의사 설문조사 결과내일채움공제 연계가입 유무
순번1.0000.8070.4090.000
청년내일채움공제 가입년도0.8071.0000.2010.003
내일채움공제 연계가입 의사 설문조사 결과0.4090.2011.0000.058
내일채움공제 연계가입 유무0.0000.0030.0581.000
2023-12-12T11:04:58.753391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
내일채움공제 연계가입 유무내일채움공제 연계가입 의사 설문조사 결과청년내일채움공제 가입년도
내일채움공제 연계가입 유무1.0000.0710.004
내일채움공제 연계가입 의사 설문조사 결과0.0711.0000.154
청년내일채움공제 가입년도0.0040.1541.000
2023-12-12T11:04:58.855946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번청년내일채움공제 가입년도내일채움공제 연계가입 의사 설문조사 결과내일채움공제 연계가입 유무
순번1.0000.7010.1820.000
청년내일채움공제 가입년도0.7011.0000.1540.004
내일채움공제 연계가입 의사 설문조사 결과0.1820.1541.0000.071
내일채움공제 연계가입 유무0.0000.0040.0711.000

Missing values

2023-12-12T11:04:56.546975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:04:56.675759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번청년내일채움공제 가입년도가입자내일채움공제 연계가입 의사 설문조사 결과내일채움공제 연계가입 유무
90375903762018년최o o매우 그렇다미가입
17289172902017년배o o그렇다미가입
91548915492018년나o o매우 그렇다미가입
47145471462018년김o o보통이다미가입
353535362017년정o o그렇다미가입
28462284632018년유o o그렇다미가입
63956639572018년김o o보통이다미가입
91788917892018년임o o그렇다미가입
66771667722018년원o o매우 아니다미가입
78720787212018년박o o보통이다미가입
순번청년내일채움공제 가입년도가입자내일채움공제 연계가입 의사 설문조사 결과내일채움공제 연계가입 유무
89125891262018년박o o보통이다미가입
20838208392017년오o o그렇다미가입
23707237082017년김o o보통이다미가입
44336443372018년백o o그렇다미가입
88832888332018년나o o매우 그렇다미가입
16612166132017년이o o보통이다미가입
90942909432018년박o o아니다미가입
86304863052018년손o o보통이다미가입
45995459962018년김o o그렇다미가입
12487124882017년최o o보통이다미가입