Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory52.3 B

Variable types

Categorical5
Numeric1

Alerts

stats_year has constant value ""Constant
qestn_cn is highly overall correlated with answer_value and 2 other fieldsHigh correlation
qestn_id is highly overall correlated with answer_value and 2 other fieldsHigh correlation
answer_value is highly overall correlated with qestn_id and 2 other fieldsHigh correlation
answer_cn is highly overall correlated with answer_value and 2 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 10:02:00.760764
Analysis finished2023-12-10 10:02:01.963389
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

stats_year
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 100
100.0%

Length

2023-12-10T19:02:02.137520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:02.347385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 100
100.0%

seq_no
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
3
32 
2
29 
1
27 
4
3530
 
3

Length

Max length4
Median length1
Mean length1.09
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3530
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 32
32.0%
2 29
29.0%
1 27
27.0%
4 9
 
9.0%
3530 3
 
3.0%

Length

2023-12-10T19:02:02.602355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:03.064595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 32
32.0%
2 29
29.0%
1 27
27.0%
4 9
 
9.0%
3530 3
 
3.0%

qestn_id
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)34.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A5
 
4
Q45_N9
 
4
Q45_N7
 
4
Q26
 
4
Q29
 
4
Other values (29)
80 

Length

Max length7
Median length6
Mean length4.9
Min length2

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st rowA5
2nd rowQ45_N7
3rd rowQ17
4th rowQ18
5th rowQ26

Common Values

ValueCountFrequency (%)
A5 4
 
4.0%
Q45_N9 4
 
4.0%
Q45_N7 4
 
4.0%
Q26 4
 
4.0%
Q29 4
 
4.0%
Q30 4
 
4.0%
Q45_N8 4
 
4.0%
Q33 4
 
4.0%
Q18 3
 
3.0%
Q45_N14 3
 
3.0%
Other values (24) 62
62.0%

Length

2023-12-10T19:02:03.314477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
a5 4
 
4.0%
q30 4
 
4.0%
q33 4
 
4.0%
q45_n8 4
 
4.0%
q45_n9 4
 
4.0%
q29 4
 
4.0%
q26 4
 
4.0%
q45_n7 4
 
4.0%
sq2 3
 
3.0%
q45_n16 3
 
3.0%
Other values (24) 62
62.0%

qestn_cn
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)33.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
sido(17개시도)
 
4
문45.구직활동을하지않은이유-(7)학교(정규교육기관)에다니고있어서
 
4
문33.귀하는우리나라청년취업경쟁이현재와비교하여2025년에어떻게변할것이라고생각하십니까?
 
4
문45.구직활동을하지않은이유-(8)입시학원에다니고있어서
 
4
문45.구직활동을하지않은이유-(9)학원에다니지않고진학준비중이어서
 
4
Other values (28)
80 

Length

Max length60
Median length42.5
Mean length32.43
Min length11

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st rowsido(17개시도)
2nd row문45.구직활동을하지않은이유-(7)학교(정규교육기관)에다니고있어서
3rd row문17.대학시절졸업에필요한요건을이수하고도졸업을늦춘경험이있습니까?
4th row문18.대학시절졸업을늦추기위해졸업에필요한요건을이수하지않은경험이있습니까?
5th row문26.귀하는지난주에주로취업준비를하셨습니까?

Common Values

ValueCountFrequency (%)
sido(17개시도) 4
 
4.0%
문45.구직활동을하지않은이유-(7)학교(정규교육기관)에다니고있어서 4
 
4.0%
문33.귀하는우리나라청년취업경쟁이현재와비교하여2025년에어떻게변할것이라고생각하십니까? 4
 
4.0%
문45.구직활동을하지않은이유-(8)입시학원에다니고있어서 4
 
4.0%
문45.구직활동을하지않은이유-(9)학원에다니지않고진학준비중이어서 4
 
4.0%
문29.귀하는현재까지공무원또는공단(공사),교원임용시험,국가전문자격시험준비경험이있습니까? 4
 
4.0%
문26.귀하는지난주에주로취업준비를하셨습니까? 4
 
4.0%
문30.귀하는향후직업훈련을받을계획이있습니까? 4
 
4.0%
문45.구직활동을하지않은이유-(1)일자리가없을것같아서 3
 
3.0%
문45.구직활동을하지않은이유-(12)군입대대기중이어서 3
 
3.0%
Other values (23) 62
62.0%

Length

2023-12-10T19:02:03.669310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
sido(17개시도 4
 
4.0%
문33.귀하는우리나라청년취업경쟁이현재와비교하여2025년에어떻게변할것이라고생각하십니까 4
 
4.0%
문45.구직활동을하지않은이유-(8)입시학원에다니고있어서 4
 
4.0%
문45.구직활동을하지않은이유-(9)학원에다니지않고진학준비중이어서 4
 
4.0%
문29.귀하는현재까지공무원또는공단(공사),교원임용시험,국가전문자격시험준비경험이있습니까 4
 
4.0%
문26.귀하는지난주에주로취업준비를하셨습니까 4
 
4.0%
문30.귀하는향후직업훈련을받을계획이있습니까 4
 
4.0%
문45.구직활동을하지않은이유-(7)학교(정규교육기관)에다니고있어서 4
 
4.0%
문45.구직활동을하지않은이유-(4)시간적여유를즐기기위해서 3
 
3.0%
문45.구직활동을하지않은이유-(17)기타 3
 
3.0%
Other values (23) 62
62.0%

answer_value
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.29
Minimum0
Maximum35
Zeros1
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:04.026853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q32
95-th percentile5.1
Maximum35
Range35
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.15998
Coefficient of variation (CV)1.8723343
Kurtosis20.774864
Mean3.29
Median Absolute Deviation (MAD)0
Skewness4.6763246
Sum329
Variance37.945354
MonotonicityNot monotonic
2023-12-10T19:02:04.274781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2 71
71.0%
1 12
 
12.0%
3 9
 
9.0%
35 2
 
2.0%
31 2
 
2.0%
4 1
 
1.0%
5 1
 
1.0%
0 1
 
1.0%
7 1
 
1.0%
ValueCountFrequency (%)
0 1
 
1.0%
1 12
 
12.0%
2 71
71.0%
3 9
 
9.0%
4 1
 
1.0%
5 1
 
1.0%
7 1
 
1.0%
31 2
 
2.0%
35 2
 
2.0%
ValueCountFrequency (%)
35 2
 
2.0%
31 2
 
2.0%
7 1
 
1.0%
5 1
 
1.0%
4 1
 
1.0%
3 9
 
9.0%
2 71
71.0%
1 12
 
12.0%
0 1
 
1.0%

answer_cn
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
아니오
62 
없다
 
4
경쟁이심해질것이다
 
3
전라북도
 
2
Other values (16)
21 

Length

Max length27
Median length3
Mean length4.16
Min length1

Unique

Unique11 ?
Unique (%)11.0%

Sample

1st row전라북도
2nd row아니오
3rd row아니오
4th row아니오
5th row아니오

Common Values

ValueCountFrequency (%)
아니오 62
62.0%
8
 
8.0%
없다 4
 
4.0%
경쟁이심해질것이다 3
 
3.0%
전라북도 2
 
2.0%
<NA> 2
 
2.0%
여자 2
 
2.0%
경기도 2
 
2.0%
일을하지않았음 2
 
2.0%
창업을생각해본적이없다 2
 
2.0%
Other values (11) 11
 
11.0%

Length

2023-12-10T19:02:04.618566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
아니오 62
62.0%
8
 
8.0%
없다 4
 
4.0%
경쟁이심해질것이다 3
 
3.0%
전라북도 2
 
2.0%
na 2
 
2.0%
여자 2
 
2.0%
경기도 2
 
2.0%
일을하지않았음 2
 
2.0%
창업을생각해본적이없다 2
 
2.0%
Other values (11) 11
 
11.0%

Interactions

2023-12-10T19:02:01.288298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:02:04.790410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
seq_noqestn_idqestn_cnanswer_valueanswer_cn
seq_no1.0000.0000.0000.3060.000
qestn_id0.0001.0001.0000.7790.944
qestn_cn0.0001.0001.0000.7830.944
answer_value0.3060.7790.7831.0001.000
answer_cn0.0000.9440.9441.0001.000
2023-12-10T19:02:05.042329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
qestn_cnqestn_idseq_noanswer_cn
qestn_cn1.0000.9930.0000.543
qestn_id0.9931.0000.0000.543
seq_no0.0000.0001.0000.000
answer_cn0.5430.5430.0001.000
2023-12-10T19:02:05.231180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
answer_valueseq_noqestn_idqestn_cnanswer_cn
answer_value1.0000.0000.5150.5250.911
seq_no0.0001.0000.0000.0000.000
qestn_id0.5150.0001.0000.9930.543
qestn_cn0.5250.0000.9931.0000.543
answer_cn0.9110.0000.5430.5431.000

Missing values

2023-12-10T19:02:01.524078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:02:01.832021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

stats_yearseq_noqestn_idqestn_cnanswer_valueanswer_cn
020201A5sido(17개시도)35전라북도
120203530Q45_N7문45.구직활동을하지않은이유-(7)학교(정규교육기관)에다니고있어서2아니오
220201Q17문17.대학시절졸업에필요한요건을이수하고도졸업을늦춘경험이있습니까?2아니오
320201Q18문18.대학시절졸업을늦추기위해졸업에필요한요건을이수하지않은경험이있습니까?2아니오
420201Q26문26.귀하는지난주에주로취업준비를하셨습니까?2아니오
520201Q29문29.귀하는현재까지공무원또는공단(공사),교원임용시험,국가전문자격시험준비경험이있습니까?2없다
620201Q30문30.귀하는향후직업훈련을받을계획이있습니까?1
720203530Q45_N8문45.구직활동을하지않은이유-(8)입시학원에다니고있어서2아니오
820201Q33문33.귀하는우리나라청년취업경쟁이현재와비교하여2025년에어떻게변할것이라고생각하십니까?2경쟁이유지될것이다
920201Q39문39.귀하는중소기업에취업하실의향이있습니까?1
stats_yearseq_noqestn_idqestn_cnanswer_valueanswer_cn
9020203Q46_N5문46.귀하가학교를다니지않고취업도하지않은상태가얼마나지속되었습니까?응답해주십시오-1년~2년미만0<NA>
9120204A5sido(17개시도)31경기도
9220204SQ2SQ2.귀하의성별은무엇입니까?2여자
9320204Q17문17.대학시절졸업에필요한요건을이수하고도졸업을늦춘경험이있습니까?2아니오
9420204Q18문18.대학시절졸업을늦추기위해졸업에필요한요건을이수하지않은경험이있습니까?2아니오
9520204Q26문26.귀하는지난주에주로취업준비를하셨습니까?2아니오
9620204Q29문29.귀하는현재까지공무원또는공단(공사),교원임용시험,국가전문자격시험준비경험이있습니까?2없다
9720204Q30문30.귀하는향후직업훈련을받을계획이있습니까?2아니오
9820204Q31문31.귀하가가장선호하는일자리유형은무엇입니까?7직무가적성에맞는회사
9920204Q33문33.귀하는우리나라청년취업경쟁이현재와비교하여2025년에어떻게변할것이라고생각하십니까?3경쟁이심해질것이다