Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory45.4 B

Variable types

Numeric1
Categorical1
Boolean1
DateTime2

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/b3251bf8-0b0a-4c6b-9a91-09c50ec5ab97

Alerts

4대보험가입구분명 has constant value ""Constant
고용보험가입여부 has constant value ""Constant
청년시리즈신청번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:09:24.484809
Analysis finished2023-12-10 14:09:25.071568
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년시리즈신청번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.966667
Minimum43
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:09:25.160446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile44.45
Q155.25
median68.5
Q384
95-th percentile93.1
Maximum95
Range52
Interquartile range (IQR)28.75

Descriptive statistics

Standard deviation16.740377
Coefficient of variation (CV)0.24273142
Kurtosis-1.3782704
Mean68.966667
Median Absolute Deviation (MAD)14
Skewness0.013054939
Sum2069
Variance280.24023
MonotonicityStrictly increasing
2023-12-10T23:09:25.363182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
43 1
 
3.3%
75 1
 
3.3%
95 1
 
3.3%
94 1
 
3.3%
92 1
 
3.3%
91 1
 
3.3%
89 1
 
3.3%
87 1
 
3.3%
86 1
 
3.3%
85 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
43 1
3.3%
44 1
3.3%
45 1
3.3%
47 1
3.3%
50 1
3.3%
51 1
3.3%
54 1
3.3%
55 1
3.3%
56 1
3.3%
57 1
3.3%
ValueCountFrequency (%)
95 1
3.3%
94 1
3.3%
92 1
3.3%
91 1
3.3%
89 1
3.3%
87 1
3.3%
86 1
3.3%
85 1
3.3%
81 1
3.3%
79 1
3.3%

4대보험가입구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
4대 사회보험 가입자
30 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4대 사회보험 가입자
2nd row4대 사회보험 가입자
3rd row4대 사회보험 가입자
4th row4대 사회보험 가입자
5th row4대 사회보험 가입자

Common Values

ValueCountFrequency (%)
4대 사회보험 가입자 30
100.0%

Length

2023-12-10T23:09:25.634048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:09:25.805420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4대 30
33.3%
사회보험 30
33.3%
가입자 30
33.3%

고용보험가입여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T23:09:25.929876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct28
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2013-01-01 00:00:00
Maximum2017-09-11 00:00:00
2023-12-10T23:09:26.073092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:09:26.316075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2018-01-22 00:00:00
Maximum2018-01-31 00:00:00
2023-12-10T23:09:26.576450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:09:26.769997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

Interactions

2023-12-10T23:09:24.666052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:09:26.922880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호고용보험가입일자데이터기준일자
청년시리즈신청번호1.0000.0000.000
고용보험가입일자0.0001.0000.921
데이터기준일자0.0000.9211.000

Missing values

2023-12-10T23:09:24.831416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:09:25.008567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년시리즈신청번호4대보험가입구분명고용보험가입여부고용보험가입일자데이터기준일자
0434대 사회보험 가입자Y2016-12-202018-01-26
1444대 사회보험 가입자Y2013-08-192018-01-25
2454대 사회보험 가입자Y2015-01-052018-01-23
3474대 사회보험 가입자Y2014-12-182018-01-27
4504대 사회보험 가입자Y2017-08-072018-01-22
5514대 사회보험 가입자Y2014-08-212018-01-29
6544대 사회보험 가입자Y2014-08-262018-01-27
7554대 사회보험 가입자Y2017-09-112018-01-22
8564대 사회보험 가입자Y2017-02-012018-01-26
9574대 사회보험 가입자Y2016-07-042018-01-22
청년시리즈신청번호4대보험가입구분명고용보험가입여부고용보험가입일자데이터기준일자
20794대 사회보험 가입자Y2016-02-152018-01-26
21814대 사회보험 가입자Y2017-05-292018-01-22
22854대 사회보험 가입자Y2013-01-012018-01-22
23864대 사회보험 가입자Y2017-08-162018-01-31
24874대 사회보험 가입자Y2015-11-022018-01-22
25894대 사회보험 가입자Y2016-10-102018-01-22
26914대 사회보험 가입자Y2017-02-202018-01-29
27924대 사회보험 가입자Y2016-11-032018-01-22
28944대 사회보험 가입자Y2017-06-012018-01-26
29954대 사회보험 가입자Y2015-04-212018-01-22