Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells30
Missing cells (%)25.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory38.4 B

Variable types

Numeric1
Boolean1
Unsupported1
Categorical1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/754d03f7-1a59-4b53-a4e6-04a5e0640ab8

Alerts

육아기근로시간단축자여부 has constant value ""Constant
육아기근로시간단축전근로시수 has 30 (100.0%) missing valuesMissing
청년시리즈신청번호 has unique valuesUnique
육아기근로시간단축전근로시수 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 13:50:59.993636
Analysis finished2023-12-10 13:51:00.656976
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년시리즈신청번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.966667
Minimum43
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:51:00.762695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile44.45
Q155.25
median68.5
Q384
95-th percentile93.1
Maximum95
Range52
Interquartile range (IQR)28.75

Descriptive statistics

Standard deviation16.740377
Coefficient of variation (CV)0.24273142
Kurtosis-1.3782704
Mean68.966667
Median Absolute Deviation (MAD)14
Skewness0.013054939
Sum2069
Variance280.24023
MonotonicityStrictly increasing
2023-12-10T22:51:00.948409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
43 1
 
3.3%
75 1
 
3.3%
95 1
 
3.3%
94 1
 
3.3%
92 1
 
3.3%
91 1
 
3.3%
89 1
 
3.3%
87 1
 
3.3%
86 1
 
3.3%
85 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
43 1
3.3%
44 1
3.3%
45 1
3.3%
47 1
3.3%
50 1
3.3%
51 1
3.3%
54 1
3.3%
55 1
3.3%
56 1
3.3%
57 1
3.3%
ValueCountFrequency (%)
95 1
3.3%
94 1
3.3%
92 1
3.3%
91 1
3.3%
89 1
3.3%
87 1
3.3%
86 1
3.3%
85 1
3.3%
81 1
3.3%
79 1
3.3%
Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-10T22:51:01.187003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

육아기근로시간단축전근로시수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B
Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2018-01-22
15 
2018-01-26
2018-01-23
2018-01-27
2018-01-29
Other values (4)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique3 ?
Unique (%)10.0%

Sample

1st row2018-01-26
2nd row2018-01-25
3rd row2018-01-23
4th row2018-01-27
5th row2018-01-22

Common Values

ValueCountFrequency (%)
2018-01-22 15
50.0%
2018-01-26 4
 
13.3%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-01-30 1
 
3.3%
2018-01-24 1
 
3.3%

Length

2023-12-10T22:51:01.364302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:51:01.585965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018-01-22 15
50.0%
2018-01-26 4
 
13.3%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-01-30 1
 
3.3%
2018-01-24 1
 
3.3%

Interactions

2023-12-10T22:51:00.231833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:51:01.734780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호데이터기준일자
청년시리즈신청번호1.0000.000
데이터기준일자0.0001.000
2023-12-10T22:51:01.880329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호데이터기준일자
청년시리즈신청번호1.0000.000
데이터기준일자0.0001.000

Missing values

2023-12-10T22:51:00.471762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:51:00.603058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년시리즈신청번호육아기근로시간단축자여부육아기근로시간단축전근로시수데이터기준일자
043n<NA>2018-01-26
144n<NA>2018-01-25
245n<NA>2018-01-23
347n<NA>2018-01-27
450n<NA>2018-01-22
551n<NA>2018-01-29
654n<NA>2018-01-27
755n<NA>2018-01-22
856n<NA>2018-01-26
957n<NA>2018-01-22
청년시리즈신청번호육아기근로시간단축자여부육아기근로시간단축전근로시수데이터기준일자
2079n<NA>2018-01-26
2181n<NA>2018-01-22
2285n<NA>2018-01-22
2386n<NA>2018-01-31
2487n<NA>2018-01-22
2589n<NA>2018-01-22
2691n<NA>2018-01-29
2792n<NA>2018-01-22
2894n<NA>2018-01-26
2995n<NA>2018-01-22