Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells30
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory57.4 B

Variable types

Numeric4
Unsupported1
Categorical1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/373ccc90-2dc5-47ab-9eca-697556517d42

Alerts

1개월전건강보험요금 is highly overall correlated with 2개월전건강보험요금 and 1 other fieldsHigh correlation
2개월전건강보험요금 is highly overall correlated with 1개월전건강보험요금 and 1 other fieldsHigh correlation
3달전건강보험요금 is highly overall correlated with 1개월전건강보험요금 and 1 other fieldsHigh correlation
소득정보값 has 30 (100.0%) missing valuesMissing
청년시리즈신청번호 has unique valuesUnique
소득정보값 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 13:44:13.366981
Analysis finished2023-12-10 13:44:17.506067
Duration4.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년시리즈신청번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.833333
Minimum42
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:44:17.605670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42
5-th percentile43.45
Q154.25
median63.5
Q377.75
95-th percentile88.1
Maximum91
Range49
Interquartile range (IQR)23.5

Descriptive statistics

Standard deviation15.261137
Coefficient of variation (CV)0.23181474
Kurtosis-1.2765039
Mean65.833333
Median Absolute Deviation (MAD)13
Skewness0.010854192
Sum1975
Variance232.9023
MonotonicityStrictly increasing
2023-12-10T22:44:17.779942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
42 1
 
3.3%
71 1
 
3.3%
91 1
 
3.3%
89 1
 
3.3%
87 1
 
3.3%
86 1
 
3.3%
85 1
 
3.3%
81 1
 
3.3%
79 1
 
3.3%
78 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
42 1
3.3%
43 1
3.3%
44 1
3.3%
45 1
3.3%
47 1
3.3%
50 1
3.3%
51 1
3.3%
54 1
3.3%
55 1
3.3%
56 1
3.3%
ValueCountFrequency (%)
91 1
3.3%
89 1
3.3%
87 1
3.3%
86 1
3.3%
85 1
3.3%
81 1
3.3%
79 1
3.3%
78 1
3.3%
77 1
3.3%
76 1
3.3%

1개월전건강보험요금
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62467
Minimum41310
Maximum75260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:44:17.973079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41310
5-th percentile49261.5
Q155475
median64115
Q371022.5
95-th percentile73467.5
Maximum75260
Range33950
Interquartile range (IQR)15547.5

Descriptive statistics

Standard deviation9409.2431
Coefficient of variation (CV)0.15062742
Kurtosis-0.91630973
Mean62467
Median Absolute Deviation (MAD)8435
Skewness-0.37785405
Sum1874010
Variance88533856
MonotonicityNot monotonic
2023-12-10T22:44:18.203919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
73440 4
 
13.3%
56090 2
 
6.7%
69140 1
 
3.3%
58590 1
 
3.3%
55270 1
 
3.3%
60170 1
 
3.3%
71650 1
 
3.3%
73490 1
 
3.3%
65390 1
 
3.3%
48960 1
 
3.3%
Other values (16) 16
53.3%
ValueCountFrequency (%)
41310 1
3.3%
48960 1
3.3%
49630 1
3.3%
51000 1
3.3%
51010 1
3.3%
52020 1
3.3%
52720 1
3.3%
55270 1
3.3%
56090 2
6.7%
58590 1
3.3%
ValueCountFrequency (%)
75260 1
 
3.3%
73490 1
 
3.3%
73440 4
13.3%
73120 1
 
3.3%
71650 1
 
3.3%
69140 1
 
3.3%
68540 1
 
3.3%
67320 1
 
3.3%
67060 1
 
3.3%
66950 1
 
3.3%

2개월전건강보험요금
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62467
Minimum41310
Maximum75260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:44:18.449648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41310
5-th percentile49261.5
Q155475
median64115
Q371022.5
95-th percentile73467.5
Maximum75260
Range33950
Interquartile range (IQR)15547.5

Descriptive statistics

Standard deviation9409.2431
Coefficient of variation (CV)0.15062742
Kurtosis-0.91630973
Mean62467
Median Absolute Deviation (MAD)8435
Skewness-0.37785405
Sum1874010
Variance88533856
MonotonicityNot monotonic
2023-12-10T22:44:18.792970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
73440 4
 
13.3%
56090 2
 
6.7%
69140 1
 
3.3%
58590 1
 
3.3%
55270 1
 
3.3%
60170 1
 
3.3%
71650 1
 
3.3%
73490 1
 
3.3%
65390 1
 
3.3%
48960 1
 
3.3%
Other values (16) 16
53.3%
ValueCountFrequency (%)
41310 1
3.3%
48960 1
3.3%
49630 1
3.3%
51000 1
3.3%
51010 1
3.3%
52020 1
3.3%
52720 1
3.3%
55270 1
3.3%
56090 2
6.7%
58590 1
3.3%
ValueCountFrequency (%)
75260 1
 
3.3%
73490 1
 
3.3%
73440 4
13.3%
73120 1
 
3.3%
71650 1
 
3.3%
69140 1
 
3.3%
68540 1
 
3.3%
67320 1
 
3.3%
67060 1
 
3.3%
66950 1
 
3.3%

3달전건강보험요금
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62467
Minimum41310
Maximum75260
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:44:19.021537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum41310
5-th percentile49261.5
Q155475
median64115
Q371022.5
95-th percentile73467.5
Maximum75260
Range33950
Interquartile range (IQR)15547.5

Descriptive statistics

Standard deviation9409.2431
Coefficient of variation (CV)0.15062742
Kurtosis-0.91630973
Mean62467
Median Absolute Deviation (MAD)8435
Skewness-0.37785405
Sum1874010
Variance88533856
MonotonicityNot monotonic
2023-12-10T22:44:19.255781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
73440 4
 
13.3%
56090 2
 
6.7%
69140 1
 
3.3%
58590 1
 
3.3%
55270 1
 
3.3%
60170 1
 
3.3%
71650 1
 
3.3%
73490 1
 
3.3%
65390 1
 
3.3%
48960 1
 
3.3%
Other values (16) 16
53.3%
ValueCountFrequency (%)
41310 1
3.3%
48960 1
3.3%
49630 1
3.3%
51000 1
3.3%
51010 1
3.3%
52020 1
3.3%
52720 1
3.3%
55270 1
3.3%
56090 2
6.7%
58590 1
3.3%
ValueCountFrequency (%)
75260 1
 
3.3%
73490 1
 
3.3%
73440 4
13.3%
73120 1
 
3.3%
71650 1
 
3.3%
69140 1
 
3.3%
68540 1
 
3.3%
67320 1
 
3.3%
67060 1
 
3.3%
66950 1
 
3.3%

소득정보값
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B
Distinct11
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2018-01-22
13 
2018-01-26
2018-02-05
2018-01-23
2018-01-27
Other values (6)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row2018-02-05
2nd row2018-01-26
3rd row2018-01-25
4th row2018-01-23
5th row2018-01-27

Common Values

ValueCountFrequency (%)
2018-01-22 13
43.3%
2018-01-26 3
 
10.0%
2018-02-05 2
 
6.7%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-02-01 1
 
3.3%
2018-01-30 1
 
3.3%

Length

2023-12-10T22:44:19.644098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2018-01-22 13
43.3%
2018-01-26 3
 
10.0%
2018-02-05 2
 
6.7%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-02-01 1
 
3.3%
2018-01-30 1
 
3.3%

Interactions

2023-12-10T22:44:16.301226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:13.736788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:14.748468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:15.575028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:16.763147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:13.985681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:14.994278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:15.762831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:16.906897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:14.247400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:15.253015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:15.902517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:17.068370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:14.507777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:15.434565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:44:16.090400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:44:19.803819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호1개월전건강보험요금2개월전건강보험요금3달전건강보험요금데이터기준일자
청년시리즈신청번호1.0000.3650.3650.3650.404
1개월전건강보험요금0.3651.0001.0001.0000.684
2개월전건강보험요금0.3651.0001.0001.0000.684
3달전건강보험요금0.3651.0001.0001.0000.684
데이터기준일자0.4040.6840.6840.6841.000
2023-12-10T22:44:19.965623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호1개월전건강보험요금2개월전건강보험요금3달전건강보험요금데이터기준일자
청년시리즈신청번호1.000-0.131-0.131-0.1310.127
1개월전건강보험요금-0.1311.0001.0001.0000.362
2개월전건강보험요금-0.1311.0001.0001.0000.362
3달전건강보험요금-0.1311.0001.0001.0000.362
데이터기준일자0.1270.3620.3620.3621.000

Missing values

2023-12-10T22:44:17.270409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:44:17.438016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년시리즈신청번호1개월전건강보험요금2개월전건강보험요금3달전건강보험요금소득정보값데이터기준일자
042691406914069140<NA>2018-02-05
143520205202052020<NA>2018-01-26
244670606706067060<NA>2018-01-25
345666606666066660<NA>2018-01-23
447734407344073440<NA>2018-01-27
550510005100051000<NA>2018-01-22
651731207312073120<NA>2018-01-29
754752607526075260<NA>2018-01-27
855685406854068540<NA>2018-01-22
956673206732067320<NA>2018-01-26
청년시리즈신청번호1개월전건강보험요금2개월전건강보험요금3달전건강보험요금소득정보값데이터기준일자
2076628406284062840<NA>2018-01-30
2177489604896048960<NA>2018-01-24
2278653906539065390<NA>2018-01-22
2379734907349073490<NA>2018-01-26
2481560905609056090<NA>2018-01-22
2585716507165071650<NA>2018-01-22
2686601706017060170<NA>2018-01-31
2787560905609056090<NA>2018-01-22
2889552705527055270<NA>2018-01-22
2991734407344073440<NA>2018-01-29