Overview

Dataset statistics

Number of variables3
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory882.0 B
Average record size in memory29.4 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/b71817cf-04a4-4fd8-afbe-589ed55ec7dc

Alerts

청년시리즈신청번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:18:33.751090
Analysis finished2023-12-10 14:18:34.133310
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년시리즈신청번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.966667
Minimum43
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:18:34.259725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile44.45
Q155.25
median68.5
Q384
95-th percentile93.1
Maximum95
Range52
Interquartile range (IQR)28.75

Descriptive statistics

Standard deviation16.740377
Coefficient of variation (CV)0.24273142
Kurtosis-1.3782704
Mean68.966667
Median Absolute Deviation (MAD)14
Skewness0.013054939
Sum2069
Variance280.24023
MonotonicityStrictly increasing
2023-12-10T23:18:34.495110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
43 1
 
3.3%
75 1
 
3.3%
95 1
 
3.3%
94 1
 
3.3%
92 1
 
3.3%
91 1
 
3.3%
89 1
 
3.3%
87 1
 
3.3%
86 1
 
3.3%
85 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
43 1
3.3%
44 1
3.3%
45 1
3.3%
47 1
3.3%
50 1
3.3%
51 1
3.3%
54 1
3.3%
55 1
3.3%
56 1
3.3%
57 1
3.3%
ValueCountFrequency (%)
95 1
3.3%
94 1
3.3%
92 1
3.3%
91 1
3.3%
89 1
3.3%
87 1
3.3%
86 1
3.3%
85 1
3.3%
81 1
3.3%
79 1
3.3%
Distinct16
Distinct (%)53.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:18:34.673018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length7.0666667
Min length7

Characters and Unicode

Total characters212
Distinct characters28
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)23.3%

Sample

1st row경기도 포천시
2nd row경기도 화성시
3rd row경기도 안성시
4th row경기도 화성시
5th row경기도 성남시
ValueCountFrequency (%)
경기도 30
50.0%
화성시 5
 
8.3%
성남시 3
 
5.0%
안양시 3
 
5.0%
수원시 2
 
3.3%
안산시 2
 
3.3%
군포시 2
 
3.3%
의정부시 2
 
3.3%
부천시 2
 
3.3%
포천시 2
 
3.3%
Other values (7) 7
 
11.7%
2023-12-10T23:18:35.100486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
14.6%
30
14.2%
30
14.2%
30
14.2%
30
14.2%
9
 
4.2%
6
 
2.8%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (18) 32
15.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 182
85.8%
Space Separator 30
 
14.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
17.0%
30
16.5%
30
16.5%
30
16.5%
9
 
4.9%
6
 
3.3%
5
 
2.7%
5
 
2.7%
4
 
2.2%
4
 
2.2%
Other values (17) 28
15.4%
Space Separator
ValueCountFrequency (%)
30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 182
85.8%
Common 30
 
14.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
17.0%
30
16.5%
30
16.5%
30
16.5%
9
 
4.9%
6
 
3.3%
5
 
2.7%
5
 
2.7%
4
 
2.2%
4
 
2.2%
Other values (17) 28
15.4%
Common
ValueCountFrequency (%)
30
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 182
85.8%
ASCII 30
 
14.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
17.0%
30
16.5%
30
16.5%
30
16.5%
9
 
4.9%
6
 
3.3%
5
 
2.7%
5
 
2.7%
4
 
2.2%
4
 
2.2%
Other values (17) 28
15.4%
ASCII
ValueCountFrequency (%)
30
100.0%
Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2018-01-22
15 
2018-01-26
2018-01-23
2018-01-27
2018-01-29
Other values (4)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique3 ?
Unique (%)10.0%

Sample

1st row2018-01-26
2nd row2018-01-25
3rd row2018-01-23
4th row2018-01-27
5th row2018-01-22

Common Values

ValueCountFrequency (%)
2018-01-22 15
50.0%
2018-01-26 4
 
13.3%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-01-30 1
 
3.3%
2018-01-24 1
 
3.3%

Length

2023-12-10T23:18:35.256486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:18:35.406405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018-01-22 15
50.0%
2018-01-26 4
 
13.3%
2018-01-23 2
 
6.7%
2018-01-27 2
 
6.7%
2018-01-29 2
 
6.7%
2018-01-31 2
 
6.7%
2018-01-25 1
 
3.3%
2018-01-30 1
 
3.3%
2018-01-24 1
 
3.3%

Interactions

2023-12-10T23:18:33.871582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:18:35.516496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호시군주소데이터기준일자
청년시리즈신청번호1.0000.0000.000
시군주소0.0001.0000.642
데이터기준일자0.0000.6421.000
2023-12-10T23:18:35.609116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년시리즈신청번호데이터기준일자
청년시리즈신청번호1.0000.000
데이터기준일자0.0001.000

Missing values

2023-12-10T23:18:33.990826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:18:34.088301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년시리즈신청번호시군주소데이터기준일자
043경기도 포천시2018-01-26
144경기도 화성시2018-01-25
245경기도 안성시2018-01-23
347경기도 화성시2018-01-27
450경기도 성남시2018-01-22
551경기도 부천시2018-01-29
654경기도 의정부시2018-01-27
755경기도 수원시2018-01-22
856경기도 포천시2018-01-26
957경기도 광주시2018-01-22
청년시리즈신청번호시군주소데이터기준일자
2079경기도 이천시2018-01-26
2181경기도 안양시2018-01-22
2285경기도 안양시2018-01-22
2386경기도 성남시2018-01-31
2487경기도 안산시2018-01-22
2589경기도 수원시2018-01-22
2691경기도 군포시2018-01-29
2792경기도 성남시2018-01-22
2894경기도 부천시2018-01-26
2995경기도 고양시2018-01-22