Overview

Dataset statistics

Number of variables18
Number of observations30
Missing cells14
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory152.4 B

Variable types

Numeric3
Categorical4
Boolean7
DateTime3
Text1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/054f4576-142d-4e80-ac5b-d423495e4372

Alerts

청년통장명 has constant value ""Constant
청년통장순번 has constant value ""Constant
지번주소 has constant value ""Constant
답변키값 has constant value ""Constant
질문1답변구분명 has constant value ""Constant
질문2답변구분명 has constant value ""Constant
질문3답변구분명 has constant value ""Constant
질문4답변구분명 has constant value ""Constant
질문5답변구분명 has constant value ""Constant
질문6답변구분명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
우편번호 is highly overall correlated with 성별코드High correlation
생일일자 is highly overall correlated with 성별코드High correlation
성별코드 is highly overall correlated with 우편번호 and 1 other fieldsHigh correlation
우편번호 has 7 (23.3%) missing valuesMissing
생일일자 has 7 (23.3%) missing valuesMissing
동의정보번호 has unique valuesUnique
수정일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:50:18.955262
Analysis finished2023-12-10 13:50:22.397982
Duration3.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

동의정보번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:50:22.491052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-10T22:50:22.701901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

청년통장명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
경기도 일하는 청년통장 2018년 상반기 모집
30 

Length

Max length25
Median length25
Mean length25
Min length25

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 일하는 청년통장 2018년 상반기 모집
2nd row경기도 일하는 청년통장 2018년 상반기 모집
3rd row경기도 일하는 청년통장 2018년 상반기 모집
4th row경기도 일하는 청년통장 2018년 상반기 모집
5th row경기도 일하는 청년통장 2018년 상반기 모집

Common Values

ValueCountFrequency (%)
경기도 일하는 청년통장 2018년 상반기 모집 30
100.0%

Length

2023-12-10T22:50:22.925636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:50:23.085146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 30
16.7%
일하는 30
16.7%
청년통장 30
16.7%
2018년 30
16.7%
상반기 30
16.7%
모집 30
16.7%

청년통장순번
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 30
100.0%

Length

2023-12-10T22:50:23.260294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:50:23.411175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 30
100.0%

우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct18
Distinct (%)78.3%
Missing7
Missing (%)23.3%
Infinite0
Infinite (%)0.0%
Mean14023.13
Minimum10077
Maximum18147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:50:23.574423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10077
5-th percentile11371.2
Q111692
median14548
Q315802
95-th percentile17511.7
Maximum18147
Range8070
Interquartile range (IQR)4110

Descriptive statistics

Standard deviation2386.7346
Coefficient of variation (CV)0.17019985
Kurtosis-1.2564338
Mean14023.13
Median Absolute Deviation (MAD)2431
Skewness0.18224814
Sum322532
Variance5696502.2
MonotonicityNot monotonic
2023-12-10T22:50:23.784467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
11692 2
 
6.7%
13208 2
 
6.7%
14907 2
 
6.7%
16987 2
 
6.7%
11625 2
 
6.7%
10077 1
 
3.3%
14548 1
 
3.3%
14994 1
 
3.3%
18147 1
 
3.3%
14635 1
 
3.3%
Other values (8) 8
26.7%
(Missing) 7
23.3%
ValueCountFrequency (%)
10077 1
3.3%
11343 1
3.3%
11625 2
6.7%
11637 1
3.3%
11692 2
6.7%
12117 1
3.3%
12172 1
3.3%
13208 2
6.7%
14548 1
3.3%
14635 1
3.3%
ValueCountFrequency (%)
18147 1
3.3%
17570 1
3.3%
16987 2
6.7%
16897 1
3.3%
16610 1
3.3%
14994 1
3.3%
14947 1
3.3%
14907 2
6.7%
14635 1
3.3%
14548 1
3.3%

지번주소
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
30 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
30
100.0%

Length

2023-12-10T22:50:24.024715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:50:24.209267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
No values found.

생일일자
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct10
Distinct (%)43.5%
Missing7
Missing (%)23.3%
Infinite0
Infinite (%)0.0%
Mean1993.1304
Minimum1985
Maximum1998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T22:50:24.346168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1985
5-th percentile1985.4
Q11990.5
median1994
Q31997
95-th percentile1997.9
Maximum1998
Range13
Interquartile range (IQR)6.5

Descriptive statistics

Standard deviation3.8174759
Coefficient of variation (CV)0.0019153167
Kurtosis-0.18477566
Mean1993.1304
Median Absolute Deviation (MAD)3
Skewness-0.64654501
Sum45842
Variance14.573123
MonotonicityNot monotonic
2023-12-10T22:50:24.545541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1997 5
16.7%
1990 3
10.0%
1994 3
10.0%
1991 2
 
6.7%
1998 2
 
6.7%
1985 2
 
6.7%
1993 2
 
6.7%
1995 2
 
6.7%
1992 1
 
3.3%
1989 1
 
3.3%
(Missing) 7
23.3%
ValueCountFrequency (%)
1985 2
 
6.7%
1989 1
 
3.3%
1990 3
10.0%
1991 2
 
6.7%
1992 1
 
3.3%
1993 2
 
6.7%
1994 3
10.0%
1995 2
 
6.7%
1997 5
16.7%
1998 2
 
6.7%
ValueCountFrequency (%)
1998 2
 
6.7%
1997 5
16.7%
1995 2
 
6.7%
1994 3
10.0%
1993 2
 
6.7%
1992 1
 
3.3%
1991 2
 
6.7%
1990 3
10.0%
1989 1
 
3.3%
1985 2
 
6.7%

성별코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
F
12 
M
11 
<NA>

Length

Max length4
Median length1
Mean length1.7
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowF

Common Values

ValueCountFrequency (%)
F 12
40.0%
M 11
36.7%
<NA> 7
23.3%

Length

2023-12-10T22:50:24.790573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:50:25.022850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 12
40.0%
m 11
36.7%
na 7
23.3%

답변키값
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.155983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문1답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.277949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문2답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.393455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문3답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.518580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문4답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.653353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문5답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.766513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

질문6답변구분명
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-10T22:50:25.879588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2018-03-26 00:01:00
Maximum2018-03-26 00:09:00
2023-12-10T22:50:26.023899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:26.210841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2018-03-26 00:01:00
Maximum2018-03-26 00:09:00
2023-12-10T22:50:26.395813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:26.596264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

수정일시
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T22:50:26.909132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length19.7
Min length19

Characters and Unicode

Total characters591
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row1_20170922103257002
2nd row2_20180325235428001
3rd row3_20170915123728001
4th row4_20180325235428001
5th row5_20180326000131001
ValueCountFrequency (%)
1_20170922103257002 1
 
3.3%
2_20180325235428001 1
 
3.3%
29_20180325232101001 1
 
3.3%
28_20180326000859001 1
 
3.3%
27_20180325233235001 1
 
3.3%
26_20180212161459001 1
 
3.3%
25_20180313015030001 1
 
3.3%
24_20180326000753001 1
 
3.3%
23_20170818200442001 1
 
3.3%
22_20180326000530001 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T22:50:27.763884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 171
28.9%
1 101
17.1%
2 93
15.7%
3 54
 
9.1%
8 36
 
6.1%
5 31
 
5.2%
_ 30
 
5.1%
6 21
 
3.6%
4 20
 
3.4%
7 19
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 561
94.9%
Connector Punctuation 30
 
5.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 171
30.5%
1 101
18.0%
2 93
16.6%
3 54
 
9.6%
8 36
 
6.4%
5 31
 
5.5%
6 21
 
3.7%
4 20
 
3.6%
7 19
 
3.4%
9 15
 
2.7%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 591
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 171
28.9%
1 101
17.1%
2 93
15.7%
3 54
 
9.1%
8 36
 
6.1%
5 31
 
5.2%
_ 30
 
5.1%
6 21
 
3.6%
4 20
 
3.4%
7 19
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 591
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 171
28.9%
1 101
17.1%
2 93
15.7%
3 54
 
9.1%
8 36
 
6.1%
5 31
 
5.2%
_ 30
 
5.1%
6 21
 
3.6%
4 20
 
3.4%
7 19
 
3.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2018-03-26 00:00:00
Maximum2018-03-26 00:00:00
2023-12-10T22:50:28.179608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:28.484880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-10T22:50:21.093132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.027780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.524046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:21.313816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.191186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.685554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:21.480211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.369151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:50:20.914492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:50:28.712372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동의정보번호우편번호생일일자성별코드질문7답변구분명등록일시수정일시
동의정보번호1.0000.0000.5370.2790.9110.9111.000
우편번호0.0001.0000.8120.7630.0000.0001.000
생일일자0.5370.8121.0000.9850.4570.4571.000
성별코드0.2790.7630.9851.0000.5090.5091.000
질문7답변구분명0.9110.0000.4570.5091.0001.0001.000
등록일시0.9110.0000.4570.5091.0001.0001.000
수정일시1.0001.0001.0001.0001.0001.0001.000
2023-12-10T22:50:28.932323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동의정보번호우편번호생일일자성별코드
동의정보번호1.000-0.1370.2110.103
우편번호-0.1371.0000.1760.697
생일일자0.2110.1761.0000.785
성별코드0.1030.6970.7851.000

Missing values

2023-12-10T22:50:21.703764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:50:22.078972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T22:50:22.304295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

동의정보번호청년통장명청년통장순번우편번호지번주소생일일자성별코드답변키값질문1답변구분명질문2답변구분명질문3답변구분명질문4답변구분명질문5답변구분명질문6답변구분명질문7답변구분명등록일시수정일시데이터기준일자
01경기도 일하는 청년통장 2018년 상반기 모집2116251991MYYYYYYY2018-03-26 00:012018-03-26 00:011_201709221032570022018-03-26
12경기도 일하는 청년통장 2018년 상반기 모집2169871990MYYYYYYY2018-03-26 00:012018-03-26 00:012_201803252354280012018-03-26
23경기도 일하는 청년통장 2018년 상반기 모집2116371990MYYYYYYY2018-03-26 00:012018-03-26 00:013_201709151237280012018-03-26
34경기도 일하는 청년통장 2018년 상반기 모집2169871990MYYYYYYY2018-03-26 00:022018-03-26 00:024_201803252354280012018-03-26
45경기도 일하는 청년통장 2018년 상반기 모집2146351998FYYYYYYY2018-03-26 00:022018-03-26 00:025_201803260001310012018-03-26
56경기도 일하는 청년통장 2018년 상반기 모집2<NA><NA><NA>YYYYYYY2018-03-26 00:022018-03-26 00:026_201803252346160012018-03-26
67경기도 일하는 청년통장 2018년 상반기 모집2181471997FYYYYYYY2018-03-26 00:022018-03-26 00:027_201803260001590012018-03-26
78경기도 일하는 청년통장 2018년 상반기 모집2145481997FYYYYYYY2018-03-26 00:032018-03-26 00:038_201803260002340012018-03-26
89경기도 일하는 청년통장 2018년 상반기 모집2<NA><NA><NA>YYYYYYY2018-03-26 00:032018-03-26 00:039_201803252321010012018-03-26
910경기도 일하는 청년통장 2018년 상반기 모집2100771985MYYYYYYY2018-03-26 00:042018-03-26 00:0410_201803171341490012018-03-26
동의정보번호청년통장명청년통장순번우편번호지번주소생일일자성별코드답변키값질문1답변구분명질문2답변구분명질문3답변구분명질문4답변구분명질문5답변구분명질문6답변구분명질문7답변구분명등록일시수정일시데이터기준일자
2021경기도 일하는 청년통장 2018년 상반기 모집2116921994FYYYYYYY2018-03-26 00:072018-03-26 00:0721_201803230937530012018-03-26
2122경기도 일하는 청년통장 2018년 상반기 모집2149471997FYYYYYYY2018-03-26 00:072018-03-26 00:0722_201803260005300012018-03-26
2223경기도 일하는 청년통장 2018년 상반기 모집2175701992FYYYYYYY2018-03-26 00:072018-03-26 00:0723_201708182004420012018-03-26
2324경기도 일하는 청년통장 2018년 상반기 모집2132081995MYYYYYYY2018-03-26 00:082018-03-26 00:0824_201803260007530012018-03-26
2425경기도 일하는 청년통장 2018년 상반기 모집2121171998FYYYYYYY2018-03-26 00:082018-03-26 00:0825_201803130150300012018-03-26
2526경기도 일하는 청년통장 2018년 상반기 모집2113431989MYYYYYYY2018-03-26 00:092018-03-26 00:0926_201802121614590012018-03-26
2627경기도 일하는 청년통장 2018년 상반기 모집2<NA><NA><NA>YYYYYYY2018-03-26 00:092018-03-26 00:0927_201803252332350012018-03-26
2728경기도 일하는 청년통장 2018년 상반기 모집2<NA><NA><NA>YYYYYYY2018-03-26 00:092018-03-26 00:0928_201803260008590012018-03-26
2829경기도 일하는 청년통장 2018년 상반기 모집2<NA><NA><NA>YYYYYYY2018-03-26 00:092018-03-26 00:0929_201803252321010012018-03-26
2930경기도 일하는 청년통장 2018년 상반기 모집2132081995MYYYYYYY2018-03-26 00:092018-03-26 00:0930_201803260007530012018-03-26