Overview

Dataset statistics

Number of variables8
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory70.7 B

Variable types

Numeric1
Text1
DateTime5
Boolean1

Dataset

Description샘플 데이터
Author경기도일자리재단
URLhttps://www.bigdata-region.kr/#/dataset/e2c4c3af-c4a8-485d-9f26-09ddcfe3f991

Alerts

삭제여부 is highly imbalanced (57.4%)Imbalance
청년통장설문조사번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:15:57.354536
Analysis finished2023-12-10 14:15:58.294391
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

청년통장설문조사번호
Real number (ℝ)

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.565217
Minimum1
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-10T23:15:58.400202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile30.9
Maximum32
Range31
Interquartile range (IQR)11

Descriptive statistics

Standard deviation9.4475879
Coefficient of variation (CV)0.6964568
Kurtosis-0.34664672
Mean13.565217
Median Absolute Deviation (MAD)6
Skewness0.75113598
Sum312
Variance89.256917
MonotonicityStrictly increasing
2023-12-10T23:15:58.621702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
32 1
 
4.3%
31 1
 
4.3%
30 1
 
4.3%
29 1
 
4.3%
19 1
 
4.3%
18 1
 
4.3%
17 1
 
4.3%
16 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
32 1
4.3%
31 1
4.3%
30 1
4.3%
29 1
4.3%
19 1
4.3%
18 1
4.3%
17 1
4.3%
16 1
4.3%
15 1
4.3%
14 1
4.3%

제목
Text

Distinct19
Distinct (%)82.6%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-10T23:15:59.433709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length20.913043
Min length4

Characters and Unicode

Total characters481
Distinct characters52
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)69.6%

Sample

1st row청년통장 테스트
2nd row설문조사 기능 테스트 페이지
3rd row2017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사
4th row2017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사
5th row2017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사
ValueCountFrequency (%)
설문조사 18
16.1%
청년통장 16
14.3%
일하는 13
 
11.6%
경기도 9
 
8.0%
사후 6
 
5.4%
만기지급 3
 
2.7%
5기 3
 
2.7%
2017년 3
 
2.7%
1차 3
 
2.7%
참여자 3
 
2.7%
Other values (23) 35
31.2%
2023-12-10T23:16:00.164809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
19.1%
36
 
7.5%
30
 
6.2%
23
 
4.8%
21
 
4.4%
21
 
4.4%
21
 
4.4%
16
 
3.3%
16
 
3.3%
16
 
3.3%
Other values (42) 189
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 331
68.8%
Space Separator 92
 
19.1%
Decimal Number 48
 
10.0%
Lowercase Letter 4
 
0.8%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
10.9%
30
 
9.1%
23
 
6.9%
21
 
6.3%
21
 
6.3%
21
 
6.3%
16
 
4.8%
16
 
4.8%
16
 
4.8%
14
 
4.2%
Other values (27) 117
35.3%
Decimal Number
ValueCountFrequency (%)
1 11
22.9%
2 10
20.8%
0 9
18.8%
3 5
10.4%
5 3
 
6.2%
4 3
 
6.2%
7 3
 
6.2%
9 2
 
4.2%
8 2
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
t 2
50.0%
s 1
25.0%
e 1
25.0%
Space Separator
ValueCountFrequency (%)
92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 331
68.8%
Common 146
30.4%
Latin 4
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
10.9%
30
 
9.1%
23
 
6.9%
21
 
6.3%
21
 
6.3%
21
 
6.3%
16
 
4.8%
16
 
4.8%
16
 
4.8%
14
 
4.2%
Other values (27) 117
35.3%
Common
ValueCountFrequency (%)
92
63.0%
1 11
 
7.5%
2 10
 
6.8%
0 9
 
6.2%
3 5
 
3.4%
5 3
 
2.1%
4 3
 
2.1%
7 3
 
2.1%
( 3
 
2.1%
) 3
 
2.1%
Other values (2) 4
 
2.7%
Latin
ValueCountFrequency (%)
t 2
50.0%
s 1
25.0%
e 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 331
68.8%
ASCII 150
31.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92
61.3%
1 11
 
7.3%
2 10
 
6.7%
0 9
 
6.0%
3 5
 
3.3%
5 3
 
2.0%
4 3
 
2.0%
7 3
 
2.0%
( 3
 
2.0%
) 3
 
2.0%
Other values (5) 8
 
5.3%
Hangul
ValueCountFrequency (%)
36
 
10.9%
30
 
9.1%
23
 
6.9%
21
 
6.3%
21
 
6.3%
21
 
6.3%
16
 
4.8%
16
 
4.8%
16
 
4.8%
14
 
4.2%
Other values (27) 117
35.3%
Distinct17
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum2017-07-01 00:00:00
Maximum2021-10-06 00:00:00
2023-12-10T23:16:00.480772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:16:00.843329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
Distinct18
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum2017-07-01 00:00:00
Maximum2021-11-02 00:00:00
2023-12-10T23:16:01.031065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:16:01.235390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
Distinct20
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum2017-07-21 15:32:00
Maximum2021-10-07 17:47:00
2023-12-10T23:16:01.443238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:16:01.717582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
Distinct20
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum2017-08-03 13:43:00
Maximum2021-10-07 17:47:00
2023-12-10T23:16:02.054912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:16:02.287084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

삭제여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size155.0 B
False
21 
True
 
2
ValueCountFrequency (%)
False 21
91.3%
True 2
 
8.7%
2023-12-10T23:16:02.483513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct17
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum2017-07-21 00:00:00
Maximum2021-10-07 00:00:00
2023-12-10T23:16:02.675420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:16:02.959069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

Interactions

2023-12-10T23:15:57.739011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:16:03.183327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년통장설문조사번호제목시작일자종료일자등록일시수정일시삭제여부데이터기준일자
청년통장설문조사번호1.0000.9560.8340.8301.0000.9350.1030.834
제목0.9561.0000.9380.9040.9310.8130.0000.938
시작일자0.8340.9381.0001.0001.0001.0000.0001.000
종료일자0.8300.9041.0001.0000.9841.0001.0001.000
등록일시1.0000.9311.0000.9841.0000.9970.0001.000
수정일시0.9350.8131.0001.0000.9971.0001.0001.000
삭제여부0.1030.0000.0001.0000.0001.0001.0000.000
데이터기준일자0.8340.9381.0001.0001.0001.0000.0001.000
2023-12-10T23:16:03.447157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
청년통장설문조사번호삭제여부
청년통장설문조사번호1.0000.000
삭제여부0.0001.000

Missing values

2023-12-10T23:15:57.949177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:15:58.201298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

청년통장설문조사번호제목시작일자종료일자등록일시수정일시삭제여부데이터기준일자
01청년통장 테스트2017-07-012017-07-012017-07-21 15:322017-08-03 13:43N2017-07-21
12설문조사 기능 테스트 페이지2017-08-102017-08-102017-08-10 18:582017-08-10 18:58N2017-08-10
232017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사2017-09-132017-09-222017-09-13 13:412017-09-13 13:41N2017-09-13
342017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사2017-09-282017-10-092017-09-28 13:382017-09-28 13:38N2017-09-28
452017년 일하는 청년통장 상반기(3기) 참여자 1차 설문조사2017-10-102017-10-152017-10-10 13:252017-10-10 13:25N2017-10-10
562018년 상반기 청년통장 선정자 설문조사2018-05-272018-10-282018-05-28 09:112018-10-29 14:23N2018-05-28
67경기도 일하는 청년통장 3기 사후 설문조사2018-10-292018-11-112018-10-26 10:582018-11-09 16:23N2018-10-26
78경기도 일하는 청년통장 4기 사후 설문조사2018-10-292018-11-112018-10-26 10:592018-11-09 16:23N2018-10-26
89경기도 일하는 청년통장 5기 사후 설문조사2018-10-292018-11-112018-10-26 10:592018-11-09 16:23N2018-10-26
910경기도 일하는 청년통장 4기 사후 설문조사2018-10-292018-11-092018-10-26 10:592018-10-26 11:00Y2018-10-26
청년통장설문조사번호제목시작일자종료일자등록일시수정일시삭제여부데이터기준일자
1314test2019-05-072019-05-072019-05-20 16:282019-05-20 16:29Y2019-05-20
14152019년 경기도 일하는 청년통장 선발자 사전설문조사2019-08-262019-08-272019-08-01 12:152019-08-26 13:46N2019-08-01
1516청년통장 2기 만기 설문조사2019-12-102019-12-302019-12-09 18:022019-12-30 13:25N2019-12-09
1617경기도 일하는 청년통장 3기 만기지급설문조사2020-07-132020-07-292020-05-28 14:292020-07-24 11:28N2020-05-28
17182020년 경기도 일하는 청년통장 선발자 사전설문조사2020-08-262020-09-302020-08-27 19:002020-08-27 19:00N2020-08-27
18194기 만기지급 설문조사2020-10-292020-12-312020-10-29 11:502020-10-29 11:50N2020-10-29
19295기 만기지급 설문조사2021-05-252021-06-292021-05-26 17:522021-05-26 17:52N2021-05-26
20305기 만기지급 설문조사2021-05-252021-06-292021-05-26 18:022021-05-26 18:02N2021-05-26
21319기 약정체결 설문조사2021-06-072021-07-062021-06-08 16:012021-06-08 16:01N2021-06-08
223210기 약정체결 설문조사2021-10-062021-11-022021-10-07 17:472021-10-07 17:47N2021-10-07