Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 30 |
Missing cells | 2 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.7 KiB |
Average record size in memory | 93.4 B |
Variable types
Text | 2 |
---|---|
Numeric | 1 |
Categorical | 5 |
DateTime | 3 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 경기도일자리재단 |
URL | https://www.bigdata-region.kr/#/dataset/93e07762-c6f5-4fcd-8573-e114073160dc |
회원지번주소 has constant value "" | Constant |
회원유입구분명 has constant value "" | Constant |
회원생일일자 is highly overall correlated with 회원우편번호 and 2 other fields | High correlation |
회원성별코드 is highly overall correlated with 회원생일일자 | High correlation |
회원취업상태명 is highly overall correlated with 회원생일일자 | High correlation |
회원우편번호 is highly overall correlated with 회원생일일자 | High correlation |
회원생일일자 is highly imbalanced (64.7%) | Imbalance |
회원우편번호 has 2 (6.7%) missing values | Missing |
이용자관심범주번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:50:50.008658 |
---|---|
Analysis finished | 2023-12-10 13:50:51.258443 |
Duration | 1.25 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
이용자관심범주번호
Text
UNIQUE
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 26 |
---|---|
Median length | 26 |
Mean length | 26 |
Min length | 26 |
Characters and Unicode
Total characters | 780 |
---|---|
Distinct characters | 14 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 30 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 20170402213531001_CMMN_202 |
---|---|
2nd row | 20170402214034001_CMMN_181 |
3rd row | 20170410084200001_CMMN_183 |
4th row | 20170410084359001_CMMN_192 |
5th row | 20170410084506001_CMMN_181 |
Value | Count | Frequency (%) |
20170402213531001_cmmn_202 | 1 | 3.3% |
20170402214034001_cmmn_181 | 1 | 3.3% |
20170410095147002_cmmn_181 | 1 | 3.3% |
20170410095105001_cmmn_193 | 1 | 3.3% |
20170410094735001_cmmn_192 | 1 | 3.3% |
20170410094735001_cmmn_181 | 1 | 3.3% |
20170410094626001_cmmn_200 | 1 | 3.3% |
20170410094626001_cmmn_198 | 1 | 3.3% |
20170410094626001_cmmn_193 | 1 | 3.3% |
20170410093922001_cmmn_193 | 1 | 3.3% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 197 | |
1 | 139 | |
_ | 60 | 7.7% |
M | 60 | 7.7% |
2 | 55 | 7.1% |
4 | 54 | 6.9% |
7 | 43 | 5.5% |
9 | 40 | 5.1% |
C | 30 | 3.8% |
N | 30 | 3.8% |
Other values (4) | 72 | 9.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 600 | |
Uppercase Letter | 120 | 15.4% |
Connector Punctuation | 60 | 7.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 197 | |
1 | 139 | |
2 | 55 | 9.2% |
4 | 54 | 9.0% |
7 | 43 | 7.2% |
9 | 40 | 6.7% |
8 | 29 | 4.8% |
3 | 18 | 3.0% |
5 | 15 | 2.5% |
6 | 10 | 1.7% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 60 | |
C | 30 | |
N | 30 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 60 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 660 | |
Latin | 120 | 15.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 197 | |
1 | 139 | |
_ | 60 | 9.1% |
2 | 55 | 8.3% |
4 | 54 | 8.2% |
7 | 43 | 6.5% |
9 | 40 | 6.1% |
8 | 29 | 4.4% |
3 | 18 | 2.7% |
5 | 15 | 2.3% |
Latin
Value | Count | Frequency (%) |
M | 60 | |
C | 30 | |
N | 30 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 780 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 197 | |
1 | 139 | |
_ | 60 | 7.7% |
M | 60 | 7.7% |
2 | 55 | 7.1% |
4 | 54 | 6.9% |
7 | 43 | 5.5% |
9 | 40 | 5.1% |
C | 30 | 3.8% |
N | 30 | 3.8% |
Other values (4) | 72 | 9.2% |
회원우편번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 19 |
---|---|
Distinct (%) | 67.9% |
Missing | 2 |
Missing (%) | 6.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15026.25 |
Minimum | 11479 |
---|---|
Maximum | 18132 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 402.0 B |
Quantile statistics
Minimum | 11479 |
---|---|
5-th percentile | 12736 |
Q1 | 13421 |
median | 15222 |
Q3 | 16606 |
95-th percentile | 17245.35 |
Maximum | 18132 |
Range | 6653 |
Interquartile range (IQR) | 3185 |
Descriptive statistics
Standard deviation | 1708.7259 |
---|---|
Coefficient of variation (CV) | 0.11371606 |
Kurtosis | -0.79921135 |
Mean | 15026.25 |
Median Absolute Deviation (MAD) | 1499 |
Skewness | -0.17651668 |
Sum | 420735 |
Variance | 2919744.3 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12736 | 3 | 10.0% |
13421 | 3 | 10.0% |
15251 | 3 | 10.0% |
16606 | 2 | 6.7% |
14568 | 2 | 6.7% |
15222 | 2 | 6.7% |
17051 | 1 | 3.3% |
17350 | 1 | 3.3% |
14614 | 1 | 3.3% |
15041 | 1 | 3.3% |
Other values (9) | 9 | |
(Missing) | 2 | 6.7% |
Value | Count | Frequency (%) |
11479 | 1 | 3.3% |
12736 | 3 | |
12915 | 1 | 3.3% |
13421 | 3 | |
14285 | 1 | 3.3% |
14568 | 2 | |
14614 | 1 | 3.3% |
15041 | 1 | 3.3% |
15222 | 2 | |
15251 | 3 |
Value | Count | Frequency (%) |
18132 | 1 | 3.3% |
17350 | 1 | 3.3% |
17051 | 1 | 3.3% |
16988 | 1 | 3.3% |
16873 | 1 | 3.3% |
16836 | 1 | 3.3% |
16606 | 2 | |
16275 | 1 | 3.3% |
15880 | 1 | 3.3% |
15251 | 3 |
회원지번주소
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
No values found. |
회원생일일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
19** | |
---|---|
<NA> | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 19** |
4th row | 19** |
5th row | 19** |
Common Values
Value | Count | Frequency (%) |
19** | 28 | |
<NA> | 2 | 6.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
19 | 28 | |
na | 2 | 6.7% |
회원성별코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
F | |
---|---|
M | |
<NA> | 2 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.2 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | F |
4th row | F |
5th row | F |
Common Values
Value | Count | Frequency (%) |
F | 20 | |
M | 8 | 26.7% |
<NA> | 2 | 6.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 20 | |
m | 8 | 26.7% |
na | 2 | 6.7% |
회원유입구분명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
청년통장 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 청년통장 |
---|---|
2nd row | 청년통장 |
3rd row | 청년통장 |
4th row | 청년통장 |
5th row | 청년통장 |
Common Values
Value | Count | Frequency (%) |
청년통장 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
청년통장 | 30 |
회원취업상태명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
재직중 | |
---|---|
<NA> | |
취업준비중 | 2 |
기타 | 1 |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.3% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 재직중 |
4th row | 재직중 |
5th row | 재직중 |
Common Values
Value | Count | Frequency (%) |
재직중 | 24 | |
<NA> | 3 | 10.0% |
취업준비중 | 2 | 6.7% |
기타 | 1 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
재직중 | 24 | |
na | 3 | 10.0% |
취업준비중 | 2 | 6.7% |
기타 | 1 | 3.3% |
관심범주명
Text
Distinct | 22 |
---|---|
Distinct (%) | 73.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Value | Count | Frequency (%) |
빅데이터 | 6 | |
행정 | 2 | 6.7% |
반려동물 | 2 | 6.7% |
세무 | 2 | 6.7% |
실내디자인 | 1 | 3.3% |
기후변화 | 1 | 3.3% |
디자인 | 1 | 3.3% |
엑셀 | 1 | 3.3% |
핸드메이드 | 1 | 3.3% |
조리·제빵·바리스타 | 1 | 3.3% |
Other values (12) | 12 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 7 | 6.1% |
빅 | 6 | 5.2% |
터 | 6 | 5.2% |
데 | 6 | 5.2% |
자 | 5 | 4.3% |
디 | 4 | 3.5% |
인 | 4 | 3.5% |
리 | 4 | 3.5% |
행 | 3 | 2.6% |
무 | 3 | 2.6% |
Other values (51) | 67 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 110 | |
Uppercase Letter | 3 | 2.6% |
Other Punctuation | 2 | 1.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 7 | 6.4% |
빅 | 6 | 5.5% |
터 | 6 | 5.5% |
데 | 6 | 5.5% |
자 | 5 | 4.5% |
디 | 4 | 3.6% |
인 | 4 | 3.6% |
리 | 4 | 3.6% |
행 | 3 | 2.7% |
무 | 3 | 2.7% |
Other values (47) | 62 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1 | |
B | 1 | |
C | 1 |
Other Punctuation
Value | Count | Frequency (%) |
· | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 110 | |
Latin | 3 | 2.6% |
Common | 2 | 1.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 7 | 6.4% |
빅 | 6 | 5.5% |
터 | 6 | 5.5% |
데 | 6 | 5.5% |
자 | 5 | 4.5% |
디 | 4 | 3.6% |
인 | 4 | 3.6% |
리 | 4 | 3.6% |
행 | 3 | 2.7% |
무 | 3 | 2.7% |
Other values (47) | 62 |
Latin
Value | Count | Frequency (%) |
P | 1 | |
B | 1 | |
C | 1 |
Common
Value | Count | Frequency (%) |
· | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 110 | |
ASCII | 3 | 2.6% |
None | 2 | 1.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 7 | 6.4% |
빅 | 6 | 5.5% |
터 | 6 | 5.5% |
데 | 6 | 5.5% |
자 | 5 | 4.5% |
디 | 4 | 3.6% |
인 | 4 | 3.6% |
리 | 4 | 3.6% |
행 | 3 | 2.7% |
무 | 3 | 2.7% |
Other values (47) | 62 |
None
Value | Count | Frequency (%) |
· | 2 |
ASCII
Value | Count | Frequency (%) |
P | 1 | |
B | 1 | |
C | 1 |
등록일시
Date
Distinct | 21 |
---|---|
Distinct (%) | 70.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2018-11-06 12:41:00 |
---|---|
Maximum | 2021-07-11 23:24:00 |
수정일시
Date
Distinct | 21 |
---|---|
Distinct (%) | 70.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2018-11-06 12:41:00 |
---|---|
Maximum | 2021-07-11 23:24:00 |
데이터기준일자
Date
Distinct | 18 |
---|---|
Distinct (%) | 60.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 372.0 B |
Minimum | 2018-11-06 00:00:00 |
---|---|
Maximum | 2021-07-11 00:00:00 |
이용자관심범주번호 | 회원우편번호 | 회원성별코드 | 회원취업상태명 | 관심범주명 | 등록일시 | 수정일시 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|
이용자관심범주번호 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
회원우편번호 | 1.000 | 1.000 | 0.694 | 0.000 | 0.000 | 1.000 | 1.000 | 0.954 |
회원성별코드 | 1.000 | 0.694 | 1.000 | 0.000 | 0.512 | 1.000 | 1.000 | 0.807 |
회원취업상태명 | 1.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
관심범주명 | 1.000 | 0.000 | 0.512 | 0.000 | 1.000 | 0.000 | 0.000 | 0.478 |
등록일시 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
수정일시 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
데이터기준일자 | 1.000 | 0.954 | 0.807 | 1.000 | 0.478 | 1.000 | 1.000 | 1.000 |
회원생일일자 | 회원성별코드 | 회원취업상태명 | |
---|---|---|---|
회원생일일자 | 1.000 | 1.000 | 1.000 |
회원성별코드 | 1.000 | 1.000 | 0.000 |
회원취업상태명 | 1.000 | 0.000 | 1.000 |
회원우편번호 | 회원생일일자 | 회원성별코드 | 회원취업상태명 | |
---|---|---|---|---|
회원우편번호 | 1.000 | 1.000 | 0.431 | 0.000 |
회원생일일자 | 1.000 | 1.000 | 1.000 | 1.000 |
회원성별코드 | 0.431 | 1.000 | 1.000 | 0.000 |
회원취업상태명 | 0.000 | 1.000 | 0.000 | 1.000 |
이용자관심범주번호 | 회원우편번호 | 회원지번주소 | 회원생일일자 | 회원성별코드 | 회원유입구분명 | 회원취업상태명 | 관심범주명 | 등록일시 | 수정일시 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 20170402213531001_CMMN_202 | <NA> | <NA> | <NA> | 청년통장 | <NA> | 기후변화 | 2019-01-28 17:10 | 2019-01-28 17:10 | 2019-01-28 | |
1 | 20170402214034001_CMMN_181 | <NA> | <NA> | <NA> | 청년통장 | <NA> | 빅데이터 | 2019-11-07 15:35 | 2019-11-07 15:35 | 2019-11-07 | |
2 | 20170410084200001_CMMN_183 | 15880 | 19** | F | 청년통장 | 재직중 | 여행 | 2019-01-09 11:56 | 2019-01-09 11:56 | 2019-01-09 | |
3 | 20170410084359001_CMMN_192 | 16988 | 19** | F | 청년통장 | 재직중 | 반려동물 | 2019-02-14 14:33 | 2019-02-14 14:33 | 2019-02-14 | |
4 | 20170410084506001_CMMN_181 | 18132 | 19** | F | 청년통장 | 재직중 | 빅데이터 | 2018-12-10 13:13 | 2018-12-10 13:13 | 2018-12-10 | |
5 | 20170410084610001_CMMN_183 | 12736 | 19** | M | 청년통장 | 재직중 | 관광통역 | 2018-12-10 15:07 | 2018-12-10 15:07 | 2018-12-10 | |
6 | 20170410084610001_CMMN_188 | 12736 | 19** | M | 청년통장 | 재직중 | 수출입 | 2018-12-10 15:07 | 2018-12-10 15:07 | 2018-12-10 | |
7 | 20170410084610001_CMMN_189 | 12736 | 19** | M | 청년통장 | 재직중 | 세무 | 2018-12-10 15:07 | 2018-12-10 15:07 | 2018-12-10 | |
8 | 20170410085107001_CMMN_181 | 14285 | 19** | F | 청년통장 | 재직중 | 빅데이터 | 2018-12-07 21:55 | 2018-12-07 21:55 | 2018-12-07 | |
9 | 20170410090219001_CMMN_184 | 13421 | 19** | F | 청년통장 | 재직중 | 건축 | 2018-12-06 10:42 | 2018-12-06 10:42 | 2018-12-06 |
이용자관심범주번호 | 회원우편번호 | 회원지번주소 | 회원생일일자 | 회원성별코드 | 회원유입구분명 | 회원취업상태명 | 관심범주명 | 등록일시 | 수정일시 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|
20 | 20170410093747001_CMMN_197 | 15041 | 19** | M | 청년통장 | 재직중 | PCB | 2019-07-04 10:32 | 2019-07-04 10:32 | 2019-07-04 | |
21 | 20170410093922001_CMMN_193 | 14614 | 19** | M | 청년통장 | 재직중 | 행정 | 2018-12-04 10:11 | 2018-12-04 10:11 | 2018-12-04 | |
22 | 20170410094626001_CMMN_193 | 15251 | 19** | F | 청년통장 | 재직중 | 세무 | 2018-11-06 12:41 | 2018-11-06 12:41 | 2018-11-06 | |
23 | 20170410094626001_CMMN_198 | 15251 | 19** | F | 청년통장 | 재직중 | 조리·제빵·바리스타 | 2018-11-06 12:41 | 2018-11-06 12:41 | 2018-11-06 | |
24 | 20170410094626001_CMMN_200 | 15251 | 19** | F | 청년통장 | 재직중 | 핸드메이드 | 2018-11-06 12:41 | 2018-11-06 12:41 | 2018-11-06 | |
25 | 20170410094735001_CMMN_181 | 14568 | 19** | F | 청년통장 | 재직중 | 빅데이터 | 2018-12-10 17:28 | 2018-12-10 17:28 | 2018-12-10 | |
26 | 20170410094735001_CMMN_192 | 14568 | 19** | F | 청년통장 | 재직중 | 반려동물 | 2018-12-10 17:28 | 2018-12-10 17:28 | 2018-12-10 | |
27 | 20170410095105001_CMMN_193 | 17350 | 19** | F | 청년통장 | 재직중 | 엑셀 | 2019-12-12 17:35 | 2019-12-12 17:35 | 2019-12-12 | |
28 | 20170410095147002_CMMN_181 | 16606 | 19** | M | 청년통장 | 재직중 | 빅데이터 | 2018-12-14 09:47 | 2018-12-14 09:47 | 2018-12-14 | |
29 | 20170410095147002_CMMN_184 | 16606 | 19** | M | 청년통장 | 재직중 | 건설안전기사 | 2018-12-14 09:47 | 2018-12-14 09:47 | 2018-12-14 |