Overview

Dataset statistics

Number of variables5
Number of observations96
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory42.4 B

Variable types

Numeric1
Categorical3
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15241/S/1/datasetView.do

Alerts

사용자코드 has constant value ""Constant

Reproduction

Analysis started2024-05-11 08:55:20.141884
Analysis finished2024-05-11 08:55:21.940974
Duration1.8 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

가입일시
Real number (ℝ)

Distinct6
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202209.5
Minimum202207
Maximum202212
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size996.0 B
2024-05-11T08:55:22.117033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum202207
5-th percentile202207
Q1202208
median202209.5
Q3202211
95-th percentile202212
Maximum202212
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7167902
Coefficient of variation (CV)8.4901558 × 10-6
Kurtosis-1.2720071
Mean202209.5
Median Absolute Deviation (MAD)1.5
Skewness0
Sum19412112
Variance2.9473684
MonotonicityIncreasing
2024-05-11T08:55:22.474129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
202207 16
16.7%
202208 16
16.7%
202209 16
16.7%
202210 16
16.7%
202211 16
16.7%
202212 16
16.7%
ValueCountFrequency (%)
202207 16
16.7%
202208 16
16.7%
202209 16
16.7%
202210 16
16.7%
202211 16
16.7%
202212 16
16.7%
ValueCountFrequency (%)
202212 16
16.7%
202211 16
16.7%
202210 16
16.7%
202209 16
16.7%
202208 16
16.7%
202207 16
16.7%

사용자코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size900.0 B
회원-내국인
96 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row회원-내국인
2nd row회원-내국인
3rd row회원-내국인
4th row회원-내국인
5th row회원-내국인

Common Values

ValueCountFrequency (%)
회원-내국인 96
100.0%

Length

2024-05-11T08:55:22.868488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:55:23.138649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
회원-내국인 96
100.0%

연령대코드
Categorical

Distinct8
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size900.0 B
~10대
12 
20대
12 
30대
12 
40대
12 
50대
12 
Other values (3)
36 

Length

Max length5
Median length3
Mean length3.25
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row~10대
2nd row20대
3rd row30대
4th row40대
5th row50대

Common Values

ValueCountFrequency (%)
~10대 12
12.5%
20대 12
12.5%
30대 12
12.5%
40대 12
12.5%
50대 12
12.5%
60대 12
12.5%
70대이상 12
12.5%
기타 12
12.5%

Length

2024-05-11T08:55:23.708795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:55:24.199816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10대 12
12.5%
20대 12
12.5%
30대 12
12.5%
40대 12
12.5%
50대 12
12.5%
60대 12
12.5%
70대이상 12
12.5%
기타 12
12.5%

성별
Categorical

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size900.0 B
F
48 
M
48 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 48
50.0%
M 48
50.0%

Length

2024-05-11T08:55:24.912785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:55:25.330996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 48
50.0%
m 48
50.0%
Distinct95
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size900.0 B
2024-05-11T08:55:26.055945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.15625
Min length1

Characters and Unicode

Total characters399
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)97.9%

Sample

1st row3,880
2nd row10,470
3rd row4,428
4th row2,965
5th row1,713
ValueCountFrequency (%)
70 2
 
2.1%
32 1
 
1.0%
3,861 1
 
1.0%
58 1
 
1.0%
297 1
 
1.0%
1,253 1
 
1.0%
2,388 1
 
1.0%
3,142 1
 
1.0%
6,031 1
 
1.0%
2,696 1
 
1.0%
Other values (85) 85
88.5%
2024-05-11T08:55:27.496095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 62
15.5%
, 59
14.8%
5 38
9.5%
3 38
9.5%
2 35
8.8%
7 30
7.5%
4 30
7.5%
0 29
7.3%
9 28
7.0%
6 26
6.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 340
85.2%
Other Punctuation 59
 
14.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 62
18.2%
5 38
11.2%
3 38
11.2%
2 35
10.3%
7 30
8.8%
4 30
8.8%
0 29
8.5%
9 28
8.2%
6 26
7.6%
8 24
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 399
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 62
15.5%
, 59
14.8%
5 38
9.5%
3 38
9.5%
2 35
8.8%
7 30
7.5%
4 30
7.5%
0 29
7.3%
9 28
7.0%
6 26
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 399
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 62
15.5%
, 59
14.8%
5 38
9.5%
3 38
9.5%
2 35
8.8%
7 30
7.5%
4 30
7.5%
0 29
7.3%
9 28
7.0%
6 26
6.5%

Interactions

2024-05-11T08:55:20.611624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T08:55:27.807995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가입일시연령대코드성별가입 수
가입일시1.0000.0000.0001.000
연령대코드0.0001.0000.0000.924
성별0.0000.0001.0000.000
가입 수1.0000.9240.0001.000
2024-05-11T08:55:28.115907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령대코드
성별1.0000.000
연령대코드0.0001.000
2024-05-11T08:55:28.390694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가입일시연령대코드성별
가입일시1.0000.0000.000
연령대코드0.0001.0000.000
성별0.0000.0001.000

Missing values

2024-05-11T08:55:21.187758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T08:55:21.820646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가입일시사용자코드연령대코드성별가입 수
0202207회원-내국인~10대F3,880
1202207회원-내국인20대F10,470
2202207회원-내국인30대F4,428
3202207회원-내국인40대F2,965
4202207회원-내국인50대F1,713
5202207회원-내국인60대F525
6202207회원-내국인70대이상F82
7202207회원-내국인기타F215
8202207회원-내국인~10대M5,291
9202207회원-내국인20대M11,718
가입일시사용자코드연령대코드성별가입 수
86202212회원-내국인70대이상F21
87202212회원-내국인기타F5
88202212회원-내국인~10대M1,593
89202212회원-내국인20대M1,915
90202212회원-내국인30대M1,341
91202212회원-내국인40대M1,075
92202212회원-내국인50대M772
93202212회원-내국인60대M313
94202212회원-내국인70대이상M41
95202212회원-내국인기타M13