Overview

Dataset statistics

Number of variables5
Number of observations2270
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory91.0 KiB
Average record size in memory41.1 B

Variable types

DateTime1
Categorical3
Numeric1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15243/S/1/datasetView.do

Alerts

'사용자코드' has constant value ""Constant
'신규가입자수' is highly overall correlated with '성별'High correlation
'성별' is highly overall correlated with '신규가입자수'High correlation

Reproduction

Analysis started2023-12-11 06:24:38.168697
Analysis finished2023-12-11 06:24:38.631303
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct199
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
Minimum2018-01-01 00:00:00
Maximum2018-07-19 00:00:00
2023-12-11T15:24:38.720809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T15:24:38.889087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

'사용자코드'
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
'회원-내국인'
2270 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row'회원-내국인'
2nd row'회원-내국인'
3rd row'회원-내국인'
4th row'회원-내국인'
5th row'회원-내국인'

Common Values

ValueCountFrequency (%)
'회원-내국인' 2270
100.0%

Length

2023-12-11T15:24:39.041049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:24:39.130963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
회원-내국인 2270
100.0%

'성별'
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
'M'
1129 
'F'
1104 
''
 
37

Length

Max length3
Median length3
Mean length2.9837004
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row'F'
2nd row'M'
3rd row'F'
4th row'M'
5th row'F'

Common Values

ValueCountFrequency (%)
'M' 1129
49.7%
'F' 1104
48.6%
'' 37
 
1.6%

Length

2023-12-11T15:24:39.550578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:24:39.645154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 1129
49.7%
f 1104
48.6%
37
 
1.6%
Distinct7
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
'20대'
340 
'30대'
340 
'40대'
340 
'50대'
338 
'~10대'
327 
Other values (2)
585 

Length

Max length6
Median length5
Mean length5.2651982
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row'~10대'
2nd row'~10대'
3rd row'20대'
4th row'20대'
5th row'30대'

Common Values

ValueCountFrequency (%)
'20대' 340
15.0%
'30대' 340
15.0%
'40대' 340
15.0%
'50대' 338
14.9%
'~10대' 327
14.4%
'60대' 310
13.7%
'70대~' 275
12.1%

Length

2023-12-11T15:24:39.755006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T15:24:39.878987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20대 340
15.0%
30대 340
15.0%
40대 340
15.0%
50대 338
14.9%
10대 327
14.4%
60대 310
13.7%
70대 275
12.1%

'신규가입자수'
Real number (ℝ)

HIGH CORRELATION 

Distinct445
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121.8022
Minimum1
Maximum5706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.1 KiB
2023-12-11T15:24:40.115427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q16
median24.5
Q384.75
95-th percentile520
Maximum5706
Range5705
Interquartile range (IQR)78.75

Descriptive statistics

Standard deviation352.28859
Coefficient of variation (CV)2.8923006
Kurtosis90.485056
Mean121.8022
Median Absolute Deviation (MAD)21.5
Skewness8.1580528
Sum276491
Variance124107.25
MonotonicityNot monotonic
2023-12-11T15:24:40.251173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 134
 
5.9%
2 124
 
5.5%
3 104
 
4.6%
4 82
 
3.6%
5 73
 
3.2%
6 71
 
3.1%
7 53
 
2.3%
8 49
 
2.2%
9 43
 
1.9%
10 43
 
1.9%
Other values (435) 1494
65.8%
ValueCountFrequency (%)
1 134
5.9%
2 124
5.5%
3 104
4.6%
4 82
3.6%
5 73
3.2%
6 71
3.1%
7 53
 
2.3%
8 49
 
2.2%
9 43
 
1.9%
10 43
 
1.9%
ValueCountFrequency (%)
5706 1
< 0.1%
5428 1
< 0.1%
4526 1
< 0.1%
4408 1
< 0.1%
3446 1
< 0.1%
3335 1
< 0.1%
3099 1
< 0.1%
2937 1
< 0.1%
2828 1
< 0.1%
2684 1
< 0.1%

Interactions

2023-12-11T15:24:38.363087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T15:24:40.338459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
'성별''연령대코드''신규가입자수'
'성별'1.0000.3360.850
'연령대코드'0.3361.0000.330
'신규가입자수'0.8500.3301.000
2023-12-11T15:24:40.424384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
'연령대코드''성별'
'연령대코드'1.0000.240
'성별'0.2401.000
2023-12-11T15:24:40.509907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
'신규가입자수''성별''연령대코드'
'신규가입자수'1.0000.5640.180
'성별'0.5641.0000.240
'연령대코드'0.1800.2401.000

Missing values

2023-12-11T15:24:38.490997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T15:24:38.582223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

'대여일자''사용자코드''성별''연령대코드''신규가입자수'
0'2018-01-01''회원-내국인''F''~10대'7
1'2018-01-01''회원-내국인''M''~10대'9
2'2018-01-01''회원-내국인''F''20대'58
3'2018-01-01''회원-내국인''M''20대'81
4'2018-01-01''회원-내국인''F''30대'32
5'2018-01-01''회원-내국인''M''30대'46
6'2018-01-01''회원-내국인''F''40대'19
7'2018-01-01''회원-내국인''M''40대'26
8'2018-01-01''회원-내국인''F''50대'9
9'2018-01-01''회원-내국인''M''50대'8
'대여일자''사용자코드''성별''연령대코드''신규가입자수'
2260'2018-07-10''회원-내국인''''70대~'1347
2261'2018-07-11''회원-내국인''''70대~'1856
2262'2018-07-12''회원-내국인''''70대~'2372
2263'2018-07-13''회원-내국인''''70대~'2539
2264'2018-07-14''회원-내국인''''70대~'3446
2265'2018-07-15''회원-내국인''''70대~'2937
2266'2018-07-16''회원-내국인''''70대~'2161
2267'2018-07-17''회원-내국인''''70대~'2133
2268'2018-07-18''회원-내국인''''70대~'2123
2269'2018-07-19''회원-내국인''''70대~'2194