Overview

Dataset statistics

Number of variables3
Number of observations453
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory25.3 B

Variable types

Numeric1
DateTime1
Categorical1

Dataset

Description상주시 생활체육공원(국민체육센터) 홈페이지의 회원의 회원번호, 가입일자, 성별에 대한 데이터를 제공합니다.
Author경상북도 상주시
URLhttps://www.data.go.kr/data/15095982/fileData.do

Alerts

회원번호 is highly overall correlated with 성별High correlation
성별 is highly overall correlated with 회원번호High correlation
회원번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:48:23.872286
Analysis finished2023-12-12 23:48:24.170556
Duration0.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회원번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct453
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean228.34437
Minimum1
Maximum455
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-13T08:48:24.238510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.6
Q1115
median228
Q3342
95-th percentile432.4
Maximum455
Range454
Interquartile range (IQR)227

Descriptive statistics

Standard deviation131.31087
Coefficient of variation (CV)0.57505631
Kurtosis-1.2017805
Mean228.34437
Median Absolute Deviation (MAD)114
Skewness0.0030901486
Sum103440
Variance17242.545
MonotonicityStrictly increasing
2023-12-13T08:48:24.385815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
286 1
 
0.2%
313 1
 
0.2%
312 1
 
0.2%
311 1
 
0.2%
310 1
 
0.2%
309 1
 
0.2%
308 1
 
0.2%
307 1
 
0.2%
306 1
 
0.2%
Other values (443) 443
97.8%
ValueCountFrequency (%)
1 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
11 1
0.2%
ValueCountFrequency (%)
455 1
0.2%
454 1
0.2%
453 1
0.2%
452 1
0.2%
451 1
0.2%
450 1
0.2%
449 1
0.2%
448 1
0.2%
447 1
0.2%
446 1
0.2%
Distinct407
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
Minimum2009-10-01 00:00:00
Maximum2021-12-07 00:00:00
2023-12-13T08:48:24.535763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:48:24.680131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

성별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
389 
64 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
389
85.9%
64
 
14.1%

Length

2023-12-13T08:48:24.823201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:48:24.909860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
389
85.9%
64
 
14.1%

Interactions

2023-12-13T08:48:23.953788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:48:24.962103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원번호성별
회원번호1.0000.842
성별0.8421.000
2023-12-13T08:48:25.033081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원번호성별
회원번호1.0000.668
성별0.6681.000

Missing values

2023-12-13T08:48:24.059495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:48:24.139468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회원번호가입일자성별
012009-10-01
132009-10-16
242009-10-19
352009-10-19
462009-10-19
572009-10-19
682009-10-20
792009-10-20
8102009-10-20
9112009-10-20
회원번호가입일자성별
4434462021-07-23
4444472021-07-27
4454482021-08-25
4464492021-09-28
4474502021-10-11
4484512021-10-24
4494522021-11-01
4504532021-11-25
4514542021-12-07
4524552021-12-07