Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells2
Missing cells (%)1.2%
Duplicate rows9
Duplicate rows (%)26.5%
Total size in memory1.6 KiB
Average record size in memory46.9 B

Variable types

Categorical3
Numeric2

Alerts

Dataset has 9 (26.5%) duplicate rowsDuplicates
mber_cd_nm is highly overall correlated with mber_cdHigh correlation
mber_cd is highly overall correlated with mber_cd_nmHigh correlation
cl_cd is highly overall correlated with cl_cd_nmHigh correlation
co is highly overall correlated with cl_cd_nmHigh correlation
cl_cd_nm is highly overall correlated with cl_cd and 1 other fieldsHigh correlation
cl_cd has 2 (5.9%) missing valuesMissing

Reproduction

Analysis started2023-12-10 10:14:47.793457
Analysis finished2023-12-10 10:14:49.123660
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

mber_cd
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
1
18 
2
16 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 18
52.9%
2 16
47.1%

Length

2023-12-10T19:14:49.338104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:49.518864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 18
52.9%
2 16
47.1%

mber_cd_nm
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
정회원
18 
웹회원
16 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정회원
2nd row정회원
3rd row정회원
4th row정회원
5th row정회원

Common Values

ValueCountFrequency (%)
정회원 18
52.9%
웹회원 16
47.1%

Length

2023-12-10T19:14:49.716860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:14:49.896968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정회원 18
52.9%
웹회원 16
47.1%

cl_cd
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct12
Distinct (%)37.5%
Missing2
Missing (%)5.9%
Infinite0
Infinite (%)0.0%
Mean1810.1875
Minimum1
Maximum9999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T19:14:50.072338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.75
median6.5
Q312.25
95-th percentile9999
Maximum9999
Range9998
Interquartile range (IQR)8.5

Descriptive statistics

Standard deviation3822.9252
Coefficient of variation (CV)2.1118946
Kurtosis0.95398487
Mean1810.1875
Median Absolute Deviation (MAD)4.5
Skewness1.6953764
Sum57926
Variance14614757
MonotonicityNot monotonic
2023-12-10T19:14:50.263798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 4
11.8%
5 4
11.8%
7 4
11.8%
9999 4
11.8%
2 2
5.9%
3 2
5.9%
6 2
5.9%
12 2
5.9%
13 2
5.9%
4 2
5.9%
Other values (2) 4
11.8%
ValueCountFrequency (%)
1 4
11.8%
2 2
5.9%
3 2
5.9%
4 2
5.9%
5 4
11.8%
6 2
5.9%
7 4
11.8%
11 2
5.9%
12 2
5.9%
13 2
5.9%
ValueCountFrequency (%)
9999 4
11.8%
8888 2
5.9%
13 2
5.9%
12 2
5.9%
11 2
5.9%
7 4
11.8%
6 2
5.9%
5 4
11.8%
4 2
5.9%
3 2
5.9%

cl_cd_nm
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
직원
외부이용자
행사및사업
기타
정회원
Other values (8)
16 

Length

Max length6
Median length4
Mean length4.1176471
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row직원
2nd row정회원
3rd row위원회 위원
4th row외부이용자
5th row자원봉사

Common Values

ValueCountFrequency (%)
직원 4
11.8%
외부이용자 4
11.8%
행사및사업 4
11.8%
기타 4
11.8%
정회원 2
 
5.9%
위원회 위원 2
 
5.9%
자원봉사 2
 
5.9%
교류기관대표 2
 
5.9%
교류기관회원 2
 
5.9%
<NA> 2
 
5.9%
Other values (3) 6
17.6%

Length

2023-12-10T19:14:50.517225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
직원 4
11.1%
외부이용자 4
11.1%
행사및사업 4
11.1%
기타 4
11.1%
정회원 2
 
5.6%
위원회 2
 
5.6%
위원 2
 
5.6%
자원봉사 2
 
5.6%
교류기관대표 2
 
5.6%
교류기관회원 2
 
5.6%
Other values (4) 8
22.2%

co
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1325.7059
Minimum10
Maximum11550
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-10T19:14:50.741021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile10.65
Q128
median52
Q3238.75
95-th percentile7994.8
Maximum11550
Range11540
Interquartile range (IQR)210.75

Descriptive statistics

Standard deviation3029.0965
Coefficient of variation (CV)2.2848933
Kurtosis5.9940901
Mean1325.7059
Median Absolute Deviation (MAD)41
Skewness2.5785849
Sum45074
Variance9175425.9
MonotonicityNot monotonic
2023-12-10T19:14:50.973157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
28 2
 
5.9%
11 2
 
5.9%
154 2
 
5.9%
52 2
 
5.9%
198 2
 
5.9%
43 2
 
5.9%
51 2
 
5.9%
30 2
 
5.9%
10 2
 
5.9%
19 1
 
2.9%
Other values (15) 15
44.1%
ValueCountFrequency (%)
10 2
5.9%
11 2
5.9%
17 1
2.9%
19 1
2.9%
26 1
2.9%
27 1
2.9%
28 2
5.9%
30 2
5.9%
43 2
5.9%
51 2
5.9%
ValueCountFrequency (%)
11550 1
2.9%
11133 1
2.9%
6305 1
2.9%
6270 1
2.9%
3577 1
2.9%
3573 1
2.9%
386 1
2.9%
382 1
2.9%
239 1
2.9%
238 1
2.9%

Interactions

2023-12-10T19:14:48.397658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:14:48.115365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:14:48.564292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:14:48.252613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:14:51.120445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
mber_cdmber_cd_nmcl_cdcl_cd_nmco
mber_cd1.0000.9950.1060.6060.494
mber_cd_nm0.9951.0000.1060.6060.494
cl_cd0.1060.1061.0001.0000.000
cl_cd_nm0.6060.6061.0001.0001.000
co0.4940.4940.0001.0001.000
2023-12-10T19:14:51.296540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
mber_cd_nmcl_cd_nmmber_cd
mber_cd_nm1.0000.3770.939
cl_cd_nm0.3771.0000.377
mber_cd0.9390.3771.000
2023-12-10T19:14:51.454662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
cl_cdcomber_cdmber_cd_nmcl_cd_nm
cl_cd1.0000.0350.1670.1670.830
co0.0351.0000.3200.3200.830
mber_cd0.1670.3201.0000.9390.377
mber_cd_nm0.1670.3200.9391.0000.377
cl_cd_nm0.8300.8300.3770.3771.000

Missing values

2023-12-10T19:14:48.789506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:14:49.032173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

mber_cdmber_cd_nmcl_cdcl_cd_nmco
01정회원1직원382
11정회원2정회원6270
21정회원3위원회 위원10
31정회원5외부이용자30
41정회원6자원봉사51
51정회원7행사및사업27
61정회원12교류기관대표11
71정회원13교류기관회원238
81정회원9999기타43
92웹회원<NA><NA>3577
mber_cdmber_cd_nmcl_cdcl_cd_nmco
242웹회원9999기타91
252웹회원5외부이용자52
261정회원6자원봉사51
271정회원9999기타43
281정회원5외부이용자30
292웹회원7행사및사업28
301정회원7행사및사업26
312웹회원1직원19
321정회원12교류기관대표11
331정회원3위원회 위원10

Duplicate rows

Most frequently occurring

mber_cdmber_cd_nmcl_cdcl_cd_nmco# duplicates
01정회원3위원회 위원102
11정회원5외부이용자302
21정회원6자원봉사512
31정회원12교류기관대표112
41정회원9999기타432
52웹회원4강좌회원1982
62웹회원5외부이용자522
72웹회원7행사및사업282
82웹회원8888장기연체회원1542