Overview

Dataset statistics

Number of variables7
Number of observations86
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory60.5 B

Variable types

Numeric3
Categorical2
DateTime2

Dataset

Description상주시 장학회 후원 회원에 대한 데이터로 회원번호, 후원기간, 후원금액, 후원일, 후원횟수의 등의 항목을 제공합니다
Author경상북도 상주시
URLhttps://www.data.go.kr/data/15089277/fileData.do

Alerts

후원횟수 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 후원횟수High correlation
구분 is highly imbalanced (78.2%)Imbalance
번호 has unique valuesUnique
금액 has 2 (2.3%) zerosZeros
후원횟수 has 15 (17.4%) zerosZeros

Reproduction

Analysis started2023-12-12 22:15:31.877421
Analysis finished2023-12-12 22:15:33.099425
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.05814
Minimum6
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-13T07:15:33.185642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile10.5
Q128.25
median49.5
Q371.75
95-th percentile91.25
Maximum100
Range94
Interquartile range (IQR)43.5

Descriptive statistics

Standard deviation25.969373
Coefficient of variation (CV)0.51878423
Kurtosis-1.097047
Mean50.05814
Median Absolute Deviation (MAD)22
Skewness0.076516799
Sum4305
Variance674.40834
MonotonicityStrictly decreasing
2023-12-13T07:15:33.372766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 1
 
1.2%
37 1
 
1.2%
29 1
 
1.2%
30 1
 
1.2%
31 1
 
1.2%
32 1
 
1.2%
33 1
 
1.2%
34 1
 
1.2%
35 1
 
1.2%
36 1
 
1.2%
Other values (76) 76
88.4%
ValueCountFrequency (%)
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
12 1
1.2%
13 1
1.2%
14 1
1.2%
15 1
1.2%
16 1
1.2%
ValueCountFrequency (%)
100 1
1.2%
99 1
1.2%
94 1
1.2%
93 1
1.2%
92 1
1.2%
89 1
1.2%
88 1
1.2%
87 1
1.2%
86 1
1.2%
84 1
1.2%

기간
Categorical

Distinct4
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size820.0 B
월별
47 
일시불
36 
연별
 
2
분기별
 
1

Length

Max length3
Median length2
Mean length2.4302326
Min length2

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row일시불
2nd row월별
3rd row일시불
4th row월별
5th row월별

Common Values

ValueCountFrequency (%)
월별 47
54.7%
일시불 36
41.9%
연별 2
 
2.3%
분기별 1
 
1.2%

Length

2023-12-13T07:15:33.493003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:15:33.591101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월별 47
54.7%
일시불 36
41.9%
연별 2
 
2.3%
분기별 1
 
1.2%

금액
Real number (ℝ)

ZEROS 

Distinct14
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean162443.02
Minimum0
Maximum5000000
Zeros2
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-13T07:15:33.685167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6250
Q110000
median50000
Q3100000
95-th percentile500000
Maximum5000000
Range5000000
Interquartile range (IQR)90000

Descriptive statistics

Standard deviation584096.13
Coefficient of variation (CV)3.5956985
Kurtosis57.590895
Mean162443.02
Median Absolute Deviation (MAD)40000
Skewness7.2511771
Sum13970100
Variance3.4116829 × 1011
MonotonicityNot monotonic
2023-12-13T07:15:33.782712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
10000 21
24.4%
100000 20
23.3%
50000 20
23.3%
20000 5
 
5.8%
30000 5
 
5.8%
500000 3
 
3.5%
0 2
 
2.3%
300000 2
 
2.3%
200000 2
 
2.3%
5000 2
 
2.3%
Other values (4) 4
 
4.7%
ValueCountFrequency (%)
0 2
 
2.3%
100 1
 
1.2%
5000 2
 
2.3%
10000 21
24.4%
20000 5
 
5.8%
30000 5
 
5.8%
50000 20
23.3%
100000 20
23.3%
200000 2
 
2.3%
300000 2
 
2.3%
ValueCountFrequency (%)
5000000 1
 
1.2%
2000000 1
 
1.2%
1000000 1
 
1.2%
500000 3
 
3.5%
300000 2
 
2.3%
200000 2
 
2.3%
100000 20
23.3%
50000 20
23.3%
30000 5
 
5.8%
20000 5
 
5.8%
Distinct50
Distinct (%)58.1%
Missing0
Missing (%)0.0%
Memory size820.0 B
Minimum2008-07-21 00:00:00
Maximum2020-07-07 00:00:00
2023-12-13T07:15:34.182222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:34.316518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct50
Distinct (%)58.1%
Missing0
Missing (%)0.0%
Memory size820.0 B
Minimum2008-07-21 00:00:00
Maximum2020-07-07 00:00:00
2023-12-13T07:15:34.438376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:34.570353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

후원횟수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5116279
Minimum0
Maximum60
Zeros15
Zeros (%)17.4%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-13T07:15:34.679098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q310
95-th percentile23
Maximum60
Range60
Interquartile range (IQR)9

Descriptive statistics

Standard deviation12.016405
Coefficient of variation (CV)1.8453764
Kurtosis12.341281
Mean6.5116279
Median Absolute Deviation (MAD)1
Skewness3.3609914
Sum560
Variance144.39398
MonotonicityNot monotonic
2023-12-13T07:15:34.766502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 39
45.3%
0 15
 
17.4%
12 12
 
14.0%
10 10
 
11.6%
60 3
 
3.5%
6 1
 
1.2%
36 1
 
1.2%
24 1
 
1.2%
20 1
 
1.2%
4 1
 
1.2%
Other values (2) 2
 
2.3%
ValueCountFrequency (%)
0 15
 
17.4%
1 39
45.3%
2 1
 
1.2%
4 1
 
1.2%
5 1
 
1.2%
6 1
 
1.2%
10 10
 
11.6%
12 12
 
14.0%
20 1
 
1.2%
24 1
 
1.2%
ValueCountFrequency (%)
60 3
 
3.5%
36 1
 
1.2%
24 1
 
1.2%
20 1
 
1.2%
12 12
14.0%
10 10
11.6%
6 1
 
1.2%
5 1
 
1.2%
4 1
 
1.2%
2 1
 
1.2%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size820.0 B
P
83 
J
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowJ
3rd rowP
4th rowJ
5th rowP

Common Values

ValueCountFrequency (%)
P 83
96.5%
J 3
 
3.5%

Length

2023-12-13T07:15:34.859264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:15:34.953017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
p 83
96.5%
j 3
 
3.5%

Interactions

2023-12-13T07:15:32.635410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.135729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.382502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.712497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.221242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.455232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.794586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.300922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:15:32.535486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:15:35.013329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호기간금액시작일종료일후원횟수구분
번호1.0000.0350.2360.9870.9870.0000.588
기간0.0351.0000.7440.0000.0000.3880.000
금액0.2360.7441.0000.9430.9430.0000.000
시작일0.9870.0000.9431.0001.0000.0001.000
종료일0.9870.0000.9431.0001.0000.0001.000
후원횟수0.0000.3880.0000.0000.0001.0000.508
구분0.5880.0000.0001.0001.0000.5081.000
2023-12-13T07:15:35.117926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분기간
구분1.0000.000
기간0.0001.000
2023-12-13T07:15:35.195198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호금액후원횟수기간구분
번호1.000-0.0210.0610.0000.431
금액-0.0211.000-0.3340.3810.000
후원횟수0.061-0.3341.0000.2990.538
기간0.0000.3810.2991.0000.000
구분0.4310.0000.5380.0001.000

Missing values

2023-12-13T07:15:32.919233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:15:33.049675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호기간금액시작일종료일후원횟수구분
0100일시불1000002020-07-072020-07-070P
199월별1000002019-04-152019-04-150J
294일시불02018-05-042018-05-041P
393월별1000002017-08-092017-08-096J
492월별1000002016-01-222016-01-221P
589월별100002015-10-212015-10-2112P
688월별200002015-07-252015-07-2560P
787월별100002015-06-042015-06-041P
886월별3000002014-03-242014-03-2436J
984월별100002013-01-252013-01-251P
번호기간금액시작일종료일후원횟수구분
7616월별1000002008-11-192008-11-1910P
7715월별500002008-11-182008-11-1810P
7814일시불500002008-11-182008-11-180P
7913일시불1000002008-11-182008-11-180P
8012월별100002008-11-032008-11-0312P
8110월별500002008-10-272008-10-2710P
829연별20000002008-10-272008-10-275P
838월별100002008-10-222008-10-2212P
847일시불2000002008-11-192008-11-190P
856월별100002008-07-212008-07-212P