Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Categorical4
Numeric2

Alerts

manage_code_name is highly overall correlated with base_year and 2 other fieldsHigh correlation
base_month is highly overall correlated with base_year and 1 other fieldsHigh correlation
base_year is highly overall correlated with base_month and 1 other fieldsHigh correlation
class_no_name is highly overall correlated with manage_code_nameHigh correlation
base_year is highly imbalanced (80.6%)Imbalance
base_month is highly imbalanced (80.6%)Imbalance
age_dc has 12 (12.0%) zerosZeros

Reproduction

Analysis started2023-12-10 09:53:54.617802
Analysis finished2023-12-10 09:53:56.200589
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

base_year
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
97 
2021
 
3

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2021
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 97
97.0%
2021 3
 
3.0%

Length

2023-12-10T18:53:56.459701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:56.645620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 97
97.0%
2021 3
 
3.0%

base_month
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
97 
6
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row6
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 97
97.0%
6 3
 
3.0%

Length

2023-12-10T18:53:56.811350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:57.037302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 97
97.0%
6 3
 
3.0%

manage_code_name
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강서기적의도서관
79 
구덕도서관
18 
화명도서관
 
3

Length

Max length8
Median length8
Mean length7.37
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강서기적의도서관
2nd row화명도서관
3rd row강서기적의도서관
4th row강서기적의도서관
5th row강서기적의도서관

Common Values

ValueCountFrequency (%)
강서기적의도서관 79
79.0%
구덕도서관 18
 
18.0%
화명도서관 3
 
3.0%

Length

2023-12-10T18:53:57.236776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:57.431902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강서기적의도서관 79
79.0%
구덕도서관 18
 
18.0%
화명도서관 3
 
3.0%

class_no_name
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
문학
17 
기술과학
16 
총류
10 
역사
예술
Other values (5)
39 

Length

Max length4
Median length2
Mean length2.64
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기술과학
2nd row총류
3rd row기술과학
4th row기술과학
5th row기술과학

Common Values

ValueCountFrequency (%)
문학 17
17.0%
기술과학 16
16.0%
총류 10
10.0%
역사 9
9.0%
예술 9
9.0%
사회과학 8
8.0%
자연과학 8
8.0%
종교 8
8.0%
철학 8
8.0%
언어 7
7.0%

Length

2023-12-10T18:53:57.661124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:53:57.953000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문학 17
17.0%
기술과학 16
16.0%
총류 10
10.0%
역사 9
9.0%
예술 9
9.0%
사회과학 8
8.0%
자연과학 8
8.0%
종교 8
8.0%
철학 8
8.0%
언어 7
7.0%

age_dc
Real number (ℝ)

ZEROS 

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.7
Minimum0
Maximum80
Zeros12
Zeros (%)12.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:53:58.253454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q120
median40
Q360
95-th percentile80
Maximum80
Range80
Interquartile range (IQR)40

Descriptive statistics

Standard deviation24.775966
Coefficient of variation (CV)0.65718742
Kurtosis-1.1428181
Mean37.7
Median Absolute Deviation (MAD)20
Skewness0.04983684
Sum3770
Variance613.84848
MonotonicityNot monotonic
2023-12-10T18:53:58.484948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0 12
12.0%
60 12
12.0%
20 12
12.0%
30 12
12.0%
40 12
12.0%
50 12
12.0%
10 11
11.0%
70 10
10.0%
80 7
7.0%
ValueCountFrequency (%)
0 12
12.0%
10 11
11.0%
20 12
12.0%
30 12
12.0%
40 12
12.0%
50 12
12.0%
60 12
12.0%
70 10
10.0%
80 7
7.0%
ValueCountFrequency (%)
80 7
7.0%
70 10
10.0%
60 12
12.0%
50 12
12.0%
40 12
12.0%
30 12
12.0%
20 12
12.0%
10 11
11.0%
0 12
12.0%

value
Real number (ℝ)

Distinct69
Distinct (%)69.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean161.45
Minimum1
Maximum2251
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:53:59.191956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18.75
median35.5
Q3154.25
95-th percentile733.6
Maximum2251
Range2250
Interquartile range (IQR)145.5

Descriptive statistics

Standard deviation324.88227
Coefficient of variation (CV)2.0122779
Kurtosis19.044091
Mean161.45
Median Absolute Deviation (MAD)33
Skewness3.9108516
Sum16145
Variance105548.49
MonotonicityNot monotonic
2023-12-10T18:53:59.569463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 7
 
7.0%
2 6
 
6.0%
1 4
 
4.0%
21 4
 
4.0%
15 3
 
3.0%
5 3
 
3.0%
13 3
 
3.0%
29 3
 
3.0%
3 2
 
2.0%
45 2
 
2.0%
Other values (59) 63
63.0%
ValueCountFrequency (%)
1 4
4.0%
2 6
6.0%
3 2
 
2.0%
4 7
7.0%
5 3
3.0%
6 1
 
1.0%
7 1
 
1.0%
8 1
 
1.0%
9 1
 
1.0%
11 2
 
2.0%
ValueCountFrequency (%)
2251 1
1.0%
1340 1
1.0%
1047 1
1.0%
1040 1
1.0%
1030 1
1.0%
718 1
1.0%
588 1
1.0%
552 1
1.0%
476 1
1.0%
428 1
1.0%

Interactions

2023-12-10T18:53:55.452224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:53:55.101793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:53:55.745445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:53:55.288608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:53:59.777278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
base_yearbase_monthmanage_code_nameclass_no_nameage_dcvalue
base_year1.0000.9631.0000.5880.0360.000
base_month0.9631.0001.0000.5880.0360.000
manage_code_name1.0001.0001.0000.6880.0000.124
class_no_name0.5880.5880.6881.0000.0000.133
age_dc0.0360.0360.0000.0001.0000.000
value0.0000.0000.1240.1330.0001.000
2023-12-10T18:54:00.199396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
manage_code_namebase_monthbase_yearclass_no_name
manage_code_name1.0000.9950.9950.522
base_month0.9951.0000.8260.435
base_year0.9950.8261.0000.435
class_no_name0.5220.4350.4351.000
2023-12-10T18:54:00.419139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
age_dcvaluebase_yearbase_monthmanage_code_nameclass_no_name
age_dc1.000-0.3700.0190.0190.0000.000
value-0.3701.0000.0000.0000.0780.059
base_year0.0190.0001.0000.8260.9950.435
base_month0.0190.0000.8261.0000.9950.435
manage_code_name0.0000.0780.9950.9951.0000.522
class_no_name0.0000.0590.4350.4350.5221.000

Missing values

2023-12-10T18:53:55.952111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:53:56.127623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

base_yearbase_monthmanage_code_nameclass_no_nameage_dcvalue
020191강서기적의도서관기술과학029
120216화명도서관총류6037
220191강서기적의도서관기술과학2014
320191강서기적의도서관기술과학30114
420191강서기적의도서관기술과학40220
520191강서기적의도서관기술과학5039
620191강서기적의도서관기술과학6021
720216화명도서관총류7026
820191강서기적의도서관기술과학803
920191강서기적의도서관문학01040
base_yearbase_monthmanage_code_nameclass_no_nameage_dcvalue
9020191구덕도서관기술과학804
9120191구덕도서관문학0320
9220191구덕도서관문학10718
9320191구덕도서관문학20337
9420191구덕도서관문학30476
9520191구덕도서관문학401030
9620191구덕도서관문학50588
9720191구덕도서관문학60280
9820191구덕도서관문학70117
9920191구덕도서관문학8021