Overview

Dataset statistics

Number of variables4
Number of observations42
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)21.4%
Total size in memory1.5 KiB
Average record size in memory37.1 B

Variable types

Categorical3
Numeric1

Dataset

Description한국자산관리공사_국유증권 보유현황("취득년도","취득일자","매각구분","수량") 데이터 제공
Author한국자산관리공사
URLhttps://www.data.go.kr/data/15074514/fileData.do

Alerts

취득년도 has constant value ""Constant
Dataset has 9 (21.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:33:25.229097
Analysis finished2023-12-12 22:33:25.565530
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

취득년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size468.0 B
2019
42 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 42
100.0%

Length

2023-12-13T07:33:25.639044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:33:25.731230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 42
100.0%

취득일자
Categorical

Distinct18
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size468.0 B
04-12
12 
04-19
04-30
04-15
10-16
Other values (13)
14 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique12 ?
Unique (%)28.6%

Sample

1st row01-09
2nd row02-11
3rd row03-27
4th row03-28
5th row04-12

Common Values

ValueCountFrequency (%)
04-12 12
28.6%
04-19 9
21.4%
04-30 3
 
7.1%
04-15 2
 
4.8%
10-16 2
 
4.8%
07-10 2
 
4.8%
07-12 1
 
2.4%
03-27 1
 
2.4%
03-28 1
 
2.4%
06-13 1
 
2.4%
Other values (8) 8
19.0%

Length

2023-12-13T07:33:25.818120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
04-12 12
28.6%
04-19 9
21.4%
04-30 3
 
7.1%
04-15 2
 
4.8%
10-16 2
 
4.8%
07-10 2
 
4.8%
07-04 1
 
2.4%
01-09 1
 
2.4%
08-02 1
 
2.4%
11-05 1
 
2.4%
Other values (8) 8
19.0%

매각구분
Categorical

Distinct2
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size468.0 B
일괄매각
25 
일부매각
17 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일괄매각
2nd row일괄매각
3rd row일괄매각
4th row일괄매각
5th row일부매각

Common Values

ValueCountFrequency (%)
일괄매각 25
59.5%
일부매각 17
40.5%

Length

2023-12-13T07:33:25.916788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:33:26.015980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일괄매각 25
59.5%
일부매각 17
40.5%

수량
Real number (ℝ)

Distinct30
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26510.5
Minimum205
Maximum203939
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size510.0 B
2023-12-13T07:33:26.117981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum205
5-th percentile1065.8
Q13072
median5500
Q317500
95-th percentile159794.3
Maximum203939
Range203734
Interquartile range (IQR)14428

Descriptive statistics

Standard deviation49800.297
Coefficient of variation (CV)1.8785122
Kurtosis5.2965243
Mean26510.5
Median Absolute Deviation (MAD)4126
Skewness2.5000592
Sum1113441
Variance2.4800696 × 109
MonotonicityNot monotonic
2023-12-13T07:33:26.244378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
5000 5
 
11.9%
2000 3
 
7.1%
18000 2
 
4.8%
160646 2
 
4.8%
3072 2
 
4.8%
8786 2
 
4.8%
3140 2
 
4.8%
1366 2
 
4.8%
12316 1
 
2.4%
4020 1
 
2.4%
Other values (20) 20
47.6%
ValueCountFrequency (%)
205 1
 
2.4%
301 1
 
2.4%
1050 1
 
2.4%
1366 2
4.8%
1382 1
 
2.4%
2000 3
7.1%
2735 1
 
2.4%
3072 2
4.8%
3140 2
4.8%
4020 1
 
2.4%
ValueCountFrequency (%)
203939 1
2.4%
160646 2
4.8%
143612 1
2.4%
95430 1
2.4%
60090 1
2.4%
35976 1
2.4%
24600 1
2.4%
24090 1
2.4%
18000 2
4.8%
16000 1
2.4%

Interactions

2023-12-13T07:33:25.329212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:33:26.341252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취득일자매각구분수량
취득일자1.0000.5470.856
매각구분0.5471.0000.150
수량0.8560.1501.000
2023-12-13T07:33:26.437290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취득일자매각구분
취득일자1.0000.324
매각구분0.3241.000
2023-12-13T07:33:26.530303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수량취득일자매각구분
수량1.0000.4920.061
취득일자0.4921.0000.324
매각구분0.0610.3241.000

Missing values

2023-12-13T07:33:25.447271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:33:25.525839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

취득년도취득일자매각구분수량
0201901-09일괄매각4229
1201902-11일괄매각205
2201903-27일괄매각1382
3201903-28일괄매각143612
4201904-12일부매각14600
5201904-12일괄매각8786
6201904-12일괄매각8786
7201904-12일부매각2000
8201904-12일부매각6786
9201904-12일괄매각60090
취득년도취득일자매각구분수량
32201907-10일괄매각160646
33201907-10일괄매각160646
34201907-12일괄매각12316
35201908-02일괄매각15000
36201908-14일괄매각301
37201910-16일괄매각3140
38201910-16일괄매각3140
39201910-25일괄매각2735
40201911-05일괄매각203939
41201911-27일괄매각1050

Duplicate rows

Most frequently occurring

취득년도취득일자매각구분수량# duplicates
6201904-19일부매각50003
0201904-12일괄매각87862
1201904-12일부매각50002
2201904-12일부매각180002
3201904-15일괄매각13662
4201904-19일괄매각30722
5201904-19일부매각20002
7201907-10일괄매각1606462
8201910-16일괄매각31402