Overview

Dataset statistics

Number of variables5
Number of observations1175
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.6 KiB
Average record size in memory44.1 B

Variable types

Numeric4
Text1

Dataset

Description국민연금의 연도말 기준 국내주식 투자 종목별 평가액, 자산군 내 비중, 지분율 등 투자 현황에 대한 정보 (단위: 억 원, %)
Author국민연금공단
URLhttps://www.data.go.kr/data/3070507/fileData.do

Alerts

번호 is highly overall correlated with 평가액(억 원) and 2 other fieldsHigh correlation
평가액(억 원) is highly overall correlated with 번호 and 2 other fieldsHigh correlation
자산군 내 비중(퍼센트) is highly overall correlated with 번호 and 2 other fieldsHigh correlation
지분율(퍼센트) is highly overall correlated with 번호 and 2 other fieldsHigh correlation
평가액(억 원) is highly skewed (γ1 = 26.14918146)Skewed
자산군 내 비중(퍼센트) is highly skewed (γ1 = 26.13354065)Skewed
번호 has unique valuesUnique
종목명 has unique valuesUnique
평가액(억 원) has 54 (4.6%) zerosZeros
자산군 내 비중(퍼센트) has 685 (58.3%) zerosZeros
지분율(퍼센트) has 24 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-12 09:03:00.282209
Analysis finished2023-12-12 09:03:03.326289
Duration3.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1175
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean588
Minimum1
Maximum1175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.5 KiB
2023-12-12T18:03:03.432600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile59.7
Q1294.5
median588
Q3881.5
95-th percentile1116.3
Maximum1175
Range1174
Interquartile range (IQR)587

Descriptive statistics

Standard deviation339.33759
Coefficient of variation (CV)0.57710474
Kurtosis-1.2
Mean588
Median Absolute Deviation (MAD)294
Skewness0
Sum690900
Variance115150
MonotonicityStrictly increasing
2023-12-12T18:03:03.627722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
773 1
 
0.1%
789 1
 
0.1%
788 1
 
0.1%
787 1
 
0.1%
786 1
 
0.1%
785 1
 
0.1%
784 1
 
0.1%
783 1
 
0.1%
782 1
 
0.1%
Other values (1165) 1165
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1175 1
0.1%
1174 1
0.1%
1173 1
0.1%
1172 1
0.1%
1171 1
0.1%
1170 1
0.1%
1169 1
0.1%
1168 1
0.1%
1167 1
0.1%
1166 1
0.1%

종목명
Text

UNIQUE 

Distinct1175
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
2023-12-12T18:03:03.999144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length4.8042553
Min length2

Characters and Unicode

Total characters5645
Distinct characters455
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1175 ?
Unique (%)100.0%

Sample

1st row삼성전자
2nd rowLG에너지솔루션
3rd row삼성바이오로직스
4th rowSK하이닉스
5th row삼성SDI
ValueCountFrequency (%)
cj 3
 
0.3%
cj제일제당 2
 
0.2%
신세계 2
 
0.2%
ls 2
 
0.2%
삼성전자 1
 
0.1%
씨에스베어링 1
 
0.1%
hdc현대ep 1
 
0.1%
한신공영 1
 
0.1%
일성신약 1
 
0.1%
와이엔텍 1
 
0.1%
Other values (1170) 1170
98.7%
2023-12-12T18:03:04.590146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
274
 
4.9%
225
 
4.0%
104
 
1.8%
96
 
1.7%
92
 
1.6%
82
 
1.5%
80
 
1.4%
67
 
1.2%
66
 
1.2%
65
 
1.2%
Other values (445) 4494
79.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5026
89.0%
Uppercase Letter 515
 
9.1%
Open Punctuation 23
 
0.4%
Close Punctuation 23
 
0.4%
Decimal Number 19
 
0.3%
Other Punctuation 14
 
0.2%
Lowercase Letter 12
 
0.2%
Space Separator 10
 
0.2%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
274
 
5.5%
225
 
4.5%
104
 
2.1%
96
 
1.9%
92
 
1.8%
82
 
1.6%
80
 
1.6%
67
 
1.3%
66
 
1.3%
65
 
1.3%
Other values (402) 3875
77.1%
Uppercase Letter
ValueCountFrequency (%)
S 65
12.6%
K 51
 
9.9%
C 50
 
9.7%
L 40
 
7.8%
G 38
 
7.4%
H 30
 
5.8%
B 28
 
5.4%
D 28
 
5.4%
T 26
 
5.0%
I 23
 
4.5%
Other values (15) 136
26.4%
Lowercase Letter
ValueCountFrequency (%)
i 3
25.0%
l 2
16.7%
n 2
16.7%
s 2
16.7%
o 1
 
8.3%
c 1
 
8.3%
t 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 7
36.8%
2 5
26.3%
3 3
15.8%
4 3
15.8%
9 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
& 13
92.9%
. 1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5026
89.0%
Latin 527
 
9.3%
Common 92
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
274
 
5.5%
225
 
4.5%
104
 
2.1%
96
 
1.9%
92
 
1.8%
82
 
1.6%
80
 
1.6%
67
 
1.3%
66
 
1.3%
65
 
1.3%
Other values (402) 3875
77.1%
Latin
ValueCountFrequency (%)
S 65
12.3%
K 51
 
9.7%
C 50
 
9.5%
L 40
 
7.6%
G 38
 
7.2%
H 30
 
5.7%
B 28
 
5.3%
D 28
 
5.3%
T 26
 
4.9%
I 23
 
4.4%
Other values (22) 148
28.1%
Common
ValueCountFrequency (%)
( 23
25.0%
) 23
25.0%
& 13
14.1%
10
10.9%
1 7
 
7.6%
2 5
 
5.4%
- 3
 
3.3%
3 3
 
3.3%
4 3
 
3.3%
9 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5026
89.0%
ASCII 619
 
11.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
274
 
5.5%
225
 
4.5%
104
 
2.1%
96
 
1.9%
92
 
1.8%
82
 
1.6%
80
 
1.6%
67
 
1.3%
66
 
1.3%
65
 
1.3%
Other values (402) 3875
77.1%
ASCII
ValueCountFrequency (%)
S 65
 
10.5%
K 51
 
8.2%
C 50
 
8.1%
L 40
 
6.5%
G 38
 
6.1%
H 30
 
4.8%
B 28
 
4.5%
D 28
 
4.5%
T 26
 
4.2%
I 23
 
3.7%
Other values (33) 240
38.8%

평가액(억 원)
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct459
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1055.2187
Minimum0
Maximum248521
Zeros54
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size10.5 KiB
2023-12-12T18:03:04.792138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median31
Q3308.5
95-th percentile3487
Maximum248521
Range248521
Interquartile range (IQR)302.5

Descriptive statistics

Standard deviation7975.4617
Coefficient of variation (CV)7.5581124
Kurtosis793.87048
Mean1055.2187
Median Absolute Deviation (MAD)30
Skewness26.149181
Sum1239882
Variance63607990
MonotonicityDecreasing
2023-12-12T18:03:04.981433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 75
 
6.4%
0 54
 
4.6%
2 48
 
4.1%
4 40
 
3.4%
3 37
 
3.1%
6 35
 
3.0%
5 32
 
2.7%
7 23
 
2.0%
8 20
 
1.7%
9 16
 
1.4%
Other values (449) 795
67.7%
ValueCountFrequency (%)
0 54
4.6%
1 75
6.4%
2 48
4.1%
3 37
3.1%
4 40
3.4%
5 32
2.7%
6 35
3.0%
7 23
 
2.0%
8 20
 
1.7%
9 16
 
1.4%
ValueCountFrequency (%)
248521 1
0.1%
54757 1
0.1%
39620 1
0.1%
39288 1
0.1%
32126 1
0.1%
31578 1
0.1%
24627 1
0.1%
22550 1
0.1%
21328 1
0.1%
17204 1
0.1%

자산군 내 비중(퍼센트)
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct75
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.084748936
Minimum0
Maximum20.04
Zeros685
Zeros (%)58.3%
Negative0
Negative (%)0.0%
Memory size10.5 KiB
2023-12-12T18:03:05.502429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.02
95-th percentile0.28
Maximum20.04
Range20.04
Interquartile range (IQR)0.02

Descriptive statistics

Standard deviation0.64327427
Coefficient of variation (CV)7.5903522
Kurtosis793.16379
Mean0.084748936
Median Absolute Deviation (MAD)0
Skewness26.133541
Sum99.58
Variance0.41380179
MonotonicityDecreasing
2023-12-12T18:03:05.697285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 685
58.3%
0.01 141
 
12.0%
0.02 56
 
4.8%
0.03 41
 
3.5%
0.04 30
 
2.6%
0.05 20
 
1.7%
0.06 20
 
1.7%
0.09 13
 
1.1%
0.07 10
 
0.9%
0.1 10
 
0.9%
Other values (65) 149
 
12.7%
ValueCountFrequency (%)
0.0 685
58.3%
0.01 141
 
12.0%
0.02 56
 
4.8%
0.03 41
 
3.5%
0.04 30
 
2.6%
0.05 20
 
1.7%
0.06 20
 
1.7%
0.07 10
 
0.9%
0.08 7
 
0.6%
0.09 13
 
1.1%
ValueCountFrequency (%)
20.04 1
0.1%
4.42 1
0.1%
3.2 1
0.1%
3.17 1
0.1%
2.59 1
0.1%
2.55 1
0.1%
1.99 1
0.1%
1.82 1
0.1%
1.72 1
0.1%
1.39 1
0.1%

지분율(퍼센트)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct541
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7432851
Minimum0
Maximum13.98
Zeros24
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size10.5 KiB
2023-12-12T18:03:05.872176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.047
Q10.33
median1.16
Q34.77
95-th percentile9.01
Maximum13.98
Range13.98
Interquartile range (IQR)4.44

Descriptive statistics

Standard deviation3.127884
Coefficient of variation (CV)1.1401965
Kurtosis0.57371587
Mean2.7432851
Median Absolute Deviation (MAD)1.04
Skewness1.2199305
Sum3223.36
Variance9.7836585
MonotonicityNot monotonic
2023-12-12T18:03:06.038622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 24
 
2.0%
0.17 17
 
1.4%
0.04 14
 
1.2%
0.15 13
 
1.1%
0.08 11
 
0.9%
0.1 11
 
0.9%
0.12 11
 
0.9%
0.18 11
 
0.9%
0.05 11
 
0.9%
0.25 11
 
0.9%
Other values (531) 1041
88.6%
ValueCountFrequency (%)
0.0 24
2.0%
0.01 6
 
0.5%
0.02 8
 
0.7%
0.03 7
 
0.6%
0.04 14
1.2%
0.05 11
0.9%
0.06 8
 
0.7%
0.07 10
0.9%
0.08 11
0.9%
0.09 10
0.9%
ValueCountFrequency (%)
13.98 1
0.1%
13.6 1
0.1%
13.54 1
0.1%
13.48 1
0.1%
13.26 1
0.1%
13.22 1
0.1%
12.69 1
0.1%
12.56 1
0.1%
12.53 1
0.1%
12.28 1
0.1%

Interactions

2023-12-12T18:03:02.479312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:00.739572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.326308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.861978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.630854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:00.878811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.471003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.031614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.791279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.006289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.603762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.176644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.943604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.158088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:01.737932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:02.330948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:03:06.146745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호평가액(억 원)자산군 내 비중(퍼센트)지분율(퍼센트)
번호1.0000.1460.1460.824
평가액(억 원)0.1461.0001.0000.171
자산군 내 비중(퍼센트)0.1461.0001.0000.171
지분율(퍼센트)0.8240.1710.1711.000
2023-12-12T18:03:06.263169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호평가액(억 원)자산군 내 비중(퍼센트)지분율(퍼센트)
번호1.000-1.000-0.894-0.931
평가액(억 원)-1.0001.0000.8950.930
자산군 내 비중(퍼센트)-0.8940.8951.0000.830
지분율(퍼센트)-0.9310.9300.8301.000

Missing values

2023-12-12T18:03:03.136283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:03:03.269003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호종목명평가액(억 원)자산군 내 비중(퍼센트)지분율(퍼센트)
01삼성전자24852120.047.53
12LG에너지솔루션547574.425.37
23삼성바이오로직스396203.26.78
34SK하이닉스392883.177.2
45삼성SDI321262.597.9
56LG화학315782.557.46
67NAVER246271.998.46
78현대차225501.826.99
89POSCO홀딩스213281.729.12
910셀트리온172041.397.61
번호종목명평가액(억 원)자산군 내 비중(퍼센트)지분율(퍼센트)
11651166초록뱀헬스케어00.00.03
11661167엘브이엠씨홀딩스00.00.0
11671168LX홀딩스1우00.00.06
11681169인바이오젠00.00.02
11691170신풍00.00.02
11701171인디에프00.00.01
11711172선도전기00.00.01
11721173비케이탑스00.00.02
11731174엠투엔00.00.0
11741175MTRON00.00.0