Overview

Dataset statistics

Number of variables3
Number of observations249
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory25.5 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description분기마다 업데이트되는 국민연금기금의 거래증권사에 대한 정보로 자산별(국내외 주식, 채권, 단기자금) 거래증권사 현황을 제공합니다.
Author국민연금공단
URLhttps://www.data.go.kr/data/15005663/fileData.do

Alerts

번호 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:04:08.042035
Analysis finished2024-04-06 08:04:08.813920
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct249
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean125
Minimum1
Maximum249
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-04-06T17:04:08.984655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.4
Q163
median125
Q3187
95-th percentile236.6
Maximum249
Range248
Interquartile range (IQR)124

Descriptive statistics

Standard deviation72.024301
Coefficient of variation (CV)0.57619441
Kurtosis-1.2
Mean125
Median Absolute Deviation (MAD)62
Skewness0
Sum31125
Variance5187.5
MonotonicityStrictly increasing
2024-04-06T17:04:09.305829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
172 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
Other values (239) 239
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
해외채권
68 
단기자금(원화)
63 
국내채권
44 
국내주식
42 
단기자금(외화)
24 

Length

Max length8
Median length4
Mean length5.3975904
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내주식
2nd row국내주식
3rd row국내주식
4th row국내주식
5th row국내주식

Common Values

ValueCountFrequency (%)
해외채권 68
27.3%
단기자금(원화) 63
25.3%
국내채권 44
17.7%
국내주식 42
16.9%
단기자금(외화) 24
 
9.6%
해외주식 8
 
3.2%

Length

2024-04-06T17:04:09.555666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:04:09.816655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해외채권 68
27.3%
단기자금(원화 63
25.3%
국내채권 44
17.7%
국내주식 42
16.9%
단기자금(외화 24
 
9.6%
해외주식 8
 
3.2%
Distinct169
Distinct (%)67.9%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-06T17:04:10.247266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length30
Mean length8.3574297
Min length3

Characters and Unicode

Total characters2081
Distinct characters178
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)47.4%

Sample

1st row골드만삭스증권
2nd row교보증권
3rd row노무라금융투자
4th row다올투자증권
5th row다이와증권
ValueCountFrequency (%)
bank 16
 
4.8%
china 5
 
1.5%
of 5
 
1.5%
morgan 4
 
1.2%
markets 4
 
1.2%
capital 4
 
1.2%
nh투자증권 4
 
1.2%
삼성증권 4
 
1.2%
한국투자증권 4
 
1.2%
co 3
 
0.9%
Other values (197) 283
84.2%
2024-04-06T17:04:10.946559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
117
 
5.6%
115
 
5.5%
a 92
 
4.4%
87
 
4.2%
B 72
 
3.5%
n 69
 
3.3%
e 62
 
3.0%
r 53
 
2.5%
i 52
 
2.5%
50
 
2.4%
Other values (168) 1312
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 953
45.8%
Lowercase Letter 627
30.1%
Uppercase Letter 403
19.4%
Space Separator 87
 
4.2%
Other Punctuation 6
 
0.3%
Dash Punctuation 3
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
12.3%
115
 
12.1%
50
 
5.2%
47
 
4.9%
41
 
4.3%
41
 
4.3%
32
 
3.4%
27
 
2.8%
23
 
2.4%
17
 
1.8%
Other values (113) 443
46.5%
Lowercase Letter
ValueCountFrequency (%)
a 92
14.7%
n 69
11.0%
e 62
9.9%
r 53
8.5%
i 52
8.3%
t 49
 
7.8%
o 45
 
7.2%
s 29
 
4.6%
l 25
 
4.0%
u 23
 
3.7%
Other values (14) 128
20.4%
Uppercase Letter
ValueCountFrequency (%)
B 72
17.9%
C 42
10.4%
S 40
9.9%
N 33
 
8.2%
K 26
 
6.5%
A 25
 
6.2%
I 24
 
6.0%
M 15
 
3.7%
D 15
 
3.7%
E 14
 
3.5%
Other values (14) 97
24.1%
Other Punctuation
ValueCountFrequency (%)
. 3
50.0%
& 2
33.3%
/ 1
 
16.7%
Space Separator
ValueCountFrequency (%)
87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1030
49.5%
Hangul 953
45.8%
Common 98
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
12.3%
115
 
12.1%
50
 
5.2%
47
 
4.9%
41
 
4.3%
41
 
4.3%
32
 
3.4%
27
 
2.8%
23
 
2.4%
17
 
1.8%
Other values (113) 443
46.5%
Latin
ValueCountFrequency (%)
a 92
 
8.9%
B 72
 
7.0%
n 69
 
6.7%
e 62
 
6.0%
r 53
 
5.1%
i 52
 
5.0%
t 49
 
4.8%
o 45
 
4.4%
C 42
 
4.1%
S 40
 
3.9%
Other values (38) 454
44.1%
Common
ValueCountFrequency (%)
87
88.8%
- 3
 
3.1%
. 3
 
3.1%
& 2
 
2.0%
( 1
 
1.0%
/ 1
 
1.0%
) 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1128
54.2%
Hangul 953
45.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
117
 
12.3%
115
 
12.1%
50
 
5.2%
47
 
4.9%
41
 
4.3%
41
 
4.3%
32
 
3.4%
27
 
2.8%
23
 
2.4%
17
 
1.8%
Other values (113) 443
46.5%
ASCII
ValueCountFrequency (%)
a 92
 
8.2%
87
 
7.7%
B 72
 
6.4%
n 69
 
6.1%
e 62
 
5.5%
r 53
 
4.7%
i 52
 
4.6%
t 49
 
4.3%
o 45
 
4.0%
C 42
 
3.7%
Other values (45) 505
44.8%

Interactions

2024-04-06T17:04:08.309922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:04:11.140013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.929
구분0.9291.000
2024-04-06T17:04:11.352956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.813
구분0.8131.000

Missing values

2024-04-06T17:04:08.551206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:04:08.736114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구분거래증권사명
01국내주식골드만삭스증권
12국내주식교보증권
23국내주식노무라금융투자
34국내주식다올투자증권
45국내주식다이와증권
56국내주식대신증권
67국내주식디에스투자증권
78국내주식리딩투자증권
89국내주식맥쿼리증권
910국내주식메리츠증권
번호구분거래증권사명
239240해외채권SVENSKA HANDELSBANKEN
240241해외채권TD Securities
241242해외채권Tradition
242243해외채권UBS
243244해외채권UniCredit
244245해외채권US Bancorp
245246해외채권Wells Fargo
246247해외채권Westpac Banking
247248해외채권신한금융투자
248249해외채권NH투자증권