Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 10062 |
Missing cells (%) | 14.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 654.3 KiB |
Average record size in memory | 67.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 3 |
Unsupported | 1 |
Dataset
Description | 한국주택금융공사에서 발행한 데이터 입니다. 순서,자료유형,등록번호,서명,저자,출판사,출판년도 칼럼이 포함되어있으며 관련값이 있습니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/3071759/fileData.do |
순서 is highly overall correlated with 등록번호 | High correlation |
등록번호 is highly overall correlated with 순서 | High correlation |
자료유형 is highly imbalanced (60.8%) | Imbalance |
Unnamed: 6 has 10000 (100.0%) missing values | Missing |
순서 has unique values | Unique |
등록번호 has unique values | Unique |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 18:15:43.294959 |
---|---|
Analysis finished | 2023-12-12 18:15:46.105634 |
Duration | 2.81 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순서
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13582.494 |
Minimum | 2 |
---|---|
Maximum | 27353 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 1377.95 |
Q1 | 6703.5 |
median | 13468.5 |
Q3 | 20226.5 |
95-th percentile | 26080.05 |
Maximum | 27353 |
Range | 27351 |
Interquartile range (IQR) | 13523 |
Descriptive statistics
Standard deviation | 7933.8622 |
---|---|
Coefficient of variation (CV) | 0.5841241 |
Kurtosis | -1.1998084 |
Mean | 13582.494 |
Median Absolute Deviation (MAD) | 6763.5 |
Skewness | 0.028562979 |
Sum | 1.3582494 × 108 |
Variance | 62946170 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6473 | 1 | < 0.1% |
18265 | 1 | < 0.1% |
19115 | 1 | < 0.1% |
17138 | 1 | < 0.1% |
27302 | 1 | < 0.1% |
6127 | 1 | < 0.1% |
13426 | 1 | < 0.1% |
10768 | 1 | < 0.1% |
24508 | 1 | < 0.1% |
25584 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
2 | 1 | |
9 | 1 | |
12 | 1 | |
13 | 1 | |
16 | 1 | |
18 | 1 | |
21 | 1 | |
22 | 1 | |
23 | 1 | |
26 | 1 |
Value | Count | Frequency (%) |
27353 | 1 | |
27352 | 1 | |
27343 | 1 | |
27342 | 1 | |
27341 | 1 | |
27337 | 1 | |
27335 | 1 | |
27328 | 1 | |
27326 | 1 | |
27315 | 1 |
자료유형
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
국내서 | |
---|---|
연구보고서 | |
국외서 | 417 |
(비)일반 | 218 |
학위논문 | 17 |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.3448 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 국내서 |
---|---|
2nd row | 국내서 |
3rd row | 연구보고서 |
4th row | 연구보고서 |
5th row | 국내서 |
Common Values
Value | Count | Frequency (%) |
국내서 | 7852 | |
연구보고서 | 1495 | 14.9% |
국외서 | 417 | 4.2% |
(비)일반 | 218 | 2.2% |
학위논문 | 17 | 0.2% |
(연)국내연간물 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
국내서 | 7852 | |
연구보고서 | 1495 | 14.9% |
국외서 | 417 | 4.2% |
비)일반 | 218 | 2.2% |
학위논문 | 17 | 0.2% |
연)국내연간물 | 1 | < 0.1% |
등록번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14291.106 |
Minimum | 2 |
---|---|
Maximum | 28274 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 1574.95 |
Q1 | 7342.5 |
median | 14281.5 |
Q3 | 21101.5 |
95-th percentile | 26997.05 |
Maximum | 28274 |
Range | 28272 |
Interquartile range (IQR) | 13759 |
Descriptive statistics
Standard deviation | 8144.6942 |
---|---|
Coefficient of variation (CV) | 0.5699135 |
Kurtosis | -1.1843254 |
Mean | 14291.106 |
Median Absolute Deviation (MAD) | 6888 |
Skewness | -0.0069491107 |
Sum | 1.4291106 × 108 |
Variance | 66336044 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7110 | 1 | < 0.1% |
19129 | 1 | < 0.1% |
19987 | 1 | < 0.1% |
17995 | 1 | < 0.1% |
28223 | 1 | < 0.1% |
6713 | 1 | < 0.1% |
14239 | 1 | < 0.1% |
11531 | 1 | < 0.1% |
25416 | 1 | < 0.1% |
26498 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
2 | 1 | |
9 | 1 | |
12 | 1 | |
13 | 1 | |
16 | 1 | |
18 | 1 | |
21 | 1 | |
22 | 1 | |
23 | 1 | |
26 | 1 |
Value | Count | Frequency (%) |
28274 | 1 | |
28273 | 1 | |
28264 | 1 | |
28263 | 1 | |
28262 | 1 | |
28258 | 1 | |
28256 | 1 | |
28249 | 1 | |
28247 | 1 | |
28236 | 1 |
서명
Text
Distinct | 9203 |
---|---|
Distinct (%) | 92.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 226 |
---|---|
Median length | 138 |
Mean length | 24.4648 |
Min length | 1 |
Characters and Unicode
Total characters | 244648 |
---|---|
Distinct characters | 1571 |
Distinct categories | 15 ? |
Distinct scripts | 6 ? |
Distinct blocks | 13 ? |
Unique
Unique | 8783 ? |
---|---|
Unique (%) | 87.8% |
Sample
1st row | 대제국 고구려.1:광개토대제비의 위용 |
---|---|
2nd row | (온쪽이 하예린의)내가 만난 파리 |
3rd row | 맥쿼리 그룹(Macquarie Group)의 성장과정과 전략적 시사점 |
4th row | (2007 통신연수)금융경제.1 |
5th row | (기발한 시골 양반 라 만차의)돈끼호떼.1 |
Value | Count | Frequency (%) |
the | 405 | 0.9% |
of | 380 | 0.8% |
및 | 366 | 0.8% |
위한 | 338 | 0.7% |
연구 | 294 | 0.6% |
and | 286 | 0.6% |
장편소설 | 176 | 0.4% |
167 | 0.4% | |
관한 | 160 | 0.3% |
for | 157 | 0.3% |
Other values (21504) | 44254 |
Most occurring characters
Value | Count | Frequency (%) |
36996 | 15.1% | |
e | 5608 | 2.3% |
n | 4546 | 1.9% |
a | 4402 | 1.8% |
i | 4173 | 1.7% |
o | 4040 | 1.7% |
의 | 3999 | 1.6% |
t | 3851 | 1.6% |
: | 3583 | 1.5% |
r | 3214 | 1.3% |
Other values (1561) | 170236 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 129176 | |
Lowercase Letter | 47747 | 19.5% |
Space Separator | 36996 | 15.1% |
Uppercase Letter | 9652 | 3.9% |
Decimal Number | 7522 | 3.1% |
Other Punctuation | 6692 | 2.7% |
Open Punctuation | 2677 | 1.1% |
Close Punctuation | 2675 | 1.1% |
Math Symbol | 982 | 0.4% |
Dash Punctuation | 417 | 0.2% |
Other values (5) | 112 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 3999 | 3.1% |
한 | 2293 | 1.8% |
이 | 2256 | 1.7% |
기 | 2245 | 1.7% |
사 | 1981 | 1.5% |
는 | 1834 | 1.4% |
제 | 1781 | 1.4% |
가 | 1555 | 1.2% |
리 | 1542 | 1.2% |
지 | 1461 | 1.1% |
Other values (1449) | 108229 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 5608 | |
n | 4546 | |
a | 4402 | 9.2% |
i | 4173 | 8.7% |
o | 4040 | 8.5% |
t | 3851 | 8.1% |
r | 3214 | 6.7% |
s | 2879 | 6.0% |
c | 2040 | 4.3% |
l | 1874 | 3.9% |
Other values (16) | 11120 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 899 | 9.3% |
T | 786 | 8.1% |
E | 762 | 7.9% |
A | 753 | 7.8% |
C | 745 | 7.7% |
P | 569 | 5.9% |
I | 562 | 5.8% |
M | 533 | 5.5% |
D | 492 | 5.1% |
R | 474 | 4.9% |
Other values (16) | 3077 |
Other Punctuation
Value | Count | Frequency (%) |
: | 3583 | |
. | 1262 | 18.9% |
, | 1005 | 15.0% |
· | 198 | 3.0% |
/ | 136 | 2.0% |
& | 104 | 1.6% |
' | 102 | 1.5% |
; | 73 | 1.1% |
! | 72 | 1.1% |
' | 54 | 0.8% |
Other values (10) | 103 | 1.5% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2177 | |
2 | 1593 | |
1 | 1575 | |
3 | 503 | 6.7% |
5 | 364 | 4.8% |
4 | 291 | 3.9% |
9 | 286 | 3.8% |
7 | 276 | 3.7% |
6 | 243 | 3.2% |
8 | 214 | 2.8% |
Math Symbol
Value | Count | Frequency (%) |
= | 905 | |
+ | 40 | 4.1% |
~ | 27 | 2.7% |
> | 4 | 0.4% |
< | 4 | 0.4% |
∼ | 2 | 0.2% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 40 | |
Ⅰ | 27 | |
Ⅲ | 13 | 13.3% |
Ⅳ | 11 | 11.2% |
Ⅴ | 5 | 5.1% |
Ⅵ | 2 | 2.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 2643 | |
[ | 30 | 1.1% |
「 | 2 | 0.1% |
『 | 1 | < 0.1% |
〈 | 1 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 2641 | |
] | 30 | 1.1% |
」 | 2 | 0.1% |
』 | 1 | < 0.1% |
〉 | 1 | < 0.1% |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 6 | |
★ | 1 | 14.3% |
Modifier Symbol
Value | Count | Frequency (%) |
˚ | 2 | |
˙ | 1 |
Space Separator
Value | Count | Frequency (%) |
36996 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 417 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 2 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 127583 | |
Common | 57969 | |
Latin | 57497 | |
Han | 1577 | 0.6% |
Hiragana | 15 | < 0.1% |
Katakana | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 3999 | 3.1% |
한 | 2293 | 1.8% |
이 | 2256 | 1.8% |
기 | 2245 | 1.8% |
사 | 1981 | 1.6% |
는 | 1834 | 1.4% |
제 | 1781 | 1.4% |
가 | 1555 | 1.2% |
리 | 1542 | 1.2% |
지 | 1461 | 1.1% |
Other values (1094) | 106636 |
Han
Value | Count | Frequency (%) |
法 | 119 | 7.5% |
大 | 50 | 3.2% |
民 | 37 | 2.3% |
國 | 36 | 2.3% |
事 | 35 | 2.2% |
新 | 25 | 1.6% |
論 | 23 | 1.5% |
小 | 22 | 1.4% |
典 | 20 | 1.3% |
說 | 19 | 1.2% |
Other values (329) | 1191 |
Latin
Value | Count | Frequency (%) |
e | 5608 | 9.8% |
n | 4546 | 7.9% |
a | 4402 | 7.7% |
i | 4173 | 7.3% |
o | 4040 | 7.0% |
t | 3851 | 6.7% |
r | 3214 | 5.6% |
s | 2879 | 5.0% |
c | 2040 | 3.5% |
l | 1874 | 3.3% |
Other values (48) | 20870 |
Common
Value | Count | Frequency (%) |
36996 | ||
: | 3583 | 6.2% |
( | 2643 | 4.6% |
) | 2641 | 4.6% |
0 | 2177 | 3.8% |
2 | 1593 | 2.7% |
1 | 1575 | 2.7% |
. | 1262 | 2.2% |
, | 1005 | 1.7% |
= | 905 | 1.6% |
Other values (43) | 3589 | 6.2% |
Hiragana
Value | Count | Frequency (%) |
か | 3 | |
の | 2 | |
た | 2 | |
は | 2 | |
け | 1 | 6.7% |
ら | 1 | 6.7% |
を | 1 | 6.7% |
し | 1 | 6.7% |
て | 1 | 6.7% |
き | 1 | 6.7% |
Katakana
Value | Count | Frequency (%) |
マ | 1 | |
ン | 1 | |
シ | 1 | |
ョ | 1 | |
リ | 1 | |
ッ | 1 | |
ク | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 127548 | |
ASCII | 115046 | |
CJK | 1515 | 0.6% |
None | 317 | 0.1% |
Number Forms | 98 | < 0.1% |
CJK Compat Ideographs | 62 | < 0.1% |
Compat Jamo | 29 | < 0.1% |
Hiragana | 15 | < 0.1% |
Katakana | 7 | < 0.1% |
Punctuation | 5 | < 0.1% |
Other values (3) | 6 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
36996 | ||
e | 5608 | 4.9% |
n | 4546 | 4.0% |
a | 4402 | 3.8% |
i | 4173 | 3.6% |
o | 4040 | 3.5% |
t | 3851 | 3.3% |
: | 3583 | 3.1% |
r | 3214 | 2.8% |
s | 2879 | 2.5% |
Other values (75) | 41754 |
Hangul
Value | Count | Frequency (%) |
의 | 3999 | 3.1% |
한 | 2293 | 1.8% |
이 | 2256 | 1.8% |
기 | 2245 | 1.8% |
사 | 1981 | 1.6% |
는 | 1834 | 1.4% |
제 | 1781 | 1.4% |
가 | 1555 | 1.2% |
리 | 1542 | 1.2% |
지 | 1461 | 1.1% |
Other values (1092) | 106601 |
None
Value | Count | Frequency (%) |
· | 198 | |
' | 54 | 17.0% |
? | 34 | 10.7% |
& | 11 | 3.5% |
㈜ | 6 | 1.9% |
% | 4 | 1.3% |
」 | 2 | 0.6% |
「 | 2 | 0.6% |
、 | 1 | 0.3% |
。 | 1 | 0.3% |
Other values (4) | 4 | 1.3% |
CJK
Value | Count | Frequency (%) |
法 | 119 | 7.9% |
大 | 50 | 3.3% |
民 | 37 | 2.4% |
國 | 36 | 2.4% |
事 | 35 | 2.3% |
新 | 25 | 1.7% |
論 | 23 | 1.5% |
小 | 22 | 1.5% |
典 | 20 | 1.3% |
說 | 19 | 1.3% |
Other values (318) | 1129 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 40 | |
Ⅰ | 27 | |
Ⅲ | 13 | 13.3% |
Ⅳ | 11 | 11.2% |
Ⅴ | 5 | 5.1% |
Ⅵ | 2 | 2.0% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 29 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
宅 | 17 | |
勞 | 13 | |
金 | 13 | |
理 | 8 | |
年 | 4 | 6.5% |
列 | 2 | 3.2% |
例 | 1 | 1.6% |
女 | 1 | 1.6% |
不 | 1 | 1.6% |
李 | 1 | 1.6% |
Hiragana
Value | Count | Frequency (%) |
か | 3 | |
の | 2 | |
た | 2 | |
は | 2 | |
け | 1 | 6.7% |
ら | 1 | 6.7% |
を | 1 | 6.7% |
し | 1 | 6.7% |
て | 1 | 6.7% |
き | 1 | 6.7% |
Math Operators
Value | Count | Frequency (%) |
∼ | 2 |
Modifier Letters
Value | Count | Frequency (%) |
˚ | 2 | |
˙ | 1 |
Punctuation
Value | Count | Frequency (%) |
‘ | 2 | |
’ | 2 | |
… | 1 |
Katakana
Value | Count | Frequency (%) |
マ | 1 | |
ン | 1 | |
シ | 1 | |
ョ | 1 | |
リ | 1 | |
ッ | 1 | |
ク | 1 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 1 |
저자
Text
Distinct | 6318 |
---|---|
Distinct (%) | 63.4% |
Missing | 32 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
대외경제정책연구원 | 216 | 1.6% |
한국금융연수원 | 162 | 1.2% |
한국금융연구원 | 152 | 1.1% |
한국은행 | 101 | 0.7% |
법제처 | 69 | 0.5% |
삼성경제연구소 | 60 | 0.4% |
j | 58 | 0.4% |
한국경제연구원 | 55 | 0.4% |
edited | 54 | 0.4% |
지음 | 46 | 0.3% |
Other values (7531) | 12890 |
Most occurring characters
Value | Count | Frequency (%) |
3898 | 5.5% | |
, | 3785 | 5.3% |
a | 2167 | 3.1% |
e | 2136 | 3.0% |
n | 1752 | 2.5% |
r | 1563 | 2.2% |
i | 1495 | 2.1% |
o | 1435 | 2.0% |
원 | 1291 | 1.8% |
이 | 1279 | 1.8% |
Other values (971) | 50147 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 40055 | |
Lowercase Letter | 19113 | |
Other Punctuation | 4466 | 6.3% |
Space Separator | 3898 | 5.5% |
Uppercase Letter | 3125 | 4.4% |
Close Punctuation | 111 | 0.2% |
Open Punctuation | 97 | 0.1% |
Dash Punctuation | 46 | 0.1% |
Decimal Number | 20 | < 0.1% |
Math Symbol | 16 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 1291 | 3.2% |
이 | 1279 | 3.2% |
연 | 1201 | 3.0% |
김 | 1097 | 2.7% |
한 | 1083 | 2.7% |
정 | 1006 | 2.5% |
구 | 984 | 2.5% |
국 | 963 | 2.4% |
경 | 734 | 1.8% |
제 | 623 | 1.6% |
Other values (893) | 29794 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2167 | |
e | 2136 | |
n | 1752 | 9.2% |
r | 1563 | 8.2% |
i | 1495 | 7.8% |
o | 1435 | 7.5% |
l | 1006 | 5.3% |
s | 946 | 4.9% |
t | 871 | 4.6% |
h | 721 | 3.8% |
Other values (17) | 5021 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 272 | 8.7% |
C | 261 | 8.4% |
M | 218 | 7.0% |
B | 210 | 6.7% |
K | 205 | 6.6% |
R | 191 | 6.1% |
J | 188 | 6.0% |
H | 171 | 5.5% |
A | 162 | 5.2% |
D | 153 | 4.9% |
Other values (17) | 1094 |
Other Punctuation
Value | Count | Frequency (%) |
, | 3785 | |
. | 657 | 14.7% |
& | 10 | 0.2% |
' | 5 | 0.1% |
; | 3 | 0.1% |
: | 3 | 0.1% |
· | 2 | < 0.1% |
& | 1 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 7 | |
6 | 6 | |
1 | 3 | |
2 | 2 | 10.0% |
4 | 1 | 5.0% |
5 | 1 | 5.0% |
Math Symbol
Value | Count | Frequency (%) |
= | 8 | |
> | 5 | |
< | 3 | 18.8% |
Close Punctuation
Value | Count | Frequency (%) |
] | 97 | |
) | 14 | 12.6% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 83 | |
( | 14 | 14.4% |
Space Separator
Value | Count | Frequency (%) |
3898 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 46 |
Other Symbol
Value | Count | Frequency (%) |
ⓔ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 39543 | |
Latin | 22238 | |
Common | 8655 | 12.2% |
Han | 493 | 0.7% |
Katakana | 12 | < 0.1% |
Hiragana | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 1291 | 3.3% |
이 | 1279 | 3.2% |
연 | 1201 | 3.0% |
김 | 1097 | 2.8% |
한 | 1083 | 2.7% |
정 | 1006 | 2.5% |
구 | 984 | 2.5% |
국 | 963 | 2.4% |
경 | 734 | 1.9% |
제 | 623 | 1.6% |
Other values (678) | 29282 |
Han
Value | Count | Frequency (%) |
潤 | 18 | 3.7% |
會 | 15 | 3.0% |
直 | 15 | 3.0% |
郭 | 14 | 2.8% |
協 | 13 | 2.6% |
韓 | 13 | 2.6% |
洙 | 11 | 2.2% |
鄭 | 9 | 1.8% |
李 | 9 | 1.8% |
宅 | 8 | 1.6% |
Other values (187) | 368 |
Latin
Value | Count | Frequency (%) |
a | 2167 | 9.7% |
e | 2136 | 9.6% |
n | 1752 | 7.9% |
r | 1563 | 7.0% |
i | 1495 | 6.7% |
o | 1435 | 6.5% |
l | 1006 | 4.5% |
s | 946 | 4.3% |
t | 871 | 3.9% |
h | 721 | 3.2% |
Other values (44) | 8146 |
Common
Value | Count | Frequency (%) |
3898 | ||
, | 3785 | |
. | 657 | 7.6% |
] | 97 | 1.1% |
[ | 83 | 1.0% |
- | 46 | 0.5% |
( | 14 | 0.2% |
) | 14 | 0.2% |
& | 10 | 0.1% |
= | 8 | 0.1% |
Other values (14) | 43 | 0.5% |
Katakana
Value | Count | Frequency (%) |
ム | 2 | |
キ | 1 | |
ル | 1 | |
ハ | 1 | |
ラ | 1 | |
カ | 1 | |
ミ | 1 | |
コ | 1 | |
ヨ | 1 | |
レ | 1 |
Hiragana
Value | Count | Frequency (%) |
お | 1 | |
も | 1 | |
し | 1 | |
ろ | 1 | |
え | 1 | |
な | 1 | |
か | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 39543 | |
ASCII | 30882 | |
CJK | 467 | 0.7% |
CJK Compat Ideographs | 26 | < 0.1% |
Katakana | 12 | < 0.1% |
None | 10 | < 0.1% |
Hiragana | 7 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3898 | 12.6% | |
, | 3785 | 12.3% |
a | 2167 | 7.0% |
e | 2136 | 6.9% |
n | 1752 | 5.7% |
r | 1563 | 5.1% |
i | 1495 | 4.8% |
o | 1435 | 4.6% |
l | 1006 | 3.3% |
s | 946 | 3.1% |
Other values (62) | 10699 |
Hangul
Value | Count | Frequency (%) |
원 | 1291 | 3.3% |
이 | 1279 | 3.2% |
연 | 1201 | 3.0% |
김 | 1097 | 2.8% |
한 | 1083 | 2.7% |
정 | 1006 | 2.5% |
구 | 984 | 2.5% |
국 | 963 | 2.4% |
경 | 734 | 1.9% |
제 | 623 | 1.6% |
Other values (678) | 29282 |
CJK
Value | Count | Frequency (%) |
潤 | 18 | 3.9% |
會 | 15 | 3.2% |
直 | 15 | 3.2% |
郭 | 14 | 3.0% |
協 | 13 | 2.8% |
韓 | 13 | 2.8% |
洙 | 11 | 2.4% |
鄭 | 9 | 1.9% |
說 | 8 | 1.7% |
朴 | 8 | 1.7% |
Other values (178) | 343 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 9 | |
宅 | 8 | |
金 | 2 | 7.7% |
立 | 2 | 7.7% |
林 | 1 | 3.8% |
隆 | 1 | 3.8% |
梁 | 1 | 3.8% |
綾 | 1 | 3.8% |
寧 | 1 | 3.8% |
None
Value | Count | Frequency (%) |
' | 5 | |
· | 2 | 20.0% |
æ | 1 | 10.0% |
& | 1 | 10.0% |
Ø | 1 | 10.0% |
Katakana
Value | Count | Frequency (%) |
ム | 2 | |
キ | 1 | |
ル | 1 | |
ハ | 1 | |
ラ | 1 | |
カ | 1 | |
ミ | 1 | |
コ | 1 | |
ヨ | 1 | |
レ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓔ | 1 |
Hiragana
Value | Count | Frequency (%) |
お | 1 | |
も | 1 | |
し | 1 | |
ろ | 1 | |
え | 1 | |
な | 1 | |
か | 1 |
출판사
Text
Distinct | 2247 |
---|---|
Distinct (%) | 22.5% |
Missing | 30 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
대외경제정책연구원 | 264 | 2.4% |
한국금융연수원 | 264 | 2.4% |
한국금융연구원 | 261 | 2.4% |
국토연구원 | 181 | 1.7% |
한국개발연구원 | 172 | 1.6% |
한국행정연구원 | 150 | 1.4% |
한국경제연구원 | 147 | 1.4% |
민음사 | 142 | 1.3% |
김영사 | 131 | 1.2% |
한국은행 | 122 | 1.1% |
Other values (2340) | 8979 |
Most occurring characters
Value | Count | Frequency (%) |
원 | 2186 | 3.9% |
연 | 2143 | 3.8% |
한 | 1974 | 3.5% |
국 | 1914 | 3.4% |
사 | 1785 | 3.2% |
구 | 1773 | 3.2% |
스 | 1340 | 2.4% |
경 | 967 | 1.7% |
제 | 927 | 1.6% |
843 | 1.5% | |
Other values (696) | 40355 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 46014 | |
Lowercase Letter | 6369 | 11.3% |
Uppercase Letter | 2276 | 4.0% |
Space Separator | 843 | 1.5% |
Decimal Number | 339 | 0.6% |
Other Punctuation | 160 | 0.3% |
Open Punctuation | 74 | 0.1% |
Close Punctuation | 74 | 0.1% |
Dash Punctuation | 57 | 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 2186 | 4.8% |
연 | 2143 | 4.7% |
한 | 1974 | 4.3% |
국 | 1914 | 4.2% |
사 | 1785 | 3.9% |
구 | 1773 | 3.9% |
스 | 1340 | 2.9% |
경 | 967 | 2.1% |
제 | 927 | 2.0% |
정 | 760 | 1.7% |
Other values (617) | 30245 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 704 | |
o | 616 | |
n | 592 | |
i | 541 | 8.5% |
a | 521 | 8.2% |
s | 504 | 7.9% |
r | 477 | 7.5% |
l | 367 | 5.8% |
t | 306 | 4.8% |
c | 273 | 4.3% |
Other values (15) | 1468 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 217 | 9.5% |
B | 211 | 9.3% |
P | 178 | 7.8% |
M | 166 | 7.3% |
O | 140 | 6.2% |
H | 128 | 5.6% |
C | 125 | 5.5% |
I | 117 | 5.1% |
A | 114 | 5.0% |
E | 107 | 4.7% |
Other values (15) | 773 |
Other Punctuation
Value | Count | Frequency (%) |
& | 74 | |
, | 32 | |
. | 21 | 13.1% |
/ | 12 | 7.5% |
; | 4 | 2.5% |
: | 4 | 2.5% |
' | 3 | 1.9% |
' | 3 | 1.9% |
@ | 2 | 1.2% |
· | 2 | 1.2% |
Other values (2) | 3 | 1.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 152 | |
2 | 138 | |
0 | 11 | 3.2% |
3 | 9 | 2.7% |
4 | 9 | 2.7% |
8 | 6 | 1.8% |
9 | 5 | 1.5% |
5 | 4 | 1.2% |
6 | 3 | 0.9% |
7 | 2 | 0.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 64 | |
[ | 10 | 13.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 64 | |
] | 10 | 13.5% |
Space Separator
Value | Count | Frequency (%) |
843 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 57 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 45501 | |
Latin | 8645 | 15.4% |
Common | 1548 | 2.8% |
Han | 513 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 2186 | 4.8% |
연 | 2143 | 4.7% |
한 | 1974 | 4.3% |
국 | 1914 | 4.2% |
사 | 1785 | 3.9% |
구 | 1773 | 3.9% |
스 | 1340 | 2.9% |
경 | 967 | 2.1% |
제 | 927 | 2.0% |
정 | 760 | 1.7% |
Other values (538) | 29732 |
Han
Value | Count | Frequency (%) |
社 | 79 | 15.4% |
英 | 35 | 6.8% |
法 | 33 | 6.4% |
文 | 32 | 6.2% |
博 | 29 | 5.7% |
韓 | 20 | 3.9% |
會 | 20 | 3.9% |
協 | 12 | 2.3% |
國 | 12 | 2.3% |
住 | 11 | 2.1% |
Other values (69) | 230 |
Latin
Value | Count | Frequency (%) |
e | 704 | 8.1% |
o | 616 | 7.1% |
n | 592 | 6.8% |
i | 541 | 6.3% |
a | 521 | 6.0% |
s | 504 | 5.8% |
r | 477 | 5.5% |
l | 367 | 4.2% |
t | 306 | 3.5% |
c | 273 | 3.2% |
Other values (40) | 3744 |
Common
Value | Count | Frequency (%) |
843 | ||
1 | 152 | 9.8% |
2 | 138 | 8.9% |
& | 74 | 4.8% |
( | 64 | 4.1% |
) | 64 | 4.1% |
- | 57 | 3.7% |
, | 32 | 2.1% |
. | 21 | 1.4% |
/ | 12 | 0.8% |
Other values (19) | 91 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 45501 | |
ASCII | 10186 | 18.1% |
CJK | 503 | 0.9% |
CJK Compat Ideographs | 10 | < 0.1% |
None | 7 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
원 | 2186 | 4.8% |
연 | 2143 | 4.7% |
한 | 1974 | 4.3% |
국 | 1914 | 4.2% |
사 | 1785 | 3.9% |
구 | 1773 | 3.9% |
스 | 1340 | 2.9% |
경 | 967 | 2.1% |
제 | 927 | 2.0% |
정 | 760 | 1.7% |
Other values (538) | 29732 |
ASCII
Value | Count | Frequency (%) |
843 | 8.3% | |
e | 704 | 6.9% |
o | 616 | 6.0% |
n | 592 | 5.8% |
i | 541 | 5.3% |
a | 521 | 5.1% |
s | 504 | 4.9% |
r | 477 | 4.7% |
l | 367 | 3.6% |
t | 306 | 3.0% |
Other values (66) | 4715 |
CJK
Value | Count | Frequency (%) |
社 | 79 | 15.7% |
英 | 35 | 7.0% |
法 | 33 | 6.6% |
文 | 32 | 6.4% |
博 | 29 | 5.8% |
韓 | 20 | 4.0% |
會 | 20 | 4.0% |
協 | 12 | 2.4% |
國 | 12 | 2.4% |
住 | 11 | 2.2% |
Other values (67) | 220 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
宅 | 8 | |
金 | 2 | 20.0% |
None
Value | Count | Frequency (%) |
' | 3 | |
· | 2 | |
& | 2 |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
순서 | 자료유형 | 등록번호 | |
---|---|---|---|
순서 | 1.000 | 0.248 | 0.998 |
자료유형 | 0.248 | 1.000 | 0.252 |
등록번호 | 0.998 | 0.252 | 1.000 |
순서 | 등록번호 | 자료유형 | |
---|---|---|---|
순서 | 1.000 | 1.000 | 0.133 |
등록번호 | 1.000 | 1.000 | 0.135 |
자료유형 | 0.133 | 0.135 | 1.000 |
순서 | 자료유형 | 등록번호 | 서명 | 저자 | 출판사 | Unnamed: 6 | |
---|---|---|---|---|---|---|---|
6414 | 6473 | 국내서 | 7110 | 대제국 고구려.1:광개토대제비의 위용 | 유현종 | 굿인포메이션 | <NA> |
26 | 27 | 국내서 | 27 | (온쪽이 하예린의)내가 만난 파리 | 최하예린 | 디자인하우스 | <NA> |
16880 | 17383 | 연구보고서 | 18240 | 맥쿼리 그룹(Macquarie Group)의 성장과정과 전략적 시사점 | 한국금융연구원 | 한국금융연구원 | <NA> |
8964 | 9043 | 연구보고서 | 9789 | (2007 통신연수)금융경제.1 | 한국금융연수원 | 한국금융연수원 | <NA> |
4411 | 4426 | 국내서 | 4836 | (기발한 시골 양반 라 만차의)돈끼호떼.1 | 세르반떼스, 미겔 데 | 창작과 비평사 | <NA> |
12395 | 12672 | 국내서 | 13474 | (2008년도)건물신축단가표 | 한국감정원 | 한국감정원 | <NA> |
24619 | 25831 | 연구보고서 | 26745 | (2016년)하반기 경제전망 | 한국금융연구원 | 한국금융연구원 | <NA> |
20656 | 21277 | 국내서 | 22162 | 아! 아브라함 | 조우철 | 오직말씀 | <NA> |
12885 | 13208 | 국내서 | 14014 | 사회공헌활동백서 | 한국도로공사 | 한국도로공사 | <NA> |
8203 | 8276 | 국내서 | 8978 | (켈러의)경영경제통계학:엑셀의 실전적 활용 | Keller, gerald | Thomson | <NA> |
순서 | 자료유형 | 등록번호 | 서명 | 저자 | 출판사 | Unnamed: 6 | |
---|---|---|---|---|---|---|---|
2877 | 2885 | 국내서 | 3157 | TOEIC 900이나 500이나 미국가면 헤매는 20가지 이유 | 구경서 | 스타일 리더 | <NA> |
1724 | 1729 | 국내서 | 1927 | 변신 이야기.1 | 오비디우스 | 민음사 | <NA> |
4326 | 4341 | 국내서 | 4751 | Business Communication | School, harvard business | Harvard Business School | <NA> |
2571 | 2579 | 국내서 | 2839 | 꿈꾸는 책들의 도시 2 | 뫼르스, 발터 | 들녘 | <NA> |
515 | 516 | 국내서 | 518 | 내부감사 실무전서:기준. 매뉴얼, 체크리스트 | 한국감사협의회 | 한국감사협의회 | <NA> |
19162 | 19748 | 국내서 | 20622 | Romance of the three kingdoms =삼국지 | 나관중 | 다산북스 | <NA> |
1694 | 1699 | 국내서 | 1897 | 밑줄 긋는 남자 | 봉그랑, 카롤린 | 열린책들 | <NA> |
13111 | 13437 | 국내서 | 14250 | 미국 부동산 금융의 대위기:언제 극복할 것인가=(The) Great crisis of real estate & finance of America : When will America overcome | 김일권 | 부연사 | <NA> |
21333 | 22275 | 국내서 | 23167 | (지승호가 묻고 강신주가 답하다)강신주의 맨얼굴의 철학 당당한 인문학 | 강신주 | 시대의창 | <NA> |
5192 | 5217 | 국내서 | 5667 | (한 권으로 끝내는)변액보험 | 김종서 | 미래지식 | <NA> |