Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 556.6 KiB |
Average record size in memory | 57.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 방송통신위원회 도서 반출입시스템으로 도서에 대한 요약 제공 |
---|---|
Author | 방송통신위원회 |
URL | https://www.data.go.kr/data/3047123/fileData.do |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
별치 is highly imbalanced (79.0%) | Imbalance |
Reproduction
Analysis started | 2023-12-11 22:47:47.852807 |
---|---|
Analysis finished | 2023-12-11 22:47:49.864966 |
Duration | 2.01 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
청구기호
Text
Distinct | 9996 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 24 |
---|---|
Median length | 23 |
Mean length | 11.7127 |
Min length | 8 |
Characters and Unicode
Total characters | 117127 |
---|---|
Distinct characters | 447 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 9992 ? |
---|---|
Unique (%) | 99.9% |
Sample
1st row | 359.01 이14ㅇ |
---|---|
2nd row | 61B 02-063 |
3rd row | 90Z 94-026 |
4th row | 320.911 사15ㅅ |
5th row | 00A 92-008 |
Value | Count | Frequency (%) |
00a | 524 | 2.2% |
b | 412 | 1.7% |
813.6 | 388 | 1.6% |
61b | 349 | 1.5% |
c.2 | 339 | 1.4% |
v.2 | 319 | 1.4% |
v.1 | 304 | 1.3% |
11a | 301 | 1.3% |
21a | 268 | 1.1% |
37a | 261 | 1.1% |
Other values (6259) | 20140 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 16839 | |
13605 | ||
1 | 11482 | 9.8% |
3 | 9004 | 7.7% |
2 | 8198 | 7.0% |
9 | 6194 | 5.3% |
. | 5557 | 4.7% |
- | 5435 | 4.6% |
6 | 5389 | 4.6% |
8 | 4657 | 4.0% |
Other values (437) | 30767 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 75002 | |
Space Separator | 13605 | 11.6% |
Other Letter | 8867 | 7.6% |
Uppercase Letter | 6606 | 5.6% |
Other Punctuation | 5581 | 4.8% |
Dash Punctuation | 5435 | 4.6% |
Lowercase Letter | 2030 | 1.7% |
Format | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
ㅇ | 714 | 8.1% |
ㅅ | 564 | 6.4% |
김 | 528 | 6.0% |
이 | 485 | 5.5% |
ㄱ | 426 | 4.8% |
ㅎ | 420 | 4.7% |
ㅈ | 389 | 4.4% |
ㅁ | 335 | 3.8% |
ㅂ | 310 | 3.5% |
ㄷ | 275 | 3.1% |
Other values (372) | 4421 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 2712 | |
B | 1446 | |
Z | 999 | 15.1% |
C | 275 | 4.2% |
L | 215 | 3.3% |
J | 176 | 2.7% |
E | 158 | 2.4% |
W | 106 | 1.6% |
D | 91 | 1.4% |
R | 76 | 1.2% |
Other values (16) | 352 | 5.3% |
Lowercase Letter
Value | Count | Frequency (%) |
v | 1317 | |
c | 385 | 19.0% |
m | 66 | 3.3% |
t | 58 | 2.9% |
p | 22 | 1.1% |
a | 20 | 1.0% |
w | 17 | 0.8% |
s | 17 | 0.8% |
n | 16 | 0.8% |
b | 14 | 0.7% |
Other values (13) | 98 | 4.8% |
Decimal Number
Value | Count | Frequency (%) |
0 | 16839 | |
1 | 11482 | |
3 | 9004 | |
2 | 8198 | |
9 | 6194 | 8.3% |
6 | 5389 | 7.2% |
8 | 4657 | 6.2% |
5 | 4487 | 6.0% |
4 | 4406 | 5.9% |
7 | 4346 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 5557 | |
, | 23 | 0.4% |
/ | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
13605 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5435 |
Format
Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 99624 | |
Hangul | 8867 | 7.6% |
Latin | 8636 | 7.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
ㅇ | 714 | 8.1% |
ㅅ | 564 | 6.4% |
김 | 528 | 6.0% |
이 | 485 | 5.5% |
ㄱ | 426 | 4.8% |
ㅎ | 420 | 4.7% |
ㅈ | 389 | 4.4% |
ㅁ | 335 | 3.8% |
ㅂ | 310 | 3.5% |
ㄷ | 275 | 3.1% |
Other values (372) | 4421 |
Latin
Value | Count | Frequency (%) |
A | 2712 | |
B | 1446 | |
v | 1317 | |
Z | 999 | 11.6% |
c | 385 | 4.5% |
C | 275 | 3.2% |
L | 215 | 2.5% |
J | 176 | 2.0% |
E | 158 | 1.8% |
W | 106 | 1.2% |
Other values (39) | 847 | 9.8% |
Common
Value | Count | Frequency (%) |
0 | 16839 | |
13605 | ||
1 | 11482 | |
3 | 9004 | |
2 | 8198 | |
9 | 6194 | 6.2% |
. | 5557 | 5.6% |
- | 5435 | 5.5% |
6 | 5389 | 5.4% |
8 | 4657 | 4.7% |
Other values (6) | 13264 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 108259 | |
Hangul | 4699 | 4.0% |
Compat Jamo | 4168 | 3.6% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 16839 | |
13605 | ||
1 | 11482 | |
3 | 9004 | |
2 | 8198 | 7.6% |
9 | 6194 | 5.7% |
. | 5557 | 5.1% |
- | 5435 | 5.0% |
6 | 5389 | 5.0% |
8 | 4657 | 4.3% |
Other values (54) | 21899 |
Compat Jamo
Value | Count | Frequency (%) |
ㅇ | 714 | |
ㅅ | 564 | |
ㄱ | 426 | |
ㅎ | 420 | |
ㅈ | 389 | |
ㅁ | 335 | |
ㅂ | 310 | |
ㄷ | 275 | 6.6% |
ㄴ | 206 | 4.9% |
ㅌ | 113 | 2.7% |
Other values (9) | 416 |
Hangul
Value | Count | Frequency (%) |
김 | 528 | 11.2% |
이 | 485 | 10.3% |
박 | 185 | 3.9% |
한 | 137 | 2.9% |
최 | 130 | 2.8% |
정 | 128 | 2.7% |
시 | 106 | 2.3% |
아 | 89 | 1.9% |
조 | 81 | 1.7% |
유 | 74 | 1.6% |
Other values (353) | 2756 |
None
Value | Count | Frequency (%) |
| 1 |
서명
Text
Distinct | 8345 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 124 |
---|---|
Median length | 76 |
Mean length | 15.4013 |
Min length | 1 |
Characters and Unicode
Total characters | 154013 |
---|---|
Distinct characters | 1207 |
Distinct categories | 15 ? |
Distinct scripts | 4 ? |
Distinct blocks | 10 ? |
Unique
Unique | 7669 ? |
---|---|
Unique (%) | 76.7% |
Sample
1st row | 아래로부터의 정부개혁 |
---|---|
2nd row | Changes in industrial interdependency betweeb Japan and Korea since 1985 |
3rd row | 환경청정기술개발의 국제적동향파악 및 종합추진전략 방안에 관한 연구 |
4th row | 세계속의 한국경제 |
5th row | 미국 ABC 및 영국 IBA 광고방송 기준 |
Value | Count | Frequency (%) |
연구 | 843 | 2.6% |
및 | 409 | 1.2% |
관한 | 388 | 1.2% |
위한 | 346 | 1.0% |
방안 | 184 | 0.6% |
보고서 | 183 | 0.6% |
방송 | 175 | 0.5% |
미디어 | 162 | 0.5% |
the | 145 | 0.4% |
디지털 | 140 | 0.4% |
Other values (13709) | 30035 |
Most occurring characters
Value | Count | Frequency (%) |
23026 | 15.0% | |
의 | 3317 | 2.2% |
방 | 2959 | 1.9% |
한 | 2046 | 1.3% |
송 | 1871 | 1.2% |
제 | 1864 | 1.2% |
정 | 1726 | 1.1% |
연 | 1659 | 1.1% |
사 | 1633 | 1.1% |
e | 1569 | 1.0% |
Other values (1197) | 112343 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 103138 | |
Space Separator | 23026 | 15.0% |
Lowercase Letter | 14462 | 9.4% |
Decimal Number | 5087 | 3.3% |
Uppercase Letter | 4031 | 2.6% |
Open Punctuation | 1411 | 0.9% |
Close Punctuation | 1411 | 0.9% |
Other Punctuation | 1060 | 0.7% |
Math Symbol | 234 | 0.2% |
Dash Punctuation | 118 | 0.1% |
Other values (5) | 35 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 3317 | 3.2% |
방 | 2959 | 2.9% |
한 | 2046 | 2.0% |
송 | 1871 | 1.8% |
제 | 1864 | 1.8% |
정 | 1726 | 1.7% |
연 | 1659 | 1.6% |
사 | 1633 | 1.6% |
구 | 1545 | 1.5% |
국 | 1390 | 1.3% |
Other values (1101) | 83128 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1569 | |
i | 1435 | |
a | 1331 | |
n | 1329 | |
o | 1295 | 9.0% |
t | 1138 | 7.9% |
s | 916 | 6.3% |
r | 882 | 6.1% |
c | 673 | 4.7% |
d | 585 | 4.0% |
Other values (16) | 3309 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 609 | |
V | 335 | 8.3% |
I | 271 | 6.7% |
C | 257 | 6.4% |
A | 256 | 6.4% |
B | 253 | 6.3% |
S | 239 | 5.9% |
M | 197 | 4.9% |
E | 195 | 4.8% |
K | 162 | 4.0% |
Other values (16) | 1257 |
Other Punctuation
Value | Count | Frequency (%) |
· | 390 | |
, | 309 | |
. | 117 | 11.0% |
? | 70 | 6.6% |
' | 46 | 4.3% |
! | 43 | 4.1% |
& | 34 | 3.2% |
/ | 27 | 2.5% |
: | 8 | 0.8% |
" | 6 | 0.6% |
Other values (6) | 10 | 0.9% |
Decimal Number
Value | Count | Frequency (%) |
0 | 1367 | |
1 | 990 | |
9 | 896 | |
2 | 784 | |
8 | 211 | 4.1% |
7 | 190 | 3.7% |
3 | 190 | 3.7% |
5 | 163 | 3.2% |
6 | 150 | 2.9% |
4 | 146 | 2.9% |
Math Symbol
Value | Count | Frequency (%) |
~ | 216 | |
+ | 10 | 4.3% |
∼ | 5 | 2.1% |
= | 3 | 1.3% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 10 | |
Ⅱ | 10 | |
Ⅲ | 1 | 4.5% |
Ⅳ | 1 | 4.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1409 | |
「 | 2 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1408 | |
」 | 3 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
23026 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 118 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 8 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 2 |
Other Symbol
Value | Count | Frequency (%) |
℃ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 102652 | |
Common | 32360 | 21.0% |
Latin | 18515 | 12.0% |
Han | 486 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 3317 | 3.2% |
방 | 2959 | 2.9% |
한 | 2046 | 2.0% |
송 | 1871 | 1.8% |
제 | 1864 | 1.8% |
정 | 1726 | 1.7% |
연 | 1659 | 1.6% |
사 | 1633 | 1.6% |
구 | 1545 | 1.5% |
국 | 1390 | 1.4% |
Other values (927) | 82642 |
Han
Value | Count | Frequency (%) |
法 | 36 | 7.4% |
國 | 23 | 4.7% |
論 | 17 | 3.5% |
行 | 16 | 3.3% |
新 | 16 | 3.3% |
政 | 15 | 3.1% |
代 | 13 | 2.7% |
義 | 13 | 2.7% |
經 | 12 | 2.5% |
主 | 12 | 2.5% |
Other values (164) | 313 |
Latin
Value | Count | Frequency (%) |
e | 1569 | 8.5% |
i | 1435 | 7.8% |
a | 1331 | 7.2% |
n | 1329 | 7.2% |
o | 1295 | 7.0% |
t | 1138 | 6.1% |
s | 916 | 4.9% |
r | 882 | 4.8% |
c | 673 | 3.6% |
T | 609 | 3.3% |
Other values (46) | 7338 |
Common
Value | Count | Frequency (%) |
23026 | ||
( | 1409 | 4.4% |
) | 1408 | 4.4% |
0 | 1367 | 4.2% |
1 | 990 | 3.1% |
9 | 896 | 2.8% |
2 | 784 | 2.4% |
· | 390 | 1.2% |
, | 309 | 1.0% |
~ | 216 | 0.7% |
Other values (30) | 1565 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 102651 | |
ASCII | 50440 | |
CJK | 476 | 0.3% |
None | 397 | 0.3% |
Number Forms | 22 | < 0.1% |
Punctuation | 10 | < 0.1% |
CJK Compat Ideographs | 10 | < 0.1% |
Math Operators | 5 | < 0.1% |
Letterlike Symbols | 1 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
23026 | ||
e | 1569 | 3.1% |
i | 1435 | 2.8% |
( | 1409 | 2.8% |
) | 1408 | 2.8% |
0 | 1367 | 2.7% |
a | 1331 | 2.6% |
n | 1329 | 2.6% |
o | 1295 | 2.6% |
t | 1138 | 2.3% |
Other values (73) | 15133 |
Hangul
Value | Count | Frequency (%) |
의 | 3317 | 3.2% |
방 | 2959 | 2.9% |
한 | 2046 | 2.0% |
송 | 1871 | 1.8% |
제 | 1864 | 1.8% |
정 | 1726 | 1.7% |
연 | 1659 | 1.6% |
사 | 1633 | 1.6% |
구 | 1545 | 1.5% |
국 | 1390 | 1.4% |
Other values (926) | 82641 |
None
Value | Count | Frequency (%) |
· | 390 | |
」 | 3 | 0.8% |
「 | 2 | 0.5% |
% | 1 | 0.3% |
& | 1 | 0.3% |
CJK
Value | Count | Frequency (%) |
法 | 36 | 7.6% |
國 | 23 | 4.8% |
論 | 17 | 3.6% |
行 | 16 | 3.4% |
新 | 16 | 3.4% |
政 | 15 | 3.2% |
代 | 13 | 2.7% |
義 | 13 | 2.7% |
經 | 12 | 2.5% |
主 | 12 | 2.5% |
Other values (160) | 303 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 10 | |
Ⅱ | 10 | |
Ⅲ | 1 | 4.5% |
Ⅳ | 1 | 4.5% |
Punctuation
Value | Count | Frequency (%) |
’ | 8 | |
… | 2 | 20.0% |
Math Operators
Value | Count | Frequency (%) |
∼ | 5 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
列 | 5 | |
歷 | 2 | 20.0% |
金 | 2 | 20.0% |
略 | 1 | 10.0% |
Letterlike Symbols
Value | Count | Frequency (%) |
℃ | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
저자
Text
Distinct | 4687 |
---|---|
Distinct (%) | 46.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
방송위원회 | 526 | 4.1% |
한국언론재단 | 101 | 0.8% |
한국방송학회 | 100 | 0.8% |
한국방송공사 | 97 | 0.8% |
한국언론학회 | 91 | 0.7% |
한국방송광고공사 | 81 | 0.6% |
한국방송개발원 | 61 | 0.5% |
시공사 | 60 | 0.5% |
한국언론연구원 | 59 | 0.5% |
편집부 | 58 | 0.5% |
Other values (5542) | 11454 |
Most occurring characters
Value | Count | Frequency (%) |
2693 | 4.6% | |
회 | 1662 | 2.9% |
원 | 1562 | 2.7% |
송 | 1557 | 2.7% |
방 | 1501 | 2.6% |
국 | 1500 | 2.6% |
한 | 1407 | 2.4% |
, | 1249 | 2.1% |
이 | 1157 | 2.0% |
정 | 916 | 1.6% |
Other values (713) | 43030 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 45896 | |
Lowercase Letter | 6102 | 10.5% |
Space Separator | 2693 | 4.6% |
Uppercase Letter | 1969 | 3.4% |
Other Punctuation | 1386 | 2.4% |
Decimal Number | 89 | 0.2% |
Close Punctuation | 34 | 0.1% |
Open Punctuation | 34 | 0.1% |
Dash Punctuation | 31 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
회 | 1662 | 3.6% |
원 | 1562 | 3.4% |
송 | 1557 | 3.4% |
방 | 1501 | 3.3% |
국 | 1500 | 3.3% |
한 | 1407 | 3.1% |
이 | 1157 | 2.5% |
정 | 916 | 2.0% |
위 | 884 | 1.9% |
김 | 877 | 1.9% |
Other values (647) | 32873 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 712 | |
a | 665 | |
n | 568 | 9.3% |
i | 553 | 9.1% |
o | 474 | 7.8% |
r | 471 | 7.7% |
s | 352 | 5.8% |
t | 311 | 5.1% |
l | 296 | 4.9% |
d | 207 | 3.4% |
Other values (16) | 1493 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 177 | 9.0% |
S | 159 | 8.1% |
T | 158 | 8.0% |
M | 143 | 7.3% |
K | 135 | 6.9% |
C | 121 | 6.1% |
J | 112 | 5.7% |
A | 105 | 5.3% |
N | 98 | 5.0% |
D | 96 | 4.9% |
Other values (15) | 665 |
Decimal Number
Value | Count | Frequency (%) |
2 | 41 | |
1 | 20 | |
0 | 17 | |
3 | 4 | 4.5% |
4 | 4 | 4.5% |
5 | 2 | 2.2% |
6 | 1 | 1.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1249 | |
. | 120 | 8.7% |
· | 9 | 0.6% |
& | 8 | 0.6% |
Space Separator
Value | Count | Frequency (%) |
2693 |
Close Punctuation
Value | Count | Frequency (%) |
) | 34 |
Open Punctuation
Value | Count | Frequency (%) |
( | 34 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 31 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 45896 | |
Latin | 8071 | 13.9% |
Common | 4267 | 7.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
회 | 1662 | 3.6% |
원 | 1562 | 3.4% |
송 | 1557 | 3.4% |
방 | 1501 | 3.3% |
국 | 1500 | 3.3% |
한 | 1407 | 3.1% |
이 | 1157 | 2.5% |
정 | 916 | 2.0% |
위 | 884 | 1.9% |
김 | 877 | 1.9% |
Other values (647) | 32873 |
Latin
Value | Count | Frequency (%) |
e | 712 | 8.8% |
a | 665 | 8.2% |
n | 568 | 7.0% |
i | 553 | 6.9% |
o | 474 | 5.9% |
r | 471 | 5.8% |
s | 352 | 4.4% |
t | 311 | 3.9% |
l | 296 | 3.7% |
d | 207 | 2.6% |
Other values (41) | 3462 |
Common
Value | Count | Frequency (%) |
2693 | ||
, | 1249 | |
. | 120 | 2.8% |
2 | 41 | 1.0% |
) | 34 | 0.8% |
( | 34 | 0.8% |
- | 31 | 0.7% |
1 | 20 | 0.5% |
0 | 17 | 0.4% |
· | 9 | 0.2% |
Other values (5) | 19 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 45896 | |
ASCII | 12329 | 21.2% |
None | 9 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2693 | ||
, | 1249 | 10.1% |
e | 712 | 5.8% |
a | 665 | 5.4% |
n | 568 | 4.6% |
i | 553 | 4.5% |
o | 474 | 3.8% |
r | 471 | 3.8% |
s | 352 | 2.9% |
t | 311 | 2.5% |
Other values (55) | 4281 |
Hangul
Value | Count | Frequency (%) |
회 | 1662 | 3.6% |
원 | 1562 | 3.4% |
송 | 1557 | 3.4% |
방 | 1501 | 3.3% |
국 | 1500 | 3.3% |
한 | 1407 | 3.1% |
이 | 1157 | 2.5% |
정 | 916 | 2.0% |
위 | 884 | 1.9% |
김 | 877 | 1.9% |
Other values (647) | 32873 |
None
Value | Count | Frequency (%) |
· | 9 |
출판사
Text
Distinct | 2118 |
---|---|
Distinct (%) | 21.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 142 |
---|---|
Median length | 75 |
Mean length | 6.9258 |
Min length | 1 |
Characters and Unicode
Total characters | 69258 |
---|---|
Distinct characters | 769 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 1282 ? |
---|---|
Unique (%) | 12.8% |
Sample
1st row | 博英社 |
---|---|
2nd row | Korea Institute for International economic policy |
3rd row | 한국환경기술개발원 |
4th row | 김영사 |
5th row | 방송위원회 |
Value | Count | Frequency (%) |
방송위원회 | 661 | 5.7% |
한국법제연구원 | 263 | 2.3% |
대외경제정책연구원 | 244 | 2.1% |
커뮤니케이션북스 | 218 | 1.9% |
한국언론재단 | 196 | 1.7% |
한국행정연구원 | 195 | 1.7% |
한국방송개발원 | 174 | 1.5% |
한국개발연구원 | 152 | 1.3% |
한국방송공사 | 140 | 1.2% |
방송통신위원회 | 120 | 1.0% |
Other values (2220) | 9293 |
Most occurring characters
Value | Count | Frequency (%) |
원 | 3054 | 4.4% |
한 | 2944 | 4.3% |
국 | 2905 | 4.2% |
방 | 2021 | 2.9% |
송 | 2006 | 2.9% |
회 | 1885 | 2.7% |
연 | 1747 | 2.5% |
1656 | 2.4% | |
구 | 1562 | 2.3% |
사 | 1478 | 2.1% |
Other values (759) | 48000 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 54881 | |
Lowercase Letter | 10022 | 14.5% |
Uppercase Letter | 2109 | 3.0% |
Space Separator | 1656 | 2.4% |
Other Punctuation | 321 | 0.5% |
Decimal Number | 195 | 0.3% |
Dash Punctuation | 27 | < 0.1% |
Open Punctuation | 21 | < 0.1% |
Close Punctuation | 21 | < 0.1% |
Final Punctuation | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 3054 | 5.6% |
한 | 2944 | 5.4% |
국 | 2905 | 5.3% |
방 | 2021 | 3.7% |
송 | 2006 | 3.7% |
회 | 1885 | 3.4% |
연 | 1747 | 3.2% |
구 | 1562 | 2.8% |
사 | 1478 | 2.7% |
정 | 1140 | 2.1% |
Other values (684) | 34139 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1199 | |
o | 997 | |
i | 920 | |
n | 861 | |
t | 825 | |
a | 811 | |
r | 785 | 7.8% |
s | 678 | 6.8% |
c | 495 | 4.9% |
l | 488 | 4.9% |
Other values (15) | 1963 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 198 | 9.4% |
P | 197 | 9.3% |
I | 191 | 9.1% |
B | 188 | 8.9% |
M | 153 | 7.3% |
S | 150 | 7.1% |
T | 142 | 6.7% |
N | 106 | 5.0% |
R | 106 | 5.0% |
C | 100 | 4.7% |
Other values (15) | 578 |
Other Punctuation
Value | Count | Frequency (%) |
: | 101 | |
& | 62 | |
; | 52 | |
. | 38 | 11.8% |
· | 33 | 10.3% |
& | 18 | 5.6% |
, | 11 | 3.4% |
/ | 4 | 1.2% |
@ | 2 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
2 | 87 | |
1 | 86 | |
0 | 9 | 4.6% |
4 | 5 | 2.6% |
3 | 4 | 2.1% |
9 | 2 | 1.0% |
5 | 1 | 0.5% |
6 | 1 | 0.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 20 | |
[ | 1 | 4.8% |
Close Punctuation
Value | Count | Frequency (%) |
) | 20 | |
] | 1 | 4.8% |
Space Separator
Value | Count | Frequency (%) |
1656 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 27 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 4 |
Math Symbol
Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 53918 | |
Latin | 12131 | 17.5% |
Common | 2246 | 3.2% |
Han | 963 | 1.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 3054 | 5.7% |
한 | 2944 | 5.5% |
국 | 2905 | 5.4% |
방 | 2021 | 3.7% |
송 | 2006 | 3.7% |
회 | 1885 | 3.5% |
연 | 1747 | 3.2% |
구 | 1562 | 2.9% |
사 | 1478 | 2.7% |
정 | 1140 | 2.1% |
Other values (508) | 33176 |
Han
Value | Count | Frequency (%) |
社 | 177 | |
文 | 75 | 7.8% |
英 | 65 | 6.7% |
博 | 58 | 6.0% |
法 | 47 | 4.9% |
國 | 25 | 2.6% |
經 | 24 | 2.5% |
韓 | 22 | 2.3% |
新 | 18 | 1.9% |
出 | 17 | 1.8% |
Other values (166) | 435 |
Latin
Value | Count | Frequency (%) |
e | 1199 | 9.9% |
o | 997 | 8.2% |
i | 920 | 7.6% |
n | 861 | 7.1% |
t | 825 | 6.8% |
a | 811 | 6.7% |
r | 785 | 6.5% |
s | 678 | 5.6% |
c | 495 | 4.1% |
l | 488 | 4.0% |
Other values (40) | 4072 |
Common
Value | Count | Frequency (%) |
1656 | ||
: | 101 | 4.5% |
2 | 87 | 3.9% |
1 | 86 | 3.8% |
& | 62 | 2.8% |
; | 52 | 2.3% |
. | 38 | 1.7% |
· | 33 | 1.5% |
- | 27 | 1.2% |
( | 20 | 0.9% |
Other values (15) | 84 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 53918 | |
ASCII | 14322 | 20.7% |
CJK | 960 | 1.4% |
None | 51 | 0.1% |
Punctuation | 4 | < 0.1% |
CJK Compat Ideographs | 3 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
원 | 3054 | 5.7% |
한 | 2944 | 5.5% |
국 | 2905 | 5.4% |
방 | 2021 | 3.7% |
송 | 2006 | 3.7% |
회 | 1885 | 3.5% |
연 | 1747 | 3.2% |
구 | 1562 | 2.9% |
사 | 1478 | 2.7% |
정 | 1140 | 2.1% |
Other values (508) | 33176 |
ASCII
Value | Count | Frequency (%) |
1656 | 11.6% | |
e | 1199 | 8.4% |
o | 997 | 7.0% |
i | 920 | 6.4% |
n | 861 | 6.0% |
t | 825 | 5.8% |
a | 811 | 5.7% |
r | 785 | 5.5% |
s | 678 | 4.7% |
c | 495 | 3.5% |
Other values (62) | 5095 |
CJK
Value | Count | Frequency (%) |
社 | 177 | |
文 | 75 | 7.8% |
英 | 65 | 6.8% |
博 | 58 | 6.0% |
法 | 47 | 4.9% |
國 | 25 | 2.6% |
經 | 24 | 2.5% |
韓 | 22 | 2.3% |
新 | 18 | 1.9% |
出 | 17 | 1.8% |
Other values (163) | 432 |
None
Value | Count | Frequency (%) |
· | 33 | |
& | 18 |
Punctuation
Value | Count | Frequency (%) |
’ | 4 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
女 | 1 | |
金 | 1 | |
率 | 1 |
별치
Categorical
IMBALANCE
 
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
B | 412 |
L | 145 |
Z | 135 |
W | 89 |
Other values (8) | 268 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6853 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 8951 | |
B | 412 | 4.1% |
L | 145 | 1.5% |
Z | 135 | 1.4% |
W | 89 | 0.9% |
E | 81 | 0.8% |
R | 64 | 0.6% |
X | 40 | 0.4% |
H | 22 | 0.2% |
P | 19 | 0.2% |
Other values (3) | 42 | 0.4% |
Length
Value | Count | Frequency (%) |
na | 8951 | |
b | 412 | 4.1% |
l | 145 | 1.5% |
z | 135 | 1.4% |
w | 89 | 0.9% |
e | 81 | 0.8% |
r | 64 | 0.6% |
x | 40 | 0.4% |
h | 22 | 0.2% |
p | 19 | 0.2% |
Other values (3) | 42 | 0.4% |
출판년도
Real number (ℝ)
Distinct | 59 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2001.3512 |
Minimum | 1956 |
---|---|
Maximum | 2019 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1956 |
---|---|
5-th percentile | 1986 |
Q1 | 1996 |
median | 2002 |
Q3 | 2007 |
95-th percentile | 2015 |
Maximum | 2019 |
Range | 63 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 8.9013809 |
---|---|
Coefficient of variation (CV) | 0.0044476856 |
Kurtosis | 1.1703731 |
Mean | 2001.3512 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -0.70064161 |
Sum | 20013512 |
Variance | 79.234582 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2006 | 578 | 5.8% |
2004 | 564 | 5.6% |
2005 | 555 | 5.5% |
2007 | 552 | 5.5% |
2001 | 525 | 5.2% |
2003 | 475 | 4.8% |
2002 | 468 | 4.7% |
1994 | 408 | 4.1% |
2000 | 370 | 3.7% |
1997 | 350 | 3.5% |
Other values (49) | 5155 |
Value | Count | Frequency (%) |
1956 | 1 | < 0.1% |
1961 | 3 | < 0.1% |
1962 | 13 | |
1963 | 2 | < 0.1% |
1964 | 1 | < 0.1% |
1966 | 4 | < 0.1% |
1967 | 8 | 0.1% |
1968 | 1 | < 0.1% |
1969 | 8 | 0.1% |
1970 | 22 |
Value | Count | Frequency (%) |
2019 | 71 | 0.7% |
2018 | 78 | 0.8% |
2017 | 87 | 0.9% |
2016 | 228 | |
2015 | 142 | |
2014 | 166 | |
2013 | 198 | |
2012 | 189 | |
2011 | 266 | |
2010 | 237 |
별치 | 출판년도 | |
---|---|---|
별치 | 1.000 | 0.533 |
출판년도 | 0.533 | 1.000 |
출판년도 | 별치 | |
---|---|---|
출판년도 | 1.000 | 0.256 |
별치 | 0.256 | 1.000 |
청구기호 | 서명 | 저자 | 출판사 | 별치 | 출판년도 | |
---|---|---|---|---|---|---|
13754 | 359.01 이14ㅇ | 아래로부터의 정부개혁 | 이계식 | 博英社 | <NA> | 1997 |
5426 | 61B 02-063 | Changes in industrial interdependency betweeb Japan and Korea since 1985 | Lee HongBae | Korea Institute for International economic policy | <NA> | 2002 |
5792 | 90Z 94-026 | 환경청정기술개발의 국제적동향파악 및 종합추진전략 방안에 관한 연구 | 신명교 | 한국환경기술개발원 | <NA> | 1994 |
14288 | 320.911 사15ㅅ | 세계속의 한국경제 | 사공일 | 김영사 | <NA> | 1993 |
2108 | 00A 92-008 | 미국 ABC 및 영국 IBA 광고방송 기준 | 방송위원회 | 방송위원회 | <NA> | 1992 |
10758 | 070.4 안44ㅎ | 행동하는 언론, 공공 저널리즘 | 안병길 | 전망 | <NA> | 2005 |
6913 | 68A 05-014 | 디지털 시대 한·일 양국의 저작권 법제와 처리관행에 관한 비교 연구 | 한국방송광고공사 | 한국방송광고공사 | <NA> | 2005 |
15351 | 005.3 엣829ㅍ | 프레젠테이션을 부탁해 | 엣킨슨, 클리프 | 정보문화사 | <NA> | 2009 |
5849 | 36A 96-002 | 방송관계법 개정방향에 관한 공청회 | 국회 제도개선특별위원회 | 국회제도개선특별위원회 | <NA> | 1996 |
5538 | 61B 03-084 | European integration and the Asia-pacific region | 김흥종 | Korea Institute for International economic policy | <NA> | 2003 |
청구기호 | 서명 | 저자 | 출판사 | 별치 | 출판년도 | |
---|---|---|---|---|---|---|
2623 | 21A 00-014 | 지역공동체와 저널리즘 | 한국언론재단 | 한국언론재단 | <NA> | 2000 |
6851 | 67Z 83-001 | TV광고방송량 83상반기 | 한국광보문화연구원 | 한국광보문화연구원 | <NA> | 1983 |
1161 | 21B 04-011 | KI 도입기반 구축 연구 결과 발표 및 토론회 | 한국언론학회 | 한국언론학회 | <NA> | 2004 |
4487 | 35A 03-029 | 규제영향분석 지침서 및 교재개발 | 윤종설 | 한국행정연구원 | <NA> | 2003 |
15729 | 326.14 이59ㅁ | 미디어 소비자 광고의 변화 | 이시훈 | 한경사 | <NA> | 2008 |
6278 | 83C 99-002 | (언론개혁시민연대 토론회)방송개혁, 이제 시작이다 | 언론개혁시민연대 | 언론개혁시민연대 | <NA> | 1999 |
1311 | 00A 06-053 | T-Commerce의 방송산업 파급효과와 정책방안에 관한 연구 | 주정민 | 방송위원회 | <NA> | 2006 |
3798 | 36A 95-003 | (국회)경과보고서 | 국회사무처 | 국회사무처 | <NA> | 1995 |
13355 | 367.564 우94ㅂ | (왕초보 박과장)부동산 경매로 집고 사고 돈도 벌다 | 우형달 | 원앤원북스 | <NA> | 2006 |
4243 | 31A 03-002 | 문화예술인실태조사 | 문화관광부 | 문화관광부: 한국문화관광정책연구원 | <NA> | 2003 |
Most frequently occurring
청구기호 | 서명 | 저자 | 출판사 | 별치 | 출판년도 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | 02A 05-003 | 외국방송 재송신 승인 정책 수립을 위한 전문가 토론회 | 방송위원회 | 방송위원회 | <NA> | 2005 | 2 |