Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.0 KiB |
Average record size in memory | 41.3 B |
Variable types
Categorical | 2 |
---|---|
Text | 2 |
DateTime | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국문화예술위원회 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=2b2c109b-f2a3-4db0-a6d5-2bb21e50adc1 |
ltrtr_se_cd is highly overall correlated with cyber_ltrtr_cd_nm | High correlation |
cyber_ltrtr_cd_nm is highly overall correlated with ltrtr_se_cd | High correlation |
ltrtr_se_cd is highly imbalanced (80.6%) | Imbalance |
cyber_ltrtr_cd_nm is highly imbalanced (80.6%) | Imbalance |
authr_sj has unique values | Unique |
rgs_de has unique values | Unique |
orginl_link_url has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:49:10.601306 |
---|---|
Analysis finished | 2023-12-10 09:49:12.048621 |
Duration | 1.45 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ltrtr_se_cd
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
PM | |
---|---|
LT | 3 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PM |
---|---|
2nd row | LT |
3rd row | PM |
4th row | PM |
5th row | PM |
Common Values
Value | Count | Frequency (%) |
PM | 97 | |
LT | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
pm | 97 | |
lt | 3 | 3.0% |
cyber_ltrtr_cd_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
시배달 | |
---|---|
문장배달 | 3 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.03 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 시배달 |
---|---|
2nd row | 문장배달 |
3rd row | 시배달 |
4th row | 시배달 |
5th row | 시배달 |
Common Values
Value | Count | Frequency (%) |
시배달 | 97 | |
문장배달 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
시배달 | 97 | |
문장배달 | 3 | 3.0% |
authr_sj
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 44 |
---|---|
Median length | 21 |
Mean length | 13.75 |
Min length | 8 |
Characters and Unicode
Total characters | 1375 |
---|---|
Distinct characters | 320 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 7 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 신대철, 「반딧불 하나 내려보낼까요?」 |
---|---|
2nd row | 김승옥의「무진기행」 |
3rd row | 김영승, 「반성 673」 |
4th row | 김행숙, 「입맞춤-사춘기2」 |
5th row | 박용래, 「상치꽃 아욱꽃」 |
Value | Count | Frequency (%) |
7 | 2.2% | |
」 | 2 | 0.6% |
김행숙 | 2 | 0.6% |
함민복 | 2 | 0.6% |
이장욱 | 2 | 0.6% |
진은영 | 2 | 0.6% |
안도현 | 2 | 0.6% |
김용택 | 2 | 0.6% |
김혜순 | 2 | 0.6% |
이원 | 2 | 0.6% |
Other values (282) | 287 |
Most occurring characters
Value | Count | Frequency (%) |
212 | 15.4% | |
, | 99 | 7.2% |
」 | 98 | 7.1% |
「 | 98 | 7.1% |
이 | 26 | 1.9% |
김 | 21 | 1.5% |
의 | 18 | 1.3% |
는 | 16 | 1.2% |
리 | 11 | 0.8% |
은 | 11 | 0.8% |
Other values (310) | 765 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 843 | |
Space Separator | 212 | 15.4% |
Close Punctuation | 101 | 7.3% |
Open Punctuation | 101 | 7.3% |
Other Punctuation | 100 | 7.3% |
Decimal Number | 14 | 1.0% |
Other Symbol | 1 | 0.1% |
Initial Punctuation | 1 | 0.1% |
Final Punctuation | 1 | 0.1% |
Dash Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 26 | 3.1% |
김 | 21 | 2.5% |
의 | 18 | 2.1% |
는 | 16 | 1.9% |
리 | 11 | 1.3% |
은 | 11 | 1.3% |
나 | 11 | 1.3% |
기 | 10 | 1.2% |
지 | 10 | 1.2% |
장 | 10 | 1.2% |
Other values (289) | 699 |
Decimal Number
Value | Count | Frequency (%) |
2 | 4 | |
4 | 2 | |
1 | 2 | |
9 | 2 | |
0 | 1 | 7.1% |
6 | 1 | 7.1% |
7 | 1 | 7.1% |
3 | 1 | 7.1% |
Close Punctuation
Value | Count | Frequency (%) |
」 | 98 | |
) | 2 | 2.0% |
」 | 1 | 1.0% |
Open Punctuation
Value | Count | Frequency (%) |
「 | 98 | |
( | 2 | 2.0% |
「 | 1 | 1.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 99 | |
? | 1 | 1.0% |
Space Separator
Value | Count | Frequency (%) |
212 |
Other Symbol
Value | Count | Frequency (%) |
◉ | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 1 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 837 | |
Common | 532 | |
Han | 6 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 26 | 3.1% |
김 | 21 | 2.5% |
의 | 18 | 2.2% |
는 | 16 | 1.9% |
리 | 11 | 1.3% |
은 | 11 | 1.3% |
나 | 11 | 1.3% |
기 | 10 | 1.2% |
지 | 10 | 1.2% |
장 | 10 | 1.2% |
Other values (283) | 693 |
Common
Value | Count | Frequency (%) |
212 | ||
, | 99 | |
」 | 98 | |
「 | 98 | |
2 | 4 | 0.8% |
( | 2 | 0.4% |
4 | 2 | 0.4% |
1 | 2 | 0.4% |
) | 2 | 0.4% |
9 | 2 | 0.4% |
Other values (11) | 11 | 2.1% |
Han
Value | Count | Frequency (%) |
招 | 1 | |
魂 | 1 | |
略 | 1 | |
傳 | 1 | |
法 | 1 | |
詩 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 837 | |
ASCII | 331 | 24.1% |
None | 198 | 14.4% |
CJK | 5 | 0.4% |
Punctuation | 2 | 0.1% |
Geometric Shapes | 1 | 0.1% |
CJK Compat Ideographs | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
212 | ||
, | 99 | |
2 | 4 | 1.2% |
( | 2 | 0.6% |
4 | 2 | 0.6% |
1 | 2 | 0.6% |
) | 2 | 0.6% |
9 | 2 | 0.6% |
0 | 1 | 0.3% |
? | 1 | 0.3% |
Other values (4) | 4 | 1.2% |
None
Value | Count | Frequency (%) |
」 | 98 | |
「 | 98 | |
「 | 1 | 0.5% |
」 | 1 | 0.5% |
Hangul
Value | Count | Frequency (%) |
이 | 26 | 3.1% |
김 | 21 | 2.5% |
의 | 18 | 2.2% |
는 | 16 | 1.9% |
리 | 11 | 1.3% |
은 | 11 | 1.3% |
나 | 11 | 1.3% |
기 | 10 | 1.2% |
지 | 10 | 1.2% |
장 | 10 | 1.2% |
Other values (283) | 693 |
Geometric Shapes
Value | Count | Frequency (%) |
◉ | 1 |
Punctuation
Value | Count | Frequency (%) |
‘ | 1 | |
’ | 1 |
CJK
Value | Count | Frequency (%) |
招 | 1 | |
魂 | 1 | |
傳 | 1 | |
法 | 1 | |
詩 | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
略 | 1 |
rgs_de
Date
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2007-05-31 00:00:00 |
---|---|
Maximum | 2021-08-26 00:00:00 |
orginl_link_url
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 36 |
---|---|
Median length | 31 |
Mean length | 30.77 |
Min length | 30 |
Characters and Unicode
Total characters | 3077 |
---|---|
Distinct characters | 32 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://munjang.or.kr/?p=283305 |
---|---|
2nd row | http://munjang.or.kr/archives/141504 |
3rd row | https://munjang.or.kr/?p=283205 |
4th row | https://munjang.or.kr/?p=283131 |
5th row | https://munjang.or.kr/?p=283077 |
Value | Count | Frequency (%) |
https://munjang.or.kr/?p=283305 | 1 | 1.0% |
http://munjang.or.kr/?p=277757 | 1 | 1.0% |
http://munjang.or.kr/?p=276980 | 1 | 1.0% |
http://munjang.or.kr/?p=277086 | 1 | 1.0% |
http://munjang.or.kr/?p=277171 | 1 | 1.0% |
http://munjang.or.kr/?p=277209 | 1 | 1.0% |
http://munjang.or.kr/?p=277243 | 1 | 1.0% |
http://munjang.or.kr/?p=277291 | 1 | 1.0% |
http://munjang.or.kr/?p=277370 | 1 | 1.0% |
http://munjang.or.kr/?p=277464 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 303 | 9.8% |
r | 203 | 6.6% |
n | 200 | 6.5% |
. | 200 | 6.5% |
t | 200 | 6.5% |
p | 197 | 6.4% |
2 | 131 | 4.3% |
7 | 108 | 3.5% |
a | 103 | 3.3% |
h | 103 | 3.3% |
Other values (22) | 1329 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1680 | |
Other Punctuation | 700 | |
Decimal Number | 600 | 19.5% |
Math Symbol | 97 | 3.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
r | 203 | |
n | 200 | |
t | 200 | |
p | 197 | |
a | 103 | 6.1% |
h | 103 | 6.1% |
m | 100 | 6.0% |
u | 100 | 6.0% |
j | 100 | 6.0% |
g | 100 | 6.0% |
Other values (7) | 274 |
Decimal Number
Value | Count | Frequency (%) |
2 | 131 | |
7 | 108 | |
8 | 75 | |
0 | 46 | 7.7% |
4 | 46 | 7.7% |
9 | 44 | 7.3% |
1 | 43 | 7.2% |
3 | 39 | 6.5% |
5 | 36 | 6.0% |
6 | 32 | 5.3% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 303 | |
. | 200 | |
: | 100 | 14.3% |
? | 97 | 13.9% |
Math Symbol
Value | Count | Frequency (%) |
= | 97 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1680 | |
Common | 1397 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
r | 203 | |
n | 200 | |
t | 200 | |
p | 197 | |
a | 103 | 6.1% |
h | 103 | 6.1% |
m | 100 | 6.0% |
u | 100 | 6.0% |
j | 100 | 6.0% |
g | 100 | 6.0% |
Other values (7) | 274 |
Common
Value | Count | Frequency (%) |
/ | 303 | |
. | 200 | |
2 | 131 | |
7 | 108 | 7.7% |
: | 100 | 7.2% |
? | 97 | 6.9% |
= | 97 | 6.9% |
8 | 75 | 5.4% |
0 | 46 | 3.3% |
4 | 46 | 3.3% |
Other values (5) | 194 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3077 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 303 | 9.8% |
r | 203 | 6.6% |
n | 200 | 6.5% |
. | 200 | 6.5% |
t | 200 | 6.5% |
p | 197 | 6.4% |
2 | 131 | 4.3% |
7 | 108 | 3.5% |
a | 103 | 3.3% |
h | 103 | 3.3% |
Other values (22) | 1329 |
ltrtr_se_cd | cyber_ltrtr_cd_nm | authr_sj | rgs_de | orginl_link_url | |
---|---|---|---|---|---|
ltrtr_se_cd | 1.000 | 0.963 | 1.000 | 1.000 | 1.000 |
cyber_ltrtr_cd_nm | 0.963 | 1.000 | 1.000 | 1.000 | 1.000 |
authr_sj | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
rgs_de | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
orginl_link_url | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
cyber_ltrtr_cd_nm | ltrtr_se_cd | |
---|---|---|
cyber_ltrtr_cd_nm | 1.000 | 0.826 |
ltrtr_se_cd | 0.826 | 1.000 |
ltrtr_se_cd | cyber_ltrtr_cd_nm | |
---|---|---|
ltrtr_se_cd | 1.000 | 0.826 |
cyber_ltrtr_cd_nm | 0.826 | 1.000 |
ltrtr_se_cd | cyber_ltrtr_cd_nm | authr_sj | rgs_de | orginl_link_url | |
---|---|---|---|---|---|
0 | PM | 시배달 | 신대철, 「반딧불 하나 내려보낼까요?」 | 2021-08-26 | https://munjang.or.kr/?p=283305 |
1 | LT | 문장배달 | 김승옥의「무진기행」 | 2007-06-14 | http://munjang.or.kr/archives/141504 |
2 | PM | 시배달 | 김영승, 「반성 673」 | 2021-07-29 | https://munjang.or.kr/?p=283205 |
3 | PM | 시배달 | 김행숙, 「입맞춤-사춘기2」 | 2021-07-15 | https://munjang.or.kr/?p=283131 |
4 | PM | 시배달 | 박용래, 「상치꽃 아욱꽃」 | 2021-07-01 | https://munjang.or.kr/?p=283077 |
5 | PM | 시배달 | 황인찬 , 「법원」 | 2020-12-17 | https://munjang.or.kr/?p=282388 |
6 | PM | 시배달 | 신해욱, 「보고 싶은 친구에게」 | 2020-12-03 | https://munjang.or.kr/?p=282259 |
7 | LT | 문장배달 | 김소진의「눈사람 속의 검은 항아리」 | 2007-06-07 | http://munjang.or.kr/archives/141544 |
8 | PM | 시배달 | 김언희, 「트렁크」 | 2020-11-05 | https://munjang.or.kr/?p=282112 |
9 | PM | 시배달 | 이장욱, 「두번째 강물」 | 2020-10-22 | https://munjang.or.kr/?p=281917 |
ltrtr_se_cd | cyber_ltrtr_cd_nm | authr_sj | rgs_de | orginl_link_url | |
---|---|---|---|---|---|
90 | PM | 시배달 | 함민복, 「숨 쉬기도 미안한 4월」 | 2017-04-13 | http://munjang.or.kr/?p=274275 |
91 | PM | 시배달 | 경종호, 「새싹 하나가 나기까지는」 | 2017-03-30 | http://munjang.or.kr/?p=274229 |
92 | PM | 시배달 | 신철규, 「눈물의 중력」 | 2017-03-16 | http://munjang.or.kr/?p=274108 |
93 | PM | 시배달 | 임순덕, 「부아가 나서」 | 2017-03-02 | http://munjang.or.kr/?p=274049 |
94 | PM | 시배달 | 이병률, 「반반」 | 2017-02-16 | http://munjang.or.kr/?p=273919 |
95 | PM | 시배달 | 정동철, 「포릉포릉」 | 2017-02-02 | http://munjang.or.kr/?p=273838 |
96 | PM | 시배달 | 박소란, 「지익」 | 2017-01-19 | http://munjang.or.kr/?p=273723 |
97 | PM | 시배달 | 장석남, 「여행의 메모」 | 2017-01-05 | http://munjang.or.kr/?p=273427 |
98 | PM | 시배달 | 성미정, 무상한 나라의 앨리스 | 2016-12-22 | http://munjang.or.kr/?p=273336 |
99 | PM | 시배달 | 송경동, 「참, 좆같은 풍경」 | 2016-12-08 | http://munjang.or.kr/?p=273280 |