Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 100 |
Missing cells (%) | 20.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.3 KiB |
Average record size in memory | 44.3 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Unsupported | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국문화예술위원회 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=bde7091f-aaca-41cf-98fa-da8a12fa2e63 |
lon_cd is highly overall correlated with lon_cd_nm | High correlation |
lon_cd_nm is highly overall correlated with lon_cd | High correlation |
book_isbn_cn has 100 (100.0%) missing values | Missing |
book_isbn_cn is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-10 10:02:18.618812 |
---|---|
Analysis finished | 2023-12-10 10:02:19.557138 |
Duration | 0.94 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
lon_cd
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1001 | |
---|---|
1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.4 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1001 |
---|---|
2nd row | 1 |
3rd row | 1001 |
4th row | 1001 |
5th row | 1001 |
Common Values
Value | Count | Frequency (%) |
1001 | 80 | |
1 | 20 | 20.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1001 | 80 | |
1 | 20 | 20.0% |
lon_cd_nm
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
관내대출 | |
---|---|
일반대출 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 관내대출 |
---|---|
2nd row | 일반대출 |
3rd row | 관내대출 |
4th row | 관내대출 |
5th row | 관내대출 |
Common Values
Value | Count | Frequency (%) |
관내대출 | 80 | |
일반대출 | 20 | 20.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
관내대출 | 80 | |
일반대출 | 20 | 20.0% |
sj
Text
Distinct | 94 |
---|---|
Distinct (%) | 94.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 71 |
---|---|
Median length | 32.5 |
Mean length | 20.98 |
Min length | 3 |
Characters and Unicode
Total characters | 2098 |
---|---|
Distinct characters | 297 |
Distinct categories | 9 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 88 ? |
---|---|
Unique (%) | 88.0% |
Sample
1st row | 갈매기 |
---|---|
2nd row | (제25회)앙가쥬망전 . 25 |
3rd row | 문예진흥원 2000년도 기획공연 ; 세 자매 |
4th row | 갈매기 [DVD] |
5th row | 벚나무 동산 |
Value | Count | Frequency (%) |
75 | 16.6% | |
dvd | 39 | 8.6% |
2004 | 13 | 2.9% |
연극열전 | 13 | 2.9% |
verdi | 5 | 1.1% |
서울국제공연예술제 | 5 | 1.1% |
the | 4 | 0.9% |
갈매기 | 4 | 0.9% |
기획공연 | 4 | 0.9% |
극단 | 4 | 0.9% |
Other values (219) | 285 |
Most occurring characters
Value | Count | Frequency (%) |
353 | 16.8% | |
D | 86 | 4.1% |
0 | 67 | 3.2% |
; | 50 | 2.4% |
연 | 50 | 2.4% |
e | 50 | 2.4% |
i | 48 | 2.3% |
V | 45 | 2.1% |
a | 44 | 2.1% |
( | 41 | 2.0% |
Other values (287) | 1264 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 739 | |
Lowercase Letter | 389 | |
Space Separator | 353 | |
Uppercase Letter | 206 | 9.8% |
Decimal Number | 159 | 7.6% |
Other Punctuation | 87 | 4.1% |
Open Punctuation | 82 | 3.9% |
Close Punctuation | 82 | 3.9% |
Math Symbol | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
연 | 50 | 6.8% |
극 | 36 | 4.9% |
제 | 28 | 3.8% |
전 | 21 | 2.8% |
서 | 18 | 2.4% |
기 | 18 | 2.4% |
공 | 16 | 2.2% |
열 | 15 | 2.0% |
울 | 15 | 2.0% |
이 | 14 | 1.9% |
Other values (216) | 508 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 50 | |
i | 48 | |
a | 44 | |
o | 31 | 8.0% |
r | 29 | 7.5% |
n | 28 | 7.2% |
t | 23 | 5.9% |
d | 17 | 4.4% |
l | 17 | 4.4% |
s | 14 | 3.6% |
Other values (13) | 88 |
Uppercase Letter
Value | Count | Frequency (%) |
D | 86 | |
V | 45 | |
T | 13 | 6.3% |
R | 8 | 3.9% |
M | 7 | 3.4% |
L | 5 | 2.4% |
C | 5 | 2.4% |
N | 4 | 1.9% |
A | 4 | 1.9% |
S | 4 | 1.9% |
Other values (12) | 25 | 12.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 67 | |
2 | 40 | |
4 | 15 | 9.4% |
1 | 14 | 8.8% |
9 | 8 | 5.0% |
5 | 4 | 2.5% |
6 | 3 | 1.9% |
3 | 3 | 1.9% |
8 | 3 | 1.9% |
7 | 2 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
; | 50 | |
: | 22 | |
" | 4 | 4.6% |
, | 3 | 3.4% |
& | 3 | 3.4% |
. | 1 | 1.1% |
' | 1 | 1.1% |
? | 1 | 1.1% |
! | 1 | 1.1% |
/ | 1 | 1.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 41 | |
[ | 41 |
Close Punctuation
Value | Count | Frequency (%) |
] | 41 | |
) | 41 |
Space Separator
Value | Count | Frequency (%) |
353 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 764 | |
Hangul | 738 | |
Latin | 595 | |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
연 | 50 | 6.8% |
극 | 36 | 4.9% |
제 | 28 | 3.8% |
전 | 21 | 2.8% |
서 | 18 | 2.4% |
기 | 18 | 2.4% |
공 | 16 | 2.2% |
열 | 15 | 2.0% |
울 | 15 | 2.0% |
이 | 14 | 1.9% |
Other values (215) | 507 |
Latin
Value | Count | Frequency (%) |
D | 86 | |
e | 50 | 8.4% |
i | 48 | 8.1% |
V | 45 | 7.6% |
a | 44 | 7.4% |
o | 31 | 5.2% |
r | 29 | 4.9% |
n | 28 | 4.7% |
t | 23 | 3.9% |
d | 17 | 2.9% |
Other values (35) | 194 |
Common
Value | Count | Frequency (%) |
353 | ||
0 | 67 | 8.8% |
; | 50 | 6.5% |
( | 41 | 5.4% |
[ | 41 | 5.4% |
] | 41 | 5.4% |
) | 41 | 5.4% |
2 | 40 | 5.2% |
: | 22 | 2.9% |
4 | 15 | 2.0% |
Other values (16) | 53 | 6.9% |
Han
Value | Count | Frequency (%) |
爾 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1358 | |
Hangul | 738 | |
CJK | 1 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
353 | ||
D | 86 | 6.3% |
0 | 67 | 4.9% |
; | 50 | 3.7% |
e | 50 | 3.7% |
i | 48 | 3.5% |
V | 45 | 3.3% |
a | 44 | 3.2% |
( | 41 | 3.0% |
[ | 41 | 3.0% |
Other values (60) | 533 |
Hangul
Value | Count | Frequency (%) |
연 | 50 | 6.8% |
극 | 36 | 4.9% |
제 | 28 | 3.8% |
전 | 21 | 2.8% |
서 | 18 | 2.4% |
기 | 18 | 2.4% |
공 | 16 | 2.2% |
열 | 15 | 2.0% |
울 | 15 | 2.0% |
이 | 14 | 1.9% |
Other values (215) | 507 |
CJK
Value | Count | Frequency (%) |
爾 | 1 |
None
Value | Count | Frequency (%) |
ä | 1 |
book_isbn_cn
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 100 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.0 KiB |
co
Real number (ℝ)
Distinct | 92 |
---|---|
Distinct (%) | 92.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 368.9 |
Minimum | 1 |
---|---|
Maximum | 1866 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 210.95 |
Q1 | 256.25 |
median | 314 |
Q3 | 415.5 |
95-th percentile | 730.3 |
Maximum | 1866 |
Range | 1865 |
Interquartile range (IQR) | 159.25 |
Descriptive statistics
Standard deviation | 231.15824 |
---|---|
Coefficient of variation (CV) | 0.62661491 |
Kurtosis | 18.724933 |
Mean | 368.9 |
Median Absolute Deviation (MAD) | 70.5 |
Skewness | 3.5202874 |
Sum | 36890 |
Variance | 53434.131 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 3.0% |
258 | 2 | 2.0% |
340 | 2 | 2.0% |
289 | 2 | 2.0% |
277 | 2 | 2.0% |
320 | 2 | 2.0% |
210 | 2 | 2.0% |
273 | 1 | 1.0% |
276 | 1 | 1.0% |
212 | 1 | 1.0% |
Other values (82) | 82 |
Value | Count | Frequency (%) |
1 | 3 | |
210 | 2 | |
211 | 1 | 1.0% |
212 | 1 | 1.0% |
221 | 1 | 1.0% |
222 | 1 | 1.0% |
225 | 1 | 1.0% |
226 | 1 | 1.0% |
229 | 1 | 1.0% |
230 | 1 | 1.0% |
Value | Count | Frequency (%) |
1866 | 1 | |
1215 | 1 | |
871 | 1 | |
870 | 1 | |
793 | 1 | |
727 | 1 | |
685 | 1 | |
677 | 1 | |
624 | 1 | |
573 | 1 |
lon_cd | lon_cd_nm | sj | co | |
---|---|---|---|---|
lon_cd | 1.000 | 0.999 | 0.000 | 0.153 |
lon_cd_nm | 0.999 | 1.000 | 0.000 | 0.153 |
sj | 0.000 | 0.000 | 1.000 | 0.997 |
co | 0.153 | 0.153 | 0.997 | 1.000 |
lon_cd | lon_cd_nm | |
---|---|---|
lon_cd | 1.000 | 0.968 |
lon_cd_nm | 0.968 | 1.000 |
co | lon_cd | lon_cd_nm | |
---|---|---|---|
co | 1.000 | 0.158 | 0.158 |
lon_cd | 0.158 | 1.000 | 0.968 |
lon_cd_nm | 0.158 | 0.968 | 1.000 |
lon_cd | lon_cd_nm | sj | book_isbn_cn | co | |
---|---|---|---|---|---|
0 | 1001 | 관내대출 | 갈매기 | <NA> | 1866 |
1 | 1 | 일반대출 | (제25회)앙가쥬망전 . 25 | <NA> | 1 |
2 | 1001 | 관내대출 | 문예진흥원 2000년도 기획공연 ; 세 자매 | <NA> | 1215 |
3 | 1001 | 관내대출 | 갈매기 [DVD] | <NA> | 871 |
4 | 1001 | 관내대출 | 벚나무 동산 | <NA> | 870 |
5 | 1001 | 관내대출 | 리어왕 | <NA> | 793 |
6 | 1001 | 관내대출 | 시련 [VHS] | <NA> | 727 |
7 | 1001 | 관내대출 | 무엇이 될꼬하니 | <NA> | 1 |
8 | 1001 | 관내대출 | (Unplugged Musical) 밑바닥에서 | <NA> | 685 |
9 | 1001 | 관내대출 | 김종욱 찾기 ; 로맨틱 코메디 뮤지컬 | <NA> | 677 |
lon_cd | lon_cd_nm | sj | book_isbn_cn | co | |
---|---|---|---|---|---|
90 | 1001 | 관내대출 | The Heat Is On : The making of Miss Saigon [DVD] = 미스사이공 | <NA> | 230 |
91 | 1 | 일반대출 | Rigoletto [DVD] | <NA> | 229 |
92 | 1001 | 관내대출 | Les Miserables [DVD] | <NA> | 226 |
93 | 1 | 일반대출 | Die Zauberflote [DVD] | <NA> | 225 |
94 | 1 | 일반대출 | Tchaikovsky : Eugene Onegin [DVD] | <NA> | 222 |
95 | 1001 | 관내대출 | 보이첵 ; 몸짓 콘서트 | <NA> | 221 |
96 | 1001 | 관내대출 | 굿모닝? 체홉 ; 혜화동1번지 '98 동인작업시리즈 | <NA> | 212 |
97 | 1001 | 관내대출 | 해무 : (30주년) 극단 연우무대 기념공연 [DVD] | <NA> | 211 |
98 | 1001 | 관내대출 | 십이야 ; 서울남산국악당 기획공연 | <NA> | 210 |
99 | 1001 | 관내대출 | (2004) 연극열전 ; 남자충동 ; (2004) 연극열전 | <NA> | 210 |