Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 3409 |
Missing cells | 3340 |
Missing cells (%) | 19.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 140.0 KiB |
Average record size in memory | 42.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 2 |
Text | 2 |
Dataset
Description | 국립중앙극장 공연예술자료 조직에 대한 정보로 조직코드, 조직유형, 조직명, 비고, 등록일 등의 정보를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15090174/fileData.do |
Reproduction
Analysis started | 2023-12-12 02:42:32.798457 |
---|---|
Analysis finished | 2023-12-12 02:42:34.234548 |
Duration | 1.44 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
조직코드
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 3409 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2046.5975 |
Minimum | 1 |
---|---|
Maximum | 5857 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 30.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 172.4 |
Q1 | 854 |
median | 1706 |
Q3 | 2558 |
95-th percentile | 5686.6 |
Maximum | 5857 |
Range | 5856 |
Interquartile range (IQR) | 1704 |
Descriptive statistics
Standard deviation | 1617.1794 |
---|---|
Coefficient of variation (CV) | 0.7901795 |
Kurtosis | 0.36630114 |
Mean | 2046.5975 |
Median Absolute Deviation (MAD) | 852 |
Skewness | 1.1109037 |
Sum | 6976851 |
Variance | 2615269.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
353 | 1 | < 0.1% |
5564 | 1 | < 0.1% |
5566 | 1 | < 0.1% |
5567 | 1 | < 0.1% |
5568 | 1 | < 0.1% |
5569 | 1 | < 0.1% |
5570 | 1 | < 0.1% |
5571 | 1 | < 0.1% |
5572 | 1 | < 0.1% |
5573 | 1 | < 0.1% |
Other values (3399) | 3399 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
5857 | 1 | |
5856 | 1 | |
5855 | 1 | |
5854 | 1 | |
5853 | 1 | |
5852 | 1 | |
5851 | 1 | |
5850 | 1 | |
5849 | 1 | |
5848 | 1 |
조직유형
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 26.8 KiB |
1 | |
---|---|
2 | 252 |
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 3156 | |
2 | 252 | 7.4% |
3 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3156 | |
2 | 252 | 7.4% |
3 | 1 | < 0.1% |
조직명
Text
Distinct | 3154 |
---|---|
Distinct (%) | 92.5% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 26.8 KiB |
Length
Max length | 50 |
---|---|
Median length | 42 |
Mean length | 11.38615 |
Min length | 2 |
Characters and Unicode
Total characters | 38804 |
---|---|
Distinct characters | 1094 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 2901 ? |
---|---|
Unique (%) | 85.1% |
Sample
1st row | 코리안체임버 오페라단 |
---|---|
2nd row | 명품극단 |
3rd row | Haydn di Bolzano e Trento |
4th row | Teatro alla Scala |
5th row | The Dance Theatre of Harlem |
Value | Count | Frequency (%) |
극단 | 246 | 3.3% |
외 | 187 | 2.5% |
orchestra | 123 | 1.7% |
opera | 81 | 1.1% |
the | 78 | 1.1% |
ballet | 75 | 1.0% |
국립극장 | 53 | 0.7% |
teatro | 45 | 0.6% |
de | 44 | 0.6% |
劇團 | 38 | 0.5% |
Other values (3995) | 6451 |
Most occurring characters
Value | Count | Frequency (%) |
4031 | 10.4% | |
e | 1323 | 3.4% |
a | 1229 | 3.2% |
r | 1021 | 2.6% |
단 | 957 | 2.5% |
t | 692 | 1.8% |
o | 692 | 1.8% |
l | 673 | 1.7% |
극 | 629 | 1.6% |
n | 612 | 1.6% |
Other values (1084) | 26945 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 21252 | |
Lowercase Letter | 9594 | |
Space Separator | 4031 | 10.4% |
Uppercase Letter | 2159 | 5.6% |
Other Punctuation | 578 | 1.5% |
Close Punctuation | 507 | 1.3% |
Open Punctuation | 505 | 1.3% |
Decimal Number | 150 | 0.4% |
Dash Punctuation | 16 | < 0.1% |
Math Symbol | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
단 | 957 | 4.5% |
극 | 629 | 3.0% |
국 | 554 | 2.6% |
학 | 451 | 2.1% |
대 | 441 | 2.1% |
악 | 353 | 1.7% |
회 | 319 | 1.5% |
교 | 315 | 1.5% |
무 | 309 | 1.5% |
김 | 294 | 1.4% |
Other values (1008) | 16630 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1323 | |
a | 1229 | |
r | 1021 | |
t | 692 | 7.2% |
o | 692 | 7.2% |
l | 673 | 7.0% |
n | 612 | 6.4% |
i | 602 | 6.3% |
h | 513 | 5.3% |
s | 482 | 5.0% |
Other values (16) | 1755 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 281 | |
B | 203 | 9.4% |
T | 192 | 8.9% |
C | 177 | 8.2% |
S | 173 | 8.0% |
M | 109 | 5.0% |
L | 106 | 4.9% |
A | 102 | 4.7% |
R | 96 | 4.4% |
N | 84 | 3.9% |
Other values (16) | 636 |
Decimal Number
Value | Count | Frequency (%) |
1 | 37 | |
0 | 36 | |
2 | 34 | |
3 | 10 | 6.7% |
6 | 9 | 6.0% |
4 | 7 | 4.7% |
9 | 5 | 3.3% |
8 | 5 | 3.3% |
5 | 4 | 2.7% |
7 | 3 | 2.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 529 | |
. | 34 | 5.9% |
& | 8 | 1.4% |
/ | 4 | 0.7% |
: | 2 | 0.3% |
' | 1 | 0.2% |
Math Symbol
Value | Count | Frequency (%) |
< | 4 | |
> | 4 | |
+ | 2 |
Space Separator
Value | Count | Frequency (%) |
4031 |
Close Punctuation
Value | Count | Frequency (%) |
) | 507 |
Open Punctuation
Value | Count | Frequency (%) |
( | 505 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 19570 | |
Latin | 11753 | |
Common | 5799 | 14.9% |
Han | 1682 | 4.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
단 | 957 | 4.9% |
극 | 629 | 3.2% |
국 | 554 | 2.8% |
학 | 451 | 2.3% |
대 | 441 | 2.3% |
악 | 353 | 1.8% |
회 | 319 | 1.6% |
교 | 315 | 1.6% |
무 | 309 | 1.6% |
김 | 294 | 1.5% |
Other values (589) | 14948 |
Han
Value | Count | Frequency (%) |
劇 | 83 | 4.9% |
團 | 78 | 4.6% |
金 | 68 | 4.0% |
李 | 44 | 2.6% |
樂 | 36 | 2.1% |
會 | 36 | 2.1% |
大 | 32 | 1.9% |
學 | 31 | 1.8% |
子 | 27 | 1.6% |
場 | 26 | 1.5% |
Other values (409) | 1221 |
Latin
Value | Count | Frequency (%) |
e | 1323 | 11.3% |
a | 1229 | 10.5% |
r | 1021 | 8.7% |
t | 692 | 5.9% |
o | 692 | 5.9% |
l | 673 | 5.7% |
n | 612 | 5.2% |
i | 602 | 5.1% |
h | 513 | 4.4% |
s | 482 | 4.1% |
Other values (42) | 3914 |
Common
Value | Count | Frequency (%) |
4031 | ||
, | 529 | 9.1% |
) | 507 | 8.7% |
( | 505 | 8.7% |
1 | 37 | 0.6% |
0 | 36 | 0.6% |
. | 34 | 0.6% |
2 | 34 | 0.6% |
- | 16 | 0.3% |
3 | 10 | 0.2% |
Other values (14) | 60 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 19568 | |
ASCII | 17552 | |
CJK | 1583 | 4.1% |
CJK Compat Ideographs | 99 | 0.3% |
Compat Jamo | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4031 | ||
e | 1323 | 7.5% |
a | 1229 | 7.0% |
r | 1021 | 5.8% |
t | 692 | 3.9% |
o | 692 | 3.9% |
l | 673 | 3.8% |
n | 612 | 3.5% |
i | 602 | 3.4% |
, | 529 | 3.0% |
Other values (66) | 6148 |
Hangul
Value | Count | Frequency (%) |
단 | 957 | 4.9% |
극 | 629 | 3.2% |
국 | 554 | 2.8% |
학 | 451 | 2.3% |
대 | 441 | 2.3% |
악 | 353 | 1.8% |
회 | 319 | 1.6% |
교 | 315 | 1.6% |
무 | 309 | 1.6% |
김 | 294 | 1.5% |
Other values (588) | 14946 |
CJK
Value | Count | Frequency (%) |
劇 | 83 | 5.2% |
團 | 78 | 4.9% |
金 | 68 | 4.3% |
樂 | 36 | 2.3% |
會 | 36 | 2.3% |
大 | 32 | 2.0% |
學 | 31 | 2.0% |
子 | 27 | 1.7% |
場 | 26 | 1.6% |
國 | 22 | 1.4% |
Other values (385) | 1144 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 44 | |
女 | 9 | 9.1% |
林 | 9 | 9.1% |
金 | 4 | 4.0% |
梁 | 4 | 4.0% |
盧 | 3 | 3.0% |
烈 | 3 | 3.0% |
聯 | 2 | 2.0% |
禮 | 2 | 2.0% |
樂 | 2 | 2.0% |
Other values (14) | 17 | 17.2% |
Compat Jamo
Value | Count | Frequency (%) |
ㄹ | 2 |
비고
Text
MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 64.3% |
Missing | 3339 |
Missing (%) | 97.9% |
Memory size | 26.8 KiB |
Length
Max length | 26 |
---|---|
Median length | 24 |
Mean length | 9.3571429 |
Min length | 2 |
Characters and Unicode
Total characters | 655 |
---|---|
Distinct characters | 148 |
Distinct categories | 8 ? |
Distinct scripts | 4 ? |
Distinct blocks | 3 ? |
Unique
Unique | 39 ? |
---|---|
Unique (%) | 55.7% |
Sample
1st row | 무용인 |
---|---|
2nd row | 교수극단 (셰익스피어 아해들) 원어연극 |
3rd row | 한양대 교수 |
4th row | 한국예술종합학교 교수 |
5th row | 고려대 교수 |
Value | Count | Frequency (%) |
소리꾼 | 15 | 10.3% |
교수 | 11 | 7.5% |
참가 | 8 | 5.5% |
2014 | 8 | 5.5% |
국립극장 | 6 | 4.1% |
여우락(樂)페스티벌 | 6 | 4.1% |
국립극장에서 | 4 | 2.7% |
분리됨 | 4 | 2.7% |
고려대 | 3 | 2.1% |
2000년 | 3 | 2.1% |
Other values (68) | 78 |
Most occurring characters
Value | Count | Frequency (%) |
76 | 11.6% | |
리 | 22 | 3.4% |
국 | 20 | 3.1% |
0 | 19 | 2.9% |
소 | 16 | 2.4% |
꾼 | 15 | 2.3% |
극 | 14 | 2.1% |
교 | 14 | 2.1% |
수 | 14 | 2.1% |
장 | 13 | 2.0% |
Other values (138) | 432 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 460 | |
Space Separator | 76 | 11.6% |
Decimal Number | 48 | 7.3% |
Lowercase Letter | 48 | 7.3% |
Close Punctuation | 8 | 1.2% |
Open Punctuation | 7 | 1.1% |
Uppercase Letter | 5 | 0.8% |
Other Punctuation | 3 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
리 | 22 | 4.8% |
국 | 20 | 4.3% |
소 | 16 | 3.5% |
꾼 | 15 | 3.3% |
극 | 14 | 3.0% |
교 | 14 | 3.0% |
수 | 14 | 3.0% |
장 | 13 | 2.8% |
립 | 13 | 2.8% |
가 | 13 | 2.8% |
Other values (110) | 306 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 10 | |
m | 6 | |
s | 5 | |
n | 4 | 8.3% |
a | 4 | 8.3% |
r | 4 | 8.3% |
y | 3 | 6.2% |
t | 2 | 4.2% |
i | 2 | 4.2% |
b | 2 | 4.2% |
Other values (5) | 6 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 1 | |
A | 1 | |
T | 1 | |
D | 1 | |
E | 1 |
Decimal Number
Value | Count | Frequency (%) |
0 | 19 | |
2 | 12 | |
1 | 9 | |
4 | 8 |
Space Separator
Value | Count | Frequency (%) |
76 |
Close Punctuation
Value | Count | Frequency (%) |
) | 8 |
Open Punctuation
Value | Count | Frequency (%) |
( | 7 |
Other Punctuation
Value | Count | Frequency (%) |
, | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 454 | |
Common | 142 | 21.7% |
Latin | 53 | 8.1% |
Han | 6 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
리 | 22 | 4.8% |
국 | 20 | 4.4% |
소 | 16 | 3.5% |
꾼 | 15 | 3.3% |
극 | 14 | 3.1% |
교 | 14 | 3.1% |
수 | 14 | 3.1% |
장 | 13 | 2.9% |
립 | 13 | 2.9% |
가 | 13 | 2.9% |
Other values (109) | 300 |
Latin
Value | Count | Frequency (%) |
e | 10 | |
m | 6 | |
s | 5 | |
n | 4 | 7.5% |
a | 4 | 7.5% |
r | 4 | 7.5% |
y | 3 | 5.7% |
t | 2 | 3.8% |
i | 2 | 3.8% |
b | 2 | 3.8% |
Other values (10) | 11 |
Common
Value | Count | Frequency (%) |
76 | ||
0 | 19 | 13.4% |
2 | 12 | 8.5% |
1 | 9 | 6.3% |
) | 8 | 5.6% |
4 | 8 | 5.6% |
( | 7 | 4.9% |
, | 3 | 2.1% |
Han
Value | Count | Frequency (%) |
樂 | 6 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 454 | |
ASCII | 195 | |
CJK Compat Ideographs | 6 | 0.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
76 | ||
0 | 19 | 9.7% |
2 | 12 | 6.2% |
e | 10 | 5.1% |
1 | 9 | 4.6% |
) | 8 | 4.1% |
4 | 8 | 4.1% |
( | 7 | 3.6% |
m | 6 | 3.1% |
s | 5 | 2.6% |
Other values (18) | 35 |
Hangul
Value | Count | Frequency (%) |
리 | 22 | 4.8% |
국 | 20 | 4.4% |
소 | 16 | 3.5% |
꾼 | 15 | 3.3% |
극 | 14 | 3.1% |
교 | 14 | 3.1% |
수 | 14 | 3.1% |
장 | 13 | 2.9% |
립 | 13 | 2.9% |
가 | 13 | 2.9% |
Other values (109) | 300 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
樂 | 6 |
등록일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 26.8 KiB |
2013-03-27 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2013-03-27 |
---|---|
2nd row | 2013-03-27 |
3rd row | 2013-03-27 |
4th row | 2013-03-27 |
5th row | 2013-03-27 |
Common Values
Value | Count | Frequency (%) |
2013-03-27 | 3409 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2013-03-27 | 3409 |
조직코드 | 조직유형 | 비고 | |
---|---|---|---|
조직코드 | 1.000 | 0.774 | 0.978 |
조직유형 | 0.774 | 1.000 | NaN |
비고 | 0.978 | NaN | 1.000 |
조직코드 | 조직유형 | |
---|---|---|
조직코드 | 1.000 | 0.654 |
조직유형 | 0.654 | 1.000 |
조직코드 | 조직유형 | 조직명 | 비고 | 등록일 | |
---|---|---|---|---|---|
0 | 353 | 1 | 코리안체임버 오페라단 | <NA> | 2013-03-27 |
1 | 354 | 1 | 명품극단 | <NA> | 2013-03-27 |
2 | 355 | 1 | Haydn di Bolzano e Trento | <NA> | 2013-03-27 |
3 | 356 | 1 | Teatro alla Scala | <NA> | 2013-03-27 |
4 | 357 | 1 | The Dance Theatre of Harlem | <NA> | 2013-03-27 |
5 | 358 | 1 | Del Teatro la Fenice | <NA> | 2013-03-27 |
6 | 359 | 1 | Age of Englihtement | <NA> | 2013-03-27 |
7 | 360 | 1 | St. Thomas Boys Choir Leipzig | <NA> | 2013-03-27 |
8 | 361 | 1 | 홍신자 | <NA> | 2013-03-27 |
9 | 362 | 1 | 정은혜 | 무용인 | 2013-03-27 |
조직코드 | 조직유형 | 조직명 | 비고 | 등록일 | |
---|---|---|---|---|---|
3399 | 2319 | 1 | 사단법인 벽사춤아카데미 | <NA> | 2013-03-27 |
3400 | 2320 | 1 | 극단예맥 | <NA> | 2013-03-27 |
3401 | 2321 | 1 | 그레이스여성성가단 외 | <NA> | 2013-03-27 |
3402 | 2322 | 1 | 우재현 외 | <NA> | 2013-03-27 |
3403 | 2323 | 1 | 한국음악극연구소 | <NA> | 2013-03-27 |
3404 | 2324 | 1 | Labo. C.J.K | <NA> | 2013-03-27 |
3405 | 2325 | 1 | 국립극장 공연사업팀 | <NA> | 2013-03-27 |
3406 | 2326 | 1 | 김준희의 나비6 | <NA> | 2013-03-27 |
3407 | 2327 | 1 | Pacific Northwest Ballet | <NA> | 2013-03-27 |
3408 | 2328 | 1 | The Bavarian State Ballet | <NA> | 2013-03-27 |