Overview

Dataset statistics

Number of variables15
Number of observations4352
Missing cells13351
Missing cells (%)20.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory539.9 KiB
Average record size in memory127.0 B

Variable types

Categorical6
Numeric6
Text2
DateTime1

Dataset

Description2020년도 애니메이션 디지털아카이빙 사업 결과 목록(사업기간: 2020년 10월 13일~ 2021년 3월 31일) - 제공 항목명: 사업공정, 구분(처천기관), 순번, 작품명, 원동화지 면수, 영상 편수(편) / 씬수, MOV파일 수량, MP4파일 수량, 기타 수량(mpg,avi 등), 원본자료 종류, 원본자료 상세, 총 러닝타임(HH:MM:SS), 제작사, 소장처 - 작품 편수: 5,813편, MOV 파일: 7,204건, MP4 파일: 2,214건, 기타 파일: 182건
Author한국영상자료원
URLhttps://www.data.go.kr/data/15097529/fileData.do

Alerts

소장처 is highly overall correlated with 제작연도 and 6 other fieldsHigh correlation
원본자료상세 is highly overall correlated with 제작연도 and 6 other fieldsHigh correlation
추천기관 is highly overall correlated with 순번 and 6 other fieldsHigh correlation
사업공정 is highly overall correlated with 제작연도 and 6 other fieldsHigh correlation
기타자료수량 is highly overall correlated with 순번 and 7 other fieldsHigh correlation
원본자료종류 is highly overall correlated with 순번 and 6 other fieldsHigh correlation
순번 is highly overall correlated with 제작연도 and 5 other fieldsHigh correlation
제작연도 is highly overall correlated with 순번 and 8 other fieldsHigh correlation
원동화지면수 is highly overall correlated with 사업공정 and 1 other fieldsHigh correlation
편수씬수 is highly overall correlated with MOV수량 and 3 other fieldsHigh correlation
MOV수량 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
MP4수량 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
사업공정 is highly imbalanced (82.2%)Imbalance
기타자료수량 is highly imbalanced (99.3%)Imbalance
원본자료종류 is highly imbalanced (85.5%)Imbalance
원본자료상세 is highly imbalanced (88.9%)Imbalance
소장처 is highly imbalanced (66.7%)Imbalance
원동화지면수 has 181 (4.2%) missing valuesMissing
MOV수량 has 4184 (96.1%) missing valuesMissing
MP4수량 has 4195 (96.4%) missing valuesMissing
총러닝타임 has 4171 (95.8%) missing valuesMissing
제작사 has 613 (14.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 17:09:15.575192
Analysis finished2023-12-12 17:09:21.147501
Duration5.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업공정
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
극장용 애니메이션 영화자료 디지털화
4171 
구작 TV애니메이션(테이프) 디지털화
 
139
파일보존 디지털화
 
42

Length

Max length20
Median length19
Mean length18.935432
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구작 TV애니메이션(테이프) 디지털화
2nd row구작 TV애니메이션(테이프) 디지털화
3rd row구작 TV애니메이션(테이프) 디지털화
4th row구작 TV애니메이션(테이프) 디지털화
5th row구작 TV애니메이션(테이프) 디지털화

Common Values

ValueCountFrequency (%)
극장용 애니메이션 영화자료 디지털화 4171
95.8%
구작 TV애니메이션(테이프) 디지털화 139
 
3.2%
파일보존 디지털화 42
 
1.0%

Length

2023-12-13T02:09:21.208404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:21.300559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
디지털화 4352
25.3%
극장용 4171
24.3%
애니메이션 4171
24.3%
영화자료 4171
24.3%
구작 139
 
0.8%
tv애니메이션(테이프 139
 
0.8%
파일보존 42
 
0.2%

추천기관
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
한국영상자료원
2263 
애니메이션박물관
1950 
한국독립애니메이션협회
 
81
한국애니메이션제작자협회
 
50
한국애니메이션산업협회
 
8

Length

Max length12
Median length7
Mean length7.5873162
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국애니메이션제작자협회
2nd row한국애니메이션제작자협회
3rd row한국애니메이션제작자협회
4th row한국애니메이션제작자협회
5th row한국애니메이션제작자협회

Common Values

ValueCountFrequency (%)
한국영상자료원 2263
52.0%
애니메이션박물관 1950
44.8%
한국독립애니메이션협회 81
 
1.9%
한국애니메이션제작자협회 50
 
1.1%
한국애니메이션산업협회 8
 
0.2%

Length

2023-12-13T02:09:21.449097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:21.567125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국영상자료원 2263
52.0%
애니메이션박물관 1950
44.8%
한국독립애니메이션협회 81
 
1.9%
한국애니메이션제작자협회 50
 
1.1%
한국애니메이션산업협회 8
 
0.2%

순번
Real number (ℝ)

HIGH CORRELATION 

Distinct4171
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2001.6864
Minimum1
Maximum4171
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:21.696901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile88.55
Q1907.75
median1995.5
Q33083.25
95-th percentile3953.45
Maximum4171
Range4170
Interquartile range (IQR)2175.5

Descriptive statistics

Standard deviation1246.4821
Coefficient of variation (CV)0.622716
Kurtosis-1.226444
Mean2001.6864
Median Absolute Deviation (MAD)1088
Skewness0.024064326
Sum8711339
Variance1553717.6
MonotonicityNot monotonic
2023-12-13T02:09:21.831657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3
 
0.1%
34 3
 
0.1%
25 3
 
0.1%
26 3
 
0.1%
27 3
 
0.1%
28 3
 
0.1%
29 3
 
0.1%
30 3
 
0.1%
32 3
 
0.1%
31 3
 
0.1%
Other values (4161) 4322
99.3%
ValueCountFrequency (%)
1 3
0.1%
2 3
0.1%
3 3
0.1%
4 3
0.1%
5 3
0.1%
6 3
0.1%
7 3
0.1%
8 3
0.1%
9 3
0.1%
10 3
0.1%
ValueCountFrequency (%)
4171 1
< 0.1%
4170 1
< 0.1%
4169 1
< 0.1%
4168 1
< 0.1%
4167 1
< 0.1%
4166 1
< 0.1%
4165 1
< 0.1%
4164 1
< 0.1%
4163 1
< 0.1%
4162 1
< 0.1%

제작연도
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)0.8%
Missing7
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean2007.5337
Minimum1979
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:21.968795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1979
5-th percentile2003
Q12003
median2011
Q32011
95-th percentile2014
Maximum2020
Range41
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.8760254
Coefficient of variation (CV)0.0024288635
Kurtosis-0.38512228
Mean2007.5337
Median Absolute Deviation (MAD)3
Skewness-0.25342472
Sum8722734
Variance23.775624
MonotonicityNot monotonic
2023-12-13T02:09:22.093177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
2003 1954
44.9%
2011 1481
34.0%
2014 614
 
14.1%
2012 134
 
3.1%
1997 24
 
0.6%
2000 15
 
0.3%
1996 13
 
0.3%
1999 13
 
0.3%
2010 11
 
0.3%
2019 8
 
0.2%
Other values (26) 78
 
1.8%
ValueCountFrequency (%)
1979 1
 
< 0.1%
1981 1
 
< 0.1%
1983 2
< 0.1%
1984 1
 
< 0.1%
1985 1
 
< 0.1%
1986 1
 
< 0.1%
1991 1
 
< 0.1%
1992 3
0.1%
1993 1
 
< 0.1%
1994 3
0.1%
ValueCountFrequency (%)
2020 4
 
0.1%
2019 8
 
0.2%
2018 3
 
0.1%
2017 2
 
< 0.1%
2016 6
 
0.1%
2015 2
 
< 0.1%
2014 614
14.1%
2013 1
 
< 0.1%
2012 134
 
3.1%
2011 1481
34.0%
Distinct190
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
2023-12-13T02:09:22.263589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length33
Mean length11.252528
Min length2

Characters and Unicode

Total characters48971
Distinct characters388
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)4.0%

Sample

1st row내일은 월드컵
2nd row엄마찾아 삼만리
3rd row별나라 삼총사
4th row타임머신 001
5th row15소년 우주 표류기
ValueCountFrequency (%)
동화 2178
20.5%
오세암 1950
18.4%
소중한날의꿈(원화 1445
13.6%
작화 1026
9.7%
배경 926
8.7%
원화 762
 
7.2%
355
 
3.3%
운수 354
 
3.3%
좋은 354
 
3.3%
봄봄 258
 
2.4%
Other values (302) 993
9.4%
2023-12-13T02:09:22.816250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6250
 
12.8%
5442
 
11.1%
( 4207
 
8.6%
) 4207
 
8.6%
2210
 
4.5%
, 2190
 
4.5%
2183
 
4.5%
1953
 
4.0%
1952
 
4.0%
1951
 
4.0%
Other values (378) 16426
33.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31766
64.9%
Space Separator 6250
 
12.8%
Open Punctuation 4207
 
8.6%
Close Punctuation 4207
 
8.6%
Other Punctuation 2238
 
4.6%
Lowercase Letter 145
 
0.3%
Uppercase Letter 105
 
0.2%
Decimal Number 49
 
0.1%
Connector Punctuation 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5442
17.1%
2210
 
7.0%
2183
 
6.9%
1953
 
6.1%
1952
 
6.1%
1951
 
6.1%
1835
 
5.8%
1493
 
4.7%
1492
 
4.7%
1481
 
4.7%
Other values (314) 9774
30.8%
Lowercase Letter
ValueCountFrequency (%)
e 17
 
11.7%
i 12
 
8.3%
u 11
 
7.6%
o 10
 
6.9%
a 10
 
6.9%
t 9
 
6.2%
g 8
 
5.5%
s 8
 
5.5%
l 7
 
4.8%
d 7
 
4.8%
Other values (13) 46
31.7%
Uppercase Letter
ValueCountFrequency (%)
S 13
12.4%
B 11
 
10.5%
D 9
 
8.6%
E 8
 
7.6%
A 7
 
6.7%
M 6
 
5.7%
F 5
 
4.8%
R 5
 
4.8%
O 5
 
4.8%
I 4
 
3.8%
Other values (13) 32
30.5%
Other Punctuation
ValueCountFrequency (%)
, 2190
97.9%
: 29
 
1.3%
/ 6
 
0.3%
! 5
 
0.2%
# 3
 
0.1%
. 3
 
0.1%
' 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 21
42.9%
2 16
32.7%
3 5
 
10.2%
0 4
 
8.2%
4 2
 
4.1%
5 1
 
2.0%
Space Separator
ValueCountFrequency (%)
6250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4207
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4207
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31764
64.9%
Common 16955
34.6%
Latin 250
 
0.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5442
17.1%
2210
 
7.0%
2183
 
6.9%
1953
 
6.1%
1952
 
6.1%
1951
 
6.1%
1835
 
5.8%
1493
 
4.7%
1492
 
4.7%
1481
 
4.7%
Other values (312) 9772
30.8%
Latin
ValueCountFrequency (%)
e 17
 
6.8%
S 13
 
5.2%
i 12
 
4.8%
B 11
 
4.4%
u 11
 
4.4%
o 10
 
4.0%
a 10
 
4.0%
D 9
 
3.6%
t 9
 
3.6%
g 8
 
3.2%
Other values (36) 140
56.0%
Common
ValueCountFrequency (%)
6250
36.9%
( 4207
24.8%
) 4207
24.8%
, 2190
 
12.9%
: 29
 
0.2%
1 21
 
0.1%
2 16
 
0.1%
/ 6
 
< 0.1%
! 5
 
< 0.1%
3 5
 
< 0.1%
Other values (8) 19
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31764
64.9%
ASCII 17205
35.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6250
36.3%
( 4207
24.5%
) 4207
24.5%
, 2190
 
12.7%
: 29
 
0.2%
1 21
 
0.1%
e 17
 
0.1%
2 16
 
0.1%
S 13
 
0.1%
i 12
 
0.1%
Other values (54) 243
 
1.4%
Hangul
ValueCountFrequency (%)
5442
17.1%
2210
 
7.0%
2183
 
6.9%
1953
 
6.1%
1952
 
6.1%
1951
 
6.1%
1835
 
5.8%
1493
 
4.7%
1492
 
4.7%
1481
 
4.7%
Other values (312) 9772
30.8%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

원동화지면수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct275
Distinct (%)6.6%
Missing181
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean46.766003
Minimum1
Maximum533
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:22.933984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median28
Q366
95-th percentile158
Maximum533
Range532
Interquartile range (IQR)61

Descriptive statistics

Standard deviation57.923442
Coefficient of variation (CV)1.2385801
Kurtosis11.175474
Mean46.766003
Median Absolute Deviation (MAD)25
Skewness2.6750751
Sum195061
Variance3355.1251
MonotonicityNot monotonic
2023-12-13T02:09:23.065394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 376
 
8.6%
3 363
 
8.3%
4 185
 
4.3%
5 139
 
3.2%
6 76
 
1.7%
8 63
 
1.4%
20 52
 
1.2%
28 51
 
1.2%
19 51
 
1.2%
13 50
 
1.1%
Other values (265) 2765
63.5%
(Missing) 181
 
4.2%
ValueCountFrequency (%)
1 49
 
1.1%
2 376
8.6%
3 363
8.3%
4 185
4.3%
5 139
 
3.2%
6 76
 
1.7%
7 46
 
1.1%
8 63
 
1.4%
9 40
 
0.9%
10 34
 
0.8%
ValueCountFrequency (%)
533 1
< 0.1%
528 2
< 0.1%
509 1
< 0.1%
472 1
< 0.1%
453 1
< 0.1%
448 1
< 0.1%
425 1
< 0.1%
404 1
< 0.1%
396 1
< 0.1%
389 1
< 0.1%

편수씬수
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3357077
Minimum1
Maximum104
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:23.171706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile1
Maximum104
Range103
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4.317457
Coefficient of variation (CV)3.2323366
Kurtosis296.32226
Mean1.3357077
Median Absolute Deviation (MAD)0
Skewness16.14462
Sum5813
Variance18.640435
MonotonicityNot monotonic
2023-12-13T02:09:23.256229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 4315
99.1%
26 14
 
0.3%
78 6
 
0.1%
39 5
 
0.1%
40 4
 
0.1%
6 2
 
< 0.1%
10 1
 
< 0.1%
14 1
 
< 0.1%
17 1
 
< 0.1%
102 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
ValueCountFrequency (%)
1 4315
99.1%
6 2
 
< 0.1%
10 1
 
< 0.1%
14 1
 
< 0.1%
17 1
 
< 0.1%
26 14
 
0.3%
39 5
 
0.1%
40 4
 
0.1%
52 1
 
< 0.1%
78 6
 
0.1%
ValueCountFrequency (%)
104 1
 
< 0.1%
102 1
 
< 0.1%
78 6
0.1%
52 1
 
< 0.1%
40 4
 
0.1%
39 5
 
0.1%
26 14
0.3%
17 1
 
< 0.1%
14 1
 
< 0.1%
10 1
 
< 0.1%

MOV수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct32
Distinct (%)19.0%
Missing4184
Missing (%)96.1%
Infinite0
Infinite (%)0.0%
Mean43.113095
Minimum1
Maximum4340
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:23.352623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q325.25
95-th percentile78
Maximum4340
Range4339
Interquartile range (IQR)24.25

Descriptive statistics

Standard deviation336.05431
Coefficient of variation (CV)7.7947154
Kurtosis162.86019
Mean43.113095
Median Absolute Deviation (MAD)0
Skewness12.674962
Sum7243
Variance112932.5
MonotonicityNot monotonic
2023-12-13T02:09:23.449689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 101
 
2.3%
26 9
 
0.2%
2 5
 
0.1%
40 5
 
0.1%
54 4
 
0.1%
78 4
 
0.1%
20 3
 
0.1%
6 3
 
0.1%
52 3
 
0.1%
13 3
 
0.1%
Other values (22) 28
 
0.6%
(Missing) 4184
96.1%
ValueCountFrequency (%)
1 101
2.3%
2 5
 
0.1%
3 2
 
< 0.1%
5 1
 
< 0.1%
6 3
 
0.1%
7 2
 
< 0.1%
9 2
 
< 0.1%
10 1
 
< 0.1%
13 3
 
0.1%
15 1
 
< 0.1%
ValueCountFrequency (%)
4340 1
 
< 0.1%
380 1
 
< 0.1%
260 1
 
< 0.1%
157 1
 
< 0.1%
108 1
 
< 0.1%
79 1
 
< 0.1%
78 4
0.1%
57 1
 
< 0.1%
54 4
0.1%
53 1
 
< 0.1%

MP4수량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct27
Distinct (%)17.2%
Missing4195
Missing (%)96.4%
Infinite0
Infinite (%)0.0%
Mean14.101911
Minimum1
Maximum104
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.4 KiB
2023-12-13T02:09:23.545764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q326
95-th percentile59.6
Maximum104
Range103
Interquartile range (IQR)25

Descriptive statistics

Standard deviation22.625486
Coefficient of variation (CV)1.604427
Kurtosis2.375067
Mean14.101911
Median Absolute Deviation (MAD)0
Skewness1.756772
Sum2214
Variance511.91262
MonotonicityNot monotonic
2023-12-13T02:09:23.662755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
1 101
 
2.3%
26 11
 
0.3%
78 6
 
0.1%
52 5
 
0.1%
41 3
 
0.1%
6 2
 
< 0.1%
39 2
 
< 0.1%
3 2
 
< 0.1%
54 2
 
< 0.1%
2 2
 
< 0.1%
Other values (17) 21
 
0.5%
(Missing) 4195
96.4%
ValueCountFrequency (%)
1 101
2.3%
2 2
 
< 0.1%
3 2
 
< 0.1%
5 1
 
< 0.1%
6 2
 
< 0.1%
10 1
 
< 0.1%
11 2
 
< 0.1%
14 1
 
< 0.1%
16 1
 
< 0.1%
26 11
 
0.3%
ValueCountFrequency (%)
104 1
 
< 0.1%
79 1
 
< 0.1%
78 6
0.1%
55 1
 
< 0.1%
54 2
 
< 0.1%
52 5
0.1%
50 1
 
< 0.1%
48 1
 
< 0.1%
45 1
 
< 0.1%
43 1
 
< 0.1%

기타자료수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
<NA>
4348 
52
 
3
26
 
1

Length

Max length4
Median length4
Mean length3.9981618
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4348
99.9%
52 3
 
0.1%
26 1
 
< 0.1%

Length

2023-12-13T02:09:23.793079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:23.898207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4348
99.9%
52 3
 
0.1%
26 1
 
< 0.1%

원본자료종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
<NA>
4171 
비디오테이프
 
122
디지털파일
 
58
기타자료
 
1

Length

Max length6
Median length4
Mean length4.0693934
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row비디오테이프
2nd row비디오테이프
3rd row비디오테이프
4th row비디오테이프
5th row비디오테이프

Common Values

ValueCountFrequency (%)
<NA> 4171
95.8%
비디오테이프 122
 
2.8%
디지털파일 58
 
1.3%
기타자료 1
 
< 0.1%

Length

2023-12-13T02:09:24.028484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:24.173820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4171
95.8%
비디오테이프 122
 
2.8%
디지털파일 58
 
1.3%
기타자료 1
 
< 0.1%

원본자료상세
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
제작실무자료(원동화지)
4171 
제작실무자료 / TAPE (BETACAM)
 
83
제작실무자료(디지털파일)
 
42
제작실무자료 / TAPE (DIGI-BETA)
 
33
제작실무자료 / 디지털파일
 
16
Other values (3)
 
7

Length

Max length25
Median length12
Mean length12.338465
Min length12

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row제작실무자료 / TAPE (DIGI-BETA)
2nd row제작실무자료 / TAPE (DIGI-BETA)
3rd row제작실무자료 / TAPE (DIGI-BETA)
4th row제작실무자료 / TAPE (DIGI-BETA)
5th row제작실무자료 / TAPE (DIGI-BETA)

Common Values

ValueCountFrequency (%)
제작실무자료(원동화지) 4171
95.8%
제작실무자료 / TAPE (BETACAM) 83
 
1.9%
제작실무자료(디지털파일) 42
 
1.0%
제작실무자료 / TAPE (DIGI-BETA) 33
 
0.8%
제작실무자료 / 디지털파일 16
 
0.4%
제작실무자료 / TAPE (HDCAM) 5
 
0.1%
제작실무자료 / TAPE (HDCAM SR) 1
 
< 0.1%
제작실무자료 / DVD 1
 
< 0.1%

Length

2023-12-13T02:09:24.359397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:09:24.506346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제작실무자료(원동화지 4171
87.8%
제작실무자료 139
 
2.9%
139
 
2.9%
tape 122
 
2.6%
betacam 83
 
1.7%
제작실무자료(디지털파일 42
 
0.9%
digi-beta 33
 
0.7%
디지털파일 16
 
0.3%
hdcam 6
 
0.1%
sr 1
 
< 0.1%

총러닝타임
Date

MISSING 

Distinct172
Distinct (%)95.0%
Missing4171
Missing (%)95.8%
Memory size34.1 KiB
Minimum2023-12-13 00:01:08
Maximum2023-12-13 23:12:51
2023-12-13T02:09:24.674147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:24.822215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제작사
Text

MISSING 

Distinct84
Distinct (%)2.2%
Missing613
Missing (%)14.1%
Memory size34.1 KiB
2023-12-13T02:09:25.002704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length5
Mean length6.3190693
Min length2

Characters and Unicode

Total characters23627
Distinct characters170
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)1.5%

Sample

1st row선우
2nd row선우
3rd row선우
4th row선우
5th row선우
ValueCountFrequency (%)
㈜마고21 1950
37.2%
명상하기 1478
28.2%
연필로 1478
28.2%
㈜연필로명상하기 131
 
2.5%
선우 25
 
0.5%
㈜지앤지엔터테인먼트 13
 
0.2%
ocon 12
 
0.2%
퍼니플럭스 8
 
0.2%
대원미디어 7
 
0.1%
독립애니협회 7
 
0.1%
Other values (89) 134
 
2.6%
2023-12-13T02:09:25.347435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2108
 
8.9%
2 1953
 
8.3%
1951
 
8.3%
1950
 
8.3%
1 1950
 
8.3%
1616
 
6.8%
1615
 
6.8%
1611
 
6.8%
1611
 
6.8%
1611
 
6.8%
Other values (160) 5651
23.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15989
67.7%
Decimal Number 3903
 
16.5%
Other Symbol 2108
 
8.9%
Space Separator 1504
 
6.4%
Uppercase Letter 76
 
0.3%
Other Punctuation 21
 
0.1%
Close Punctuation 13
 
0.1%
Open Punctuation 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1951
12.2%
1950
12.2%
1616
10.1%
1615
10.1%
1611
10.1%
1611
10.1%
1611
10.1%
1610
10.1%
1609
10.1%
41
 
0.3%
Other values (140) 764
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
O 26
34.2%
N 15
19.7%
C 12
15.8%
A 6
 
7.9%
I 3
 
3.9%
M 3
 
3.9%
L 3
 
3.9%
T 2
 
2.6%
S 2
 
2.6%
B 2
 
2.6%
Other values (2) 2
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 1953
50.0%
1 1950
50.0%
Other Punctuation
ValueCountFrequency (%)
, 16
76.2%
/ 5
 
23.8%
Other Symbol
ValueCountFrequency (%)
2108
100.0%
Space Separator
ValueCountFrequency (%)
1504
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18097
76.6%
Common 5454
 
23.1%
Latin 76
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2108
11.6%
1951
10.8%
1950
10.8%
1616
8.9%
1615
8.9%
1611
8.9%
1611
8.9%
1611
8.9%
1610
8.9%
1609
8.9%
Other values (141) 805
 
4.4%
Latin
ValueCountFrequency (%)
O 26
34.2%
N 15
19.7%
C 12
15.8%
A 6
 
7.9%
I 3
 
3.9%
M 3
 
3.9%
L 3
 
3.9%
T 2
 
2.6%
S 2
 
2.6%
B 2
 
2.6%
Other values (2) 2
 
2.6%
Common
ValueCountFrequency (%)
2 1953
35.8%
1 1950
35.8%
1504
27.6%
, 16
 
0.3%
) 13
 
0.2%
( 13
 
0.2%
/ 5
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15989
67.7%
ASCII 5530
 
23.4%
None 2108
 
8.9%

Most frequent character per block

None
ValueCountFrequency (%)
2108
100.0%
ASCII
ValueCountFrequency (%)
2 1953
35.3%
1 1950
35.3%
1504
27.2%
O 26
 
0.5%
, 16
 
0.3%
N 15
 
0.3%
) 13
 
0.2%
( 13
 
0.2%
C 12
 
0.2%
A 6
 
0.1%
Other values (9) 22
 
0.4%
Hangul
ValueCountFrequency (%)
1951
12.2%
1950
12.2%
1616
10.1%
1615
10.1%
1611
10.1%
1611
10.1%
1611
10.1%
1610
10.1%
1609
10.1%
41
 
0.3%
Other values (140) 764
 
4.8%

소장처
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
한국영상자료원
2263 
애니메이션박물관
1950 
독립애니협회
 
81
엔팝
 
25
㈜지앤지엔터테인먼트
 
8
Other values (8)
 
25

Length

Max length10
Median length7
Mean length7.3915441
Min length2

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row엔팝
2nd row엔팝
3rd row엔팝
4th row엔팝
5th row엔팝

Common Values

ValueCountFrequency (%)
한국영상자료원 2263
52.0%
애니메이션박물관 1950
44.8%
독립애니협회 81
 
1.9%
엔팝 25
 
0.6%
㈜지앤지엔터테인먼트 8
 
0.2%
춘천 7
 
0.2%
영상자료원 6
 
0.1%
대원미디어 5
 
0.1%
㈜아이코닉스 3
 
0.1%
구로 1
 
< 0.1%
Other values (3) 3
 
0.1%

Length

2023-12-13T02:09:25.498339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국영상자료원 2263
52.0%
애니메이션박물관 1950
44.8%
독립애니협회 81
 
1.9%
엔팝 25
 
0.6%
㈜지앤지엔터테인먼트 8
 
0.2%
춘천 7
 
0.2%
영상자료원 6
 
0.1%
대원미디어 5
 
0.1%
㈜아이코닉스 3
 
0.1%
구로 1
 
< 0.1%
Other values (3) 3
 
0.1%

Interactions

2023-12-13T02:09:19.941932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:16.772391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.326299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.924601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.562911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.273822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:20.048916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:16.884347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.418104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.046485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.663671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.391275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:20.165718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:16.989968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.503947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.158446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.782610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.499679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:20.263295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.084197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.604988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.289546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.917072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.607811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:20.357908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.165982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.696809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.402261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.031997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.720213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:20.468493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.243818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:17.788969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:18.480743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.144997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:09:19.817821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:09:25.613341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업공정추천기관순번제작연도원동화지면수편수씬수MOV수량MP4수량기타자료수량원본자료종류원본자료상세제작사소장처
사업공정1.0000.7200.5210.843NaN0.4550.0000.460NaN0.5321.0001.0000.835
추천기관0.7201.0000.8550.9640.0100.6420.1590.746NaN0.6100.8550.9951.000
순번0.5210.8551.0000.8600.3930.155NaNNaNNaNNaN0.3800.8500.660
제작연도0.8430.9640.8601.0000.1130.3850.0000.5891.0000.7180.7690.9880.845
원동화지면수NaN0.0100.3930.1131.000NaNNaNNaNNaNNaNNaN0.0660.010
편수씬수0.4550.6420.1550.385NaN1.0000.0000.817NaN0.0000.5950.8750.818
MOV수량0.0000.159NaN0.000NaN0.0001.0000.000NaN0.0400.0001.0000.000
MP4수량0.4600.746NaN0.589NaN0.8170.0001.000NaN0.4490.5270.0000.763
기타자료수량NaNNaNNaN1.000NaNNaNNaNNaN1.000NaNNaN0.000NaN
원본자료종류0.5320.610NaN0.718NaN0.0000.0400.449NaN1.0001.0000.7100.939
원본자료상세1.0000.8550.3800.769NaN0.5950.0000.527NaN1.0001.0000.9680.926
제작사1.0000.9950.8500.9880.0660.8751.0000.0000.0000.7100.9681.0000.994
소장처0.8351.0000.6600.8450.0100.8180.0000.763NaN0.9390.9260.9941.000
2023-12-13T02:09:25.795028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소장처원본자료상세추천기관사업공정기타자료수량원본자료종류
소장처1.0000.7690.9990.7081.0000.703
원본자료상세0.7691.0000.7470.9991.0000.989
추천기관0.9990.7471.0000.7101.0000.625
사업공정0.7080.9990.7101.0001.0000.796
기타자료수량1.0001.0001.0001.0001.0001.000
원본자료종류0.7030.9890.6250.7961.0001.000
2023-12-13T02:09:25.926660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번제작연도원동화지면수편수씬수MOV수량MP4수량사업공정추천기관기타자료수량원본자료종류원본자료상세소장처
순번1.000-0.577-0.403-0.154-0.566-0.5080.3660.5221.0001.0000.1920.346
제작연도-0.5771.0000.200-0.0430.7640.6890.7560.7390.7070.5680.5120.559
원동화지면수-0.4030.2001.000NaNNaNNaN1.0000.0070.0000.0001.0000.007
편수씬수-0.154-0.043NaN1.0000.6170.7520.3440.4831.0000.0000.3720.562
MOV수량-0.5660.764NaN0.6171.0000.9260.0000.104NaN0.0660.0000.000
MP4수량-0.5080.689NaN0.7520.9261.0000.3390.409NaN0.3140.3150.443
사업공정0.3660.7561.0000.3440.0000.3391.0000.7101.0000.7960.9990.708
추천기관0.5220.7390.0070.4830.1040.4090.7101.0001.0000.6250.7470.999
기타자료수량1.0000.7070.0001.000NaNNaN1.0001.0001.0001.0001.0001.000
원본자료종류1.0000.5680.0000.0000.0660.3140.7960.6251.0001.0000.9890.703
원본자료상세0.1920.5121.0000.3720.0000.3150.9990.7471.0000.9891.0000.769
소장처0.3460.5590.0070.5620.0000.4430.7080.9991.0000.7030.7691.000

Missing values

2023-12-13T02:09:20.639444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:09:20.888047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:09:21.057462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업공정추천기관순번제작연도작품명원동화지면수편수씬수MOV수량MP4수량기타자료수량원본자료종류원본자료상세총러닝타임제작사소장처
0구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회11996내일은 월드컵<NA>111<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)01:13:00선우엔팝
1구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회21981엄마찾아 삼만리<NA>111<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)01:23:00선우엔팝
2구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회31979별나라 삼총사<NA>111<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)01:13:00선우엔팝
3구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회41996타임머신 001<NA>111<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)01:11:00선우엔팝
4구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회5199615소년 우주 표류기<NA>111<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)01:18:00선우엔팝
5구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회61999마일로의 대모험 (한국어)<NA>262727<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)09:44:29선우엔팝
6구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회71999마일로의 대모험 (영어) / Milo's Bug Quest<NA>262626<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)09:34:12선우엔팝
7구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회82002스페이스 힙합덕 (한국어)<NA>262755<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)09:38:21선우엔팝
8구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회92002스페이스 힙합덕 (영어) / Space Hiphop Duck<NA>2666<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)02:13:50선우엔팝
9구작 TV애니메이션(테이프) 디지털화한국애니메이션제작자협회102002스페이스 힙합덕 (중국어/자막)<NA>261352<NA>비디오테이프제작실무자료 / TAPE (DIGI-BETA)11:45:01선우엔팝
사업공정추천기관순번제작연도작품명원동화지면수편수씬수MOV수량MP4수량기타자료수량원본자료종류원본자료상세총러닝타임제작사소장처
4342파일보존 디지털화한국영상자료원332014출동!슈퍼윙스 시즌1<NA>1<NA>52<NA>디지털파일제작실무자료(디지털파일)10:57:15퍼니플럭스한국영상자료원
4343파일보존 디지털화한국영상자료원342017출동!슈퍼윙스 시즌2<NA>1<NA>52<NA>디지털파일제작실무자료(디지털파일)10:59:32퍼니플럭스한국영상자료원
4344파일보존 디지털화한국영상자료원352019출동!슈퍼윙스 시즌3<NA>1<NA><NA><NA>디지털파일제작실무자료(디지털파일)08:27:20퍼니플럭스한국영상자료원
4345파일보존 디지털화한국영상자료원362016엄마까투리 시즌1<NA>154<NA><NA>디지털파일제작실무자료(디지털파일)04:39:19퍼니플럭스한국영상자료원
4346파일보존 디지털화한국영상자료원372018엄마까투리 시즌2<NA>154<NA><NA>디지털파일제작실무자료(디지털파일)04:37:42퍼니플럭스한국영상자료원
4347파일보존 디지털화한국영상자료원382019엄마까투리 시즌3<NA>1<NA>54<NA>디지털파일제작실무자료(디지털파일)04:38:45퍼니플럭스한국영상자료원
4348파일보존 디지털화한국영상자료원392012꼬마기차추추<NA>1<NA>78<NA>디지털파일제작실무자료(디지털파일)09:32:11애니투아트한국영상자료원
4349파일보존 디지털화한국영상자료원402019길냥이 키츠_슈퍼문 탐험대<NA>1<NA>2<NA>디지털파일제작실무자료(디지털파일)00:11:16크리에이티브 섬한국영상자료원
4350파일보존 디지털화한국영상자료원412019꿈을 요리하는 마법카페<NA>1<NA>3<NA>디지털파일제작실무자료(디지털파일)00:07:37크리에이티브 섬한국영상자료원
4351파일보존 디지털화한국영상자료원422018콩이야학교가자<NA>154<NA><NA>디지털파일제작실무자료(디지털파일)09:42:29픽스트랜드한국영상자료원