Overview

Dataset statistics

Number of variables7
Number of observations40
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory62.3 B

Variable types

Numeric2
Categorical4
Text1

Dataset

Description드림주니어 동영상 게시물 첨부파일 정보
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15072464/fileData.do

Alerts

파일 유형 has constant value ""Constant
파일 종류 has constant value ""Constant
다운로드 수 has constant value ""Constant
게시글 번호 is highly overall correlated with 파일명High correlation
파일 크기 is highly overall correlated with 파일명High correlation
파일명 is highly overall correlated with 게시글 번호 and 1 other fieldsHigh correlation
게시글 번호 has unique valuesUnique
파일 경로 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:21:13.735918
Analysis finished2023-12-12 01:21:14.503103
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

게시글 번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.5
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T10:21:14.573261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q330.25
95-th percentile38.05
Maximum40
Range39
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.690452
Coefficient of variation (CV)0.57026595
Kurtosis-1.2
Mean20.5
Median Absolute Deviation (MAD)10
Skewness0
Sum820
Variance136.66667
MonotonicityStrictly increasing
2023-12-12T10:21:14.722083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 1
 
2.5%
22 1
 
2.5%
24 1
 
2.5%
25 1
 
2.5%
26 1
 
2.5%
27 1
 
2.5%
28 1
 
2.5%
29 1
 
2.5%
30 1
 
2.5%
31 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
1 1
2.5%
2 1
2.5%
3 1
2.5%
4 1
2.5%
5 1
2.5%
6 1
2.5%
7 1
2.5%
8 1
2.5%
9 1
2.5%
10 1
2.5%
ValueCountFrequency (%)
40 1
2.5%
39 1
2.5%
38 1
2.5%
37 1
2.5%
36 1
2.5%
35 1
2.5%
34 1
2.5%
33 1
2.5%
32 1
2.5%
31 1
2.5%

파일명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)42.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
02.JPG
01.jpg
04.JPG
 
2
3회썸네일.jpg
 
2
4회썸네일.jpg
 
2
Other values (12)
24 

Length

Max length22
Median length6
Mean length8.6
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1화.jpg
2nd row1화.jpg
3rd row2화.jpg
4th row2화.jpg
5th row3회썸네일.jpg

Common Values

ValueCountFrequency (%)
02.JPG 6
15.0%
01.jpg 4
 
10.0%
04.JPG 2
 
5.0%
3회썸네일.jpg 2
 
5.0%
4회썸네일.jpg 2
 
5.0%
03.JPG 2
 
5.0%
131729209572135055.jpg 2
 
5.0%
3.JPG 2
 
5.0%
2화.jpg 2
 
5.0%
3.jpg 2
 
5.0%
Other values (7) 14
35.0%

Length

2023-12-12T10:21:14.893668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
02.jpg 8
20.0%
01.jpg 4
10.0%
04.jpg 4
10.0%
3.jpg 4
10.0%
3회썸네일.jpg 2
 
5.0%
4회썸네일.jpg 2
 
5.0%
03.jpg 2
 
5.0%
131729209572135055.jpg 2
 
5.0%
2화.jpg 2
 
5.0%
131782776470513196.jpg 2
 
5.0%
Other values (4) 8
20.0%

파일 경로
Text

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T10:21:15.226113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length79
Mean length79
Min length79

Characters and Unicode

Total characters3160
Distinct characters34
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088276.jpg
2nd row/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088292.jpg
3rd row/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088333.jpg
4th row/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088349.jpg
5th row/open_content/upload/leconline/LecOnlineController/2018/05/30/UP_1527665820.jpg
ValueCountFrequency (%)
open_content/upload/leconline/leconlinecontroller/2018/05/24/up_1527088276.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/05/24/up_1527088292.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/30/up_1535619283.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/07/27/up_1532682133.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/09/up_1533788845.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/09/up_1533790613.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/09/up_1533791096.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/27/up_1535333018.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/27/up_1535333139.jpg 1
 
2.5%
open_content/upload/leconline/leconlinecontroller/2018/08/30/up_1535618958.jpg 1
 
2.5%
Other values (30) 30
75.0%
2023-12-12T10:21:15.709109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 320
 
10.1%
n 320
 
10.1%
e 280
 
8.9%
o 240
 
7.6%
l 240
 
7.6%
0 128
 
4.1%
1 124
 
3.9%
c 120
 
3.8%
t 120
 
3.8%
p 108
 
3.4%
Other values (24) 1160
36.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1764
55.8%
Decimal Number 720
22.8%
Other Punctuation 360
 
11.4%
Uppercase Letter 236
 
7.5%
Connector Punctuation 80
 
2.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 320
18.1%
e 280
15.9%
o 240
13.6%
l 240
13.6%
c 120
 
6.8%
t 120
 
6.8%
p 108
 
6.1%
i 80
 
4.5%
r 80
 
4.5%
d 40
 
2.3%
Other values (4) 136
7.7%
Decimal Number
ValueCountFrequency (%)
0 128
17.8%
1 124
17.2%
8 99
13.8%
2 95
13.2%
5 76
10.6%
3 61
8.5%
6 42
 
5.8%
9 38
 
5.3%
7 35
 
4.9%
4 22
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
P 52
22.0%
L 40
16.9%
U 40
16.9%
O 40
16.9%
C 40
16.9%
J 12
 
5.1%
G 12
 
5.1%
Other Punctuation
ValueCountFrequency (%)
/ 320
88.9%
. 40
 
11.1%
Connector Punctuation
ValueCountFrequency (%)
_ 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2000
63.3%
Common 1160
36.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 320
16.0%
e 280
14.0%
o 240
12.0%
l 240
12.0%
c 120
 
6.0%
t 120
 
6.0%
p 108
 
5.4%
i 80
 
4.0%
r 80
 
4.0%
P 52
 
2.6%
Other values (11) 360
18.0%
Common
ValueCountFrequency (%)
/ 320
27.6%
0 128
 
11.0%
1 124
 
10.7%
8 99
 
8.5%
2 95
 
8.2%
_ 80
 
6.9%
5 76
 
6.6%
3 61
 
5.3%
6 42
 
3.6%
. 40
 
3.4%
Other values (3) 95
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3160
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 320
 
10.1%
n 320
 
10.1%
e 280
 
8.9%
o 240
 
7.6%
l 240
 
7.6%
0 128
 
4.1%
1 124
 
3.9%
c 120
 
3.8%
t 120
 
3.8%
p 108
 
3.4%
Other values (24) 1160
36.7%

파일 크기
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean215374.2
Minimum75671
Maximum814779
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2023-12-12T10:21:15.882563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum75671
5-th percentile81553.4
Q1121659.25
median160154
Q3208139.5
95-th percentile422379.6
Maximum814779
Range739108
Interquartile range (IQR)86480.25

Descriptive statistics

Standard deviation168089.53
Coefficient of variation (CV)0.7804534
Kurtosis6.7247992
Mean215374.2
Median Absolute Deviation (MAD)39433.5
Skewness2.4692419
Sum8614968
Variance2.8254089 × 1010
MonotonicityNot monotonic
2023-12-12T10:21:16.032414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
160932 2
 
5.0%
239899 2
 
5.0%
337705 2
 
5.0%
151693 2
 
5.0%
118843 2
 
5.0%
395501 2
 
5.0%
195461 2
 
5.0%
157338 2
 
5.0%
814779 2
 
5.0%
83893 2
 
5.0%
Other values (10) 20
50.0%
ValueCountFrequency (%)
75671 2
5.0%
81863 2
5.0%
83893 2
5.0%
87781 2
5.0%
118843 2
5.0%
122598 2
5.0%
151693 2
5.0%
151764 2
5.0%
157338 2
5.0%
159376 2
5.0%
ValueCountFrequency (%)
814779 2
5.0%
401727 2
5.0%
395501 2
5.0%
337705 2
5.0%
239899 2
5.0%
197553 2
5.0%
195461 2
5.0%
195189 2
5.0%
177918 2
5.0%
160932 2
5.0%

파일 유형
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
image/jpeg
40 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowimage/jpeg
2nd rowimage/jpeg
3rd rowimage/jpeg
4th rowimage/jpeg
5th rowimage/jpeg

Common Values

ValueCountFrequency (%)
image/jpeg 40
100.0%

Length

2023-12-12T10:21:16.206634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:21:16.306509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
image/jpeg 40
100.0%

파일 종류
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
defaultFile
40 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdefaultFile
2nd rowdefaultFile
3rd rowdefaultFile
4th rowdefaultFile
5th rowdefaultFile

Common Values

ValueCountFrequency (%)
defaultFile 40
100.0%

Length

2023-12-12T10:21:16.423304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:21:16.553046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
defaultfile 40
100.0%

다운로드 수
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
0
40 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 40
100.0%

Length

2023-12-12T10:21:16.666808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:21:16.775409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 40
100.0%

Interactions

2023-12-12T10:21:14.118763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:21:13.899574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:21:14.220000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:21:13.997430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:21:16.838937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시글 번호파일명파일 경로파일 크기
게시글 번호1.0000.9341.0000.646
파일명0.9341.0001.0000.961
파일 경로1.0001.0001.0001.000
파일 크기0.6460.9611.0001.000
2023-12-12T10:21:16.943559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시글 번호파일 크기파일명
게시글 번호1.0000.3060.637
파일 크기0.3061.0000.712
파일명0.6370.7121.000

Missing values

2023-12-12T10:21:14.341451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:21:14.458349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

게시글 번호파일명파일 경로파일 크기파일 유형파일 종류다운로드 수
011화.jpg/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088276.jpg160932image/jpegdefaultFile0
121화.jpg/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088292.jpg160932image/jpegdefaultFile0
232화.jpg/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088333.jpg159376image/jpegdefaultFile0
342화.jpg/open_content/upload/leconline/LecOnlineController/2018/05/24/UP_1527088349.jpg159376image/jpegdefaultFile0
453회썸네일.jpg/open_content/upload/leconline/LecOnlineController/2018/05/30/UP_1527665820.jpg81863image/jpegdefaultFile0
563회썸네일.jpg/open_content/upload/leconline/LecOnlineController/2018/05/30/UP_1527665858.jpg81863image/jpegdefaultFile0
674회썸네일.jpg/open_content/upload/leconline/LecOnlineController/2018/05/30/UP_1527665900.jpg75671image/jpegdefaultFile0
784회썸네일.jpg/open_content/upload/leconline/LecOnlineController/2018/05/30/UP_1527665999.jpg75671image/jpegdefaultFile0
8902.JPG/open_content/upload/leconline/LecOnlineController/2018/06/14/UP_1528960301.JPG177918image/jpegdefaultFile0
91002.JPG/open_content/upload/leconline/LecOnlineController/2018/06/14/UP_1528960477.JPG177918image/jpegdefaultFile0
게시글 번호파일명파일 경로파일 크기파일 유형파일 종류다운로드 수
303117.jpg/open_content/upload/leconline/LecOnlineController/2018/10/02/UP_1538441587.jpg195461image/jpegdefaultFile0
313201.jpg/open_content/upload/leconline/LecOnlineController/2018/08/30/UP_1535620095.jpg395501image/jpegdefaultFile0
323301.jpg/open_content/upload/leconline/LecOnlineController/2018/08/30/UP_1535620258.jpg395501image/jpegdefaultFile0
333404.jpg/open_content/upload/leconline/LecOnlineController/2018/09/27/UP_1538038270.jpg118843image/jpegdefaultFile0
343504.jpg/open_content/upload/leconline/LecOnlineController/2018/09/27/UP_1538036266.jpg118843image/jpegdefaultFile0
353619.jpg/open_content/upload/leconline/LecOnlineController/2018/10/02/UP_1538440729.jpg151693image/jpegdefaultFile0
363719.jpg/open_content/upload/leconline/LecOnlineController/2018/10/02/UP_1538440853.jpg151693image/jpegdefaultFile0
373817.jpg/open_content/upload/leconline/LecOnlineController/2018/10/02/UP_1538441680.jpg195461image/jpegdefaultFile0
383902.jpg/open_content/upload/leconline/LecOnlineController/2018/10/11/UP_1539238950.jpg337705image/jpegdefaultFile0
394002.jpg/open_content/upload/leconline/LecOnlineController/2018/10/11/UP_1539240607.jpg337705image/jpegdefaultFile0