Overview

Dataset statistics

Number of variables10
Number of observations1596
Missing cells1035
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory126.4 KiB
Average record size in memory81.1 B

Variable types

Numeric1
Text2
DateTime3
Categorical4

Dataset

Description경기도 광주시 도서관에서 진행하는 무료영화 상영정보에 대한 데이터로 제목, 시작날, 분류, 장르, 장소 등을 제공합니다.
URLhttps://www.data.go.kr/data/15036886/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
시작시간 is highly imbalanced (60.2%)Imbalance
수정날 has 1034 (64.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:04:13.850902
Analysis finished2023-12-12 21:04:14.821189
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1596
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean798.5
Minimum1
Maximum1596
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.2 KiB
2023-12-13T06:04:14.880207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile80.75
Q1399.75
median798.5
Q31197.25
95-th percentile1516.25
Maximum1596
Range1595
Interquartile range (IQR)797.5

Descriptive statistics

Standard deviation460.86983
Coefficient of variation (CV)0.57716948
Kurtosis-1.2
Mean798.5
Median Absolute Deviation (MAD)399
Skewness0
Sum1274406
Variance212401
MonotonicityStrictly increasing
2023-12-13T06:04:15.017471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1074 1
 
0.1%
1072 1
 
0.1%
1071 1
 
0.1%
1070 1
 
0.1%
1069 1
 
0.1%
1068 1
 
0.1%
1067 1
 
0.1%
1066 1
 
0.1%
1065 1
 
0.1%
Other values (1586) 1586
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1596 1
0.1%
1595 1
0.1%
1594 1
0.1%
1593 1
0.1%
1592 1
0.1%
1591 1
0.1%
1590 1
0.1%
1589 1
0.1%
1588 1
0.1%
1587 1
0.1%

제목
Text

Distinct1025
Distinct (%)64.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2023-12-13T06:04:15.314345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length8.2562657
Min length1

Characters and Unicode

Total characters13177
Distinct characters672
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique757 ?
Unique (%)47.4%

Sample

1st row가부와 메이 이야기
2nd row빅히어로
3rd row슈퍼배드2
4th row몬스터 대학교
5th row가디언즈
ValueCountFrequency (%)
126
 
3.6%
문화가 28
 
0.8%
몬스터 27
 
0.8%
있는 27
 
0.8%
비밀 27
 
0.8%
찾아서 26
 
0.7%
2 25
 
0.7%
대모험 21
 
0.6%
20
 
0.6%
드래곤 15
 
0.4%
Other values (1492) 3156
90.2%
2023-12-13T06:04:15.758978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1929
 
14.6%
386
 
2.9%
332
 
2.5%
303
 
2.3%
: 237
 
1.8%
215
 
1.6%
214
 
1.6%
177
 
1.3%
145
 
1.1%
127
 
1.0%
Other values (662) 9112
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10318
78.3%
Space Separator 1929
 
14.6%
Decimal Number 322
 
2.4%
Other Punctuation 283
 
2.1%
Close Punctuation 111
 
0.8%
Open Punctuation 111
 
0.8%
Lowercase Letter 57
 
0.4%
Uppercase Letter 21
 
0.2%
Math Symbol 18
 
0.1%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
386
 
3.7%
332
 
3.2%
303
 
2.9%
215
 
2.1%
214
 
2.1%
177
 
1.7%
145
 
1.4%
127
 
1.2%
123
 
1.2%
118
 
1.1%
Other values (610) 8178
79.3%
Lowercase Letter
ValueCountFrequency (%)
e 9
15.8%
o 7
12.3%
t 7
12.3%
r 5
8.8%
v 5
8.8%
i 5
8.8%
s 4
7.0%
a 4
7.0%
b 3
 
5.3%
l 2
 
3.5%
Other values (5) 6
10.5%
Decimal Number
ValueCountFrequency (%)
2 113
35.1%
1 70
21.7%
3 54
16.8%
0 29
 
9.0%
7 17
 
5.3%
4 14
 
4.3%
9 8
 
2.5%
8 8
 
2.5%
5 7
 
2.2%
6 2
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
E 5
23.8%
M 5
23.8%
L 2
 
9.5%
O 2
 
9.5%
X 2
 
9.5%
R 1
 
4.8%
A 1
 
4.8%
T 1
 
4.8%
N 1
 
4.8%
B 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
: 237
83.7%
. 30
 
10.6%
! 8
 
2.8%
' 2
 
0.7%
% 2
 
0.7%
? 2
 
0.7%
; 2
 
0.7%
Math Symbol
ValueCountFrequency (%)
+ 15
83.3%
= 1
 
5.6%
> 1
 
5.6%
< 1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
] 75
67.6%
) 36
32.4%
Open Punctuation
ValueCountFrequency (%)
[ 75
67.6%
( 36
32.4%
Space Separator
ValueCountFrequency (%)
1929
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10318
78.3%
Common 2781
 
21.1%
Latin 78
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
386
 
3.7%
332
 
3.2%
303
 
2.9%
215
 
2.1%
214
 
2.1%
177
 
1.7%
145
 
1.4%
127
 
1.2%
123
 
1.2%
118
 
1.1%
Other values (610) 8178
79.3%
Common
ValueCountFrequency (%)
1929
69.4%
: 237
 
8.5%
2 113
 
4.1%
] 75
 
2.7%
[ 75
 
2.7%
1 70
 
2.5%
3 54
 
1.9%
( 36
 
1.3%
) 36
 
1.3%
. 30
 
1.1%
Other values (17) 126
 
4.5%
Latin
ValueCountFrequency (%)
e 9
11.5%
o 7
 
9.0%
t 7
 
9.0%
E 5
 
6.4%
r 5
 
6.4%
v 5
 
6.4%
M 5
 
6.4%
i 5
 
6.4%
s 4
 
5.1%
a 4
 
5.1%
Other values (15) 22
28.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10318
78.3%
ASCII 2859
 
21.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1929
67.5%
: 237
 
8.3%
2 113
 
4.0%
] 75
 
2.6%
[ 75
 
2.6%
1 70
 
2.4%
3 54
 
1.9%
( 36
 
1.3%
) 36
 
1.3%
. 30
 
1.0%
Other values (42) 204
 
7.1%
Hangul
ValueCountFrequency (%)
386
 
3.7%
332
 
3.2%
303
 
2.9%
215
 
2.1%
214
 
2.1%
177
 
1.7%
145
 
1.4%
127
 
1.2%
123
 
1.2%
118
 
1.1%
Other values (610) 8178
79.3%
Distinct609
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
Minimum2007-09-02 00:00:00
Maximum2023-09-23 00:00:00
2023-12-13T06:04:15.894149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:04:16.389879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시작시간
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
15:00
1129 
14:00
249 
11:00
193 
<NA>
 
12
10:00
 
9
Other values (4)
 
4

Length

Max length5
Median length5
Mean length4.9918546
Min length4

Unique

Unique4 ?
Unique (%)0.3%

Sample

1st row15:00
2nd row15:00
3rd row15:00
4th row15:00
5th row15:00

Common Values

ValueCountFrequency (%)
15:00 1129
70.7%
14:00 249
 
15.6%
11:00 193
 
12.1%
<NA> 12
 
0.8%
10:00 9
 
0.6%
17:30 1
 
0.1%
16:30 1
 
0.1%
9:30 1
 
0.1%
11;00 1
 
0.1%

Length

2023-12-13T06:04:16.550660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:04:16.665421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15:00 1129
70.7%
14:00 249
 
15.6%
11:00 193
 
12.1%
na 12
 
0.8%
10:00 9
 
0.6%
17:30 1
 
0.1%
16:30 1
 
0.1%
9:30 1
 
0.1%
11;00 1
 
0.1%

분류
Categorical

Distinct26
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
전체관람가
943 
전체
142 
전체 관람가
119 
12세 이상
 
83
15세 이상
 
68
Other values (21)
241 

Length

Max length10
Median length5
Mean length5.2249373
Min length2

Unique

Unique8 ?
Unique (%)0.5%

Sample

1st row전체관람가
2nd row전체관람가
3rd row전체관람가
4th row전체관람가
5th row전체관람가

Common Values

ValueCountFrequency (%)
전체관람가 943
59.1%
전체 142
 
8.9%
전체 관람가 119
 
7.5%
12세 이상 83
 
5.2%
15세 이상 68
 
4.3%
12세 이상 관람가 56
 
3.5%
12세이상 41
 
2.6%
15세 이상 관람가 32
 
2.0%
12세관람가 26
 
1.6%
12세 관람가 15
 
0.9%
Other values (16) 71
 
4.4%

Length

2023-12-13T06:04:16.800396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전체관람가 943
45.5%
전체 261
 
12.6%
이상 240
 
11.6%
관람가 233
 
11.3%
12세 167
 
8.1%
15세 116
 
5.6%
12세이상 42
 
2.0%
12세관람가 26
 
1.3%
15세이상 19
 
0.9%
15세관람가 14
 
0.7%
Other values (7) 10
 
0.5%

장르
Text

Distinct115
Distinct (%)7.2%
Missing1
Missing (%)0.1%
Memory size12.6 KiB
2023-12-13T06:04:16.991474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length5
Mean length4.799373
Min length2

Characters and Unicode

Total characters7655
Distinct characters71
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)4.3%

Sample

1st row애니메이션
2nd row애니메이션
3rd row애니메이션
4th row애니메이션
5th row애니메이션
ValueCountFrequency (%)
애니메이션 1081
67.4%
드라마 248
 
15.5%
코미디 22
 
1.4%
드라 19
 
1.2%
액션 15
 
0.9%
판타지 11
 
0.7%
코미디+드라마 10
 
0.6%
드람 10
 
0.6%
다큐멘터리 8
 
0.5%
멜로+로맨스 7
 
0.4%
Other values (104) 173
 
10.8%
2023-12-13T06:04:17.328520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1165
15.2%
1120
14.6%
1114
14.6%
1113
14.5%
1111
14.5%
342
 
4.5%
330
 
4.3%
305
 
4.0%
+ 191
 
2.5%
85
 
1.1%
Other values (61) 779
10.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7432
97.1%
Math Symbol 191
 
2.5%
Uppercase Letter 16
 
0.2%
Space Separator 9
 
0.1%
Other Punctuation 6
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1165
15.7%
1120
15.1%
1114
15.0%
1113
15.0%
1111
14.9%
342
 
4.6%
330
 
4.4%
305
 
4.1%
85
 
1.1%
82
 
1.1%
Other values (54) 665
8.9%
Uppercase Letter
ValueCountFrequency (%)
S 8
50.0%
F 8
50.0%
Other Punctuation
ValueCountFrequency (%)
/ 5
83.3%
. 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
+ 191
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7432
97.1%
Common 207
 
2.7%
Latin 16
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1165
15.7%
1120
15.1%
1114
15.0%
1113
15.0%
1111
14.9%
342
 
4.6%
330
 
4.4%
305
 
4.1%
85
 
1.1%
82
 
1.1%
Other values (54) 665
8.9%
Common
ValueCountFrequency (%)
+ 191
92.3%
9
 
4.3%
/ 5
 
2.4%
3 1
 
0.5%
. 1
 
0.5%
Latin
ValueCountFrequency (%)
S 8
50.0%
F 8
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7432
97.1%
ASCII 223
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1165
15.7%
1120
15.1%
1114
15.0%
1113
15.0%
1111
14.9%
342
 
4.6%
330
 
4.4%
305
 
4.1%
85
 
1.1%
82
 
1.1%
Other values (54) 665
8.9%
ASCII
ValueCountFrequency (%)
+ 191
85.7%
9
 
4.0%
S 8
 
3.6%
F 8
 
3.6%
/ 5
 
2.2%
3 1
 
0.4%
. 1
 
0.4%

장소
Categorical

Distinct30
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
시청각실
478 
3층 시청각실
380 
지하1층 시청각실
186 
도척작은도서관
145 
초월도서관
49 
Other values (25)
358 

Length

Max length18
Median length14
Mean length6.5432331
Min length4

Unique

Unique11 ?
Unique (%)0.7%

Sample

1st row3층 시청각실
2nd row3층 시청각실
3rd row3층 시청각실
4th row3층 시청각실
5th row3층 시청각실

Common Values

ValueCountFrequency (%)
시청각실 478
29.9%
3층 시청각실 380
23.8%
지하1층 시청각실 186
 
11.7%
도척작은도서관 145
 
9.1%
초월도서관 49
 
3.1%
중앙도서관 48
 
3.0%
오포도서관 46
 
2.9%
곤지암도서관 3층 시청각실 46
 
2.9%
곤지암도서관 45
 
2.8%
능평도서관 43
 
2.7%
Other values (20) 130
 
8.1%

Length

2023-12-13T06:04:17.487677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
시청각실 1165
47.7%
3층 508
20.8%
지하1층 187
 
7.7%
도척작은도서관 145
 
5.9%
곤지암도서관 92
 
3.8%
초월도서관 91
 
3.7%
오포도서관 63
 
2.6%
중앙도서관 48
 
2.0%
능평도서관 43
 
1.8%
다목적실 37
 
1.5%
Other values (18) 61
 
2.5%
Distinct200
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
Minimum2016-01-05 00:00:00
Maximum2023-08-14 00:00:00
2023-12-13T06:04:17.637107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:04:17.800599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정날
Date

MISSING 

Distinct174
Distinct (%)31.0%
Missing1034
Missing (%)64.8%
Memory size12.6 KiB
Minimum2016-01-23 00:00:00
Maximum2019-05-16 00:00:00
2023-12-13T06:04:17.951776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:04:18.170948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
2023-08-10
1596 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-10
2nd row2023-08-10
3rd row2023-08-10
4th row2023-08-10
5th row2023-08-10

Common Values

ValueCountFrequency (%)
2023-08-10 1596
100.0%

Length

2023-12-13T06:04:18.308884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:04:18.423735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-10 1596
100.0%

Interactions

2023-12-13T06:04:14.433687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:04:18.501462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시작시간분류장소
연번1.0000.1540.5050.779
시작시간0.1541.0000.6790.506
분류0.5050.6791.0000.609
장소0.7790.5060.6091.000
2023-12-13T06:04:18.615262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류장소시작시간
분류1.0000.1850.347
장소0.1851.0000.220
시작시간0.3470.2201.000
2023-12-13T06:04:18.719683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시작시간분류장소
연번1.0000.0740.2070.407
시작시간0.0741.0000.3470.220
분류0.2070.3471.0000.185
장소0.4070.2200.1851.000

Missing values

2023-12-13T06:04:14.555965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:04:14.675968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T06:04:14.775043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번제목시작날시작시간분류장르장소등록날수정날데이터기준일자
01가부와 메이 이야기2016-02-0615:00전체관람가애니메이션3층 시청각실2016-01-05<NA>2023-08-10
12빅히어로2016-02-1315:00전체관람가애니메이션3층 시청각실2016-01-05<NA>2023-08-10
23슈퍼배드22016-02-2015:00전체관람가애니메이션3층 시청각실2016-01-05<NA>2023-08-10
34몬스터 대학교2016-02-2715:00전체관람가애니메이션3층 시청각실2016-01-05<NA>2023-08-10
45가디언즈2016-01-0215:00전체관람가애니메이션3층 시청각실2016-01-08<NA>2023-08-10
56머털도사2016-01-0915:00전체관람가애니메이션3층 시청각실2016-01-08<NA>2023-08-10
67눈의여왕2016-01-1615:00전체관람가애니메이션3층 시청각실2016-01-08<NA>2023-08-10
78하울의 움직이는 성2016-01-2315:00전체관람가애니메이션3층 시청각실2016-01-08<NA>2023-08-10
89겨울왕국2016-01-3015:00전체관람가애니메이션3층 시청각실2016-01-08<NA>2023-08-10
910가부와 메이 이야기2016-02-2715:00전체관람가애니메이션초월도서관 3층 시청각실2016-01-112016-01-232023-08-10
연번제목시작날시작시간분류장르장소등록날수정날데이터기준일자
15861587인사이드 아웃(102분)2023-07-1515:00전체관람가애니메이션능평도서관2023-06-13<NA>2023-08-10
15871588주먹왕 랄프2 : 인터넷 속으로(112분)2023-07-0815:00전체관람가애니메이션곤지암도서관2023-06-13<NA>2023-08-10
15881589출동! 소방관 샘 - 외계인 대소동 (63분)2023-07-0115:00전체관람가애니메이션초월도서관2023-06-13<NA>2023-08-10
15891590스노우 몬스터 (92분)2023-06-2415:00전체관람가애니메이션오포도서관2023-05-19<NA>2023-08-10
15901591배드가이즈 (100분)2023-06-1715:00전체관람가애니메이션중앙도서관2023-05-19<NA>2023-08-10
15911592주먹왕 랄프1 (108분)2023-06-1015:00전체관람가애니메이션곤지암도서관2023-05-19<NA>2023-08-10
15921593장수상회(112분)2023-06-0315:00전체관람가애니메이션능평도서관2023-05-19<NA>2023-08-10
15931594굿 다이노 (101분)2023-05-2015:00전체관람가애니메이션초월도서관2023-04-14<NA>2023-08-10
15941595엔칸토: 마법의 세계 (109분)2023-05-1315:00전체관람가애니메이션오포도서관2023-04-14<NA>2023-08-10
15951596씽2게더 (110분)2023-05-0615:00전체관람가애니메이션중앙도서관2023-04-14<NA>2023-08-10