Overview

Dataset statistics

Number of variables5
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory47.7 B

Variable types

Text2
Categorical1
Numeric2

Dataset

Description충청북도 영화상영관 정보 - 영화관명, 소재지, 전화번호, 상영관수, 좌석수의 정보를 제공합니다. (CGV청주, 롯데시네마, 메가박스 등)
URLhttps://www.data.go.kr/data/15026938/fileData.do

Alerts

상영관수 is highly overall correlated with 좌석수High correlation
좌석수 is highly overall correlated with 상영관수High correlation
영화관명 has unique valuesUnique
소재지 has unique valuesUnique
좌석수 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:26:10.160428
Analysis finished2023-12-12 11:26:11.258428
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영화관명
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T20:26:11.425643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length8.7391304
Min length5

Characters and Unicode

Total characters201
Distinct characters77
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st rowCGV 청주성안길
2nd rowCGV 청주터미널
3rd rowCGV 청주서문점
4th rowCGV 청주지웰시티관
5th rowCGV 청주율량점
ValueCountFrequency (%)
cgv 7
 
16.7%
메가박스 5
 
11.9%
롯데시네마 3
 
7.1%
충주 2
 
4.8%
충주교현점 1
 
2.4%
with 1
 
2.4%
보은영화관 1
 
2.4%
cgv제천 1
 
2.4%
설성시네마 1
 
2.4%
충북혁신도시점 1
 
2.4%
Other values (19) 19
45.2%
2023-12-12T20:26:11.950856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
9.5%
12
 
6.0%
C 8
 
4.0%
V 8
 
4.0%
8
 
4.0%
G 8
 
4.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
6
 
3.0%
Other values (67) 111
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 153
76.1%
Uppercase Letter 25
 
12.4%
Space Separator 19
 
9.5%
Lowercase Letter 4
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
7.8%
8
 
5.2%
8
 
5.2%
7
 
4.6%
6
 
3.9%
6
 
3.9%
5
 
3.3%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (58) 86
56.2%
Uppercase Letter
ValueCountFrequency (%)
C 8
32.0%
V 8
32.0%
G 8
32.0%
Q 1
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
w 1
25.0%
i 1
25.0%
t 1
25.0%
h 1
25.0%
Space Separator
ValueCountFrequency (%)
19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 153
76.1%
Latin 29
 
14.4%
Common 19
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
7.8%
8
 
5.2%
8
 
5.2%
7
 
4.6%
6
 
3.9%
6
 
3.9%
5
 
3.3%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (58) 86
56.2%
Latin
ValueCountFrequency (%)
C 8
27.6%
V 8
27.6%
G 8
27.6%
w 1
 
3.4%
i 1
 
3.4%
t 1
 
3.4%
h 1
 
3.4%
Q 1
 
3.4%
Common
ValueCountFrequency (%)
19
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 153
76.1%
ASCII 48
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19
39.6%
C 8
16.7%
V 8
16.7%
G 8
16.7%
w 1
 
2.1%
i 1
 
2.1%
t 1
 
2.1%
h 1
 
2.1%
Q 1
 
2.1%
Hangul
ValueCountFrequency (%)
12
 
7.8%
8
 
5.2%
8
 
5.2%
7
 
4.6%
6
 
3.9%
6
 
3.9%
5
 
3.3%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (58) 86
56.2%

소재지
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T20:26:12.288323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length28
Mean length26.086957
Min length20

Characters and Unicode

Total characters600
Distinct characters108
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row충청북도 청주시 상당구 상당로81번길 33 (북문로1가)
2nd row충청북도 청주시 흥덕구 2순환로 1233 드림플러스
3rd row충청북도 청주시 상당구 상당로81번길 63
4th row충청북도 청주시 흥덕구 대농로 47-1 (복대동)
5th row충청북도 청주시 상당구 충청대로 114 (라마다프라자 청주)
ValueCountFrequency (%)
충청북도 22
 
16.7%
청주시 10
 
7.6%
상당구 4
 
3.0%
충주시 4
 
3.0%
흥덕구 4
 
3.0%
음성군 2
 
1.5%
3 2
 
1.5%
1순환로 2
 
1.5%
4층 2
 
1.5%
제천시 2
 
1.5%
Other values (75) 78
59.1%
2023-12-12T20:26:13.044372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
18.5%
35
 
5.8%
28
 
4.7%
26
 
4.3%
1 23
 
3.8%
23
 
3.8%
18
 
3.0%
17
 
2.8%
15
 
2.5%
3 15
 
2.5%
Other values (98) 289
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 370
61.7%
Space Separator 111
 
18.5%
Decimal Number 89
 
14.8%
Open Punctuation 9
 
1.5%
Close Punctuation 9
 
1.5%
Other Punctuation 7
 
1.2%
Dash Punctuation 4
 
0.7%
Math Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
9.5%
28
 
7.6%
26
 
7.0%
23
 
6.2%
18
 
4.9%
17
 
4.6%
15
 
4.1%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (82) 178
48.1%
Decimal Number
ValueCountFrequency (%)
1 23
25.8%
3 15
16.9%
2 13
14.6%
4 9
 
10.1%
8 8
 
9.0%
7 6
 
6.7%
6 5
 
5.6%
0 5
 
5.6%
9 3
 
3.4%
5 2
 
2.2%
Space Separator
ValueCountFrequency (%)
111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 370
61.7%
Common 230
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
9.5%
28
 
7.6%
26
 
7.0%
23
 
6.2%
18
 
4.9%
17
 
4.6%
15
 
4.1%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (82) 178
48.1%
Common
ValueCountFrequency (%)
111
48.3%
1 23
 
10.0%
3 15
 
6.5%
2 13
 
5.7%
( 9
 
3.9%
) 9
 
3.9%
4 9
 
3.9%
8 8
 
3.5%
, 7
 
3.0%
7 6
 
2.6%
Other values (6) 20
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 370
61.7%
ASCII 230
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111
48.3%
1 23
 
10.0%
3 15
 
6.5%
2 13
 
5.7%
( 9
 
3.9%
) 9
 
3.9%
4 9
 
3.9%
8 8
 
3.5%
, 7
 
3.0%
7 6
 
2.6%
Other values (6) 20
 
8.7%
Hangul
ValueCountFrequency (%)
35
 
9.5%
28
 
7.6%
26
 
7.0%
23
 
6.2%
18
 
4.9%
17
 
4.6%
15
 
4.1%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (82) 178
48.1%

전화번호
Categorical

Distinct11
Distinct (%)47.8%
Missing0
Missing (%)0.0%
Memory size316.0 B
1544-1122
1544-0070
1544-8855
043-238-0590
043-857-7000
Other values (6)

Length

Max length13
Median length9
Mean length10.173913
Min length9

Unique

Unique8 ?
Unique (%)34.8%

Sample

1st row1544-1122
2nd row1544-1122
3rd row1544-1122
4th row1544-1122
5th row1544-1122

Common Values

ValueCountFrequency (%)
1544-1122 8
34.8%
1544-0070 4
17.4%
1544-8855 3
 
13.0%
043-238-0590 1
 
4.3%
043-857-7000 1
 
4.3%
070-5221-1354 1
 
4.3%
043-760-8706 1
 
4.3%
043-731-7050 1
 
4.3%
043-742-7053 1
 
4.3%
070-7704-7788 1
 
4.3%

Length

2023-12-12T20:26:13.288848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1544-1122 8
34.8%
1544-0070 4
17.4%
1544-8855 3
 
13.0%
043-238-0590 1
 
4.3%
043-857-7000 1
 
4.3%
070-5221-1354 1
 
4.3%
043-760-8706 1
 
4.3%
043-731-7050 1
 
4.3%
043-742-7053 1
 
4.3%
070-7704-7788 1
 
4.3%

상영관수
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)43.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.5652174
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-12T20:26:13.510628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13.5
median6
Q38
95-th percentile9.8
Maximum11
Range10
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation2.8094983
Coefficient of variation (CV)0.50483172
Kurtosis-0.94825121
Mean5.5652174
Median Absolute Deviation (MAD)2
Skewness0.072719909
Sum128
Variance7.8932806
MonotonicityNot monotonic
2023-12-12T20:26:13.676455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
8 6
26.1%
2 4
17.4%
6 3
13.0%
4 3
13.0%
5 2
 
8.7%
10 1
 
4.3%
11 1
 
4.3%
1 1
 
4.3%
7 1
 
4.3%
3 1
 
4.3%
ValueCountFrequency (%)
1 1
 
4.3%
2 4
17.4%
3 1
 
4.3%
4 3
13.0%
5 2
 
8.7%
6 3
13.0%
7 1
 
4.3%
8 6
26.1%
10 1
 
4.3%
11 1
 
4.3%
ValueCountFrequency (%)
11 1
 
4.3%
10 1
 
4.3%
8 6
26.1%
7 1
 
4.3%
6 3
13.0%
5 2
 
8.7%
4 3
13.0%
3 1
 
4.3%
2 4
17.4%
1 1
 
4.3%

좌석수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean748.21739
Minimum17
Maximum1839
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-12T20:26:13.865101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile91.3
Q1303
median676
Q31195.5
95-th percentile1661
Maximum1839
Range1822
Interquartile range (IQR)892.5

Descriptive statistics

Standard deviation549.90801
Coefficient of variation (CV)0.73495753
Kurtosis-0.89590437
Mean748.21739
Median Absolute Deviation (MAD)515
Skewness0.39260719
Sum17209
Variance302398.81
MonotonicityNot monotonic
2023-12-12T20:26:14.563703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
904 1
 
4.3%
676 1
 
4.3%
91 1
 
4.3%
642 1
 
4.3%
94 1
 
4.3%
799 1
 
4.3%
97 1
 
4.3%
372 1
 
4.3%
95 1
 
4.3%
234 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
17 1
4.3%
91 1
4.3%
94 1
4.3%
95 1
4.3%
97 1
4.3%
234 1
4.3%
372 1
4.3%
380 1
4.3%
463 1
4.3%
578 1
4.3%
ValueCountFrequency (%)
1839 1
4.3%
1675 1
4.3%
1535 1
4.3%
1286 1
4.3%
1282 1
4.3%
1200 1
4.3%
1191 1
4.3%
1013 1
4.3%
904 1
4.3%
799 1
4.3%

Interactions

2023-12-12T20:26:10.750047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:26:10.466296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:26:10.897284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:26:10.603111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:26:14.715742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영화관명소재지전화번호상영관수좌석수
영화관명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
전화번호1.0001.0001.0000.4400.000
상영관수1.0001.0000.4401.0000.780
좌석수1.0001.0000.0000.7801.000
2023-12-12T20:26:14.924321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상영관수좌석수전화번호
상영관수1.0000.9060.000
좌석수0.9061.0000.000
전화번호0.0000.0001.000

Missing values

2023-12-12T20:26:11.064916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:26:11.202675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영화관명소재지전화번호상영관수좌석수
0CGV 청주성안길충청북도 청주시 상당구 상당로81번길 33 (북문로1가)1544-11228904
1CGV 청주터미널충청북도 청주시 흥덕구 2순환로 1233 드림플러스1544-11228676
2CGV 청주서문점충청북도 청주시 상당구 상당로81번길 631544-1122101839
3CGV 청주지웰시티관충청북도 청주시 흥덕구 대농로 47-1 (복대동)1544-112281675
4CGV 청주율량점충청북도 청주시 상당구 충청대로 114 (라마다프라자 청주)1544-1122111535
5롯데시네마 서청주관충청북도 청주시 흥덕구 2순환로 1004 (비하동, 롯데쇼핑몰 4층)1544-885561286
6롯데시네마 청주용암충청북도 청주시 상당구 1순환로 12331544-885581282
7메가박스 청주사창충청북도 청주시 서원구 사창동 1순환로 6821544-00704380
8메가박스 오창충청북도 청주시 청원구 오창읍 중심상업1로 8-9, 메가시티 3~4층1544-007081013
9오송자동차극장충청북도 청주시 흥덕구 오송읍 미호천길 295043-238-0590117
영화관명소재지전화번호상영관수좌석수
13CGV 충주교현점충청북도 충주시 국원대로 107 한스 타워 5,6,7,8층1544-11226463
14메가박스 제천충청북도 제천시 의병대로18길 1(남천동)1544-007071200
15오가닉메이커협동조합 괴산극장충청북도 괴산군 칠성면 자연드림길 240, 지원센터043-760-87063234
16향수시네마충북북도 옥천군 옥천읍 문정1길 47043-731-7050295
17메가박스 진천충청북도 진천군 진천읍 중앙북1길 3 (진천터미널)1544-00704372
18영동레인보우영화관충청북도 영동군 영동읍 계산로 2길 24043-742-7053297
19CGV 충북혁신도시점충청북도 음성군 맹동면 대하1길 101544-11225799
20설성시네마충청북도 음성군 음성읍 수정로 37 읍민회관070-7704-7788294
21CGV제천충청북도 제천시 장평천로 27-131544-11226642
22보은영화관 with 시네큐충청북도 보은군 보은읍 뱃들로 68-22070-5117-5819291