Overview

Dataset statistics

Number of variables8
Number of observations229
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.6 KiB
Average record size in memory69.6 B

Variable types

Numeric4
Categorical3
Text1

Dataset

Description해외방송시장조사 보고서 중 OTT관련 이용행태 및 만족도 등에 대한 데이터 중 국가(미국, 호주, 말레이시아, 영국, 터키, 러시아 등 15개국), 연령, 소득, 성별, 고객가치, 직업 등에 따른 OTT서비스 시청여부(예/아니오)비율에 대한 통계데이터
URLhttps://www.data.go.kr/data/15102265/fileData.do

Alerts

국가 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
조사연도 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 예(비율) and 4 other fieldsHigh correlation
사례수(명) is highly overall correlated with 분류High correlation
예(비율) is highly overall correlated with 연번 and 1 other fieldsHigh correlation
아니요(비율) is highly overall correlated with 연번 and 1 other fieldsHigh correlation
분류 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:27:52.001812
Analysis finished2023-12-12 23:27:53.954870
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct229
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115
Minimum1
Maximum229
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T08:27:54.344550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.4
Q158
median115
Q3172
95-th percentile217.6
Maximum229
Range228
Interquartile range (IQR)114

Descriptive statistics

Standard deviation66.250786
Coefficient of variation (CV)0.57609379
Kurtosis-1.2
Mean115
Median Absolute Deviation (MAD)57
Skewness0
Sum26335
Variance4389.1667
MonotonicityStrictly increasing
2023-12-13T08:27:54.512434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
145 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
Other values (219) 219
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%

조사연도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2019
104 
2020
95 
2018
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 104
45.4%
2020 95
41.5%
2018 30
 
13.1%

Length

2023-12-13T08:27:54.649964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:27:54.752607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 104
45.4%
2020 95
41.5%
2018 30
 
13.1%

국가
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
미국
32 
영국
19 
호주
19 
러시아
19 
브라질
19 
Other values (10)
121 

Length

Max length5
Median length4
Mean length2.6419214
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말레이시아
2nd row말레이시아
3rd row말레이시아
4th row말레이시아
5th row말레이시아

Common Values

ValueCountFrequency (%)
미국 32
14.0%
영국 19
8.3%
호주 19
8.3%
러시아 19
8.3%
브라질 19
8.3%
UAE 19
8.3%
인도 18
7.9%
터키 18
7.9%
캐나다 18
7.9%
헝가리 18
7.9%
Other values (5) 30
13.1%

Length

2023-12-13T08:27:54.870105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미국 32
14.0%
영국 19
8.3%
호주 19
8.3%
러시아 19
8.3%
브라질 19
8.3%
uae 19
8.3%
인도 18
7.9%
터키 18
7.9%
캐나다 18
7.9%
헝가리 18
7.9%
Other values (5) 30
13.1%

분류
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
연령별
80 
직업별
54 
소득별
23 
학력별
22 
성별
20 
Other values (4)
30 

Length

Max length5
Median length3
Mean length3.0174672
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연령별
2nd row연령별
3rd row연령별
4th row연령별
5th row연령별

Common Values

ValueCountFrequency (%)
연령별 80
34.9%
직업별 54
23.6%
소득별 23
 
10.0%
학력별 22
 
9.6%
성별 20
 
8.7%
고객가치별 12
 
5.2%
국가별 9
 
3.9%
인종별 6
 
2.6%
지역별 3
 
1.3%

Length

2023-12-13T08:27:55.013446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:27:55.135553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연령별 80
34.9%
직업별 54
23.6%
소득별 23
 
10.0%
학력별 22
 
9.6%
성별 20
 
8.7%
고객가치별 12
 
5.2%
국가별 9
 
3.9%
인종별 6
 
2.6%
지역별 3
 
1.3%

구분
Text

Distinct55
Distinct (%)24.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T08:27:55.337444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length10
Mean length4.3886463
Min length2

Characters and Unicode

Total characters1005
Distinct characters83
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)13.1%

Sample

1st row전체
2nd row10대
3rd row20대
4th row30대
5th row40대
ValueCountFrequency (%)
이상 35
 
11.7%
10대 15
 
5.0%
40대 15
 
5.0%
50대 15
 
5.0%
20대 15
 
5.0%
이하 15
 
5.0%
30대 15
 
5.0%
전체 14
 
4.7%
평균 10
 
3.3%
남성 10
 
3.3%
Other values (46) 140
46.8%
2023-12-13T08:27:55.682146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 131
 
13.0%
86
 
8.6%
70
 
7.0%
51
 
5.1%
42
 
4.2%
36
 
3.6%
25
 
2.5%
24
 
2.4%
22
 
2.2%
5 21
 
2.1%
Other values (73) 497
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 660
65.7%
Decimal Number 222
 
22.1%
Space Separator 70
 
7.0%
Other Punctuation 40
 
4.0%
Currency Symbol 8
 
0.8%
Math Symbol 3
 
0.3%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
13.0%
51
 
7.7%
42
 
6.4%
36
 
5.5%
25
 
3.8%
24
 
3.6%
22
 
3.3%
20
 
3.0%
20
 
3.0%
17
 
2.6%
Other values (57) 317
48.0%
Decimal Number
ValueCountFrequency (%)
0 131
59.0%
5 21
 
9.5%
1 17
 
7.7%
4 17
 
7.7%
3 17
 
7.7%
2 15
 
6.8%
8 2
 
0.9%
7 2
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 16
40.0%
/ 15
37.5%
· 9
22.5%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
70
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 660
65.7%
Common 343
34.1%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
13.0%
51
 
7.7%
42
 
6.4%
36
 
5.5%
25
 
3.8%
24
 
3.6%
22
 
3.3%
20
 
3.0%
20
 
3.0%
17
 
2.6%
Other values (57) 317
48.0%
Common
ValueCountFrequency (%)
0 131
38.2%
70
20.4%
5 21
 
6.1%
1 17
 
5.0%
4 17
 
5.0%
3 17
 
5.0%
, 16
 
4.7%
/ 15
 
4.4%
2 15
 
4.4%
· 9
 
2.6%
Other values (4) 15
 
4.4%
Latin
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 660
65.7%
ASCII 336
33.4%
None 9
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 131
39.0%
70
20.8%
5 21
 
6.2%
1 17
 
5.1%
4 17
 
5.1%
3 17
 
5.1%
, 16
 
4.8%
/ 15
 
4.5%
2 15
 
4.5%
$ 8
 
2.4%
Other values (5) 9
 
2.7%
Hangul
ValueCountFrequency (%)
86
 
13.0%
51
 
7.7%
42
 
6.4%
36
 
5.5%
25
 
3.8%
24
 
3.6%
22
 
3.3%
20
 
3.0%
20
 
3.0%
17
 
2.6%
Other values (57) 317
48.0%
None
ValueCountFrequency (%)
· 9
100.0%

사례수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct146
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142.33188
Minimum1
Maximum461
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T08:27:55.854721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25
Q170
median102
Q3208
95-th percentile400
Maximum461
Range460
Interquartile range (IQR)138

Descriptive statistics

Standard deviation104.63002
Coefficient of variation (CV)0.73511308
Kurtosis0.98087382
Mean142.33188
Median Absolute Deviation (MAD)51
Skewness1.2265056
Sum32594
Variance10947.442
MonotonicityNot monotonic
2023-12-13T08:27:55.984429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
400 5
 
2.2%
16 5
 
2.2%
88 5
 
2.2%
91 5
 
2.2%
131 4
 
1.7%
74 4
 
1.7%
214 4
 
1.7%
93 4
 
1.7%
61 3
 
1.3%
92 3
 
1.3%
Other values (136) 187
81.7%
ValueCountFrequency (%)
1 1
 
0.4%
7 1
 
0.4%
8 1
 
0.4%
16 5
2.2%
17 1
 
0.4%
18 1
 
0.4%
22 1
 
0.4%
25 2
 
0.9%
27 1
 
0.4%
30 1
 
0.4%
ValueCountFrequency (%)
461 1
 
0.4%
439 1
 
0.4%
435 1
 
0.4%
431 1
 
0.4%
428 1
 
0.4%
426 1
 
0.4%
424 1
 
0.4%
419 1
 
0.4%
410 1
 
0.4%
400 5
2.2%

예(비율)
Real number (ℝ)

HIGH CORRELATION 

Distinct181
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.612227
Minimum0
Maximum97.7
Zeros1
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T08:27:56.114954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile42.36
Q157
median70.3
Q385
95-th percentile91.2
Maximum97.7
Range97.7
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.580916
Coefficient of variation (CV)0.23818971
Kurtosis0.28828096
Mean69.612227
Median Absolute Deviation (MAD)14.5
Skewness-0.59235163
Sum15941.2
Variance274.92678
MonotonicityNot monotonic
2023-12-13T08:27:56.266931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
87.3 3
 
1.3%
75.0 3
 
1.3%
86.6 3
 
1.3%
93.8 3
 
1.3%
85.0 3
 
1.3%
65.7 3
 
1.3%
90.9 3
 
1.3%
61.3 3
 
1.3%
90.4 3
 
1.3%
87.8 3
 
1.3%
Other values (171) 199
86.9%
ValueCountFrequency (%)
0.0 1
0.4%
25.0 1
0.4%
30.2 1
0.4%
31.5 1
0.4%
34.2 1
0.4%
36.8 1
0.4%
38.1 2
0.9%
40.0 1
0.4%
40.6 1
0.4%
41.5 1
0.4%
ValueCountFrequency (%)
97.7 1
 
0.4%
94.9 2
0.9%
93.8 3
1.3%
93.4 1
 
0.4%
92.9 1
 
0.4%
92.2 1
 
0.4%
91.7 1
 
0.4%
91.5 1
 
0.4%
91.2 2
0.9%
90.9 3
1.3%

아니요(비율)
Real number (ℝ)

HIGH CORRELATION 

Distinct182
Distinct (%)79.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.389956
Minimum2.3
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T08:27:56.392150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.3
5-th percentile8.8
Q115
median29.7
Q343
95-th percentile57.64
Maximum100
Range97.7
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.580078
Coefficient of variation (CV)0.54557754
Kurtosis0.28835347
Mean30.389956
Median Absolute Deviation (MAD)14.5
Skewness0.59260725
Sum6959.3
Variance274.89898
MonotonicityNot monotonic
2023-12-13T08:27:56.548508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34.3 3
 
1.3%
12.2 3
 
1.3%
9.6 3
 
1.3%
15.0 3
 
1.3%
9.1 3
 
1.3%
13.4 3
 
1.3%
38.7 3
 
1.3%
25.0 3
 
1.3%
12.7 3
 
1.3%
12.0 2
 
0.9%
Other values (172) 200
87.3%
ValueCountFrequency (%)
2.3 1
0.4%
5.1 2
0.9%
6.2 1
0.4%
6.3 2
0.9%
6.6 1
0.4%
7.1 1
0.4%
7.8 1
0.4%
8.3 1
0.4%
8.5 1
0.4%
8.8 2
0.9%
ValueCountFrequency (%)
100.0 1
0.4%
75.0 1
0.4%
69.8 1
0.4%
68.5 1
0.4%
65.8 1
0.4%
63.2 1
0.4%
61.9 2
0.9%
60.0 1
0.4%
59.4 1
0.4%
58.5 1
0.4%

Interactions

2023-12-13T08:27:53.395807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.351956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.657341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.041425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.474983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.419903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.750290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.135427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.569466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.497689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.856241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.214058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.662205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.573458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:52.951140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:27:53.307438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:27:56.638834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
연번1.0000.9430.8730.7840.8700.6780.5020.502
조사연도0.9431.0001.0000.7140.6500.5030.7830.783
국가0.8731.0001.0000.3650.0000.0000.6280.628
분류0.7840.7140.3651.0000.9970.7810.4610.461
구분0.8700.6500.0000.9971.0000.9020.8740.874
사례수(명)0.6780.5030.0000.7810.9021.0000.0000.000
예(비율)0.5020.7830.6280.4610.8740.0001.0001.000
아니요(비율)0.5020.7830.6280.4610.8740.0001.0001.000
2023-12-13T08:27:56.747645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가조사연도분류
국가1.0000.9730.154
조사연도0.9731.0000.415
분류0.1540.4151.000
2023-12-13T08:27:56.838753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사례수(명)예(비율)아니요(비율)조사연도국가분류
연번1.000-0.2160.569-0.5690.9150.5520.506
사례수(명)-0.2161.000-0.0040.0030.3670.0610.508
예(비율)0.569-0.0041.000-1.0000.4820.3090.162
아니요(비율)-0.5690.003-1.0001.0000.4820.3090.162
조사연도0.9150.3670.4820.4821.0000.9730.415
국가0.5520.0610.3090.3090.9731.0000.154
분류0.5060.5080.1620.1620.4150.1541.000

Missing values

2023-12-13T08:27:53.791074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:27:53.909764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
012018말레이시아연령별전체40048.052.0
122018말레이시아연령별10대6952.247.8
232018말레이시아연령별20대8854.545.5
342018말레이시아연령별30대8851.148.9
452018말레이시아연령별40대9247.852.2
562018말레이시아연령별50대 이상6330.269.8
672018베트남연령별전체40066.533.5
782018베트남연령별10대5367.932.1
892018베트남연령별20대9960.639.4
9102018베트남연령별30대10872.227.8
연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
2192202020영국소득별평균 이하18985.214.8
2202212020영국소득별평균 이상12293.46.6
2212222020호주소득별평균 이하17667.632.4
2222232020호주소득별평균 이상10380.619.4
2232242020러시아소득별평균 이하23083.017.0
2242252020러시아소득별평균 이상13093.86.2
2252262020브라질소득별평균 이하19790.49.6
2262272020브라질소득별평균 이상10087.013.0
2272282020UAE소득별평균 이하18489.710.3
2282292020UAE소득별평균 이상14087.112.9