Overview

Dataset statistics

Number of variables9
Number of observations133
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.3 KiB
Average record size in memory79.0 B

Variable types

Numeric5
Categorical4

Dataset

Description해외방송시장조사 보고서 중 OTT관련 이용행태 및 만족도 등에 대한 데이터 중 해외국가별(인도, 싱가포르 등 10개국), 연령, 소득, 성별,고객가치, 직업 등에 따른 선호하는 OTT 시청 유형에 대한 통계데이터
URLhttps://www.data.go.kr/data/15102280/fileData.do

Alerts

조사연도 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
국가 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 조사연도 and 2 other fieldsHigh correlation
동시간대방송되고있는프로그램을더많이시청(비율) is highly overall correlated with 주문형으로더많이시청(비율)High correlation
주문형으로더많이시청(비율) is highly overall correlated with 동시간대방송되고있는프로그램을더많이시청(비율)High correlation
분류 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 분류High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:06:16.858819
Analysis finished2023-12-12 07:06:20.501372
Duration3.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct133
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67
Minimum1
Maximum133
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T16:06:20.605459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.6
Q134
median67
Q3100
95-th percentile126.4
Maximum133
Range132
Interquartile range (IQR)66

Descriptive statistics

Standard deviation38.53786
Coefficient of variation (CV)0.57519194
Kurtosis-1.2
Mean67
Median Absolute Deviation (MAD)33
Skewness0
Sum8911
Variance1485.1667
MonotonicityStrictly increasing
2023-12-12T16:06:20.792137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
85 1
 
0.8%
99 1
 
0.8%
98 1
 
0.8%
97 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
Other values (123) 123
92.5%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
133 1
0.8%
132 1
0.8%
131 1
0.8%
130 1
0.8%
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%

조사연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2019
103 
2018
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 103
77.4%
2018 30
 
22.6%

Length

2023-12-12T16:06:20.967109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:21.130110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 103
77.4%
2018 30
 
22.6%

국가
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
미국
31 
인도
18 
터키
18 
캐나다
18 
헝가리
18 
Other values (5)
30 

Length

Max length5
Median length2
Mean length2.6766917
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말레이시아
2nd row말레이시아
3rd row말레이시아
4th row말레이시아
5th row말레이시아

Common Values

ValueCountFrequency (%)
미국 31
23.3%
인도 18
13.5%
터키 18
13.5%
캐나다 18
13.5%
헝가리 18
13.5%
말레이시아 6
 
4.5%
베트남 6
 
4.5%
싱가포르 6
 
4.5%
인도네시아 6
 
4.5%
태국 6
 
4.5%

Length

2023-12-12T16:06:21.273598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:21.427850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미국 31
23.3%
인도 18
13.5%
터키 18
13.5%
캐나다 18
13.5%
헝가리 18
13.5%
말레이시아 6
 
4.5%
베트남 6
 
4.5%
싱가포르 6
 
4.5%
인도네시아 6
 
4.5%
태국 6
 
4.5%

분류
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
연령별
55 
직업별
19 
소득별
13 
고객가치별
12 
학력별
11 
Other values (4)
23 

Length

Max length5
Median length3
Mean length3.1052632
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연령별
2nd row연령별
3rd row연령별
4th row연령별
5th row연령별

Common Values

ValueCountFrequency (%)
연령별 55
41.4%
직업별 19
 
14.3%
소득별 13
 
9.8%
고객가치별 12
 
9.0%
학력별 11
 
8.3%
성별 10
 
7.5%
인종별 6
 
4.5%
국가별 4
 
3.0%
지역별 3
 
2.3%

Length

2023-12-12T16:06:21.615190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:21.757554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연령별 55
41.4%
직업별 19
 
14.3%
소득별 13
 
9.8%
고객가치별 12
 
9.0%
학력별 11
 
8.3%
성별 10
 
7.5%
인종별 6
 
4.5%
국가별 4
 
3.0%
지역별 3
 
2.3%

구분
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
20대
10 
40대
10 
50대 이상
10 
10대
10 
30대
10 
Other values (41)
83 

Length

Max length20
Median length16
Mean length4.6466165
Min length2

Unique

Unique30 ?
Unique (%)22.6%

Sample

1st row전체
2nd row10대
3rd row20대
4th row30대
5th row40대

Common Values

ValueCountFrequency (%)
20대 10
 
7.5%
40대 10
 
7.5%
50대 이상 10
 
7.5%
10대 10
 
7.5%
30대 10
 
7.5%
전체 9
 
6.8%
남성 5
 
3.8%
여성 5
 
3.8%
고졸 이하 5
 
3.8%
사무·전문직 5
 
3.8%
Other values (36) 54
40.6%

Length

2023-12-12T16:06:21.952836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이상 20
 
11.6%
20대 10
 
5.8%
50대 10
 
5.8%
10대 10
 
5.8%
30대 10
 
5.8%
40대 10
 
5.8%
전체 9
 
5.2%
미만 8
 
4.6%
남성 5
 
2.9%
여성 5
 
2.9%
Other values (37) 76
43.9%

사례수(명)
Real number (ℝ)

Distinct95
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean96.030075
Minimum2
Maximum322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T16:06:22.125978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile23
Q148
median70
Q3128
95-th percentile251
Maximum322
Range320
Interquartile range (IQR)80

Descriptive statistics

Standard deviation71.646961
Coefficient of variation (CV)0.74608877
Kurtosis1.4388805
Mean96.030075
Median Absolute Deviation (MAD)31
Skewness1.371827
Sum12772
Variance5133.287
MonotonicityNot monotonic
2023-12-12T16:06:22.639506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 4
 
3.0%
87 3
 
2.3%
48 3
 
2.3%
39 3
 
2.3%
56 3
 
2.3%
82 3
 
2.3%
60 3
 
2.3%
36 3
 
2.3%
163 2
 
1.5%
52 2
 
1.5%
Other values (85) 104
78.2%
ValueCountFrequency (%)
2 1
0.8%
4 1
0.8%
14 1
0.8%
16 1
0.8%
17 1
0.8%
19 1
0.8%
23 2
1.5%
24 1
0.8%
25 1
0.8%
26 1
0.8%
ValueCountFrequency (%)
322 1
0.8%
321 1
0.8%
319 1
0.8%
293 1
0.8%
281 1
0.8%
267 1
0.8%
266 1
0.8%
241 1
0.8%
224 1
0.8%
223 1
0.8%
Distinct118
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.209774
Minimum0
Maximum73.1
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T16:06:22.817216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8.86
Q116.2
median25
Q337.5
95-th percentile49.34
Maximum73.1
Range73.1
Interquartile range (IQR)21.3

Descriptive statistics

Standard deviation13.389644
Coefficient of variation (CV)0.49208947
Kurtosis-0.10397081
Mean27.209774
Median Absolute Deviation (MAD)10.2
Skewness0.47285167
Sum3618.9
Variance179.28256
MonotonicityNot monotonic
2023-12-12T16:06:23.015965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25.0 3
 
2.3%
22.2 3
 
2.3%
50.0 2
 
1.5%
35.7 2
 
1.5%
41.0 2
 
1.5%
28.6 2
 
1.5%
14.3 2
 
1.5%
34.0 2
 
1.5%
21.1 2
 
1.5%
19.4 2
 
1.5%
Other values (108) 111
83.5%
ValueCountFrequency (%)
0.0 1
0.8%
1.9 1
0.8%
5.7 1
0.8%
7.1 1
0.8%
7.7 1
0.8%
8.7 1
0.8%
8.8 1
0.8%
8.9 1
0.8%
9.5 1
0.8%
9.8 1
0.8%
ValueCountFrequency (%)
73.1 1
0.8%
58.6 1
0.8%
54.8 1
0.8%
52.1 1
0.8%
50.6 1
0.8%
50.0 2
1.5%
48.9 1
0.8%
48.7 1
0.8%
48.1 1
0.8%
45.8 1
0.8%

비슷함(비율)
Real number (ℝ)

Distinct103
Distinct (%)77.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.803008
Minimum0
Maximum64.3
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T16:06:23.210126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20.46
Q126.5
median32.3
Q338.5
95-th percentile55.8
Maximum64.3
Range64.3
Interquartile range (IQR)12

Descriptive statistics

Standard deviation11.316139
Coefficient of variation (CV)0.33476721
Kurtosis0.77646011
Mean33.803008
Median Absolute Deviation (MAD)5.9
Skewness0.54618379
Sum4495.8
Variance128.05499
MonotonicityNot monotonic
2023-12-12T16:06:23.391120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.3 5
 
3.8%
25.0 3
 
2.3%
30.6 3
 
2.3%
28.2 3
 
2.3%
25.6 3
 
2.3%
38.6 3
 
2.3%
43.8 2
 
1.5%
30.2 2
 
1.5%
27.7 2
 
1.5%
30.1 2
 
1.5%
Other values (93) 105
78.9%
ValueCountFrequency (%)
0.0 1
0.8%
7.7 1
0.8%
11.7 1
0.8%
13.9 1
0.8%
16.1 1
0.8%
19.0 1
0.8%
20.4 1
0.8%
20.5 1
0.8%
20.9 1
0.8%
21.1 1
0.8%
ValueCountFrequency (%)
64.3 1
0.8%
61.7 1
0.8%
61.0 1
0.8%
60.0 1
0.8%
59.7 2
1.5%
56.1 1
0.8%
55.6 1
0.8%
55.4 1
0.8%
54.5 1
0.8%
54.4 1
0.8%

주문형으로더많이시청(비율)
Real number (ℝ)

HIGH CORRELATION 

Distinct113
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.996241
Minimum12.9
Maximum75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T16:06:23.540222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12.9
5-th percentile20.42
Q129.6
median37.9
Q347.8
95-th percentile63.4
Maximum75
Range62.1
Interquartile range (IQR)18.2

Descriptive statistics

Standard deviation12.692287
Coefficient of variation (CV)0.32547463
Kurtosis-0.18469127
Mean38.996241
Median Absolute Deviation (MAD)8.7
Skewness0.50615971
Sum5186.5
Variance161.09415
MonotonicityNot monotonic
2023-12-12T16:06:23.755463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
29.1 3
 
2.3%
30.4 3
 
2.3%
50.0 3
 
2.3%
39.9 2
 
1.5%
20.8 2
 
1.5%
35.9 2
 
1.5%
19.5 2
 
1.5%
28.6 2
 
1.5%
63.4 2
 
1.5%
35.2 2
 
1.5%
Other values (103) 110
82.7%
ValueCountFrequency (%)
12.9 1
0.8%
17.1 1
0.8%
19.2 1
0.8%
19.5 2
1.5%
19.6 1
0.8%
20.0 1
0.8%
20.7 1
0.8%
20.8 2
1.5%
22.2 1
0.8%
22.4 1
0.8%
ValueCountFrequency (%)
75.0 1
0.8%
72.2 1
0.8%
65.0 1
0.8%
64.8 1
0.8%
64.6 1
0.8%
64.2 1
0.8%
63.4 2
1.5%
61.1 1
0.8%
60.9 1
0.8%
59.6 1
0.8%

Interactions

2023-12-12T16:06:19.663193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.329679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.877860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.515428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.106291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.786964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.436088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.986548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.639057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.205496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.901602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.550122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.117014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.757645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.311632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:20.020389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.670567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.274115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.869749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.424932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:20.131840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:17.767884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.388426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:18.990188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:06:19.557915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:06:23.876824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조사연도국가분류구분사례수(명)동시간대방송되고있는프로그램을더많이시청(비율)비슷함(비율)주문형으로더많이시청(비율)
연번1.0000.9960.9070.8220.7870.5630.5430.3310.266
조사연도0.9961.0001.0000.6080.2580.2430.0000.0000.302
국가0.9071.0001.0000.3110.0000.4200.7160.7640.786
분류0.8220.6080.3111.0000.9940.5730.2030.0690.000
구분0.7870.2580.0000.9941.0000.6650.8130.7520.000
사례수(명)0.5630.2430.4200.5730.6651.0000.0000.0000.000
동시간대방송되고있는프로그램을더많이시청(비율)0.5430.0000.7160.2030.8130.0001.0000.6680.821
비슷함(비율)0.3310.0000.7640.0690.7520.0000.6681.0000.541
주문형으로더많이시청(비율)0.2660.3020.7860.0000.0000.0000.8210.5411.000
2023-12-12T16:06:24.079936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류조사연도구분국가
분류1.0000.5960.7890.144
조사연도0.5961.0000.1600.969
구분0.7890.1601.0000.000
국가0.1440.9690.0001.000
2023-12-12T16:06:24.257479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사례수(명)동시간대방송되고있는프로그램을더많이시청(비율)비슷함(비율)주문형으로더많이시청(비율)조사연도국가분류구분
연번1.0000.0000.409-0.213-0.2930.9160.5110.5610.350
사례수(명)0.0001.000-0.0760.164-0.0500.1780.1230.3100.258
동시간대방송되고있는프로그램을더많이시청(비율)0.409-0.0761.000-0.449-0.5810.0000.2930.0900.366
비슷함(비율)-0.2130.164-0.4491.000-0.3320.0000.3310.0210.307
주문형으로더많이시청(비율)-0.293-0.050-0.581-0.3321.0000.2230.3510.0000.000
조사연도0.9160.1780.0000.0000.2231.0000.9690.5960.160
국가0.5110.1230.2930.3310.3510.9691.0000.1440.000
분류0.5610.3100.0900.0210.0000.5960.1441.0000.789
구분0.3500.2580.3660.3070.0000.1600.0000.7891.000

Missing values

2023-12-12T16:06:20.277003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:06:20.433744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번조사연도국가분류구분사례수(명)동시간대방송되고있는프로그램을더많이시청(비율)비슷함(비율)주문형으로더많이시청(비율)
012018말레이시아연령별전체19219.333.946.9
122018말레이시아연령별10대3616.722.261.1
232018말레이시아연령별20대4820.835.443.8
342018말레이시아연령별30대4517.840.042.2
452018말레이시아연령별40대4415.940.943.2
562018말레이시아연령별50대 이상1931.621.147.4
672018베트남연령별전체26622.222.255.6
782018베트남연령별10대3613.913.972.2
892018베트남연령별20대6023.311.765.0
9102018베트남연령별30대7829.525.644.9
연번조사연도국가분류구분사례수(명)동시간대방송되고있는프로그램을더많이시청(비율)비슷함(비율)주문형으로더많이시청(비율)
1231242019미국직업별서비스업1631.325.043.8
1241252019미국직업별학생5837.919.043.1
1251262019미국직업별주부1428.621.450.0
1261272019미국직업별농림어업/군인450.025.025.0
1271282019미국직업별기타/무직2343.530.426.1
1281292019미국소득별3,000$ 미만5635.716.148.2
1291302019미국소득별3,000$~5,000$ 미만6142.624.632.8
1301312019미국소득별5,000$~7,000$ 미만5540.030.929.1
1311322019미국소득별7,000$ 이상~10,000$ 미만7245.833.320.8
1321332019미국소득별10,000$ 이상8750.629.919.5