Overview

Dataset statistics

Number of variables9
Number of observations133
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.3 KiB
Average record size in memory79.0 B

Variable types

Numeric5
Categorical4

Dataset

Description해외방송시장조사 보고서 중 OTT관련 이용행태 및 만족도 등에 대한 데이터 중 국가별(인도, 싱가포르 등 10개국), 연령, 소득, 성별,고객가치, 직업 등에 따른 방송서비스와 OTT 서비스 이용현황 통계데이터
URLhttps://www.data.go.kr/data/15102279/fileData.do

Alerts

조사연도 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
국가 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 조사연도 and 2 other fieldsHigh correlation
방송서비스(지상파_유료방송)를더많이이용(비율) is highly overall correlated with 온라인동영상서비스(OTT)를더많이이용(비율)High correlation
온라인동영상서비스(OTT)를더많이이용(비율) is highly overall correlated with 방송서비스(지상파_유료방송)를더많이이용(비율)High correlation
분류 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 분류High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:41:32.446737
Analysis finished2023-12-12 11:41:38.092697
Duration5.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct133
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67
Minimum1
Maximum133
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:41:38.222677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.6
Q134
median67
Q3100
95-th percentile126.4
Maximum133
Range132
Interquartile range (IQR)66

Descriptive statistics

Standard deviation38.53786
Coefficient of variation (CV)0.57519194
Kurtosis-1.2
Mean67
Median Absolute Deviation (MAD)33
Skewness0
Sum8911
Variance1485.1667
MonotonicityStrictly increasing
2023-12-12T20:41:38.490649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
85 1
 
0.8%
99 1
 
0.8%
98 1
 
0.8%
97 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
Other values (123) 123
92.5%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
133 1
0.8%
132 1
0.8%
131 1
0.8%
130 1
0.8%
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%

조사연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2019
103 
2018
30 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2019 103
77.4%
2018 30
 
22.6%

Length

2023-12-12T20:41:38.737813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:38.894206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 103
77.4%
2018 30
 
22.6%

국가
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
미국
31 
인도
18 
터키
18 
캐나다
18 
헝가리
18 
Other values (5)
30 

Length

Max length5
Median length2
Mean length2.6766917
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말레이시아
2nd row말레이시아
3rd row말레이시아
4th row말레이시아
5th row말레이시아

Common Values

ValueCountFrequency (%)
미국 31
23.3%
인도 18
13.5%
터키 18
13.5%
캐나다 18
13.5%
헝가리 18
13.5%
말레이시아 6
 
4.5%
베트남 6
 
4.5%
싱가포르 6
 
4.5%
인도네시아 6
 
4.5%
태국 6
 
4.5%

Length

2023-12-12T20:41:39.079311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:39.311471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미국 31
23.3%
인도 18
13.5%
터키 18
13.5%
캐나다 18
13.5%
헝가리 18
13.5%
말레이시아 6
 
4.5%
베트남 6
 
4.5%
싱가포르 6
 
4.5%
인도네시아 6
 
4.5%
태국 6
 
4.5%

분류
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
연령별
55 
직업별
19 
소득별
13 
고객가치별
12 
학력별
11 
Other values (4)
23 

Length

Max length5
Median length3
Mean length3.1052632
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연령별
2nd row연령별
3rd row연령별
4th row연령별
5th row연령별

Common Values

ValueCountFrequency (%)
연령별 55
41.4%
직업별 19
 
14.3%
소득별 13
 
9.8%
고객가치별 12
 
9.0%
학력별 11
 
8.3%
성별 10
 
7.5%
인종별 6
 
4.5%
국가별 4
 
3.0%
지역별 3
 
2.3%

Length

2023-12-12T20:41:39.588979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:41:39.824317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연령별 55
41.4%
직업별 19
 
14.3%
소득별 13
 
9.8%
고객가치별 12
 
9.0%
학력별 11
 
8.3%
성별 10
 
7.5%
인종별 6
 
4.5%
국가별 4
 
3.0%
지역별 3
 
2.3%

구분
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
20대
10 
40대
10 
50대 이상
10 
10대
10 
30대
10 
Other values (41)
83 

Length

Max length20
Median length16
Mean length4.6466165
Min length2

Unique

Unique30 ?
Unique (%)22.6%

Sample

1st row전체
2nd row10대
3rd row20대
4th row30대
5th row40대

Common Values

ValueCountFrequency (%)
20대 10
 
7.5%
40대 10
 
7.5%
50대 이상 10
 
7.5%
10대 10
 
7.5%
30대 10
 
7.5%
전체 9
 
6.8%
남성 5
 
3.8%
여성 5
 
3.8%
고졸 이하 5
 
3.8%
사무·전문직 5
 
3.8%
Other values (36) 54
40.6%

Length

2023-12-12T20:41:40.094037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이상 20
 
11.6%
20대 10
 
5.8%
50대 10
 
5.8%
10대 10
 
5.8%
30대 10
 
5.8%
40대 10
 
5.8%
전체 9
 
5.2%
미만 8
 
4.6%
남성 5
 
2.9%
여성 5
 
2.9%
Other values (37) 76
43.9%

사례수(명)
Real number (ℝ)

Distinct95
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean96.030075
Minimum2
Maximum322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:41:40.347432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile23
Q148
median70
Q3128
95-th percentile251
Maximum322
Range320
Interquartile range (IQR)80

Descriptive statistics

Standard deviation71.646961
Coefficient of variation (CV)0.74608877
Kurtosis1.4388805
Mean96.030075
Median Absolute Deviation (MAD)31
Skewness1.371827
Sum12772
Variance5133.287
MonotonicityNot monotonic
2023-12-12T20:41:40.592745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 4
 
3.0%
87 3
 
2.3%
48 3
 
2.3%
39 3
 
2.3%
56 3
 
2.3%
82 3
 
2.3%
60 3
 
2.3%
36 3
 
2.3%
163 2
 
1.5%
52 2
 
1.5%
Other values (85) 104
78.2%
ValueCountFrequency (%)
2 1
0.8%
4 1
0.8%
14 1
0.8%
16 1
0.8%
17 1
0.8%
19 1
0.8%
23 2
1.5%
24 1
0.8%
25 1
0.8%
26 1
0.8%
ValueCountFrequency (%)
322 1
0.8%
321 1
0.8%
319 1
0.8%
293 1
0.8%
281 1
0.8%
267 1
0.8%
266 1
0.8%
241 1
0.8%
224 1
0.8%
223 1
0.8%
Distinct106
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.233835
Minimum0
Maximum100
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:41:40.851975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16.18
Q125.4
median32.3
Q339.5
95-th percentile50.66
Maximum100
Range100
Interquartile range (IQR)14.1

Descriptive statistics

Standard deviation12.552652
Coefficient of variation (CV)0.377707
Kurtosis6.0258357
Mean33.233835
Median Absolute Deviation (MAD)7.2
Skewness1.3915009
Sum4420.1
Variance157.56907
MonotonicityNot monotonic
2023-12-12T20:41:41.112336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25.0 4
 
3.0%
33.3 4
 
3.0%
36.9 3
 
2.3%
36.5 3
 
2.3%
31.0 2
 
1.5%
28.1 2
 
1.5%
44.8 2
 
1.5%
25.6 2
 
1.5%
47.8 2
 
1.5%
37.0 2
 
1.5%
Other values (96) 107
80.5%
ValueCountFrequency (%)
0.0 1
0.8%
6.9 1
0.8%
12.5 1
0.8%
14.3 1
0.8%
15.0 1
0.8%
15.1 1
0.8%
15.4 1
0.8%
16.7 1
0.8%
17.2 1
0.8%
17.4 1
0.8%
ValueCountFrequency (%)
100.0 1
0.8%
71.8 1
0.8%
70.6 1
0.8%
57.7 1
0.8%
55.7 1
0.8%
55.6 1
0.8%
50.9 1
0.8%
50.5 1
0.8%
50.0 1
0.8%
47.8 2
1.5%

비슷함(비율)
Real number (ℝ)

Distinct98
Distinct (%)73.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.826316
Minimum0
Maximum53.6
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:41:41.400300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile17
Q125
median29.9
Q334.4
95-th percentile41.7
Maximum53.6
Range53.6
Interquartile range (IQR)9.4

Descriptive statistics

Standard deviation8.1964044
Coefficient of variation (CV)0.27480445
Kurtosis1.5170505
Mean29.826316
Median Absolute Deviation (MAD)4.9
Skewness-0.29983874
Sum3966.9
Variance67.181045
MonotonicityNot monotonic
2023-12-12T20:41:41.713558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.3 6
 
4.5%
34.4 4
 
3.0%
25.0 4
 
3.0%
28.3 3
 
2.3%
37.5 3
 
2.3%
35.5 3
 
2.3%
29.8 3
 
2.3%
27.6 3
 
2.3%
31.8 3
 
2.3%
34.1 2
 
1.5%
Other values (88) 99
74.4%
ValueCountFrequency (%)
0.0 1
0.8%
5.1 1
0.8%
11.8 1
0.8%
13.0 1
0.8%
13.9 1
0.8%
15.6 1
0.8%
16.4 1
0.8%
17.4 1
0.8%
17.9 2
1.5%
18.5 1
0.8%
ValueCountFrequency (%)
53.6 1
0.8%
50.0 1
0.8%
48.0 1
0.8%
47.4 1
0.8%
44.7 1
0.8%
44.6 1
0.8%
41.7 2
1.5%
41.5 1
0.8%
41.4 1
0.8%
40.6 1
0.8%
Distinct105
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.934586
Minimum0
Maximum65.5
Zeros1
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T20:41:41.973300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21
Q130.7
median37.2
Q341.7
95-th percentile52.74
Maximum65.5
Range65.5
Interquartile range (IQR)11

Descriptive statistics

Standard deviation9.9346428
Coefficient of variation (CV)0.2689794
Kurtosis1.3697368
Mean36.934586
Median Absolute Deviation (MAD)5.5
Skewness0.04474297
Sum4912.3
Variance98.697128
MonotonicityNot monotonic
2023-12-12T20:41:42.227781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.3 5
 
3.8%
50.0 3
 
2.3%
40.0 3
 
2.3%
39.1 3
 
2.3%
34.5 2
 
1.5%
42.7 2
 
1.5%
51.7 2
 
1.5%
23.1 2
 
1.5%
36.3 2
 
1.5%
20.0 2
 
1.5%
Other values (95) 107
80.5%
ValueCountFrequency (%)
0.0 1
0.8%
17.6 1
0.8%
18.0 1
0.8%
19.3 1
0.8%
20.0 2
1.5%
20.4 1
0.8%
21.4 1
0.8%
22.6 1
0.8%
23.1 2
1.5%
23.2 1
0.8%
ValueCountFrequency (%)
65.5 1
 
0.8%
62.5 1
 
0.8%
61.2 1
 
0.8%
60.0 1
 
0.8%
58.6 1
 
0.8%
58.5 1
 
0.8%
54.3 1
 
0.8%
51.7 2
1.5%
50.8 1
 
0.8%
50.0 3
2.3%

Interactions

2023-12-12T20:41:36.914624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:33.729965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.453798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.281868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:36.138201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:37.074531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:33.878243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.618412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.445461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:36.293228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:37.230672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.017716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.781033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.634763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:36.457181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:37.388916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.166094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.948284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.805540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:36.616972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:37.548641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:34.305733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.100362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:35.965449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:41:36.769777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:41:42.920247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조사연도국가분류구분사례수(명)방송서비스(지상파_유료방송)를더많이이용(비율)비슷함(비율)온라인동영상서비스(OTT)를더많이이용(비율)
연번1.0000.9960.9070.8220.7870.5630.2060.2880.110
조사연도0.9961.0001.0000.6080.2580.2430.0000.2200.020
국가0.9071.0001.0000.3110.0000.4200.3740.4970.191
분류0.8220.6080.3111.0000.9940.5730.1330.2570.000
구분0.7870.2580.0000.9941.0000.6650.8150.6800.657
사례수(명)0.5630.2430.4200.5730.6651.0000.0000.0000.090
방송서비스(지상파_유료방송)를더많이이용(비율)0.2060.0000.3740.1330.8150.0001.0000.6570.789
비슷함(비율)0.2880.2200.4970.2570.6800.0000.6571.0000.660
온라인동영상서비스(OTT)를더많이이용(비율)0.1100.0200.1910.0000.6570.0900.7890.6601.000
2023-12-12T20:41:43.113659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사연도국가분류구분
조사연도1.0000.9690.5960.160
국가0.9691.0000.1440.000
분류0.5960.1441.0000.789
구분0.1600.0000.7891.000
2023-12-12T20:41:43.290025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사례수(명)방송서비스(지상파_유료방송)를더많이이용(비율)비슷함(비율)온라인동영상서비스(OTT)를더많이이용(비율)조사연도국가분류구분
연번1.0000.0000.272-0.4580.0270.9160.5110.5610.350
사례수(명)0.0001.000-0.0350.1090.0500.1780.1230.3100.258
방송서비스(지상파_유료방송)를더많이이용(비율)0.272-0.0351.000-0.494-0.7270.0000.2000.0780.395
비슷함(비율)-0.4580.109-0.4941.000-0.1400.2140.2470.0790.256
온라인동영상서비스(OTT)를더많이이용(비율)0.0270.050-0.727-0.1401.0000.0000.0840.0000.246
조사연도0.9160.1780.0000.2140.0001.0000.9690.5960.160
국가0.5110.1230.2000.2470.0840.9691.0000.1440.000
분류0.5610.3100.0780.0790.0000.5960.1441.0000.789
구분0.3500.2580.3950.2560.2460.1600.0000.7891.000

Missing values

2023-12-12T20:41:37.775679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:41:37.996322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번조사연도국가분류구분사례수(명)방송서비스(지상파_유료방송)를더많이이용(비율)비슷함(비율)온라인동영상서비스(OTT)를더많이이용(비율)
012018말레이시아연령별전체19225.036.538.5
122018말레이시아연령별10대3622.241.736.1
232018말레이시아연령별20대4831.333.335.4
342018말레이시아연령별30대4520.040.040.0
452018말레이시아연령별40대4422.731.845.5
562018말레이시아연령별50대 이상1931.636.831.6
672018베트남연령별전체26641.732.026.3
782018베트남연령별10대3633.341.725.0
892018베트남연령별20대6038.328.333.3
9102018베트남연령별30대7843.626.929.5
연번조사연도국가분류구분사례수(명)방송서비스(지상파_유료방송)를더많이이용(비율)비슷함(비율)온라인동영상서비스(OTT)를더많이이용(비율)
1231242019미국직업별서비스업1637.525.037.5
1241252019미국직업별학생5817.224.158.6
1251262019미국직업별주부1450.021.428.6
1261272019미국직업별농림어업/군인40.050.050.0
1271282019미국직업별기타/무직2343.517.439.1
1281292019미국소득별3,000$ 미만5628.623.248.2
1291302019미국소득별3,000$~5,000$ 미만6132.816.450.8
1301312019미국소득별5,000$~7,000$ 미만5550.925.523.6
1311322019미국소득별7,000$ 이상~10,000$ 미만7243.129.227.8
1321332019미국소득별10,000$ 이상8746.024.129.9