Overview

Dataset statistics

Number of variables8
Number of observations167
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.4 KiB
Average record size in memory69.8 B

Variable types

Numeric4
Categorical4

Dataset

Description해외방송시장조사 보고서 중 OTT관련 이용행태 및 만족도 등에 대한 데이터 중 국가별(영국, 브라질, 헝가리 캐나다 등 9개국), 연령, 소득, 성별,고객가치, 직업 등에 따른 OTT서비스 가입 행태에 대한 통계데이터
URLhttps://www.data.go.kr/data/15102281/fileData.do

Alerts

분류 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
국가 is highly overall correlated with 조사연도High correlation
구분 is highly overall correlated with 조사연도 and 1 other fieldsHigh correlation
조사연도 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 조사연도 and 1 other fieldsHigh correlation
사례수(명) is highly overall correlated with 분류High correlation
예(비율) is highly overall correlated with 아니요(비율)High correlation
아니요(비율) is highly overall correlated with 예(비율)High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:43:09.559081
Analysis finished2023-12-12 14:43:11.832492
Duration2.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct167
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84
Minimum1
Maximum167
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T23:43:11.903635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.3
Q142.5
median84
Q3125.5
95-th percentile158.7
Maximum167
Range166
Interquartile range (IQR)83

Descriptive statistics

Standard deviation48.35287
Coefficient of variation (CV)0.5756294
Kurtosis-1.2
Mean84
Median Absolute Deviation (MAD)42
Skewness0
Sum14028
Variance2338
MonotonicityStrictly increasing
2023-12-12T23:43:12.041566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
116 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
Other values (157) 157
94.0%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
167 1
0.6%
166 1
0.6%
165 1
0.6%
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%

조사연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2020
95 
2019
72 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2020 95
56.9%
2019 72
43.1%

Length

2023-12-12T23:43:12.160679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:43:12.252285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 95
56.9%
2019 72
43.1%

국가
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
영국
19 
호주
19 
러시아
19 
브라질
19 
UAE
19 
Other values (4)
72 

Length

Max length3
Median length3
Mean length2.5568862
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인도
2nd row터키
3rd row캐나다
4th row헝가리
5th row인도

Common Values

ValueCountFrequency (%)
영국 19
11.4%
호주 19
11.4%
러시아 19
11.4%
브라질 19
11.4%
UAE 19
11.4%
인도 18
10.8%
터키 18
10.8%
캐나다 18
10.8%
헝가리 18
10.8%

Length

2023-12-12T23:43:12.356990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:43:12.523090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영국 19
11.4%
호주 19
11.4%
러시아 19
11.4%
브라질 19
11.4%
uae 19
11.4%
인도 18
10.8%
터키 18
10.8%
캐나다 18
10.8%
헝가리 18
10.8%

분류
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
직업별
47 
연령별
45 
성별
18 
학력별
18 
소득별
18 
Other values (2)
21 

Length

Max length5
Median length3
Mean length3.0359281
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가별
2nd row국가별
3rd row국가별
4th row국가별
5th row성별

Common Values

ValueCountFrequency (%)
직업별 47
28.1%
연령별 45
26.9%
성별 18
 
10.8%
학력별 18
 
10.8%
소득별 18
 
10.8%
고객가치별 12
 
7.2%
국가별 9
 
5.4%

Length

2023-12-12T23:43:12.648361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:43:12.757164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
직업별 47
28.1%
연령별 45
26.9%
성별 18
 
10.8%
학력별 18
 
10.8%
소득별 18
 
10.8%
고객가치별 12
 
7.2%
국가별 9
 
5.4%

구분
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)19.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
전체
 
9
남성
 
9
고졸 이하
 
9
50대 이상
 
9
대졸 이상
 
9
Other values (28)
122 

Length

Max length10
Median length8
Mean length4.3293413
Min length2

Unique

Unique8 ?
Unique (%)4.8%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row남성

Common Values

ValueCountFrequency (%)
전체 9
 
5.4%
남성 9
 
5.4%
고졸 이하 9
 
5.4%
50대 이상 9
 
5.4%
대졸 이상 9
 
5.4%
30대 9
 
5.4%
20대 9
 
5.4%
10대 9
 
5.4%
여성 9
 
5.4%
40대 9
 
5.4%
Other values (23) 77
46.1%

Length

2023-12-12T23:43:12.886956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이상 27
 
12.4%
이하 14
 
6.5%
평균 10
 
4.6%
전체 9
 
4.1%
10대 9
 
4.1%
40대 9
 
4.1%
여성 9
 
4.1%
20대 9
 
4.1%
30대 9
 
4.1%
대졸 9
 
4.1%
Other values (22) 103
47.5%

사례수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct116
Distinct (%)69.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108.68263
Minimum10
Maximum373
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T23:43:13.004551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile18.3
Q150.5
median82
Q3154
95-th percentile278.9
Maximum373
Range363
Interquartile range (IQR)103.5

Descriptive statistics

Standard deviation81.232975
Coefficient of variation (CV)0.74743288
Kurtosis1.3387207
Mean108.68263
Median Absolute Deviation (MAD)46
Skewness1.2548902
Sum18150
Variance6598.7963
MonotonicityNot monotonic
2023-12-12T23:43:13.500566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65 5
 
3.0%
52 4
 
2.4%
64 4
 
2.4%
48 3
 
1.8%
84 3
 
1.8%
34 3
 
1.8%
82 3
 
1.8%
87 3
 
1.8%
103 3
 
1.8%
68 3
 
1.8%
Other values (106) 133
79.6%
ValueCountFrequency (%)
10 1
0.6%
11 1
0.6%
12 1
0.6%
14 1
0.6%
15 2
1.2%
16 1
0.6%
17 1
0.6%
18 1
0.6%
19 1
0.6%
20 1
0.6%
ValueCountFrequency (%)
373 1
0.6%
368 1
0.6%
367 1
0.6%
355 1
0.6%
322 1
0.6%
321 1
0.6%
301 1
0.6%
293 1
0.6%
281 1
0.6%
274 1
0.6%

예(비율)
Real number (ℝ)

HIGH CORRELATION 

Distinct129
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.767066
Minimum21.4
Maximum97.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T23:43:13.713781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21.4
5-th percentile40.96
Q165.5
median77.1
Q383.95
95-th percentile90.8
Maximum97.1
Range75.7
Interquartile range (IQR)18.45

Descriptive statistics

Standard deviation15.559538
Coefficient of variation (CV)0.21382665
Kurtosis0.96507397
Mean72.767066
Median Absolute Deviation (MAD)8.2
Skewness-1.1396108
Sum12152.1
Variance242.09921
MonotonicityNot monotonic
2023-12-12T23:43:13.975941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
83.6 4
 
2.4%
84.0 3
 
1.8%
72.9 3
 
1.8%
84.6 3
 
1.8%
77.6 3
 
1.8%
85.3 3
 
1.8%
82.3 2
 
1.2%
60.0 2
 
1.2%
66.7 2
 
1.2%
79.2 2
 
1.2%
Other values (119) 140
83.8%
ValueCountFrequency (%)
21.4 1
0.6%
23.9 1
0.6%
24.2 1
0.6%
35.9 1
0.6%
36.8 1
0.6%
37.7 1
0.6%
38.5 1
0.6%
40.4 1
0.6%
40.9 1
0.6%
41.1 1
0.6%
ValueCountFrequency (%)
97.1 1
0.6%
95.7 1
0.6%
95.1 1
0.6%
93.5 2
1.2%
93.3 1
0.6%
91.7 1
0.6%
91.2 1
0.6%
90.8 2
1.2%
90.0 1
0.6%
89.9 1
0.6%

아니요(비율)
Real number (ℝ)

HIGH CORRELATION 

Distinct129
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.233533
Minimum2.9
Maximum78.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T23:43:14.240635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.9
5-th percentile9.2
Q116.05
median22.9
Q334.5
95-th percentile59.04
Maximum78.6
Range75.7
Interquartile range (IQR)18.45

Descriptive statistics

Standard deviation15.559209
Coefficient of variation (CV)0.5713254
Kurtosis0.96520768
Mean27.233533
Median Absolute Deviation (MAD)8.2
Skewness1.1396011
Sum4548
Variance242.08899
MonotonicityNot monotonic
2023-12-12T23:43:14.539161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16.4 4
 
2.4%
16.0 3
 
1.8%
27.1 3
 
1.8%
15.4 3
 
1.8%
22.4 3
 
1.8%
14.7 3
 
1.8%
17.7 2
 
1.2%
40.0 2
 
1.2%
33.3 2
 
1.2%
20.8 2
 
1.2%
Other values (119) 140
83.8%
ValueCountFrequency (%)
2.9 1
0.6%
4.3 1
0.6%
4.9 1
0.6%
6.5 2
1.2%
6.7 1
0.6%
8.3 1
0.6%
8.8 1
0.6%
9.2 2
1.2%
10.0 1
0.6%
10.1 1
0.6%
ValueCountFrequency (%)
78.6 1
0.6%
76.1 1
0.6%
75.8 1
0.6%
64.1 1
0.6%
63.2 1
0.6%
62.3 1
0.6%
61.5 1
0.6%
59.6 1
0.6%
59.1 1
0.6%
58.9 1
0.6%

Interactions

2023-12-12T23:43:11.252443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:09.996382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.417275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.787207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:11.348717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.086577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.501456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.872284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:11.436414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.197082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.593934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.984318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:11.530088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.311333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:10.685680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:43:11.115093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:43:14.695487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
연번1.0000.9970.6550.8140.8490.6880.4490.449
조사연도0.9971.0001.0000.2930.6650.3630.4630.463
국가0.6551.0001.0000.0000.0000.0000.6580.658
분류0.8140.2930.0001.0001.0000.7840.0000.000
구분0.8490.6650.0001.0001.0000.8500.0000.000
사례수(명)0.6880.3630.0000.7840.8501.0000.0000.000
예(비율)0.4490.4630.6580.0000.0000.0001.0001.000
아니요(비율)0.4490.4630.6580.0000.0000.0001.0001.000
2023-12-12T23:43:14.814594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류국가구분조사연도
분류1.0000.0000.9150.309
국가0.0001.0000.0000.979
구분0.9150.0001.0000.516
조사연도0.3090.9790.5161.000
2023-12-12T23:43:14.948078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사례수(명)예(비율)아니요(비율)조사연도국가분류구분
연번1.000-0.231-0.2890.2890.9300.3690.5910.466
사례수(명)-0.2311.0000.200-0.2000.2710.0000.5470.467
예(비율)-0.2890.2001.000-1.0000.3470.3710.0000.000
아니요(비율)0.289-0.200-1.0001.0000.3470.3710.0000.000
조사연도0.9300.2710.3470.3471.0000.9790.3090.516
국가0.3690.0000.3710.3710.9791.0000.0000.000
분류0.5910.5470.0000.0000.3090.0001.0000.915
구분0.4660.4670.0000.0000.5160.0000.9151.000

Missing values

2023-12-12T23:43:11.655317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:43:11.789769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
012019인도국가별전체32282.317.7
122019터키국가별전체32189.410.6
232019캐나다국가별전체28184.016.0
342019헝가리국가별전체22363.736.3
452019인도성별남성15979.920.1
562019인도성별여성16384.715.3
672019터키성별남성12886.713.3
782019터키성별여성19391.28.8
892019캐나다성별남성13584.415.6
9102019캐나다성별여성14683.616.4
연번조사연도국가분류구분사례수(명)예(비율)아니요(비율)
1571582020영국소득별평균 이하16174.525.5
1581592020영국소득별평균 이상11475.424.6
1591602020호주소득별평균 이하11964.735.3
1601612020호주소득별평균 이상8377.122.9
1611622020러시아소득별평균 이하19137.762.3
1621632020러시아소득별평균 이상12252.547.5
1631642020브라질소득별평균 이하17878.121.9
1641652020브라질소득별평균 이상8789.710.3
1651662020UAE소득별평균 이하16581.818.2
1661672020UAE소득별평균 이상12283.616.4