Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 122 |
Missing cells | 410 |
Missing cells (%) | 56.0% |
Duplicate rows | 4 |
Duplicate rows (%) | 3.3% |
Total size in memory | 6.1 KiB |
Average record size in memory | 51.1 B |
Variable types
Unsupported | 2 |
---|---|
Text | 2 |
Numeric | 1 |
Categorical | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 컨슈머인사이트 |
URL | http://www.datastore.or.kr/product/file/d79bd378-0b0a-406d-9b59-df472f66b6c7 |
Dataset has 4 (3.3%) duplicate rows | Duplicates |
Unnamed: 3 is highly overall correlated with Unnamed: 4 | High correlation |
Unnamed: 4 is highly overall correlated with Unnamed: 3 | High correlation |
Unnamed: 0 has 122 (100.0%) missing values | Missing |
Column 정의서 has 82 (67.2%) missing values | Missing |
Unnamed: 2 has 83 (68.0%) missing values | Missing |
Unnamed: 3 has 40 (32.8%) missing values | Missing |
Unnamed: 5 has 83 (68.0%) missing values | Missing |
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-03-11 03:18:48.043105 |
---|---|
Analysis finished | 2024-03-11 03:18:49.645628 |
Duration | 1.6 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Unnamed: 0
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 122 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.2 KiB |
Column 정의서
Text
MISSING
 
Distinct | 40 |
---|---|
Distinct (%) | 100.0% |
Missing | 82 |
Missing (%) | 67.2% |
Memory size | 1.1 KiB |
Length
Max length | 44 |
---|---|
Median length | 8 |
Mean length | 6.275 |
Min length | 2 |
Characters and Unicode
Total characters | 251 |
---|---|
Distinct characters | 44 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 40 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 1. 컨슈머인사이트 이동통신 기획조사 _ 통신사 및 통신서비스 브랜드 Index |
---|---|
2nd row | 항목 |
3rd row | idx |
4th row | A01011 |
5th row | A0101 |
Value | Count | Frequency (%) |
1 | 1 | 2.0% |
g0102 | 1 | 2.0% |
g0104 | 1 | 2.0% |
g0105 | 1 | 2.0% |
g0106 | 1 | 2.0% |
g0107 | 1 | 2.0% |
g0108 | 1 | 2.0% |
g0109 | 1 | 2.0% |
g010101 | 1 | 2.0% |
g010601 | 1 | 2.0% |
Other values (39) | 39 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 74 | |
1 | 51 | |
G | 25 | 10.0% |
A | 12 | 4.8% |
4 | 11 | 4.4% |
9 | 3.6% | |
2 | 7 | 2.8% |
6 | 6 | 2.4% |
3 | 6 | 2.4% |
통 | 3 | 1.2% |
Other values (34) | 47 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 163 | |
Uppercase Letter | 41 | 16.3% |
Other Letter | 29 | 11.6% |
Space Separator | 9 | 3.6% |
Lowercase Letter | 7 | 2.8% |
Other Punctuation | 1 | 0.4% |
Connector Punctuation | 1 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
통 | 3 | 10.3% |
신 | 3 | 10.3% |
사 | 3 | 10.3% |
이 | 2 | 6.9% |
항 | 1 | 3.4% |
컨 | 1 | 3.4% |
슈 | 1 | 3.4% |
머 | 1 | 3.4% |
인 | 1 | 3.4% |
트 | 1 | 3.4% |
Other values (12) | 12 |
Decimal Number
Value | Count | Frequency (%) |
0 | 74 | |
1 | 51 | |
4 | 11 | 6.7% |
2 | 7 | 4.3% |
6 | 6 | 3.7% |
3 | 6 | 3.7% |
9 | 2 | 1.2% |
8 | 2 | 1.2% |
7 | 2 | 1.2% |
5 | 2 | 1.2% |
Lowercase Letter
Value | Count | Frequency (%) |
x | 2 | |
d | 2 | |
i | 1 | |
e | 1 | |
n | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 25 | |
A | 12 | |
M | 3 | 7.3% |
I | 1 | 2.4% |
Space Separator
Value | Count | Frequency (%) |
9 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 174 | |
Latin | 48 | 19.1% |
Hangul | 29 | 11.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
통 | 3 | 10.3% |
신 | 3 | 10.3% |
사 | 3 | 10.3% |
이 | 2 | 6.9% |
항 | 1 | 3.4% |
컨 | 1 | 3.4% |
슈 | 1 | 3.4% |
머 | 1 | 3.4% |
인 | 1 | 3.4% |
트 | 1 | 3.4% |
Other values (12) | 12 |
Common
Value | Count | Frequency (%) |
0 | 74 | |
1 | 51 | |
4 | 11 | 6.3% |
9 | 5.2% | |
2 | 7 | 4.0% |
6 | 6 | 3.4% |
3 | 6 | 3.4% |
9 | 2 | 1.1% |
8 | 2 | 1.1% |
7 | 2 | 1.1% |
Other values (3) | 4 | 2.3% |
Latin
Value | Count | Frequency (%) |
G | 25 | |
A | 12 | |
M | 3 | 6.2% |
x | 2 | 4.2% |
d | 2 | 4.2% |
i | 1 | 2.1% |
e | 1 | 2.1% |
n | 1 | 2.1% |
I | 1 | 2.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 222 | |
Hangul | 29 | 11.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 74 | |
1 | 51 | |
G | 25 | 11.3% |
A | 12 | 5.4% |
4 | 11 | 5.0% |
9 | 4.1% | |
2 | 7 | 3.2% |
6 | 6 | 2.7% |
3 | 6 | 2.7% |
M | 3 | 1.4% |
Other values (12) | 18 | 8.1% |
Hangul
Value | Count | Frequency (%) |
통 | 3 | 10.3% |
신 | 3 | 10.3% |
사 | 3 | 10.3% |
이 | 2 | 6.9% |
항 | 1 | 3.4% |
컨 | 1 | 3.4% |
슈 | 1 | 3.4% |
머 | 1 | 3.4% |
인 | 1 | 3.4% |
트 | 1 | 3.4% |
Other values (12) | 12 |
Unnamed: 2
Text
MISSING
 
Distinct | 39 |
---|---|
Distinct (%) | 100.0% |
Missing | 83 |
Missing (%) | 68.0% |
Memory size | 1.1 KiB |
Length
Max length | 31 |
---|---|
Median length | 20 |
Mean length | 14.076923 |
Min length | 2 |
Characters and Unicode
Total characters | 549 |
---|---|
Distinct characters | 107 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 39 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 변수명 |
---|---|
2nd row | 사용자 식별 번호 |
3rd row | 통신사 비보조 인지 [최초 인지] |
4th row | 통신사 비보조 인지 [1+2+3순위] |
5th row | 5G 서비스 비보조 인지 [최초 인지] |
Value | Count | Frequency (%) |
인지 | 18 | 12.7% |
비보조 | 12 | 8.5% |
브랜드 | 10 | 7.0% |
5g | 8 | 5.6% |
서비스 | 6 | 4.2% |
데이터 | 6 | 4.2% |
최초 | 6 | 4.2% |
1+2+3순위 | 5 | 3.5% |
통신사 | 4 | 2.8% |
최선호 | 4 | 2.8% |
Other values (47) | 63 |
Most occurring characters
Value | Count | Frequency (%) |
103 | 18.8% | |
인 | 21 | 3.8% |
지 | 20 | 3.6% |
비 | 20 | 3.6% |
이 | 14 | 2.6% |
사 | 13 | 2.4% |
보 | 12 | 2.2% |
] | 12 | 2.2% |
[ | 12 | 2.2% |
조 | 12 | 2.2% |
Other values (97) | 310 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 355 | |
Space Separator | 103 | 18.8% |
Decimal Number | 26 | 4.7% |
Uppercase Letter | 24 | 4.4% |
Close Punctuation | 14 | 2.6% |
Open Punctuation | 14 | 2.6% |
Math Symbol | 11 | 2.0% |
Other Punctuation | 2 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
인 | 21 | 5.9% |
지 | 20 | 5.6% |
비 | 20 | 5.6% |
이 | 14 | 3.9% |
사 | 13 | 3.7% |
보 | 12 | 3.4% |
조 | 12 | 3.4% |
통 | 12 | 3.4% |
드 | 10 | 2.8% |
최 | 10 | 2.8% |
Other values (79) | 211 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 8 | |
I | 5 | |
P | 3 | 12.5% |
T | 3 | 12.5% |
V | 3 | 12.5% |
A | 2 | 8.3% |
Decimal Number
Value | Count | Frequency (%) |
5 | 9 | |
1 | 7 | |
2 | 5 | |
3 | 5 |
Close Punctuation
Value | Count | Frequency (%) |
] | 12 | |
) | 2 | 14.3% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 12 | |
( | 2 | 14.3% |
Math Symbol
Value | Count | Frequency (%) |
+ | 10 | |
~ | 1 | 9.1% |
Space Separator
Value | Count | Frequency (%) |
103 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 355 | |
Common | 170 | |
Latin | 24 | 4.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
인 | 21 | 5.9% |
지 | 20 | 5.6% |
비 | 20 | 5.6% |
이 | 14 | 3.9% |
사 | 13 | 3.7% |
보 | 12 | 3.4% |
조 | 12 | 3.4% |
통 | 12 | 3.4% |
드 | 10 | 2.8% |
최 | 10 | 2.8% |
Other values (79) | 211 |
Common
Value | Count | Frequency (%) |
103 | ||
] | 12 | 7.1% |
[ | 12 | 7.1% |
+ | 10 | 5.9% |
5 | 9 | 5.3% |
1 | 7 | 4.1% |
2 | 5 | 2.9% |
3 | 5 | 2.9% |
/ | 2 | 1.2% |
( | 2 | 1.2% |
Other values (2) | 3 | 1.8% |
Latin
Value | Count | Frequency (%) |
G | 8 | |
I | 5 | |
P | 3 | 12.5% |
T | 3 | 12.5% |
V | 3 | 12.5% |
A | 2 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 355 | |
ASCII | 194 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
103 | ||
] | 12 | 6.2% |
[ | 12 | 6.2% |
+ | 10 | 5.2% |
5 | 9 | 4.6% |
G | 8 | 4.1% |
1 | 7 | 3.6% |
I | 5 | 2.6% |
2 | 5 | 2.6% |
3 | 5 | 2.6% |
Other values (8) | 18 | 9.3% |
Hangul
Value | Count | Frequency (%) |
인 | 21 | 5.9% |
지 | 20 | 5.6% |
비 | 20 | 5.6% |
이 | 14 | 3.9% |
사 | 13 | 3.7% |
보 | 12 | 3.4% |
조 | 12 | 3.4% |
통 | 12 | 3.4% |
드 | 10 | 2.8% |
최 | 10 | 2.8% |
Other values (79) | 211 |
Unnamed: 3
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 13.4% |
Missing | 40 |
Missing (%) | 32.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.7073171 |
Minimum | 1 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 4.75 |
95-th percentile | 10 |
Maximum | 99 |
Range | 98 |
Interquartile range (IQR) | 3.75 |
Descriptive statistics
Standard deviation | 18.218487 |
---|---|
Coefficient of variation (CV) | 2.7162108 |
Kurtosis | 22.052087 |
Mean | 6.7073171 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 4.7596449 |
Sum | 550 |
Variance | 331.91328 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 37 | |
2 | 22 | |
10 | 15 | 12.3% |
99 | 1 | 0.8% |
3 | 1 | 0.8% |
4 | 1 | 0.8% |
5 | 1 | 0.8% |
6 | 1 | 0.8% |
7 | 1 | 0.8% |
97 | 1 | 0.8% |
(Missing) | 40 |
Value | Count | Frequency (%) |
1 | 37 | |
2 | 22 | |
3 | 1 | 0.8% |
4 | 1 | 0.8% |
5 | 1 | 0.8% |
6 | 1 | 0.8% |
7 | 1 | 0.8% |
10 | 15 | |
97 | 1 | 0.8% |
98 | 1 | 0.8% |
Value | Count | Frequency (%) |
99 | 1 | 0.8% |
98 | 1 | 0.8% |
97 | 1 | 0.8% |
10 | 15 | |
7 | 1 | 0.8% |
6 | 1 | 0.8% |
5 | 1 | 0.8% |
4 | 1 | 0.8% |
3 | 1 | 0.8% |
2 | 22 |
Unnamed: 4
Categorical
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 23.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
… | |
---|---|
KT | |
매우 불만족한다 | |
매우 만족한다 | |
SKT | |
Other values (24) |
Length
Max length | 37 |
---|---|
Median length | 27 |
Mean length | 5.2704918 |
Min length | 1 |
Unique
Unique | 19 ? |
---|---|
Unique (%) | 15.6% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | 내용 |
5th row | - |
Common Values
Value | Count | Frequency (%) |
… | 35 | |
KT | 20 | |
매우 불만족한다 | 14 | 11.5% |
매우 만족한다 | 14 | 11.5% |
SKT | 6 | 4.9% |
SK텔레콤 | 5 | 4.1% |
<NA> | 3 | 2.5% |
SKB(B TV) | 2 | 1.6% |
SK | 2 | 1.6% |
SKB | 2 | 1.6% |
Other values (19) | 19 |
Length
Value | Count | Frequency (%) |
… | 35 | |
매우 | 28 | 13.5% |
kt | 20 | 9.7% |
불만족한다 | 14 | 6.8% |
만족한다 | 14 | 6.8% |
5g | 6 | 2.9% |
skt | 6 | 2.9% |
sk텔레콤 | 5 | 2.4% |
sk | 4 | 1.9% |
것 | 4 | 1.9% |
Other values (54) | 71 |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 83 |
---|---|
Missing (%) | 68.0% |
Memory size | 1.1 KiB |
Column 정의서 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | |
---|---|---|---|---|
Column 정의서 | 1.000 | 1.000 | NaN | 1.000 |
Unnamed: 2 | 1.000 | 1.000 | NaN | 1.000 |
Unnamed: 3 | NaN | NaN | 1.000 | 1.000 |
Unnamed: 4 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 3 | Unnamed: 4 | |
---|---|---|
Unnamed: 3 | 1.000 | 0.844 |
Unnamed: 4 | 0.844 | 1.000 |
Unnamed: 0 | Column 정의서 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | NaN |
1 | <NA> | 1. 컨슈머인사이트 이동통신 기획조사 _ 통신사 및 통신서비스 브랜드 Index | <NA> | <NA> | <NA> | NaN |
2 | <NA> | <NA> | <NA> | <NA> | <NA> | NaN |
3 | <NA> | 항목 | 변수명 | <NA> | 내용 | DATA 예시 |
4 | <NA> | idx | 사용자 식별 번호 | <NA> | - | 29906 |
5 | <NA> | A01011 | 통신사 비보조 인지 [최초 인지] | 1 | SKT | SKT |
6 | <NA> | <NA> | <NA> | 2 | KT | NaN |
7 | <NA> | <NA> | <NA> | <NA> | … | NaN |
8 | <NA> | A0101 | 통신사 비보조 인지 [1+2+3순위] | 1 | SKT | SKT |
9 | <NA> | <NA> | <NA> | 2 | KT | NaN |
Unnamed: 0 | Column 정의서 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | |
---|---|---|---|---|---|---|
112 | <NA> | <NA> | <NA> | 98 | 특별한 이유 없음 | NaN |
113 | <NA> | G10 | 최선호 유/무선 통합 브랜드 | 1 | SK통신계열사(SK텔레콤, SK브로드밴드) | 1 |
114 | <NA> | <NA> | <NA> | 2 | KT | NaN |
115 | <NA> | <NA> | <NA> | <NA> | … | NaN |
116 | <NA> | G11 | 최선호 유선 초고속 인터넷 브랜드 | 1 | SK B인터넷 | 1 |
117 | <NA> | <NA> | <NA> | 2 | KT | NaN |
118 | <NA> | <NA> | <NA> | <NA> | … | NaN |
119 | <NA> | G12 | 최선호 IPTV 브랜드 | 1 | SK B TV | 1 |
120 | <NA> | <NA> | <NA> | 2 | KT | NaN |
121 | <NA> | <NA> | <NA> | <NA> | … | NaN |
Most frequently occurring
Column 정의서 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | # duplicates | |
---|---|---|---|---|---|
2 | <NA> | <NA> | <NA> | … | 35 |
0 | <NA> | <NA> | 2 | KT | 20 |
1 | <NA> | <NA> | 10 | 매우 만족한다 | 14 |
3 | <NA> | <NA> | <NA> | <NA> | 2 |