Overview

Dataset statistics

Number of variables4
Number of observations63
Missing cells130
Missing cells (%)51.6%
Duplicate rows6
Duplicate rows (%)9.5%
Total size in memory2.2 KiB
Average record size in memory36.1 B

Variable types

Unsupported1
Numeric1
Categorical1
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2743/F/1/datasetView.do

Alerts

Dataset has 6 (9.5%) duplicate rowsDuplicates
2017년 서울시 브랜드에 대한 온라인 여론조사 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 2 is highly overall correlated with 2017년 서울시 브랜드에 대한 온라인 여론조사High correlation
Unnamed: 0 has 63 (100.0%) missing valuesMissing
2017년 서울시 브랜드에 대한 온라인 여론조사 has 53 (84.1%) missing valuesMissing
Unnamed: 3 has 14 (22.2%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 05:25:43.969550
Analysis finished2023-12-11 05:25:44.434810
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing63
Missing (%)100.0%
Memory size699.0 B

2017년 서울시 브랜드에 대한 온라인 여론조사
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct10
Distinct (%)100.0%
Missing53
Missing (%)84.1%
Infinite0
Infinite (%)0.0%
Mean5.5
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size699.0 B
2023-12-11T14:25:44.480419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.45
Q13.25
median5.5
Q37.75
95-th percentile9.55
Maximum10
Range9
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation3.0276504
Coefficient of variation (CV)0.55048188
Kurtosis-1.2
Mean5.5
Median Absolute Deviation (MAD)2.5
Skewness0
Sum55
Variance9.1666667
MonotonicityStrictly increasing
2023-12-11T14:25:44.577782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 1
 
1.6%
2 1
 
1.6%
3 1
 
1.6%
4 1
 
1.6%
5 1
 
1.6%
6 1
 
1.6%
7 1
 
1.6%
8 1
 
1.6%
9 1
 
1.6%
10 1
 
1.6%
(Missing) 53
84.1%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
10 1
1.6%
9 1
1.6%
8 1
1.6%
7 1
1.6%
6 1
1.6%
5 1
1.6%
4 1
1.6%
3 1
1.6%
2 1
1.6%
1 1
1.6%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size636.0 B
1
10 
3
4
2
99
Other values (16)
21 

Length

Max length153
Median length1
Mean length13.396825
Min length1

Unique

Unique14 ?
Unique (%)22.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row선생님께서는 I·SEOUL·U(아이서울유)가 서울시 브랜드라는 것을 알고 계십니까?

Common Values

ValueCountFrequency (%)
1 10
15.9%
3 9
14.3%
4 9
14.3%
2 9
14.3%
99 5
7.9%
<NA> 4
 
6.3%
5 3
 
4.8%
서울 브랜드를 활용한 다양한 활동 사례를 공유하고 시민들과의 소통을 강화하기 위해 다음 중 어떤 사업을 우선적으로 추진해야 한다고 생각하십니까? 1
 
1.6%
그럼, 주로 어떤 경로를 통해 서울브랜드를 듣거나 알게 되었습니까? 1
 
1.6%
서울 브랜드 I·SEOUL·U(아이서울유)에 대해 어느 정도 호감이 가시나요? 1
 
1.6%
Other values (11) 11
17.5%

Length

2023-12-11T14:25:44.703700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 10
 
4.8%
4 9
 
4.3%
2 9
 
4.3%
3 9
 
4.3%
서울 6
 
2.9%
99 5
 
2.4%
생각하십니까 4
 
1.9%
서울브랜드 4
 
1.9%
na 4
 
1.9%
어떤 3
 
1.4%
Other values (123) 147
70.0%

Unnamed: 3
Text

MISSING 

Distinct41
Distinct (%)83.7%
Missing14
Missing (%)22.2%
Memory size636.0 B
2023-12-11T14:25:44.960466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length11.959184
Min length2

Characters and Unicode

Total characters586
Distinct characters169
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)73.5%

Sample

1st row잘 알고 있다
2nd row대충은 알고 있다
3rd row들어는 보았다
4th row잘 몰랐다
5th row오늘 처음 들어보았다
ValueCountFrequency (%)
도움이 8
 
4.9%
것이다 8
 
4.9%
7
 
4.3%
서울브랜드 6
 
3.7%
기타 5
 
3.1%
4
 
2.5%
호감이 4
 
2.5%
않을 4
 
2.5%
되지 4
 
2.5%
약간 3
 
1.9%
Other values (90) 109
67.3%
2023-12-11T14:25:45.338638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
113
 
19.3%
29
 
4.9%
18
 
3.1%
12
 
2.0%
12
 
2.0%
10
 
1.7%
9
 
1.5%
9
 
1.5%
8
 
1.4%
8
 
1.4%
Other values (159) 358
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 446
76.1%
Space Separator 113
 
19.3%
Other Punctuation 15
 
2.6%
Uppercase Letter 8
 
1.4%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
6.5%
18
 
4.0%
12
 
2.7%
12
 
2.7%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (149) 323
72.4%
Uppercase Letter
ValueCountFrequency (%)
S 4
50.0%
N 2
25.0%
V 1
 
12.5%
T 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 8
53.3%
/ 6
40.0%
1
 
6.7%
Space Separator
ValueCountFrequency (%)
113
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 446
76.1%
Common 132
 
22.5%
Latin 8
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
6.5%
18
 
4.0%
12
 
2.7%
12
 
2.7%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (149) 323
72.4%
Common
ValueCountFrequency (%)
113
85.6%
, 8
 
6.1%
/ 6
 
4.5%
( 2
 
1.5%
) 2
 
1.5%
1
 
0.8%
Latin
ValueCountFrequency (%)
S 4
50.0%
N 2
25.0%
V 1
 
12.5%
T 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 446
76.1%
ASCII 139
 
23.7%
Katakana 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
113
81.3%
, 8
 
5.8%
/ 6
 
4.3%
S 4
 
2.9%
N 2
 
1.4%
( 2
 
1.4%
) 2
 
1.4%
V 1
 
0.7%
T 1
 
0.7%
Hangul
ValueCountFrequency (%)
29
 
6.5%
18
 
4.0%
12
 
2.7%
12
 
2.7%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (149) 323
72.4%
Katakana
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T14:25:44.116119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T14:25:45.480299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2017년 서울시 브랜드에 대한 온라인 여론조사Unnamed: 2Unnamed: 3
2017년 서울시 브랜드에 대한 온라인 여론조사1.0001.000NaN
Unnamed: 21.0001.0001.000
Unnamed: 3NaN1.0001.000
2023-12-11T14:25:45.605208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2017년 서울시 브랜드에 대한 온라인 여론조사Unnamed: 2
2017년 서울시 브랜드에 대한 온라인 여론조사1.0001.000
Unnamed: 21.0001.000

Missing values

2023-12-11T14:25:44.211111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:25:44.293477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T14:25:44.381130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 02017년 서울시 브랜드에 대한 온라인 여론조사Unnamed: 2Unnamed: 3
0<NA><NA><NA><NA>
1<NA><NA><NA><NA>
2<NA><NA><NA><NA>
3<NA><NA><NA><NA>
4<NA>1선생님께서는 I·SEOUL·U(아이서울유)가 서울시 브랜드라는 것을 알고 계십니까?<NA>
5<NA><NA>1잘 알고 있다
6<NA><NA>2대충은 알고 있다
7<NA><NA>3들어는 보았다
8<NA><NA>4잘 몰랐다
9<NA><NA>5오늘 처음 들어보았다
Unnamed: 02017년 서울시 브랜드에 대한 온라인 여론조사Unnamed: 2Unnamed: 3
53<NA><NA>3판매직/서비스직
54<NA><NA>4기능직/노무직
55<NA><NA>5자영업
56<NA><NA>6전업주부
57<NA><NA>7학생
58<NA><NA>8무직/구직 중
59<NA><NA>9응답하고 싶지 않음
60<NA><NA>99기타
61<NA>10서울 거주기간은 대략 몇 년 정도 되셨습니까?<NA>
62<NA><NA>1주관식분기용보기

Duplicate rows

Most frequently occurring

2017년 서울시 브랜드에 대한 온라인 여론조사Unnamed: 2Unnamed: 3# duplicates
4<NA>99기타5
5<NA><NA><NA>4
0<NA>1매우 도움이 될 것이다2
1<NA>2약간 도움이 될 것이다2
2<NA>3별로 도움이 되지 않을 것이다2
3<NA>4전혀 도움이 되지 않을 것이다2