Overview

Dataset statistics

Number of variables4
Number of observations124
Missing cells252
Missing cells (%)50.8%
Duplicate rows11
Duplicate rows (%)8.9%
Total size in memory4.2 KiB
Average record size in memory35.1 B

Variable types

Unsupported1
Numeric1
Categorical1
Text1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2743/F/1/datasetView.do

Alerts

Dataset has 11 (8.9%) duplicate rowsDuplicates
2017년 서울시 홈페이지 이용자 만족도 조사 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 2 is highly overall correlated with 2017년 서울시 홈페이지 이용자 만족도 조사High correlation
Unnamed: 0 has 124 (100.0%) missing valuesMissing
2017년 서울시 홈페이지 이용자 만족도 조사 has 103 (83.1%) missing valuesMissing
Unnamed: 3 has 25 (20.2%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 05:25:23.204914
Analysis finished2023-12-11 05:25:23.924562
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing124
Missing (%)100.0%
Memory size1.2 KiB

2017년 서울시 홈페이지 이용자 만족도 조사
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct21
Distinct (%)100.0%
Missing103
Missing (%)83.1%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-11T14:25:24.008081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median11
Q316
95-th percentile20
Maximum21
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2048368
Coefficient of variation (CV)0.56407607
Kurtosis-1.2
Mean11
Median Absolute Deviation (MAD)5
Skewness0
Sum231
Variance38.5
MonotonicityStrictly increasing
2023-12-11T14:25:24.206282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
2 1
 
0.8%
21 1
 
0.8%
20 1
 
0.8%
19 1
 
0.8%
18 1
 
0.8%
17 1
 
0.8%
16 1
 
0.8%
15 1
 
0.8%
14 1
 
0.8%
13 1
 
0.8%
Other values (11) 11
 
8.9%
(Missing) 103
83.1%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
21 1
0.8%
20 1
0.8%
19 1
0.8%
18 1
0.8%
17 1
0.8%
16 1
0.8%
15 1
0.8%
14 1
0.8%
13 1
0.8%
12 1
0.8%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
21 
2
18 
3
15 
4
13 
5
12 
Other values (28)
45 

Length

Max length89
Median length1
Mean length10.362903
Min length1

Unique

Unique22 ?
Unique (%)17.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row선생님께서는 2017년 한 해 동안 서울시 홈페이지를 이용하신 경험이 있습니까?

Common Values

ValueCountFrequency (%)
1 21
16.9%
2 18
14.5%
3 15
12.1%
4 13
10.5%
5 12
9.7%
99 6
 
4.8%
<NA> 4
 
3.2%
6 4
 
3.2%
8 3
 
2.4%
9 3
 
2.4%
Other values (23) 25
20.2%

Length

2023-12-11T14:25:24.382641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 21
 
6.1%
2 18
 
5.2%
서울시 16
 
4.6%
3 15
 
4.3%
4 13
 
3.8%
5 12
 
3.5%
선생님께서는 12
 
3.5%
대해 9
 
2.6%
정도 7
 
2.0%
어느 7
 
2.0%
Other values (139) 215
62.3%

Unnamed: 3
Text

MISSING 

Distinct67
Distinct (%)67.7%
Missing25
Missing (%)20.2%
Memory size1.1 KiB
2023-12-11T14:25:24.661097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length39
Mean length13.40404
Min length2

Characters and Unicode

Total characters1327
Distinct characters251
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)57.6%

Sample

1st row있다
2nd row없다
3rd row인터넷(포털사이트) ‘서울시’ 키워드 검색을 통해
4th row‘www.seoul.go.kr’ 서울시 홈페이지 주소(URL) 직접입력
5th row관련 기관 홈페이지 방문을 통해 (산하기관, 서울도서관, 서울시 온라인여론조사 등)
ValueCountFrequency (%)
20
 
6.2%
만족한다 14
 
4.4%
매우 14
 
4.4%
관련 9
 
2.8%
다소 7
 
2.2%
불만이다 7
 
2.2%
서울시 7
 
2.2%
불만인 7
 
2.2%
편이다 7
 
2.2%
약간 7
 
2.2%
Other values (176) 222
69.2%
2023-12-11T14:25:25.184337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
227
 
17.1%
47
 
3.5%
, 47
 
3.5%
32
 
2.4%
31
 
2.3%
30
 
2.3%
28
 
2.1%
24
 
1.8%
21
 
1.6%
) 21
 
1.6%
Other values (241) 819
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 968
72.9%
Space Separator 227
 
17.1%
Other Punctuation 58
 
4.4%
Close Punctuation 21
 
1.6%
Open Punctuation 21
 
1.6%
Lowercase Letter 12
 
0.9%
Uppercase Letter 9
 
0.7%
Control 5
 
0.4%
Initial Punctuation 3
 
0.2%
Final Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
4.9%
32
 
3.3%
31
 
3.2%
30
 
3.1%
28
 
2.9%
24
 
2.5%
21
 
2.2%
20
 
2.1%
20
 
2.1%
18
 
1.9%
Other values (213) 697
72.0%
Lowercase Letter
ValueCountFrequency (%)
w 3
25.0%
o 2
16.7%
s 1
 
8.3%
e 1
 
8.3%
r 1
 
8.3%
k 1
 
8.3%
g 1
 
8.3%
l 1
 
8.3%
u 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
I 2
22.2%
F 1
11.1%
W 1
11.1%
P 1
11.1%
C 1
11.1%
L 1
11.1%
R 1
11.1%
U 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 47
81.0%
/ 5
 
8.6%
. 3
 
5.2%
· 2
 
3.4%
& 1
 
1.7%
Space Separator
ValueCountFrequency (%)
227
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Control
ValueCountFrequency (%)
5
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 968
72.9%
Common 338
 
25.5%
Latin 21
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
4.9%
32
 
3.3%
31
 
3.2%
30
 
3.1%
28
 
2.9%
24
 
2.5%
21
 
2.2%
20
 
2.1%
20
 
2.1%
18
 
1.9%
Other values (213) 697
72.0%
Latin
ValueCountFrequency (%)
w 3
14.3%
I 2
 
9.5%
o 2
 
9.5%
F 1
 
4.8%
W 1
 
4.8%
P 1
 
4.8%
C 1
 
4.8%
s 1
 
4.8%
e 1
 
4.8%
L 1
 
4.8%
Other values (7) 7
33.3%
Common
ValueCountFrequency (%)
227
67.2%
, 47
 
13.9%
) 21
 
6.2%
( 21
 
6.2%
5
 
1.5%
/ 5
 
1.5%
. 3
 
0.9%
3
 
0.9%
3
 
0.9%
· 2
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 968
72.9%
ASCII 351
 
26.5%
Punctuation 6
 
0.5%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
227
64.7%
, 47
 
13.4%
) 21
 
6.0%
( 21
 
6.0%
5
 
1.4%
/ 5
 
1.4%
w 3
 
0.9%
. 3
 
0.9%
I 2
 
0.6%
o 2
 
0.6%
Other values (15) 15
 
4.3%
Hangul
ValueCountFrequency (%)
47
 
4.9%
32
 
3.3%
31
 
3.2%
30
 
3.1%
28
 
2.9%
24
 
2.5%
21
 
2.2%
20
 
2.1%
20
 
2.1%
18
 
1.9%
Other values (213) 697
72.0%
Punctuation
ValueCountFrequency (%)
3
50.0%
3
50.0%
None
ValueCountFrequency (%)
· 2
100.0%

Interactions

2023-12-11T14:25:23.452776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T14:25:25.346778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2017년 서울시 홈페이지 이용자 만족도 조사Unnamed: 2Unnamed: 3
2017년 서울시 홈페이지 이용자 만족도 조사1.0001.000NaN
Unnamed: 21.0001.0001.000
Unnamed: 3NaN1.0001.000
2023-12-11T14:25:25.473309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2017년 서울시 홈페이지 이용자 만족도 조사Unnamed: 2
2017년 서울시 홈페이지 이용자 만족도 조사1.0001.000
Unnamed: 21.0001.000

Missing values

2023-12-11T14:25:23.639099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:25:23.756562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T14:25:23.863416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 02017년 서울시 홈페이지 이용자 만족도 조사Unnamed: 2Unnamed: 3
0<NA><NA><NA><NA>
1<NA><NA><NA><NA>
2<NA><NA><NA><NA>
3<NA><NA><NA><NA>
4<NA>1선생님께서는 2017년 한 해 동안 서울시 홈페이지를 이용하신 경험이 있습니까?<NA>
5<NA><NA>1있다
6<NA><NA>2없다
7<NA>2선생님께서는 서울시 홈페이지를 이용하실 때 주로 어떤 경로로 방문(접근)하십니까?<NA>
8<NA><NA>1인터넷(포털사이트) ‘서울시’ 키워드 검색을 통해
9<NA><NA>2‘www.seoul.go.kr’ 서울시 홈페이지 주소(URL) 직접입력
Unnamed: 02017년 서울시 홈페이지 이용자 만족도 조사Unnamed: 2Unnamed: 3
114<NA><NA>7학생
115<NA><NA>8무직/기타
116<NA><NA>9응답하고 싶지 않음
117<NA>21선생님께서 거주하고 계신 지역은 다음 중 어디입니까? 아래 지역구분을 참고하여 답해주십시오.<NA>
118<NA><NA>1도심권
119<NA><NA>2서북권
120<NA><NA>3동북권
121<NA><NA>4서남권
122<NA><NA>5동남권
123<NA><NA>6서울시 외 지역

Duplicate rows

Most frequently occurring

2017년 서울시 홈페이지 이용자 만족도 조사Unnamed: 2Unnamed: 3# duplicates
3<NA>2약간 만족한다7
4<NA>3다소 불만인 편이다7
9<NA>99기타6
0<NA>1매우 만족한다5
6<NA>4매우 불만이다4
10<NA><NA><NA>4
2<NA>1주관식분기용보기3
5<NA>4매우 불만이다3
7<NA>5잘 모름3
1<NA>1매우 만족한다2