Overview

Dataset statistics

Number of variables3
Number of observations297
Missing cells10
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory25.4 B

Variable types

Text2
Numeric1

Dataset

Description한국언론진흥재단 미디어이슈 (19년 5호)에서 시민을 대상으로 언론자유와 기사삭제 청구에 대한 인식을 조사한 데이터입니다.
Author한국언론진흥재단
URLhttps://www.data.go.kr/data/15086110/fileData.do

Alerts

사례수 has 10 (3.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:01:37.415188
Analysis finished2023-12-12 14:01:37.836150
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct293
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T23:01:38.087441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length6.7104377
Min length2

Characters and Unicode

Total characters1993
Distinct characters118
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)98.3%

Sample

1st row성별1
2nd row성별2
3rd row연령1
4th row연령2
5th row연령3
ValueCountFrequency (%)
이후 5
 
1.7%
이용자확산행위2 1
 
0.3%
신체정보_1 1
 
0.3%
주거지_4 1
 
0.3%
주거지_3 1
 
0.3%
주거지_2 1
 
0.3%
주거지_1 1
 
0.3%
나이_4 1
 
0.3%
나이_3 1
 
0.3%
나이_2 1
 
0.3%
Other values (283) 283
95.3%
2023-12-12T23:01:38.766881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 109
 
5.5%
80
 
4.0%
1 78
 
3.9%
74
 
3.7%
70
 
3.5%
69
 
3.5%
2 66
 
3.3%
3 65
 
3.3%
4 64
 
3.2%
62
 
3.1%
Other values (108) 1256
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1580
79.3%
Decimal Number 304
 
15.3%
Connector Punctuation 109
 
5.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
5.1%
74
 
4.7%
70
 
4.4%
69
 
4.4%
62
 
3.9%
50
 
3.2%
34
 
2.2%
34
 
2.2%
33
 
2.1%
33
 
2.1%
Other values (97) 1041
65.9%
Decimal Number
ValueCountFrequency (%)
1 78
25.7%
2 66
21.7%
3 65
21.4%
4 64
21.1%
5 15
 
4.9%
6 5
 
1.6%
7 4
 
1.3%
8 3
 
1.0%
9 2
 
0.7%
0 2
 
0.7%
Connector Punctuation
ValueCountFrequency (%)
_ 109
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1580
79.3%
Common 413
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
5.1%
74
 
4.7%
70
 
4.4%
69
 
4.4%
62
 
3.9%
50
 
3.2%
34
 
2.2%
34
 
2.2%
33
 
2.1%
33
 
2.1%
Other values (97) 1041
65.9%
Common
ValueCountFrequency (%)
_ 109
26.4%
1 78
18.9%
2 66
16.0%
3 65
15.7%
4 64
15.5%
5 15
 
3.6%
6 5
 
1.2%
7 4
 
1.0%
8 3
 
0.7%
9 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1580
79.3%
ASCII 413
 
20.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 109
26.4%
1 78
18.9%
2 66
16.0%
3 65
15.7%
4 64
15.5%
5 15
 
3.6%
6 5
 
1.2%
7 4
 
1.0%
8 3
 
0.7%
9 2
 
0.5%
Hangul
ValueCountFrequency (%)
80
 
5.1%
74
 
4.7%
70
 
4.4%
69
 
4.4%
62
 
3.9%
50
 
3.2%
34
 
2.2%
34
 
2.2%
33
 
2.1%
33
 
2.1%
Other values (97) 1041
65.9%
Distinct89
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T23:01:39.083484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length36
Mean length6.6127946
Min length2

Characters and Unicode

Total characters1964
Distinct characters148
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)25.6%

Sample

1st row남성
2nd row여성
3rd row만20~29세
4th row만30~39세
5th row만40~49세
ValueCountFrequency (%)
안함 83
13.3%
동의 83
13.3%
동의함 82
13.1%
별로 42
 
6.7%
전혀 41
 
6.5%
매우 41
 
6.5%
약간 41
 
6.5%
8
 
1.3%
전혀동의안함 8
 
1.3%
매우동의함 8
 
1.3%
Other values (129) 189
30.2%
2023-12-12T23:01:39.519847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
329
16.8%
200
 
10.2%
197
 
10.0%
197
 
10.0%
103
 
5.2%
53
 
2.7%
50
 
2.5%
50
 
2.5%
50
 
2.5%
50
 
2.5%
Other values (138) 685
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1516
77.2%
Space Separator 329
 
16.8%
Decimal Number 98
 
5.0%
Math Symbol 10
 
0.5%
Close Punctuation 4
 
0.2%
Open Punctuation 4
 
0.2%
Other Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
13.2%
197
 
13.0%
197
 
13.0%
103
 
6.8%
53
 
3.5%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
Other values (123) 516
34.0%
Decimal Number
ValueCountFrequency (%)
0 25
25.5%
9 19
19.4%
1 15
15.3%
5 11
11.2%
3 11
11.2%
2 5
 
5.1%
4 4
 
4.1%
8 3
 
3.1%
6 3
 
3.1%
7 2
 
2.0%
Space Separator
ValueCountFrequency (%)
329
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1516
77.2%
Common 448
 
22.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
13.2%
197
 
13.0%
197
 
13.0%
103
 
6.8%
53
 
3.5%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
Other values (123) 516
34.0%
Common
ValueCountFrequency (%)
329
73.4%
0 25
 
5.6%
9 19
 
4.2%
1 15
 
3.3%
5 11
 
2.5%
3 11
 
2.5%
~ 10
 
2.2%
2 5
 
1.1%
4 4
 
0.9%
) 4
 
0.9%
Other values (5) 15
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1516
77.2%
ASCII 448
 
22.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
329
73.4%
0 25
 
5.6%
9 19
 
4.2%
1 15
 
3.3%
5 11
 
2.5%
3 11
 
2.5%
~ 10
 
2.2%
2 5
 
1.1%
4 4
 
0.9%
) 4
 
0.9%
Other values (5) 15
 
3.3%
Hangul
ValueCountFrequency (%)
200
 
13.2%
197
 
13.0%
197
 
13.0%
103
 
6.8%
53
 
3.5%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
50
 
3.3%
Other values (123) 516
34.0%

사례수
Real number (ℝ)

MISSING 

Distinct220
Distinct (%)76.7%
Missing10
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean225.00697
Minimum2
Maximum844
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T23:01:39.658508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile23.3
Q178
median192
Q3354.5
95-th percentile524.7
Maximum844
Range842
Interquartile range (IQR)276.5

Descriptive statistics

Standard deviation173.24168
Coefficient of variation (CV)0.76993916
Kurtosis0.45328447
Mean225.00697
Median Absolute Deviation (MAD)128
Skewness0.8950196
Sum64577
Variance30012.678
MonotonicityNot monotonic
2023-12-12T23:01:39.788597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
357 4
 
1.3%
308 4
 
1.3%
64 4
 
1.3%
63 3
 
1.0%
315 3
 
1.0%
100 3
 
1.0%
55 3
 
1.0%
48 3
 
1.0%
32 3
 
1.0%
94 3
 
1.0%
Other values (210) 254
85.5%
(Missing) 10
 
3.4%
ValueCountFrequency (%)
2 1
0.3%
4 1
0.3%
6 1
0.3%
8 1
0.3%
9 2
0.7%
11 1
0.3%
12 1
0.3%
13 1
0.3%
14 1
0.3%
18 1
0.3%
ValueCountFrequency (%)
844 1
0.3%
771 1
0.3%
769 1
0.3%
748 1
0.3%
718 1
0.3%
685 1
0.3%
681 1
0.3%
645 1
0.3%
623 1
0.3%
614 1
0.3%

Interactions

2023-12-12T23:01:37.614951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:01:39.871285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분류사례수
중분류1.0000.694
사례수0.6941.000

Missing values

2023-12-12T23:01:37.718029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:01:37.792630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대분류중분류사례수
0성별1남성510
1성별2여성490
2연령1만20~29세192
3연령2만30~39세207
4연령3만40~49세248
5연령4만50~59세243
6연령5만60세 이상110
7거주지역1서울197
8거주지역2인천58
9거주지역3경기253
대분류중분류사례수
287사회계층1하층94
288사회계층2중하층396
289사회계층3중간층440
290사회계층4중상층68
291사회계층5상층2
292정치적성향1보수29
293정치적성향2보수에 가까움144
294정치적성향3중도537
295정치적성향4진보에 가까움249
296정치적성향5진보41