Overview

Dataset statistics

Number of variables5
Number of observations189
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory44.7 B

Variable types

Numeric4
Text1

Dataset

Description창원시 창원-i잉글리시와 관련된 회원 설문 객관식 지문 문항 리스트를 위한 정보입니다.(식별값, 설문번호, 문항순서, 문항내용, 답변자수)
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15089071

Alerts

식별값 is highly overall correlated with 설문번호High correlation
설문번호 is highly overall correlated with 식별값High correlation
식별값 has unique valuesUnique
답변자수 has 4 (2.1%) zerosZeros

Reproduction

Analysis started2023-12-10 23:44:03.419177
Analysis finished2023-12-10 23:44:05.284741
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

식별값
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct189
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5176.5608
Minimum5070
Maximum5364
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:44:05.358992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5070
5-th percentile5079.4
Q15119
median5171
Q35231
95-th percentile5274.6
Maximum5364
Range294
Interquartile range (IQR)112

Descriptive statistics

Standard deviation68.586199
Coefficient of variation (CV)0.013249376
Kurtosis-0.38218048
Mean5176.5608
Median Absolute Deviation (MAD)55
Skewness0.42518597
Sum978370
Variance4704.0668
MonotonicityStrictly increasing
2023-12-11T08:44:05.522045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5070 1
 
0.5%
5217 1
 
0.5%
5205 1
 
0.5%
5206 1
 
0.5%
5207 1
 
0.5%
5208 1
 
0.5%
5210 1
 
0.5%
5211 1
 
0.5%
5212 1
 
0.5%
5213 1
 
0.5%
Other values (179) 179
94.7%
ValueCountFrequency (%)
5070 1
0.5%
5071 1
0.5%
5072 1
0.5%
5073 1
0.5%
5074 1
0.5%
5075 1
0.5%
5076 1
0.5%
5077 1
0.5%
5078 1
0.5%
5079 1
0.5%
ValueCountFrequency (%)
5364 1
0.5%
5363 1
0.5%
5362 1
0.5%
5361 1
0.5%
5360 1
0.5%
5279 1
0.5%
5278 1
0.5%
5277 1
0.5%
5276 1
0.5%
5275 1
0.5%

설문번호
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.608466
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:44:05.698781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median22
Q332
95-th percentile40
Maximum41
Range40
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.048445
Coefficient of variation (CV)0.55757986
Kurtosis-1.1915748
Mean21.608466
Median Absolute Deviation (MAD)10
Skewness-0.10852979
Sum4084
Variance145.16503
MonotonicityNot monotonic
2023-12-11T08:44:05.855473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
41 7
 
3.7%
40 7
 
3.7%
3 6
 
3.2%
32 5
 
2.6%
26 5
 
2.6%
27 5
 
2.6%
28 5
 
2.6%
29 5
 
2.6%
30 5
 
2.6%
31 5
 
2.6%
Other values (27) 134
70.9%
ValueCountFrequency (%)
1 4
2.1%
2 5
2.6%
3 6
3.2%
4 5
2.6%
5 5
2.6%
6 5
2.6%
7 5
2.6%
8 5
2.6%
10 5
2.6%
11 5
2.6%
ValueCountFrequency (%)
41 7
3.7%
40 7
3.7%
38 5
2.6%
37 5
2.6%
36 5
2.6%
35 5
2.6%
34 5
2.6%
33 5
2.6%
32 5
2.6%
31 5
2.6%

문항순서
Real number (ℝ)

Distinct7
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0793651
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:44:06.000682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.5014346
Coefficient of variation (CV)0.48757929
Kurtosis-0.94207572
Mean3.0793651
Median Absolute Deviation (MAD)1
Skewness0.15883145
Sum582
Variance2.254306
MonotonicityNot monotonic
2023-12-11T08:44:06.130777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 37
19.6%
2 37
19.6%
3 37
19.6%
4 37
19.6%
5 36
19.0%
6 3
 
1.6%
7 2
 
1.1%
ValueCountFrequency (%)
1 37
19.6%
2 37
19.6%
3 37
19.6%
4 37
19.6%
5 36
19.0%
6 3
 
1.6%
7 2
 
1.1%
ValueCountFrequency (%)
7 2
 
1.1%
6 3
 
1.6%
5 36
19.0%
4 37
19.6%
3 37
19.6%
2 37
19.6%
1 37
19.6%
Distinct75
Distinct (%)39.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-11T08:44:06.512529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length9.8994709
Min length4

Characters and Unicode

Total characters1871
Distinct characters181
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)31.2%

Sample

1st row① 초등학교 1~3학년
2nd row② 초등학교 4~6학년
3rd row③ 중학교 1~2학년
4th row④ 중학교 3학년
5th row① 의창구
ValueCountFrequency (%)
매우 46
 
7.4%
그렇지 46
 
7.4%
않다 46
 
7.4%
그렇다 46
 
7.4%
37
 
5.9%
37
 
5.9%
37
 
5.9%
37
 
5.9%
36
 
5.8%
보통이다 23
 
3.7%
Other values (139) 233
37.3%
2023-12-11T08:44:07.053292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
491
26.2%
129
 
6.9%
92
 
4.9%
92
 
4.9%
49
 
2.6%
48
 
2.6%
47
 
2.5%
46
 
2.5%
41
 
2.2%
37
 
2.0%
Other values (171) 799
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1061
56.7%
Space Separator 491
26.2%
Other Number 189
 
10.1%
Lowercase Letter 47
 
2.5%
Decimal Number 27
 
1.4%
Uppercase Letter 16
 
0.9%
Math Symbol 12
 
0.6%
Close Punctuation 10
 
0.5%
Open Punctuation 10
 
0.5%
Other Punctuation 7
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
129
 
12.2%
92
 
8.7%
92
 
8.7%
49
 
4.6%
48
 
4.5%
47
 
4.4%
46
 
4.3%
41
 
3.9%
26
 
2.5%
26
 
2.5%
Other values (126) 465
43.8%
Lowercase Letter
ValueCountFrequency (%)
t 7
14.9%
i 6
12.8%
o 5
10.6%
e 4
8.5%
v 4
8.5%
a 4
8.5%
n 3
 
6.4%
r 2
 
4.3%
u 2
 
4.3%
p 2
 
4.3%
Other values (6) 8
17.0%
Uppercase Letter
ValueCountFrequency (%)
A 4
25.0%
M 3
18.8%
B 2
12.5%
S 1
 
6.2%
D 1
 
6.2%
K 1
 
6.2%
I 1
 
6.2%
R 1
 
6.2%
P 1
 
6.2%
J 1
 
6.2%
Other Number
ValueCountFrequency (%)
37
19.6%
37
19.6%
37
19.6%
37
19.6%
36
19.0%
3
 
1.6%
2
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 10
37.0%
3 6
22.2%
2 4
 
14.8%
6 4
 
14.8%
4 2
 
7.4%
5 1
 
3.7%
Space Separator
ValueCountFrequency (%)
491
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1061
56.7%
Common 747
39.9%
Latin 63
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
129
 
12.2%
92
 
8.7%
92
 
8.7%
49
 
4.6%
48
 
4.5%
47
 
4.4%
46
 
4.3%
41
 
3.9%
26
 
2.5%
26
 
2.5%
Other values (126) 465
43.8%
Latin
ValueCountFrequency (%)
t 7
 
11.1%
i 6
 
9.5%
o 5
 
7.9%
A 4
 
6.3%
e 4
 
6.3%
v 4
 
6.3%
a 4
 
6.3%
M 3
 
4.8%
n 3
 
4.8%
r 2
 
3.2%
Other values (16) 21
33.3%
Common
ValueCountFrequency (%)
491
65.7%
37
 
5.0%
37
 
5.0%
37
 
5.0%
37
 
5.0%
36
 
4.8%
~ 12
 
1.6%
1 10
 
1.3%
) 10
 
1.3%
( 10
 
1.3%
Other values (9) 30
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1061
56.7%
ASCII 621
33.2%
Enclosed Alphanum 189
 
10.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
491
79.1%
~ 12
 
1.9%
1 10
 
1.6%
) 10
 
1.6%
( 10
 
1.6%
t 7
 
1.1%
. 7
 
1.1%
3 6
 
1.0%
i 6
 
1.0%
o 5
 
0.8%
Other values (28) 57
 
9.2%
Hangul
ValueCountFrequency (%)
129
 
12.2%
92
 
8.7%
92
 
8.7%
49
 
4.6%
48
 
4.5%
47
 
4.4%
46
 
4.3%
41
 
3.9%
26
 
2.5%
26
 
2.5%
Other values (126) 465
43.8%
Enclosed Alphanum
ValueCountFrequency (%)
37
19.6%
37
19.6%
37
19.6%
37
19.6%
36
19.0%
3
 
1.6%
2
 
1.1%

답변자수
Real number (ℝ)

ZEROS 

Distinct62
Distinct (%)32.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.714286
Minimum0
Maximum111
Zeros4
Zeros (%)2.1%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-11T08:44:07.216115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q110
median25
Q336
95-th percentile59.6
Maximum111
Range111
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.508123
Coefficient of variation (CV)0.75864923
Kurtosis2.0631156
Mean25.714286
Median Absolute Deviation (MAD)13
Skewness1.085701
Sum4860
Variance380.56687
MonotonicityNot monotonic
2023-12-11T08:44:07.367639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 8
 
4.2%
31 7
 
3.7%
1 7
 
3.7%
14 6
 
3.2%
19 6
 
3.2%
36 6
 
3.2%
12 6
 
3.2%
38 6
 
3.2%
32 5
 
2.6%
9 5
 
2.6%
Other values (52) 127
67.2%
ValueCountFrequency (%)
0 4
2.1%
1 7
3.7%
2 8
4.2%
3 4
2.1%
4 3
 
1.6%
5 5
2.6%
6 2
 
1.1%
7 4
2.1%
8 5
2.6%
9 5
2.6%
ValueCountFrequency (%)
111 1
0.5%
99 1
0.5%
86 1
0.5%
71 1
0.5%
69 2
1.1%
66 1
0.5%
63 1
0.5%
60 2
1.1%
59 1
0.5%
57 1
0.5%

Interactions

2023-12-11T08:44:04.498681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.615016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.896911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.177746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.564462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.678648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.965971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.262981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.638999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.756387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.041337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.359160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.997115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:03.829952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.109563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:04.430538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:44:07.465628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식별값설문번호문항순서문항내용답변자수
식별값1.0000.9010.0000.0000.149
설문번호0.9011.0000.0000.7510.000
문항순서0.0000.0001.0001.0000.360
문항내용0.0000.7511.0001.0000.886
답변자수0.1490.0000.3600.8861.000
2023-12-11T08:44:07.561772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식별값설문번호문항순서답변자수
식별값1.0000.9390.0810.028
설문번호0.9391.0000.0580.049
문항순서0.0810.0581.000-0.297
답변자수0.0280.049-0.2971.000

Missing values

2023-12-11T08:44:05.135136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:44:05.243358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

식별값설문번호문항순서문항내용답변자수
0507011① 초등학교 1~3학년21
1507112② 초등학교 4~6학년56
2507213③ 중학교 1~2학년26
3507314④ 중학교 3학년19
4507421① 의창구36
5507522② 성산구39
6507623③ 마산합포구12
7507724④ 마산회원구26
8507825⑤ 진해구10
9507931① 1코스 (Phonics)5
식별값설문번호문항순서문항내용답변자수
1795275413③ 실시간 수학 학습30
1805276414④ 화상영어71
1815277415⑤ 영어 골든벨28
1825278416⑥ 영어토론대회24
1835279417⑦ 해외캠프56
1845360181① 매우 그렇다63
1855361182② 그렇다44
1865362183③ 보통이다12
1875363184④ 그렇지 않다2
1885364185⑤ 매우 그렇지 않다1