Overview

Dataset statistics

Number of variables3
Number of observations2488
Missing cells0
Missing cells (%)0.0%
Duplicate rows91
Duplicate rows (%)3.7%
Total size in memory63.3 KiB
Average record size in memory26.1 B

Variable types

Numeric2
Text1

Dataset

Description홈페이지에 메뉴, 회원, 콘텐츠 관련 기본정보DB에 대한 내용입니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15071877/fileData.do

Alerts

Dataset has 91 (3.7%) duplicate rowsDuplicates
조회수 has 55 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 07:38:00.227993
Analysis finished2023-12-12 07:38:01.168448
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

태그ID
Real number (ℝ)

Distinct12
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3784124 × 1014
Minimum1.33 × 1014
Maximum1.44 × 1014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size22.0 KiB
2023-12-12T16:38:01.543008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.33 × 1014
5-th percentile1.34 × 1014
Q11.35 × 1014
median1.37 × 1014
Q31.41 × 1014
95-th percentile1.43 × 1014
Maximum1.44 × 1014
Range1.1 × 1013
Interquartile range (IQR)6 × 1012

Descriptive statistics

Standard deviation3.422077 × 1012
Coefficient of variation (CV)0.024826221
Kurtosis-1.4374664
Mean1.3784124 × 1014
Median Absolute Deviation (MAD)3 × 1012
Skewness0.27339792
Sum3.42949 × 1017
Variance1.1710611 × 1025
MonotonicityIncreasing
2023-12-12T16:38:01.656820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
134000000000000 454
18.2%
135000000000000 418
16.8%
141000000000000 296
11.9%
143000000000000 225
9.0%
136000000000000 224
9.0%
142000000000000 224
9.0%
140000000000000 142
 
5.7%
137000000000000 137
 
5.5%
138000000000000 125
 
5.0%
139000000000000 89
 
3.6%
Other values (2) 154
 
6.2%
ValueCountFrequency (%)
133000000000000 86
 
3.5%
134000000000000 454
18.2%
135000000000000 418
16.8%
136000000000000 224
9.0%
137000000000000 137
 
5.5%
138000000000000 125
 
5.0%
139000000000000 89
 
3.6%
140000000000000 142
 
5.7%
141000000000000 296
11.9%
142000000000000 224
9.0%
ValueCountFrequency (%)
144000000000000 68
 
2.7%
143000000000000 225
9.0%
142000000000000 224
9.0%
141000000000000 296
11.9%
140000000000000 142
 
5.7%
139000000000000 89
 
3.6%
138000000000000 125
 
5.0%
137000000000000 137
 
5.5%
136000000000000 224
9.0%
135000000000000 418
16.8%
Distinct2096
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size19.6 KiB
2023-12-12T16:38:02.076895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length3.6567524
Min length1

Characters and Unicode

Total characters9098
Distinct characters627
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1947 ?
Unique (%)78.3%

Sample

1st row전국학부모지원센터
2nd row111
3rd row학부모
4th row학부모지원
5th row학부모정책
ValueCountFrequency (%)
콘텐츠 72
 
2.7%
드림레터 23
 
0.9%
제주 21
 
0.8%
학부모교육 20
 
0.8%
학부모리더 11
 
0.4%
경남 10
 
0.4%
경북 9
 
0.3%
충남 9
 
0.3%
소식지 9
 
0.3%
수능 9
 
0.3%
Other values (2083) 2447
92.7%
2023-12-12T16:38:02.741266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
373
 
4.1%
285
 
3.1%
172
 
1.9%
162
 
1.8%
157
 
1.7%
152
 
1.7%
121
 
1.3%
115
 
1.3%
114
 
1.3%
107
 
1.2%
Other values (617) 7340
80.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8553
94.0%
Decimal Number 216
 
2.4%
Space Separator 152
 
1.7%
Uppercase Letter 84
 
0.9%
Lowercase Letter 74
 
0.8%
Other Punctuation 14
 
0.2%
Modifier Symbol 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
373
 
4.4%
285
 
3.3%
172
 
2.0%
162
 
1.9%
157
 
1.8%
121
 
1.4%
115
 
1.3%
114
 
1.3%
107
 
1.3%
101
 
1.2%
Other values (559) 6846
80.0%
Uppercase Letter
ValueCountFrequency (%)
E 15
17.9%
N 9
10.7%
S 9
10.7%
T 7
8.3%
I 6
 
7.1%
K 5
 
6.0%
Q 5
 
6.0%
M 4
 
4.8%
A 4
 
4.8%
B 3
 
3.6%
Other values (11) 17
20.2%
Lowercase Letter
ValueCountFrequency (%)
e 15
20.3%
s 8
10.8%
n 8
10.8%
o 8
10.8%
t 6
 
8.1%
i 3
 
4.1%
k 3
 
4.1%
r 3
 
4.1%
l 3
 
4.1%
v 3
 
4.1%
Other values (9) 14
18.9%
Decimal Number
ValueCountFrequency (%)
2 57
26.4%
1 53
24.5%
0 40
18.5%
3 19
 
8.8%
5 14
 
6.5%
4 14
 
6.5%
6 7
 
3.2%
9 6
 
2.8%
7 3
 
1.4%
8 3
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 10
71.4%
· 3
 
21.4%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
152
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8553
94.0%
Common 387
 
4.3%
Latin 158
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
373
 
4.4%
285
 
3.3%
172
 
2.0%
162
 
1.9%
157
 
1.8%
121
 
1.4%
115
 
1.3%
114
 
1.3%
107
 
1.3%
101
 
1.2%
Other values (559) 6846
80.0%
Latin
ValueCountFrequency (%)
e 15
 
9.5%
E 15
 
9.5%
N 9
 
5.7%
S 9
 
5.7%
s 8
 
5.1%
n 8
 
5.1%
o 8
 
5.1%
T 7
 
4.4%
t 6
 
3.8%
I 6
 
3.8%
Other values (30) 67
42.4%
Common
ValueCountFrequency (%)
152
39.3%
2 57
 
14.7%
1 53
 
13.7%
0 40
 
10.3%
3 19
 
4.9%
5 14
 
3.6%
4 14
 
3.6%
. 10
 
2.6%
6 7
 
1.8%
9 6
 
1.6%
Other values (8) 15
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8549
94.0%
ASCII 541
 
5.9%
None 4
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
373
 
4.4%
285
 
3.3%
172
 
2.0%
162
 
1.9%
157
 
1.8%
121
 
1.4%
115
 
1.3%
114
 
1.3%
107
 
1.3%
101
 
1.2%
Other values (556) 6842
80.0%
ASCII
ValueCountFrequency (%)
152
28.1%
2 57
 
10.5%
1 53
 
9.8%
0 40
 
7.4%
3 19
 
3.5%
e 15
 
2.8%
E 15
 
2.8%
5 14
 
2.6%
4 14
 
2.6%
. 10
 
1.8%
Other values (46) 152
28.1%
None
ValueCountFrequency (%)
· 3
75.0%
1
 
25.0%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

조회수
Real number (ℝ)

ZEROS 

Distinct61
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.892283
Minimum0
Maximum77
Zeros55
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size22.0 KiB
2023-12-12T16:38:02.912457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile29
Maximum77
Range77
Interquartile range (IQR)1

Descriptive statistics

Standard deviation11.818884
Coefficient of variation (CV)2.4158218
Kurtosis15.589418
Mean4.892283
Median Absolute Deviation (MAD)0
Skewness3.9595662
Sum12172
Variance139.68602
MonotonicityNot monotonic
2023-12-12T16:38:03.064090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1542
62.0%
2 296
 
11.9%
3 125
 
5.0%
4 83
 
3.3%
0 55
 
2.2%
5 53
 
2.1%
6 40
 
1.6%
7 30
 
1.2%
53 20
 
0.8%
8 16
 
0.6%
Other values (51) 228
 
9.2%
ValueCountFrequency (%)
0 55
 
2.2%
1 1542
62.0%
2 296
 
11.9%
3 125
 
5.0%
4 83
 
3.3%
5 53
 
2.1%
6 40
 
1.6%
7 30
 
1.2%
8 16
 
0.6%
9 14
 
0.6%
ValueCountFrequency (%)
77 1
 
< 0.1%
74 5
0.2%
73 3
0.1%
72 5
0.2%
71 5
0.2%
70 1
 
< 0.1%
68 1
 
< 0.1%
65 1
 
< 0.1%
62 2
 
0.1%
60 4
0.2%

Interactions

2023-12-12T16:38:00.785011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:38:00.554740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:38:00.897223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:38:00.655197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:38:03.200166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그ID조회수
태그ID1.0000.441
조회수0.4411.000
2023-12-12T16:38:03.292183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그ID조회수
태그ID1.000-0.428
조회수-0.4281.000

Missing values

2023-12-12T16:38:01.037601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:38:01.136278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

태그ID태그명조회수
0133000000000000전국학부모지원센터3
11330000000000001111
2133000000000000학부모52
3133000000000000학부모지원0
4133000000000000학부모정책1
5133000000000000학부모지원정책0
6133000000000000개편0
7133000000000000이벤트1
8133000000000000학교12
9133000000000000가나다0
태그ID태그명조회수
2478144000000000000사람다움1
2479144000000000000올바른인성1
2480144000000000000vol121
2481144000000000000vol120
2482144000000000000웹진1
2483144000000000000밥상머리교육0
2484144000000000000vol120
2485144000000000000인성교육 프로그램1
2486144000000000000공모전1
24871440000000000002015 인성교육 프로그램 인증 공모전1

Duplicate rows

Most frequently occurring

태그ID태그명조회수# duplicates
49134000000000000콘텐츠5319
33134000000000000제주2910
48134000000000000콘텐츠529
58134000000000000학부모리더48
67135000000000000드림레터38
70135000000000000소식지18
11134000000000000광주47
47134000000000000콘텐츠517
27134000000000000울산16
52134000000000000콘텐츠596