Overview

Dataset statistics

Number of variables3
Number of observations75
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory26.8 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description교육훈련기관의홍보내용(신청번호, 전공명)
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15071174/fileData.do

Alerts

일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:37:39.513600
Analysis finished2023-12-12 09:37:39.990699
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38
Minimum1
Maximum75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-12T18:37:40.092076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.7
Q119.5
median38
Q356.5
95-th percentile71.3
Maximum75
Range74
Interquartile range (IQR)37

Descriptive statistics

Standard deviation21.794495
Coefficient of variation (CV)0.57353933
Kurtosis-1.2
Mean38
Median Absolute Deviation (MAD)19
Skewness0
Sum2850
Variance475
MonotonicityStrictly increasing
2023-12-12T18:37:40.248001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
49 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
50 1
 
1.3%
48 1
 
1.3%
Other values (65) 65
86.7%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%
67 1
1.3%
66 1
1.3%
Distinct71
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-12T18:37:40.546665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters375
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)90.7%

Sample

1st row4,442
2nd row4,443
3rd row4,445
4th row4,446
5th row4,447
ValueCountFrequency (%)
5,141 3
 
4.0%
4,461 2
 
2.7%
4,476 2
 
2.7%
5,235 1
 
1.3%
5,192 1
 
1.3%
5,179 1
 
1.3%
4,490 1
 
1.3%
5,140 1
 
1.3%
5,146 1
 
1.3%
5,168 1
 
1.3%
Other values (61) 61
81.3%
2023-12-12T18:37:40.985642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 107
28.5%
, 75
20.0%
5 52
13.9%
2 29
 
7.7%
1 22
 
5.9%
6 21
 
5.6%
7 19
 
5.1%
8 15
 
4.0%
3 13
 
3.5%
9 13
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
80.0%
Other Punctuation 75
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 107
35.7%
5 52
17.3%
2 29
 
9.7%
1 22
 
7.3%
6 21
 
7.0%
7 19
 
6.3%
8 15
 
5.0%
3 13
 
4.3%
9 13
 
4.3%
0 9
 
3.0%
Other Punctuation
ValueCountFrequency (%)
, 75
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 375
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 107
28.5%
, 75
20.0%
5 52
13.9%
2 29
 
7.7%
1 22
 
5.9%
6 21
 
5.6%
7 19
 
5.1%
8 15
 
4.0%
3 13
 
3.5%
9 13
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 375
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 107
28.5%
, 75
20.0%
5 52
13.9%
2 29
 
7.7%
1 22
 
5.9%
6 21
 
5.6%
7 19
 
5.1%
8 15
 
4.0%
3 13
 
3.5%
9 13
 
3.5%

전공명
Categorical

Distinct33
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
간호학 전공
22 
사회복지학 전공
경영학 전공
실용음악학 전공
 
3
체육학 전공
 
3
Other values (28)
38 

Length

Max length11
Median length6
Mean length6.7333333
Min length5

Unique

Unique20 ?
Unique (%)26.7%

Sample

1st row간호학 전공
2nd row신학 전공
3rd row심리학 전공
4th row간호학 전공
5th row기악 전공

Common Values

ValueCountFrequency (%)
간호학 전공 22
29.3%
사회복지학 전공 5
 
6.7%
경영학 전공 4
 
5.3%
실용음악학 전공 3
 
4.0%
체육학 전공 3
 
4.0%
신학 전공 3
 
4.0%
미용학 전공 3
 
4.0%
연극학 전공 2
 
2.7%
호텔조리 전공 2
 
2.7%
심리학 전공 2
 
2.7%
Other values (23) 26
34.7%

Length

2023-12-12T18:37:41.147576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전공 75
50.0%
간호학 22
 
14.7%
사회복지학 5
 
3.3%
경영학 4
 
2.7%
실용음악학 3
 
2.0%
체육학 3
 
2.0%
신학 3
 
2.0%
미용학 3
 
2.0%
아동학 2
 
1.3%
패션디자인학 2
 
1.3%
Other values (24) 28
 
18.7%

Interactions

2023-12-12T18:37:39.681401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:37:41.234979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호신청번호전공명
일련번호1.0000.9790.435
신청번호0.9791.0000.891
전공명0.4350.8911.000
2023-12-12T18:37:41.357614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호전공명
일련번호1.0000.109
전공명0.1091.000

Missing values

2023-12-12T18:37:39.837982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:37:39.945298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호신청번호전공명
014,442간호학 전공
124,443신학 전공
234,445심리학 전공
344,446간호학 전공
454,447기악 전공
564,448실용음악학 전공
674,449호텔조리 전공
784,450사회복지학 전공
894,451관현악 전공
9104,452경영학 전공
일련번호신청번호전공명
65665,248간호학 전공
66675,258연극학 전공
67685,269관광경영학 전공
68695,276간호학 전공
69705,277간호학 전공
70715,335체육학 전공
71725,349호텔조리 전공
72735,393간호학 전공
73745,394간호학 전공
74755,897사회복지학 전공