Overview

Dataset statistics

Number of variables8
Number of observations990
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory63.0 KiB
Average record size in memory65.1 B

Variable types

Categorical4
Text1
DateTime2
Numeric1

Dataset

Description전라북도 군산시 대표홈페이지의 시민정보화교육 강의정보(장소명,강의명,클래스,강의시작일,강의종료일,강의시작시간, 강의종료시간,수강인원)
Author전라북도
URLhttps://www.bigdatahub.go.kr/index.jeonbuk?startPage=3&menuCd=DOM_000000103007001000&pListTypeStr=&pId=15063780

Alerts

클래스 is highly overall correlated with 강의시작시간 and 1 other fieldsHigh correlation
강의종료시간 is highly overall correlated with 장소명 and 2 other fieldsHigh correlation
강의시작시간 is highly overall correlated with 장소명 and 2 other fieldsHigh correlation
장소명 is highly overall correlated with 강의시작시간 and 1 other fieldsHigh correlation
강의시작시간 is highly imbalanced (58.8%)Imbalance

Reproduction

Analysis started2024-03-14 00:35:47.702468
Analysis finished2024-03-14 00:35:48.412959
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

장소명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
시립도서관
431 
시청(8층)전산교육장
328 
시민정보화교육장
219 
온라인교육
 
12

Length

Max length11
Median length8
Mean length7.6515152
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시청(8층)전산교육장
2nd row시민정보화교육장
3rd row시민정보화교육장
4th row시청(8층)전산교육장
5th row시청(8층)전산교육장

Common Values

ValueCountFrequency (%)
시립도서관 431
43.5%
시청(8층)전산교육장 328
33.1%
시민정보화교육장 219
22.1%
온라인교육 12
 
1.2%

Length

2024-03-14T09:35:48.468514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:35:48.584564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시립도서관 431
43.5%
시청(8층)전산교육장 328
33.1%
시민정보화교육장 219
22.1%
온라인교육 12
 
1.2%
Distinct241
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-03-14T09:35:48.769022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length8.1656566
Min length2

Characters and Unicode

Total characters8084
Distinct characters143
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)17.4%

Sample

1st row컴퓨터기초3주 (오후반)
2nd row컴퓨터기초및운영체제3주 (오전반)
3rd row컴퓨터기초및운영체제3주 (오후반)
4th row컴퓨터기초3주 (오전반)
5th row인터넷활용(3주)
ValueCountFrequency (%)
문서작성 77
 
5.9%
엑셀 74
 
5.7%
파워포인트 68
 
5.2%
인터넷활용 65
 
5.0%
전산교육장 55
 
4.2%
컴퓨터기초 52
 
4.0%
스마트폰활용 49
 
3.7%
포토샵 45
 
3.4%
수강생 42
 
3.2%
디지털생활 32
 
2.4%
Other values (200) 748
57.2%
2024-03-14T09:35:49.071593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
317
 
3.9%
) 286
 
3.5%
( 286
 
3.5%
260
 
3.2%
245
 
3.0%
238
 
2.9%
213
 
2.6%
201
 
2.5%
199
 
2.5%
198
 
2.4%
Other values (133) 5641
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6658
82.4%
Decimal Number 332
 
4.1%
Space Separator 317
 
3.9%
Close Punctuation 286
 
3.5%
Open Punctuation 286
 
3.5%
Uppercase Letter 60
 
0.7%
Dash Punctuation 45
 
0.6%
Lowercase Letter 41
 
0.5%
Other Punctuation 39
 
0.5%
Connector Punctuation 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
260
 
3.9%
245
 
3.7%
238
 
3.6%
213
 
3.2%
201
 
3.0%
199
 
3.0%
198
 
3.0%
192
 
2.9%
191
 
2.9%
189
 
2.8%
Other values (100) 4532
68.1%
Decimal Number
ValueCountFrequency (%)
8 95
28.6%
0 84
25.3%
1 63
19.0%
3 41
12.3%
2 36
 
10.8%
7 6
 
1.8%
4 5
 
1.5%
5 1
 
0.3%
6 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
t 7
17.1%
k 7
17.1%
n 6
14.6%
b 6
14.6%
s 6
14.6%
p 6
14.6%
c 2
 
4.9%
u 1
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
C 32
53.3%
U 16
26.7%
S 4
 
6.7%
T 3
 
5.0%
K 3
 
5.0%
N 2
 
3.3%
Other Punctuation
ValueCountFrequency (%)
: 18
46.2%
. 9
23.1%
& 6
 
15.4%
; 6
 
15.4%
Space Separator
ValueCountFrequency (%)
317
100.0%
Close Punctuation
ValueCountFrequency (%)
) 286
100.0%
Open Punctuation
ValueCountFrequency (%)
( 286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6658
82.4%
Common 1325
 
16.4%
Latin 101
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
260
 
3.9%
245
 
3.7%
238
 
3.6%
213
 
3.2%
201
 
3.0%
199
 
3.0%
198
 
3.0%
192
 
2.9%
191
 
2.9%
189
 
2.8%
Other values (100) 4532
68.1%
Common
ValueCountFrequency (%)
317
23.9%
) 286
21.6%
( 286
21.6%
8 95
 
7.2%
0 84
 
6.3%
1 63
 
4.8%
- 45
 
3.4%
3 41
 
3.1%
2 36
 
2.7%
: 18
 
1.4%
Other values (9) 54
 
4.1%
Latin
ValueCountFrequency (%)
C 32
31.7%
U 16
15.8%
t 7
 
6.9%
k 7
 
6.9%
n 6
 
5.9%
b 6
 
5.9%
s 6
 
5.9%
p 6
 
5.9%
S 4
 
4.0%
T 3
 
3.0%
Other values (4) 8
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6658
82.4%
ASCII 1426
 
17.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
317
22.2%
) 286
20.1%
( 286
20.1%
8 95
 
6.7%
0 84
 
5.9%
1 63
 
4.4%
- 45
 
3.2%
3 41
 
2.9%
2 36
 
2.5%
C 32
 
2.2%
Other values (23) 141
9.9%
Hangul
ValueCountFrequency (%)
260
 
3.9%
245
 
3.7%
238
 
3.6%
213
 
3.2%
201
 
3.0%
199
 
3.0%
198
 
3.0%
192
 
2.9%
191
 
2.9%
189
 
2.8%
Other values (100) 4532
68.1%

클래스
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
오전반
609 
오후반
380 
기타
 
1

Length

Max length3
Median length3
Mean length2.9989899
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row오후반
2nd row오전반
3rd row오후반
4th row오전반
5th row오후반

Common Values

ValueCountFrequency (%)
오전반 609
61.5%
오후반 380
38.4%
기타 1
 
0.1%

Length

2024-03-14T09:35:49.174706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:35:49.248008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오전반 609
61.5%
오후반 380
38.4%
기타 1
 
0.1%
Distinct562
Distinct (%)56.8%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum2006-02-06 00:00:00
Maximum2023-12-04 00:00:00
2024-03-14T09:35:49.344432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:35:49.467859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct564
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum2006-02-24 00:00:00
Maximum2023-12-15 00:00:00
2024-03-14T09:35:49.600999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:35:49.720156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

강의시작시간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
10:00
586 
13:30
354 
11:00
 
19
14:00
 
12
19:00
 
12
Other values (4)
 
7

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row13:30
2nd row10:00
3rd row13:30
4th row10:00
5th row14:00

Common Values

ValueCountFrequency (%)
10:00 586
59.2%
13:30 354
35.8%
11:00 19
 
1.9%
14:00 12
 
1.2%
19:00 12
 
1.2%
15:30 3
 
0.3%
13:00 2
 
0.2%
11:11 1
 
0.1%
18:30 1
 
0.1%

Length

2024-03-14T09:35:49.830525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:35:49.921625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10:00 586
59.2%
13:30 354
35.8%
11:00 19
 
1.9%
14:00 12
 
1.2%
19:00 12
 
1.2%
15:30 3
 
0.3%
13:00 2
 
0.2%
11:11 1
 
0.1%
18:30 1
 
0.1%

강의종료시간
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
12:00
378 
11:30
205 
14:50
191 
15:30
164 
21:00
 
12
Other values (9)
40 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st row14:50
2nd row11:30
3rd row14:50
4th row11:30
5th row15:00

Common Values

ValueCountFrequency (%)
12:00 378
38.2%
11:30 205
20.7%
14:50 191
19.3%
15:30 164
16.6%
21:00 12
 
1.2%
15:00 11
 
1.1%
10:50 10
 
1.0%
13:00 9
 
0.9%
18:00 5
 
0.5%
11:11 1
 
0.1%
Other values (4) 4
 
0.4%

Length

2024-03-14T09:35:50.013851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
12:00 378
38.2%
11:30 205
20.7%
14:50 191
19.3%
15:30 164
16.6%
21:00 12
 
1.2%
15:00 11
 
1.1%
10:50 10
 
1.0%
13:00 9
 
0.9%
18:00 5
 
0.5%
11:11 1
 
0.1%
Other values (4) 4
 
0.4%

수강인원
Real number (ℝ)

Distinct14
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.464646
Minimum9
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.8 KiB
2024-03-14T09:35:50.094347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile17
Q130
median34
Q334
95-th percentile35
Maximum100
Range91
Interquartile range (IQR)4

Descriptive statistics

Standard deviation5.5847422
Coefficient of variation (CV)0.17749261
Kurtosis26.63502
Mean31.464646
Median Absolute Deviation (MAD)4
Skewness0.53249564
Sum31150
Variance31.189345
MonotonicityNot monotonic
2024-03-14T09:35:50.170792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
34 417
42.1%
30 414
41.8%
35 61
 
6.2%
15 38
 
3.8%
17 19
 
1.9%
40 16
 
1.6%
50 7
 
0.7%
31 4
 
0.4%
9 4
 
0.4%
45 3
 
0.3%
Other values (4) 7
 
0.7%
ValueCountFrequency (%)
9 4
 
0.4%
10 2
 
0.2%
11 1
 
0.1%
15 38
 
3.8%
17 19
 
1.9%
30 414
41.8%
31 4
 
0.4%
34 417
42.1%
35 61
 
6.2%
36 3
 
0.3%
ValueCountFrequency (%)
100 1
 
0.1%
50 7
 
0.7%
45 3
 
0.3%
40 16
 
1.6%
36 3
 
0.3%
35 61
 
6.2%
34 417
42.1%
31 4
 
0.4%
30 414
41.8%
17 19
 
1.9%

Interactions

2024-03-14T09:35:48.184645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T09:35:50.229554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장소명클래스강의시작시간강의종료시간수강인원
장소명1.0000.2730.7710.8950.153
클래스0.2731.0001.0001.0000.000
강의시작시간0.7711.0001.0000.9670.169
강의종료시간0.8951.0000.9671.0000.271
수강인원0.1530.0000.1690.2711.000
2024-03-14T09:35:50.311295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
클래스강의종료시간장소명강의시작시간
클래스1.0000.9940.2610.997
강의종료시간0.9941.0000.7440.867
장소명0.2610.7441.0000.620
강의시작시간0.9970.8670.6201.000
2024-03-14T09:35:50.388048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수강인원장소명클래스강의시작시간강의종료시간
수강인원1.0000.1250.0000.0980.144
장소명0.1251.0000.2610.6200.744
클래스0.0000.2611.0000.9970.994
강의시작시간0.0980.6200.9971.0000.867
강의종료시간0.1440.7440.9940.8671.000

Missing values

2024-03-14T09:35:48.277987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:35:48.374756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

장소명강의명클래스강의시작일강의종료일강의시작시간강의종료시간수강인원
0시청(8층)전산교육장컴퓨터기초3주 (오후반)오후반2006-02-062006-02-2413:3014:5034
1시민정보화교육장컴퓨터기초및운영체제3주 (오전반)오전반2006-02-062006-02-2410:0011:3030
2시민정보화교육장컴퓨터기초및운영체제3주 (오후반)오후반2006-02-062006-02-2413:3014:5030
3시청(8층)전산교육장컴퓨터기초3주 (오전반)오전반2006-02-062006-02-2410:0011:3034
4시청(8층)전산교육장인터넷활용(3주)오후반2006-02-272006-03-1714:0015:0035
5시청(8층)전산교육장문서작성(4주)오전반2006-02-272006-03-2410:0010:5035
6시청(8층)전산교육장컴퓨터그래픽(포토샵)오전반2006-02-272006-04-1411:0012:0034
7시민정보화교육장문서작성(4주)오전반2006-02-272006-03-2410:0011:3030
8시민정보화교육장인터넷활용(오후반)오후반2006-02-272006-03-1713:3014:5030
9시청(8층)전산교육장엑셀오후반2006-03-202006-04-1414:0015:0034
장소명강의명클래스강의시작일강의종료일강의시작시간강의종료시간수강인원
980시립도서관엑셀오후반2023-10-092023-10-2713:3015:3030
981시청(8층)전산교육장문서작성오전반2023-10-162023-11-0310:0012:0034
982시립도서관인터넷활용오전반2023-10-232023-11-1010:0012:0030
983시립도서관파워포인트오후반2023-10-302023-11-1713:3015:3030
984시청(8층)전산교육장엑셀오전반2023-11-062023-11-2410:0012:0034
985시립도서관문서작성오전반2023-11-132023-12-0110:0012:0030
986시립도서관스마트폰입문오후반2023-11-202022-12-0113:3015:3030
987시청(8층)전산교육장파워포인트오전반2023-11-272023-12-1510:0012:0034
988시립도서관스마트폰입문오전반2023-12-042023-12-1510:0012:0030
989시립도서관스마트폰활용오후반2023-12-042023-12-1513:3015:3030