Overview

Dataset statistics

Number of variables2
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows6
Duplicate rows (%)6.0%
Total size in memory1.7 KiB
Average record size in memory17.3 B

Variable types

Categorical2

Dataset

Description공공데이터 중장기개방계획에 따른 경상남도 도립남해대학학사행정시스템 데이터입니다. 학사 프로그램의 정보를 포함하고 있습니다.
Author경상남도
URLhttps://www.data.go.kr/data/15092366/fileData.do

Alerts

Dataset has 6 (6.0%) duplicate rowsDuplicates
단위업무 is highly overall correlated with 업무High correlation
업무 is highly overall correlated with 단위업무High correlation
업무 is highly imbalanced (91.9%)Imbalance

Reproduction

Analysis started2023-12-12 21:58:08.520696
Analysis finished2023-12-12 21:58:08.698919
Duration0.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
HS
99 
BS
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st rowHS
2nd rowHS
3rd rowHS
4th rowHS
5th rowHS

Common Values

ValueCountFrequency (%)
HS 99
99.0%
BS 1
 
1.0%

Length

2023-12-13T06:58:08.768732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:58:08.858134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
hs 99
99.0%
bs 1
 
1.0%

단위업무
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
DR
34 
HJ
28 
SG
15 
SJ
JH
Other values (3)

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st rowHJ
2nd rowHJ
3rd rowJH
4th rowJH
5th rowJH

Common Values

ValueCountFrequency (%)
DR 34
34.0%
HJ 28
28.0%
SG 15
15.0%
SJ 8
 
8.0%
JH 7
 
7.0%
SI 6
 
6.0%
SY 1
 
1.0%
HS 1
 
1.0%

Length

2023-12-13T06:58:08.953100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:58:09.048907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dr 34
34.0%
hj 28
28.0%
sg 15
15.0%
sj 8
 
8.0%
jh 7
 
7.0%
si 6
 
6.0%
sy 1
 
1.0%
hs 1
 
1.0%

Correlations

2023-12-13T06:58:09.124694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무단위업무
업무1.0001.000
단위업무1.0001.000
2023-12-13T06:58:09.197234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위업무업무
단위업무1.0000.969
업무0.9691.000
2023-12-13T06:58:09.268586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무단위업무
업무1.0000.969
단위업무0.9691.000

Missing values

2023-12-13T06:58:08.612520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:58:08.674575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무단위업무
0HSHJ
1HSHJ
2HSJH
3HSJH
4HSJH
5HSJH
6HSJH
7HSJH
8HSJH
9HSDR
업무단위업무
90HSSG
91HSSG
92HSSG
93HSSJ
94HSSJ
95HSSJ
96HSSJ
97HSSJ
98HSSJ
99HSSJ

Duplicate rows

Most frequently occurring

업무단위업무# duplicates
0HSDR34
1HSHJ28
3HSSG15
5HSSJ8
2HSJH7
4HSSI6