Overview

Dataset statistics

Number of variables14
Number of observations35
Missing cells105
Missing cells (%)21.4%
Duplicate rows9
Duplicate rows (%)25.7%
Total size in memory4.4 KiB
Average record size in memory128.8 B

Variable types

Categorical10
DateTime1
Unsupported3

Dataset

Description경상남도 도립남해대학 입학정원 DB입니다. (입학년도, 정원외인원)
Author경상남도
URLhttps://www.data.go.kr/data/15039595/fileData.do

Alerts

정원내_합격인원(주)1 has constant value ""Constant
정원내_합격인원(주)2 has constant value ""Constant
정원내_합격인원(야)2 has constant value ""Constant
정원외인원(야)2 has constant value ""Constant
Dataset has 9 (25.7%) duplicate rowsDuplicates
정원내_입학정원(야)1 is highly overall correlated with 정원내_입학정원(주)1 and 2 other fieldsHigh correlation
정원내_입학정원(야)2 is highly overall correlated with 정원내_입학정원(주)1 and 2 other fieldsHigh correlation
정원내_입학정원(주)2 is highly overall correlated with 정원내_입학정원(주)1 and 3 other fieldsHigh correlation
정원내_입학정원(주)1 is highly overall correlated with 정원내_입학정원(야)1 and 3 other fieldsHigh correlation
정원외인원(주)2 is highly overall correlated with 정원내_입학정원(주)1 and 1 other fieldsHigh correlation
정원내_입학정원(주)1 is highly imbalanced (56.2%)Imbalance
정원내_입학정원(주)2 is highly imbalanced (56.2%)Imbalance
사용자아이디 has 35 (100.0%) missing valuesMissing
사용날짜 has 35 (100.0%) missing valuesMissing
시간 has 35 (100.0%) missing valuesMissing
사용자아이디 is an unsupported type, check if it needs cleaning or further analysisUnsupported
사용날짜 is an unsupported type, check if it needs cleaning or further analysisUnsupported
시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 20:02:03.908812
Analysis finished2023-12-12 20:02:04.502488
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

입학년도
Categorical

Distinct5
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
1999
2000
1998
1997
1996

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1997
2nd row1997
3rd row1997
4th row1997
5th row1997

Common Values

ValueCountFrequency (%)
1999 9
25.7%
2000 9
25.7%
1998 7
20.0%
1997 6
17.1%
1996 4
11.4%

Length

2023-12-13T05:02:04.575089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:04.691455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1999 9
25.7%
2000 9
25.7%
1998 7
20.0%
1997 6
17.1%
1996 4
11.4%

정원내_입학정원(주)1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
80
30 
0
40
 
1

Length

Max length2
Median length2
Mean length1.8857143
Min length1

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row80
2nd row80
3rd row80
4th row80
5th row80

Common Values

ValueCountFrequency (%)
80 30
85.7%
0 4
 
11.4%
40 1
 
2.9%

Length

2023-12-13T05:02:04.821012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:04.915550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
80 30
85.7%
0 4
 
11.4%
40 1
 
2.9%

정원내_입학정원(야)1
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
31 
40

Length

Max length2
Median length1
Mean length1.1142857
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 31
88.6%
40 4
 
11.4%

Length

2023-12-13T05:02:05.018948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:05.110840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 31
88.6%
40 4
 
11.4%

정원내_합격인원(주)1
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35
100.0%

Length

2023-12-13T05:02:05.209636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:05.305866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 35
100.0%

정원내_입학정원(주)2
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
80
30 
0
40
 
1

Length

Max length2
Median length2
Mean length1.8857143
Min length1

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row80
2nd row80
3rd row80
4th row80
5th row80

Common Values

ValueCountFrequency (%)
80 30
85.7%
0 4
 
11.4%
40 1
 
2.9%

Length

2023-12-13T05:02:05.412183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:05.544706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
80 30
85.7%
0 4
 
11.4%
40 1
 
2.9%

정원내_입학정원(야)2
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
31 
40

Length

Max length2
Median length1
Mean length1.1142857
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 31
88.6%
40 4
 
11.4%

Length

2023-12-13T05:02:05.692968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:05.821390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 31
88.6%
40 4
 
11.4%

정원내_합격인원(주)2
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35
100.0%

Length

2023-12-13T05:02:05.935119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:06.033202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 35
100.0%

정원내_합격인원(야)2
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35
100.0%

Length

2023-12-13T05:02:06.142555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:06.279677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 35
100.0%

정원외인원(주)2
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
3
13 
2
10 
0
10 
1
 
1
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row3
2nd row3
3rd row2
4th row0
5th row3

Common Values

ValueCountFrequency (%)
3 13
37.1%
2 10
28.6%
0 10
28.6%
1 1
 
2.9%
4 1
 
2.9%

Length

2023-12-13T05:02:06.434256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:06.576290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 13
37.1%
2 10
28.6%
0 10
28.6%
1 1
 
2.9%
4 1
 
2.9%

정원외인원(야)2
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
0
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 35
100.0%

Length

2023-12-13T05:02:06.753927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:06.896883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 35
100.0%
Distinct5
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum1996-03-07 00:00:00
Maximum2000-03-03 00:00:00
2023-12-13T05:02:06.995740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:02:07.120901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)

사용자아이디
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing35
Missing (%)100.0%
Memory size447.0 B

사용날짜
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing35
Missing (%)100.0%
Memory size447.0 B

시간
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing35
Missing (%)100.0%
Memory size447.0 B

Correlations

2023-12-13T05:02:07.255141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입학년도정원내_입학정원(주)1정원내_입학정원(야)1정원내_입학정원(주)2정원내_입학정원(야)2정원외인원(주)2입학일
입학년도1.0000.0650.0760.0650.0760.0731.000
정원내_입학정원(주)10.0651.0001.0001.0001.0000.7740.065
정원내_입학정원(야)10.0761.0001.0001.0000.9740.3980.076
정원내_입학정원(주)20.0651.0001.0001.0001.0000.7740.065
정원내_입학정원(야)20.0761.0000.9741.0001.0000.3980.076
정원외인원(주)20.0730.7740.3980.7740.3981.0000.073
입학일1.0000.0650.0760.0650.0760.0731.000
2023-12-13T05:02:07.385327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정원외인원(주)2정원내_입학정원(야)1입학년도정원내_입학정원(야)2정원내_입학정원(주)2정원내_입학정원(주)1
정원외인원(주)21.0000.4600.0000.4600.7590.759
정원내_입학정원(야)10.4601.0000.0660.8540.9850.985
입학년도0.0000.0661.0000.0660.0000.000
정원내_입학정원(야)20.4600.8540.0661.0000.9850.985
정원내_입학정원(주)20.7590.9850.0000.9851.0001.000
정원내_입학정원(주)10.7590.9850.0000.9851.0001.000
2023-12-13T05:02:07.552953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입학년도정원내_입학정원(주)1정원내_입학정원(야)1정원내_입학정원(주)2정원내_입학정원(야)2정원외인원(주)2
입학년도1.0000.0000.0660.0000.0660.000
정원내_입학정원(주)10.0001.0000.9851.0000.9850.759
정원내_입학정원(야)10.0660.9851.0000.9850.8540.460
정원내_입학정원(주)20.0001.0000.9851.0000.9850.759
정원내_입학정원(야)20.0660.9850.8540.9851.0000.460
정원외인원(주)20.0000.7590.4600.7590.4601.000

Missing values

2023-12-13T05:02:04.257982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:02:04.425211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

입학년도정원내_입학정원(주)1정원내_입학정원(야)1정원내_합격인원(주)1정원내_입학정원(주)2정원내_입학정원(야)2정원내_합격인원(주)2정원내_합격인원(야)2정원외인원(주)2정원외인원(야)2입학일사용자아이디사용날짜시간
01997800080000301997-03-05<NA><NA><NA>
11997800080000301997-03-05<NA><NA><NA>
21997800080000201997-03-05<NA><NA><NA>
31997800080000001997-03-05<NA><NA><NA>
41997800080000301997-03-05<NA><NA><NA>
51997800080000301997-03-05<NA><NA><NA>
61996800080000301996-03-07<NA><NA><NA>
71996800080000201996-03-07<NA><NA><NA>
81996800080000101996-03-07<NA><NA><NA>
91996800080000001996-03-07<NA><NA><NA>
입학년도정원내_입학정원(주)1정원내_입학정원(야)1정원내_합격인원(주)1정원내_입학정원(주)2정원내_입학정원(야)2정원내_합격인원(주)2정원내_합격인원(야)2정원외인원(주)2정원외인원(야)2입학일사용자아이디사용날짜시간
251999040004000001999-03-02<NA><NA><NA>
262000800080000202000-03-03<NA><NA><NA>
272000800080000302000-03-03<NA><NA><NA>
282000800080000202000-03-03<NA><NA><NA>
292000800080000002000-03-03<NA><NA><NA>
302000800080000202000-03-03<NA><NA><NA>
312000800080000302000-03-03<NA><NA><NA>
322000800080000202000-03-03<NA><NA><NA>
332000040004000002000-03-03<NA><NA><NA>
342000040004000002000-03-03<NA><NA><NA>

Duplicate rows

Most frequently occurring

입학년도정원내_입학정원(주)1정원내_입학정원(야)1정원내_합격인원(주)1정원내_입학정원(주)2정원내_입학정원(야)2정원내_합격인원(주)2정원내_합격인원(야)2정원외인원(주)2정원외인원(야)2입학일# duplicates
01997800080000301997-03-054
72000800080000202000-03-034
21998800080000301998-03-033
41999800080000201999-03-023
51999800080000301999-03-023
11998800080000001998-03-032
31999040004000001999-03-022
62000040004000002000-03-032
82000800080000302000-03-032