Overview

Dataset statistics

Number of variables6
Number of observations21
Missing cells1
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory54.3 B

Variable types

Text3
Categorical3

Dataset

Description충청북도 내에 소재하고 있는 대학교에 대한 파일데이터 정보를 제공합니다. (대학명, 총장, 유형, 설립주체, 소재, 홈페이지)
URLhttps://www.data.go.kr/data/3066312/fileData.do

Alerts

설립주체 is highly overall correlated with 소재High correlation
소재 is highly overall correlated with 설립주체High correlation
총장 has 1 (4.8%) missing valuesMissing
대학명 has unique valuesUnique
홈페이지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:23:42.148822
Analysis finished2023-12-12 07:23:42.604208
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대학명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T16:23:42.742763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length5.8095238
Min length3

Characters and Unicode

Total characters122
Distinct characters45
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row강동대
2nd row건국대(글로컬)
3rd row극동대
4th row꽃동네대
5th row대원대
ValueCountFrequency (%)
한국폴리텍 2
 
8.7%
강동대 1
 
4.3%
건국대(글로컬 1
 
4.3%
ⅳ대학(충주 1
 
4.3%
ⅳ대학(청주 1
 
4.3%
우석대 1
 
4.3%
한국교통대 1
 
4.3%
한국교원대 1
 
4.3%
충청대 1
 
4.3%
충북보건과학대 1
 
4.3%
Other values (12) 12
52.2%
2023-12-12T16:23:43.110358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
17.2%
17
 
13.9%
5
 
4.1%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
4
 
3.3%
4
 
3.3%
Other values (35) 49
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95
77.9%
Control 17
 
13.9%
Open Punctuation 3
 
2.5%
Close Punctuation 3
 
2.5%
Space Separator 2
 
1.6%
Letter Number 2
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
22.1%
5
 
5.3%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
Other values (30) 36
37.9%
Control
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95
77.9%
Common 25
 
20.5%
Latin 2
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
22.1%
5
 
5.3%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
Other values (30) 36
37.9%
Common
ValueCountFrequency (%)
17
68.0%
( 3
 
12.0%
) 3
 
12.0%
2
 
8.0%
Latin
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95
77.9%
ASCII 25
 
20.5%
Number Forms 2
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
22.1%
5
 
5.3%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
Other values (30) 36
37.9%
ASCII
ValueCountFrequency (%)
17
68.0%
( 3
 
12.0%
) 3
 
12.0%
2
 
8.0%
Number Forms
ValueCountFrequency (%)
2
100.0%

총장
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing1
Missing (%)4.8%
Memory size300.0 B
2023-12-12T16:23:43.334828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters60
Distinct characters43
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row서석해
2nd row문상호
3rd row류기일
4th row이종서
5th row김영철
ValueCountFrequency (%)
서석해 1
 
5.0%
문상호 1
 
5.0%
홍석원 1
 
5.0%
양기용 1
 
5.0%
남천현 1
 
5.0%
윤승조 1
 
5.0%
김종우 1
 
5.0%
박용석 1
 
5.0%
김용수 1
 
5.0%
고창섭 1
 
5.0%
Other values (10) 10
50.0%
2023-12-12T16:23:43.713734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
6.7%
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (33) 34
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
6.7%
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (33) 34
56.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
6.7%
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (33) 34
56.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
6.7%
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (33) 34
56.7%

유형
Categorical

Distinct4
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
대학
13 
전문대
기능대학
각종학교
 
1

Length

Max length4
Median length2
Mean length2.5238095
Min length2

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row전문대
2nd row대학
3rd row대학
4th row대학
5th row전문대

Common Values

ValueCountFrequency (%)
대학 13
61.9%
전문대 5
 
23.8%
기능대학 2
 
9.5%
각종학교 1
 
4.8%

Length

2023-12-12T16:23:43.891707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:23:44.034052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대학 13
61.9%
전문대 5
 
23.8%
기능대학 2
 
9.5%
각종학교 1
 
4.8%

설립주체
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size300.0 B
사립
16 
국립
공립
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row사립
2nd row사립
3rd row사립
4th row사립
5th row사립

Common Values

ValueCountFrequency (%)
사립 16
76.2%
국립 4
 
19.0%
공립 1
 
4.8%

Length

2023-12-12T16:23:44.184284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:23:44.312843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 16
76.2%
국립 4
 
19.0%
공립 1
 
4.8%

소재
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size300.0 B
청주
충주
제천
음성
영동
Other values (3)

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique4 ?
Unique (%)19.0%

Sample

1st row음성
2nd row충주
3rd row음성
4th row청주
5th row제천

Common Values

ValueCountFrequency (%)
청주 9
42.9%
충주 3
 
14.3%
제천 3
 
14.3%
음성 2
 
9.5%
영동 1
 
4.8%
괴산 1
 
4.8%
옥천 1
 
4.8%
진천 1
 
4.8%

Length

2023-12-12T16:23:44.744306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:23:44.873832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청주 9
42.9%
충주 3
 
14.3%
제천 3
 
14.3%
음성 2
 
9.5%
영동 1
 
4.8%
괴산 1
 
4.8%
옥천 1
 
4.8%
진천 1
 
4.8%

홈페이지
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T16:23:45.166837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length38
Mean length35.571429
Min length17

Characters and Unicode

Total characters747
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st rowhttps://www.gangdong.ac.kr/intro.jsp
2nd rowhttps://www.kku.ac.kr/mbshome/mbs/wwwkr/index.do
3rd rowhttps://www.kdu.ac.kr/home/intro.do
4th rowhttp://www.kkot.ac.kr/
5th rowhttps://www.daewon.ac.kr/mbs/daewon/
ValueCountFrequency (%)
https://www.gangdong.ac.kr/intro.jsp 1
 
4.8%
https://www.chungbuk.ac.kr/site/www/main.do 1
 
4.8%
https://www.kopo.ac.kr/chungju/index.do 1
 
4.8%
https://www.kopo.ac.kr/cheongju/index.do 1
 
4.8%
https://jc.woosuk.ac.kr/woosukjcmain.do 1
 
4.8%
https://www.ut.ac.kr/kor.do 1
 
4.8%
https://knue.ac.kr/smain.html 1
 
4.8%
https://www.ok.ac.kr/www/index.do 1
 
4.8%
https://www.chsu.ac.kr/cmshome/maindefault.aspx 1
 
4.8%
http://www.cpu.ac.kr/home/main.do 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T16:23:45.593889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 83
 
11.1%
. 78
 
10.4%
w 74
 
9.9%
t 56
 
7.5%
o 41
 
5.5%
k 37
 
5.0%
s 34
 
4.6%
a 32
 
4.3%
c 32
 
4.3%
h 31
 
4.1%
Other values (24) 249
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 556
74.4%
Other Punctuation 182
 
24.4%
Uppercase Letter 8
 
1.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 74
13.3%
t 56
 
10.1%
o 41
 
7.4%
k 37
 
6.7%
s 34
 
6.1%
a 32
 
5.8%
c 32
 
5.8%
h 31
 
5.6%
r 29
 
5.2%
n 27
 
4.9%
Other values (13) 163
29.3%
Uppercase Letter
ValueCountFrequency (%)
M 2
25.0%
V 1
12.5%
C 1
12.5%
H 1
12.5%
D 1
12.5%
W 1
12.5%
J 1
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 83
45.6%
. 78
42.9%
: 21
 
11.5%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 564
75.5%
Common 183
 
24.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 74
13.1%
t 56
 
9.9%
o 41
 
7.3%
k 37
 
6.6%
s 34
 
6.0%
a 32
 
5.7%
c 32
 
5.7%
h 31
 
5.5%
r 29
 
5.1%
n 27
 
4.8%
Other values (20) 171
30.3%
Common
ValueCountFrequency (%)
/ 83
45.4%
. 78
42.6%
: 21
 
11.5%
1 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 747
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 83
 
11.1%
. 78
 
10.4%
w 74
 
9.9%
t 56
 
7.5%
o 41
 
5.5%
k 37
 
5.0%
s 34
 
4.6%
a 32
 
4.3%
c 32
 
4.3%
h 31
 
4.1%
Other values (24) 249
33.3%

Correlations

2023-12-12T16:23:45.693776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대학명총장유형설립주체소재홈페이지
대학명\t1.0001.0001.0001.0001.0001.000
총장1.0001.0001.0001.0001.0001.000
유형1.0001.0001.0000.0000.0001.000
설립주체1.0001.0000.0001.0000.7201.000
소재1.0001.0000.0000.7201.0001.000
홈페이지1.0001.0001.0001.0001.0001.000
2023-12-12T16:23:45.815364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재설립주체유형
소재1.0000.5090.000
설립주체0.5091.0000.000
유형0.0000.0001.000
2023-12-12T16:23:45.897182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형설립주체소재
유형1.0000.0000.000
설립주체0.0001.0000.509
소재0.0000.5091.000

Missing values

2023-12-12T16:23:42.437226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:23:42.560907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대학명총장유형설립주체소재홈페이지
0강동대서석해전문대사립음성https://www.gangdong.ac.kr/intro.jsp
1건국대(글로컬)문상호대학사립충주https://www.kku.ac.kr/mbshome/mbs/wwwkr/index.do
2극동대류기일대학사립음성https://www.kdu.ac.kr/home/intro.do
3꽃동네대이종서대학사립청주http://www.kkot.ac.kr/
4대원대김영철전문대사립제천https://www.daewon.ac.kr/mbs/daewon/
5서원대손석민대학사립청주https://www.seowon.ac.kr/sites/seowon/index.do
6세명대권동현대학사립제천http://www.semyung.ac.kr/kor.do
7유원대채훈관대학사립영동https://www.u1.ac.kr/html/intro/intro.html
8중원대황윤원대학사립괴산http://www.jwu.ac.kr/site/siteView.jwu
9청주교대이혁규대학국립청주https://www.cje.ac.kr/group/main
대학명총장유형설립주체소재홈페이지
11충북대고창섭대학국립청주https://www.chungbuk.ac.kr/site/www/main.do
12충북도립대김용수전문대공립옥천http://www.cpu.ac.kr/home/main.do
13충북보건과학대박용석전문대사립청주https://www.chsu.ac.kr/CmsHome/MainDefault.aspx
14충청대<NA>전문대사립청주https://www.ok.ac.kr/www/index.do
15한국교원대김종우대학국립청주https://knue.ac.kr/smain.html
16한국교통대윤승조대학국립충주https://www.ut.ac.kr/kor.do
17우석대남천현대학사립진천https://jc.woosuk.ac.kr/WoosukJcMain.do
18한국폴리텍 ⅳ대학(청주)양기용기능대학사립청주https://www.kopo.ac.kr/cheongju/index.do
19한국폴리텍 ⅳ대학(충주)홍석원기능대학사립충주https://www.kopo.ac.kr/chungju/index.do
20순복음총회신학교유영희각종학교사립제천http://kcc.ac.kr/