Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 2776 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 67.9 KiB |
Average record size in memory | 25.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 대학(대학교, 전문대학 및 사이버대학 포함) 정보에 대한 데이터로 일련번호, 상태, 학교명 등의 항목을 제공합니다. |
---|---|
Author | 국가평생교육진흥원 |
URL | https://www.data.go.kr/data/15070852/fileData.do |
일련번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 11:19:24.884353 |
---|---|
Analysis finished | 2023-12-12 11:19:25.852213 |
Duration | 0.97 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
일련번호
Real number (ℝ)
UNIQUE
 
Distinct | 2776 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1388.5 |
Minimum | 1 |
---|---|
Maximum | 2776 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 24.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 139.75 |
Q1 | 694.75 |
median | 1388.5 |
Q3 | 2082.25 |
95-th percentile | 2637.25 |
Maximum | 2776 |
Range | 2775 |
Interquartile range (IQR) | 1387.5 |
Descriptive statistics
Standard deviation | 801.5065 |
---|---|
Coefficient of variation (CV) | 0.57724631 |
Kurtosis | -1.2 |
Mean | 1388.5 |
Median Absolute Deviation (MAD) | 694 |
Skewness | 0 |
Sum | 3854476 |
Variance | 642412.67 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
1856 | 1 | < 0.1% |
1848 | 1 | < 0.1% |
1849 | 1 | < 0.1% |
1850 | 1 | < 0.1% |
1851 | 1 | < 0.1% |
1852 | 1 | < 0.1% |
1853 | 1 | < 0.1% |
1854 | 1 | < 0.1% |
1855 | 1 | < 0.1% |
Other values (2766) | 2766 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
2776 | 1 | |
2775 | 1 | |
2774 | 1 | |
2773 | 1 | |
2772 | 1 | |
2771 | 1 | |
2770 | 1 | |
2769 | 1 | |
2768 | 1 | |
2767 | 1 |
상태
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 21.8 KiB |
유효 | |
---|---|
변경 | |
삭제 | 81 |
통합됨 | 54 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.0194524 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 유효 |
---|---|
2nd row | 변경 |
3rd row | 변경 |
4th row | 변경 |
5th row | 유효 |
Common Values
Value | Count | Frequency (%) |
유효 | 2017 | |
변경 | 624 | 22.5% |
삭제 | 81 | 2.9% |
통합됨 | 54 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
유효 | 2017 | |
변경 | 624 | 22.5% |
삭제 | 81 | 2.9% |
통합됨 | 54 | 1.9% |
학교명
Text
Distinct | 2766 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 21.8 KiB |
Value | Count | Frequency (%) |
university | 923 | 15.1% |
of | 469 | 7.7% |
college | 215 | 3.5% |
state | 103 | 1.7% |
the | 102 | 1.7% |
institute | 54 | 0.9% |
and | 52 | 0.8% |
technology | 49 | 0.8% |
international | 43 | 0.7% |
california | 30 | 0.5% |
Other values (2740) | 4078 |
Most occurring characters
Value | Count | Frequency (%) |
i | 3499 | 7.7% |
3363 | 7.4% | |
e | 2981 | 6.5% |
n | 2774 | 6.1% |
t | 2246 | 4.9% |
o | 2046 | 4.5% |
a | 1933 | 4.2% |
r | 1782 | 3.9% |
s | 1736 | 3.8% |
학 | 1658 | 3.6% |
Other values (379) | 21708 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 26986 | |
Other Letter | 10429 | 22.8% |
Uppercase Letter | 4780 | 10.5% |
Space Separator | 3363 | 7.4% |
Other Punctuation | 94 | 0.2% |
Decimal Number | 22 | < 0.1% |
Close Punctuation | 17 | < 0.1% |
Open Punctuation | 17 | < 0.1% |
Dash Punctuation | 14 | < 0.1% |
Letter Number | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
학 | 1658 | 15.9% |
대 | 1514 | 14.5% |
교 | 791 | 7.6% |
전 | 468 | 4.5% |
문 | 417 | 4.0% |
업 | 205 | 2.0% |
원 | 196 | 1.9% |
산 | 159 | 1.5% |
공 | 159 | 1.5% |
한 | 154 | 1.5% |
Other values (306) | 4708 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 3499 | |
e | 2981 | |
n | 2774 | |
t | 2246 | |
o | 2046 | 7.6% |
a | 1933 | 7.2% |
r | 1782 | 6.6% |
s | 1736 | 6.4% |
y | 1291 | 4.8% |
l | 1181 | 4.4% |
Other values (16) | 5517 |
Uppercase Letter
Value | Count | Frequency (%) |
U | 1000 | |
C | 532 | |
S | 423 | 8.8% |
T | 321 | 6.7% |
I | 257 | 5.4% |
N | 241 | 5.0% |
M | 220 | 4.6% |
A | 205 | 4.3% |
E | 162 | 3.4% |
B | 142 | 3.0% |
Other values (16) | 1277 |
Decimal Number
Value | Count | Frequency (%) |
1 | 8 | |
2 | 7 | |
4 | 2 | 9.1% |
3 | 2 | 9.1% |
5 | 1 | 4.5% |
7 | 1 | 4.5% |
6 | 1 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 27 | |
. | 25 | |
' | 23 | |
& | 12 | |
? | 5 | 5.3% |
" | 2 | 2.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅳ | 1 | |
Ⅱ | 1 | |
Ⅲ | 1 | |
Ⅶ | 1 |
Space Separator
Value | Count | Frequency (%) |
3363 |
Close Punctuation
Value | Count | Frequency (%) |
) | 17 |
Open Punctuation
Value | Count | Frequency (%) |
( | 17 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 31770 | |
Hangul | 10425 | 22.8% |
Common | 3527 | 7.7% |
Han | 4 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
학 | 1658 | 15.9% |
대 | 1514 | 14.5% |
교 | 791 | 7.6% |
전 | 468 | 4.5% |
문 | 417 | 4.0% |
업 | 205 | 2.0% |
원 | 196 | 1.9% |
산 | 159 | 1.5% |
공 | 159 | 1.5% |
한 | 154 | 1.5% |
Other values (302) | 4704 |
Latin
Value | Count | Frequency (%) |
i | 3499 | 11.0% |
e | 2981 | 9.4% |
n | 2774 | 8.7% |
t | 2246 | 7.1% |
o | 2046 | 6.4% |
a | 1933 | 6.1% |
r | 1782 | 5.6% |
s | 1736 | 5.5% |
y | 1291 | 4.1% |
l | 1181 | 3.7% |
Other values (46) | 10301 |
Common
Value | Count | Frequency (%) |
3363 | ||
, | 27 | 0.8% |
. | 25 | 0.7% |
' | 23 | 0.7% |
) | 17 | 0.5% |
( | 17 | 0.5% |
- | 14 | 0.4% |
& | 12 | 0.3% |
1 | 8 | 0.2% |
2 | 7 | 0.2% |
Other values (7) | 14 | 0.4% |
Han
Value | Count | Frequency (%) |
院 | 1 | |
美 | 1 | |
央 | 1 | |
中 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35288 | |
Hangul | 10425 | 22.8% |
None | 5 | < 0.1% |
Number Forms | 4 | < 0.1% |
CJK | 4 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 3499 | 9.9% |
3363 | 9.5% | |
e | 2981 | 8.4% |
n | 2774 | 7.9% |
t | 2246 | 6.4% |
o | 2046 | 5.8% |
a | 1933 | 5.5% |
r | 1782 | 5.0% |
s | 1736 | 4.9% |
y | 1291 | 3.7% |
Other values (58) | 11637 |
Hangul
Value | Count | Frequency (%) |
학 | 1658 | 15.9% |
대 | 1514 | 14.5% |
교 | 791 | 7.6% |
전 | 468 | 4.5% |
문 | 417 | 4.0% |
업 | 205 | 2.0% |
원 | 196 | 1.9% |
산 | 159 | 1.5% |
공 | 159 | 1.5% |
한 | 154 | 1.5% |
Other values (302) | 4704 |
None
Value | Count | Frequency (%) |
? | 5 |
Number Forms
Value | Count | Frequency (%) |
Ⅳ | 1 | |
Ⅱ | 1 | |
Ⅲ | 1 | |
Ⅶ | 1 |
CJK
Value | Count | Frequency (%) |
院 | 1 | |
美 | 1 | |
央 | 1 | |
中 | 1 |
일련번호 | 상태 | |
---|---|---|
일련번호 | 1.000 | 0.443 |
상태 | 0.443 | 1.000 |
일련번호 | 상태 | |
---|---|---|
일련번호 | 1.000 | 0.279 |
상태 | 0.279 | 1.000 |
일련번호 | 상태 | 학교명 | |
---|---|---|---|
0 | 1 | 유효 | 서울여자대학교 |
1 | 2 | 변경 | 서울예술전문대학 |
2 | 3 | 변경 | 서울예술대학 |
3 | 4 | 변경 | 서울장로회신학교 |
4 | 5 | 유효 | 서울장신대학교 |
5 | 6 | 유효 | 서원대학교 |
6 | 7 | 변경 | 서일전문대학 |
7 | 8 | 변경 | 서일대학 |
8 | 9 | 변경 | 서정대학 |
9 | 10 | 변경 | 군산전문대학 |
일련번호 | 상태 | 학교명 | |
---|---|---|---|
2766 | 2767 | 유효 | Bukhara Technological Institute of Food and L.I |
2767 | 2768 | 유효 | University of Greenwich |
2768 | 2769 | 유효 | Ramkhamhaeng University |
2769 | 2770 | 유효 | Montgomery County Community College |
2770 | 2771 | 유효 | American University of Central Asia |
2771 | 2772 | 유효 | Graffith University |
2772 | 2773 | 유효 | Deakin University |
2773 | 2774 | 유효 | Health Sciences University of Hokkaido |
2774 | 2775 | 유효 | San Jose Christian College |
2775 | 2776 | 유효 | INTI International College Subang |