Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory49.3 B

Variable types

Text1
Numeric4

Dataset

Description국가과학기술연구회 소관 25개 출연연구기관의 정규인력에 대한 학위별(학사,석사,박사) 인력 정보를 제공합니다.
Author국가과학기술연구회
URLhttps://www.data.go.kr/data/15045199/fileData.do

Alerts

박사 is highly overall correlated with 석사(박사수료 포함) and 2 other fieldsHigh correlation
석사(박사수료 포함) is highly overall correlated with 박사 and 2 other fieldsHigh correlation
학사이하 is highly overall correlated with 박사 and 2 other fieldsHigh correlation
총현원 is highly overall correlated with 박사 and 2 other fieldsHigh correlation
기관명 has unique valuesUnique
박사 has unique valuesUnique
학사이하 has unique valuesUnique
총현원 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:16:11.397294
Analysis finished2023-12-12 09:16:13.665721
Duration2.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T18:16:13.830953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.28
Min length5

Characters and Unicode

Total characters207
Distinct characters58
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row한국과학기술연구원
2nd row녹색기술센터
3rd row한국기초과학지원연구원
4th row국가핵융합연구소
5th row한국천문연구원
ValueCountFrequency (%)
한국과학기술연구원 1
 
4.0%
한국표준과학연구원 1
 
4.0%
안전성평가연구소 1
 
4.0%
한국화학연구원 1
 
4.0%
한국전기연구원 1
 
4.0%
한국에너지기술연구원 1
 
4.0%
한국항공우주연구원 1
 
4.0%
재료연구소 1
 
4.0%
한국기계연구원 1
 
4.0%
한국지질자원연구원 1
 
4.0%
Other values (15) 15
60.0%
2023-12-12T18:16:14.239143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 207
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 207
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 207
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

박사
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean329.72
Minimum28
Maximum1112
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T18:16:14.380530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28
5-th percentile48.8
Q1173
median265
Q3353
95-th percentile845.8
Maximum1112
Range1084
Interquartile range (IQR)180

Descriptive statistics

Standard deviation253.34553
Coefficient of variation (CV)0.76836569
Kurtosis3.3414442
Mean329.72
Median Absolute Deviation (MAD)92
Skewness1.7192227
Sum8243
Variance64183.96
MonotonicityNot monotonic
2023-12-12T18:16:14.535607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
597 1
 
4.0%
28 1
 
4.0%
908 1
 
4.0%
84 1
 
4.0%
344 1
 
4.0%
265 1
 
4.0%
346 1
 
4.0%
506 1
 
4.0%
209 1
 
4.0%
353 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
28 1
4.0%
40 1
4.0%
84 1
4.0%
141 1
4.0%
161 1
4.0%
167 1
4.0%
173 1
4.0%
181 1
4.0%
189 1
4.0%
209 1
4.0%
ValueCountFrequency (%)
1112 1
4.0%
908 1
4.0%
597 1
4.0%
582 1
4.0%
506 1
4.0%
372 1
4.0%
353 1
4.0%
346 1
4.0%
344 1
4.0%
339 1
4.0%

석사(박사수료 포함)
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean171.12
Minimum17
Maximum913
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T18:16:14.679954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile40
Q191
median119
Q3177
95-th percentile337.4
Maximum913
Range896
Interquartile range (IQR)86

Descriptive statistics

Standard deviation177.31524
Coefficient of variation (CV)1.0362041
Kurtosis13.168186
Mean171.12
Median Absolute Deviation (MAD)45
Skewness3.302752
Sum4278
Variance31440.693
MonotonicityNot monotonic
2023-12-12T18:16:14.857365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
119 3
 
12.0%
40 2
 
8.0%
128 1
 
4.0%
93 1
 
4.0%
319 1
 
4.0%
104 1
 
4.0%
171 1
 
4.0%
177 1
 
4.0%
303 1
 
4.0%
91 1
 
4.0%
Other values (12) 12
48.0%
ValueCountFrequency (%)
17 1
4.0%
40 2
8.0%
68 1
4.0%
74 1
4.0%
85 1
4.0%
91 1
4.0%
93 1
4.0%
94 1
4.0%
104 1
4.0%
105 1
4.0%
ValueCountFrequency (%)
913 1
4.0%
342 1
4.0%
319 1
4.0%
303 1
4.0%
275 1
4.0%
191 1
4.0%
177 1
4.0%
171 1
4.0%
156 1
4.0%
135 1
4.0%

학사이하
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.48
Minimum15
Maximum482
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T18:16:14.990318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile27
Q174
median99
Q3177
95-th percentile253
Maximum482
Range467
Interquartile range (IQR)103

Descriptive statistics

Standard deviation99.940032
Coefficient of variation (CV)0.75437826
Kurtosis5.1711501
Mean132.48
Median Absolute Deviation (MAD)40
Skewness1.9183749
Sum3312
Variance9988.01
MonotonicityNot monotonic
2023-12-12T18:16:15.145648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
245 1
 
4.0%
15 1
 
4.0%
482 1
 
4.0%
158 1
 
4.0%
103 1
 
4.0%
232 1
 
4.0%
98 1
 
4.0%
207 1
 
4.0%
94 1
 
4.0%
81 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
15 1
4.0%
23 1
4.0%
43 1
4.0%
50 1
4.0%
59 1
4.0%
65 1
4.0%
74 1
4.0%
81 1
4.0%
84 1
4.0%
92 1
4.0%
ValueCountFrequency (%)
482 1
4.0%
255 1
4.0%
245 1
4.0%
232 1
4.0%
208 1
4.0%
207 1
4.0%
177 1
4.0%
158 1
4.0%
133 1
4.0%
131 1
4.0%

총현원
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean633.24
Minimum60
Maximum2280
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T18:16:15.282695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile138.6
Q1348
median519
Q3674
95-th percentile1593.4
Maximum2280
Range2220
Interquartile range (IQR)326

Descriptive statistics

Standard deviation488.82617
Coefficient of variation (CV)0.77194455
Kurtosis5.0576021
Mean633.24
Median Absolute Deviation (MAD)171
Skewness2.0869322
Sum15831
Variance238951.02
MonotonicityNot monotonic
2023-12-12T18:16:15.409618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
970 1
 
4.0%
60 1
 
4.0%
1709 1
 
4.0%
346 1
 
4.0%
618 1
 
4.0%
674 1
 
4.0%
563 1
 
4.0%
1016 1
 
4.0%
394 1
 
4.0%
519 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
60 1
4.0%
103 1
4.0%
281 1
4.0%
293 1
4.0%
317 1
4.0%
346 1
4.0%
348 1
4.0%
392 1
4.0%
394 1
4.0%
413 1
4.0%
ValueCountFrequency (%)
2280 1
4.0%
1709 1
4.0%
1131 1
4.0%
1016 1
4.0%
970 1
4.0%
694 1
4.0%
674 1
4.0%
651 1
4.0%
618 1
4.0%
563 1
4.0%

Interactions

2023-12-12T18:16:13.095061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:11.599626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.019137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.696927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:13.196183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:11.718984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.107749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.800620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:13.302201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:11.821916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.215447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.910055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:13.394857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:11.919839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:12.592748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:16:13.006348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:16:15.509449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명박사석사(박사수료 포함)학사이하총현원
기관명1.0001.0001.0001.0001.000
박사1.0001.0000.7650.8610.930
석사(박사수료 포함)1.0000.7651.0000.7430.770
학사이하1.0000.8610.7431.0000.843
총현원1.0000.9300.7700.8431.000
2023-12-12T18:16:15.616629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
박사석사(박사수료 포함)학사이하총현원
박사1.0000.6940.7030.925
석사(박사수료 포함)0.6941.0000.7530.884
학사이하0.7030.7531.0000.828
총현원0.9250.8840.8281.000

Missing values

2023-12-12T18:16:13.518831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:16:13.623529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명박사석사(박사수료 포함)학사이하총현원
0한국과학기술연구원597128245970
1녹색기술센터28171560
2한국기초과학지원연구원18111992392
3국가핵융합연구소161119133413
4한국천문연구원1674074281
5한국생명공학연구원339135177651
6한국과학기술정보연구원25715699512
7한국한의학연구원1739450317
8한국생산기술연구원5823422081131
9한국전자통신연구원11129132552280
기관명박사석사(박사수료 포함)학사이하총현원
15세계김치연구소404023103
16한국지질자원연구원33310584522
17한국기계연구원3538581519
18재료연구소2099194394
19한국항공우주연구원5063032071016
20한국에너지기술연구원34611998563
21한국전기연구원265177232674
22한국화학연구원344171103618
23안전성평가연구소84104158346
24한국원자력연구원9083194821709