Overview

Dataset statistics

Number of variables6
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory58.3 B

Variable types

Text1
Numeric5

Dataset

Description국가과학기술연구회 소관 25개 출연연구기관의 연령대별(20대이하~60대이상) 정규인력 연구직 인력정보를 제공합니다.
Author국가과학기술연구회
URLhttps://www.data.go.kr/data/15045196/fileData.do

Alerts

20대 이하 is highly overall correlated with 30대 and 3 other fieldsHigh correlation
30대 is highly overall correlated with 20대 이하 and 3 other fieldsHigh correlation
40대 is highly overall correlated with 20대 이하 and 3 other fieldsHigh correlation
50대 is highly overall correlated with 20대 이하 and 3 other fieldsHigh correlation
60대 이상 is highly overall correlated with 20대 이하 and 3 other fieldsHigh correlation
기관명 has unique valuesUnique
20대 이하 has 5 (20.0%) zerosZeros
60대 이상 has 1 (4.0%) zerosZeros

Reproduction

Analysis started2023-12-12 01:13:01.689948
Analysis finished2023-12-12 01:13:05.244466
Duration3.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T10:13:05.419308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.28
Min length5

Characters and Unicode

Total characters207
Distinct characters58
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row한국과학기술연구원
2nd row녹색기술센터
3rd row한국기초과학지원연구원
4th row국가핵융합연구소
5th row한국천문연구원
ValueCountFrequency (%)
한국과학기술연구원 1
 
4.0%
한국표준과학연구원 1
 
4.0%
안전성평가연구소 1
 
4.0%
한국화학연구원 1
 
4.0%
한국전기연구원 1
 
4.0%
한국에너지기술연구원 1
 
4.0%
한국항공우주연구원 1
 
4.0%
재료연구소 1
 
4.0%
한국기계연구원 1
 
4.0%
한국지질자원연구원 1
 
4.0%
Other values (15) 15
60.0%
2023-12-12T10:13:05.938709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 207
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 207
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 207
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
11.6%
24
 
11.6%
22
 
10.6%
21
 
10.1%
20
 
9.7%
11
 
5.3%
8
 
3.9%
7
 
3.4%
5
 
2.4%
4
 
1.9%
Other values (48) 61
29.5%

20대 이하
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)48.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.36
Minimum0
Maximum42
Zeros5
Zeros (%)20.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T10:13:06.137005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q36
95-th percentile28.6
Maximum42
Range42
Interquartile range (IQR)5

Descriptive statistics

Standard deviation10.750504
Coefficient of variation (CV)1.4606663
Kurtosis3.92928
Mean7.36
Median Absolute Deviation (MAD)3
Skewness2.0694264
Sum184
Variance115.57333
MonotonicityNot monotonic
2023-12-12T10:13:06.305204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 5
20.0%
2 3
12.0%
1 3
12.0%
3 3
12.0%
6 3
12.0%
4 2
 
8.0%
23 1
 
4.0%
42 1
 
4.0%
22 1
 
4.0%
10 1
 
4.0%
Other values (2) 2
 
8.0%
ValueCountFrequency (%)
0 5
20.0%
1 3
12.0%
2 3
12.0%
3 3
12.0%
4 2
 
8.0%
6 3
12.0%
10 1
 
4.0%
13 1
 
4.0%
22 1
 
4.0%
23 1
 
4.0%
ValueCountFrequency (%)
42 1
 
4.0%
30 1
 
4.0%
23 1
 
4.0%
22 1
 
4.0%
13 1
 
4.0%
10 1
 
4.0%
6 3
12.0%
4 2
8.0%
3 3
12.0%
2 3
12.0%

30대
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean111.84
Minimum20
Maximum357
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T10:13:06.445673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile24.6
Q140
median79
Q3150
95-th percentile334
Maximum357
Range337
Interquartile range (IQR)110

Descriptive statistics

Standard deviation100.02486
Coefficient of variation (CV)0.89435679
Kurtosis1.2010359
Mean111.84
Median Absolute Deviation (MAD)44
Skewness1.4282485
Sum2796
Variance10004.973
MonotonicityNot monotonic
2023-12-12T10:13:06.609633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
42 2
 
8.0%
197 1
 
4.0%
68 1
 
4.0%
322 1
 
4.0%
80 1
 
4.0%
126 1
 
4.0%
65 1
 
4.0%
111 1
 
4.0%
159 1
 
4.0%
93 1
 
4.0%
Other values (14) 14
56.0%
ValueCountFrequency (%)
20 1
4.0%
24 1
4.0%
27 1
4.0%
28 1
4.0%
35 1
4.0%
39 1
4.0%
40 1
4.0%
42 2
8.0%
43 1
4.0%
65 1
4.0%
ValueCountFrequency (%)
357 1
4.0%
337 1
4.0%
322 1
4.0%
197 1
4.0%
190 1
4.0%
159 1
4.0%
150 1
4.0%
126 1
4.0%
122 1
4.0%
111 1
4.0%

40대
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean156.52
Minimum12
Maximum732
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T10:13:06.748339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile28.2
Q181
median105
Q3192
95-th percentile351.2
Maximum732
Range720
Interquartile range (IQR)111

Descriptive statistics

Standard deviation150.37734
Coefficient of variation (CV)0.96075477
Kurtosis8.5310002
Mean156.52
Median Absolute Deviation (MAD)34
Skewness2.6339594
Sum3913
Variance22613.343
MonotonicityNot monotonic
2023-12-12T10:13:06.898770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
192 2
 
8.0%
218 1
 
4.0%
49 1
 
4.0%
336 1
 
4.0%
72 1
 
4.0%
118 1
 
4.0%
98 1
 
4.0%
139 1
 
4.0%
316 1
 
4.0%
71 1
 
4.0%
Other values (14) 14
56.0%
ValueCountFrequency (%)
12 1
4.0%
23 1
4.0%
49 1
4.0%
69 1
4.0%
71 1
4.0%
72 1
4.0%
81 1
4.0%
83 1
4.0%
86 1
4.0%
91 1
4.0%
ValueCountFrequency (%)
732 1
4.0%
355 1
4.0%
336 1
4.0%
316 1
4.0%
218 1
4.0%
192 2
8.0%
154 1
4.0%
139 1
4.0%
118 1
4.0%
116 1
4.0%

50대
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108.8
Minimum2
Maximum673
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T10:13:07.036851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile7.4
Q153
median76
Q3116
95-th percentile237.4
Maximum673
Range671
Interquartile range (IQR)63

Descriptive statistics

Standard deviation131.91822
Coefficient of variation (CV)1.2124836
Kurtosis14.654757
Mean108.8
Median Absolute Deviation (MAD)32
Skewness3.5222878
Sum2720
Variance17402.417
MonotonicityNot monotonic
2023-12-12T10:13:07.196251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
63 2
 
8.0%
120 1
 
4.0%
87 1
 
4.0%
249 1
 
4.0%
21 1
 
4.0%
67 1
 
4.0%
78 1
 
4.0%
53 1
 
4.0%
191 1
 
4.0%
76 1
 
4.0%
Other values (14) 14
56.0%
ValueCountFrequency (%)
2 1
4.0%
4 1
4.0%
21 1
4.0%
31 1
4.0%
36 1
4.0%
44 1
4.0%
53 1
4.0%
55 1
4.0%
58 1
4.0%
63 2
8.0%
ValueCountFrequency (%)
673 1
4.0%
249 1
4.0%
191 1
4.0%
188 1
4.0%
161 1
4.0%
120 1
4.0%
116 1
4.0%
104 1
4.0%
103 1
4.0%
87 1
4.0%

60대 이상
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct22
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.96
Minimum0
Maximum182
Zeros1
Zeros (%)4.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T10:13:07.321809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.2
Q17
median19
Q340
95-th percentile80.4
Maximum182
Range182
Interquartile range (IQR)33

Descriptive statistics

Standard deviation38.244259
Coefficient of variation (CV)1.2765106
Kurtosis10.328253
Mean29.96
Median Absolute Deviation (MAD)16
Skewness2.8821704
Sum749
Variance1462.6233
MonotonicityNot monotonic
2023-12-12T10:13:07.462907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
2 2
 
8.0%
23 2
 
8.0%
46 2
 
8.0%
62 1
 
4.0%
35 1
 
4.0%
182 1
 
4.0%
32 1
 
4.0%
49 1
 
4.0%
9 1
 
4.0%
40 1
 
4.0%
Other values (12) 12
48.0%
ValueCountFrequency (%)
0 1
4.0%
1 1
4.0%
2 2
8.0%
3 1
4.0%
5 1
4.0%
7 1
4.0%
9 1
4.0%
10 1
4.0%
14 1
4.0%
16 1
4.0%
ValueCountFrequency (%)
182 1
4.0%
85 1
4.0%
62 1
4.0%
49 1
4.0%
46 2
8.0%
40 1
4.0%
35 1
4.0%
32 1
4.0%
23 2
8.0%
20 1
4.0%

Interactions

2023-12-12T10:13:04.386752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:01.960052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.833364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.355736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.831441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.487117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.057961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.919612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.446943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.949302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.645369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.521399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.035855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.555673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.077365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.782286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.629044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.166796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.650076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.171676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.901787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:02.744525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.261241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:03.738745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:13:04.279520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:13:07.586045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명20대 이하30대40대50대60대 이상
기관명1.0001.0001.0001.0001.0001.000
20대 이하1.0001.0000.8530.7720.8350.820
30대1.0000.8531.0000.7720.7820.796
40대1.0000.7720.7721.0000.9570.744
50대1.0000.8350.7820.9571.0000.795
60대 이상1.0000.8200.7960.7440.7951.000
2023-12-12T10:13:07.727543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
20대 이하30대40대50대60대 이상
20대 이하1.0000.8370.6780.5060.552
30대0.8371.0000.8310.6010.608
40대0.6780.8311.0000.7790.679
50대0.5060.6010.7791.0000.750
60대 이상0.5520.6080.6790.7501.000

Missing values

2023-12-12T10:13:05.058790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:13:05.197426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명20대 이하30대40대50대60대 이상
0한국과학기술연구원219721812062
1녹색기술센터2271220
2한국기초과학지원연구원12469637
3국가핵융합연구소02891552
4한국천문연구원020835814
5한국생명공학연구원2421057723
6한국과학기술정보연구원14315410310
7한국한의학연구원03581315
8한국생산기술연구원2335735518819
9한국전자통신연구원4233773267385
기관명20대 이하30대40대50대60대 이상
15세계김치연구소1422341
16한국지질자원연구원6799610446
17한국기계연구원101221167640
18재료연구소39371639
19한국항공우주연구원1315931619149
20한국에너지기술연구원31111395332
21한국전기연구원665987823
22한국화학연구원61261186746
23안전성평가연구소48072212
24한국원자력연구원30322336249182