Overview

Dataset statistics

Number of variables4
Number of observations539
Missing cells539
Missing cells (%)25.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.5 KiB
Average record size in memory33.2 B

Variable types

Text3
Unsupported1

Dataset

Description산림과학기술정보서비스(FTIS) 시스템 협동연구기관에 대한 데이터로 협동연구기관, 연구책임자 등의 정보를 제공합니다.
Author산림청
URLhttps://www.data.go.kr/data/15091872/fileData.do

Alerts

Unnamed: 3 has 539 (100.0%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 03:39:23.643198
Analysis finished2023-12-12 03:39:24.060875
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct351
Distinct (%)65.1%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T12:39:24.309633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters6468
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)38.0%

Sample

1st rowEC_000000006
2nd rowEC_000000056
3rd rowEC_000000006
4th rowEC_000000007
5th rowEC_000000007
ValueCountFrequency (%)
ec_000000413 5
 
0.9%
ec_000000600 4
 
0.7%
ec_000000407 4
 
0.7%
ec_000000523 4
 
0.7%
ec_000000598 4
 
0.7%
ec_000000283 3
 
0.6%
ec_000000292 3
 
0.6%
ec_000000162 3
 
0.6%
ec_000000294 3
 
0.6%
ec_000000421 3
 
0.6%
Other values (341) 503
93.3%
2023-12-12T12:39:24.863404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3429
53.0%
E 539
 
8.3%
C 539
 
8.3%
_ 539
 
8.3%
5 227
 
3.5%
2 207
 
3.2%
4 202
 
3.1%
1 198
 
3.1%
3 155
 
2.4%
6 114
 
1.8%
Other values (3) 319
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4851
75.0%
Uppercase Letter 1078
 
16.7%
Connector Punctuation 539
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3429
70.7%
5 227
 
4.7%
2 207
 
4.3%
4 202
 
4.2%
1 198
 
4.1%
3 155
 
3.2%
6 114
 
2.4%
8 108
 
2.2%
7 107
 
2.2%
9 104
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
E 539
50.0%
C 539
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 539
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5390
83.3%
Latin 1078
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3429
63.6%
_ 539
 
10.0%
5 227
 
4.2%
2 207
 
3.8%
4 202
 
3.7%
1 198
 
3.7%
3 155
 
2.9%
6 114
 
2.1%
8 108
 
2.0%
7 107
 
2.0%
Latin
ValueCountFrequency (%)
E 539
50.0%
C 539
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3429
53.0%
E 539
 
8.3%
C 539
 
8.3%
_ 539
 
8.3%
5 227
 
3.5%
2 207
 
3.2%
4 202
 
3.1%
1 198
 
3.1%
3 155
 
2.4%
6 114
 
1.8%
Other values (3) 319
 
4.9%
Distinct206
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T12:39:25.185724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length10.111317
Min length3

Characters and Unicode

Total characters5450
Distinct characters261
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)19.1%

Sample

1st row한국한의학연구원
2nd row국민대학교
3rd row한국임업진흥원
4th row한국임업진흥원
5th row한국한의학연구원
ValueCountFrequency (%)
산학협력단 136
 
18.3%
서울대학교 32
 
4.3%
주식회사 26
 
3.5%
국립산림과학원 20
 
2.7%
전라남도산림자원연구소 18
 
2.4%
강원대학교산학협력단 15
 
2.0%
경상대학교 13
 
1.7%
경북대학교 12
 
1.6%
충남대학교 11
 
1.5%
사단법인 10
 
1.3%
Other values (208) 450
60.6%
2023-12-12T12:39:25.657655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
465
 
8.5%
293
 
5.4%
255
 
4.7%
233
 
4.3%
218
 
4.0%
210
 
3.9%
204
 
3.7%
202
 
3.7%
( 179
 
3.3%
) 179
 
3.3%
Other values (251) 3012
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4833
88.7%
Space Separator 204
 
3.7%
Open Punctuation 184
 
3.4%
Close Punctuation 184
 
3.4%
Uppercase Letter 45
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
465
 
9.6%
293
 
6.1%
255
 
5.3%
233
 
4.8%
218
 
4.5%
210
 
4.3%
202
 
4.2%
172
 
3.6%
139
 
2.9%
97
 
2.0%
Other values (239) 2549
52.7%
Uppercase Letter
ValueCountFrequency (%)
O 11
24.4%
T 10
22.2%
I 10
22.2%
K 5
11.1%
Z 3
 
6.7%
N 3
 
6.7%
H 3
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 179
97.3%
[ 5
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 179
97.3%
] 5
 
2.7%
Space Separator
ValueCountFrequency (%)
204
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4833
88.7%
Common 572
 
10.5%
Latin 45
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
465
 
9.6%
293
 
6.1%
255
 
5.3%
233
 
4.8%
218
 
4.5%
210
 
4.3%
202
 
4.2%
172
 
3.6%
139
 
2.9%
97
 
2.0%
Other values (239) 2549
52.7%
Latin
ValueCountFrequency (%)
O 11
24.4%
T 10
22.2%
I 10
22.2%
K 5
11.1%
Z 3
 
6.7%
N 3
 
6.7%
H 3
 
6.7%
Common
ValueCountFrequency (%)
204
35.7%
( 179
31.3%
) 179
31.3%
] 5
 
0.9%
[ 5
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4833
88.7%
ASCII 617
 
11.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
465
 
9.6%
293
 
6.1%
255
 
5.3%
233
 
4.8%
218
 
4.5%
210
 
4.3%
202
 
4.2%
172
 
3.6%
139
 
2.9%
97
 
2.0%
Other values (239) 2549
52.7%
ASCII
ValueCountFrequency (%)
204
33.1%
( 179
29.0%
) 179
29.0%
O 11
 
1.8%
T 10
 
1.6%
I 10
 
1.6%
] 5
 
0.8%
[ 5
 
0.8%
K 5
 
0.8%
Z 3
 
0.5%
Other values (2) 6
 
1.0%
Distinct248
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T12:39:26.137581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.0241187
Min length2

Characters and Unicode

Total characters1630
Distinct characters125
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)23.2%

Sample

1st row한국***연구원
2nd row신*용
3rd row김*정
4th row김*정
5th row한국***연구원
ValueCountFrequency (%)
김*진 11
 
2.0%
김*준 10
 
1.9%
이*원 9
 
1.7%
김*식 8
 
1.5%
박*석 7
 
1.3%
김*훈 7
 
1.3%
김*영 6
 
1.1%
김*수 6
 
1.1%
한*호 6
 
1.1%
전*화 6
 
1.1%
Other values (238) 463
85.9%
2023-12-12T12:39:26.772815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 545
33.4%
118
 
7.2%
86
 
5.3%
40
 
2.5%
33
 
2.0%
32
 
2.0%
27
 
1.7%
27
 
1.7%
27
 
1.7%
26
 
1.6%
Other values (115) 669
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1084
66.5%
Other Punctuation 545
33.4%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
 
10.9%
86
 
7.9%
40
 
3.7%
33
 
3.0%
32
 
3.0%
27
 
2.5%
27
 
2.5%
27
 
2.5%
26
 
2.4%
26
 
2.4%
Other values (113) 642
59.2%
Other Punctuation
ValueCountFrequency (%)
* 545
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1084
66.5%
Common 546
33.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
 
10.9%
86
 
7.9%
40
 
3.7%
33
 
3.0%
32
 
3.0%
27
 
2.5%
27
 
2.5%
27
 
2.5%
26
 
2.4%
26
 
2.4%
Other values (113) 642
59.2%
Common
ValueCountFrequency (%)
* 545
99.8%
1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1084
66.5%
ASCII 546
33.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 545
99.8%
1
 
0.2%
Hangul
ValueCountFrequency (%)
118
 
10.9%
86
 
7.9%
40
 
3.7%
33
 
3.0%
32
 
3.0%
27
 
2.5%
27
 
2.5%
27
 
2.5%
26
 
2.4%
26
 
2.4%
Other values (113) 642
59.2%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing539
Missing (%)100.0%
Memory size4.9 KiB

Missing values

2023-12-12T12:39:23.908861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:39:24.018729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

전자협약번호협동연구기관명협동 연구책임자명Unnamed: 3
0EC_000000006한국한의학연구원한국***연구원<NA>
1EC_000000056국민대학교신*용<NA>
2EC_000000006한국임업진흥원김*정<NA>
3EC_000000007한국임업진흥원김*정<NA>
4EC_000000007한국한의학연구원한국***연구원<NA>
5EC_000000009한국임업진흥원김*정<NA>
6EC_000000009한국한의학연구원한국***연구원<NA>
7EC_000000056고려대학교산학협력단전*우<NA>
8EC_000000056국립산림과학원임*수<NA>
9EC_000000065(학교)숭실대학교김*민<NA>
전자협약번호협동연구기관명협동 연구책임자명Unnamed: 3
529EC_000000597(주)디엠스튜디오권*준<NA>
530EC_000000598한국해양수산개발원이*아<NA>
531EC_000000598(주)아라종합기술김*은<NA>
532EC_000000598(주)마케시안고*원<NA>
533EC_000000598고려대학교산학협력단윤*철<NA>
534EC_000000599(재)한국화학융합시험연구원조*훈<NA>
535EC_000000599(주)해랑기술정책연구소백*규<NA>
536EC_000000600한국해양수산개발원홍*원<NA>
537EC_000000600환동해산업연구원우*희<NA>
538EC_000000600(주)해랑기술정책연구소백*규<NA>