Overview

Dataset statistics

Number of variables4
Number of observations469
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description기술원에서 제공하는 환경기술개발목록(오염예방, 복원/재생, 오염방지 등에 분류에 대한 기술명, 개발자(기관))에 대한 정보
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15068320/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:01:29.294682
Analysis finished2023-12-11 23:01:29.960471
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct469
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean235
Minimum1
Maximum469
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T08:01:30.036100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.4
Q1118
median235
Q3352
95-th percentile445.6
Maximum469
Range468
Interquartile range (IQR)234

Descriptive statistics

Standard deviation135.5329
Coefficient of variation (CV)0.57673574
Kurtosis-1.2
Mean235
Median Absolute Deviation (MAD)117
Skewness0
Sum110215
Variance18369.167
MonotonicityStrictly increasing
2023-12-12T08:01:30.196167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
310 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
315 1
 
0.2%
Other values (459) 459
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
469 1
0.2%
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%
460 1
0.2%

분류
Categorical

Distinct6
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
오염방지
203 
오염예방
113 
복원/재생
75 
환경경영
38 
측정평가
35 

Length

Max length6
Median length4
Mean length4.1812367
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오염예방
2nd row오염예방
3rd row복원/재생
4th row복원/재생
5th row오염방지

Common Values

ValueCountFrequency (%)
오염방지 203
43.3%
오염예방 113
24.1%
복원/재생 75
 
16.0%
환경경영 38
 
8.1%
측정평가 35
 
7.5%
지구환경보전 5
 
1.1%

Length

2023-12-12T08:01:30.385891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:01:30.499609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오염방지 203
43.3%
오염예방 113
24.1%
복원/재생 75
 
16.0%
환경경영 38
 
8.1%
측정평가 35
 
7.5%
지구환경보전 5
 
1.1%
Distinct441
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-12T08:01:30.771165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length39
Mean length21.014925
Min length4

Characters and Unicode

Total characters9856
Distinct characters491
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique416 ?
Unique (%)88.7%

Sample

1st row생물학적 처리를 이용한 골판지 원지 제조공정 개설 기술
2nd row가정용 정수기 및 산업용에 사용되는 수처리용 중공섬유막 개발
3rd rowPCB Etching 폐액으로부터 산화구리 회수
4th row감량가공공정 폐기물로부터 TPA회수 실용화
5th row재순환형 열건조/소각시스템
ValueCountFrequency (%)
92
 
4.2%
이용한 65
 
3.0%
개발 52
 
2.4%
기술 46
 
2.1%
제조기술 23
 
1.0%
위한 20
 
0.9%
시스템 20
 
0.9%
장치 17
 
0.8%
의한 13
 
0.6%
설계기술 12
 
0.5%
Other values (1373) 1842
83.7%
2023-12-12T08:01:31.207391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1733
 
17.6%
352
 
3.6%
198
 
2.0%
168
 
1.7%
162
 
1.6%
149
 
1.5%
142
 
1.4%
140
 
1.4%
118
 
1.2%
110
 
1.1%
Other values (481) 6584
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7328
74.4%
Space Separator 1733
 
17.6%
Lowercase Letter 431
 
4.4%
Uppercase Letter 218
 
2.2%
Other Punctuation 38
 
0.4%
Decimal Number 32
 
0.3%
Close Punctuation 30
 
0.3%
Open Punctuation 30
 
0.3%
Dash Punctuation 14
 
0.1%
Other Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
352
 
4.8%
198
 
2.7%
168
 
2.3%
162
 
2.2%
149
 
2.0%
142
 
1.9%
140
 
1.9%
118
 
1.6%
110
 
1.5%
107
 
1.5%
Other values (415) 5682
77.5%
Lowercase Letter
ValueCountFrequency (%)
e 62
14.4%
t 45
10.4%
l 39
9.0%
a 34
 
7.9%
i 32
 
7.4%
o 30
 
7.0%
r 30
 
7.0%
n 27
 
6.3%
c 22
 
5.1%
s 15
 
3.5%
Other values (15) 95
22.0%
Uppercase Letter
ValueCountFrequency (%)
C 23
 
10.6%
S 18
 
8.3%
P 17
 
7.8%
T 17
 
7.8%
B 14
 
6.4%
O 14
 
6.4%
F 13
 
6.0%
N 11
 
5.0%
E 11
 
5.0%
R 10
 
4.6%
Other values (15) 70
32.1%
Other Punctuation
ValueCountFrequency (%)
, 20
52.6%
/ 14
36.8%
. 2
 
5.3%
: 1
 
2.6%
· 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 11
34.4%
0 8
25.0%
3 7
21.9%
1 6
18.8%
Close Punctuation
ValueCountFrequency (%)
) 29
96.7%
] 1
 
3.3%
Open Punctuation
ValueCountFrequency (%)
( 29
96.7%
[ 1
 
3.3%
Space Separator
ValueCountFrequency (%)
1733
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7326
74.3%
Common 1879
 
19.1%
Latin 649
 
6.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
352
 
4.8%
198
 
2.7%
168
 
2.3%
162
 
2.2%
149
 
2.0%
142
 
1.9%
140
 
1.9%
118
 
1.6%
110
 
1.5%
107
 
1.5%
Other values (413) 5680
77.5%
Latin
ValueCountFrequency (%)
e 62
 
9.6%
t 45
 
6.9%
l 39
 
6.0%
a 34
 
5.2%
i 32
 
4.9%
o 30
 
4.6%
r 30
 
4.6%
n 27
 
4.2%
C 23
 
3.5%
c 22
 
3.4%
Other values (40) 305
47.0%
Common
ValueCountFrequency (%)
1733
92.2%
) 29
 
1.5%
( 29
 
1.5%
, 20
 
1.1%
/ 14
 
0.7%
- 14
 
0.7%
2 11
 
0.6%
0 8
 
0.4%
3 7
 
0.4%
1 6
 
0.3%
Other values (6) 8
 
0.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7326
74.3%
ASCII 2525
 
25.6%
None 3
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1733
68.6%
e 62
 
2.5%
t 45
 
1.8%
l 39
 
1.5%
a 34
 
1.3%
i 32
 
1.3%
o 30
 
1.2%
r 30
 
1.2%
) 29
 
1.1%
( 29
 
1.1%
Other values (54) 462
 
18.3%
Hangul
ValueCountFrequency (%)
352
 
4.8%
198
 
2.7%
168
 
2.3%
162
 
2.2%
149
 
2.0%
142
 
1.9%
140
 
1.9%
118
 
1.6%
110
 
1.5%
107
 
1.5%
Other values (413) 5680
77.5%
None
ValueCountFrequency (%)
2
66.7%
· 1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct126
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-12T08:01:31.508169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length14
Mean length6.5138593
Min length1

Characters and Unicode

Total characters3055
Distinct characters205
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)17.9%

Sample

1st row한국화학연구원
2nd row한국화학연구원
3rd row한국화학연구원
4th row한국화학연구원
5th row한국기계연구원
ValueCountFrequency (%)
한국화학연구원 72
 
15.0%
한국에너지기술연구소 62
 
12.9%
포항산업과학연구원 29
 
6.0%
한국기계연구원 25
 
5.2%
25
 
5.2%
한국지질자원연구원 19
 
4.0%
한국표준과학연구원 12
 
2.5%
이방희 12
 
2.5%
한국과학기술원 12
 
2.5%
한국생산기술연구원 10
 
2.1%
Other values (122) 202
42.1%
2023-12-12T08:01:31.924236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
253
 
8.3%
250
 
8.2%
238
 
7.8%
231
 
7.6%
225
 
7.4%
149
 
4.9%
146
 
4.8%
101
 
3.3%
95
 
3.1%
73
 
2.4%
Other values (195) 1294
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2867
93.8%
Uppercase Letter 50
 
1.6%
Lowercase Letter 40
 
1.3%
Close Punctuation 27
 
0.9%
Dash Punctuation 26
 
0.9%
Open Punctuation 25
 
0.8%
Space Separator 11
 
0.4%
Other Punctuation 8
 
0.3%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
253
 
8.8%
250
 
8.7%
238
 
8.3%
231
 
8.1%
225
 
7.8%
149
 
5.2%
146
 
5.1%
101
 
3.5%
95
 
3.3%
73
 
2.5%
Other values (161) 1106
38.6%
Uppercase Letter
ValueCountFrequency (%)
L 8
16.0%
W 6
12.0%
O 4
 
8.0%
R 4
 
8.0%
T 4
 
8.0%
E 3
 
6.0%
A 3
 
6.0%
C 3
 
6.0%
I 3
 
6.0%
F 2
 
4.0%
Other values (6) 10
20.0%
Lowercase Letter
ValueCountFrequency (%)
a 6
15.0%
n 6
15.0%
t 6
15.0%
o 4
10.0%
e 4
10.0%
r 2
 
5.0%
i 2
 
5.0%
l 2
 
5.0%
d 2
 
5.0%
b 2
 
5.0%
Other values (2) 4
10.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2868
93.9%
Common 97
 
3.2%
Latin 90
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
253
 
8.8%
250
 
8.7%
238
 
8.3%
231
 
8.1%
225
 
7.8%
149
 
5.2%
146
 
5.1%
101
 
3.5%
95
 
3.3%
73
 
2.5%
Other values (162) 1107
38.6%
Latin
ValueCountFrequency (%)
L 8
 
8.9%
a 6
 
6.7%
n 6
 
6.7%
t 6
 
6.7%
W 6
 
6.7%
o 4
 
4.4%
O 4
 
4.4%
R 4
 
4.4%
T 4
 
4.4%
e 4
 
4.4%
Other values (18) 38
42.2%
Common
ValueCountFrequency (%)
) 27
27.8%
- 26
26.8%
( 25
25.8%
11
11.3%
. 8
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2867
93.8%
ASCII 187
 
6.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
253
 
8.8%
250
 
8.7%
238
 
8.3%
231
 
8.1%
225
 
7.8%
149
 
5.2%
146
 
5.1%
101
 
3.5%
95
 
3.3%
73
 
2.5%
Other values (161) 1106
38.6%
ASCII
ValueCountFrequency (%)
) 27
14.4%
- 26
13.9%
( 25
13.4%
11
 
5.9%
. 8
 
4.3%
L 8
 
4.3%
a 6
 
3.2%
n 6
 
3.2%
t 6
 
3.2%
W 6
 
3.2%
Other values (23) 58
31.0%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T08:01:29.675171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:01:32.025159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류
번호1.0000.444
분류0.4441.000
2023-12-12T08:01:32.094153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류
번호1.0000.251
분류0.2511.000

Missing values

2023-12-12T08:01:29.829764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:01:29.928143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호분류기술명개발자
01오염예방생물학적 처리를 이용한 골판지 원지 제조공정 개설 기술한국화학연구원
12오염예방가정용 정수기 및 산업용에 사용되는 수처리용 중공섬유막 개발한국화학연구원
23복원/재생PCB Etching 폐액으로부터 산화구리 회수한국화학연구원
34복원/재생감량가공공정 폐기물로부터 TPA회수 실용화한국화학연구원
45오염방지재순환형 열건조/소각시스템한국기계연구원
56오염방지무한궤도식 연속소각 기술한국기계연구원
67오염예방흡차음성능 평가 및 설계기술한국기계연구원
78오염방지고체음 차단 기술한국기계연구원
89오염방지배가스로부터 SOx와 NOx의 동시 제거기술한국에너지기술연구소
910오염예방분리형 고성능 콤팩트 가정용 콘덴싱 가스보일러 및 저공해 가스버너 설계기술한국에너지기술연구소
번호분류기술명개발자
459460오염예방연수 및 정수 통합시스템유영범
460461오염예방가변형 무동력 하수토출시스템김연권
461462오염방지수질정화용 취송류 하강유도장치신재기
462463측정평가수질측정장치김윤석
463464측정평가강수량 측정장치이경우
464465오염예방응집지 스컴제거수단과 용존공기부상수단을 구비한 방류수 처리장치권순범
465466오염예방정수장치 필터간 수압분배구조임재림
466467오염예방연속회분식반응조에서 질산화반응과 연계한 포기 동력제어 장치김지연
467468오염예방슬러지 평탄 기능을 갖는 스크래퍼, 이를 이용하는 슬러지 스크래핑 장치박덕준
468469오염예방교차흐름형 혼합장치황재우