Overview

Dataset statistics

Number of variables7
Number of observations86
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory58.5 B

Variable types

Numeric1
Categorical1
Text3
DateTime2

Dataset

Description농림축산분야 녹색인증 기술 및 사업 현황 제공 목록 : 녹색인증제 유효 녹색기술 현황, 녹색인증제 유효 녹색기술제품 현황, 녹색인증제 유효 녹색전문기업 현황
URLhttps://www.data.go.kr/data/3078079/fileData.do

Alerts

신규구분 is highly imbalanced (52.3%)Imbalance
순번 has unique valuesUnique
녹색기술인증번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:20:42.426248
Analysis finished2023-12-12 16:20:43.944610
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.5
Minimum1
Maximum86
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-13T01:20:44.031609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.25
Q122.25
median43.5
Q364.75
95-th percentile81.75
Maximum86
Range85
Interquartile range (IQR)42.5

Descriptive statistics

Standard deviation24.969982
Coefficient of variation (CV)0.57402257
Kurtosis-1.2
Mean43.5
Median Absolute Deviation (MAD)21.5
Skewness0
Sum3741
Variance623.5
MonotonicityStrictly increasing
2023-12-13T01:20:44.246287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
56 1
 
1.2%
64 1
 
1.2%
63 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
Other values (76) 76
88.4%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
86 1
1.2%
85 1
1.2%
84 1
1.2%
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%

신규구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size820.0 B
신규
70 
연장
15 
신규
 
1

Length

Max length3
Median length2
Mean length2.0116279
Min length2

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row연장
2nd row연장
3rd row연장
4th row연장
5th row신규

Common Values

ValueCountFrequency (%)
신규 70
81.4%
연장 15
 
17.4%
신규 1
 
1.2%

Length

2023-12-13T01:20:44.399709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:20:44.559715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 71
82.6%
연장 15
 
17.4%
Distinct83
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T01:20:44.823523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length37
Mean length29.069767
Min length11

Characters and Unicode

Total characters2500
Distinct characters312
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)95.3%

Sample

1st row직압출 성형방식의 목재 플라스틱 복합재(합성목재) 제조기술
2nd row초고속 고액분리 장치를 이용한 재이용수 처리기술
3rd row에탄올 베이스 잉크 및 저심도 제판 인쇄 기술을 이용한 포장재 제조기술
4th row당귀, 천궁, 작약으로부터 추출된 건강기능식품 원료 제조기술 (헤모힘당귀등혼합추출물)
5th row축산유기자원을 활용한 바이오매스플라스틱 제조기술
ValueCountFrequency (%)
기술 39
 
6.4%
제조기술 32
 
5.3%
이용한 30
 
4.9%
포장재 18
 
3.0%
16
 
2.6%
제조 14
 
2.3%
유해성 9
 
1.5%
저감 9
 
1.5%
에탄올 8
 
1.3%
잉크를 7
 
1.2%
Other values (361) 426
70.1%
2023-12-13T01:20:45.280636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
534
 
21.4%
117
 
4.7%
86
 
3.4%
79
 
3.2%
70
 
2.8%
55
 
2.2%
53
 
2.1%
52
 
2.1%
44
 
1.8%
39
 
1.6%
Other values (302) 1371
54.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1868
74.7%
Space Separator 534
 
21.4%
Lowercase Letter 50
 
2.0%
Uppercase Letter 30
 
1.2%
Other Punctuation 8
 
0.3%
Open Punctuation 5
 
0.2%
Close Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
6.3%
86
 
4.6%
79
 
4.2%
70
 
3.7%
55
 
2.9%
53
 
2.8%
52
 
2.8%
44
 
2.4%
39
 
2.1%
30
 
1.6%
Other values (267) 1243
66.5%
Lowercase Letter
ValueCountFrequency (%)
l 6
12.0%
e 6
12.0%
a 5
10.0%
d 4
8.0%
y 4
8.0%
c 4
8.0%
o 4
8.0%
r 3
 
6.0%
i 3
 
6.0%
h 2
 
4.0%
Other values (7) 9
18.0%
Uppercase Letter
ValueCountFrequency (%)
P 10
33.3%
C 3
 
10.0%
E 3
 
10.0%
L 2
 
6.7%
A 2
 
6.7%
M 2
 
6.7%
F 2
 
6.7%
H 1
 
3.3%
O 1
 
3.3%
G 1
 
3.3%
Other values (3) 3
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 7
87.5%
/ 1
 
12.5%
Space Separator
ValueCountFrequency (%)
534
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1866
74.6%
Common 552
 
22.1%
Latin 80
 
3.2%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
6.3%
86
 
4.6%
79
 
4.2%
70
 
3.8%
55
 
2.9%
53
 
2.8%
52
 
2.8%
44
 
2.4%
39
 
2.1%
30
 
1.6%
Other values (265) 1241
66.5%
Latin
ValueCountFrequency (%)
P 10
 
12.5%
l 6
 
7.5%
e 6
 
7.5%
a 5
 
6.2%
d 4
 
5.0%
y 4
 
5.0%
c 4
 
5.0%
o 4
 
5.0%
C 3
 
3.8%
E 3
 
3.8%
Other values (20) 31
38.8%
Common
ValueCountFrequency (%)
534
96.7%
, 7
 
1.3%
( 5
 
0.9%
) 5
 
0.9%
/ 1
 
0.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1866
74.6%
ASCII 632
 
25.3%
CJK 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
534
84.5%
P 10
 
1.6%
, 7
 
1.1%
l 6
 
0.9%
e 6
 
0.9%
( 5
 
0.8%
a 5
 
0.8%
) 5
 
0.8%
d 4
 
0.6%
y 4
 
0.6%
Other values (25) 46
 
7.3%
Hangul
ValueCountFrequency (%)
117
 
6.3%
86
 
4.6%
79
 
4.2%
70
 
3.8%
55
 
2.9%
53
 
2.8%
52
 
2.8%
44
 
2.4%
39
 
2.1%
30
 
1.6%
Other values (265) 1241
66.5%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct81
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T01:20:45.539568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length6.7674419
Min length2

Characters and Unicode

Total characters582
Distinct characters165
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)88.4%

Sample

1st row㈜본우드
2nd row㈜블루비에스
3rd row율촌화학㈜
4th row콜마비앤에이치㈜
5th row주식회사 더자연
ValueCountFrequency (%)
주식회사 19
 
17.0%
율촌화학㈜ 2
 
1.8%
아모레퍼시픽 2
 
1.8%
에스피씨팩 2
 
1.8%
㈜유상 2
 
1.8%
㈜뉴트렉스테크놀러지 2
 
1.8%
강청 1
 
0.9%
㈜본우드 1
 
0.9%
㈜키랜드디앤씨 1
 
0.9%
진성냉기산업㈜ 1
 
0.9%
Other values (79) 79
70.5%
2023-12-13T01:20:45.954850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
6.7%
31
 
5.3%
28
 
4.8%
23
 
4.0%
22
 
3.8%
20
 
3.4%
18
 
3.1%
17
 
2.9%
16
 
2.7%
( 11
 
1.9%
Other values (155) 357
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 488
83.8%
Other Symbol 39
 
6.7%
Space Separator 28
 
4.8%
Open Punctuation 11
 
1.9%
Close Punctuation 11
 
1.9%
Uppercase Letter 4
 
0.7%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
6.4%
23
 
4.7%
22
 
4.5%
20
 
4.1%
18
 
3.7%
17
 
3.5%
16
 
3.3%
9
 
1.8%
8
 
1.6%
8
 
1.6%
Other values (147) 316
64.8%
Uppercase Letter
ValueCountFrequency (%)
P 2
50.0%
J 1
25.0%
C 1
25.0%
Other Symbol
ValueCountFrequency (%)
39
100.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 527
90.5%
Common 51
 
8.8%
Latin 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
7.4%
31
 
5.9%
23
 
4.4%
22
 
4.2%
20
 
3.8%
18
 
3.4%
17
 
3.2%
16
 
3.0%
9
 
1.7%
8
 
1.5%
Other values (148) 324
61.5%
Common
ValueCountFrequency (%)
28
54.9%
( 11
 
21.6%
) 11
 
21.6%
& 1
 
2.0%
Latin
ValueCountFrequency (%)
P 2
50.0%
J 1
25.0%
C 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 488
83.8%
ASCII 55
 
9.5%
None 39
 
6.7%

Most frequent character per block

None
ValueCountFrequency (%)
39
100.0%
Hangul
ValueCountFrequency (%)
31
 
6.4%
23
 
4.7%
22
 
4.5%
20
 
4.1%
18
 
3.7%
17
 
3.5%
16
 
3.3%
9
 
1.8%
8
 
1.6%
8
 
1.6%
Other values (147) 316
64.8%
ASCII
ValueCountFrequency (%)
28
50.9%
( 11
 
20.0%
) 11
 
20.0%
P 2
 
3.6%
J 1
 
1.8%
C 1
 
1.8%
& 1
 
1.8%
Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T01:20:46.243515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length11.186047
Min length11

Characters and Unicode

Total characters962
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)100.0%

Sample

1st rowGT-19-00594
2nd rowGT-20-00822
3rd rowGT-20-00835
4th rowGT-20-00876
5th rowGT-20-00900
ValueCountFrequency (%)
gt-23 2
 
2.3%
gt-19-00594 1
 
1.1%
gt-22-01530 1
 
1.1%
gt-22-01539 1
 
1.1%
gt-19-00753 1
 
1.1%
gt-22-01493 1
 
1.1%
gt-22-01496 1
 
1.1%
gt-22-01503 1
 
1.1%
gt-22-01506 1
 
1.1%
gt-22-01504 1
 
1.1%
Other values (77) 77
87.5%
2023-12-13T01:20:46.655016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 172
17.9%
0 153
15.9%
2 132
13.7%
1 112
11.6%
G 86
8.9%
T 86
8.9%
3 43
 
4.5%
5 39
 
4.1%
7 28
 
2.9%
9 25
 
2.6%
Other values (4) 86
8.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 602
62.6%
Dash Punctuation 172
 
17.9%
Uppercase Letter 172
 
17.9%
Space Separator 16
 
1.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 153
25.4%
2 132
21.9%
1 112
18.6%
3 43
 
7.1%
5 39
 
6.5%
7 28
 
4.7%
9 25
 
4.2%
4 24
 
4.0%
8 24
 
4.0%
6 22
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
G 86
50.0%
T 86
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 172
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 790
82.1%
Latin 172
 
17.9%

Most frequent character per script

Common
ValueCountFrequency (%)
- 172
21.8%
0 153
19.4%
2 132
16.7%
1 112
14.2%
3 43
 
5.4%
5 39
 
4.9%
7 28
 
3.5%
9 25
 
3.2%
4 24
 
3.0%
8 24
 
3.0%
Other values (2) 38
 
4.8%
Latin
ValueCountFrequency (%)
G 86
50.0%
T 86
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 962
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 172
17.9%
0 153
15.9%
2 132
13.7%
1 112
11.6%
G 86
8.9%
T 86
8.9%
3 43
 
4.5%
5 39
 
4.1%
7 28
 
2.9%
9 25
 
2.6%
Other values (4) 86
8.9%
Distinct34
Distinct (%)39.5%
Missing0
Missing (%)0.0%
Memory size820.0 B
Minimum2017-12-14 00:00:00
Maximum2023-06-15 00:00:00
2023-12-13T01:20:46.811705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:20:46.945107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
Distinct34
Distinct (%)39.5%
Missing0
Missing (%)0.0%
Memory size820.0 B
Minimum2023-09-02 00:00:00
Maximum2026-07-15 00:00:00
2023-12-13T01:20:47.069405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:20:47.203172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

Interactions

2023-12-13T01:20:43.484229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:20:47.292590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번신규구분기술명(국문)신청기관명녹색기술인증번호인증여부확정일자유효기간만료일자
순번1.0000.3800.8090.6901.0000.9750.975
신규구분0.3801.0001.0000.6151.0000.8420.842
기술명(국문)0.8091.0001.0000.9961.0000.9800.980
신청기관명0.6900.6150.9961.0001.0000.6390.639
녹색기술인증번호1.0001.0001.0001.0001.0001.0001.000
인증여부확정일자0.9750.8420.9800.6391.0001.0001.000
유효기간만료일자0.9750.8420.9800.6391.0001.0001.000
2023-12-13T01:20:47.422751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번신규구분
순번1.0000.253
신규구분0.2531.000

Missing values

2023-12-13T01:20:43.687690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:20:43.894493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번신규구분기술명(국문)신청기관명녹색기술인증번호인증여부확정일자유효기간만료일자
01연장직압출 성형방식의 목재 플라스틱 복합재(합성목재) 제조기술㈜본우드GT-19-005942019-01-242025-01-23
12연장초고속 고액분리 장치를 이용한 재이용수 처리기술㈜블루비에스GT-20-008222020-01-302026-01-29
23연장에탄올 베이스 잉크 및 저심도 제판 인쇄 기술을 이용한 포장재 제조기술율촌화학㈜GT-20-008352020-01-302026-01-29
34연장당귀, 천궁, 작약으로부터 추출된 건강기능식품 원료 제조기술 (헤모힘당귀등혼합추출물)콜마비앤에이치㈜GT-20-008762020-05-212026-05-20
45신규축산유기자원을 활용한 바이오매스플라스틱 제조기술주식회사 더자연GT-20-009002020-07-162026-07-15
56신규곡물 유래의 식물성 재료와 도자기 제조법을 응용한 식품용기 제조 기술㈜자연동화GT-20-009122020-07-162026-07-15
67신규초고압 인삼 가공 기술㈜아모레퍼시픽GT-20-009282020-09-032023-09-02
78신규밀폐형 식물공장 환경제어 기술㈜지플러스생명과학GT-20-009312020-09-032023-09-02
89연장영상기반 페로몬 트랩을 이용한 무인 해충 예찰 시스템㈜그린아그로텍GT-17-003772017-12-142023-12-13
910신규제독된 유황으로 제조되어 식물병 방제효과를 갖는 유기농자재 개발 기술농업회사법인㈜엘바이오텍GT-20-009742020-10-152023-10-14
순번신규구분기술명(국문)신청기관명녹색기술인증번호인증여부확정일자유효기간만료일자
7677신규스마트팜 복합환경시스템 제어기술주식회사 에너틱스GT-23-016612023-04-202026-04-19
7778신규풍압기반 제상시점 검출을 이용한 증발기 제상기술데스코전자GT-23-016632023-04-202026-04-19
7879신규친환경 후렉소 인쇄기를 활용한 유해성 저감 포장재 제조 기술주식회사 풍림P&PGT-23-016532023-04-202026-04-19
7980신규페트병 재활용이 용이한 복합재질 비접착식 수축라벨 제조기술위더스 케미칼(주)GT-23-017042023-06-152026-06-14
8081신규에탄올 베이스 친환경 그라비아잉크 이용한 정전인쇄 필름 포장재 제조기술오원색(주)GT-23-017052023-06-152026-06-14
8182신규농기계 및 산업용 장비 구동 엔진을 PM과 NOx를 저감하는 LPG 엔진으로 전환하는 기술주식회사 로GT-23-017072023-06-152026-06-14
8283신규동백유박 업사이클링을 통한 다용도 피부 기능성 신소재 제조기술아모레퍼시픽GT-23-017082023-06-152026-06-14
8384신규PP발포 식품용기 제조기술케미코 첨단소재 주식회사GT-23-017102023-06-152026-06-14
8485신규미생물 발효를 이용한 PHA(Polyhydroxyalkanoate) 생분해 플라스틱 제조기술CJ제일제당GT-23-017122023-06-152026-06-14
8586신규축산 분뇨 바이오차 생산 밀폐 다단건조 시스템 공정 기술(주)에코피트GT-23-017182023-06-152026-06-14