Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Categorical1
Text2
Numeric1

Dataset

Description녹색인증을 신청한 정보 현황 입니다. 녹색인증 신청년도, 신청구분, 신청번호, 명칭 등의 데이터 항목을 제공합니다.
Author한국산업기술진흥원
URLhttps://www.data.go.kr/data/15069722/fileData.do

Alerts

신청번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:10:32.612928
Analysis finished2023-12-12 13:10:33.630803
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

신청구분
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
녹색기술
5246 
녹색기술제품
4309 
녹색전문기업
 
346
녹색사업
 
99

Length

Max length6
Median length4
Mean length4.931
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색기술
2nd row녹색기술
3rd row녹색기술제품
4th row녹색기술
5th row녹색기술

Common Values

ValueCountFrequency (%)
녹색기술 5246
52.5%
녹색기술제품 4309
43.1%
녹색전문기업 346
 
3.5%
녹색사업 99
 
1.0%

Length

2023-12-12T22:10:33.727371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:10:33.846743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색기술 5246
52.5%
녹색기술제품 4309
43.1%
녹색전문기업 346
 
3.5%
녹색사업 99
 
1.0%

신청번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:10:34.196975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length8.4309
Min length8

Characters and Unicode

Total characters84309
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowGT110482
2nd rowGT110154
3rd rowGTP160420
4th rowGT167762
5th rowGT169368
ValueCountFrequency (%)
gt110482 1
 
< 0.1%
gt120205 1
 
< 0.1%
gtp162881 1
 
< 0.1%
gtp161982 1
 
< 0.1%
gtp130235 1
 
< 0.1%
gtp165156 1
 
< 0.1%
gt130819 1
 
< 0.1%
gt100055 1
 
< 0.1%
gt164484 1
 
< 0.1%
gt167238 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T22:10:34.783832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 14390
17.1%
G 10000
11.9%
6 9643
11.4%
T 9555
11.3%
0 8669
10.3%
5 4532
 
5.4%
3 4481
 
5.3%
P 4408
 
5.2%
4 4396
 
5.2%
2 4274
 
5.1%
Other values (4) 9961
11.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
71.2%
Uppercase Letter 24309
28.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 14390
24.0%
6 9643
16.1%
0 8669
14.4%
5 4532
 
7.6%
3 4481
 
7.5%
4 4396
 
7.3%
2 4274
 
7.1%
7 3725
 
6.2%
8 3050
 
5.1%
9 2840
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
G 10000
41.1%
T 9555
39.3%
P 4408
18.1%
C 346
 
1.4%

Most occurring scripts

ValueCountFrequency (%)
Common 60000
71.2%
Latin 24309
28.8%

Most frequent character per script

Common
ValueCountFrequency (%)
1 14390
24.0%
6 9643
16.1%
0 8669
14.4%
5 4532
 
7.6%
3 4481
 
7.5%
4 4396
 
7.3%
2 4274
 
7.1%
7 3725
 
6.2%
8 3050
 
5.1%
9 2840
 
4.7%
Latin
ValueCountFrequency (%)
G 10000
41.1%
T 9555
39.3%
P 4408
18.1%
C 346
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 84309
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 14390
17.1%
G 10000
11.9%
6 9643
11.4%
T 9555
11.3%
0 8669
10.3%
5 4532
 
5.4%
3 4481
 
5.3%
P 4408
 
5.2%
4 4396
 
5.2%
2 4274
 
5.1%
Other values (4) 9961
11.8%

명칭
Text

Distinct5821
Distinct (%)58.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:10:35.161265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length133
Median length76
Mean length23.4726
Min length1

Characters and Unicode

Total characters234726
Distinct characters907
Distinct categories18 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4210 ?
Unique (%)42.1%

Sample

1st row폐 발포 합성수지 분쇄압축장치 기술
2nd rowUV경화 코팅방식에 의한 투명 열차단 단열 필름 제조 기술
3rd row금속제창, 합성수지제창
4th row중성자 스펙트럼 해석을 통한 중성자 차폐 재료 선정 및 설계 기술
5th row유동층 건조기를 이용한 고분자응집제 제조기술
ValueCountFrequency (%)
기술 2421
 
4.6%
1633
 
3.1%
이용한 1506
 
2.9%
제조기술 909
 
1.7%
컴퓨터 887
 
1.7%
led 715
 
1.4%
제조 537
 
1.0%
절전형 496
 
0.9%
시스템 432
 
0.8%
친환경 411
 
0.8%
Other values (12497) 42311
81.0%
2023-12-12T22:10:36.068361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42458
 
18.1%
8607
 
3.7%
4915
 
2.1%
4763
 
2.0%
3774
 
1.6%
3608
 
1.5%
3482
 
1.5%
3421
 
1.5%
3167
 
1.3%
2438
 
1.0%
Other values (897) 154093
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 166566
71.0%
Space Separator 42458
 
18.1%
Uppercase Letter 11481
 
4.9%
Lowercase Letter 8840
 
3.8%
Other Punctuation 1378
 
0.6%
Open Punctuation 1310
 
0.6%
Close Punctuation 1309
 
0.6%
Decimal Number 953
 
0.4%
Dash Punctuation 344
 
0.1%
Math Symbol 23
 
< 0.1%
Other values (8) 64
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8607
 
5.2%
4915
 
3.0%
4763
 
2.9%
3774
 
2.3%
3608
 
2.2%
3482
 
2.1%
3421
 
2.1%
3167
 
1.9%
2438
 
1.5%
2358
 
1.4%
Other values (784) 126033
75.7%
Uppercase Letter
ValueCountFrequency (%)
E 1790
15.6%
D 1766
15.4%
L 1598
13.9%
P 940
8.2%
C 802
 
7.0%
S 746
 
6.5%
M 482
 
4.2%
T 348
 
3.0%
A 344
 
3.0%
I 317
 
2.8%
Other values (16) 2348
20.5%
Lowercase Letter
ValueCountFrequency (%)
e 1155
13.1%
i 781
 
8.8%
o 749
 
8.5%
t 698
 
7.9%
a 693
 
7.8%
r 673
 
7.6%
n 612
 
6.9%
l 550
 
6.2%
s 462
 
5.2%
m 376
 
4.3%
Other values (16) 2091
23.7%
Other Punctuation
ValueCountFrequency (%)
, 703
51.0%
/ 298
21.6%
. 147
 
10.7%
· 96
 
7.0%
: 40
 
2.9%
% 30
 
2.2%
& 27
 
2.0%
; 14
 
1.0%
" 13
 
0.9%
' 8
 
0.6%
Other values (2) 2
 
0.1%
Other Number
ValueCountFrequency (%)
3
16.7%
2
11.1%
2
11.1%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (2) 2
11.1%
Decimal Number
ValueCountFrequency (%)
2 240
25.2%
0 209
21.9%
3 143
15.0%
1 106
11.1%
5 77
 
8.1%
6 62
 
6.5%
4 34
 
3.6%
7 31
 
3.3%
9 29
 
3.0%
8 22
 
2.3%
Other Symbol
ValueCountFrequency (%)
® 3
21.4%
3
21.4%
2
14.3%
2
14.3%
° 2
14.3%
1
 
7.1%
1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 1297
99.0%
[ 10
 
0.8%
3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1296
99.0%
] 10
 
0.8%
3
 
0.2%
Letter Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Math Symbol
ValueCountFrequency (%)
+ 19
82.6%
~ 4
 
17.4%
Initial Punctuation
ValueCountFrequency (%)
12
85.7%
2
 
14.3%
Final Punctuation
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
42458
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 344
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 166535
70.9%
Common 47836
 
20.4%
Latin 20324
 
8.7%
Han 31
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8607
 
5.2%
4915
 
3.0%
4763
 
2.9%
3774
 
2.3%
3608
 
2.2%
3482
 
2.1%
3421
 
2.1%
3167
 
1.9%
2438
 
1.5%
2358
 
1.4%
Other values (769) 126002
75.7%
Common
ValueCountFrequency (%)
42458
88.8%
( 1297
 
2.7%
) 1296
 
2.7%
, 703
 
1.5%
- 344
 
0.7%
/ 298
 
0.6%
2 240
 
0.5%
0 209
 
0.4%
. 147
 
0.3%
3 143
 
0.3%
Other values (48) 701
 
1.5%
Latin
ValueCountFrequency (%)
E 1790
 
8.8%
D 1766
 
8.7%
L 1598
 
7.9%
e 1155
 
5.7%
P 940
 
4.6%
C 802
 
3.9%
i 781
 
3.8%
o 749
 
3.7%
S 746
 
3.7%
t 698
 
3.4%
Other values (45) 9299
45.8%
Han
ValueCountFrequency (%)
14
45.2%
3
 
9.7%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (5) 5
 
16.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 166519
70.9%
ASCII 68002
29.0%
None 110
 
< 0.1%
CJK 31
 
< 0.1%
Punctuation 21
 
< 0.1%
Enclosed Alphanum 17
 
< 0.1%
Compat Jamo 16
 
< 0.1%
CJK Compat 5
 
< 0.1%
Number Forms 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42458
62.4%
E 1790
 
2.6%
D 1766
 
2.6%
L 1598
 
2.3%
( 1297
 
1.9%
) 1296
 
1.9%
e 1155
 
1.7%
P 940
 
1.4%
C 802
 
1.2%
i 781
 
1.1%
Other values (74) 14119
 
20.8%
Hangul
ValueCountFrequency (%)
8607
 
5.2%
4915
 
3.0%
4763
 
2.9%
3774
 
2.3%
3608
 
2.2%
3482
 
2.1%
3421
 
2.1%
3167
 
1.9%
2438
 
1.5%
2358
 
1.4%
Other values (768) 125986
75.7%
None
ValueCountFrequency (%)
· 96
87.3%
® 3
 
2.7%
3
 
2.7%
3
 
2.7%
3
 
2.7%
° 2
 
1.8%
Compat Jamo
ValueCountFrequency (%)
16
100.0%
CJK
ValueCountFrequency (%)
14
45.2%
3
 
9.7%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (5) 5
 
16.1%
Punctuation
ValueCountFrequency (%)
12
57.1%
6
28.6%
2
 
9.5%
1
 
4.8%
CJK Compat
ValueCountFrequency (%)
3
60.0%
2
40.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
11.8%
2
11.8%
2
11.8%
2
11.8%
2
11.8%
1
5.9%
1
5.9%
1
5.9%
1
5.9%
1
5.9%
Other values (2) 2
11.8%
Number Forms
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Letterlike Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%

신청년도
Real number (ℝ)

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.0717
Minimum2010
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:10:36.232938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2011
Q12014
median2017
Q32020
95-th percentile2022
Maximum2023
Range13
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.7141
Coefficient of variation (CV)0.0018413326
Kurtosis-1.1053708
Mean2017.0717
Median Absolute Deviation (MAD)3
Skewness-0.18748884
Sum20170717
Variance13.794539
MonotonicityNot monotonic
2023-12-12T22:10:36.362348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2021 1098
11.0%
2022 913
9.1%
2015 886
8.9%
2020 874
8.7%
2019 789
7.9%
2018 776
7.8%
2014 763
7.6%
2016 746
7.5%
2013 739
7.4%
2017 667
 
6.7%
Other values (4) 1749
17.5%
ValueCountFrequency (%)
2010 391
3.9%
2011 469
4.7%
2012 476
4.8%
2013 739
7.4%
2014 763
7.6%
2015 886
8.9%
2016 746
7.5%
2017 667
6.7%
2018 776
7.8%
2019 789
7.9%
ValueCountFrequency (%)
2023 413
 
4.1%
2022 913
9.1%
2021 1098
11.0%
2020 874
8.7%
2019 789
7.9%
2018 776
7.8%
2017 667
6.7%
2016 746
7.5%
2015 886
8.9%
2014 763
7.6%

Interactions

2023-12-12T22:10:33.321026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:10:36.460037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청구분신청년도
신청구분1.0000.382
신청년도0.3821.000
2023-12-12T22:10:36.563236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청년도신청구분
신청년도1.0000.264
신청구분0.2641.000

Missing values

2023-12-12T22:10:33.470942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:10:33.579919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신청구분신청번호명칭신청년도
14737녹색기술GT110482폐 발포 합성수지 분쇄압축장치 기술2011
15012녹색기술GT110154UV경화 코팅방식에 의한 투명 열차단 단열 필름 제조 기술2011
9390녹색기술제품GTP160420금속제창, 합성수지제창2016
2492녹색기술GT167762중성자 스펙트럼 해석을 통한 중성자 차폐 재료 선정 및 설계 기술2021
1019녹색기술GT169368유동층 건조기를 이용한 고분자응집제 제조기술2022
2924녹색기술제품GTP166095소비전력 절전형 컴퓨터2021
9398녹색기술제품GTP160793금속제창2016
11981녹색기술GT140161암반에서 확공을 이용한 앵커의 친환경공정 기술2014
2329녹색기술GT168283인버터를 이용한 축동력 제어기술2021
7574녹색기술제품GTP161838데스크톱 컴퓨터2018
신청구분신청번호명칭신청년도
5246녹색기술GT165495하이패스 유로구조와 다수개의 독립된 송풍 모듈을 이용한 에어컨 실내기의 저소음 고효율화 기술2019
11139녹색기술제품GTP150100열역학 펌프효율 측정장비(PEMS)2015
5713녹색기술GT164140전기분해를 이용한 냉각탑 살균용 염소 사용량 절감기술2019
9674녹색기술제품GTP160346절전형 컴퓨터2016
8308녹색기술제품GTP161567금속제 창(단열미서기창)2017
6387녹색기술제품GTP162890에너지절감형 컴퓨터2019
1421녹색기술제품GTP167065선박 및 기자재용 방수코팅제 (선박, 기자재 및 양식장 수조 등에 적용하는 방수코팅제)2022
7024녹색기술제품GTP162740데스크톱 컴퓨터2018
7270녹색기술제품GTP162323기능성 두발화장품2018
13804녹색기술GT120725저온에서 생산이 가능한 점착시트 생산기술(폴리올로부터 형성한 폴리우레탄 점착시트)2012