Overview

Dataset statistics

Number of variables5
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory42.5 B

Variable types

Text3
Categorical2

Dataset

Description2021년~2023년도 7월의 환경부, 한국환경산업기술원, 녹색소비자연대 등 민관이 공동으로 선정한 생활화학제품 화학물질 저감 우수제품 목록(구분, 기업명, 제품명, 품목, 신고번호) 입니다.
URLhttps://www.data.go.kr/data/15105279/fileData.do

Alerts

기업명 is highly overall correlated with 품목High correlation
품목 is highly overall correlated with 기업명High correlation
구분 has unique valuesUnique
제품명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:39:23.092032
Analysis finished2023-12-12 21:39:23.640133
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-13T06:39:23.825216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.8269231
Min length2

Characters and Unicode

Total characters147
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row1호
2nd row2호
3rd row3호
4th row4호
5th row5호
ValueCountFrequency (%)
1호 1
 
1.9%
2호 1
 
1.9%
39호 1
 
1.9%
29호 1
 
1.9%
30호 1
 
1.9%
31호 1
 
1.9%
32호 1
 
1.9%
33호 1
 
1.9%
34호 1
 
1.9%
35호 1
 
1.9%
Other values (42) 42
80.8%
2023-12-13T06:39:24.197577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
35.4%
1 16
 
10.9%
2 16
 
10.9%
3 15
 
10.2%
4 15
 
10.2%
5 8
 
5.4%
6 5
 
3.4%
7 5
 
3.4%
8 5
 
3.4%
9 5
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 95
64.6%
Other Letter 52
35.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 16
16.8%
2 16
16.8%
3 15
15.8%
4 15
15.8%
5 8
8.4%
6 5
 
5.3%
7 5
 
5.3%
8 5
 
5.3%
9 5
 
5.3%
0 5
 
5.3%
Other Letter
ValueCountFrequency (%)
52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 95
64.6%
Hangul 52
35.4%

Most frequent character per script

Common
ValueCountFrequency (%)
1 16
16.8%
2 16
16.8%
3 15
15.8%
4 15
15.8%
5 8
8.4%
6 5
 
5.3%
7 5
 
5.3%
8 5
 
5.3%
9 5
 
5.3%
0 5
 
5.3%
Hangul
ValueCountFrequency (%)
52
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 95
64.6%
Hangul 52
35.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
52
100.0%
ASCII
ValueCountFrequency (%)
1 16
16.8%
2 16
16.8%
3 15
15.8%
4 15
15.8%
5 8
8.4%
6 5
 
5.3%
7 5
 
5.3%
8 5
 
5.3%
9 5
 
5.3%
0 5
 
5.3%

기업명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
㈜불스원
20 
㈜크린하우스
라이온코리아㈜
엘지생활건강
코웨이
Other values (6)
10 

Length

Max length14
Median length8
Mean length5.5576923
Min length3

Unique

Unique3 ?
Unique (%)5.8%

Sample

1st row유한크로락스
2nd row엘지생활건강
3rd row엘지생활건강
4th row라이온코리아㈜
5th row㈜불스원

Common Values

ValueCountFrequency (%)
㈜불스원 20
38.5%
㈜크린하우스 9
17.3%
라이온코리아㈜ 5
 
9.6%
엘지생활건강 4
 
7.7%
코웨이 4
 
7.7%
메디앙스(주) 3
 
5.8%
㈜이마트 2
 
3.8%
롯데쇼핑㈜ 롯데마트사업본부 2
 
3.8%
유한크로락스 1
 
1.9%
비앤디생활건강 1
 
1.9%

Length

2023-12-13T06:39:24.350203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
㈜불스원 20
37.0%
㈜크린하우스 9
16.7%
라이온코리아㈜ 5
 
9.3%
엘지생활건강 4
 
7.4%
코웨이 4
 
7.4%
메디앙스(주 3
 
5.6%
㈜이마트 2
 
3.7%
롯데쇼핑㈜ 2
 
3.7%
롯데마트사업본부 2
 
3.7%
유한크로락스 1
 
1.9%
Other values (2) 2
 
3.7%

제품명
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-13T06:39:24.623046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length22
Mean length15.115385
Min length2

Characters and Unicode

Total characters786
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row유한젠
2nd row피지 딥클린젤
3rd row홈스타 인덕션 클린티슈
4th row하이지아 다목적 살균 스프레이
5th row레인오케이 에탄올 그린워셔
ValueCountFrequency (%)
레인오케이(rainok 6
 
3.8%
코팅워셔 5
 
3.2%
습기제거제 5
 
3.2%
에탄올 5
 
3.2%
불스원 4
 
2.6%
노브랜드 4
 
2.6%
레인오케이 4
 
2.6%
다목적 3
 
1.9%
3
 
1.9%
참그린 3
 
1.9%
Other values (89) 114
73.1%
2023-12-13T06:39:25.330352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
13.4%
24
 
3.1%
19
 
2.4%
17
 
2.2%
( 17
 
2.2%
) 17
 
2.2%
16
 
2.0%
14
 
1.8%
14
 
1.8%
13
 
1.7%
Other values (172) 530
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 554
70.5%
Space Separator 105
 
13.4%
Lowercase Letter 36
 
4.6%
Decimal Number 26
 
3.3%
Uppercase Letter 25
 
3.2%
Open Punctuation 17
 
2.2%
Close Punctuation 17
 
2.2%
Other Punctuation 5
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.3%
19
 
3.4%
17
 
3.1%
16
 
2.9%
14
 
2.5%
14
 
2.5%
13
 
2.3%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (150) 404
72.9%
Lowercase Letter
ValueCountFrequency (%)
i 12
33.3%
n 11
30.6%
a 8
22.2%
m 2
 
5.6%
u 1
 
2.8%
e 1
 
2.8%
r 1
 
2.8%
Decimal Number
ValueCountFrequency (%)
1 8
30.8%
0 7
26.9%
3 5
19.2%
2 4
15.4%
5 2
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
O 8
32.0%
K 8
32.0%
R 8
32.0%
P 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
& 3
60.0%
% 2
40.0%
Space Separator
ValueCountFrequency (%)
105
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 554
70.5%
Common 171
 
21.8%
Latin 61
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.3%
19
 
3.4%
17
 
3.1%
16
 
2.9%
14
 
2.5%
14
 
2.5%
13
 
2.3%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (150) 404
72.9%
Common
ValueCountFrequency (%)
105
61.4%
( 17
 
9.9%
) 17
 
9.9%
1 8
 
4.7%
0 7
 
4.1%
3 5
 
2.9%
2 4
 
2.3%
& 3
 
1.8%
5 2
 
1.2%
% 2
 
1.2%
Latin
ValueCountFrequency (%)
i 12
19.7%
n 11
18.0%
a 8
13.1%
O 8
13.1%
K 8
13.1%
R 8
13.1%
m 2
 
3.3%
u 1
 
1.6%
e 1
 
1.6%
r 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 552
70.2%
ASCII 232
29.5%
Compat Jamo 2
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
45.3%
( 17
 
7.3%
) 17
 
7.3%
i 12
 
5.2%
n 11
 
4.7%
a 8
 
3.4%
O 8
 
3.4%
K 8
 
3.4%
1 8
 
3.4%
R 8
 
3.4%
Other values (12) 30
 
12.9%
Hangul
ValueCountFrequency (%)
24
 
4.3%
19
 
3.4%
17
 
3.1%
16
 
2.9%
14
 
2.5%
14
 
2.5%
13
 
2.4%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (149) 402
72.8%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

품목
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)30.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
자동차용 워셔액
10 
세정제
습기제거제
세탁세제
필터형보존처리제품
Other values (11)
16 

Length

Max length19
Median length15
Mean length7.0384615
Min length3

Unique

Unique7 ?
Unique (%)13.5%

Sample

1st row표백제, 살균제
2nd row세탁세제
3rd row세정제, 살균제
4th row살균제, 세정제
5th row자동차용 워셔액

Common Values

ValueCountFrequency (%)
자동차용 워셔액 10
19.2%
세정제 9
17.3%
습기제거제 8
15.4%
세탁세제 5
9.6%
필터형보존처리제품 4
 
7.7%
자동차용 부동액 3
 
5.8%
살균제, 세정제 2
 
3.8%
특수목적코팅제 2
 
3.8%
세정제, 특수목적 코팅제 2
 
3.8%
표백제, 살균제 1
 
1.9%
Other values (6) 6
11.5%

Length

2023-12-13T06:39:25.467705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
세정제 18
22.0%
자동차용 13
15.9%
워셔액 10
12.2%
습기제거제 9
11.0%
세탁세제 6
 
7.3%
필터형보존처리제품 4
 
4.9%
살균제 4
 
4.9%
부동액 3
 
3.7%
표백제 3
 
3.7%
탈취제 3
 
3.7%
Other values (5) 9
11.0%
Distinct34
Distinct (%)65.4%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-13T06:39:25.644308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.019231
Min length11

Characters and Unicode

Total characters625
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)51.9%

Sample

1st rowFB19-04-0005
2nd rowFB20-03-0008
3rd rowFB21-01-0200
4th rowEB20-01-096
5th rowDA20-16-0005
ValueCountFrequency (%)
fb21-27-0003 8
 
15.4%
da20-16-0005 4
 
7.7%
da20-16-0008 4
 
7.7%
hb21-25-0020 3
 
5.8%
da21-17-0005 2
 
3.8%
da20-16-0007 2
 
3.8%
cb21-01-1856 2
 
3.8%
da19-17-0009 1
 
1.9%
cb21-01-0832 1
 
1.9%
fb23-01-0424 1
 
1.9%
Other values (24) 24
46.2%
2023-12-13T06:39:25.973435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 153
24.5%
- 104
16.6%
2 73
11.7%
1 70
11.2%
B 39
 
6.2%
7 24
 
3.8%
3 22
 
3.5%
6 21
 
3.4%
5 19
 
3.0%
F 15
 
2.4%
Other values (10) 85
13.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 415
66.4%
Dash Punctuation 104
 
16.6%
Uppercase Letter 104
 
16.6%
Space Separator 2
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 153
36.9%
2 73
17.6%
1 70
16.9%
7 24
 
5.8%
3 22
 
5.3%
6 21
 
5.1%
5 19
 
4.6%
9 13
 
3.1%
8 12
 
2.9%
4 8
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
B 39
37.5%
F 15
 
14.4%
A 13
 
12.5%
D 13
 
12.5%
C 11
 
10.6%
H 7
 
6.7%
E 5
 
4.8%
G 1
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 521
83.4%
Latin 104
 
16.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 153
29.4%
- 104
20.0%
2 73
14.0%
1 70
13.4%
7 24
 
4.6%
3 22
 
4.2%
6 21
 
4.0%
5 19
 
3.6%
9 13
 
2.5%
8 12
 
2.3%
Other values (2) 10
 
1.9%
Latin
ValueCountFrequency (%)
B 39
37.5%
F 15
 
14.4%
A 13
 
12.5%
D 13
 
12.5%
C 11
 
10.6%
H 7
 
6.7%
E 5
 
4.8%
G 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 625
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 153
24.5%
- 104
16.6%
2 73
11.7%
1 70
11.2%
B 39
 
6.2%
7 24
 
3.8%
3 22
 
3.5%
6 21
 
3.4%
5 19
 
3.0%
F 15
 
2.4%
Other values (10) 85
13.6%

Correlations

2023-12-13T06:39:26.144432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분기업명제품명품목신고번호
구분1.0001.0001.0001.0001.000
기업명1.0001.0001.0000.9121.000
제품명1.0001.0001.0001.0001.000
품목1.0000.9121.0001.0001.000
신고번호1.0001.0001.0001.0001.000
2023-12-13T06:39:26.239997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목기업명
품목1.0000.617
기업명0.6171.000
2023-12-13T06:39:26.319135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업명품목
기업명1.0000.617
품목0.6171.000

Missing values

2023-12-13T06:39:23.490846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:39:23.599707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분기업명제품명품목신고번호
01호유한크로락스유한젠표백제, 살균제FB19-04-0005
12호엘지생활건강피지 딥클린젤세탁세제FB20-03-0008
23호엘지생활건강홈스타 인덕션 클린티슈세정제, 살균제FB21-01-0200
34호라이온코리아㈜하이지아 다목적 살균 스프레이살균제, 세정제EB20-01-096
45호㈜불스원레인오케이 에탄올 그린워셔자동차용 워셔액DA20-16-0005
56호㈜불스원레인오케이(RainOK) 에탄올 3인1(3in1) 코팅워셔자동차용 워셔액DA20-16-0008
67호㈜불스원불스원 다목적 세정제세정제CB19-01-0643
78호㈜불스원퍼스트클래스 초고농축 슈퍼버블폼세정제CB19-01-0247
89호㈜불스원레인오케이(RainOK) 프리미엄 에탄올 발수코팅 워셔자동차용 워셔액DA20-16-0007
910호비앤디생활건강슈맘세탁세제HB20-03-0187
구분기업명제품명품목신고번호
4243호㈜이마트노브랜드 물걸레 청소포 20매(대)세정제CB21-01-1856
4344호엘지생활건강한ㆍ입 100% 구연산 알파세정제FB19-01-0777
4445호엘지생활건강한ㆍ입 100% 베이킹소다 알파세정제FB19-01-0775
4546호㈜불스원레인오케이 발수코팅&세정 클린세정제, 특수목적 코팅제CB21-01-0832
4647호㈜불스원레인오케이(RainOK) 발수코팅&세정투인원(2in1)세정제, 특수목적 코팅제CB21-01-0828
4748호롯데쇼핑㈜ 롯데마트사업본부오늘좋은 과탄산소다세정제, 탈취제, 표백제FB23-01-0425
4849호롯데쇼핑㈜ 롯데마트사업본부오늘좋은 베이킹소다세정제FB23-01-0424
4950호애경산업㈜엄마의 선택 구연산세정제CB21-01-1302
5051호㈜불스원크리스탈 발수코팅제특수목적코팅제CB23-07-0066
5152호㈜불스원스피드업 발수 코팅워셔자동차용 워셔액DA20-16-0007