Overview

Dataset statistics

Number of variables6
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory50.6 B

Variable types

Categorical5
Text1

Dataset

Description2017년~2023년도 환경부, 한국환경산업기술원, 생활화학제품 기업, 시민단체가 참여하는"생활화학제품 안전관리 자발적 협약" 참여기업 목록(기업명, 협약 구분 등) 입니다.
URLhttps://www.data.go.kr/data/15105282/fileData.do

Alerts

구분 is highly overall correlated with 1기 자발적 협약(2017-02_2019-02) and 3 other fieldsHigh correlation
2기 자발적 협약(2019-06_2021-06) is highly overall correlated with 구분 and 2 other fieldsHigh correlation
비고 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
1기 자발적 협약(2017-02_2019-02) is highly overall correlated with 구분 and 2 other fieldsHigh correlation
3기 자발적 협약(2021-12_2023-12) is highly overall correlated with 구분 and 3 other fieldsHigh correlation
구분 is highly imbalanced (54.4%)Imbalance
기업명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:25:38.512881
Analysis finished2023-12-12 12:25:39.446591
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
제조·수입사
43 
유통사
 
4
시민단체
 
3

Length

Max length6
Median length6
Mean length5.64
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조·수입사
2nd row제조·수입사
3rd row제조·수입사
4th row제조·수입사
5th row제조·수입사

Common Values

ValueCountFrequency (%)
제조·수입사 43
86.0%
유통사 4
 
8.0%
시민단체 3
 
6.0%

Length

2023-12-12T21:25:39.607960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:25:39.811900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조·수입사 43
86.0%
유통사 4
 
8.0%
시민단체 3
 
6.0%

기업명
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T21:25:40.104930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length6.62
Min length3

Characters and Unicode

Total characters331
Distinct characters165
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row라이온코리아㈜
2nd row㈜무궁화
3rd row보령메디앙스㈜
4th row㈜불스원
5th row애경산업㈜
ValueCountFrequency (%)
라이온코리아㈜ 1
 
2.0%
아리퓨어 1
 
2.0%
㈜씨앤지세븐 1
 
2.0%
㈜월드그린 1
 
2.0%
㈜천연살균의학처 1
 
2.0%
㈜프로세이프바이오 1
 
2.0%
허브에프앤씨(f&c 1
 
2.0%
㈜해피룸 1
 
2.0%
디오티큐 1
 
2.0%
다비디퓨저 1
 
2.0%
Other values (41) 41
80.4%
2023-12-12T21:25:40.603678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
9.4%
( 10
 
3.0%
10
 
3.0%
) 10
 
3.0%
9
 
2.7%
8
 
2.4%
7
 
2.1%
6
 
1.8%
6
 
1.8%
5
 
1.5%
Other values (155) 229
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 264
79.8%
Other Symbol 31
 
9.4%
Open Punctuation 10
 
3.0%
Close Punctuation 10
 
3.0%
Uppercase Letter 7
 
2.1%
Lowercase Letter 6
 
1.8%
Other Punctuation 2
 
0.6%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
3.8%
9
 
3.4%
8
 
3.0%
7
 
2.7%
6
 
2.3%
6
 
2.3%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (136) 201
76.1%
Uppercase Letter
ValueCountFrequency (%)
S 1
14.3%
C 1
14.3%
F 1
14.3%
K 1
14.3%
A 1
14.3%
J 1
14.3%
Y 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
c 1
16.7%
e 1
16.7%
u 1
16.7%
o 1
16.7%
t 1
16.7%
n 1
16.7%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
/ 1
50.0%
Other Symbol
ValueCountFrequency (%)
31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 295
89.1%
Common 23
 
6.9%
Latin 13
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
10.5%
10
 
3.4%
9
 
3.1%
8
 
2.7%
7
 
2.4%
6
 
2.0%
6
 
2.0%
5
 
1.7%
4
 
1.4%
4
 
1.4%
Other values (137) 205
69.5%
Latin
ValueCountFrequency (%)
S 1
 
7.7%
C 1
 
7.7%
F 1
 
7.7%
c 1
 
7.7%
e 1
 
7.7%
K 1
 
7.7%
A 1
 
7.7%
J 1
 
7.7%
u 1
 
7.7%
o 1
 
7.7%
Other values (3) 3
23.1%
Common
ValueCountFrequency (%)
( 10
43.5%
) 10
43.5%
& 1
 
4.3%
/ 1
 
4.3%
1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 264
79.8%
ASCII 36
 
10.9%
None 31
 
9.4%

Most frequent character per block

None
ValueCountFrequency (%)
31
100.0%
ASCII
ValueCountFrequency (%)
( 10
27.8%
) 10
27.8%
S 1
 
2.8%
C 1
 
2.8%
& 1
 
2.8%
F 1
 
2.8%
c 1
 
2.8%
e 1
 
2.8%
/ 1
 
2.8%
K 1
 
2.8%
Other values (8) 8
22.2%
Hangul
ValueCountFrequency (%)
10
 
3.8%
9
 
3.4%
8
 
3.0%
7
 
2.7%
6
 
2.3%
6
 
2.3%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (136) 201
76.1%

1기 자발적 협약(2017-02_2019-02)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
<NA>
32 
참여
18 

Length

Max length4
Median length4
Mean length3.28
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row참여
2nd row참여
3rd row참여
4th row참여
5th row참여

Common Values

ValueCountFrequency (%)
<NA> 32
64.0%
참여 18
36.0%

Length

2023-12-12T21:25:40.799184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:25:40.959306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 32
64.0%
참여 18
36.0%

2기 자발적 협약(2019-06_2021-06)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
<NA>
30 
참여
20 

Length

Max length4
Median length4
Mean length3.2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row참여
2nd row<NA>
3rd row참여
4th row참여
5th row참여

Common Values

ValueCountFrequency (%)
<NA> 30
60.0%
참여 20
40.0%

Length

2023-12-12T21:25:41.159323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:25:41.309861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 30
60.0%
참여 20
40.0%

3기 자발적 협약(2021-12_2023-12)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
참여
40 
<NA>
10 

Length

Max length4
Median length2
Mean length2.4
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row참여
2nd row참여
3rd row참여
4th row참여
5th row참여

Common Values

ValueCountFrequency (%)
참여 40
80.0%
<NA> 10
 
20.0%

Length

2023-12-12T21:25:41.466227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:25:41.613771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
참여 40
80.0%
na 10
 
20.0%

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
<NA>
27 
신규
23 

Length

Max length4
Median length4
Mean length3.08
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 27
54.0%
신규 23
46.0%

Length

2023-12-12T21:25:41.780578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:25:41.949672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 27
54.0%
신규 23
46.0%

Correlations

2023-12-12T21:25:42.046947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분기업명
구분1.0001.000
기업명1.0001.000
2023-12-12T21:25:42.179277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분2기 자발적 협약(2019-06_2021-06)비고1기 자발적 협약(2017-02_2019-02)3기 자발적 협약(2021-12_2023-12)
구분1.0001.0001.0001.0001.000
2기 자발적 협약(2019-06_2021-06)1.0001.000NaN1.0001.000
비고1.000NaN1.000NaN1.000
1기 자발적 협약(2017-02_2019-02)1.0001.000NaN1.0001.000
3기 자발적 협약(2021-12_2023-12)1.0001.0001.0001.0001.000
2023-12-12T21:25:42.328841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분1기 자발적 협약(2017-02_2019-02)2기 자발적 협약(2019-06_2021-06)3기 자발적 협약(2021-12_2023-12)비고
구분1.0001.0001.0001.0001.000
1기 자발적 협약(2017-02_2019-02)1.0001.0001.0001.0000.000
2기 자발적 협약(2019-06_2021-06)1.0001.0001.0001.0000.000
3기 자발적 협약(2021-12_2023-12)1.0001.0001.0001.0001.000
비고1.0000.0000.0001.0001.000

Missing values

2023-12-12T21:25:39.084105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:25:39.352381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분기업명1기 자발적 협약(2017-02_2019-02)2기 자발적 협약(2019-06_2021-06)3기 자발적 협약(2021-12_2023-12)비고
0제조·수입사라이온코리아㈜참여참여참여<NA>
1제조·수입사㈜무궁화참여<NA>참여<NA>
2제조·수입사보령메디앙스㈜참여참여참여<NA>
3제조·수입사㈜불스원참여참여참여<NA>
4제조·수입사애경산업㈜참여참여참여<NA>
5제조·수입사에스씨존슨코리아(유)참여<NA><NA><NA>
6제조·수입사㈜엘지생활건강참여참여참여<NA>
7제조·수입사(유)옥시레킷벤키저참여참여<NA><NA>
8제조·수입사(유)유한양행참여<NA><NA><NA>
9제조·수입사유한킴벌리㈜참여<NA><NA><NA>
구분기업명1기 자발적 협약(2017-02_2019-02)2기 자발적 협약(2019-06_2021-06)3기 자발적 협약(2021-12_2023-12)비고
40제조·수입사포레스트오브퍼퓸<NA><NA>참여신규
41제조·수입사향기만드는가게<NA><NA>참여신규
42제조·수입사(유)강청<NA><NA>참여신규
43유통사롯데쇼핑㈜롯데마트참여참여참여<NA>
44유통사㈜아성다이소참여참여참여<NA>
45유통사㈜이마트참여참여참여<NA>
46유통사㈜홈플러스참여참여참여<NA>
47시민단체환경정의<NA>참여참여<NA>
48시민단체발암물질없는사회만들기국민행동<NA><NA>참여<NA>
49시민단체환경운동연합<NA>참여<NA><NA>