Overview

Dataset statistics

Number of variables3
Number of observations146
Missing cells6
Missing cells (%)1.4%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory3.6 KiB
Average record size in memory24.9 B

Variable types

Text3

Dataset

Description경상남도 진주시 내의 사업장폐기물 배출자 신고 현황 자료로 사업장 명칭, 사업장 주소, 폐기물의 종류를 제공합니다.
Author경상남도 진주시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15064090

Alerts

Dataset has 1 (0.7%) duplicate rowsDuplicates
사업장명 has 2 (1.4%) missing valuesMissing
사업장주소 has 2 (1.4%) missing valuesMissing
폐기물의 종류 has 2 (1.4%) missing valuesMissing

Reproduction

Analysis started2024-04-17 21:36:25.503791
Analysis finished2024-04-17 21:36:26.241705
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

MISSING 

Distinct144
Distinct (%)100.0%
Missing2
Missing (%)1.4%
Memory size1.3 KiB
2024-04-18T06:36:26.400433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length8.1180556
Min length3

Characters and Unicode

Total characters1169
Distinct characters207
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)100.0%

Sample

1st row동일팩키지㈜진주공장
2nd row우성정공(주)
3rd row남강제지(주)
4th row설정식품(주)
5th row정암산업㈜
ValueCountFrequency (%)
진주점 4
 
2.3%
환경시설관리 3
 
1.7%
탑마트 3
 
1.7%
서원유통(주 2
 
1.2%
㈜성광 2
 
1.2%
성화산업㈜ 2
 
1.2%
농업회사법인 2
 
1.2%
진주공장 2
 
1.2%
진주지사 2
 
1.2%
진주지점 2
 
1.2%
Other values (149) 149
86.1%
2024-04-18T06:36:26.745955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
6.1%
53
 
4.5%
( 45
 
3.8%
) 45
 
3.8%
37
 
3.2%
36
 
3.1%
33
 
2.8%
30
 
2.6%
25
 
2.1%
23
 
2.0%
Other values (197) 771
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 987
84.4%
Other Symbol 53
 
4.5%
Open Punctuation 45
 
3.8%
Close Punctuation 45
 
3.8%
Space Separator 30
 
2.6%
Decimal Number 4
 
0.3%
Other Punctuation 2
 
0.2%
Uppercase Letter 2
 
0.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
7.2%
37
 
3.7%
36
 
3.6%
33
 
3.3%
25
 
2.5%
23
 
2.3%
22
 
2.2%
22
 
2.2%
21
 
2.1%
21
 
2.1%
Other values (186) 676
68.5%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
3 2
50.0%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
P 1
50.0%
T 1
50.0%
Other Symbol
ValueCountFrequency (%)
53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1040
89.0%
Common 127
 
10.9%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
6.8%
53
 
5.1%
37
 
3.6%
36
 
3.5%
33
 
3.2%
25
 
2.4%
23
 
2.2%
22
 
2.1%
22
 
2.1%
21
 
2.0%
Other values (187) 697
67.0%
Common
ValueCountFrequency (%)
( 45
35.4%
) 45
35.4%
30
23.6%
2 2
 
1.6%
3 2
 
1.6%
& 1
 
0.8%
, 1
 
0.8%
- 1
 
0.8%
Latin
ValueCountFrequency (%)
P 1
50.0%
T 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 987
84.4%
ASCII 129
 
11.0%
None 53
 
4.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
7.2%
37
 
3.7%
36
 
3.6%
33
 
3.3%
25
 
2.5%
23
 
2.3%
22
 
2.2%
22
 
2.2%
21
 
2.1%
21
 
2.1%
Other values (186) 676
68.5%
None
ValueCountFrequency (%)
53
100.0%
ASCII
ValueCountFrequency (%)
( 45
34.9%
) 45
34.9%
30
23.3%
2 2
 
1.6%
3 2
 
1.6%
& 1
 
0.8%
P 1
 
0.8%
, 1
 
0.8%
- 1
 
0.8%
T 1
 
0.8%

사업장주소
Text

MISSING 

Distinct139
Distinct (%)96.5%
Missing2
Missing (%)1.4%
Memory size1.3 KiB
2024-04-18T06:36:26.983619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length16.583333
Min length7

Characters and Unicode

Total characters2388
Distinct characters112
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)93.1%

Sample

1st row진주시 남강로 1303 (상평동)
2nd row진주시 진성면 동부로1259번길 54
3rd row진주시 남강로1367번길 36
4th row진주시 남강로1367번길 14-8 (상대동)
5th row진주시 진성면 진의로 471
ValueCountFrequency (%)
진주시 134
24.9%
대곡면 20
 
3.7%
사봉면 15
 
2.8%
진성면 14
 
2.6%
정촌면 14
 
2.6%
남강로 13
 
2.4%
지수면 8
 
1.5%
문산읍 7
 
1.3%
진주대로 7
 
1.3%
산업단지로 6
 
1.1%
Other values (204) 301
55.8%
2024-04-18T06:36:27.344762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
407
17.0%
173
 
7.2%
144
 
6.0%
134
 
5.6%
130
 
5.4%
1 111
 
4.6%
84
 
3.5%
2 70
 
2.9%
3 68
 
2.8%
58
 
2.4%
Other values (102) 1009
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1357
56.8%
Decimal Number 553
23.2%
Space Separator 407
 
17.0%
Close Punctuation 25
 
1.0%
Open Punctuation 25
 
1.0%
Dash Punctuation 20
 
0.8%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
12.7%
144
 
10.6%
134
 
9.9%
130
 
9.6%
84
 
6.2%
58
 
4.3%
57
 
4.2%
51
 
3.8%
45
 
3.3%
30
 
2.2%
Other values (87) 451
33.2%
Decimal Number
ValueCountFrequency (%)
1 111
20.1%
2 70
12.7%
3 68
12.3%
4 55
9.9%
5 49
8.9%
7 46
8.3%
8 43
 
7.8%
6 41
 
7.4%
0 38
 
6.9%
9 32
 
5.8%
Space Separator
ValueCountFrequency (%)
407
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1357
56.8%
Common 1031
43.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
12.7%
144
 
10.6%
134
 
9.9%
130
 
9.6%
84
 
6.2%
58
 
4.3%
57
 
4.2%
51
 
3.8%
45
 
3.3%
30
 
2.2%
Other values (87) 451
33.2%
Common
ValueCountFrequency (%)
407
39.5%
1 111
 
10.8%
2 70
 
6.8%
3 68
 
6.6%
4 55
 
5.3%
5 49
 
4.8%
7 46
 
4.5%
8 43
 
4.2%
6 41
 
4.0%
0 38
 
3.7%
Other values (5) 103
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1357
56.8%
ASCII 1031
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
407
39.5%
1 111
 
10.8%
2 70
 
6.8%
3 68
 
6.6%
4 55
 
5.3%
5 49
 
4.8%
7 46
 
4.5%
8 43
 
4.2%
6 41
 
4.0%
0 38
 
3.7%
Other values (5) 103
 
10.0%
Hangul
ValueCountFrequency (%)
173
 
12.7%
144
 
10.6%
134
 
9.9%
130
 
9.6%
84
 
6.2%
58
 
4.3%
57
 
4.2%
51
 
3.8%
45
 
3.3%
30
 
2.2%
Other values (87) 451
33.2%

폐기물의 종류
Text

MISSING 

Distinct104
Distinct (%)72.2%
Missing2
Missing (%)1.4%
Memory size1.3 KiB
2024-04-18T06:36:27.549467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length186
Median length91
Mean length29
Min length3

Characters and Unicode

Total characters4176
Distinct characters152
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)61.1%

Sample

1st row사업장폐기물 소각시설 바닥재,폐토사
2nd row그 밖의 공정오니,폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row펄프·제지폐수처리오니,사업장폐기물 소각시설 바닥재
4th row폐합성수지류(폐염화비닐수지류는 제외한다),그 밖의 식물성잔재물
5th row폐수처리오니,폐콘크리트
ValueCountFrequency (%)
밖의 88
 
15.3%
34
 
5.9%
폐기물 20
 
3.5%
폐합성수지류(폐염화비닐수지류는 20
 
3.5%
폐합성수지류 18
 
3.1%
과정에서 10
 
1.7%
처리하는 8
 
1.4%
폐목재류 8
 
1.4%
발생한 8
 
1.4%
분진 7
 
1.2%
Other values (173) 356
61.7%
2024-04-18T06:36:27.891618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
433
 
10.4%
333
 
8.0%
, 237
 
5.7%
177
 
4.2%
170
 
4.1%
132
 
3.2%
125
 
3.0%
110
 
2.6%
102
 
2.4%
95
 
2.3%
Other values (142) 2262
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3372
80.7%
Space Separator 433
 
10.4%
Other Punctuation 259
 
6.2%
Open Punctuation 56
 
1.3%
Close Punctuation 56
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
333
 
9.9%
177
 
5.2%
170
 
5.0%
132
 
3.9%
125
 
3.7%
110
 
3.3%
102
 
3.0%
95
 
2.8%
95
 
2.8%
91
 
2.7%
Other values (136) 1942
57.6%
Other Punctuation
ValueCountFrequency (%)
, 237
91.5%
· 21
 
8.1%
. 1
 
0.4%
Space Separator
ValueCountFrequency (%)
433
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3372
80.7%
Common 804
 
19.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
333
 
9.9%
177
 
5.2%
170
 
5.0%
132
 
3.9%
125
 
3.7%
110
 
3.3%
102
 
3.0%
95
 
2.8%
95
 
2.8%
91
 
2.7%
Other values (136) 1942
57.6%
Common
ValueCountFrequency (%)
433
53.9%
, 237
29.5%
( 56
 
7.0%
) 56
 
7.0%
· 21
 
2.6%
. 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3372
80.7%
ASCII 783
 
18.8%
None 21
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
433
55.3%
, 237
30.3%
( 56
 
7.2%
) 56
 
7.2%
. 1
 
0.1%
Hangul
ValueCountFrequency (%)
333
 
9.9%
177
 
5.2%
170
 
5.0%
132
 
3.9%
125
 
3.7%
110
 
3.3%
102
 
3.0%
95
 
2.8%
95
 
2.8%
91
 
2.7%
Other values (136) 1942
57.6%
None
ValueCountFrequency (%)
· 21
100.0%

Missing values

2024-04-18T06:36:26.134494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T06:36:26.201973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명사업장주소폐기물의 종류
0동일팩키지㈜진주공장진주시 남강로 1303 (상평동)사업장폐기물 소각시설 바닥재,폐토사
1우성정공(주)진주시 진성면 동부로1259번길 54그 밖의 공정오니,폐합성수지류(폐염화비닐수지류는 제외한다)
2남강제지(주)진주시 남강로1367번길 36펄프·제지폐수처리오니,사업장폐기물 소각시설 바닥재
3설정식품(주)진주시 남강로1367번길 14-8 (상대동)폐합성수지류(폐염화비닐수지류는 제외한다),그 밖의 식물성잔재물
4정암산업㈜진주시 진성면 진의로 471폐수처리오니,폐콘크리트
5진주시정수과진주시 남강로1번길 38 (판문동)정수처리오니
6농업회사법인 현대씨앤에프(주)진주시 지수면 청담길175번길 16-16그 밖의 폐수처리오니,폐합성수지류(폐염화비닐수지류는 제외한다),그 밖의 동물성잔재물
7동이메탈(주)진주시 남강로 1353 (상대동)점토점결폐주물사,음식물류폐기물,그 밖의 폐기물
8공군교육사령부진주시 금산면 송백로 46폐합성수지류(폐염화비닐수지류는 제외한다),폐유리섬유,음식물류폐기물,종량제봉투 배출 폐기물(합성수지 종량제 봉투에 배출되는 폐기물을 말한다)
9진주교도소진주시 대곡면 월암로23번길 39그 밖의 폐기물
사업장명사업장주소폐기물의 종류
136㈜비지에프로지스정촌면 화개천로 50폐합성수지류
137푸디스트㈜공군신교대점금산면 송백로 46음식물류폐기물
138아이티알인더스트리즈㈜3공장사봉면 사군로303번길 34폐활성탄
139㈜신흥기업사봉면 산업단지로27번길 21폐합성수지류, 그밖의 분진
140㈜석영지수면 금평로 156그밖의폐목재류
141하이파워㈜사봉면 산업단지로 36폐합성수지류
142하진산업이반성면 진마대로2520번길 46폐활성탄
143㈜천보철강동진로 324폐유리섬유
144<NA><NA><NA>
145<NA><NA><NA>

Duplicate rows

Most frequently occurring

사업장명사업장주소폐기물의 종류# duplicates
0<NA><NA><NA>2