Overview

Dataset statistics

Number of variables3
Number of observations217
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory24.6 B

Variable types

Text3

Dataset

Description경상북도 환경, 환경보호와 관련한 정보를 제공합니다.(경상북도 폐수배출시설의 상호명, 주소, 업종 현황입니다.)
Author경상북도
URLhttps://www.data.go.kr/data/15063149/fileData.do

Reproduction

Analysis started2023-12-12 22:33:43.735072
Analysis finished2023-12-12 22:33:44.129873
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct215
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:33:44.302582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length24
Mean length9.797235
Min length3

Characters and Unicode

Total characters2126
Distinct characters301
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)98.6%

Sample

1st row한일신재생㈜
2nd row경산시청(자원회수시설)
3rd row㈜영신금속
4th row조일알미늄㈜
5th row매일유업㈜경산공장
ValueCountFrequency (%)
코오롱플라스틱㈜ 3
 
1.2%
제2공장 2
 
0.8%
㈜티케이케미칼 2
 
0.8%
구미공장 2
 
0.8%
김천공장 2
 
0.8%
유)클라리오스델코 2
 
0.8%
㈜대광 1
 
0.4%
세아산업㈜약목 1
 
0.4%
공장 1
 
0.4%
명신섬유공업㈜ 1
 
0.4%
Other values (226) 226
93.0%
2023-12-13T07:33:44.626162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
9.1%
79
 
3.7%
79
 
3.7%
65
 
3.1%
64
 
3.0%
) 49
 
2.3%
( 49
 
2.3%
43
 
2.0%
39
 
1.8%
33
 
1.6%
Other values (291) 1432
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1698
79.9%
Other Symbol 194
 
9.1%
Close Punctuation 52
 
2.4%
Open Punctuation 52
 
2.4%
Decimal Number 33
 
1.6%
Uppercase Letter 30
 
1.4%
Space Separator 27
 
1.3%
Other Punctuation 20
 
0.9%
Connector Punctuation 10
 
0.5%
Dash Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
4.7%
79
 
4.7%
65
 
3.8%
64
 
3.8%
43
 
2.5%
39
 
2.3%
33
 
1.9%
32
 
1.9%
31
 
1.8%
31
 
1.8%
Other values (253) 1202
70.8%
Uppercase Letter
ValueCountFrequency (%)
S 4
13.3%
C 4
13.3%
T 3
10.0%
K 3
10.0%
I 2
 
6.7%
P 2
 
6.7%
L 2
 
6.7%
D 2
 
6.7%
O 1
 
3.3%
W 1
 
3.3%
Other values (6) 6
20.0%
Decimal Number
ValueCountFrequency (%)
2 14
42.4%
1 10
30.3%
3 4
 
12.1%
6 2
 
6.1%
4 2
 
6.1%
5 1
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
r 1
20.0%
a 1
20.0%
l 1
20.0%
o 1
20.0%
s 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 17
85.0%
, 2
 
10.0%
& 1
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 49
94.2%
] 3
 
5.8%
Open Punctuation
ValueCountFrequency (%)
( 49
94.2%
[ 3
 
5.8%
Other Symbol
ValueCountFrequency (%)
194
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1892
89.0%
Common 199
 
9.4%
Latin 35
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
194
 
10.3%
79
 
4.2%
79
 
4.2%
65
 
3.4%
64
 
3.4%
43
 
2.3%
39
 
2.1%
33
 
1.7%
32
 
1.7%
31
 
1.6%
Other values (254) 1233
65.2%
Latin
ValueCountFrequency (%)
S 4
 
11.4%
C 4
 
11.4%
T 3
 
8.6%
K 3
 
8.6%
I 2
 
5.7%
P 2
 
5.7%
L 2
 
5.7%
D 2
 
5.7%
O 1
 
2.9%
W 1
 
2.9%
Other values (11) 11
31.4%
Common
ValueCountFrequency (%)
) 49
24.6%
( 49
24.6%
27
13.6%
. 17
 
8.5%
2 14
 
7.0%
1 10
 
5.0%
_ 10
 
5.0%
- 5
 
2.5%
3 4
 
2.0%
] 3
 
1.5%
Other values (6) 11
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1698
79.9%
ASCII 234
 
11.0%
None 194
 
9.1%

Most frequent character per block

None
ValueCountFrequency (%)
194
100.0%
Hangul
ValueCountFrequency (%)
79
 
4.7%
79
 
4.7%
65
 
3.8%
64
 
3.8%
43
 
2.5%
39
 
2.3%
33
 
1.9%
32
 
1.9%
31
 
1.8%
31
 
1.8%
Other values (253) 1202
70.8%
ASCII
ValueCountFrequency (%)
) 49
20.9%
( 49
20.9%
27
11.5%
. 17
 
7.3%
2 14
 
6.0%
1 10
 
4.3%
_ 10
 
4.3%
- 5
 
2.1%
S 4
 
1.7%
3 4
 
1.7%
Other values (27) 45
19.2%

주소
Text

Distinct201
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:33:44.943326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length23
Mean length15.479263
Min length10

Characters and Unicode

Total characters3359
Distinct characters147
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)87.6%

Sample

1st row경산시 남천면 남성현로 1022 외2필지
2nd row경산시 용성면 설총로 154-180
3rd row경산시 진량읍 공단4로 78
4th row경산시 진량읍 공단6로 98
5th row경산시 진량읍 대학로 1090
ValueCountFrequency (%)
구미시 71
 
9.0%
포항시 66
 
8.4%
남구 57
 
7.2%
김천시 16
 
2.0%
경주시 15
 
1.9%
고령군 12
 
1.5%
수출대로 11
 
1.4%
3공단2로 11
 
1.4%
칠곡군 10
 
1.3%
다산면 10
 
1.3%
Other values (338) 511
64.7%
2023-12-13T07:33:45.366115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
575
 
17.1%
186
 
5.5%
171
 
5.1%
1 163
 
4.9%
144
 
4.3%
2 130
 
3.9%
3 107
 
3.2%
4 90
 
2.7%
89
 
2.6%
84
 
2.5%
Other values (137) 1620
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1879
55.9%
Decimal Number 836
24.9%
Space Separator 575
 
17.1%
Dash Punctuation 55
 
1.6%
Open Punctuation 5
 
0.1%
Close Punctuation 5
 
0.1%
Connector Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
186
 
9.9%
171
 
9.1%
144
 
7.7%
89
 
4.7%
84
 
4.5%
77
 
4.1%
70
 
3.7%
69
 
3.7%
68
 
3.6%
67
 
3.6%
Other values (120) 854
45.4%
Decimal Number
ValueCountFrequency (%)
1 163
19.5%
2 130
15.6%
3 107
12.8%
4 90
10.8%
6 74
8.9%
5 62
 
7.4%
8 57
 
6.8%
0 56
 
6.7%
9 49
 
5.9%
7 48
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
: 1
50.0%
Space Separator
ValueCountFrequency (%)
575
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1879
55.9%
Common 1480
44.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
186
 
9.9%
171
 
9.1%
144
 
7.7%
89
 
4.7%
84
 
4.5%
77
 
4.1%
70
 
3.7%
69
 
3.7%
68
 
3.6%
67
 
3.6%
Other values (120) 854
45.4%
Common
ValueCountFrequency (%)
575
38.9%
1 163
 
11.0%
2 130
 
8.8%
3 107
 
7.2%
4 90
 
6.1%
6 74
 
5.0%
5 62
 
4.2%
8 57
 
3.9%
0 56
 
3.8%
- 55
 
3.7%
Other values (7) 111
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1879
55.9%
ASCII 1480
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
575
38.9%
1 163
 
11.0%
2 130
 
8.8%
3 107
 
7.2%
4 90
 
6.1%
6 74
 
5.0%
5 62
 
4.2%
8 57
 
3.9%
0 56
 
3.8%
- 55
 
3.7%
Other values (7) 111
 
7.5%
Hangul
ValueCountFrequency (%)
186
 
9.9%
171
 
9.1%
144
 
7.7%
89
 
4.7%
84
 
4.5%
77
 
4.1%
70
 
3.7%
69
 
3.7%
68
 
3.6%
67
 
3.6%
Other values (120) 854
45.4%

업종
Text

Distinct150
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:33:45.552517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length43
Mean length19.165899
Min length3

Characters and Unicode

Total characters4159
Distinct characters189
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)53.0%

Sample

1st row지정외폐기물처리업(38210)
2nd row폐기물처리(382)
3rd row기타기초무기_화학물질제조업_(20129)
4th row알루미늄압연,압출 및 연신제품제조업(24222)
5th row액상시유 및 기타 낙농제품제조업(10501)
ValueCountFrequency (%)
67
 
12.4%
기타 43
 
7.9%
합성수지 11
 
2.0%
그외 10
 
1.8%
제조업 8
 
1.5%
정련 8
 
1.5%
비금속광물제품제조업 8
 
1.5%
제련 7
 
1.3%
축전지제조업(28202 5
 
0.9%
5
 
0.9%
Other values (254) 370
68.3%
2023-12-13T07:33:45.911174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
330
 
7.9%
2 310
 
7.5%
248
 
6.0%
197
 
4.7%
) 187
 
4.5%
( 187
 
4.5%
180
 
4.3%
1 175
 
4.2%
3 124
 
3.0%
0 97
 
2.3%
Other values (179) 2124
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2428
58.4%
Decimal Number 931
 
22.4%
Space Separator 330
 
7.9%
Close Punctuation 187
 
4.5%
Open Punctuation 187
 
4.5%
Other Punctuation 75
 
1.8%
Connector Punctuation 20
 
0.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
248
 
10.2%
197
 
8.1%
180
 
7.4%
96
 
4.0%
84
 
3.5%
80
 
3.3%
68
 
2.8%
60
 
2.5%
52
 
2.1%
51
 
2.1%
Other values (162) 1312
54.0%
Decimal Number
ValueCountFrequency (%)
2 310
33.3%
1 175
18.8%
3 124
 
13.3%
0 97
 
10.4%
9 94
 
10.1%
4 52
 
5.6%
5 30
 
3.2%
6 23
 
2.5%
8 19
 
2.0%
7 7
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 67
89.3%
: 8
 
10.7%
Space Separator
ValueCountFrequency (%)
330
100.0%
Close Punctuation
ValueCountFrequency (%)
) 187
100.0%
Open Punctuation
ValueCountFrequency (%)
( 187
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2428
58.4%
Common 1731
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
248
 
10.2%
197
 
8.1%
180
 
7.4%
96
 
4.0%
84
 
3.5%
80
 
3.3%
68
 
2.8%
60
 
2.5%
52
 
2.1%
51
 
2.1%
Other values (162) 1312
54.0%
Common
ValueCountFrequency (%)
330
19.1%
2 310
17.9%
) 187
10.8%
( 187
10.8%
1 175
10.1%
3 124
 
7.2%
0 97
 
5.6%
9 94
 
5.4%
, 67
 
3.9%
4 52
 
3.0%
Other values (7) 108
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2428
58.4%
ASCII 1731
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
330
19.1%
2 310
17.9%
) 187
10.8%
( 187
10.8%
1 175
10.1%
3 124
 
7.2%
0 97
 
5.6%
9 94
 
5.4%
, 67
 
3.9%
4 52
 
3.0%
Other values (7) 108
 
6.2%
Hangul
ValueCountFrequency (%)
248
 
10.2%
197
 
8.1%
180
 
7.4%
96
 
4.0%
84
 
3.5%
80
 
3.3%
68
 
2.8%
60
 
2.5%
52
 
2.1%
51
 
2.1%
Other values (162) 1312
54.0%

Missing values

2023-12-13T07:33:44.033351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:33:44.099031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명주소업종
0한일신재생㈜경산시 남천면 남성현로 1022 외2필지지정외폐기물처리업(38210)
1경산시청(자원회수시설)경산시 용성면 설총로 154-180폐기물처리(382)
2㈜영신금속경산시 진량읍 공단4로 78기타기초무기_화학물질제조업_(20129)
3조일알미늄㈜경산시 진량읍 공단6로 98알루미늄압연,압출 및 연신제품제조업(24222)
4매일유업㈜경산공장경산시 진량읍 대학로 1090액상시유 및 기타 낙농제품제조업(10501)
5코오롱인더스트리㈜경산공장경산시 진량읍 대학로 1298화학섬유직물직조업(13213)외6종_(13992, 13993, 20501, 22213, 29175, 30399)
6㈜하이필 경산공장경산시 진량읍 북리1길 55액체여과기제조업(29175)
7베페사징크코리아㈜경주시 천북산단로 265-15연 및 아연 제련, 정련 및 합금제조업(24213)
8(주)더이한에스티이경주시 강동면 강동산단로2길 50-17연 및 아연 제련, 정련 및 합금제조업(24213), 기타비철금속제련, 정련 및 합금제조업
9㈜엔케이합금메탈(구.㈜상영합금메탈)경주시 건천읍 용명공단길 173-120연 및 아연 제련, 정련 및 합금제조업(24213)
상호명주소업종
207한국수자원공사포항권지사학야정수장포항시 북구 기계면 학야리 730수도사업
208동해화학공업㈜포항시 북구 동해대로 2694-71비금속광물제조
209우정건설㈜포항시 북구 동해대로1954번길 23-1아스콘제조업(23991)
210㈜화진철강_(구 항진제강)포항시 북구 송라면 대전길 97-88열간압연압출제품제조업(24121)
211이비덴그라파이트코리아㈜포항시 북구 영일만일반산단남로75번길 41그 외 기타 분류 안 된 비금속, 광물제품제조업(23999)
212현대힘스㈜포항공장포항시 북구 흥해읍 해안로 1001금속제품제조
213강림중공업㈜포항공장포항시 북구 흥해읍 용한리 811-30선박구성부분품제조업(31114)
214알펙㈜포항시 북구 흥해읍 해안로 964설치용금속탱크 및 저장용기제조업(25122)
215㈜에스앤지포항시 남구 동촌동 5비금속광물제품제조업
216㈜에코프로지이엠포항시 북구 흥해읍 영일만산단남로75번길 15축전지제조업(28202)