Overview

Dataset statistics

Number of variables4
Number of observations831
Missing cells95
Missing cells (%)2.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory26.1 KiB
Average record size in memory32.2 B

Variable types

Text4

Dataset

Description경상남도 양산시 대기 및 폐수 배출업소의 업소명, 사업장 소재지주소, 업종, 전화번호 등을 읍면동별로 배출업소사업장현황을 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3040406

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
전화번호 has 92 (11.1%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:27:44.820921
Analysis finished2023-12-11 00:27:45.480299
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct816
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-11T09:27:45.673000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length6.5415162
Min length2

Characters and Unicode

Total characters5436
Distinct characters385
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique802 ?
Unique (%)96.5%

Sample

1st row명광기업㈜
2nd row㈜동호산업
3rd row에스케이정밀㈜
4th row한미공업사
5th row(주)화승소재
ValueCountFrequency (%)
제2공장 6
 
0.7%
양산시 4
 
0.4%
양산지점 4
 
0.4%
태광산업 3
 
0.3%
양산공장 3
 
0.3%
덕성인더스트리㈜ 2
 
0.2%
현대드럼 2
 
0.2%
부산지점 2
 
0.2%
㈜서비스코리아 2
 
0.2%
㈜신성에너지 2
 
0.2%
Other values (853) 871
96.7%
2023-12-11T09:27:46.143430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
347
 
6.4%
204
 
3.8%
152
 
2.8%
151
 
2.8%
) 127
 
2.3%
( 127
 
2.3%
120
 
2.2%
105
 
1.9%
98
 
1.8%
98
 
1.8%
Other values (375) 3907
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4559
83.9%
Other Symbol 347
 
6.4%
Close Punctuation 127
 
2.3%
Open Punctuation 127
 
2.3%
Space Separator 120
 
2.2%
Uppercase Letter 86
 
1.6%
Decimal Number 41
 
0.8%
Other Punctuation 27
 
0.5%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
204
 
4.5%
152
 
3.3%
151
 
3.3%
105
 
2.3%
98
 
2.1%
98
 
2.1%
95
 
2.1%
93
 
2.0%
93
 
2.0%
92
 
2.0%
Other values (344) 3378
74.1%
Uppercase Letter
ValueCountFrequency (%)
C 17
19.8%
T 9
10.5%
S 8
9.3%
R 6
 
7.0%
D 6
 
7.0%
H 5
 
5.8%
A 5
 
5.8%
M 5
 
5.8%
I 4
 
4.7%
P 3
 
3.5%
Other values (9) 18
20.9%
Decimal Number
ValueCountFrequency (%)
2 27
65.9%
1 9
 
22.0%
3 3
 
7.3%
4 2
 
4.9%
Other Punctuation
ValueCountFrequency (%)
. 22
81.5%
& 4
 
14.8%
1
 
3.7%
Other Symbol
ValueCountFrequency (%)
347
100.0%
Close Punctuation
ValueCountFrequency (%)
) 127
100.0%
Open Punctuation
ValueCountFrequency (%)
( 127
100.0%
Space Separator
ValueCountFrequency (%)
120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4906
90.3%
Common 444
 
8.2%
Latin 86
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
347
 
7.1%
204
 
4.2%
152
 
3.1%
151
 
3.1%
105
 
2.1%
98
 
2.0%
98
 
2.0%
95
 
1.9%
93
 
1.9%
93
 
1.9%
Other values (345) 3470
70.7%
Latin
ValueCountFrequency (%)
C 17
19.8%
T 9
10.5%
S 8
9.3%
R 6
 
7.0%
D 6
 
7.0%
H 5
 
5.8%
A 5
 
5.8%
M 5
 
5.8%
I 4
 
4.7%
P 3
 
3.5%
Other values (9) 18
20.9%
Common
ValueCountFrequency (%)
) 127
28.6%
( 127
28.6%
120
27.0%
2 27
 
6.1%
. 22
 
5.0%
1 9
 
2.0%
& 4
 
0.9%
3 3
 
0.7%
- 2
 
0.5%
4 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4559
83.9%
ASCII 529
 
9.7%
None 348
 
6.4%

Most frequent character per block

None
ValueCountFrequency (%)
347
99.7%
1
 
0.3%
Hangul
ValueCountFrequency (%)
204
 
4.5%
152
 
3.3%
151
 
3.3%
105
 
2.3%
98
 
2.1%
98
 
2.1%
95
 
2.1%
93
 
2.0%
93
 
2.0%
92
 
2.0%
Other values (344) 3378
74.1%
ASCII
ValueCountFrequency (%)
) 127
24.0%
( 127
24.0%
120
22.7%
2 27
 
5.1%
. 22
 
4.2%
C 17
 
3.2%
T 9
 
1.7%
1 9
 
1.7%
S 8
 
1.5%
R 6
 
1.1%
Other values (19) 57
10.8%
Distinct795
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-11T09:27:46.409034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length19.323706
Min length13

Characters and Unicode

Total characters16058
Distinct characters118
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique766 ?
Unique (%)92.2%

Sample

1st row경상남도 양산시 교동 114-2
2nd row경상남도 양산시 교동 117
3rd row경상남도 양산시 교동 117-11
4th row경상남도 양산시 교동 129
5th row경상남도 양산시 교동 147-1
ValueCountFrequency (%)
경상남도 831
23.1%
양산시 828
23.0%
상북면 120
 
3.3%
어곡동 80
 
2.2%
북정동 79
 
2.2%
소토리 75
 
2.1%
소주동 75
 
2.1%
유산동 62
 
1.7%
산막동 60
 
1.7%
주남동 42
 
1.2%
Other values (893) 1349
37.5%
2023-12-11T09:27:46.864103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2789
17.4%
1067
 
6.6%
956
 
6.0%
883
 
5.5%
834
 
5.2%
832
 
5.2%
832
 
5.2%
831
 
5.2%
680
 
4.2%
1 648
 
4.0%
Other values (108) 5706
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9299
57.9%
Decimal Number 3256
 
20.3%
Space Separator 2789
 
17.4%
Dash Punctuation 594
 
3.7%
Uppercase Letter 41
 
0.3%
Other Punctuation 34
 
0.2%
Close Punctuation 22
 
0.1%
Open Punctuation 22
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1067
11.5%
956
10.3%
883
9.5%
834
9.0%
832
8.9%
832
8.9%
831
8.9%
680
 
7.3%
244
 
2.6%
195
 
2.1%
Other values (85) 1945
20.9%
Decimal Number
ValueCountFrequency (%)
1 648
19.9%
2 466
14.3%
3 370
11.4%
4 319
9.8%
8 258
 
7.9%
5 250
 
7.7%
6 249
 
7.6%
7 247
 
7.6%
9 241
 
7.4%
0 208
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
L 19
46.3%
B 18
43.9%
A 1
 
2.4%
I 1
 
2.4%
C 1
 
2.4%
D 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 18
52.9%
. 16
47.1%
Space Separator
ValueCountFrequency (%)
2789
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 594
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Lowercase Letter
ValueCountFrequency (%)
w 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9299
57.9%
Common 6717
41.8%
Latin 42
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1067
11.5%
956
10.3%
883
9.5%
834
9.0%
832
8.9%
832
8.9%
831
8.9%
680
 
7.3%
244
 
2.6%
195
 
2.1%
Other values (85) 1945
20.9%
Common
ValueCountFrequency (%)
2789
41.5%
1 648
 
9.6%
- 594
 
8.8%
2 466
 
6.9%
3 370
 
5.5%
4 319
 
4.7%
8 258
 
3.8%
5 250
 
3.7%
6 249
 
3.7%
7 247
 
3.7%
Other values (6) 527
 
7.8%
Latin
ValueCountFrequency (%)
L 19
45.2%
B 18
42.9%
w 1
 
2.4%
A 1
 
2.4%
I 1
 
2.4%
C 1
 
2.4%
D 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9299
57.9%
ASCII 6759
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2789
41.3%
1 648
 
9.6%
- 594
 
8.8%
2 466
 
6.9%
3 370
 
5.5%
4 319
 
4.7%
8 258
 
3.8%
5 250
 
3.7%
6 249
 
3.7%
7 247
 
3.7%
Other values (13) 569
 
8.4%
Hangul
ValueCountFrequency (%)
1067
11.5%
956
10.3%
883
9.5%
834
9.0%
832
8.9%
832
8.9%
831
8.9%
680
 
7.3%
244
 
2.6%
195
 
2.1%
Other values (85) 1945
20.9%

업종
Text

Distinct371
Distinct (%)44.8%
Missing3
Missing (%)0.4%
Memory size6.6 KiB
2023-12-11T09:27:47.147350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22
Mean length7.6944444
Min length2

Characters and Unicode

Total characters6371
Distinct characters226
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique266 ?
Unique (%)32.1%

Sample

1st row기타자동차부품제조업
2nd row플라스틱호스제조업
3rd row비철금속
4th row장비수선.세차
5th row고무및플라스틱
ValueCountFrequency (%)
세차시설 76
 
6.8%
73
 
6.6%
고무및플라스틱 31
 
2.8%
장비수선.세차 24
 
2.2%
플라스틱 23
 
2.1%
기타화학 22
 
2.0%
제조업 21
 
1.9%
조립금속 20
 
1.8%
도금업 19
 
1.7%
비금속광물 16
 
1.4%
Other values (421) 785
70.7%
2023-12-11T09:27:47.574375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
470
 
7.4%
326
 
5.1%
296
 
4.6%
295
 
4.6%
237
 
3.7%
206
 
3.2%
199
 
3.1%
169
 
2.7%
163
 
2.6%
159
 
2.5%
Other values (216) 3851
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5963
93.6%
Space Separator 296
 
4.6%
Other Punctuation 62
 
1.0%
Open Punctuation 23
 
0.4%
Close Punctuation 23
 
0.4%
Decimal Number 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
470
 
7.9%
326
 
5.5%
295
 
4.9%
237
 
4.0%
206
 
3.5%
199
 
3.3%
169
 
2.8%
163
 
2.7%
159
 
2.7%
139
 
2.3%
Other values (208) 3600
60.4%
Other Punctuation
ValueCountFrequency (%)
. 32
51.6%
, 29
46.8%
· 1
 
1.6%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
3 2
50.0%
Space Separator
ValueCountFrequency (%)
296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5963
93.6%
Common 408
 
6.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
470
 
7.9%
326
 
5.5%
295
 
4.9%
237
 
4.0%
206
 
3.5%
199
 
3.3%
169
 
2.8%
163
 
2.7%
159
 
2.7%
139
 
2.3%
Other values (208) 3600
60.4%
Common
ValueCountFrequency (%)
296
72.5%
. 32
 
7.8%
, 29
 
7.1%
( 23
 
5.6%
) 23
 
5.6%
1 2
 
0.5%
3 2
 
0.5%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5963
93.6%
ASCII 407
 
6.4%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
470
 
7.9%
326
 
5.5%
295
 
4.9%
237
 
4.0%
206
 
3.5%
199
 
3.3%
169
 
2.8%
163
 
2.7%
159
 
2.7%
139
 
2.3%
Other values (208) 3600
60.4%
ASCII
ValueCountFrequency (%)
296
72.7%
. 32
 
7.9%
, 29
 
7.1%
( 23
 
5.7%
) 23
 
5.7%
1 2
 
0.5%
3 2
 
0.5%
None
ValueCountFrequency (%)
· 1
100.0%

전화번호
Text

MISSING 

Distinct701
Distinct (%)94.9%
Missing92
Missing (%)11.1%
Memory size6.6 KiB
2023-12-11T09:27:47.820468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.00406
Min length12

Characters and Unicode

Total characters8871
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique668 ?
Unique (%)90.4%

Sample

1st row055-383-8700
2nd row055-388-6966
3rd row055-387-0477
4th row055-384-0012
5th row055-370-3247
ValueCountFrequency (%)
055-388-1101 3
 
0.4%
055-388-3319 3
 
0.4%
055-388-7100 3
 
0.4%
055-366-9991 3
 
0.4%
055-386-0500 3
 
0.4%
055-389-1900 2
 
0.3%
055-374-9566 2
 
0.3%
055-386-4061 2
 
0.3%
055-366-2601 2
 
0.3%
055-367-3641 2
 
0.3%
Other values (691) 714
96.6%
2023-12-11T09:27:48.168579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1914
21.6%
- 1478
16.7%
0 1258
14.2%
3 1067
12.0%
8 645
 
7.3%
6 535
 
6.0%
1 516
 
5.8%
7 460
 
5.2%
2 378
 
4.3%
4 364
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7393
83.3%
Dash Punctuation 1478
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1914
25.9%
0 1258
17.0%
3 1067
14.4%
8 645
 
8.7%
6 535
 
7.2%
1 516
 
7.0%
7 460
 
6.2%
2 378
 
5.1%
4 364
 
4.9%
9 256
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 1478
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8871
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1914
21.6%
- 1478
16.7%
0 1258
14.2%
3 1067
12.0%
8 645
 
7.3%
6 535
 
6.0%
1 516
 
5.8%
7 460
 
5.2%
2 378
 
4.3%
4 364
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8871
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1914
21.6%
- 1478
16.7%
0 1258
14.2%
3 1067
12.0%
8 645
 
7.3%
6 535
 
6.0%
1 516
 
5.8%
7 460
 
5.2%
2 378
 
4.3%
4 364
 
4.1%

Missing values

2023-12-11T09:27:45.248252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:27:45.341108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:27:45.420581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업소명소재지업종전화번호
0명광기업㈜경상남도 양산시 교동 114-2기타자동차부품제조업055-383-8700
1㈜동호산업경상남도 양산시 교동 117플라스틱호스제조업055-388-6966
2에스케이정밀㈜경상남도 양산시 교동 117-11비철금속055-387-0477
3한미공업사경상남도 양산시 교동 129장비수선.세차055-384-0012
4(주)화승소재경상남도 양산시 교동 147-1고무및플라스틱055-370-3247
5(주)화승R&A경상남도 양산시 교동 147-1고무제품제조055-370-3242
6대창산업경상남도 양산시 교동 147-1고무제품제조055-364-2756
7승호경상남도 양산시 교동 147-1고무제품제조055-381-8198
8㈜화승엑스윌경상남도 양산시 교동 147-1고무제품제조055-370-3242
9태광산업경상남도 양산시 교동 147-1고무제품제조055-381-1780
업소명소재지업종전화번호
821(주)동흥포장경상남도 양산시 평산동 163-3종이제품055-365-4732
822대남FM자동차서비스경상남도 양산시 평산동 18-3자동차정비055-365-1234
823골든24시셀프세차장경상남도 양산시 평산동 19B-4L(w모텔 앞)세차시설<NA>
824혜인요양병원경상남도 양산시 평산동 31-5, 23번지일반 병원055-385-7551
825영창목재산업경상남도 양산시 평산동 58-4목제가공업055-366-2123
826웅상정비센터경상남도 양산시 평산동 76-4자동차정비업055-364-8285
827양산시 웅상정수장경상남도 양산시 평산동 800수도사업시설055-382-9005
828문화세차장경상남도 양산시 평산동 9-1세차시설<NA>
829은진자동차종합정비경상남도 양산시 평산동 토지구획 26B 9L세차시설055-388-0670
830평산셀프경상남도 양산시 평산동 108-7운수장비수선 및 세차시설<NA>

Duplicate rows

Most frequently occurring

업소명소재지업종전화번호# duplicates
0베데스다병원경상남도 양산시 신기동 396-2병원시설055-384-99012