Overview

Dataset statistics

Number of variables8
Number of observations140
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.9 KiB
Average record size in memory64.9 B

Variable types

Text3
Boolean5

Dataset

Description부산광역시영도구환경인허가업소현황에대한데이터로대기,폐수,소음,휘발성유기화합물,기타수질배출사업장에대한자료를제공합니다.
Author부산광역시 영도구
URLhttps://www.data.go.kr/data/15083311/fileData.do

Reproduction

Analysis started2024-03-14 19:44:52.939218
Analysis finished2024-03-14 19:44:54.687681
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct139
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-15T04:44:55.571737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length7.9571429
Min length4

Characters and Unicode

Total characters1114
Distinct characters220
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)98.6%

Sample

1st row남항주유소
2nd row현대자동차 남항점
3rd row케이에스브이
4th row(주)일성조선
5th row금강조선(주)
ValueCountFrequency (%)
안경원 7
 
4.1%
신한여객자동차(주 2
 
1.2%
실로암안경원 2
 
1.2%
관리사무소 2
 
1.2%
주식회사 2
 
1.2%
기숙사 1
 
0.6%
시원메티컬의원 1
 
0.6%
한국해양수산연수원 1
 
0.6%
영도남항1급종합정비 1
 
0.6%
㈜한진중공업 1
 
0.6%
Other values (150) 150
88.2%
2024-03-15T04:44:57.148249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
5.2%
( 44
 
3.9%
) 44
 
3.9%
30
 
2.7%
25
 
2.2%
25
 
2.2%
22
 
2.0%
20
 
1.8%
20
 
1.8%
20
 
1.8%
Other values (210) 806
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 942
84.6%
Open Punctuation 45
 
4.0%
Close Punctuation 45
 
4.0%
Space Separator 30
 
2.7%
Other Symbol 22
 
2.0%
Uppercase Letter 14
 
1.3%
Decimal Number 12
 
1.1%
Lowercase Letter 3
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
6.2%
25
 
2.7%
25
 
2.7%
20
 
2.1%
20
 
2.1%
20
 
2.1%
19
 
2.0%
19
 
2.0%
18
 
1.9%
16
 
1.7%
Other values (186) 702
74.5%
Uppercase Letter
ValueCountFrequency (%)
H 3
21.4%
E 2
14.3%
T 2
14.3%
C 1
 
7.1%
K 1
 
7.1%
J 1
 
7.1%
S 1
 
7.1%
G 1
 
7.1%
P 1
 
7.1%
L 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 5
41.7%
2 3
25.0%
3 2
 
16.7%
9 1
 
8.3%
6 1
 
8.3%
Open Punctuation
ValueCountFrequency (%)
( 44
97.8%
[ 1
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 44
97.8%
] 1
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
k 2
66.7%
s 1
33.3%
Space Separator
ValueCountFrequency (%)
30
100.0%
Other Symbol
ValueCountFrequency (%)
22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 964
86.5%
Common 133
 
11.9%
Latin 17
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
6.0%
25
 
2.6%
25
 
2.6%
22
 
2.3%
20
 
2.1%
20
 
2.1%
20
 
2.1%
19
 
2.0%
19
 
2.0%
18
 
1.9%
Other values (187) 718
74.5%
Latin
ValueCountFrequency (%)
H 3
17.6%
E 2
11.8%
T 2
11.8%
k 2
11.8%
C 1
 
5.9%
K 1
 
5.9%
J 1
 
5.9%
S 1
 
5.9%
G 1
 
5.9%
P 1
 
5.9%
Other values (2) 2
11.8%
Common
ValueCountFrequency (%)
( 44
33.1%
) 44
33.1%
30
22.6%
1 5
 
3.8%
2 3
 
2.3%
3 2
 
1.5%
9 1
 
0.8%
6 1
 
0.8%
- 1
 
0.8%
] 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 942
84.6%
ASCII 150
 
13.5%
None 22
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
58
 
6.2%
25
 
2.7%
25
 
2.7%
20
 
2.1%
20
 
2.1%
20
 
2.1%
19
 
2.0%
19
 
2.0%
18
 
1.9%
16
 
1.7%
Other values (186) 702
74.5%
ASCII
ValueCountFrequency (%)
( 44
29.3%
) 44
29.3%
30
20.0%
1 5
 
3.3%
H 3
 
2.0%
2 3
 
2.0%
E 2
 
1.3%
T 2
 
1.3%
k 2
 
1.3%
3 2
 
1.3%
Other values (13) 13
 
8.7%
None
ValueCountFrequency (%)
22
100.0%
Distinct139
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-15T04:44:58.473310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18.5
Mean length8.9357143
Min length4

Characters and Unicode

Total characters1251
Distinct characters60
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)98.6%

Sample

1st row남항서로96
2nd row절영로42
3rd row남항서로 104번길 6
4th row남항서로 69-1
5th row남항서로 63
ValueCountFrequency (%)
해양로 12
 
5.0%
태종로 11
 
4.6%
남항서로 6
 
2.5%
동삼로 5
 
2.1%
절영로 4
 
1.7%
대평남로 4
 
1.7%
남항로 3
 
1.3%
와치로 3
 
1.3%
45 3
 
1.3%
봉래나루로 3
 
1.3%
Other values (170) 184
77.3%
2024-03-15T04:45:00.322366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
10.4%
1 102
 
8.2%
98
 
7.8%
2 73
 
5.8%
53
 
4.2%
48
 
3.8%
3 47
 
3.8%
6 45
 
3.6%
5 39
 
3.1%
32
 
2.6%
Other values (50) 584
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 621
49.6%
Decimal Number 445
35.6%
Space Separator 98
 
7.8%
Dash Punctuation 31
 
2.5%
Close Punctuation 26
 
2.1%
Open Punctuation 26
 
2.1%
Other Punctuation 2
 
0.2%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
20.9%
53
 
8.5%
48
 
7.7%
32
 
5.2%
29
 
4.7%
29
 
4.7%
29
 
4.7%
27
 
4.3%
23
 
3.7%
23
 
3.7%
Other values (33) 198
31.9%
Decimal Number
ValueCountFrequency (%)
1 102
22.9%
2 73
16.4%
3 47
10.6%
6 45
10.1%
5 39
 
8.8%
4 32
 
7.2%
7 28
 
6.3%
8 28
 
6.3%
9 26
 
5.8%
0 25
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
M 1
50.0%
Space Separator
ValueCountFrequency (%)
98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 628
50.2%
Hangul 621
49.6%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
20.9%
53
 
8.5%
48
 
7.7%
32
 
5.2%
29
 
4.7%
29
 
4.7%
29
 
4.7%
27
 
4.3%
23
 
3.7%
23
 
3.7%
Other values (33) 198
31.9%
Common
ValueCountFrequency (%)
1 102
16.2%
98
15.6%
2 73
11.6%
3 47
7.5%
6 45
7.2%
5 39
 
6.2%
4 32
 
5.1%
- 31
 
4.9%
7 28
 
4.5%
8 28
 
4.5%
Other values (5) 105
16.7%
Latin
ValueCountFrequency (%)
L 1
50.0%
M 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 630
50.4%
Hangul 621
49.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
130
20.9%
53
 
8.5%
48
 
7.7%
32
 
5.2%
29
 
4.7%
29
 
4.7%
29
 
4.7%
27
 
4.3%
23
 
3.7%
23
 
3.7%
Other values (33) 198
31.9%
ASCII
ValueCountFrequency (%)
1 102
16.2%
98
15.6%
2 73
11.6%
3 47
7.5%
6 45
7.1%
5 39
 
6.2%
4 32
 
5.1%
- 31
 
4.9%
7 28
 
4.4%
8 28
 
4.4%
Other values (7) 107
17.0%

업종
Text

Distinct51
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-15T04:45:01.267462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length4.9285714
Min length2

Characters and Unicode

Total characters690
Distinct characters102
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)22.9%

Sample

1st row세차업
2nd row자동차정비
3rd row밸브제조업
4th row선박수리
5th row선박수리
ValueCountFrequency (%)
세차업 23
 
14.6%
안경원 18
 
11.5%
선박수리 12
 
7.6%
선박구성부분품제조 10
 
6.4%
자동차정비 7
 
4.5%
의원 6
 
3.8%
6
 
3.8%
조립금속 4
 
2.5%
세차업,주유 4
 
2.5%
자동차세차업 3
 
1.9%
Other values (49) 64
40.8%
2024-03-15T04:45:02.656388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
6.1%
41
 
5.9%
38
 
5.5%
32
 
4.6%
31
 
4.5%
30
 
4.3%
29
 
4.2%
27
 
3.9%
19
 
2.8%
18
 
2.6%
Other values (92) 383
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 667
96.7%
Space Separator 17
 
2.5%
Other Punctuation 4
 
0.6%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
6.3%
41
 
6.1%
38
 
5.7%
32
 
4.8%
31
 
4.6%
30
 
4.5%
29
 
4.3%
27
 
4.0%
19
 
2.8%
18
 
2.7%
Other values (88) 360
54.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 667
96.7%
Common 23
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
6.3%
41
 
6.1%
38
 
5.7%
32
 
4.8%
31
 
4.6%
30
 
4.5%
29
 
4.3%
27
 
4.0%
19
 
2.8%
18
 
2.7%
Other values (88) 360
54.0%
Common
ValueCountFrequency (%)
17
73.9%
, 4
 
17.4%
( 1
 
4.3%
) 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 667
96.7%
ASCII 23
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
6.3%
41
 
6.1%
38
 
5.7%
32
 
4.8%
31
 
4.6%
30
 
4.5%
29
 
4.3%
27
 
4.0%
19
 
2.8%
18
 
2.7%
Other values (88) 360
54.0%
ASCII
ValueCountFrequency (%)
17
73.9%
, 4
 
17.4%
( 1
 
4.3%
) 1
 
4.3%

대기
Boolean

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size268.0 B
False
92 
True
48 
ValueCountFrequency (%)
False 92
65.7%
True 48
34.3%
2024-03-15T04:45:03.040902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

수질
Boolean

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size268.0 B
False
87 
True
53 
ValueCountFrequency (%)
False 87
62.1%
True 53
37.9%
2024-03-15T04:45:03.311253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

소음
Boolean

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size268.0 B
False
107 
True
33 
ValueCountFrequency (%)
False 107
76.4%
True 33
 
23.6%
2024-03-15T04:45:03.804628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size268.0 B
False
115 
True
25 
ValueCountFrequency (%)
False 115
82.1%
True 25
 
17.9%
2024-03-15T04:45:04.109124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size268.0 B
False
106 
True
34 
ValueCountFrequency (%)
False 106
75.7%
True 34
 
24.3%
2024-03-15T04:45:04.434209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T04:45:04.707492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종대기수질소음휘발성유기화합물(VOC)기타수질
업종1.0000.9200.9700.9380.9661.000
대기0.9201.0000.0000.0000.4050.250
수질0.9700.0001.0000.2360.1990.610
소음0.9380.0000.2361.0000.2860.371
휘발성유기화합물(VOC)0.9660.4050.1990.2861.0000.350
기타수질1.0000.2500.6100.3710.3501.000
2024-03-15T04:45:05.000716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대기휘발성유기화합물(VOC)수질기타수질소음
대기1.0000.2660.0000.1600.000
휘발성유기화합물(VOC)0.2661.0000.1270.2280.184
수질0.0000.1271.0000.4180.152
기타수질0.1600.2280.4181.0000.242
소음0.0000.1840.1520.2421.000
2024-03-15T04:45:05.313262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대기수질소음휘발성유기화합물(VOC)기타수질
대기1.0000.0000.0000.2660.160
수질0.0001.0000.1520.1270.418
소음0.0000.1521.0000.1840.242
휘발성유기화합물(VOC)0.2660.1270.1841.0000.228
기타수질0.1600.4180.2420.2281.000

Missing values

2024-03-15T04:44:53.866128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:44:54.525807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지업종대기수질소음휘발성유기화합물(VOC)기타수질
0남항주유소남항서로96세차업NYNNN
1현대자동차 남항점절영로42자동차정비NNNNY
2케이에스브이남항서로 104번길 6밸브제조업YYYNN
3(주)일성조선남항서로 69-1선박수리NNYYN
4금강조선(주)남항서로 63선박수리NNYYN
5보은산업(주)남항남로50세차업NYNNN
6덕성교통(주)남항서로 12-12세차업NYNNN
7대동메탈공업(주)남항남로40조립금속제품제조YYYNN
8(주)종합해사남항남로22선박구성부분품제조YYYNN
9우진공업사남항남로6선박구성부분품제조NNYNN
업소명소재지업종대기수질소음휘발성유기화합물(VOC)기타수질
130다비치 안경원태종로 92안경원NNNNY
131바른눈 안경원영선대로 87안경원NNNNY
132은하안경원절영로 123안경원NNNNY
133실로암안경원태종로 108-1안경원NNNNY
134스타글라스 렌즈아이대교동1가 150-2안경원NNNNY
135실로암안경원 동삼점동삼로 67안경원NNNNY
136모모스커피 주식회사봉래나루로 160커피가공업YNNNN
137㈜나부코봉래동4가 10-1, 38자동차세차업NYNNN
138영도구청태종로 423공공기관YNNNN
139동삼2지구 도시공사 영구임대아파트 관리사무소함지로79번길 76임대아파트YNNNN