Overview

Dataset statistics

Number of variables6
Number of observations1323
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory64.7 KiB
Average record size in memory50.1 B

Variable types

Numeric1
Text3
Categorical1
DateTime1

Dataset

Description김해시 환경오염물질 배출시설 현황-수질(업체명, 업종, 종수, 전화번호, 도로명주소 등)에 대한 데이터를 제공합니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033436

Alerts

종수 is highly imbalanced (92.3%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:32:33.605858
Analysis finished2023-12-11 00:32:34.387449
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1323
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean662
Minimum1
Maximum1323
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2023-12-11T09:32:34.476927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile67.1
Q1331.5
median662
Q3992.5
95-th percentile1256.9
Maximum1323
Range1322
Interquartile range (IQR)661

Descriptive statistics

Standard deviation382.06151
Coefficient of variation (CV)0.5771322
Kurtosis-1.2
Mean662
Median Absolute Deviation (MAD)331
Skewness0
Sum875826
Variance145971
MonotonicityStrictly increasing
2023-12-11T09:32:34.624289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
890 1
 
0.1%
888 1
 
0.1%
887 1
 
0.1%
886 1
 
0.1%
885 1
 
0.1%
884 1
 
0.1%
883 1
 
0.1%
882 1
 
0.1%
881 1
 
0.1%
Other values (1313) 1313
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1323 1
0.1%
1322 1
0.1%
1321 1
0.1%
1320 1
0.1%
1319 1
0.1%
1318 1
0.1%
1317 1
0.1%
1316 1
0.1%
1315 1
0.1%
1314 1
0.1%
Distinct1299
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2023-12-11T09:32:34.946203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length20
Mean length6.1352986
Min length2

Characters and Unicode

Total characters8117
Distinct characters478
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1279 ?
Unique (%)96.7%

Sample

1st row㈜씨앤엠
2nd row㈜한보메디팜
3rd row아세아식품
4th row부산카세차장
5th row㈜빙그레 김해공장
ValueCountFrequency (%)
2공장 9
 
0.6%
주식회사 8
 
0.6%
김해공장 6
 
0.4%
삼부정밀화학㈜ 4
 
0.3%
의료법인 4
 
0.3%
김해지점 4
 
0.3%
㈜경동냉열산업 3
 
0.2%
㈜한성 3
 
0.2%
화인케미칼㈜ 3
 
0.2%
㈜태창포징 3
 
0.2%
Other values (1330) 1370
96.7%
2023-12-11T09:32:35.392940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
681
 
8.4%
185
 
2.3%
173
 
2.1%
170
 
2.1%
156
 
1.9%
152
 
1.9%
150
 
1.8%
123
 
1.5%
120
 
1.5%
119
 
1.5%
Other values (468) 6088
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6988
86.1%
Other Symbol 681
 
8.4%
Uppercase Letter 183
 
2.3%
Space Separator 94
 
1.2%
Decimal Number 51
 
0.6%
Other Punctuation 33
 
0.4%
Close Punctuation 30
 
0.4%
Open Punctuation 30
 
0.4%
Lowercase Letter 25
 
0.3%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
185
 
2.6%
173
 
2.5%
170
 
2.4%
156
 
2.2%
152
 
2.2%
150
 
2.1%
123
 
1.8%
120
 
1.7%
119
 
1.7%
119
 
1.7%
Other values (411) 5521
79.0%
Uppercase Letter
ValueCountFrequency (%)
S 19
 
10.4%
C 18
 
9.8%
K 17
 
9.3%
E 14
 
7.7%
P 13
 
7.1%
T 13
 
7.1%
N 10
 
5.5%
G 10
 
5.5%
M 8
 
4.4%
H 8
 
4.4%
Other values (14) 53
29.0%
Lowercase Letter
ValueCountFrequency (%)
c 4
16.0%
o 3
12.0%
i 3
12.0%
a 3
12.0%
r 2
8.0%
e 2
8.0%
t 1
 
4.0%
y 1
 
4.0%
m 1
 
4.0%
s 1
 
4.0%
Other values (4) 4
16.0%
Decimal Number
ValueCountFrequency (%)
2 24
47.1%
1 9
 
17.6%
4 5
 
9.8%
3 5
 
9.8%
7 4
 
7.8%
0 2
 
3.9%
8 1
 
2.0%
9 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 22
66.7%
& 6
 
18.2%
, 2
 
6.1%
: 1
 
3.0%
? 1
 
3.0%
/ 1
 
3.0%
Other Symbol
ValueCountFrequency (%)
681
100.0%
Space Separator
ValueCountFrequency (%)
94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7669
94.5%
Common 240
 
3.0%
Latin 208
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
681
 
8.9%
185
 
2.4%
173
 
2.3%
170
 
2.2%
156
 
2.0%
152
 
2.0%
150
 
2.0%
123
 
1.6%
120
 
1.6%
119
 
1.6%
Other values (412) 5640
73.5%
Latin
ValueCountFrequency (%)
S 19
 
9.1%
C 18
 
8.7%
K 17
 
8.2%
E 14
 
6.7%
P 13
 
6.2%
T 13
 
6.2%
N 10
 
4.8%
G 10
 
4.8%
M 8
 
3.8%
H 8
 
3.8%
Other values (28) 78
37.5%
Common
ValueCountFrequency (%)
94
39.2%
) 30
 
12.5%
( 30
 
12.5%
2 24
 
10.0%
. 22
 
9.2%
1 9
 
3.8%
& 6
 
2.5%
4 5
 
2.1%
3 5
 
2.1%
7 4
 
1.7%
Other values (8) 11
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6988
86.1%
None 681
 
8.4%
ASCII 448
 
5.5%

Most frequent character per block

None
ValueCountFrequency (%)
681
100.0%
Hangul
ValueCountFrequency (%)
185
 
2.6%
173
 
2.5%
170
 
2.4%
156
 
2.2%
152
 
2.2%
150
 
2.1%
123
 
1.8%
120
 
1.7%
119
 
1.7%
119
 
1.7%
Other values (411) 5521
79.0%
ASCII
ValueCountFrequency (%)
94
21.0%
) 30
 
6.7%
( 30
 
6.7%
2 24
 
5.4%
. 22
 
4.9%
S 19
 
4.2%
C 18
 
4.0%
K 17
 
3.8%
E 14
 
3.1%
P 13
 
2.9%
Other values (46) 167
37.3%
Distinct1279
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2023-12-11T09:32:35.653193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length35
Mean length23.164021
Min length14

Characters and Unicode

Total characters30646
Distinct characters141
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1246 ?
Unique (%)94.2%

Sample

1st row경상남도 김해시 김해대로2635번길 29
2nd row경상남도 김해시 진례면 진례로371번길 71
3rd row경상남도 김해시 생림면 장재로520번안길 8
4th row경상남도 김해시 생림면 장재로520번안길 8
5th row경상남도 김해시 한림면 고모로 768
ValueCountFrequency (%)
경상남도 1323
21.2%
김해시 1323
21.2%
주촌면 250
 
4.0%
한림면 191
 
3.1%
진영읍 136
 
2.2%
진례면 104
 
1.7%
생림면 103
 
1.6%
상동면 85
 
1.4%
김해대로 75
 
1.2%
서부로1499번길 53
 
0.8%
Other values (1224) 2604
41.7%
2023-12-11T09:32:36.024069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4941
 
16.1%
1549
 
5.1%
1548
 
5.1%
1456
 
4.8%
1 1374
 
4.5%
1324
 
4.3%
1324
 
4.3%
1323
 
4.3%
1323
 
4.3%
1288
 
4.2%
Other values (131) 13196
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18067
59.0%
Decimal Number 6915
 
22.6%
Space Separator 4941
 
16.1%
Dash Punctuation 602
 
2.0%
Open Punctuation 42
 
0.1%
Close Punctuation 42
 
0.1%
Other Punctuation 31
 
0.1%
Uppercase Letter 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1549
 
8.6%
1548
 
8.6%
1456
 
8.1%
1324
 
7.3%
1324
 
7.3%
1323
 
7.3%
1323
 
7.3%
1288
 
7.1%
775
 
4.3%
769
 
4.3%
Other values (111) 5388
29.8%
Decimal Number
ValueCountFrequency (%)
1 1374
19.9%
2 873
12.6%
3 805
11.6%
5 707
10.2%
4 698
10.1%
6 603
8.7%
9 596
8.6%
7 467
 
6.8%
0 409
 
5.9%
8 383
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
F 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 28
90.3%
: 3
 
9.7%
Space Separator
ValueCountFrequency (%)
4941
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 602
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18067
59.0%
Common 12575
41.0%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1549
 
8.6%
1548
 
8.6%
1456
 
8.1%
1324
 
7.3%
1324
 
7.3%
1323
 
7.3%
1323
 
7.3%
1288
 
7.1%
775
 
4.3%
769
 
4.3%
Other values (111) 5388
29.8%
Common
ValueCountFrequency (%)
4941
39.3%
1 1374
 
10.9%
2 873
 
6.9%
3 805
 
6.4%
5 707
 
5.6%
4 698
 
5.6%
6 603
 
4.8%
- 602
 
4.8%
9 596
 
4.7%
7 467
 
3.7%
Other values (7) 909
 
7.2%
Latin
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
F 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18067
59.0%
ASCII 12579
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4941
39.3%
1 1374
 
10.9%
2 873
 
6.9%
3 805
 
6.4%
5 707
 
5.6%
4 698
 
5.5%
6 603
 
4.8%
- 602
 
4.8%
9 596
 
4.7%
7 467
 
3.7%
Other values (10) 913
 
7.3%
Hangul
ValueCountFrequency (%)
1549
 
8.6%
1548
 
8.6%
1456
 
8.1%
1324
 
7.3%
1324
 
7.3%
1323
 
7.3%
1323
 
7.3%
1288
 
7.1%
775
 
4.3%
769
 
4.3%
Other values (111) 5388
29.8%

업종
Text

Distinct585
Distinct (%)44.3%
Missing1
Missing (%)0.1%
Memory size10.5 KiB
2023-12-11T09:32:36.229390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length31
Mean length10.954614
Min length2

Characters and Unicode

Total characters14482
Distinct characters285
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique415 ?
Unique (%)31.4%

Sample

1st row전동기및발전기제조업
2nd row기타비알콜음료제조업
3rd row식품제조
4th row세차장
5th row낙농제품및아이스크림제조
ValueCountFrequency (%)
53
 
3.4%
주유소운영업 51
 
3.2%
자동차세차업 49
 
3.1%
세차시설 42
 
2.7%
도장및기타피막처리업 33
 
2.1%
기타자동차부품제조업 26
 
1.7%
자동차세차업(95213 23
 
1.5%
그외기타자동차부품제조업 23
 
1.5%
제조업 21
 
1.3%
차량용주유소운영업 20
 
1.3%
Other values (635) 1230
78.3%
2023-12-11T09:32:36.551518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1112
 
7.7%
943
 
6.5%
781
 
5.4%
498
 
3.4%
404
 
2.8%
399
 
2.8%
342
 
2.4%
2 311
 
2.1%
297
 
2.1%
249
 
1.7%
Other values (275) 9146
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12582
86.9%
Decimal Number 1093
 
7.5%
Space Separator 249
 
1.7%
Open Punctuation 210
 
1.5%
Close Punctuation 210
 
1.5%
Other Punctuation 138
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1112
 
8.8%
943
 
7.5%
781
 
6.2%
498
 
4.0%
404
 
3.2%
399
 
3.2%
342
 
2.7%
297
 
2.4%
233
 
1.9%
220
 
1.7%
Other values (260) 7353
58.4%
Decimal Number
ValueCountFrequency (%)
2 311
28.5%
1 205
18.8%
3 166
15.2%
9 157
14.4%
0 82
 
7.5%
5 60
 
5.5%
4 50
 
4.6%
8 31
 
2.8%
7 20
 
1.8%
6 11
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 134
97.1%
. 4
 
2.9%
Space Separator
ValueCountFrequency (%)
249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 210
100.0%
Close Punctuation
ValueCountFrequency (%)
) 210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12582
86.9%
Common 1900
 
13.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1112
 
8.8%
943
 
7.5%
781
 
6.2%
498
 
4.0%
404
 
3.2%
399
 
3.2%
342
 
2.7%
297
 
2.4%
233
 
1.9%
220
 
1.7%
Other values (260) 7353
58.4%
Common
ValueCountFrequency (%)
2 311
16.4%
249
13.1%
( 210
11.1%
) 210
11.1%
1 205
10.8%
3 166
8.7%
9 157
8.3%
, 134
7.1%
0 82
 
4.3%
5 60
 
3.2%
Other values (5) 116
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12582
86.9%
ASCII 1900
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1112
 
8.8%
943
 
7.5%
781
 
6.2%
498
 
4.0%
404
 
3.2%
399
 
3.2%
342
 
2.7%
297
 
2.4%
233
 
1.9%
220
 
1.7%
Other values (260) 7353
58.4%
ASCII
ValueCountFrequency (%)
2 311
16.4%
249
13.1%
( 210
11.1%
) 210
11.1%
1 205
10.8%
3 166
8.7%
9 157
8.3%
, 134
7.1%
0 82
 
4.3%
5 60
 
3.2%
Other values (5) 116
 
6.1%

종수
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
5
1294 
4
 
21
2
 
4
3
 
2
1
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row5
4th row5
5th row2

Common Values

ValueCountFrequency (%)
5 1294
97.8%
4 21
 
1.6%
2 4
 
0.3%
3 2
 
0.2%
1 2
 
0.2%

Length

2023-12-11T09:32:36.665361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:32:36.752325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 1294
97.8%
4 21
 
1.6%
2 4
 
0.3%
3 2
 
0.2%
1 2
 
0.2%
Distinct1114
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
Minimum1979-08-10 00:00:00
Maximum2018-12-31 00:00:00
2023-12-11T09:32:36.854690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:32:36.977715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T09:32:34.125609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:32:37.055286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종수
연번1.0000.186
종수0.1861.000
2023-12-11T09:32:37.132857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종수
연번1.0000.078
종수0.0781.000

Missing values

2023-12-11T09:32:34.243181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:32:34.340547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명도로명주소업종종수신고일자
01㈜씨앤엠경상남도 김해시 김해대로2635번길 29전동기및발전기제조업51979-08-10
12㈜한보메디팜경상남도 김해시 진례면 진례로371번길 71기타비알콜음료제조업51980-06-30
23아세아식품경상남도 김해시 생림면 장재로520번안길 8식품제조51980-07-10
34부산카세차장경상남도 김해시 생림면 장재로520번안길 8세차장51980-12-12
45㈜빙그레 김해공장경상남도 김해시 한림면 고모로 768낙농제품및아이스크림제조21981-06-13
56김해복음병원경상남도 김해시 활천로 33종합병원51982-07-07
67㈜동남정유경상남도 김해시 김해대로2579번길 36윤활유및그리스제조업51984-04-10
78한성기업㈜김해공장경상남도 김해시 삼안로 51음식료품제조시설31984-06-22
89인제대학교경상남도 김해시 인제로 197대학교51985-01-12
910삼영산업㈜경상남도 김해시 진영읍 하계로138번길 51-62타일및유사비내화요업제품제조업51986-08-01
연번업체명도로명주소업종종수신고일자
13131314㈜해피콜 제1공장지점경상남도 김해시 한림면 안곡로 497-34수동식식품가공기기및금속주방용기제조업(25992)52018-11-21
13141315㈜태창포징경상남도 김해시 주촌면 서부로1409번길 74절삭가공 및 유사 처리업(25924)52018-12-12
13151316㈜삼영검사엔지니어링경상남도 김해시 진영읍 진산대로 238전자기측정,시험및분석기구제조업 외52018-12-10
13161317㈜해동솔라텍경상남도 김해시 생림면 마사로 26플라스틱선,봉,관및호스제조업 외(22211, 22299)52018-12-10
13171318㈜바이저 2공장경상남도 김해시 한림면 김해대로1022번길 105고무패킹류제조업(22191)52018-12-11
13181319경진단조경상남도 김해시 진영읍 본산로212번길 32금속단조제품제조업(25912)52018-12-21
13191320대동기업경상남도 김해시 생림면 봉림리 산 194 외 8필지건설폐기물중간처리업52018-12-18
13201321지엠피경상남도 김해시 생림면 생림대로669번길 3액체펌프제조업(29131)52018-12-31
13211322황금빛식품경상남도 김해시 한림면 가산리 682-6도시락류 제조업 외 152018-12-31
13221323피피엠테크㈜경상남도 김해시 한림면 안하로116번길 58-36폐기물종합재활용업52018-12-31