Overview

Dataset statistics

Number of variables6
Number of observations2031
Missing cells51
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory99.3 KiB
Average record size in memory50.1 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description창원시 대기배출시설, 폐수배출시설, 소음진동 배출시설 현황 데이터로서 상호명, 소재지, 업종, 배출업소 종별 정보가 제공됩니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15080991

Alerts

종별 is highly imbalanced (52.4%)Imbalance
업종 has 51 (2.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:43:56.174239
Analysis finished2023-12-10 22:43:56.935807
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

Distinct1048
Distinct (%)51.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean430.94239
Minimum1
Maximum1048
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size18.0 KiB
2023-12-11T07:43:57.002720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile34.5
Q1169
median407
Q3661
95-th percentile946.5
Maximum1048
Range1047
Interquartile range (IQR)492

Descriptive statistics

Standard deviation288.39071
Coefficient of variation (CV)0.66920942
Kurtosis-1.0186153
Mean430.94239
Median Absolute Deviation (MAD)244
Skewness0.30804911
Sum875244
Variance83169.2
MonotonicityNot monotonic
2023-12-11T07:43:57.131850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49 4
 
0.2%
48 4
 
0.2%
1 3
 
0.1%
140 3
 
0.1%
131 3
 
0.1%
132 3
 
0.1%
133 3
 
0.1%
134 3
 
0.1%
135 3
 
0.1%
136 3
 
0.1%
Other values (1038) 1999
98.4%
ValueCountFrequency (%)
1 3
0.1%
2 3
0.1%
3 3
0.1%
4 3
0.1%
5 3
0.1%
6 3
0.1%
7 3
0.1%
8 3
0.1%
9 3
0.1%
10 3
0.1%
ValueCountFrequency (%)
1048 1
< 0.1%
1047 1
< 0.1%
1046 1
< 0.1%
1045 1
< 0.1%
1044 1
< 0.1%
1043 1
< 0.1%
1042 1
< 0.1%
1041 1
< 0.1%
1040 1
< 0.1%
1039 1
< 0.1%
Distinct1656
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Memory size16.0 KiB
2023-12-11T07:43:57.360480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length33
Mean length7.7592319
Min length2

Characters and Unicode

Total characters15759
Distinct characters510
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1305 ?
Unique (%)64.3%

Sample

1st row동현자동차정비
2nd row(주)케이비아이테크
3rd row성우하이텍
4th row현대환경개발(주)
5th row(주)가현
ValueCountFrequency (%)
주식회사 38
 
1.6%
제2공장 13
 
0.6%
의료법인 13
 
0.6%
12
 
0.5%
창원공장 8
 
0.3%
주)대동사 6
 
0.3%
주)경신 6
 
0.3%
마산공장 6
 
0.3%
마산점 5
 
0.2%
제3공장 5
 
0.2%
Other values (1781) 2219
95.2%
2023-12-11T07:43:57.717055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
553
 
3.5%
547
 
3.5%
) 516
 
3.3%
( 514
 
3.3%
371
 
2.4%
326
 
2.1%
311
 
2.0%
295
 
1.9%
287
 
1.8%
282
 
1.8%
Other values (500) 11757
74.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13040
82.7%
Other Symbol 547
 
3.5%
Close Punctuation 523
 
3.3%
Open Punctuation 521
 
3.3%
Space Separator 371
 
2.4%
Decimal Number 316
 
2.0%
Uppercase Letter 225
 
1.4%
Other Punctuation 177
 
1.1%
Lowercase Letter 37
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
553
 
4.2%
326
 
2.5%
311
 
2.4%
295
 
2.3%
287
 
2.2%
282
 
2.2%
282
 
2.2%
267
 
2.0%
240
 
1.8%
238
 
1.8%
Other values (439) 9959
76.4%
Uppercase Letter
ValueCountFrequency (%)
M 23
 
10.2%
S 23
 
10.2%
C 20
 
8.9%
G 18
 
8.0%
H 17
 
7.6%
E 16
 
7.1%
T 15
 
6.7%
I 13
 
5.8%
N 13
 
5.8%
P 9
 
4.0%
Other values (15) 58
25.8%
Lowercase Letter
ValueCountFrequency (%)
i 8
21.6%
g 5
13.5%
n 5
13.5%
m 3
 
8.1%
e 3
 
8.1%
t 3
 
8.1%
a 2
 
5.4%
l 2
 
5.4%
o 2
 
5.4%
r 2
 
5.4%
Decimal Number
ValueCountFrequency (%)
2 111
35.1%
1 52
16.5%
3 52
16.5%
5 26
 
8.2%
6 19
 
6.0%
4 19
 
6.0%
0 10
 
3.2%
9 10
 
3.2%
8 10
 
3.2%
7 7
 
2.2%
Other Punctuation
ValueCountFrequency (%)
. 119
67.2%
: 37
 
20.9%
& 9
 
5.1%
, 7
 
4.0%
/ 2
 
1.1%
· 2
 
1.1%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 516
98.7%
] 7
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 514
98.7%
[ 7
 
1.3%
Other Symbol
ValueCountFrequency (%)
547
100.0%
Space Separator
ValueCountFrequency (%)
371
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13587
86.2%
Common 1910
 
12.1%
Latin 262
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
553
 
4.1%
547
 
4.0%
326
 
2.4%
311
 
2.3%
295
 
2.2%
287
 
2.1%
282
 
2.1%
282
 
2.1%
267
 
2.0%
240
 
1.8%
Other values (440) 10197
75.0%
Latin
ValueCountFrequency (%)
M 23
 
8.8%
S 23
 
8.8%
C 20
 
7.6%
G 18
 
6.9%
H 17
 
6.5%
E 16
 
6.1%
T 15
 
5.7%
I 13
 
5.0%
N 13
 
5.0%
P 9
 
3.4%
Other values (26) 95
36.3%
Common
ValueCountFrequency (%)
) 516
27.0%
( 514
26.9%
371
19.4%
. 119
 
6.2%
2 111
 
5.8%
1 52
 
2.7%
3 52
 
2.7%
: 37
 
1.9%
5 26
 
1.4%
6 19
 
1.0%
Other values (14) 93
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13040
82.7%
ASCII 2169
 
13.8%
None 550
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
553
 
4.2%
326
 
2.5%
311
 
2.4%
295
 
2.3%
287
 
2.2%
282
 
2.2%
282
 
2.2%
267
 
2.0%
240
 
1.8%
238
 
1.8%
Other values (439) 9959
76.4%
None
ValueCountFrequency (%)
547
99.5%
· 2
 
0.4%
1
 
0.2%
ASCII
ValueCountFrequency (%)
) 516
23.8%
( 514
23.7%
371
17.1%
. 119
 
5.5%
2 111
 
5.1%
1 52
 
2.4%
3 52
 
2.4%
: 37
 
1.7%
5 26
 
1.2%
M 23
 
1.1%
Other values (48) 348
16.0%
Distinct1629
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Memory size16.0 KiB
2023-12-11T07:43:57.992302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length45
Mean length23.133432
Min length13

Characters and Unicode

Total characters46984
Distinct characters281
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1281 ?
Unique (%)63.1%

Sample

1st row창원시 의창구 차상로18번길 50 (팔용동)
2nd row창원시 의창구 대산면 봉강가술로559번길 8
3rd row창원시 의창구 대산면 봉강가술로537번길 16
4th row창원시 의창구 북면 천주로 991-22
5th row창원시 의창구 평산로38번길 13 (팔용동)
ValueCountFrequency (%)
창원시 2032
20.6%
마산회원구 519
 
5.3%
의창구 497
 
5.0%
성산구 456
 
4.6%
마산합포구 365
 
3.7%
팔용동 217
 
2.2%
봉암동 203
 
2.1%
진해구 192
 
2.0%
진북면 163
 
1.7%
내서읍 162
 
1.6%
Other values (1610) 5038
51.2%
2023-12-11T07:43:58.398849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8163
 
17.4%
2696
 
5.7%
2614
 
5.6%
2061
 
4.4%
2040
 
4.3%
1745
 
3.7%
1 1597
 
3.4%
1545
 
3.3%
1449
 
3.1%
) 1150
 
2.4%
Other values (271) 21924
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28435
60.5%
Space Separator 8163
 
17.4%
Decimal Number 7476
 
15.9%
Close Punctuation 1152
 
2.5%
Open Punctuation 1152
 
2.5%
Dash Punctuation 481
 
1.0%
Other Punctuation 109
 
0.2%
Uppercase Letter 12
 
< 0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2696
 
9.5%
2614
 
9.2%
2061
 
7.2%
2040
 
7.2%
1745
 
6.1%
1545
 
5.4%
1449
 
5.1%
914
 
3.2%
896
 
3.2%
651
 
2.3%
Other values (242) 11824
41.6%
Decimal Number
ValueCountFrequency (%)
1 1597
21.4%
2 1056
14.1%
3 888
11.9%
4 695
9.3%
5 632
 
8.5%
6 597
 
8.0%
7 571
 
7.6%
0 486
 
6.5%
9 484
 
6.5%
8 470
 
6.3%
Other Punctuation
ValueCountFrequency (%)
, 67
61.5%
. 19
 
17.4%
· 10
 
9.2%
; 5
 
4.6%
: 5
 
4.6%
* 3
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
A 4
33.3%
K 2
16.7%
C 2
16.7%
T 2
16.7%
S 1
 
8.3%
B 1
 
8.3%
Close Punctuation
ValueCountFrequency (%)
) 1150
99.8%
] 2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1150
99.8%
[ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
8163
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 481
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28439
60.5%
Common 18533
39.4%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2696
 
9.5%
2614
 
9.2%
2061
 
7.2%
2040
 
7.2%
1745
 
6.1%
1545
 
5.4%
1449
 
5.1%
914
 
3.2%
896
 
3.2%
651
 
2.3%
Other values (243) 11828
41.6%
Common
ValueCountFrequency (%)
8163
44.0%
1 1597
 
8.6%
) 1150
 
6.2%
( 1150
 
6.2%
2 1056
 
5.7%
3 888
 
4.8%
4 695
 
3.8%
5 632
 
3.4%
6 597
 
3.2%
7 571
 
3.1%
Other values (12) 2034
 
11.0%
Latin
ValueCountFrequency (%)
A 4
33.3%
K 2
16.7%
C 2
16.7%
T 2
16.7%
S 1
 
8.3%
B 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28435
60.5%
ASCII 18535
39.4%
None 14
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8163
44.0%
1 1597
 
8.6%
) 1150
 
6.2%
( 1150
 
6.2%
2 1056
 
5.7%
3 888
 
4.8%
4 695
 
3.7%
5 632
 
3.4%
6 597
 
3.2%
7 571
 
3.1%
Other values (17) 2036
 
11.0%
Hangul
ValueCountFrequency (%)
2696
 
9.5%
2614
 
9.2%
2061
 
7.2%
2040
 
7.2%
1745
 
6.1%
1545
 
5.4%
1449
 
5.1%
914
 
3.2%
896
 
3.2%
651
 
2.3%
Other values (242) 11824
41.6%
None
ValueCountFrequency (%)
· 10
71.4%
4
 
28.6%

종별
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size16.0 KiB
5
1446 
4
353 
<NA>
202 
3
 
14
2
 
9

Length

Max length4
Median length1
Mean length1.2983752
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row4
3rd row5
4th row4
5th row4

Common Values

ValueCountFrequency (%)
5 1446
71.2%
4 353
 
17.4%
<NA> 202
 
9.9%
3 14
 
0.7%
2 9
 
0.4%
1 7
 
0.3%

Length

2023-12-11T07:43:58.525048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:43:58.637090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 1446
71.2%
4 353
 
17.4%
na 202
 
9.9%
3 14
 
0.7%
2 9
 
0.4%
1 7
 
0.3%

업종
Text

MISSING 

Distinct609
Distinct (%)30.8%
Missing51
Missing (%)2.5%
Memory size16.0 KiB
2023-12-11T07:43:58.854320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length23
Mean length9.3691919
Min length1

Characters and Unicode

Total characters18551
Distinct characters292
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)16.1%

Sample

1st row자동차종합수리업
2nd row철도차량부품및관련장치물제조업
3rd row금속절삭기계제조업
4th row비금속광물분쇄물생산업
5th row기타일반목적용기계제조업
ValueCountFrequency (%)
259
 
7.5%
제조업 256
 
7.4%
도금업 137
 
4.0%
기타 125
 
3.6%
자동차수리업 101
 
2.9%
자동차 89
 
2.6%
운수장비수선및세차 70
 
2.0%
세차업 62
 
1.8%
자동차종합수리업 53
 
1.5%
그외 50
 
1.4%
Other values (721) 2248
65.2%
2023-12-11T07:43:59.220181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1637
 
8.8%
1498
 
8.1%
1147
 
6.2%
984
 
5.3%
702
 
3.8%
654
 
3.5%
605
 
3.3%
541
 
2.9%
491
 
2.6%
458
 
2.5%
Other values (282) 9834
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16885
91.0%
Space Separator 1498
 
8.1%
Other Punctuation 64
 
0.3%
Close Punctuation 34
 
0.2%
Open Punctuation 34
 
0.2%
Decimal Number 31
 
0.2%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1637
 
9.7%
1147
 
6.8%
984
 
5.8%
702
 
4.2%
654
 
3.9%
605
 
3.6%
541
 
3.2%
491
 
2.9%
458
 
2.7%
444
 
2.6%
Other values (269) 9222
54.6%
Decimal Number
ValueCountFrequency (%)
1 17
54.8%
3 5
 
16.1%
6 4
 
12.9%
2 3
 
9.7%
4 1
 
3.2%
9 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 41
64.1%
. 21
32.8%
· 2
 
3.1%
Space Separator
ValueCountFrequency (%)
1498
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16885
91.0%
Common 1666
 
9.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1637
 
9.7%
1147
 
6.8%
984
 
5.8%
702
 
4.2%
654
 
3.9%
605
 
3.6%
541
 
3.2%
491
 
2.9%
458
 
2.7%
444
 
2.6%
Other values (269) 9222
54.6%
Common
ValueCountFrequency (%)
1498
89.9%
, 41
 
2.5%
) 34
 
2.0%
( 34
 
2.0%
. 21
 
1.3%
1 17
 
1.0%
3 5
 
0.3%
- 5
 
0.3%
6 4
 
0.2%
2 3
 
0.2%
Other values (3) 4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16885
91.0%
ASCII 1664
 
9.0%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1637
 
9.7%
1147
 
6.8%
984
 
5.8%
702
 
4.2%
654
 
3.9%
605
 
3.6%
541
 
3.2%
491
 
2.9%
458
 
2.7%
444
 
2.6%
Other values (269) 9222
54.6%
ASCII
ValueCountFrequency (%)
1498
90.0%
, 41
 
2.5%
) 34
 
2.0%
( 34
 
2.0%
. 21
 
1.3%
1 17
 
1.0%
3 5
 
0.3%
- 5
 
0.3%
6 4
 
0.2%
2 3
 
0.2%
Other values (2) 2
 
0.1%
None
ValueCountFrequency (%)
· 2
100.0%

구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.0 KiB
폐수
1048 
대기
781 
소음진동
202 

Length

Max length4
Median length2
Mean length2.1989168
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기
2nd row대기
3rd row대기
4th row대기
5th row대기

Common Values

ValueCountFrequency (%)
폐수 1048
51.6%
대기 781
38.5%
소음진동 202
 
9.9%

Length

2023-12-11T07:43:59.338383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:43:59.429702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐수 1048
51.6%
대기 781
38.5%
소음진동 202
 
9.9%

Interactions

2023-12-11T07:43:56.709744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:43:59.486149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별구분
연번1.0000.2690.574
종별0.2691.0000.356
구분0.5740.3561.000
2023-12-11T07:43:59.550804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종별구분
종별1.0000.433
구분0.4331.000
2023-12-11T07:43:59.617877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별구분
연번1.0000.1150.417
종별0.1151.0000.433
구분0.4170.4331.000

Missing values

2023-12-11T07:43:56.805898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:43:56.888676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지종별업종구분
01동현자동차정비창원시 의창구 차상로18번길 50 (팔용동)5자동차종합수리업대기
12(주)케이비아이테크창원시 의창구 대산면 봉강가술로559번길 84철도차량부품및관련장치물제조업대기
23성우하이텍창원시 의창구 대산면 봉강가술로537번길 165금속절삭기계제조업대기
34현대환경개발(주)창원시 의창구 북면 천주로 991-224비금속광물분쇄물생산업대기
45(주)가현창원시 의창구 평산로38번길 13 (팔용동)4기타일반목적용기계제조업대기
56미래금속창원시 의창구 팔용로359번길 10 (팔용동)5도금업대기
67삼광기계 제2공장창원시 의창구 창원대로204번길 20 (팔용동)5조립금속제품제조업대기
78우성산업사창원시 의창구 팔용로346번길 8 (팔용동)4도금업대기
89유창정밀공업창원시 의창구 평산로8번길 19-40 (팔용동)5주형및금형제조업대기
910(주)피텍창원시 의창구 팔용로346번길 30 (팔용동)5도장및기타피막처리업대기
연번업체명소재지종별업종구분
2021191㈜성일엔케어 진해공장창원시 진해구 가주로15번길 23(가주동)<NA>금속제품가공소음진동
2022192용원레미콘㈜창원시 진해구 용원동 055-547<NA>레미콘제조업소음진동
2023193동오식품상사창원시 진해구 가주동 50외 5필지<NA>식용해조류가공소음진동
2024194㈜삼경글로벌창원시 진해구 성내동 180외 5필지<NA>일반제재업소음진동
2025195부경산업창원시 진해구 웅동로 87(마천동)<NA>플라스틱제품제조업소음진동
2026196㈜해성아이엔티엘창원시 진해구 안골로 108번길 70(안골동)<NA>기타수산동물가공및저장시설소음진동
2027197산양알앤에이㈜창원시 진해구 용원북로 19(용원동)<NA>비금속광물(시멘트제품)소음진동
2028198항도파일㈜창원시 진해구 용원북로 19(용원동)<NA>콘크리트 관 및 기타 구조용 콘크리트제품 제조업소음진동
2029199(주)에스제이티창원시 진해구 안골로 64-10(안골동)<NA>사료제조업소음진동
2030200창성적재함창원시 진해구 안골로 82-5<NA><NA>소음진동