Overview

Dataset statistics

Number of variables11
Number of observations170
Missing cells170
Missing cells (%)9.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.6 KiB
Average record size in memory93.8 B

Variable types

Categorical2
Text4
Unsupported1
Numeric4

Dataset

Description경기도_굴뚝 자동 측정기기 설치 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=H0Z8L0VN8HK3538X7OLL11949113&infSeq=1

Alerts

소재지우편번호 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
구분명 is highly imbalanced (67.1%)Imbalance
사업자등록번호 has 170 (100.0%) missing valuesMissing
사업장명 has unique valuesUnique
사업자등록번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:06:15.703459
Analysis finished2023-12-10 23:06:18.484445
Duration2.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
안산시
25 
평택시
21 
화성시
12 
시흥시
10 
양주시
10 
Other values (24)
92 

Length

Max length4
Median length3
Mean length3.0352941
Min length3

Unique

Unique5 ?
Unique (%)2.9%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
안산시 25
14.7%
평택시 21
 
12.4%
화성시 12
 
7.1%
시흥시 10
 
5.9%
양주시 10
 
5.9%
포천시 9
 
5.3%
용인시 8
 
4.7%
파주시 7
 
4.1%
수원시 7
 
4.1%
이천시 6
 
3.5%
Other values (19) 55
32.4%

Length

2023-12-11T08:06:18.557831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안산시 25
14.7%
평택시 21
 
12.4%
화성시 12
 
7.1%
시흥시 10
 
5.9%
양주시 10
 
5.9%
포천시 9
 
5.3%
용인시 8
 
4.7%
파주시 7
 
4.1%
수원시 7
 
4.1%
이천시 6
 
3.5%
Other values (19) 55
32.4%

사업장명
Text

UNIQUE 

Distinct170
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T08:06:18.789834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length8.4235294
Min length3

Characters and Unicode

Total characters1432
Distinct characters228
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)100.0%

Sample

1st row한국동서발전㈜일산발전본부
2nd row서울특별시 난지물재생센터
3rd row한국지역난방공사 삼송지사
4th row한국지역난방공사 중앙지사(난지)
5th row고양시환경에너지시설
ValueCountFrequency (%)
한국지역난방공사 9
 
4.4%
여주공장 3
 
1.5%
주식회사 2
 
1.0%
㈜애니테크 2
 
1.0%
㈜농심 2
 
1.0%
이천지점 1
 
0.5%
㈜스테리싸이클코리아 1
 
0.5%
㈜뉴그린 1
 
0.5%
삼성전자㈜기흥사업장 1
 
0.5%
용인시환경자원화시설 1
 
0.5%
Other values (180) 180
88.7%
2023-12-11T08:06:19.151578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
119
 
8.3%
65
 
4.5%
48
 
3.4%
35
 
2.4%
33
 
2.3%
31
 
2.2%
29
 
2.0%
26
 
1.8%
24
 
1.7%
24
 
1.7%
Other values (218) 998
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1247
87.1%
Other Symbol 119
 
8.3%
Space Separator 33
 
2.3%
Uppercase Letter 15
 
1.0%
Close Punctuation 8
 
0.6%
Open Punctuation 8
 
0.6%
Other Punctuation 1
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
5.2%
48
 
3.8%
35
 
2.8%
31
 
2.5%
29
 
2.3%
26
 
2.1%
24
 
1.9%
24
 
1.9%
23
 
1.8%
22
 
1.8%
Other values (205) 920
73.8%
Uppercase Letter
ValueCountFrequency (%)
S 6
40.0%
C 2
 
13.3%
G 2
 
13.3%
K 2
 
13.3%
D 1
 
6.7%
L 1
 
6.7%
T 1
 
6.7%
Other Symbol
ValueCountFrequency (%)
119
100.0%
Space Separator
ValueCountFrequency (%)
33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1366
95.4%
Common 51
 
3.6%
Latin 15
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
8.7%
65
 
4.8%
48
 
3.5%
35
 
2.6%
31
 
2.3%
29
 
2.1%
26
 
1.9%
24
 
1.8%
24
 
1.8%
23
 
1.7%
Other values (206) 942
69.0%
Latin
ValueCountFrequency (%)
S 6
40.0%
C 2
 
13.3%
G 2
 
13.3%
K 2
 
13.3%
D 1
 
6.7%
L 1
 
6.7%
T 1
 
6.7%
Common
ValueCountFrequency (%)
33
64.7%
) 8
 
15.7%
( 8
 
15.7%
, 1
 
2.0%
4 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1247
87.1%
None 119
 
8.3%
ASCII 66
 
4.6%

Most frequent character per block

None
ValueCountFrequency (%)
119
100.0%
Hangul
ValueCountFrequency (%)
65
 
5.2%
48
 
3.8%
35
 
2.8%
31
 
2.5%
29
 
2.3%
26
 
2.1%
24
 
1.9%
24
 
1.9%
23
 
1.8%
22
 
1.8%
Other values (205) 920
73.8%
ASCII
ValueCountFrequency (%)
33
50.0%
) 8
 
12.1%
( 8
 
12.1%
S 6
 
9.1%
C 2
 
3.0%
G 2
 
3.0%
K 2
 
3.0%
D 1
 
1.5%
L 1
 
1.5%
, 1
 
1.5%
Other values (2) 2
 
3.0%

사업자등록번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing170
Missing (%)100.0%
Memory size1.6 KiB

구분명
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
1종
153 
2종
 
15
3종
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1종
2nd row1종
3rd row1종
4th row1종
5th row1종

Common Values

ValueCountFrequency (%)
1종 153
90.0%
2종 15
 
8.8%
3종 2
 
1.2%

Length

2023-12-11T08:06:19.292959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:06:19.391354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1종 153
90.0%
2종 15
 
8.8%
3종 2
 
1.2%
Distinct57
Distinct (%)33.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T08:06:19.671713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length13.676471
Min length6

Characters and Unicode

Total characters2325
Distinct characters150
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)20.0%

Sample

1st row화력 발전업
2nd row하수 처리업
3rd row화력 발전업
4th row증기, 냉ㆍ온수 및 공기 조절 공급업
5th row지정 외 폐기물 처리업
ValueCountFrequency (%)
제조업 66
 
9.1%
59
 
8.1%
폐기물 50
 
6.9%
지정 49
 
6.8%
처리업 48
 
6.6%
44
 
6.1%
기타 43
 
5.9%
발전업 21
 
2.9%
증기 20
 
2.8%
냉ㆍ온수 20
 
2.8%
Other values (120) 304
42.0%
2023-12-11T08:06:20.136611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
554
23.8%
170
 
7.3%
142
 
6.1%
94
 
4.0%
91
 
3.9%
62
 
2.7%
61
 
2.6%
59
 
2.5%
59
 
2.5%
55
 
2.4%
Other values (140) 978
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1728
74.3%
Space Separator 554
 
23.8%
Other Punctuation 41
 
1.8%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
9.8%
142
 
8.2%
94
 
5.4%
91
 
5.3%
62
 
3.6%
61
 
3.5%
59
 
3.4%
59
 
3.4%
55
 
3.2%
52
 
3.0%
Other values (137) 883
51.1%
Space Separator
ValueCountFrequency (%)
554
100.0%
Other Punctuation
ValueCountFrequency (%)
, 41
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1728
74.3%
Common 597
 
25.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
9.8%
142
 
8.2%
94
 
5.4%
91
 
5.3%
62
 
3.6%
61
 
3.5%
59
 
3.4%
59
 
3.4%
55
 
3.2%
52
 
3.0%
Other values (137) 883
51.1%
Common
ValueCountFrequency (%)
554
92.8%
, 41
 
6.9%
1 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1708
73.5%
ASCII 597
 
25.7%
Compat Jamo 20
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
554
92.8%
, 41
 
6.9%
1 2
 
0.3%
Hangul
ValueCountFrequency (%)
170
 
10.0%
142
 
8.3%
94
 
5.5%
91
 
5.3%
62
 
3.6%
61
 
3.6%
59
 
3.5%
59
 
3.5%
55
 
3.2%
52
 
3.0%
Other values (136) 863
50.5%
Compat Jamo
ValueCountFrequency (%)
20
100.0%

TMS부착굴뚝수
Real number (ℝ)

Distinct12
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.6352941
Minimum1
Maximum117
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-11T08:06:20.231515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q32
95-th percentile6
Maximum117
Range116
Interquartile range (IQR)1

Descriptive statistics

Standard deviation12.498376
Coefficient of variation (CV)3.4380645
Kurtosis69.142398
Mean3.6352941
Median Absolute Deviation (MAD)1
Skewness8.201144
Sum618
Variance156.2094
MonotonicityNot monotonic
2023-12-11T08:06:20.316816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 73
42.9%
2 55
32.4%
3 19
 
11.2%
4 10
 
5.9%
6 3
 
1.8%
5 3
 
1.8%
8 2
 
1.2%
47 1
 
0.6%
10 1
 
0.6%
7 1
 
0.6%
Other values (2) 2
 
1.2%
ValueCountFrequency (%)
1 73
42.9%
2 55
32.4%
3 19
 
11.2%
4 10
 
5.9%
5 3
 
1.8%
6 3
 
1.8%
7 1
 
0.6%
8 2
 
1.2%
10 1
 
0.6%
47 1
 
0.6%
ValueCountFrequency (%)
117 1
 
0.6%
108 1
 
0.6%
47 1
 
0.6%
10 1
 
0.6%
8 2
 
1.2%
7 1
 
0.6%
6 3
 
1.8%
5 3
 
1.8%
4 10
5.9%
3 19
11.2%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct131
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14824.676
Minimum10066
Maximum18626
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-11T08:06:20.422760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10066
5-th percentile10845
Q111993
median15412
Q317374.25
95-th percentile18348.1
Maximum18626
Range8560
Interquartile range (IQR)5381.25

Descriptive statistics

Standard deviation2666.838
Coefficient of variation (CV)0.17989182
Kurtosis-1.3452365
Mean14824.676
Median Absolute Deviation (MAD)2387
Skewness-0.25318442
Sum2520195
Variance7112025.1
MonotonicityNot monotonic
2023-12-11T08:06:20.535744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11022 4
 
2.4%
15619 4
 
2.4%
18126 4
 
2.4%
11138 4
 
2.4%
11426 3
 
1.8%
11413 3
 
1.8%
10542 2
 
1.2%
17113 2
 
1.2%
15093 2
 
1.2%
17956 2
 
1.2%
Other values (121) 140
82.4%
ValueCountFrequency (%)
10066 1
0.6%
10443 2
1.2%
10542 2
1.2%
10594 1
0.6%
10816 1
0.6%
10837 1
0.6%
10845 2
1.2%
10857 1
0.6%
10876 1
0.6%
10896 1
0.6%
ValueCountFrequency (%)
18626 1
0.6%
18623 1
0.6%
18571 1
0.6%
18570 1
0.6%
18512 1
0.6%
18487 1
0.6%
18450 1
0.6%
18448 1
0.6%
18358 1
0.6%
18336 1
0.6%
Distinct164
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T08:06:20.758552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length21.058824
Min length15

Characters and Unicode

Total characters3580
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique158 ?
Unique (%)92.9%

Sample

1st row경기도 고양시 일산동구 백석동 1143-1번지
2nd row경기도 고양시 덕양구 현천동 692-2번지
3rd row경기도 고양시 덕양구 동산동 378번지
4th row경기도 고양시 덕양구 현천동 692-2번지
5th row경기도 고양시 일산동구 백석동 1234번지
ValueCountFrequency (%)
경기도 170
 
21.1%
안산시 25
 
3.1%
단원구 24
 
3.0%
평택시 21
 
2.6%
성곡동 14
 
1.7%
화성시 12
 
1.5%
정왕동 10
 
1.2%
시흥시 10
 
1.2%
양주시 10
 
1.2%
포천시 9
 
1.1%
Other values (333) 499
62.1%
2023-12-11T08:06:21.098769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
634
 
17.7%
177
 
4.9%
177
 
4.9%
175
 
4.9%
172
 
4.8%
170
 
4.7%
170
 
4.7%
116
 
3.2%
1 115
 
3.2%
- 86
 
2.4%
Other values (147) 1588
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2257
63.0%
Space Separator 634
 
17.7%
Decimal Number 603
 
16.8%
Dash Punctuation 86
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
177
 
7.8%
177
 
7.8%
175
 
7.8%
172
 
7.6%
170
 
7.5%
170
 
7.5%
116
 
5.1%
70
 
3.1%
57
 
2.5%
46
 
2.0%
Other values (135) 927
41.1%
Decimal Number
ValueCountFrequency (%)
1 115
19.1%
2 78
12.9%
6 70
11.6%
3 57
9.5%
5 56
9.3%
4 52
8.6%
7 48
8.0%
0 47
7.8%
9 44
 
7.3%
8 36
 
6.0%
Space Separator
ValueCountFrequency (%)
634
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2257
63.0%
Common 1323
37.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
177
 
7.8%
177
 
7.8%
175
 
7.8%
172
 
7.6%
170
 
7.5%
170
 
7.5%
116
 
5.1%
70
 
3.1%
57
 
2.5%
46
 
2.0%
Other values (135) 927
41.1%
Common
ValueCountFrequency (%)
634
47.9%
1 115
 
8.7%
- 86
 
6.5%
2 78
 
5.9%
6 70
 
5.3%
3 57
 
4.3%
5 56
 
4.2%
4 52
 
3.9%
7 48
 
3.6%
0 47
 
3.6%
Other values (2) 80
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2257
63.0%
ASCII 1323
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
634
47.9%
1 115
 
8.7%
- 86
 
6.5%
2 78
 
5.9%
6 70
 
5.3%
3 57
 
4.3%
5 56
 
4.2%
4 52
 
3.9%
7 48
 
3.6%
0 47
 
3.6%
Other values (2) 80
 
6.0%
Hangul
ValueCountFrequency (%)
177
 
7.8%
177
 
7.8%
175
 
7.8%
172
 
7.6%
170
 
7.5%
170
 
7.5%
116
 
5.1%
70
 
3.1%
57
 
2.5%
46
 
2.0%
Other values (135) 927
41.1%
Distinct164
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-11T08:06:21.332889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length25
Mean length19.658824
Min length14

Characters and Unicode

Total characters3342
Distinct characters193
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique158 ?
Unique (%)92.9%

Sample

1st row경기도 고양시 일산동구 경의로 201
2nd row경기도 고양시 덕양구 대덕로 426
3rd row경기도 고양시 덕양구 동축로 16
4th row경기도 고양시 덕양구 대덕로 426
5th row경기도 고양시 일산동구 경의로 115
ValueCountFrequency (%)
경기도 170
 
21.1%
안산시 25
 
3.1%
단원구 24
 
3.0%
평택시 21
 
2.6%
화성시 12
 
1.5%
양주시 10
 
1.2%
시흥시 10
 
1.2%
포천시 9
 
1.1%
용인시 8
 
1.0%
포승읍 7
 
0.9%
Other values (347) 510
63.3%
2023-12-11T08:06:21.675280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
636
19.0%
179
 
5.4%
177
 
5.3%
176
 
5.3%
171
 
5.1%
160
 
4.8%
1 133
 
4.0%
2 81
 
2.4%
3 62
 
1.9%
4 60
 
1.8%
Other values (183) 1507
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2068
61.9%
Space Separator 636
 
19.0%
Decimal Number 608
 
18.2%
Dash Punctuation 28
 
0.8%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
179
 
8.7%
177
 
8.6%
176
 
8.5%
171
 
8.3%
160
 
7.7%
59
 
2.9%
53
 
2.6%
47
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (169) 962
46.5%
Decimal Number
ValueCountFrequency (%)
1 133
21.9%
2 81
13.3%
3 62
10.2%
4 60
9.9%
5 56
9.2%
7 49
 
8.1%
0 48
 
7.9%
6 47
 
7.7%
8 37
 
6.1%
9 35
 
5.8%
Space Separator
ValueCountFrequency (%)
636
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2068
61.9%
Common 1274
38.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
179
 
8.7%
177
 
8.6%
176
 
8.5%
171
 
8.3%
160
 
7.7%
59
 
2.9%
53
 
2.6%
47
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (169) 962
46.5%
Common
ValueCountFrequency (%)
636
49.9%
1 133
 
10.4%
2 81
 
6.4%
3 62
 
4.9%
4 60
 
4.7%
5 56
 
4.4%
7 49
 
3.8%
0 48
 
3.8%
6 47
 
3.7%
8 37
 
2.9%
Other values (4) 65
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2068
61.9%
ASCII 1274
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
636
49.9%
1 133
 
10.4%
2 81
 
6.4%
3 62
 
4.9%
4 60
 
4.7%
5 56
 
4.4%
7 49
 
3.8%
0 48
 
3.8%
6 47
 
3.7%
8 37
 
2.9%
Other values (4) 65
 
5.1%
Hangul
ValueCountFrequency (%)
179
 
8.7%
177
 
8.6%
176
 
8.5%
171
 
8.3%
160
 
7.7%
59
 
2.9%
53
 
2.6%
47
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (169) 962
46.5%

WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct164
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.40536
Minimum36.955851
Maximum38.049731
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-11T08:06:21.804105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.955851
5-th percentile37.010426
Q137.221978
median37.318929
Q337.58829
95-th percentile37.968867
Maximum38.049731
Range1.0938801
Interquartile range (IQR)0.3663118

Descriptive statistics

Standard deviation0.29946029
Coefficient of variation (CV)0.0080058124
Kurtosis-0.6316148
Mean37.40536
Median Absolute Deviation (MAD)0.17307144
Skewness0.64080167
Sum6358.9112
Variance0.089676467
MonotonicityNot monotonic
2023-12-11T08:06:21.923407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5865557413 2
 
1.2%
37.1404464258 2
 
1.2%
37.2529797883 2
 
1.2%
37.2286936056 2
 
1.2%
37.0077477207 2
 
1.2%
37.8767440783 2
 
1.2%
37.5091980903 1
 
0.6%
37.2787456368 1
 
0.6%
37.2118210121 1
 
0.6%
37.2942111936 1
 
0.6%
Other values (154) 154
90.6%
ValueCountFrequency (%)
36.9558508975 1
0.6%
36.9596905456 1
0.6%
36.9768651298 1
0.6%
36.9779988613 1
0.6%
36.9814490928 1
0.6%
36.9857918727 1
0.6%
36.9867483085 1
0.6%
37.0077477207 2
1.2%
37.0136998293 1
0.6%
37.0153386199 1
0.6%
ValueCountFrequency (%)
38.0497310076 1
0.6%
38.0077493853 1
0.6%
38.0069365787 1
0.6%
38.0065864705 1
0.6%
38.0056901945 1
0.6%
38.0052724215 1
0.6%
37.9970076759 1
0.6%
37.9911661267 1
0.6%
37.9698756561 1
0.6%
37.967634585 1
0.6%

WGS84경도
Real number (ℝ)

HIGH CORRELATION 

Distinct164
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.9866
Minimum126.63805
Maximum127.65347
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-11T08:06:22.043760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.63805
5-th percentile126.71467
Q1126.78652
median126.99505
Q3127.08932
95-th percentile127.40258
Maximum127.65347
Range1.0154206
Interquartile range (IQR)0.30279399

Descriptive statistics

Standard deviation0.21350278
Coefficient of variation (CV)0.0016813016
Kurtosis0.25773339
Mean126.9866
Median Absolute Deviation (MAD)0.1633896
Skewness0.71874144
Sum21587.722
Variance0.045583437
MonotonicityNot monotonic
2023-12-11T08:06:22.168403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.8471495102 2
 
1.2%
127.0558595496 2
 
1.2%
127.4806834787 2
 
1.2%
127.0830303783 2
 
1.2%
126.7982169096 2
 
1.2%
127.0073702677 2
 
1.2%
127.4491849261 1
 
0.6%
127.247998608 1
 
0.6%
127.3943070064 1
 
0.6%
127.4920909297 1
 
0.6%
Other values (154) 154
90.6%
ValueCountFrequency (%)
126.6380537927 1
0.6%
126.6947327633 1
0.6%
126.6998349146 1
0.6%
126.7046172017 1
0.6%
126.7064053189 1
0.6%
126.7072651919 1
0.6%
126.7090570674 1
0.6%
126.7114241379 1
0.6%
126.7131270074 1
0.6%
126.7165509778 1
0.6%
ValueCountFrequency (%)
127.653474376 1
0.6%
127.593153731 1
0.6%
127.5771767 1
0.6%
127.5285081815 1
0.6%
127.4920909297 1
0.6%
127.4806834787 2
1.2%
127.4491849261 1
0.6%
127.4093570094 1
0.6%
127.3943070064 1
0.6%
127.3644789194 1
0.6%

Interactions

2023-12-11T08:06:17.807231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.395020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.818289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.180577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.889228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.491289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.917990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.276356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.980563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.597941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.007811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.359917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:18.067124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:16.716129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.092386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:06:17.725495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:06:22.250810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명구분명업종명TMS부착굴뚝수소재지우편번호WGS84위도WGS84경도
시군명1.0000.0000.0000.0000.9920.9670.949
구분명0.0001.0000.9190.0000.0000.0000.000
업종명0.0000.9191.0000.7630.0000.0000.704
TMS부착굴뚝수0.0000.0000.7631.0000.0000.0000.000
소재지우편번호0.9920.0000.0000.0001.0000.9350.870
WGS84위도0.9670.0000.0000.0000.9351.0000.730
WGS84경도0.9490.0000.7040.0000.8700.7301.000
2023-12-11T08:06:22.345374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분명시군명
구분명1.0000.000
시군명0.0001.000
2023-12-11T08:06:22.420475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
TMS부착굴뚝수소재지우편번호WGS84위도WGS84경도시군명구분명
TMS부착굴뚝수1.000-0.001-0.089-0.0450.0000.000
소재지우편번호-0.0011.000-0.9240.0020.8860.000
WGS84위도-0.089-0.9241.000-0.0000.7550.000
WGS84경도-0.0450.002-0.0001.0000.6900.000
시군명0.0000.8860.7550.6901.0000.000
구분명0.0000.0000.0000.0000.0001.000

Missing values

2023-12-11T08:06:18.220863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:06:18.409688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명사업장명사업자등록번호구분명업종명TMS부착굴뚝수소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
0고양시한국동서발전㈜일산발전본부<NA>1종화력 발전업610443경기도 고양시 일산동구 백석동 1143-1번지경기도 고양시 일산동구 경의로 20137.645292126.797023
1고양시서울특별시 난지물재생센터<NA>1종하수 처리업210542경기도 고양시 덕양구 현천동 692-2번지경기도 고양시 덕양구 대덕로 42637.586556126.84715
2고양시한국지역난방공사 삼송지사<NA>1종화력 발전업310594경기도 고양시 덕양구 동산동 378번지경기도 고양시 덕양구 동축로 1637.647857126.90523
3고양시한국지역난방공사 중앙지사(난지)<NA>1종증기, 냉ㆍ온수 및 공기 조절 공급업110542경기도 고양시 덕양구 현천동 692-2번지경기도 고양시 덕양구 대덕로 42637.586556126.84715
4고양시고양시환경에너지시설<NA>1종지정 외 폐기물 처리업410443경기도 고양시 일산동구 백석동 1234번지경기도 고양시 일산동구 경의로 11537.643322126.796078
5과천시과천시자원정화센터<NA>1종지정 외 폐기물 처리업113824경기도 과천시 갈현동 205-1번지경기도 과천시 구리안로 17737.407178126.993275
6광명시광명시자원회수시설<NA>1종지정 외 폐기물 처리업214341경기도 광명시 가학동 27번지경기도 광명시 가학로85번길 14237.424668126.863444
7광명시㈜삼천리<NA>1종기타 발전업214354경기도 광명시 일직동 502번지경기도 광명시 자경로 1737.422526126.890848
8광명시기아㈜ 광명공장<NA>1종승용차 및 기타 여객용 자동차 제조업214323경기도 광명시 소하동 610-1번지경기도 광명시 기아로 11337.440639126.89417
9광주시경기환경에너지 주식회사<NA>1종지정 외 폐기물 처리업112717경기도 광주시 곤지암읍 열미리 440-5번지경기도 광주시 곤지암읍 열미길 2237.354049127.364479
시군명사업장명사업자등록번호구분명업종명TMS부착굴뚝수소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
160화성시㈜백철금속<NA>1종그 외 기타 금속 가공업118570경기도 화성시 우정읍 매향리 96-1번지경기도 화성시 우정읍 궁평항로 113-5637.051357126.753719
161화성시화성 그린환경센터<NA>1종지정 외 폐기물 처리업218336경기도 화성시 봉담읍 하가등리 107-1번지경기도 화성시 봉담읍 하가등안길 10037.151167126.937714
162화성시한국지역난방공사 화성지사<NA>1종기타 발전업218450경기도 화성시 석우동 39번지경기도 화성시 큰재봉길 1637.217776127.079991
163화성시㈜알엠 화성공장<NA>2종비금속류 원료 재생업118626경기도 화성시 양감면 사창리 775-1번지경기도 화성시 양감면 사격장길 88-4537.094284126.958221
164화성시신대원에너지㈜,화성그린에너지밸류<NA>1종지정 외 폐기물 처리업118623경기도 화성시 향남읍 구문천리 929-23번지경기도 화성시 향남읍 발안공단로 13937.085936126.903497
165화성시㈜삼부환경<NA>1종지정 외 폐기물 처리업118281경기도 화성시 남양읍 장덕리 765-39번지경기도 화성시 남양읍 매바위로 580-1837.156561126.804209
166화성시㈜신승에너지<NA>1종지정 폐기물 수집, 운반업118281경기도 화성시 남양읍 장덕리 765-1번지경기도 화성시 남양읍 매바위로 570-1037.155289126.804097
167화성시한국지역난방공사 동탄지사<NA>1종증기, 냉ㆍ온수 및 공기 조절 공급업218487경기도 화성시 방교동 830번지경기도 화성시 동탄기흥로 16637.175111127.092472
168화성시기아㈜화성공장<NA>1종승용차 및 기타 여객용 자동차 제조업818571경기도 화성시 우정읍 매향리 966번지경기도 화성시 우정읍 기아자동차로 9537.039269126.788976
169화성시삼성전자㈜화성사업장<NA>1종메모리용 전자집적회로 제조업10818448경기도 화성시 반월동 948번지경기도 화성시 삼성전자로 137.223467127.063344