Overview

Dataset statistics

Number of variables9
Number of observations5092
Missing cells601
Missing cells (%)1.3%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory373.1 KiB
Average record size in memory75.0 B

Variable types

Categorical2
Text4
Numeric3

Dataset

Description경기도_유해화학물질 취급 사업장 현황
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=ZS7P7P6A6I4XZDVB349W19085839&infSeq=1

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates
소재지우편번호 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
전화번호 has 330 (6.5%) missing valuesMissing
소재지지번주소 has 61 (1.2%) missing valuesMissing
WGS84위도 has 82 (1.6%) missing valuesMissing
WGS84경도 has 82 (1.6%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:29:42.213784
Analysis finished2023-12-10 22:29:44.708675
Duration2.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size39.9 KiB
안산시
1030 
화성시
606 
시흥시
526 
평택시
317 
부천시
249 
Other values (26)
2364 

Length

Max length4
Median length3
Mean length3.0288688
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row고양시

Common Values

ValueCountFrequency (%)
안산시 1030
20.2%
화성시 606
11.9%
시흥시 526
 
10.3%
평택시 317
 
6.2%
부천시 249
 
4.9%
안양시 245
 
4.8%
성남시 211
 
4.1%
고양시 194
 
3.8%
용인시 186
 
3.7%
수원시 179
 
3.5%
Other values (21) 1349
26.5%

Length

2023-12-11T07:29:44.768745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안산시 1030
20.2%
화성시 606
11.9%
시흥시 526
 
10.3%
평택시 317
 
6.2%
부천시 249
 
4.9%
안양시 245
 
4.8%
성남시 211
 
4.1%
고양시 194
 
3.8%
용인시 186
 
3.7%
수원시 179
 
3.5%
Other values (21) 1349
26.5%
Distinct4559
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Memory size39.9 KiB
2023-12-11T07:29:45.027450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length6.7743519
Min length2

Characters and Unicode

Total characters34495
Distinct characters613
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4139 ?
Unique (%)81.3%

Sample

1st row삼화페인트칠칠공사
2nd row대성우드
3rd row삼화페인트가평공사
4th row노루표페인트㈜
5th row주식회사솔팩서비스
ValueCountFrequency (%)
주식회사 691
 
11.3%
안산지점 14
 
0.2%
제2공장 13
 
0.2%
안산공장 13
 
0.2%
반월공장 11
 
0.2%
지점 11
 
0.2%
2공장 10
 
0.2%
삼화페인트 9
 
0.1%
유한회사 9
 
0.1%
현대상사 8
 
0.1%
Other values (4642) 5310
87.1%
2023-12-11T07:29:45.435337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1667
 
4.8%
1512
 
4.4%
1266
 
3.7%
1009
 
2.9%
1004
 
2.9%
891
 
2.6%
874
 
2.5%
756
 
2.2%
651
 
1.9%
554
 
1.6%
Other values (603) 24311
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30174
87.5%
Other Symbol 1667
 
4.8%
Space Separator 1009
 
2.9%
Close Punctuation 485
 
1.4%
Open Punctuation 485
 
1.4%
Uppercase Letter 417
 
1.2%
Decimal Number 101
 
0.3%
Lowercase Letter 98
 
0.3%
Other Punctuation 55
 
0.2%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1512
 
5.0%
1266
 
4.2%
1004
 
3.3%
891
 
3.0%
874
 
2.9%
756
 
2.5%
651
 
2.2%
554
 
1.8%
552
 
1.8%
508
 
1.7%
Other values (545) 21606
71.6%
Uppercase Letter
ValueCountFrequency (%)
C 73
17.5%
S 40
 
9.6%
E 28
 
6.7%
T 27
 
6.5%
K 27
 
6.5%
L 23
 
5.5%
P 22
 
5.3%
N 19
 
4.6%
M 19
 
4.6%
H 17
 
4.1%
Other values (13) 122
29.3%
Lowercase Letter
ValueCountFrequency (%)
e 19
19.4%
h 11
11.2%
c 11
11.2%
a 8
8.2%
m 8
8.2%
t 6
 
6.1%
n 6
 
6.1%
i 6
 
6.1%
r 6
 
6.1%
l 4
 
4.1%
Other values (6) 13
13.3%
Decimal Number
ValueCountFrequency (%)
2 55
54.5%
1 24
23.8%
3 11
 
10.9%
4 4
 
4.0%
5 3
 
3.0%
8 2
 
2.0%
6 1
 
1.0%
0 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 30
54.5%
& 20
36.4%
, 4
 
7.3%
? 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 484
99.8%
] 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 484
99.8%
[ 1
 
0.2%
Other Symbol
ValueCountFrequency (%)
1667
100.0%
Space Separator
ValueCountFrequency (%)
1009
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31841
92.3%
Common 2139
 
6.2%
Latin 515
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1667
 
5.2%
1512
 
4.7%
1266
 
4.0%
1004
 
3.2%
891
 
2.8%
874
 
2.7%
756
 
2.4%
651
 
2.0%
554
 
1.7%
552
 
1.7%
Other values (546) 22114
69.5%
Latin
ValueCountFrequency (%)
C 73
 
14.2%
S 40
 
7.8%
E 28
 
5.4%
T 27
 
5.2%
K 27
 
5.2%
L 23
 
4.5%
P 22
 
4.3%
N 19
 
3.7%
M 19
 
3.7%
e 19
 
3.7%
Other values (29) 218
42.3%
Common
ValueCountFrequency (%)
1009
47.2%
) 484
22.6%
( 484
22.6%
2 55
 
2.6%
. 30
 
1.4%
1 24
 
1.1%
& 20
 
0.9%
3 11
 
0.5%
, 4
 
0.2%
- 4
 
0.2%
Other values (8) 14
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30174
87.5%
ASCII 2654
 
7.7%
None 1667
 
4.8%

Most frequent character per block

None
ValueCountFrequency (%)
1667
100.0%
Hangul
ValueCountFrequency (%)
1512
 
5.0%
1266
 
4.2%
1004
 
3.3%
891
 
3.0%
874
 
2.9%
756
 
2.5%
651
 
2.2%
554
 
1.8%
552
 
1.8%
508
 
1.7%
Other values (545) 21606
71.6%
ASCII
ValueCountFrequency (%)
1009
38.0%
) 484
18.2%
( 484
18.2%
C 73
 
2.8%
2 55
 
2.1%
S 40
 
1.5%
. 30
 
1.1%
E 28
 
1.1%
T 27
 
1.0%
K 27
 
1.0%
Other values (47) 397
 
15.0%

전화번호
Text

MISSING 

Distinct4255
Distinct (%)89.4%
Missing330
Missing (%)6.5%
Memory size39.9 KiB
2023-12-11T07:29:45.677649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.053339
Min length9

Characters and Unicode

Total characters57398
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3855 ?
Unique (%)81.0%

Sample

1st row031-585-7704
2nd row031-585-8292
3rd row031-581-3377
4th row031-581-2557
5th row031-906-5551
ValueCountFrequency (%)
031-864-5778 8
 
0.2%
031-498-4555 6
 
0.1%
031-352-6177 5
 
0.1%
031-491-7979 4
 
0.1%
031-497-5700 4
 
0.1%
031-499-2311 4
 
0.1%
031-495-4055 4
 
0.1%
031-508-6700 4
 
0.1%
031-499-5334 4
 
0.1%
031-383-6525 4
 
0.1%
Other values (4245) 4715
99.0%
2023-12-11T07:29:46.307132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 9516
16.6%
0 8326
14.5%
3 8013
14.0%
1 7365
12.8%
4 4111
7.2%
7 3604
 
6.3%
2 3369
 
5.9%
9 3360
 
5.9%
8 3265
 
5.7%
5 3253
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 47882
83.4%
Dash Punctuation 9516
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8326
17.4%
3 8013
16.7%
1 7365
15.4%
4 4111
8.6%
7 3604
7.5%
2 3369
7.0%
9 3360
7.0%
8 3265
 
6.8%
5 3253
 
6.8%
6 3216
 
6.7%
Dash Punctuation
ValueCountFrequency (%)
- 9516
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 57398
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 9516
16.6%
0 8326
14.5%
3 8013
14.0%
1 7365
12.8%
4 4111
7.2%
7 3604
 
6.3%
2 3369
 
5.9%
9 3360
 
5.9%
8 3265
 
5.7%
5 3253
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 57398
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 9516
16.6%
0 8326
14.5%
3 8013
14.0%
1 7365
12.8%
4 4111
7.2%
7 3604
 
6.3%
2 3369
 
5.9%
9 3360
 
5.9%
8 3265
 
5.7%
5 3253
 
5.7%

업종명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size39.9 KiB
판매업
2744 
사용업
1895 
제조업
325 
운반업
 
69
보관저장업
 
59

Length

Max length5
Median length3
Mean length3.0231736
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row판매업
2nd row판매업
3rd row판매업
4th row판매업
5th row판매업

Common Values

ValueCountFrequency (%)
판매업 2744
53.9%
사용업 1895
37.2%
제조업 325
 
6.4%
운반업 69
 
1.4%
보관저장업 59
 
1.2%

Length

2023-12-11T07:29:46.477250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:29:46.582932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
판매업 2744
53.9%
사용업 1895
37.2%
제조업 325
 
6.4%
운반업 69
 
1.4%
보관저장업 59
 
1.2%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1545
Distinct (%)30.6%
Missing46
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean15027.923
Minimum10003
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.9 KiB
2023-12-11T07:29:46.697891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10003
5-th percentile10401
Q113595
median15423
Q317076
95-th percentile18554
Maximum18635
Range8632
Interquartile range (IQR)3481

Descriptive statistics

Standard deviation2481.4124
Coefficient of variation (CV)0.16512012
Kurtosis-0.69859026
Mean15027.923
Median Absolute Deviation (MAD)1720.5
Skewness-0.42884141
Sum75830899
Variance6157407.5
MonotonicityNot monotonic
2023-12-11T07:29:46.825313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15431 126
 
2.5%
15103 87
 
1.7%
15433 66
 
1.3%
14117 63
 
1.2%
15528 56
 
1.1%
15090 54
 
1.1%
18623 54
 
1.1%
15091 48
 
0.9%
18543 47
 
0.9%
15429 45
 
0.9%
Other values (1535) 4400
86.4%
(Missing) 46
 
0.9%
ValueCountFrequency (%)
10003 1
 
< 0.1%
10008 6
0.1%
10010 3
0.1%
10011 6
0.1%
10012 3
0.1%
10014 1
 
< 0.1%
10016 2
 
< 0.1%
10017 2
 
< 0.1%
10022 4
0.1%
10023 1
 
< 0.1%
ValueCountFrequency (%)
18635 7
0.1%
18634 1
 
< 0.1%
18633 4
0.1%
18632 2
 
< 0.1%
18631 3
0.1%
18630 2
 
< 0.1%
18628 5
0.1%
18627 6
0.1%
18626 4
0.1%
18625 4
0.1%

소재지지번주소
Text

MISSING 

Distinct3572
Distinct (%)71.0%
Missing61
Missing (%)1.2%
Memory size39.9 KiB
2023-12-11T07:29:47.120465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length44
Mean length21.921487
Min length11

Characters and Unicode

Total characters110287
Distinct characters378
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2942 ?
Unique (%)58.5%

Sample

1st row경기도 가평군 조종면 현리 322-4번지
2nd row경기도 가평군 설악면 선촌리 368-4번지
3rd row경기도 가평군 가평읍 달전리 338-1번지
4th row경기도 가평군 가평읍 읍내리 338-4번지
5th row경기도 고양시 일산동구 장항동 893-1번지
ValueCountFrequency (%)
경기도 5031
 
21.1%
안산시 1023
 
4.3%
단원구 916
 
3.8%
화성시 599
 
2.5%
시흥시 526
 
2.2%
정왕동 467
 
2.0%
평택시 314
 
1.3%
성곡동 254
 
1.1%
안양시 242
 
1.0%
부천시 236
 
1.0%
Other values (4233) 14248
59.7%
2023-12-11T07:29:47.610117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18825
 
17.1%
5738
 
5.2%
5241
 
4.8%
5137
 
4.7%
5080
 
4.6%
5044
 
4.6%
4878
 
4.4%
3933
 
3.6%
- 3852
 
3.5%
1 3841
 
3.5%
Other values (368) 48718
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67143
60.9%
Decimal Number 20423
 
18.5%
Space Separator 18825
 
17.1%
Dash Punctuation 3852
 
3.5%
Lowercase Letter 18
 
< 0.1%
Uppercase Letter 16
 
< 0.1%
Other Punctuation 6
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5738
 
8.5%
5241
 
7.8%
5137
 
7.7%
5080
 
7.6%
5044
 
7.5%
4878
 
7.3%
3933
 
5.9%
2179
 
3.2%
1829
 
2.7%
1762
 
2.6%
Other values (336) 26322
39.2%
Decimal Number
ValueCountFrequency (%)
1 3841
18.8%
2 2436
11.9%
3 2117
10.4%
5 2090
10.2%
4 2084
10.2%
6 2061
10.1%
7 1677
8.2%
9 1620
7.9%
8 1280
 
6.3%
0 1217
 
6.0%
Lowercase Letter
ValueCountFrequency (%)
m 4
22.2%
l 2
11.1%
i 2
11.1%
e 2
11.1%
c 2
11.1%
a 2
11.1%
t 2
11.1%
u 2
11.1%
Uppercase Letter
ValueCountFrequency (%)
C 5
31.2%
S 3
18.8%
I 2
 
12.5%
P 2
 
12.5%
B 2
 
12.5%
A 1
 
6.2%
E 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 3
50.0%
? 2
33.3%
/ 1
 
16.7%
Space Separator
ValueCountFrequency (%)
18825
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3852
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67143
60.9%
Common 43110
39.1%
Latin 34
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5738
 
8.5%
5241
 
7.8%
5137
 
7.7%
5080
 
7.6%
5044
 
7.5%
4878
 
7.3%
3933
 
5.9%
2179
 
3.2%
1829
 
2.7%
1762
 
2.6%
Other values (336) 26322
39.2%
Common
ValueCountFrequency (%)
18825
43.7%
- 3852
 
8.9%
1 3841
 
8.9%
2 2436
 
5.7%
3 2117
 
4.9%
5 2090
 
4.8%
4 2084
 
4.8%
6 2061
 
4.8%
7 1677
 
3.9%
9 1620
 
3.8%
Other values (7) 2507
 
5.8%
Latin
ValueCountFrequency (%)
C 5
14.7%
m 4
11.8%
S 3
 
8.8%
I 2
 
5.9%
l 2
 
5.9%
i 2
 
5.9%
e 2
 
5.9%
c 2
 
5.9%
a 2
 
5.9%
t 2
 
5.9%
Other values (5) 8
23.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67143
60.9%
ASCII 43144
39.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18825
43.6%
- 3852
 
8.9%
1 3841
 
8.9%
2 2436
 
5.6%
3 2117
 
4.9%
5 2090
 
4.8%
4 2084
 
4.8%
6 2061
 
4.8%
7 1677
 
3.9%
9 1620
 
3.8%
Other values (22) 2541
 
5.9%
Hangul
ValueCountFrequency (%)
5738
 
8.5%
5241
 
7.8%
5137
 
7.7%
5080
 
7.6%
5044
 
7.5%
4878
 
7.3%
3933
 
5.9%
2179
 
3.2%
1829
 
2.7%
1762
 
2.6%
Other values (336) 26322
39.2%
Distinct3839
Distinct (%)75.4%
Missing0
Missing (%)0.0%
Memory size39.9 KiB
2023-12-11T07:29:47.909606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length20.923409
Min length11

Characters and Unicode

Total characters106542
Distinct characters418
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3266 ?
Unique (%)64.1%

Sample

1st row경기도 가평군 하면 현창로73-1
2nd row경기도 가평군 설악면 유명로 1685
3rd row경기도 가평군 가평읍 달전로49
4th row경기도 가평군 가평읍 연인길12
5th row경기도 고양시 장항동 일산동구장백로194
ValueCountFrequency (%)
경기도 5095
 
21.6%
안산시 1030
 
4.4%
화성시 607
 
2.6%
시흥시 527
 
2.2%
정왕동 466
 
2.0%
평택시 317
 
1.3%
성곡동 255
 
1.1%
부천시 249
 
1.1%
안양시 245
 
1.0%
성남시 211
 
0.9%
Other values (4273) 14638
61.9%
2023-12-11T07:29:48.369629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18548
 
17.4%
5941
 
5.6%
5368
 
5.0%
5285
 
5.0%
5230
 
4.9%
4338
 
4.1%
1 3960
 
3.7%
3124
 
2.9%
2 2761
 
2.6%
2128
 
2.0%
Other values (408) 49859
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67726
63.6%
Decimal Number 19092
 
17.9%
Space Separator 18548
 
17.4%
Dash Punctuation 1146
 
1.1%
Other Punctuation 15
 
< 0.1%
Uppercase Letter 13
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5941
 
8.8%
5368
 
7.9%
5285
 
7.8%
5230
 
7.7%
4338
 
6.4%
3124
 
4.6%
2128
 
3.1%
2086
 
3.1%
2048
 
3.0%
1443
 
2.1%
Other values (382) 30735
45.4%
Decimal Number
ValueCountFrequency (%)
1 3960
20.7%
2 2761
14.5%
3 2114
11.1%
4 1891
9.9%
5 1546
 
8.1%
7 1517
 
7.9%
6 1445
 
7.6%
9 1304
 
6.8%
8 1280
 
6.7%
0 1274
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
B 2
15.4%
E 2
15.4%
L 2
15.4%
C 2
15.4%
V 1
7.7%
I 1
7.7%
T 1
7.7%
A 1
7.7%
Y 1
7.7%
Other Punctuation
ValueCountFrequency (%)
. 9
60.0%
? 5
33.3%
/ 1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 1
50.0%
] 1
50.0%
Space Separator
ValueCountFrequency (%)
18548
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1146
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67726
63.6%
Common 38803
36.4%
Latin 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5941
 
8.8%
5368
 
7.9%
5285
 
7.8%
5230
 
7.7%
4338
 
6.4%
3124
 
4.6%
2128
 
3.1%
2086
 
3.1%
2048
 
3.0%
1443
 
2.1%
Other values (382) 30735
45.4%
Common
ValueCountFrequency (%)
18548
47.8%
1 3960
 
10.2%
2 2761
 
7.1%
3 2114
 
5.4%
4 1891
 
4.9%
5 1546
 
4.0%
7 1517
 
3.9%
6 1445
 
3.7%
9 1304
 
3.4%
8 1280
 
3.3%
Other values (7) 2437
 
6.3%
Latin
ValueCountFrequency (%)
B 2
15.4%
E 2
15.4%
L 2
15.4%
C 2
15.4%
V 1
7.7%
I 1
7.7%
T 1
7.7%
A 1
7.7%
Y 1
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67726
63.6%
ASCII 38816
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18548
47.8%
1 3960
 
10.2%
2 2761
 
7.1%
3 2114
 
5.4%
4 1891
 
4.9%
5 1546
 
4.0%
7 1517
 
3.9%
6 1445
 
3.7%
9 1304
 
3.4%
8 1280
 
3.3%
Other values (16) 2450
 
6.3%
Hangul
ValueCountFrequency (%)
5941
 
8.8%
5368
 
7.9%
5285
 
7.8%
5230
 
7.7%
4338
 
6.4%
3124
 
4.6%
2128
 
3.1%
2086
 
3.1%
2048
 
3.0%
1443
 
2.1%
Other values (382) 30735
45.4%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3518
Distinct (%)70.2%
Missing82
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean37.367957
Minimum36.921577
Maximum38.123564
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.9 KiB
2023-12-11T07:29:48.540444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.921577
5-th percentile37.024037
Q137.250564
median37.32482
Q337.444722
95-th percentile37.841529
Maximum38.123564
Range1.2019873
Interquartile range (IQR)0.19415741

Descriptive statistics

Standard deviation0.22689805
Coefficient of variation (CV)0.0060719952
Kurtosis0.22150713
Mean37.367957
Median Absolute Deviation (MAD)0.10180191
Skewness0.70268648
Sum187213.46
Variance0.051482726
MonotonicityNot monotonic
2023-12-11T07:29:48.680375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3347248053 87
 
1.7%
37.3248199688 55
 
1.1%
37.3379338971 54
 
1.1%
37.3716127805 54
 
1.1%
37.3244038958 51
 
1.0%
37.0844123125 26
 
0.5%
37.3213219086 25
 
0.5%
37.3926182411 21
 
0.4%
37.3201262611 21
 
0.4%
37.438418567 19
 
0.4%
Other values (3508) 4597
90.3%
(Missing) 82
 
1.6%
ValueCountFrequency (%)
36.9215766277 1
 
< 0.1%
36.9494726346 1
 
< 0.1%
36.9523591039 1
 
< 0.1%
36.9528822167 1
 
< 0.1%
36.9531805494 2
< 0.1%
36.9547253645 1
 
< 0.1%
36.9550273911 3
0.1%
36.9553922829 1
 
< 0.1%
36.9556277524 1
 
< 0.1%
36.9558182704 2
< 0.1%
ValueCountFrequency (%)
38.1235639079 1
< 0.1%
38.0975323133 1
< 0.1%
38.0944284566 1
< 0.1%
38.0726009588 1
< 0.1%
38.0708166728 2
< 0.1%
38.0550284368 2
< 0.1%
38.0497310076 1
< 0.1%
38.046364633 1
< 0.1%
38.028333191 1
< 0.1%
38.0278453361 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct3518
Distinct (%)70.2%
Missing82
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean126.91421
Minimum126.54562
Maximum127.68684
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size44.9 KiB
2023-12-11T07:29:48.829473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.54562
5-th percentile126.70304
Q1126.77252
median126.86313
Q3127.05144
95-th percentile127.26063
Maximum127.68684
Range1.1412273
Interquartile range (IQR)0.27891282

Descriptive statistics

Standard deviation0.18969085
Coefficient of variation (CV)0.0014946385
Kurtosis0.41477008
Mean126.91421
Median Absolute Deviation (MAD)0.11122182
Skewness0.84733103
Sum635840.17
Variance0.035982619
MonotonicityNot monotonic
2023-12-11T07:29:48.992780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.7265416247 87
 
1.7%
126.7870055968 55
 
1.1%
126.7238219689 54
 
1.1%
126.9514327046 54
 
1.1%
126.789763526 51
 
1.0%
126.8980854718 26
 
0.5%
126.7898828225 25
 
0.5%
126.9541265594 21
 
0.4%
126.7904132393 21
 
0.4%
127.1780452538 19
 
0.4%
Other values (3508) 4597
90.3%
(Missing) 82
 
1.6%
ValueCountFrequency (%)
126.5456164043 2
< 0.1%
126.546660519 2
< 0.1%
126.5468561086 3
0.1%
126.5471841376 4
0.1%
126.5476160049 1
 
< 0.1%
126.548315156 1
 
< 0.1%
126.5495244569 1
 
< 0.1%
126.5520668386 1
 
< 0.1%
126.5529233926 1
 
< 0.1%
126.5574080063 1
 
< 0.1%
ValueCountFrequency (%)
127.686843709 1
< 0.1%
127.6618110691 1
< 0.1%
127.6502339557 1
< 0.1%
127.6438940339 1
< 0.1%
127.640326505 1
< 0.1%
127.6345924993 1
< 0.1%
127.6337572553 1
< 0.1%
127.632613107 1
< 0.1%
127.6310267958 1
< 0.1%
127.6250104963 1
< 0.1%

Interactions

2023-12-11T07:29:44.081054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.425890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.803542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:44.166463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.527312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.901592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:44.255626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.673254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:43.983150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:29:49.083788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업종명소재지우편번호WGS84위도WGS84경도
시군명1.0000.4580.9920.9590.938
업종명0.4581.0000.4450.3470.305
소재지우편번호0.9920.4451.0000.9170.864
WGS84위도0.9590.3470.9171.0000.677
WGS84경도0.9380.3050.8640.6771.000
2023-12-11T07:29:49.205606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명시군명
업종명1.0000.235
시군명0.2351.000
2023-12-11T07:29:49.282964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명업종명
소재지우편번호1.000-0.9350.1530.9330.200
WGS84위도-0.9351.000-0.1560.7670.151
WGS84경도0.153-0.1561.0000.6940.131
시군명0.9330.7670.6941.0000.235
업종명0.2000.1510.1310.2351.000

Missing values

2023-12-11T07:29:44.380740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:29:44.519905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:29:44.640549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명전화번호업종명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
0가평군삼화페인트칠칠공사031-585-7704판매업12437경기도 가평군 조종면 현리 322-4번지경기도 가평군 하면 현창로73-137.820486127.34586
1가평군대성우드031-585-8292판매업12466경기도 가평군 설악면 선촌리 368-4번지경기도 가평군 설악면 유명로 168537.677788127.483704
2가평군삼화페인트가평공사031-581-3377판매업12422경기도 가평군 가평읍 달전리 338-1번지경기도 가평군 가평읍 달전로4937.814537127.518667
3가평군노루표페인트㈜031-581-2557판매업12418경기도 가평군 가평읍 읍내리 338-4번지경기도 가평군 가평읍 연인길1237.8284127.512932
4고양시주식회사솔팩서비스031-906-5551판매업10414경기도 고양시 일산동구 장항동 893-1번지경기도 고양시 장항동 일산동구장백로19437.652091126.77669
5고양시제이에이치코리아031-813-5743판매업10364경기도 고양시 일산동구 장항동 750-1번지경기도 고양시 일산동구 호수로67237.661475126.765159
6고양시비에스글로벌 주식회사031-812-7113판매업10414경기도 고양시 일산동구 장항동 890-4번지경기도 고양시 일산동구 중앙로 119337.653092126.776623
7고양시홍주교역031-921-0273판매업10364경기도 고양시 일산동구 장항동 730-1번지경기도 고양시 일산동구 고봉로 32-937.664835126.76635
8고양시(주)켐트리070-5014-3032판매업10449경기도 고양시 일산동구 백석동 1323번지경기도 고양시 백석동 호수로358-3937.640363126.78671
9고양시(주)더피유031-924-3609판매업10366경기도 고양시 일산동구 장항동 727-2번지경기도 고양시 일산동구 중앙로134137.664649126.768336
시군명사업장명전화번호업종명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도
5082화성시하나페인트031-352-9744판매업18525경기도 화성시 팔탄면 구장리 12-1번지경기도 화성시 팔탄면 시청로94337.169129126.893235
5083화성시중부울트라켐텍031-354-6749판매업18624경기도 화성시 향남읍 동오리 208-2번지경기도 화성시 향남읍 발안로474번길237.130605126.958739
5084화성시페인트마트031-358-1477판매업18581경기도 화성시 장안면 수촌리 425-46번지경기도 화성시 장안면 수정로41737.101986126.864511
5085화성시(주)서현케미칼031-358-5555보관저장업18581경기도 화성시 장안면 수촌리 876-1번지경기도 화성시 장안면 금의솔안길43-737.105279126.841188
5086화성시유한회사 청명031-355-2887사용업18274경기도 화성시 남양읍 무송리 154-3번지경기도 화성시 남양읍 현대기아로487번길 35-2637.198069126.843435
5087화성시에스테크 주식회사031-351-9235사용업18559경기도 화성시 우정읍 주곡리 505-19번지경기도 화성시 우정읍 쌍봉로 422-1137.112332126.800123
5088화성시삼원공업 주식회사031-366-5805제조업18525경기도 화성시 팔탄면 창곡리 27-12번지경기도 화성시 팔탄면 주석로778번길 54-1937.192603126.893476
5089화성시주식회사 대명내이처오션031-355-4597제조업18554경기도 화성시 서신면 전곡리 1095-7번지경기도 화성시 서신면 전곡산단4길 3837.192116126.671686
5090화성시주식회사 케이앤티로지스틱스031-8059-5015보관저장업18633경기도 화성시 양감면 요당리 103-12번지경기도 화성시 양감면 은행나무로62번길 18-9937.062708126.939985
5091화성시페트로켐031-296-1209판매업18334경기도 화성시 봉담읍 당하리 157번지경기도 화성시 봉담읍 삼천병마로85937.184501126.93776

Duplicate rows

Most frequently occurring

시군명사업장명전화번호업종명소재지우편번호소재지지번주소소재지도로명주소WGS84위도WGS84경도# duplicates
0시흥시동인산업㈜031-499-5334제조업15096경기도 시흥시 정왕동 1357-3번지경기도 시흥시 정왕동 경제로 9437.336544126.7072422
1시흥시삼성켐텍031-498-1433판매업15090경기도 시흥시 정왕동 1364번지경기도 시흥시 정왕동 공단1대로 20437.337934126.7238222