Overview

Dataset statistics

Number of variables20
Number of observations1986
Missing cells11449
Missing cells (%)28.8%
Duplicate rows14
Duplicate rows (%)0.7%
Total size in memory327.9 KiB
Average record size in memory169.1 B

Variable types

Categorical6
Text4
DateTime2
Unsupported2
Numeric6

Alerts

Dataset has 14 (0.7%) duplicate rowsDuplicates
영업상태명 is highly overall correlated with 영업상태구분코드High correlation
영업상태구분코드 is highly overall correlated with 영업상태명High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with X좌표값 and 1 other fieldsHigh correlation
X좌표값 is highly overall correlated with WGS84경도 and 1 other fieldsHigh correlation
Y좌표값 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
사업장부지용도구분명 is highly imbalanced (68.6%)Imbalance
인허가취소일자 has 1986 (100.0%) missing valuesMissing
폐업일자 has 1705 (85.9%) missing valuesMissing
소재지시설전화번호 has 1986 (100.0%) missing valuesMissing
소재지면적정보 has 1868 (94.1%) missing valuesMissing
도로명우편번호 has 1426 (71.8%) missing valuesMissing
소재지도로명주소 has 297 (15.0%) missing valuesMissing
소재지우편번호 has 92 (4.6%) missing valuesMissing
WGS84위도 has 104 (5.2%) missing valuesMissing
WGS84경도 has 104 (5.2%) missing valuesMissing
X좌표값 has 939 (47.3%) missing valuesMissing
Y좌표값 has 939 (47.3%) missing valuesMissing
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지시설전화번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 21:44:32.213905
Analysis finished2023-12-10 21:44:38.302680
Duration6.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
화성시
306 
평택시
239 
용인시
149 
안산시
126 
안성시
113 
Other values (27)
1053 

Length

Max length4
Median length3
Mean length3.0312185
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 306
15.4%
평택시 239
 
12.0%
용인시 149
 
7.5%
안산시 126
 
6.3%
안성시 113
 
5.7%
이천시 112
 
5.6%
김포시 100
 
5.0%
수원시 80
 
4.0%
포천시 72
 
3.6%
광주시 66
 
3.3%
Other values (22) 623
31.4%

Length

2023-12-11T06:44:38.373869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 306
15.4%
평택시 239
 
12.0%
용인시 149
 
7.5%
안산시 126
 
6.3%
안성시 113
 
5.7%
이천시 112
 
5.6%
김포시 100
 
5.0%
수원시 80
 
4.0%
포천시 72
 
3.6%
광주시 66
 
3.3%
Other values (22) 623
31.4%
Distinct1552
Distinct (%)78.1%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
2023-12-11T06:44:38.622195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length9.6973817
Min length1

Characters and Unicode

Total characters19259
Distinct characters525
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1306 ?
Unique (%)65.8%

Sample

1st row대성의료종합가스(주) 현리공장
2nd row팜스코(주)
3rd row세계평화통일가정연합
4th rowSK종합가스
5th row삼성물산(주) 가평베네스트골프클럽
ValueCountFrequency (%)
주식회사 84
 
3.5%
에스피엘(주 31
 
1.3%
삼성전자(주 28
 
1.2%
주)맥서브 20
 
0.8%
수소충전소 18
 
0.7%
에어프로덕츠코리아(주 12
 
0.5%
주)제이투가스 12
 
0.5%
블루코브일반사모부동산투자신탁4호 11
 
0.5%
린데코리아(주 11
 
0.5%
sk하이닉스(주 10
 
0.4%
Other values (1671) 2194
90.3%
2023-12-11T06:44:39.334804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 1388
 
7.2%
( 1387
 
7.2%
1217
 
6.3%
839
 
4.4%
548
 
2.8%
447
 
2.3%
336
 
1.7%
332
 
1.7%
297
 
1.5%
263
 
1.4%
Other values (515) 12205
63.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14764
76.7%
Close Punctuation 1410
 
7.3%
Open Punctuation 1410
 
7.3%
Decimal Number 631
 
3.3%
Space Separator 447
 
2.3%
Uppercase Letter 372
 
1.9%
Dash Punctuation 142
 
0.7%
Lowercase Letter 33
 
0.2%
Other Punctuation 28
 
0.1%
Other Symbol 18
 
0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1217
 
8.2%
839
 
5.7%
548
 
3.7%
336
 
2.3%
332
 
2.2%
297
 
2.0%
263
 
1.8%
256
 
1.7%
252
 
1.7%
230
 
1.6%
Other values (456) 10194
69.0%
Uppercase Letter
ValueCountFrequency (%)
C 51
13.7%
S 48
12.9%
D 42
11.3%
U 37
9.9%
K 28
 
7.5%
M 17
 
4.6%
G 15
 
4.0%
L 15
 
4.0%
T 14
 
3.8%
N 14
 
3.8%
Other values (11) 91
24.5%
Lowercase Letter
ValueCountFrequency (%)
e 7
21.2%
r 6
18.2%
a 4
12.1%
n 3
9.1%
g 2
 
6.1%
o 2
 
6.1%
s 2
 
6.1%
i 1
 
3.0%
t 1
 
3.0%
c 1
 
3.0%
Other values (4) 4
12.1%
Decimal Number
ValueCountFrequency (%)
1 185
29.3%
2 128
20.3%
0 69
 
10.9%
3 62
 
9.8%
4 54
 
8.6%
9 48
 
7.6%
5 27
 
4.3%
8 20
 
3.2%
7 20
 
3.2%
6 18
 
2.9%
Other Punctuation
ValueCountFrequency (%)
# 11
39.3%
. 10
35.7%
, 5
17.9%
& 2
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 1388
98.4%
] 22
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 1387
98.4%
[ 23
 
1.6%
Space Separator
ValueCountFrequency (%)
447
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 142
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14782
76.8%
Common 4072
 
21.1%
Latin 405
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1217
 
8.2%
839
 
5.7%
548
 
3.7%
336
 
2.3%
332
 
2.2%
297
 
2.0%
263
 
1.8%
256
 
1.7%
252
 
1.7%
230
 
1.6%
Other values (457) 10212
69.1%
Latin
ValueCountFrequency (%)
C 51
12.6%
S 48
 
11.9%
D 42
 
10.4%
U 37
 
9.1%
K 28
 
6.9%
M 17
 
4.2%
G 15
 
3.7%
L 15
 
3.7%
T 14
 
3.5%
N 14
 
3.5%
Other values (25) 124
30.6%
Common
ValueCountFrequency (%)
) 1388
34.1%
( 1387
34.1%
447
 
11.0%
1 185
 
4.5%
- 142
 
3.5%
2 128
 
3.1%
0 69
 
1.7%
3 62
 
1.5%
4 54
 
1.3%
9 48
 
1.2%
Other values (13) 162
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14763
76.7%
ASCII 4476
 
23.2%
None 19
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 1388
31.0%
( 1387
31.0%
447
 
10.0%
1 185
 
4.1%
- 142
 
3.2%
2 128
 
2.9%
0 69
 
1.5%
3 62
 
1.4%
4 54
 
1.2%
C 51
 
1.1%
Other values (47) 563
12.6%
Hangul
ValueCountFrequency (%)
1217
 
8.2%
839
 
5.7%
548
 
3.7%
336
 
2.3%
332
 
2.2%
297
 
2.0%
263
 
1.8%
256
 
1.7%
252
 
1.7%
230
 
1.6%
Other values (455) 10193
69.0%
None
ValueCountFrequency (%)
18
94.7%
1
 
5.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct1450
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
Minimum1977-05-21 00:00:00
Maximum2023-11-27 00:00:00
2023-12-11T06:44:39.467808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:39.624601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1986
Missing (%)100.0%
Memory size17.6 KiB

영업상태구분코드
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
1
937 
<NA>
833 
3
139 
2
 
72
4
 
5

Length

Max length4
Median length1
Mean length2.2583082
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 937
47.2%
<NA> 833
41.9%
3 139
 
7.0%
2 72
 
3.6%
4 5
 
0.3%

Length

2023-12-11T06:44:39.782072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:44:39.905187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 937
47.2%
na 833
41.9%
3 139
 
7.0%
2 72
 
3.6%
4 5
 
0.3%

영업상태명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
영업중
937 
운영중
673 
폐업
139 
폐업 등
139 
휴업
 
72
Other values (2)
 
26

Length

Max length6
Median length3
Mean length2.9818731
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 937
47.2%
운영중 673
33.9%
폐업 139
 
7.0%
폐업 등 139
 
7.0%
휴업 72
 
3.6%
휴업 등 21
 
1.1%
타시군구이관 5
 
0.3%

Length

2023-12-11T06:44:40.036755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:44:40.166579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 937
43.7%
운영중 673
31.4%
폐업 278
 
13.0%
160
 
7.5%
휴업 93
 
4.3%
타시군구이관 5
 
0.2%

폐업일자
Date

MISSING 

Distinct228
Distinct (%)81.1%
Missing1705
Missing (%)85.9%
Memory size15.6 KiB
Minimum1988-11-07 00:00:00
Maximum2023-12-01 00:00:00
2023-12-11T06:44:40.328642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:40.490817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소재지시설전화번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1986
Missing (%)100.0%
Memory size17.6 KiB

소재지면적정보
Real number (ℝ)

MISSING 

Distinct97
Distinct (%)82.2%
Missing1868
Missing (%)94.1%
Infinite0
Infinite (%)0.0%
Mean12078.417
Minimum0
Maximum100182
Zeros9
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:40.670723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1871
median2847.95
Q311306
95-th percentile60421.15
Maximum100182
Range100182
Interquartile range (IQR)10435

Descriptive statistics

Standard deviation20687.465
Coefficient of variation (CV)1.7127629
Kurtosis5.6359653
Mean12078.417
Median Absolute Deviation (MAD)2400.95
Skewness2.4270779
Sum1425253.2
Variance4.279712 × 108
MonotonicityNot monotonic
2023-12-11T06:44:40.831192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 9
 
0.5%
4978.0 6
 
0.3%
11306.0 4
 
0.2%
793.0 2
 
0.1%
19258.0 2
 
0.1%
918.0 2
 
0.1%
990.0 2
 
0.1%
29536.0 2
 
0.1%
84974.0 1
 
0.1%
12572.0 1
 
0.1%
Other values (87) 87
 
4.4%
(Missing) 1868
94.1%
ValueCountFrequency (%)
0.0 9
0.5%
10.0 1
 
0.1%
74.47 1
 
0.1%
147.0 1
 
0.1%
236.0 1
 
0.1%
270.0 1
 
0.1%
331.0 1
 
0.1%
407.0 1
 
0.1%
446.0 1
 
0.1%
448.0 1
 
0.1%
ValueCountFrequency (%)
100182.0 1
0.1%
88500.0 1
0.1%
84974.0 1
0.1%
76304.0 1
0.1%
73175.0 1
0.1%
64196.0 1
0.1%
59755.0 1
0.1%
55248.0 1
0.1%
48155.0 1
0.1%
48122.0 1
0.1%

도로명우편번호
Text

MISSING 

Distinct274
Distinct (%)48.9%
Missing1426
Missing (%)71.8%
Memory size15.6 KiB
2023-12-11T06:44:41.180323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0107143
Min length5

Characters and Unicode

Total characters2806
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique184 ?
Unique (%)32.9%

Sample

1st row12441
2nd row12430
3rd row12434
4th row12441
5th row12441
ValueCountFrequency (%)
17794 38
 
6.8%
18512 33
 
5.9%
17998 33
 
5.9%
17786 20
 
3.6%
18579 12
 
2.1%
18516 7
 
1.2%
17336 7
 
1.2%
16082 6
 
1.1%
16642 6
 
1.1%
12441 6
 
1.1%
Other values (264) 392
70.0%
2023-12-11T06:44:41.770442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 737
26.3%
7 379
13.5%
8 272
 
9.7%
5 242
 
8.6%
9 231
 
8.2%
0 218
 
7.8%
4 203
 
7.2%
6 188
 
6.7%
2 187
 
6.7%
3 146
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2803
99.9%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 737
26.3%
7 379
13.5%
8 272
 
9.7%
5 242
 
8.6%
9 231
 
8.2%
0 218
 
7.8%
4 203
 
7.2%
6 188
 
6.7%
2 187
 
6.7%
3 146
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2806
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 737
26.3%
7 379
13.5%
8 272
 
9.7%
5 242
 
8.6%
9 231
 
8.2%
0 218
 
7.8%
4 203
 
7.2%
6 188
 
6.7%
2 187
 
6.7%
3 146
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2806
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 737
26.3%
7 379
13.5%
8 272
 
9.7%
5 242
 
8.6%
9 231
 
8.2%
0 218
 
7.8%
4 203
 
7.2%
6 188
 
6.7%
2 187
 
6.7%
3 146
 
5.2%
Distinct1278
Distinct (%)75.7%
Missing297
Missing (%)15.0%
Memory size15.6 KiB
2023-12-11T06:44:42.043294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length44
Mean length24.320308
Min length13

Characters and Unicode

Total characters41077
Distinct characters453
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1076 ?
Unique (%)63.7%

Sample

1st row경기도 가평군 조종면 비득재길 152
2nd row경기도 가평군 설악면 미사리로 156-22
3rd row경기도 가평군 설악면 미사리로 324-275
4th row경기도 가평군 상면 조종로 1643
5th row경기도 가평군 상면 둔덕말길 232
ValueCountFrequency (%)
경기도 1688
 
18.4%
화성시 268
 
2.9%
평택시 217
 
2.4%
용인시 128
 
1.4%
안산시 121
 
1.3%
단원구 111
 
1.2%
안성시 99
 
1.1%
처인구 77
 
0.8%
이천시 70
 
0.8%
포천시 65
 
0.7%
Other values (2109) 6331
69.0%
2023-12-11T06:44:42.428455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7551
 
18.4%
1850
 
4.5%
1795
 
4.4%
1768
 
4.3%
1767
 
4.3%
1457
 
3.5%
1 1252
 
3.0%
2 951
 
2.3%
849
 
2.1%
) 734
 
1.8%
Other values (443) 21103
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25095
61.1%
Space Separator 7551
 
18.4%
Decimal Number 6280
 
15.3%
Close Punctuation 735
 
1.8%
Open Punctuation 734
 
1.8%
Dash Punctuation 356
 
0.9%
Other Punctuation 271
 
0.7%
Uppercase Letter 54
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1850
 
7.4%
1795
 
7.2%
1768
 
7.0%
1767
 
7.0%
1457
 
5.8%
849
 
3.4%
641
 
2.6%
608
 
2.4%
542
 
2.2%
501
 
2.0%
Other values (406) 13317
53.1%
Uppercase Letter
ValueCountFrequency (%)
L 9
16.7%
D 6
11.1%
C 5
9.3%
B 5
9.3%
T 4
7.4%
P 4
7.4%
S 3
 
5.6%
R 3
 
5.6%
K 3
 
5.6%
H 3
 
5.6%
Other values (5) 9
16.7%
Decimal Number
ValueCountFrequency (%)
1 1252
19.9%
2 951
15.1%
3 656
10.4%
5 648
10.3%
4 516
8.2%
7 482
 
7.7%
8 475
 
7.6%
9 463
 
7.4%
6 420
 
6.7%
0 417
 
6.6%
Other Punctuation
ValueCountFrequency (%)
, 258
95.2%
. 7
 
2.6%
/ 4
 
1.5%
* 1
 
0.4%
& 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 734
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 733
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
7551
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 356
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25095
61.1%
Common 15927
38.8%
Latin 55
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1850
 
7.4%
1795
 
7.2%
1768
 
7.0%
1767
 
7.0%
1457
 
5.8%
849
 
3.4%
641
 
2.6%
608
 
2.4%
542
 
2.2%
501
 
2.0%
Other values (406) 13317
53.1%
Common
ValueCountFrequency (%)
7551
47.4%
1 1252
 
7.9%
2 951
 
6.0%
) 734
 
4.6%
( 733
 
4.6%
3 656
 
4.1%
5 648
 
4.1%
4 516
 
3.2%
7 482
 
3.0%
8 475
 
3.0%
Other values (11) 1929
 
12.1%
Latin
ValueCountFrequency (%)
L 9
16.4%
D 6
10.9%
C 5
9.1%
B 5
9.1%
T 4
 
7.3%
P 4
 
7.3%
S 3
 
5.5%
R 3
 
5.5%
K 3
 
5.5%
H 3
 
5.5%
Other values (6) 10
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25095
61.1%
ASCII 15982
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7551
47.2%
1 1252
 
7.8%
2 951
 
6.0%
) 734
 
4.6%
( 733
 
4.6%
3 656
 
4.1%
5 648
 
4.1%
4 516
 
3.2%
7 482
 
3.0%
8 475
 
3.0%
Other values (27) 1984
 
12.4%
Hangul
ValueCountFrequency (%)
1850
 
7.4%
1795
 
7.2%
1768
 
7.0%
1767
 
7.0%
1457
 
5.8%
849
 
3.4%
641
 
2.6%
608
 
2.4%
542
 
2.2%
501
 
2.0%
Other values (406) 13317
53.1%
Distinct1603
Distinct (%)80.8%
Missing3
Missing (%)0.2%
Memory size15.6 KiB
2023-12-11T06:44:42.712162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length54
Mean length23.437216
Min length12

Characters and Unicode

Total characters46476
Distinct characters428
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1419 ?
Unique (%)71.6%

Sample

1st row경기도 가평군 조종면 현리 464-1 외 464-2, 465, 465-8, 산72-7
2nd row경기도 가평군 설악면 송산리 675-1
3rd row경기도 가평군 설악면 송산리 산 99
4th row경기도 가평군 상면 율길리 295 ,295-4, 295-6, 295-8
5th row경기도 가평군 상면 상동리 산 52
ValueCountFrequency (%)
경기도 1982
 
19.2%
화성시 305
 
3.0%
평택시 239
 
2.3%
용인시 149
 
1.4%
안산시 126
 
1.2%
안성시 116
 
1.1%
단원구 114
 
1.1%
이천시 112
 
1.1%
김포시 99
 
1.0%
처인구 92
 
0.9%
Other values (2655) 6998
67.7%
2023-12-11T06:44:43.199367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9486
20.4%
2102
 
4.5%
2058
 
4.4%
2056
 
4.4%
1992
 
4.3%
1 1726
 
3.7%
- 1359
 
2.9%
1129
 
2.4%
1101
 
2.4%
2 1083
 
2.3%
Other values (418) 22384
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27167
58.5%
Space Separator 9486
 
20.4%
Decimal Number 8025
 
17.3%
Dash Punctuation 1359
 
2.9%
Other Punctuation 121
 
0.3%
Close Punctuation 114
 
0.2%
Open Punctuation 114
 
0.2%
Uppercase Letter 89
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2102
 
7.7%
2058
 
7.6%
2056
 
7.6%
1992
 
7.3%
1129
 
4.2%
1101
 
4.1%
1073
 
3.9%
861
 
3.2%
684
 
2.5%
656
 
2.4%
Other values (381) 13455
49.5%
Uppercase Letter
ValueCountFrequency (%)
B 15
16.9%
C 11
12.4%
L 10
11.2%
D 7
7.9%
P 7
7.9%
S 6
 
6.7%
K 5
 
5.6%
T 4
 
4.5%
R 4
 
4.5%
E 4
 
4.5%
Other values (7) 16
18.0%
Decimal Number
ValueCountFrequency (%)
1 1726
21.5%
2 1083
13.5%
3 875
10.9%
5 728
9.1%
6 707
8.8%
4 687
 
8.6%
7 594
 
7.4%
0 570
 
7.1%
9 538
 
6.7%
8 517
 
6.4%
Other Punctuation
ValueCountFrequency (%)
, 102
84.3%
/ 12
 
9.9%
& 4
 
3.3%
. 2
 
1.7%
: 1
 
0.8%
Space Separator
ValueCountFrequency (%)
9486
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1359
100.0%
Close Punctuation
ValueCountFrequency (%)
) 114
100.0%
Open Punctuation
ValueCountFrequency (%)
( 114
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27167
58.5%
Common 19219
41.4%
Latin 90
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2102
 
7.7%
2058
 
7.6%
2056
 
7.6%
1992
 
7.3%
1129
 
4.2%
1101
 
4.1%
1073
 
3.9%
861
 
3.2%
684
 
2.5%
656
 
2.4%
Other values (381) 13455
49.5%
Common
ValueCountFrequency (%)
9486
49.4%
1 1726
 
9.0%
- 1359
 
7.1%
2 1083
 
5.6%
3 875
 
4.6%
5 728
 
3.8%
6 707
 
3.7%
4 687
 
3.6%
7 594
 
3.1%
0 570
 
3.0%
Other values (9) 1404
 
7.3%
Latin
ValueCountFrequency (%)
B 15
16.7%
C 11
12.2%
L 10
11.1%
D 7
7.8%
P 7
7.8%
S 6
 
6.7%
K 5
 
5.6%
T 4
 
4.4%
R 4
 
4.4%
E 4
 
4.4%
Other values (8) 17
18.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27167
58.5%
ASCII 19309
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9486
49.1%
1 1726
 
8.9%
- 1359
 
7.0%
2 1083
 
5.6%
3 875
 
4.5%
5 728
 
3.8%
6 707
 
3.7%
4 687
 
3.6%
7 594
 
3.1%
0 570
 
3.0%
Other values (27) 1494
 
7.7%
Hangul
ValueCountFrequency (%)
2102
 
7.7%
2058
 
7.6%
2056
 
7.6%
1992
 
7.3%
1129
 
4.2%
1101
 
4.1%
1073
 
3.9%
861
 
3.2%
684
 
2.5%
656
 
2.4%
Other values (381) 13455
49.5%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct835
Distinct (%)44.1%
Missing92
Missing (%)4.6%
Infinite0
Infinite (%)0.0%
Mean15389.001
Minimum6133
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:43.359084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6133
5-th percentile10311.15
Q112737.25
median16622
Q317786
95-th percentile18556.7
Maximum18635
Range12502
Interquartile range (IQR)5048.75

Descriptive statistics

Standard deviation2821.631
Coefficient of variation (CV)0.18335376
Kurtosis-1.0522945
Mean15389.001
Median Absolute Deviation (MAD)1742
Skewness-0.5879384
Sum29146767
Variance7961601.7
MonotonicityNot monotonic
2023-12-11T06:44:43.521918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17794 43
 
2.2%
17998 34
 
1.7%
18512 33
 
1.7%
17786 20
 
1.0%
18579 16
 
0.8%
17336 14
 
0.7%
15014 14
 
0.7%
16678 13
 
0.7%
14450 13
 
0.7%
18545 12
 
0.6%
Other values (825) 1682
84.7%
(Missing) 92
 
4.6%
ValueCountFrequency (%)
6133 2
0.1%
10003 4
0.2%
10005 2
0.1%
10007 1
 
0.1%
10008 2
0.1%
10009 1
 
0.1%
10013 1
 
0.1%
10016 3
0.2%
10018 1
 
0.1%
10020 2
0.1%
ValueCountFrequency (%)
18635 6
0.3%
18633 1
 
0.1%
18631 4
0.2%
18630 2
 
0.1%
18628 1
 
0.1%
18627 2
 
0.1%
18626 4
0.2%
18624 8
0.4%
18623 5
0.3%
18622 8
0.4%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1312
Distinct (%)69.7%
Missing104
Missing (%)5.2%
Infinite0
Infinite (%)0.0%
Mean37.333365
Minimum36.926192
Maximum38.122387
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:43.682618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.926192
5-th percentile36.980634
Q137.141458
median37.285544
Q337.455415
95-th percentile37.837781
Maximum38.122387
Range1.1961954
Interquartile range (IQR)0.31395662

Descriptive statistics

Standard deviation0.26187052
Coefficient of variation (CV)0.0070143831
Kurtosis-0.27134494
Mean37.333365
Median Absolute Deviation (MAD)0.15075687
Skewness0.73993225
Sum70261.393
Variance0.068576171
MonotonicityNot monotonic
2023-12-11T06:44:43.827127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.0548419187 37
 
1.9%
37.1649284099 33
 
1.7%
36.9523591039 31
 
1.6%
37.523254288 13
 
0.7%
37.6678691376 12
 
0.6%
37.3496542905 12
 
0.6%
37.7277062927 12
 
0.6%
37.035618718 11
 
0.6%
37.2529797883 10
 
0.5%
37.2623766897 8
 
0.4%
Other values (1302) 1703
85.8%
(Missing) 104
 
5.2%
ValueCountFrequency (%)
36.9261919049 1
0.1%
36.9322246938 2
0.1%
36.9335597905 1
0.1%
36.9361479463 1
0.1%
36.9369480574 1
0.1%
36.9434020033 1
0.1%
36.9438404736 1
0.1%
36.9494728556 1
0.1%
36.9520588289 1
0.1%
36.9522315921 1
0.1%
ValueCountFrequency (%)
38.122387343 1
 
0.1%
38.1118451261 1
 
0.1%
38.0479939082 1
 
0.1%
38.0449191732 1
 
0.1%
38.0172838914 4
0.2%
38.0159781217 1
 
0.1%
38.0128745985 2
0.1%
38.0061994464 1
 
0.1%
38.0057169564 1
 
0.1%
37.998179028 2
0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1312
Distinct (%)69.7%
Missing104
Missing (%)5.2%
Infinite0
Infinite (%)0.0%
Mean127.03524
Minimum126.53915
Maximum127.73068
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:44.001342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.53915
5-th percentile126.70253
Q1126.84512
median127.01226
Q3127.19464
95-th percentile127.49439
Maximum127.73068
Range1.1915336
Interquartile range (IQR)0.34951538

Descriptive statistics

Standard deviation0.24243236
Coefficient of variation (CV)0.0019083868
Kurtosis-0.30614368
Mean127.03524
Median Absolute Deviation (MAD)0.1723228
Skewness0.42737102
Sum239080.32
Variance0.058773451
MonotonicityNot monotonic
2023-12-11T06:44:44.144031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9320373155 37
 
1.9%
127.0122647357 33
 
1.7%
127.0769712801 31
 
1.6%
126.7693005811 13
 
0.7%
126.9230328971 12
 
0.6%
126.7025314364 12
 
0.6%
126.6148612552 12
 
0.6%
127.0531670514 11
 
0.6%
127.4806834787 10
 
0.5%
127.6438940339 8
 
0.4%
Other values (1302) 1703
85.8%
(Missing) 104
 
5.2%
ValueCountFrequency (%)
126.539145711 2
0.1%
126.540004757 1
0.1%
126.5522087631 1
0.1%
126.5534635533 1
0.1%
126.5535052305 2
0.1%
126.5593839774 1
0.1%
126.5601526416 1
0.1%
126.5610030469 2
0.1%
126.5645151591 1
0.1%
126.5656883917 1
0.1%
ValueCountFrequency (%)
127.7306793433 1
 
0.1%
127.7040936165 1
 
0.1%
127.7031262334 1
 
0.1%
127.6918987923 1
 
0.1%
127.6837697378 1
 
0.1%
127.6648029544 1
 
0.1%
127.6567315694 1
 
0.1%
127.6560224498 4
0.2%
127.64931189 2
0.1%
127.6479223552 1
 
0.1%
Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
제조
917 
<NA>
833 
저장소
149 
판매
 
87

Length

Max length4
Median length2
Mean length2.9138973
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조
2nd row저장소
3rd row제조
4th row판매
5th row제조

Common Values

ValueCountFrequency (%)
제조 917
46.2%
<NA> 833
41.9%
저장소 149
 
7.5%
판매 87
 
4.4%

Length

2023-12-11T06:44:44.305382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:44:44.422324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조 917
46.2%
na 833
41.9%
저장소 149
 
7.5%
판매 87
 
4.4%

X좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct669
Distinct (%)63.9%
Missing939
Missing (%)47.3%
Infinite0
Infinite (%)0.0%
Mean202429.3
Minimum160553.67
Maximum261396.72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:44.556774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum160553.67
5-th percentile173821.66
Q1186594.04
median201064.87
Q3213442.16
95-th percentile243104.08
Maximum261396.72
Range100843.06
Interquartile range (IQR)26848.126

Descriptive statistics

Standard deviation20426.391
Coefficient of variation (CV)0.10090629
Kurtosis-0.11867181
Mean202429.3
Median Absolute Deviation (MAD)13812.839
Skewness0.50032414
Sum2.1194348 × 108
Variance4.1723744 × 108
MonotonicityNot monotonic
2023-12-11T06:44:44.742876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
193886.930285785 37
 
1.9%
201064.868606751 33
 
1.7%
206711.085897129 30
 
1.5%
205276.542163998 14
 
0.7%
193149.241332339 12
 
0.6%
165990.906082822 12
 
0.6%
243104.079290009 10
 
0.5%
183629.624815046 7
 
0.4%
204522.879651792 7
 
0.4%
186156.338207414 7
 
0.4%
Other values (659) 878
44.2%
(Missing) 939
47.3%
ValueCountFrequency (%)
160553.665204455 1
0.1%
160629.096172044 1
0.1%
161086.572348385 1
0.1%
161144.458710812 1
0.1%
161599.603128501 1
0.1%
162637.162976021 1
0.1%
162652.205355125 1
0.1%
163001.732152244 1
0.1%
163290.542222638 2
0.1%
163735.640764745 1
0.1%
ValueCountFrequency (%)
261396.72187117 1
 
0.1%
258144.22893945 3
0.2%
257510.524735758 1
 
0.1%
257018.258976503 6
0.3%
255404.407117064 1
 
0.1%
254635.480051079 1
 
0.1%
253513.667693509 1
 
0.1%
252288.514752201 1
 
0.1%
251743.990723393 1
 
0.1%
251281.164110577 1
 
0.1%

Y좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct669
Distinct (%)63.9%
Missing939
Missing (%)47.3%
Infinite0
Infinite (%)0.0%
Mean421263.51
Minimum380529.63
Maximum513323.95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.6 KiB
2023-12-11T06:44:44.878495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum380529.63
5-th percentile384337.01
Q1399297.14
median416736.84
Q3432633.97
95-th percentile478868.08
Maximum513323.95
Range132794.32
Interquartile range (IQR)33336.826

Descriptive statistics

Standard deviation28046.95
Coefficient of variation (CV)0.066578161
Kurtosis0.034954193
Mean421263.51
Median Absolute Deviation (MAD)16376.857
Skewness0.84928993
Sum4.4106289 × 108
Variance7.8663139 × 108
MonotonicityNot monotonic
2023-12-11T06:44:45.048002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
394793.401043022 37
 
1.9%
407039.865534952 33
 
1.7%
383491.474001352 30
 
1.5%
392561.65560657 14
 
0.7%
462831.427474751 12
 
0.6%
469554.379065048 12
 
0.6%
416797.645145138 10
 
0.5%
405905.580491458 7
 
0.4%
391620.467576035 7
 
0.4%
400395.922088682 7
 
0.4%
Other values (659) 878
44.2%
(Missing) 939
47.3%
ValueCountFrequency (%)
380529.629191611 1
 
0.1%
381220.438504532 2
 
0.1%
383109.60756648 1
 
0.1%
383423.789506698 1
 
0.1%
383476.658593602 1
 
0.1%
383491.474001352 30
1.5%
383615.426390938 1
 
0.1%
383650.447760523 2
 
0.1%
383682.691253928 1
 
0.1%
383800.0610058 3
 
0.2%
ValueCountFrequency (%)
513323.949530588 1
0.1%
501646.544748831 2
0.1%
501475.963463682 1
0.1%
501230.930506746 1
0.1%
500387.82323695 1
0.1%
499531.812911478 1
0.1%
498650.38308043 2
0.1%
498599.881590121 1
0.1%
498295.370913404 1
0.1%
497749.212972921 1
0.1%

제조구분명
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
냉동
747 
일반
554 
<NA>
539 
충전
144 
특정
 
2

Length

Max length4
Median length2
Mean length2.5427996
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충전
2nd row<NA>
3rd row냉동
4th row<NA>
5th row냉동

Common Values

ValueCountFrequency (%)
냉동 747
37.6%
일반 554
27.9%
<NA> 539
27.1%
충전 144
 
7.3%
특정 2
 
0.1%

Length

2023-12-11T06:44:45.207246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:44:45.363908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
냉동 747
37.6%
일반 554
27.9%
na 539
27.1%
충전 144
 
7.3%
특정 2
 
0.1%

사업장부지용도구분명
Categorical

IMBALANCE 

Distinct29
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size15.6 KiB
<NA>
1544 
공업용
172 
기타
 
50
지정되지않음
 
33
주상나지
 
27
Other values (24)
160 

Length

Max length6
Median length4
Mean length3.8519637
Min length1

Unique

Unique5 ?
Unique (%)0.3%

Sample

1st row
2nd row임야
3rd row기타
4th row지정되지않음
5th row골프장

Common Values

ValueCountFrequency (%)
<NA> 1544
77.7%
공업용 172
 
8.7%
기타 50
 
2.5%
지정되지않음 33
 
1.7%
주상나지 27
 
1.4%
공업기타 21
 
1.1%
상업용 16
 
0.8%
위험시설 15
 
0.8%
특수토지 14
 
0.7%
13
 
0.7%
Other values (19) 81
 
4.1%

Length

2023-12-11T06:44:45.722549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 1544
77.7%
공업용 172
 
8.7%
기타 50
 
2.5%
지정되지않음 33
 
1.7%
주상나지 27
 
1.4%
공업기타 21
 
1.1%
상업용 16
 
0.8%
위험시설 15
 
0.8%
특수토지 14
 
0.7%
13
 
0.7%
Other values (19) 81
 
4.1%

Interactions

2023-12-11T06:44:36.916692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:33.841869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.414801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.000812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.593305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.216225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.994776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:33.923298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.493952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.092030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.697002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.347840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:37.099456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.024462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.595622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.200668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.798578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.462094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:37.186420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.116620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.700230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.289720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.884205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.578279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:37.297942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.199664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.803555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.402163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.969133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.697156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:37.391474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.315971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:34.903913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:35.496562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.074590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:44:36.798509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:44:45.814667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태구분코드영업상태명소재지면적정보소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값제조구분명사업장부지용도구분명
시군명1.0000.5270.5060.0000.9950.9430.9210.5260.9300.9520.3720.747
영업상태구분코드0.5271.0001.0000.0000.2550.2460.1690.0600.1930.2480.1290.093
영업상태명0.5061.0001.0000.0000.3780.2310.1860.0600.1930.2480.5590.093
소재지면적정보0.0000.0000.0001.0000.1070.0000.4890.3570.4240.0000.4390.000
소재지우편번호0.9950.2550.3780.1071.0000.8300.6770.2330.6740.8410.1620.710
WGS84위도0.9430.2460.2310.0000.8301.0000.6530.3710.6871.0000.1980.462
WGS84경도0.9210.1690.1860.4890.6770.6531.0000.3830.9960.6970.2480.590
업태구분명정보0.5260.0600.0600.3570.2330.3710.3831.0000.3530.3670.3990.588
X좌표값0.9300.1930.1930.4240.6740.6870.9960.3531.0000.6890.1520.598
Y좌표값0.9520.2480.2480.0000.8411.0000.6970.3670.6891.0000.1890.472
제조구분명0.3720.1290.5590.4390.1620.1980.2480.3990.1520.1891.0000.385
사업장부지용도구분명0.7470.0930.0930.0000.7100.4620.5900.5880.5980.4720.3851.000
2023-12-11T06:44:45.961926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제조구분명영업상태명업태구분명정보시군명사업장부지용도구분명영업상태구분코드
제조구분명1.0000.4200.3900.1990.2060.051
영업상태명0.4201.0000.0560.2360.0421.000
업태구분명정보0.3900.0561.0000.3100.3580.056
시군명0.1990.2360.3101.0000.2600.296
사업장부지용도구분명0.2060.0420.3580.2601.0000.042
영업상태구분코드0.0511.0000.0560.2960.0421.000
2023-12-11T06:44:46.077291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지면적정보소재지우편번호WGS84위도WGS84경도X좌표값Y좌표값시군명영업상태구분코드영업상태명업태구분명정보제조구분명사업장부지용도구분명
소재지면적정보1.0000.232-0.3300.0690.068-0.3330.0000.0000.0000.2210.2760.000
소재지우편번호0.2321.000-0.8850.0360.049-0.8490.9640.1980.1530.1580.1100.345
WGS84위도-0.330-0.8851.000-0.153-0.2011.0000.7090.1490.1190.2390.1190.180
WGS84경도0.0690.036-0.1531.0001.000-0.1980.6450.1010.0950.2480.1500.249
X좌표값0.0680.049-0.2011.0001.000-0.1980.6670.1140.1140.2320.0910.254
Y좌표값-0.333-0.8491.000-0.198-0.1981.0000.7350.1520.1520.2370.1140.184
시군명0.0000.9640.7090.6450.6670.7351.0000.2960.2360.3100.1990.260
영업상태구분코드0.0000.1980.1490.1010.1140.1520.2961.0001.0000.0560.0510.042
영업상태명0.0000.1530.1190.0950.1140.1520.2361.0001.0000.0560.4200.042
업태구분명정보0.2210.1580.2390.2480.2320.2370.3100.0560.0561.0000.3900.358
제조구분명0.2760.1100.1190.1500.0910.1140.1990.0510.4200.3901.0000.206
사업장부지용도구분명0.0000.3450.1800.2490.2540.1840.2600.0420.0420.3580.2061.000

Missing values

2023-12-11T06:44:37.587469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:44:37.842869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:44:38.108915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값제조구분명사업장부지용도구분명
0가평군대성의료종합가스(주) 현리공장2020-07-17<NA>1영업중<NA><NA>3884.0<NA>경기도 가평군 조종면 비득재길 152경기도 가평군 조종면 현리 464-1 외 464-2, 465, 465-8, 산72-71243837.810108127.343178제조230132.619882478672.460845충전
1가평군팜스코(주)2012-09-05<NA>1영업중<NA><NA>4156.0<NA>경기도 가평군 설악면 미사리로 156-22경기도 가평군 설악면 송산리 675-11246137.685414127.510025저장소244886.745355464897.556195<NA>임야
2가평군세계평화통일가정연합2006-06-21<NA>1영업중<NA><NA><NA><NA>경기도 가평군 설악면 미사리로 324-275경기도 가평군 설악면 송산리 산 991246137.68381127.547021제조248299.148984464700.249777냉동기타
3가평군SK종합가스2020-02-24<NA>1영업중<NA><NA>4978.012441경기도 가평군 상면 조종로 1643경기도 가평군 상면 율길리 295 ,295-4, 295-6, 295-81244137.827945127.295948판매225999.837472480646.668284<NA>지정되지않음
4가평군삼성물산(주) 가평베네스트골프클럽2006-04-13<NA>1영업중<NA><NA><NA><NA>경기도 가평군 상면 둔덕말길 232경기도 가평군 상면 상동리 산 521244337.797145127.305438제조226656.532694477303.071588냉동골프장
5가평군가평소방서(수난구조대)2023-10-13<NA>1영업중<NA><NA><NA>12430경기도 가평군 가평읍 호반로 1580, 가평소방서수난구조대경기도 가평군 가평읍 복장리 381-2 가평소방서수난구조대1243037.737042127.50799제조244703.914838470621.495295충전<NA>
6가평군디에스프리져(D. S Freezer)2009-06-18<NA>1영업중<NA><NA>635.012434경기도 가평군 조종면 세곡로 230-1경기도 가평군 조종면 신상리 7491243437.837604127.328254판매228819.469131481718.783274일반하천등
7가평군샘가스코리아 유한회사2020-02-24<NA>1영업중<NA><NA>4978.012441경기도 가평군 상면 조종로 1643경기도 가평군 상면 율길리 295 ,295-4, 295-6, 295-81244137.827945127.295948판매225999.837472480646.668284<NA>지정되지않음
8가평군(주)청평가스마트2009-05-18<NA>1영업중<NA><NA><NA><NA>경기도 가평군 청평면 경춘로 1091-2경기도 가평군 청평면 하천리 328-41245037.757954127.435596판매238304.644269472919.288233일반다세대
9가평군설악가스20050218<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 설악면 신천중앙로28번길 46경기도 가평군 설악면 신천리 326-2번지1246737.667982127.4895<NA><NA><NA>일반<NA>
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값제조구분명사업장부지용도구분명
1976화성시선기종합가스20111013<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 팔탄면 시청로 1084경기도 화성시 팔탄면 기천리 962번지1852637.172556126.905959<NA><NA><NA><NA><NA>
1977화성시한일에너지20111202<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 우정읍 남양만로 607경기도 화성시 우정읍 이화리 965-10번지1857337.038434126.801578<NA><NA><NA><NA><NA>
1978화성시에스엘지(주)화성사업소20120503<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 송산면 송산포도로 357-7경기도 화성시 송산면 중송리 101-3번지 101-51854537.220483126.711378<NA><NA><NA><NA><NA>
1979화성시국제특수종합가스20041116<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 남양읍 주석로 262경기도 화성시 북양동 548번지1825537.20988126.857116<NA><NA><NA>일반<NA>
1980화성시(주)남양에너텍20111111<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 팔탄면 서해로 1334-6경기도 화성시 송산면 중송리 101-2번지1852537.179566126.884196<NA><NA><NA><NA><NA>
1981화성시한일에너지20071010<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 우정읍 남양만로 607경기도 화성시 우정읍 이화리 965-10번지1857337.038434126.801578<NA><NA><NA><NA><NA>
1982화성시에코보보스(주)-판매20150821<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 남양읍 남양로 239경기도 화성시 남양읍 1065-5번지1828137.163766126.80777<NA><NA><NA><NA><NA>
1983화성시에코보보스(주)20091006<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 장안면 3.1만세로 558경기도 화성시 장안면 수촌리 425-8번지1858137.102369126.86372<NA><NA><NA><NA><NA>
1984<NA>(주)하나자산신탁2023-10-23<NA>2휴업<NA><NA><NA>06133서울특별시 강남구 테헤란로 127, 하나금융그룹 강남사옥 (역삼동)서울특별시 강남구 역삼동 648-19 하나금융그룹 강남사옥613337.499701127.032043제조202775.815444173.79냉동<NA>
1985<NA>㈜하나자산신탁2023-10-23<NA>2휴업<NA><NA><NA>06133서울특별시 강남구 테헤란로 127, 하나금융그룹 강남사옥 (역삼동)서울특별시 강남구 역삼동 648-19 하나금융그룹 강남사옥613337.499701127.032043제조202775.815444173.79냉동<NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태구분코드영업상태명폐업일자소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값제조구분명사업장부지용도구분명# duplicates
1고양시농협고양농산물종합유통센터2012-12-103폐업2015-03-23<NA><NA>경기도 고양시 일산서구 대화로 362 (대화동)경기도 고양시 일산서구 대화동 2324번지1022637.683592126.746341제조177648.225192464571.275236냉동<NA>4
12화성시브릭화성물류센터(주)2023-10-263폐업2023-11-07<NA><NA><NA>경기도 화성시 양감면 송산리 산 83-1<NA>37.104955126.982711제조<NA><NA>냉동<NA>4
7이천시(주)맥서브2021-08-053폐업2023-07-18<NA><NA><NA>경기도 이천시 백사면 도립리 512-11730937.339762127.468184제조<NA><NA>냉동<NA>3
0고양시(주)원마운트2013-03-121영업중<NA><NA>10392경기도 고양시 일산서구 한류월드로 300, 고양 원마운트 (대화동)경기도 고양시 일산서구 대화동 2606 고양 원마운트1039237.664554126.754527제조178268.135569462480.734363냉동<NA>2
2김포시케이비부동산신탁(주)2023-11-241영업중<NA><NA><NA><NA>경기도 김포시 양촌읍 학운리 38851004937.610547126.601812제조164780.289143456553.760543냉동<NA>2
3시흥시버슘머트리얼즈코리아(주)2018-09-133폐업2023-11-14<NA><NA>경기도 시흥시 협력로 143경기도 시흥시 정왕동 1238-3 에어프로덕츠코리아(주)1509437.341889126.70582제조173822.71312426758.591696냉동<NA>2
4용인시(주)맥서브2022-09-051영업중<NA><NA><NA>경기도 용인시 처인구 남사읍 형제로 26경기도 용인시 처인구 남사읍 북리 45-11711837.144116127.159693제조214119.897168404726.158193냉동<NA>2
5이천시(주)대신자산신탁2023-11-101영업중<NA><NA><NA><NA>경기도 이천시 마장면 회억리 산 104-5 외 20필지<NA><NA><NA>제조<NA><NA>냉동<NA>2
6이천시(주)맥서브2021-07-303폐업2023-07-18<NA><NA><NA>경기도 이천시 백사면 도립리 512-11730937.339762127.468184제조<NA><NA>냉동<NA>2
8이천시(주)맥서브2022-10-111영업중<NA><NA><NA><NA>경기도 이천시 대월면 군량리 622-1 외 13필지1740237.18176127.494544제조243802.015162408993.382535냉동<NA>2