Overview

Dataset statistics

Number of variables19
Number of observations5568
Missing cells27146
Missing cells (%)25.7%
Duplicate rows20
Duplicate rows (%)0.4%
Total size in memory870.1 KiB
Average record size in memory160.0 B

Variable types

Categorical4
Text5
DateTime2
Numeric6
Unsupported2

Alerts

Dataset has 20 (0.4%) duplicate rowsDuplicates
영업상태명 is highly overall correlated with 영업상태구분코드High correlation
영업상태구분코드 is highly overall correlated with 영업상태명High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with X좌표값 and 1 other fieldsHigh correlation
X좌표값 is highly overall correlated with WGS84경도 and 1 other fieldsHigh correlation
Y좌표값 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
업태구분명정보 is highly imbalanced (58.3%)Imbalance
인허가취소일자 has 5410 (97.2%) missing valuesMissing
소재지시설전화번호 has 830 (14.9%) missing valuesMissing
소재지면적정보 has 1894 (34.0%) missing valuesMissing
도로명우편번호 has 4950 (88.9%) missing valuesMissing
소재지도로명주소 has 501 (9.0%) missing valuesMissing
소재지우편번호 has 414 (7.4%) missing valuesMissing
WGS84위도 has 245 (4.4%) missing valuesMissing
WGS84경도 has 245 (4.4%) missing valuesMissing
X좌표값 has 757 (13.6%) missing valuesMissing
Y좌표값 has 757 (13.6%) missing valuesMissing
자본금 has 5568 (100.0%) missing valuesMissing
거래처 has 5568 (100.0%) missing valuesMissing
소재지면적정보 is highly skewed (γ1 = 60.61352837)Skewed
자본금 is an unsupported type, check if it needs cleaning or further analysisUnsupported
거래처 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 22:25:17.810612
Analysis finished2023-12-10 22:25:23.784261
Duration5.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
화성시
473 
평택시
 
374
용인시
 
367
양주시
 
314
고양시
 
293
Other values (27)
3747 

Length

Max length4
Median length3
Mean length3.0915948
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 473
 
8.5%
평택시 374
 
6.7%
용인시 367
 
6.6%
양주시 314
 
5.6%
고양시 293
 
5.3%
수원시 288
 
5.2%
남양주시 281
 
5.0%
부천시 266
 
4.8%
파주시 232
 
4.2%
광주시 232
 
4.2%
Other values (22) 2448
44.0%

Length

2023-12-11T07:25:23.851332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 473
 
8.5%
평택시 374
 
6.7%
용인시 367
 
6.6%
양주시 314
 
5.6%
고양시 293
 
5.3%
수원시 288
 
5.2%
남양주시 281
 
5.0%
부천시 266
 
4.8%
파주시 232
 
4.2%
광주시 232
 
4.2%
Other values (22) 2448
44.0%
Distinct4415
Distinct (%)79.3%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
2023-12-11T07:25:24.097949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length7.6470905
Min length2

Characters and Unicode

Total characters42579
Distinct characters649
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3774 ?
Unique (%)67.8%

Sample

1st row경기주유소
2nd row유명산주유소
3rd row대성주유소
4th row스카이주유소
5th row고동산주유소
ValueCountFrequency (%)
주유소 118
 
1.7%
주식회사 84
 
1.2%
에이치디현대오일뱅크(주)직영 67
 
1.0%
직영 67
 
1.0%
에이치디현대오일뱅크(주 33
 
0.5%
지에스칼텍스(주 32
 
0.5%
구도일주유소 22
 
0.3%
sk에너지(주 22
 
0.3%
kh에너지(주)직영 18
 
0.3%
현대석유 17
 
0.3%
Other values (4598) 6295
92.9%
2023-12-11T07:25:24.484398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5055
 
11.9%
4737
 
11.1%
3695
 
8.7%
) 1361
 
3.2%
( 1360
 
3.2%
1210
 
2.8%
1130
 
2.7%
1111
 
2.6%
974
 
2.3%
821
 
1.9%
Other values (639) 21125
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37451
88.0%
Close Punctuation 1363
 
3.2%
Open Punctuation 1362
 
3.2%
Space Separator 1210
 
2.8%
Uppercase Letter 802
 
1.9%
Decimal Number 187
 
0.4%
Lowercase Letter 149
 
0.3%
Other Punctuation 21
 
< 0.1%
Other Symbol 19
 
< 0.1%
Dash Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5055
 
13.5%
4737
 
12.6%
3695
 
9.9%
1130
 
3.0%
1111
 
3.0%
974
 
2.6%
821
 
2.2%
678
 
1.8%
587
 
1.6%
502
 
1.3%
Other values (577) 18161
48.5%
Uppercase Letter
ValueCountFrequency (%)
K 211
26.3%
S 210
26.2%
C 76
 
9.5%
I 71
 
8.9%
G 56
 
7.0%
H 34
 
4.2%
L 33
 
4.1%
N 15
 
1.9%
E 14
 
1.7%
O 14
 
1.7%
Other values (15) 68
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
s 33
22.1%
e 29
19.5%
l 27
18.1%
f 25
16.8%
k 16
10.7%
c 4
 
2.7%
i 4
 
2.7%
t 3
 
2.0%
a 2
 
1.3%
x 1
 
0.7%
Other values (5) 5
 
3.4%
Decimal Number
ValueCountFrequency (%)
2 61
32.6%
1 45
24.1%
3 13
 
7.0%
4 13
 
7.0%
5 12
 
6.4%
9 11
 
5.9%
0 11
 
5.9%
8 10
 
5.3%
7 7
 
3.7%
6 4
 
2.1%
Other Punctuation
ValueCountFrequency (%)
. 7
33.3%
& 6
28.6%
, 4
19.0%
· 3
14.3%
/ 1
 
4.8%
Close Punctuation
ValueCountFrequency (%)
) 1361
99.9%
] 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1360
99.9%
[ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
1210
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37470
88.0%
Common 4158
 
9.8%
Latin 951
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5055
 
13.5%
4737
 
12.6%
3695
 
9.9%
1130
 
3.0%
1111
 
3.0%
974
 
2.6%
821
 
2.2%
678
 
1.8%
587
 
1.6%
502
 
1.3%
Other values (578) 18180
48.5%
Latin
ValueCountFrequency (%)
K 211
22.2%
S 210
22.1%
C 76
 
8.0%
I 71
 
7.5%
G 56
 
5.9%
H 34
 
3.6%
s 33
 
3.5%
L 33
 
3.5%
e 29
 
3.0%
l 27
 
2.8%
Other values (30) 171
18.0%
Common
ValueCountFrequency (%)
) 1361
32.7%
( 1360
32.7%
1210
29.1%
2 61
 
1.5%
1 45
 
1.1%
- 15
 
0.4%
3 13
 
0.3%
4 13
 
0.3%
5 12
 
0.3%
9 11
 
0.3%
Other values (11) 57
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37404
87.8%
ASCII 5106
 
12.0%
Compat Jamo 47
 
0.1%
None 22
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5055
 
13.5%
4737
 
12.7%
3695
 
9.9%
1130
 
3.0%
1111
 
3.0%
974
 
2.6%
821
 
2.2%
678
 
1.8%
587
 
1.6%
502
 
1.3%
Other values (558) 18114
48.4%
ASCII
ValueCountFrequency (%)
) 1361
26.7%
( 1360
26.6%
1210
23.7%
K 211
 
4.1%
S 210
 
4.1%
C 76
 
1.5%
I 71
 
1.4%
2 61
 
1.2%
G 56
 
1.1%
1 45
 
0.9%
Other values (50) 445
 
8.7%
None
ValueCountFrequency (%)
19
86.4%
· 3
 
13.6%
Compat Jamo
ValueCountFrequency (%)
8
17.0%
7
14.9%
5
10.6%
3
 
6.4%
3
 
6.4%
3
 
6.4%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (9) 10
21.3%
Distinct3514
Distinct (%)63.1%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
Minimum1963-03-15 00:00:00
Maximum2023-12-06 00:00:00
2023-12-11T07:25:24.599372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:24.726537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct144
Distinct (%)91.1%
Missing5410
Missing (%)97.2%
Memory size43.6 KiB
Minimum1989-10-16 00:00:00
Maximum2023-05-10 00:00:00
2023-12-11T07:25:24.875084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:25.003201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태구분코드
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
03
2208 
01
1676 
07
1022 
06
354 
02
229 
Other values (3)
 
79

Length

Max length4
Median length2
Mean length2.0003592
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row05
2nd row01
3rd row01
4th row01
5th row01

Common Values

ValueCountFrequency (%)
03 2208
39.7%
01 1676
30.1%
07 1022
18.4%
06 354
 
6.4%
02 229
 
4.1%
05 72
 
1.3%
04 6
 
0.1%
BBBB 1
 
< 0.1%

Length

2023-12-11T07:25:25.343056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:25:25.446995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
03 2208
39.7%
01 1676
30.1%
07 1022
18.4%
06 354
 
6.4%
02 229
 
4.1%
05 72
 
1.3%
04 6
 
0.1%
bbbb 1
 
< 0.1%

영업상태명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
폐지
2208 
신규등록
1676 
영업개시
1022 
휴지사업재개
354 
등록취소
229 
Other values (3)
 
79

Length

Max length6
Median length4
Mean length3.3340517
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row사업휴지
2nd row신규등록
3rd row신규등록
4th row신규등록
5th row신규등록

Common Values

ValueCountFrequency (%)
폐지 2208
39.7%
신규등록 1676
30.1%
영업개시 1022
18.4%
휴지사업재개 354
 
6.4%
등록취소 229
 
4.1%
사업휴지 72
 
1.3%
사업정지 6
 
0.1%
<NA> 1
 
< 0.1%

Length

2023-12-11T07:25:25.569944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:25:25.678209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐지 2208
39.7%
신규등록 1676
30.1%
영업개시 1022
18.4%
휴지사업재개 354
 
6.4%
등록취소 229
 
4.1%
사업휴지 72
 
1.3%
사업정지 6
 
0.1%
na 1
 
< 0.1%
Distinct4527
Distinct (%)95.5%
Missing830
Missing (%)14.9%
Memory size43.6 KiB
2023-12-11T07:25:25.990153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.275433
Min length1

Characters and Unicode

Total characters53423
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4373 ?
Unique (%)92.3%

Sample

1st row031 5847776
2nd row031 5850236
3rd row031 5848339
4th row031 5840587
5th row031 5844300
ValueCountFrequency (%)
031 2994
31.4%
02 76
 
0.8%
032 66
 
0.7%
5151 43
 
0.5%
5189 40
 
0.4%
0031 34
 
0.4%
0343 26
 
0.3%
5182 22
 
0.2%
5145 19
 
0.2%
5149 17
 
0.2%
Other values (4681) 6204
65.0%
2023-12-11T07:25:26.441722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8218
15.4%
3 7834
14.7%
1 7797
14.6%
5071
9.5%
5 4997
9.4%
2 3582
6.7%
6 3407
6.4%
8 3207
 
6.0%
4 3171
 
5.9%
7 3084
 
5.8%
Other values (2) 3055
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 47717
89.3%
Space Separator 5071
 
9.5%
Dash Punctuation 635
 
1.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8218
17.2%
3 7834
16.4%
1 7797
16.3%
5 4997
10.5%
2 3582
7.5%
6 3407
7.1%
8 3207
 
6.7%
4 3171
 
6.6%
7 3084
 
6.5%
9 2420
 
5.1%
Space Separator
ValueCountFrequency (%)
5071
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 635
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 53423
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8218
15.4%
3 7834
14.7%
1 7797
14.6%
5071
9.5%
5 4997
9.4%
2 3582
6.7%
6 3407
6.4%
8 3207
 
6.0%
4 3171
 
5.9%
7 3084
 
5.8%
Other values (2) 3055
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 53423
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8218
15.4%
3 7834
14.7%
1 7797
14.6%
5071
9.5%
5 4997
9.4%
2 3582
6.7%
6 3407
6.4%
8 3207
 
6.0%
4 3171
 
5.9%
7 3084
 
5.8%
Other values (2) 3055
 
5.7%

소재지면적정보
Real number (ℝ)

MISSING  SKEWED 

Distinct1788
Distinct (%)48.7%
Missing1894
Missing (%)34.0%
Infinite0
Infinite (%)0.0%
Mean3025919.8
Minimum0
Maximum1.1111111 × 1010
Zeros26
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:26.598080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile230.559
Q1774
median1010
Q31475.75
95-th percentile2486
Maximum1.1111111 × 1010
Range1.1111111 × 1010
Interquartile range (IQR)701.75

Descriptive statistics

Standard deviation1.8331072 × 108
Coefficient of variation (CV)60.580163
Kurtosis3673.9999
Mean3025919.8
Median Absolute Deviation (MAD)350
Skewness60.613528
Sum1.1117229 × 1010
Variance3.3602818 × 1016
MonotonicityNot monotonic
2023-12-11T07:25:26.741911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
990.0 78
 
1.4%
660.0 66
 
1.2%
1495.0 36
 
0.6%
998.0 35
 
0.6%
1490.0 34
 
0.6%
1000.0 29
 
0.5%
992.0 29
 
0.5%
995.0 28
 
0.5%
1498.0 28
 
0.5%
0.0 26
 
0.5%
Other values (1778) 3285
59.0%
(Missing) 1894
34.0%
ValueCountFrequency (%)
0.0 26
0.5%
1.0 7
 
0.1%
1.37 1
 
< 0.1%
1.45 1
 
< 0.1%
4.9 8
 
0.1%
5.0 1
 
< 0.1%
6.0 1
 
< 0.1%
6.12 1
 
< 0.1%
6.4 2
 
< 0.1%
7.2 1
 
< 0.1%
ValueCountFrequency (%)
11111111111.0 1
< 0.1%
1401569.0 1
< 0.1%
145970.0 1
< 0.1%
103380.0 1
< 0.1%
29487.0 1
< 0.1%
17282.0 1
< 0.1%
16852.0 2
< 0.1%
12000.0 1
< 0.1%
11832.0 1
< 0.1%
10314.0 1
< 0.1%

도로명우편번호
Text

MISSING 

Distinct441
Distinct (%)71.4%
Missing4950
Missing (%)88.9%
Memory size43.6 KiB
2023-12-11T07:25:27.039541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0032362
Min length5

Characters and Unicode

Total characters3092
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique353 ?
Unique (%)57.1%

Sample

1st row12438
2nd row10245
3rd row10252
4th row10386
5th row10271
ValueCountFrequency (%)
12532 27
 
4.4%
12611 16
 
2.6%
10801 9
 
1.5%
17118 7
 
1.1%
14058 7
 
1.1%
10857 5
 
0.8%
18255 5
 
0.8%
12774 4
 
0.6%
17797 4
 
0.6%
18284 4
 
0.6%
Other values (431) 530
85.8%
2023-12-11T07:25:27.479401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 924
29.9%
2 304
 
9.8%
7 295
 
9.5%
0 294
 
9.5%
8 293
 
9.5%
5 240
 
7.8%
4 217
 
7.0%
3 191
 
6.2%
6 173
 
5.6%
9 160
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3091
> 99.9%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 924
29.9%
2 304
 
9.8%
7 295
 
9.5%
0 294
 
9.5%
8 293
 
9.5%
5 240
 
7.8%
4 217
 
7.0%
3 191
 
6.2%
6 173
 
5.6%
9 160
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3092
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 924
29.9%
2 304
 
9.8%
7 295
 
9.5%
0 294
 
9.5%
8 293
 
9.5%
5 240
 
7.8%
4 217
 
7.0%
3 191
 
6.2%
6 173
 
5.6%
9 160
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3092
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 924
29.9%
2 304
 
9.8%
7 295
 
9.5%
0 294
 
9.5%
8 293
 
9.5%
5 240
 
7.8%
4 217
 
7.0%
3 191
 
6.2%
6 173
 
5.6%
9 160
 
5.2%
Distinct4539
Distinct (%)89.6%
Missing501
Missing (%)9.0%
Memory size43.6 KiB
2023-12-11T07:25:27.754770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length51
Mean length23.820012
Min length14

Characters and Unicode

Total characters120696
Distinct characters465
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4097 ?
Unique (%)80.9%

Sample

1st row경기도 가평군 청평면 경춘로 1013
2nd row경기도 가평군 설악면 유명로 867
3rd row경기도 가평군 조종면 청군로 1186
4th row경기도 가평군 설악면 유명로 ****
5th row경기도 가평군 청평면 북한강로 1659
ValueCountFrequency (%)
경기도 4983
 
18.2%
화성시 413
 
1.5%
용인시 352
 
1.3%
평택시 340
 
1.2%
고양시 261
 
1.0%
수원시 258
 
0.9%
부천시 252
 
0.9%
남양주시 248
 
0.9%
양주시 244
 
0.9%
파주시 215
 
0.8%
Other values (4897) 19865
72.4%
2023-12-11T07:25:28.130111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22704
 
18.8%
5383
 
4.5%
5255
 
4.4%
5218
 
4.3%
5090
 
4.2%
4784
 
4.0%
3641
 
3.0%
1 3513
 
2.9%
( 3056
 
2.5%
) 3056
 
2.5%
Other values (455) 58996
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71979
59.6%
Space Separator 22704
 
18.8%
Decimal Number 17951
 
14.9%
Open Punctuation 3058
 
2.5%
Close Punctuation 3058
 
2.5%
Other Punctuation 1241
 
1.0%
Dash Punctuation 634
 
0.5%
Uppercase Letter 54
 
< 0.1%
Lowercase Letter 16
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5383
 
7.5%
5255
 
7.3%
5218
 
7.2%
5090
 
7.1%
4784
 
6.6%
3641
 
5.1%
1623
 
2.3%
1548
 
2.2%
1337
 
1.9%
1281
 
1.8%
Other values (411) 36819
51.2%
Uppercase Letter
ValueCountFrequency (%)
S 16
29.6%
G 9
16.7%
K 9
16.7%
B 6
 
11.1%
C 3
 
5.6%
I 3
 
5.6%
D 2
 
3.7%
O 2
 
3.7%
N 1
 
1.9%
A 1
 
1.9%
Other values (2) 2
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
e 4
25.0%
l 2
12.5%
n 2
12.5%
f 1
 
6.2%
s 1
 
6.2%
k 1
 
6.2%
a 1
 
6.2%
i 1
 
6.2%
r 1
 
6.2%
t 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
1 3513
19.6%
2 2447
13.6%
3 1878
10.5%
4 1666
9.3%
5 1574
8.8%
6 1474
8.2%
7 1423
7.9%
0 1416
7.9%
9 1303
 
7.3%
8 1257
 
7.0%
Other Punctuation
ValueCountFrequency (%)
* 667
53.7%
, 553
44.6%
. 19
 
1.5%
@ 2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 3056
99.9%
[ 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 3056
99.9%
] 2
 
0.1%
Space Separator
ValueCountFrequency (%)
22704
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 634
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71979
59.6%
Common 48646
40.3%
Latin 71
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5383
 
7.5%
5255
 
7.3%
5218
 
7.2%
5090
 
7.1%
4784
 
6.6%
3641
 
5.1%
1623
 
2.3%
1548
 
2.2%
1337
 
1.9%
1281
 
1.8%
Other values (411) 36819
51.2%
Latin
ValueCountFrequency (%)
S 16
22.5%
G 9
12.7%
K 9
12.7%
B 6
 
8.5%
e 4
 
5.6%
C 3
 
4.2%
I 3
 
4.2%
D 2
 
2.8%
l 2
 
2.8%
O 2
 
2.8%
Other values (14) 15
21.1%
Common
ValueCountFrequency (%)
22704
46.7%
1 3513
 
7.2%
( 3056
 
6.3%
) 3056
 
6.3%
2 2447
 
5.0%
3 1878
 
3.9%
4 1666
 
3.4%
5 1574
 
3.2%
6 1474
 
3.0%
7 1423
 
2.9%
Other values (10) 5855
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71979
59.6%
ASCII 48716
40.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22704
46.6%
1 3513
 
7.2%
( 3056
 
6.3%
) 3056
 
6.3%
2 2447
 
5.0%
3 1878
 
3.9%
4 1666
 
3.4%
5 1574
 
3.2%
6 1474
 
3.0%
7 1423
 
2.9%
Other values (33) 5925
 
12.2%
Hangul
ValueCountFrequency (%)
5383
 
7.5%
5255
 
7.3%
5218
 
7.2%
5090
 
7.1%
4784
 
6.6%
3641
 
5.1%
1623
 
2.3%
1548
 
2.2%
1337
 
1.9%
1281
 
1.8%
Other values (411) 36819
51.2%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct5207
Distinct (%)93.6%
Missing7
Missing (%)0.1%
Memory size43.6 KiB
2023-12-11T07:25:28.427945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length57
Mean length22.976443
Min length5

Characters and Unicode

Total characters127772
Distinct characters439
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4934 ?
Unique (%)88.7%

Sample

1st row경기도 가평군 청평면 하천리 243-2
2nd row경기도 가평군 설악면 방일리 248-1
3rd row경기도 가평군 조종면 현리 477-252번지
4th row경기도 가평군 설악면 천안리 ***-*
5th row경기도 가평군 청평면 삼회리 474-3번지
ValueCountFrequency (%)
경기도 5466
 
19.6%
화성시 404
 
1.4%
평택시 374
 
1.3%
용인시 362
 
1.3%
양주시 303
 
1.1%
고양시 293
 
1.0%
수원시 287
 
1.0%
남양주시 281
 
1.0%
부천시 265
 
0.9%
파주시 231
 
0.8%
Other values (6785) 19650
70.4%
2023-12-11T07:25:28.851012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25410
19.9%
5681
 
4.4%
5616
 
4.4%
5488
 
4.3%
5378
 
4.2%
1 4814
 
3.8%
- 4809
 
3.8%
3790
 
3.0%
2 3237
 
2.5%
3171
 
2.5%
Other values (429) 60378
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71710
56.1%
Space Separator 25410
 
19.9%
Decimal Number 24312
 
19.0%
Dash Punctuation 4809
 
3.8%
Other Punctuation 1250
 
1.0%
Close Punctuation 87
 
0.1%
Open Punctuation 87
 
0.1%
Uppercase Letter 85
 
0.1%
Lowercase Letter 16
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5681
 
7.9%
5616
 
7.8%
5488
 
7.7%
5378
 
7.5%
3790
 
5.3%
3171
 
4.4%
2768
 
3.9%
2556
 
3.6%
1689
 
2.4%
1645
 
2.3%
Other values (383) 33928
47.3%
Uppercase Letter
ValueCountFrequency (%)
S 22
25.9%
G 13
15.3%
K 12
14.1%
B 8
 
9.4%
L 7
 
8.2%
A 6
 
7.1%
C 4
 
4.7%
I 3
 
3.5%
T 3
 
3.5%
O 2
 
2.4%
Other values (4) 5
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
e 4
25.0%
l 2
12.5%
n 2
12.5%
f 1
 
6.2%
s 1
 
6.2%
i 1
 
6.2%
r 1
 
6.2%
c 1
 
6.2%
t 1
 
6.2%
k 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
1 4814
19.8%
2 3237
13.3%
3 2762
11.4%
4 2512
10.3%
5 2199
9.0%
6 2038
8.4%
7 1903
 
7.8%
0 1722
 
7.1%
9 1577
 
6.5%
8 1548
 
6.4%
Other Punctuation
ValueCountFrequency (%)
* 912
73.0%
, 320
 
25.6%
. 13
 
1.0%
@ 3
 
0.2%
/ 2
 
0.2%
Space Separator
ValueCountFrequency (%)
25410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4809
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71710
56.1%
Common 55960
43.8%
Latin 102
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5681
 
7.9%
5616
 
7.8%
5488
 
7.7%
5378
 
7.5%
3790
 
5.3%
3171
 
4.4%
2768
 
3.9%
2556
 
3.6%
1689
 
2.4%
1645
 
2.3%
Other values (383) 33928
47.3%
Latin
ValueCountFrequency (%)
S 22
21.6%
G 13
12.7%
K 12
11.8%
B 8
 
7.8%
L 7
 
6.9%
A 6
 
5.9%
e 4
 
3.9%
C 4
 
3.9%
I 3
 
2.9%
T 3
 
2.9%
Other values (16) 20
19.6%
Common
ValueCountFrequency (%)
25410
45.4%
1 4814
 
8.6%
- 4809
 
8.6%
2 3237
 
5.8%
3 2762
 
4.9%
4 2512
 
4.5%
5 2199
 
3.9%
6 2038
 
3.6%
7 1903
 
3.4%
0 1722
 
3.1%
Other values (10) 4554
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71710
56.1%
ASCII 56061
43.9%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25410
45.3%
1 4814
 
8.6%
- 4809
 
8.6%
2 3237
 
5.8%
3 2762
 
4.9%
4 2512
 
4.5%
5 2199
 
3.9%
6 2038
 
3.6%
7 1903
 
3.4%
0 1722
 
3.1%
Other values (35) 4655
 
8.3%
Hangul
ValueCountFrequency (%)
5681
 
7.9%
5616
 
7.8%
5488
 
7.7%
5378
 
7.5%
3790
 
5.3%
3171
 
4.4%
2768
 
3.9%
2556
 
3.6%
1689
 
2.4%
1645
 
2.3%
Other values (383) 33928
47.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2399
Distinct (%)46.5%
Missing414
Missing (%)7.4%
Infinite0
Infinite (%)0.0%
Mean14243.804
Minimum1198
Maximum63104
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:28.967806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1198
5-th percentile10231
Q111508
median14059
Q317141
95-th percentile18487
Maximum63104
Range61906
Interquartile range (IQR)5633

Descriptive statistics

Standard deviation3223.9271
Coefficient of variation (CV)0.22633891
Kurtosis23.205135
Mean14243.804
Median Absolute Deviation (MAD)2704
Skewness1.6600151
Sum73412567
Variance10393706
MonotonicityNot monotonic
2023-12-11T07:25:29.080108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12532 28
 
0.5%
10801 19
 
0.3%
12611 19
 
0.3%
17770 16
 
0.3%
11167 16
 
0.3%
11188 15
 
0.3%
11426 13
 
0.2%
10857 13
 
0.2%
18537 12
 
0.2%
10949 12
 
0.2%
Other values (2389) 4991
89.6%
(Missing) 414
 
7.4%
ValueCountFrequency (%)
1198 1
< 0.1%
1410 1
< 0.1%
1624 2
< 0.1%
1817 1
< 0.1%
2161 1
< 0.1%
2410 1
< 0.1%
2795 1
< 0.1%
2835 1
< 0.1%
3010 1
< 0.1%
3188 1
< 0.1%
ValueCountFrequency (%)
63104 1
 
< 0.1%
59712 1
 
< 0.1%
55359 1
 
< 0.1%
34850 3
0.1%
31748 1
 
< 0.1%
28936 1
 
< 0.1%
27802 1
 
< 0.1%
23108 1
 
< 0.1%
22817 1
 
< 0.1%
22709 1
 
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct4795
Distinct (%)90.1%
Missing245
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean37.442753
Minimum33.484678
Maximum38.189902
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:29.208092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.484678
5-th percentile37.018481
Q137.249956
median37.405869
Q337.662403
95-th percentile37.88114
Maximum38.189902
Range4.7052243
Interquartile range (IQR)0.41244685

Descriptive statistics

Standard deviation0.27614974
Coefficient of variation (CV)0.007375252
Kurtosis8.9624307
Mean37.442753
Median Absolute Deviation (MAD)0.20024624
Skewness-0.6013899
Sum199307.78
Variance0.076258679
MonotonicityNot monotonic
2023-12-11T07:25:29.338743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5339522939 27
 
0.5%
37.357175753 14
 
0.3%
37.5157043 8
 
0.1%
37.4974642 7
 
0.1%
37.2879031 6
 
0.1%
37.340885 6
 
0.1%
37.6634381 5
 
0.1%
37.3410282 5
 
0.1%
37.3409323 5
 
0.1%
37.3951731088 5
 
0.1%
Other values (4785) 5235
94.0%
(Missing) 245
 
4.4%
ValueCountFrequency (%)
33.4846778 1
 
< 0.1%
34.7532725566 1
 
< 0.1%
35.7688596712 1
 
< 0.1%
36.330636 3
0.1%
36.4900663 1
 
< 0.1%
36.8828528825 1
 
< 0.1%
36.9170539 1
 
< 0.1%
36.9274805 1
 
< 0.1%
36.929173597 1
 
< 0.1%
36.9312447791 1
 
< 0.1%
ValueCountFrequency (%)
38.1899021 1
< 0.1%
38.1764918 2
< 0.1%
38.1563638656 1
< 0.1%
38.1531461911 1
< 0.1%
38.1514267604 1
< 0.1%
38.1444386 1
< 0.1%
38.1320799 1
< 0.1%
38.1302151 1
< 0.1%
38.111308628 1
< 0.1%
38.1105237251 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct4794
Distinct (%)90.1%
Missing245
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean127.0496
Minimum125.7044
Maximum127.79182
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:29.518463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum125.7044
5-th percentile126.72581
Q1126.85729
median127.04252
Q3127.19412
95-th percentile127.49571
Maximum127.79182
Range2.0874239
Interquartile range (IQR)0.33683038

Descriptive statistics

Standard deviation0.2373214
Coefficient of variation (CV)0.0018679429
Kurtosis0.30840923
Mean127.0496
Median Absolute Deviation (MAD)0.16675929
Skewness0.51080598
Sum676285.01
Variance0.056321447
MonotonicityNot monotonic
2023-12-11T07:25:29.670682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.787798124 27
 
0.5%
127.644424863 14
 
0.3%
127.0219186 8
 
0.1%
127.0392498 7
 
0.1%
126.8635999 6
 
0.1%
126.8247273 6
 
0.1%
126.8064829 5
 
0.1%
126.8240611 5
 
0.1%
126.8262506 5
 
0.1%
126.972884612 5
 
0.1%
Other values (4784) 5235
94.0%
(Missing) 245
 
4.4%
ValueCountFrequency (%)
125.7043953 1
< 0.1%
126.4683464 1
< 0.1%
126.53240288 1
< 0.1%
126.5331441 1
< 0.1%
126.5368709 1
< 0.1%
126.5398188 1
< 0.1%
126.5432931633 1
< 0.1%
126.5456164 1
< 0.1%
126.5456164043 1
< 0.1%
126.5476927796 1
< 0.1%
ValueCountFrequency (%)
127.7918192359 1
 
< 0.1%
127.787798124 27
0.5%
127.7730676 1
 
< 0.1%
127.7669864 1
 
< 0.1%
127.7552359639 1
 
< 0.1%
127.7491218 1
 
< 0.1%
127.7475903 1
 
< 0.1%
127.7473951497 1
 
< 0.1%
127.7438863012 1
 
< 0.1%
127.7334509 1
 
< 0.1%

업태구분명정보
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size43.6 KiB
주유소
3893 
일반판매소
1424 
용제판매소
 
242
항공유판매소
 
5
특수판매소
 
3

Length

Max length8
Median length3
Mean length3.6030891
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row주유소
2nd row주유소
3rd row주유소
4th row주유소
5th row주유소

Common Values

ValueCountFrequency (%)
주유소 3893
69.9%
일반판매소 1424
 
25.6%
용제판매소 242
 
4.3%
항공유판매소 5
 
0.1%
특수판매소 3
 
0.1%
부생연료유판매소 1
 
< 0.1%

Length

2023-12-11T07:25:29.859915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:25:29.960258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주유소 3893
69.9%
일반판매소 1424
 
25.6%
용제판매소 242
 
4.3%
항공유판매소 5
 
0.1%
특수판매소 3
 
0.1%
부생연료유판매소 1
 
< 0.1%

X좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct4230
Distinct (%)87.9%
Missing757
Missing (%)13.6%
Infinite0
Infinite (%)0.0%
Mean204722.92
Minimum85905.278
Maximum269906.32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:30.093024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum85905.278
5-th percentile175921
Q1187531.94
median203923.91
Q3217436.92
95-th percentile244526.66
Maximum269906.32
Range184001.04
Interquartile range (IQR)29904.977

Descriptive statistics

Standard deviation21185.519
Coefficient of variation (CV)0.10348387
Kurtosis0.26885243
Mean204722.92
Median Absolute Deviation (MAD)14683.996
Skewness0.50284153
Sum9.8492195 × 108
Variance4.4882623 × 108
MonotonicityNot monotonic
2023-12-11T07:25:30.223475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
269560.377508183 27
 
0.5%
257022.898297917 16
 
0.3%
184440.540505444 16
 
0.3%
201872.403440328 8
 
0.1%
203405.721138531 7
 
0.1%
195688.283833074 6
 
0.1%
182860.249041445 5
 
0.1%
180987.37358885 4
 
0.1%
175470.687063644 4
 
0.1%
203381.197031295 4
 
0.1%
Other values (4220) 4714
84.7%
(Missing) 757
 
13.6%
ValueCountFrequency (%)
85905.2776136714 1
< 0.1%
158725.556008307 1
< 0.1%
158791.111955628 1
< 0.1%
159103.930494512 1
< 0.1%
159365.312415585 1
< 0.1%
159665.710032875 1
< 0.1%
159858.355550259 1
< 0.1%
160049.837135948 1
< 0.1%
160146.374864699 2
< 0.1%
160252.961083209 1
< 0.1%
ValueCountFrequency (%)
269906.317372191 1
 
< 0.1%
269560.377508183 27
0.5%
268415.615947562 1
 
< 0.1%
267833.702373223 1
 
< 0.1%
266779.3956952 1
 
< 0.1%
266106.603792296 1
 
< 0.1%
266093.737305732 1
 
< 0.1%
265764.643927842 1
 
< 0.1%
265060.712119 1
 
< 0.1%
264751.210194496 1
 
< 0.1%

Y좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct4230
Distinct (%)87.9%
Missing757
Missing (%)13.6%
Infinite0
Infinite (%)0.0%
Mean438256.53
Minimum139648.2
Maximum520789.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size49.1 KiB
2023-12-11T07:25:30.371603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum139648.2
5-th percentile390920.94
Q1416437.85
median434572.6
Q3462860.68
95-th percentile487396.54
Maximum520789.7
Range381141.5
Interquartile range (IQR)46422.827

Descriptive statistics

Standard deviation30407.88
Coefficient of variation (CV)0.069383747
Kurtosis1.4400517
Mean438256.53
Median Absolute Deviation (MAD)22712.642
Skewness-0.092407516
Sum2.1084522 × 109
Variance9.2463919 × 108
MonotonicityNot monotonic
2023-12-11T07:25:30.531506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
448250.823117257 27
 
0.5%
426562.300646577 16
 
0.3%
428540.793441867 16
 
0.3%
445952.75898316 8
 
0.1%
443926.327308964 7
 
0.1%
429949.927814167 6
 
0.1%
462365.49736364 5
 
0.1%
381995.944871202 4
 
0.1%
487995.936237086 4
 
0.1%
489453.722194982 4
 
0.1%
Other values (4220) 4714
84.7%
(Missing) 757
 
13.6%
ValueCountFrequency (%)
139648.198762 1
 
< 0.1%
252109.053197 1
 
< 0.1%
314513.366319 3
0.1%
332369.750411 1
 
< 0.1%
379515.520853534 1
 
< 0.1%
381110.40594389 1
 
< 0.1%
381426.387293525 1
 
< 0.1%
381673.716219975 2
< 0.1%
381816.984159735 1
 
< 0.1%
381912.338691767 1
 
< 0.1%
ValueCountFrequency (%)
520789.699438722 1
< 0.1%
519299.425751985 2
< 0.1%
517104.065497182 1
< 0.1%
516733.793967712 1
< 0.1%
516511.013661792 1
< 0.1%
515741.409308819 1
< 0.1%
514368.573450798 1
< 0.1%
513067.890842246 1
< 0.1%
512097.881604141 1
< 0.1%
511974.626952408 1
< 0.1%

자본금
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5568
Missing (%)100.0%
Memory size49.1 KiB

거래처
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5568
Missing (%)100.0%
Memory size49.1 KiB

Interactions

2023-12-11T07:25:22.599370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:19.522531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.298280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.875041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.460900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.039245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.690214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:19.599049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.394492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.972185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.554507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.130425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.776815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:19.678133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.479779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.085142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.663246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.225332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.867115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:19.751898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.572239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.165584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.754941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.317788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.947298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:19.849473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.672826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.248527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.858163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.409114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:23.038707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.178710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:20.779589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.357194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:21.948232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:22.505542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:25:30.634851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태구분코드영업상태명소재지면적정보소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값
시군명1.0000.4580.4550.0000.8760.9730.9160.2850.9170.955
영업상태구분코드0.4581.0001.0000.0000.1840.1120.2350.2830.1670.102
영업상태명0.4551.0001.0000.0000.1280.1740.1630.2670.2530.159
소재지면적정보0.0000.0000.0001.0000.0000.0000.0000.045NaNNaN
소재지우편번호0.8760.1840.1280.0001.0000.8890.6020.0660.2980.967
WGS84위도0.9730.1120.1740.0000.8891.0000.4080.0540.2100.916
WGS84경도0.9160.2350.1630.0000.6020.4081.0000.1171.0000.260
업태구분명정보0.2850.2830.2670.0450.0660.0540.1171.0000.1200.060
X좌표값0.9170.1670.253NaN0.2980.2101.0000.1201.0000.387
Y좌표값0.9550.1020.159NaN0.9670.9160.2600.0600.3871.000
2023-12-11T07:25:30.760221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태구분명정보시군명영업상태명영업상태구분코드
업태구분명정보1.0000.1290.1620.161
시군명0.1291.0000.2080.196
영업상태명0.1620.2081.0001.000
영업상태구분코드0.1610.1961.0001.000
2023-12-11T07:25:30.864098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지면적정보소재지우편번호WGS84위도WGS84경도X좌표값Y좌표값시군명영업상태구분코드영업상태명업태구분명정보
소재지면적정보1.0000.159-0.1450.0260.034-0.1370.0000.0000.0000.056
소재지우편번호0.1591.000-0.9060.0900.106-0.9050.7020.0620.0690.037
WGS84위도-0.145-0.9061.000-0.124-0.1230.9990.9060.0600.0610.032
WGS84경도0.0260.090-0.1241.0000.999-0.1210.6900.0810.0880.065
X좌표값0.0340.106-0.1230.9991.000-0.1210.6920.0820.0900.070
Y좌표값-0.137-0.9050.999-0.121-0.1211.0000.8320.0490.0550.045
시군명0.0000.7020.9060.6900.6920.8321.0000.1960.2080.129
영업상태구분코드0.0000.0620.0600.0810.0820.0490.1961.0001.0000.161
영업상태명0.0000.0690.0610.0880.0900.0550.2081.0001.0000.162
업태구분명정보0.0560.0370.0320.0650.0700.0450.1290.1610.1621.000

Missing values

2023-12-11T07:25:23.182921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:25:23.452137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:25:23.655578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값자본금거래처
0가평군경기주유소20051228<NA>05사업휴지<NA>679.44<NA>경기도 가평군 청평면 경춘로 1013경기도 가평군 청평면 하천리 243-21245037.751616127.432401주유소238040.212712472196.966674<NA><NA>
1가평군유명산주유소19920725<NA>01신규등록031 58477761845.0<NA>경기도 가평군 설악면 유명로 867경기도 가평군 설악면 방일리 248-11247237.62123127.494901주유소243627.667228457762.369037<NA><NA>
2가평군대성주유소19930614<NA>01신규등록031 5850236756.0<NA>경기도 가평군 조종면 청군로 1186경기도 가평군 조종면 현리 477-252번지1243837.81112127.352262주유소230952.040149478767.127157<NA><NA>
3가평군스카이주유소19950918<NA>01신규등록031 58483391480.0<NA>경기도 가평군 설악면 유명로 ****경기도 가평군 설악면 천안리 ***-*12469<NA><NA>주유소242324.357912461214.690315<NA><NA>
4가평군고동산주유소19940119<NA>01신규등록031 58405871487.0<NA>경기도 가평군 청평면 북한강로 1659경기도 가평군 청평면 삼회리 474-3번지1245837.669643127.381601주유소233602.599703463104.5086<NA><NA>
5가평군(주)설악주유소19890729<NA>01신규등록031 58443001128.0<NA>경기도 가평군 설악면 유명로 1711경기도 가평군 설악면 선촌리 472-3번지1246637.67695127.480665주유소242339.934756463955.596159<NA><NA>
6가평군호반주유소1994-12-06<NA>01신규등록031 58488751109.0<NA>경기도 가평군 청평면 북한강로 2187경기도 가평군 청평면 삼회리 85-11245837.710375127.400263주유소235240.699148467642.744258<NA><NA>
7가평군강변셀프주유소1993-03-27<NA>01신규등록031 58151511270.0<NA>경기도 가평군 청평면 경춘로 1191경기도 가평군 청평면 상천리 1151-111244937.76443127.443448주유소238999.531553473640.931662<NA><NA>
8가평군청풍주유소19930204<NA>01신규등록031 5854667<NA><NA>경기도 가평군 청평면 경춘로 257경기도 가평군 청평면 대성리 344-171245737.698472127.384318주유소233822.390784466306.857152<NA><NA>
9가평군KH에너지(주) 직영 가평주유소[춘천방향]2009-07-03<NA>01신규등록<NA>634.1<NA>경기도 가평군 설악면 미사리로540번길 51경기도 가평군 설악면 미사리 149-41246237.701936127.546794주유소248130.438975466721.141902<NA><NA>
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값자본금거래처
5558<NA>대경티엘에스(주)대경주유소20040525<NA>03폐지031 3588267<NA><NA>서울특별시 양천구 신목로2길 11, 3동 1202호 (신정동,청구아파트101-)서울특별시 양천구 신정동 336 19통4반 청구아파트101- 3동 1202호801537.518852126.875489주유소188934.31909446320.477875<NA><NA>
5559<NA>기양주유소19890929<NA>03폐지031 35857581976.0<NA>서울특별시 서대문구 연세로9길 20서울특별시 서대문구 창천동 52-31378837.558616126.935765주유소<NA><NA><NA><NA>
5560<NA>sk문학주유소20050803<NA>03폐지031 35631801127.0<NA>인천광역시 남동구 오봉로 33인천광역시 남동구 도림동 630-1 31통4반 주공그린빌아파트 204동 1601호2165637.419016126.730094주유소175978.705139435127.025484<NA><NA>
5561<NA>비봉주유소20050627<NA>03폐지031 3565154<NA><NA>서울특별시 송파구 석촌호수로 278, 701호 (송파동,여흥레이크빌)서울특별시 송파구 송파동 31 6통7반 여흥레이크빌 701호562337.510383127.10652주유소209333.010334445350.002674<NA><NA>
5562<NA>sk동화주유소20050518<NA>03폐지0220577005918.0<NA>서울특별시 종로구 종로 26 (서린동)서울특별시 종로구 서린동 99318837.569691126.980309주유소198195.965393451925.631118<NA><NA>
5563<NA>보령개발(주)화성(하)주유소20031219<NA>03폐지031 3538015<NA><NA>서울특별시 서초구 방배중앙로9길 32 (방배동)서울특별시 서초구 방배동 947-42 2통1반667437.484496126.988828주유소198940.583575442478.024686<NA><NA>
5564<NA>서경유업상사20031001<NA>03폐지031 5625043<NA><NA>인천광역시 서구 완정로34번길 20 111동 403호 (마전동,동아아파트)인천광역시 서구 마전동 동아A 111/4032264237.597063126.672939일반판매소<NA><NA><NA><NA>
5565<NA>동우주유소19890523<NA>03폐지031 4252650<NA><NA>서울특별시 강남구 역삼로25길 21 (역삼동)서울특별시 강남구 역삼동 727-9번지 8통1반622437.497464127.03925일반판매소203405.721139443926.327309<NA><NA>
5566<NA>옥현주유소20040201<NA>03폐지0317715 175<NA><NA>서울특별시 서초구 사평대로14길 23 702호 (반포동,그랑빌아파트)서울특별시 서초구 반포동 110-6번지 4통2반 그랑빌 702호657537.497154126.994638일반판매소199459.714447443891.115702<NA><NA>
5567<NA>그랜드주유소19840302<NA>03폐지03105934426859.0<NA>충청북도 진천군 광혜원면 화랑1길 14충청북도 진천군 광혜원면 광혜원리 529-14번지2780236.993447127.437578주유소238887.152175388077.219737<NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값# duplicates
16<NA>(주)한석디앤아이20050302<NA>01신규등록03111111 111<NA><NA>서울특별시 강남구 도산대로6길 15, 302호 (논현동,평화빌딩)서울특별시 강남구 논현동 3-15번지 평화빌딩 302호603837.515704127.021919주유소201872.40344445952.7589836
0고양시대풍주유소20050104<NA>03폐지<NA>248.0<NA>경기도 고양시 일산동구 백마로 526 (풍동)경기도 고양시 일산동구 풍동 113-2번지 외1필지1030037.663438126.806483주유소182860.249041462365.4973643
1고양시대풍주유소20050104<NA>03폐지<NA>248.0<NA><NA>경기도 고양시 일산동구 풍동 113-1번지 외1필지<NA>37.663627126.806267주유소182841.595409462386.7444143
2고양시형진에너지20050315<NA>03폐지<NA>372.0<NA>경기도 고양시 일산동구 고봉로819번길 8 (설문동)경기도 고양시 일산동구 설문동 590-2번지1024537.726401126.798229주유소182147.61861469355.3539182
3남양주시강북강변주유소20060201<NA>01신규등록<NA>160.0<NA><NA>경기도 남양주시 금곡동 486번지 2통3반1224037.623451127.205757주유소<NA><NA>2
4남양주시굿서비스20050322<NA>01신규등록55555555<NA><NA><NA>경기도 남양주시 진건읍 신월리 649번지 15통1213137.661749127.153558주유소<NA><NA>2
5남양주시월문20050323<NA>03폐지555555555555<NA><NA>경기도 남양주시 화도읍 수레로1233번길 8 103동 1001호 (창현리,두산1차아파트)경기도 남양주시 화도읍 창현리 757번지 46통10반 두산아파트1단지 103동 1001호1218637.648439127.301404주유소226501.523373460651.1913222
6남양주시초ㅠ20050324<NA>01신규등록1111111111<NA><NA>경기도 남양주시 경춘로773번길 7-2 (일패동)경기도 남양주시 일패동 69-10번지 1통2반1224237.627145127.189613주유소216673.185956458335.5456452
7동두천시오일딜러20221104<NA>01신규등록<NA>995.0<NA><NA>경기도 동두천시 상패동 398-21134037.907629127.03148일반판매소202703.891194489453.7221952
8성남시세원주유소19830518<NA>03폐지0342753 88331039.5<NA><NA>경기도 성남시 수정구 태평동 3659,3666,3667번지1330537.441967127.134595주유소211845.273351437775.7507652