Overview

Dataset statistics

Number of variables8
Number of observations3613
Missing cells749
Missing cells (%)2.6%
Duplicate rows8
Duplicate rows (%)0.2%
Total size in memory225.9 KiB
Average record size in memory64.0 B

Variable types

Text3
DateTime1
Categorical3
Boolean1

Dataset

Description경상남도 빅데이터 허브 플랫폼 DB 내 인허가 숙박업 중 정상 영업상태인 업장에 대한 데이터로, 사업장명, 주소, 인허가일자, 영업상태, 업종구분(자동차야영업, 한옥체헙업, 농어촌민박업 등), 다중이용업소여부 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15122722

Alerts

영업상태 has constant value ""Constant
Dataset has 8 (0.2%) duplicate rowsDuplicates
업종구분 is highly overall correlated with 상세영업상태 and 1 other fieldsHigh correlation
상세영업상태 is highly overall correlated with 업종구분 and 1 other fieldsHigh correlation
다중이용업소여부 is highly overall correlated with 상세영업상태 and 1 other fieldsHigh correlation
상세영업상태 is highly imbalanced (56.0%)Imbalance
업종구분 is highly imbalanced (63.5%)Imbalance
다중이용업소여부 is highly imbalanced (95.4%)Imbalance
도로명주소 has 58 (1.6%) missing valuesMissing
다중이용업소여부 has 688 (19.0%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:03:40.486441
Analysis finished2023-12-11 00:03:42.582374
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3037
Distinct (%)84.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
2023-12-11T09:03:42.817176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length5.8527539
Min length1

Characters and Unicode

Total characters21146
Distinct characters689
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2664 ?
Unique (%)73.7%

Sample

1st row한일장여관
2nd row황산문화체육공원 국민여가캠핑장
3rd row대운산 자연휴양림 야영장
4th row호텔시카고
5th row바드리산장모텔
ValueCountFrequency (%)
모텔 79
 
1.8%
호텔 46
 
1.1%
캠핑장 35
 
0.8%
펜션 29
 
0.7%
야영장 21
 
0.5%
오토캠핑장 14
 
0.3%
hotel 13
 
0.3%
게스트하우스 13
 
0.3%
주식회사 12
 
0.3%
글램핑 12
 
0.3%
Other values (3220) 4001
93.6%
2023-12-11T09:03:43.281501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1941
 
9.2%
1352
 
6.4%
664
 
3.1%
628
 
3.0%
614
 
2.9%
575
 
2.7%
476
 
2.3%
457
 
2.2%
356
 
1.7%
326
 
1.5%
Other values (679) 13757
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18628
88.1%
Uppercase Letter 780
 
3.7%
Space Separator 664
 
3.1%
Lowercase Letter 340
 
1.6%
Open Punctuation 233
 
1.1%
Close Punctuation 233
 
1.1%
Decimal Number 188
 
0.9%
Other Punctuation 54
 
0.3%
Dash Punctuation 11
 
0.1%
Math Symbol 8
 
< 0.1%
Other values (4) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1941
 
10.4%
1352
 
7.3%
628
 
3.4%
614
 
3.3%
575
 
3.1%
476
 
2.6%
457
 
2.5%
356
 
1.9%
326
 
1.8%
220
 
1.2%
Other values (599) 11683
62.7%
Uppercase Letter
ValueCountFrequency (%)
O 65
 
8.3%
E 58
 
7.4%
M 58
 
7.4%
T 57
 
7.3%
H 52
 
6.7%
L 43
 
5.5%
A 43
 
5.5%
S 42
 
5.4%
N 40
 
5.1%
B 37
 
4.7%
Other values (16) 285
36.5%
Lowercase Letter
ValueCountFrequency (%)
e 65
19.1%
o 44
12.9%
t 29
8.5%
n 25
 
7.4%
l 21
 
6.2%
i 21
 
6.2%
s 19
 
5.6%
a 17
 
5.0%
h 16
 
4.7%
r 14
 
4.1%
Other values (12) 69
20.3%
Decimal Number
ValueCountFrequency (%)
2 46
24.5%
1 30
16.0%
5 25
13.3%
9 18
 
9.6%
7 14
 
7.4%
0 14
 
7.4%
3 13
 
6.9%
8 11
 
5.9%
4 10
 
5.3%
6 7
 
3.7%
Other Punctuation
ValueCountFrequency (%)
& 19
35.2%
. 14
25.9%
· 7
 
13.0%
' 5
 
9.3%
, 5
 
9.3%
: 2
 
3.7%
@ 1
 
1.9%
/ 1
 
1.9%
Math Symbol
ValueCountFrequency (%)
+ 6
75.0%
< 1
 
12.5%
> 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 232
99.6%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 232
99.6%
1
 
0.4%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
664
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18624
88.1%
Common 1394
 
6.6%
Latin 1124
 
5.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1941
 
10.4%
1352
 
7.3%
628
 
3.4%
614
 
3.3%
575
 
3.1%
476
 
2.6%
457
 
2.5%
356
 
1.9%
326
 
1.8%
220
 
1.2%
Other values (596) 11679
62.7%
Latin
ValueCountFrequency (%)
O 65
 
5.8%
e 65
 
5.8%
E 58
 
5.2%
M 58
 
5.2%
T 57
 
5.1%
H 52
 
4.6%
o 44
 
3.9%
L 43
 
3.8%
A 43
 
3.8%
S 42
 
3.7%
Other values (40) 597
53.1%
Common
ValueCountFrequency (%)
664
47.6%
( 232
 
16.6%
) 232
 
16.6%
2 46
 
3.3%
1 30
 
2.2%
5 25
 
1.8%
& 19
 
1.4%
9 18
 
1.3%
. 14
 
1.0%
7 14
 
1.0%
Other values (20) 100
 
7.2%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18624
88.1%
ASCII 2503
 
11.8%
None 10
 
< 0.1%
CJK 4
 
< 0.1%
Number Forms 4
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1941
 
10.4%
1352
 
7.3%
628
 
3.4%
614
 
3.3%
575
 
3.1%
476
 
2.6%
457
 
2.5%
356
 
1.9%
326
 
1.8%
220
 
1.2%
Other values (596) 11679
62.7%
ASCII
ValueCountFrequency (%)
664
26.5%
( 232
 
9.3%
) 232
 
9.3%
O 65
 
2.6%
e 65
 
2.6%
E 58
 
2.3%
M 58
 
2.3%
T 57
 
2.3%
H 52
 
2.1%
2 46
 
1.8%
Other values (63) 974
38.9%
None
ValueCountFrequency (%)
· 7
70.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct3508
Distinct (%)97.2%
Missing3
Missing (%)0.1%
Memory size28.4 KiB
2023-12-11T09:03:43.652535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length52
Mean length24.614404
Min length15

Characters and Unicode

Total characters88858
Distinct characters401
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3414 ?
Unique (%)94.6%

Sample

1st row경상남도 거창군 거창읍 대동리 698-4
2nd row경상남도 양산시 물금읍 물금리 225-1
3rd row경상남도 양산시 용당동 산 66번지
4th row경상남도 창원시 의창구 명서동 204-2번지 2,3,4,5,6층
5th row경상남도 밀양시 단장면 구천리 685-3번지
ValueCountFrequency (%)
경상남도 3610
 
20.0%
창원시 864
 
4.8%
거제시 361
 
2.0%
통영시 345
 
1.9%
김해시 307
 
1.7%
진주시 291
 
1.6%
마산합포구 227
 
1.3%
성산구 205
 
1.1%
남해군 190
 
1.1%
사천시 185
 
1.0%
Other values (4627) 11441
63.5%
2023-12-11T09:03:44.234021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17964
20.2%
4198
 
4.7%
3969
 
4.5%
3719
 
4.2%
3617
 
4.1%
1 3488
 
3.9%
- 3063
 
3.4%
2693
 
3.0%
2478
 
2.8%
2202
 
2.5%
Other values (391) 41467
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 50827
57.2%
Space Separator 17964
 
20.2%
Decimal Number 16080
 
18.1%
Dash Punctuation 3063
 
3.4%
Other Punctuation 555
 
0.6%
Math Symbol 127
 
0.1%
Close Punctuation 107
 
0.1%
Open Punctuation 107
 
0.1%
Uppercase Letter 28
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4198
 
8.3%
3969
 
7.8%
3719
 
7.3%
3617
 
7.1%
2693
 
5.3%
2478
 
4.9%
2202
 
4.3%
1952
 
3.8%
1728
 
3.4%
1334
 
2.6%
Other values (360) 22937
45.1%
Uppercase Letter
ValueCountFrequency (%)
A 5
17.9%
B 4
14.3%
N 4
14.3%
M 3
10.7%
L 2
 
7.1%
C 2
 
7.1%
V 1
 
3.6%
G 1
 
3.6%
Y 1
 
3.6%
T 1
 
3.6%
Other values (4) 4
14.3%
Decimal Number
ValueCountFrequency (%)
1 3488
21.7%
2 2092
13.0%
3 1719
10.7%
4 1456
9.1%
5 1424
8.9%
6 1353
 
8.4%
7 1265
 
7.9%
8 1138
 
7.1%
0 1087
 
6.8%
9 1058
 
6.6%
Other Punctuation
ValueCountFrequency (%)
, 469
84.5%
. 86
 
15.5%
Space Separator
ValueCountFrequency (%)
17964
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3063
100.0%
Math Symbol
ValueCountFrequency (%)
~ 127
100.0%
Close Punctuation
ValueCountFrequency (%)
) 107
100.0%
Open Punctuation
ValueCountFrequency (%)
( 107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50826
57.2%
Common 38003
42.8%
Latin 28
 
< 0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4198
 
8.3%
3969
 
7.8%
3719
 
7.3%
3617
 
7.1%
2693
 
5.3%
2478
 
4.9%
2202
 
4.3%
1952
 
3.8%
1728
 
3.4%
1334
 
2.6%
Other values (359) 22936
45.1%
Common
ValueCountFrequency (%)
17964
47.3%
1 3488
 
9.2%
- 3063
 
8.1%
2 2092
 
5.5%
3 1719
 
4.5%
4 1456
 
3.8%
5 1424
 
3.7%
6 1353
 
3.6%
7 1265
 
3.3%
8 1138
 
3.0%
Other values (7) 3041
 
8.0%
Latin
ValueCountFrequency (%)
A 5
17.9%
B 4
14.3%
N 4
14.3%
M 3
10.7%
L 2
 
7.1%
C 2
 
7.1%
V 1
 
3.6%
G 1
 
3.6%
Y 1
 
3.6%
T 1
 
3.6%
Other values (4) 4
14.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50826
57.2%
ASCII 38031
42.8%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17964
47.2%
1 3488
 
9.2%
- 3063
 
8.1%
2 2092
 
5.5%
3 1719
 
4.5%
4 1456
 
3.8%
5 1424
 
3.7%
6 1353
 
3.6%
7 1265
 
3.3%
8 1138
 
3.0%
Other values (21) 3069
 
8.1%
Hangul
ValueCountFrequency (%)
4198
 
8.3%
3969
 
7.8%
3719
 
7.3%
3617
 
7.1%
2693
 
5.3%
2478
 
4.9%
2202
 
4.3%
1952
 
3.8%
1728
 
3.4%
1334
 
2.6%
Other values (359) 22936
45.1%
CJK
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct3393
Distinct (%)95.4%
Missing58
Missing (%)1.6%
Memory size28.4 KiB
2023-12-11T09:03:44.576211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length54
Mean length26.728833
Min length16

Characters and Unicode

Total characters95021
Distinct characters446
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3245 ?
Unique (%)91.3%

Sample

1st row경상남도 거창군 거창읍 거열로 188-1
2nd row경상남도 양산시 탑골길 208-124 (용당동, 대운산 자연휴양림)
3rd row경상남도 창원시 의창구 창이대로99번길 7, 2.3.4.5.6층 (명서동)
4th row경상남도 밀양시 단장면 바드리길 52
5th row경상남도 밀양시 초동면 검암3길 2
ValueCountFrequency (%)
경상남도 3555
 
18.1%
창원시 861
 
4.4%
거제시 359
 
1.8%
통영시 341
 
1.7%
김해시 304
 
1.5%
진주시 291
 
1.5%
마산합포구 224
 
1.1%
성산구 205
 
1.0%
남해군 188
 
1.0%
사천시 186
 
0.9%
Other values (3747) 13148
66.9%
2023-12-11T09:03:45.085342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16110
 
17.0%
4412
 
4.6%
3914
 
4.1%
3715
 
3.9%
1 3611
 
3.8%
3598
 
3.8%
2728
 
2.9%
2709
 
2.9%
2473
 
2.6%
2 2114
 
2.2%
Other values (436) 49637
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56358
59.3%
Space Separator 16110
 
17.0%
Decimal Number 15294
 
16.1%
Close Punctuation 2106
 
2.2%
Open Punctuation 2106
 
2.2%
Other Punctuation 1468
 
1.5%
Dash Punctuation 1342
 
1.4%
Math Symbol 208
 
0.2%
Uppercase Letter 28
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4412
 
7.8%
3914
 
6.9%
3715
 
6.6%
3598
 
6.4%
2728
 
4.8%
2709
 
4.8%
2473
 
4.4%
2083
 
3.7%
1357
 
2.4%
1307
 
2.3%
Other values (398) 28062
49.8%
Uppercase Letter
ValueCountFrequency (%)
A 5
17.9%
B 4
14.3%
O 2
 
7.1%
K 2
 
7.1%
N 2
 
7.1%
T 2
 
7.1%
L 2
 
7.1%
M 2
 
7.1%
G 1
 
3.6%
V 1
 
3.6%
Other values (5) 5
17.9%
Decimal Number
ValueCountFrequency (%)
1 3611
23.6%
2 2114
13.8%
3 1700
11.1%
5 1442
 
9.4%
4 1363
 
8.9%
7 1227
 
8.0%
6 1175
 
7.7%
0 946
 
6.2%
9 868
 
5.7%
8 848
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 1382
94.1%
. 74
 
5.0%
· 10
 
0.7%
* 1
 
0.1%
/ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 2105
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2105
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
16110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1342
100.0%
Math Symbol
ValueCountFrequency (%)
~ 208
100.0%
Lowercase Letter
ValueCountFrequency (%)
v 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56357
59.3%
Common 38634
40.7%
Latin 29
 
< 0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4412
 
7.8%
3914
 
6.9%
3715
 
6.6%
3598
 
6.4%
2728
 
4.8%
2709
 
4.8%
2473
 
4.4%
2083
 
3.7%
1357
 
2.4%
1307
 
2.3%
Other values (397) 28061
49.8%
Common
ValueCountFrequency (%)
16110
41.7%
1 3611
 
9.3%
2 2114
 
5.5%
) 2105
 
5.4%
( 2105
 
5.4%
3 1700
 
4.4%
5 1442
 
3.7%
, 1382
 
3.6%
4 1363
 
3.5%
- 1342
 
3.5%
Other values (12) 5360
 
13.9%
Latin
ValueCountFrequency (%)
A 5
17.2%
B 4
13.8%
O 2
 
6.9%
K 2
 
6.9%
N 2
 
6.9%
T 2
 
6.9%
L 2
 
6.9%
M 2
 
6.9%
G 1
 
3.4%
V 1
 
3.4%
Other values (6) 6
20.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56357
59.3%
ASCII 38653
40.7%
None 10
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16110
41.7%
1 3611
 
9.3%
2 2114
 
5.5%
) 2105
 
5.4%
( 2105
 
5.4%
3 1700
 
4.4%
5 1442
 
3.7%
, 1382
 
3.6%
4 1363
 
3.5%
- 1342
 
3.5%
Other values (27) 5379
 
13.9%
Hangul
ValueCountFrequency (%)
4412
 
7.8%
3914
 
6.9%
3715
 
6.6%
3598
 
6.4%
2728
 
4.8%
2709
 
4.8%
2473
 
4.4%
2083
 
3.7%
1357
 
2.4%
1307
 
2.3%
Other values (397) 28061
49.8%
None
ValueCountFrequency (%)
· 10
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct2690
Distinct (%)74.5%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
Minimum1958-05-19 00:00:00
Maximum2023-05-31 00:00:00
2023-12-11T09:03:45.436284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:03:45.571395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
영업/정상
3613 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 3613
100.0%

Length

2023-12-11T09:03:45.691091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:03:45.783772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 3613
100.0%

상세영업상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
영업
2980 
영업중
618 
정상
 
15

Length

Max length3
Median length2
Mean length2.171049
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업중
3rd row영업중
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 2980
82.5%
영업중 618
 
17.1%
정상 15
 
0.4%

Length

2023-12-11T09:03:45.895949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:03:46.001796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 2980
82.5%
영업중 618
 
17.1%
정상 15
 
0.4%

업종구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size28.4 KiB
숙박업
2980 
일반야영장업
 
193
관광숙박업
 
143
자동차야영장업
 
123
관광펜션업
 
102
Other values (3)
 
72

Length

Max length10
Median length3
Mean length3.5120399
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업
2nd row자동차야영장업
3rd row일반야영장업
4th row숙박업
5th row숙박업

Common Values

ValueCountFrequency (%)
숙박업 2980
82.5%
일반야영장업 193
 
5.3%
관광숙박업 143
 
4.0%
자동차야영장업 123
 
3.4%
관광펜션업 102
 
2.8%
한옥체험업 31
 
0.9%
외국인관광도시민박업 26
 
0.7%
농어촌민박업 15
 
0.4%

Length

2023-12-11T09:03:46.175225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:03:46.293701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업 2980
82.5%
일반야영장업 193
 
5.3%
관광숙박업 143
 
4.0%
자동차야영장업 123
 
3.4%
관광펜션업 102
 
2.8%
한옥체험업 31
 
0.9%
외국인관광도시민박업 26
 
0.7%
농어촌민박업 15
 
0.4%

다중이용업소여부
Boolean

HIGH CORRELATION  IMBALANCE  MISSING 

Distinct2
Distinct (%)0.1%
Missing688
Missing (%)19.0%
Memory size7.2 KiB
False
2910 
True
 
15
(Missing)
688 
ValueCountFrequency (%)
False 2910
80.5%
True 15
 
0.4%
(Missing) 688
 
19.0%
2023-12-11T09:03:46.402159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:03:46.464725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세영업상태업종구분다중이용업소여부
상세영업상태1.0001.000NaN
업종구분1.0001.000NaN
다중이용업소여부NaNNaN1.000
2023-12-11T09:03:46.574996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종구분상세영업상태다중이용업소여부
업종구분1.0000.9991.000
상세영업상태0.9991.0001.000
다중이용업소여부1.0001.0001.000
2023-12-11T09:03:46.672836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세영업상태업종구분다중이용업소여부
상세영업상태1.0000.9991.000
업종구분0.9991.0001.000
다중이용업소여부1.0001.0001.000

Missing values

2023-12-11T09:03:42.294563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:03:42.424539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:03:42.526047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명지번주소도로명주소인허가일자영업상태상세영업상태업종구분다중이용업소여부
0한일장여관경상남도 거창군 거창읍 대동리 698-4경상남도 거창군 거창읍 거열로 188-11985-09-02영업/정상영업숙박업N
1황산문화체육공원 국민여가캠핑장경상남도 양산시 물금읍 물금리 225-1<NA>2016-01-07영업/정상영업중자동차야영장업<NA>
2대운산 자연휴양림 야영장경상남도 양산시 용당동 산 66번지경상남도 양산시 탑골길 208-124 (용당동, 대운산 자연휴양림)2015-06-23영업/정상영업중일반야영장업<NA>
3호텔시카고경상남도 창원시 의창구 명서동 204-2번지 2,3,4,5,6층경상남도 창원시 의창구 창이대로99번길 7, 2.3.4.5.6층 (명서동)1996-10-01영업/정상영업숙박업N
4바드리산장모텔경상남도 밀양시 단장면 구천리 685-3번지경상남도 밀양시 단장면 바드리길 522002-07-09영업/정상영업숙박업N
5가야성모텔경상남도 밀양시 초동면 검암리 232-1번지경상남도 밀양시 초동면 검암3길 21991-12-12영업/정상영업숙박업N
6자자호텔경상남도 고성군 영오면 오서리 1231-1번지경상남도 고성군 영오면 오서1길 952017-03-08영업/정상영업숙박업N
7안가(安家)경상남도 거제시 고현동 860번지 거제롯데인벤스家경상남도 거제시 서문로 30, 107동 402호 (고현동, 거제롯데인벤스家)2020-03-31영업/정상영업중외국인관광도시민박업<NA>
8합천관광모텔경상남도 합천군 율곡면 임북리 744-3번지경상남도 합천군 율곡면 임북길 16-72018-11-26영업/정상영업숙박업N
9탑모텔경상남도 합천군 야로면 매촌리 197-1번지경상남도 합천군 야로면 가야산로 572-52015-06-22영업/정상영업숙박업N
사업장명지번주소도로명주소인허가일자영업상태상세영업상태업종구분다중이용업소여부
3603한산도 충무공교실 야영장경상남도 통영시 한산면 창좌리 726-1경상남도 통영시 한산면 한산일주로 6222022-02-11영업/정상영업중자동차야영장업<NA>
3604남계 한옥체험동경상남도 함양군 수동면 원평리 666경상남도 함양군 수동면 남계서원길 132022-01-13영업/정상영업중한옥체험업<NA>
3605남계 한옥 게스트 하우스경상남도 함양군 수동면 원평리 666경상남도 함양군 수동면 남계서원길 132022-01-13영업/정상영업중한옥체험업<NA>
3606The Cruise Guest House경상남도 거제시 장평동 5 삼성중공업사원아파트,삼성빌리지경상남도 거제시 장평1로 86, B동 1503호 (장평동, 삼성중공업사원아파트,삼성빌리지)2022-02-23영업/정상영업중외국인관광도시민박업<NA>
3607우리집경상남도 거제시 일운면 구조라리 371경상남도 거제시 일운면 구조라로 42-1, 나동 2층2022-07-05영업/정상영업숙박업N
3608씨앤리조트 신관경상남도 남해군 남면 석교리 390경상남도 남해군 남면 남면로 219-262022-06-28영업/정상영업숙박업N
3609반가운캠핑장경상남도 거제시 동부면 구천리 647-8 외 6필지<NA>2022-03-11영업/정상영업중일반야영장업<NA>
3610옥천1382경상남도 창녕군 창녕읍 옥천리 1382경상남도 창녕군 창녕읍 전평길 15-8072022-03-28영업/정상영업중자동차야영장업<NA>
3611스테이(STAY)3411경상남도 고성군 회화면 배둔리 606-24경상남도 고성군 회화면 당항길 34-112022-03-30영업/정상영업숙박업N
3612까사드발리(CASA DE BALI)경상남도 남해군 미조면 송정리 1030경상남도 남해군 미조면 미송로 341-122022-03-30영업/정상영업숙박업N

Duplicate rows

Most frequently occurring

사업장명지번주소도로명주소인허가일자영업상태상세영업상태업종구분다중이용업소여부# duplicates
3에비앙경상남도 밀양시 부북면 용지리 177경상남도 밀양시 부북면 춘화로 102, 에비앙2013-10-04영업/정상영업숙박업N3
0바이더씨경상남도 통영시 용남면 장문리 산 260-1 바이더씨경상남도 통영시 용남면 기호바깥길 7-53, 바이더씨2019-07-26영업/정상영업숙박업N2
1수류화개경상남도 하동군 화개면 탑리 539-5경상남도 하동군 화개면 쌍계로 70-172013-04-30영업/정상영업중한옥체험업<NA>2
2야놀자호텔경상남도 통영시 항남동 239-66경상남도 통영시 동충2길 9, 2층,7층 (항남동)2019-11-06영업/정상영업숙박업N2
4킹스빌딩경상남도 양산시 북부동 169-20 킹스빌딩경상남도 양산시 북안남7길 4-10, 1층 일부, 3,4,5층 (북부동)2021-01-25영업/정상영업숙박업N2
5통영게스트하우스 가고파경상남도 통영시 도천동 278-19경상남도 통영시 중앙로 47-2 (도천동)2016-05-11영업/정상영업중외국인관광도시민박업<NA>2
6하늘채경상남도 양산시 북부동 689-6경상남도 양산시 북안남8길 8-14 (북부동)1997-10-17영업/정상영업숙박업N2
7하동 유성준·이선유 판소리기념관 한옥체험관경상남도 하동군 악양면 중대리 907-0경상남도 하동군 악양면 하중대길 44-542016-11-05영업/정상영업중한옥체험업<NA>2