Overview

Dataset statistics

Number of variables15
Number of observations115
Missing cells166
Missing cells (%)9.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory122.1 B

Variable types

Text7
DateTime4
Categorical3
Numeric1

Dataset

Description부산광역시 연제구 관광 및 여행업에 대한 데이터로 상호, 소재지, 전화번호 등의 항목을 포함하고 있습니다. (2023. 6. 5. 기준)
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/3040711/fileData.do

Alerts

영업상태 has constant value ""Constant
우편번호 has 4 (3.5%) missing valuesMissing
시설면적(제곱미터) has 75 (65.2%) missing valuesMissing
보험시작일 has 18 (15.7%) missing valuesMissing
보험종료일 has 18 (15.7%) missing valuesMissing
조직(단체)명 has 50 (43.5%) missing valuesMissing
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:50:25.352762
Analysis finished2023-12-16 15:50:53.288345
Duration27.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct115
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-16T15:50:54.133191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length17
Mean length17
Min length17

Characters and Unicode

Total characters1955
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)100.0%

Sample

1st row26001-2011-000006
2nd row26001-2012-000005
3rd row26001-2012-000011
4th row26001-2013-000005
5th row26001-2014-000001
ValueCountFrequency (%)
26001-2011-000006 1
 
0.9%
26002-2019-000009 1
 
0.9%
26004-2014-000003 1
 
0.9%
26003-2022-000001 1
 
0.9%
26003-2017-000002 1
 
0.9%
26003-2017-000001 1
 
0.9%
26003-2016-000001 1
 
0.9%
26003-2003-000001 1
 
0.9%
26002-2023-000005 1
 
0.9%
26002-2023-000004 1
 
0.9%
Other values (105) 105
91.3%
2023-12-16T15:50:55.669390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 916
46.9%
2 377
19.3%
- 230
 
11.8%
1 148
 
7.6%
6 128
 
6.5%
3 47
 
2.4%
4 41
 
2.1%
9 19
 
1.0%
5 19
 
1.0%
7 17
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1725
88.2%
Dash Punctuation 230
 
11.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 916
53.1%
2 377
21.9%
1 148
 
8.6%
6 128
 
7.4%
3 47
 
2.7%
4 41
 
2.4%
9 19
 
1.1%
5 19
 
1.1%
7 17
 
1.0%
8 13
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 230
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1955
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 916
46.9%
2 377
19.3%
- 230
 
11.8%
1 148
 
7.6%
6 128
 
6.5%
3 47
 
2.4%
4 41
 
2.1%
9 19
 
1.0%
5 19
 
1.0%
7 17
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1955
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 916
46.9%
2 377
19.3%
- 230
 
11.8%
1 148
 
7.6%
6 128
 
6.5%
3 47
 
2.4%
4 41
 
2.1%
9 19
 
1.0%
5 19
 
1.0%
7 17
 
0.9%
Distinct99
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum1992-10-20 00:00:00
Maximum2023-12-07 00:00:00
2023-12-16T15:50:56.774111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:50:57.967897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업종
Categorical

Distinct6
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
국내외여행업
50 
국내여행업
29 
종합여행업
21 
외국인관광 도시민박업
관광숙박업
 
5

Length

Max length11
Median length7
Mean length5.8173913
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 50
43.5%
국내여행업 29
25.2%
종합여행업 21
18.3%
외국인관광 도시민박업 6
 
5.2%
관광숙박업 5
 
4.3%
국제회의기획업 4
 
3.5%

Length

2023-12-16T15:50:59.065762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:51:00.038569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 50
41.3%
국내여행업 29
24.0%
종합여행업 21
17.4%
외국인관광 6
 
5.0%
도시민박업 6
 
5.0%
관광숙박업 5
 
4.1%
국제회의기획업 4
 
3.3%

상호
Text

Distinct99
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-16T15:51:01.410098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.6434783
Min length2

Characters and Unicode

Total characters879
Distinct characters182
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)72.2%

Sample

1st row(주)애니투어
2nd row(주)투어일번지
3rd row고려선투어(주)
4th row미래테마여행사
5th row자비정진회여행사 주식회사
ValueCountFrequency (%)
주식회사 16
 
10.5%
여행사 4
 
2.6%
주)애니투어 2
 
1.3%
부산시티투어 2
 
1.3%
house 2
 
1.3%
주)범주여행사 2
 
1.3%
스타골프 2
 
1.3%
주)미투어 2
 
1.3%
주)투어일번지 2
 
1.3%
주)에이비투어 2
 
1.3%
Other values (106) 117
76.5%
2023-12-16T15:51:03.663093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
8.1%
( 52
 
5.9%
) 52
 
5.9%
46
 
5.2%
46
 
5.2%
42
 
4.8%
39
 
4.4%
32
 
3.6%
31
 
3.5%
18
 
2.0%
Other values (172) 450
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 700
79.6%
Open Punctuation 52
 
5.9%
Close Punctuation 52
 
5.9%
Space Separator 39
 
4.4%
Uppercase Letter 26
 
3.0%
Lowercase Letter 8
 
0.9%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
10.1%
46
 
6.6%
46
 
6.6%
42
 
6.0%
32
 
4.6%
31
 
4.4%
18
 
2.6%
18
 
2.6%
16
 
2.3%
16
 
2.3%
Other values (150) 364
52.0%
Uppercase Letter
ValueCountFrequency (%)
J 4
15.4%
S 4
15.4%
U 4
15.4%
E 3
11.5%
O 2
7.7%
H 2
7.7%
B 2
7.7%
N 1
 
3.8%
G 1
 
3.8%
T 1
 
3.8%
Other values (2) 2
7.7%
Lowercase Letter
ValueCountFrequency (%)
n 2
25.0%
s 2
25.0%
i 1
12.5%
e 1
12.5%
c 1
12.5%
r 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Space Separator
ValueCountFrequency (%)
39
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 700
79.6%
Common 145
 
16.5%
Latin 34
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
10.1%
46
 
6.6%
46
 
6.6%
42
 
6.0%
32
 
4.6%
31
 
4.4%
18
 
2.6%
18
 
2.6%
16
 
2.3%
16
 
2.3%
Other values (150) 364
52.0%
Latin
ValueCountFrequency (%)
J 4
11.8%
S 4
11.8%
U 4
11.8%
E 3
 
8.8%
O 2
 
5.9%
n 2
 
5.9%
s 2
 
5.9%
H 2
 
5.9%
B 2
 
5.9%
N 1
 
2.9%
Other values (8) 8
23.5%
Common
ValueCountFrequency (%)
( 52
35.9%
) 52
35.9%
39
26.9%
. 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 700
79.6%
ASCII 179
 
20.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
10.1%
46
 
6.6%
46
 
6.6%
42
 
6.0%
32
 
4.6%
31
 
4.4%
18
 
2.6%
18
 
2.6%
16
 
2.3%
16
 
2.3%
Other values (150) 364
52.0%
ASCII
ValueCountFrequency (%)
( 52
29.1%
) 52
29.1%
39
21.8%
J 4
 
2.2%
S 4
 
2.2%
U 4
 
2.2%
E 3
 
1.7%
O 2
 
1.1%
n 2
 
1.1%
s 2
 
1.1%
Other values (12) 15
 
8.4%

우편번호
Real number (ℝ)

MISSING 

Distinct47
Distinct (%)42.3%
Missing4
Missing (%)3.5%
Infinite0
Infinite (%)0.0%
Mean88194.198
Minimum47500
Maximum611807
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-16T15:51:04.081005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum47500
5-th percentile47514
Q147522
median47542
Q347580
95-th percentile611082
Maximum611807
Range564307
Interquartile range (IQR)58

Descriptive statistics

Standard deviation146506.75
Coefficient of variation (CV)1.6611836
Kurtosis9.4255123
Mean88194.198
Median Absolute Deviation (MAD)22
Skewness3.3549928
Sum9789556
Variance2.1464229 × 1010
MonotonicityNot monotonic
2023-12-16T15:51:05.660981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
47520 13
 
11.3%
47542 6
 
5.2%
47603 6
 
5.2%
47541 5
 
4.3%
47540 5
 
4.3%
47580 4
 
3.5%
47525 4
 
3.5%
47522 4
 
3.5%
47514 4
 
3.5%
47521 3
 
2.6%
Other values (37) 57
49.6%
(Missing) 4
 
3.5%
ValueCountFrequency (%)
47500 2
 
1.7%
47507 1
 
0.9%
47511 2
 
1.7%
47514 4
 
3.5%
47515 1
 
0.9%
47519 1
 
0.9%
47520 13
11.3%
47521 3
 
2.6%
47522 4
 
3.5%
47524 2
 
1.7%
ValueCountFrequency (%)
611807 1
 
0.9%
611800 2
 
1.7%
611724 2
 
1.7%
611082 3
2.6%
47606 2
 
1.7%
47605 3
2.6%
47603 6
5.2%
47599 1
 
0.9%
47598 1
 
0.9%
47597 2
 
1.7%
Distinct86
Distinct (%)74.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-16T15:51:07.105955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length24.373913
Min length19

Characters and Unicode

Total characters2803
Distinct characters110
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)56.5%

Sample

1st row부산광역시 연제구 거제동 2-27
2nd row부산광역시 연제구 연산동 632-1
3rd row부산광역시 연제구 연산동 822-7
4th row부산광역시 연제구 연산동 775-7
5th row부산광역시 연제구 거제동 1029-2 거제동원타워
ValueCountFrequency (%)
부산광역시 115
21.2%
연제구 115
21.2%
연산동 83
15.3%
거제동 36
 
6.6%
587-8 9
 
1.7%
sk 7
 
1.3%
775-7 5
 
0.9%
view(2단지 5
 
0.9%
시청역 3
 
0.6%
588-6 3
 
0.6%
Other values (126) 162
29.8%
2023-12-16T15:51:09.412521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
530
18.9%
204
 
7.3%
201
 
7.2%
158
 
5.6%
130
 
4.6%
122
 
4.4%
120
 
4.3%
119
 
4.2%
116
 
4.1%
116
 
4.1%
Other values (100) 987
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1532
54.7%
Decimal Number 576
 
20.5%
Space Separator 530
 
18.9%
Dash Punctuation 109
 
3.9%
Uppercase Letter 44
 
1.6%
Open Punctuation 6
 
0.2%
Close Punctuation 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
204
13.3%
201
13.1%
158
10.3%
130
8.5%
122
8.0%
120
7.8%
119
7.8%
116
7.6%
116
7.6%
41
 
2.7%
Other values (80) 205
13.4%
Decimal Number
ValueCountFrequency (%)
1 109
18.9%
2 82
14.2%
7 70
12.2%
3 68
11.8%
8 57
9.9%
4 52
9.0%
5 46
8.0%
0 43
 
7.5%
6 28
 
4.9%
9 21
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
K 8
18.2%
S 8
18.2%
V 7
15.9%
I 7
15.9%
E 7
15.9%
W 7
15.9%
Space Separator
ValueCountFrequency (%)
530
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1532
54.7%
Common 1227
43.8%
Latin 44
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
204
13.3%
201
13.1%
158
10.3%
130
8.5%
122
8.0%
120
7.8%
119
7.8%
116
7.6%
116
7.6%
41
 
2.7%
Other values (80) 205
13.4%
Common
ValueCountFrequency (%)
530
43.2%
1 109
 
8.9%
- 109
 
8.9%
2 82
 
6.7%
7 70
 
5.7%
3 68
 
5.5%
8 57
 
4.6%
4 52
 
4.2%
5 46
 
3.7%
0 43
 
3.5%
Other values (4) 61
 
5.0%
Latin
ValueCountFrequency (%)
K 8
18.2%
S 8
18.2%
V 7
15.9%
I 7
15.9%
E 7
15.9%
W 7
15.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1532
54.7%
ASCII 1271
45.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
530
41.7%
1 109
 
8.6%
- 109
 
8.6%
2 82
 
6.5%
7 70
 
5.5%
3 68
 
5.4%
8 57
 
4.5%
4 52
 
4.1%
5 46
 
3.6%
0 43
 
3.4%
Other values (10) 105
 
8.3%
Hangul
ValueCountFrequency (%)
204
13.3%
201
13.1%
158
10.3%
130
8.5%
122
8.0%
120
7.8%
119
7.8%
116
7.6%
116
7.6%
41
 
2.7%
Other values (80) 205
13.4%
Distinct94
Distinct (%)81.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-16T15:51:10.821960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length47
Mean length35.469565
Min length21

Characters and Unicode

Total characters4079
Distinct characters151
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)67.0%

Sample

1st row부산광역시 연제구 거제대로286번길 10 (거제동)
2nd row부산광역시 연제구 쌍미천로 149-1, 1층 (연산동)
3rd row부산광역시 연제구 연수로 89 (연산동)
4th row부산광역시 연제구 월드컵대로 25-2, 2층 (연산동)
5th row부산광역시 연제구 월드컵대로243번길 19, 102동 1605호 (거제동, 거제동원타워)
ValueCountFrequency (%)
부산광역시 115
 
14.7%
연제구 115
 
14.7%
연산동 83
 
10.6%
거제동 35
 
4.5%
2층 22
 
2.8%
월드컵대로 20
 
2.6%
중앙대로 20
 
2.6%
3층 12
 
1.5%
sk 9
 
1.2%
1130 9
 
1.2%
Other values (208) 342
43.7%
2023-12-16T15:51:13.127048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
667
 
16.4%
217
 
5.3%
206
 
5.1%
1 180
 
4.4%
174
 
4.3%
154
 
3.8%
, 126
 
3.1%
126
 
3.1%
) 124
 
3.0%
( 124
 
3.0%
Other values (141) 1981
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2275
55.8%
Decimal Number 680
 
16.7%
Space Separator 667
 
16.4%
Other Punctuation 127
 
3.1%
Close Punctuation 125
 
3.1%
Open Punctuation 125
 
3.1%
Uppercase Letter 62
 
1.5%
Dash Punctuation 18
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
217
 
9.5%
206
 
9.1%
174
 
7.6%
154
 
6.8%
126
 
5.5%
119
 
5.2%
116
 
5.1%
116
 
5.1%
116
 
5.1%
115
 
5.1%
Other values (113) 816
35.9%
Decimal Number
ValueCountFrequency (%)
1 180
26.5%
2 109
16.0%
0 97
14.3%
3 82
12.1%
4 52
 
7.6%
5 40
 
5.9%
7 38
 
5.6%
8 36
 
5.3%
6 27
 
4.0%
9 19
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
K 10
16.1%
S 10
16.1%
E 9
14.5%
I 9
14.5%
V 9
14.5%
W 9
14.5%
J 2
 
3.2%
H 2
 
3.2%
C 1
 
1.6%
B 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 126
99.2%
* 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 124
99.2%
] 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 124
99.2%
[ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
667
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2275
55.8%
Common 1742
42.7%
Latin 62
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
217
 
9.5%
206
 
9.1%
174
 
7.6%
154
 
6.8%
126
 
5.5%
119
 
5.2%
116
 
5.1%
116
 
5.1%
116
 
5.1%
115
 
5.1%
Other values (113) 816
35.9%
Common
ValueCountFrequency (%)
667
38.3%
1 180
 
10.3%
, 126
 
7.2%
) 124
 
7.1%
( 124
 
7.1%
2 109
 
6.3%
0 97
 
5.6%
3 82
 
4.7%
4 52
 
3.0%
5 40
 
2.3%
Other values (8) 141
 
8.1%
Latin
ValueCountFrequency (%)
K 10
16.1%
S 10
16.1%
E 9
14.5%
I 9
14.5%
V 9
14.5%
W 9
14.5%
J 2
 
3.2%
H 2
 
3.2%
C 1
 
1.6%
B 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2275
55.8%
ASCII 1804
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
667
37.0%
1 180
 
10.0%
, 126
 
7.0%
) 124
 
6.9%
( 124
 
6.9%
2 109
 
6.0%
0 97
 
5.4%
3 82
 
4.5%
4 52
 
2.9%
5 40
 
2.2%
Other values (18) 203
 
11.3%
Hangul
ValueCountFrequency (%)
217
 
9.5%
206
 
9.1%
174
 
7.6%
154
 
6.8%
126
 
5.5%
119
 
5.2%
116
 
5.1%
116
 
5.1%
116
 
5.1%
115
 
5.1%
Other values (113) 816
35.9%
Distinct35
Distinct (%)87.5%
Missing75
Missing (%)65.2%
Memory size1.0 KiB
2023-12-16T15:51:14.507209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.675
Min length2

Characters and Unicode

Total characters187
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)75.0%

Sample

1st row40.84
2nd row15.6
3rd row60
4th row128.4
5th row25
ValueCountFrequency (%)
49.91 2
 
5.0%
38.61 2
 
5.0%
60 2
 
5.0%
40.84 2
 
5.0%
108.06 2
 
5.0%
48 1
 
2.5%
3,955.82 1
 
2.5%
7,235.96 1
 
2.5%
4,997.12 1
 
2.5%
9,153.94 1
 
2.5%
Other values (25) 25
62.5%
2023-12-16T15:51:16.705749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 31
16.6%
9 23
12.3%
6 21
11.2%
1 20
10.7%
3 19
10.2%
4 16
8.6%
8 15
8.0%
0 12
 
6.4%
2 11
 
5.9%
5 10
 
5.3%
Other values (2) 9
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 151
80.7%
Other Punctuation 36
 
19.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 23
15.2%
6 21
13.9%
1 20
13.2%
3 19
12.6%
4 16
10.6%
8 15
9.9%
0 12
7.9%
2 11
7.3%
5 10
6.6%
7 4
 
2.6%
Other Punctuation
ValueCountFrequency (%)
. 31
86.1%
, 5
 
13.9%

Most occurring scripts

ValueCountFrequency (%)
Common 187
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 31
16.6%
9 23
12.3%
6 21
11.2%
1 20
10.7%
3 19
10.2%
4 16
8.6%
8 15
8.0%
0 12
 
6.4%
2 11
 
5.9%
5 10
 
5.3%
Other values (2) 9
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 187
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 31
16.6%
9 23
12.3%
6 21
11.2%
1 20
10.7%
3 19
10.2%
4 16
8.6%
8 15
8.0%
0 12
 
6.4%
2 11
 
5.9%
5 10
 
5.3%
Other values (2) 9
 
4.8%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
영업중
115 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 115
100.0%

Length

2023-12-16T15:51:17.739814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:51:18.042026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 115
100.0%

보험시작일
Date

MISSING 

Distinct78
Distinct (%)80.4%
Missing18
Missing (%)15.7%
Memory size1.0 KiB
Minimum2011-05-16 00:00:00
Maximum2023-12-14 00:00:00
2023-12-16T15:51:18.724341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:51:19.908207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

보험종료일
Date

MISSING 

Distinct79
Distinct (%)81.4%
Missing18
Missing (%)15.7%
Memory size1.0 KiB
Minimum2012-05-16 00:00:00
Maximum2024-12-14 00:00:00
2023-12-16T15:51:20.815672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:51:21.756342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

보험기관
Categorical

Distinct11
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
서울보증보험
27 
서울보증보험주식회사
21 
서울보증보험(주)
18 
<NA>
18 
한국관광협회중앙회
13 
Other values (6)
18 

Length

Max length15
Median length11
Mean length8.2434783
Min length4

Unique

Unique3 ?
Unique (%)2.6%

Sample

1st row부산광역시관광협회
2nd row서울보증보험(주)
3rd row한국관광협회중앙회
4th row한국관광협회중앙회
5th row서울보증보험

Common Values

ValueCountFrequency (%)
서울보증보험 27
23.5%
서울보증보험주식회사 21
18.3%
서울보증보험(주) 18
15.7%
<NA> 18
15.7%
한국관광협회중앙회 13
11.3%
한국관광협회중앙회 여행공제회 6
 
5.2%
한국관광협회중앙회 관광공제회 5
 
4.3%
부산광역시관광협회 4
 
3.5%
한국관광협회 1
 
0.9%
서울보증보험 주식회사 1
 
0.9%

Length

2023-12-16T15:51:22.565359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울보증보험 28
22.0%
한국관광협회중앙회 24
18.9%
서울보증보험주식회사 21
16.5%
서울보증보험(주 18
14.2%
na 18
14.2%
여행공제회 6
 
4.7%
관광공제회 5
 
3.9%
부산광역시관광협회 4
 
3.1%
한국관광협회 1
 
0.8%
주식회사 1
 
0.8%
Distinct99
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-16T15:51:23.668621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9913043
Min length2

Characters and Unicode

Total characters344
Distinct characters110
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)72.2%

Sample

1st row김거중
2nd row윤진국
3rd row김영곤
4th row김영순
5th row최성이
ValueCountFrequency (%)
김거중 2
 
1.7%
임채현 2
 
1.7%
이태진 2
 
1.7%
이상군 2
 
1.7%
정순부 2
 
1.7%
윤진국 2
 
1.7%
이준경 2
 
1.7%
김원영 2
 
1.7%
박세종 2
 
1.7%
김강석 2
 
1.7%
Other values (89) 95
82.6%
2023-12-16T15:51:25.430135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
7.0%
20
 
5.8%
19
 
5.5%
15
 
4.4%
13
 
3.8%
10
 
2.9%
9
 
2.6%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (100) 213
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 344
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
7.0%
20
 
5.8%
19
 
5.5%
15
 
4.4%
13
 
3.8%
10
 
2.9%
9
 
2.6%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (100) 213
61.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 344
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
7.0%
20
 
5.8%
19
 
5.5%
15
 
4.4%
13
 
3.8%
10
 
2.9%
9
 
2.6%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (100) 213
61.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 344
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
7.0%
20
 
5.8%
19
 
5.5%
15
 
4.4%
13
 
3.8%
10
 
2.9%
9
 
2.6%
7
 
2.0%
7
 
2.0%
7
 
2.0%
Other values (100) 213
61.9%

조직(단체)명
Text

MISSING 

Distinct55
Distinct (%)84.6%
Missing50
Missing (%)43.5%
Memory size1.0 KiB
2023-12-16T15:51:26.250304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length9.0923077
Min length4

Characters and Unicode

Total characters591
Distinct characters116
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)69.2%

Sample

1st row(주)애니투어
2nd row(주)투어일번지
3rd row(주)고려선투어
4th row(주)자비정진회 여행사
5th row에이스골프(주)
ValueCountFrequency (%)
주식회사 34
33.3%
주)투어일번지 2
 
2.0%
미투어 2
 
2.0%
주)애니투어 2
 
2.0%
나비투어 2
 
2.0%
스타골프 2
 
2.0%
부산시티투어협동조합 2
 
2.0%
에이스골프(주 2
 
2.0%
주)에이비투어 2
 
2.0%
라온투어 2
 
2.0%
Other values (49) 50
49.0%
2023-12-16T15:51:27.607786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
10.5%
46
 
7.8%
37
 
6.3%
37
 
6.3%
34
 
5.8%
34
 
5.8%
34
 
5.8%
( 26
 
4.4%
) 26
 
4.4%
16
 
2.7%
Other values (106) 239
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 502
84.9%
Space Separator 37
 
6.3%
Open Punctuation 26
 
4.4%
Close Punctuation 26
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
12.4%
46
 
9.2%
37
 
7.4%
34
 
6.8%
34
 
6.8%
34
 
6.8%
16
 
3.2%
15
 
3.0%
10
 
2.0%
10
 
2.0%
Other values (103) 204
40.6%
Space Separator
ValueCountFrequency (%)
37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 502
84.9%
Common 89
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
12.4%
46
 
9.2%
37
 
7.4%
34
 
6.8%
34
 
6.8%
34
 
6.8%
16
 
3.2%
15
 
3.0%
10
 
2.0%
10
 
2.0%
Other values (103) 204
40.6%
Common
ValueCountFrequency (%)
37
41.6%
( 26
29.2%
) 26
29.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 502
84.9%
ASCII 89
 
15.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
12.4%
46
 
9.2%
37
 
7.4%
34
 
6.8%
34
 
6.8%
34
 
6.8%
16
 
3.2%
15
 
3.0%
10
 
2.0%
10
 
2.0%
Other values (103) 204
40.6%
ASCII
ValueCountFrequency (%)
37
41.6%
( 26
29.2%
) 26
29.2%
Distinct92
Distinct (%)80.7%
Missing1
Missing (%)0.9%
Memory size1.0 KiB
Minimum2011-07-06 00:00:00
Maximum2023-12-11 00:00:00
2023-12-16T15:51:28.522599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:51:29.359075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-16T15:50:48.747798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:51:30.029489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록일자업종상호우편번호소재지(지번)소재지(도로명)시설면적(제곱미터)보험시작일보험종료일보험기관대표자성명조직(단체)명변경일자
등록일자1.0000.8711.0001.0000.9970.9990.9990.9980.9980.9981.0001.0000.999
업종0.8711.0000.9030.0000.9000.9220.8720.0000.0000.2080.9030.8810.923
상호1.0000.9031.0000.8680.9991.0000.9990.9980.9980.9991.0001.0001.000
우편번호1.0000.0000.8681.0000.9040.8410.0001.0001.0000.5560.8681.0001.000
소재지(지번)0.9970.9000.9990.9041.0000.9990.9900.9890.9910.9070.9990.9960.998
소재지(도로명)0.9990.9221.0000.8410.9991.0000.9990.9970.9970.9841.0001.0000.999
시설면적(제곱미터)0.9990.8720.9990.0000.9900.9991.0000.9950.9950.9820.9991.0000.999
보험시작일0.9980.0000.9981.0000.9890.9970.9951.0001.0000.9260.9980.9940.995
보험종료일0.9980.0000.9981.0000.9910.9970.9951.0001.0000.9320.9980.9940.996
보험기관0.9980.2080.9990.5560.9070.9840.9820.9260.9321.0000.9990.9980.948
대표자성명1.0000.9031.0000.8680.9991.0000.9990.9980.9980.9991.0001.0001.000
조직(단체)명1.0000.8811.0001.0000.9961.0001.0000.9940.9940.9981.0001.0001.000
변경일자0.9990.9231.0001.0000.9980.9990.9990.9950.9960.9481.0001.0001.000
2023-12-16T15:51:30.610753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종보험기관
업종1.0000.117
보험기관0.1171.000
2023-12-16T15:51:31.133911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종보험기관
우편번호1.0000.0000.408
업종0.0001.0000.117
보험기관0.4080.1171.000

Missing values

2023-12-16T15:50:49.678678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:50:51.296467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-16T15:50:52.620017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호등록일자업종상호우편번호소재지(지번)소재지(도로명)시설면적(제곱미터)영업상태보험시작일보험종료일보험기관대표자성명조직(단체)명변경일자
026001-2011-0000062011-04-28국내여행업(주)애니투어611800부산광역시 연제구 거제동 2-27부산광역시 연제구 거제대로286번길 10 (거제동)40.84영업중2011-05-162012-05-16부산광역시관광협회김거중(주)애니투어2019-01-11
126001-2012-0000052007-10-17국내여행업(주)투어일번지47550부산광역시 연제구 연산동 632-1부산광역시 연제구 쌍미천로 149-1, 1층 (연산동)<NA>영업중2021-10-132022-10-12서울보증보험(주)윤진국(주)투어일번지2023-05-11
226001-2012-0000112010-12-22국내여행업고려선투어(주)611082부산광역시 연제구 연산동 822-7부산광역시 연제구 연수로 89 (연산동)<NA>영업중2023-01-222024-01-22한국관광협회중앙회김영곤(주)고려선투어2012-12-07
326001-2013-0000052013-09-17국내여행업미래테마여행사47603부산광역시 연제구 연산동 775-7부산광역시 연제구 월드컵대로 25-2, 2층 (연산동)<NA>영업중2023-11-102024-11-10한국관광협회중앙회김영순<NA>2023-02-01
426001-2014-0000012013-09-11국내여행업자비정진회여행사 주식회사47525부산광역시 연제구 거제동 1029-2 거제동원타워부산광역시 연제구 월드컵대로243번길 19, 102동 1605호 (거제동, 거제동원타워)15.6영업중2023-01-012023-12-31서울보증보험최성이(주)자비정진회 여행사2019-07-29
526001-2014-0000052014-12-12국내여행업에이스골프(주)611724부산광역시 연제구 거제동 1491-1 801호부산광역시 연제구 법원로 20, 801호 (거제동)<NA>영업중2022-08-082023-08-07서울보증보험(주)김강석에이스골프(주)2015-07-02
626001-2014-0000072014-12-18국내여행업(주)시원여행47520부산광역시 연제구 연산동 587-8부산광역시 연제구 중앙대로 1130 (연산동, 207호(연산동, 에스케이뷰))<NA>영업중2023-01-072024-01-06서울보증보험(주)박영진주식회사 시원여행2017-03-23
726001-2015-0000012013-09-05국내여행업동그라미 여행사611082부산광역시 연제구 연산동 775-7부산광역시 연제구 월드컵대로 25-2, 2층 (연산동)60영업중2023-10-272024-10-27한국관광협회중앙회안성배<NA>2015-02-27
826001-2015-0000042010-11-26국내여행업(주)범주여행사47606부산광역시 연제구 연산동 859-24부산광역시 연제구 연제로 1 (연산동)<NA>영업중2022-11-102023-11-09서울보증보험이태진<NA>2015-05-15
926001-2015-0000092015-08-12국내여행업(주)라온투어47520부산광역시 연제구 연산동 588-6 경보오피스텔 518호부산광역시 연제구 중앙대로1134번길 34, 5층 518호 (연산동, 경보 오피스텔)<NA>영업중2019-04-282022-04-27서울보증보험이희원주식회사 라온투어2015-08-12
등록번호등록일자업종상호우편번호소재지(지번)소재지(도로명)시설면적(제곱미터)영업상태보험시작일보험종료일보험기관대표자성명조직(단체)명변경일자
10526221-2014-0000022014-07-11외국인관광 도시민박업JA GUEST HOUSE<NA>부산광역시 연제구 연산동 587-4부산광역시 연제구 중앙대로1124번길 15, 101동 1607호 (연산동)119.8영업중<NA><NA><NA>임대현<NA>2020-04-03
10626221-2017-0000022017-07-17외국인관광 도시민박업J. HOUSE47550부산광역시 연제구 연산동 622-26부산광역시 연제구 쌍미천로151번길 10-1 (연산동)49.43영업중<NA><NA><NA>육진선<NA>2017-07-17
10726221-2017-0000032017-10-18외국인관광 도시민박업Princess 게스트하우스47569부산광역시 연제구 연산동 471-13부산광역시 연제구 고분로235번길 42 (연산동)97.65영업중<NA><NA><NA>곽옥희<NA>2017-10-18
10826221-2023-0000012023-04-11외국인관광 도시민박업호산나 하우스47561부산광역시 연제구 연산동 307-4 모란부산광역시 연제구 신금로17번길 31, 나동 2호 (연산동, 모란)62.14영업중<NA><NA><NA>황지원<NA>2023-04-11
10926221-2023-0000022023-06-09외국인관광 도시민박업JUN BnB47538부산광역시 연제구 거제동 34-32부산광역시 연제구 거제천로 147-3, 2층 (거제동)33.39영업중<NA><NA><NA>황명준<NA>2023-06-09
11026221-2023-0000032023-11-30외국인관광 도시민박업스테이 단비47580부산광역시 연제구 연산동 1226-1부산광역시 연제구 월드컵대로 90-5, 2층 (연산동)66.21영업중<NA><NA><NA>홍주완<NA>2023-11-30
11126310-2012-0000022012-08-01국제회의기획업동서문화기획47538부산광역시 연제구 거제동 41-3부산광역시 연제구 월드컵대로187번길 75 (거제동)<NA>영업중<NA><NA><NA>조명수<NA>2015-11-12
11226310-2019-0000012019-01-17국제회의기획업케이프로모션47570부산광역시 연제구 연산동 487-31부산광역시 연제구 고분로242번길 57-1, 2층 (연산동)<NA>영업중<NA><NA><NA>윤난주<NA>2019-01-17
11326310-2020-0000012020-08-20국제회의기획업(주)플랫폼제이47562부산광역시 연제구 연산동 365-3부산광역시 연제구 과정로276번가길 32, 2층 (연산동)<NA>영업중<NA><NA><NA>장지훈<NA>2020-08-20
11426310-2023-0000012021-07-13국제회의기획업주식회사 라쿤47514부산광역시 연제구 거제동 150-6 한양타워빌딩부산광역시 연제구 명륜로 10, 한양타워빌딩 1104-1107호 (거제동)<NA>영업중<NA><NA><NA>홍연택주식회사 라쿤2023-12-11