Overview

Dataset statistics

Number of variables14
Number of observations351
Missing cells1349
Missing cells (%)27.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory41.3 KiB
Average record size in memory120.4 B

Variable types

Text3
Categorical4
Numeric4
Unsupported2
DateTime1

Dataset

Description- 국외를 여행하는 내국인을 대상으로 여행의 일정, 비용산출, 숙박예약, 명소안내 등 여행의 정보를 제공하는 업종의 사업장명, 인허가일자, 영업상태, 소재지주소 등의 정보를 제공합니다. - 원본 파일의 좌표가 중부원점 TM(EPSH:2097) 좌표계를 따르고 있는데 위경도(WGS84) 좌표계로 변환 시 정확하게 변환이 되지 않아 파일에서 좌표 정보는 제외하였습니다. 데이터 제공 사이트로 가시면 원본 좌표를 확인하실 수 있습니다. - 데이터 제공처: LOCALDATA
Author제주데이터허브
URLhttps://www.jejudatahub.net/data/view/data/816

Alerts

업종구분대분류 has constant value ""Constant
인허가일자 is highly overall correlated with 폐업일자High correlation
인허가취소일자 is highly overall correlated with 영업상태명High correlation
폐업일자 is highly overall correlated with 인허가일자 and 2 other fieldsHigh correlation
도로명우편번호 is highly overall correlated with 폐업일자High correlation
영업상태명 is highly overall correlated with 인허가취소일자 and 1 other fieldsHigh correlation
휴업시작일자 is highly imbalanced (97.2%)Imbalance
휴업종료일자 is highly imbalanced (97.2%)Imbalance
인허가취소일자 has 320 (91.2%) missing valuesMissing
폐업일자 has 198 (56.4%) missing valuesMissing
재개업일자 has 351 (100.0%) missing valuesMissing
소재지면적 has 351 (100.0%) missing valuesMissing
도로명주소 has 11 (3.1%) missing valuesMissing
도로명우편번호 has 118 (33.6%) missing valuesMissing
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 19:35:27.205708
Analysis finished2023-12-11 19:35:30.481520
Duration3.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct345
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-12T04:35:30.997924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length16
Mean length8.6467236
Min length3

Characters and Unicode

Total characters3035
Distinct characters328
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique339 ?
Unique (%)96.6%

Sample

1st row보리여행사
2nd row제이제이골프
3rd row제주렌터카서비스협동조합
4th row주식회사 한스글로벌
5th row핑크투어
ValueCountFrequency (%)
주식회사 86
 
18.7%
유한회사 7
 
1.5%
유)그린삼육오 2
 
0.4%
나눔투어 2
 
0.4%
tour 2
 
0.4%
주)리플투어클럽 2
 
0.4%
세계의제주 2
 
0.4%
주)제이케이투어 2
 
0.4%
와제주다여행사 2
 
0.4%
주)초원여행클럽 2
 
0.4%
Other values (351) 351
76.3%
2023-12-12T04:35:31.765269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
315
 
10.4%
201
 
6.6%
) 179
 
5.9%
( 178
 
5.9%
134
 
4.4%
134
 
4.4%
113
 
3.7%
112
 
3.7%
109
 
3.6%
95
 
3.1%
Other values (318) 1465
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2488
82.0%
Close Punctuation 179
 
5.9%
Open Punctuation 178
 
5.9%
Space Separator 109
 
3.6%
Uppercase Letter 44
 
1.4%
Lowercase Letter 28
 
0.9%
Decimal Number 7
 
0.2%
Other Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
315
 
12.7%
201
 
8.1%
134
 
5.4%
134
 
5.4%
113
 
4.5%
112
 
4.5%
95
 
3.8%
94
 
3.8%
87
 
3.5%
64
 
2.6%
Other values (274) 1139
45.8%
Uppercase Letter
ValueCountFrequency (%)
O 5
11.4%
A 5
11.4%
U 4
9.1%
N 4
9.1%
T 4
9.1%
E 4
9.1%
R 3
 
6.8%
F 2
 
4.5%
C 2
 
4.5%
D 2
 
4.5%
Other values (9) 9
20.5%
Lowercase Letter
ValueCountFrequency (%)
a 4
14.3%
l 3
10.7%
s 3
10.7%
c 2
 
7.1%
e 2
 
7.1%
r 2
 
7.1%
n 2
 
7.1%
i 2
 
7.1%
t 2
 
7.1%
d 1
 
3.6%
Other values (5) 5
17.9%
Decimal Number
ValueCountFrequency (%)
7 3
42.9%
1 1
 
14.3%
9 1
 
14.3%
5 1
 
14.3%
0 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 179
100.0%
Open Punctuation
ValueCountFrequency (%)
( 178
100.0%
Space Separator
ValueCountFrequency (%)
109
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2488
82.0%
Common 475
 
15.7%
Latin 72
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
315
 
12.7%
201
 
8.1%
134
 
5.4%
134
 
5.4%
113
 
4.5%
112
 
4.5%
95
 
3.8%
94
 
3.8%
87
 
3.5%
64
 
2.6%
Other values (274) 1139
45.8%
Latin
ValueCountFrequency (%)
O 5
 
6.9%
A 5
 
6.9%
U 4
 
5.6%
N 4
 
5.6%
T 4
 
5.6%
a 4
 
5.6%
E 4
 
5.6%
l 3
 
4.2%
s 3
 
4.2%
R 3
 
4.2%
Other values (24) 33
45.8%
Common
ValueCountFrequency (%)
) 179
37.7%
( 178
37.5%
109
22.9%
7 3
 
0.6%
& 1
 
0.2%
- 1
 
0.2%
1 1
 
0.2%
9 1
 
0.2%
5 1
 
0.2%
0 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2488
82.0%
ASCII 547
 
18.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
315
 
12.7%
201
 
8.1%
134
 
5.4%
134
 
5.4%
113
 
4.5%
112
 
4.5%
95
 
3.8%
94
 
3.8%
87
 
3.5%
64
 
2.6%
Other values (274) 1139
45.8%
ASCII
ValueCountFrequency (%)
) 179
32.7%
( 178
32.5%
109
19.9%
O 5
 
0.9%
A 5
 
0.9%
U 4
 
0.7%
N 4
 
0.7%
T 4
 
0.7%
a 4
 
0.7%
E 4
 
0.7%
Other values (34) 51
 
9.3%

업종구분대분류
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
국내외여행업
351 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내외여행업
2nd row국내외여행업
3rd row국내외여행업
4th row국내외여행업
5th row국내외여행업

Common Values

ValueCountFrequency (%)
국내외여행업 351
100.0%

Length

2023-12-12T04:35:31.992762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:35:32.157906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 351
100.0%

인허가일자
Real number (ℝ)

HIGH CORRELATION 

Distinct330
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20116179
Minimum19881216
Maximum20230905
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T04:35:32.342554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19881216
5-th percentile19990960
Q120061054
median20120910
Q320170865
95-th percentile20230116
Maximum20230905
Range349689
Interquartile range (IQR)109811.5

Descriptive statistics

Standard deviation74467.944
Coefficient of variation (CV)0.0037018932
Kurtosis-0.18466468
Mean20116179
Median Absolute Deviation (MAD)50487
Skewness-0.45689614
Sum7.0607787 × 109
Variance5.5454747 × 109
MonotonicityNot monotonic
2023-12-12T04:35:32.570088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20021010 4
 
1.1%
20221115 3
 
0.9%
20081023 2
 
0.6%
20140305 2
 
0.6%
20230102 2
 
0.6%
20180806 2
 
0.6%
20230711 2
 
0.6%
20120426 2
 
0.6%
20120730 2
 
0.6%
20090428 2
 
0.6%
Other values (320) 328
93.4%
ValueCountFrequency (%)
19881216 1
0.3%
19890415 1
0.3%
19910510 1
0.3%
19910710 1
0.3%
19911002 1
0.3%
19931222 1
0.3%
19940509 1
0.3%
19950808 1
0.3%
19960208 1
0.3%
19960314 1
0.3%
ValueCountFrequency (%)
20230905 1
0.3%
20230829 1
0.3%
20230801 1
0.3%
20230711 2
0.6%
20230623 1
0.3%
20230523 1
0.3%
20230515 1
0.3%
20230413 1
0.3%
20230406 1
0.3%
20230329 1
0.3%

인허가취소일자
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct29
Distinct (%)93.5%
Missing320
Missing (%)91.2%
Infinite0
Infinite (%)0.0%
Mean20131361
Minimum20070111
Maximum20220217
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T04:35:32.781437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20070111
5-th percentile20080321
Q120100703
median20120809
Q320165824
95-th percentile20205605
Maximum20220217
Range150106
Interquartile range (IQR)65120.5

Descriptive statistics

Standard deviation43324.309
Coefficient of variation (CV)0.0021520805
Kurtosis-0.7256381
Mean20131361
Median Absolute Deviation (MAD)29585
Skewness0.64868693
Sum6.2407219 × 108
Variance1.8769957 × 109
MonotonicityNot monotonic
2023-12-12T04:35:32.983585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
20171130 2
 
0.6%
20101109 2
 
0.6%
20120117 1
 
0.3%
20090226 1
 
0.3%
20160517 1
 
0.3%
20130520 1
 
0.3%
20191101 1
 
0.3%
20191128 1
 
0.3%
20110415 1
 
0.3%
20100604 1
 
0.3%
Other values (19) 19
 
5.4%
(Missing) 320
91.2%
ValueCountFrequency (%)
20070111 1
0.3%
20080218 1
0.3%
20080424 1
0.3%
20080824 1
0.3%
20090226 1
0.3%
20090325 1
0.3%
20091224 1
0.3%
20100604 1
0.3%
20100802 1
0.3%
20101109 2
0.6%
ValueCountFrequency (%)
20220217 1
0.3%
20210803 1
0.3%
20200407 1
0.3%
20200305 1
0.3%
20191128 1
0.3%
20191101 1
0.3%
20171130 2
0.6%
20160517 1
0.3%
20141211 1
0.3%
20131025 1
0.3%

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
영업/정상
166 
폐업
153 
취소/말소/만료/정지/중지
31 
휴업
 
1

Length

Max length14
Median length5
Mean length4.4786325
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row휴업
2nd row폐업
3rd row폐업
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
영업/정상 166
47.3%
폐업 153
43.6%
취소/말소/만료/정지/중지 31
 
8.8%
휴업 1
 
0.3%

Length

2023-12-12T04:35:33.232092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:35:33.406077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 166
47.3%
폐업 153
43.6%
취소/말소/만료/정지/중지 31
 
8.8%
휴업 1
 
0.3%

폐업일자
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct148
Distinct (%)96.7%
Missing198
Missing (%)56.4%
Infinite0
Infinite (%)0.0%
Mean20144936
Minimum20050507
Maximum20230918
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T04:35:33.592737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20050507
5-th percentile20070938
Q120100622
median20150311
Q320181108
95-th percentile20210991
Maximum20230918
Range180411
Interquartile range (IQR)80486

Descriptive statistics

Standard deviation47245.865
Coefficient of variation (CV)0.0023452973
Kurtosis-1.1524295
Mean20144936
Median Absolute Deviation (MAD)40201
Skewness-0.089205569
Sum3.0821752 × 109
Variance2.2321717 × 109
MonotonicityNot monotonic
2023-12-12T04:35:33.767018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20150730 2
 
0.6%
20170316 2
 
0.6%
20191231 2
 
0.6%
20200331 2
 
0.6%
20221007 2
 
0.6%
20050630 1
 
0.3%
20210107 1
 
0.3%
20190116 1
 
0.3%
20201103 1
 
0.3%
20181108 1
 
0.3%
Other values (138) 138
39.3%
(Missing) 198
56.4%
ValueCountFrequency (%)
20050507 1
0.3%
20050630 1
0.3%
20060706 1
0.3%
20061002 1
0.3%
20070110 1
0.3%
20070313 1
0.3%
20070402 1
0.3%
20070531 1
0.3%
20071210 1
0.3%
20071220 1
0.3%
ValueCountFrequency (%)
20230918 1
0.3%
20230511 1
0.3%
20230509 1
0.3%
20221107 1
0.3%
20221007 2
0.6%
20220722 1
0.3%
20211126 1
0.3%
20210901 1
0.3%
20210624 1
0.3%
20210520 1
0.3%

휴업시작일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
350 
20050819
 
1

Length

Max length8
Median length4
Mean length4.011396
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row20050819
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 350
99.7%
20050819 1
 
0.3%

Length

2023-12-12T04:35:33.981055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:35:34.152912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 350
99.7%
20050819 1
 
0.3%

휴업종료일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
350 
20060818
 
1

Length

Max length8
Median length4
Mean length4.011396
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row20060818
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 350
99.7%
20060818 1
 
0.3%

Length

2023-12-12T04:35:34.360876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:35:34.773984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 350
99.7%
20060818 1
 
0.3%

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing351
Missing (%)100.0%
Memory size3.2 KiB

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing351
Missing (%)100.0%
Memory size3.2 KiB
Distinct332
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-12T04:35:35.170430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length41
Mean length27.002849
Min length16

Characters and Unicode

Total characters9478
Distinct characters197
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)90.3%

Sample

1st row제주특별자치도 제주시 노형동 1256-4번지
2nd row제주특별자치도 제주시 오라이동 2185-2번지
3rd row제주특별자치도 제주시 노형동 711-1번지 신영빌딩
4th row제주특별자치도 제주시 이도이동 1997-5
5th row제주특별자치도 제주시 아라일동 6092-1번지
ValueCountFrequency (%)
제주특별자치도 349
20.6%
제주시 318
18.7%
연동 93
 
5.5%
노형동 50
 
2.9%
2층 45
 
2.7%
서귀포시 31
 
1.8%
이도이동 28
 
1.6%
일도이동 23
 
1.4%
1층 22
 
1.3%
용담이동 16
 
0.9%
Other values (527) 723
42.6%
2023-12-12T04:35:35.803867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1347
 
14.2%
676
 
7.1%
672
 
7.1%
452
 
4.8%
1 369
 
3.9%
365
 
3.9%
353
 
3.7%
351
 
3.7%
351
 
3.7%
350
 
3.7%
Other values (187) 4192
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5754
60.7%
Decimal Number 1948
 
20.6%
Space Separator 1347
 
14.2%
Dash Punctuation 306
 
3.2%
Open Punctuation 56
 
0.6%
Close Punctuation 56
 
0.6%
Uppercase Letter 7
 
0.1%
Other Punctuation 3
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
676
11.7%
672
11.7%
452
 
7.9%
365
 
6.3%
353
 
6.1%
351
 
6.1%
351
 
6.1%
350
 
6.1%
349
 
6.1%
156
 
2.7%
Other values (164) 1679
29.2%
Decimal Number
ValueCountFrequency (%)
1 369
18.9%
2 348
17.9%
3 241
12.4%
4 190
9.8%
0 184
9.4%
5 151
7.8%
9 140
 
7.2%
6 126
 
6.5%
7 118
 
6.1%
8 81
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
C 2
28.6%
J 1
14.3%
A 1
14.3%
S 1
14.3%
B 1
14.3%
D 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
: 1
33.3%
Space Separator
ValueCountFrequency (%)
1347
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 306
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5754
60.7%
Common 3716
39.2%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
676
11.7%
672
11.7%
452
 
7.9%
365
 
6.3%
353
 
6.1%
351
 
6.1%
351
 
6.1%
350
 
6.1%
349
 
6.1%
156
 
2.7%
Other values (164) 1679
29.2%
Common
ValueCountFrequency (%)
1347
36.2%
1 369
 
9.9%
2 348
 
9.4%
- 306
 
8.2%
3 241
 
6.5%
4 190
 
5.1%
0 184
 
5.0%
5 151
 
4.1%
9 140
 
3.8%
6 126
 
3.4%
Other values (6) 314
 
8.4%
Latin
ValueCountFrequency (%)
C 2
25.0%
J 1
12.5%
A 1
12.5%
S 1
12.5%
B 1
12.5%
1
12.5%
D 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5754
60.7%
ASCII 3723
39.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1347
36.2%
1 369
 
9.9%
2 348
 
9.3%
- 306
 
8.2%
3 241
 
6.5%
4 190
 
5.1%
0 184
 
4.9%
5 151
 
4.1%
9 140
 
3.8%
6 126
 
3.4%
Other values (12) 321
 
8.6%
Hangul
ValueCountFrequency (%)
676
11.7%
672
11.7%
452
 
7.9%
365
 
6.3%
353
 
6.1%
351
 
6.1%
351
 
6.1%
350
 
6.1%
349
 
6.1%
156
 
2.7%
Other values (164) 1679
29.2%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct321
Distinct (%)94.4%
Missing11
Missing (%)3.1%
Memory size2.9 KiB
2023-12-12T04:35:36.206958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length46
Mean length32.852941
Min length22

Characters and Unicode

Total characters11170
Distinct characters236
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique310 ?
Unique (%)91.2%

Sample

1st row제주특별자치도 제주시 연삼로 165, 2층 (오라이동)
2nd row제주특별자치도 제주시 연북로 10, 신영빌딩 3층 (노형동)
3rd row제주특별자치도 제주시 구남로8길 4-6 (이도이동)
4th row제주특별자치도 제주시 아란7길 1, 3층 (아라일동)
5th row제주특별자치도 제주시 신광로 20, 501호 (연동)
ValueCountFrequency (%)
제주특별자치도 339
 
16.3%
제주시 313
 
15.1%
2층 81
 
3.9%
연동 70
 
3.4%
1층 41
 
2.0%
노형동 34
 
1.6%
3층 29
 
1.4%
서귀포시 26
 
1.3%
이도이동 23
 
1.1%
도령로 16
 
0.8%
Other values (602) 1107
53.2%
2023-12-12T04:35:36.771062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1792
 
16.0%
674
 
6.0%
658
 
5.9%
469
 
4.2%
401
 
3.6%
( 384
 
3.4%
) 384
 
3.4%
1 359
 
3.2%
342
 
3.1%
341
 
3.1%
Other values (226) 5366
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6668
59.7%
Space Separator 1792
 
16.0%
Decimal Number 1570
 
14.1%
Open Punctuation 384
 
3.4%
Close Punctuation 384
 
3.4%
Other Punctuation 307
 
2.7%
Dash Punctuation 51
 
0.5%
Uppercase Letter 12
 
0.1%
Lowercase Letter 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
674
 
10.1%
658
 
9.9%
469
 
7.0%
401
 
6.0%
342
 
5.1%
341
 
5.1%
340
 
5.1%
339
 
5.1%
339
 
5.1%
320
 
4.8%
Other values (201) 2445
36.7%
Decimal Number
ValueCountFrequency (%)
1 359
22.9%
2 308
19.6%
3 168
10.7%
5 157
10.0%
0 156
9.9%
4 128
 
8.2%
6 99
 
6.3%
7 76
 
4.8%
9 61
 
3.9%
8 58
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
B 3
25.0%
C 2
16.7%
A 2
16.7%
D 1
 
8.3%
J 1
 
8.3%
S 1
 
8.3%
M 1
 
8.3%
E 1
 
8.3%
Space Separator
ValueCountFrequency (%)
1792
100.0%
Open Punctuation
ValueCountFrequency (%)
( 384
100.0%
Close Punctuation
ValueCountFrequency (%)
) 384
100.0%
Other Punctuation
ValueCountFrequency (%)
, 307
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6668
59.7%
Common 4488
40.2%
Latin 14
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
674
 
10.1%
658
 
9.9%
469
 
7.0%
401
 
6.0%
342
 
5.1%
341
 
5.1%
340
 
5.1%
339
 
5.1%
339
 
5.1%
320
 
4.8%
Other values (201) 2445
36.7%
Common
ValueCountFrequency (%)
1792
39.9%
( 384
 
8.6%
) 384
 
8.6%
1 359
 
8.0%
2 308
 
6.9%
, 307
 
6.8%
3 168
 
3.7%
5 157
 
3.5%
0 156
 
3.5%
4 128
 
2.9%
Other values (5) 345
 
7.7%
Latin
ValueCountFrequency (%)
B 3
21.4%
C 2
14.3%
A 2
14.3%
D 1
 
7.1%
J 1
 
7.1%
S 1
 
7.1%
e 1
 
7.1%
1
 
7.1%
M 1
 
7.1%
E 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6668
59.7%
ASCII 4501
40.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1792
39.8%
( 384
 
8.5%
) 384
 
8.5%
1 359
 
8.0%
2 308
 
6.8%
, 307
 
6.8%
3 168
 
3.7%
5 157
 
3.5%
0 156
 
3.5%
4 128
 
2.8%
Other values (14) 358
 
8.0%
Hangul
ValueCountFrequency (%)
674
 
10.1%
658
 
9.9%
469
 
7.0%
401
 
6.0%
342
 
5.1%
341
 
5.1%
340
 
5.1%
339
 
5.1%
339
 
5.1%
320
 
4.8%
Other values (201) 2445
36.7%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct138
Distinct (%)59.2%
Missing118
Missing (%)33.6%
Infinite0
Infinite (%)0.0%
Mean111649.85
Minimum63008
Maximum690819
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T04:35:37.236547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum63008
5-th percentile63081
Q163124
median63169
Q363257
95-th percentile690148.4
Maximum690819
Range627811
Interquartile range (IQR)133

Descriptive statistics

Standard deviation167795.56
Coefficient of variation (CV)1.5028731
Kurtosis8.2293442
Mean111649.85
Median Absolute Deviation (MAD)60
Skewness3.1872817
Sum26014416
Variance2.815535 × 1010
MonotonicityNot monotonic
2023-12-12T04:35:37.413675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
63081 10
 
2.8%
63139 9
 
2.6%
63124 8
 
2.3%
63118 6
 
1.7%
63132 5
 
1.4%
63125 5
 
1.4%
63134 5
 
1.4%
63568 4
 
1.1%
63126 4
 
1.1%
63188 3
 
0.9%
Other values (128) 174
49.6%
(Missing) 118
33.6%
ValueCountFrequency (%)
63008 1
 
0.3%
63031 1
 
0.3%
63038 1
 
0.3%
63050 1
 
0.3%
63069 2
 
0.6%
63071 1
 
0.3%
63073 1
 
0.3%
63081 10
2.8%
63082 3
 
0.9%
63083 1
 
0.3%
ValueCountFrequency (%)
690819 1
 
0.3%
690817 1
 
0.3%
690815 1
 
0.3%
690814 1
 
0.3%
690813 1
 
0.3%
690241 1
 
0.3%
690180 3
0.9%
690170 2
0.6%
690161 1
 
0.3%
690140 1
 
0.3%
Distinct150
Distinct (%)42.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2018-08-31 23:59:00
Maximum2023-09-29 02:40:00
2023-12-12T04:35:37.584410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:37.778187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T04:35:29.314440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:27.898931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.368109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.856789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:29.426892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:27.993187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.492418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.962063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:29.520185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.111673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.626406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:29.049943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:29.646753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.230434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:28.721124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:35:29.190339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T04:35:37.889229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자인허가취소일자영업상태명폐업일자도로명우편번호
인허가일자1.0000.6800.5300.7480.416
인허가취소일자0.6801.000NaNNaN0.429
영업상태명0.530NaN1.000NaN0.183
폐업일자0.748NaNNaN1.0000.740
도로명우편번호0.4160.4290.1830.7401.000
2023-12-12T04:35:38.013692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
휴업종료일자영업상태명휴업시작일자
휴업종료일자1.000NaNNaN
영업상태명NaN1.000NaN
휴업시작일자NaNNaN1.000
2023-12-12T04:35:38.142203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자인허가취소일자폐업일자도로명우편번호영업상태명휴업시작일자휴업종료일자
인허가일자1.0000.4920.556-0.1210.344NaNNaN
인허가취소일자0.4921.000NaN-0.4501.0000.0000.000
폐업일자0.556NaN1.000-0.5211.0000.0000.000
도로명우편번호-0.121-0.450-0.5211.0000.2960.0000.000
영업상태명0.3441.0001.0000.2961.000NaNNaN
휴업시작일자NaN0.0000.0000.000NaN1.000NaN
휴업종료일자NaN0.0000.0000.000NaNNaN1.000

Missing values

2023-12-12T04:35:29.853172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T04:35:30.085240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T04:35:30.313369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명업종구분대분류인허가일자인허가취소일자영업상태명폐업일자휴업시작일자휴업종료일자재개업일자소재지면적지번주소도로명주소도로명우편번호데이터갱신일자
0보리여행사국내외여행업19980430<NA>휴업<NA>2005081920060818<NA><NA>제주특별자치도 제주시 노형동 1256-4번지<NA><NA>2018-08-31 23:59
1제이제이골프국내외여행업20181115<NA>폐업20200403<NA><NA><NA><NA>제주특별자치도 제주시 오라이동 2185-2번지제주특별자치도 제주시 연삼로 165, 2층 (오라이동)631482020-04-08 2:40
2제주렌터카서비스협동조합국내외여행업20190114<NA>폐업20200331<NA><NA><NA><NA>제주특별자치도 제주시 노형동 711-1번지 신영빌딩제주특별자치도 제주시 연북로 10, 신영빌딩 3층 (노형동)630862020-04-18 2:40
3주식회사 한스글로벌국내외여행업20190416<NA>폐업20221007<NA><NA><NA><NA>제주특별자치도 제주시 이도이동 1997-5제주특별자치도 제주시 구남로8길 4-6 (이도이동)632322022-10-09 2:40
4핑크투어국내외여행업20190530<NA>폐업20200323<NA><NA><NA><NA>제주특별자치도 제주시 아라일동 6092-1번지제주특별자치도 제주시 아란7길 1, 3층 (아라일동)632372020-03-25 2:40
5(주)요요국내외여행업20140115<NA>폐업20140225<NA><NA><NA><NA>제주특별자치도 제주시 연동 260-50번지제주특별자치도 제주시 신광로 20, 501호 (연동)6908132018-08-31 23:59
6(주)탐라국여행사국내외여행업20130314<NA>폐업20140801<NA><NA><NA><NA>제주특별자치도 제주시 노형동 2520-18번지 2층제주특별자치도 제주시 노형로 355, 2층 (노형동)6901802018-08-31 23:59
7제주킹덤투어(주)국내외여행업20130405<NA>폐업20200519<NA><NA><NA><NA>제주특별자치도 제주시 이호일동 295-1번지 이호리조트제주특별자치도 제주시 도리로 106-16, 이호리조트 1층 (이호일동)631082020-05-22 2:40
8(주)신정관광여행사국내외여행업20131202<NA>폐업20150311<NA><NA><NA><NA>제주특별자치도 제주시 용담일동 84-2번지제주특별자치도 제주시 서문로5길 7 (용담일동)6908192018-08-31 23:59
9(주)제주디엠씨국내외여행업20110530<NA>폐업20130710<NA><NA><NA><NA>제주특별자치도 제주시 용담일동 238-2번지제주특별자치도 제주시 용남길 3 (용담일동)<NA>2018-08-31 23:59
사업장명업종구분대분류인허가일자인허가취소일자영업상태명폐업일자휴업시작일자휴업종료일자재개업일자소재지면적지번주소도로명주소도로명우편번호데이터갱신일자
341하나이글스투어국내외여행업20060327<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 서홍동 731-61제주특별자치도 서귀포시 남성로128번길 37-6 (서홍동)635942023-02-11 2:40
342투어리스여행사국내외여행업20120605<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 서홍동 349-8 제1호제주특별자치도 서귀포시 홍중로 53-3 (서홍동, 민우아파트)635842023-02-11 2:40
343제주아일랜드센터국내외여행업20150402<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 동홍동 416-8제주특별자치도 서귀포시 동홍중앙로66번길 35 (동홍동)635872023-02-11 2:40
344더힐링아일랜드국내외여행업20171128<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 호근동 409-4제주특별자치도 서귀포시 태평로120번길 20 (호근동)635712023-02-11 2:40
345누리해외여행국내외여행업20200221<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 하효동 701제주특별자치도 서귀포시 하신상로 7, 3층 (하효동)636062023-03-11 2:40
346(주)클릭제주국내외여행업20230413<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 서호동 1605제주특별자치도 서귀포시 서호중앙로 55, 제주유포리아지식산업센터 비존동 403호 (서호동)635682023-04-15 0:20
347바스톤투어국내외여행업20230302<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 서호동 1605제주특별자치도 서귀포시 서호중앙로 55, 유포리아지식산업센터 E동 104,105호 (서호동)635682023-03-11 2:40
348주식회사 뷰틱스국내외여행업20230220<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 서호동 1605제주특별자치도 서귀포시 서호중앙로 55, 유포리아지식센터 A동 515호 (서호동)635682023-02-22 0:41
349(주)클릭앤라이드국내외여행업20221230<NA>영업/정상<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 강정동 5009제주특별자치도 서귀포시 말질로 322, 1층 102호 (강정동)635632023-02-11 2:40
350(주)플러스세계여행국내외여행업2002042220090226취소/말소/만료/정지/중지<NA><NA><NA><NA><NA>제주특별자치도 서귀포시 동홍동 1485-25번지제주특별자치도 서귀포시 중앙로 108 (동홍동)<NA>2018-08-31 23:59