Overview

Dataset statistics

Number of variables4
Number of observations804
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.3 KiB
Average record size in memory32.2 B

Variable types

Categorical2
Text2

Dataset

Description경상남도 관광사업 중 여행업과 관련한 파일로 여행사들의 업종분류(종합, 국내외, 국내) 및 업체명, 소재지 등이 기입된 자료입니다.
Author경상남도
URLhttps://www.data.go.kr/data/15102944/fileData.do

Reproduction

Analysis started2023-12-12 00:04:53.901858
Analysis finished2023-12-12 00:04:54.411151
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct19
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
창원시
251 
진주시
147 
김해시
109 
거제시
65 
통영시
52 
Other values (14)
180 

Length

Max length4
Median length3
Mean length3.0074627
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 251
31.2%
진주시 147
18.3%
김해시 109
13.6%
거제시 65
 
8.1%
통영시 52
 
6.5%
양산시 49
 
6.1%
사천시 17
 
2.1%
밀양시 15
 
1.9%
합천군 13
 
1.6%
거창군 12
 
1.5%
Other values (9) 74
 
9.2%

Length

2023-12-12T09:04:54.471061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 251
31.2%
진주시 147
18.3%
김해시 109
13.6%
거제시 65
 
8.1%
통영시 52
 
6.5%
양산시 49
 
6.1%
사천시 23
 
2.9%
밀양시 15
 
1.9%
합천군 13
 
1.6%
거창군 12
 
1.5%
Other values (8) 68
 
8.5%

업종분류
Categorical

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
국내외여행업
437 
국내여행업
222 
종합여행업
133 
일반여행업
 
7
국외여행업
 
5

Length

Max length6
Median length6
Mean length5.5435323
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합여행업
2nd row종합여행업
3rd row종합여행업
4th row종합여행업
5th row종합여행업

Common Values

ValueCountFrequency (%)
국내외여행업 437
54.4%
국내여행업 222
27.6%
종합여행업 133
 
16.5%
일반여행업 7
 
0.9%
국외여행업 5
 
0.6%

Length

2023-12-12T09:04:54.576792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:04:54.687993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 437
54.4%
국내여행업 222
27.6%
종합여행업 133
 
16.5%
일반여행업 7
 
0.9%
국외여행업 5
 
0.6%
Distinct668
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T09:04:54.925662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length8.2748756
Min length2

Characters and Unicode

Total characters6653
Distinct characters396
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique537 ?
Unique (%)66.8%

Sample

1st row(주)동림인터내셔널
2nd row(주)세원여행사
3rd row헤리츠투어
4th row신화항공여행사
5th row(주)경일항공여행사
ValueCountFrequency (%)
주식회사 116
 
11.5%
투어 11
 
1.1%
tour 6
 
0.6%
여행이야기 5
 
0.5%
여행사 5
 
0.5%
협동조합 4
 
0.4%
골프 3
 
0.3%
유한회사 3
 
0.3%
서진항공여행사(주 3
 
0.3%
주)우리관광여행사 3
 
0.3%
Other values (706) 849
84.2%
2023-12-12T09:04:55.239776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
489
 
7.4%
443
 
6.7%
364
 
5.5%
362
 
5.4%
) 351
 
5.3%
( 349
 
5.2%
230
 
3.5%
230
 
3.5%
205
 
3.1%
155
 
2.3%
Other values (386) 3475
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5503
82.7%
Close Punctuation 351
 
5.3%
Open Punctuation 349
 
5.2%
Space Separator 205
 
3.1%
Uppercase Letter 118
 
1.8%
Other Symbol 64
 
1.0%
Lowercase Letter 41
 
0.6%
Decimal Number 12
 
0.2%
Other Punctuation 7
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
489
 
8.9%
443
 
8.1%
364
 
6.6%
362
 
6.6%
230
 
4.2%
230
 
4.2%
155
 
2.8%
151
 
2.7%
141
 
2.6%
135
 
2.5%
Other values (335) 2803
50.9%
Uppercase Letter
ValueCountFrequency (%)
T 18
15.3%
O 15
12.7%
A 10
 
8.5%
N 10
 
8.5%
U 9
 
7.6%
R 7
 
5.9%
E 7
 
5.9%
M 6
 
5.1%
V 5
 
4.2%
H 5
 
4.2%
Other values (13) 26
22.0%
Lowercase Letter
ValueCountFrequency (%)
r 6
14.6%
e 6
14.6%
i 5
12.2%
o 4
9.8%
u 4
9.8%
t 3
7.3%
d 3
7.3%
n 2
 
4.9%
c 2
 
4.9%
h 2
 
4.9%
Other values (4) 4
9.8%
Decimal Number
ValueCountFrequency (%)
3 4
33.3%
5 3
25.0%
6 3
25.0%
4 1
 
8.3%
7 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
& 4
57.1%
, 2
28.6%
. 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 351
100.0%
Open Punctuation
ValueCountFrequency (%)
( 349
100.0%
Space Separator
ValueCountFrequency (%)
205
100.0%
Other Symbol
ValueCountFrequency (%)
64
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5567
83.7%
Common 927
 
13.9%
Latin 159
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
489
 
8.8%
443
 
8.0%
364
 
6.5%
362
 
6.5%
230
 
4.1%
230
 
4.1%
155
 
2.8%
151
 
2.7%
141
 
2.5%
135
 
2.4%
Other values (336) 2867
51.5%
Latin
ValueCountFrequency (%)
T 18
 
11.3%
O 15
 
9.4%
A 10
 
6.3%
N 10
 
6.3%
U 9
 
5.7%
R 7
 
4.4%
E 7
 
4.4%
r 6
 
3.8%
M 6
 
3.8%
e 6
 
3.8%
Other values (27) 65
40.9%
Common
ValueCountFrequency (%)
) 351
37.9%
( 349
37.6%
205
22.1%
& 4
 
0.4%
3 4
 
0.4%
5 3
 
0.3%
6 3
 
0.3%
- 2
 
0.2%
, 2
 
0.2%
. 1
 
0.1%
Other values (3) 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5503
82.7%
ASCII 1086
 
16.3%
None 64
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
489
 
8.9%
443
 
8.1%
364
 
6.6%
362
 
6.6%
230
 
4.2%
230
 
4.2%
155
 
2.8%
151
 
2.7%
141
 
2.6%
135
 
2.5%
Other values (335) 2803
50.9%
ASCII
ValueCountFrequency (%)
) 351
32.3%
( 349
32.1%
205
18.9%
T 18
 
1.7%
O 15
 
1.4%
A 10
 
0.9%
N 10
 
0.9%
U 9
 
0.8%
R 7
 
0.6%
E 7
 
0.6%
Other values (40) 105
 
9.7%
None
ValueCountFrequency (%)
64
100.0%
Distinct677
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T09:04:55.502342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length43
Mean length30.241294
Min length11

Characters and Unicode

Total characters24314
Distinct characters366
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique558 ?
Unique (%)69.4%

Sample

1st row경상남도 창원시 마산합포구 3·15대로 298, 경남대학교 창업보육관 405호 (서성동)
2nd row경상남도 창원시 성산구 신월로5번길 28 (신월동)
3rd row 경상남도 창원시 성산구 중앙대로 33, 대흥인터빌 6층 632호 (중앙동)
4th row 경상남도 창원시 성산구 상남로 35, 새롬아이포빌 111호 (상남동)
5th row 경상남도 창원시 성산구 용지로 70, 성원오피스텔 316호 (중앙동)
ValueCountFrequency (%)
경상남도 702
 
14.1%
창원시 250
 
5.0%
진주시 146
 
2.9%
김해시 109
 
2.2%
2층 105
 
2.1%
성산구 101
 
2.0%
1층 75
 
1.5%
거제시 65
 
1.3%
통영시 52
 
1.0%
마산회원구 52
 
1.0%
Other values (1421) 3333
66.8%
2023-12-12T09:04:55.879283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4217
 
17.3%
1 929
 
3.8%
870
 
3.6%
863
 
3.5%
756
 
3.1%
741
 
3.0%
740
 
3.0%
734
 
3.0%
713
 
2.9%
2 624
 
2.6%
Other values (356) 13127
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14176
58.3%
Space Separator 4217
 
17.3%
Decimal Number 3879
 
16.0%
Other Punctuation 624
 
2.6%
Open Punctuation 605
 
2.5%
Close Punctuation 605
 
2.5%
Dash Punctuation 165
 
0.7%
Uppercase Letter 41
 
0.2%
Lowercase Letter 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
870
 
6.1%
863
 
6.1%
756
 
5.3%
741
 
5.2%
740
 
5.2%
734
 
5.2%
713
 
5.0%
396
 
2.8%
372
 
2.6%
371
 
2.6%
Other values (324) 7620
53.8%
Uppercase Letter
ValueCountFrequency (%)
B 10
24.4%
W 6
14.6%
A 6
14.6%
R 3
 
7.3%
T 3
 
7.3%
C 2
 
4.9%
E 2
 
4.9%
O 2
 
4.9%
S 2
 
4.9%
G 2
 
4.9%
Other values (3) 3
 
7.3%
Decimal Number
ValueCountFrequency (%)
1 929
23.9%
2 624
16.1%
3 446
11.5%
0 414
10.7%
4 328
 
8.5%
5 308
 
7.9%
6 247
 
6.4%
7 241
 
6.2%
9 179
 
4.6%
8 163
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 614
98.4%
· 7
 
1.1%
. 3
 
0.5%
Space Separator
ValueCountFrequency (%)
4217
100.0%
Open Punctuation
ValueCountFrequency (%)
( 605
100.0%
Close Punctuation
ValueCountFrequency (%)
) 605
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 165
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14176
58.3%
Common 10096
41.5%
Latin 42
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
870
 
6.1%
863
 
6.1%
756
 
5.3%
741
 
5.2%
740
 
5.2%
734
 
5.2%
713
 
5.0%
396
 
2.8%
372
 
2.6%
371
 
2.6%
Other values (324) 7620
53.8%
Common
ValueCountFrequency (%)
4217
41.8%
1 929
 
9.2%
2 624
 
6.2%
, 614
 
6.1%
( 605
 
6.0%
) 605
 
6.0%
3 446
 
4.4%
0 414
 
4.1%
4 328
 
3.2%
5 308
 
3.1%
Other values (8) 1006
 
10.0%
Latin
ValueCountFrequency (%)
B 10
23.8%
W 6
14.3%
A 6
14.3%
R 3
 
7.1%
T 3
 
7.1%
C 2
 
4.8%
E 2
 
4.8%
O 2
 
4.8%
S 2
 
4.8%
G 2
 
4.8%
Other values (4) 4
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14175
58.3%
ASCII 10131
41.7%
None 7
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4217
41.6%
1 929
 
9.2%
2 624
 
6.2%
, 614
 
6.1%
( 605
 
6.0%
) 605
 
6.0%
3 446
 
4.4%
0 414
 
4.1%
4 328
 
3.2%
5 308
 
3.0%
Other values (21) 1041
 
10.3%
Hangul
ValueCountFrequency (%)
870
 
6.1%
863
 
6.1%
756
 
5.3%
741
 
5.2%
740
 
5.2%
734
 
5.2%
713
 
5.0%
396
 
2.8%
372
 
2.6%
371
 
2.6%
Other values (323) 7619
53.7%
None
ValueCountFrequency (%)
· 7
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T09:04:55.957873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업종분류
시군명1.0000.681
업종분류0.6811.000
2023-12-12T09:04:56.020736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업종분류
시군명1.0000.417
업종분류0.4171.000
2023-12-12T09:04:56.088372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업종분류
시군명1.0000.417
업종분류0.4171.000

Missing values

2023-12-12T09:04:54.298280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:04:54.375929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명업종분류업체명소재지
0창원시종합여행업(주)동림인터내셔널경상남도 창원시 마산합포구 3·15대로 298, 경남대학교 창업보육관 405호 (서성동)
1창원시종합여행업(주)세원여행사경상남도 창원시 성산구 신월로5번길 28 (신월동)
2창원시종합여행업헤리츠투어경상남도 창원시 성산구 중앙대로 33, 대흥인터빌 6층 632호 (중앙동)
3창원시종합여행업신화항공여행사경상남도 창원시 성산구 상남로 35, 새롬아이포빌 111호 (상남동)
4창원시종합여행업(주)경일항공여행사경상남도 창원시 성산구 용지로 70, 성원오피스텔 316호 (중앙동)
5창원시종합여행업(주)태평양항공여행사경상남도 창원시 성산구 원이대로 332 (대원동, 1층)
6창원시종합여행업㈜트래블신나라경상남도 창원시 의창구 태복산로 19-1(도계동)
7창원시종합여행업마이스피플 주식회사경상남도 창원시 성산구 원이대로 362, 창원컨벤션센터 1001호(대원동)
8창원시종합여행업(주)다모아투어경상남도 창원시 성산구 용지로 161, 201호 (용호동, 경남빌딩)
9창원시종합여행업(주)잇츠코리아 에이전시경상남도 창원시 마산회원구 내서읍 삼계4길 8, 201호 (오아시스빌딩)
시군명업종분류업체명소재지
794합천군국내외여행업합천새천년관광㈜합천군 합천읍 옥산로 102, 2층
795합천군국내외여행업금화고속관광합천군 삼가면 삼가로 123-4
796합천군국내외여행업경호관광㈜합천군 합천읍 대야로 901
797합천군국내외여행업주식회사매화고속관광여행사합천군 합천읍 서산실 17-3
798합천군국내여행업주식회사매화관광여행합천군 합천읍 서산실 17-3
799합천군국내여행업㈜해인고속관광합천군 합천읍 대야로 886-1
800합천군국내여행업경호관광주식회사합천군 합천읍 대야로 901
801합천군국내여행업위드합천협동조합합천군 삼가면 삼가중앙길 21-7, 2층
802합천군국내여행업패키지여행사합천군 대병면 합천호수로 310
803합천군국내여행업주민공정여행사 합천댕김 주식회사합천군 가야면 가야산로 1183, 1층