Overview

Dataset statistics

Number of variables5
Number of observations961
Missing cells185
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory38.6 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 여행업 현황 데이터를 제공합니다.
Author경상남도
URLhttps://www.data.go.kr/data/3083313/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 185 (19.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:47:04.423053
Analysis finished2023-12-12 20:47:05.160918
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION 

Distinct955
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean477.16233
Minimum1
Maximum955
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.6 KiB
2023-12-13T05:47:05.249278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q1239
median477
Q3715
95-th percentile907
Maximum955
Range954
Interquartile range (IQR)476

Descriptive statistics

Standard deviation275.52778
Coefficient of variation (CV)0.57742987
Kurtosis-1.1951147
Mean477.16233
Median Absolute Deviation (MAD)238
Skewness0.0035864421
Sum458553
Variance75915.559
MonotonicityIncreasing
2023-12-13T05:47:05.429013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
530 3
 
0.3%
390 2
 
0.2%
30 2
 
0.2%
207 2
 
0.2%
376 2
 
0.2%
653 1
 
0.1%
641 1
 
0.1%
632 1
 
0.1%
656 1
 
0.1%
633 1
 
0.1%
Other values (945) 945
98.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
955 1
0.1%
954 1
0.1%
953 1
0.1%
952 1
0.1%
951 1
0.1%
950 1
0.1%
949 1
0.1%
948 1
0.1%
947 1
0.1%
946 1
0.1%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
국내여행업
445 
국외여행업
443 
일반여행업
73 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반여행업
2nd row일반여행업
3rd row일반여행업
4th row일반여행업
5th row일반여행업

Common Values

ValueCountFrequency (%)
국내여행업 445
46.3%
국외여행업 443
46.1%
일반여행업 73
 
7.6%

Length

2023-12-13T05:47:05.595951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:47:05.721133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 445
46.3%
국외여행업 443
46.1%
일반여행업 73
 
7.6%
Distinct591
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2023-12-13T05:47:05.968232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length8.2965661
Min length3

Characters and Unicode

Total characters7973
Distinct characters343
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)24.5%

Sample

1st row주식회사 스타일투어
2nd row(주)럭키항공여행사
3rd row주식회사 엠에스투어
4th row(주)다모아투어
5th row(주)태평양항공여행사
ValueCountFrequency (%)
주식회사 96
 
8.4%
투어 13
 
1.1%
여행사 8
 
0.7%
tour 7
 
0.6%
동백투어 4
 
0.4%
주)하나여행사 4
 
0.4%
주)티월드 4
 
0.4%
주)가든여행사 4
 
0.4%
여행이야기 4
 
0.4%
서진항공여행사(주 3
 
0.3%
Other values (614) 990
87.1%
2023-12-13T05:47:06.755455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
608
 
7.6%
546
 
6.8%
) 507
 
6.4%
( 506
 
6.3%
505
 
6.3%
504
 
6.3%
246
 
3.1%
245
 
3.1%
226
 
2.8%
220
 
2.8%
Other values (333) 3860
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6550
82.2%
Close Punctuation 507
 
6.4%
Open Punctuation 506
 
6.3%
Space Separator 189
 
2.4%
Other Symbol 73
 
0.9%
Lowercase Letter 67
 
0.8%
Uppercase Letter 61
 
0.8%
Decimal Number 8
 
0.1%
Other Punctuation 6
 
0.1%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
608
 
9.3%
546
 
8.3%
505
 
7.7%
504
 
7.7%
246
 
3.8%
245
 
3.7%
226
 
3.5%
220
 
3.4%
129
 
2.0%
123
 
1.9%
Other values (288) 3198
48.8%
Uppercase Letter
ValueCountFrequency (%)
T 12
19.7%
O 8
13.1%
N 7
11.5%
U 6
9.8%
R 4
 
6.6%
A 3
 
4.9%
S 3
 
4.9%
K 3
 
4.9%
F 2
 
3.3%
G 2
 
3.3%
Other values (8) 11
18.0%
Lowercase Letter
ValueCountFrequency (%)
e 12
17.9%
r 9
13.4%
o 8
11.9%
i 6
9.0%
u 6
9.0%
n 5
7.5%
h 4
 
6.0%
t 4
 
6.0%
d 4
 
6.0%
c 2
 
3.0%
Other values (5) 7
10.4%
Decimal Number
ValueCountFrequency (%)
3 3
37.5%
6 2
25.0%
5 2
25.0%
7 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
& 4
66.7%
. 2
33.3%
Close Punctuation
ValueCountFrequency (%)
) 507
100.0%
Open Punctuation
ValueCountFrequency (%)
( 506
100.0%
Space Separator
ValueCountFrequency (%)
189
100.0%
Other Symbol
ValueCountFrequency (%)
73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6623
83.1%
Common 1222
 
15.3%
Latin 128
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
608
 
9.2%
546
 
8.2%
505
 
7.6%
504
 
7.6%
246
 
3.7%
245
 
3.7%
226
 
3.4%
220
 
3.3%
129
 
1.9%
123
 
1.9%
Other values (289) 3271
49.4%
Latin
ValueCountFrequency (%)
T 12
 
9.4%
e 12
 
9.4%
r 9
 
7.0%
O 8
 
6.2%
o 8
 
6.2%
N 7
 
5.5%
i 6
 
4.7%
u 6
 
4.7%
U 6
 
4.7%
n 5
 
3.9%
Other values (23) 49
38.3%
Common
ValueCountFrequency (%)
) 507
41.5%
( 506
41.4%
189
 
15.5%
- 5
 
0.4%
& 4
 
0.3%
3 3
 
0.2%
6 2
 
0.2%
5 2
 
0.2%
. 2
 
0.2%
7 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6550
82.2%
ASCII 1349
 
16.9%
None 73
 
0.9%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
608
 
9.3%
546
 
8.3%
505
 
7.7%
504
 
7.7%
246
 
3.8%
245
 
3.7%
226
 
3.5%
220
 
3.4%
129
 
2.0%
123
 
1.9%
Other values (288) 3198
48.8%
ASCII
ValueCountFrequency (%)
) 507
37.6%
( 506
37.5%
189
 
14.0%
T 12
 
0.9%
e 12
 
0.9%
r 9
 
0.7%
O 8
 
0.6%
o 8
 
0.6%
N 7
 
0.5%
i 6
 
0.4%
Other values (33) 85
 
6.3%
None
ValueCountFrequency (%)
73
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct624
Distinct (%)64.9%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2023-12-13T05:47:07.175433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length40
Mean length27.170656
Min length9

Characters and Unicode

Total characters26111
Distinct characters323
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)31.2%

Sample

1st row창원시 의창구 의창대로247번길 10, 1층 (소답동)
2nd row창원시 의창구 퇴촌로 15 (사림동, 1층)
3rd row창원시 의창구 용지로169번길 3 (용호동, 401호)
4th row창원시 의창구 용지로 161, 201호 (용호동, 경남빌딩)
5th row창원시 의창구 원이대로 332 (대원동, 1층)
ValueCountFrequency (%)
경상남도 417
 
7.7%
창원시 356
 
6.6%
진주시 156
 
2.9%
성산구 124
 
2.3%
2층 95
 
1.8%
거제시 92
 
1.7%
김해시 88
 
1.6%
의창구 82
 
1.5%
마산회원구 63
 
1.2%
1층 62
 
1.1%
Other values (1177) 3873
71.6%
2023-12-13T05:47:07.695570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4516
 
17.3%
1 997
 
3.8%
884
 
3.4%
881
 
3.4%
879
 
3.4%
, 747
 
2.9%
) 737
 
2.8%
( 737
 
2.8%
2 666
 
2.6%
648
 
2.5%
Other values (313) 14419
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14875
57.0%
Space Separator 4516
 
17.3%
Decimal Number 4279
 
16.4%
Other Punctuation 761
 
2.9%
Close Punctuation 737
 
2.8%
Open Punctuation 737
 
2.8%
Dash Punctuation 162
 
0.6%
Uppercase Letter 38
 
0.1%
Lowercase Letter 4
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
884
 
5.9%
881
 
5.9%
879
 
5.9%
648
 
4.4%
632
 
4.2%
544
 
3.7%
516
 
3.5%
455
 
3.1%
453
 
3.0%
447
 
3.0%
Other values (285) 8536
57.4%
Decimal Number
ValueCountFrequency (%)
1 997
23.3%
2 666
15.6%
3 497
11.6%
0 415
9.7%
5 356
 
8.3%
4 336
 
7.9%
7 287
 
6.7%
6 263
 
6.1%
8 241
 
5.6%
9 221
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
A 9
23.7%
B 9
23.7%
T 7
18.4%
K 7
18.4%
W 3
 
7.9%
S 1
 
2.6%
J 1
 
2.6%
G 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
, 747
98.2%
· 12
 
1.6%
. 2
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
r 2
50.0%
o 2
50.0%
Space Separator
ValueCountFrequency (%)
4516
100.0%
Close Punctuation
ValueCountFrequency (%)
) 737
100.0%
Open Punctuation
ValueCountFrequency (%)
( 737
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 162
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14877
57.0%
Common 11192
42.9%
Latin 42
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
884
 
5.9%
881
 
5.9%
879
 
5.9%
648
 
4.4%
632
 
4.2%
544
 
3.7%
516
 
3.5%
455
 
3.1%
453
 
3.0%
447
 
3.0%
Other values (286) 8538
57.4%
Common
ValueCountFrequency (%)
4516
40.4%
1 997
 
8.9%
, 747
 
6.7%
) 737
 
6.6%
( 737
 
6.6%
2 666
 
6.0%
3 497
 
4.4%
0 415
 
3.7%
5 356
 
3.2%
4 336
 
3.0%
Other values (7) 1188
 
10.6%
Latin
ValueCountFrequency (%)
A 9
21.4%
B 9
21.4%
T 7
16.7%
K 7
16.7%
W 3
 
7.1%
r 2
 
4.8%
o 2
 
4.8%
S 1
 
2.4%
J 1
 
2.4%
G 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14875
57.0%
ASCII 11222
43.0%
None 14
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4516
40.2%
1 997
 
8.9%
, 747
 
6.7%
) 737
 
6.6%
( 737
 
6.6%
2 666
 
5.9%
3 497
 
4.4%
0 415
 
3.7%
5 356
 
3.2%
4 336
 
3.0%
Other values (16) 1218
 
10.9%
Hangul
ValueCountFrequency (%)
884
 
5.9%
881
 
5.9%
879
 
5.9%
648
 
4.4%
632
 
4.2%
544
 
3.7%
516
 
3.5%
455
 
3.1%
453
 
3.0%
447
 
3.0%
Other values (285) 8536
57.4%
None
ValueCountFrequency (%)
· 12
85.7%
2
 
14.3%

전화번호
Text

MISSING 

Distinct488
Distinct (%)62.9%
Missing185
Missing (%)19.3%
Memory size7.6 KiB
2023-12-13T05:47:07.970282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.997423
Min length9

Characters and Unicode

Total characters9310
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)27.1%

Sample

1st row055-288-0015
2nd row055-266-1155
3rd row055-600-3737
4th row070-4035-2999
5th row055-601-1000
ValueCountFrequency (%)
055-643-4011 4
 
0.5%
055-963-3999 3
 
0.4%
055-295-7001 3
 
0.4%
055-745-0088 3
 
0.4%
055-299-1234 3
 
0.4%
055-545-2226 3
 
0.4%
055-224-4448 3
 
0.4%
055-242-5544 3
 
0.4%
055-288-2200 3
 
0.4%
055-741-9909 2
 
0.3%
Other values (478) 746
96.1%
2023-12-13T05:47:08.408366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 2030
21.8%
- 1546
16.6%
0 1418
15.2%
2 686
 
7.4%
3 629
 
6.8%
6 578
 
6.2%
7 565
 
6.1%
8 540
 
5.8%
4 504
 
5.4%
1 452
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7764
83.4%
Dash Punctuation 1546
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 2030
26.1%
0 1418
18.3%
2 686
 
8.8%
3 629
 
8.1%
6 578
 
7.4%
7 565
 
7.3%
8 540
 
7.0%
4 504
 
6.5%
1 452
 
5.8%
9 362
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 1546
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9310
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 2030
21.8%
- 1546
16.6%
0 1418
15.2%
2 686
 
7.4%
3 629
 
6.8%
6 578
 
6.2%
7 565
 
6.1%
8 540
 
5.8%
4 504
 
5.4%
1 452
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9310
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 2030
21.8%
- 1546
16.6%
0 1418
15.2%
2 686
 
7.4%
3 629
 
6.8%
6 578
 
6.2%
7 565
 
6.1%
8 540
 
5.8%
4 504
 
5.4%
1 452
 
4.9%

Interactions

2023-12-13T05:47:04.855128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:47:08.509420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.913
업종0.9131.000
2023-12-13T05:47:08.598609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.873
업종0.8731.000

Missing values

2023-12-13T05:47:05.014776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:47:05.121008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종여행사명주소전화번호
01일반여행업주식회사 스타일투어창원시 의창구 의창대로247번길 10, 1층 (소답동)055-288-0015
12일반여행업(주)럭키항공여행사창원시 의창구 퇴촌로 15 (사림동, 1층)<NA>
23일반여행업주식회사 엠에스투어창원시 의창구 용지로169번길 3 (용호동, 401호)<NA>
34일반여행업(주)다모아투어창원시 의창구 용지로 161, 201호 (용호동, 경남빌딩)055-266-1155
45일반여행업(주)태평양항공여행사창원시 의창구 원이대로 332 (대원동, 1층)055-600-3737
56일반여행업시원투어 주식회사창원시 의창구 충혼로 53, 지하1층 (두대동, 골든힐 골프연습장)070-4035-2999
67일반여행업(주)경남투어렌트카창원시 의창구 중앙대로210번길 3 (신월동, 경남신문사 가동 4층)<NA>
78일반여행업(주)한찬코리아창원시 의창구 천주로1번길 14 (서상동, 2층)055-601-1000
89일반여행업(주)엠아이에스창원시 의창구 원이대로 362 (대원동, 창원컨벤션센터1층 3호)055-212-1319
910일반여행업린월드투어 주식회사창원시 의창구 동읍 의창대로915번길 17-16, 지하1층호055-251-5519
연번업종여행사명주소전화번호
951946국내여행업㈜대성항공여행사거창군 거창읍 중앙로 135055-945-5700
952947국내여행업(주)문수관광거창군 거창읍 거창대로 76055-945-0969
953948국내여행업㈜거창백두관광거창군 강남로 154055-941-0102
954949국내여행업이엠주식회사거창군 대동리 69-1, 2층055-944-6312
955950국내여행업합천새천년관광㈜합천군 합천읍 옥산로 102, 2층<NA>
956951국내여행업주식회사매화관광여행합천군 합천읍 서산실 17-3<NA>
957952국내여행업㈜해인고속관광합천군 합천읍 대야로 883<NA>
958953국내여행업금화고속관광합천군 삼가면 삼가로 123-4<NA>
959954국내여행업경호관광주식회사합천군 합천읍 대야로 901<NA>
960955국내여행업위드합천협동조합합천군 삼가면 삼가중앙길 21-7, 2층055-934-2321