Overview

Dataset statistics

Number of variables6
Number of observations309
Missing cells58
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.9 KiB
Average record size in memory49.4 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description경상북도에 있는 여행 업체의 등록업종, 업체명, 대표자, 주소, 연락가능한 번호, 보증기관등을 공개함으로써 안전한 여행이 될 수 있도록 도움이 되는 정보입니다.
URLhttps://www.data.go.kr/data/3083600/fileData.do

Alerts

전화번호 has 58 (18.8%) missing valuesMissing
구분 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:50:10.654039
Analysis finished2023-12-12 06:50:11.409362
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

UNIQUE 

Distinct309
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean155
Minimum1
Maximum309
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T15:50:11.498130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.4
Q178
median155
Q3232
95-th percentile293.6
Maximum309
Range308
Interquartile range (IQR)154

Descriptive statistics

Standard deviation89.344838
Coefficient of variation (CV)0.57641831
Kurtosis-1.2
Mean155
Median Absolute Deviation (MAD)77
Skewness0
Sum47895
Variance7982.5
MonotonicityStrictly increasing
2023-12-12T15:50:11.684812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
205 1
 
0.3%
212 1
 
0.3%
211 1
 
0.3%
210 1
 
0.3%
209 1
 
0.3%
208 1
 
0.3%
207 1
 
0.3%
206 1
 
0.3%
204 1
 
0.3%
Other values (299) 299
96.8%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
309 1
0.3%
308 1
0.3%
307 1
0.3%
306 1
0.3%
305 1
0.3%
304 1
0.3%
303 1
0.3%
302 1
0.3%
301 1
0.3%
300 1
0.3%

등록업종
Categorical

Distinct6
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
국내외 국내
144 
국내외
66 
국내
54 
종합
27 
종합 국내
 
9

Length

Max length9
Median length6
Mean length4.368932
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내외
2nd row국내외 국내
3rd row국내
4th row국내외 국내
5th row국내외 국내

Common Values

ValueCountFrequency (%)
국내외 국내 144
46.6%
국내외 66
21.4%
국내 54
 
17.5%
종합 27
 
8.7%
종합 국내 9
 
2.9%
종합 국내외 국내 9
 
2.9%

Length

2023-12-12T15:50:11.857422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:50:12.003066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외 219
45.6%
국내 216
45.0%
종합 45
 
9.4%
Distinct306
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T15:50:12.291333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.605178
Min length4

Characters and Unicode

Total characters2350
Distinct characters290
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique303 ?
Unique (%)98.1%

Sample

1st row주식21세기여행사
2nd row주식21세기여행사
3rd row개인237여행
4th row주식가람관광여행사
5th row개인가보투어
ValueCountFrequency (%)
주식21세기여행사 2
 
0.6%
개인투어코리아 2
 
0.6%
주식동양여행사 2
 
0.6%
주식하나여행사 2
 
0.6%
주식영천항공여행사 1
 
0.3%
주식신라교과서여행 1
 
0.3%
주식우진관광 1
 
0.3%
개인우주여행사 1
 
0.3%
주식우산국투어 1
 
0.3%
개인우리여행사 1
 
0.3%
Other values (299) 299
95.5%
2023-12-12T15:50:12.779534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
 
9.1%
198
 
8.4%
128
 
5.4%
125
 
5.3%
112
 
4.8%
103
 
4.4%
98
 
4.2%
89
 
3.8%
88
 
3.7%
69
 
2.9%
Other values (280) 1126
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2313
98.4%
Uppercase Letter 16
 
0.7%
Decimal Number 11
 
0.5%
Space Separator 4
 
0.2%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%
Other Symbol 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
214
 
9.3%
198
 
8.6%
128
 
5.5%
125
 
5.4%
112
 
4.8%
103
 
4.5%
98
 
4.2%
89
 
3.8%
88
 
3.8%
69
 
3.0%
Other values (258) 1089
47.1%
Uppercase Letter
ValueCountFrequency (%)
J 2
12.5%
S 2
12.5%
C 2
12.5%
A 2
12.5%
L 1
6.2%
E 1
6.2%
V 1
6.2%
R 1
6.2%
T 1
6.2%
G 1
6.2%
Other values (2) 2
12.5%
Decimal Number
ValueCountFrequency (%)
2 4
36.4%
1 3
27.3%
8 2
18.2%
3 1
 
9.1%
7 1
 
9.1%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2314
98.5%
Common 20
 
0.9%
Latin 16
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
214
 
9.2%
198
 
8.6%
128
 
5.5%
125
 
5.4%
112
 
4.8%
103
 
4.5%
98
 
4.2%
89
 
3.8%
88
 
3.8%
69
 
3.0%
Other values (259) 1090
47.1%
Latin
ValueCountFrequency (%)
J 2
12.5%
S 2
12.5%
C 2
12.5%
A 2
12.5%
L 1
6.2%
E 1
6.2%
V 1
6.2%
R 1
6.2%
T 1
6.2%
G 1
6.2%
Other values (2) 2
12.5%
Common
ValueCountFrequency (%)
2 4
20.0%
4
20.0%
1 3
15.0%
8 2
10.0%
( 2
10.0%
) 2
10.0%
3 1
 
5.0%
7 1
 
5.0%
, 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2313
98.4%
ASCII 36
 
1.5%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
214
 
9.3%
198
 
8.6%
128
 
5.5%
125
 
5.4%
112
 
4.8%
103
 
4.5%
98
 
4.2%
89
 
3.8%
88
 
3.8%
69
 
3.0%
Other values (258) 1089
47.1%
ASCII
ValueCountFrequency (%)
2 4
 
11.1%
4
 
11.1%
1 3
 
8.3%
J 2
 
5.6%
S 2
 
5.6%
C 2
 
5.6%
A 2
 
5.6%
8 2
 
5.6%
( 2
 
5.6%
) 2
 
5.6%
Other values (11) 11
30.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct52
Distinct (%)16.8%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T15:50:13.017071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.0647249
Min length3

Characters and Unicode

Total characters947
Distinct characters54
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)8.1%

Sample

1st row권ㅇㅇ
2nd row최ㅇㅇ
3rd row남ㅇㅇ
4th row김ㅇㅇ
5th row김ㅇㅇ
ValueCountFrequency (%)
김ㅇㅇ 78
24.6%
이ㅇㅇ 43
13.6%
박ㅇㅇ 27
 
8.5%
권ㅇㅇ 15
 
4.7%
정ㅇㅇ 14
 
4.4%
최ㅇㅇ 10
 
3.2%
조ㅇㅇ 10
 
3.2%
강ㅇㅇ 9
 
2.8%
임ㅇㅇ 9
 
2.8%
신ㅇㅇ 7
 
2.2%
Other values (41) 95
30.0%
2023-12-12T15:50:13.416426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
618
65.3%
78
 
8.2%
43
 
4.5%
27
 
2.9%
15
 
1.6%
14
 
1.5%
10
 
1.1%
10
 
1.1%
9
 
1.0%
9
 
1.0%
Other values (44) 114
 
12.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 935
98.7%
Space Separator 8
 
0.8%
Decimal Number 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
618
66.1%
78
 
8.3%
43
 
4.6%
27
 
2.9%
15
 
1.6%
14
 
1.5%
10
 
1.1%
10
 
1.1%
9
 
1.0%
9
 
1.0%
Other values (42) 102
 
10.9%
Space Separator
ValueCountFrequency (%)
8
100.0%
Decimal Number
ValueCountFrequency (%)
1 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 935
98.7%
Common 12
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
618
66.1%
78
 
8.3%
43
 
4.6%
27
 
2.9%
15
 
1.6%
14
 
1.5%
10
 
1.1%
10
 
1.1%
9
 
1.0%
9
 
1.0%
Other values (42) 102
 
10.9%
Common
ValueCountFrequency (%)
8
66.7%
1 4
33.3%

Most occurring blocks

ValueCountFrequency (%)
Compat Jamo 618
65.3%
Hangul 317
33.5%
ASCII 12
 
1.3%

Most frequent character per block

Compat Jamo
ValueCountFrequency (%)
618
100.0%
Hangul
ValueCountFrequency (%)
78
24.6%
43
13.6%
27
 
8.5%
15
 
4.7%
14
 
4.4%
10
 
3.2%
10
 
3.2%
9
 
2.8%
9
 
2.8%
7
 
2.2%
Other values (41) 95
30.0%
ASCII
ValueCountFrequency (%)
8
66.7%
1 4
33.3%
Distinct297
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T15:50:13.790210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36
Mean length21.478964
Min length14

Characters and Unicode

Total characters6637
Distinct characters237
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique286 ?
Unique (%)92.6%

Sample

1st row경상북도 포항시 북구 천마로 93, 202호
2nd row경상북도 포항시 북구 대곡로 47, 효성빌딩 1층
3rd row경상북도 포항시 북구 신광면 신흥로 383
4th row경상북도 포항시 남구 대이로 47, 2층
5th row경상북도 상주시 왕산로 347
ValueCountFrequency (%)
경상북도 309
 
19.9%
포항시 61
 
3.9%
경주시 49
 
3.2%
안동시 48
 
3.1%
2층 48
 
3.1%
구미시 45
 
2.9%
북구 36
 
2.3%
남구 25
 
1.6%
울릉읍 17
 
1.1%
울릉군 17
 
1.1%
Other values (550) 896
57.8%
2023-12-12T15:50:14.282488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1242
18.7%
385
 
5.8%
360
 
5.4%
330
 
5.0%
328
 
4.9%
256
 
3.9%
1 252
 
3.8%
240
 
3.6%
2 230
 
3.5%
, 115
 
1.7%
Other values (227) 2899
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4023
60.6%
Space Separator 1242
 
18.7%
Decimal Number 1162
 
17.5%
Other Punctuation 115
 
1.7%
Dash Punctuation 80
 
1.2%
Uppercase Letter 7
 
0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
385
 
9.6%
360
 
8.9%
330
 
8.2%
328
 
8.2%
256
 
6.4%
240
 
6.0%
115
 
2.9%
108
 
2.7%
108
 
2.7%
89
 
2.2%
Other values (204) 1704
42.4%
Decimal Number
ValueCountFrequency (%)
1 252
21.7%
2 230
19.8%
3 115
9.9%
0 103
8.9%
6 90
 
7.7%
4 89
 
7.7%
5 85
 
7.3%
8 72
 
6.2%
7 69
 
5.9%
9 57
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
28.6%
S 1
14.3%
G 1
14.3%
B 1
14.3%
H 1
14.3%
N 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
1242
100.0%
Other Punctuation
ValueCountFrequency (%)
, 115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4023
60.6%
Common 2605
39.2%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
385
 
9.6%
360
 
8.9%
330
 
8.2%
328
 
8.2%
256
 
6.4%
240
 
6.0%
115
 
2.9%
108
 
2.7%
108
 
2.7%
89
 
2.2%
Other values (204) 1704
42.4%
Common
ValueCountFrequency (%)
1242
47.7%
1 252
 
9.7%
2 230
 
8.8%
, 115
 
4.4%
3 115
 
4.4%
0 103
 
4.0%
6 90
 
3.5%
4 89
 
3.4%
5 85
 
3.3%
- 80
 
3.1%
Other values (5) 204
 
7.8%
Latin
ValueCountFrequency (%)
A 2
22.2%
S 1
11.1%
G 1
11.1%
B 1
11.1%
s 1
11.1%
k 1
11.1%
H 1
11.1%
N 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4023
60.6%
ASCII 2614
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1242
47.5%
1 252
 
9.6%
2 230
 
8.8%
, 115
 
4.4%
3 115
 
4.4%
0 103
 
3.9%
6 90
 
3.4%
4 89
 
3.4%
5 85
 
3.3%
- 80
 
3.1%
Other values (13) 213
 
8.1%
Hangul
ValueCountFrequency (%)
385
 
9.6%
360
 
8.9%
330
 
8.2%
328
 
8.2%
256
 
6.4%
240
 
6.0%
115
 
2.9%
108
 
2.7%
108
 
2.7%
89
 
2.2%
Other values (204) 1704
42.4%

전화번호
Text

MISSING 

Distinct245
Distinct (%)97.6%
Missing58
Missing (%)18.8%
Memory size2.5 KiB
2023-12-12T15:50:14.613088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.908367
Min length5

Characters and Unicode

Total characters2989
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique241 ?
Unique (%)96.0%

Sample

1st row054-244-0021
2nd row054-251-3800
3rd row054-535-8582
4th row054-741-4838
5th row054-462-2636
ValueCountFrequency (%)
054-434-4717 3
 
1.2%
054-843-7942 3
 
1.2%
054-462-2636 2
 
0.8%
054-275-3456 2
 
0.8%
054-255-5141 1
 
0.4%
054-462-8216 1
 
0.4%
054-791-3636 1
 
0.4%
054-372-5050 1
 
0.4%
054-742-0066 1
 
0.4%
10127 1
 
0.4%
Other values (235) 235
93.6%
2023-12-12T15:50:15.151306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 498
16.7%
0 473
15.8%
4 447
15.0%
5 408
13.7%
7 237
7.9%
2 183
 
6.1%
3 179
 
6.0%
8 178
 
6.0%
1 169
 
5.7%
6 118
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2491
83.3%
Dash Punctuation 498
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 473
19.0%
4 447
17.9%
5 408
16.4%
7 237
9.5%
2 183
 
7.3%
3 179
 
7.2%
8 178
 
7.1%
1 169
 
6.8%
6 118
 
4.7%
9 99
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 498
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2989
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 498
16.7%
0 473
15.8%
4 447
15.0%
5 408
13.7%
7 237
7.9%
2 183
 
6.1%
3 179
 
6.0%
8 178
 
6.0%
1 169
 
5.7%
6 118
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2989
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 498
16.7%
0 473
15.8%
4 447
15.0%
5 408
13.7%
7 237
7.9%
2 183
 
6.1%
3 179
 
6.0%
8 178
 
6.0%
1 169
 
5.7%
6 118
 
3.9%

Interactions

2023-12-12T15:50:11.109314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:50:15.312486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록업종대표자
구분1.0000.0570.216
등록업종0.0571.0000.000
대표자0.2160.0001.000
2023-12-12T15:50:15.415381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록업종
구분1.0000.028
등록업종0.0281.000

Missing values

2023-12-12T15:50:11.237478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:50:11.365289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록업종업체명대표자도로명 주소전화번호
01국내외주식21세기여행사권ㅇㅇ경상북도 포항시 북구 천마로 93, 202호<NA>
12국내외 국내주식21세기여행사최ㅇㅇ경상북도 포항시 북구 대곡로 47, 효성빌딩 1층054-244-0021
23국내개인237여행남ㅇㅇ경상북도 포항시 북구 신광면 신흥로 383<NA>
34국내외 국내주식가람관광여행사김ㅇㅇ경상북도 포항시 남구 대이로 47, 2층054-251-3800
45국내외 국내개인가보투어김ㅇㅇ경상북도 상주시 왕산로 347054-535-8582
56국내외 국내주식가자투어강ㅇㅇ경상북도 경주시 태종로685번길 35054-741-4838
67국내외개인감승투어권ㅇㅇ경상북도 구미시 신시로10길 110-5, 1층054-462-2636
78국내외 국내주식강남투어도ㅇㅇ경상북도 경주시 태종로 458, 3층<NA>
89종합 국내주식강산투어김ㅇㅇ경상북도 경주시 용담로 82054-743-5050
910국내주식같이가자여행사견ㅇㅇ경상북도 울릉군 울릉읍 봉래2길 30-4<NA>
구분등록업종업체명대표자도로명 주소전화번호
299300국내외 국내주식황악산관광김ㅇㅇ경상북도 김천시 김천로 23-1054-431-0707
300301국내주식히어로박ㅇㅇ경상북도 안동시 영가로 16, 경북문화콘텐츠진흥원 513호<NA>
301302국내외주식힐링여행사박ㅇㅇ경상북도 포항시 북구 천마로 42054-232-8858
302303국내외 국내주식힐링투어최ㅇㅇ경상북도 칠곡군 왜관읍 관문로 24-1054-973-0900
303304국내외개인힐링투어강ㅇㅇ경상북도 포항시 북구 흥해읍 초곡지구로58번길 82, 108동 1602호<NA>
304305국내외개인ABC여행사이ㅇㅇ경상북도 포항시 북구 삼호로 451054-762-9722
305306국내외개인CK디자인연구소정ㅇㅇ경상북도 칠곡군 왜관읍 2번도로길 52, 102호<NA>
306307국내외 국내개인GJS박ㅇㅇ경상북도 경주시 충효중앙길 22-9, 혁원오피스텔 202호054-753-0999
307308국내외 국내개인JS인터네셔널이ㅇㅇ경상북도 경주시 안강읍 구부랑4길 2<NA>
308309국내외개인TRAVEL882김ㅇㅇ경상북도 포항시 남구 지곡로 394, 101동 101호054-614-8700