Overview

Dataset statistics

Number of variables5
Number of observations225
Missing cells64
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description울산광역시의 여행업체의 현황으로 업종, 상호, 소재지, 전화번호,도로명주소, 지번주소 정보 등을 제공하고 있음.
Author울산광역시
URLhttps://www.data.go.kr/data/15052667/fileData.do

Alerts

구분 is highly overall correlated with 등록업종High correlation
등록업종 is highly overall correlated with 구분High correlation
전화번호 has 64 (28.4%) missing valuesMissing
구분 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:14:04.745040
Analysis finished2024-03-15 02:14:05.905680
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct225
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113
Minimum1
Maximum225
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-03-15T11:14:06.127936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.2
Q157
median113
Q3169
95-th percentile213.8
Maximum225
Range224
Interquartile range (IQR)112

Descriptive statistics

Standard deviation65.096083
Coefficient of variation (CV)0.57607153
Kurtosis-1.2
Mean113
Median Absolute Deviation (MAD)56
Skewness0
Sum25425
Variance4237.5
MonotonicityStrictly increasing
2024-03-15T11:14:06.562961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
170 1
 
0.4%
144 1
 
0.4%
145 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
Other values (215) 215
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%
217 1
0.4%
216 1
0.4%

등록업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
국내외여행업
137 
종합여행업
61 
국내여행업
27 

Length

Max length6
Median length6
Mean length5.6088889
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내외여행업
2nd row종합여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 137
60.9%
종합여행업 61
27.1%
국내여행업 27
 
12.0%

Length

2024-03-15T11:14:06.959350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:14:07.163744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 137
60.9%
종합여행업 61
27.1%
국내여행업 27
 
12.0%
Distinct215
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-03-15T11:14:08.149859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length7.9733333
Min length3

Characters and Unicode

Total characters1794
Distinct characters276
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)91.1%

Sample

1st row㈜울산방송
2nd row㈜울산방송
3rd row㈜월드라온
4th row중앙관광여행사
5th row웰리힐리파크 탑스투어
ValueCountFrequency (%)
주식회사 34
 
11.1%
여행사 14
 
4.6%
6
 
2.0%
투어 5
 
1.6%
월드투어 2
 
0.7%
주)힐링투어 2
 
0.7%
글로벌투어 2
 
0.7%
투어마켓 2
 
0.7%
㈜울산방송 2
 
0.7%
엔돌핀여행사 2
 
0.7%
Other values (229) 234
76.7%
2024-03-15T11:14:09.452408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
 
7.1%
103
 
5.7%
103
 
5.7%
101
 
5.6%
80
 
4.5%
( 67
 
3.7%
) 67
 
3.7%
67
 
3.7%
63
 
3.5%
38
 
2.1%
Other values (266) 978
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1487
82.9%
Space Separator 80
 
4.5%
Open Punctuation 67
 
3.7%
Close Punctuation 67
 
3.7%
Uppercase Letter 43
 
2.4%
Other Symbol 26
 
1.4%
Lowercase Letter 19
 
1.1%
Decimal Number 3
 
0.2%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
127
 
8.5%
103
 
6.9%
103
 
6.9%
101
 
6.8%
67
 
4.5%
63
 
4.2%
38
 
2.6%
37
 
2.5%
36
 
2.4%
25
 
1.7%
Other values (231) 787
52.9%
Uppercase Letter
ValueCountFrequency (%)
O 6
14.0%
K 4
9.3%
U 4
9.3%
T 4
9.3%
S 3
 
7.0%
A 3
 
7.0%
C 3
 
7.0%
R 3
 
7.0%
L 3
 
7.0%
H 2
 
4.7%
Other values (6) 8
18.6%
Lowercase Letter
ValueCountFrequency (%)
o 4
21.1%
m 2
10.5%
t 2
10.5%
i 2
10.5%
n 2
10.5%
u 2
10.5%
c 2
10.5%
r 1
 
5.3%
s 1
 
5.3%
a 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
3 1
33.3%
6 1
33.3%
Space Separator
ValueCountFrequency (%)
80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Other Symbol
ValueCountFrequency (%)
26
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1513
84.3%
Common 219
 
12.2%
Latin 62
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
127
 
8.4%
103
 
6.8%
103
 
6.8%
101
 
6.7%
67
 
4.4%
63
 
4.2%
38
 
2.5%
37
 
2.4%
36
 
2.4%
26
 
1.7%
Other values (232) 812
53.7%
Latin
ValueCountFrequency (%)
O 6
 
9.7%
K 4
 
6.5%
U 4
 
6.5%
o 4
 
6.5%
T 4
 
6.5%
S 3
 
4.8%
A 3
 
4.8%
C 3
 
4.8%
R 3
 
4.8%
L 3
 
4.8%
Other values (16) 25
40.3%
Common
ValueCountFrequency (%)
80
36.5%
( 67
30.6%
) 67
30.6%
5 1
 
0.5%
' 1
 
0.5%
3 1
 
0.5%
6 1
 
0.5%
- 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1487
82.9%
ASCII 281
 
15.7%
None 26
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
127
 
8.5%
103
 
6.9%
103
 
6.9%
101
 
6.8%
67
 
4.5%
63
 
4.2%
38
 
2.6%
37
 
2.5%
36
 
2.4%
25
 
1.7%
Other values (231) 787
52.9%
ASCII
ValueCountFrequency (%)
80
28.5%
( 67
23.8%
) 67
23.8%
O 6
 
2.1%
K 4
 
1.4%
U 4
 
1.4%
o 4
 
1.4%
T 4
 
1.4%
S 3
 
1.1%
A 3
 
1.1%
Other values (24) 39
13.9%
None
ValueCountFrequency (%)
26
100.0%
Distinct214
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-03-15T11:14:10.881720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length43
Mean length28.031111
Min length20

Characters and Unicode

Total characters6307
Distinct characters237
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique204 ?
Unique (%)90.7%

Sample

1st row울산광역시 중구 구교로 41(학산동)
2nd row울산광역시 중구 구교로 41(학산동)
3rd row울산광역시 중구 신기5길 3-1, 1층(태화동)
4th row울산광역시 중구 태화로 133, 1층(태화동)
5th row울산광역시 중구 서원1길 37, 지하1층(반구동)
ValueCountFrequency (%)
울산광역시 225
 
17.1%
남구 119
 
9.1%
중구 48
 
3.7%
삼산동 32
 
2.4%
신정동 29
 
2.2%
울주군 29
 
2.2%
1층 28
 
2.1%
2층 27
 
2.1%
달동 23
 
1.8%
동구 17
 
1.3%
Other values (456) 735
56.0%
2024-03-15T11:14:12.480494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1089
 
17.3%
322
 
5.1%
267
 
4.2%
236
 
3.7%
229
 
3.6%
228
 
3.6%
226
 
3.6%
1 223
 
3.5%
215
 
3.4%
( 201
 
3.2%
Other values (227) 3071
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3635
57.6%
Space Separator 1089
 
17.3%
Decimal Number 968
 
15.3%
Open Punctuation 201
 
3.2%
Close Punctuation 201
 
3.2%
Other Punctuation 170
 
2.7%
Dash Punctuation 27
 
0.4%
Uppercase Letter 9
 
0.1%
Lowercase Letter 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
322
 
8.9%
267
 
7.3%
236
 
6.5%
229
 
6.3%
228
 
6.3%
226
 
6.2%
215
 
5.9%
178
 
4.9%
142
 
3.9%
100
 
2.8%
Other values (202) 1492
41.0%
Decimal Number
ValueCountFrequency (%)
1 223
23.0%
2 182
18.8%
3 118
12.2%
0 93
9.6%
4 88
 
9.1%
5 70
 
7.2%
6 54
 
5.6%
7 53
 
5.5%
9 47
 
4.9%
8 40
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
B 5
55.6%
C 1
 
11.1%
T 1
 
11.1%
F 1
 
11.1%
H 1
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
28.6%
h 2
28.6%
e 1
14.3%
g 1
14.3%
i 1
14.3%
Space Separator
ValueCountFrequency (%)
1089
100.0%
Open Punctuation
ValueCountFrequency (%)
( 201
100.0%
Close Punctuation
ValueCountFrequency (%)
) 201
100.0%
Other Punctuation
ValueCountFrequency (%)
, 170
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3635
57.6%
Common 2656
42.1%
Latin 16
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
322
 
8.9%
267
 
7.3%
236
 
6.5%
229
 
6.3%
228
 
6.3%
226
 
6.2%
215
 
5.9%
178
 
4.9%
142
 
3.9%
100
 
2.8%
Other values (202) 1492
41.0%
Common
ValueCountFrequency (%)
1089
41.0%
1 223
 
8.4%
( 201
 
7.6%
) 201
 
7.6%
2 182
 
6.9%
, 170
 
6.4%
3 118
 
4.4%
0 93
 
3.5%
4 88
 
3.3%
5 70
 
2.6%
Other values (5) 221
 
8.3%
Latin
ValueCountFrequency (%)
B 5
31.2%
c 2
 
12.5%
h 2
 
12.5%
C 1
 
6.2%
e 1
 
6.2%
T 1
 
6.2%
g 1
 
6.2%
i 1
 
6.2%
F 1
 
6.2%
H 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3635
57.6%
ASCII 2672
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1089
40.8%
1 223
 
8.3%
( 201
 
7.5%
) 201
 
7.5%
2 182
 
6.8%
, 170
 
6.4%
3 118
 
4.4%
0 93
 
3.5%
4 88
 
3.3%
5 70
 
2.6%
Other values (15) 237
 
8.9%
Hangul
ValueCountFrequency (%)
322
 
8.9%
267
 
7.3%
236
 
6.5%
229
 
6.3%
228
 
6.3%
226
 
6.2%
215
 
5.9%
178
 
4.9%
142
 
3.9%
100
 
2.8%
Other values (202) 1492
41.0%

전화번호
Text

MISSING 

Distinct155
Distinct (%)96.3%
Missing64
Missing (%)28.4%
Memory size1.9 KiB
2024-03-15T11:14:13.463103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.012422
Min length9

Characters and Unicode

Total characters1934
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)92.5%

Sample

1st row052-228-6160
2nd row052-228-6160
3rd row052-977-7805
4th row052-242-3000
5th row052-267-7741
ValueCountFrequency (%)
052-267-9190 2
 
1.2%
052-258-0325 2
 
1.2%
052-716-0043 2
 
1.2%
052-228-6160 2
 
1.2%
052-225-2788 2
 
1.2%
052-257-6997 2
 
1.2%
052-272-0021 1
 
0.6%
052-265-1778 1
 
0.6%
052-277-0055 1
 
0.6%
052-258-9977 1
 
0.6%
Other values (145) 145
90.1%
2024-03-15T11:14:14.643462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 404
20.9%
0 324
16.8%
- 321
16.6%
5 255
13.2%
7 122
 
6.3%
1 90
 
4.7%
6 89
 
4.6%
8 84
 
4.3%
4 84
 
4.3%
3 83
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1613
83.4%
Dash Punctuation 321
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 404
25.0%
0 324
20.1%
5 255
15.8%
7 122
 
7.6%
1 90
 
5.6%
6 89
 
5.5%
8 84
 
5.2%
4 84
 
5.2%
3 83
 
5.1%
9 78
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 321
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1934
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 404
20.9%
0 324
16.8%
- 321
16.6%
5 255
13.2%
7 122
 
6.3%
1 90
 
4.7%
6 89
 
4.6%
8 84
 
4.3%
4 84
 
4.3%
3 83
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1934
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 404
20.9%
0 324
16.8%
- 321
16.6%
5 255
13.2%
7 122
 
6.3%
1 90
 
4.7%
6 89
 
4.6%
8 84
 
4.3%
4 84
 
4.3%
3 83
 
4.3%

Interactions

2024-03-15T11:14:05.284492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:14:14.906302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록업종
구분1.0000.758
등록업종0.7581.000
2024-03-15T11:14:15.303612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록업종
구분1.0000.622
등록업종0.6221.000

Missing values

2024-03-15T11:14:05.594337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:14:05.835167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록업종업체명도로명 주소전화번호
01국내외여행업㈜울산방송울산광역시 중구 구교로 41(학산동)052-228-6160
12종합여행업㈜울산방송울산광역시 중구 구교로 41(학산동)052-228-6160
23국내여행업㈜월드라온울산광역시 중구 신기5길 3-1, 1층(태화동)052-977-7805
34국내여행업중앙관광여행사울산광역시 중구 태화로 133, 1층(태화동)052-242-3000
45국내여행업웰리힐리파크 탑스투어울산광역시 중구 서원1길 37, 지하1층(반구동)052-267-7741
56국내외여행업미래로여행사울산광역시 중구 번영로 539, 2층(약사동)052-282-2950
67국내외여행업㈜평화관광울산광역시 중구 태화로 204(태화동)052-243-0013
78국내외여행업㈜현대관광울산광역시 중구 염포로 62-1(반구동)052-249-4300
89국내외여행업㈜허브여행사울산광역시 중구 함월4길 9, 101호(성안동)052-242-1253
910국내외여행업㈜투어엔울산광역시 중구 난곡로 36, 1층(태화동)052-249-9303
구분등록업종업체명도로명 주소전화번호
215216국내외여행업365 국내외 전문여행사울산광역시 울주군 온산읍 신경7길 4, 2층052-239-9500
216217국내여행업(주)청녹관광여행사울산광역시 울주군 청량읍 온산로 740-6052-261-8080
217218국내여행업세명여행사울산광역시 울주군 삼남읍 서향교1길 37052-254-5323
218219국내여행업울산고속관광(주)울산광역시 울주군 삼남읍 봉당골2길 8052-254-1345
219220국내여행업(주)글로벌 여행사울산광역시 울주군 범서읍 구영로 82, 이프라자052-257-6997
220221국내여행업투어마켓울산광역시 울주군 언양읍 서부2길 16<NA>
221222국내여행업(주)힐링투어울산광역시 울주군 온양읍 남창역길 83052-716-0043
222223국내여행업주식회사 착한도공울산광역시 울주군 온양읍 연안8길 23-3, 1층052-238-0103
223224국내여행업울산무장애관광지원센터 사회적협동조합울산광역시 울주군 범서읍 천상중앙길 110, 천상종합상가 다동 141호052-248-4860
224225국내여행업울산출입국여행사울산광역시 울주군 온산읍 덕신로 275-2052-707-8555