Overview

Dataset statistics

Number of variables5
Number of observations24
Missing cells4
Missing cells (%)3.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory46.5 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description영천시에 있는 여행사의 현황을 조사하여 업종명, 상호명, 도로명주소, 지번주소, 전화번호 등의 항목을 제공합니다.
Author경상북도 영천시
URLhttps://www.data.go.kr/data/15110092/fileData.do

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
전화번호 has 4 (16.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-16 16:03:41.330965
Analysis finished2023-12-16 16:03:43.019285
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-16T16:03:43.261850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2023-12-16T16:03:43.731206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 1
 
4.2%
14 1
 
4.2%
24 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
24 1
4.2%
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%

업종명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size324.0 B
국내여행업
10 
국내외여행업
국외여행업
종합여행업

Length

Max length6
Median length5
Mean length5.2916667
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내여행업 10
41.7%
국내외여행업 7
29.2%
국외여행업 4
 
16.7%
종합여행업 3
 
12.5%

Length

2023-12-16T16:03:44.172127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T16:03:44.688797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 10
41.7%
국내외여행업 7
29.2%
국외여행업 4
 
16.7%
종합여행업 3
 
12.5%
Distinct19
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-16T16:03:45.338644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length10
Mean length7.6666667
Min length4

Characters and Unicode

Total characters184
Distinct characters73
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)58.3%

Sample

1st row㈜천마관광여행사
2nd row㈜철마관광
3rd row뉴세화고속관광㈜
4th row㈜아세아여행사
5th row㈜노블투어
ValueCountFrequency (%)
㈜천마관광여행사 2
 
7.4%
㈜노블투어 2
 
7.4%
㈜영천항공여행사 2
 
7.4%
뉴세화고속관광㈜ 2
 
7.4%
㈜아세아여행사 2
 
7.4%
주식회사 2
 
7.4%
하람기획 1
 
3.7%
굿샤인투어(goodshine 1
 
3.7%
㈜영천관광해외여행사 1
 
3.7%
가영투어 1
 
3.7%
Other values (11) 11
40.7%
2023-12-16T16:03:47.209152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
7.6%
14
 
7.6%
12
 
6.5%
12
 
6.5%
8
 
4.3%
8
 
4.3%
6
 
3.3%
5
 
2.7%
5
 
2.7%
4
 
2.2%
Other values (63) 96
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152
82.6%
Other Symbol 14
 
7.6%
Lowercase Letter 13
 
7.1%
Space Separator 3
 
1.6%
Open Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
9.2%
12
 
7.9%
12
 
7.9%
8
 
5.3%
8
 
5.3%
6
 
3.9%
5
 
3.3%
5
 
3.3%
4
 
2.6%
4
 
2.6%
Other values (48) 74
48.7%
Lowercase Letter
ValueCountFrequency (%)
o 3
23.1%
n 1
 
7.7%
g 1
 
7.7%
d 1
 
7.7%
s 1
 
7.7%
h 1
 
7.7%
i 1
 
7.7%
e 1
 
7.7%
t 1
 
7.7%
u 1
 
7.7%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 166
90.2%
Latin 13
 
7.1%
Common 5
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.4%
14
 
8.4%
12
 
7.2%
12
 
7.2%
8
 
4.8%
8
 
4.8%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
Other values (49) 78
47.0%
Latin
ValueCountFrequency (%)
o 3
23.1%
n 1
 
7.7%
g 1
 
7.7%
d 1
 
7.7%
s 1
 
7.7%
h 1
 
7.7%
i 1
 
7.7%
e 1
 
7.7%
t 1
 
7.7%
u 1
 
7.7%
Common
ValueCountFrequency (%)
3
60.0%
( 1
 
20.0%
) 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152
82.6%
ASCII 18
 
9.8%
None 14
 
7.6%

Most frequent character per block

None
ValueCountFrequency (%)
14
100.0%
Hangul
ValueCountFrequency (%)
14
 
9.2%
12
 
7.9%
12
 
7.9%
8
 
5.3%
8
 
5.3%
6
 
3.9%
5
 
3.3%
5
 
3.3%
4
 
2.6%
4
 
2.6%
Other values (48) 74
48.7%
ASCII
ValueCountFrequency (%)
o 3
16.7%
3
16.7%
n 1
 
5.6%
( 1
 
5.6%
g 1
 
5.6%
d 1
 
5.6%
s 1
 
5.6%
h 1
 
5.6%
i 1
 
5.6%
e 1
 
5.6%
Other values (4) 4
22.2%
Distinct19
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Memory size324.0 B
2023-12-16T16:03:48.049623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24.5
Mean length21.375
Min length15

Characters and Unicode

Total characters513
Distinct characters68
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)58.3%

Sample

1st row경상북도 영천시 강변로 35 (금노동)
2nd row경상북도 영천시 쌍계길4(오수동)
3rd row경상북도 영천시 강변로 65 (금노동)
4th row경상북도 영천시 완산로 34 (완산동)
5th row경상북도 영천시 시장로 57 3층 (완산동)
ValueCountFrequency (%)
경상북도 24
20.2%
영천시 24
20.2%
강변로 6
 
5.0%
금노동 6
 
5.0%
완산동 5
 
4.2%
완산로 3
 
2.5%
문외동 3
 
2.5%
55 2
 
1.7%
성내동 2
 
1.7%
천문로 2
 
1.7%
Other values (36) 42
35.3%
2023-12-16T16:03:49.855457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
95
18.5%
27
 
5.3%
26
 
5.1%
24
 
4.7%
24
 
4.7%
24
 
4.7%
24
 
4.7%
24
 
4.7%
20
 
3.9%
19
 
3.7%
Other values (58) 206
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 314
61.2%
Space Separator 95
 
18.5%
Decimal Number 66
 
12.9%
Open Punctuation 18
 
3.5%
Close Punctuation 18
 
3.5%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
8.6%
26
 
8.3%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
20
 
6.4%
19
 
6.1%
8
 
2.5%
Other values (43) 94
29.9%
Decimal Number
ValueCountFrequency (%)
3 11
16.7%
5 10
15.2%
1 10
15.2%
4 9
13.6%
2 8
12.1%
6 6
9.1%
7 5
7.6%
0 3
 
4.5%
8 2
 
3.0%
9 2
 
3.0%
Space Separator
ValueCountFrequency (%)
95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 314
61.2%
Common 199
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
8.6%
26
 
8.3%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
20
 
6.4%
19
 
6.1%
8
 
2.5%
Other values (43) 94
29.9%
Common
ValueCountFrequency (%)
95
47.7%
( 18
 
9.0%
) 18
 
9.0%
3 11
 
5.5%
5 10
 
5.0%
1 10
 
5.0%
4 9
 
4.5%
2 8
 
4.0%
6 6
 
3.0%
7 5
 
2.5%
Other values (5) 9
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 314
61.2%
ASCII 199
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
95
47.7%
( 18
 
9.0%
) 18
 
9.0%
3 11
 
5.5%
5 10
 
5.0%
1 10
 
5.0%
4 9
 
4.5%
2 8
 
4.0%
6 6
 
3.0%
7 5
 
2.5%
Other values (5) 9
 
4.5%
Hangul
ValueCountFrequency (%)
27
 
8.6%
26
 
8.3%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
24
 
7.6%
20
 
6.4%
19
 
6.1%
8
 
2.5%
Other values (43) 94
29.9%

전화번호
Text

MISSING 

Distinct15
Distinct (%)75.0%
Missing4
Missing (%)16.7%
Memory size324.0 B
2023-12-16T16:03:50.462405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.05
Min length12

Characters and Unicode

Total characters241
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)50.0%

Sample

1st row054-333-7700
2nd row054-332-2500
3rd row054-333-6666
4th row054-338-1558
5th row054-336-0029
ValueCountFrequency (%)
054-333-7700 2
 
10.0%
054-333-6666 2
 
10.0%
054-338-1558 2
 
10.0%
054-336-0029 2
 
10.0%
054-337-7113 2
 
10.0%
054-332-2500 1
 
5.0%
054-338-0801 1
 
5.0%
054-336-4646 1
 
5.0%
054-332-3434 1
 
5.0%
054-337-2700 1
 
5.0%
Other values (5) 5
25.0%
2023-12-16T16:03:51.754297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 48
19.9%
- 40
16.6%
0 39
16.2%
4 26
10.8%
5 25
10.4%
6 16
 
6.6%
8 12
 
5.0%
1 12
 
5.0%
7 11
 
4.6%
2 10
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 201
83.4%
Dash Punctuation 40
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 48
23.9%
0 39
19.4%
4 26
12.9%
5 25
12.4%
6 16
 
8.0%
8 12
 
6.0%
1 12
 
6.0%
7 11
 
5.5%
2 10
 
5.0%
9 2
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 241
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 48
19.9%
- 40
16.6%
0 39
16.2%
4 26
10.8%
5 25
10.4%
6 16
 
6.6%
8 12
 
5.0%
1 12
 
5.0%
7 11
 
4.6%
2 10
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 241
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 48
19.9%
- 40
16.6%
0 39
16.2%
4 26
10.8%
5 25
10.4%
6 16
 
6.6%
8 12
 
5.0%
1 12
 
5.0%
7 11
 
4.6%
2 10
 
4.1%

Interactions

2023-12-16T16:03:42.049223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T16:03:52.182878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명상호명도로명주소전화번호
연번1.0000.9470.0000.0000.764
업종명0.9471.0000.5650.5650.602
상호명0.0000.5651.0001.0001.000
도로명주소0.0000.5651.0001.0001.000
전화번호0.7640.6021.0001.0001.000
2023-12-16T16:03:52.583091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.724
업종명0.7241.000

Missing values

2023-12-16T16:03:42.350956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T16:03:42.804401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명상호명도로명주소전화번호
01국내여행업㈜천마관광여행사경상북도 영천시 강변로 35 (금노동)054-333-7700
12국내여행업㈜철마관광경상북도 영천시 쌍계길4(오수동)054-332-2500
23국내여행업뉴세화고속관광㈜경상북도 영천시 강변로 65 (금노동)054-333-6666
34국내여행업㈜아세아여행사경상북도 영천시 완산로 34 (완산동)054-338-1558
45국내여행업㈜노블투어경상북도 영천시 시장로 57 3층 (완산동)054-336-0029
56국내여행업㈜영천항공여행사경상북도 영천시 강변로 55 (금노동)054-337-7113
67국내여행업지구마을산책경상북도 영천시 대창면 금박로 1021054-338-0801
78국내여행업별별곳간사회적협동조합경상북도 영천시 충효로 147, 103호<NA>
89국내여행업유창여행사경상북도 영천시 청통면 보성공단길 31-37054-336-4646
910국내여행업주식회사 하람기획경상북도 영천시 임고면 포은로 482<NA>
연번업종명상호명도로명주소전화번호
1415국내외여행업㈜영천항공여행사경상북도 영천시 강변로 55 (금노동)054-337-7113
1516국내외여행업㈜한진항공여행경상북도 영천시 시청남길 24 2층 (문외동)054-337-2700
1617국외여행업㈜영천관광해외여행사경상북도 영천시 중앙동2길 76 (문외동)054-338-1000
1718국외여행업뉴세화고속관광㈜경상북도 영천시 강변로 65 (금노동)054-333-6666
1819국외여행업로얄관광여행사경상북도 영천시 최무선로 266 (성내동)054-333-2220
1920국외여행업새천년여행사경상북도 영천시 천문로 319(괴전동)<NA>
2021국내외여행업동아산업 주식회사경상북도 영천시 향군로 29054-338-1888
2122종합여행업시민항공여행사경상북도 영천시 호국로 23 (문외동)054-331-3651
2223종합여행업가영투어경상북도 영천시 강변로4 (성내동)070-4166-2844
2324종합여행업굿샤인투어(goodshine tour)경상북도 영천시 천문로 181 4층 401호<NA>