Overview

Dataset statistics

Number of variables5
Number of observations38
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory44.5 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description대구광역시_서구_여행업 현황_20210903
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15053841&dataSetDetailId=1505384119bf9c608a42a&provdMethod=FILE

Alerts

연번 is highly overall correlated with 업종중분류High correlation
업종중분류 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-19 05:19:07.644115
Analysis finished2024-04-19 05:19:08.033184
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.5
Minimum1
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2024-04-19T14:19:08.091619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.85
Q110.25
median19.5
Q328.75
95-th percentile36.15
Maximum38
Range37
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation11.113055
Coefficient of variation (CV)0.56990028
Kurtosis-1.2
Mean19.5
Median Absolute Deviation (MAD)9.5
Skewness0
Sum741
Variance123.5
MonotonicityStrictly increasing
2024-04-19T14:19:08.212000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
1 1
 
2.6%
30 1
 
2.6%
23 1
 
2.6%
24 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
31 1
 
2.6%
Other values (28) 28
73.7%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%
29 1
2.6%

업종중분류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size436.0 B
국내여행업
18 
국외여행업
16 
일반여행업

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내여행업 18
47.4%
국외여행업 16
42.1%
일반여행업 4
 
10.5%

Length

2024-04-19T14:19:08.328775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:19:08.417887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 18
47.4%
국외여행업 16
42.1%
일반여행업 4
 
10.5%
Distinct26
Distinct (%)68.4%
Missing0
Missing (%)0.0%
Memory size436.0 B
2024-04-19T14:19:08.596566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7.2105263
Min length4

Characters and Unicode

Total characters274
Distinct characters66
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)36.8%

Sample

1st row(주)신도관광여행사
2nd row㈜신동아관광
3rd row무궁화고속관광㈜
4th row대구관광여행사(주)
5th row신지여행사
ValueCountFrequency (%)
주)신도관광여행사 2
 
5.0%
㈜진에어투어 2
 
5.0%
㈜스타대구고속관광 2
 
5.0%
㈜에스엠투어 2
 
5.0%
한국공무연수개발원 2
 
5.0%
㈜신동아관광 2
 
5.0%
가자시네마투어 2
 
5.0%
신지여행사 2
 
5.0%
㈜나이스여행사 2
 
5.0%
대구관광여행사(주 2
 
5.0%
Other values (18) 20
50.0%
2024-04-19T14:19:08.886545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
6.9%
15
 
5.5%
13
 
4.7%
12
 
4.4%
12
 
4.4%
12
 
4.4%
11
 
4.0%
11
 
4.0%
11
 
4.0%
( 9
 
3.3%
Other values (56) 149
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 233
85.0%
Other Symbol 19
 
6.9%
Open Punctuation 9
 
3.3%
Close Punctuation 9
 
3.3%
Space Separator 2
 
0.7%
Uppercase Letter 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.4%
13
 
5.6%
12
 
5.2%
12
 
5.2%
12
 
5.2%
11
 
4.7%
11
 
4.7%
11
 
4.7%
9
 
3.9%
9
 
3.9%
Other values (50) 118
50.6%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
N 1
50.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 252
92.0%
Common 20
 
7.3%
Latin 2
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
7.5%
15
 
6.0%
13
 
5.2%
12
 
4.8%
12
 
4.8%
12
 
4.8%
11
 
4.4%
11
 
4.4%
11
 
4.4%
9
 
3.6%
Other values (51) 127
50.4%
Common
ValueCountFrequency (%)
( 9
45.0%
) 9
45.0%
2
 
10.0%
Latin
ValueCountFrequency (%)
K 1
50.0%
N 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 233
85.0%
ASCII 22
 
8.0%
None 19
 
6.9%

Most frequent character per block

None
ValueCountFrequency (%)
19
100.0%
Hangul
ValueCountFrequency (%)
15
 
6.4%
13
 
5.6%
12
 
5.2%
12
 
5.2%
12
 
5.2%
11
 
4.7%
11
 
4.7%
11
 
4.7%
9
 
3.9%
9
 
3.9%
Other values (50) 118
50.6%
ASCII
ValueCountFrequency (%)
( 9
40.9%
) 9
40.9%
2
 
9.1%
K 1
 
4.5%
N 1
 
4.5%
Distinct22
Distinct (%)57.9%
Missing0
Missing (%)0.0%
Memory size436.0 B
2024-04-19T14:19:09.076642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length29
Mean length28.236842
Min length21

Characters and Unicode

Total characters1073
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)26.3%

Sample

1st row대구광역시 서구 달구벌대로 1791 (내당동)
2nd row대구광역시 서구 국채보상로 316, 201동 402호 (평리동, 평리롯데캐슬)
3rd row대구광역시 서구 통학로 30 (내당동)
4th row대구광역시 서구 평리로 403 (평리동)
5th row대구광역시 서구 고성로15길 25 (원대동3가)
ValueCountFrequency (%)
대구광역시 38
16.9%
서구 38
16.9%
내당동 12
 
5.3%
평리동 12
 
5.3%
비산동 11
 
4.9%
2층 9
 
4.0%
국채보상로 8
 
3.6%
316 6
 
2.7%
평리롯데캐슬 6
 
2.7%
서대구로 5
 
2.2%
Other values (40) 80
35.6%
2024-04-19T14:19:09.404914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
187
17.4%
86
 
8.0%
50
 
4.7%
49
 
4.6%
45
 
4.2%
( 38
 
3.5%
38
 
3.5%
38
 
3.5%
38
 
3.5%
38
 
3.5%
Other values (44) 466
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 605
56.4%
Space Separator 187
 
17.4%
Decimal Number 172
 
16.0%
Open Punctuation 38
 
3.5%
Close Punctuation 38
 
3.5%
Other Punctuation 30
 
2.8%
Dash Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
14.2%
50
 
8.3%
49
 
8.1%
45
 
7.4%
38
 
6.3%
38
 
6.3%
38
 
6.3%
38
 
6.3%
23
 
3.8%
22
 
3.6%
Other values (29) 178
29.4%
Decimal Number
ValueCountFrequency (%)
1 36
20.9%
3 31
18.0%
2 30
17.4%
0 22
12.8%
4 19
11.0%
8 11
 
6.4%
6 8
 
4.7%
5 5
 
2.9%
9 5
 
2.9%
7 5
 
2.9%
Space Separator
ValueCountFrequency (%)
187
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Other Punctuation
ValueCountFrequency (%)
, 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 605
56.4%
Common 468
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
14.2%
50
 
8.3%
49
 
8.1%
45
 
7.4%
38
 
6.3%
38
 
6.3%
38
 
6.3%
38
 
6.3%
23
 
3.8%
22
 
3.6%
Other values (29) 178
29.4%
Common
ValueCountFrequency (%)
187
40.0%
( 38
 
8.1%
) 38
 
8.1%
1 36
 
7.7%
3 31
 
6.6%
, 30
 
6.4%
2 30
 
6.4%
0 22
 
4.7%
4 19
 
4.1%
8 11
 
2.4%
Other values (5) 26
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 605
56.4%
ASCII 468
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
187
40.0%
( 38
 
8.1%
) 38
 
8.1%
1 36
 
7.7%
3 31
 
6.6%
, 30
 
6.4%
2 30
 
6.4%
0 22
 
4.7%
4 19
 
4.1%
8 11
 
2.4%
Other values (5) 26
 
5.6%
Hangul
ValueCountFrequency (%)
86
14.2%
50
 
8.3%
49
 
8.1%
45
 
7.4%
38
 
6.3%
38
 
6.3%
38
 
6.3%
38
 
6.3%
23
 
3.8%
22
 
3.6%
Other values (29) 178
29.4%
Distinct25
Distinct (%)65.8%
Missing0
Missing (%)0.0%
Memory size436.0 B
2024-04-19T14:19:09.581613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.947368
Min length9

Characters and Unicode

Total characters454
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)34.2%

Sample

1st row053-559-3131
2nd row053-572-7777
3rd row053-572-3838
4th row053-572-0011
5th row053-627-9858
ValueCountFrequency (%)
053-744-4007 3
 
7.9%
053-524-8080 2
 
5.3%
053-761-0063 2
 
5.3%
053-427-8721 2
 
5.3%
053-215-8800 2
 
5.3%
053-476-5622 2
 
5.3%
053-551-6598 2
 
5.3%
053-639-7878 2
 
5.3%
053-526-0052 2
 
5.3%
053-572-4000 2
 
5.3%
Other values (15) 17
44.7%
2024-04-19T14:19:10.138731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 75
16.5%
- 75
16.5%
5 74
16.3%
3 54
11.9%
7 36
7.9%
2 33
7.3%
8 33
7.3%
4 23
 
5.1%
1 21
 
4.6%
6 18
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 379
83.5%
Dash Punctuation 75
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 75
19.8%
5 74
19.5%
3 54
14.2%
7 36
9.5%
2 33
8.7%
8 33
8.7%
4 23
 
6.1%
1 21
 
5.5%
6 18
 
4.7%
9 12
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 454
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 75
16.5%
- 75
16.5%
5 74
16.3%
3 54
11.9%
7 36
7.9%
2 33
7.3%
8 33
7.3%
4 23
 
5.1%
1 21
 
4.6%
6 18
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 454
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 75
16.5%
- 75
16.5%
5 74
16.3%
3 54
11.9%
7 36
7.9%
2 33
7.3%
8 33
7.3%
4 23
 
5.1%
1 21
 
4.6%
6 18
 
4.0%

Interactions

2024-04-19T14:19:07.827890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:19:10.228526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종중분류업체명소재지전화번호
연번1.0000.9660.0000.0000.000
업종중분류0.9661.0000.0000.0000.211
업체명0.0000.0001.0000.9950.993
소재지0.0000.0000.9951.0000.995
전화번호0.0000.2110.9930.9951.000
2024-04-19T14:19:10.321765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종중분류
연번1.0000.868
업종중분류0.8681.000

Missing values

2024-04-19T14:19:07.923610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:19:08.001666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종중분류업체명소재지전화번호
01국내여행업(주)신도관광여행사대구광역시 서구 달구벌대로 1791 (내당동)053-559-3131
12국내여행업㈜신동아관광대구광역시 서구 국채보상로 316, 201동 402호 (평리동, 평리롯데캐슬)053-572-7777
23국내여행업무궁화고속관광㈜대구광역시 서구 통학로 30 (내당동)053-572-3838
34국내여행업대구관광여행사(주)대구광역시 서구 평리로 403 (평리동)053-572-0011
45국내여행업신지여행사대구광역시 서구 고성로15길 25 (원대동3가)053-627-9858
56국내여행업(주)신동아고속관광대구광역시 서구 국채보상로 316, 201동 402호 (평리동, 평리롯데캐슬)053-572-4000
67국내여행업위너스투어(주)대구광역시 서구 달서로 123 (비산동)053-639-7878
78국내여행업(주)웰컴투어대구광역시 서구 달구벌대로 1821, 2층 (내당동)053-526-0052
89국내여행업㈜고경자여행사대구광역시 서구 서대구로10길 6 (내당동)053-427-8721
910국내여행업㈜여행돌대구광역시 서구 북비산로 347, 3층 (비산동)1599-2952
연번업종중분류업체명소재지전화번호
2829국외여행업㈜나이스여행사대구광역시 서구 달서로 13, 2층 (내당동)053-761-0063
2930국외여행업신원여행사대구광역시 서구 서대구로 299, 12호 (비산동)053-352-5582
3031국외여행업가자시네마투어대구광역시 서구 서대구로 108, 1층 (평리동)053-476-5622
3132국외여행업(주)더블유아이씨씨코리아대구광역시 서구 서대구로 13, 1층 (내당동)070-7434-2373
3233국외여행업한국공무연수개발원대구광역시 서구 북비산로74길 8-1, 2층 (비산동)053-551-6598
3334국외여행업㈜에스엠투어대구광역시 서구 통학로 4, 2층 (내당동)053-215-8800
3435일반여행업㈜무궁화엠에스대구광역시 서구 통학로 30 (내당동)053-431-8886
3536일반여행업NK 투어대구광역시 서구 국채보상로 188, 3층 (중리동, 동우빌딩)053-522-0113
3637일반여행업㈜엑스코투어대구광역시 서구 통학로48길 34-3, 2층 (비산동)053-428-4002
3738일반여행업㈜신천지투어대구광역시 서구 서대구로 318, 3층 (비산동)053-359-0888