Overview

Dataset statistics

Number of variables7
Number of observations60
Missing cells28
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory59.2 B

Variable types

Numeric1
Text3
Categorical1
DateTime2

Dataset

Description인천광역시 중구 관내에 위치한 여행업 현황에 대한 데이터 입니다. 파일명 인천광역시_중구_여행업 현황 파일내용 업소명, 업종명, 업소소재지 등
URLhttps://www.data.go.kr/data/15038644/fileData.do

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
소재지전화 has 28 (46.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:20:15.046296
Analysis finished2023-12-12 16:20:15.816855
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.5
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-13T01:20:15.885969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.95
Q115.75
median30.5
Q345.25
95-th percentile57.05
Maximum60
Range59
Interquartile range (IQR)29.5

Descriptive statistics

Standard deviation17.464249
Coefficient of variation (CV)0.57259833
Kurtosis-1.2
Mean30.5
Median Absolute Deviation (MAD)15
Skewness0
Sum1830
Variance305
MonotonicityStrictly increasing
2023-12-13T01:20:16.057477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
32 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
41 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
60 1
1.7%
59 1
1.7%
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
Distinct56
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
2023-12-13T01:20:16.275476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length13
Mean length8.7833333
Min length3

Characters and Unicode

Total characters527
Distinct characters151
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)86.7%

Sample

1st row(주)해운관광
2nd row세진관광여행사
3rd row(주)여행가기좋은날
4th row(주)청해진해운
5th row현대해양레져(주)
ValueCountFrequency (%)
주식회사 4
 
5.8%
주)여행가기좋은날 2
 
2.9%
세진관광여행사 2
 
2.9%
주)프라미스컴퍼니 2
 
2.9%
현대하나여행사 2
 
2.9%
주)이솝여행사 1
 
1.4%
주)아이씨트레블 1
 
1.4%
서치어카인드홀딩스 1
 
1.4%
주)드림어스코리아 1
 
1.4%
한유취국제여행사 1
 
1.4%
Other values (52) 52
75.4%
2023-12-13T01:20:16.638988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
8.0%
( 37
 
7.0%
) 37
 
7.0%
23
 
4.4%
21
 
4.0%
21
 
4.0%
14
 
2.7%
13
 
2.5%
12
 
2.3%
10
 
1.9%
Other values (141) 297
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 415
78.7%
Open Punctuation 37
 
7.0%
Close Punctuation 37
 
7.0%
Lowercase Letter 23
 
4.4%
Space Separator 9
 
1.7%
Uppercase Letter 6
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
10.1%
23
 
5.5%
21
 
5.1%
21
 
5.1%
14
 
3.4%
13
 
3.1%
12
 
2.9%
10
 
2.4%
10
 
2.4%
7
 
1.7%
Other values (123) 242
58.3%
Lowercase Letter
ValueCountFrequency (%)
n 5
21.7%
i 3
13.0%
s 3
13.0%
t 2
 
8.7%
a 2
 
8.7%
o 2
 
8.7%
e 2
 
8.7%
u 1
 
4.3%
l 1
 
4.3%
r 1
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
B 2
33.3%
D 1
16.7%
I 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 415
78.7%
Common 83
 
15.7%
Latin 29
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
10.1%
23
 
5.5%
21
 
5.1%
21
 
5.1%
14
 
3.4%
13
 
3.1%
12
 
2.9%
10
 
2.4%
10
 
2.4%
7
 
1.7%
Other values (123) 242
58.3%
Latin
ValueCountFrequency (%)
n 5
17.2%
i 3
10.3%
s 3
10.3%
S 2
 
6.9%
B 2
 
6.9%
t 2
 
6.9%
a 2
 
6.9%
o 2
 
6.9%
e 2
 
6.9%
u 1
 
3.4%
Other values (5) 5
17.2%
Common
ValueCountFrequency (%)
( 37
44.6%
) 37
44.6%
9
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 415
78.7%
ASCII 112
 
21.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
10.1%
23
 
5.5%
21
 
5.1%
21
 
5.1%
14
 
3.4%
13
 
3.1%
12
 
2.9%
10
 
2.4%
10
 
2.4%
7
 
1.7%
Other values (123) 242
58.3%
ASCII
ValueCountFrequency (%)
( 37
33.0%
) 37
33.0%
9
 
8.0%
n 5
 
4.5%
i 3
 
2.7%
s 3
 
2.7%
S 2
 
1.8%
B 2
 
1.8%
t 2
 
1.8%
a 2
 
1.8%
Other values (8) 10
 
8.9%

업종명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
종합여행업
26 
국내외여행업
20 
국내여행업
14 

Length

Max length6
Median length5
Mean length5.3333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
종합여행업 26
43.3%
국내외여행업 20
33.3%
국내여행업 14
23.3%

Length

2023-12-13T01:20:16.839795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:20:16.957522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종합여행업 26
43.3%
국내외여행업 20
33.3%
국내여행업 14
23.3%
Distinct50
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
2023-12-13T01:20:17.166445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length30.216667
Min length15

Characters and Unicode

Total characters1813
Distinct characters141
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)70.0%

Sample

1st row인천광역시 중구 답동 18-4
2nd row인천광역시 중구 답동 13
3rd row인천광역시 중구 운서동 2850 아이비씨 월드게이트 927호
4th row인천광역시 중구 항동7가 88
5th row인천광역시 중구 항동7가 60
ValueCountFrequency (%)
인천광역시 60
 
16.3%
중구 60
 
16.3%
2층 18
 
4.9%
운서동 11
 
3.0%
항동7가 9
 
2.4%
중산동 5
 
1.4%
1층 5
 
1.4%
월드게이트 5
 
1.4%
북성동1가 5
 
1.4%
연안부두로 5
 
1.4%
Other values (132) 185
50.3%
2023-12-13T01:20:17.936542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
319
 
17.6%
76
 
4.2%
67
 
3.7%
2 67
 
3.7%
64
 
3.5%
63
 
3.5%
1 63
 
3.5%
62
 
3.4%
61
 
3.4%
61
 
3.4%
Other values (131) 910
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1012
55.8%
Space Separator 319
 
17.6%
Decimal Number 310
 
17.1%
Other Punctuation 47
 
2.6%
Close Punctuation 47
 
2.6%
Open Punctuation 47
 
2.6%
Dash Punctuation 16
 
0.9%
Uppercase Letter 15
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
76
 
7.5%
67
 
6.6%
64
 
6.3%
63
 
6.2%
62
 
6.1%
61
 
6.0%
61
 
6.0%
60
 
5.9%
46
 
4.5%
27
 
2.7%
Other values (103) 425
42.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
20.0%
T 1
 
6.7%
Y 1
 
6.7%
O 1
 
6.7%
H 1
 
6.7%
E 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
K 1
 
6.7%
G 1
 
6.7%
Other values (3) 3
20.0%
Decimal Number
ValueCountFrequency (%)
2 67
21.6%
1 63
20.3%
0 30
9.7%
7 30
9.7%
4 30
9.7%
3 24
 
7.7%
6 21
 
6.8%
9 19
 
6.1%
5 13
 
4.2%
8 13
 
4.2%
Space Separator
ValueCountFrequency (%)
319
100.0%
Other Punctuation
ValueCountFrequency (%)
, 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1012
55.8%
Common 786
43.4%
Latin 15
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
76
 
7.5%
67
 
6.6%
64
 
6.3%
63
 
6.2%
62
 
6.1%
61
 
6.0%
61
 
6.0%
60
 
5.9%
46
 
4.5%
27
 
2.7%
Other values (103) 425
42.0%
Common
ValueCountFrequency (%)
319
40.6%
2 67
 
8.5%
1 63
 
8.0%
, 47
 
6.0%
) 47
 
6.0%
( 47
 
6.0%
0 30
 
3.8%
7 30
 
3.8%
4 30
 
3.8%
3 24
 
3.1%
Other values (5) 82
 
10.4%
Latin
ValueCountFrequency (%)
B 3
20.0%
T 1
 
6.7%
Y 1
 
6.7%
O 1
 
6.7%
H 1
 
6.7%
E 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
K 1
 
6.7%
G 1
 
6.7%
Other values (3) 3
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1012
55.8%
ASCII 801
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
319
39.8%
2 67
 
8.4%
1 63
 
7.9%
, 47
 
5.9%
) 47
 
5.9%
( 47
 
5.9%
0 30
 
3.7%
7 30
 
3.7%
4 30
 
3.7%
3 24
 
3.0%
Other values (18) 97
 
12.1%
Hangul
ValueCountFrequency (%)
76
 
7.5%
67
 
6.6%
64
 
6.3%
63
 
6.2%
62
 
6.1%
61
 
6.0%
61
 
6.0%
60
 
5.9%
46
 
4.5%
27
 
2.7%
Other values (103) 425
42.0%

소재지전화
Text

MISSING 

Distinct28
Distinct (%)87.5%
Missing28
Missing (%)46.7%
Memory size612.0 B
2023-12-13T01:20:18.139994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.09375
Min length9

Characters and Unicode

Total characters387
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)75.0%

Sample

1st row032-743-3060~3
2nd row032-884-8700
3rd row032-882-5555
4th row032-761-1950
5th row1522-3609
ValueCountFrequency (%)
032-761-1950 2
 
6.2%
032-743-3060~3 2
 
6.2%
032-884-8700 2
 
6.2%
032-766-9076 2
 
6.2%
032-777-2420 1
 
3.1%
032-773-0014 1
 
3.1%
032-889-1911 1
 
3.1%
032-345-5635 1
 
3.1%
032-746-8889 1
 
3.1%
02-2266-8631 1
 
3.1%
Other values (18) 18
56.2%
2023-12-13T01:20:18.453106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 63
16.3%
0 59
15.2%
3 51
13.2%
7 48
12.4%
2 43
11.1%
6 26
6.7%
8 25
 
6.5%
1 21
 
5.4%
4 17
 
4.4%
9 16
 
4.1%
Other values (2) 18
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 322
83.2%
Dash Punctuation 63
 
16.3%
Math Symbol 2
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 59
18.3%
3 51
15.8%
7 48
14.9%
2 43
13.4%
6 26
8.1%
8 25
7.8%
1 21
 
6.5%
4 17
 
5.3%
9 16
 
5.0%
5 16
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 387
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 63
16.3%
0 59
15.2%
3 51
13.2%
7 48
12.4%
2 43
11.1%
6 26
6.7%
8 25
 
6.5%
1 21
 
5.4%
4 17
 
4.4%
9 16
 
4.1%
Other values (2) 18
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 387
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 63
16.3%
0 59
15.2%
3 51
13.2%
7 48
12.4%
2 43
11.1%
6 26
6.7%
8 25
 
6.5%
1 21
 
5.4%
4 17
 
4.4%
9 16
 
4.1%
Other values (2) 18
 
4.7%
Distinct56
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
Minimum1997-10-08 00:00:00
Maximum2023-07-24 00:00:00
2023-12-13T01:20:18.642688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:20:18.828650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
Minimum2023-08-12 00:00:00
Maximum2023-08-12 00:00:00
2023-12-13T01:20:18.982423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:20:19.101570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T01:20:15.512068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:20:19.176285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명업종명업소소재지소재지전화신고일자
연번1.0000.7170.9340.8970.9150.717
업소명0.7171.0000.0000.9981.0001.000
업종명0.9340.0001.0000.0000.3260.000
업소소재지0.8970.9980.0001.0001.0001.000
소재지전화0.9151.0000.3261.0001.0001.000
신고일자0.7171.0000.0001.0001.0001.000
2023-12-13T01:20:19.298406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.856
업종명0.8561.000

Missing values

2023-12-13T01:20:15.649760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:20:15.774338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업종명업소소재지소재지전화신고일자데이터기준일
01(주)해운관광국내여행업인천광역시 중구 답동 18-4<NA>1999-09-152023-08-12
12세진관광여행사국내여행업인천광역시 중구 답동 13<NA>2004-11-252023-08-12
23(주)여행가기좋은날국내여행업인천광역시 중구 운서동 2850 아이비씨 월드게이트 927호032-743-3060~32008-05-162023-08-12
34(주)청해진해운국내여행업인천광역시 중구 항동7가 88032-884-87002008-06-182023-08-12
45현대해양레져(주)국내여행업인천광역시 중구 항동7가 60032-882-55552010-04-072023-08-12
56제물포 항공국내여행업인천광역시 중구 신생동 7-24<NA>2008-11-142023-08-12
67월미도해양관광(주)국내여행업인천광역시 중구 월미로 199, 1층 103호 (북성동1가)032-761-19502015-05-012023-08-12
78주식회사 후쿠코리아국내여행업인천광역시 중구 공항로424번길 50 (운서동, 월드게이트)1522-36092015-10-282023-08-12
89섬 투어(주)국내여행업인천광역시 중구 월미로 199, 1층 103호 (북성동1가)032-761-19502009-06-242023-08-12
910(주)프라미스컴퍼니국내여행업인천광역시 중구 개항로 62, 2층 (경동)032-766-90762018-07-092023-08-12
연번업소명업종명업소소재지소재지전화신고일자데이터기준일
5051여행마니아(주)종합여행업인천광역시 중구 신포로 19-10, 2층 (중앙동4가)<NA>2017-03-062023-08-12
5152(주)제이씨투어스종합여행업인천광역시 중구 운남서로 7, 113동 1004호 (운남동, 영종 자이)02-2266-86312010-02-252023-08-12
5253트리퍼스트글로벌(주)인천지점종합여행업인천광역시 중구 운중로177번길 39, 우리 상가 3층 (중산동)032-746-88892019-10-312023-08-12
5354부두 여행사종합여행업인천광역시 중구 우현로20번길 5, 1층 (신생동)<NA>2021-08-292023-08-12
5455(주)아이플러스종합여행업인천광역시 중구 서해대로93번길 14-3, 2층 201호 (항동7가)<NA>2022-01-132023-08-12
5556(주)카툰캠퍼스종합여행업인천광역시 중구 자유공원로 29, 2층 (내동)032-345-56352022-09-282023-08-12
5657(주)연평여행사종합여행업인천광역시 중구 연안부두로 70, 인천항 연안여객터미널 (항동7가)032-889-19112023-03-312023-08-12
5758시원국제상무(Siwon International Business)종합여행업인천광역시 중구 영종대로196번길 15-7, 스카이탑 116호 (운서동)<NA>2016-08-242023-08-12
5859인천그린모빌리티 주식회사종합여행업인천광역시 중구 축항대로290번길 124, 신흥교통(주) 1동 2층 (신흥동3가)<NA>2023-06-082023-08-12
5960주식회사 한성투어종합여행업인천광역시 중구 연안부두로 27, 인천수산물센타 1014호 (항동7가)032-888-89732023-07-242023-08-12