Overview

Dataset statistics

Number of variables7
Number of observations85
Missing cells51
Missing cells (%)8.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory57.6 B

Variable types

Categorical2
Text3
DateTime2

Dataset

Description경기도 하남시_관광사업 현황에 대한 데이터로 업종, 상호, 우편번호, 소재지(도로명), 전화번호, 영업상태 등의 항목을 제공합니다.
Author경기도 하남시
URLhttps://www.data.go.kr/data/3047295/fileData.do

Alerts

영업상태 has constant value ""Constant
기준일자 has constant value ""Constant
전화번호 has 51 (60.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 16:29:52.084762
Analysis finished2023-12-12 16:29:52.905317
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct10
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size812.0 B
국내외여행업
25 
종합여행업
20 
기타유원시설업
15 
국내여행업
13 
국제회의기획업
Other values (5)

Length

Max length11
Median length7
Mean length5.8823529
Min length5

Unique

Unique3 ?
Unique (%)3.5%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 25
29.4%
종합여행업 20
23.5%
기타유원시설업 15
17.6%
국내여행업 13
15.3%
국제회의기획업 5
 
5.9%
관광숙박업 2
 
2.4%
일반야영장업 2
 
2.4%
외국인관광 도시민박업 1
 
1.2%
한옥체험업 1
 
1.2%
종합유원시설업 1
 
1.2%

Length

2023-12-13T01:29:52.982225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:29:53.107670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 25
29.1%
종합여행업 20
23.3%
기타유원시설업 15
17.4%
국내여행업 13
15.1%
국제회의기획업 5
 
5.8%
관광숙박업 2
 
2.3%
일반야영장업 2
 
2.3%
외국인관광 1
 
1.2%
도시민박업 1
 
1.2%
한옥체험업 1
 
1.2%

상호
Text

Distinct77
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-13T01:29:53.360730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length8.4235294
Min length2

Characters and Unicode

Total characters716
Distinct characters196
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)81.2%

Sample

1st row(주)해피투어
2nd row신화관광개발(주)
3rd row(주)캐슬렉스서울
4th row(주)대현여행사
5th row주식회사 어울림씨앤씨
ValueCountFrequency (%)
주식회사 14
 
11.8%
하남 3
 
2.5%
주)해피투어 2
 
1.7%
자이투어(주 2
 
1.7%
진선관광 2
 
1.7%
에이티월드(주 2
 
1.7%
신화관광개발(주 2
 
1.7%
열린투어 2
 
1.7%
주)캐슬렉스서울 2
 
1.7%
호텔 2
 
1.7%
Other values (84) 86
72.3%
2023-12-13T01:29:53.751299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
6.7%
( 37
 
5.2%
) 37
 
5.2%
34
 
4.7%
27
 
3.8%
25
 
3.5%
22
 
3.1%
18
 
2.5%
17
 
2.4%
15
 
2.1%
Other values (186) 436
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 593
82.8%
Open Punctuation 37
 
5.2%
Close Punctuation 37
 
5.2%
Space Separator 34
 
4.7%
Uppercase Letter 8
 
1.1%
Lowercase Letter 7
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
8.1%
27
 
4.6%
25
 
4.2%
22
 
3.7%
18
 
3.0%
17
 
2.9%
15
 
2.5%
15
 
2.5%
12
 
2.0%
12
 
2.0%
Other values (172) 382
64.4%
Uppercase Letter
ValueCountFrequency (%)
N 2
25.0%
K 2
25.0%
B 1
12.5%
Z 1
12.5%
I 1
12.5%
O 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
t 2
28.6%
o 2
28.6%
a 1
14.3%
u 1
14.3%
r 1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 593
82.8%
Common 108
 
15.1%
Latin 15
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
8.1%
27
 
4.6%
25
 
4.2%
22
 
3.7%
18
 
3.0%
17
 
2.9%
15
 
2.5%
15
 
2.5%
12
 
2.0%
12
 
2.0%
Other values (172) 382
64.4%
Latin
ValueCountFrequency (%)
N 2
13.3%
t 2
13.3%
K 2
13.3%
o 2
13.3%
B 1
6.7%
Z 1
6.7%
I 1
6.7%
O 1
6.7%
a 1
6.7%
u 1
6.7%
Common
ValueCountFrequency (%)
( 37
34.3%
) 37
34.3%
34
31.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 593
82.8%
ASCII 123
 
17.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
8.1%
27
 
4.6%
25
 
4.2%
22
 
3.7%
18
 
3.0%
17
 
2.9%
15
 
2.5%
15
 
2.5%
12
 
2.0%
12
 
2.0%
Other values (172) 382
64.4%
ASCII
ValueCountFrequency (%)
( 37
30.1%
) 37
30.1%
34
27.6%
N 2
 
1.6%
t 2
 
1.6%
K 2
 
1.6%
o 2
 
1.6%
B 1
 
0.8%
Z 1
 
0.8%
I 1
 
0.8%
Other values (4) 4
 
3.3%
Distinct74
Distinct (%)87.1%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-13T01:29:54.026519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length45
Mean length38.611765
Min length20

Characters and Unicode

Total characters3282
Distinct characters170
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)75.3%

Sample

1st row경기도 하남시 미사강변한강로 170, 1110동 1401호 (망월동, 미사강변 한신휴플러스)
2nd row경기도 하남시 대청로 26 (신장동)
3rd row경기도 하남시 감이로 317 (감이동)
4th row경기도 하남시 조정대로 150, 545호 (덕풍동, 아이테코)
5th row경기도 하남시 서하남로48번길 50, 3층 (감일동)
ValueCountFrequency (%)
경기도 85
 
13.3%
하남시 85
 
13.3%
망월동 24
 
3.7%
풍산동 19
 
3.0%
덕풍동 15
 
2.3%
미사대로 14
 
2.2%
신장동 13
 
2.0%
미사강변서로 7
 
1.1%
미사 7
 
1.1%
25 7
 
1.1%
Other values (231) 365
56.9%
2023-12-13T01:29:54.413966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
557
 
17.0%
1 129
 
3.9%
120
 
3.7%
111
 
3.4%
109
 
3.3%
0 97
 
3.0%
89
 
2.7%
87
 
2.7%
) 87
 
2.7%
, 87
 
2.7%
Other values (160) 1809
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1820
55.5%
Decimal Number 564
 
17.2%
Space Separator 557
 
17.0%
Close Punctuation 87
 
2.7%
Other Punctuation 87
 
2.7%
Open Punctuation 87
 
2.7%
Uppercase Letter 61
 
1.9%
Dash Punctuation 13
 
0.4%
Math Symbol 5
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
6.6%
111
 
6.1%
109
 
6.0%
89
 
4.9%
87
 
4.8%
86
 
4.7%
85
 
4.7%
84
 
4.6%
77
 
4.2%
76
 
4.2%
Other values (130) 896
49.2%
Uppercase Letter
ValueCountFrequency (%)
E 10
16.4%
B 8
13.1%
C 7
11.5%
A 6
9.8%
T 6
9.8%
N 5
8.2%
U 5
8.2%
R 5
8.2%
F 3
 
4.9%
D 3
 
4.9%
Other values (3) 3
 
4.9%
Decimal Number
ValueCountFrequency (%)
1 129
22.9%
0 97
17.2%
5 63
11.2%
2 63
11.2%
3 50
 
8.9%
4 44
 
7.8%
7 41
 
7.3%
9 29
 
5.1%
6 28
 
5.0%
8 20
 
3.5%
Space Separator
ValueCountFrequency (%)
557
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Other Punctuation
ValueCountFrequency (%)
, 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1820
55.5%
Common 1400
42.7%
Latin 62
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
6.6%
111
 
6.1%
109
 
6.0%
89
 
4.9%
87
 
4.8%
86
 
4.7%
85
 
4.7%
84
 
4.6%
77
 
4.2%
76
 
4.2%
Other values (130) 896
49.2%
Common
ValueCountFrequency (%)
557
39.8%
1 129
 
9.2%
0 97
 
6.9%
) 87
 
6.2%
, 87
 
6.2%
( 87
 
6.2%
5 63
 
4.5%
2 63
 
4.5%
3 50
 
3.6%
4 44
 
3.1%
Other values (6) 136
 
9.7%
Latin
ValueCountFrequency (%)
E 10
16.1%
B 8
12.9%
C 7
11.3%
A 6
9.7%
T 6
9.7%
N 5
8.1%
U 5
8.1%
R 5
8.1%
F 3
 
4.8%
D 3
 
4.8%
Other values (4) 4
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1820
55.5%
ASCII 1462
44.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
557
38.1%
1 129
 
8.8%
0 97
 
6.6%
) 87
 
6.0%
, 87
 
6.0%
( 87
 
6.0%
5 63
 
4.3%
2 63
 
4.3%
3 50
 
3.4%
4 44
 
3.0%
Other values (20) 198
 
13.5%
Hangul
ValueCountFrequency (%)
120
 
6.6%
111
 
6.1%
109
 
6.0%
89
 
4.9%
87
 
4.8%
86
 
4.7%
85
 
4.7%
84
 
4.6%
77
 
4.2%
76
 
4.2%
Other values (130) 896
49.2%

전화번호
Text

MISSING 

Distinct31
Distinct (%)91.2%
Missing51
Missing (%)60.0%
Memory size812.0 B
2023-12-13T01:29:54.616048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.794118
Min length8

Characters and Unicode

Total characters401
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)82.4%

Sample

1st row031-792-8123
2nd row031-796-7300
3rd row02-480-5600
4th row031-794-2999
5th row031-591-7657
ValueCountFrequency (%)
031-796-7300 2
 
5.9%
070-4814-1161 2
 
5.9%
031-792-8123 2
 
5.9%
1577-7355 1
 
2.9%
1833-9001 1
 
2.9%
793-7799 1
 
2.9%
031-794-4778 1
 
2.9%
031-8072-8451 1
 
2.9%
031-5175-6195 1
 
2.9%
031-791-2997 1
 
2.9%
Other values (21) 21
61.8%
2023-12-13T01:29:54.955121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 75
18.7%
- 59
14.7%
1 49
12.2%
7 42
10.5%
3 40
10.0%
8 31
7.7%
2 29
 
7.2%
9 27
 
6.7%
5 19
 
4.7%
6 15
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 342
85.3%
Dash Punctuation 59
 
14.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 75
21.9%
1 49
14.3%
7 42
12.3%
3 40
11.7%
8 31
9.1%
2 29
 
8.5%
9 27
 
7.9%
5 19
 
5.6%
6 15
 
4.4%
4 15
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 401
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 75
18.7%
- 59
14.7%
1 49
12.2%
7 42
10.5%
3 40
10.0%
8 31
7.7%
2 29
 
7.2%
9 27
 
6.7%
5 19
 
4.7%
6 15
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 401
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 75
18.7%
- 59
14.7%
1 49
12.2%
7 42
10.5%
3 40
10.0%
8 31
7.7%
2 29
 
7.2%
9 27
 
6.7%
5 19
 
4.7%
6 15
 
3.7%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size812.0 B
영업중
85 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 85
100.0%

Length

2023-12-13T01:29:55.091656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:29:55.194383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 85
100.0%
Distinct77
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Memory size812.0 B
Minimum2002-10-16 00:00:00
Maximum2022-06-30 00:00:00
2023-12-13T01:29:55.315206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:29:55.459233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size812.0 B
Minimum2022-08-11 00:00:00
Maximum2022-08-11 00:00:00
2023-12-13T01:29:55.594884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:29:55.690479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T01:29:55.798004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)전화번호등록일자
업종1.0000.9520.5030.9400.963
상호0.9521.0001.0001.0001.000
소재지(도로명)0.5031.0001.0001.0000.999
전화번호0.9401.0001.0001.0001.000
등록일자0.9631.0000.9991.0001.000

Missing values

2023-12-13T01:29:52.714358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:29:52.838092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호영업상태등록일자기준일자
0국내여행업(주)해피투어경기도 하남시 미사강변한강로 170, 1110동 1401호 (망월동, 미사강변 한신휴플러스)031-792-8123영업중2006-10-242022-08-11
1국내여행업신화관광개발(주)경기도 하남시 대청로 26 (신장동)031-796-7300영업중2010-10-282022-08-11
2국내여행업(주)캐슬렉스서울경기도 하남시 감이로 317 (감이동)02-480-5600영업중2015-12-082022-08-11
3국내여행업(주)대현여행사경기도 하남시 조정대로 150, 545호 (덕풍동, 아이테코)<NA>영업중2007-12-242022-08-11
4국내여행업주식회사 어울림씨앤씨경기도 하남시 서하남로48번길 50, 3층 (감일동)<NA>영업중2018-01-312022-08-11
5국내여행업자이투어(주)경기도 하남시 미사강변동로 73, 미사강변 노블레스 505호 (망월동)031-794-2999영업중2005-10-312022-08-11
6국내여행업(주)에이케이레저경기도 하남시 하남대로 947, 하남테크노밸리 U1 CENTER B동 1303호 (풍산동)031-591-7657영업중2016-11-032022-08-11
7국내여행업진선관광경기도 하남시 위례중앙로 215, 위례롯데캐슬 6403동 804호 (학암동)<NA>영업중2020-03-302022-08-11
8국내여행업에이티월드(주)경기도 하남시 하남대로 947, 하남테크노밸리 U1 CENTER 비동 1211호 (풍산동)<NA>영업중2012-08-252022-08-11
9국내여행업주식회사 에이치씨인터내셔널경기도 하남시 미사대로 410, 미사강변 오벨리스크 101동 638호 (망월동)<NA>영업중2021-01-062022-08-11
업종상호소재지(도로명)전화번호영업상태등록일자기준일자
75기타유원시설업점핑파크 미사점경기도 하남시 미사강변중앙로204번길 45, 성산타워플러스 701~707호 (망월동)793-7799영업중2018-07-262022-08-11
76기타유원시설업디아망 하남미사점경기도 하남시 미사강변동로 127, 경서타워 12층 1201~1206호 (망월동)<NA>영업중2018-10-122022-08-11
77기타유원시설업헬로방방 하남풍산점경기도 하남시 덕풍동로 111-21, 케이엔몰 901~903호 (덕풍동)<NA>영업중2018-11-132022-08-11
78기타유원시설업기드온 주식회사(토이킹덤 스타필드하남)경기도 하남시 미사대로 750, 스타필드 하남 3층 토이킹덤호 (신장동)1833-9001영업중2019-06-122022-08-11
79기타유원시설업점핑몬스터 하남미사점경기도 하남시 미사강변대로34번길 100, 미사타워 4층 (풍산동)031-794-4778영업중2019-08-072022-08-11
80기타유원시설업(주)프렌즈경기도 하남시 미사강변대로 38, 위너스프라자 비101호 (풍산동)<NA>영업중2020-01-162022-08-11
81기타유원시설업(주)신창어뮤즈먼트경기도 하남시 미사대로 750, 스타필드 하남 3층 (신장동)<NA>영업중2020-05-142022-08-11
82기타유원시설업효정패밀리아카페경기도 하남시 미사강변대로226번안길 35, 청담프라자 2층 (망월동)031-699-1710영업중2021-06-012022-08-11
83기타유원시설업주식회사 지니아이경기도 하남시 미사강변동로 95, 힐스테이트 미사역 그랑파사쥬(12-1BL) B1069호 (망월동)<NA>영업중2022-04-072022-08-11
84기타유원시설업(주)바운스 하남센터경기도 하남시 미사강변한강로 135, B151, B259호 (망월동)<NA>영업중2022-04-262022-08-11