Overview

Dataset statistics

Number of variables6
Number of observations319
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.4 KiB
Average record size in memory49.4 B

Variable types

Numeric1
DateTime1
Categorical1
Text3

Dataset

Description송파구 관광사업체 현황으로 문화관광업종, 상호명, 주소, 전화번호 등 정보
Author서울특별시 송파구
URLhttps://www.data.go.kr/data/15037186/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:40:05.948108
Analysis finished2023-12-12 19:40:06.777641
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct319
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean160
Minimum1
Maximum319
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-13T04:40:06.873504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.9
Q180.5
median160
Q3239.5
95-th percentile303.1
Maximum319
Range318
Interquartile range (IQR)159

Descriptive statistics

Standard deviation92.231593
Coefficient of variation (CV)0.57644745
Kurtosis-1.2
Mean160
Median Absolute Deviation (MAD)80
Skewness0
Sum51040
Variance8506.6667
MonotonicityStrictly increasing
2023-12-13T04:40:07.069340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
2 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
215 1
 
0.3%
214 1
 
0.3%
213 1
 
0.3%
212 1
 
0.3%
Other values (309) 309
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
314 1
0.3%
313 1
0.3%
312 1
0.3%
311 1
0.3%
310 1
0.3%
Distinct260
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1988-06-02 00:00:00
Maximum2020-09-10 00:00:00
2023-12-13T04:40:07.237667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:40:07.435090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
국외여행업
148 
일반여행업
100 
국내여행업
71 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국외여행업 148
46.4%
일반여행업 100
31.3%
국내여행업 71
22.3%

Length

2023-12-13T04:40:07.608915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:07.734403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국외여행업 148
46.4%
일반여행업 100
31.3%
국내여행업 71
22.3%

상호
Text

Distinct269
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T04:40:08.025314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length8.3793103
Min length2

Characters and Unicode

Total characters2673
Distinct characters335
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)69.0%

Sample

1st row세화여행사
2nd row(주)자유투어잠실
3rd row(주)여행상자
4th row(주)제세관광여행사
5th row(주)예스버스관광
ValueCountFrequency (%)
주식회사 72
 
16.9%
여행사 4
 
0.9%
주)제로쿨투어 3
 
0.7%
tour 3
 
0.7%
travel 3
 
0.7%
골프큐브 2
 
0.5%
더베스트여행사 2
 
0.5%
하나로여행사 2
 
0.5%
제이엠어게인투어 2
 
0.5%
주)시도여행사 2
 
0.5%
Other values (282) 330
77.6%
2023-12-13T04:40:08.553926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
212
 
7.9%
( 145
 
5.4%
) 145
 
5.4%
124
 
4.6%
109
 
4.1%
106
 
4.0%
104
 
3.9%
89
 
3.3%
74
 
2.8%
72
 
2.7%
Other values (325) 1493
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2176
81.4%
Open Punctuation 145
 
5.4%
Close Punctuation 145
 
5.4%
Space Separator 106
 
4.0%
Uppercase Letter 64
 
2.4%
Lowercase Letter 34
 
1.3%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
212
 
9.7%
124
 
5.7%
109
 
5.0%
104
 
4.8%
89
 
4.1%
74
 
3.4%
72
 
3.3%
68
 
3.1%
57
 
2.6%
56
 
2.6%
Other values (284) 1211
55.7%
Uppercase Letter
ValueCountFrequency (%)
T 9
14.1%
E 5
 
7.8%
A 5
 
7.8%
M 4
 
6.2%
B 4
 
6.2%
K 4
 
6.2%
R 4
 
6.2%
N 4
 
6.2%
S 3
 
4.7%
L 3
 
4.7%
Other values (11) 19
29.7%
Lowercase Letter
ValueCountFrequency (%)
o 6
17.6%
r 5
14.7%
u 4
11.8%
l 3
8.8%
a 3
8.8%
n 2
 
5.9%
v 2
 
5.9%
e 2
 
5.9%
k 1
 
2.9%
s 1
 
2.9%
Other values (5) 5
14.7%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 145
100.0%
Close Punctuation
ValueCountFrequency (%)
) 145
100.0%
Space Separator
ValueCountFrequency (%)
106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2176
81.4%
Common 399
 
14.9%
Latin 98
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
212
 
9.7%
124
 
5.7%
109
 
5.0%
104
 
4.8%
89
 
4.1%
74
 
3.4%
72
 
3.3%
68
 
3.1%
57
 
2.6%
56
 
2.6%
Other values (284) 1211
55.7%
Latin
ValueCountFrequency (%)
T 9
 
9.2%
o 6
 
6.1%
E 5
 
5.1%
r 5
 
5.1%
A 5
 
5.1%
M 4
 
4.1%
B 4
 
4.1%
u 4
 
4.1%
K 4
 
4.1%
R 4
 
4.1%
Other values (26) 48
49.0%
Common
ValueCountFrequency (%)
( 145
36.3%
) 145
36.3%
106
26.6%
& 2
 
0.5%
. 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2176
81.4%
ASCII 497
 
18.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
212
 
9.7%
124
 
5.7%
109
 
5.0%
104
 
4.8%
89
 
4.1%
74
 
3.4%
72
 
3.3%
68
 
3.1%
57
 
2.6%
56
 
2.6%
Other values (284) 1211
55.7%
ASCII
ValueCountFrequency (%)
( 145
29.2%
) 145
29.2%
106
21.3%
T 9
 
1.8%
o 6
 
1.2%
E 5
 
1.0%
r 5
 
1.0%
A 5
 
1.0%
M 4
 
0.8%
B 4
 
0.8%
Other values (31) 63
12.7%
Distinct273
Distinct (%)85.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T04:40:08.932987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length47
Mean length38.275862
Min length21

Characters and Unicode

Total characters12210
Distinct characters250
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)71.5%

Sample

1st row서울특별시 송파구 송이로20길 10, 3층 (가락동, 리치월드빌딩)
2nd row서울특별시 송파구 송파대로 468, 2층 202호 (송파동, 금남빌딩)
3rd row서울특별시 송파구 충민로 10, 가든파이브툴 지하 1층 (문정동)
4th row서울특별시 송파구 오금로31가길 20, 203호 (방이동, 대림아파트상가)
5th row서울특별시 송파구 위례순환로 478, 센트라포레 1004호 (장지동)
ValueCountFrequency (%)
서울특별시 319
 
13.8%
송파구 319
 
13.8%
문정동 95
 
4.1%
잠실동 52
 
2.3%
가락동 49
 
2.1%
충민로 35
 
1.5%
방이동 34
 
1.5%
송파대로 28
 
1.2%
66 27
 
1.2%
올림픽로 26
 
1.1%
Other values (576) 1320
57.3%
2023-12-13T04:40:09.556695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1990
 
16.3%
455
 
3.7%
1 420
 
3.4%
412
 
3.4%
, 412
 
3.4%
393
 
3.2%
344
 
2.8%
330
 
2.7%
322
 
2.6%
( 321
 
2.6%
Other values (240) 6811
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7035
57.6%
Space Separator 1990
 
16.3%
Decimal Number 1933
 
15.8%
Other Punctuation 412
 
3.4%
Open Punctuation 321
 
2.6%
Close Punctuation 321
 
2.6%
Uppercase Letter 126
 
1.0%
Dash Punctuation 60
 
0.5%
Lowercase Letter 10
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
455
 
6.5%
412
 
5.9%
393
 
5.6%
344
 
4.9%
330
 
4.7%
322
 
4.6%
320
 
4.5%
319
 
4.5%
319
 
4.5%
319
 
4.5%
Other values (203) 3502
49.8%
Uppercase Letter
ValueCountFrequency (%)
B 25
19.8%
A 18
14.3%
T 14
11.1%
S 11
8.7%
C 9
 
7.1%
F 8
 
6.3%
U 6
 
4.8%
N 5
 
4.0%
G 5
 
4.0%
D 4
 
3.2%
Other values (8) 21
16.7%
Decimal Number
ValueCountFrequency (%)
1 420
21.7%
2 303
15.7%
0 264
13.7%
3 181
9.4%
6 172
8.9%
5 145
 
7.5%
4 132
 
6.8%
8 122
 
6.3%
9 100
 
5.2%
7 94
 
4.9%
Lowercase Letter
ValueCountFrequency (%)
s 6
60.0%
m 3
30.0%
t 1
 
10.0%
Space Separator
ValueCountFrequency (%)
1990
100.0%
Other Punctuation
ValueCountFrequency (%)
, 412
100.0%
Open Punctuation
ValueCountFrequency (%)
( 321
100.0%
Close Punctuation
ValueCountFrequency (%)
) 321
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7035
57.6%
Common 5039
41.3%
Latin 136
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
455
 
6.5%
412
 
5.9%
393
 
5.6%
344
 
4.9%
330
 
4.7%
322
 
4.6%
320
 
4.5%
319
 
4.5%
319
 
4.5%
319
 
4.5%
Other values (203) 3502
49.8%
Latin
ValueCountFrequency (%)
B 25
18.4%
A 18
13.2%
T 14
10.3%
S 11
 
8.1%
C 9
 
6.6%
F 8
 
5.9%
s 6
 
4.4%
U 6
 
4.4%
N 5
 
3.7%
G 5
 
3.7%
Other values (11) 29
21.3%
Common
ValueCountFrequency (%)
1990
39.5%
1 420
 
8.3%
, 412
 
8.2%
( 321
 
6.4%
) 321
 
6.4%
2 303
 
6.0%
0 264
 
5.2%
3 181
 
3.6%
6 172
 
3.4%
5 145
 
2.9%
Other values (6) 510
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7035
57.6%
ASCII 5175
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1990
38.5%
1 420
 
8.1%
, 412
 
8.0%
( 321
 
6.2%
) 321
 
6.2%
2 303
 
5.9%
0 264
 
5.1%
3 181
 
3.5%
6 172
 
3.3%
5 145
 
2.8%
Other values (27) 646
 
12.5%
Hangul
ValueCountFrequency (%)
455
 
6.5%
412
 
5.9%
393
 
5.6%
344
 
4.9%
330
 
4.7%
322
 
4.6%
320
 
4.5%
319
 
4.5%
319
 
4.5%
319
 
4.5%
Other values (203) 3502
49.8%
Distinct262
Distinct (%)82.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T04:40:09.928234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.338558
Min length2

Characters and Unicode

Total characters1065
Distinct characters161
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique207 ?
Unique (%)64.9%

Sample

1st row홍근성
2nd row김형년
3rd row정윤
4th row김용민
5th row최준영
ValueCountFrequency (%)
13
 
3.6%
13
 
3.6%
1 9
 
2.5%
2 4
 
1.1%
김형년 3
 
0.8%
박광수 3
 
0.8%
김성광 2
 
0.6%
김향일 2
 
0.6%
이준혁 2
 
0.6%
김현진 2
 
0.6%
Other values (257) 309
85.4%
2023-12-13T04:40:10.517784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74
 
6.9%
43
 
4.0%
43
 
4.0%
33
 
3.1%
30
 
2.8%
24
 
2.3%
24
 
2.3%
23
 
2.2%
20
 
1.9%
20
 
1.9%
Other values (151) 731
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 985
92.5%
Space Separator 43
 
4.0%
Uppercase Letter 20
 
1.9%
Decimal Number 13
 
1.2%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
7.5%
43
 
4.4%
33
 
3.4%
30
 
3.0%
24
 
2.4%
24
 
2.4%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
Other values (137) 675
68.5%
Uppercase Letter
ValueCountFrequency (%)
N 4
20.0%
E 2
10.0%
I 2
10.0%
K 2
10.0%
O 2
10.0%
S 2
10.0%
U 2
10.0%
H 2
10.0%
C 2
10.0%
Decimal Number
ValueCountFrequency (%)
1 9
69.2%
2 4
30.8%
Space Separator
ValueCountFrequency (%)
43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 985
92.5%
Common 60
 
5.6%
Latin 20
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
7.5%
43
 
4.4%
33
 
3.4%
30
 
3.0%
24
 
2.4%
24
 
2.4%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
Other values (137) 675
68.5%
Latin
ValueCountFrequency (%)
N 4
20.0%
E 2
10.0%
I 2
10.0%
K 2
10.0%
O 2
10.0%
S 2
10.0%
U 2
10.0%
H 2
10.0%
C 2
10.0%
Common
ValueCountFrequency (%)
43
71.7%
1 9
 
15.0%
2 4
 
6.7%
( 2
 
3.3%
) 2
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 985
92.5%
ASCII 80
 
7.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
74
 
7.5%
43
 
4.4%
33
 
3.4%
30
 
3.0%
24
 
2.4%
24
 
2.4%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
Other values (137) 675
68.5%
ASCII
ValueCountFrequency (%)
43
53.8%
1 9
 
11.2%
N 4
 
5.0%
2 4
 
5.0%
E 2
 
2.5%
( 2
 
2.5%
I 2
 
2.5%
K 2
 
2.5%
O 2
 
2.5%
S 2
 
2.5%
Other values (4) 8
 
10.0%

Interactions

2023-12-13T04:40:06.429890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:40:10.647865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.952
업종0.9521.000
2023-12-13T04:40:10.765095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.936
업종0.9361.000

Missing values

2023-12-13T04:40:06.596140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:40:06.724364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록일자업종상호소재지(도로명)대표자성명
011996-11-22국내여행업세화여행사서울특별시 송파구 송이로20길 10, 3층 (가락동, 리치월드빌딩)홍근성
121999-03-02국내여행업(주)자유투어잠실서울특별시 송파구 송파대로 468, 2층 202호 (송파동, 금남빌딩)김형년
232002-01-15국내여행업(주)여행상자서울특별시 송파구 충민로 10, 가든파이브툴 지하 1층 (문정동)정윤
342003-05-12국내여행업(주)제세관광여행사서울특별시 송파구 오금로31가길 20, 203호 (방이동, 대림아파트상가)김용민
452004-12-15국내여행업(주)예스버스관광서울특별시 송파구 위례순환로 478, 센트라포레 1004호 (장지동)최준영
562005-01-17국내여행업(주)타임투트레블서울특별시 송파구 올림픽로 212 (잠실동,갤러리아팰리스 지하 1층 8호)현경환
672005-11-23국내여행업(주)하나티엔에스서울특별시 송파구 중대로 24, 110-1호 (문정동, 훼미리상가)임동수
782007-04-02국내여행업(주)호산나투어서울특별시 송파구 가락동 795 밀리아나오피스텔 1401호정성영
892007-07-03국내여행업인터내셔날 에스오에스 코리아(주)서울특별시 송파구 백제고분로 69 (잠실동,애플타워 15층)마크아타웨이
9102007-08-01국내여행업(주)용화관광서울특별시 송파구 송파대로49길 33, 세화빌딩 302호 (석촌동)이기석
연번등록일자업종상호소재지(도로명)대표자성명
3093102020-02-21일반여행업(주)호텔마켓서울특별시 송파구 정의로7길 13, 힐스테이트에코송파 717호 (문정동)전민수
3103112020-03-09일반여행업(주)더크라운서울특별시 송파구 위례성대로 6, 현대토픽스 1014호 (방이동)이다은
3113122016-11-30일반여행업(주)마음챙김여행 백락투어서울특별시 송파구 송파대로 260, 제일오피스텔 812호 (가락동)박동주
3123132020-05-01일반여행업연하국제여행사서울특별시 송파구 송파대로 145, 문정 오벨리스크 425호 (문정동)김지나
3133142020-07-20일반여행업토리브엔터테인먼트서울특별시 송파구 위례성대로 6, 현대토픽스 1904호 (방이동)권경상
3143152020-08-12일반여행업가든투어서울특별시 송파구 충민로 66, 가든파이브라이프 Y-9082호 (문정동)유병성
3153162008-01-09일반여행업(주)비룡항공여행사서울특별시 송파구 송파대로44길 5, 아산빌딩 2층 (송파동)오창석
3163172017-11-06일반여행업서울공항리무진(주)서울특별시 송파구 충민로6길 67, 삼광교통(주)사옥 3층호 (장지동)김석균
3173182013-12-16일반여행업그린유스투어서울특별시 송파구 송파대로 145, 문정 오벨리스크 220호 (문정동)김미연
3183192007-03-14국외여행업(주)채널라인서울특별시 송파구 위례성대로22길 27-22 (오금동,서울레저 관광타운)조정혜