Overview

Dataset statistics

Number of variables7
Number of observations118
Missing cells21
Missing cells (%)2.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory57.1 B

Variable types

Categorical3
DateTime1
Text3

Dataset

Description서울특별시 금천구 관내 유흥, 단란주점의 업종명, 인허가일자, 업소명, 소재지(도로명), 소새지전화번호 등의 항목을 제공하고 있습니다.
Author서울특별시 금천구
URLhttps://www.data.go.kr/data/3081161/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
기타사항 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 기타사항High correlation
소재지전화 has 21 (17.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 17:16:37.838926
Analysis finished2023-12-12 17:16:38.632789
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
단란주점
70 
유흥주점영업
48 

Length

Max length6
Median length4
Mean length4.8135593
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
단란주점 70
59.3%
유흥주점영업 48
40.7%

Length

2023-12-13T02:16:38.752201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:16:38.870655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단란주점 70
59.3%
유흥주점영업 48
40.7%
Distinct113
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum1973-11-08 00:00:00
Maximum2018-11-13 00:00:00
2023-12-13T02:16:38.976642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:16:39.159537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct113
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T02:16:39.492839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length10
Mean length4.4576271
Min length1

Characters and Unicode

Total characters526
Distinct characters200
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)91.5%

Sample

1st row도화촌
2nd row여우노래바
3rd row보물섬
4th row국일관 스텐드빠
5th row헤네시룸비지니스
ValueCountFrequency (%)
동경 2
 
1.6%
홀리데이 2
 
1.6%
노래바 2
 
1.6%
에쿠스 2
 
1.6%
7080 2
 
1.6%
미팅 2
 
1.6%
맨체스터 2
 
1.6%
스타 2
 
1.6%
도화촌 1
 
0.8%
갈채단란주점 1
 
0.8%
Other values (111) 111
86.0%
2023-12-13T02:16:40.003440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
8.0%
42
 
8.0%
28
 
5.3%
0 20
 
3.8%
15
 
2.9%
11
 
2.1%
8 10
 
1.9%
7 10
 
1.9%
10
 
1.9%
9
 
1.7%
Other values (190) 329
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 458
87.1%
Decimal Number 40
 
7.6%
Space Separator 11
 
2.1%
Lowercase Letter 11
 
2.1%
Uppercase Letter 4
 
0.8%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
9.2%
42
 
9.2%
28
 
6.1%
15
 
3.3%
10
 
2.2%
9
 
2.0%
7
 
1.5%
7
 
1.5%
6
 
1.3%
5
 
1.1%
Other values (174) 287
62.7%
Lowercase Letter
ValueCountFrequency (%)
o 4
36.4%
n 2
18.2%
d 1
 
9.1%
e 1
 
9.1%
v 1
 
9.1%
i 1
 
9.1%
t 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
0 20
50.0%
8 10
25.0%
7 10
25.0%
Uppercase Letter
ValueCountFrequency (%)
L 2
50.0%
Z 1
25.0%
S 1
25.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 458
87.1%
Common 53
 
10.1%
Latin 15
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
9.2%
42
 
9.2%
28
 
6.1%
15
 
3.3%
10
 
2.2%
9
 
2.0%
7
 
1.5%
7
 
1.5%
6
 
1.3%
5
 
1.1%
Other values (174) 287
62.7%
Latin
ValueCountFrequency (%)
o 4
26.7%
L 2
13.3%
n 2
13.3%
d 1
 
6.7%
e 1
 
6.7%
v 1
 
6.7%
i 1
 
6.7%
Z 1
 
6.7%
t 1
 
6.7%
S 1
 
6.7%
Common
ValueCountFrequency (%)
0 20
37.7%
11
20.8%
8 10
18.9%
7 10
18.9%
( 1
 
1.9%
) 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 458
87.1%
ASCII 68
 
12.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
9.2%
42
 
9.2%
28
 
6.1%
15
 
3.3%
10
 
2.2%
9
 
2.0%
7
 
1.5%
7
 
1.5%
6
 
1.3%
5
 
1.1%
Other values (174) 287
62.7%
ASCII
ValueCountFrequency (%)
0 20
29.4%
11
16.2%
8 10
14.7%
7 10
14.7%
o 4
 
5.9%
L 2
 
2.9%
n 2
 
2.9%
d 1
 
1.5%
( 1
 
1.5%
e 1
 
1.5%
Other values (6) 6
 
8.8%
Distinct113
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T02:16:40.275620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length42
Mean length31.779661
Min length26

Characters and Unicode

Total characters3750
Distinct characters69
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)91.5%

Sample

1st row서울특별시 금천구 시흥대로59길 10 (시흥동,지하1층 (건영길 6))
2nd row서울특별시 금천구 독산로 364 (독산동,지하1층 (독산동길 3))
3rd row서울특별시 금천구 범안로 1252 (독산동,지하1층 (독산역길 5))
4th row서울특별시 금천구 독산로 353, 지하1층 (독산동)
5th row서울특별시 금천구 남부순환로 1372, 지하1층 (독산동)
ValueCountFrequency (%)
서울특별시 118
16.3%
금천구 118
16.3%
지하1층 77
 
10.6%
독산동 38
 
5.2%
시흥동 38
 
5.2%
시흥대로 25
 
3.4%
가산로 23
 
3.2%
가산동 19
 
2.6%
독산로 15
 
2.1%
금하로 12
 
1.7%
Other values (147) 243
33.5%
2023-12-13T02:16:40.784443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
610
 
16.3%
213
 
5.7%
1 174
 
4.6%
) 133
 
3.5%
( 133
 
3.5%
131
 
3.5%
126
 
3.4%
, 125
 
3.3%
121
 
3.2%
119
 
3.2%
Other values (59) 1865
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2181
58.2%
Space Separator 610
 
16.3%
Decimal Number 557
 
14.9%
Close Punctuation 133
 
3.5%
Open Punctuation 133
 
3.5%
Other Punctuation 125
 
3.3%
Dash Punctuation 7
 
0.2%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
213
 
9.8%
131
 
6.0%
126
 
5.8%
121
 
5.5%
119
 
5.5%
119
 
5.5%
119
 
5.5%
118
 
5.4%
118
 
5.4%
118
 
5.4%
Other values (40) 879
40.3%
Decimal Number
ValueCountFrequency (%)
1 174
31.2%
2 87
15.6%
3 65
 
11.7%
5 53
 
9.5%
6 40
 
7.2%
4 35
 
6.3%
0 28
 
5.0%
8 28
 
5.0%
7 26
 
4.7%
9 21
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
D 1
25.0%
N 1
25.0%
I 1
25.0%
M 1
25.0%
Space Separator
ValueCountFrequency (%)
610
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Other Punctuation
ValueCountFrequency (%)
, 125
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2181
58.2%
Common 1565
41.7%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
213
 
9.8%
131
 
6.0%
126
 
5.8%
121
 
5.5%
119
 
5.5%
119
 
5.5%
119
 
5.5%
118
 
5.4%
118
 
5.4%
118
 
5.4%
Other values (40) 879
40.3%
Common
ValueCountFrequency (%)
610
39.0%
1 174
 
11.1%
) 133
 
8.5%
( 133
 
8.5%
, 125
 
8.0%
2 87
 
5.6%
3 65
 
4.2%
5 53
 
3.4%
6 40
 
2.6%
4 35
 
2.2%
Other values (5) 110
 
7.0%
Latin
ValueCountFrequency (%)
D 1
25.0%
N 1
25.0%
I 1
25.0%
M 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2181
58.2%
ASCII 1569
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
610
38.9%
1 174
 
11.1%
) 133
 
8.5%
( 133
 
8.5%
, 125
 
8.0%
2 87
 
5.5%
3 65
 
4.1%
5 53
 
3.4%
6 40
 
2.5%
4 35
 
2.2%
Other values (9) 114
 
7.3%
Hangul
ValueCountFrequency (%)
213
 
9.8%
131
 
6.0%
126
 
5.8%
121
 
5.5%
119
 
5.5%
119
 
5.5%
119
 
5.5%
118
 
5.4%
118
 
5.4%
118
 
5.4%
Other values (40) 879
40.3%

소재지전화
Text

MISSING 

Distinct96
Distinct (%)99.0%
Missing21
Missing (%)17.8%
Memory size1.1 KiB
2023-12-13T02:16:41.110353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length12.649485
Min length11

Characters and Unicode

Total characters1227
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)97.9%

Sample

1st row 02-896-2566
2nd row 02-859-3051
3rd row 02-808-1118
4th row 02-862-0421
5th row02-861-0031
ValueCountFrequency (%)
02-857-6366 2
 
2.1%
02-805-1557 1
 
1.0%
02-896-2566 1
 
1.0%
02-805-3863 1
 
1.0%
02-893-7800 1
 
1.0%
02-807-9237 1
 
1.0%
02-896-1197 1
 
1.0%
02-891-5442 1
 
1.0%
02-853-5890 1
 
1.0%
02-837-4567 1
 
1.0%
Other values (86) 86
88.7%
2023-12-13T02:16:41.648546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 194
15.8%
0 171
13.9%
159
13.0%
8 146
11.9%
2 142
11.6%
5 74
 
6.0%
6 71
 
5.8%
3 64
 
5.2%
1 63
 
5.1%
9 54
 
4.4%
Other values (2) 89
7.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 874
71.2%
Dash Punctuation 194
 
15.8%
Space Separator 159
 
13.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 171
19.6%
8 146
16.7%
2 142
16.2%
5 74
8.5%
6 71
8.1%
3 64
 
7.3%
1 63
 
7.2%
9 54
 
6.2%
7 52
 
5.9%
4 37
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%
Space Separator
ValueCountFrequency (%)
159
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1227
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 194
15.8%
0 171
13.9%
159
13.0%
8 146
11.9%
2 142
11.6%
5 74
 
6.0%
6 71
 
5.8%
3 64
 
5.2%
1 63
 
5.1%
9 54
 
4.4%
Other values (2) 89
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1227
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 194
15.8%
0 171
13.9%
159
13.0%
8 146
11.9%
2 142
11.6%
5 74
 
6.0%
6 71
 
5.8%
3 64
 
5.2%
1 63
 
5.1%
9 54
 
4.4%
Other values (2) 89
7.3%

기타사항
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
97 
휴대폰번호임
21 

Length

Max length6
Median length4
Mean length4.3559322
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 97
82.2%
휴대폰번호임 21
 
17.8%

Length

2023-12-13T02:16:41.855419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:16:42.013093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 97
82.2%
휴대폰번호임 21
 
17.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-09-13
118 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-13
2nd row2023-09-13
3rd row2023-09-13
4th row2023-09-13
5th row2023-09-13

Common Values

ValueCountFrequency (%)
2023-09-13 118
100.0%

Length

2023-12-13T02:16:42.131026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:16:42.219743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-13 118
100.0%

Correlations

2023-12-13T02:16:42.281238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명소재지전화
업종명1.0001.000
소재지전화1.0001.000
2023-12-13T02:16:42.383903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기타사항업종명
기타사항1.0001.000
업종명1.0001.000
2023-12-13T02:16:42.487400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명기타사항
업종명1.0001.000
기타사항1.0001.000

Missing values

2023-12-13T02:16:38.464780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:16:38.588814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명인허가일자업소명소재지(도로명)소재지전화기타사항데이터기준일자
0유흥주점영업1973-11-08도화촌서울특별시 금천구 시흥대로59길 10 (시흥동,지하1층 (건영길 6))02-896-2566<NA>2023-09-13
1유흥주점영업1978-09-23여우노래바서울특별시 금천구 독산로 364 (독산동,지하1층 (독산동길 3))02-859-3051<NA>2023-09-13
2유흥주점영업1978-10-21보물섬서울특별시 금천구 범안로 1252 (독산동,지하1층 (독산역길 5))02-808-1118<NA>2023-09-13
3유흥주점영업1978-11-28국일관 스텐드빠서울특별시 금천구 독산로 353, 지하1층 (독산동)02-862-0421<NA>2023-09-13
4유흥주점영업1978-02-03헤네시룸비지니스서울특별시 금천구 남부순환로 1372, 지하1층 (독산동)02-861-0031<NA>2023-09-13
5유흥주점영업1978-10-28국빈관관광나이트크럽서울특별시 금천구 범안로 1209, 지상3층 (독산동)02-895-2245<NA>2023-09-13
6유흥주점영업1979-05-10떳다노래바서울특별시 금천구 시흥대로52길 62 (시흥동, 지하1층)02-893-4827<NA>2023-09-13
7유흥주점영업1980-08-14별난서울특별시 금천구 시흥대로 370 (독산동,지하1층)02-851-8131<NA>2023-09-13
8유흥주점영업1980-09-09보물썸노래바서울특별시 금천구 시흥대로52길 지하 24 (시흥동)02-892-3777<NA>2023-09-13
9유흥주점영업1980-08-14오페라하우스7080서울특별시 금천구 시흥대로 149 (시흥동,지하1층 (시흥대로 536))02-892-5124<NA>2023-09-13
업종명인허가일자업소명소재지(도로명)소재지전화기타사항데이터기준일자
108단란주점2011-06-27종이연 7080 단란주점서울특별시 금천구 시흥대로 228 (시흥동, 지하1층)<NA>휴대폰번호임2023-09-13
109단란주점2013-09-26모모7080서울특별시 금천구 범안로 1209, 지상5층 33호 (독산동, 협진식품빌딩)02-892-2626<NA>2023-09-13
110단란주점2013-10-04Zoot London(쥬트런던)서울특별시 금천구 금하로 614, 지하1층 (시흥동)02-806-7080<NA>2023-09-13
111단란주점2014-07-29힐링서울특별시 금천구 가산로 150, 지하1층 101호 (가산동)<NA>휴대폰번호임2023-09-13
112단란주점2016-12-21쎄시봉7080라이브서울특별시 금천구 시흥대로 220, 지하1층 (시흥동)02-802-3341<NA>2023-09-13
113단란주점2017-05-10맨체스터서울특별시 금천구 시흥대로 222, 지하1층 (시흥동)<NA>휴대폰번호임2023-09-13
114단란주점2017-12-22소풍서울특별시 금천구 금하로 632, 김안과의원 지하1층 (시흥동)02-892-7089<NA>2023-09-13
115단란주점2018-02-28수노래광장서울특별시 금천구 시흥대로52길 62, 지상2층 (시흥동)<NA>휴대폰번호임2023-09-13
116단란주점2018-07-04킹노래바서울특별시 금천구 금하로 617-2, 지하1층 (시흥동)<NA>휴대폰번호임2023-09-13
117단란주점2018-11-13아리조나7080서울특별시 금천구 금하로 631, 지하1층 (시흥동)<NA>휴대폰번호임2023-09-13