Overview

Dataset statistics

Number of variables6
Number of observations68
Missing cells5
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory49.9 B

Variable types

Categorical2
Text4

Dataset

Description서울시 양천구 유흥단란주점현황(업종명, 업소명, 소재지 도로명주소, 소재지 지번주소, 소재지 전화번호, 데이터 기준일자 등)에 대한 정보입니다.
URLhttps://www.data.go.kr/data/15038563/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업종명 is highly imbalanced (67.7%)Imbalance
소재지전화 has 5 (7.4%) missing valuesMissing
소재지 도로명주소 has unique valuesUnique
소재지 지번주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:00:34.120621
Analysis finished2023-12-12 20:00:34.694118
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size676.0 B
단란주점
64 
유흥주점영업
 
4

Length

Max length6
Median length4
Mean length4.1176471
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row단란주점

Common Values

ValueCountFrequency (%)
단란주점 64
94.1%
유흥주점영업 4
 
5.9%

Length

2023-12-13T05:00:34.772715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:00:34.895804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단란주점 64
94.1%
유흥주점영업 4
 
5.9%
Distinct67
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-13T05:00:35.133893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length4.7205882
Min length1

Characters and Unicode

Total characters321
Distinct characters144
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)97.1%

Sample

1st row킹노래바
2nd row터널
3rd row타임노래뱅크
4th row캉캉노래바
5th row라온뮤직타운
ValueCountFrequency (%)
터널 2
 
2.8%
보이스 1
 
1.4%
1
 
1.4%
썸노래주점 1
 
1.4%
카사블랑카 1
 
1.4%
황금단란주점 1
 
1.4%
황실 1
 
1.4%
스카이노래주점 1
 
1.4%
삼삼한청춘 1
 
1.4%
라이브사랑벌 1
 
1.4%
Other values (61) 61
84.7%
2023-12-13T05:00:35.598164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
6.9%
22
 
6.9%
9
 
2.8%
9
 
2.8%
9
 
2.8%
0 8
 
2.5%
8
 
2.5%
8
 
2.5%
6
 
1.9%
6
 
1.9%
Other values (134) 214
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 291
90.7%
Decimal Number 18
 
5.6%
Space Separator 4
 
1.2%
Open Punctuation 2
 
0.6%
Close Punctuation 2
 
0.6%
Lowercase Letter 2
 
0.6%
Uppercase Letter 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
7.6%
22
 
7.6%
9
 
3.1%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (122) 186
63.9%
Decimal Number
ValueCountFrequency (%)
0 8
44.4%
8 4
22.2%
7 4
22.2%
2 1
 
5.6%
1 1
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
d 1
50.0%
e 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
O 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 291
90.7%
Common 26
 
8.1%
Latin 4
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
7.6%
22
 
7.6%
9
 
3.1%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (122) 186
63.9%
Common
ValueCountFrequency (%)
0 8
30.8%
4
15.4%
8 4
15.4%
7 4
15.4%
( 2
 
7.7%
) 2
 
7.7%
2 1
 
3.8%
1 1
 
3.8%
Latin
ValueCountFrequency (%)
d 1
25.0%
e 1
25.0%
O 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 291
90.7%
ASCII 30
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
7.6%
22
 
7.6%
9
 
3.1%
9
 
3.1%
9
 
3.1%
8
 
2.7%
8
 
2.7%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (122) 186
63.9%
ASCII
ValueCountFrequency (%)
0 8
26.7%
4
13.3%
8 4
13.3%
7 4
13.3%
( 2
 
6.7%
) 2
 
6.7%
2 1
 
3.3%
1 1
 
3.3%
d 1
 
3.3%
e 1
 
3.3%
Other values (2) 2
 
6.7%
Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-13T05:00:35.921748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length42
Mean length30.926471
Min length27

Characters and Unicode

Total characters2103
Distinct characters64
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row서울특별시 양천구 공항대로 564, 지하 1층 (목동)
2nd row서울특별시 양천구 신월로 292, 지하 1층 (신정동)
3rd row서울특별시 양천구 등촌로 182, 지하 1층 (목동)
4th row서울특별시 양천구 목동서로 213, 세신비젼프라자 3층 306~309호 (목동)
5th row서울특별시 양천구 중앙로 263, 지하 1층 (신정동)
ValueCountFrequency (%)
서울특별시 68
14.2%
양천구 68
14.2%
지하 62
13.0%
1층 62
13.0%
신정동 38
 
7.9%
목동 17
 
3.6%
신월동 13
 
2.7%
중앙로 13
 
2.7%
신월로 10
 
2.1%
오목로 7
 
1.5%
Other values (96) 120
25.1%
2023-12-13T05:00:36.368256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
410
 
19.5%
1 95
 
4.5%
78
 
3.7%
71
 
3.4%
71
 
3.4%
71
 
3.4%
70
 
3.3%
, 69
 
3.3%
68
 
3.2%
68
 
3.2%
Other values (54) 1032
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1193
56.7%
Space Separator 410
 
19.5%
Decimal Number 284
 
13.5%
Other Punctuation 69
 
3.3%
Close Punctuation 68
 
3.2%
Open Punctuation 68
 
3.2%
Dash Punctuation 9
 
0.4%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
6.5%
71
 
6.0%
71
 
6.0%
71
 
6.0%
70
 
5.9%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
Other values (38) 492
41.2%
Decimal Number
ValueCountFrequency (%)
1 95
33.5%
2 45
15.8%
3 29
 
10.2%
5 18
 
6.3%
4 17
 
6.0%
0 17
 
6.0%
8 17
 
6.0%
6 17
 
6.0%
9 16
 
5.6%
7 13
 
4.6%
Space Separator
ValueCountFrequency (%)
410
100.0%
Other Punctuation
ValueCountFrequency (%)
, 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 68
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1193
56.7%
Common 910
43.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
6.5%
71
 
6.0%
71
 
6.0%
71
 
6.0%
70
 
5.9%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
Other values (38) 492
41.2%
Common
ValueCountFrequency (%)
410
45.1%
1 95
 
10.4%
, 69
 
7.6%
) 68
 
7.5%
( 68
 
7.5%
2 45
 
4.9%
3 29
 
3.2%
5 18
 
2.0%
4 17
 
1.9%
0 17
 
1.9%
Other values (6) 74
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1193
56.7%
ASCII 910
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
410
45.1%
1 95
 
10.4%
, 69
 
7.6%
) 68
 
7.5%
( 68
 
7.5%
2 45
 
4.9%
3 29
 
3.2%
5 18
 
2.0%
4 17
 
1.9%
0 17
 
1.9%
Other values (6) 74
 
8.1%
Hangul
ValueCountFrequency (%)
78
 
6.5%
71
 
6.0%
71
 
6.0%
71
 
6.0%
70
 
5.9%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
68
 
5.7%
Other values (38) 492
41.2%
Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-13T05:00:36.663495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length35
Mean length26.764706
Min length23

Characters and Unicode

Total characters1820
Distinct characters42
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row서울특별시 양천구 목동 602-1 지하 1층
2nd row서울특별시 양천구 신정동 1030-1 지하 1층
3rd row서울특별시 양천구 목동 651-9 지하 1층
4th row서울특별시 양천구 목동 923 세신비젼프라자 3층 306~309호
5th row서울특별시 양천구 신정동 1183-7 지하 1층
ValueCountFrequency (%)
서울특별시 68
16.5%
양천구 68
16.5%
지하 63
15.3%
1층 63
15.3%
신정동 38
9.2%
목동 17
 
4.1%
신월동 13
 
3.2%
2층 3
 
0.7%
972-1 2
 
0.5%
3층 2
 
0.5%
Other values (74) 74
18.0%
2023-12-13T05:00:37.160460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
407
22.4%
1 142
 
7.8%
70
 
3.8%
68
 
3.7%
68
 
3.7%
68
 
3.7%
68
 
3.7%
68
 
3.7%
68
 
3.7%
68
 
3.7%
Other values (32) 725
39.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 949
52.1%
Space Separator 407
22.4%
Decimal Number 395
21.7%
Dash Punctuation 67
 
3.7%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
7.4%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
Other values (19) 267
28.1%
Decimal Number
ValueCountFrequency (%)
1 142
35.9%
2 42
 
10.6%
3 38
 
9.6%
9 38
 
9.6%
0 34
 
8.6%
8 22
 
5.6%
6 21
 
5.3%
7 20
 
5.1%
4 19
 
4.8%
5 19
 
4.8%
Space Separator
ValueCountFrequency (%)
407
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 949
52.1%
Common 871
47.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
7.4%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
Other values (19) 267
28.1%
Common
ValueCountFrequency (%)
407
46.7%
1 142
 
16.3%
- 67
 
7.7%
2 42
 
4.8%
3 38
 
4.4%
9 38
 
4.4%
0 34
 
3.9%
8 22
 
2.5%
6 21
 
2.4%
7 20
 
2.3%
Other values (3) 40
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 949
52.1%
ASCII 871
47.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
407
46.7%
1 142
 
16.3%
- 67
 
7.7%
2 42
 
4.8%
3 38
 
4.4%
9 38
 
4.4%
0 34
 
3.9%
8 22
 
2.5%
6 21
 
2.4%
7 20
 
2.3%
Other values (3) 40
 
4.6%
Hangul
ValueCountFrequency (%)
70
 
7.4%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
68
 
7.2%
Other values (19) 267
28.1%

소재지전화
Text

MISSING 

Distinct63
Distinct (%)100.0%
Missing5
Missing (%)7.4%
Memory size676.0 B
2023-12-13T05:00:37.567925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.809524
Min length13

Characters and Unicode

Total characters870
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)100.0%

Sample

1st row 02-2652-8983
2nd row 02-2651-4028
3rd row 02-2651-4208
4th row 02-2653-2991
5th row 02-2695-8915
ValueCountFrequency (%)
02-2652-8983 1
 
1.6%
02-2601-4457 1
 
1.6%
02-2649-1215 1
 
1.6%
02-2651-4028 1
 
1.6%
02-2643-0553 1
 
1.6%
02-602-2035 1
 
1.6%
02-887-0839 1
 
1.6%
02-644-2445 1
 
1.6%
02-606-2606 1
 
1.6%
02-2645-1186 1
 
1.6%
Other values (54) 54
84.4%
2023-12-13T05:00:38.050756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 150
17.2%
127
14.6%
- 126
14.5%
0 103
11.8%
6 99
11.4%
9 53
 
6.1%
4 49
 
5.6%
5 47
 
5.4%
3 31
 
3.6%
8 30
 
3.4%
Other values (2) 55
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 617
70.9%
Space Separator 127
 
14.6%
Dash Punctuation 126
 
14.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 150
24.3%
0 103
16.7%
6 99
16.0%
9 53
 
8.6%
4 49
 
7.9%
5 47
 
7.6%
3 31
 
5.0%
8 30
 
4.9%
1 30
 
4.9%
7 25
 
4.1%
Space Separator
ValueCountFrequency (%)
127
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 870
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 150
17.2%
127
14.6%
- 126
14.5%
0 103
11.8%
6 99
11.4%
9 53
 
6.1%
4 49
 
5.6%
5 47
 
5.4%
3 31
 
3.6%
8 30
 
3.4%
Other values (2) 55
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 870
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 150
17.2%
127
14.6%
- 126
14.5%
0 103
11.8%
6 99
11.4%
9 53
 
6.1%
4 49
 
5.6%
5 47
 
5.4%
3 31
 
3.6%
8 30
 
3.4%
Other values (2) 55
 
6.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-08-10
68 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-10
2nd row2023-08-10
3rd row2023-08-10
4th row2023-08-10
5th row2023-08-10

Common Values

ValueCountFrequency (%)
2023-08-10 68
100.0%

Length

2023-12-13T05:00:38.245082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:00:38.376881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-10 68
100.0%

Correlations

2023-12-13T05:00:38.458765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명소재지 도로명주소소재지 지번주소소재지전화
업종명1.0000.0001.0001.0001.000
업소명0.0001.0001.0001.0001.000
소재지 도로명주소1.0001.0001.0001.0001.000
소재지 지번주소1.0001.0001.0001.0001.000
소재지전화1.0001.0001.0001.0001.000

Missing values

2023-12-13T05:00:34.530887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:00:34.645291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지 도로명주소소재지 지번주소소재지전화데이터기준일자
0유흥주점영업킹노래바서울특별시 양천구 공항대로 564, 지하 1층 (목동)서울특별시 양천구 목동 602-1 지하 1층02-2652-89832023-08-10
1유흥주점영업터널서울특별시 양천구 신월로 292, 지하 1층 (신정동)서울특별시 양천구 신정동 1030-1 지하 1층02-2651-40282023-08-10
2유흥주점영업타임노래뱅크서울특별시 양천구 등촌로 182, 지하 1층 (목동)서울특별시 양천구 목동 651-9 지하 1층02-2651-42082023-08-10
3유흥주점영업캉캉노래바서울특별시 양천구 목동서로 213, 세신비젼프라자 3층 306~309호 (목동)서울특별시 양천구 목동 923 세신비젼프라자 3층 306~309호02-2653-29912023-08-10
4단란주점라온뮤직타운서울특별시 양천구 중앙로 263, 지하 1층 (신정동)서울특별시 양천구 신정동 1183-7 지하 1층02-2695-89152023-08-10
5단란주점가야서울특별시 양천구 신목로 94, 지하 1층 (목동)서울특별시 양천구 목동 404-115 지하 1층02-2646-22912023-08-10
6단란주점세인서울특별시 양천구 신월로 294, 지하 1층 (신정동)서울특별시 양천구 신정동 1030-2 지하 1층02-2645-88202023-08-10
7단란주점스마일서울특별시 양천구 신목로 78-1, 지하 1층 (신정동)서울특별시 양천구 신정동 117-36 지하 1층02-2647-73362023-08-10
8단란주점엠에스서울특별시 양천구 목동중앙서로7가길 39, 지하 1층 (목동)서울특별시 양천구 목동 792-1 지하 1층02-2654-21542023-08-10
9단란주점호박서울특별시 양천구 남부순환로 357-1, 지하 1층 (신월동)서울특별시 양천구 신월동 145-2 지하 1층02-693-15022023-08-10
업종명업소명소재지 도로명주소소재지 지번주소소재지전화데이터기준일자
58단란주점승리노래광장서울특별시 양천구 중앙로 272, 지하 1층 (신정동)서울특별시 양천구 신정동 1031-3 지하 1층<NA>2023-08-10
59단란주점타임노래광장서울특별시 양천구 신월로 282, 지하 1층 (신정동)서울특별시 양천구 신정동 1190-2 지하 1층02-2690-31402023-08-10
60단란주점퓨전노래주점서울특별시 양천구 중앙로 288, 2층 (신정동)서울특별시 양천구 신정동 972-1 2층02-2692-77122023-08-10
61단란주점아주노래광장서울특별시 양천구 중앙로34길 30, 2층 (신정동)서울특별시 양천구 신정동 1030-6 2층02-2649-99982023-08-10
62단란주점강남노래바서울특별시 양천구 중앙로 264, 지하 1층 (신정동)서울특별시 양천구 신정동 1031-8 지하 1층02-2646-88432023-08-10
63단란주점라이브7080 1박2일서울특별시 양천구 중앙로 284, 동성빌딩 지하 1층 (신정동)서울특별시 양천구 신정동 972-3 동성빌딩 지하 1층<NA>2023-08-10
64단란주점오페라노래바서울특별시 양천구 중앙로 257, 지하 1층 (신정동)서울특별시 양천구 신정동 1183-11 지하 1층02-2695-82592023-08-10
65단란주점라이브사랑벌서울특별시 양천구 중앙로 269, 지하 1층 (신정동)서울특별시 양천구 신정동 1190-6 지하 1층<NA>2023-08-10
66단란주점터널서울특별시 양천구 중앙로 261, 2층 (신정동)서울특별시 양천구 신정동 1183-8 2층02-2698-54542023-08-10
67단란주점보고싶다서울특별시 양천구 중앙로 247, 지하 1층 (신정동)서울특별시 양천구 신정동 1182-7 지하 1층<NA>2023-08-10