Overview

Dataset statistics

Number of variables9
Number of observations74
Missing cells8
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory74.8 B

Variable types

Categorical2
DateTime3
Text3
Numeric1

Dataset

Description경기도 여주시 관내 식품위생업 중 유흥주점 및 단란주점 현황 정보(업종명, 허가일자, 업소명, 소재지, 영업장면적, 전화번호, 영업자시작일자, 소재지시작일자, 업태명)를 제공합니다.
Author경기도 여주시
URLhttps://www.data.go.kr/data/15038680/fileData.do

Alerts

업태명 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 업태명High correlation
소재지전화 has 8 (10.8%) missing valuesMissing
소재지시작일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:42:01.416711
Analysis finished2023-12-12 07:42:02.774458
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size724.0 B
유흥주점영업
39 
단란주점
35 

Length

Max length6
Median length6
Mean length5.0540541
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 39
52.7%
단란주점 35
47.3%

Length

2023-12-12T16:42:02.858379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:42:02.991891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 39
52.7%
단란주점 35
47.3%
Distinct73
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size724.0 B
Minimum1977-03-11 00:00:00
Maximum2020-12-07 00:00:00
2023-12-12T16:42:03.111000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:03.651523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct73
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size724.0 B
2023-12-12T16:42:03.987158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length8
Mean length5.1486486
Min length2

Characters and Unicode

Total characters381
Distinct characters166
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)97.3%

Sample

1st row노블레스 가요빠
2nd row쿵쿵따노래빵
3rd row캣츠
4th row벌떼노래빠
5th row금란주점
ValueCountFrequency (%)
단란주점 4
 
4.5%
에이스 2
 
2.3%
유흥주점 2
 
2.3%
세시봉노래빠 1
 
1.1%
카니발단란주점 1
 
1.1%
모카단란주점 1
 
1.1%
홀인원단란주점 1
 
1.1%
올레노래빠 1
 
1.1%
쿨노래빠 1
 
1.1%
여우꽃 1
 
1.1%
Other values (73) 73
83.0%
2023-12-12T16:42:04.482506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
6.6%
23
 
6.0%
20
 
5.2%
19
 
5.0%
16
 
4.2%
15
 
3.9%
14
 
3.7%
11
 
2.9%
8
 
2.1%
5
 
1.3%
Other values (156) 225
59.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 334
87.7%
Uppercase Letter 16
 
4.2%
Space Separator 14
 
3.7%
Lowercase Letter 9
 
2.4%
Close Punctuation 3
 
0.8%
Open Punctuation 3
 
0.8%
Other Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.5%
23
 
6.9%
20
 
6.0%
19
 
5.7%
16
 
4.8%
15
 
4.5%
11
 
3.3%
8
 
2.4%
5
 
1.5%
5
 
1.5%
Other values (133) 187
56.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
12.5%
W 2
12.5%
M 2
12.5%
S 2
12.5%
U 2
12.5%
E 1
6.2%
O 1
6.2%
H 1
6.2%
R 1
6.2%
A 1
6.2%
Lowercase Letter
ValueCountFrequency (%)
a 2
22.2%
e 1
11.1%
u 1
11.1%
c 1
11.1%
i 1
11.1%
r 1
11.1%
f 1
11.1%
s 1
11.1%
Space Separator
ValueCountFrequency (%)
14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 334
87.7%
Latin 25
 
6.6%
Common 22
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.5%
23
 
6.9%
20
 
6.0%
19
 
5.7%
16
 
4.8%
15
 
4.5%
11
 
3.3%
8
 
2.4%
5
 
1.5%
5
 
1.5%
Other values (133) 187
56.0%
Latin
ValueCountFrequency (%)
a 2
 
8.0%
B 2
 
8.0%
W 2
 
8.0%
M 2
 
8.0%
S 2
 
8.0%
U 2
 
8.0%
E 1
 
4.0%
O 1
 
4.0%
H 1
 
4.0%
R 1
 
4.0%
Other values (9) 9
36.0%
Common
ValueCountFrequency (%)
14
63.6%
) 3
 
13.6%
( 3
 
13.6%
. 2
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 334
87.7%
ASCII 47
 
12.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
7.5%
23
 
6.9%
20
 
6.0%
19
 
5.7%
16
 
4.8%
15
 
4.5%
11
 
3.3%
8
 
2.4%
5
 
1.5%
5
 
1.5%
Other values (133) 187
56.0%
ASCII
ValueCountFrequency (%)
14
29.8%
) 3
 
6.4%
( 3
 
6.4%
a 2
 
4.3%
B 2
 
4.3%
W 2
 
4.3%
M 2
 
4.3%
. 2
 
4.3%
S 2
 
4.3%
U 2
 
4.3%
Other values (13) 13
27.7%
Distinct67
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size724.0 B
2023-12-12T16:42:04.792470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length18.135135
Min length14

Characters and Unicode

Total characters1342
Distinct characters68
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)82.4%

Sample

1st row경기도 여주시 여흥로 115, 2층 (홍문동)
2nd row경기도 여주시 세종로 19-1
3rd row경기도 여주시 세종로 38
4th row경기도 여주시 가남읍 태평중앙1길 3
5th row경기도 여주시 북내면 여양2로 271
ValueCountFrequency (%)
경기도 74
21.7%
여주시 74
21.7%
가남읍 19
 
5.6%
여흥로 14
 
4.1%
세종로 10
 
2.9%
태평로 10
 
2.9%
태평중앙1길 8
 
2.3%
강변로 6
 
1.8%
홍문동 5
 
1.5%
우암로 4
 
1.2%
Other values (88) 117
34.3%
2023-12-12T16:42:05.166363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
268
20.0%
97
 
7.2%
74
 
5.5%
74
 
5.5%
74
 
5.5%
74
 
5.5%
74
 
5.5%
1 69
 
5.1%
64
 
4.8%
2 30
 
2.2%
Other values (58) 444
33.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 814
60.7%
Space Separator 268
 
20.0%
Decimal Number 217
 
16.2%
Dash Punctuation 18
 
1.3%
Open Punctuation 9
 
0.7%
Close Punctuation 9
 
0.7%
Other Punctuation 6
 
0.4%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
11.9%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
64
 
7.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (42) 226
27.8%
Decimal Number
ValueCountFrequency (%)
1 69
31.8%
2 30
13.8%
8 22
 
10.1%
3 20
 
9.2%
6 19
 
8.8%
4 14
 
6.5%
5 13
 
6.0%
9 11
 
5.1%
0 10
 
4.6%
7 9
 
4.1%
Space Separator
ValueCountFrequency (%)
268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 814
60.7%
Common 527
39.3%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
11.9%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
64
 
7.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (42) 226
27.8%
Common
ValueCountFrequency (%)
268
50.9%
1 69
 
13.1%
2 30
 
5.7%
8 22
 
4.2%
3 20
 
3.8%
6 19
 
3.6%
- 18
 
3.4%
4 14
 
2.7%
5 13
 
2.5%
9 11
 
2.1%
Other values (5) 43
 
8.2%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 814
60.7%
ASCII 528
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
268
50.8%
1 69
 
13.1%
2 30
 
5.7%
8 22
 
4.2%
3 20
 
3.8%
6 19
 
3.6%
- 18
 
3.4%
4 14
 
2.7%
5 13
 
2.5%
9 11
 
2.1%
Other values (6) 44
 
8.3%
Hangul
ValueCountFrequency (%)
97
11.9%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
74
 
9.1%
64
 
7.9%
19
 
2.3%
19
 
2.3%
19
 
2.3%
Other values (42) 226
27.8%

영업장면적
Real number (ℝ)

Distinct73
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean105.36649
Minimum42
Maximum218.68
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size798.0 B
2023-12-12T16:42:05.324064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum42
5-th percentile57.095
Q180.635
median102.945
Q3126.7425
95-th percentile163.0325
Maximum218.68
Range176.68
Interquartile range (IQR)46.1075

Descriptive statistics

Standard deviation35.256317
Coefficient of variation (CV)0.33460655
Kurtosis1.4703243
Mean105.36649
Median Absolute Deviation (MAD)23.06
Skewness0.93386791
Sum7797.12
Variance1243.0079
MonotonicityNot monotonic
2023-12-12T16:42:05.470699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
87.24 2
 
2.7%
71.28 1
 
1.4%
105.99 1
 
1.4%
133.51 1
 
1.4%
87.29 1
 
1.4%
60.25 1
 
1.4%
102.99 1
 
1.4%
51.56 1
 
1.4%
92.26 1
 
1.4%
99.32 1
 
1.4%
Other values (63) 63
85.1%
ValueCountFrequency (%)
42.0 1
1.4%
49.2 1
1.4%
51.56 1
1.4%
53.26 1
1.4%
59.16 1
1.4%
60.25 1
1.4%
62.1 1
1.4%
67.0 1
1.4%
68.1 1
1.4%
68.95 1
1.4%
ValueCountFrequency (%)
218.68 1
1.4%
208.72 1
1.4%
195.3 1
1.4%
188.48 1
1.4%
149.33 1
1.4%
143.96 1
1.4%
143.02 1
1.4%
142.16 1
1.4%
140.13 1
1.4%
139.41 1
1.4%

소재지전화
Text

MISSING 

Distinct65
Distinct (%)98.5%
Missing8
Missing (%)10.8%
Memory size724.0 B
2023-12-12T16:42:05.731238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters792
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)97.0%

Sample

1st row031-885-3850
2nd row031-886-4271
3rd row031-882-3407
4th row031-884-5523
5th row031-885-0855
ValueCountFrequency (%)
031-881-0003 2
 
3.0%
031-883-8954 1
 
1.5%
031-883-5776 1
 
1.5%
031-886-6996 1
 
1.5%
031-885-8507 1
 
1.5%
031-883-8359 1
 
1.5%
031-885-5504 1
 
1.5%
031-632-5828 1
 
1.5%
031-886-1177 1
 
1.5%
031-881-2133 1
 
1.5%
Other values (55) 55
83.3%
2023-12-12T16:42:06.135554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 160
20.2%
- 132
16.7%
0 106
13.4%
1 104
13.1%
3 102
12.9%
5 52
 
6.6%
2 31
 
3.9%
6 30
 
3.8%
7 28
 
3.5%
4 24
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 660
83.3%
Dash Punctuation 132
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 160
24.2%
0 106
16.1%
1 104
15.8%
3 102
15.5%
5 52
 
7.9%
2 31
 
4.7%
6 30
 
4.5%
7 28
 
4.2%
4 24
 
3.6%
9 23
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 132
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 792
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 160
20.2%
- 132
16.7%
0 106
13.4%
1 104
13.1%
3 102
12.9%
5 52
 
6.6%
2 31
 
3.9%
6 30
 
3.8%
7 28
 
3.5%
4 24
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 792
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 160
20.2%
- 132
16.7%
0 106
13.4%
1 104
13.1%
3 102
12.9%
5 52
 
6.6%
2 31
 
3.9%
6 30
 
3.8%
7 28
 
3.5%
4 24
 
3.0%
Distinct72
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size724.0 B
Minimum1994-10-22 00:00:00
Maximum2021-07-07 00:00:00
2023-12-12T16:42:06.339562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:06.543752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소재지시작일
Date

UNIQUE 

Distinct74
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size724.0 B
Minimum1977-03-11 00:00:00
Maximum2020-12-07 00:00:00
2023-12-12T16:42:06.704200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:42:06.855479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size724.0 B
룸살롱
35 
단란주점
35 
카바레
 
2
간이주점
 
1
기타
 
1

Length

Max length4
Median length3.5
Mean length3.472973
Min length2

Unique

Unique2 ?
Unique (%)2.7%

Sample

1st row룸살롱
2nd row룸살롱
3rd row룸살롱
4th row룸살롱
5th row카바레

Common Values

ValueCountFrequency (%)
룸살롱 35
47.3%
단란주점 35
47.3%
카바레 2
 
2.7%
간이주점 1
 
1.4%
기타 1
 
1.4%

Length

2023-12-12T16:42:07.022283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:42:07.187474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
룸살롱 35
47.3%
단란주점 35
47.3%
카바레 2
 
2.7%
간이주점 1
 
1.4%
기타 1
 
1.4%

Interactions

2023-12-12T16:42:02.380095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:42:07.285973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일소재지시작일업태명
업종명1.0001.0001.0000.9470.0001.0001.0001.0001.000
인허가일자1.0001.0000.9980.9930.9820.9980.9951.0001.000
업소명1.0000.9981.0000.9930.9730.9980.9951.0000.000
소재지0.9470.9930.9931.0000.8821.0000.9971.0000.842
영업장면적0.0000.9820.9730.8821.0000.9030.8921.0000.000
소재지전화1.0000.9980.9981.0000.9031.0001.0001.0001.000
영업자시작일1.0000.9950.9950.9970.8921.0001.0001.0001.000
소재지시작일1.0001.0001.0001.0001.0001.0001.0001.0001.000
업태명1.0001.0000.0000.8420.0001.0001.0001.0001.000
2023-12-12T16:42:07.433649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태명업종명
업태명1.0000.979
업종명0.9791.000
2023-12-12T16:42:07.548390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적업종명업태명
영업장면적1.0000.0000.000
업종명0.0001.0000.979
업태명0.0000.9791.000

Missing values

2023-12-12T16:42:02.544957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:42:02.713825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일소재지시작일업태명
0유흥주점영업1984-04-03노블레스 가요빠경기도 여주시 여흥로 115, 2층 (홍문동)71.28<NA>2019-09-251984-04-03룸살롱
1유흥주점영업1987-10-28쿵쿵따노래빵경기도 여주시 세종로 19-179.59031-885-38502019-12-051987-10-28룸살롱
2유흥주점영업1989-11-04캣츠경기도 여주시 세종로 3859.16031-886-42712018-10-081989-11-04룸살롱
3유흥주점영업1989-09-23벌떼노래빠경기도 여주시 가남읍 태평중앙1길 377.76031-882-34072021-07-071989-09-23룸살롱
4유흥주점영업1989-01-04금란주점경기도 여주시 북내면 여양2로 271110.41<NA>1998-07-311989-01-04카바레
5유흥주점영업1989-12-27벅시경기도 여주시 여양로 210-6208.72031-884-55232021-01-121989-12-27룸살롱
6유흥주점영업1977-03-11에오스경기도 여주시 여흥로 111-167.0031-885-08552010-09-171977-03-11룸살롱
7유흥주점영업1990-09-05미희가요주점경기도 여주시 세종로14번길 542.0031-883-59592013-04-161990-09-05룸살롱
8유흥주점영업1981-02-20황후노래빠경기도 여주시 가남읍 태평로 2862.1031-883-88782017-10-101981-02-20간이주점
9유흥주점영업1994-06-21비타민경기도 여주시 가남읍 태평중앙1길 482.8031-882-69852009-02-031994-06-21룸살롱
업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일소재지시작일업태명
64단란주점1998-07-27BMW단란주점경기도 여주시 여흥로 129143.02031-885-75782017-02-221998-07-27단란주점
65단란주점1998-09-04예목단란주점경기도 여주시 금사면 이여로 1366107.8031-884-69002004-03-021998-09-04단란주점
66단란주점1995-08-07둥지 단란주점경기도 여주시 청심로 175-3775.66031-884-23622017-11-221995-08-07단란주점
67단란주점2006-04-26천사노래주점경기도 여주시 대신면 여양로 1980, 2층68.95031-883-75602014-08-252006-04-26단란주점
68단란주점2009-09-10와와와단란주점경기도 여주시 가남읍 태평중앙1길 1133.66031-882-18552018-06-012009-09-10단란주점
69단란주점2009-09-24노래하는 바우경기도 여주시 가남읍 태평중앙1길 3123.45<NA>2019-10-282009-09-24단란주점
70단란주점2009-10-23토마토 노래팡경기도 여주시 가남읍 태평중앙1길 2139.41031-884-91822020-11-302009-10-23단란주점
71단란주점2010-09-20피닉스 술래방경기도 여주시 가남읍 태평로 23-8116.16031-881-17082017-05-302010-09-20단란주점
72단란주점2012-12-10얄개가요주점경기도 여주시 금사면 이여로 135494.99031-884-13012013-09-022012-12-10단란주점
73단란주점2020-11-27고향역경기도 여주시 가남읍 태평로 15, B1층85.15<NA>2020-11-272020-11-27단란주점