Overview

Dataset statistics

Number of variables5
Number of observations48
Missing cells13
Missing cells (%)5.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory43.8 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 하동군에 있는 유흥단란주점 현황 (연번, 업종명, 업체명, 소재지(도로명). 전화번호)의 정보를 제공하고 있습니다.
Author경상남도 하동군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15086268

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
전화번호 has 13 (27.1%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:45:30.995589
Analysis finished2023-12-10 23:45:31.483869
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.5
Minimum1
Maximum48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T08:45:31.571110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.35
Q112.75
median24.5
Q336.25
95-th percentile45.65
Maximum48
Range47
Interquartile range (IQR)23.5

Descriptive statistics

Standard deviation14
Coefficient of variation (CV)0.57142857
Kurtosis-1.2
Mean24.5
Median Absolute Deviation (MAD)12
Skewness0
Sum1176
Variance196
MonotonicityStrictly increasing
2023-12-11T08:45:31.725336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 1
 
2.1%
26 1
 
2.1%
28 1
 
2.1%
29 1
 
2.1%
30 1
 
2.1%
31 1
 
2.1%
32 1
 
2.1%
33 1
 
2.1%
34 1
 
2.1%
35 1
 
2.1%
Other values (38) 38
79.2%
ValueCountFrequency (%)
1 1
2.1%
2 1
2.1%
3 1
2.1%
4 1
2.1%
5 1
2.1%
6 1
2.1%
7 1
2.1%
8 1
2.1%
9 1
2.1%
10 1
2.1%
ValueCountFrequency (%)
48 1
2.1%
47 1
2.1%
46 1
2.1%
45 1
2.1%
44 1
2.1%
43 1
2.1%
42 1
2.1%
41 1
2.1%
40 1
2.1%
39 1
2.1%

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size516.0 B
유흥주점영업
40 
단란주점

Length

Max length6
Median length6
Mean length5.6666667
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 40
83.3%
단란주점 8
 
16.7%

Length

2023-12-11T08:45:31.905933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:45:32.005027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 40
83.3%
단란주점 8
 
16.7%

업소명
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-11T08:45:32.244122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length5.125
Min length1

Characters and Unicode

Total characters246
Distinct characters107
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row고전가요방
2nd row홍당무
3rd row태경가요방
4th row월드부일룸 가요방
5th row백야가요방
ValueCountFrequency (%)
고전가요방 1
 
1.8%
쌈바쌈바 1
 
1.8%
비체음악홀 1
 
1.8%
경일노래주점 1
 
1.8%
앵콜 1
 
1.8%
1
 
1.8%
싸이키가요방 1
 
1.8%
수궁 1
 
1.8%
1
 
1.8%
음악홀 1
 
1.8%
Other values (45) 45
81.8%
2023-12-11T08:45:32.615432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.9%
14
 
5.7%
14
 
5.7%
14
 
5.7%
13
 
5.3%
12
 
4.9%
11
 
4.5%
7
 
2.8%
5
 
2.0%
4
 
1.6%
Other values (97) 135
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 229
93.1%
Space Separator 7
 
2.8%
Decimal Number 6
 
2.4%
Uppercase Letter 2
 
0.8%
Letter Number 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
7.4%
14
 
6.1%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
5
 
2.2%
4
 
1.7%
4
 
1.7%
Other values (88) 121
52.8%
Decimal Number
ValueCountFrequency (%)
0 2
33.3%
1 1
16.7%
8 1
16.7%
7 1
16.7%
2 1
16.7%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 229
93.1%
Common 13
 
5.3%
Latin 4
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
7.4%
14
 
6.1%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
5
 
2.2%
4
 
1.7%
4
 
1.7%
Other values (88) 121
52.8%
Common
ValueCountFrequency (%)
7
53.8%
0 2
 
15.4%
1 1
 
7.7%
8 1
 
7.7%
7 1
 
7.7%
2 1
 
7.7%
Latin
ValueCountFrequency (%)
G 2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 229
93.1%
ASCII 15
 
6.1%
Number Forms 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
7.4%
14
 
6.1%
14
 
6.1%
14
 
6.1%
13
 
5.7%
12
 
5.2%
11
 
4.8%
5
 
2.2%
4
 
1.7%
4
 
1.7%
Other values (88) 121
52.8%
ASCII
ValueCountFrequency (%)
7
46.7%
0 2
 
13.3%
G 2
 
13.3%
1 1
 
6.7%
8 1
 
6.7%
7 1
 
6.7%
2 1
 
6.7%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-11T08:45:32.865389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length22.5
Min length19

Characters and Unicode

Total characters1080
Distinct characters67
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row경상남도 하동군 고전면 하동읍성로 550-5
2nd row경상남도 하동군 하동읍 중앙로 54
3rd row경상남도 하동군 화개면 화개로 2-15
4th row경상남도 하동군 진교면 민다리길 56-1
5th row경상남도 하동군 하동읍 중앙2길 11-1
ValueCountFrequency (%)
경상남도 48
19.4%
하동군 48
19.4%
하동읍 20
 
8.1%
진교면 11
 
4.4%
중앙2길 8
 
3.2%
화개면 6
 
2.4%
화개로 6
 
2.4%
진교중앙길 5
 
2.0%
민다리길 4
 
1.6%
중앙3길 4
 
1.6%
Other values (74) 88
35.5%
2023-12-11T08:45:33.291288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
200
18.5%
69
 
6.4%
69
 
6.4%
50
 
4.6%
50
 
4.6%
49
 
4.5%
48
 
4.4%
48
 
4.4%
1 44
 
4.1%
31
 
2.9%
Other values (57) 422
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 674
62.4%
Space Separator 200
 
18.5%
Decimal Number 165
 
15.3%
Dash Punctuation 27
 
2.5%
Close Punctuation 6
 
0.6%
Open Punctuation 6
 
0.6%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
10.2%
69
 
10.2%
50
 
7.4%
50
 
7.4%
49
 
7.3%
48
 
7.1%
48
 
7.1%
31
 
4.6%
28
 
4.2%
25
 
3.7%
Other values (42) 207
30.7%
Decimal Number
ValueCountFrequency (%)
1 44
26.7%
2 25
15.2%
5 17
 
10.3%
3 17
 
10.3%
6 15
 
9.1%
4 12
 
7.3%
0 11
 
6.7%
8 10
 
6.1%
7 8
 
4.8%
9 6
 
3.6%
Space Separator
ValueCountFrequency (%)
200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 674
62.4%
Common 406
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
10.2%
69
 
10.2%
50
 
7.4%
50
 
7.4%
49
 
7.3%
48
 
7.1%
48
 
7.1%
31
 
4.6%
28
 
4.2%
25
 
3.7%
Other values (42) 207
30.7%
Common
ValueCountFrequency (%)
200
49.3%
1 44
 
10.8%
- 27
 
6.7%
2 25
 
6.2%
5 17
 
4.2%
3 17
 
4.2%
6 15
 
3.7%
4 12
 
3.0%
0 11
 
2.7%
8 10
 
2.5%
Other values (5) 28
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 674
62.4%
ASCII 406
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
200
49.3%
1 44
 
10.8%
- 27
 
6.7%
2 25
 
6.2%
5 17
 
4.2%
3 17
 
4.2%
6 15
 
3.7%
4 12
 
3.0%
0 11
 
2.7%
8 10
 
2.5%
Other values (5) 28
 
6.9%
Hangul
ValueCountFrequency (%)
69
 
10.2%
69
 
10.2%
50
 
7.4%
50
 
7.4%
49
 
7.3%
48
 
7.1%
48
 
7.1%
31
 
4.6%
28
 
4.2%
25
 
3.7%
Other values (42) 207
30.7%

전화번호
Text

MISSING 

Distinct35
Distinct (%)100.0%
Missing13
Missing (%)27.1%
Memory size516.0 B
2023-12-11T08:45:33.459025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters490
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row055 -883 -8861
2nd row 055-884 -4911
3rd row055 -883 -9636
4th row 055-884 -8085
5th row 055- 884-7557
ValueCountFrequency (%)
055 31
36.9%
883 7
 
8.3%
884 4
 
4.8%
055-884 3
 
3.6%
882 3
 
3.6%
0506 1
 
1.2%
7518 1
 
1.2%
883-1037 1
 
1.2%
882-4988 1
 
1.2%
2366 1
 
1.2%
Other values (31) 31
36.9%
2023-12-11T08:45:33.768452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 90
18.4%
8 84
17.1%
70
14.3%
- 70
14.3%
0 49
10.0%
4 28
 
5.7%
3 26
 
5.3%
1 20
 
4.1%
2 17
 
3.5%
6 13
 
2.7%
Other values (2) 23
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 350
71.4%
Space Separator 70
 
14.3%
Dash Punctuation 70
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 90
25.7%
8 84
24.0%
0 49
14.0%
4 28
 
8.0%
3 26
 
7.4%
1 20
 
5.7%
2 17
 
4.9%
6 13
 
3.7%
7 13
 
3.7%
9 10
 
2.9%
Space Separator
ValueCountFrequency (%)
70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 490
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 90
18.4%
8 84
17.1%
70
14.3%
- 70
14.3%
0 49
10.0%
4 28
 
5.7%
3 26
 
5.3%
1 20
 
4.1%
2 17
 
3.5%
6 13
 
2.7%
Other values (2) 23
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 490
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 90
18.4%
8 84
17.1%
70
14.3%
- 70
14.3%
0 49
10.0%
4 28
 
5.7%
3 26
 
5.3%
1 20
 
4.1%
2 17
 
3.5%
6 13
 
2.7%
Other values (2) 23
 
4.7%

Interactions

2023-12-11T08:45:31.239933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:45:33.878191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명업소명소재지(도로명)전화번호
연번1.0000.9821.0001.0001.000
업종명0.9821.0001.0001.0001.000
업소명1.0001.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2023-12-11T08:45:33.981524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.801
업종명0.8011.000

Missing values

2023-12-11T08:45:31.348121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:45:31.449527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명소재지(도로명)전화번호
01유흥주점영업고전가요방경상남도 하동군 고전면 하동읍성로 550-5055 -883 -8861
12유흥주점영업홍당무경상남도 하동군 하동읍 중앙로 54055-884 -4911
23유흥주점영업태경가요방경상남도 하동군 화개면 화개로 2-15055 -883 -9636
34유흥주점영업월드부일룸 가요방경상남도 하동군 진교면 민다리길 56-1055-884 -8085
45유흥주점영업백야가요방경상남도 하동군 하동읍 중앙2길 11-1055- 884-7557
56유흥주점영업둥지 가요주점경상남도 하동군 금성면 금성로 270055 -882 -5225
67유흥주점영업로또스탠드빠경상남도 하동군 진교면 민다리길 51-2<NA>
78유흥주점영업오페라 7080 노래방경상남도 하동군 진교면 진교중앙길 14-7055- 884-1524
89유흥주점영업토심경상남도 하동군 진교면 민다리안길 107055- 884-0765
910유흥주점영업수일룸가요방경상남도 하동군 진교면 민다리길 53055 -884 -4121
연번업종명업소명소재지(도로명)전화번호
3839유흥주점영업폭포수유흥주점경상남도 하동군 진교면 진교중앙길 14-5, 폭포수유흥주점 2층<NA>
3940유흥주점영업GG가요주점경상남도 하동군 금남면 섬진강대로 963<NA>
4041단란주점정원경상남도 하동군 진교면 경충로 1103055- 883-7266
4142단란주점늘봄단란주점경상남도 하동군 화개면 화개로 239055-883 -8411
4243단란주점비너스단란주점경상남도 하동군 화개면 화개로 48, 3층055- 883-6303
4344단란주점청학골노래주점경상남도 하동군 청암면 청학로 2267055- 882-7094
4445단란주점유진노래주점경상남도 하동군 진교면 민다리길 83-1<NA>
4546단란주점삼덕가요방경상남도 하동군 화개면 화개로 886 (외1필지(지상2층))<NA>
4647단란주점고향노래주점경상남도 하동군 화개면 화개로 533055 -883 -9544
4748단란주점귀족단란주점경상남도 하동군 진교면 진교중앙길 11055 -884 -3358