Overview

Dataset statistics

Number of variables5
Number of observations4276
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory171.3 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Descriptionㅇ 주로 주류를 조리ㆍ판매하는 영업으로서 유흥종사자를 두거나 유흥시설을 설치할 수 있고 손님이 노래를 부르거나 춤을 추는 행위가 허용되는 유흥주점 업소정보
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15069260

Alerts

업종 has constant value ""Constant
연번 is highly overall correlated with 인허가관할기관High correlation
인허가관할기관 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 22:53:58.449735
Analysis finished2023-12-10 22:53:59.549813
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct4276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2138.5
Minimum1
Maximum4276
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size37.7 KiB
2023-12-11T07:53:59.641896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile214.75
Q11069.75
median2138.5
Q33207.25
95-th percentile4062.25
Maximum4276
Range4275
Interquartile range (IQR)2137.5

Descriptive statistics

Standard deviation1234.5192
Coefficient of variation (CV)0.57728277
Kurtosis-1.2
Mean2138.5
Median Absolute Deviation (MAD)1069
Skewness0
Sum9144226
Variance1524037.7
MonotonicityStrictly increasing
2023-12-11T07:54:00.114939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2858 1
 
< 0.1%
2844 1
 
< 0.1%
2845 1
 
< 0.1%
2846 1
 
< 0.1%
2847 1
 
< 0.1%
2848 1
 
< 0.1%
2849 1
 
< 0.1%
2850 1
 
< 0.1%
2851 1
 
< 0.1%
Other values (4266) 4266
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4276 1
< 0.1%
4275 1
< 0.1%
4274 1
< 0.1%
4273 1
< 0.1%
4272 1
< 0.1%
4271 1
< 0.1%
4270 1
< 0.1%
4269 1
< 0.1%
4268 1
< 0.1%
4267 1
< 0.1%

인허가관할기관
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size33.5 KiB
김해시
648 
창원시 성산구
531 
창원시 마산합포구
444 
양산시
327 
거제시
325 
Other values (17)
2001 

Length

Max length9
Median length3
Mean length4.9172123
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제시
2nd row거제시
3rd row거제시
4th row거제시
5th row거제시

Common Values

ValueCountFrequency (%)
김해시 648
15.2%
창원시 성산구 531
12.4%
창원시 마산합포구 444
10.4%
양산시 327
7.6%
거제시 325
7.6%
창원시 의창구 292
6.8%
통영시 290
6.8%
창원시 진해구 256
 
6.0%
진주시 250
 
5.8%
창원시 마산회원구 203
 
4.7%
Other values (12) 710
16.6%

Length

2023-12-11T07:54:00.281236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 1726
28.8%
김해시 648
 
10.8%
성산구 531
 
8.8%
마산합포구 444
 
7.4%
양산시 327
 
5.4%
거제시 325
 
5.4%
의창구 292
 
4.9%
통영시 290
 
4.8%
진해구 256
 
4.3%
진주시 250
 
4.2%
Other values (13) 913
15.2%
Distinct3567
Distinct (%)83.4%
Missing0
Missing (%)0.0%
Memory size33.5 KiB
2023-12-11T07:54:00.618321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length25
Mean length5.7284846
Min length1

Characters and Unicode

Total characters24495
Distinct characters754
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3135 ?
Unique (%)73.3%

Sample

1st row고구려
2nd row고운정가요주점
3rd row그린비
4th row갤러리
5th row꿀단지가요주점
ValueCountFrequency (%)
노래주점 94
 
2.0%
노래방 52
 
1.1%
술마시는노래방 28
 
0.6%
술마시는 27
 
0.6%
가라오케 18
 
0.4%
유흥주점 11
 
0.2%
가요주점 10
 
0.2%
황진이 10
 
0.2%
bar 9
 
0.2%
고구려 9
 
0.2%
Other values (3563) 4434
94.3%
2023-12-11T07:54:01.119339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1891
 
7.7%
1887
 
7.7%
1437
 
5.9%
1436
 
5.9%
791
 
3.2%
542
 
2.2%
437
 
1.8%
436
 
1.8%
432
 
1.8%
431
 
1.8%
Other values (744) 14775
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22617
92.3%
Uppercase Letter 533
 
2.2%
Space Separator 431
 
1.8%
Decimal Number 398
 
1.6%
Lowercase Letter 208
 
0.8%
Close Punctuation 130
 
0.5%
Open Punctuation 129
 
0.5%
Other Punctuation 39
 
0.2%
Letter Number 5
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1891
 
8.4%
1887
 
8.3%
1437
 
6.4%
1436
 
6.3%
791
 
3.5%
542
 
2.4%
437
 
1.9%
436
 
1.9%
432
 
1.9%
410
 
1.8%
Other values (671) 12918
57.1%
Uppercase Letter
ValueCountFrequency (%)
B 51
 
9.6%
A 46
 
8.6%
M 34
 
6.4%
I 33
 
6.2%
O 33
 
6.2%
N 28
 
5.3%
L 28
 
5.3%
E 27
 
5.1%
C 27
 
5.1%
S 27
 
5.1%
Other values (15) 199
37.3%
Lowercase Letter
ValueCountFrequency (%)
o 31
14.9%
a 26
12.5%
r 22
10.6%
e 19
 
9.1%
n 14
 
6.7%
b 11
 
5.3%
m 10
 
4.8%
l 10
 
4.8%
s 8
 
3.8%
i 8
 
3.8%
Other values (14) 49
23.6%
Decimal Number
ValueCountFrequency (%)
0 117
29.4%
1 61
15.3%
8 61
15.3%
7 56
14.1%
2 49
12.3%
9 19
 
4.8%
3 15
 
3.8%
6 8
 
2.0%
5 6
 
1.5%
4 6
 
1.5%
Other Punctuation
ValueCountFrequency (%)
. 21
53.8%
& 9
23.1%
, 4
 
10.3%
! 3
 
7.7%
1
 
2.6%
/ 1
 
2.6%
Letter Number
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
431
100.0%
Close Punctuation
ValueCountFrequency (%)
) 130
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22615
92.3%
Common 1132
 
4.6%
Latin 746
 
3.0%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1891
 
8.4%
1887
 
8.3%
1437
 
6.4%
1436
 
6.3%
791
 
3.5%
542
 
2.4%
437
 
1.9%
436
 
1.9%
432
 
1.9%
410
 
1.8%
Other values (669) 12916
57.1%
Latin
ValueCountFrequency (%)
B 51
 
6.8%
A 46
 
6.2%
M 34
 
4.6%
I 33
 
4.4%
O 33
 
4.4%
o 31
 
4.2%
N 28
 
3.8%
L 28
 
3.8%
E 27
 
3.6%
C 27
 
3.6%
Other values (42) 408
54.7%
Common
ValueCountFrequency (%)
431
38.1%
) 130
 
11.5%
( 129
 
11.4%
0 117
 
10.3%
1 61
 
5.4%
8 61
 
5.4%
7 56
 
4.9%
2 49
 
4.3%
. 21
 
1.9%
9 19
 
1.7%
Other values (11) 58
 
5.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22615
92.3%
ASCII 1872
 
7.6%
Number Forms 5
 
< 0.1%
None 1
 
< 0.1%
CJK 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1891
 
8.4%
1887
 
8.3%
1437
 
6.4%
1436
 
6.3%
791
 
3.5%
542
 
2.4%
437
 
1.9%
436
 
1.9%
432
 
1.9%
410
 
1.8%
Other values (669) 12916
57.1%
ASCII
ValueCountFrequency (%)
431
23.0%
) 130
 
6.9%
( 129
 
6.9%
0 117
 
6.2%
1 61
 
3.3%
8 61
 
3.3%
7 56
 
3.0%
B 51
 
2.7%
2 49
 
2.6%
A 46
 
2.5%
Other values (59) 741
39.6%
Number Forms
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.5 KiB
유흥주점영업
4276 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 4276
100.0%

Length

2023-12-11T07:54:01.262949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:54:01.354465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 4276
100.0%
Distinct4069
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size33.5 KiB
2023-12-11T07:54:01.644990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length49
Mean length29.755379
Min length14

Characters and Unicode

Total characters127234
Distinct characters391
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3883 ?
Unique (%)90.8%

Sample

1st row경상남도 거제시 옥포로10길 6(옥포동)
2nd row경상남도 거제시 거제중앙로13길 15-2(고현동)
3rd row경상남도 거제시 능포로 153-1(능포동)
4th row경상남도 거제시 거제중앙로27길 5(고현동,1동)
5th row경상남도 거제시 옥포로6길 46(옥포동,2층)
ValueCountFrequency (%)
경상남도 4276
 
18.9%
창원시 1726
 
7.6%
김해시 648
 
2.9%
성산구 514
 
2.3%
마산합포구 444
 
2.0%
양산시 327
 
1.4%
거제시 325
 
1.4%
의창구 309
 
1.4%
통영시 290
 
1.3%
진해구 256
 
1.1%
Other values (4723) 13497
59.7%
2023-12-11T07:54:02.091656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18344
 
14.4%
1 5178
 
4.1%
5126
 
4.0%
5004
 
3.9%
4372
 
3.4%
4317
 
3.4%
4062
 
3.2%
3957
 
3.1%
( 3803
 
3.0%
) 3803
 
3.0%
Other values (381) 69268
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77310
60.8%
Decimal Number 20903
 
16.4%
Space Separator 18344
 
14.4%
Open Punctuation 3803
 
3.0%
Close Punctuation 3803
 
3.0%
Other Punctuation 1862
 
1.5%
Dash Punctuation 1092
 
0.9%
Uppercase Letter 99
 
0.1%
Lowercase Letter 13
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5126
 
6.6%
5004
 
6.5%
4372
 
5.7%
4317
 
5.6%
4062
 
5.3%
3957
 
5.1%
3106
 
4.0%
2733
 
3.5%
2468
 
3.2%
2324
 
3.0%
Other values (338) 39841
51.5%
Uppercase Letter
ValueCountFrequency (%)
B 25
25.3%
N 16
16.2%
A 9
 
9.1%
C 9
 
9.1%
M 8
 
8.1%
J 6
 
6.1%
O 4
 
4.0%
S 4
 
4.0%
G 3
 
3.0%
K 3
 
3.0%
Other values (6) 12
12.1%
Decimal Number
ValueCountFrequency (%)
1 5178
24.8%
2 3512
16.8%
3 2468
11.8%
0 2092
10.0%
5 1706
 
8.2%
4 1637
 
7.8%
6 1430
 
6.8%
7 1169
 
5.6%
9 870
 
4.2%
8 841
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
y 2
15.4%
t 2
15.4%
i 2
15.4%
c 2
15.4%
o 2
15.4%
p 2
15.4%
k 1
7.7%
Other Punctuation
ValueCountFrequency (%)
, 1791
96.2%
· 35
 
1.9%
. 27
 
1.5%
/ 9
 
0.5%
Math Symbol
ValueCountFrequency (%)
~ 4
80.0%
+ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
18344
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3803
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3803
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1092
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77310
60.8%
Common 49812
39.1%
Latin 112
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5126
 
6.6%
5004
 
6.5%
4372
 
5.7%
4317
 
5.6%
4062
 
5.3%
3957
 
5.1%
3106
 
4.0%
2733
 
3.5%
2468
 
3.2%
2324
 
3.0%
Other values (338) 39841
51.5%
Latin
ValueCountFrequency (%)
B 25
22.3%
N 16
14.3%
A 9
 
8.0%
C 9
 
8.0%
M 8
 
7.1%
J 6
 
5.4%
O 4
 
3.6%
S 4
 
3.6%
G 3
 
2.7%
K 3
 
2.7%
Other values (13) 25
22.3%
Common
ValueCountFrequency (%)
18344
36.8%
1 5178
 
10.4%
( 3803
 
7.6%
) 3803
 
7.6%
2 3512
 
7.1%
3 2468
 
5.0%
0 2092
 
4.2%
, 1791
 
3.6%
5 1706
 
3.4%
4 1637
 
3.3%
Other values (10) 5478
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77310
60.8%
ASCII 49889
39.2%
None 35
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18344
36.8%
1 5178
 
10.4%
( 3803
 
7.6%
) 3803
 
7.6%
2 3512
 
7.0%
3 2468
 
4.9%
0 2092
 
4.2%
, 1791
 
3.6%
5 1706
 
3.4%
4 1637
 
3.3%
Other values (32) 5555
 
11.1%
Hangul
ValueCountFrequency (%)
5126
 
6.6%
5004
 
6.5%
4372
 
5.7%
4317
 
5.6%
4062
 
5.3%
3957
 
5.1%
3106
 
4.0%
2733
 
3.5%
2468
 
3.2%
2324
 
3.0%
Other values (338) 39841
51.5%
None
ValueCountFrequency (%)
· 35
100.0%

Interactions

2023-12-11T07:53:59.229007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:54:02.207386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관
연번1.0000.974
인허가관할기관0.9741.000
2023-12-11T07:54:02.318000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관
연번1.0000.856
인허가관할기관0.8561.000

Missing values

2023-12-11T07:53:59.369975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:53:59.498843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번인허가관할기관업소명업종업소주소
01거제시고구려유흥주점영업경상남도 거제시 옥포로10길 6(옥포동)
12거제시고운정가요주점유흥주점영업경상남도 거제시 거제중앙로13길 15-2(고현동)
23거제시그린비유흥주점영업경상남도 거제시 능포로 153-1(능포동)
34거제시갤러리유흥주점영업경상남도 거제시 거제중앙로27길 5(고현동,1동)
45거제시꿀단지가요주점유흥주점영업경상남도 거제시 옥포로6길 46(옥포동,2층)
56거제시꿀단지노래주점유흥주점영업경상남도 거제시 장평로8길 22-1(장평동,2층)
67거제시준노래주점유흥주점영업경상남도 거제시 장평로8길 11(장평동,장평프라자2층)
78거제시카우보이유흥주점영업경상남도 거제시 옥포로 182(지층 옥포동)
89거제시준코뮤직타운 고현1호점유흥주점영업경상남도 거제시 거제중앙로24길 5(고현동,2층)
910거제시준코뮤직타운 고현2호점유흥주점영업경상남도 거제시 거제중앙로24길 5(고현동,3층)
연번인허가관할기관업소명업종업소주소
42664267합천군백야유흥주점영업경상남도 합천군 동서로 65
42674268합천군발리유흥주점영업경상남도 합천군 치인1길 13-7(지하1층)
42684269합천군향기주점유흥주점영업경상남도 합천군 청덕면 의합대로 2909
42694270합천군현대 룸싸롱유흥주점영업경상남도 합천군 야로면 가야산로 346
42704271합천군VIP주점유흥주점영업경상남도 합천군 야로면 가야산로 347(2층)
42714272합천군고래유흥주점영업경상남도 합천군 동서로 87
42724273합천군골든폭스유흥주점영업경상남도 합천군 합천읍 충효로 78-8
42734274합천군궁전싸롱유흥주점영업경상남도 합천군 가야면 치인1길 20
42744275합천군궁전유흥주점유흥주점영업경상남도 합천군 초계면 내동1길 4-3(1층)
42754276합천군감로가요주점유흥주점영업경상남도 합천군 일부4길 4(1층)