Overview

Dataset statistics

Number of variables5
Number of observations129
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory42.0 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상남도 남해군에 등록된 이미용업소현황입니다. 이미용업소의 업종명, 업소명, 업소소재지(도로명주소), 전화번호를 포함한 정보입니다.
Author경상남도 남해군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15065510

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:18:43.011012
Analysis finished2023-12-10 23:18:43.419026
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct129
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65
Minimum1
Maximum129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-11T08:18:43.489140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.4
Q133
median65
Q397
95-th percentile122.6
Maximum129
Range128
Interquartile range (IQR)64

Descriptive statistics

Standard deviation37.383151
Coefficient of variation (CV)0.5751254
Kurtosis-1.2
Mean65
Median Absolute Deviation (MAD)32
Skewness0
Sum8385
Variance1397.5
MonotonicityStrictly increasing
2023-12-11T08:18:43.623872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
98 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
Other values (119) 119
92.2%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%

업종명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
미용업
61 
이용업
30 
미용업(일반)
17 
미용업(피부)
11 
미용업(종합)
 
3
Other values (6)

Length

Max length19
Median length3
Mean length4.496124
Min length3

Unique

Unique5 ?
Unique (%)3.9%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
미용업 61
47.3%
이용업 30
23.3%
미용업(일반) 17
 
13.2%
미용업(피부) 11
 
8.5%
미용업(종합) 3
 
2.3%
미용업(손톱,발톱) 2
 
1.6%
미용업(일반,피부) 1
 
0.8%
미용업(일반,손톱,발톱) 1
 
0.8%
미용업(피부,손톱,발톱) 1
 
0.8%
미용업(손톱,발,화장,분장) 1
 
0.8%

Length

2023-12-11T08:18:43.746388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미용업 61
47.3%
이용업 30
23.3%
미용업(일반 17
 
13.2%
미용업(피부 11
 
8.5%
미용업(종합 3
 
2.3%
미용업(손톱,발톱 2
 
1.6%
미용업(일반,피부 1
 
0.8%
미용업(일반,손톱,발톱 1
 
0.8%
미용업(피부,손톱,발톱 1
 
0.8%
미용업(손톱,발,화장,분장 1
 
0.8%
Distinct124
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T08:18:44.013062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length5
Mean length5.5503876
Min length2

Characters and Unicode

Total characters716
Distinct characters190
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)92.2%

Sample

1st row금음이용원
2nd row신흥이용원
3rd row용소이용원
4th row고등이용원
5th row제일이용원
ValueCountFrequency (%)
신흥이용원 2
 
1.4%
행복미용실 2
 
1.4%
헤어 2
 
1.4%
영미용실 2
 
1.4%
salon 2
 
1.4%
서울미용실 2
 
1.4%
현대이용원 2
 
1.4%
금음이용원 1
 
0.7%
hair(유앤미헤어 1
 
0.7%
참신미용실 1
 
0.7%
Other values (130) 130
88.4%
2023-12-11T08:18:44.378999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
9.9%
51
 
7.1%
43
 
6.0%
36
 
5.0%
30
 
4.2%
29
 
4.1%
29
 
4.1%
18
 
2.5%
13
 
1.8%
12
 
1.7%
Other values (180) 384
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 625
87.3%
Uppercase Letter 37
 
5.2%
Lowercase Letter 20
 
2.8%
Space Separator 18
 
2.5%
Close Punctuation 7
 
1.0%
Open Punctuation 7
 
1.0%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
11.4%
51
 
8.2%
43
 
6.9%
36
 
5.8%
30
 
4.8%
29
 
4.6%
29
 
4.6%
13
 
2.1%
12
 
1.9%
9
 
1.4%
Other values (152) 302
48.3%
Uppercase Letter
ValueCountFrequency (%)
O 6
16.2%
A 6
16.2%
N 4
10.8%
I 4
10.8%
L 3
8.1%
S 3
8.1%
R 2
 
5.4%
H 2
 
5.4%
B 2
 
5.4%
M 2
 
5.4%
Other values (3) 3
8.1%
Lowercase Letter
ValueCountFrequency (%)
a 3
15.0%
l 3
15.0%
h 2
10.0%
o 2
10.0%
u 2
10.0%
r 2
10.0%
i 2
10.0%
n 1
 
5.0%
g 1
 
5.0%
e 1
 
5.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 623
87.0%
Latin 57
 
8.0%
Common 34
 
4.7%
Han 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
11.4%
51
 
8.2%
43
 
6.9%
36
 
5.8%
30
 
4.8%
29
 
4.7%
29
 
4.7%
13
 
2.1%
12
 
1.9%
9
 
1.4%
Other values (150) 300
48.2%
Latin
ValueCountFrequency (%)
O 6
 
10.5%
A 6
 
10.5%
N 4
 
7.0%
I 4
 
7.0%
a 3
 
5.3%
L 3
 
5.3%
S 3
 
5.3%
l 3
 
5.3%
R 2
 
3.5%
H 2
 
3.5%
Other values (14) 21
36.8%
Common
ValueCountFrequency (%)
18
52.9%
) 7
 
20.6%
( 7
 
20.6%
& 2
 
5.9%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 623
87.0%
ASCII 91
 
12.7%
CJK 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
11.4%
51
 
8.2%
43
 
6.9%
36
 
5.8%
30
 
4.8%
29
 
4.7%
29
 
4.7%
13
 
2.1%
12
 
1.9%
9
 
1.4%
Other values (150) 300
48.2%
ASCII
ValueCountFrequency (%)
18
19.8%
) 7
 
7.7%
( 7
 
7.7%
O 6
 
6.6%
A 6
 
6.6%
N 4
 
4.4%
I 4
 
4.4%
a 3
 
3.3%
L 3
 
3.3%
S 3
 
3.3%
Other values (18) 30
33.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct121
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T08:18:44.713121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36
Mean length23.790698
Min length18

Characters and Unicode

Total characters3069
Distinct characters79
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)87.6%

Sample

1st row경상남도 남해군 설천면 설천로 702
2nd row경상남도 남해군 고현면 탑동로 65-20
3rd row경상남도 남해군 이동면 남서대로264번길 18-7
4th row경상남도 남해군 남면 남서대로 777-1
5th row경상남도 남해군 창선면 흥선로148번길 26-1
ValueCountFrequency (%)
경상남도 129
18.6%
남해군 129
18.6%
남해읍 79
 
11.4%
화전로 33
 
4.8%
2층 17
 
2.5%
창선면 11
 
1.6%
창선로 8
 
1.2%
1층 8
 
1.2%
이동면 8
 
1.2%
미조면 7
 
1.0%
Other values (156) 264
38.1%
2023-12-11T08:18:45.103720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
578
18.8%
356
 
11.6%
213
 
6.9%
135
 
4.4%
129
 
4.2%
129
 
4.2%
129
 
4.2%
128
 
4.2%
1 119
 
3.9%
79
 
2.6%
Other values (69) 1074
35.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1896
61.8%
Space Separator 578
 
18.8%
Decimal Number 502
 
16.4%
Dash Punctuation 39
 
1.3%
Close Punctuation 27
 
0.9%
Open Punctuation 27
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
356
18.8%
213
11.2%
135
 
7.1%
129
 
6.8%
129
 
6.8%
129
 
6.8%
128
 
6.8%
79
 
4.2%
69
 
3.6%
66
 
3.5%
Other values (55) 463
24.4%
Decimal Number
ValueCountFrequency (%)
1 119
23.7%
2 76
15.1%
6 58
11.6%
9 40
 
8.0%
3 40
 
8.0%
7 39
 
7.8%
4 38
 
7.6%
8 34
 
6.8%
5 31
 
6.2%
0 27
 
5.4%
Space Separator
ValueCountFrequency (%)
578
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1896
61.8%
Common 1173
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
356
18.8%
213
11.2%
135
 
7.1%
129
 
6.8%
129
 
6.8%
129
 
6.8%
128
 
6.8%
79
 
4.2%
69
 
3.6%
66
 
3.5%
Other values (55) 463
24.4%
Common
ValueCountFrequency (%)
578
49.3%
1 119
 
10.1%
2 76
 
6.5%
6 58
 
4.9%
9 40
 
3.4%
3 40
 
3.4%
7 39
 
3.3%
- 39
 
3.3%
4 38
 
3.2%
8 34
 
2.9%
Other values (4) 112
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1896
61.8%
ASCII 1173
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
578
49.3%
1 119
 
10.1%
2 76
 
6.5%
6 58
 
4.9%
9 40
 
3.4%
3 40
 
3.4%
7 39
 
3.3%
- 39
 
3.3%
4 38
 
3.2%
8 34
 
2.9%
Other values (4) 112
 
9.5%
Hangul
ValueCountFrequency (%)
356
18.8%
213
11.2%
135
 
7.1%
129
 
6.8%
129
 
6.8%
129
 
6.8%
128
 
6.8%
79
 
4.2%
69
 
3.6%
66
 
3.5%
Other values (55) 463
24.4%
Distinct102
Distinct (%)79.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-11T08:18:45.328939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1548
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)78.3%

Sample

1st row055-863-3123
2nd row055-863-3740
3rd row055-863-0139
4th row055-862-8448
5th row055-867-1138
ValueCountFrequency (%)
000-000-0000 28
 
21.7%
055-867-0984 1
 
0.8%
055-864-5972 1
 
0.8%
055-867-3928 1
 
0.8%
055-863-3234 1
 
0.8%
055-864-2893 1
 
0.8%
055-864-4905 1
 
0.8%
055-864-7361 1
 
0.8%
055-864-7868 1
 
0.8%
055-864-6399 1
 
0.8%
Other values (92) 92
71.3%
2023-12-11T08:18:45.663250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 411
26.6%
- 258
16.7%
5 253
16.3%
6 138
 
8.9%
8 133
 
8.6%
3 78
 
5.0%
4 74
 
4.8%
2 57
 
3.7%
7 56
 
3.6%
1 52
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1290
83.3%
Dash Punctuation 258
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 411
31.9%
5 253
19.6%
6 138
 
10.7%
8 133
 
10.3%
3 78
 
6.0%
4 74
 
5.7%
2 57
 
4.4%
7 56
 
4.3%
1 52
 
4.0%
9 38
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 258
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1548
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 411
26.6%
- 258
16.7%
5 253
16.3%
6 138
 
8.9%
8 133
 
8.6%
3 78
 
5.0%
4 74
 
4.8%
2 57
 
3.7%
7 56
 
3.6%
1 52
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1548
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 411
26.6%
- 258
16.7%
5 253
16.3%
6 138
 
8.9%
8 133
 
8.6%
3 78
 
5.0%
4 74
 
4.8%
2 57
 
3.7%
7 56
 
3.6%
1 52
 
3.4%

Interactions

2023-12-11T08:18:43.197391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:18:45.749892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.808
업종명0.8081.000
2023-12-11T08:18:45.823882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.511
업종명0.5111.000

Missing values

2023-12-11T08:18:43.296237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:18:43.387403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명업소소재지전화번호
01이용업금음이용원경상남도 남해군 설천면 설천로 702055-863-3123
12이용업신흥이용원경상남도 남해군 고현면 탑동로 65-20055-863-3740
23이용업용소이용원경상남도 남해군 이동면 남서대로264번길 18-7055-863-0139
34이용업고등이용원경상남도 남해군 남면 남서대로 777-1055-862-8448
45이용업제일이용원경상남도 남해군 창선면 흥선로148번길 26-1055-867-1138
56이용업미미이용원경상남도 남해군 남해읍 화전로 17055-864-6552
67이용업대성이용원경상남도 남해군 남해읍 화전로78번가길 3055-864-6482
78이용업삼성이용원경상남도 남해군 서면 남서대로2618번길 3055-862-1080
89이용업신광이용원경상남도 남해군 고현면 탑동로 61-1055-862-3305
910이용업해성이용원경상남도 남해군 창선면 서부로 1063055-867-0969
연번업종명업소명업소소재지전화번호
119120미용업(종합)뷰티네일경상남도 남해군 남해읍 화전로 103 4층000-000-0000
120121미용업(종합)베누스파경상남도 남해군 남해읍 화전로 123-3000-000-0000
121122미용업(종합)모모살롱(MOMO SALON)경상남도 남해군 남해읍 망운로21번가길 24-23 1층000-000-0000
122123미용업(손톱,발톱)라라네일경상남도 남해군 남해읍 화전로96번가길 10055-862-6664
123124미용업(손톱,발톱)네일해경상남도 남해군 남해읍 망운로 64000-000-0000
124125미용업(일반,피부)주노(Juno)경상남도 남해군 남해읍 화전로59번길 5000-000-0000
125126미용업(일반,손톱,발톱)네일스토리경상남도 남해군 남해읍 화전로 126 2층000-000-0000
126127미용업(피부,손톱,발톱)영스킨경상남도 남해군 남해읍 화전로 143 (2층)000-000-0000
127128미용업(손톱,발,화장,분장)제이속눈썹 네일경상남도 남해군 남해읍 화전로 75-1 (2층)000-000-0000
128129미용업(일반,손톱,발톱,화장,분장)수 헤어샵경상남도 남해군 상주면 남해대로697번길 5 1층000-000-0000