Overview

Dataset statistics

Number of variables4
Number of observations3998
Missing cells59
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory129.0 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description경상남도 창원시 미용업(종합, 일반, 네일, 화장.분장, 피부) 현황(업종명, 업소명, 영업장 소재지(도로명, 지번))입니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15006846

Alerts

소재지(도로명) has 59 (1.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:14:07.766832
Analysis finished2023-12-11 00:14:08.621502
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct3998
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1999.5
Minimum1
Maximum3998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.3 KiB
2023-12-11T09:14:08.686932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile200.85
Q11000.25
median1999.5
Q32998.75
95-th percentile3798.15
Maximum3998
Range3997
Interquartile range (IQR)1998.5

Descriptive statistics

Standard deviation1154.2675
Coefficient of variation (CV)0.57727808
Kurtosis-1.2
Mean1999.5
Median Absolute Deviation (MAD)999.5
Skewness0
Sum7994001
Variance1332333.5
MonotonicityStrictly increasing
2023-12-11T09:14:08.817503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2672 1
 
< 0.1%
2659 1
 
< 0.1%
2660 1
 
< 0.1%
2661 1
 
< 0.1%
2662 1
 
< 0.1%
2663 1
 
< 0.1%
2664 1
 
< 0.1%
2665 1
 
< 0.1%
2666 1
 
< 0.1%
Other values (3988) 3988
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3998 1
< 0.1%
3997 1
< 0.1%
3996 1
< 0.1%
3995 1
< 0.1%
3994 1
< 0.1%
3993 1
< 0.1%
3992 1
< 0.1%
3991 1
< 0.1%
3990 1
< 0.1%
3989 1
< 0.1%

업종
Categorical

Distinct16
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size31.4 KiB
일반미용업
1881 
피부미용업
580 
미용업
547 
네일미용업
340 
종합미용업
264 
Other values (11)
386 

Length

Max length23
Median length5
Mean length5.6088044
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반미용업
2nd row일반미용업
3rd row일반미용업
4th row일반미용업
5th row일반미용업

Common Values

ValueCountFrequency (%)
일반미용업 1881
47.0%
피부미용업 580
 
14.5%
미용업 547
 
13.7%
네일미용업 340
 
8.5%
종합미용업 264
 
6.6%
화장ㆍ분장 미용업 101
 
2.5%
피부미용업, 네일미용업 58
 
1.5%
네일미용업, 화장ㆍ분장 미용업 54
 
1.4%
피부미용업, 화장ㆍ분장 미용업 44
 
1.1%
일반미용업, 네일미용업, 화장ㆍ분장 미용업 27
 
0.7%
Other values (6) 102
 
2.6%

Length

2023-12-11T09:14:08.968046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 1984
42.9%
미용업 827
17.9%
피부미용업 732
 
15.8%
네일미용업 535
 
11.6%
화장ㆍ분장 280
 
6.1%
종합미용업 264
 
5.7%
Distinct3717
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size31.4 KiB
2023-12-11T09:14:09.260851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length30
Mean length6.172086
Min length1

Characters and Unicode

Total characters24676
Distinct characters778
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3507 ?
Unique (%)87.7%

Sample

1st row춤추는머리나라미용실
2nd row뉴스타미용실
3rd row뉴신촌미용실
4th row모드니헤어라인
5th row제일미용실
ValueCountFrequency (%)
hair 48
 
1.0%
헤어 40
 
0.9%
네일 30
 
0.6%
에스테틱 22
 
0.5%
미용실 21
 
0.5%
헤어샵 20
 
0.4%
nail 18
 
0.4%
by 12
 
0.3%
beauty 12
 
0.3%
salon 10
 
0.2%
Other values (3917) 4388
95.0%
2023-12-11T09:14:09.666438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1620
 
6.6%
1535
 
6.2%
714
 
2.9%
624
 
2.5%
477
 
1.9%
474
 
1.9%
470
 
1.9%
466
 
1.9%
448
 
1.8%
441
 
1.8%
Other values (768) 17407
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20382
82.6%
Lowercase Letter 1354
 
5.5%
Uppercase Letter 1167
 
4.7%
Space Separator 624
 
2.5%
Close Punctuation 386
 
1.6%
Open Punctuation 385
 
1.6%
Other Punctuation 217
 
0.9%
Decimal Number 130
 
0.5%
Dash Punctuation 20
 
0.1%
Connector Punctuation 5
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1620
 
7.9%
1535
 
7.5%
714
 
3.5%
477
 
2.3%
474
 
2.3%
470
 
2.3%
466
 
2.3%
448
 
2.2%
441
 
2.2%
430
 
2.1%
Other values (686) 13307
65.3%
Lowercase Letter
ValueCountFrequency (%)
a 169
12.5%
i 136
10.0%
e 133
9.8%
o 126
9.3%
n 113
 
8.3%
l 95
 
7.0%
r 90
 
6.6%
y 72
 
5.3%
h 69
 
5.1%
u 55
 
4.1%
Other values (15) 296
21.9%
Uppercase Letter
ValueCountFrequency (%)
A 120
 
10.3%
I 91
 
7.8%
H 91
 
7.8%
S 88
 
7.5%
N 79
 
6.8%
L 71
 
6.1%
O 66
 
5.7%
R 66
 
5.7%
B 62
 
5.3%
E 61
 
5.2%
Other values (15) 372
31.9%
Other Punctuation
ValueCountFrequency (%)
& 57
26.3%
# 46
21.2%
. 43
19.8%
, 40
18.4%
' 11
 
5.1%
: 10
 
4.6%
! 3
 
1.4%
" 2
 
0.9%
· 2
 
0.9%
% 2
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 42
32.3%
0 19
14.6%
9 15
 
11.5%
2 15
 
11.5%
5 9
 
6.9%
4 9
 
6.9%
6 7
 
5.4%
8 6
 
4.6%
7 5
 
3.8%
3 3
 
2.3%
Math Symbol
ValueCountFrequency (%)
+ 1
25.0%
< 1
25.0%
> 1
25.0%
= 1
25.0%
Space Separator
ValueCountFrequency (%)
624
100.0%
Close Punctuation
ValueCountFrequency (%)
) 386
100.0%
Open Punctuation
ValueCountFrequency (%)
( 385
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20361
82.5%
Latin 2522
 
10.2%
Common 1772
 
7.2%
Han 21
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1620
 
8.0%
1535
 
7.5%
714
 
3.5%
477
 
2.3%
474
 
2.3%
470
 
2.3%
466
 
2.3%
448
 
2.2%
441
 
2.2%
430
 
2.1%
Other values (677) 13286
65.3%
Latin
ValueCountFrequency (%)
a 169
 
6.7%
i 136
 
5.4%
e 133
 
5.3%
o 126
 
5.0%
A 120
 
4.8%
n 113
 
4.5%
l 95
 
3.8%
I 91
 
3.6%
H 91
 
3.6%
r 90
 
3.6%
Other values (41) 1358
53.8%
Common
ValueCountFrequency (%)
624
35.2%
) 386
21.8%
( 385
21.7%
& 57
 
3.2%
# 46
 
2.6%
. 43
 
2.4%
1 42
 
2.4%
, 40
 
2.3%
- 20
 
1.1%
0 19
 
1.1%
Other values (21) 110
 
6.2%
Han
ValueCountFrequency (%)
12
57.1%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20361
82.5%
ASCII 4290
 
17.4%
CJK 20
 
0.1%
None 3
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1620
 
8.0%
1535
 
7.5%
714
 
3.5%
477
 
2.3%
474
 
2.3%
470
 
2.3%
466
 
2.3%
448
 
2.2%
441
 
2.2%
430
 
2.1%
Other values (677) 13286
65.3%
ASCII
ValueCountFrequency (%)
624
 
14.5%
) 386
 
9.0%
( 385
 
9.0%
a 169
 
3.9%
i 136
 
3.2%
e 133
 
3.1%
o 126
 
2.9%
A 120
 
2.8%
n 113
 
2.6%
l 95
 
2.2%
Other values (69) 2003
46.7%
CJK
ValueCountFrequency (%)
12
60.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

소재지(도로명)
Text

MISSING 

Distinct3898
Distinct (%)99.0%
Missing59
Missing (%)1.5%
Memory size31.4 KiB
2023-12-11T09:14:09.972853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length53
Mean length38.396293
Min length22

Characters and Unicode

Total characters151243
Distinct characters458
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3859 ?
Unique (%)98.0%

Sample

1st row경상남도 창원시 의창구 금강로341번길 28, 1층 (소계동)
2nd row경상남도 창원시 의창구 금강로341번길 40 (소계동)
3rd row경상남도 창원시 의창구 금강로369번길 5 (소계동,번지 1층)
4th row경상남도 창원시 의창구 남산로 20 (팔용동)
5th row경상남도 창원시 의창구 남산로 20, 113호 (팔용동, 벽산C단지상가 )
ValueCountFrequency (%)
경상남도 3939
 
12.8%
창원시 3939
 
12.8%
1층 1352
 
4.4%
성산구 1052
 
3.4%
의창구 794
 
2.6%
마산회원구 761
 
2.5%
진해구 675
 
2.2%
마산합포구 657
 
2.1%
2층 495
 
1.6%
상남동 275
 
0.9%
Other values (3760) 16776
54.6%
2023-12-11T09:14:10.438283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26794
 
17.7%
1 6908
 
4.6%
5588
 
3.7%
5549
 
3.7%
5053
 
3.3%
5044
 
3.3%
4827
 
3.2%
4167
 
2.8%
, 4162
 
2.8%
4119
 
2.7%
Other values (448) 79032
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88801
58.7%
Space Separator 26794
 
17.7%
Decimal Number 22383
 
14.8%
Other Punctuation 4257
 
2.8%
Close Punctuation 3920
 
2.6%
Open Punctuation 3919
 
2.6%
Dash Punctuation 756
 
0.5%
Uppercase Letter 321
 
0.2%
Lowercase Letter 85
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5588
 
6.3%
5549
 
6.2%
5053
 
5.7%
5044
 
5.7%
4827
 
5.4%
4167
 
4.7%
4119
 
4.6%
4099
 
4.6%
4064
 
4.6%
3097
 
3.5%
Other values (387) 43194
48.6%
Uppercase Letter
ValueCountFrequency (%)
A 86
26.8%
B 39
12.1%
S 23
 
7.2%
E 18
 
5.6%
T 16
 
5.0%
C 14
 
4.4%
L 12
 
3.7%
R 12
 
3.7%
K 11
 
3.4%
M 11
 
3.4%
Other values (15) 79
24.6%
Lowercase Letter
ValueCountFrequency (%)
a 32
37.6%
e 13
15.3%
l 10
 
11.8%
t 5
 
5.9%
s 4
 
4.7%
h 4
 
4.7%
m 4
 
4.7%
c 3
 
3.5%
n 3
 
3.5%
q 2
 
2.4%
Other values (3) 5
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 6908
30.9%
2 3678
16.4%
0 2282
 
10.2%
3 2216
 
9.9%
4 1548
 
6.9%
5 1448
 
6.5%
6 1162
 
5.2%
7 1146
 
5.1%
8 1074
 
4.8%
9 921
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 4162
97.8%
· 65
 
1.5%
' 14
 
0.3%
@ 8
 
0.2%
. 4
 
0.1%
/ 3
 
0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
26794
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3920
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3919
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 756
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88801
58.7%
Common 62036
41.0%
Latin 406
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5588
 
6.3%
5549
 
6.2%
5053
 
5.7%
5044
 
5.7%
4827
 
5.4%
4167
 
4.7%
4119
 
4.6%
4099
 
4.6%
4064
 
4.6%
3097
 
3.5%
Other values (387) 43194
48.6%
Latin
ValueCountFrequency (%)
A 86
21.2%
B 39
 
9.6%
a 32
 
7.9%
S 23
 
5.7%
E 18
 
4.4%
T 16
 
3.9%
C 14
 
3.4%
e 13
 
3.2%
L 12
 
3.0%
R 12
 
3.0%
Other values (28) 141
34.7%
Common
ValueCountFrequency (%)
26794
43.2%
1 6908
 
11.1%
, 4162
 
6.7%
) 3920
 
6.3%
( 3919
 
6.3%
2 3678
 
5.9%
0 2282
 
3.7%
3 2216
 
3.6%
4 1548
 
2.5%
5 1448
 
2.3%
Other values (13) 5161
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88801
58.7%
ASCII 62377
41.2%
None 65
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26794
43.0%
1 6908
 
11.1%
, 4162
 
6.7%
) 3920
 
6.3%
( 3919
 
6.3%
2 3678
 
5.9%
0 2282
 
3.7%
3 2216
 
3.6%
4 1548
 
2.5%
5 1448
 
2.3%
Other values (50) 5502
 
8.8%
Hangul
ValueCountFrequency (%)
5588
 
6.3%
5549
 
6.2%
5053
 
5.7%
5044
 
5.7%
4827
 
5.4%
4167
 
4.7%
4119
 
4.6%
4099
 
4.6%
4064
 
4.6%
3097
 
3.5%
Other values (387) 43194
48.6%
None
ValueCountFrequency (%)
· 65
100.0%

Interactions

2023-12-11T09:14:08.387785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:14:10.516640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.344
업종0.3441.000
2023-12-11T09:14:10.591085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.142
업종0.1421.000

Missing values

2023-12-11T09:14:08.510077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:14:08.587847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업소명소재지(도로명)
01일반미용업춤추는머리나라미용실<NA>
12일반미용업뉴스타미용실<NA>
23일반미용업뉴신촌미용실<NA>
34일반미용업모드니헤어라인<NA>
45일반미용업제일미용실<NA>
56일반미용업댕기머리미용실<NA>
67일반미용업상아헤어<NA>
78일반미용업소계헤어경상남도 창원시 의창구 금강로341번길 28, 1층 (소계동)
89일반미용업화니핀헤어아트경상남도 창원시 의창구 금강로341번길 40 (소계동)
910일반미용업가인헤어경상남도 창원시 의창구 금강로369번길 5 (소계동,번지 1층)
연번업종업소명소재지(도로명)
39883989일반미용업나현헤어랜드경상남도 창원시 진해구 해원로 8, 2층 (이동)
39893990피부미용업뷰티샵더예쁘다경상남도 창원시 진해구 해원로 8, 2층 일부호 (이동)
39903991화장ㆍ분장 미용업속눈썹이긴여자경상남도 창원시 진해구 해원로32번길 30, 석정빌 1층 (이동)
39913992피부미용업제이블리경상남도 창원시 진해구 해원로32번길 34 (이동)
39923993일반미용업소윤헤어경상남도 창원시 진해구 해원로8번길 12-1 (이동)
39933994네일미용업네일은경상남도 창원시 진해구 해원로8번길 17, 1층 (이동)
39943995일반미용업네오리브경상남도 창원시 진해구 행암로 12, 4층 401호 (장천동)
39953996피부미용업, 화장ㆍ분장 미용업올가드림뷰티경상남도 창원시 진해구 행암로 12, 4층 402호 (장천동)
39963997화장ㆍ분장 미용업알로하스킨경상남도 창원시 진해구 행암로 12, 4층 406호 (장천동)
39973998일반미용업대동다숲헤어샵경상남도 창원시 진해구 행암로 25, 2층 211호 (장천동, 대동다숲상가)