Overview

Dataset statistics

Number of variables3
Number of observations3353
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory82.0 KiB
Average record size in memory25.0 B

Variable types

Text2
Numeric1

Dataset

Description양산시 2020년 9월 가맹점 현황
Author경상남도 양산시
URLhttps://www.data.go.kr/data/15067485/fileData.do

Alerts

가맹점수 is highly skewed (γ1 = 20.93428049)Skewed
업종코드 has unique valuesUnique
가맹점수 has 3026 (90.2%) zerosZeros

Reproduction

Analysis started2023-12-12 00:42:33.478251
Analysis finished2023-12-12 00:42:34.195558
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종코드
Text

UNIQUE 

Distinct3353
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size26.3 KiB
2023-12-12T09:42:34.468081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters20118
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3353 ?
Unique (%)100.0%

Sample

1st rowQ01A01
2nd rowF01A01
3rd rowQ09A07
4th rowQ06A01
5th rowR08A02
ValueCountFrequency (%)
q01a01 1
 
< 0.1%
e01a99 1
 
< 0.1%
f03a12 1
 
< 0.1%
f04a09 1
 
< 0.1%
f03a13 1
 
< 0.1%
f03a14 1
 
< 0.1%
f03a15 1
 
< 0.1%
f03a16 1
 
< 0.1%
f04a01 1
 
< 0.1%
f04a02 1
 
< 0.1%
Other values (3343) 3343
99.7%
2023-12-12T09:42:34.977878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4156
20.7%
B 2315
11.5%
1 2178
10.8%
2 1783
8.9%
A 1616
 
8.0%
3 1061
 
5.3%
6 888
 
4.4%
4 771
 
3.8%
9 754
 
3.7%
C 729
 
3.6%
Other values (21) 3867
19.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13412
66.7%
Uppercase Letter 6706
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 2315
34.5%
A 1616
24.1%
C 729
 
10.9%
D 414
 
6.2%
V 209
 
3.1%
F 209
 
3.1%
R 169
 
2.5%
G 167
 
2.5%
I 165
 
2.5%
E 159
 
2.4%
Other values (11) 554
 
8.3%
Decimal Number
ValueCountFrequency (%)
0 4156
31.0%
1 2178
16.2%
2 1783
13.3%
3 1061
 
7.9%
6 888
 
6.6%
4 771
 
5.7%
9 754
 
5.6%
7 628
 
4.7%
5 615
 
4.6%
8 578
 
4.3%

Most occurring scripts

ValueCountFrequency (%)
Common 13412
66.7%
Latin 6706
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 2315
34.5%
A 1616
24.1%
C 729
 
10.9%
D 414
 
6.2%
V 209
 
3.1%
F 209
 
3.1%
R 169
 
2.5%
G 167
 
2.5%
I 165
 
2.5%
E 159
 
2.4%
Other values (11) 554
 
8.3%
Common
ValueCountFrequency (%)
0 4156
31.0%
1 2178
16.2%
2 1783
13.3%
3 1061
 
7.9%
6 888
 
6.6%
4 771
 
5.7%
9 754
 
5.6%
7 628
 
4.7%
5 615
 
4.6%
8 578
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20118
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4156
20.7%
B 2315
11.5%
1 2178
10.8%
2 1783
8.9%
A 1616
 
8.0%
3 1061
 
5.3%
6 888
 
4.4%
4 771
 
3.8%
9 754
 
3.7%
C 729
 
3.6%
Other values (21) 3867
19.2%
Distinct3351
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size26.3 KiB
2023-12-12T09:42:35.250478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length6.7470922
Min length2

Characters and Unicode

Total characters22623
Distinct characters622
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3349 ?
Unique (%)99.9%

Sample

1st row한식/백반/한정식
2nd row여성미용실
3rd row기타일반유흥주점
4th row정통양식/경양식
5th row어린이집
ValueCountFrequency (%)
이미용기재도매 2
 
0.1%
박물관/사적지관리 2
 
0.1%
가스제조/공급-종합 1
 
< 0.1%
지도판매 1
 
< 0.1%
오락/스포츠용품대여 1
 
< 0.1%
종합이삿짐대행 1
 
< 0.1%
한식/백반/한정식 1
 
< 0.1%
피아노운반 1
 
< 0.1%
골동품수리 1
 
< 0.1%
피아노조율 1
 
< 0.1%
Other values (3342) 3342
99.6%
2023-12-12T09:42:35.670724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1233
 
5.5%
1135
 
5.0%
954
 
4.2%
- 843
 
3.7%
622
 
2.7%
/ 613
 
2.7%
541
 
2.4%
352
 
1.6%
350
 
1.5%
344
 
1.5%
Other values (612) 15636
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21064
93.1%
Dash Punctuation 843
 
3.7%
Other Punctuation 616
 
2.7%
Uppercase Letter 72
 
0.3%
Open Punctuation 13
 
0.1%
Close Punctuation 13
 
0.1%
Decimal Number 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1233
 
5.9%
1135
 
5.4%
954
 
4.5%
622
 
3.0%
541
 
2.6%
352
 
1.7%
350
 
1.7%
344
 
1.6%
305
 
1.4%
297
 
1.4%
Other values (590) 14931
70.9%
Uppercase Letter
ValueCountFrequency (%)
P 13
18.1%
V 11
15.3%
T 10
13.9%
C 10
13.9%
D 8
11.1%
L 5
 
6.9%
M 3
 
4.2%
R 3
 
4.2%
G 3
 
4.2%
E 2
 
2.8%
Other values (4) 4
 
5.6%
Other Punctuation
ValueCountFrequency (%)
/ 613
99.5%
, 2
 
0.3%
. 1
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 843
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21064
93.1%
Common 1487
 
6.6%
Latin 72
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1233
 
5.9%
1135
 
5.4%
954
 
4.5%
622
 
3.0%
541
 
2.6%
352
 
1.7%
350
 
1.7%
344
 
1.6%
305
 
1.4%
297
 
1.4%
Other values (590) 14931
70.9%
Latin
ValueCountFrequency (%)
P 13
18.1%
V 11
15.3%
T 10
13.9%
C 10
13.9%
D 8
11.1%
L 5
 
6.9%
M 3
 
4.2%
R 3
 
4.2%
G 3
 
4.2%
E 2
 
2.8%
Other values (4) 4
 
5.6%
Common
ValueCountFrequency (%)
- 843
56.7%
/ 613
41.2%
( 13
 
0.9%
) 13
 
0.9%
, 2
 
0.1%
5 1
 
0.1%
. 1
 
0.1%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21064
93.1%
ASCII 1559
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1233
 
5.9%
1135
 
5.4%
954
 
4.5%
622
 
3.0%
541
 
2.6%
352
 
1.7%
350
 
1.7%
344
 
1.6%
305
 
1.4%
297
 
1.4%
Other values (590) 14931
70.9%
ASCII
ValueCountFrequency (%)
- 843
54.1%
/ 613
39.3%
( 13
 
0.8%
P 13
 
0.8%
) 13
 
0.8%
V 11
 
0.7%
T 10
 
0.6%
C 10
 
0.6%
D 8
 
0.5%
L 5
 
0.3%
Other values (12) 20
 
1.3%

가맹점수
Real number (ℝ)

SKEWED  ZEROS 

Distinct109
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.954071
Minimum0
Maximum1488
Zeros3026
Zeros (%)90.2%
Negative0
Negative (%)0.0%
Memory size29.6 KiB
2023-12-12T09:42:35.839977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile13
Maximum1488
Range1488
Interquartile range (IQR)0

Descriptive statistics

Standard deviation40.13926
Coefficient of variation (CV)8.1022779
Kurtosis631.35925
Mean4.954071
Median Absolute Deviation (MAD)0
Skewness20.93428
Sum16611
Variance1611.1602
MonotonicityDecreasing
2023-12-12T09:42:36.057934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3026
90.2%
1 42
 
1.3%
3 23
 
0.7%
2 21
 
0.6%
6 15
 
0.4%
7 12
 
0.4%
4 8
 
0.2%
5 8
 
0.2%
18 7
 
0.2%
26 6
 
0.2%
Other values (99) 185
 
5.5%
ValueCountFrequency (%)
0 3026
90.2%
1 42
 
1.3%
2 21
 
0.6%
3 23
 
0.7%
4 8
 
0.2%
5 8
 
0.2%
6 15
 
0.4%
7 12
 
0.4%
8 6
 
0.2%
9 6
 
0.2%
ValueCountFrequency (%)
1488 1
< 0.1%
724 1
< 0.1%
664 1
< 0.1%
459 1
< 0.1%
457 1
< 0.1%
441 1
< 0.1%
408 1
< 0.1%
390 1
< 0.1%
311 1
< 0.1%
306 1
< 0.1%

Interactions

2023-12-12T09:42:33.895680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T09:42:34.054564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:42:34.157659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종코드업종명가맹점수
0Q01A01한식/백반/한정식1488
1F01A01여성미용실724
2Q09A07기타일반유흥주점664
3Q06A01정통양식/경양식459
4R08A02어린이집457
5D05A09남성의류전문점441
6D03A01편의점408
7Q01A02갈비/삼겹살390
8R09A01학원(종합)311
9Q05A08후라이드/양념치킨306
업종코드업종명가맹점수
3343V16C92산악회0
3344V16C93전우회0
3345V16C99기타사회단체0
3346V18B01세무서0
3347V18B02등기소0
3348V18B03경찰서/파출소/소방서0
3349V18B04시도군구청0
3350V18B05읍면동사무소0
3351V18B07운전면허시험장0
3352V18B08마을회관0