Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells42
Missing cells (%)38.9%
Duplicate rows1
Duplicate rows (%)3.7%
Total size in memory1023.0 B
Average record size in memory37.9 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description부산광역시강서구_동물판매업_20230810
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15085636

Alerts

Dataset has 1 (3.7%) duplicate rowsDuplicates
연번 is highly overall correlated with 데이터기준일자High correlation
데이터기준일자 is highly overall correlated with 연번High correlation
연번 has 14 (51.9%) missing valuesMissing
사업장명칭 has 14 (51.9%) missing valuesMissing
소재지주소(도로명) has 14 (51.9%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:59:09.632133
Analysis finished2023-12-10 16:59:10.480845
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Infinite0
Infinite (%)0.0%
Mean7
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-11T01:59:10.572093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.6
Q14
median7
Q310
95-th percentile12.4
Maximum13
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.8944405
Coefficient of variation (CV)0.55634864
Kurtosis-1.2
Mean7
Median Absolute Deviation (MAD)3
Skewness0
Sum91
Variance15.166667
MonotonicityStrictly increasing
2023-12-11T01:59:10.758215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 1
 
3.7%
2 1
 
3.7%
3 1
 
3.7%
4 1
 
3.7%
5 1
 
3.7%
6 1
 
3.7%
7 1
 
3.7%
8 1
 
3.7%
9 1
 
3.7%
10 1
 
3.7%
Other values (3) 3
 
11.1%
(Missing) 14
51.9%
ValueCountFrequency (%)
1 1
3.7%
2 1
3.7%
3 1
3.7%
4 1
3.7%
5 1
3.7%
6 1
3.7%
7 1
3.7%
8 1
3.7%
9 1
3.7%
10 1
3.7%
ValueCountFrequency (%)
13 1
3.7%
12 1
3.7%
11 1
3.7%
10 1
3.7%
9 1
3.7%
8 1
3.7%
7 1
3.7%
6 1
3.7%
5 1
3.7%
4 1
3.7%

사업장명칭
Text

MISSING 

Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Memory size348.0 B
2023-12-11T01:59:10.935384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length9
Mean length8.2307692
Min length3

Characters and Unicode

Total characters107
Distinct characters59
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row쥬쥬펫살롱
2nd row홍's살롱
3rd row펫피플 스타필드명지점
4th row개츠비 켄넬
5th row에이펫
ValueCountFrequency (%)
부산점 3
 
13.0%
쥬쥬펫살롱 1
 
4.3%
뽀글리 1
 
4.3%
신호점 1
 
4.3%
장보는강아지와고양이 1
 
4.3%
명지본점 1
 
4.3%
테디럽 1
 
4.3%
부산강아지분양 1
 
4.3%
명지점 1
 
4.3%
올리스24 1
 
4.3%
Other values (11) 11
47.8%
2023-12-11T01:59:11.374902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.3%
7
 
6.5%
6
 
5.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (49) 60
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
86.9%
Space Separator 10
 
9.3%
Decimal Number 2
 
1.9%
Lowercase Letter 1
 
0.9%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
86.9%
Common 13
 
12.1%
Latin 1
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Common
ValueCountFrequency (%)
10
76.9%
2 1
 
7.7%
4 1
 
7.7%
' 1
 
7.7%
Latin
ValueCountFrequency (%)
s 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
86.9%
ASCII 14
 
13.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10
71.4%
2 1
 
7.1%
4 1
 
7.1%
s 1
 
7.1%
' 1
 
7.1%
Hangul
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Memory size348.0 B
2023-12-11T01:59:11.648989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length36
Mean length34.615385
Min length26

Characters and Unicode

Total characters450
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row부산광역시 강서구 신호산단4로64번가길 41 (신호동)
2nd row부산광역시 강서구 명지오션시티4로 73, 302호 (명지동)
3rd row부산광역시 강서구 명지국제6로 168, 스타필드 시티 명지점 4층 4104호 (명지동)
4th row부산광역시 강서구 명지국제6로 99, 127호 (명지동)
5th row부산광역시 강서구 경전철로 211-2(대저2동)
ValueCountFrequency (%)
부산광역시 13
15.7%
강서구 13
15.7%
명지동 10
 
12.0%
1층 4
 
4.8%
명지국제6로 3
 
3.6%
신호동 2
 
2.4%
명지국제2로28번길 2
 
2.4%
23 2
 
2.4%
10 1
 
1.2%
2층 1
 
1.2%
Other values (32) 32
38.6%
2023-12-11T01:59:12.055236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
15.6%
2 22
 
4.9%
1 21
 
4.7%
21
 
4.7%
21
 
4.7%
16
 
3.6%
15
 
3.3%
14
 
3.1%
( 13
 
2.9%
13
 
2.9%
Other values (44) 224
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 251
55.8%
Decimal Number 88
 
19.6%
Space Separator 70
 
15.6%
Open Punctuation 13
 
2.9%
Close Punctuation 13
 
2.9%
Other Punctuation 11
 
2.4%
Dash Punctuation 4
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%
Decimal Number
ValueCountFrequency (%)
2 22
25.0%
1 21
23.9%
4 10
11.4%
6 8
 
9.1%
0 8
 
9.1%
3 6
 
6.8%
8 5
 
5.7%
9 3
 
3.4%
5 3
 
3.4%
7 2
 
2.3%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 251
55.8%
Common 199
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%
Common
ValueCountFrequency (%)
70
35.2%
2 22
 
11.1%
1 21
 
10.6%
( 13
 
6.5%
) 13
 
6.5%
, 11
 
5.5%
4 10
 
5.0%
6 8
 
4.0%
0 8
 
4.0%
3 6
 
3.0%
Other values (5) 17
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 251
55.8%
ASCII 199
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
35.2%
2 22
 
11.1%
1 21
 
10.6%
( 13
 
6.5%
) 13
 
6.5%
, 11
 
5.5%
4 10
 
5.0%
6 8
 
4.0%
0 8
 
4.0%
3 6
 
3.0%
Other values (5) 17
 
8.5%
Hangul
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%

데이터기준일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size348.0 B
<NA>
14 
2023-08-10
13 

Length

Max length10
Median length4
Mean length6.8888889
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-10
2nd row2023-08-10
3rd row2023-08-10
4th row2023-08-10
5th row2023-08-10

Common Values

ValueCountFrequency (%)
<NA> 14
51.9%
2023-08-10 13
48.1%

Length

2023-12-11T01:59:12.256507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:12.403043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 14
51.9%
2023-08-10 13
48.1%

Interactions

2023-12-11T01:59:09.908358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:59:12.537204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명칭소재지주소(도로명)
연번1.0001.0001.000
사업장명칭1.0001.0001.000
소재지주소(도로명)1.0001.0001.000
2023-12-11T01:59:12.715662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번데이터기준일자
연번1.0001.000
데이터기준일자1.0001.000

Missing values

2023-12-11T01:59:10.074543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:59:10.214348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:59:10.374214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번사업장명칭소재지주소(도로명)데이터기준일자
01쥬쥬펫살롱부산광역시 강서구 신호산단4로64번가길 41 (신호동)2023-08-10
12홍's살롱부산광역시 강서구 명지오션시티4로 73, 302호 (명지동)2023-08-10
23펫피플 스타필드명지점부산광역시 강서구 명지국제6로 168, 스타필드 시티 명지점 4층 4104호 (명지동)2023-08-10
34개츠비 켄넬부산광역시 강서구 명지국제6로 99, 127호 (명지동)2023-08-10
45에이펫부산광역시 강서구 경전철로 211-2(대저2동)2023-08-10
56마이도그 강아지분양 부산점부산광역시 강서구 명지국제4로208번길 22-5, 1층 (명지동)2023-08-10
67와우펫앤쥬 부산점부산광역시 강서구 명지국제6로 21, 도형건설 201호호 (명지동)2023-08-10
78뽀글리부산광역시 강서구 명지국제2로28번길 23, 2동 1층 101호 (명지동)2023-08-10
89헬로퍼피 부산점부산광역시 강서구 명지국제5로 10, 엠시티 106호 (명지동)2023-08-10
910올리스24 명지점부산광역시 강서구 명지국제12로 69-1, 1층 (명지동)2023-08-10
연번사업장명칭소재지주소(도로명)데이터기준일자
17<NA><NA><NA><NA>
18<NA><NA><NA><NA>
19<NA><NA><NA><NA>
20<NA><NA><NA><NA>
21<NA><NA><NA><NA>
22<NA><NA><NA><NA>
23<NA><NA><NA><NA>
24<NA><NA><NA><NA>
25<NA><NA><NA><NA>
26<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번사업장명칭소재지주소(도로명)데이터기준일자# duplicates
0<NA><NA><NA><NA>14