Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells56
Missing cells (%)51.9%
Duplicate rows1
Duplicate rows (%)3.7%
Total size in memory1023.0 B
Average record size in memory37.9 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description부산광역시 강서구 관할내 동물판매업 현황(연번, 사업자명칭, 소재지주소(도로명), 데이터기준일자)에 대한 정보입니다.
URLhttps://www.data.go.kr/data/15085636/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (3.7%) duplicate rowsDuplicates
연번 has 14 (51.9%) missing valuesMissing
사업장명칭 has 14 (51.9%) missing valuesMissing
소재지주소(도로명) has 14 (51.9%) missing valuesMissing
데이터기준일자 has 14 (51.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:05:16.638812
Analysis finished2023-12-12 05:05:17.576140
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

MISSING 

Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Infinite0
Infinite (%)0.0%
Mean7
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-12T14:05:17.633849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.6
Q14
median7
Q310
95-th percentile12.4
Maximum13
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.8944405
Coefficient of variation (CV)0.55634864
Kurtosis-1.2
Mean7
Median Absolute Deviation (MAD)3
Skewness0
Sum91
Variance15.166667
MonotonicityStrictly increasing
2023-12-12T14:05:17.766992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 1
 
3.7%
2 1
 
3.7%
3 1
 
3.7%
4 1
 
3.7%
5 1
 
3.7%
6 1
 
3.7%
7 1
 
3.7%
8 1
 
3.7%
9 1
 
3.7%
10 1
 
3.7%
Other values (3) 3
 
11.1%
(Missing) 14
51.9%
ValueCountFrequency (%)
1 1
3.7%
2 1
3.7%
3 1
3.7%
4 1
3.7%
5 1
3.7%
6 1
3.7%
7 1
3.7%
8 1
3.7%
9 1
3.7%
10 1
3.7%
ValueCountFrequency (%)
13 1
3.7%
12 1
3.7%
11 1
3.7%
10 1
3.7%
9 1
3.7%
8 1
3.7%
7 1
3.7%
6 1
3.7%
5 1
3.7%
4 1
3.7%

사업장명칭
Text

MISSING 

Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Memory size348.0 B
2023-12-12T14:05:17.978126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length9
Mean length8.2307692
Min length3

Characters and Unicode

Total characters107
Distinct characters59
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row쥬쥬펫살롱
2nd row홍's살롱
3rd row펫피플 스타필드명지점
4th row개츠비 켄넬
5th row에이펫
ValueCountFrequency (%)
부산점 3
 
13.0%
쥬쥬펫살롱 1
 
4.3%
뽀글리 1
 
4.3%
신호점 1
 
4.3%
장보는강아지와고양이 1
 
4.3%
명지본점 1
 
4.3%
테디럽 1
 
4.3%
부산강아지분양 1
 
4.3%
명지점 1
 
4.3%
올리스24 1
 
4.3%
Other values (11) 11
47.8%
2023-12-12T14:05:18.339932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.3%
7
 
6.5%
6
 
5.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (49) 60
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
86.9%
Space Separator 10
 
9.3%
Decimal Number 2
 
1.9%
Lowercase Letter 1
 
0.9%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
86.9%
Common 13
 
12.1%
Latin 1
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Common
ValueCountFrequency (%)
10
76.9%
2 1
 
7.7%
4 1
 
7.7%
' 1
 
7.7%
Latin
ValueCountFrequency (%)
s 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
86.9%
ASCII 14
 
13.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10
71.4%
2 1
 
7.1%
4 1
 
7.1%
s 1
 
7.1%
' 1
 
7.1%
Hangul
ValueCountFrequency (%)
7
 
7.5%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (44) 53
57.0%
Distinct13
Distinct (%)100.0%
Missing14
Missing (%)51.9%
Memory size348.0 B
2023-12-12T14:05:18.577426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length36
Mean length34.615385
Min length26

Characters and Unicode

Total characters450
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row부산광역시 강서구 신호산단4로64번가길 41 (신호동)
2nd row부산광역시 강서구 명지오션시티4로 73, 302호 (명지동)
3rd row부산광역시 강서구 명지국제6로 168, 스타필드 시티 명지점 4층 4104호 (명지동)
4th row부산광역시 강서구 명지국제6로 99, 127호 (명지동)
5th row부산광역시 강서구 경전철로 211-2(대저2동)
ValueCountFrequency (%)
부산광역시 13
15.7%
강서구 13
15.7%
명지동 10
 
12.0%
1층 4
 
4.8%
명지국제6로 3
 
3.6%
신호동 2
 
2.4%
명지국제2로28번길 2
 
2.4%
23 2
 
2.4%
10 1
 
1.2%
2층 1
 
1.2%
Other values (32) 32
38.6%
2023-12-12T14:05:18.964960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
15.6%
2 22
 
4.9%
1 21
 
4.7%
21
 
4.7%
21
 
4.7%
16
 
3.6%
15
 
3.3%
14
 
3.1%
( 13
 
2.9%
13
 
2.9%
Other values (44) 224
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 251
55.8%
Decimal Number 88
 
19.6%
Space Separator 70
 
15.6%
Open Punctuation 13
 
2.9%
Close Punctuation 13
 
2.9%
Other Punctuation 11
 
2.4%
Dash Punctuation 4
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%
Decimal Number
ValueCountFrequency (%)
2 22
25.0%
1 21
23.9%
4 10
11.4%
6 8
 
9.1%
0 8
 
9.1%
3 6
 
6.8%
8 5
 
5.7%
9 3
 
3.4%
5 3
 
3.4%
7 2
 
2.3%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 251
55.8%
Common 199
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%
Common
ValueCountFrequency (%)
70
35.2%
2 22
 
11.1%
1 21
 
10.6%
( 13
 
6.5%
) 13
 
6.5%
, 11
 
5.5%
4 10
 
5.0%
6 8
 
4.0%
0 8
 
4.0%
3 6
 
3.0%
Other values (5) 17
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 251
55.8%
ASCII 199
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
35.2%
2 22
 
11.1%
1 21
 
10.6%
( 13
 
6.5%
) 13
 
6.5%
, 11
 
5.5%
4 10
 
5.0%
6 8
 
4.0%
0 8
 
4.0%
3 6
 
3.0%
Other values (5) 17
 
8.5%
Hangul
ValueCountFrequency (%)
21
 
8.4%
21
 
8.4%
16
 
6.4%
15
 
6.0%
14
 
5.6%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
13
 
5.2%
Other values (29) 99
39.4%

데이터기준일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)7.7%
Missing14
Missing (%)51.9%
Memory size348.0 B
Minimum2023-08-10 00:00:00
Maximum2023-08-10 00:00:00
2023-12-12T14:05:19.084163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:05:19.190398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T14:05:16.813090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:05:19.286341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명칭소재지주소(도로명)
연번1.0001.0001.000
사업장명칭1.0001.0001.000
소재지주소(도로명)1.0001.0001.000

Missing values

2023-12-12T14:05:16.972795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:05:17.064294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:05:17.506773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번사업장명칭소재지주소(도로명)데이터기준일자
01쥬쥬펫살롱부산광역시 강서구 신호산단4로64번가길 41 (신호동)2023-08-10
12홍's살롱부산광역시 강서구 명지오션시티4로 73, 302호 (명지동)2023-08-10
23펫피플 스타필드명지점부산광역시 강서구 명지국제6로 168, 스타필드 시티 명지점 4층 4104호 (명지동)2023-08-10
34개츠비 켄넬부산광역시 강서구 명지국제6로 99, 127호 (명지동)2023-08-10
45에이펫부산광역시 강서구 경전철로 211-2(대저2동)2023-08-10
56마이도그 강아지분양 부산점부산광역시 강서구 명지국제4로208번길 22-5, 1층 (명지동)2023-08-10
67와우펫앤쥬 부산점부산광역시 강서구 명지국제6로 21, 도형건설 201호호 (명지동)2023-08-10
78뽀글리부산광역시 강서구 명지국제2로28번길 23, 2동 1층 101호 (명지동)2023-08-10
89헬로퍼피 부산점부산광역시 강서구 명지국제5로 10, 엠시티 106호 (명지동)2023-08-10
910올리스24 명지점부산광역시 강서구 명지국제12로 69-1, 1층 (명지동)2023-08-10
연번사업장명칭소재지주소(도로명)데이터기준일자
17<NA><NA><NA><NA>
18<NA><NA><NA><NA>
19<NA><NA><NA><NA>
20<NA><NA><NA><NA>
21<NA><NA><NA><NA>
22<NA><NA><NA><NA>
23<NA><NA><NA><NA>
24<NA><NA><NA><NA>
25<NA><NA><NA><NA>
26<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번사업장명칭소재지주소(도로명)데이터기준일자# duplicates
0<NA><NA><NA><NA>14