Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory45.4 B

Variable types

Numeric1
Categorical1
Text2
DateTime1

Dataset

Description경상남도 양산시의 동물판매업 현황에 관한 데이터로 연번, 영업의내용, 사업장명칭, 소재지주소(도로명), 신고일자 등의 항목을 제공합니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15105951

Alerts

연번 is highly overall correlated with 영업의내용High correlation
영업의내용 is highly overall correlated with 연번High correlation
영업의내용 is highly imbalanced (64.7%)Imbalance
연번 has unique valuesUnique
사업장명칭 has unique valuesUnique
소재지주소(도로명) has unique valuesUnique
신고일자 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:25:57.595441
Analysis finished2023-12-11 00:25:58.057793
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T09:25:58.117634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-11T09:25:58.230148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

영업의내용
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
동물 ( 판매 )
28 
동물 ( 판매 )
 
2

Length

Max length10
Median length9
Mean length9.0666667
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동물 ( 판매 )
2nd row동물 ( 판매 )
3rd row동물 ( 판매 )
4th row동물 ( 판매 )
5th row동물 ( 판매 )

Common Values

ValueCountFrequency (%)
동물 ( 판매 ) 28
93.3%
동물 ( 판매 ) 2
 
6.7%

Length

2023-12-11T09:25:58.371246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:25:58.480951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
60
50.0%
동물 30
25.0%
판매 30
25.0%

사업장명칭
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-11T09:25:58.724555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10.5
Mean length6.7
Min length3

Characters and Unicode

Total characters201
Distinct characters114
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row행복한강아지 틱독
2nd row동물농장
3rd row토니와 쁘니
4th row미미애견
5th row스타독스
ValueCountFrequency (%)
양산점 2
 
4.8%
행복한강아지 1
 
2.4%
rags 1
 
2.4%
양산동물메디컬센터 1
 
2.4%
이마트 1
 
2.4%
골든아쿠아펫 1
 
2.4%
제이엠펫하우스 1
 
2.4%
라이프펫(양산점 1
 
2.4%
도그스타일 1
 
2.4%
고양이숲 1
 
2.4%
Other values (31) 31
73.8%
2023-12-11T09:25:59.146241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
6.0%
7
 
3.5%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
A 4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (104) 144
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157
78.1%
Uppercase Letter 17
 
8.5%
Space Separator 12
 
6.0%
Lowercase Letter 8
 
4.0%
Open Punctuation 3
 
1.5%
Close Punctuation 3
 
1.5%
Other Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (81) 108
68.8%
Uppercase Letter
ValueCountFrequency (%)
A 4
23.5%
L 2
11.8%
S 2
11.8%
C 1
 
5.9%
E 1
 
5.9%
M 1
 
5.9%
T 1
 
5.9%
N 1
 
5.9%
G 1
 
5.9%
R 1
 
5.9%
Other values (2) 2
11.8%
Lowercase Letter
ValueCountFrequency (%)
e 2
25.0%
p 1
12.5%
l 1
12.5%
s 1
12.5%
t 1
12.5%
n 1
12.5%
g 1
12.5%
Space Separator
ValueCountFrequency (%)
12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157
78.1%
Latin 25
 
12.4%
Common 19
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (81) 108
68.8%
Latin
ValueCountFrequency (%)
A 4
16.0%
L 2
 
8.0%
S 2
 
8.0%
e 2
 
8.0%
p 1
 
4.0%
l 1
 
4.0%
s 1
 
4.0%
t 1
 
4.0%
C 1
 
4.0%
n 1
 
4.0%
Other values (9) 9
36.0%
Common
ValueCountFrequency (%)
12
63.2%
( 3
 
15.8%
) 3
 
15.8%
& 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157
78.1%
ASCII 44
 
21.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12
27.3%
A 4
 
9.1%
( 3
 
6.8%
) 3
 
6.8%
L 2
 
4.5%
S 2
 
4.5%
e 2
 
4.5%
p 1
 
2.3%
l 1
 
2.3%
s 1
 
2.3%
Other values (13) 13
29.5%
Hangul
ValueCountFrequency (%)
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (81) 108
68.8%
Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-11T09:25:59.455432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length30.5
Mean length28.933333
Min length22

Characters and Unicode

Total characters868
Distinct characters97
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row경상남도 양산시 양주로 129, 101-2호 (중부동, 대범빌딩)
2nd row경상남도 양산시 삽량로 175, 101-2호 (중부동, 신진프라자)
3rd row경상남도 양산시 서창로 112, 2층 (삼호동)
4th row경상남도 양산시 서창로 114 (삼호동)
5th row경상남도 양산시 번영로 164, 1층 (평산동)
ValueCountFrequency (%)
경상남도 30
 
15.9%
양산시 30
 
15.9%
물금읍 10
 
5.3%
삼호동 6
 
3.2%
중부동 5
 
2.6%
1층 5
 
2.6%
2층 4
 
2.1%
대운로 3
 
1.6%
102호 3
 
1.6%
야리3길 2
 
1.1%
Other values (77) 91
48.1%
2023-12-11T09:25:59.931216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
159
 
18.3%
1 43
 
5.0%
38
 
4.4%
36
 
4.1%
31
 
3.6%
31
 
3.6%
31
 
3.6%
30
 
3.5%
30
 
3.5%
, 27
 
3.1%
Other values (87) 412
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 487
56.1%
Space Separator 159
 
18.3%
Decimal Number 148
 
17.1%
Other Punctuation 27
 
3.1%
Close Punctuation 19
 
2.2%
Open Punctuation 19
 
2.2%
Dash Punctuation 6
 
0.7%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
7.8%
36
 
7.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
30
 
6.2%
23
 
4.7%
21
 
4.3%
21
 
4.3%
Other values (69) 195
40.0%
Decimal Number
ValueCountFrequency (%)
1 43
29.1%
2 23
15.5%
0 18
12.2%
4 17
 
11.5%
3 15
 
10.1%
7 11
 
7.4%
5 7
 
4.7%
9 6
 
4.1%
8 5
 
3.4%
6 3
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
B 1
33.3%
D 1
33.3%
Space Separator
ValueCountFrequency (%)
159
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 487
56.1%
Common 378
43.5%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
7.8%
36
 
7.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
30
 
6.2%
23
 
4.7%
21
 
4.3%
21
 
4.3%
Other values (69) 195
40.0%
Common
ValueCountFrequency (%)
159
42.1%
1 43
 
11.4%
, 27
 
7.1%
2 23
 
6.1%
) 19
 
5.0%
( 19
 
5.0%
0 18
 
4.8%
4 17
 
4.5%
3 15
 
4.0%
7 11
 
2.9%
Other values (5) 27
 
7.1%
Latin
ValueCountFrequency (%)
S 1
33.3%
B 1
33.3%
D 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 487
56.1%
ASCII 381
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
159
41.7%
1 43
 
11.3%
, 27
 
7.1%
2 23
 
6.0%
) 19
 
5.0%
( 19
 
5.0%
0 18
 
4.7%
4 17
 
4.5%
3 15
 
3.9%
7 11
 
2.9%
Other values (8) 30
 
7.9%
Hangul
ValueCountFrequency (%)
38
 
7.8%
36
 
7.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
30
 
6.2%
23
 
4.7%
21
 
4.3%
21
 
4.3%
Other values (69) 195
40.0%

신고일자
Date

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2012-08-29 00:00:00
Maximum2021-04-13 00:00:00
2023-12-11T09:26:00.113548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:00.242476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)

Interactions

2023-12-11T09:25:57.807558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:26:00.327980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업의내용사업장명칭소재지주소(도로명)신고일자
연번1.0000.8671.0001.0001.000
영업의내용0.8671.0001.0001.0001.000
사업장명칭1.0001.0001.0001.0001.000
소재지주소(도로명)1.0001.0001.0001.0001.000
신고일자1.0001.0001.0001.0001.000
2023-12-11T09:26:00.443272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업의내용
연번1.0000.587
영업의내용0.5871.000

Missing values

2023-12-11T09:25:57.913401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:25:58.009100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번영업의내용사업장명칭소재지주소(도로명)신고일자
01동물 ( 판매 )행복한강아지 틱독경상남도 양산시 양주로 129, 101-2호 (중부동, 대범빌딩)2014-08-11
12동물 ( 판매 )동물농장경상남도 양산시 삽량로 175, 101-2호 (중부동, 신진프라자)2015-07-21
23동물 ( 판매 )토니와 쁘니경상남도 양산시 서창로 112, 2층 (삼호동)2015-11-17
34동물 ( 판매 )미미애견경상남도 양산시 서창로 114 (삼호동)2015-12-07
45동물 ( 판매 )스타독스경상남도 양산시 번영로 164, 1층 (평산동)2016-03-07
56동물 ( 판매 )해동물병원경상남도 양산시 물금읍 청운로 349, 103동 2호 (세정에스타)2016-07-27
67동물 ( 판매 )냥이친구경상남도 양산시 대운9길 5-7 (삼호동)2017-02-22
78동물 ( 판매 )개예쁘다경상남도 양산시 연호13길 22, 101호 (삼호동)2017-03-31
89동물 ( 판매 )사료할인마트경상남도 양산시 웅상대로 1138, 1층 (명동)2017-05-04
910동물 ( 판매 )하얀강아지경상남도 양산시 하북면 신평로 22-1 (통도동물병원)2017-08-16
연번영업의내용사업장명칭소재지주소(도로명)신고일자
2021동물 ( 판매 )제이엠펫하우스경상남도 양산시 물금읍 야리3길 14, DS프라자 108-4호2019-09-18
2122동물 ( 판매 )라이프펫(양산점)경상남도 양산시 물금읍 청운로 177, 오션프라자 103호2019-12-02
2223동물 ( 판매 )도그스타일경상남도 양산시 북안남8길 4, 1층 (북부동)2019-12-03
2324동물 ( 판매 )고양이숲경상남도 양산시 물금읍 범구로 11, 국보프라자 103호2020-05-07
2425동물 ( 판매 )모나미네코 양산점경상남도 양산시 물금읍 백호로 92, 트윈에비뉴 B동 104호2020-06-10
2526동물 ( 판매 )CATS&ME(캣츠 앤 미)경상남도 양산시 물금읍 증산역로 153, 정우프라자 2층 204호2020-07-07
2627동물 ( 판매 )버블버블수족관경상남도 양산시 물금읍 물금로 9, 107호2020-07-23
2728동물 ( 판매 )엔젤스펫(Angels pet)경상남도 양산시 하북면 양산대로 20042020-10-16
2829동물 ( 판매 )올리스경상남도 양산시 동면 금오7길 33-162021-03-16
2930동물 ( 판매 )멍멍친구 애견샵경상남도 양산시 물금읍 목화로 55, 102호2021-04-13