Overview

Dataset statistics

Number of variables5
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory41.2 B

Variable types

Text3
Categorical2

Dataset

Description김해시 방문판매사업자에 대한 2022. 8. 19. 일자 현황으로 (사업장명, 법인구분, 사업장 소재지 주소, 취급품목 )에 대한 정보가 제공됩니다
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033440

Alerts

법인구분 is highly overall correlated with 취급품목High correlation
취급품목 is highly overall correlated with 법인구분High correlation

Reproduction

Analysis started2023-12-11 00:16:54.746721
Analysis finished2023-12-11 00:16:55.214378
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct112
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T09:16:55.371176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.4385965
Min length2

Characters and Unicode

Total characters962
Distinct characters233
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)96.5%

Sample

1st row뉴질랜드 부원점
2nd row주식회사 슈퍼정보통신
3rd row다온 갤러리
4th row엘엔케이(LNK)
5th row야보아스
ValueCountFrequency (%)
주식회사 10
 
6.2%
마임 5
 
3.1%
인셀덤 4
 
2.5%
에치와이 3
 
1.9%
홍선생미술 2
 
1.2%
드림파 2
 
1.2%
기아 2
 
1.2%
김해지사 2
 
1.2%
ksi 1
 
0.6%
장유농업협동조합 1
 
0.6%
Other values (128) 128
80.0%
2023-12-11T09:16:55.783295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
 
4.8%
35
 
3.6%
32
 
3.3%
31
 
3.2%
26
 
2.7%
25
 
2.6%
24
 
2.5%
24
 
2.5%
21
 
2.2%
21
 
2.2%
Other values (223) 677
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 843
87.6%
Space Separator 46
 
4.8%
Uppercase Letter 20
 
2.1%
Close Punctuation 15
 
1.6%
Open Punctuation 15
 
1.6%
Decimal Number 10
 
1.0%
Lowercase Letter 10
 
1.0%
Other Punctuation 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
4.2%
32
 
3.8%
31
 
3.7%
26
 
3.1%
25
 
3.0%
24
 
2.8%
24
 
2.8%
21
 
2.5%
21
 
2.5%
20
 
2.4%
Other values (192) 584
69.3%
Uppercase Letter
ValueCountFrequency (%)
K 5
25.0%
S 2
 
10.0%
N 2
 
10.0%
D 2
 
10.0%
O 2
 
10.0%
L 1
 
5.0%
H 1
 
5.0%
I 1
 
5.0%
C 1
 
5.0%
G 1
 
5.0%
Other values (2) 2
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
20.0%
f 1
10.0%
a 1
10.0%
t 1
10.0%
o 1
10.0%
r 1
10.0%
h 1
10.0%
y 1
10.0%
e 1
10.0%
Decimal Number
ValueCountFrequency (%)
1 5
50.0%
0 2
 
20.0%
2 2
 
20.0%
4 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 841
87.4%
Common 88
 
9.1%
Latin 30
 
3.1%
Han 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
4.2%
32
 
3.8%
31
 
3.7%
26
 
3.1%
25
 
3.0%
24
 
2.9%
24
 
2.9%
21
 
2.5%
21
 
2.5%
20
 
2.4%
Other values (190) 582
69.2%
Latin
ValueCountFrequency (%)
K 5
16.7%
S 2
 
6.7%
N 2
 
6.7%
D 2
 
6.7%
O 2
 
6.7%
c 2
 
6.7%
L 1
 
3.3%
H 1
 
3.3%
I 1
 
3.3%
C 1
 
3.3%
Other values (11) 11
36.7%
Common
ValueCountFrequency (%)
46
52.3%
) 15
 
17.0%
( 15
 
17.0%
1 5
 
5.7%
0 2
 
2.3%
2 2
 
2.3%
/ 1
 
1.1%
4 1
 
1.1%
& 1
 
1.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 840
87.3%
ASCII 118
 
12.3%
CJK 3
 
0.3%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46
39.0%
) 15
 
12.7%
( 15
 
12.7%
K 5
 
4.2%
1 5
 
4.2%
S 2
 
1.7%
N 2
 
1.7%
D 2
 
1.7%
0 2
 
1.7%
O 2
 
1.7%
Other values (20) 22
18.6%
Hangul
ValueCountFrequency (%)
35
 
4.2%
32
 
3.8%
31
 
3.7%
26
 
3.1%
25
 
3.0%
24
 
2.9%
24
 
2.9%
21
 
2.5%
21
 
2.5%
20
 
2.4%
Other values (189) 581
69.2%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

법인구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
개인
87 
법인
27 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row법인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 87
76.3%
법인 27
 
23.7%

Length

2023-12-11T09:16:55.913796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:16:56.001201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 87
76.3%
법인 27
 
23.7%
Distinct81
Distinct (%)71.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T09:16:56.229971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.3508772
Min length5

Characters and Unicode

Total characters610
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)49.1%

Sample

1st row50926
2nd row50922
3rd row50926
4th row50935
5th row50943
ValueCountFrequency (%)
50905 5
 
4.4%
50981 3
 
2.6%
50858 3
 
2.6%
50948 3
 
2.6%
50943 3
 
2.6%
50922 3
 
2.6%
50901 2
 
1.8%
50814 2
 
1.8%
50920 2
 
1.8%
50808 2
 
1.8%
Other values (71) 86
75.4%
2023-12-11T09:16:56.680324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 141
23.1%
5 120
19.7%
9 77
12.6%
8 57
9.3%
2 51
 
8.4%
1 51
 
8.4%
6 39
 
6.4%
4 21
 
3.4%
- 20
 
3.3%
3 18
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 590
96.7%
Dash Punctuation 20
 
3.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 141
23.9%
5 120
20.3%
9 77
13.1%
8 57
9.7%
2 51
 
8.6%
1 51
 
8.6%
6 39
 
6.6%
4 21
 
3.6%
3 18
 
3.1%
7 15
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 610
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 141
23.1%
5 120
19.7%
9 77
12.6%
8 57
9.3%
2 51
 
8.4%
1 51
 
8.4%
6 39
 
6.4%
4 21
 
3.4%
- 20
 
3.3%
3 18
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 610
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 141
23.1%
5 120
19.7%
9 77
12.6%
8 57
9.3%
2 51
 
8.4%
1 51
 
8.4%
6 39
 
6.4%
4 21
 
3.4%
- 20
 
3.3%
3 18
 
3.0%
Distinct109
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-11T09:16:56.951433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length30.587719
Min length19

Characters and Unicode

Total characters3487
Distinct characters179
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)91.2%

Sample

1st row경상남도 김해시 가락로 19, 아름다운 뉴욕 메디컬센터 103호 (부원동)
2nd row경상남도 김해시 호계로438번길 11, KT김해지점 5층 (부원동)
3rd row경상남도 김해시 가락로 35, 동방빌딩 4층 (부원동)
4th row경상남도 김해시 김해대로2492번길 20, 메가마트 김해점 2층 프로스펙스호 (삼정동)
5th row경상남도 김해시 내외중앙로 137, 322호 (내동)
ValueCountFrequency (%)
경상남도 114
 
16.1%
김해시 114
 
16.1%
14
 
2.0%
1층 12
 
1.7%
2층 10
 
1.4%
10
 
1.4%
김해대로 9
 
1.3%
진영읍 9
 
1.3%
내동 8
 
1.1%
부원동 7
 
1.0%
Other values (277) 400
56.6%
2023-12-11T09:16:57.390680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
593
 
17.0%
137
 
3.9%
135
 
3.9%
1 134
 
3.8%
124
 
3.6%
120
 
3.4%
2 120
 
3.4%
116
 
3.3%
116
 
3.3%
116
 
3.3%
Other values (169) 1776
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1961
56.2%
Decimal Number 596
 
17.1%
Space Separator 593
 
17.0%
Other Punctuation 113
 
3.2%
Close Punctuation 95
 
2.7%
Open Punctuation 95
 
2.7%
Dash Punctuation 26
 
0.7%
Uppercase Letter 7
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
7.0%
135
 
6.9%
124
 
6.3%
120
 
6.1%
116
 
5.9%
116
 
5.9%
116
 
5.9%
115
 
5.9%
114
 
5.8%
57
 
2.9%
Other values (149) 811
41.4%
Decimal Number
ValueCountFrequency (%)
1 134
22.5%
2 120
20.1%
0 63
10.6%
3 62
10.4%
4 51
 
8.6%
5 43
 
7.2%
7 36
 
6.0%
9 33
 
5.5%
6 27
 
4.5%
8 27
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
T 3
42.9%
K 3
42.9%
S 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
88
77.9%
* 25
 
22.1%
Space Separator
ValueCountFrequency (%)
593
100.0%
Close Punctuation
ValueCountFrequency (%)
) 95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1961
56.2%
Common 1518
43.5%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
7.0%
135
 
6.9%
124
 
6.3%
120
 
6.1%
116
 
5.9%
116
 
5.9%
116
 
5.9%
115
 
5.9%
114
 
5.8%
57
 
2.9%
Other values (149) 811
41.4%
Common
ValueCountFrequency (%)
593
39.1%
1 134
 
8.8%
2 120
 
7.9%
) 95
 
6.3%
( 95
 
6.3%
88
 
5.8%
0 63
 
4.2%
3 62
 
4.1%
4 51
 
3.4%
5 43
 
2.8%
Other values (6) 174
 
11.5%
Latin
ValueCountFrequency (%)
T 3
37.5%
K 3
37.5%
S 1
 
12.5%
e 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1961
56.2%
ASCII 1438
41.2%
None 88
 
2.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
593
41.2%
1 134
 
9.3%
2 120
 
8.3%
) 95
 
6.6%
( 95
 
6.6%
0 63
 
4.4%
3 62
 
4.3%
4 51
 
3.5%
5 43
 
3.0%
7 36
 
2.5%
Other values (9) 146
 
10.2%
Hangul
ValueCountFrequency (%)
137
 
7.0%
135
 
6.9%
124
 
6.3%
120
 
6.1%
116
 
5.9%
116
 
5.9%
116
 
5.9%
115
 
5.9%
114
 
5.8%
57
 
2.9%
Other values (149) 811
41.4%
None
ValueCountFrequency (%)
88
100.0%

취급품목
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
기타
20 
건강식품 화장품/미용용품
17 
자동차/자동차용품
15 
화장품/미용용품
14 
건강식품
14 
Other values (23)
34 

Length

Max length37
Median length28
Mean length8.4122807
Min length2

Unique

Unique19 ?
Unique (%)16.7%

Sample

1st row건강식품
2nd row통신기기 기타
3rd row화장품/미용용품 생활용품/세제류
4th row기타
5th row화장품/미용용품

Common Values

ValueCountFrequency (%)
기타 20
17.5%
건강식품 화장품/미용용품 17
14.9%
자동차/자동차용품 15
13.2%
화장품/미용용품 14
12.3%
건강식품 14
12.3%
통신기기 7
 
6.1%
교육/도서 4
 
3.5%
가전 2
 
1.8%
통신기기 기타 2
 
1.8%
기타 건강식품 1
 
0.9%
Other values (18) 18
15.8%

Length

2023-12-11T09:16:57.552066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화장품/미용용품 41
24.6%
건강식품 39
23.4%
기타 29
17.4%
자동차/자동차용품 15
 
9.0%
통신기기 13
 
7.8%
생활용품/세제류 10
 
6.0%
가전 6
 
3.6%
교육/도서 5
 
3.0%
컴퓨터/사무용품 5
 
3.0%
의류/패션 4
 
2.4%

Correlations

2023-12-11T09:16:57.646681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인구분소재지우편번호취급품목
법인구분1.0000.3410.836
소재지우편번호0.3411.0000.000
취급품목0.8360.0001.000
2023-12-11T09:16:57.750051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인구분취급품목
법인구분1.0000.610
취급품목0.6101.000
2023-12-11T09:16:57.854866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인구분취급품목
법인구분1.0000.610
취급품목0.6101.000

Missing values

2023-12-11T09:16:55.063206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:16:55.165351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인또는상호법인구분소재지우편번호소재지주소취급품목
0뉴질랜드 부원점개인50926경상남도 김해시 가락로 19, 아름다운 뉴욕 메디컬센터 103호 (부원동)건강식품
1주식회사 슈퍼정보통신법인50922경상남도 김해시 호계로438번길 11, KT김해지점 5층 (부원동)통신기기 기타
2다온 갤러리개인50926경상남도 김해시 가락로 35, 동방빌딩 4층 (부원동)화장품/미용용품 생활용품/세제류
3엘엔케이(LNK)개인50935경상남도 김해시 김해대로2492번길 20, 메가마트 김해점 2층 프로스펙스호 (삼정동)기타
4야보아스개인50943경상남도 김해시 내외중앙로 137, 322호 (내동)화장품/미용용품
5DH홀쇼핑개인50865경상남도 김해시 진영읍 여래로20번길 7-1, 금양종합상가 1층 2호건강식품 화장품/미용용품 생활용품/세제류
6(주)금해법인50973경상남도 김해시 내덕로148번길 28(내덕동)컴퓨터/사무용품 통신기기 기타
7비에스컴퍼니개인50989경상남도 김해시 번화1로 78, 미건빌딩 202호 (대청동)생활용품/세제류
8드림파개인50933경상남도 김해시 김해대로2511번길 8 (삼정동, 경성빌라)기타 건강식품
9(대성)OK120개인50881경상남도 김해시 김해대로 1784-14, *동 *호 (삼계동, 동신아파트)기타
법인또는상호법인구분소재지우편번호소재지주소취급품목
104스마트114 주식회사법인621-909경상남도 김해시 삼안로297번길 8-4 (삼방동)기타
105장유쉐보레자동차판매개인51001경상남도 김해시 장유로 406-1 (신문동)자동차/자동차용품
106김정문알로에장유지점개인621-250경상남도 김해시 장유로 194, 302호 (부곡동)건강식품
107쌍용자동차김해가야판매대리점개인50890경상남도 김해시 금관대로 1349 (내동)자동차/자동차용품
108현대자동차김해어방판매대리점개인621-917경상남도 김해시 인제로 130 (어방동)자동차/자동차용품
109김해남부쉐보레자동차판매개인621-040경상남도 김해시 김해대로 2301 (봉황동)자동차/자동차용품
110기아 김해가야대리점개인621-916경상남도 김해시 김해대로2541번길 2 (어방동)자동차/자동차용품
111현대가야판매대리점개인50936경상남도 김해시 김해대로 2634 (안동)자동차/자동차용품
112시즌글라스개인621-030경상남도 김해시 가락로 109-1 (서상동)건강식품
113기아 장유대리점개인621-280경상남도 김해시 대청로104번길 69 (대청동)자동차/자동차용품