Overview

Dataset statistics

Number of variables7
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory62.4 B

Variable types

Numeric2
Text2
Categorical3

Dataset

Description경상남도 사천시 방문판매사업자 현황 자료입니다.(연번, 법인 또는 상호, 법인구분, 소재지우편번호, 소재지주소, 취급품목)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15114250

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 법인구분High correlation
법인구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
취급품목 is highly overall correlated with 법인구분High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:11:49.273947
Analysis finished2023-12-10 23:11:50.484160
Duration1.21 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T08:11:50.546986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-11T08:11:50.690247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-11T08:11:50.960150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.4666667
Min length3

Characters and Unicode

Total characters254
Distinct characters101
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row숲에서
2nd row제이케이코스메틱 바비대리점
3rd row엘모코스메틱
4th row아주스토아
5th row아모레카운셀러
ValueCountFrequency (%)
아모레카운셀러 2
 
4.9%
정동농업협동조합 1
 
2.4%
가맹점 1
 
2.4%
숲에서 1
 
2.4%
사천축산업협동조합 1
 
2.4%
사천농업협동조합 1
 
2.4%
삼천포농업협동조합 1
 
2.4%
사남농업협동조합 1
 
2.4%
르노삼성자동차 1
 
2.4%
사천대리점 1
 
2.4%
Other values (30) 30
73.2%
2023-12-11T08:11:51.400164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
4.7%
12
 
4.7%
11
 
4.3%
9
 
3.5%
9
 
3.5%
9
 
3.5%
8
 
3.1%
8
 
3.1%
6
 
2.4%
6
 
2.4%
Other values (91) 164
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 241
94.9%
Space Separator 11
 
4.3%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.0%
12
 
5.0%
9
 
3.7%
9
 
3.7%
9
 
3.7%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (88) 156
64.7%
Space Separator
ValueCountFrequency (%)
11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 241
94.9%
Common 13
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.0%
12
 
5.0%
9
 
3.7%
9
 
3.7%
9
 
3.7%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (88) 156
64.7%
Common
ValueCountFrequency (%)
11
84.6%
) 1
 
7.7%
( 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 241
94.9%
ASCII 13
 
5.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
5.0%
12
 
5.0%
9
 
3.7%
9
 
3.7%
9
 
3.7%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (88) 156
64.7%
ASCII
ValueCountFrequency (%)
11
84.6%
) 1
 
7.7%
( 1
 
7.7%

법인구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
개인
23 
법인

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 23
76.7%
법인 7
 
23.3%

Length

2023-12-11T08:11:51.582161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:11:51.703695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 23
76.7%
법인 7
 
23.3%

소재지우편번호
Real number (ℝ)

Distinct19
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52539
Minimum52504
Maximum52568
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T08:11:51.792379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum52504
5-th percentile52509.85
Q152520.25
median52545
Q352557.25
95-th percentile52562
Maximum52568
Range64
Interquartile range (IQR)37

Descriptive statistics

Standard deviation19.27836
Coefficient of variation (CV)0.00036693428
Kurtosis-1.2954651
Mean52539
Median Absolute Deviation (MAD)15.5
Skewness-0.27263442
Sum1576170
Variance371.65517
MonotonicityNot monotonic
2023-12-11T08:11:51.898327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
52548 3
 
10.0%
52517 3
 
10.0%
52520 3
 
10.0%
52560 2
 
6.7%
52562 2
 
6.7%
52521 2
 
6.7%
52558 2
 
6.7%
52504 2
 
6.7%
52536 1
 
3.3%
52543 1
 
3.3%
Other values (9) 9
30.0%
ValueCountFrequency (%)
52504 2
6.7%
52517 3
10.0%
52520 3
10.0%
52521 2
6.7%
52523 1
 
3.3%
52532 1
 
3.3%
52536 1
 
3.3%
52539 1
 
3.3%
52543 1
 
3.3%
52547 1
 
3.3%
ValueCountFrequency (%)
52568 1
 
3.3%
52562 2
6.7%
52561 1
 
3.3%
52560 2
6.7%
52558 2
6.7%
52555 1
 
3.3%
52551 1
 
3.3%
52550 1
 
3.3%
52548 3
10.0%
52547 1
 
3.3%
Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-11T08:11:52.104027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length33
Mean length27.233333
Min length20

Characters and Unicode

Total characters817
Distinct characters95
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row경상남도 사천시 사천읍 옥산로 71, 1층
2nd row경상남도 사천시 사천읍 옥산로 97, 104동 1102호 (덕진봄아파트)
3rd row경상남도 사천시 주공로 23, 2층 201호 (벌리동)
4th row경상남도 사천시 곤명면 경서대로 3417
5th row경상남도 사천시 중앙로 126, 금양빌딩 5층 (벌리동)
ValueCountFrequency (%)
경상남도 30
 
16.6%
사천시 30
 
16.6%
벌리동 6
 
3.3%
사천읍 6
 
3.3%
1층 5
 
2.8%
진삼로 4
 
2.2%
옥산로 4
 
2.2%
중앙로 3
 
1.7%
정동면 3
 
1.7%
주공로 3
 
1.7%
Other values (76) 87
48.1%
2023-12-11T08:11:52.412793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
18.5%
43
 
5.3%
42
 
5.1%
33
 
4.0%
33
 
4.0%
1 32
 
3.9%
31
 
3.8%
31
 
3.8%
30
 
3.7%
26
 
3.2%
Other values (85) 365
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 477
58.4%
Space Separator 151
 
18.5%
Decimal Number 130
 
15.9%
Other Punctuation 18
 
2.2%
Close Punctuation 18
 
2.2%
Open Punctuation 18
 
2.2%
Dash Punctuation 5
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
9.0%
42
 
8.8%
33
 
6.9%
33
 
6.9%
31
 
6.5%
31
 
6.5%
30
 
6.3%
26
 
5.5%
24
 
5.0%
9
 
1.9%
Other values (70) 175
36.7%
Decimal Number
ValueCountFrequency (%)
1 32
24.6%
2 20
15.4%
0 19
14.6%
4 15
11.5%
3 11
 
8.5%
5 9
 
6.9%
7 8
 
6.2%
9 7
 
5.4%
6 5
 
3.8%
8 4
 
3.1%
Space Separator
ValueCountFrequency (%)
151
100.0%
Other Punctuation
ValueCountFrequency (%)
18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 477
58.4%
Common 340
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
9.0%
42
 
8.8%
33
 
6.9%
33
 
6.9%
31
 
6.5%
31
 
6.5%
30
 
6.3%
26
 
5.5%
24
 
5.0%
9
 
1.9%
Other values (70) 175
36.7%
Common
ValueCountFrequency (%)
151
44.4%
1 32
 
9.4%
2 20
 
5.9%
0 19
 
5.6%
18
 
5.3%
) 18
 
5.3%
( 18
 
5.3%
4 15
 
4.4%
3 11
 
3.2%
5 9
 
2.6%
Other values (5) 29
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 477
58.4%
ASCII 322
39.4%
None 18
 
2.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
46.9%
1 32
 
9.9%
2 20
 
6.2%
0 19
 
5.9%
) 18
 
5.6%
( 18
 
5.6%
4 15
 
4.7%
3 11
 
3.4%
5 9
 
2.8%
7 8
 
2.5%
Other values (4) 21
 
6.5%
Hangul
ValueCountFrequency (%)
43
 
9.0%
42
 
8.8%
33
 
6.9%
33
 
6.9%
31
 
6.5%
31
 
6.5%
30
 
6.3%
26
 
5.5%
24
 
5.0%
9
 
1.9%
Other values (70) 175
36.7%
None
ValueCountFrequency (%)
18
100.0%

취급품목
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
기타
건강식품 화장품/미용용품
화장품/미용용품
건강식품
자동차/자동차용품
Other values (7)

Length

Max length31
Median length13
Mean length9.3666667
Min length2

Unique

Unique7 ?
Unique (%)23.3%

Sample

1st row건강식품 화장품/미용용품 가전 의류/패션 기타
2nd row화장품/미용용품
3rd row건강식품 화장품/미용용품 생활용품/세제류 의류/패션 기타
4th row건강식품 화장품/미용용품
5th row건강식품 화장품/미용용품

Common Values

ValueCountFrequency (%)
기타 7
23.3%
건강식품 화장품/미용용품 6
20.0%
화장품/미용용품 4
13.3%
건강식품 3
10.0%
자동차/자동차용품 3
10.0%
건강식품 화장품/미용용품 가전 의류/패션 기타 1
 
3.3%
건강식품 화장품/미용용품 생활용품/세제류 의류/패션 기타 1
 
3.3%
가전 1
 
3.3%
건강식품 가전 1
 
3.3%
건강식품 화장품/미용용품 생활용품/세제류 가전 의류/패션 1
 
3.3%
Other values (2) 2
 
6.7%

Length

2023-12-11T08:11:52.529988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건강식품 13
25.5%
화장품/미용용품 13
25.5%
기타 11
21.6%
가전 4
 
7.8%
생활용품/세제류 4
 
7.8%
자동차/자동차용품 3
 
5.9%
의류/패션 3
 
5.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-10-31
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-31
2nd row2023-10-31
3rd row2023-10-31
4th row2023-10-31
5th row2023-10-31

Common Values

ValueCountFrequency (%)
2023-10-31 30
100.0%

Length

2023-12-11T08:11:52.631163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:11:52.716672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-31 30
100.0%

Interactions

2023-12-11T08:11:49.783278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:11:49.602393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:11:49.887882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:11:49.698451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:11:52.768558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번법인또는상호법인구분소재지우편번호소재지주소취급품목
연번1.0001.0000.8510.6241.0000.608
법인또는상호1.0001.0001.0000.9171.0001.000
법인구분0.8511.0001.0000.4421.0000.913
소재지우편번호0.6240.9170.4421.0000.9170.698
소재지주소1.0001.0001.0000.9171.0001.000
취급품목0.6081.0000.9130.6981.0001.000
2023-12-11T08:11:52.918259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취급품목법인구분
취급품목1.0000.606
법인구분0.6061.000
2023-12-11T08:11:52.989876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지우편번호법인구분취급품목
연번1.000-0.2630.5730.262
소재지우편번호-0.2631.0000.4100.331
법인구분0.5730.4101.0000.606
취급품목0.2620.3310.6061.000

Missing values

2023-12-11T08:11:50.022780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:11:50.437830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번법인또는상호법인구분소재지우편번호소재지주소취급품목데이터기준일자
01숲에서개인52560경상남도 사천시 사천읍 옥산로 71, 1층건강식품 화장품/미용용품 가전 의류/패션 기타2023-10-31
12제이케이코스메틱 바비대리점개인52560경상남도 사천시 사천읍 옥산로 97, 104동 1102호 (덕진봄아파트)화장품/미용용품2023-10-31
23엘모코스메틱개인52517경상남도 사천시 주공로 23, 2층 201호 (벌리동)건강식품 화장품/미용용품 생활용품/세제류 의류/패션 기타2023-10-31
34아주스토아개인52568경상남도 사천시 곤명면 경서대로 3417건강식품 화장품/미용용품2023-10-31
45아모레카운셀러개인52562경상남도 사천시 중앙로 126, 금양빌딩 5층 (벌리동)건강식품 화장품/미용용품2023-10-31
56아모레카운셀러개인52517경상남도 사천시 중앙로 126, 금양빌딩 5층 (벌리동)건강식품 화장품/미용용품2023-10-31
67제이케이삼천포점개인52521경상남도 사천시 건어시장길 29, 그린가요방 1층 (선구동)건강식품 화장품/미용용품2023-10-31
78코웨이 사천 삼천포 대리점개인52539경상남도 사천시 신항로 64, 상가동 2층 204호 (동금동, 삼천포 예미지)가전2023-10-31
89코웨이누리대리점개인52558경상남도 사천시 사천읍 동문4길 79, 1층 102호건강식품 가전2023-10-31
910종근당건강 헬스벨스토리 사천정동점개인52551경상남도 사천시 정동면 옥산로 50-59, 1층 (동백빌)건강식품 화장품/미용용품2023-10-31
연번법인또는상호법인구분소재지우편번호소재지주소취급품목데이터기준일자
2021삼천포농업협동조합법인52548경상남도 사천시 주공로 2 (벌리동)기타2023-10-31
2122사남농업협동조합법인52520경상남도 사천시 사남면 진삼로 1113기타2023-10-31
2223르노삼성자동차 사천대리점개인52548경상남도 사천시 사남면 사천대로 1642자동차/자동차용품2023-10-31
2324삼천포남양우유가정대리점개인52504경상남도 사천시 주공로 80 (용강동)기타2023-10-31
2425여성의로망개인52548경상남도 사천시 용강길 37, 204동 804호 (용강동,용강2주공아파트)화장품/미용용품2023-10-31
2526생녹용직판장개인52504경상남도 사천시 곤양면 곤북로 491기타2023-10-31
2627에치와이 사천점개인52561경상남도 사천시 문선6길 20 (벌리동)건강식품2023-10-31
2728서포농협하나로마트법인52543경상남도 사천시 서포면 사천대교로 731생활용품/세제류 기타2023-10-31
2829기아가나대리점개인52520경상남도 사천시 사천읍 진삼로 1423자동차/자동차용품2023-10-31
2930현대동금판매대리점개인52558경상남도 사천시 중앙로 103-1 (벌리동)자동차/자동차용품2023-10-31