Overview

Dataset statistics

Number of variables8
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory67.6 B

Variable types

Numeric1
Text3
Categorical3
DateTime1

Dataset

Description관내 방문판매업 현황에 대한 데이터로 관리번호, 법인 또는 상호, 운영상태, 법인구분, 소재지주소 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3080051/fileData.do

Alerts

관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
법인구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
운영상태 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
번호 is highly overall correlated with 운영상태 and 1 other fieldsHigh correlation
운영상태 is highly imbalanced (67.3%)Imbalance
번호 has unique valuesUnique
관리번호 has unique valuesUnique
법인 또는 상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:16:01.124722
Analysis finished2023-12-12 21:16:01.745269
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.5
Minimum1
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2023-12-13T06:16:01.824162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.45
Q113.25
median25.5
Q337.75
95-th percentile47.55
Maximum50
Range49
Interquartile range (IQR)24.5

Descriptive statistics

Standard deviation14.57738
Coefficient of variation (CV)0.57166195
Kurtosis-1.2
Mean25.5
Median Absolute Deviation (MAD)12.5
Skewness0
Sum1275
Variance212.5
MonotonicityStrictly increasing
2023-12-13T06:16:01.982945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
39 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%

관리번호
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-13T06:16:02.202736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.6
Min length10

Characters and Unicode

Total characters680
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row2023-경기양주-0004
2nd row2023-경기양주-0003
3rd row2023-경기양주-0001
4th row2022-경기양주-0012
5th row2022-경기양주-0011
ValueCountFrequency (%)
2023-경기양주-0004 1
 
2.0%
2018-경기양주-0008 1
 
2.0%
1999-00014 1
 
2.0%
2020-경기양주-0001 1
 
2.0%
2019-경기양주-0017 1
 
2.0%
2019-경기양주-0011 1
 
2.0%
2019-경기양주-0005 1
 
2.0%
2019-경기양주-0004 1
 
2.0%
2019-경기양주-0003 1
 
2.0%
2019-경기양주-0002 1
 
2.0%
Other values (40) 40
80.0%
2023-12-13T06:16:02.572197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 194
28.5%
- 97
14.3%
2 90
13.2%
1 50
 
7.4%
47
 
6.9%
47
 
6.9%
47
 
6.9%
47
 
6.9%
9 17
 
2.5%
3 9
 
1.3%
Other values (5) 35
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 395
58.1%
Other Letter 188
27.6%
Dash Punctuation 97
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 194
49.1%
2 90
22.8%
1 50
 
12.7%
9 17
 
4.3%
3 9
 
2.3%
8 9
 
2.3%
7 8
 
2.0%
4 7
 
1.8%
5 7
 
1.8%
6 4
 
1.0%
Other Letter
ValueCountFrequency (%)
47
25.0%
47
25.0%
47
25.0%
47
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 492
72.4%
Hangul 188
 
27.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 194
39.4%
- 97
19.7%
2 90
18.3%
1 50
 
10.2%
9 17
 
3.5%
3 9
 
1.8%
8 9
 
1.8%
7 8
 
1.6%
4 7
 
1.4%
5 7
 
1.4%
Hangul
ValueCountFrequency (%)
47
25.0%
47
25.0%
47
25.0%
47
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 492
72.4%
Hangul 188
 
27.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 194
39.4%
- 97
19.7%
2 90
18.3%
1 50
 
10.2%
9 17
 
3.5%
3 9
 
1.8%
8 9
 
1.8%
7 8
 
1.6%
4 7
 
1.4%
5 7
 
1.4%
Hangul
ValueCountFrequency (%)
47
25.0%
47
25.0%
47
25.0%
47
25.0%
Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-13T06:16:02.853232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.12
Min length2

Characters and Unicode

Total characters406
Distinct characters149
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row이오스
2nd row좋은 날들
3rd row오케이바이오
4th row하이리 코스메틱
5th row양주 이오스타
ValueCountFrequency (%)
마임 5
 
6.8%
주식회사 3
 
4.1%
이오스 1
 
1.4%
양주옥정지사 1
 
1.4%
주)라이브존 1
 
1.4%
백석농업협동조합 1
 
1.4%
양주중앙센터 1
 
1.4%
윤선생영어교실 1
 
1.4%
양주농업협동조합 1
 
1.4%
에스테틱 1
 
1.4%
Other values (57) 57
78.1%
2023-12-13T06:16:03.324325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
5.9%
23
 
5.7%
19
 
4.7%
12
 
3.0%
12
 
3.0%
11
 
2.7%
10
 
2.5%
9
 
2.2%
8
 
2.0%
8
 
2.0%
Other values (139) 270
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 360
88.7%
Space Separator 23
 
5.7%
Lowercase Letter 8
 
2.0%
Close Punctuation 5
 
1.2%
Open Punctuation 5
 
1.2%
Uppercase Letter 3
 
0.7%
Other Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
6.7%
19
 
5.3%
12
 
3.3%
12
 
3.3%
11
 
3.1%
10
 
2.8%
9
 
2.5%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (127) 240
66.7%
Lowercase Letter
ValueCountFrequency (%)
e 3
37.5%
r 2
25.0%
n 1
 
12.5%
g 1
 
12.5%
v 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
L 1
33.3%
K 1
33.3%
Space Separator
ValueCountFrequency (%)
23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 360
88.7%
Common 35
 
8.6%
Latin 11
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
6.7%
19
 
5.3%
12
 
3.3%
12
 
3.3%
11
 
3.1%
10
 
2.8%
9
 
2.5%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (127) 240
66.7%
Latin
ValueCountFrequency (%)
e 3
27.3%
r 2
18.2%
E 1
 
9.1%
n 1
 
9.1%
g 1
 
9.1%
v 1
 
9.1%
L 1
 
9.1%
K 1
 
9.1%
Common
ValueCountFrequency (%)
23
65.7%
) 5
 
14.3%
( 5
 
14.3%
. 2
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 360
88.7%
ASCII 46
 
11.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
6.7%
19
 
5.3%
12
 
3.3%
12
 
3.3%
11
 
3.1%
10
 
2.8%
9
 
2.5%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (127) 240
66.7%
ASCII
ValueCountFrequency (%)
23
50.0%
) 5
 
10.9%
( 5
 
10.9%
e 3
 
6.5%
. 2
 
4.3%
r 2
 
4.3%
E 1
 
2.2%
n 1
 
2.2%
g 1
 
2.2%
v 1
 
2.2%
Other values (2) 2
 
4.3%

운영상태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
정상영업
47 
<NA>
 
3

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 47
94.0%
<NA> 3
 
6.0%

Length

2023-12-13T06:16:03.453550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:16:03.543572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 47
94.0%
na 3
 
6.0%

법인구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
개인
38 
법인
12 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 38
76.0%
법인 12
 
24.0%

Length

2023-12-13T06:16:03.648762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:16:03.771780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 38
76.0%
법인 12
 
24.0%
Distinct47
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-13T06:16:04.071620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39.5
Mean length28.74
Min length18

Characters and Unicode

Total characters1437
Distinct characters125
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)90.0%

Sample

1st row경기도 양주시 고읍북로 120, 803동 1407호 (만송동, 은빛마을 휴먼시아)
2nd row경기도 양주시 고읍남로 6-6, 301호,302호 (광사동)
3rd row경기도 양주시 은현면 은현로 137-29
4th row경기도 양주시 평화로 1395, 2층 (덕계동)
5th row경기도 양주시 고읍남로 32, 양주프라임타워 7층 701호 (광사동)
ValueCountFrequency (%)
경기도 50
 
16.1%
양주시 50
 
16.1%
광사동 8
 
2.6%
백석읍 8
 
2.6%
덕정동 7
 
2.3%
고읍남로 6
 
1.9%
1층 5
 
1.6%
화합로 4
 
1.3%
옥정동 4
 
1.3%
광적면 4
 
1.3%
Other values (130) 164
52.9%
2023-12-13T06:16:04.878490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
260
 
18.1%
1 57
 
4.0%
56
 
3.9%
55
 
3.8%
51
 
3.5%
50
 
3.5%
50
 
3.5%
50
 
3.5%
45
 
3.1%
41
 
2.9%
Other values (115) 722
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 783
54.5%
Decimal Number 266
 
18.5%
Space Separator 260
 
18.1%
Other Punctuation 38
 
2.6%
Open Punctuation 37
 
2.6%
Close Punctuation 37
 
2.6%
Dash Punctuation 12
 
0.8%
Uppercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
7.2%
55
 
7.0%
51
 
6.5%
50
 
6.4%
50
 
6.4%
50
 
6.4%
45
 
5.7%
41
 
5.2%
23
 
2.9%
21
 
2.7%
Other values (97) 341
43.6%
Decimal Number
ValueCountFrequency (%)
1 57
21.4%
0 38
14.3%
2 35
13.2%
5 25
9.4%
3 24
9.0%
6 23
8.6%
7 21
 
7.9%
9 17
 
6.4%
8 13
 
4.9%
4 13
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
T 1
25.0%
S 1
25.0%
Space Separator
ValueCountFrequency (%)
260
100.0%
Other Punctuation
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 783
54.5%
Common 650
45.2%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
7.2%
55
 
7.0%
51
 
6.5%
50
 
6.4%
50
 
6.4%
50
 
6.4%
45
 
5.7%
41
 
5.2%
23
 
2.9%
21
 
2.7%
Other values (97) 341
43.6%
Common
ValueCountFrequency (%)
260
40.0%
1 57
 
8.8%
38
 
5.8%
0 38
 
5.8%
( 37
 
5.7%
) 37
 
5.7%
2 35
 
5.4%
5 25
 
3.8%
3 24
 
3.7%
6 23
 
3.5%
Other values (5) 76
 
11.7%
Latin
ValueCountFrequency (%)
A 2
50.0%
T 1
25.0%
S 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 783
54.5%
ASCII 616
42.9%
None 38
 
2.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
260
42.2%
1 57
 
9.3%
0 38
 
6.2%
( 37
 
6.0%
) 37
 
6.0%
2 35
 
5.7%
5 25
 
4.1%
3 24
 
3.9%
6 23
 
3.7%
7 21
 
3.4%
Other values (7) 59
 
9.6%
Hangul
ValueCountFrequency (%)
56
 
7.2%
55
 
7.0%
51
 
6.5%
50
 
6.4%
50
 
6.4%
50
 
6.4%
45
 
5.7%
41
 
5.2%
23
 
2.9%
21
 
2.7%
Other values (97) 341
43.6%
None
ValueCountFrequency (%)
38
100.0%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
양주시 일자리경제과
50 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시 일자리경제과
2nd row양주시 일자리경제과
3rd row양주시 일자리경제과
4th row양주시 일자리경제과
5th row양주시 일자리경제과

Common Values

ValueCountFrequency (%)
양주시 일자리경제과 50
100.0%

Length

2023-12-13T06:16:05.038589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:16:05.131635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 50
50.0%
일자리경제과 50
50.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2023-08-04 00:00:00
Maximum2023-08-04 00:00:00
2023-12-13T06:16:05.213474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:16:05.311807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T06:16:01.472037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:16:05.388670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관리번호법인 또는 상호법인구분소재지주소
번호1.0001.0001.0000.8850.904
관리번호1.0001.0001.0001.0001.000
법인 또는 상호1.0001.0001.0001.0001.000
법인구분0.8851.0001.0001.0000.662
소재지주소0.9041.0001.0000.6621.000
2023-12-13T06:16:05.484173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인구분운영상태
법인구분1.0001.000
운영상태1.0001.000
2023-12-13T06:16:05.565459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호운영상태법인구분
번호1.0001.0000.656
운영상태1.0001.0001.000
법인구분0.6561.0001.000

Missing values

2023-12-13T06:16:01.584175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:16:01.700397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관리번호법인 또는 상호운영상태법인구분소재지주소관리기관명데이터기준일자
012023-경기양주-0004이오스정상영업개인경기도 양주시 고읍북로 120, 803동 1407호 (만송동, 은빛마을 휴먼시아)양주시 일자리경제과2023-08-04
122023-경기양주-0003좋은 날들정상영업개인경기도 양주시 고읍남로 6-6, 301호,302호 (광사동)양주시 일자리경제과2023-08-04
232023-경기양주-0001오케이바이오정상영업개인경기도 양주시 은현면 은현로 137-29양주시 일자리경제과2023-08-04
342022-경기양주-0012하이리 코스메틱정상영업개인경기도 양주시 평화로 1395, 2층 (덕계동)양주시 일자리경제과2023-08-04
452022-경기양주-0011양주 이오스타정상영업개인경기도 양주시 고읍남로 32, 양주프라임타워 7층 701호 (광사동)양주시 일자리경제과2023-08-04
562022-경기양주-0010에코이라이프정상영업개인경기도 양주시 백석읍 고릉말로56번길 60양주시 일자리경제과2023-08-04
672022-경기양주-0009상록수(Evergreen)정상영업개인경기도 양주시 부흥로 2096, 301동 1504호 (삼숭동, TS푸른솔3차아파트)양주시 일자리경제과2023-08-04
782022-경기양주-0008태양힐링센터정상영업개인경기도 양주시 고읍남로 6, 세원메디컬프라자 5층 502-1호 (광사동)양주시 일자리경제과2023-08-04
892022-경기양주-0007종근당건강 헬스벨스토리 양주고읍점정상영업개인경기도 양주시 고읍남로 6-9, A104호 (광사동)양주시 일자리경제과2023-08-04
9102022-경기양주-0003마임 양주옥정사랑지사정상영업개인경기도 양주시 옥정동로7다길 54, 5층 505호 (옥정동)양주시 일자리경제과2023-08-04
번호관리번호법인 또는 상호운영상태법인구분소재지주소관리기관명데이터기준일자
40412018-경기양주-0002브라보농원정상영업개인경기도 양주시 백석읍 양주산성로 865-20양주시 일자리경제과2023-08-04
41422017-경기양주-011주식회사 성심통상정상영업법인경기도 양주시 백석읍 고릉말로 15, 1층양주시 일자리경제과2023-08-04
42432017-경기양주-010회천농업협동조합정상영업법인경기도 양주시 화합로 1369 (덕정동)양주시 일자리경제과2023-08-04
43442017-경기양주-009장흥농업협동조합정상영업법인경기도 양주시 장흥면 호국로 176양주시 일자리경제과2023-08-04
44452015-경기양주-003(주)한빛디앤에스정상영업법인경기도 양주시 화합로 1609 (회암동)양주시 일자리경제과2023-08-04
45462015-경기양주-001동성 에너텍정상영업개인경기도 양주시 화합로 1609 (회암동)양주시 일자리경제과2023-08-04
46472009-경기양주-7홍선생미술정상영업개인경기도 양주시 삼숭로38번길 80, 양주자이프라자 2225호 (삼숭동)양주시 일자리경제과2023-08-04
47482005-00006르노코리아자동차 양주대리점<NA>개인경기도 양주시 부흥로 1530, 1층 (남방동)양주시 일자리경제과2023-08-04
48491999-00014쉐보레양주대리점<NA>개인경기도 양주시 회정로 91 (덕정동)양주시 일자리경제과2023-08-04
49501996-00004기아자동차 덕정대리점<NA>개인경기도 양주시 평화로 1574 (회정동)양주시 일자리경제과2023-08-04