Overview

Dataset statistics

Number of variables5
Number of observations595
Missing cells453
Missing cells (%)15.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.0 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Text2
DateTime1
Categorical1

Dataset

Description경상남도 후원방문판매업에 대한 현황으로, 법인명, 등록일자, 운영구분, 업체 전화번호에 관한 정보들을 제공합니다.
URLhttps://www.data.go.kr/data/15102929/fileData.do

Alerts

운영구분 has constant value ""Constant
전화번호 has 453 (76.1%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:49:15.536558
Analysis finished2023-12-12 04:49:16.116675
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct595
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean298
Minimum1
Maximum595
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T13:49:16.473269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.7
Q1149.5
median298
Q3446.5
95-th percentile565.3
Maximum595
Range594
Interquartile range (IQR)297

Descriptive statistics

Standard deviation171.90598
Coefficient of variation (CV)0.57686571
Kurtosis-1.2
Mean298
Median Absolute Deviation (MAD)149
Skewness0
Sum177310
Variance29551.667
MonotonicityStrictly increasing
2023-12-12T13:49:16.622070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
393 1
 
0.2%
395 1
 
0.2%
396 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
Other values (585) 585
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%
588 1
0.2%
587 1
0.2%
586 1
0.2%
Distinct586
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T13:49:16.886510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length16
Mean length8.5630252
Min length2

Characters and Unicode

Total characters5095
Distinct characters388
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique580 ?
Unique (%)97.5%

Sample

1st row인셀덤어썸
2nd row인셀덤 경남셀대리점
3rd row다올인셀덤
4th row인셀덤 리치 대리점
5th row인셀덤 그린썸
ValueCountFrequency (%)
인셀덤 250
 
23.9%
대리점 77
 
7.3%
유니베라 14
 
1.3%
주식회사 11
 
1.0%
엑소바이옴 9
 
0.9%
아모레퍼시픽 7
 
0.7%
럭셔리뷰티 7
 
0.7%
오휘 5
 
0.5%
쥬단학 4
 
0.4%
미라클 4
 
0.4%
Other values (619) 660
63.0%
2023-12-12T13:49:17.298477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
453
 
8.9%
382
 
7.5%
377
 
7.4%
366
 
7.2%
318
 
6.2%
293
 
5.8%
232
 
4.6%
67
 
1.3%
65
 
1.3%
64
 
1.3%
Other values (378) 2478
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4393
86.2%
Space Separator 453
 
8.9%
Lowercase Letter 89
 
1.7%
Uppercase Letter 67
 
1.3%
Close Punctuation 30
 
0.6%
Open Punctuation 30
 
0.6%
Decimal Number 22
 
0.4%
Other Punctuation 11
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
 
8.7%
377
 
8.6%
366
 
8.3%
318
 
7.2%
293
 
6.7%
232
 
5.3%
67
 
1.5%
65
 
1.5%
64
 
1.5%
50
 
1.1%
Other values (323) 2179
49.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
13.5%
o 11
12.4%
n 8
9.0%
i 8
9.0%
a 8
9.0%
r 6
 
6.7%
s 5
 
5.6%
l 4
 
4.5%
d 4
 
4.5%
h 4
 
4.5%
Other values (10) 19
21.3%
Uppercase Letter
ValueCountFrequency (%)
L 11
16.4%
I 7
10.4%
B 6
9.0%
N 6
9.0%
A 5
7.5%
E 5
7.5%
G 5
7.5%
R 4
 
6.0%
T 3
 
4.5%
M 3
 
4.5%
Other values (8) 12
17.9%
Decimal Number
ValueCountFrequency (%)
0 6
27.3%
1 5
22.7%
4 4
18.2%
2 3
13.6%
5 2
 
9.1%
3 1
 
4.5%
7 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
& 3
27.3%
. 2
18.2%
: 2
18.2%
# 1
 
9.1%
, 1
 
9.1%
/ 1
 
9.1%
' 1
 
9.1%
Space Separator
ValueCountFrequency (%)
453
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4393
86.2%
Common 546
 
10.7%
Latin 156
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
 
8.7%
377
 
8.6%
366
 
8.3%
318
 
7.2%
293
 
6.7%
232
 
5.3%
67
 
1.5%
65
 
1.5%
64
 
1.5%
50
 
1.1%
Other values (323) 2179
49.6%
Latin
ValueCountFrequency (%)
e 12
 
7.7%
L 11
 
7.1%
o 11
 
7.1%
n 8
 
5.1%
i 8
 
5.1%
a 8
 
5.1%
I 7
 
4.5%
B 6
 
3.8%
r 6
 
3.8%
N 6
 
3.8%
Other values (28) 73
46.8%
Common
ValueCountFrequency (%)
453
83.0%
) 30
 
5.5%
( 30
 
5.5%
0 6
 
1.1%
1 5
 
0.9%
4 4
 
0.7%
& 3
 
0.5%
2 3
 
0.5%
5 2
 
0.4%
. 2
 
0.4%
Other values (7) 8
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4393
86.2%
ASCII 702
 
13.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
453
64.5%
) 30
 
4.3%
( 30
 
4.3%
e 12
 
1.7%
L 11
 
1.6%
o 11
 
1.6%
n 8
 
1.1%
i 8
 
1.1%
a 8
 
1.1%
I 7
 
1.0%
Other values (45) 124
 
17.7%
Hangul
ValueCountFrequency (%)
382
 
8.7%
377
 
8.6%
366
 
8.3%
318
 
7.2%
293
 
6.7%
232
 
5.3%
67
 
1.5%
65
 
1.5%
64
 
1.5%
50
 
1.1%
Other values (323) 2179
49.6%
Distinct142
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
Minimum2013-07-09 00:00:00
Maximum2023-03-20 00:00:00
2023-12-12T13:49:17.467193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:49:17.662197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

운영구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
운영중
595 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 595
100.0%

Length

2023-12-12T13:49:17.793756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:49:17.909767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 595
100.0%

전화번호
Text

MISSING 

Distinct141
Distinct (%)99.3%
Missing453
Missing (%)76.1%
Memory size4.8 KiB
2023-12-12T13:49:18.127335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007042
Min length12

Characters and Unicode

Total characters1705
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)98.6%

Sample

1st row055-646-5153
2nd row055-637-0120
3rd row055-343-0996
4th row055-283-2831
5th row055-543-2215
ValueCountFrequency (%)
055-637-0120 2
 
1.4%
055-388-7788 1
 
0.7%
055-332-1462 1
 
0.7%
055-255-9907 1
 
0.7%
055-288-9930 1
 
0.7%
055-336-5581 1
 
0.7%
055-333-6773 1
 
0.7%
055-339-4060 1
 
0.7%
055-372-4542 1
 
0.7%
055-388-0647 1
 
0.7%
Other values (131) 131
92.3%
2023-12-12T13:49:18.606663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 403
23.6%
- 284
16.7%
0 214
12.6%
3 158
 
9.3%
2 121
 
7.1%
4 104
 
6.1%
7 103
 
6.0%
8 100
 
5.9%
6 92
 
5.4%
9 65
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1421
83.3%
Dash Punctuation 284
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 403
28.4%
0 214
15.1%
3 158
 
11.1%
2 121
 
8.5%
4 104
 
7.3%
7 103
 
7.2%
8 100
 
7.0%
6 92
 
6.5%
9 65
 
4.6%
1 61
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 284
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1705
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 403
23.6%
- 284
16.7%
0 214
12.6%
3 158
 
9.3%
2 121
 
7.1%
4 104
 
6.1%
7 103
 
6.0%
8 100
 
5.9%
6 92
 
5.4%
9 65
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1705
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 403
23.6%
- 284
16.7%
0 214
12.6%
3 158
 
9.3%
2 121
 
7.1%
4 104
 
6.1%
7 103
 
6.0%
8 100
 
5.9%
6 92
 
5.4%
9 65
 
3.8%

Interactions

2023-12-12T13:49:15.805259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T13:49:15.961303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:49:16.069589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호법인명등록일자운영구분전화번호
01인셀덤어썸2023-03-20운영중<NA>
12인셀덤 경남셀대리점2023-03-20운영중<NA>
23다올인셀덤2023-03-20운영중<NA>
34인셀덤 리치 대리점2023-03-20운영중<NA>
45인셀덤 그린썸2023-03-20운영중<NA>
56아모레퍼시픽 뉴통영점2023-03-20운영중055-646-5153
67아모레퍼시픽 창원센터 예진점2023-03-20운영중<NA>
78아모레퍼시픽 창원센터 합포점2023-03-20운영중<NA>
89엑소바이옴코리아안우지점2023-03-09운영중<NA>
910브리앙뜨 인셀덤 (BRILLANT INCELLDERM)2023-03-08운영중<NA>
번호법인명등록일자운영구분전화번호
585586마산생활건강2013-07-23운영중055-241-5302
586587건강생활 통영지점2013-07-23운영중055-649-1665
587588건강생활거제지점2013-07-23운영중055-635-4285
588589세원건강생활(풀무원)2013-07-23운영중055-311-1515
589590한국화장품 남해지사2013-07-18운영중055-864-7973
590591한국화장품 진주지사2013-07-15운영중055-747-6668
591592쥬단학 화장품2013-07-15운영중055-638-2473
592593쥬단학 충무대리점2013-07-09운영중055-644-7338
593594쥬단학 통영대리점2013-07-09운영중055-646-3247
594595쥬단학 거창방판대리점2013-07-09운영중055-944-1866