Overview

Dataset statistics

Number of variables4
Number of observations116
Missing cells47
Missing cells (%)10.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory34.1 B

Variable types

Numeric1
Text3

Dataset

Description인천광역시 남동구 방문판매업 신고현황에 대한 데이터로 연번, 법인또는상호, 소재지주소, 소재지전화번호을 제공합니다,
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15067141&srcSe=7661IVAWM27C61E190

Alerts

소재지전화번호 has 47 (40.5%) missing valuesMissing
번호 has unique valuesUnique
법인또는상호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 13:48:34.522189
Analysis finished2024-01-28 13:48:35.019171
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct116
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.5
Minimum1
Maximum116
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-28T22:48:35.079261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.75
Q129.75
median58.5
Q387.25
95-th percentile110.25
Maximum116
Range115
Interquartile range (IQR)57.5

Descriptive statistics

Standard deviation33.630343
Coefficient of variation (CV)0.57487767
Kurtosis-1.2
Mean58.5
Median Absolute Deviation (MAD)29
Skewness0
Sum6786
Variance1131
MonotonicityStrictly increasing
2024-01-28T22:48:35.195019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
75 1
 
0.9%
87 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
84 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
Other values (106) 106
91.4%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
116 1
0.9%
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%

법인또는상호
Text

UNIQUE 

Distinct116
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-01-28T22:48:35.408010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length7.7068966
Min length2

Characters and Unicode

Total characters894
Distinct characters241
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)100.0%

Sample

1st row바디닥터 비움사랑
2nd row바디닥터
3rd row주식회사 자연속요정
4th row주식회사 위스토리
5th row마임인천서창지사
ValueCountFrequency (%)
주식회사 14
 
8.5%
마임 3
 
1.8%
바디닥터 2
 
1.2%
알즈너 2
 
1.2%
인천 2
 
1.2%
건강마을 1
 
0.6%
문화쇼핑 1
 
0.6%
모래내 1
 
0.6%
중심코어 1
 
0.6%
오름정보통신 1
 
0.6%
Other values (137) 137
83.0%
2024-01-28T22:48:35.743613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
5.5%
31
 
3.5%
30
 
3.4%
28
 
3.1%
23
 
2.6%
22
 
2.5%
21
 
2.3%
17
 
1.9%
17
 
1.9%
16
 
1.8%
Other values (231) 640
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 788
88.1%
Space Separator 49
 
5.5%
Close Punctuation 15
 
1.7%
Open Punctuation 15
 
1.7%
Uppercase Letter 15
 
1.7%
Lowercase Letter 10
 
1.1%
Other Symbol 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
3.9%
30
 
3.8%
28
 
3.6%
23
 
2.9%
22
 
2.8%
21
 
2.7%
17
 
2.2%
17
 
2.2%
16
 
2.0%
15
 
1.9%
Other values (207) 568
72.1%
Uppercase Letter
ValueCountFrequency (%)
N 2
13.3%
E 2
13.3%
C 2
13.3%
Z 1
6.7%
G 1
6.7%
M 1
6.7%
T 1
6.7%
R 1
6.7%
S 1
6.7%
J 1
6.7%
Other values (2) 2
13.3%
Lowercase Letter
ValueCountFrequency (%)
e 2
20.0%
s 2
20.0%
y 2
20.0%
h 1
10.0%
t 1
10.0%
n 1
10.0%
o 1
10.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 789
88.3%
Common 80
 
8.9%
Latin 25
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
3.9%
30
 
3.8%
28
 
3.5%
23
 
2.9%
22
 
2.8%
21
 
2.7%
17
 
2.2%
17
 
2.2%
16
 
2.0%
15
 
1.9%
Other values (208) 569
72.1%
Latin
ValueCountFrequency (%)
N 2
 
8.0%
E 2
 
8.0%
C 2
 
8.0%
e 2
 
8.0%
s 2
 
8.0%
y 2
 
8.0%
h 1
 
4.0%
t 1
 
4.0%
Z 1
 
4.0%
G 1
 
4.0%
Other values (9) 9
36.0%
Common
ValueCountFrequency (%)
49
61.3%
) 15
 
18.8%
( 15
 
18.8%
- 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 788
88.1%
ASCII 105
 
11.7%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49
46.7%
) 15
 
14.3%
( 15
 
14.3%
N 2
 
1.9%
E 2
 
1.9%
C 2
 
1.9%
e 2
 
1.9%
s 2
 
1.9%
y 2
 
1.9%
h 1
 
1.0%
Other values (13) 13
 
12.4%
Hangul
ValueCountFrequency (%)
31
 
3.9%
30
 
3.8%
28
 
3.6%
23
 
2.9%
22
 
2.8%
21
 
2.7%
17
 
2.2%
17
 
2.2%
16
 
2.0%
15
 
1.9%
Other values (207) 568
72.1%
None
ValueCountFrequency (%)
1
100.0%
Distinct112
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-01-28T22:48:36.001074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length34.543103
Min length22

Characters and Unicode

Total characters4007
Distinct characters166
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)93.1%

Sample

1st row인천광역시 남동구 석산로 35, 104동 2202호 (간석동, 간석마을 풍림아이원아파트)
2nd row인천광역시 남동구 논고개로 81, 7층 704호 (논현동)
3rd row인천광역시 남동구 인주대로591번길 60, 3층 (구월동)
4th row인천광역시 남동구 구월로 293, 1층 101호 (만수동)
5th row인천광역시 남동구 서창남순환로216번길 30, 304호 (서창동)
ValueCountFrequency (%)
인천광역시 116
 
15.1%
남동구 116
 
15.1%
구월동 30
 
3.9%
간석동 22
 
2.9%
만수동 14
 
1.8%
2층 14
 
1.8%
1층 13
 
1.7%
만수동, 10
 
1.3%
논현동 9
 
1.2%
백범로 8
 
1.0%
Other values (305) 417
54.2%
2024-01-28T22:48:36.640311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
653
 
16.3%
269
 
6.7%
166
 
4.1%
1 144
 
3.6%
140
 
3.5%
139
 
3.5%
124
 
3.1%
124
 
3.1%
121
 
3.0%
118
 
2.9%
Other values (156) 2009
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2238
55.9%
Decimal Number 726
 
18.1%
Space Separator 653
 
16.3%
Other Punctuation 124
 
3.1%
Close Punctuation 117
 
2.9%
Open Punctuation 117
 
2.9%
Dash Punctuation 17
 
0.4%
Uppercase Letter 15
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
269
 
12.0%
166
 
7.4%
140
 
6.3%
139
 
6.2%
124
 
5.5%
121
 
5.4%
118
 
5.3%
117
 
5.2%
116
 
5.2%
67
 
3.0%
Other values (135) 861
38.5%
Decimal Number
ValueCountFrequency (%)
1 144
19.8%
2 109
15.0%
0 98
13.5%
3 91
12.5%
5 65
9.0%
4 62
8.5%
8 47
 
6.5%
7 39
 
5.4%
6 38
 
5.2%
9 33
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 5
33.3%
T 3
20.0%
K 3
20.0%
D 2
 
13.3%
C 1
 
6.7%
A 1
 
6.7%
Space Separator
ValueCountFrequency (%)
653
100.0%
Other Punctuation
ValueCountFrequency (%)
124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 117
100.0%
Open Punctuation
ValueCountFrequency (%)
( 117
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2238
55.9%
Common 1754
43.8%
Latin 15
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
269
 
12.0%
166
 
7.4%
140
 
6.3%
139
 
6.2%
124
 
5.5%
121
 
5.4%
118
 
5.3%
117
 
5.2%
116
 
5.2%
67
 
3.0%
Other values (135) 861
38.5%
Common
ValueCountFrequency (%)
653
37.2%
1 144
 
8.2%
124
 
7.1%
) 117
 
6.7%
( 117
 
6.7%
2 109
 
6.2%
0 98
 
5.6%
3 91
 
5.2%
5 65
 
3.7%
4 62
 
3.5%
Other values (5) 174
 
9.9%
Latin
ValueCountFrequency (%)
B 5
33.3%
T 3
20.0%
K 3
20.0%
D 2
 
13.3%
C 1
 
6.7%
A 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2238
55.9%
ASCII 1645
41.1%
None 124
 
3.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
653
39.7%
1 144
 
8.8%
) 117
 
7.1%
( 117
 
7.1%
2 109
 
6.6%
0 98
 
6.0%
3 91
 
5.5%
5 65
 
4.0%
4 62
 
3.8%
8 47
 
2.9%
Other values (10) 142
 
8.6%
Hangul
ValueCountFrequency (%)
269
 
12.0%
166
 
7.4%
140
 
6.3%
139
 
6.2%
124
 
5.5%
121
 
5.4%
118
 
5.3%
117
 
5.2%
116
 
5.2%
67
 
3.0%
Other values (135) 861
38.5%
None
ValueCountFrequency (%)
124
100.0%

소재지전화번호
Text

MISSING 

Distinct69
Distinct (%)100.0%
Missing47
Missing (%)40.5%
Memory size1.0 KiB
2024-01-28T22:48:36.881400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.927536
Min length9

Characters and Unicode

Total characters823
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)100.0%

Sample

1st row02-1566-4029
2nd row032-467-7020
3rd row032-465-6002
4th row032-428-5552
5th row032-434-3360
ValueCountFrequency (%)
032-202-1333 1
 
1.4%
032-464-3381 1
 
1.4%
032-816-7071 1
 
1.4%
032-819-9999 1
 
1.4%
032-821-7476 1
 
1.4%
032-815-3911 1
 
1.4%
070-7725-3360 1
 
1.4%
080-234-2899 1
 
1.4%
032-425-0404 1
 
1.4%
032-762-3759 1
 
1.4%
Other values (59) 59
85.5%
2024-01-28T22:48:37.232839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 135
16.4%
0 129
15.7%
2 124
15.1%
3 108
13.1%
4 73
8.9%
1 48
 
5.8%
6 43
 
5.2%
5 43
 
5.2%
8 43
 
5.2%
7 43
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 688
83.6%
Dash Punctuation 135
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 129
18.8%
2 124
18.0%
3 108
15.7%
4 73
10.6%
1 48
 
7.0%
6 43
 
6.2%
5 43
 
6.2%
8 43
 
6.2%
7 43
 
6.2%
9 34
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 135
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 823
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 135
16.4%
0 129
15.7%
2 124
15.1%
3 108
13.1%
4 73
8.9%
1 48
 
5.8%
6 43
 
5.2%
5 43
 
5.2%
8 43
 
5.2%
7 43
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 823
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 135
16.4%
0 129
15.7%
2 124
15.1%
3 108
13.1%
4 73
8.9%
1 48
 
5.8%
6 43
 
5.2%
5 43
 
5.2%
8 43
 
5.2%
7 43
 
5.2%

Interactions

2024-01-28T22:48:34.796444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T22:48:37.319414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호소재지전화번호
번호1.0001.000
소재지전화번호1.0001.000

Missing values

2024-01-28T22:48:34.922056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:48:34.990993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호법인또는상호소재지주소소재지전화번호
01바디닥터 비움사랑인천광역시 남동구 석산로 35, 104동 2202호 (간석동, 간석마을 풍림아이원아파트)<NA>
12바디닥터인천광역시 남동구 논고개로 81, 7층 704호 (논현동)<NA>
23주식회사 자연속요정인천광역시 남동구 인주대로591번길 60, 3층 (구월동)02-1566-4029
34주식회사 위스토리인천광역시 남동구 구월로 293, 1층 101호 (만수동)032-467-7020
45마임인천서창지사인천광역시 남동구 서창남순환로216번길 30, 304호 (서창동)<NA>
56인셀덤 지브로점인천광역시 남동구 인주대로591번길 74, 1201호 (구월동)<NA>
67에스케이브로드밴드인천광역시 남동구 만수서로105번길 40-18, 1102동 102호 (만수동, 만수주공11단지아파트)032-465-6002
78스워커코리아인천광역시 남동구 문화로 153, 2층 (구월동)032-428-5552
89르노코리아자동차 남동대리점인천광역시 남동구 백범로 301, 1층 (간석동)032-434-3360
910해인선원인천광역시 남동구 복개서로89번길 54(구월동)<NA>
번호법인또는상호소재지주소소재지전화번호
106107유니베라인천광역시 남동구 장승남로 50, 301호 (만수동, 부건프라자)032-467-2855
107108현대남동판매대리점인천광역시 남동구 청능대로289번길 73, 102호 (고잔동)032-821-5544
108109생그린인천지사인천광역시 남동구 인주대로888번길 27 (만수동, 창대장터상가 제1동108호)032-468-8450
109110기아 간석대리점인천광역시 남동구 남동대로 938 (간석동)032-424-4488
110111GM쉐보레남동구대리점인천광역시 남동구 선수촌공원로 1, D동 1층 (구월동)032-429-3000
111112현대자동차문학대리점인천광역시 남동구 인하로 523 (구월동)032-424-5949
112113현대서창판매대리점인천광역시 남동구 소래로 624 (만수동)032-472-2002
113114기아간석사거리대리점인천광역시 남동구 석산로 248 (구월동, 외2필지)032-472-5656
114115기아주공대리점인천광역시 남동구 구월로 336 (만수동)032-466-5505
115116㈜현대교육개발원인천광역시 남동구 논고개로123번길 35, 제에이804호 (논현동)032-431-9939