Overview

Dataset statistics

Number of variables3
Number of observations8342
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows40
Duplicate rows (%)0.5%
Total size in memory195.6 KiB
Average record size in memory24.0 B

Variable types

Text2
Categorical1

Dataset

Description인천광역시 미추홀구에 신고된 통신판매업 업체 현황에 대한 데이터로서, 인터넷 통신판매업체의 상호명, 소재지주소, 데이터기준일 등의 항목을 제공합니다.
Author인천광역시 미추홀구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15016234&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일 has constant value ""Constant
Dataset has 40 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-28 07:04:43.069131
Analysis finished2024-01-28 07:04:43.694214
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct8192
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
2024-01-28T16:04:43.914815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length43
Mean length6.4978422
Min length1

Characters and Unicode

Total characters54205
Distinct characters1094
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8061 ?
Unique (%)96.6%

Sample

1st row성지 홀딩스(주04)
2nd row모네 글로벌
3rd row디지커넥트
4th row엔젤퀸우먼
5th row중구컴퍼니
ValueCountFrequency (%)
주식회사 438
 
4.2%
컴퍼니 39
 
0.4%
37
 
0.4%
인셀덤 35
 
0.3%
company 31
 
0.3%
23
 
0.2%
스튜디오 18
 
0.2%
스토어 15
 
0.1%
코리아 13
 
0.1%
인천 12
 
0.1%
Other values (9115) 9855
93.7%
2024-01-28T16:04:44.312682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2195
 
4.0%
2021
 
3.7%
1497
 
2.8%
( 1317
 
2.4%
) 1317
 
2.4%
928
 
1.7%
793
 
1.5%
772
 
1.4%
706
 
1.3%
593
 
1.1%
Other values (1084) 42066
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39979
73.8%
Lowercase Letter 4653
 
8.6%
Uppercase Letter 4005
 
7.4%
Space Separator 2195
 
4.0%
Open Punctuation 1318
 
2.4%
Close Punctuation 1318
 
2.4%
Decimal Number 455
 
0.8%
Other Punctuation 194
 
0.4%
Dash Punctuation 42
 
0.1%
Other Symbol 30
 
0.1%
Other values (2) 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2021
 
5.1%
1497
 
3.7%
928
 
2.3%
793
 
2.0%
772
 
1.9%
706
 
1.8%
593
 
1.5%
568
 
1.4%
536
 
1.3%
527
 
1.3%
Other values (999) 31038
77.6%
Lowercase Letter
ValueCountFrequency (%)
e 528
11.3%
o 479
 
10.3%
a 461
 
9.9%
n 376
 
8.1%
i 329
 
7.1%
l 292
 
6.3%
r 266
 
5.7%
t 253
 
5.4%
s 225
 
4.8%
m 192
 
4.1%
Other values (16) 1252
26.9%
Uppercase Letter
ValueCountFrequency (%)
A 346
 
8.6%
O 296
 
7.4%
E 257
 
6.4%
S 252
 
6.3%
M 237
 
5.9%
N 224
 
5.6%
I 222
 
5.5%
T 215
 
5.4%
C 208
 
5.2%
L 187
 
4.7%
Other values (16) 1561
39.0%
Other Punctuation
ValueCountFrequency (%)
. 80
41.2%
& 66
34.0%
' 15
 
7.7%
? 10
 
5.2%
: 7
 
3.6%
/ 6
 
3.1%
# 3
 
1.5%
· 2
 
1.0%
% 2
 
1.0%
1
 
0.5%
Other values (2) 2
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 105
23.1%
2 68
14.9%
0 57
12.5%
9 47
10.3%
3 36
 
7.9%
8 33
 
7.3%
7 31
 
6.8%
4 29
 
6.4%
5 26
 
5.7%
6 23
 
5.1%
Math Symbol
ValueCountFrequency (%)
+ 3
60.0%
< 1
 
20.0%
> 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 1317
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1317
99.9%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2195
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Other Symbol
ValueCountFrequency (%)
30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39993
73.8%
Latin 8658
 
16.0%
Common 5538
 
10.2%
Han 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2021
 
5.1%
1497
 
3.7%
928
 
2.3%
793
 
2.0%
772
 
1.9%
706
 
1.8%
593
 
1.5%
568
 
1.4%
536
 
1.3%
527
 
1.3%
Other values (985) 31052
77.6%
Latin
ValueCountFrequency (%)
e 528
 
6.1%
o 479
 
5.5%
a 461
 
5.3%
n 376
 
4.3%
A 346
 
4.0%
i 329
 
3.8%
O 296
 
3.4%
l 292
 
3.4%
r 266
 
3.1%
E 257
 
3.0%
Other values (42) 5028
58.1%
Common
ValueCountFrequency (%)
2195
39.6%
( 1317
23.8%
) 1317
23.8%
1 105
 
1.9%
. 80
 
1.4%
2 68
 
1.2%
& 66
 
1.2%
0 57
 
1.0%
9 47
 
0.8%
- 42
 
0.8%
Other values (22) 244
 
4.4%
Han
ValueCountFrequency (%)
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (5) 5
31.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39963
73.7%
ASCII 14193
 
26.2%
None 33
 
0.1%
CJK 16
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2195
 
15.5%
( 1317
 
9.3%
) 1317
 
9.3%
e 528
 
3.7%
o 479
 
3.4%
a 461
 
3.2%
n 376
 
2.6%
A 346
 
2.4%
i 329
 
2.3%
O 296
 
2.1%
Other values (72) 6549
46.1%
Hangul
ValueCountFrequency (%)
2021
 
5.1%
1497
 
3.7%
928
 
2.3%
793
 
2.0%
772
 
1.9%
706
 
1.8%
593
 
1.5%
568
 
1.4%
536
 
1.3%
527
 
1.3%
Other values (984) 31022
77.6%
None
ValueCountFrequency (%)
30
90.9%
· 2
 
6.1%
1
 
3.0%
CJK
ValueCountFrequency (%)
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (5) 5
31.2%
Distinct4387
Distinct (%)52.6%
Missing1
Missing (%)< 0.1%
Memory size65.3 KiB
2024-01-28T16:04:44.567445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length20.730248
Min length12

Characters and Unicode

Total characters172911
Distinct characters177
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3365 ?
Unique (%)40.3%

Sample

1st row인천광역시 미추홀구 주안로 75
2nd row인천광역시 미추홀구 매소홀로 68
3rd row인천광역시 미추홀구 경인로 38
4th row인천광역시 미추홀구 매소홀로 340
5th row인천광역시 미추홀구 주안로 75
ValueCountFrequency (%)
인천광역시 8342
24.9%
미추홀구 8328
24.8%
경인로 469
 
1.4%
주안로 340
 
1.0%
경원대로 273
 
0.8%
매소홀로 225
 
0.7%
석정로 207
 
0.6%
33 202
 
0.6%
소성로 184
 
0.5%
염전로 158
 
0.5%
Other values (2459) 14789
44.1%
2024-01-28T16:04:44.910334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25965
 
15.0%
10028
 
5.8%
9000
 
5.2%
8684
 
5.0%
8683
 
5.0%
8451
 
4.9%
8381
 
4.8%
8347
 
4.8%
8342
 
4.8%
8342
 
4.8%
Other values (167) 68688
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 113046
65.4%
Decimal Number 32047
 
18.5%
Space Separator 25965
 
15.0%
Dash Punctuation 1831
 
1.1%
Uppercase Letter 13
 
< 0.1%
Other Punctuation 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10028
 
8.9%
9000
 
8.0%
8684
 
7.7%
8683
 
7.7%
8451
 
7.5%
8381
 
7.4%
8347
 
7.4%
8342
 
7.4%
8342
 
7.4%
7585
 
6.7%
Other values (148) 27203
24.1%
Decimal Number
ValueCountFrequency (%)
1 6056
18.9%
2 4403
13.7%
3 4170
13.0%
4 3255
10.2%
8 2685
8.4%
5 2679
8.4%
6 2636
8.2%
7 2234
 
7.0%
0 1987
 
6.2%
9 1942
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
D 5
38.5%
B 5
38.5%
T 1
 
7.7%
I 1
 
7.7%
A 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
/ 8
88.9%
@ 1
 
11.1%
Space Separator
ValueCountFrequency (%)
25965
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1831
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 113046
65.4%
Common 59852
34.6%
Latin 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10028
 
8.9%
9000
 
8.0%
8684
 
7.7%
8683
 
7.7%
8451
 
7.5%
8381
 
7.4%
8347
 
7.4%
8342
 
7.4%
8342
 
7.4%
7585
 
6.7%
Other values (148) 27203
24.1%
Common
ValueCountFrequency (%)
25965
43.4%
1 6056
 
10.1%
2 4403
 
7.4%
3 4170
 
7.0%
4 3255
 
5.4%
8 2685
 
4.5%
5 2679
 
4.5%
6 2636
 
4.4%
7 2234
 
3.7%
0 1987
 
3.3%
Other values (4) 3782
 
6.3%
Latin
ValueCountFrequency (%)
D 5
38.5%
B 5
38.5%
T 1
 
7.7%
I 1
 
7.7%
A 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 113046
65.4%
ASCII 59865
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25965
43.4%
1 6056
 
10.1%
2 4403
 
7.4%
3 4170
 
7.0%
4 3255
 
5.4%
8 2685
 
4.5%
5 2679
 
4.5%
6 2636
 
4.4%
7 2234
 
3.7%
0 1987
 
3.3%
Other values (9) 3795
 
6.3%
Hangul
ValueCountFrequency (%)
10028
 
8.9%
9000
 
8.0%
8684
 
7.7%
8683
 
7.7%
8451
 
7.5%
8381
 
7.4%
8347
 
7.4%
8342
 
7.4%
8342
 
7.4%
7585
 
6.7%
Other values (148) 27203
24.1%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size65.3 KiB
2023-05-11
8342 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-11
2nd row2023-05-11
3rd row2023-05-11
4th row2023-05-11
5th row2023-05-11

Common Values

ValueCountFrequency (%)
2023-05-11 8342
100.0%

Length

2024-01-28T16:04:45.006985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T16:04:45.075830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-11 8342
100.0%

Missing values

2024-01-28T16:04:43.604161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T16:04:43.664344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명소재지주소데이터기준일
0성지 홀딩스(주04)인천광역시 미추홀구 주안로 752023-05-11
1모네 글로벌인천광역시 미추홀구 매소홀로 682023-05-11
2디지커넥트인천광역시 미추홀구 경인로 382023-05-11
3엔젤퀸우먼인천광역시 미추홀구 매소홀로 3402023-05-11
4중구컴퍼니인천광역시 미추홀구 주안로 752023-05-11
5냠냠버터인천광역시 미추홀구 주안로 752023-05-11
6윈트인천광역시 미추홀구 석정로351번길 122023-05-11
7네일리네일인천광역시 미추홀구 수봉남로18번길 152023-05-11
8대성인인천광역시 미추홀구 신기길30번길 53-172023-05-11
9유포릭인천광역시 미추홀구 경인로176번길 32-172023-05-11
상호명소재지주소데이터기준일
8332지혜로운삶인천광역시 미추홀구 주안6동 1585-32023-05-11
8333㈜대성하이테크인천광역시 미추홀구 주안5동 17-12023-05-11
8334태솔인천계양지사인천광역시 미추홀구 주안5동 23-42023-05-11
8335그린메디칼인천광역시 미추홀구 주안231-12023-05-11
8336AUKO인천광역시 미추홀구 주안1동 264-12023-05-11
8337세계양행인천광역시 미추홀구 주안5동 19-252023-05-11
8338여명문화사인천광역시 미추홀구 주안1동 81-102023-05-11
8339자격고시인천광역시 미추홀구 주안1동 81-102023-05-11
8340건강식품점인천광역시 미추홀구 용현3동 1452023-05-11
8341삼보문화원인천광역시 미추홀구 주안1동 130-62023-05-11

Duplicate rows

Most frequently occurring

상호명소재지주소데이터기준일# duplicates
38해피복댕이인천광역시 미추홀구 주안로90번길 42-122023-05-113
0(Mz)펠릭스인천광역시 미추홀구 석정로 3882023-05-112
1(주)울림인천광역시 미추홀구 주안로 1152023-05-112
2고은한복인천광역시 미추홀구 인하로235번길 262023-05-112
3골드제이인천광역시 미추홀구 인하로 2922023-05-112
4굿플레이스인천광역시 미추홀구 한나루로 6102023-05-112
5넛지87인천광역시 미추홀구 경인로 842023-05-112
6네오피자 미추홀점인천광역시 미추홀구 경인로326번길 372023-05-112
7다잇소인천광역시 미추홀구 석바위로 1342023-05-112
8더 걸인천광역시 미추홀구 주안로 지하 862023-05-112