Overview

Dataset statistics

Number of variables4
Number of observations68
Missing cells50
Missing cells (%)18.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory33.9 B

Variable types

Categorical1
Text3

Dataset

Description인천광역시 연수구에 위치한 비디오물 제작업소에 대한 데이터로 상호, 영업소재지(도로명) 품목 데이터를 제공합니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15055551&srcSe=7661IVAWM27C61E190

Alerts

업종 is highly imbalanced (61.1%)Imbalance
전화번호 has 50 (73.5%) missing valuesMissing

Reproduction

Analysis started2024-01-28 04:55:53.547448
Analysis finished2024-01-28 04:55:53.968151
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct4
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size676.0 B
비디오물제작업
57 
비디오물배급업
비디오물감상실업
 
1
복합영상물제공업
 
1

Length

Max length8
Median length7
Mean length7.0294118
Min length7

Unique

Unique2 ?
Unique (%)2.9%

Sample

1st row비디오물제작업
2nd row비디오물제작업
3rd row비디오물제작업
4th row비디오물제작업
5th row비디오물제작업

Common Values

ValueCountFrequency (%)
비디오물제작업 57
83.8%
비디오물배급업 9
 
13.2%
비디오물감상실업 1
 
1.5%
복합영상물제공업 1
 
1.5%

Length

2024-01-28T13:55:54.017632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T13:55:54.094943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비디오물제작업 57
83.8%
비디오물배급업 9
 
13.2%
비디오물감상실업 1
 
1.5%
복합영상물제공업 1
 
1.5%

상호
Text

Distinct63
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size676.0 B
2024-01-28T13:55:54.269711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length8.3970588
Min length1

Characters and Unicode

Total characters571
Distinct characters175
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)85.3%

Sample

1st row주식회사 펜타컴
2nd row재화커뮤니케이션(주)
3rd row유디아이 주식회사
4th row유디아이 주식회사
5th row에이피건축(주)
ValueCountFrequency (%)
주식회사 20
 
20.6%
주)지피엠 2
 
2.1%
유디아이 2
 
2.1%
주)드루지야오페라단 2
 
2.1%
2
 
2.1%
jcd 2
 
2.1%
주)디에이치이비즈 2
 
2.1%
주)스튜디오도플 1
 
1.0%
지안커뮤니케이션 1
 
1.0%
슛버튼 1
 
1.0%
Other values (62) 62
63.9%
2024-01-28T13:55:54.588985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
7.2%
29
 
5.1%
( 24
 
4.2%
) 24
 
4.2%
23
 
4.0%
20
 
3.5%
20
 
3.5%
20
 
3.5%
17
 
3.0%
15
 
2.6%
Other values (165) 338
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 451
79.0%
Space Separator 29
 
5.1%
Open Punctuation 24
 
4.2%
Close Punctuation 24
 
4.2%
Lowercase Letter 22
 
3.9%
Uppercase Letter 18
 
3.2%
Other Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
9.1%
23
 
5.1%
20
 
4.4%
20
 
4.4%
20
 
4.4%
17
 
3.8%
15
 
3.3%
8
 
1.8%
8
 
1.8%
7
 
1.6%
Other values (138) 272
60.3%
Lowercase Letter
ValueCountFrequency (%)
o 4
18.2%
i 2
9.1%
d 2
9.1%
t 2
9.1%
m 2
9.1%
a 2
9.1%
n 2
9.1%
u 2
9.1%
l 1
 
4.5%
e 1
 
4.5%
Other values (2) 2
9.1%
Uppercase Letter
ValueCountFrequency (%)
D 4
22.2%
C 3
16.7%
L 2
11.1%
F 2
11.1%
J 2
11.1%
N 1
 
5.6%
S 1
 
5.6%
M 1
 
5.6%
O 1
 
5.6%
V 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 451
79.0%
Common 80
 
14.0%
Latin 40
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
9.1%
23
 
5.1%
20
 
4.4%
20
 
4.4%
20
 
4.4%
17
 
3.8%
15
 
3.3%
8
 
1.8%
8
 
1.8%
7
 
1.6%
Other values (138) 272
60.3%
Latin
ValueCountFrequency (%)
D 4
 
10.0%
o 4
 
10.0%
C 3
 
7.5%
i 2
 
5.0%
d 2
 
5.0%
t 2
 
5.0%
L 2
 
5.0%
m 2
 
5.0%
a 2
 
5.0%
n 2
 
5.0%
Other values (12) 15
37.5%
Common
ValueCountFrequency (%)
29
36.2%
( 24
30.0%
) 24
30.0%
. 2
 
2.5%
, 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 451
79.0%
ASCII 120
 
21.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
9.1%
23
 
5.1%
20
 
4.4%
20
 
4.4%
20
 
4.4%
17
 
3.8%
15
 
3.3%
8
 
1.8%
8
 
1.8%
7
 
1.6%
Other values (138) 272
60.3%
ASCII
ValueCountFrequency (%)
29
24.2%
( 24
20.0%
) 24
20.0%
D 4
 
3.3%
o 4
 
3.3%
C 3
 
2.5%
. 2
 
1.7%
i 2
 
1.7%
d 2
 
1.7%
t 2
 
1.7%
Other values (17) 24
20.0%
Distinct62
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size676.0 B
2024-01-28T13:55:54.782164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length51
Mean length45.338235
Min length24

Characters and Unicode

Total characters3083
Distinct characters163
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)82.4%

Sample

1st row인천광역시 연수구 해돋이로 107, F동 1113호 (송도동,송도 더샵 퍼스트월드)
2nd row인천광역시 연수구 컨벤시아대로 233, 407호 (송도동,송도브릿지호텔 상가시설내 4층)
3rd row인천광역시 연수구 컨벤시아대로 81, 512, 513호 (송도동,드림시티)
4th row인천광역시 연수구 컨벤시아대로 81, 512.513호 (송도동,두림시티)
5th row인천광역시 연수구 송도과학로 70, 1804,1805,1806호 (송도동)
ValueCountFrequency (%)
인천광역시 68
 
11.8%
연수구 68
 
11.8%
송도동 61
 
10.6%
송도 16
 
2.8%
30 15
 
2.6%
송도미래로 12
 
2.1%
스마트밸리 10
 
1.7%
컨벤시아대로 10
 
1.7%
센트럴로 9
 
1.6%
brc 8
 
1.4%
Other values (174) 301
52.1%
2024-01-28T13:55:55.091973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
514
 
16.7%
120
 
3.9%
119
 
3.9%
106
 
3.4%
1 103
 
3.3%
, 95
 
3.1%
88
 
2.9%
2 83
 
2.7%
0 82
 
2.7%
81
 
2.6%
Other values (153) 1692
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1748
56.7%
Space Separator 514
 
16.7%
Decimal Number 507
 
16.4%
Other Punctuation 96
 
3.1%
Open Punctuation 70
 
2.3%
Close Punctuation 70
 
2.3%
Uppercase Letter 59
 
1.9%
Dash Punctuation 17
 
0.6%
Math Symbol 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
6.9%
119
 
6.8%
106
 
6.1%
88
 
5.0%
81
 
4.6%
80
 
4.6%
77
 
4.4%
71
 
4.1%
70
 
4.0%
69
 
3.9%
Other values (123) 867
49.6%
Uppercase Letter
ValueCountFrequency (%)
B 13
22.0%
C 13
22.0%
R 9
15.3%
E 8
13.6%
A 6
10.2%
T 3
 
5.1%
I 2
 
3.4%
O 1
 
1.7%
U 1
 
1.7%
F 1
 
1.7%
Other values (2) 2
 
3.4%
Decimal Number
ValueCountFrequency (%)
1 103
20.3%
2 83
16.4%
0 82
16.2%
3 71
14.0%
6 41
 
8.1%
4 36
 
7.1%
5 33
 
6.5%
8 22
 
4.3%
7 20
 
3.9%
9 16
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 95
99.0%
. 1
 
1.0%
Space Separator
ValueCountFrequency (%)
514
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1748
56.7%
Common 1275
41.4%
Latin 60
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
6.9%
119
 
6.8%
106
 
6.1%
88
 
5.0%
81
 
4.6%
80
 
4.6%
77
 
4.4%
71
 
4.1%
70
 
4.0%
69
 
3.9%
Other values (123) 867
49.6%
Common
ValueCountFrequency (%)
514
40.3%
1 103
 
8.1%
, 95
 
7.5%
2 83
 
6.5%
0 82
 
6.4%
3 71
 
5.6%
( 70
 
5.5%
) 70
 
5.5%
6 41
 
3.2%
4 36
 
2.8%
Other values (7) 110
 
8.6%
Latin
ValueCountFrequency (%)
B 13
21.7%
C 13
21.7%
R 9
15.0%
E 8
13.3%
A 6
10.0%
T 3
 
5.0%
I 2
 
3.3%
O 1
 
1.7%
U 1
 
1.7%
F 1
 
1.7%
Other values (3) 3
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1748
56.7%
ASCII 1335
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
514
38.5%
1 103
 
7.7%
, 95
 
7.1%
2 83
 
6.2%
0 82
 
6.1%
3 71
 
5.3%
( 70
 
5.2%
) 70
 
5.2%
6 41
 
3.1%
4 36
 
2.7%
Other values (20) 170
 
12.7%
Hangul
ValueCountFrequency (%)
120
 
6.9%
119
 
6.8%
106
 
6.1%
88
 
5.0%
81
 
4.6%
80
 
4.6%
77
 
4.4%
71
 
4.1%
70
 
4.0%
69
 
3.9%
Other values (123) 867
49.6%

전화번호
Text

MISSING 

Distinct17
Distinct (%)94.4%
Missing50
Missing (%)73.5%
Memory size676.0 B
2024-01-28T13:55:55.239173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.222222
Min length12

Characters and Unicode

Total characters220
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)88.9%

Sample

1st row070-8192-1087
2nd row032-858-4437
3rd row032-833-5988
4th row032-720-4331
5th row032-716-9660
ValueCountFrequency (%)
032-243-3048 2
 
11.1%
070-4848-0078 1
 
5.6%
070-8192-1087 1
 
5.6%
032-720-5577 1
 
5.6%
032-459-2222 1
 
5.6%
032-721-5501 1
 
5.6%
032-832-2541 1
 
5.6%
032-812-0607 1
 
5.6%
032-260-1122 1
 
5.6%
032-833-5988 1
 
5.6%
Other values (7) 7
38.9%
2024-01-28T13:55:55.499831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 36
16.4%
- 36
16.4%
2 32
14.5%
3 24
10.9%
8 23
10.5%
7 17
7.7%
1 15
6.8%
4 14
 
6.4%
5 10
 
4.5%
6 9
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 184
83.6%
Dash Punctuation 36
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 36
19.6%
2 32
17.4%
3 24
13.0%
8 23
12.5%
7 17
9.2%
1 15
8.2%
4 14
 
7.6%
5 10
 
5.4%
6 9
 
4.9%
9 4
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 220
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 36
16.4%
- 36
16.4%
2 32
14.5%
3 24
10.9%
8 23
10.5%
7 17
7.7%
1 15
6.8%
4 14
 
6.4%
5 10
 
4.5%
6 9
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 220
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 36
16.4%
- 36
16.4%
2 32
14.5%
3 24
10.9%
8 23
10.5%
7 17
7.7%
1 15
6.8%
4 14
 
6.4%
5 10
 
4.5%
6 9
 
4.1%

Correlations

2024-01-28T13:55:55.574238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호영업소소재지(도로명)전화번호
업종1.0000.3260.0000.000
상호0.3261.0000.9991.000
영업소소재지(도로명)0.0000.9991.0001.000
전화번호0.0001.0001.0001.000

Missing values

2024-01-28T13:55:53.880716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T13:55:53.943118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호영업소소재지(도로명)전화번호
0비디오물제작업주식회사 펜타컴인천광역시 연수구 해돋이로 107, F동 1113호 (송도동,송도 더샵 퍼스트월드)070-8192-1087
1비디오물제작업재화커뮤니케이션(주)인천광역시 연수구 컨벤시아대로 233, 407호 (송도동,송도브릿지호텔 상가시설내 4층)032-858-4437
2비디오물제작업유디아이 주식회사인천광역시 연수구 컨벤시아대로 81, 512, 513호 (송도동,드림시티)032-833-5988
3비디오물제작업유디아이 주식회사인천광역시 연수구 컨벤시아대로 81, 512.513호 (송도동,두림시티)<NA>
4비디오물제작업에이피건축(주)인천광역시 연수구 송도과학로 70, 1804,1805,1806호 (송도동)<NA>
5비디오물제작업웨스트코(주)인천광역시 연수구 인천타워대로 99, 애니오션빌딩 701호 (송도동)<NA>
6비디오물제작업초록미디어 콘텐츠인천광역시 연수구 송도미래로 30, E동 1401호 (송도동, 송도 BRC 스마트밸리 지식산업센터)<NA>
7비디오물제작업(주)드루지야오페라단인천광역시 연수구 송도미래로 30, E동 12층 1201호 (송도동, 스마트밸리)032-720-4331
8비디오물제작업(주)아인픽춰스인천광역시 연수구 송도미래로 30, E동 9층 902호 (송도동, 송도비알씨스마트밸리지식산업센터)<NA>
9비디오물제작업라울앤풀인천광역시 연수구 센트럴로 263 (송도동)<NA>
업종상호영업소소재지(도로명)전화번호
58비디오물배급업(주) 아인픽춰스인천광역시 연수구 송도미래로 30, E동 9층 902호 (송도동, 송도비알씨스마트밸리지식산업센터)<NA>
59비디오물배급업라울엔폴인천광역시 연수구 센트럴로 263, 2층 (송도동, IBS 별관)<NA>
60비디오물배급업래빗네트웍스인천광역시 연수구 송도미래로 30, 이동 23층 2306호 (송도동, 송도 BRC 스마트밸리 지식산업센터)032-720-5577
61비디오물배급업(주)지피엠인천광역시 연수구 송도과학로84번길 24, 제너셈(주) 신축 사옥 4층 (송도동)<NA>
62비디오물배급업(주)디에이치이비즈인천광역시 연수구 송도미래로 30, 송도 BRC 스마트밸리 지식산업센터 디동 1604호 (송도동)02-1566-1674
63비디오물배급업주식회사 놀이동산엔터테인먼트인천광역시 연수구 아트센터대로97번길 30, 1603동 504호 (송도동, 더샵그린워크1차)032-243-3048
64비디오물배급업(주)라울앤폴인터내셔널인천광역시 연수구 센트럴로 263, 송도국제업무단지 C8-2블럭 업무복합시설 별관동 2층 201호 (송도동)<NA>
65비디오물배급업JCD인천광역시 연수구 컨벤시아대로 165, 포스코타워-송도 26층 2681호 (송도동)<NA>
66비디오물감상실업블루DVD영화관인천광역시 연수구 학나래로46번길 22 (선학동,3,4층)<NA>
67복합영상물제공업MOL 멀티룸인천광역시 연수구 인천타워대로132번길 30, 휴먼빌파크 501, 502-1호 (송도동)<NA>