Overview

Dataset statistics

Number of variables3
Number of observations56
Missing cells16
Missing cells (%)9.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory26.4 B

Variable types

Text3

Dataset

Description알뜰폰 사업자 현황에 대한 데이터로 사업자명과 사업자별 사업장 전화번호, 사업자별 사업자등록번호 항목으로 제공합니다.
URLhttps://www.data.go.kr/data/15107468/fileData.do

Alerts

사업장전화번호 has 14 (25.0%) missing valuesMissing
사업자등록번호 has 2 (3.6%) missing valuesMissing
사업자명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:39:03.281900
Analysis finished2023-12-12 22:39:03.688254
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업자명
Text

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-13T07:39:03.912798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.0178571
Min length2

Characters and Unicode

Total characters281
Distinct characters117
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st rowACN코리아
2nd rowCK커뮤스트리
3rd rowKB국민은행
4th rowKG모바일
5th rowKT엠모바일
ValueCountFrequency (%)
acn코리아 1
 
1.8%
ck커뮤스트리 1
 
1.8%
더피엔엘 1
 
1.8%
유니컴즈 1
 
1.8%
인스코리아 1
 
1.8%
인스코비 1
 
1.8%
제주방송 1
 
1.8%
조이텔 1
 
1.8%
코드모바일 1
 
1.8%
큰사람 1
 
1.8%
Other values (46) 46
82.1%
2023-12-13T07:39:04.331082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
5.7%
14
 
5.0%
13
 
4.6%
9
 
3.2%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (107) 190
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 266
94.7%
Uppercase Letter 15
 
5.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
6.0%
14
 
5.3%
13
 
4.9%
9
 
3.4%
8
 
3.0%
7
 
2.6%
6
 
2.3%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (98) 175
65.8%
Uppercase Letter
ValueCountFrequency (%)
K 5
33.3%
C 2
 
13.3%
G 2
 
13.3%
A 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
T 1
 
6.7%
B 1
 
6.7%
N 1
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 266
94.7%
Latin 15
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
6.0%
14
 
5.3%
13
 
4.9%
9
 
3.4%
8
 
3.0%
7
 
2.6%
6
 
2.3%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (98) 175
65.8%
Latin
ValueCountFrequency (%)
K 5
33.3%
C 2
 
13.3%
G 2
 
13.3%
A 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
T 1
 
6.7%
B 1
 
6.7%
N 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 266
94.7%
ASCII 15
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
6.0%
14
 
5.3%
13
 
4.9%
9
 
3.4%
8
 
3.0%
7
 
2.6%
6
 
2.3%
6
 
2.3%
6
 
2.3%
6
 
2.3%
Other values (98) 175
65.8%
ASCII
ValueCountFrequency (%)
K 5
33.3%
C 2
 
13.3%
G 2
 
13.3%
A 1
 
6.7%
L 1
 
6.7%
S 1
 
6.7%
T 1
 
6.7%
B 1
 
6.7%
N 1
 
6.7%

사업장전화번호
Text

MISSING 

Distinct41
Distinct (%)97.6%
Missing14
Missing (%)25.0%
Memory size580.0 B
2023-12-13T07:39:04.536203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters378
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row1688-9800
2nd row1566-1246
3rd row1522-9999
4th row1644-9388
5th row1899-5000
ValueCountFrequency (%)
1661-2207 2
 
4.8%
1899-7700 1
 
2.4%
1833-2115 1
 
2.4%
1899-3633 1
 
2.4%
1588-1635 1
 
2.4%
1599-7114 1
 
2.4%
1644-3797 1
 
2.4%
1544-5237 1
 
2.4%
1566-9070 1
 
2.4%
1661-5646 1
 
2.4%
Other values (31) 31
73.8%
2023-12-13T07:39:04.885680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 67
17.7%
6 46
12.2%
- 42
11.1%
0 41
10.8%
8 40
10.6%
9 32
8.5%
5 29
7.7%
7 23
 
6.1%
4 20
 
5.3%
3 20
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 336
88.9%
Dash Punctuation 42
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 67
19.9%
6 46
13.7%
0 41
12.2%
8 40
11.9%
9 32
9.5%
5 29
8.6%
7 23
 
6.8%
4 20
 
6.0%
3 20
 
6.0%
2 18
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 378
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 67
17.7%
6 46
12.2%
- 42
11.1%
0 41
10.8%
8 40
10.6%
9 32
8.5%
5 29
7.7%
7 23
 
6.1%
4 20
 
5.3%
3 20
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 378
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 67
17.7%
6 46
12.2%
- 42
11.1%
0 41
10.8%
8 40
10.6%
9 32
8.5%
5 29
7.7%
7 23
 
6.1%
4 20
 
5.3%
3 20
 
5.3%

사업자등록번호
Text

MISSING 

Distinct53
Distinct (%)98.1%
Missing2
Missing (%)3.6%
Memory size580.0 B
2023-12-13T07:39:05.159110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters648
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)96.3%

Sample

1st row220-87-94670
2nd row101-86-20021
3rd row201-81-68693
4th row220-81-82546
5th row133-81-43410
ValueCountFrequency (%)
101-86-20021 2
 
3.7%
104-86-21733 1
 
1.9%
113-86-67771 1
 
1.9%
106-81-85679 1
 
1.9%
138-81-70324 1
 
1.9%
134-86-82309 1
 
1.9%
595-85-00145 1
 
1.9%
616-81-11863 1
 
1.9%
886-87-00313 1
 
1.9%
220-81-64820 1
 
1.9%
Other values (43) 43
79.6%
2023-12-13T07:39:05.626770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 108
16.7%
1 99
15.3%
8 86
13.3%
0 70
10.8%
2 65
10.0%
6 47
7.3%
5 41
 
6.3%
3 40
 
6.2%
7 38
 
5.9%
4 32
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 540
83.3%
Dash Punctuation 108
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 99
18.3%
8 86
15.9%
0 70
13.0%
2 65
12.0%
6 47
8.7%
5 41
7.6%
3 40
7.4%
7 38
 
7.0%
4 32
 
5.9%
9 22
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 648
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 108
16.7%
1 99
15.3%
8 86
13.3%
0 70
10.8%
2 65
10.0%
6 47
7.3%
5 41
 
6.3%
3 40
 
6.2%
7 38
 
5.9%
4 32
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 648
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 108
16.7%
1 99
15.3%
8 86
13.3%
0 70
10.8%
2 65
10.0%
6 47
7.3%
5 41
 
6.3%
3 40
 
6.2%
7 38
 
5.9%
4 32
 
4.9%

Correlations

2023-12-13T07:39:05.827919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자명사업장전화번호사업자등록번호
사업자명1.0001.0001.000
사업장전화번호1.0001.0001.000
사업자등록번호1.0001.0001.000

Missing values

2023-12-13T07:39:03.477976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:39:03.551716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:39:03.633620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업자명사업장전화번호사업자등록번호
0ACN코리아1688-9800220-87-94670
1CK커뮤스트리1566-1246101-86-20021
2KB국민은행1522-9999201-81-68693
3KG모바일1644-9388220-81-82546
4KT엠모바일1899-5000133-81-43410
5LG헬로비전1855-1000117-81-13423
6SK텔링크1599-0999104-81-43391
7고고팩토리1670-9098253-86-01062
8니즈텔레콤1688-5566220-86-52917
9더원플랫폼1599-0596206-86-38729
사업자명사업장전화번호사업자등록번호
46보스그룹<NA>582-86-00883
47에르엘<NA>704-88-02369
48엔페이넷<NA>518-81-00311
49엠티티텔레콤<NA>220-87-13963
50원텔레콤<NA>545-86-01239
51장성모바일<NA>264-81-45783
52제이씨티<NA>608-86-05254
53포인트파크<NA>128-81-41180
54핀샷<NA>112-86-00395
55사람과연결<NA><NA>