Overview

Dataset statistics

Number of variables6
Number of observations44
Missing cells31
Missing cells (%)11.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory51.0 B

Variable types

Text4
Categorical1
Boolean1

Dataset

Description한국도로공사 고속도로 하이패스 단말기 제조사 정보를 제공한다. (제조사명,주소,주소(상세),전화번호,팩스번호,지원금단말기 제조여부)
URLhttps://www.data.go.kr/data/15064220/fileData.do

Alerts

상세주소 is highly overall correlated with 지원금단말기 제조여부High correlation
지원금단말기 제조여부 is highly overall correlated with 상세주소High correlation
전화번호 has 3 (6.8%) missing valuesMissing
팩스번호 has 28 (63.6%) missing valuesMissing
제조사명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:51:16.652225
Analysis finished2023-12-12 20:51:17.281594
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

제조사명
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-13T05:51:17.526318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length9.75
Min length5

Characters and Unicode

Total characters429
Distinct characters94
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row주식회사 하이원시스템(본사)
2nd row포스코아이씨티(주)
3rd row(주)테크나비
4th row(주)이씨스
5th row(주)세아네트웍스
ValueCountFrequency (%)
주식회사 1
 
2.1%
하이게인텔레콤(본사 1
 
2.1%
주)라닉스(본사 1
 
2.1%
코스페이스(본사 1
 
2.1%
주)아이트로닉스 1
 
2.1%
주)aits 1
 
2.1%
주)sd시스템 1
 
2.1%
한국인포콤(본사 1
 
2.1%
주)모토텍(본사 1
 
2.1%
주)휴먼케어(본사 1
 
2.1%
Other values (37) 37
78.7%
2023-12-13T05:51:17.968864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 59
 
13.8%
) 59
 
13.8%
33
 
7.7%
27
 
6.3%
26
 
6.1%
20
 
4.7%
18
 
4.2%
12
 
2.8%
9
 
2.1%
7
 
1.6%
Other values (84) 159
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 295
68.8%
Open Punctuation 59
 
13.8%
Close Punctuation 59
 
13.8%
Uppercase Letter 13
 
3.0%
Space Separator 3
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
11.2%
27
 
9.2%
26
 
8.8%
20
 
6.8%
18
 
6.1%
12
 
4.1%
9
 
3.1%
7
 
2.4%
6
 
2.0%
6
 
2.0%
Other values (73) 131
44.4%
Uppercase Letter
ValueCountFrequency (%)
S 5
38.5%
D 2
 
15.4%
A 1
 
7.7%
I 1
 
7.7%
T 1
 
7.7%
L 1
 
7.7%
G 1
 
7.7%
M 1
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 295
68.8%
Common 121
28.2%
Latin 13
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
11.2%
27
 
9.2%
26
 
8.8%
20
 
6.8%
18
 
6.1%
12
 
4.1%
9
 
3.1%
7
 
2.4%
6
 
2.0%
6
 
2.0%
Other values (73) 131
44.4%
Latin
ValueCountFrequency (%)
S 5
38.5%
D 2
 
15.4%
A 1
 
7.7%
I 1
 
7.7%
T 1
 
7.7%
L 1
 
7.7%
G 1
 
7.7%
M 1
 
7.7%
Common
ValueCountFrequency (%)
( 59
48.8%
) 59
48.8%
3
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 295
68.8%
ASCII 134
31.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 59
44.0%
) 59
44.0%
S 5
 
3.7%
3
 
2.2%
D 2
 
1.5%
A 1
 
0.7%
I 1
 
0.7%
T 1
 
0.7%
L 1
 
0.7%
G 1
 
0.7%
Hangul
ValueCountFrequency (%)
33
 
11.2%
27
 
9.2%
26
 
8.8%
20
 
6.8%
18
 
6.1%
12
 
4.1%
9
 
3.1%
7
 
2.4%
6
 
2.0%
6
 
2.0%
Other values (73) 131
44.4%

주소
Text

Distinct38
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-13T05:51:18.371326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length37
Mean length23.477273
Min length2

Characters and Unicode

Total characters1033
Distinct characters161
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)77.3%

Sample

1st row인천광역시 남동구 장자로5번길 18-24
2nd row경기도 성남시 분당구 황새울로311번길 9 (서현동)
3rd row경기도 이천시 호법면 이섭대천로 749-12
4th row서울특별시 금천구 가산디지털1로 2, 우림라이온스밸리 2차 1109호1 (가산동)
5th row
ValueCountFrequency (%)
경기도 22
 
10.4%
인천광역시 8
 
3.8%
서울특별시 7
 
3.3%
부평구 5
 
2.4%
성남시 5
 
2.4%
분당구 4
 
1.9%
동안구 3
 
1.4%
정왕동 3
 
1.4%
시흥시 3
 
1.4%
안양시 3
 
1.4%
Other values (128) 149
70.3%
2023-12-13T05:51:18.838691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
186
 
18.0%
44
 
4.3%
35
 
3.4%
1 35
 
3.4%
31
 
3.0%
31
 
3.0%
2 25
 
2.4%
24
 
2.3%
24
 
2.3%
23
 
2.2%
Other values (151) 575
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 625
60.5%
Space Separator 186
 
18.0%
Decimal Number 155
 
15.0%
Close Punctuation 20
 
1.9%
Open Punctuation 20
 
1.9%
Other Punctuation 12
 
1.2%
Dash Punctuation 11
 
1.1%
Uppercase Letter 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
7.0%
35
 
5.6%
31
 
5.0%
31
 
5.0%
24
 
3.8%
24
 
3.8%
23
 
3.7%
16
 
2.6%
14
 
2.2%
13
 
2.1%
Other values (132) 370
59.2%
Decimal Number
ValueCountFrequency (%)
1 35
22.6%
2 25
16.1%
3 21
13.5%
0 16
10.3%
4 13
 
8.4%
9 11
 
7.1%
5 11
 
7.1%
6 9
 
5.8%
8 8
 
5.2%
7 6
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
S 1
25.0%
L 1
25.0%
D 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
186
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 625
60.5%
Common 404
39.1%
Latin 4
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
7.0%
35
 
5.6%
31
 
5.0%
31
 
5.0%
24
 
3.8%
24
 
3.8%
23
 
3.7%
16
 
2.6%
14
 
2.2%
13
 
2.1%
Other values (132) 370
59.2%
Common
ValueCountFrequency (%)
186
46.0%
1 35
 
8.7%
2 25
 
6.2%
3 21
 
5.2%
) 20
 
5.0%
( 20
 
5.0%
0 16
 
4.0%
4 13
 
3.2%
, 12
 
3.0%
9 11
 
2.7%
Other values (5) 45
 
11.1%
Latin
ValueCountFrequency (%)
S 1
25.0%
L 1
25.0%
D 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 625
60.5%
ASCII 408
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
186
45.6%
1 35
 
8.6%
2 25
 
6.1%
3 21
 
5.1%
) 20
 
4.9%
( 20
 
4.9%
0 16
 
3.9%
4 13
 
3.2%
, 12
 
2.9%
9 11
 
2.7%
Other values (9) 49
 
12.0%
Hangul
ValueCountFrequency (%)
44
 
7.0%
35
 
5.6%
31
 
5.0%
31
 
5.0%
24
 
3.8%
24
 
3.8%
23
 
3.7%
16
 
2.6%
14
 
2.2%
13
 
2.1%
Other values (132) 370
59.2%

상세주소
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)43.2%
Missing0
Missing (%)0.0%
Memory size484.0 B
23 
11
 
1
부평대로 301, 9층 913호
 
1
정민빌딩 5층
 
1
Other values (14)
14 

Length

Max length24
Median length2
Mean length5.2272727
Min length1

Unique

Unique17 ?
Unique (%)38.6%

Sample

1st row1층
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
23
52.3%
4
 
9.1%
11 1
 
2.3%
부평대로 301, 9층 913호 1
 
2.3%
정민빌딩 5층 1
 
2.3%
(주)이아이 1
 
2.3%
부평대로 293 1
 
2.3%
카네비모빌리티 5층 1
 
2.3%
서울인터내셔널타워 20층 1
 
2.3%
2층(청천동) 1
 
2.3%
Other values (9) 9
 
20.5%

Length

2023-12-13T05:51:18.978662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
5층 2
 
5.9%
3층 2
 
5.9%
부평대로 2
 
5.9%
11 1
 
2.9%
동일테크노타운7차 1
 
2.9%
1층 1
 
2.9%
한독빌딩 1
 
2.9%
도곡동 1
 
2.9%
엠피온 1
 
2.9%
j빌딩 1
 
2.9%
Other values (21) 21
61.8%

전화번호
Text

MISSING 

Distinct38
Distinct (%)92.7%
Missing3
Missing (%)6.8%
Memory size484.0 B
2023-12-13T05:51:19.165374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.926829
Min length9

Characters and Unicode

Total characters489
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)85.4%

Sample

1st row031-472-4534
2nd row031-779-2114
3rd row031-638-8374
4th row02-2027-2221
5th row02-2142-0881
ValueCountFrequency (%)
031-708-5469 2
 
4.9%
031-638-8374 2
 
4.9%
032-541-0658 2
 
4.9%
02-1600-6704 1
 
2.4%
031-421-7711 1
 
2.4%
02-1544-3061 1
 
2.4%
031-486-9015 1
 
2.4%
031-472-4534 1
 
2.4%
031-217-1063 1
 
2.4%
02-6247-6247 1
 
2.4%
Other values (28) 28
68.3%
2023-12-13T05:51:19.567435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 81
16.6%
0 77
15.7%
1 62
12.7%
3 52
10.6%
2 45
9.2%
4 34
7.0%
5 33
6.7%
6 32
 
6.5%
7 29
 
5.9%
8 28
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 408
83.4%
Dash Punctuation 81
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 77
18.9%
1 62
15.2%
3 52
12.7%
2 45
11.0%
4 34
8.3%
5 33
8.1%
6 32
7.8%
7 29
 
7.1%
8 28
 
6.9%
9 16
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 81
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 489
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 81
16.6%
0 77
15.7%
1 62
12.7%
3 52
10.6%
2 45
9.2%
4 34
7.0%
5 33
6.7%
6 32
 
6.5%
7 29
 
5.9%
8 28
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 489
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 81
16.6%
0 77
15.7%
1 62
12.7%
3 52
10.6%
2 45
9.2%
4 34
7.0%
5 33
6.7%
6 32
 
6.5%
7 29
 
5.9%
8 28
 
5.7%

팩스번호
Text

MISSING 

Distinct16
Distinct (%)100.0%
Missing28
Missing (%)63.6%
Memory size484.0 B
2023-12-13T05:51:19.758079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.875
Min length11

Characters and Unicode

Total characters190
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row032-232-5914
2nd row032-515-4605
3rd row02-2173-5453
4th row031-459-9039
5th row031-629-5318
ValueCountFrequency (%)
032-232-5914 1
 
6.2%
032-515-4605 1
 
6.2%
02-2173-5453 1
 
6.2%
031-459-9039 1
 
6.2%
031-629-5318 1
 
6.2%
031-205-0786 1
 
6.2%
031-486-9018 1
 
6.2%
02-584-5528 1
 
6.2%
031-496-1344 1
 
6.2%
02-3429-3981 1
 
6.2%
Other values (6) 6
37.5%
2023-12-13T05:51:20.084810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 32
16.8%
3 25
13.2%
0 24
12.6%
2 20
10.5%
1 18
9.5%
5 15
7.9%
4 14
7.4%
9 13
6.8%
8 11
 
5.8%
6 10
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 158
83.2%
Dash Punctuation 32
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 25
15.8%
0 24
15.2%
2 20
12.7%
1 18
11.4%
5 15
9.5%
4 14
8.9%
9 13
8.2%
8 11
7.0%
6 10
 
6.3%
7 8
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 190
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 32
16.8%
3 25
13.2%
0 24
12.6%
2 20
10.5%
1 18
9.5%
5 15
7.9%
4 14
7.4%
9 13
6.8%
8 11
 
5.8%
6 10
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 190
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 32
16.8%
3 25
13.2%
0 24
12.6%
2 20
10.5%
1 18
9.5%
5 15
7.9%
4 14
7.4%
9 13
6.8%
8 11
 
5.8%
6 10
 
5.3%

지원금단말기 제조여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size176.0 B
False
37 
True
ValueCountFrequency (%)
False 37
84.1%
True 7
 
15.9%
2023-12-13T05:51:20.230948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:51:20.306607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제조사명주소상세주소전화번호팩스번호지원금단말기 제조여부
제조사명1.0001.0001.0001.0001.0001.000
주소1.0001.0000.9930.9821.0000.556
상세주소1.0000.9931.0000.5031.0000.896
전화번호1.0000.9820.5031.0001.0000.000
팩스번호1.0001.0001.0001.0001.0001.000
지원금단말기 제조여부1.0000.5560.8960.0001.0001.000
2023-12-13T05:51:20.405203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원금단말기 제조여부상세주소
지원금단말기 제조여부1.0000.655
상세주소0.6551.000
2023-12-13T05:51:20.483913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세주소지원금단말기 제조여부
상세주소1.0000.655
지원금단말기 제조여부0.6551.000

Missing values

2023-12-13T05:51:17.030629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:51:17.130806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:51:17.237578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

제조사명주소상세주소전화번호팩스번호지원금단말기 제조여부
0주식회사 하이원시스템(본사)인천광역시 남동구 장자로5번길 18-241층031-472-4534<NA>N
1포스코아이씨티(주)경기도 성남시 분당구 황새울로311번길 9 (서현동)031-779-2114<NA>N
2(주)테크나비경기도 이천시 호법면 이섭대천로 749-12031-638-8374<NA>N
3(주)이씨스서울특별시 금천구 가산디지털1로 2, 우림라이온스밸리 2차 1109호1 (가산동)02-2027-2221<NA>N
4(주)세아네트웍스02-2142-0881<NA>N
5동선산업전자(주)(본사)경기도 안산시 상록구 건건2길 14 (건건동)031-437-5851<NA>N
6피앤티코리아(본사)인천광역시 부평구부평대로 301, 9층 913호070-8805-0914032-232-5914N
7씨앤씨일렉트론(본사)경기도 이천시 부발읍 부발중앙로 272번길 62031-636-8068<NA>N
8(주)에이티엔대전광역시 유성구 테크노9로 35 (탑립동)042-936-1133<NA>N
9(주)에어포인트대전광역시 유성구 가정로 316 (도룡동)031-708-5469<NA>N
제조사명주소상세주소전화번호팩스번호지원금단말기 제조여부
34삼성SDS(주)(본사)서울특별시 강남구 테헤란로 318 (역삼동)02-2225-629802-3429-3981N
35(주)카네비모빌리티인천광역시 부평구 경원대로 1190 (십정동)032-517-4600<NA>N
36(주)에어포인트(본사)경기도 성남시 분당구 장미로 42-0811호031-708-5469031-708-5462Y
37(주)에스에이치전자(본사)경기도 안양시 동안구 벌말로 1407306호(관양동, 동일테크노타운7차)1522-1540<NA>N
38엔씨게이트(본사)경기도 광명시 하안로 60E동 6층 602호(소하동, 광명테크노파크)<NA><NA>Y
39(주)이노카(본사)서울특별시 구로구 디지털로33길 55511호(구로동, 이앤씨벤쳐드림타워2차)02-1600-670402-6330-1217N
40삼성에스디에스(구 에스엔에스)서울특별시 송파구 올림픽로 509-0j빌딩 3층 엠피온02-1544-820202-487-6022Y
41포스데이타(본사)경기도 성남시 분당구 황새울로311번길 9 (서현동)031-779-1867031-779-2525N
42(주)현대유비스경기도 이천시 호법면 이섭대천로 749-12031-638-8374031-638-8374N
43(주)자임경기도 고양시 일산동구 동국로 32-0122호070-4312-7418031-994-3693Y