Overview

Dataset statistics

Number of variables3
Number of observations26
Missing cells11
Missing cells (%)14.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory756.0 B
Average record size in memory29.1 B

Variable types

Text3

Dataset

Description노후 경유차에서 발생하는 대기오염물질을 줄이기 위한 저감장치를 제작하는 저감장치 제작사 현황자료(저감장치 제작사 연락처, 주소 정보 등)
Author한국환경공단
URLhttps://www.data.go.kr/data/15069236/fileData.do

Alerts

연락처 has 11 (42.3%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:27:52.530182
Analysis finished2023-12-12 23:27:52.931602
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T08:27:53.083487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length12
Mean length8.8846154
Min length5

Characters and Unicode

Total characters231
Distinct characters100
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row(주)블루플래닛
2nd row(주)세라컴
3rd row(주)씨엠씨
4th row(주)알란텀
5th row(주)에코닉스
ValueCountFrequency (%)
주식회사 2
 
6.5%
주)블루플래닛 1
 
3.2%
주)세라컴 1
 
3.2%
화이버텍(주 1
 
3.2%
현대모비스(주 1
 
3.2%
system 1
 
3.2%
auto 1
 
3.2%
알오씨오토시스템(roc 1
 
3.2%
존슨매티카탈리스트코리아 1
 
3.2%
화성사업장 1
 
3.2%
Other values (20) 20
64.5%
2023-12-13T08:27:53.492681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
10.4%
( 22
 
9.5%
) 22
 
9.5%
10
 
4.3%
8
 
3.5%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.7%
Other values (90) 121
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 167
72.3%
Open Punctuation 22
 
9.5%
Close Punctuation 22
 
9.5%
Lowercase Letter 10
 
4.3%
Space Separator 5
 
2.2%
Uppercase Letter 5
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
14.4%
10
 
6.0%
8
 
4.8%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
Other values (75) 95
56.9%
Lowercase Letter
ValueCountFrequency (%)
o 2
20.0%
t 2
20.0%
m 1
10.0%
s 1
10.0%
y 1
10.0%
u 1
10.0%
c 1
10.0%
e 1
10.0%
Uppercase Letter
ValueCountFrequency (%)
S 2
40.0%
A 1
20.0%
R 1
20.0%
K 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 167
72.3%
Common 49
 
21.2%
Latin 15
 
6.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
14.4%
10
 
6.0%
8
 
4.8%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
Other values (75) 95
56.9%
Latin
ValueCountFrequency (%)
o 2
13.3%
S 2
13.3%
t 2
13.3%
m 1
6.7%
s 1
6.7%
y 1
6.7%
u 1
6.7%
A 1
6.7%
c 1
6.7%
R 1
6.7%
Other values (2) 2
13.3%
Common
ValueCountFrequency (%)
( 22
44.9%
) 22
44.9%
5
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 167
72.3%
ASCII 64
 
27.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
14.4%
10
 
6.0%
8
 
4.8%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
Other values (75) 95
56.9%
ASCII
ValueCountFrequency (%)
( 22
34.4%
) 22
34.4%
5
 
7.8%
o 2
 
3.1%
S 2
 
3.1%
t 2
 
3.1%
m 1
 
1.6%
s 1
 
1.6%
y 1
 
1.6%
u 1
 
1.6%
Other values (5) 5
 
7.8%

주소
Text

Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T08:27:53.748061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length41.5
Mean length34.5
Min length1

Characters and Unicode

Total characters897
Distinct characters183
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row경기도 화성시 마도면 마도공단로2길 46-1 (주)블루플래닛
2nd row충청남도 아산시 온천대로1122번길 46-5 (득산동) (주)세라컴
3rd row충청남도 아산시 선장면 학성로122번길 120 씨엠씨
4th row경기도 성남시 중원구 둔촌대로 400 (상대원동,STARWOOD아파트형공장) (주)알란텀
5th row경기도 파주시 탄현면 방촌로 1144-26 에코닉스
ValueCountFrequency (%)
경기도 13
 
8.2%
서울특별시 6
 
3.8%
5
 
3.1%
화성시 4
 
2.5%
탄현면 3
 
1.9%
파주시 3
 
1.9%
용인시 2
 
1.3%
산업로156번길 2
 
1.3%
권선구 2
 
1.3%
수원시 2
 
1.3%
Other values (110) 117
73.6%
2023-12-13T08:27:54.211566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
152
 
16.9%
1 32
 
3.6%
) 27
 
3.0%
( 27
 
3.0%
25
 
2.8%
2 23
 
2.6%
23
 
2.6%
19
 
2.1%
4 18
 
2.0%
18
 
2.0%
Other values (173) 533
59.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 503
56.1%
Space Separator 152
 
16.9%
Decimal Number 139
 
15.5%
Close Punctuation 27
 
3.0%
Open Punctuation 27
 
3.0%
Uppercase Letter 19
 
2.1%
Other Punctuation 17
 
1.9%
Dash Punctuation 11
 
1.2%
Lowercase Letter 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
5.0%
23
 
4.6%
19
 
3.8%
18
 
3.6%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
11
 
2.2%
10
 
2.0%
Other values (143) 337
67.0%
Uppercase Letter
ValueCountFrequency (%)
O 3
15.8%
S 2
10.5%
T 2
10.5%
R 2
10.5%
W 2
10.5%
D 2
10.5%
M 1
 
5.3%
B 1
 
5.3%
U 1
 
5.3%
E 1
 
5.3%
Other values (2) 2
10.5%
Decimal Number
ValueCountFrequency (%)
1 32
23.0%
2 23
16.5%
4 18
12.9%
0 17
12.2%
6 14
10.1%
5 13
9.4%
9 7
 
5.0%
3 6
 
4.3%
8 5
 
3.6%
7 4
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 11
64.7%
. 6
35.3%
Space Separator
ValueCountFrequency (%)
152
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 503
56.1%
Common 374
41.7%
Latin 20
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
5.0%
23
 
4.6%
19
 
3.8%
18
 
3.6%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
11
 
2.2%
10
 
2.0%
Other values (143) 337
67.0%
Common
ValueCountFrequency (%)
152
40.6%
1 32
 
8.6%
) 27
 
7.2%
( 27
 
7.2%
2 23
 
6.1%
4 18
 
4.8%
0 17
 
4.5%
6 14
 
3.7%
5 13
 
3.5%
- 11
 
2.9%
Other values (7) 40
 
10.7%
Latin
ValueCountFrequency (%)
O 3
15.0%
S 2
10.0%
T 2
10.0%
R 2
10.0%
W 2
10.0%
D 2
10.0%
M 1
 
5.0%
b 1
 
5.0%
B 1
 
5.0%
U 1
 
5.0%
Other values (3) 3
15.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 503
56.1%
ASCII 393
43.8%
Math Operators 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
152
38.7%
1 32
 
8.1%
) 27
 
6.9%
( 27
 
6.9%
2 23
 
5.9%
4 18
 
4.6%
0 17
 
4.3%
6 14
 
3.6%
5 13
 
3.3%
- 11
 
2.8%
Other values (19) 59
 
15.0%
Hangul
ValueCountFrequency (%)
25
 
5.0%
23
 
4.6%
19
 
3.8%
18
 
3.6%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
11
 
2.2%
10
 
2.0%
Other values (143) 337
67.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

연락처
Text

MISSING 

Distinct15
Distinct (%)100.0%
Missing11
Missing (%)42.3%
Memory size340.0 B
2023-12-13T08:27:54.402521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)100.0%

Sample

1st row070-7012-6600
2nd row02-744-1777
3rd row031-358-6752
4th row042-867-2500
5th row051-1644-2402
ValueCountFrequency (%)
070-7012-6600 1
 
6.7%
02-744-1777 1
 
6.7%
031-358-6752 1
 
6.7%
042-867-2500 1
 
6.7%
051-1644-2402 1
 
6.7%
031-270-6300 1
 
6.7%
02-6925-0560 1
 
6.7%
02-565-6721 1
 
6.7%
054-931-6688 1
 
6.7%
02-2121-0114 1
 
6.7%
Other values (5) 5
33.3%
2023-12-13T08:27:54.749543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 33
18.3%
- 30
16.7%
1 22
12.2%
2 20
11.1%
6 16
8.9%
5 14
7.8%
3 13
 
7.2%
7 11
 
6.1%
4 9
 
5.0%
8 8
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 150
83.3%
Dash Punctuation 30
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 33
22.0%
1 22
14.7%
2 20
13.3%
6 16
10.7%
5 14
9.3%
3 13
 
8.7%
7 11
 
7.3%
4 9
 
6.0%
8 8
 
5.3%
9 4
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 33
18.3%
- 30
16.7%
1 22
12.2%
2 20
11.1%
6 16
8.9%
5 14
7.8%
3 13
 
7.2%
7 11
 
6.1%
4 9
 
5.0%
8 8
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 33
18.3%
- 30
16.7%
1 22
12.2%
2 20
11.1%
6 16
8.9%
5 14
7.8%
3 13
 
7.2%
7 11
 
6.1%
4 9
 
5.0%
8 8
 
4.4%

Correlations

2023-12-13T08:27:54.841800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명주소연락처
업체명1.0001.0001.000
주소1.0001.0001.000
연락처1.0001.0001.000

Missing values

2023-12-13T08:27:52.795249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:27:52.896615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명주소연락처
0(주)블루플래닛경기도 화성시 마도면 마도공단로2길 46-1 (주)블루플래닛070-7012-6600
1(주)세라컴충청남도 아산시 온천대로1122번길 46-5 (득산동) (주)세라컴02-744-1777
2(주)씨엠씨충청남도 아산시 선장면 학성로122번길 120 씨엠씨031-358-6752
3(주)알란텀경기도 성남시 중원구 둔촌대로 400 (상대원동,STARWOOD아파트형공장) (주)알란텀<NA>
4(주)에코닉스경기도 파주시 탄현면 방촌로 1144-26 에코닉스<NA>
5(주)에코마스터대전광역시 유성구 가정로 152 (장동,한국에너지기술연구소) 제3연구동 418호042-867-2500
6(주)에코앤드림서울특별시 금천구 가산디지털2로 14 (가산동,대륭테크노타운12차) 912051-1644-2402
7(주)엑시언경기도 수원시 권선구 산업로156번길 201, 4층 404호(고색동 , 디엠프라자)031-270-6300
8(주)이룸지엔지서울특별시 마포구 동교로 191 (동교동,D.B.M빌딩) 5층 501호02-6925-0560
9(주)이알인터내셔널경기도 파주시 탄현면 방촌로 1144-26 (주)이알인터내셔널<NA>
업체명주소연락처
16에이치케이엠엔에스(주)서울특별시 구로구 디지털로 288 (구로동,대륭포스트타워1차) 505호<NA>
17엠즈홀딩스02-3667-5191
18우주씨엔지(주)051-312-8888
19일진전기(주)경기도 화성시 만년로 905-17 (안녕동) .031-220-0500
20일진하이솔루스 화성사업장전라북도 완주군 봉동읍 완주산단5로 97-46 ((주)케이시알) 일진하이솔루스 화성사업장<NA>
21존슨매티카탈리스트코리아경기도 화성시 존슨매티카탈리스트코리아 .031-359-1613
22주식회사 알오씨오토시스템(Roc Auto System)경기도 수원시 권선구 산업로156번길 142-10 (고색동,수원벤처밸리∥) b동 3층 301호<NA>
23현대모비스(주)서울특별시 강남구 테헤란로 203 (역삼동,서울인터내셔널타워) .<NA>
24화이버텍(주)경기도 파주시 탄현면 방촌로995번길 56-0 (화이버텍) 화이버텍(주)<NA>
25후지노테크(주)경기도 안성시 원곡면 섬바위길 44 (반제리 620-1)031-654-2031