Overview

Dataset statistics

Number of variables4
Number of observations86
Missing cells2
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory33.5 B

Variable types

Text4

Dataset

Description대전광역시 지역의 건설건축자재협회 업체현황에 대한 데이터로 업종, 상호명, 소재지, 전화번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15061021/fileData.do

Alerts

연락처 has 2 (2.3%) missing valuesMissing
상호명 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:21:35.838534
Analysis finished2023-12-12 18:21:36.400798
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Text

Distinct83
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T03:21:36.611107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length22
Mean length12.918605
Min length2

Characters and Unicode

Total characters1111
Distinct characters224
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)93.0%

Sample

1st row가구, 실내건축
2nd row메트제작, 실리콘, 우레탄 폼
3rd row방화문
4th row차단기·스위치류·산업용제어장치
5th row가구용 하드웨어 및 가구원부자재
ValueCountFrequency (%)
도소매 4
 
2.4%
가로등 3
 
1.8%
3
 
1.8%
벽지,바닥재 2
 
1.2%
2
 
1.2%
건설,제조,도소매 2
 
1.2%
파이프 2
 
1.2%
철물 2
 
1.2%
콘크리트 2
 
1.2%
led조명,led투광등,led보안등 2
 
1.2%
Other values (143) 143
85.6%
2023-12-13T03:21:37.054217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 99
 
8.9%
82
 
7.4%
26
 
2.3%
25
 
2.3%
23
 
2.1%
21
 
1.9%
19
 
1.7%
19
 
1.7%
19
 
1.7%
18
 
1.6%
Other values (214) 760
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 859
77.3%
Other Punctuation 120
 
10.8%
Space Separator 82
 
7.4%
Uppercase Letter 39
 
3.5%
Close Punctuation 3
 
0.3%
Dash Punctuation 3
 
0.3%
Open Punctuation 3
 
0.3%
Lowercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
3.0%
25
 
2.9%
23
 
2.7%
21
 
2.4%
19
 
2.2%
19
 
2.2%
19
 
2.2%
18
 
2.1%
18
 
2.1%
15
 
1.7%
Other values (191) 656
76.4%
Uppercase Letter
ValueCountFrequency (%)
L 9
23.1%
D 8
20.5%
E 8
20.5%
P 3
 
7.7%
F 2
 
5.1%
G 2
 
5.1%
X 1
 
2.6%
O 1
 
2.6%
B 1
 
2.6%
H 1
 
2.6%
Other values (3) 3
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 99
82.5%
· 12
 
10.0%
. 8
 
6.7%
/ 1
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
w 1
50.0%
o 1
50.0%
Space Separator
ValueCountFrequency (%)
82
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 859
77.3%
Common 211
 
19.0%
Latin 41
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
3.0%
25
 
2.9%
23
 
2.7%
21
 
2.4%
19
 
2.2%
19
 
2.2%
19
 
2.2%
18
 
2.1%
18
 
2.1%
15
 
1.7%
Other values (191) 656
76.4%
Latin
ValueCountFrequency (%)
L 9
22.0%
D 8
19.5%
E 8
19.5%
P 3
 
7.3%
F 2
 
4.9%
G 2
 
4.9%
X 1
 
2.4%
O 1
 
2.4%
B 1
 
2.4%
H 1
 
2.4%
Other values (5) 5
12.2%
Common
ValueCountFrequency (%)
, 99
46.9%
82
38.9%
· 12
 
5.7%
. 8
 
3.8%
) 3
 
1.4%
- 3
 
1.4%
( 3
 
1.4%
/ 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 859
77.3%
ASCII 240
 
21.6%
None 12
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 99
41.2%
82
34.2%
L 9
 
3.8%
. 8
 
3.3%
D 8
 
3.3%
E 8
 
3.3%
) 3
 
1.2%
- 3
 
1.2%
P 3
 
1.2%
( 3
 
1.2%
Other values (12) 14
 
5.8%
Hangul
ValueCountFrequency (%)
26
 
3.0%
25
 
2.9%
23
 
2.7%
21
 
2.4%
19
 
2.2%
19
 
2.2%
19
 
2.2%
18
 
2.1%
18
 
2.1%
15
 
1.7%
Other values (191) 656
76.4%
None
ValueCountFrequency (%)
· 12
100.0%

상호명
Text

UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T03:21:37.333891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length13
Mean length6.9186047
Min length3

Characters and Unicode

Total characters595
Distinct characters177
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)100.0%

Sample

1st row㈜아트맥
2nd row부광케미칼(산행간사)
3rd row㈜야무진
4th row㈜명일전기
5th row목림㈜
ValueCountFrequency (%)
주식회사 2
 
2.2%
㈜아트맥 1
 
1.1%
㈜티에스알라딘 1
 
1.1%
주안산업 1
 
1.1%
와이디글라스산업㈜ 1
 
1.1%
㈜삼덕케미칼 1
 
1.1%
대산벽난로 1
 
1.1%
㈜에이원시스템공조 1
 
1.1%
대원창호 1
 
1.1%
주)일심전기 1
 
1.1%
Other values (79) 79
87.8%
2023-12-13T03:21:37.755052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
9.9%
18
 
3.0%
16
 
2.7%
16
 
2.7%
16
 
2.7%
13
 
2.2%
12
 
2.0%
12
 
2.0%
11
 
1.8%
11
 
1.8%
Other values (167) 411
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 504
84.7%
Other Symbol 59
 
9.9%
Open Punctuation 9
 
1.5%
Close Punctuation 9
 
1.5%
Uppercase Letter 8
 
1.3%
Space Separator 5
 
0.8%
Connector Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
3.6%
16
 
3.2%
16
 
3.2%
16
 
3.2%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (155) 369
73.2%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
C 1
12.5%
V 1
12.5%
P 1
12.5%
D 1
12.5%
T 1
12.5%
I 1
12.5%
Other Symbol
ValueCountFrequency (%)
59
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 563
94.6%
Common 24
 
4.0%
Latin 8
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
10.5%
18
 
3.2%
16
 
2.8%
16
 
2.8%
16
 
2.8%
13
 
2.3%
12
 
2.1%
12
 
2.1%
11
 
2.0%
11
 
2.0%
Other values (156) 379
67.3%
Latin
ValueCountFrequency (%)
S 2
25.0%
C 1
12.5%
V 1
12.5%
P 1
12.5%
D 1
12.5%
T 1
12.5%
I 1
12.5%
Common
ValueCountFrequency (%)
( 9
37.5%
) 9
37.5%
5
20.8%
_ 1
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 504
84.7%
None 59
 
9.9%
ASCII 32
 
5.4%

Most frequent character per block

None
ValueCountFrequency (%)
59
100.0%
Hangul
ValueCountFrequency (%)
18
 
3.6%
16
 
3.2%
16
 
3.2%
16
 
3.2%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (155) 369
73.2%
ASCII
ValueCountFrequency (%)
( 9
28.1%
) 9
28.1%
5
15.6%
S 2
 
6.2%
C 1
 
3.1%
V 1
 
3.1%
P 1
 
3.1%
D 1
 
3.1%
T 1
 
3.1%
I 1
 
3.1%

소재지
Text

UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-13T03:21:38.097053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length35.5
Mean length22.337209
Min length12

Characters and Unicode

Total characters1921
Distinct characters159
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)100.0%

Sample

1st row대전 대덕구 한밭대로 1136
2nd row대전 대덕구 대화로 135(대화동) 106호
3rd row대전광역시 대덕구 한남로150번길 76(오정동)
4th row대덕구 오정동 379-4
5th row대전 대덕구 오정동 305-10
ValueCountFrequency (%)
대전 43
 
10.5%
대덕구 37
 
9.0%
대전광역시 22
 
5.4%
대전시 14
 
3.4%
유성구 14
 
3.4%
서구 14
 
3.4%
오정동 13
 
3.2%
동구 11
 
2.7%
중구 6
 
1.5%
대화로 5
 
1.2%
Other values (206) 232
56.4%
2023-12-13T03:21:38.588091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
328
 
17.1%
146
 
7.6%
1 97
 
5.0%
86
 
4.5%
85
 
4.4%
66
 
3.4%
61
 
3.2%
5 55
 
2.9%
2 46
 
2.4%
3 44
 
2.3%
Other values (149) 907
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1063
55.3%
Decimal Number 426
22.2%
Space Separator 328
 
17.1%
Dash Punctuation 32
 
1.7%
Other Punctuation 22
 
1.1%
Open Punctuation 21
 
1.1%
Close Punctuation 20
 
1.0%
Uppercase Letter 8
 
0.4%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
13.7%
86
 
8.1%
85
 
8.0%
66
 
6.2%
61
 
5.7%
43
 
4.0%
39
 
3.7%
28
 
2.6%
28
 
2.6%
26
 
2.4%
Other values (125) 455
42.8%
Decimal Number
ValueCountFrequency (%)
1 97
22.8%
5 55
12.9%
2 46
10.8%
3 44
10.3%
0 37
 
8.7%
4 35
 
8.2%
7 32
 
7.5%
6 29
 
6.8%
8 28
 
6.6%
9 23
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
F 3
37.5%
B 2
25.0%
G 1
 
12.5%
S 1
 
12.5%
D 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 15
68.2%
: 5
 
22.7%
/ 1
 
4.5%
. 1
 
4.5%
Space Separator
ValueCountFrequency (%)
328
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1064
55.4%
Common 849
44.2%
Latin 8
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
13.7%
86
 
8.1%
85
 
8.0%
66
 
6.2%
61
 
5.7%
43
 
4.0%
39
 
3.7%
28
 
2.6%
28
 
2.6%
26
 
2.4%
Other values (126) 456
42.9%
Common
ValueCountFrequency (%)
328
38.6%
1 97
 
11.4%
5 55
 
6.5%
2 46
 
5.4%
3 44
 
5.2%
0 37
 
4.4%
4 35
 
4.1%
7 32
 
3.8%
- 32
 
3.8%
6 29
 
3.4%
Other values (8) 114
 
13.4%
Latin
ValueCountFrequency (%)
F 3
37.5%
B 2
25.0%
G 1
 
12.5%
S 1
 
12.5%
D 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1063
55.3%
ASCII 857
44.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
328
38.3%
1 97
 
11.3%
5 55
 
6.4%
2 46
 
5.4%
3 44
 
5.1%
0 37
 
4.3%
4 35
 
4.1%
7 32
 
3.7%
- 32
 
3.7%
6 29
 
3.4%
Other values (13) 122
 
14.2%
Hangul
ValueCountFrequency (%)
146
 
13.7%
86
 
8.1%
85
 
8.0%
66
 
6.2%
61
 
5.7%
43
 
4.0%
39
 
3.7%
28
 
2.6%
28
 
2.6%
26
 
2.4%
Other values (125) 455
42.8%
None
ValueCountFrequency (%)
1
100.0%

연락처
Text

MISSING 

Distinct84
Distinct (%)100.0%
Missing2
Missing (%)2.3%
Memory size820.0 B
2023-12-13T03:21:38.824520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.154762
Min length12

Characters and Unicode

Total characters1021
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)100.0%

Sample

1st row042-826-0195
2nd row042-639-7900
3rd row042-638-0689
4th row042-631-5894
5th row042-712-6452
ValueCountFrequency (%)
042-826-0195 2
 
2.4%
042-528-7688 1
 
1.2%
042-633-0422 1
 
1.2%
042-672-2800 1
 
1.2%
042-823-0408 1
 
1.2%
042-637-6084 1
 
1.2%
042-256-4554 1
 
1.2%
042-636-3005 1
 
1.2%
042-586-7114 1
 
1.2%
042-627-5710 1
 
1.2%
Other values (73) 73
86.9%
2023-12-13T03:21:39.195742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 168
16.5%
2 154
15.1%
0 152
14.9%
4 140
13.7%
6 79
7.7%
1 64
 
6.3%
8 63
 
6.2%
3 57
 
5.6%
5 56
 
5.5%
7 52
 
5.1%
Other values (4) 36
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 844
82.7%
Dash Punctuation 168
 
16.5%
Space Separator 7
 
0.7%
Math Symbol 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 154
18.2%
0 152
18.0%
4 140
16.6%
6 79
9.4%
1 64
7.6%
8 63
7.5%
3 57
 
6.8%
5 56
 
6.6%
7 52
 
6.2%
9 27
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1021
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 168
16.5%
2 154
15.1%
0 152
14.9%
4 140
13.7%
6 79
7.7%
1 64
 
6.3%
8 63
 
6.2%
3 57
 
5.6%
5 56
 
5.5%
7 52
 
5.1%
Other values (4) 36
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1021
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 168
16.5%
2 154
15.1%
0 152
14.9%
4 140
13.7%
6 79
7.7%
1 64
 
6.3%
8 63
 
6.2%
3 57
 
5.6%
5 56
 
5.5%
7 52
 
5.1%
Other values (4) 36
 
3.5%

Correlations

2023-12-13T03:21:39.299775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호명소재지연락처
업종1.0001.0001.0001.000
상호명1.0001.0001.0001.000
소재지1.0001.0001.0001.000
연락처1.0001.0001.0001.000

Missing values

2023-12-13T03:21:36.282798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:21:36.367134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호명소재지연락처
0가구, 실내건축㈜아트맥대전 대덕구 한밭대로 1136042-826-0195
1메트제작, 실리콘, 우레탄 폼부광케미칼(산행간사)대전 대덕구 대화로 135(대화동) 106호042-639-7900
2방화문㈜야무진대전광역시 대덕구 한남로150번길 76(오정동)042-638-0689
3차단기·스위치류·산업용제어장치㈜명일전기대덕구 오정동 379-4042-631-5894
4가구용 하드웨어 및 가구원부자재목림㈜대전 대덕구 오정동 305-10042-712-6452
5특수도료,페인트,테라코㈜대전테라코대전 중구 중촌동 391-10042-255-7500
6기능성단열재전문업체PF-보드이보드..Low-E 단열재(주)엘림IST대전시 대덕구 신탄진로 115번안길 23042-627-0099
7공구,산업자재㈜대남기공사대전 대덕구 오정동 339-8042-626-4880
8어닝,천막,파라솔 제조,도소매㈜한미하이텍대전 대덕구 아리랑로55번길 191(신대동)042-936-4116
9제조,도매㈜건설자재산업대전 서구 대덕대로185번길63(둔산동)042-627-3117~8
업종상호명소재지연락처
76LED, 조명기구 외샤이닉스대전 광역시 대덕구 한남로 135042-623-7200
77파이프 압출, 사출, 내화 충진제 고정틀,가요 전선관, 금형제작㈜유진글로벌대전시 동구 계족로 151 대전지식산업센터 406호044-415-1827
78실내건축공사업, 신축아파트-상가 도배, 장판, 마루공사, 수장공사(주)경도엔지니어링대전광역시 서구 원도안로 242번길 15, 302 (GS빌딩)042-824-0925
79공구, 철물, 건자재와와툴코리아대전광역시 동구 동대전로 313번지042-625-8916
80통신,제조,도소매진광에스엔씨㈜대전시 동구 대전로 288-48042-272-4555
81석공사업㈜디에이치건설대전광역시 서구 관저동로 170, 건양타워 403호042-544-0508
82기계설비공사, 전문소방공사,가스공사, 탱크검사 외㈜윤진엔지니어링㈜한국이엔지㈜한국티에스아이대전광역시 중구 대둔산로 184번길 57042-586-1885
83도,소매, 제조업네이쳐카본㈜대전광역시 서구 벌곡로 367<NA>
84건축공사업, 시설물유지관리업, 금속 창호공사업㈜씨엔에스종합건설대전광역시 대덕구 대전로1087번길 17042-637-8557
85철강자재파이프,스텐파이프(배관자재 외)㈜광진종합배관대전시 동구 홍도동 141-1042-621-2441