Overview

Dataset statistics

Number of variables6
Number of observations376
Missing cells7
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.8 KiB
Average record size in memory48.4 B

Variable types

Text4
Categorical2

Dataset

Description경상북도 문경시 전문건설업 등록현황에 관한 데이터로 업체명, 대표자, 업종, 사업장소재지, 전화번호의 정보를 제공합니다
Author경상북도 문경시
URLhttps://www.data.go.kr/data/15084410/fileData.do

Alerts

데이터기준일 has constant value ""Constant
전화번호 has 6 (1.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 23:42:52.443543
Analysis finished2023-12-12 23:42:52.979978
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct240
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-13T08:42:53.154746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length7
Mean length7.0319149
Min length4

Characters and Unicode

Total characters2644
Distinct characters210
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)36.4%

Sample

1st row(유한회사)대성건업
2nd row(주)가야건설
3rd row(주)가야건설
4th row(주)가온
5th row(주)강유건설
ValueCountFrequency (%)
주)대평개발 5
 
1.3%
주)지은건설 4
 
1.1%
명선건설(주 3
 
0.8%
대양건설(주 3
 
0.8%
주식회사대건 3
 
0.8%
탄탄(주 3
 
0.8%
주식회사덕명 3
 
0.8%
대고건설(주 3
 
0.8%
주)미래플러스 3
 
0.8%
주)태양개발 3
 
0.8%
Other values (230) 343
91.2%
2023-12-13T08:42:53.479079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
318
 
12.0%
( 257
 
9.7%
) 257
 
9.7%
218
 
8.2%
211
 
8.0%
70
 
2.6%
64
 
2.4%
60
 
2.3%
49
 
1.9%
38
 
1.4%
Other values (200) 1102
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2109
79.8%
Open Punctuation 257
 
9.7%
Close Punctuation 257
 
9.7%
Uppercase Letter 9
 
0.3%
Other Punctuation 6
 
0.2%
Decimal Number 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
318
 
15.1%
218
 
10.3%
211
 
10.0%
70
 
3.3%
64
 
3.0%
60
 
2.8%
49
 
2.3%
38
 
1.8%
35
 
1.7%
34
 
1.6%
Other values (184) 1012
48.0%
Uppercase Letter
ValueCountFrequency (%)
S 2
22.2%
E 1
11.1%
N 1
11.1%
G 1
11.1%
J 1
11.1%
H 1
11.1%
K 1
11.1%
A 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 3
50.0%
& 2
33.3%
/ 1
 
16.7%
Decimal Number
ValueCountFrequency (%)
3 2
33.3%
2 2
33.3%
0 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 257
100.0%
Close Punctuation
ValueCountFrequency (%)
) 257
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2109
79.8%
Common 526
 
19.9%
Latin 9
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
318
 
15.1%
218
 
10.3%
211
 
10.0%
70
 
3.3%
64
 
3.0%
60
 
2.8%
49
 
2.3%
38
 
1.8%
35
 
1.7%
34
 
1.6%
Other values (184) 1012
48.0%
Common
ValueCountFrequency (%)
( 257
48.9%
) 257
48.9%
. 3
 
0.6%
& 2
 
0.4%
3 2
 
0.4%
2 2
 
0.4%
0 2
 
0.4%
/ 1
 
0.2%
Latin
ValueCountFrequency (%)
S 2
22.2%
E 1
11.1%
N 1
11.1%
G 1
11.1%
J 1
11.1%
H 1
11.1%
K 1
11.1%
A 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2109
79.8%
ASCII 535
 
20.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
318
 
15.1%
218
 
10.3%
211
 
10.0%
70
 
3.3%
64
 
3.0%
60
 
2.8%
49
 
2.3%
38
 
1.8%
35
 
1.7%
34
 
1.6%
Other values (184) 1012
48.0%
ASCII
ValueCountFrequency (%)
( 257
48.0%
) 257
48.0%
. 3
 
0.6%
S 2
 
0.4%
& 2
 
0.4%
3 2
 
0.4%
2 2
 
0.4%
0 2
 
0.4%
E 1
 
0.2%
N 1
 
0.2%
Other values (6) 6
 
1.1%
Distinct233
Distinct (%)62.0%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-13T08:42:53.791411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.0478723
Min length2

Characters and Unicode

Total characters1146
Distinct characters136
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)34.8%

Sample

1st row허만분
2nd row권재영
3rd row권재영
4th row이미화
5th row함정숙
ValueCountFrequency (%)
김성식 5
 
1.3%
김호찬 5
 
1.3%
이용희 4
 
1.1%
장성두 4
 
1.1%
김종진 3
 
0.8%
박병용 3
 
0.8%
박종문 3
 
0.8%
곽경택 3
 
0.8%
김택희 3
 
0.8%
김시환 3
 
0.8%
Other values (223) 340
90.4%
2023-12-13T08:42:54.248026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
7.4%
44
 
3.8%
44
 
3.8%
40
 
3.5%
36
 
3.1%
35
 
3.1%
28
 
2.4%
28
 
2.4%
22
 
1.9%
22
 
1.9%
Other values (126) 762
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1141
99.6%
Other Punctuation 5
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
7.4%
44
 
3.9%
44
 
3.9%
40
 
3.5%
36
 
3.2%
35
 
3.1%
28
 
2.5%
28
 
2.5%
22
 
1.9%
22
 
1.9%
Other values (125) 757
66.3%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1141
99.6%
Common 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
7.4%
44
 
3.9%
44
 
3.9%
40
 
3.5%
36
 
3.2%
35
 
3.1%
28
 
2.5%
28
 
2.5%
22
 
1.9%
22
 
1.9%
Other values (125) 757
66.3%
Common
ValueCountFrequency (%)
, 5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1141
99.6%
ASCII 5
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
85
 
7.4%
44
 
3.9%
44
 
3.9%
40
 
3.5%
36
 
3.2%
35
 
3.1%
28
 
2.5%
28
 
2.5%
22
 
1.9%
22
 
1.9%
Other values (125) 757
66.3%
ASCII
ValueCountFrequency (%)
, 5
100.0%

업종
Categorical

Distinct11
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
철근ㆍ콘크리트공사업
90 
지반조성ㆍ포장공사업
90 
가스난방공사업
52 
상ㆍ하수도설비공사업
33 
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업
24 
Other values (6)
87 

Length

Max length17
Median length10
Mean length10.082447
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도장ㆍ습식ㆍ방수ㆍ석공사업
2nd row철근ㆍ콘크리트공사업
3rd row구조물해체ㆍ비계공사업
4th row실내건축공사업
5th row철근ㆍ콘크리트공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 90
23.9%
지반조성ㆍ포장공사업 90
23.9%
가스난방공사업 52
13.8%
상ㆍ하수도설비공사업 33
 
8.8%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 24
 
6.4%
도장ㆍ습식ㆍ방수ㆍ석공사업 21
 
5.6%
시설물유지관리업 16
 
4.3%
구조물해체ㆍ비계공사업 14
 
3.7%
조경식재ㆍ시설물공사업 14
 
3.7%
기계가스설비공사업 13
 
3.5%

Length

2023-12-13T08:42:54.373101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 90
23.9%
지반조성ㆍ포장공사업 90
23.9%
가스난방공사업 52
13.8%
상ㆍ하수도설비공사업 33
 
8.8%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 24
 
6.4%
도장ㆍ습식ㆍ방수ㆍ석공사업 21
 
5.6%
시설물유지관리업 16
 
4.3%
구조물해체ㆍ비계공사업 14
 
3.7%
조경식재ㆍ시설물공사업 14
 
3.7%
기계가스설비공사업 13
 
3.5%
Distinct222
Distinct (%)59.2%
Missing1
Missing (%)0.3%
Memory size3.1 KiB
2023-12-13T08:42:54.649853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length35
Mean length23.568
Min length15

Characters and Unicode

Total characters8838
Distinct characters162
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)31.7%

Sample

1st row경상북도 문경시 영신로 18 (점촌동)
2nd row경상북도 문경시 진곡길 76,101호(공평동,가야빌딩주2)
3rd row경상북도 문경시 진곡길 76,101호(공평동,가야빌딩주2)
4th row경상북도 문경시 호서로 21 (점촌동)
5th row경상북도 문경시 반쟁이3길 21 (모전동)
ValueCountFrequency (%)
문경시 375
19.0%
경상북도 370
18.8%
모전동 107
 
5.4%
점촌동 48
 
2.4%
산양면 37
 
1.9%
흥덕동 35
 
1.8%
신흥로 32
 
1.6%
중앙로 29
 
1.5%
2층 25
 
1.3%
1층 23
 
1.2%
Other values (301) 890
45.2%
2023-12-13T08:42:55.052952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1596
18.1%
782
 
8.8%
402
 
4.5%
387
 
4.4%
380
 
4.3%
372
 
4.2%
370
 
4.2%
1 321
 
3.6%
2 272
 
3.1%
265
 
3.0%
Other values (152) 3691
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5249
59.4%
Space Separator 1596
 
18.1%
Decimal Number 1320
 
14.9%
Close Punctuation 253
 
2.9%
Open Punctuation 248
 
2.8%
Dash Punctuation 108
 
1.2%
Other Punctuation 60
 
0.7%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
782
14.9%
402
 
7.7%
387
 
7.4%
380
 
7.2%
372
 
7.1%
370
 
7.0%
265
 
5.0%
196
 
3.7%
180
 
3.4%
125
 
2.4%
Other values (133) 1790
34.1%
Decimal Number
ValueCountFrequency (%)
1 321
24.3%
2 272
20.6%
3 143
10.8%
0 136
10.3%
4 124
 
9.4%
6 102
 
7.7%
8 67
 
5.1%
5 66
 
5.0%
9 49
 
3.7%
7 40
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
F 2
50.0%
S 1
25.0%
K 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 35
58.3%
25
41.7%
Space Separator
ValueCountFrequency (%)
1596
100.0%
Close Punctuation
ValueCountFrequency (%)
) 253
100.0%
Open Punctuation
ValueCountFrequency (%)
( 248
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5249
59.4%
Common 3585
40.6%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
782
14.9%
402
 
7.7%
387
 
7.4%
380
 
7.2%
372
 
7.1%
370
 
7.0%
265
 
5.0%
196
 
3.7%
180
 
3.4%
125
 
2.4%
Other values (133) 1790
34.1%
Common
ValueCountFrequency (%)
1596
44.5%
1 321
 
9.0%
2 272
 
7.6%
) 253
 
7.1%
( 248
 
6.9%
3 143
 
4.0%
0 136
 
3.8%
4 124
 
3.5%
- 108
 
3.0%
6 102
 
2.8%
Other values (6) 282
 
7.9%
Latin
ValueCountFrequency (%)
F 2
50.0%
S 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5249
59.4%
ASCII 3564
40.3%
None 25
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1596
44.8%
1 321
 
9.0%
2 272
 
7.6%
) 253
 
7.1%
( 248
 
7.0%
3 143
 
4.0%
0 136
 
3.8%
4 124
 
3.5%
- 108
 
3.0%
6 102
 
2.9%
Other values (8) 261
 
7.3%
Hangul
ValueCountFrequency (%)
782
14.9%
402
 
7.7%
387
 
7.4%
380
 
7.2%
372
 
7.1%
370
 
7.0%
265
 
5.0%
196
 
3.7%
180
 
3.4%
125
 
2.4%
Other values (133) 1790
34.1%
None
ValueCountFrequency (%)
25
100.0%

전화번호
Text

MISSING 

Distinct229
Distinct (%)61.9%
Missing6
Missing (%)1.6%
Memory size3.1 KiB
2023-12-13T08:42:55.246811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.024324
Min length12

Characters and Unicode

Total characters4449
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)34.6%

Sample

1st row054-554-4307
2nd row054-554-5333
3rd row054-554-5333
4th row054-553-7708
5th row054-552-5540
ValueCountFrequency (%)
054-554-5333 5
 
1.4%
054-571-8507 5
 
1.4%
054-554-3077 4
 
1.1%
054-552-9902 4
 
1.1%
054-571-3224 3
 
0.8%
070-4961-4416 3
 
0.8%
054-556-5197 3
 
0.8%
054-552-1179 3
 
0.8%
054-976-8033 3
 
0.8%
054-556-3383 3
 
0.8%
Other values (219) 334
90.3%
2023-12-13T08:42:55.568694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1179
26.5%
- 740
16.6%
4 636
14.3%
0 635
14.3%
3 237
 
5.3%
1 212
 
4.8%
7 205
 
4.6%
2 197
 
4.4%
6 163
 
3.7%
8 139
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3709
83.4%
Dash Punctuation 740
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1179
31.8%
4 636
17.1%
0 635
17.1%
3 237
 
6.4%
1 212
 
5.7%
7 205
 
5.5%
2 197
 
5.3%
6 163
 
4.4%
8 139
 
3.7%
9 106
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 740
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4449
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1179
26.5%
- 740
16.6%
4 636
14.3%
0 635
14.3%
3 237
 
5.3%
1 212
 
4.8%
7 205
 
4.6%
2 197
 
4.4%
6 163
 
3.7%
8 139
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4449
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1179
26.5%
- 740
16.6%
4 636
14.3%
0 635
14.3%
3 237
 
5.3%
1 212
 
4.8%
7 205
 
4.6%
2 197
 
4.4%
6 163
 
3.7%
8 139
 
3.1%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-06-28
376 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-28
2nd row2023-06-28
3rd row2023-06-28
4th row2023-06-28
5th row2023-06-28

Common Values

ValueCountFrequency (%)
2023-06-28 376
100.0%

Length

2023-12-13T08:42:55.762523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:42:55.862535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-28 376
100.0%

Missing values

2023-12-13T08:42:52.738860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:42:52.841187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:42:52.932570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명대표자업종도로명주소전화번호데이터기준일
0(유한회사)대성건업허만분도장ㆍ습식ㆍ방수ㆍ석공사업경상북도 문경시 영신로 18 (점촌동)054-554-43072023-06-28
1(주)가야건설권재영철근ㆍ콘크리트공사업경상북도 문경시 진곡길 76,101호(공평동,가야빌딩주2)054-554-53332023-06-28
2(주)가야건설권재영구조물해체ㆍ비계공사업경상북도 문경시 진곡길 76,101호(공평동,가야빌딩주2)054-554-53332023-06-28
3(주)가온이미화실내건축공사업경상북도 문경시 호서로 21 (점촌동)054-553-77082023-06-28
4(주)강유건설함정숙철근ㆍ콘크리트공사업경상북도 문경시 반쟁이3길 21 (모전동)054-552-55402023-06-28
5(주)강인환경건설강구상ㆍ하수도설비공사업경상북도 문경시 신기공단2길 46 (신기동)054-553-49942023-06-28
6(주)건영산업신숙희금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업경상북도 문경시 중앙로 168, 2층 (흥덕동)054-554-67012023-06-28
7(주)건화건설박건화상ㆍ하수도설비공사업경상북도 문경시 임촌길 24-4 (공평동)054-556-04042023-06-28
8(주)건화건설박건화철근ㆍ콘크리트공사업경상북도 문경시 임촌길 24-4 (공평동)054-556-04042023-06-28
9(주)건화건설박건화지반조성ㆍ포장공사업경상북도 문경시 임촌길 24-4 (공평동)054-556-04042023-06-28
업체명대표자업종도로명주소전화번호데이터기준일
366형진건설(주)고성진지반조성ㆍ포장공사업경상북도 문경시 매봉2길 26, 4층 402호 (모전동)054-556-81002023-06-28
367형진건설(주)고성진상ㆍ하수도설비공사업경상북도 문경시 매봉2길 26, 4층 402호 (모전동)054-556-81002023-06-28
368호진기계(주)권미향조경식재ㆍ시설물공사업경상북도 문경시 신기산단1길 100(신기동)054-552-25972023-06-28
369홍익건설주식회사홍주영철근ㆍ콘크리트공사업경상북도 문경시 배실앞길 3 (공평동)054-552-28922023-06-28
370효성건설주식회사장재환금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업경상북도 문경시 반쟁이1길 1 (모전동)054-555-16442023-06-28
371효성건설주식회사장재환지반조성ㆍ포장공사업경상북도 문경시 반쟁이1길 1 (모전동)054-555-16442023-06-28
372효성건설주식회사장재환철근ㆍ콘크리트공사업경상북도 문경시 반쟁이1길 1 (모전동)054-555-16442023-06-28
373흥남건설(주)김남진철근ㆍ콘크리트공사업경상북도 문경시 영순면 포내로 42-23054-553-79972023-06-28
374흥남건설(주)김남진지반조성ㆍ포장공사업경상북도 문경시 영순면 포내로 42-23054-553-79972023-06-28
375흥남건설(주)김남진도장ㆍ습식ㆍ방수ㆍ석공사업경상북도 문경시 영순면 포내로 42-23054-553-79972023-06-28