Overview

Dataset statistics

Number of variables6
Number of observations472
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.7 KiB
Average record size in memory49.3 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description영천시에 위치하고 있는 전문 건설업의 현황정보(번호, 업체명, 대표자, 업종, 도로명주소,전화번호)를 공공데이터로 제공하고자 합니다.
Author경상북도 영천시
URLhttps://www.data.go.kr/data/15084365/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:36:40.857978
Analysis finished2023-12-12 19:36:41.562088
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct472
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean236.5
Minimum1
Maximum472
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T04:36:41.639574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.55
Q1118.75
median236.5
Q3354.25
95-th percentile448.45
Maximum472
Range471
Interquartile range (IQR)235.5

Descriptive statistics

Standard deviation136.39892
Coefficient of variation (CV)0.57673964
Kurtosis-1.2
Mean236.5
Median Absolute Deviation (MAD)118
Skewness0
Sum111628
Variance18604.667
MonotonicityStrictly increasing
2023-12-13T04:36:41.813218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
312 1
 
0.2%
324 1
 
0.2%
323 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
Other values (462) 462
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
472 1
0.2%
471 1
0.2%
470 1
0.2%
469 1
0.2%
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%
463 1
0.2%
Distinct312
Distinct (%)66.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T04:36:42.141418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length7
Mean length7.1525424
Min length4

Characters and Unicode

Total characters3376
Distinct characters215
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique193 ?
Unique (%)40.9%

Sample

1st row(유)기남건설
2nd row(유)기남건설
3rd row(주)건창
4th row(주)건창
5th row(주)건화건설
ValueCountFrequency (%)
주)서진산업개발 6
 
1.3%
주)대한피앤씨건설 4
 
0.8%
신오개발(주 4
 
0.8%
동해건설(주 4
 
0.8%
조양토건(주 4
 
0.8%
주)대안건설 3
 
0.6%
대학이엔씨(주 3
 
0.6%
주)삼조개발 3
 
0.6%
주)대의건설 3
 
0.6%
주)삼성조경 3
 
0.6%
Other values (302) 435
92.2%
2023-12-13T04:36:42.570673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 392
 
11.6%
) 392
 
11.6%
391
 
11.6%
246
 
7.3%
234
 
6.9%
52
 
1.5%
51
 
1.5%
48
 
1.4%
43
 
1.3%
43
 
1.3%
Other values (205) 1484
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2547
75.4%
Open Punctuation 392
 
11.6%
Close Punctuation 392
 
11.6%
Uppercase Letter 34
 
1.0%
Other Punctuation 7
 
0.2%
Other Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
391
 
15.4%
246
 
9.7%
234
 
9.2%
52
 
2.0%
51
 
2.0%
48
 
1.9%
43
 
1.7%
43
 
1.7%
43
 
1.7%
43
 
1.7%
Other values (188) 1353
53.1%
Uppercase Letter
ValueCountFrequency (%)
S 10
29.4%
T 4
 
11.8%
C 3
 
8.8%
E 3
 
8.8%
N 3
 
8.8%
G 3
 
8.8%
L 3
 
8.8%
A 2
 
5.9%
J 1
 
2.9%
H 1
 
2.9%
Other Punctuation
ValueCountFrequency (%)
& 3
42.9%
/ 2
28.6%
. 2
28.6%
Open Punctuation
ValueCountFrequency (%)
( 392
100.0%
Close Punctuation
ValueCountFrequency (%)
) 392
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2551
75.6%
Common 791
 
23.4%
Latin 34
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
391
 
15.3%
246
 
9.6%
234
 
9.2%
52
 
2.0%
51
 
2.0%
48
 
1.9%
43
 
1.7%
43
 
1.7%
43
 
1.7%
43
 
1.7%
Other values (189) 1357
53.2%
Latin
ValueCountFrequency (%)
S 10
29.4%
T 4
 
11.8%
C 3
 
8.8%
E 3
 
8.8%
N 3
 
8.8%
G 3
 
8.8%
L 3
 
8.8%
A 2
 
5.9%
J 1
 
2.9%
H 1
 
2.9%
Common
ValueCountFrequency (%)
( 392
49.6%
) 392
49.6%
& 3
 
0.4%
/ 2
 
0.3%
. 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2547
75.4%
ASCII 825
 
24.4%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 392
47.5%
) 392
47.5%
S 10
 
1.2%
T 4
 
0.5%
C 3
 
0.4%
E 3
 
0.4%
N 3
 
0.4%
& 3
 
0.4%
G 3
 
0.4%
L 3
 
0.4%
Other values (6) 9
 
1.1%
Hangul
ValueCountFrequency (%)
391
 
15.4%
246
 
9.7%
234
 
9.2%
52
 
2.0%
51
 
2.0%
48
 
1.9%
43
 
1.7%
43
 
1.7%
43
 
1.7%
43
 
1.7%
Other values (188) 1353
53.1%
None
ValueCountFrequency (%)
4
100.0%
Distinct304
Distinct (%)64.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T04:36:42.971864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.1207627
Min length2

Characters and Unicode

Total characters1473
Distinct characters158
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)38.8%

Sample

1st row김기출
2nd row김기출
3rd row임호규
4th row임호규
5th row조만환
ValueCountFrequency (%)
김학태 6
 
1.3%
권기한 5
 
1.1%
김종수 4
 
0.8%
김학주,정영수 4
 
0.8%
홍종옥 4
 
0.8%
조정식 4
 
0.8%
정영목 4
 
0.8%
김일창 4
 
0.8%
정병창 3
 
0.6%
김태형 3
 
0.6%
Other values (294) 431
91.3%
2023-12-13T04:36:43.515237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
 
7.3%
79
 
5.4%
73
 
5.0%
49
 
3.3%
44
 
3.0%
30
 
2.0%
29
 
2.0%
28
 
1.9%
27
 
1.8%
26
 
1.8%
Other values (148) 980
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1458
99.0%
Other Punctuation 15
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
7.4%
79
 
5.4%
73
 
5.0%
49
 
3.4%
44
 
3.0%
30
 
2.1%
29
 
2.0%
28
 
1.9%
27
 
1.9%
26
 
1.8%
Other values (146) 965
66.2%
Other Punctuation
ValueCountFrequency (%)
, 11
73.3%
4
 
26.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1458
99.0%
Common 15
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
7.4%
79
 
5.4%
73
 
5.0%
49
 
3.4%
44
 
3.0%
30
 
2.1%
29
 
2.0%
28
 
1.9%
27
 
1.9%
26
 
1.8%
Other values (146) 965
66.2%
Common
ValueCountFrequency (%)
, 11
73.3%
4
 
26.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1458
99.0%
ASCII 11
 
0.7%
None 4
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
108
 
7.4%
79
 
5.4%
73
 
5.0%
49
 
3.4%
44
 
3.0%
30
 
2.1%
29
 
2.0%
28
 
1.9%
27
 
1.9%
26
 
1.8%
Other values (146) 965
66.2%
ASCII
ValueCountFrequency (%)
, 11
100.0%
None
ValueCountFrequency (%)
4
100.0%

업종
Categorical

Distinct22
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
철근ㆍ콘크리트공사업
90 
토공사업
58 
상ㆍ하수도설비공사업
48 
금속구조물ㆍ창호ㆍ온실공사업
38 
난방시공업 제2종
37 
Other values (17)
201 

Length

Max length14
Median length11
Mean length8.8072034
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row상ㆍ하수도설비공사업
2nd row철근ㆍ콘크리트공사업
3rd row상ㆍ하수도설비공사업
4th row철근ㆍ콘크리트공사업
5th row상ㆍ하수도설비공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 90
19.1%
토공사업 58
12.3%
상ㆍ하수도설비공사업 48
10.2%
금속구조물ㆍ창호ㆍ온실공사업 38
8.1%
난방시공업 제2종 37
7.8%
시설물유지관리업 33
 
7.0%
조경식재공사업 23
 
4.9%
가스시설시공업 제2종 22
 
4.7%
포장공사업 18
 
3.8%
보링ㆍ그라우팅공사업 18
 
3.8%
Other values (12) 87
18.4%

Length

2023-12-13T04:36:43.695745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 90
16.5%
제2종 59
10.8%
토공사업 58
10.6%
상ㆍ하수도설비공사업 48
8.8%
난방시공업 40
 
7.3%
금속구조물ㆍ창호ㆍ온실공사업 38
 
7.0%
가스시설시공업 34
 
6.2%
시설물유지관리업 33
 
6.0%
조경식재공사업 23
 
4.2%
포장공사업 18
 
3.3%
Other values (12) 105
19.2%
Distinct258
Distinct (%)54.7%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T04:36:43.992391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length18.576271
Min length14

Characters and Unicode

Total characters8768
Distinct characters172
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)32.4%

Sample

1st row 영천시 장천3길 28-14 (조교동)
2nd row 영천시 장천3길 28-14 (조교동)
3rd row 영천시 문외2길 49 ,2층 (문외동)
4th row 영천시 문외2길 49 ,2층 (문외동)
5th row 영천시 교창길 12 (창구동)
ValueCountFrequency (%)
영천시 472
24.4%
문외동 57
 
2.9%
금호읍 48
 
2.5%
야사동 45
 
2.3%
최무선로 40
 
2.1%
금호로 30
 
1.6%
완산동 28
 
1.4%
중앙동2길 26
 
1.3%
청통면 24
 
1.2%
성내동 23
 
1.2%
Other values (362) 1140
59.0%
2023-12-13T04:36:44.527867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1926
22.0%
508
 
5.8%
503
 
5.7%
492
 
5.6%
1 374
 
4.3%
370
 
4.2%
( 323
 
3.7%
) 323
 
3.7%
249
 
2.8%
2 238
 
2.7%
Other values (162) 3462
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4514
51.5%
Space Separator 1926
22.0%
Decimal Number 1520
 
17.3%
Open Punctuation 323
 
3.7%
Close Punctuation 323
 
3.7%
Dash Punctuation 127
 
1.4%
Other Punctuation 34
 
0.4%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
508
 
11.3%
503
 
11.1%
492
 
10.9%
370
 
8.2%
249
 
5.5%
222
 
4.9%
126
 
2.8%
105
 
2.3%
98
 
2.2%
94
 
2.1%
Other values (146) 1747
38.7%
Decimal Number
ValueCountFrequency (%)
1 374
24.6%
2 238
15.7%
4 156
10.3%
3 144
 
9.5%
5 117
 
7.7%
9 103
 
6.8%
0 100
 
6.6%
7 99
 
6.5%
8 95
 
6.2%
6 94
 
6.2%
Space Separator
ValueCountFrequency (%)
1926
100.0%
Open Punctuation
ValueCountFrequency (%)
( 323
100.0%
Close Punctuation
ValueCountFrequency (%)
) 323
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 127
100.0%
Other Punctuation
ValueCountFrequency (%)
, 34
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4514
51.5%
Common 4253
48.5%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
508
 
11.3%
503
 
11.1%
492
 
10.9%
370
 
8.2%
249
 
5.5%
222
 
4.9%
126
 
2.8%
105
 
2.3%
98
 
2.2%
94
 
2.1%
Other values (146) 1747
38.7%
Common
ValueCountFrequency (%)
1926
45.3%
1 374
 
8.8%
( 323
 
7.6%
) 323
 
7.6%
2 238
 
5.6%
4 156
 
3.7%
3 144
 
3.4%
- 127
 
3.0%
5 117
 
2.8%
9 103
 
2.4%
Other values (5) 422
 
9.9%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4514
51.5%
ASCII 4254
48.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1926
45.3%
1 374
 
8.8%
( 323
 
7.6%
) 323
 
7.6%
2 238
 
5.6%
4 156
 
3.7%
3 144
 
3.4%
- 127
 
3.0%
5 117
 
2.8%
9 103
 
2.4%
Other values (6) 423
 
9.9%
Hangul
ValueCountFrequency (%)
508
 
11.3%
503
 
11.1%
492
 
10.9%
370
 
8.2%
249
 
5.5%
222
 
4.9%
126
 
2.8%
105
 
2.3%
98
 
2.2%
94
 
2.1%
Other values (146) 1747
38.7%
Distinct301
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T04:36:44.837555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.023305
Min length12

Characters and Unicode

Total characters5675
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)38.1%

Sample

1st row054-332-3305
2nd row054-332-3305
3rd row054-333-0645
4th row054-333-0645
5th row054-333-5674
ValueCountFrequency (%)
054-332-3689 6
 
1.3%
054-927-1718 6
 
1.3%
054-333-0464 5
 
1.1%
054-338-2345 4
 
0.8%
054-338-8898 4
 
0.8%
054-335-0456 4
 
0.8%
054-338-0085 4
 
0.8%
054-331-4480 3
 
0.6%
054-338-3006 3
 
0.6%
054-336-5570 3
 
0.6%
Other values (291) 430
91.1%
2023-12-13T04:36:45.293182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1123
19.8%
- 944
16.6%
0 799
14.1%
5 696
12.3%
4 677
11.9%
1 304
 
5.4%
8 286
 
5.0%
7 261
 
4.6%
6 232
 
4.1%
2 214
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4731
83.4%
Dash Punctuation 944
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1123
23.7%
0 799
16.9%
5 696
14.7%
4 677
14.3%
1 304
 
6.4%
8 286
 
6.0%
7 261
 
5.5%
6 232
 
4.9%
2 214
 
4.5%
9 139
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 944
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5675
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1123
19.8%
- 944
16.6%
0 799
14.1%
5 696
12.3%
4 677
11.9%
1 304
 
5.4%
8 286
 
5.0%
7 261
 
4.6%
6 232
 
4.1%
2 214
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5675
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1123
19.8%
- 944
16.6%
0 799
14.1%
5 696
12.3%
4 677
11.9%
1 304
 
5.4%
8 286
 
5.0%
7 261
 
4.6%
6 232
 
4.1%
2 214
 
3.8%

Interactions

2023-12-13T04:36:41.249201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:36:45.433687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.349
업종0.3491.000
2023-12-13T04:36:45.547800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.134
업종0.1341.000

Missing values

2023-12-13T04:36:41.375797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:36:41.514652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업체명대표자업종도로명주소전화번호
01(유)기남건설김기출상ㆍ하수도설비공사업영천시 장천3길 28-14 (조교동)054-332-3305
12(유)기남건설김기출철근ㆍ콘크리트공사업영천시 장천3길 28-14 (조교동)054-332-3305
23(주)건창임호규상ㆍ하수도설비공사업영천시 문외2길 49 ,2층 (문외동)054-333-0645
34(주)건창임호규철근ㆍ콘크리트공사업영천시 문외2길 49 ,2층 (문외동)054-333-0645
45(주)건화건설조만환상ㆍ하수도설비공사업영천시 교창길 12 (창구동)054-333-5674
56(주)건화건설조만환철근ㆍ콘크리트공사업영천시 교창길 12 (창구동)054-333-5674
67(주)경농이엔씨장재욱기계설비공사업영천시 청통면 금송로 857-66054-338-9200
78(주)경림건설서예림시설물유지관리업영천시 범어1길 25 ,1층 101호 (작산동)054-336-8841
89(주)경북건설김용순도장공사업영천시 영화로 326 (조교동)054-335-2205
910(주)경북건설김용순금속구조물ㆍ창호ㆍ온실공사업영천시 영화로 326 (조교동)054-335-2205
번호업체명대표자업종도로명주소전화번호
462463혜인산업개발주식회사이갑숙석공사업영천시 고경면 호국로 811054-338-0350
463464혜인산업개발주식회사이갑숙철근ㆍ콘크리트공사업영천시 고경면 호국로 811054-338-0350
464465혜인산업개발주식회사이갑숙토공사업영천시 고경면 호국로 811054-338-0350
465466호황건설(주)최건재철근ㆍ콘크리트공사업영천시 안야사2길 7 (야사동)054-332-6081
466467호황건설(주)최건재상ㆍ하수도설비공사업영천시 안야사2길 7 (야사동)054-332-6081
467468홍건설(주)이경홍토공사업영천시 최무선로 204 (성내동)0707-624-2148
468469홍성건설(주)이경홍포장공사업영천시 최무선로 204 (성내동)0707-623-2148
469470화산가스홍재순가스시설시공업 제2종영천시 화산면 가일길 43054-335-0027
470471화산설비조분석가스시설시공업 제2종영천시 화산면 장수로 923054-335-7847
471472화산설비조분석난방시공업 제2종영천시 화산면 장수로 923054-335-7847