Overview

Dataset statistics

Number of variables6
Number of observations294
Missing cells4
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.5 KiB
Average record size in memory50.4 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description경상북도 영주시 전문건설업 현황에 대한 테이터로 업체명, 주소, 전화번호, 우편번호, 업종 등의 항목을 제공합니다.
Author경상북도 영주시
URLhttps://www.data.go.kr/data/15084369/fileData.do

Alerts

우편번호 has 3 (1.0%) missing valuesMissing
번호 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:55:10.853778
Analysis finished2023-12-12 06:55:11.823812
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct294
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean147.5
Minimum1
Maximum294
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T15:55:11.897061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.65
Q174.25
median147.5
Q3220.75
95-th percentile279.35
Maximum294
Range293
Interquartile range (IQR)146.5

Descriptive statistics

Standard deviation85.014705
Coefficient of variation (CV)0.57637088
Kurtosis-1.2
Mean147.5
Median Absolute Deviation (MAD)73.5
Skewness0
Sum43365
Variance7227.5
MonotonicityStrictly increasing
2023-12-12T15:55:12.045408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
186 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
198 1
 
0.3%
197 1
 
0.3%
196 1
 
0.3%
195 1
 
0.3%
Other values (284) 284
96.6%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
290 1
0.3%
289 1
0.3%
288 1
0.3%
287 1
0.3%
286 1
0.3%
285 1
0.3%

업체명
Text

UNIQUE 

Distinct294
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T15:55:12.284062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.0136054
Min length4

Characters and Unicode

Total characters2062
Distinct characters200
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)100.0%

Sample

1st row(유)대유건설
2nd row(주)강경건설
3rd row(주)경북건설
4th row(주)경북석재
5th row(주)경진건설
ValueCountFrequency (%)
유)대유건설 1
 
0.3%
신아종합판매주식회사 1
 
0.3%
영주지하수개발주식회사 1
 
0.3%
영주주택공사 1
 
0.3%
영주조경(주 1
 
0.3%
영주시산림조합 1
 
0.3%
영주수도공사(주 1
 
0.3%
영주설비 1
 
0.3%
영주도시가스산업 1
 
0.3%
영문가스설비 1
 
0.3%
Other values (284) 284
96.6%
2023-12-12T15:55:12.667444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
208
 
10.1%
176
 
8.5%
( 137
 
6.6%
) 137
 
6.6%
130
 
6.3%
99
 
4.8%
76
 
3.7%
64
 
3.1%
61
 
3.0%
41
 
2.0%
Other values (190) 933
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1786
86.6%
Open Punctuation 137
 
6.6%
Close Punctuation 137
 
6.6%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
208
 
11.6%
176
 
9.9%
130
 
7.3%
99
 
5.5%
76
 
4.3%
64
 
3.6%
61
 
3.4%
41
 
2.3%
37
 
2.1%
35
 
2.0%
Other values (186) 859
48.1%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 137
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1786
86.6%
Common 276
 
13.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
208
 
11.6%
176
 
9.9%
130
 
7.3%
99
 
5.5%
76
 
4.3%
64
 
3.6%
61
 
3.4%
41
 
2.3%
37
 
2.1%
35
 
2.0%
Other values (186) 859
48.1%
Common
ValueCountFrequency (%)
( 137
49.6%
) 137
49.6%
2 1
 
0.4%
1 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1786
86.6%
ASCII 276
 
13.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
208
 
11.6%
176
 
9.9%
130
 
7.3%
99
 
5.5%
76
 
4.3%
64
 
3.6%
61
 
3.4%
41
 
2.3%
37
 
2.1%
35
 
2.0%
Other values (186) 859
48.1%
ASCII
ValueCountFrequency (%)
( 137
49.6%
) 137
49.6%
2 1
 
0.4%
1 1
 
0.4%

업종
Categorical

Distinct21
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
철근ㆍ콘크리트공사업
53 
토공사업
44 
난방시공업 제2종
34 
조경식재공사업
22 
상ㆍ하수도설비공사업
19 
Other values (16)
122 

Length

Max length14
Median length11
Mean length8.2687075
Min length4

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row토공사업
2nd row상ㆍ하수도설비공사업
3rd row철근ㆍ콘크리트공사업
4th row토공사업
5th row상ㆍ하수도설비공사업

Common Values

ValueCountFrequency (%)
철근ㆍ콘크리트공사업 53
18.0%
토공사업 44
15.0%
난방시공업 제2종 34
11.6%
조경식재공사업 22
 
7.5%
상ㆍ하수도설비공사업 19
 
6.5%
석공사업 14
 
4.8%
가스시설시공업 제2종 14
 
4.8%
금속구조물ㆍ창호ㆍ온실공사업 14
 
4.8%
시설물유지관리업 14
 
4.8%
도장공사업 11
 
3.7%
Other values (11) 55
18.7%

Length

2023-12-12T15:55:12.821211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철근ㆍ콘크리트공사업 53
14.8%
제2종 48
13.4%
토공사업 44
12.3%
난방시공업 38
10.6%
가스시설시공업 26
 
7.3%
조경식재공사업 22
 
6.1%
상ㆍ하수도설비공사업 19
 
5.3%
석공사업 14
 
3.9%
금속구조물ㆍ창호ㆍ온실공사업 14
 
3.9%
시설물유지관리업 14
 
3.9%
Other values (11) 66
18.4%

우편번호
Real number (ℝ)

MISSING 

Distinct88
Distinct (%)30.2%
Missing3
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean36103.014
Minimum36000
Maximum36172
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T15:55:12.948140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36000
5-th percentile36038.5
Q136072
median36100
Q336136
95-th percentile36167
Maximum36172
Range172
Interquartile range (IQR)64

Descriptive statistics

Standard deviation39.567835
Coefficient of variation (CV)0.0010959704
Kurtosis-0.85281013
Mean36103.014
Median Absolute Deviation (MAD)35
Skewness-0.11113125
Sum10505977
Variance1565.6136
MonotonicityNot monotonic
2023-12-12T15:55:13.127448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36136 22
 
7.5%
36145 15
 
5.1%
36057 12
 
4.1%
36167 10
 
3.4%
36099 9
 
3.1%
36077 8
 
2.7%
36065 8
 
2.7%
36088 8
 
2.7%
36101 7
 
2.4%
36064 7
 
2.4%
Other values (78) 185
62.9%
ValueCountFrequency (%)
36000 1
 
0.3%
36003 1
 
0.3%
36016 1
 
0.3%
36023 1
 
0.3%
36025 1
 
0.3%
36026 2
0.7%
36028 1
 
0.3%
36029 4
1.4%
36034 1
 
0.3%
36035 1
 
0.3%
ValueCountFrequency (%)
36172 2
 
0.7%
36171 2
 
0.7%
36170 3
 
1.0%
36167 10
3.4%
36166 1
 
0.3%
36165 2
 
0.7%
36163 1
 
0.3%
36162 2
 
0.7%
36161 2
 
0.7%
36160 2
 
0.7%
Distinct170
Distinct (%)57.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T15:55:13.404925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length40
Mean length25.418367
Min length19

Characters and Unicode

Total characters7473
Distinct characters137
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)27.6%

Sample

1st row경상북도 영주시 지천로116번길 9 (휴천동)
2nd row경상북도 영주시 지천로116번길 9 (휴천동)
3rd row경상북도 영주시 지천로116번길 9 (휴천동)
4th row경상북도 영주시 대학로284번길 9 (가흥동)
5th row경상북도 영주시 대학로284번길 9 (가흥동)
ValueCountFrequency (%)
경상북도 294
18.8%
영주시 294
18.8%
휴천동 68
 
4.3%
가흥동 67
 
4.3%
하망동 37
 
2.4%
영주동 30
 
1.9%
선비로 25
 
1.6%
2층 25
 
1.6%
원당로 21
 
1.3%
장수면 20
 
1.3%
Other values (266) 686
43.8%
2023-12-12T15:55:13.794162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1273
 
17.0%
376
 
5.0%
364
 
4.9%
325
 
4.3%
294
 
3.9%
294
 
3.9%
294
 
3.9%
294
 
3.9%
291
 
3.9%
268
 
3.6%
Other values (127) 3400
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4282
57.3%
Space Separator 1273
 
17.0%
Decimal Number 1254
 
16.8%
Close Punctuation 249
 
3.3%
Open Punctuation 249
 
3.3%
Dash Punctuation 97
 
1.3%
Other Punctuation 67
 
0.9%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
 
8.8%
364
 
8.5%
325
 
7.6%
294
 
6.9%
294
 
6.9%
294
 
6.9%
294
 
6.9%
291
 
6.8%
268
 
6.3%
139
 
3.2%
Other values (110) 1343
31.4%
Decimal Number
ValueCountFrequency (%)
1 261
20.8%
2 244
19.5%
3 129
10.3%
4 117
9.3%
0 114
9.1%
6 95
 
7.6%
8 80
 
6.4%
7 79
 
6.3%
5 73
 
5.8%
9 62
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 46
68.7%
21
31.3%
Space Separator
ValueCountFrequency (%)
1273
100.0%
Close Punctuation
ValueCountFrequency (%)
) 249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 249
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4282
57.3%
Common 3189
42.7%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
 
8.8%
364
 
8.5%
325
 
7.6%
294
 
6.9%
294
 
6.9%
294
 
6.9%
294
 
6.9%
291
 
6.8%
268
 
6.3%
139
 
3.2%
Other values (110) 1343
31.4%
Common
ValueCountFrequency (%)
1273
39.9%
1 261
 
8.2%
) 249
 
7.8%
( 249
 
7.8%
2 244
 
7.7%
3 129
 
4.0%
4 117
 
3.7%
0 114
 
3.6%
- 97
 
3.0%
6 95
 
3.0%
Other values (6) 361
 
11.3%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4282
57.3%
ASCII 3170
42.4%
None 21
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1273
40.2%
1 261
 
8.2%
) 249
 
7.9%
( 249
 
7.9%
2 244
 
7.7%
3 129
 
4.1%
4 117
 
3.7%
0 114
 
3.6%
- 97
 
3.1%
6 95
 
3.0%
Other values (6) 342
 
10.8%
Hangul
ValueCountFrequency (%)
376
 
8.8%
364
 
8.5%
325
 
7.6%
294
 
6.9%
294
 
6.9%
294
 
6.9%
294
 
6.9%
291
 
6.8%
268
 
6.3%
139
 
3.2%
Other values (110) 1343
31.4%
None
ValueCountFrequency (%)
21
100.0%
Distinct168
Distinct (%)57.3%
Missing1
Missing (%)0.3%
Memory size2.4 KiB
2023-12-12T15:55:14.095608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.003413
Min length11

Characters and Unicode

Total characters3517
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)26.3%

Sample

1st row054-638-0225
2nd row054-638-0225
3rd row054-638-0225
4th row054-638-9933
5th row054-638-9933
ValueCountFrequency (%)
054-632-2200 6
 
2.0%
054-633-8002 5
 
1.7%
054-637-9927 5
 
1.7%
054-636-8645 4
 
1.4%
054-632-7042 4
 
1.4%
054-635-5577 4
 
1.4%
054-635-0586 4
 
1.4%
054-638-9330 3
 
1.0%
054-638-0225 3
 
1.0%
054-637-8204 3
 
1.0%
Other values (158) 252
86.0%
2023-12-12T15:55:14.515428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 586
16.7%
0 505
14.4%
5 458
13.0%
4 451
12.8%
3 436
12.4%
6 415
11.8%
7 184
 
5.2%
2 144
 
4.1%
1 126
 
3.6%
8 121
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2931
83.3%
Dash Punctuation 586
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 505
17.2%
5 458
15.6%
4 451
15.4%
3 436
14.9%
6 415
14.2%
7 184
 
6.3%
2 144
 
4.9%
1 126
 
4.3%
8 121
 
4.1%
9 91
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 586
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3517
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 586
16.7%
0 505
14.4%
5 458
13.0%
4 451
12.8%
3 436
12.4%
6 415
11.8%
7 184
 
5.2%
2 144
 
4.1%
1 126
 
3.6%
8 121
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3517
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 586
16.7%
0 505
14.4%
5 458
13.0%
4 451
12.8%
3 436
12.4%
6 415
11.8%
7 184
 
5.2%
2 144
 
4.1%
1 126
 
3.6%
8 121
 
3.4%

Interactions

2023-12-12T15:55:11.324631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:55:11.149223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:55:11.428473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:55:11.230103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:55:14.622420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종우편번호
번호1.0000.4890.506
업종0.4891.0000.130
우편번호0.5060.1301.000
2023-12-12T15:55:14.732970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호우편번호업종
번호1.000-0.1340.198
우편번호-0.1341.0000.026
업종0.1980.0261.000

Missing values

2023-12-12T15:55:11.549097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:55:11.668548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:55:11.773100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체명업종우편번호도로명주소전화번호
01(유)대유건설토공사업36121경상북도 영주시 지천로116번길 9 (휴천동)054-638-0225
12(주)강경건설상ㆍ하수도설비공사업36121경상북도 영주시 지천로116번길 9 (휴천동)054-638-0225
23(주)경북건설철근ㆍ콘크리트공사업36121경상북도 영주시 지천로116번길 9 (휴천동)054-638-0225
34(주)경북석재토공사업36136경상북도 영주시 대학로284번길 9 (가흥동)054-638-9933
45(주)경진건설상ㆍ하수도설비공사업36136경상북도 영주시 대학로284번길 9 (가흥동)054-638-9933
56(주)구주건설토공사업36075경상북도 영주시 영주로 232-5 세화빌딩 (하망동)054-633-7911
67(주)금강산업조경시설물설치공사업36075경상북도 영주시 영주로 232-5 세화빌딩 (하망동)054-633-7911
78(주)금송조경건설조경식재공사업36075경상북도 영주시 영주로 232-5 세화빌딩 (하망동)054-633-7911
89(주)남광건설석공사업36144경상북도 영주시 장수면 용주로 211054-637-5807
910(주)대능산업개발토공사업36144경상북도 영주시 장수면 용주로 211054-637-5807
번호업체명업종우편번호도로명주소전화번호
284285해송기술건설주식회사조경식재공사업36065경상북도 영주시 봉화로68번길 43 (상망동)054-635-5609
285286행복가스설비공사조경식재공사업36065경상북도 영주시 봉화로68번길 43 (상망동)054-635-3533
286287현대그린산업주식회사토공사업36065경상북도 영주시 봉화로68번길 43 (상망동)054-635-3533
287288현대설비공사석공사업36065경상북도 영주시 봉화로68번길 43 (상망동)054-635-3533
288289현대조경(주)토공사업36065경상북도 영주시 원당로 348 (상망동)054-637-9077
289290현대환경산업주식회사철근ㆍ콘크리트공사업36065경상북도 영주시 원당로 348 (상망동)054-637-9077
290291현산건설(주)철근ㆍ콘크리트공사업36064경상북도 영주시 원당로315번길 29 (상망동)054-631-0982
291292혜인건축설비가스시설시공업 제2종36016경상북도 영주시 순흥면 순흥로 51054-633-2563
292293홍덕건설(주)철근ㆍ콘크리트공사업36044경상북도 영주시 봉현면 오현로 42054-635-3120
293294화성건축설비석공사업36044경상북도 영주시 봉현면 오현로 42054-635-3120