Overview

Dataset statistics

Number of variables5
Number of observations193
Missing cells14
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.9 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description경상북도 영덕군에 위치한 건설업 업체명, 소재지, 전화번호, 대표자 등 관련 데이터를 아래와 같이 제공하고자 합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15089493/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 14 (7.3%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 11:29:48.433350
Analysis finished2024-04-21 11:29:49.542009
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct193
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97
Minimum1
Maximum193
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2024-04-21T20:29:49.762093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.6
Q149
median97
Q3145
95-th percentile183.4
Maximum193
Range192
Interquartile range (IQR)96

Descriptive statistics

Standard deviation55.858452
Coefficient of variation (CV)0.57586033
Kurtosis-1.2
Mean97
Median Absolute Deviation (MAD)48
Skewness0
Sum18721
Variance3120.1667
MonotonicityStrictly increasing
2024-04-21T20:29:50.420863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
146 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
Other values (183) 183
94.8%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
193 1
0.5%
192 1
0.5%
191 1
0.5%
190 1
0.5%
189 1
0.5%
188 1
0.5%
187 1
0.5%
186 1
0.5%
185 1
0.5%
184 1
0.5%
Distinct190
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-04-21T20:29:51.312826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.642487
Min length2

Characters and Unicode

Total characters1282
Distinct characters150
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)96.9%

Sample

1st row(주)경도조경
2nd row(주)경동건설
3rd row(주)경록건설
4th row(주)경신건설
5th row(주)국보산업
ValueCountFrequency (%)
귀뚜라미보일러 2
 
1.0%
영광건설 2
 
1.0%
신흥개발(주 2
 
1.0%
현대가스 1
 
0.5%
성진설비공사 1
 
0.5%
삼성이엔씨(주 1
 
0.5%
삼양설비 1
 
0.5%
영진종합설비 1
 
0.5%
주)경도조경 1
 
0.5%
신영건설주식회사 1
 
0.5%
Other values (180) 180
93.3%
2024-04-21T20:29:52.588456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
143
 
11.2%
) 122
 
9.5%
( 121
 
9.4%
104
 
8.1%
93
 
7.3%
26
 
2.0%
25
 
2.0%
24
 
1.9%
23
 
1.8%
21
 
1.6%
Other values (140) 580
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1031
80.4%
Close Punctuation 122
 
9.5%
Open Punctuation 121
 
9.4%
Uppercase Letter 6
 
0.5%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
13.9%
104
 
10.1%
93
 
9.0%
26
 
2.5%
25
 
2.4%
24
 
2.3%
23
 
2.2%
21
 
2.0%
21
 
2.0%
21
 
2.0%
Other values (133) 530
51.4%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
N 2
33.3%
E 2
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 122
100.0%
Open Punctuation
ValueCountFrequency (%)
( 121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1031
80.4%
Common 245
 
19.1%
Latin 6
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
13.9%
104
 
10.1%
93
 
9.0%
26
 
2.5%
25
 
2.4%
24
 
2.3%
23
 
2.2%
21
 
2.0%
21
 
2.0%
21
 
2.0%
Other values (133) 530
51.4%
Common
ValueCountFrequency (%)
) 122
49.8%
( 121
49.4%
2 1
 
0.4%
4 1
 
0.4%
Latin
ValueCountFrequency (%)
G 2
33.3%
N 2
33.3%
E 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1031
80.4%
ASCII 251
 
19.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
143
 
13.9%
104
 
10.1%
93
 
9.0%
26
 
2.5%
25
 
2.4%
24
 
2.3%
23
 
2.2%
21
 
2.0%
21
 
2.0%
21
 
2.0%
Other values (133) 530
51.4%
ASCII
ValueCountFrequency (%)
) 122
48.6%
( 121
48.2%
G 2
 
0.8%
N 2
 
0.8%
E 2
 
0.8%
2 1
 
0.4%
4 1
 
0.4%
Distinct153
Distinct (%)79.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2024-04-21T20:29:54.176227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length30
Mean length21.056995
Min length18

Characters and Unicode

Total characters4064
Distinct characters111
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)64.2%

Sample

1st row경상북도 영덕군 영해면 영덕로 1831
2nd row경상북도 영덕군 영덕읍 중앙길 104-1
3rd row경상북도 영덕군 영해면 영덕로 1725
4th row경상북도 영덕군 영덕읍 중앙길 104-1
5th row경상북도 영덕군 영해면 예주목은길 349
ValueCountFrequency (%)
영덕군 193
19.8%
경상북도 191
19.5%
영덕읍 122
 
12.5%
영해면 28
 
2.9%
영덕로 24
 
2.5%
강구면 16
 
1.6%
남정면 14
 
1.4%
경동로 12
 
1.2%
도매샛길 12
 
1.2%
군청길 10
 
1.0%
Other values (208) 355
36.3%
2024-04-21T20:29:56.029662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
784
19.3%
376
 
9.3%
347
 
8.5%
205
 
5.0%
205
 
5.0%
203
 
5.0%
193
 
4.7%
192
 
4.7%
143
 
3.5%
1 140
 
3.4%
Other values (101) 1276
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2612
64.3%
Space Separator 784
 
19.3%
Decimal Number 599
 
14.7%
Dash Punctuation 66
 
1.6%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
376
14.4%
347
13.3%
205
 
7.8%
205
 
7.8%
203
 
7.8%
193
 
7.4%
192
 
7.4%
143
 
5.5%
124
 
4.7%
71
 
2.7%
Other values (86) 553
21.2%
Decimal Number
ValueCountFrequency (%)
1 140
23.4%
3 79
13.2%
2 76
12.7%
4 55
 
9.2%
7 49
 
8.2%
0 49
 
8.2%
5 44
 
7.3%
8 41
 
6.8%
6 40
 
6.7%
9 26
 
4.3%
Space Separator
ValueCountFrequency (%)
784
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2612
64.3%
Common 1452
35.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
376
14.4%
347
13.3%
205
 
7.8%
205
 
7.8%
203
 
7.8%
193
 
7.4%
192
 
7.4%
143
 
5.5%
124
 
4.7%
71
 
2.7%
Other values (86) 553
21.2%
Common
ValueCountFrequency (%)
784
54.0%
1 140
 
9.6%
3 79
 
5.4%
2 76
 
5.2%
- 66
 
4.5%
4 55
 
3.8%
7 49
 
3.4%
0 49
 
3.4%
5 44
 
3.0%
8 41
 
2.8%
Other values (5) 69
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2612
64.3%
ASCII 1452
35.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
784
54.0%
1 140
 
9.6%
3 79
 
5.4%
2 76
 
5.2%
- 66
 
4.5%
4 55
 
3.8%
7 49
 
3.4%
0 49
 
3.4%
5 44
 
3.0%
8 41
 
2.8%
Other values (5) 69
 
4.8%
Hangul
ValueCountFrequency (%)
376
14.4%
347
13.3%
205
 
7.8%
205
 
7.8%
203
 
7.8%
193
 
7.4%
192
 
7.4%
143
 
5.5%
124
 
4.7%
71
 
2.7%
Other values (86) 553
21.2%

전화번호
Text

MISSING 

Distinct164
Distinct (%)91.6%
Missing14
Missing (%)7.3%
Memory size1.6 KiB
2024-04-21T20:29:56.913314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters2148
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)84.4%

Sample

1st row054-734-9115
2nd row054-734-4463
3rd row054-733-0414
4th row054-734-4463
5th row054-732-1709
ValueCountFrequency (%)
054-732-7666 3
 
1.7%
054-734-4463 3
 
1.7%
054-733-1114 2
 
1.1%
054-733-9430 2
 
1.1%
054-732-6474 2
 
1.1%
054-732-0237 2
 
1.1%
054-733-7227 2
 
1.1%
054-733-8088 2
 
1.1%
054-733-1485 2
 
1.1%
054-733-9364 2
 
1.1%
Other values (154) 157
87.7%
2024-04-21T20:29:58.231522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 358
16.7%
0 305
14.2%
4 299
13.9%
3 295
13.7%
5 242
11.3%
7 239
11.1%
2 115
 
5.4%
6 89
 
4.1%
1 83
 
3.9%
8 67
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1790
83.3%
Dash Punctuation 358
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 305
17.0%
4 299
16.7%
3 295
16.5%
5 242
13.5%
7 239
13.4%
2 115
 
6.4%
6 89
 
5.0%
1 83
 
4.6%
8 67
 
3.7%
9 56
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 358
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2148
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 358
16.7%
0 305
14.2%
4 299
13.9%
3 295
13.7%
5 242
11.3%
7 239
11.1%
2 115
 
5.4%
6 89
 
4.1%
1 83
 
3.9%
8 67
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2148
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 358
16.7%
0 305
14.2%
4 299
13.9%
3 295
13.7%
5 242
11.3%
7 239
11.1%
2 115
 
5.4%
6 89
 
4.1%
1 83
 
3.9%
8 67
 
3.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2023-08-17 00:00:00
Maximum2023-08-17 00:00:00
2024-04-21T20:29:58.573423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T20:29:58.868777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T20:29:48.759885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-21T20:29:49.108100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T20:29:49.419398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호건설업체명소재지전화번호데이터기준일자
01(주)경도조경경상북도 영덕군 영해면 영덕로 1831054-734-91152023-08-17
12(주)경동건설경상북도 영덕군 영덕읍 중앙길 104-1054-734-44632023-08-17
23(주)경록건설경상북도 영덕군 영해면 영덕로 1725054-733-04142023-08-17
34(주)경신건설경상북도 영덕군 영덕읍 중앙길 104-1054-734-44632023-08-17
45(주)국보산업경상북도 영덕군 영해면 예주목은길 349054-732-17092023-08-17
56(주)금사경상북도 영덕군 영덕읍 영덕로 447<NA>2023-08-17
67(주)길록경상북도 영덕군 남정면 진불1길 2-1054-733-11142023-08-17
78(주)남정건설경상북도 영덕군 남정면 진불길 40054-732-53042023-08-17
89(주)대건산업개발경상북도 영덕군 강구면 나비산길 14-1053-812-23002023-08-17
910(주)대덕산업경상북도 영덕군 영덕읍 남석길 53054-734-29902023-08-17
번호건설업체명소재지전화번호데이터기준일자
183184팔팔가스경상북도 영덕군 영해면 318만세길 129054-733-15002023-08-17
184185포산건설(주)경상북도 영덕군 영덕읍 노물길 19-9054-787-53662023-08-17
185186하나가스경상북도 영덕군 영해면 벌영길 99054-733-05052023-08-17
186187하나원수중중기(주)경상북도 영덕군 영덕읍 영덕로 320054-733-10052023-08-17
187188한동건설(주)경상북도 영덕군 영덕읍 영덕로 320054-733-81012023-08-17
188189한샘가스경상북도 영덕군 영덕읍 남산2길 2-17054-732-88982023-08-17
189190해밀경상북도 영덕군 영덕읍 도매샛길 41-14<NA>2023-08-17
190191현대가스경상북도 영덕군 영덕읍 하저길 12-6054-732-81112023-08-17
191192현대설비경상북도 영덕군 강구면 강영로 345054-732-74022023-08-17
192193화남건설(주)경상북도 영덕군 영덕읍 신공업단지길 37054-733-29242023-08-17