Overview

Dataset statistics

Number of variables5
Number of observations572
Missing cells26
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.0 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description수성구 관내 전문건설업 등록현황(2018.8)
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15054518&dataSetDetailId=150545181a01f25a608ef&provdMethod=FILE

Alerts

도로명주소 has 6 (1.0%) missing valuesMissing
전화번호 has 20 (3.5%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 19:34:49.425068
Analysis finished2023-12-10 19:34:50.748689
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct572
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean286.5
Minimum1
Maximum572
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2023-12-11T04:34:50.910500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.55
Q1143.75
median286.5
Q3429.25
95-th percentile543.45
Maximum572
Range571
Interquartile range (IQR)285.5

Descriptive statistics

Standard deviation165.26645
Coefficient of variation (CV)0.57684625
Kurtosis-1.2
Mean286.5
Median Absolute Deviation (MAD)143
Skewness0
Sum163878
Variance27313
MonotonicityStrictly increasing
2023-12-11T04:34:51.188905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
386 1
 
0.2%
380 1
 
0.2%
381 1
 
0.2%
382 1
 
0.2%
383 1
 
0.2%
384 1
 
0.2%
385 1
 
0.2%
387 1
 
0.2%
378 1
 
0.2%
Other values (562) 562
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
572 1
0.2%
571 1
0.2%
570 1
0.2%
569 1
0.2%
568 1
0.2%
567 1
0.2%
566 1
0.2%
565 1
0.2%
564 1
0.2%
563 1
0.2%
Distinct443
Distinct (%)77.4%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-11T04:34:52.099638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.1695804
Min length2

Characters and Unicode

Total characters4101
Distinct characters267
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique341 ?
Unique (%)59.6%

Sample

1st row(주)1.2.3데코레이션
2nd row(주)가야산업개발
3rd row(주)가인건업
4th row(주)가인건업
5th row(주)가희에너지
ValueCountFrequency (%)
주)대은 8
 
1.4%
주식회사일해 5
 
0.9%
주)통일중원 4
 
0.7%
토문개발(주 4
 
0.7%
주)대맥건설산업 3
 
0.5%
대영설비 3
 
0.5%
주)영진토건 3
 
0.5%
대도토건(주 3
 
0.5%
주)삼마토건 3
 
0.5%
주)해원건설 3
 
0.5%
Other values (434) 534
93.2%
2023-12-11T04:34:52.810646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
437
 
10.7%
( 402
 
9.8%
) 402
 
9.8%
196
 
4.8%
194
 
4.7%
94
 
2.3%
79
 
1.9%
75
 
1.8%
61
 
1.5%
59
 
1.4%
Other values (257) 2102
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3275
79.9%
Open Punctuation 402
 
9.8%
Close Punctuation 402
 
9.8%
Uppercase Letter 10
 
0.2%
Decimal Number 8
 
0.2%
Other Punctuation 3
 
0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
437
 
13.3%
196
 
6.0%
194
 
5.9%
94
 
2.9%
79
 
2.4%
75
 
2.3%
61
 
1.9%
59
 
1.8%
57
 
1.7%
53
 
1.6%
Other values (244) 1970
60.2%
Decimal Number
ValueCountFrequency (%)
3 3
37.5%
1 2
25.0%
6 1
 
12.5%
7 1
 
12.5%
2 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
G 3
30.0%
N 3
30.0%
E 3
30.0%
B 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 402
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3275
79.9%
Common 816
 
19.9%
Latin 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
437
 
13.3%
196
 
6.0%
194
 
5.9%
94
 
2.9%
79
 
2.4%
75
 
2.3%
61
 
1.9%
59
 
1.8%
57
 
1.7%
53
 
1.6%
Other values (244) 1970
60.2%
Common
ValueCountFrequency (%)
( 402
49.3%
) 402
49.3%
. 3
 
0.4%
3 3
 
0.4%
1 2
 
0.2%
1
 
0.1%
6 1
 
0.1%
7 1
 
0.1%
2 1
 
0.1%
Latin
ValueCountFrequency (%)
G 3
30.0%
N 3
30.0%
E 3
30.0%
B 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3275
79.9%
ASCII 826
 
20.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
437
 
13.3%
196
 
6.0%
194
 
5.9%
94
 
2.9%
79
 
2.4%
75
 
2.3%
61
 
1.9%
59
 
1.8%
57
 
1.7%
53
 
1.6%
Other values (244) 1970
60.2%
ASCII
ValueCountFrequency (%)
( 402
48.7%
) 402
48.7%
G 3
 
0.4%
N 3
 
0.4%
. 3
 
0.4%
E 3
 
0.4%
3 3
 
0.4%
1 2
 
0.2%
1
 
0.1%
6 1
 
0.1%
Other values (3) 3
 
0.4%

업종
Categorical

Distinct25
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
난방시공업 제2종
77 
실내건축공사업
76 
기계설비공사업
57 
시설물유지관리업
48 
금속구조물ㆍ창호공사업
41 
Other values (20)
273 

Length

Max length13
Median length10
Mean length8.4493007
Min length4

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row실내건축공사업
2nd row기계설비공사업
3rd row금속구조물ㆍ창호공사업
4th row실내건축공사업
5th row가스시설시공업 제1종

Common Values

ValueCountFrequency (%)
난방시공업 제2종 77
13.5%
실내건축공사업 76
13.3%
기계설비공사업 57
10.0%
시설물유지관리업 48
 
8.4%
금속구조물ㆍ창호공사업 41
 
7.2%
가스시설시공업 제2종 32
 
5.6%
상ㆍ하수도설비공사업 31
 
5.4%
철근ㆍ콘크리트공사업 30
 
5.2%
토공사업 28
 
4.9%
미장ㆍ방수ㆍ조적공사업 24
 
4.2%
Other values (15) 128
22.4%

Length

2023-12-11T04:34:53.054185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제2종 109
15.3%
난방시공업 89
12.5%
실내건축공사업 76
10.7%
기계설비공사업 57
 
8.0%
가스시설시공업 51
 
7.2%
시설물유지관리업 48
 
6.7%
금속구조물ㆍ창호공사업 41
 
5.8%
상ㆍ하수도설비공사업 31
 
4.4%
철근ㆍ콘크리트공사업 30
 
4.2%
토공사업 28
 
3.9%
Other values (15) 152
21.3%

도로명주소
Text

MISSING 

Distinct419
Distinct (%)74.0%
Missing6
Missing (%)1.0%
Memory size4.6 KiB
2023-12-11T04:34:53.534312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length46
Mean length25.849823
Min length17

Characters and Unicode

Total characters14631
Distinct characters158
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique313 ?
Unique (%)55.3%

Sample

1st row대구광역시 수성구 들안로 218 (황금동)
2nd row대구광역시 수성구 공경로 12 (만촌동)
3rd row대구광역시 수성구 충의로6길 46 대경하이빌 제지하층102호 (만촌동)
4th row대구광역시 수성구 충의로6길 46 대경하이빌 제지하층102호 (만촌동)
5th row대구광역시 수성구 들안로28길 98 (황금동)
ValueCountFrequency (%)
수성구 566
 
18.8%
대구광역시 558
 
18.5%
만촌동 122
 
4.1%
범어동 81
 
2.7%
지산동 58
 
1.9%
황금동 45
 
1.5%
두산동 38
 
1.3%
2층 37
 
1.2%
중동 35
 
1.2%
3층 35
 
1.2%
Other values (547) 1435
47.7%
2023-12-11T04:34:54.293776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2444
16.7%
1208
 
8.3%
663
 
4.5%
657
 
4.5%
651
 
4.4%
629
 
4.3%
595
 
4.1%
563
 
3.8%
560
 
3.8%
559
 
3.8%
Other values (148) 6102
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8753
59.8%
Space Separator 2444
 
16.7%
Decimal Number 2217
 
15.2%
Open Punctuation 535
 
3.7%
Close Punctuation 534
 
3.6%
Dash Punctuation 96
 
0.7%
Other Punctuation 50
 
0.3%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1208
13.8%
663
 
7.6%
657
 
7.5%
651
 
7.4%
629
 
7.2%
595
 
6.8%
563
 
6.4%
560
 
6.4%
559
 
6.4%
253
 
2.9%
Other values (128) 2415
27.6%
Decimal Number
ValueCountFrequency (%)
1 424
19.1%
2 379
17.1%
3 272
12.3%
4 225
10.1%
5 170
7.7%
0 167
 
7.5%
6 165
 
7.4%
9 159
 
7.2%
8 135
 
6.1%
7 121
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 42
84.0%
5
 
10.0%
/ 2
 
4.0%
. 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
2444
100.0%
Open Punctuation
ValueCountFrequency (%)
( 535
100.0%
Close Punctuation
ValueCountFrequency (%)
) 534
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8753
59.8%
Common 5876
40.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1208
13.8%
663
 
7.6%
657
 
7.5%
651
 
7.4%
629
 
7.2%
595
 
6.8%
563
 
6.4%
560
 
6.4%
559
 
6.4%
253
 
2.9%
Other values (128) 2415
27.6%
Common
ValueCountFrequency (%)
2444
41.6%
( 535
 
9.1%
) 534
 
9.1%
1 424
 
7.2%
2 379
 
6.4%
3 272
 
4.6%
4 225
 
3.8%
5 170
 
2.9%
0 167
 
2.8%
6 165
 
2.8%
Other values (8) 561
 
9.5%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8753
59.8%
ASCII 5873
40.1%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2444
41.6%
( 535
 
9.1%
) 534
 
9.1%
1 424
 
7.2%
2 379
 
6.5%
3 272
 
4.6%
4 225
 
3.8%
5 170
 
2.9%
0 167
 
2.8%
6 165
 
2.8%
Other values (9) 558
 
9.5%
Hangul
ValueCountFrequency (%)
1208
13.8%
663
 
7.6%
657
 
7.5%
651
 
7.4%
629
 
7.2%
595
 
6.8%
563
 
6.4%
560
 
6.4%
559
 
6.4%
253
 
2.9%
Other values (128) 2415
27.6%
None
ValueCountFrequency (%)
5
100.0%

전화번호
Text

MISSING 

Distinct416
Distinct (%)75.4%
Missing20
Missing (%)3.5%
Memory size4.6 KiB
2023-12-11T04:34:54.660155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007246
Min length11

Characters and Unicode

Total characters6628
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique311 ?
Unique (%)56.3%

Sample

1st row053-422-0123
2nd row053-751-0426
3rd row053-794-1183
4th row053-794-1183
5th row053-766-2622
ValueCountFrequency (%)
053-761-0174 8
 
1.4%
053-744-3500 5
 
0.9%
053-653-3935 4
 
0.7%
053-984-3000 4
 
0.7%
053-742-1141 4
 
0.7%
053-745-8530 3
 
0.5%
053-983-9590 3
 
0.5%
053-257-0010 3
 
0.5%
053-765-0404 3
 
0.5%
053-759-4101 3
 
0.5%
Other values (406) 512
92.8%
2023-12-11T04:34:55.286152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1104
16.7%
5 957
14.4%
0 904
13.6%
3 836
12.6%
7 694
10.5%
6 508
7.7%
4 413
 
6.2%
1 379
 
5.7%
8 329
 
5.0%
2 296
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5524
83.3%
Dash Punctuation 1104
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 957
17.3%
0 904
16.4%
3 836
15.1%
7 694
12.6%
6 508
9.2%
4 413
7.5%
1 379
 
6.9%
8 329
 
6.0%
2 296
 
5.4%
9 208
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 1104
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6628
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1104
16.7%
5 957
14.4%
0 904
13.6%
3 836
12.6%
7 694
10.5%
6 508
7.7%
4 413
 
6.2%
1 379
 
5.7%
8 329
 
5.0%
2 296
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6628
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1104
16.7%
5 957
14.4%
0 904
13.6%
3 836
12.6%
7 694
10.5%
6 508
7.7%
4 413
 
6.2%
1 379
 
5.7%
8 329
 
5.0%
2 296
 
4.5%

Interactions

2023-12-11T04:34:50.028735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T04:34:55.436011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.543
업종0.5431.000
2023-12-11T04:34:55.573208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종
번호1.0000.220
업종0.2201.000

Missing values

2023-12-11T04:34:50.263675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T04:34:50.450646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T04:34:50.648823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호업체명업종도로명주소전화번호
01(주)1.2.3데코레이션실내건축공사업대구광역시 수성구 들안로 218 (황금동)053-422-0123
12(주)가야산업개발기계설비공사업대구광역시 수성구 공경로 12 (만촌동)053-751-0426
23(주)가인건업금속구조물ㆍ창호공사업대구광역시 수성구 충의로6길 46 대경하이빌 제지하층102호 (만촌동)053-794-1183
34(주)가인건업실내건축공사업대구광역시 수성구 충의로6길 46 대경하이빌 제지하층102호 (만촌동)053-794-1183
45(주)가희에너지가스시설시공업 제1종대구광역시 수성구 들안로28길 98 (황금동)053-766-2622
56(주)강동이앤씨기계설비공사업대구광역시 수성구 청호로85길 9 (범어동)053-746-1017
67(주)강산건설미장ㆍ방수ㆍ조적공사업대구광역시 수성구 들안로 220 (황금동)053-768-9006
78(주)거성시설물유지관리업대구광역시 수성구 달구벌대로 3300 2층 (신매동)053-752-9634
89(주)건영조경건설조경식재공사업대구광역시 수성구 세진로 43-3 (만촌동)053-793-5282
910(주)건원건설시설물유지관리업대구광역시 수성구 무열로 203 (만촌동)053-811-6935
번호업체명업종도로명주소전화번호
562563협진건축종합설비가스시설시공업 제2종대구광역시 수성구 들안로73길 25 (수성동4가)053-753-4689
563564협진건축종합설비난방시공업 제1종대구광역시 수성구 들안로73길 25 (수성동4가)053-753-4689
564565형제설비.엔지니어링난방시공업 제2종대구광역시 수성구 신천동로 426053-742-7856
565566혜민설비난방시공업 제2종대구광역시 수성구 청수로3길 31 (중동)<NA>
566567호청설비난방시공업 제2종대구광역시 수성구 청솔로12길 11 (범어동)053-766-1875
567568화산기업난방시공업 제2종대구광역시 수성구 파동로30길 11-1 (파동)053-764-8228
568569화성산업(주)철강재설치공사업대구광역시 수성구 동대구로 111 (황금동)053-760-3730
569570화성이엔에이(주)금속구조물ㆍ창호공사업대구광역시 수성구 수성로 262 (중동) 2층053-766-2471
570571황금설비공사난방시공업 제2종대구광역시 수성구 파동로 179 (파동)053-768-1351
571572힐엔지니어링(주)강구조물공사업대구광역시 수성구 동대구로 395 5층 (범어동)053-752-3353