Overview

Dataset statistics

Number of variables7
Number of observations203
Missing cells9
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory56.7 B

Variable types

Text2
Categorical3
DateTime2

Dataset

Description광주광역시 동구 전문건설업 등록 현황입니다. 업체명, 업종, 등록일자, 도로명주소, 전화번호 등으로 구성되어있습니다.
URLhttps://www.data.go.kr/data/15117561/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
비고 is highly overall correlated with 업종 and 1 other fieldsHigh correlation
도로명주소 is highly overall correlated with 비고High correlation
업종 is highly overall correlated with 비고High correlation
비고 is highly imbalanced (73.8%)Imbalance
전화번호 has 9 (4.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:57:41.896704
Analysis finished2023-12-12 14:57:42.357340
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct150
Distinct (%)73.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T23:57:42.532956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length7.5566502
Min length4

Characters and Unicode

Total characters1534
Distinct characters190
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)58.1%

Sample

1st row(유)영근산업
2nd row(유)영근산업
3rd row(유)영근산업
4th row(유)원앤드헤일리
5th row(자)삼화건설
ValueCountFrequency (%)
건진건설(주 6
 
3.0%
진영피앤씨(주 5
 
2.5%
영창종합건설(주 5
 
2.5%
주)동평토건 4
 
2.0%
주)제도엔지니어링 4
 
2.0%
안산조경(주 3
 
1.5%
유)영근산업 3
 
1.5%
주)거북상사 3
 
1.5%
대창건설(주 3
 
1.5%
주)이스터건설 3
 
1.5%
Other values (140) 164
80.8%
2023-12-12T23:57:42.887695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
11.4%
) 162
 
10.6%
( 161
 
10.5%
82
 
5.3%
82
 
5.3%
31
 
2.0%
25
 
1.6%
24
 
1.6%
22
 
1.4%
21
 
1.4%
Other values (180) 749
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1207
78.7%
Close Punctuation 162
 
10.6%
Open Punctuation 161
 
10.5%
Uppercase Letter 3
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
14.5%
82
 
6.8%
82
 
6.8%
31
 
2.6%
25
 
2.1%
24
 
2.0%
22
 
1.8%
21
 
1.7%
20
 
1.7%
18
 
1.5%
Other values (174) 707
58.6%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
G 1
33.3%
N 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 162
100.0%
Open Punctuation
ValueCountFrequency (%)
( 161
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1207
78.7%
Common 324
 
21.1%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
14.5%
82
 
6.8%
82
 
6.8%
31
 
2.6%
25
 
2.1%
24
 
2.0%
22
 
1.8%
21
 
1.7%
20
 
1.7%
18
 
1.5%
Other values (174) 707
58.6%
Common
ValueCountFrequency (%)
) 162
50.0%
( 161
49.7%
, 1
 
0.3%
Latin
ValueCountFrequency (%)
E 1
33.3%
G 1
33.3%
N 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1207
78.7%
ASCII 327
 
21.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
175
 
14.5%
82
 
6.8%
82
 
6.8%
31
 
2.6%
25
 
2.1%
24
 
2.0%
22
 
1.8%
21
 
1.7%
20
 
1.7%
18
 
1.5%
Other values (174) 707
58.6%
ASCII
ValueCountFrequency (%)
) 162
49.5%
( 161
49.2%
E 1
 
0.3%
G 1
 
0.3%
N 1
 
0.3%
, 1
 
0.3%

업종
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
실내건축공사업
32 
가스난방공사업
30 
도장ㆍ습식ㆍ방수ㆍ석공사업
27 
시설물유지관리업
17 
조경식재ㆍ시설물공사업
17 
Other values (8)
80 

Length

Max length17
Median length13
Mean length9.6896552
Min length7

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row상ㆍ하수도설비공사업
2nd row도장ㆍ습식ㆍ방수ㆍ석공사업
3rd row시설물유지관리업
4th row실내건축공사업
5th row도장ㆍ습식ㆍ방수ㆍ석공사업

Common Values

ValueCountFrequency (%)
실내건축공사업 32
15.8%
가스난방공사업 30
14.8%
도장ㆍ습식ㆍ방수ㆍ석공사업 27
13.3%
시설물유지관리업 17
8.4%
조경식재ㆍ시설물공사업 17
8.4%
상ㆍ하수도설비공사업 16
7.9%
기계가스설비공사업 15
7.4%
지반조성ㆍ포장공사업 15
7.4%
철근ㆍ콘크리트공사업 13
6.4%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 10
 
4.9%
Other values (3) 11
 
5.4%

Length

2023-12-12T23:57:43.011860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내건축공사업 32
15.8%
가스난방공사업 30
14.8%
도장ㆍ습식ㆍ방수ㆍ석공사업 27
13.3%
시설물유지관리업 17
8.4%
조경식재ㆍ시설물공사업 17
8.4%
상ㆍ하수도설비공사업 16
7.9%
기계가스설비공사업 15
7.4%
지반조성ㆍ포장공사업 15
7.4%
철근ㆍ콘크리트공사업 13
6.4%
금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업 10
 
4.9%
Other values (3) 11
 
5.4%
Distinct179
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum1976-11-10 00:00:00
Maximum2023-06-22 00:00:00
2023-12-12T23:57:43.127970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:57:43.257646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

도로명주소
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
광주광역시 동구 필문대로 **********
27 
광주광역시 동구 천변우로 **********
21 
광주광역시 동구 남문로 **********
17 
광주광역시 동구 금남로 **********
14 
광주광역시 동구 무등로 **********
14 
Other values (34)
110 

Length

Max length24
Median length23
Mean length23.285714
Min length23

Unique

Unique12 ?
Unique (%)5.9%

Sample

1st row광주광역시 동구 백서로 **********
2nd row광주광역시 동구 백서로 **********
3rd row광주광역시 동구 백서로 **********
4th row광주광역시 동구 무등로 **********
5th row광주광역시 동구 구성로 **********

Common Values

ValueCountFrequency (%)
광주광역시 동구 필문대로 ********** 27
 
13.3%
광주광역시 동구 천변우로 ********** 21
 
10.3%
광주광역시 동구 남문로 ********** 17
 
8.4%
광주광역시 동구 금남로 ********** 14
 
6.9%
광주광역시 동구 무등로 ********** 14
 
6.9%
광주광역시 동구 독립로 ********** 13
 
6.4%
광주광역시 동구 밤실로 ********** 10
 
4.9%
광주광역시 동구 중앙로 ********** 7
 
3.4%
광주광역시 동구 지호로 ********** 6
 
3.0%
광주광역시 동구 제봉로 ********** 6
 
3.0%
Other values (29) 68
33.5%

Length

2023-12-12T23:57:43.390743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광주광역시 203
25.0%
203
25.0%
동구 203
25.0%
필문대로 27
 
3.3%
천변우로 21
 
2.6%
남문로 18
 
2.2%
금남로 14
 
1.7%
무등로 14
 
1.7%
독립로 13
 
1.6%
밤실로 10
 
1.2%
Other values (31) 86
10.6%

전화번호
Text

MISSING 

Distinct141
Distinct (%)72.7%
Missing9
Missing (%)4.4%
Memory size1.7 KiB
2023-12-12T23:57:43.609522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010309
Min length11

Characters and Unicode

Total characters2330
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)56.7%

Sample

1st row062-385-0238
2nd row062-385-0238
3rd row062-385-0238
4th row062-228-2858
5th row062-233-0235
ValueCountFrequency (%)
062-239-8189 6
 
3.1%
070-7121-9299 5
 
2.6%
062-574-8484 5
 
2.6%
02-532-7261 5
 
2.6%
062-514-4614 4
 
2.1%
062-372-7750 4
 
2.1%
062-233-0680 3
 
1.5%
062-224-1873 3
 
1.5%
062-385-0238 3
 
1.5%
062-220-2563 3
 
1.5%
Other values (131) 153
78.9%
2023-12-12T23:57:43.969085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 456
19.6%
- 388
16.7%
0 352
15.1%
6 275
11.8%
5 154
 
6.6%
3 142
 
6.1%
1 130
 
5.6%
7 128
 
5.5%
4 118
 
5.1%
8 108
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1942
83.3%
Dash Punctuation 388
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 456
23.5%
0 352
18.1%
6 275
14.2%
5 154
 
7.9%
3 142
 
7.3%
1 130
 
6.7%
7 128
 
6.6%
4 118
 
6.1%
8 108
 
5.6%
9 79
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 388
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2330
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 456
19.6%
- 388
16.7%
0 352
15.1%
6 275
11.8%
5 154
 
6.6%
3 142
 
6.1%
1 130
 
5.6%
7 128
 
5.5%
4 118
 
5.1%
8 108
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2330
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 456
19.6%
- 388
16.7%
0 352
15.1%
6 275
11.8%
5 154
 
6.6%
3 142
 
6.1%
1 130
 
5.6%
7 128
 
5.5%
4 118
 
5.1%
8 108
 
4.6%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
<NA>
194 
전화번호 데이터 미보유
 
9

Length

Max length12
Median length4
Mean length4.3546798
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row전화번호 데이터 미보유
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 194
95.6%
전화번호 데이터 미보유 9
 
4.4%

Length

2023-12-12T23:57:44.113454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:57:44.211720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 194
87.8%
전화번호 9
 
4.1%
데이터 9
 
4.1%
미보유 9
 
4.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2023-08-01 00:00:00
Maximum2023-08-01 00:00:00
2023-12-12T23:57:44.283875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:57:44.361200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T23:57:44.431367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종도로명주소
업종1.0000.000
도로명주소0.0001.000
2023-12-12T23:57:44.518670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고도로명주소업종
비고1.0001.0001.000
도로명주소1.0001.0000.000
업종1.0000.0001.000
2023-12-12T23:57:44.598960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종도로명주소비고
업종1.0000.0001.000
도로명주소0.0001.0001.000
비고1.0001.0001.000

Missing values

2023-12-12T23:57:42.186946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:57:42.309134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명업종등록일자도로명주소전화번호비고데이터기준일자
0(유)영근산업상ㆍ하수도설비공사업2007-08-29광주광역시 동구 백서로 **********062-385-0238<NA>2023-08-01
1(유)영근산업도장ㆍ습식ㆍ방수ㆍ석공사업2021-02-16광주광역시 동구 백서로 **********062-385-0238<NA>2023-08-01
2(유)영근산업시설물유지관리업2008-09-22광주광역시 동구 백서로 **********062-385-0238<NA>2023-08-01
3(유)원앤드헤일리실내건축공사업2021-11-29광주광역시 동구 무등로 **********<NA>전화번호 데이터 미보유2023-08-01
4(자)삼화건설도장ㆍ습식ㆍ방수ㆍ석공사업1976-11-10광주광역시 동구 구성로 **********062-228-2858<NA>2023-08-01
5(주)가영세라믹스도장ㆍ습식ㆍ방수ㆍ석공사업2020-05-08광주광역시 동구 천변우로 **********062-233-0235<NA>2023-08-01
6(주)거북상사실내건축공사업2002-04-12광주광역시 동구 천변좌로 **********062-223-6835<NA>2023-08-01
7(주)거북상사시설물유지관리업2002-04-12광주광역시 동구 천변좌로 **********062-223-6835<NA>2023-08-01
8(주)거북상사금속ㆍ창호ㆍ지붕ㆍ건축물조립공사업2004-11-19광주광역시 동구 천변좌로 **********062-223-6835<NA>2023-08-01
9(주)경동이엔지가스난방공사업2005-12-14광주광역시 동구 서암대로 **********062-523-8879<NA>2023-08-01
업체명업종등록일자도로명주소전화번호비고데이터기준일자
193하이가스가스난방공사업2023-05-03광주광역시 동구 소태길 **********062-223-3345<NA>2023-08-01
194한국리페아(주)조경식재ㆍ시설물공사업2013-07-15광주광역시 동구 천변우로 **********062-233-0680<NA>2023-08-01
195한국리페아(주)시설물유지관리업2000-10-02광주광역시 동구 천변우로 **********062-233-0680<NA>2023-08-01
196한국리페아(주)도장ㆍ습식ㆍ방수ㆍ석공사업2020-04-01광주광역시 동구 천변우로 **********062-233-0680<NA>2023-08-01
197한아름개발(주)시설물유지관리업2009-07-15광주광역시 동구 밤실로 **********062-232-4100<NA>2023-08-01
198한아름개발(주)실내건축공사업2004-12-20광주광역시 동구 밤실로 **********062-232-4100<NA>2023-08-01
199현대종합관리(주)승강기ㆍ삭도공사업2004-11-16광주광역시 동구 천변우로 **********062-225-5599<NA>2023-08-01
200현성건설(주)철근ㆍ콘크리트공사업2017-10-19광주광역시 동구 천변우로 **********062-571-6021<NA>2023-08-01
201형제가스설비가스난방공사업2022-05-10광주광역시 동구 산수길 **********<NA>전화번호 데이터 미보유2023-08-01
202화일산업(주)도장ㆍ습식ㆍ방수ㆍ석공사업2001-09-06광주광역시 동구 남문로 **********062-226-4251<NA>2023-08-01