Overview

Dataset statistics

Number of variables4
Number of observations871
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.3 KiB
Average record size in memory32.2 B

Variable types

Text1
DateTime1
Categorical2

Dataset

Description인천광역시 서구 전문건설업 현황에 대한 데이터로 전문건설업의 상호, 등록일자, 업종 등의 정보가 포함되어 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090869/fileData.do

Alerts

데이터기준일자 has constant value ""Constant

Reproduction

Analysis started2023-12-12 06:37:23.270187
Analysis finished2023-12-12 06:37:23.649738
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct595
Distinct (%)68.3%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-12-12T15:37:23.865349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length7.4753157
Min length2

Characters and Unicode

Total characters6511
Distinct characters323
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique396 ?
Unique (%)45.5%

Sample

1st row주식회사홍인
2nd row성심설비
3rd row이룸토건(주)
4th row주식회사대성설비
5th row진형건설(주)
ValueCountFrequency (%)
에스지이(주 8
 
0.9%
주)한국교량 8
 
0.9%
주)부일건설산업 7
 
0.8%
아주지오텍(주 6
 
0.7%
한밭건설(주 5
 
0.6%
토방이앤이(주 5
 
0.6%
홍진건설(주 5
 
0.6%
주)제이투이앤씨 5
 
0.6%
엘케이건설산업(주 4
 
0.5%
주)펜테크 4
 
0.5%
Other values (585) 814
93.5%
2023-12-12T15:37:24.332770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
700
 
10.8%
( 659
 
10.1%
) 659
 
10.1%
262
 
4.0%
249
 
3.8%
230
 
3.5%
113
 
1.7%
96
 
1.5%
96
 
1.5%
95
 
1.5%
Other values (313) 3352
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5149
79.1%
Open Punctuation 659
 
10.1%
Close Punctuation 659
 
10.1%
Uppercase Letter 31
 
0.5%
Decimal Number 12
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
700
 
13.6%
262
 
5.1%
249
 
4.8%
230
 
4.5%
113
 
2.2%
96
 
1.9%
96
 
1.9%
95
 
1.8%
93
 
1.8%
91
 
1.8%
Other values (298) 3124
60.7%
Uppercase Letter
ValueCountFrequency (%)
E 9
29.0%
N 9
29.0%
G 9
29.0%
J 2
 
6.5%
H 1
 
3.2%
C 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
1 6
50.0%
2 2
 
16.7%
5 1
 
8.3%
6 1
 
8.3%
3 1
 
8.3%
9 1
 
8.3%
Open Punctuation
ValueCountFrequency (%)
( 659
100.0%
Close Punctuation
ValueCountFrequency (%)
) 659
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5149
79.1%
Common 1331
 
20.4%
Latin 31
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
700
 
13.6%
262
 
5.1%
249
 
4.8%
230
 
4.5%
113
 
2.2%
96
 
1.9%
96
 
1.9%
95
 
1.8%
93
 
1.8%
91
 
1.8%
Other values (298) 3124
60.7%
Common
ValueCountFrequency (%)
( 659
49.5%
) 659
49.5%
1 6
 
0.5%
2 2
 
0.2%
- 1
 
0.1%
5 1
 
0.1%
6 1
 
0.1%
3 1
 
0.1%
9 1
 
0.1%
Latin
ValueCountFrequency (%)
E 9
29.0%
N 9
29.0%
G 9
29.0%
J 2
 
6.5%
H 1
 
3.2%
C 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5149
79.1%
ASCII 1362
 
20.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
700
 
13.6%
262
 
5.1%
249
 
4.8%
230
 
4.5%
113
 
2.2%
96
 
1.9%
96
 
1.9%
95
 
1.8%
93
 
1.8%
91
 
1.8%
Other values (298) 3124
60.7%
ASCII
ValueCountFrequency (%)
( 659
48.4%
) 659
48.4%
E 9
 
0.7%
N 9
 
0.7%
G 9
 
0.7%
1 6
 
0.4%
J 2
 
0.1%
2 2
 
0.1%
- 1
 
0.1%
5 1
 
0.1%
Other values (5) 5
 
0.4%
Distinct702
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
Minimum1979-07-20 00:00:00
Maximum2021-09-03 00:00:00
2023-12-12T15:37:24.501316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:37:24.643490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업종
Categorical

Distinct26
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
기계설비공사업
98 
실내건축공사업
80 
금속구조물ㆍ창호ㆍ온실공사업
80 
가스시설시공업 제3종
65 
난방시공업 제2종
64 
Other values (21)
484 

Length

Max length14
Median length11
Mean length8.7726751
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row조경시설물설치공사업
2nd row기계설비공사업
3rd row보링ㆍ그라우팅공사업
4th row난방시공업 제2종
5th row철근ㆍ콘크리트공사업

Common Values

ValueCountFrequency (%)
기계설비공사업 98
 
11.3%
실내건축공사업 80
 
9.2%
금속구조물ㆍ창호ㆍ온실공사업 80
 
9.2%
가스시설시공업 제3종 65
 
7.5%
난방시공업 제2종 64
 
7.3%
조경식재공사업 57
 
6.5%
상ㆍ하수도설비공사업 50
 
5.7%
시설물유지관리업 41
 
4.7%
철근ㆍ콘크리트공사업 40
 
4.6%
비계ㆍ구조물해체공사업 36
 
4.1%
Other values (16) 260
29.9%

Length

2023-12-12T15:37:24.798160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가스시설시공업 103
 
9.7%
기계설비공사업 98
 
9.2%
제2종 96
 
9.0%
난방시공업 89
 
8.4%
실내건축공사업 80
 
7.5%
금속구조물ㆍ창호ㆍ온실공사업 80
 
7.5%
제3종 67
 
6.3%
조경식재공사업 57
 
5.4%
상ㆍ하수도설비공사업 50
 
4.7%
시설물유지관리업 41
 
3.9%
Other values (15) 302
28.4%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2021-09-10
871 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-10
2nd row2021-09-10
3rd row2021-09-10
4th row2021-09-10
5th row2021-09-10

Common Values

ValueCountFrequency (%)
2021-09-10 871
100.0%

Length

2023-12-12T15:37:24.925802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:37:25.010331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-10 871
100.0%

Missing values

2023-12-12T15:37:23.497681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:37:23.612736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명등록일자업종데이터기준일자
0주식회사홍인2021-09-03조경시설물설치공사업2021-09-10
1성심설비2021-09-02기계설비공사업2021-09-10
2이룸토건(주)2021-08-30보링ㆍ그라우팅공사업2021-09-10
3주식회사대성설비2021-08-27난방시공업 제2종2021-09-10
4진형건설(주)2021-08-25철근ㆍ콘크리트공사업2021-09-10
5주식회사서원종합건설2021-08-23실내건축공사업2021-09-10
6주식회사원기기공2021-08-12기계설비공사업2021-09-10
7삼양포장중기(주)2021-08-11포장공사업2021-09-10
8주식회사홍인2021-08-10조경식재공사업2021-09-10
9주식회사피움이노베이션2021-08-09도장공사업2021-09-10
업체명등록일자업종데이터기준일자
861(주)부일건설산업1994-12-22철근ㆍ콘크리트공사업2021-09-10
862승재설비1994-08-26가스시설시공업 제3종2021-09-10
863(주)석연기업1992-09-02금속구조물ㆍ창호ㆍ온실공사업2021-09-10
864정서진건설(주)1992-09-01철근ㆍ콘크리트공사업2021-09-10
865(주)모아건설1992-09-01상ㆍ하수도설비공사업2021-09-10
866(주)남강엔지니어링1992-08-31기계설비공사업2021-09-10
867(주)제이투이앤씨1992-08-31도장공사업2021-09-10
868덕원산업개발(주)1992-08-31상ㆍ하수도설비공사업2021-09-10
869(주)청우하이드로1982-12-18기계설비공사업2021-09-10
870진흥건업(주)1979-07-20금속구조물ㆍ창호ㆍ온실공사업2021-09-10