Overview

Dataset statistics

Number of variables6
Number of observations137
Missing cells104
Missing cells (%)12.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory49.9 B

Variable types

Numeric1
DateTime2
Text2
Categorical1

Dataset

Description전북특별자치도 건설 기술 용역업 등록 현황 데이터입니다.등록번호, 업채명, 대표자, 등록분야, 등록일자를 포함하고 있습니다.
Author전북특별자치도
URLhttps://www.data.go.kr/data/15045385/fileData.do

Alerts

등록번호 has 104 (75.9%) missing valuesMissing
연번 has unique valuesUnique
업 체 명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:59:20.299703
Analysis finished2024-03-14 17:59:21.450053
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct137
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69
Minimum1
Maximum137
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-03-15T02:59:21.635135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.8
Q135
median69
Q3103
95-th percentile130.2
Maximum137
Range136
Interquartile range (IQR)68

Descriptive statistics

Standard deviation39.692569
Coefficient of variation (CV)0.57525462
Kurtosis-1.2
Mean69
Median Absolute Deviation (MAD)34
Skewness0
Sum9453
Variance1575.5
MonotonicityStrictly increasing
2024-03-15T02:59:22.101511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
96 1
 
0.7%
104 1
 
0.7%
Other values (127) 127
92.7%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%

등록번호
Date

MISSING 

Distinct31
Distinct (%)93.9%
Missing104
Missing (%)75.9%
Memory size1.2 KiB
Minimum2014-02-03 00:00:00
Maximum2015-02-28 00:00:00
2024-03-15T02:59:22.463339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:59:23.147010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

업 체 명
Text

UNIQUE 

Distinct137
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-15T02:59:23.959310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length8.5912409
Min length3

Characters and Unicode

Total characters1177
Distinct characters167
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)100.0%

Sample

1st row㈜길종합건축사사무소이엔지
2nd row㈜세화엔지니어링
3rd row㈜지유엔지니어링
4th row㈜국성건설엔지니어링
5th row㈜현성엔지니어링
ValueCountFrequency (%)
주식회사 15
 
9.6%
유한회사 2
 
1.3%
한국농어촌공사 2
 
1.3%
㈜길종합건축사사무소이엔지 1
 
0.6%
㈜동서기술 1
 
0.6%
다온지앤아이 1
 
0.6%
주)천마종합건설 1
 
0.6%
㈜대성건축사사무소 1
 
0.6%
주)새롬이엔지 1
 
0.6%
㈜우향이앤씨 1
 
0.6%
Other values (130) 130
83.3%
2024-03-15T02:59:25.036854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 69
 
5.9%
( 69
 
5.9%
56
 
4.8%
51
 
4.3%
50
 
4.2%
46
 
3.9%
43
 
3.7%
41
 
3.5%
39
 
3.3%
35
 
3.0%
Other values (157) 678
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 972
82.6%
Close Punctuation 69
 
5.9%
Open Punctuation 69
 
5.9%
Other Symbol 46
 
3.9%
Space Separator 19
 
1.6%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
5.8%
51
 
5.2%
50
 
5.1%
43
 
4.4%
41
 
4.2%
39
 
4.0%
35
 
3.6%
33
 
3.4%
33
 
3.4%
32
 
3.3%
Other values (151) 559
57.5%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
G 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Other Symbol
ValueCountFrequency (%)
46
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1018
86.5%
Common 157
 
13.3%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
5.5%
51
 
5.0%
50
 
4.9%
46
 
4.5%
43
 
4.2%
41
 
4.0%
39
 
3.8%
35
 
3.4%
33
 
3.2%
33
 
3.2%
Other values (152) 591
58.1%
Common
ValueCountFrequency (%)
) 69
43.9%
( 69
43.9%
19
 
12.1%
Latin
ValueCountFrequency (%)
S 1
50.0%
G 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 972
82.6%
ASCII 159
 
13.5%
None 46
 
3.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 69
43.4%
( 69
43.4%
19
 
11.9%
S 1
 
0.6%
G 1
 
0.6%
Hangul
ValueCountFrequency (%)
56
 
5.8%
51
 
5.2%
50
 
5.1%
43
 
4.4%
41
 
4.2%
39
 
4.0%
35
 
3.6%
33
 
3.4%
33
 
3.4%
32
 
3.3%
Other values (151) 559
57.5%
None
ValueCountFrequency (%)
46
100.0%
Distinct128
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-03-15T02:59:26.514231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.2846715
Min length2

Characters and Unicode

Total characters450
Distinct characters103
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)87.6%

Sample

1st row이*환
2nd row박*순
3rd row최*갑
4th row박*우
5th row천*율
ValueCountFrequency (%)
김*수 3
 
2.1%
이*행 2
 
1.4%
이*순 2
 
1.4%
박*철 2
 
1.4%
이*정 2
 
1.4%
이*호 2
 
1.4%
이*영 2
 
1.4%
김*희 2
 
1.4%
이*형 2
 
1.4%
김*영 2
 
1.4%
Other values (124) 124
85.5%
2024-03-15T02:59:28.441869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 145
32.2%
31
 
6.9%
24
 
5.3%
17
 
3.8%
9
 
2.0%
9
 
2.0%
8
 
1.8%
, 8
 
1.8%
8
 
1.8%
7
 
1.6%
Other values (93) 184
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 289
64.2%
Other Punctuation 153
34.0%
Space Separator 8
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
10.7%
24
 
8.3%
17
 
5.9%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (90) 166
57.4%
Other Punctuation
ValueCountFrequency (%)
* 145
94.8%
, 8
 
5.2%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 289
64.2%
Common 161
35.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
10.7%
24
 
8.3%
17
 
5.9%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (90) 166
57.4%
Common
ValueCountFrequency (%)
* 145
90.1%
, 8
 
5.0%
8
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 289
64.2%
ASCII 161
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 145
90.1%
, 8
 
5.0%
8
 
5.0%
Hangul
ValueCountFrequency (%)
31
 
10.7%
24
 
8.3%
17
 
5.9%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (90) 166
57.4%

등록분야
Categorical

Distinct8
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
설계등용역 일반
58 
일반
41 
설계등용역 일반(계획·조사·설계 제외)
11 
측량
11 
건설사업관리
Other values (3)
10 

Length

Max length21
Median length13
Mean length6.4087591
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row건설사업관리
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
설계등용역 일반 58
42.3%
일반 41
29.9%
설계등용역 일반(계획·조사·설계 제외) 11
 
8.0%
측량 11
 
8.0%
건설사업관리 6
 
4.4%
특수 6
 
4.4%
토목, 특수 3
 
2.2%
설계등용역, 토목, 특수 1
 
0.7%

Length

2024-03-15T02:59:28.903442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:59:29.293367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 99
44.6%
설계등용역 70
31.5%
일반(계획·조사·설계 11
 
5.0%
제외 11
 
5.0%
측량 11
 
5.0%
특수 10
 
4.5%
건설사업관리 6
 
2.7%
토목 4
 
1.8%
Distinct98
Distinct (%)71.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2014-07-11 00:00:00
Maximum2018-10-29 00:00:00
2024-03-15T02:59:29.804694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:59:30.265617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T02:59:20.692940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:59:30.530649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호등록분야등록일자
연번1.0001.0000.6380.985
등록번호1.0001.0000.9480.978
등록분야0.6380.9481.0000.000
등록일자0.9850.9780.0001.000
2024-03-15T02:59:30.784560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록분야
연번1.0000.370
등록분야0.3701.000

Missing values

2024-03-15T02:59:21.097703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:59:21.351682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등록번호업 체 명대표자등록분야등록일자
012014-02-03㈜길종합건축사사무소이엔지이*환건설사업관리2014-07-11
122014-02-04㈜세화엔지니어링박*순일반2014-07-14
232014-02-05㈜지유엔지니어링최*갑일반2014-07-17
342014-02-06㈜국성건설엔지니어링박*우일반2014-07-23
452014-02-07㈜현성엔지니어링천*율일반2014-08-12
562014-02-08㈜인우송*섭일반2014-09-12
672014-02-09기술사건축사사무소미문건설기술연구원㈜전*영건설사업관리2014-09-12
782014-02-10성원기술개발㈜온*수일반2014-09-26
892014-02-11㈜세종건설기술유*일일반2014-09-26
9102014-02-12(유)세종건설기술조*목일반2014-09-26
연번등록번호업 체 명대표자등록분야등록일자
1271282014-03-01한국품질검사시험원(주)김*철토목, 특수2014-07-11
1281292014-03-02태안특수건설(주)이*준특수2014-07-24
1291302014-03-03쏘일락이엔지(주)백*문, 송*량특수2014-08-26
1301312014-03-06(유)대한건설품질연구원백*헌토목, 특수2015-03-30
1311322014-03-08한국농어촌공사 전북지역본부박*만특수2015-09-08
1321332014-03-09한국농어촌공사 새만금사업단심*섭특수2015-09-08
1331342014-03-11원천개발(주)이*혜특수2016-11-28
1341352014-03-12유일검사엔지니어링(주)이*효특수2017-11-30
1351362014-03-13유한회사 지오건설품질연구원이*록토목, 특수2018-07-24
1361372014-04-02㈜건설품질시험원이*열설계등용역, 토목, 특수2015-07-02