Overview

Dataset statistics

Number of variables10
Number of observations120
Missing cells97
Missing cells (%)8.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.6 KiB
Average record size in memory82.1 B

Variable types

Text5
Categorical1
Numeric1
DateTime3

Dataset

Description경기주택도시공사_건축·토목·기계·전기분야 신기술·제품 정보
Author경기주택도시공사
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=0HCH2VHKU3SPCTZUCNM829283071&infSeq=1

Alerts

업체명 has 8 (6.7%) missing valuesMissing
사업자등록번호 has 89 (74.2%) missing valuesMissing

Reproduction

Analysis started2024-03-23 01:40:19.797676
Analysis finished2024-03-23 01:40:24.143499
Duration4.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공종
Text

Distinct86
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-23T01:40:24.469200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length9.1583333
Min length6

Characters and Unicode

Total characters1099
Distinct characters177
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)57.5%

Sample

1st row건축(가설재, 가시설)
2nd row건축(목창호)
3rd row건축(무근 콘크리트)
4th row건축(바닥재)
5th row건축(방문)
ValueCountFrequency (%)
토목(교량 10
 
6.6%
토목(가시설 6
 
3.9%
토목(교량공 5
 
3.3%
토목(옹벽 4
 
2.6%
토목(옹벽공 3
 
2.0%
건축(방수 3
 
2.0%
건축(화재대피시설 3
 
2.0%
건축(철근콘크리트 3
 
2.0%
콘크리트 3
 
2.0%
토목(아스팔트 2
 
1.3%
Other values (97) 110
72.4%
2024-03-23T01:40:25.486304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 123
 
11.2%
) 123
 
11.2%
68
 
6.2%
67
 
6.1%
38
 
3.5%
32
 
2.9%
22
 
2.0%
22
 
2.0%
21
 
1.9%
19
 
1.7%
Other values (167) 564
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 795
72.3%
Open Punctuation 123
 
11.2%
Close Punctuation 123
 
11.2%
Space Separator 32
 
2.9%
Other Punctuation 13
 
1.2%
Uppercase Letter 13
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
 
8.6%
67
 
8.4%
38
 
4.8%
22
 
2.8%
22
 
2.8%
21
 
2.6%
19
 
2.4%
18
 
2.3%
18
 
2.3%
16
 
2.0%
Other values (153) 486
61.1%
Uppercase Letter
ValueCountFrequency (%)
D 3
23.1%
E 2
15.4%
L 2
15.4%
C 2
15.4%
T 1
 
7.7%
V 1
 
7.7%
P 1
 
7.7%
S 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 9
69.2%
. 3
 
23.1%
/ 1
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 123
100.0%
Close Punctuation
ValueCountFrequency (%)
) 123
100.0%
Space Separator
ValueCountFrequency (%)
32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 795
72.3%
Common 291
 
26.5%
Latin 13
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
 
8.6%
67
 
8.4%
38
 
4.8%
22
 
2.8%
22
 
2.8%
21
 
2.6%
19
 
2.4%
18
 
2.3%
18
 
2.3%
16
 
2.0%
Other values (153) 486
61.1%
Latin
ValueCountFrequency (%)
D 3
23.1%
E 2
15.4%
L 2
15.4%
C 2
15.4%
T 1
 
7.7%
V 1
 
7.7%
P 1
 
7.7%
S 1
 
7.7%
Common
ValueCountFrequency (%)
( 123
42.3%
) 123
42.3%
32
 
11.0%
, 9
 
3.1%
. 3
 
1.0%
/ 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 795
72.3%
ASCII 304
 
27.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 123
40.5%
) 123
40.5%
32
 
10.5%
, 9
 
3.0%
. 3
 
1.0%
D 3
 
1.0%
E 2
 
0.7%
L 2
 
0.7%
C 2
 
0.7%
T 1
 
0.3%
Other values (4) 4
 
1.3%
Hangul
ValueCountFrequency (%)
68
 
8.6%
67
 
8.4%
38
 
4.8%
22
 
2.8%
22
 
2.8%
21
 
2.6%
19
 
2.4%
18
 
2.3%
18
 
2.3%
16
 
2.0%
Other values (153) 486
61.1%

적용분야
Categorical

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
공법
67 
자재
53 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자재
2nd row자재
3rd row자재
4th row자재
5th row자재

Common Values

ValueCountFrequency (%)
공법 67
55.8%
자재 53
44.2%

Length

2024-03-23T01:40:25.938701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T01:40:26.294102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공법 67
55.8%
자재 53
44.2%
Distinct119
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-23T01:40:26.781443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length43
Mean length25.75
Min length4

Characters and Unicode

Total characters3090
Distinct characters377
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)98.3%

Sample

1st row특허받은 잭서포트, 센지주
2nd row손끼임방지 안전도어 및 안전손잡이
3rd row무근콘크리트(주차장, 옥상) 균열방지판
4th row2액형 에폭시 수지 조성물(무취 에폭시 논슬립 주차장 바닥재)
5th row손끼임방지기능이 추가된 다기능 문세트
ValueCountFrequency (%)
15
 
2.4%
이용한 14
 
2.3%
공법 10
 
1.6%
6
 
1.0%
기술 6
 
1.0%
거더 5
 
0.8%
콘크리트 5
 
0.8%
cctv 4
 
0.6%
시스템 4
 
0.6%
층간소음 4
 
0.6%
Other values (486) 545
88.2%
2024-03-23T01:40:27.855632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
500
 
16.2%
50
 
1.6%
49
 
1.6%
) 43
 
1.4%
( 42
 
1.4%
e 39
 
1.3%
38
 
1.2%
37
 
1.2%
35
 
1.1%
35
 
1.1%
Other values (367) 2222
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1912
61.9%
Space Separator 500
 
16.2%
Uppercase Letter 277
 
9.0%
Lowercase Letter 259
 
8.4%
Close Punctuation 44
 
1.4%
Open Punctuation 43
 
1.4%
Dash Punctuation 22
 
0.7%
Other Punctuation 19
 
0.6%
Decimal Number 13
 
0.4%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
2.6%
49
 
2.6%
38
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
34
 
1.8%
33
 
1.7%
31
 
1.6%
31
 
1.6%
Other values (304) 1539
80.5%
Uppercase Letter
ValueCountFrequency (%)
S 26
 
9.4%
A 26
 
9.4%
P 23
 
8.3%
C 21
 
7.6%
D 18
 
6.5%
T 18
 
6.5%
E 17
 
6.1%
L 14
 
5.1%
R 14
 
5.1%
B 14
 
5.1%
Other values (14) 86
31.0%
Lowercase Letter
ValueCountFrequency (%)
e 39
15.1%
r 24
9.3%
t 22
 
8.5%
i 20
 
7.7%
l 19
 
7.3%
a 19
 
7.3%
o 17
 
6.6%
n 16
 
6.2%
c 12
 
4.6%
d 12
 
4.6%
Other values (12) 59
22.8%
Other Punctuation
ValueCountFrequency (%)
, 12
63.2%
. 4
 
21.1%
/ 1
 
5.3%
: 1
 
5.3%
· 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
0 5
38.5%
1 4
30.8%
2 2
 
15.4%
8 1
 
7.7%
3 1
 
7.7%
Close Punctuation
ValueCountFrequency (%)
) 43
97.7%
] 1
 
2.3%
Open Punctuation
ValueCountFrequency (%)
( 42
97.7%
[ 1
 
2.3%
Space Separator
ValueCountFrequency (%)
500
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1912
61.9%
Common 642
 
20.8%
Latin 535
 
17.3%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
2.6%
49
 
2.6%
38
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
34
 
1.8%
33
 
1.7%
31
 
1.6%
31
 
1.6%
Other values (304) 1539
80.5%
Latin
ValueCountFrequency (%)
e 39
 
7.3%
S 26
 
4.9%
A 26
 
4.9%
r 24
 
4.5%
P 23
 
4.3%
t 22
 
4.1%
C 21
 
3.9%
i 20
 
3.7%
l 19
 
3.6%
a 19
 
3.6%
Other values (35) 296
55.3%
Common
ValueCountFrequency (%)
500
77.9%
) 43
 
6.7%
( 42
 
6.5%
- 22
 
3.4%
, 12
 
1.9%
0 5
 
0.8%
. 4
 
0.6%
1 4
 
0.6%
2 2
 
0.3%
8 1
 
0.2%
Other values (7) 7
 
1.1%
Greek
ValueCountFrequency (%)
γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1912
61.9%
ASCII 1176
38.1%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
500
42.5%
) 43
 
3.7%
( 42
 
3.6%
e 39
 
3.3%
S 26
 
2.2%
A 26
 
2.2%
r 24
 
2.0%
P 23
 
2.0%
t 22
 
1.9%
- 22
 
1.9%
Other values (51) 409
34.8%
Hangul
ValueCountFrequency (%)
50
 
2.6%
49
 
2.6%
38
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
34
 
1.8%
33
 
1.7%
31
 
1.6%
31
 
1.6%
Other values (304) 1539
80.5%
None
ValueCountFrequency (%)
· 1
50.0%
γ 1
50.0%

용도
Text

Distinct111
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-23T01:40:28.446216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length136
Median length52
Mean length23.616667
Min length2

Characters and Unicode

Total characters2834
Distinct characters360
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)85.8%

Sample

1st row건설현장(신축,철거)에서 사용되는 지지대, 잭서포트의 단점을 보완한 특허받은 잭서포트
2nd row실내건축의 구조·시공방법등에 관한 기준에 의거한 방문 손끼임방지장치(안전문, 안전손잡이)
3rd row무근(보호용 누름) 콘크리트 사인장균열 방지
4th row주차장 바닥재
5th row건물의 각종 문
ValueCountFrequency (%)
40
 
6.1%
도로 8
 
1.2%
보도교 8
 
1.2%
도로교 7
 
1.1%
인도교 7
 
1.1%
공법 6
 
0.9%
보강 6
 
0.9%
6
 
0.9%
가시설 6
 
0.9%
방수 5
 
0.8%
Other values (465) 556
84.9%
2024-03-23T01:40:29.539915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
548
 
19.3%
, 69
 
2.4%
65
 
2.3%
46
 
1.6%
44
 
1.6%
42
 
1.5%
41
 
1.4%
40
 
1.4%
40
 
1.4%
39
 
1.4%
Other values (350) 1860
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2072
73.1%
Space Separator 548
 
19.3%
Other Punctuation 81
 
2.9%
Uppercase Letter 43
 
1.5%
Lowercase Letter 33
 
1.2%
Close Punctuation 21
 
0.7%
Open Punctuation 21
 
0.7%
Decimal Number 12
 
0.4%
Math Symbol 2
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
3.1%
46
 
2.2%
44
 
2.1%
42
 
2.0%
41
 
2.0%
40
 
1.9%
40
 
1.9%
39
 
1.9%
39
 
1.9%
38
 
1.8%
Other values (300) 1638
79.1%
Uppercase Letter
ValueCountFrequency (%)
C 9
20.9%
T 5
11.6%
V 4
9.3%
D 3
 
7.0%
E 3
 
7.0%
L 3
 
7.0%
O 3
 
7.0%
B 2
 
4.7%
X 2
 
4.7%
N 2
 
4.7%
Other values (6) 7
16.3%
Lowercase Letter
ValueCountFrequency (%)
t 6
18.2%
c 5
15.2%
n 4
12.1%
e 3
9.1%
r 2
 
6.1%
o 2
 
6.1%
u 2
 
6.1%
g 2
 
6.1%
v 2
 
6.1%
l 1
 
3.0%
Other values (4) 4
12.1%
Other Punctuation
ValueCountFrequency (%)
, 69
85.2%
/ 4
 
4.9%
. 2
 
2.5%
& 2
 
2.5%
; 2
 
2.5%
% 1
 
1.2%
· 1
 
1.2%
Decimal Number
ValueCountFrequency (%)
0 4
33.3%
5 2
16.7%
1 2
16.7%
9 1
 
8.3%
3 1
 
8.3%
4 1
 
8.3%
2 1
 
8.3%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
548
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2072
73.1%
Common 686
 
24.2%
Latin 76
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
3.1%
46
 
2.2%
44
 
2.1%
42
 
2.0%
41
 
2.0%
40
 
1.9%
40
 
1.9%
39
 
1.9%
39
 
1.9%
38
 
1.8%
Other values (300) 1638
79.1%
Latin
ValueCountFrequency (%)
C 9
 
11.8%
t 6
 
7.9%
T 5
 
6.6%
c 5
 
6.6%
n 4
 
5.3%
V 4
 
5.3%
D 3
 
3.9%
E 3
 
3.9%
L 3
 
3.9%
O 3
 
3.9%
Other values (20) 31
40.8%
Common
ValueCountFrequency (%)
548
79.9%
, 69
 
10.1%
) 21
 
3.1%
( 21
 
3.1%
0 4
 
0.6%
/ 4
 
0.6%
. 2
 
0.3%
5 2
 
0.3%
& 2
 
0.3%
1 2
 
0.3%
Other values (10) 11
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2072
73.1%
ASCII 761
 
26.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
548
72.0%
, 69
 
9.1%
) 21
 
2.8%
( 21
 
2.8%
C 9
 
1.2%
t 6
 
0.8%
T 5
 
0.7%
c 5
 
0.7%
0 4
 
0.5%
n 4
 
0.5%
Other values (39) 69
 
9.1%
Hangul
ValueCountFrequency (%)
65
 
3.1%
46
 
2.2%
44
 
2.1%
42
 
2.0%
41
 
2.0%
40
 
1.9%
40
 
1.9%
39
 
1.9%
39
 
1.9%
38
 
1.8%
Other values (300) 1638
79.1%
None
ValueCountFrequency (%)
· 1
100.0%

업체명
Text

MISSING 

Distinct77
Distinct (%)68.8%
Missing8
Missing (%)6.7%
Memory size1.1 KiB
2024-03-23T01:40:30.058288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length7.9017857
Min length3

Characters and Unicode

Total characters885
Distinct characters165
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)50.9%

Sample

1st row주식회사세원
2nd row건재테크
3rd row(주)한국콘젝트시스템
4th row김민재
5th row아하방수텍(주)
ValueCountFrequency (%)
주식회사 9
 
7.0%
주)대영비앤비스 6
 
4.7%
주)보강테크 5
 
3.9%
주)아이오컨스텍 5
 
3.9%
주)스틸코리아 5
 
3.9%
건재테크 4
 
3.1%
주)지성이씨에스 3
 
2.3%
성연건설엔지니어링(주 2
 
1.6%
2
 
1.6%
한국엘단트산업(주 2
 
1.6%
Other values (73) 85
66.4%
2024-03-23T01:40:30.975315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
 
10.1%
( 70
 
7.9%
) 70
 
7.9%
37
 
4.2%
35
 
4.0%
19
 
2.1%
19
 
2.1%
19
 
2.1%
19
 
2.1%
16
 
1.8%
Other values (155) 492
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 719
81.2%
Open Punctuation 70
 
7.9%
Close Punctuation 70
 
7.9%
Space Separator 16
 
1.8%
Other Punctuation 4
 
0.5%
Uppercase Letter 4
 
0.5%
Other Symbol 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
12.4%
37
 
5.1%
35
 
4.9%
19
 
2.6%
19
 
2.6%
19
 
2.6%
19
 
2.6%
16
 
2.2%
15
 
2.1%
15
 
2.1%
Other values (146) 436
60.6%
Uppercase Letter
ValueCountFrequency (%)
T 1
25.0%
C 1
25.0%
I 1
25.0%
K 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 721
81.5%
Common 160
 
18.1%
Latin 4
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
12.3%
37
 
5.1%
35
 
4.9%
19
 
2.6%
19
 
2.6%
19
 
2.6%
19
 
2.6%
16
 
2.2%
15
 
2.1%
15
 
2.1%
Other values (147) 438
60.7%
Common
ValueCountFrequency (%)
( 70
43.8%
) 70
43.8%
16
 
10.0%
, 4
 
2.5%
Latin
ValueCountFrequency (%)
T 1
25.0%
C 1
25.0%
I 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 719
81.2%
ASCII 164
 
18.5%
None 2
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
89
 
12.4%
37
 
5.1%
35
 
4.9%
19
 
2.6%
19
 
2.6%
19
 
2.6%
19
 
2.6%
16
 
2.2%
15
 
2.1%
15
 
2.1%
Other values (146) 436
60.6%
ASCII
ValueCountFrequency (%)
( 70
42.7%
) 70
42.7%
16
 
9.8%
, 4
 
2.4%
T 1
 
0.6%
C 1
 
0.6%
I 1
 
0.6%
K 1
 
0.6%
None
ValueCountFrequency (%)
2
100.0%

사업자등록번호
Real number (ℝ)

MISSING 

Distinct21
Distinct (%)67.7%
Missing89
Missing (%)74.2%
Infinite0
Infinite (%)0.0%
Mean3.7326165 × 109
Minimum1.0517977 × 109
Maximum8.2288006 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-03-23T01:40:31.368382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0517977 × 109
5-th percentile1.1995996 × 109
Q11.3781701 × 109
median2.6688012 × 109
Q35.1381746 × 109
95-th percentile8.1481001 × 109
Maximum8.2288006 × 109
Range7.177003 × 109
Interquartile range (IQR)3.7600045 × 109

Descriptive statistics

Standard deviation2.4859757 × 109
Coefficient of variation (CV)0.66601422
Kurtosis-0.98980429
Mean3.7326165 × 109
Median Absolute Deviation (MAD)1.4677789 × 109
Skewness0.59803065
Sum1.1571111 × 1011
Variance6.1800751 × 1018
MonotonicityNot monotonic
2024-03-23T01:40:31.807580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
5138170277 3
 
2.5%
8148100053 2
 
1.7%
2668801209 2
 
1.7%
1388702022 2
 
1.7%
1201022345 2
 
1.7%
4748801606 2
 
1.7%
7808700150 2
 
1.7%
1378170087 2
 
1.7%
3688801204 2
 
1.7%
5718101418 1
 
0.8%
Other values (11) 11
 
9.2%
(Missing) 89
74.2%
ValueCountFrequency (%)
1051797680 1
0.8%
1198176894 1
0.8%
1201022345 2
1.7%
1298634713 1
0.8%
1348169470 1
0.8%
1358126741 1
0.8%
1378170087 2
1.7%
1388702022 2
1.7%
2158770341 1
0.8%
2218144564 1
0.8%
ValueCountFrequency (%)
8228800636 1
 
0.8%
8148100053 2
1.7%
7808700150 2
1.7%
5948800861 1
 
0.8%
5718101418 1
 
0.8%
5138178871 1
 
0.8%
5138170277 3
2.5%
4748801606 2
1.7%
3688801204 2
1.7%
2668801209 2
1.7%
Distinct114
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-03-23T01:40:32.544402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length17.491667
Min length8

Characters and Unicode

Total characters2099
Distinct characters74
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)90.0%

Sample

1st row특허제품(10-0979817)
2nd row특허제품(10-1776156)
3rd row특허제품(제30-0924173호)
4th row신기술(LH인증 신기술 제2021-건축3호)
5th row특허제품(10-2572696)
ValueCountFrequency (%)
특허제품(특허 14
 
9.2%
특허제품(10-2006305 2
 
1.3%
제10-1627553호 2
 
1.3%
특허제품(제10-2355089호 2
 
1.3%
10-2114593호 2
 
1.3%
신기술(nep-motie-2017-020 2
 
1.3%
건설신기술(742 2
 
1.3%
2
 
1.3%
특허제품(10-1631476 1
 
0.7%
특허제품(제10-2176576호 1
 
0.7%
Other values (122) 122
80.3%
2024-03-23T01:40:33.623690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 208
 
9.9%
0 183
 
8.7%
143
 
6.8%
) 120
 
5.7%
( 120
 
5.7%
2 119
 
5.7%
- 114
 
5.4%
86
 
4.1%
86
 
4.1%
84
 
4.0%
Other values (64) 836
39.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 920
43.8%
Other Letter 721
34.3%
Close Punctuation 120
 
5.7%
Open Punctuation 120
 
5.7%
Dash Punctuation 114
 
5.4%
Uppercase Letter 71
 
3.4%
Space Separator 32
 
1.5%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
19.8%
86
11.9%
86
11.9%
84
11.7%
67
9.3%
49
 
6.8%
46
 
6.4%
38
 
5.3%
19
 
2.6%
18
 
2.5%
Other values (37) 85
11.8%
Uppercase Letter
ValueCountFrequency (%)
E 14
19.7%
T 10
14.1%
O 8
11.3%
I 8
11.3%
N 7
9.9%
M 7
9.9%
P 7
9.9%
G 3
 
4.2%
H 2
 
2.8%
A 2
 
2.8%
Other values (2) 3
 
4.2%
Decimal Number
ValueCountFrequency (%)
1 208
22.6%
0 183
19.9%
2 119
12.9%
9 68
 
7.4%
6 64
 
7.0%
5 61
 
6.6%
7 61
 
6.6%
8 57
 
6.2%
3 50
 
5.4%
4 49
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 120
100.0%
Open Punctuation
ValueCountFrequency (%)
( 120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1307
62.3%
Hangul 721
34.3%
Latin 71
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
19.8%
86
11.9%
86
11.9%
84
11.7%
67
9.3%
49
 
6.8%
46
 
6.4%
38
 
5.3%
19
 
2.6%
18
 
2.5%
Other values (37) 85
11.8%
Common
ValueCountFrequency (%)
1 208
15.9%
0 183
14.0%
) 120
9.2%
( 120
9.2%
2 119
9.1%
- 114
8.7%
9 68
 
5.2%
6 64
 
4.9%
5 61
 
4.7%
7 61
 
4.7%
Other values (5) 189
14.5%
Latin
ValueCountFrequency (%)
E 14
19.7%
T 10
14.1%
O 8
11.3%
I 8
11.3%
N 7
9.9%
M 7
9.9%
P 7
9.9%
G 3
 
4.2%
H 2
 
2.8%
A 2
 
2.8%
Other values (2) 3
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1378
65.7%
Hangul 721
34.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 208
15.1%
0 183
13.3%
) 120
8.7%
( 120
8.7%
2 119
8.6%
- 114
8.3%
9 68
 
4.9%
6 64
 
4.6%
5 61
 
4.4%
7 61
 
4.4%
Other values (17) 260
18.9%
Hangul
ValueCountFrequency (%)
143
19.8%
86
11.9%
86
11.9%
84
11.7%
67
9.3%
49
 
6.8%
46
 
6.4%
38
 
5.3%
19
 
2.6%
18
 
2.5%
Other values (37) 85
11.8%
Distinct107
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2002-04-02 00:00:00
Maximum2024-01-23 00:00:00
2024-03-23T01:40:34.029552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:40:34.496662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct111
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2018-11-13 00:00:00
Maximum2043-01-01 00:00:00
2024-03-23T01:40:34.934961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:40:35.407168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct88
Distinct (%)73.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2017-05-08 00:00:00
Maximum2024-01-23 00:00:00
2024-03-23T01:40:35.893283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T01:40:36.348888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-23T01:40:22.471866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T01:40:36.670655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공종적용분야업체명사업자등록번호등록일
공종1.0000.9820.9970.0000.996
적용분야0.9821.0000.9730.2000.904
업체명0.9970.9731.0000.8400.999
사업자등록번호0.0000.2000.8401.0000.833
등록일0.9960.9040.9990.8331.000
2024-03-23T01:40:37.091295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자등록번호적용분야
사업자등록번호1.0000.200
적용분야0.2001.000

Missing values

2024-03-23T01:40:22.905725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T01:40:23.421633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T01:40:24.013298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

공종적용분야기술명용도업체명사업자등록번호인증현황인증시작일인증종료일등록일
0건축(가설재, 가시설)자재특허받은 잭서포트, 센지주건설현장(신축,철거)에서 사용되는 지지대, 잭서포트의 단점을 보완한 특허받은 잭서포트주식회사세원<NA>특허제품(10-0979817)2008-04-172028-04-162020-12-22
1건축(목창호)자재손끼임방지 안전도어 및 안전손잡이실내건축의 구조·시공방법등에 관한 기준에 의거한 방문 손끼임방지장치(안전문, 안전손잡이)<NA><NA>특허제품(10-1776156)2017-03-102037-03-092022-02-14
2건축(무근 콘크리트)자재무근콘크리트(주차장, 옥상) 균열방지판무근(보호용 누름) 콘크리트 사인장균열 방지건재테크<NA>특허제품(제30-0924173호)2017-01-132037-01-132023-01-17
3건축(바닥재)자재2액형 에폭시 수지 조성물(무취 에폭시 논슬립 주차장 바닥재)주차장 바닥재(주)한국콘젝트시스템<NA>신기술(LH인증 신기술 제2021-건축3호)2021-06-282026-12-312021-07-21
4건축(방문)자재손끼임방지기능이 추가된 다기능 문세트건물의 각종 문김민재<NA>특허제품(10-2572696)2023-01-012043-01-012023-09-20
5건축(방수)공법공장 생산된 박막형 점착 복합 방수시트와 콘크리트간 재료적 일체성을 가지는 건식화 복합방수 시공기술(Dry Waterproof System)건축 및 토목구조물의 방수아하방수텍(주)1198176894건설신기술(742)2014-08-252019-08-242017-05-08
6건축(방수)공법수팽창성 차수기능을 갖는 연질형 폴리우레탄을이용한 전면밀착 비노출 복합방수공법방수고장렬<NA>건설신기술(946호)2022-11-072030-11-062023-01-11
7건축(방수)공법연질형 수지를 적용한 FRP도막재와 시트를 이용한 인공지반녹화용 방근·방수복합공법(SMART GREEN SYSTEM)건축 구조물 방수주식회사 삼성건업<NA>건설신기술(710호)2013-09-272027-09-262021-11-08
8건축(방수시트)공법WPS 복합 방수 공법방수(주)한국콘젝트시스템<NA>녹색기술(제 GT-18-00388호)2018-01-182024-01-172021-07-21
9건축(에너지절감)공법지열 신재생에너지 시스템건축물에 적용하는 지열 냉난방 시스템(신재생에너지)(주)지앤지테크놀러지<NA>건설신기술(제929호)2022-01-012030-01-012023-01-27
공종적용분야기술명용도업체명사업자등록번호인증현황인증시작일인증종료일등록일
110토목(우수배제시스템)공법표면 유출된 우수와 보도블럭 하부로 침투된 우수를 동시에 집수하는 모듈러 방식의 집수정 기술도로 및 인도외측에 위치하여 우수에 의해 발생하는 침투와 표면유출을 배제하는 도시우수배제시스템<NA><NA>녹색제품(2014-142)2018-11-162021-11-152021-11-04
111토목(지반굴착)공법우드펠렛을 이용한 발파방법암발파<NA><NA>기타(제10-2225754)2021-03-042040-03-042021-08-09
112토목(콘크리트거더)공법강섬유와 철근집합체를 병용한 프리스트레스트 초고강도 콘크리트(UPC) I형 거더 제작 및 시공법보도교, 도로교, 라멘교, 아치교주식회사씨알디<NA>건설신기술(제884호)2020-03-132028-03-122020-12-30
113토목(콘크리트거더)공법단부구속을 통한 초고강도 콘크리트 라멘교 및 이의 시공방법라멘교주식회사씨알디<NA>특허제품(제10-1783326호)2017-03-232037-03-232020-12-30
114토목(터널)공법동시자동주입공법(SAG공법)터널보조공법 강관다단그라우팅(주)보강테크<NA>특허제품(10-0918681)2008-11-112028-11-112022-06-24
115토목(터널)공법입체교차 비개착공법(UPRS공법)입체교차(도로,철도 하부 통과) 비개착 UPRS공법(주)피디티건설, 윤인병<NA>신기술(특허 제10-1865858호, 10-0825074호)2018-06-012037-10-172022-05-09
116토목(토지보상 현장 서비스)공법AI 활용 지장물 조사 서비스토지 개발을 위한 타당성 검토 및 지장물 조사 단계를 기존보다 시간 및 비용 단축주식회사 업데이터5138170277기타(19년도 데이터 플래그십 선정 사업)2020-08-282021-08-282020-08-28
117토목(토질및기초)공법지반앵커 상대변위 측정장치 및 그 시공기술(STK지반앵커공법)사면 관리 및 보강주식회사 쏘일텍코리아<NA>건설신기술(제842호)2018-07-202026-07-192022-07-20
118토목(포장공사)자재무기질 바인더를 이용하여 투수성능과 강도를 증대시킨 무기질 투수콘크리트투수성 포장재이디씨라이프(주)<NA>방재신기술(제2018-9호)2018-09-112023-09-102020-04-28
119토목(표면보수, 단면복구)자재보수보강토목구조물의 보수보강㈜엔세라텍<NA>특허제품(제0931016호)2008-11-172028-11-172023-10-19