Overview

Dataset statistics

Number of variables11
Number of observations880
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory78.3 KiB
Average record size in memory91.2 B

Variable types

Numeric2
Categorical4
Text5

Dataset

Description국토교통분야의 사업화지원사업의 수행/참여기업명 , 과제명, 기관 유형, 사업자번호, 법인번호 등 참여기업 현황에 대한 정보를 제공합니다.
Author국토교통과학기술진흥원
URLhttps://www.data.go.kr/data/15088157/fileData.do

Alerts

번호 is highly overall correlated with 연도High correlation
연도 is highly overall correlated with 번호High correlation
체계 is highly overall correlated with 기관유형High correlation
기관유형 is highly overall correlated with 체계High correlation
기관유형 is highly imbalanced (55.5%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 19:05:11.898926
Analysis finished2024-04-17 19:05:13.230274
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct880
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean440.5
Minimum1
Maximum880
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-04-18T04:05:13.284090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile44.95
Q1220.75
median440.5
Q3660.25
95-th percentile836.05
Maximum880
Range879
Interquartile range (IQR)439.5

Descriptive statistics

Standard deviation254.17842
Coefficient of variation (CV)0.57702251
Kurtosis-1.2
Mean440.5
Median Absolute Deviation (MAD)220
Skewness0
Sum387640
Variance64606.667
MonotonicityStrictly increasing
2024-04-18T04:05:13.386106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
593 1
 
0.1%
582 1
 
0.1%
583 1
 
0.1%
584 1
 
0.1%
585 1
 
0.1%
586 1
 
0.1%
587 1
 
0.1%
588 1
 
0.1%
589 1
 
0.1%
Other values (870) 870
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
880 1
0.1%
879 1
0.1%
878 1
0.1%
877 1
0.1%
876 1
0.1%
875 1
0.1%
874 1
0.1%
873 1
0.1%
872 1
0.1%
871 1
0.1%

연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2021
453 
2022
427 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 453
51.5%
2022 427
48.5%

Length

2024-04-18T04:05:13.483000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:05:13.556184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 453
51.5%
2022 427
48.5%

체계
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
주관(구.협동)
444 
공동
429 
위탁
 
7

Length

Max length8
Median length8
Mean length5.0272727
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주관(구.협동)
2nd row주관(구.협동)
3rd row위탁
4th row공동
5th row공동

Common Values

ValueCountFrequency (%)
주관(구.협동) 444
50.5%
공동 429
48.8%
위탁 7
 
0.8%

Length

2024-04-18T04:05:13.637126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:05:13.721612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주관(구.협동 444
50.5%
공동 429
48.8%
위탁 7
 
0.8%
Distinct182
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2024-04-18T04:05:13.945709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length59
Mean length39.884091
Min length13

Characters and Unicode

Total characters35098
Distinct characters455
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.6%

Sample

1st row영구자석에 의한 누설자속 최소화 기술 적용을 통한 도시철도용 견인 전동기 및 추진제어시스템 기술개발
2nd row영구자석에 의한 누설자속 최소화 기술 적용을 통한 도시철도용 견인 전동기 및 추진제어시스템 기술개발
3rd rowIPM 전동기 및 추진제어시스템 현차시험
4th row도시철도차량 견인용 IPM 전동기 제작
5th row도시철도차량 견인용 IPM 전동기 제작
ValueCountFrequency (%)
개발 545
 
6.5%
472
 
5.7%
위한 164
 
2.0%
시스템 141
 
1.7%
기술 129
 
1.6%
기반 78
 
0.9%
스마트 77
 
0.9%
이용한 72
 
0.9%
중소기업 62
 
0.7%
사업화 56
 
0.7%
Other values (872) 6525
78.4%
2024-04-18T04:05:14.345815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7491
 
21.3%
862
 
2.5%
669
 
1.9%
635
 
1.8%
635
 
1.8%
582
 
1.7%
522
 
1.5%
515
 
1.5%
472
 
1.3%
437
 
1.2%
Other values (445) 22278
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24450
69.7%
Space Separator 7491
 
21.3%
Uppercase Letter 1355
 
3.9%
Lowercase Letter 614
 
1.7%
Decimal Number 464
 
1.3%
Other Punctuation 222
 
0.6%
Close Punctuation 215
 
0.6%
Open Punctuation 215
 
0.6%
Dash Punctuation 48
 
0.1%
Math Symbol 12
 
< 0.1%
Other values (2) 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
862
 
3.5%
669
 
2.7%
635
 
2.6%
635
 
2.6%
582
 
2.4%
522
 
2.1%
515
 
2.1%
472
 
1.9%
437
 
1.8%
410
 
1.7%
Other values (376) 18711
76.5%
Lowercase Letter
ValueCountFrequency (%)
e 72
11.7%
o 62
 
10.1%
a 56
 
9.1%
t 49
 
8.0%
k 46
 
7.5%
m 46
 
7.5%
l 36
 
5.9%
i 30
 
4.9%
r 30
 
4.9%
s 27
 
4.4%
Other values (13) 160
26.1%
Uppercase Letter
ValueCountFrequency (%)
A 152
11.2%
I 139
 
10.3%
S 130
 
9.6%
C 126
 
9.3%
T 100
 
7.4%
M 96
 
7.1%
P 73
 
5.4%
B 64
 
4.7%
O 60
 
4.4%
D 56
 
4.1%
Other values (12) 359
26.5%
Decimal Number
ValueCountFrequency (%)
0 157
33.8%
3 88
19.0%
2 70
15.1%
1 44
 
9.5%
4 32
 
6.9%
7 24
 
5.2%
5 22
 
4.7%
8 19
 
4.1%
6 8
 
1.7%
Other Punctuation
ValueCountFrequency (%)
, 82
36.9%
/ 66
29.7%
· 30
 
13.5%
% 20
 
9.0%
; 10
 
4.5%
& 10
 
4.5%
. 4
 
1.8%
Other Symbol
ValueCountFrequency (%)
4
50.0%
4
50.0%
Space Separator
ValueCountFrequency (%)
7491
100.0%
Close Punctuation
ValueCountFrequency (%)
) 215
100.0%
Open Punctuation
ValueCountFrequency (%)
( 215
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24442
69.6%
Common 8679
 
24.7%
Latin 1969
 
5.6%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
862
 
3.5%
669
 
2.7%
635
 
2.6%
635
 
2.6%
582
 
2.4%
522
 
2.1%
515
 
2.1%
472
 
1.9%
437
 
1.8%
410
 
1.7%
Other values (375) 18703
76.5%
Latin
ValueCountFrequency (%)
A 152
 
7.7%
I 139
 
7.1%
S 130
 
6.6%
C 126
 
6.4%
T 100
 
5.1%
M 96
 
4.9%
P 73
 
3.7%
e 72
 
3.7%
B 64
 
3.3%
o 62
 
3.1%
Other values (35) 955
48.5%
Common
ValueCountFrequency (%)
7491
86.3%
) 215
 
2.5%
( 215
 
2.5%
0 157
 
1.8%
3 88
 
1.0%
, 82
 
0.9%
2 70
 
0.8%
/ 66
 
0.8%
- 48
 
0.6%
1 44
 
0.5%
Other values (14) 203
 
2.3%
Han
ValueCountFrequency (%)
8
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24434
69.6%
ASCII 10606
30.2%
None 30
 
0.1%
CJK 8
 
< 0.1%
Compat Jamo 8
 
< 0.1%
CJK Compat 4
 
< 0.1%
Letterlike Symbols 4
 
< 0.1%
Punctuation 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7491
70.6%
) 215
 
2.0%
( 215
 
2.0%
0 157
 
1.5%
A 152
 
1.4%
I 139
 
1.3%
S 130
 
1.2%
C 126
 
1.2%
T 100
 
0.9%
M 96
 
0.9%
Other values (55) 1785
 
16.8%
Hangul
ValueCountFrequency (%)
862
 
3.5%
669
 
2.7%
635
 
2.6%
635
 
2.6%
582
 
2.4%
522
 
2.1%
515
 
2.1%
472
 
1.9%
437
 
1.8%
410
 
1.7%
Other values (374) 18695
76.5%
None
ValueCountFrequency (%)
· 30
100.0%
CJK
ValueCountFrequency (%)
8
100.0%
Compat Jamo
ValueCountFrequency (%)
8
100.0%
CJK Compat
ValueCountFrequency (%)
4
100.0%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
Punctuation
ValueCountFrequency (%)
4
100.0%
Distinct256
Distinct (%)29.1%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2024-04-18T04:05:14.622798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length3.0784091
Min length2

Characters and Unicode

Total characters2709
Distinct characters142
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)0.9%

Sample

1st row이창희
2nd row이창희
3rd row나정민
4th row하창용
5th row하창용
ValueCountFrequency (%)
김형규 10
 
1.1%
이태영 10
 
1.1%
성범규 8
 
0.9%
서창원 8
 
0.9%
박상준 8
 
0.9%
황윤태 8
 
0.9%
서석구 8
 
0.9%
김문재 6
 
0.7%
윤창연 4
 
0.4%
한진규 4
 
0.4%
Other values (249) 818
91.7%
2024-04-18T04:05:14.994338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
210
 
7.8%
118
 
4.4%
101
 
3.7%
90
 
3.3%
81
 
3.0%
72
 
2.7%
59
 
2.2%
55
 
2.0%
54
 
2.0%
52
 
1.9%
Other values (132) 1817
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2617
96.6%
Lowercase Letter 80
 
3.0%
Space Separator 12
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
210
 
8.0%
118
 
4.5%
101
 
3.9%
90
 
3.4%
81
 
3.1%
72
 
2.8%
59
 
2.3%
55
 
2.1%
54
 
2.1%
52
 
2.0%
Other values (122) 1725
65.9%
Lowercase Letter
ValueCountFrequency (%)
o 24
30.0%
h 12
15.0%
k 12
15.0%
n 8
 
10.0%
p 8
 
10.0%
v 4
 
5.0%
s 4
 
5.0%
r 4
 
5.0%
i 4
 
5.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2617
96.6%
Latin 80
 
3.0%
Common 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
210
 
8.0%
118
 
4.5%
101
 
3.9%
90
 
3.4%
81
 
3.1%
72
 
2.8%
59
 
2.3%
55
 
2.1%
54
 
2.1%
52
 
2.0%
Other values (122) 1725
65.9%
Latin
ValueCountFrequency (%)
o 24
30.0%
h 12
15.0%
k 12
15.0%
n 8
 
10.0%
p 8
 
10.0%
v 4
 
5.0%
s 4
 
5.0%
r 4
 
5.0%
i 4
 
5.0%
Common
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2617
96.6%
ASCII 92
 
3.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
210
 
8.0%
118
 
4.5%
101
 
3.9%
90
 
3.4%
81
 
3.1%
72
 
2.8%
59
 
2.3%
55
 
2.1%
54
 
2.1%
52
 
2.0%
Other values (122) 1725
65.9%
ASCII
ValueCountFrequency (%)
o 24
26.1%
12
13.0%
h 12
13.0%
k 12
13.0%
n 8
 
8.7%
p 8
 
8.7%
v 4
 
4.3%
s 4
 
4.3%
r 4
 
4.3%
i 4
 
4.3%

참여구분
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
수행기관
511 
참여기관
369 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수행기관
2nd row참여기관
3rd row수행기관
4th row수행기관
5th row참여기관

Common Values

ValueCountFrequency (%)
수행기관 511
58.1%
참여기관 369
41.9%

Length

2024-04-18T04:05:15.113895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:05:15.203350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수행기관 511
58.1%
참여기관 369
41.9%
Distinct226
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2024-04-18T04:05:15.369418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length8.8818182
Min length4

Characters and Unicode

Total characters7816
Distinct characters256
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.5%

Sample

1st row(주)다원시스
2nd row(주)다원시스
3rd row광주광역시도시철도공사
4th row(주)로텍
5th row(주)로텍
ValueCountFrequency (%)
주식회사 148
 
13.7%
28
 
2.6%
한국건설기술연구원 22
 
2.0%
산학협력단 16
 
1.5%
한국철도공사 14
 
1.3%
한국철도기술연구원 13
 
1.2%
재)한국건설생활환경시험연구원오창 10
 
0.9%
주)투트랙 8
 
0.7%
주)서린브릿지텍 8
 
0.7%
세이프웍스 8
 
0.7%
Other values (222) 805
74.5%
2024-04-18T04:05:15.680383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
718
 
9.2%
) 536
 
6.9%
( 536
 
6.9%
290
 
3.7%
223
 
2.9%
222
 
2.8%
200
 
2.6%
189
 
2.4%
180
 
2.3%
152
 
1.9%
Other values (246) 4570
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6536
83.6%
Close Punctuation 536
 
6.9%
Open Punctuation 536
 
6.9%
Space Separator 200
 
2.6%
Uppercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
718
 
11.0%
290
 
4.4%
223
 
3.4%
222
 
3.4%
189
 
2.9%
180
 
2.8%
152
 
2.3%
141
 
2.2%
133
 
2.0%
108
 
1.7%
Other values (241) 4180
64.0%
Uppercase Letter
ValueCountFrequency (%)
G 4
50.0%
S 4
50.0%
Close Punctuation
ValueCountFrequency (%)
) 536
100.0%
Open Punctuation
ValueCountFrequency (%)
( 536
100.0%
Space Separator
ValueCountFrequency (%)
200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6536
83.6%
Common 1272
 
16.3%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
718
 
11.0%
290
 
4.4%
223
 
3.4%
222
 
3.4%
189
 
2.9%
180
 
2.8%
152
 
2.3%
141
 
2.2%
133
 
2.0%
108
 
1.7%
Other values (241) 4180
64.0%
Common
ValueCountFrequency (%)
) 536
42.1%
( 536
42.1%
200
 
15.7%
Latin
ValueCountFrequency (%)
G 4
50.0%
S 4
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6536
83.6%
ASCII 1280
 
16.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
718
 
11.0%
290
 
4.4%
223
 
3.4%
222
 
3.4%
189
 
2.9%
180
 
2.8%
152
 
2.3%
141
 
2.2%
133
 
2.0%
108
 
1.7%
Other values (241) 4180
64.0%
ASCII
ValueCountFrequency (%)
) 536
41.9%
( 536
41.9%
200
 
15.6%
G 4
 
0.3%
S 4
 
0.3%
Distinct221
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2024-04-18T04:05:15.989496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.3272727
Min length2

Characters and Unicode

Total characters2928
Distinct characters150
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.5%

Sample

1st row박선순
2nd row박선순
3rd row윤진보
4th row김태훈
5th row김태훈
ValueCountFrequency (%)
1명 32
 
3.4%
김병석 22
 
2.3%
나희승 14
 
1.5%
데이터 14
 
1.5%
미집계 14
 
1.5%
한석윤 13
 
1.4%
조영태 10
 
1.1%
김영진 8
 
0.8%
황선유 8
 
0.8%
안나리 8
 
0.8%
Other values (218) 803
84.9%
2024-04-18T04:05:16.376741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
211
 
7.2%
141
 
4.8%
100
 
3.4%
88
 
3.0%
82
 
2.8%
70
 
2.4%
69
 
2.4%
53
 
1.8%
53
 
1.8%
53
 
1.8%
Other values (140) 2008
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2794
95.4%
Space Separator 82
 
2.8%
Decimal Number 36
 
1.2%
Other Punctuation 16
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
211
 
7.6%
141
 
5.0%
100
 
3.6%
88
 
3.1%
70
 
2.5%
69
 
2.5%
53
 
1.9%
53
 
1.9%
53
 
1.9%
52
 
1.9%
Other values (136) 1904
68.1%
Decimal Number
ValueCountFrequency (%)
1 32
88.9%
2 4
 
11.1%
Space Separator
ValueCountFrequency (%)
82
100.0%
Other Punctuation
ValueCountFrequency (%)
, 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2794
95.4%
Common 134
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
211
 
7.6%
141
 
5.0%
100
 
3.6%
88
 
3.1%
70
 
2.5%
69
 
2.5%
53
 
1.9%
53
 
1.9%
53
 
1.9%
52
 
1.9%
Other values (136) 1904
68.1%
Common
ValueCountFrequency (%)
82
61.2%
1 32
 
23.9%
, 16
 
11.9%
2 4
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2794
95.4%
ASCII 134
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
211
 
7.6%
141
 
5.0%
100
 
3.6%
88
 
3.1%
70
 
2.5%
69
 
2.5%
53
 
1.9%
53
 
1.9%
53
 
1.9%
52
 
1.9%
Other values (136) 1904
68.1%
ASCII
ValueCountFrequency (%)
82
61.2%
1 32
 
23.9%
, 16
 
11.9%
2 4
 
3.0%

기관유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
중소기업(연)
662 
대학교
76 
정부출연(연)
 
50
대기업(연)
 
32
중견기업(연)
 
32
Other values (4)
 
28

Length

Max length15
Median length7
Mean length6.6840909
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중소기업(연)
2nd row중소기업(연)
3rd row지자체
4th row중소기업(연)
5th row중소기업(연)

Common Values

ValueCountFrequency (%)
중소기업(연) 662
75.2%
대학교 76
 
8.6%
정부출연(연) 50
 
5.7%
대기업(연) 32
 
3.6%
중견기업(연) 32
 
3.6%
준정부기관(비영리기관)(연) 15
 
1.7%
기타 5
 
0.6%
협회 5
 
0.6%
지자체 3
 
0.3%

Length

2024-04-18T04:05:16.486139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:05:16.578698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중소기업(연 662
75.2%
대학교 76
 
8.6%
정부출연(연 50
 
5.7%
대기업(연 32
 
3.6%
중견기업(연 32
 
3.6%
준정부기관(비영리기관)(연 15
 
1.7%
기타 5
 
0.6%
협회 5
 
0.6%
지자체 3
 
0.3%

사업자번호
Real number (ℝ)

Distinct226
Distinct (%)25.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2212688 × 109
Minimum1.0181998 × 109
Maximum8.9386009 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2024-04-18T04:05:16.689918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0181998 × 109
5-th percentile1.0877339 × 109
Q11.378188 × 109
median2.2982011 × 109
Q34.5088012 × 109
95-th percentile7.1287008 × 109
Maximum8.9386009 × 109
Range7.920401 × 109
Interquartile range (IQR)3.1306132 × 109

Descriptive statistics

Standard deviation2.0826492 × 109
Coefficient of variation (CV)0.64653072
Kurtosis-0.072086493
Mean3.2212688 × 109
Median Absolute Deviation (MAD)1.0600411 × 109
Skewness0.96697562
Sum2.8347166 × 1012
Variance4.3374279 × 1018
MonotonicityNot monotonic
2024-04-18T04:05:16.829535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2298201135 22
 
2.5%
3148210024 14
 
1.6%
1388202013 13
 
1.5%
3178203895 10
 
1.1%
6208700713 8
 
0.9%
2838600730 8
 
0.9%
4058113107 8
 
0.9%
4108706712 8
 
0.9%
1148116377 8
 
0.9%
1148657882 8
 
0.9%
Other values (216) 773
87.8%
ValueCountFrequency (%)
1018199824 4
0.5%
1018212009 2
0.2%
1018684739 4
0.5%
1028131754 4
0.5%
1028138330 4
0.5%
1058213617 4
0.5%
1058779517 4
0.5%
1058799963 4
0.5%
1068212299 2
0.2%
1068632726 4
0.5%
ValueCountFrequency (%)
8938600861 4
0.5%
8888601631 4
0.5%
8758801702 4
0.5%
8578801683 4
0.5%
8578600074 4
0.5%
8518701709 2
0.2%
8468801441 4
0.5%
8198101232 4
0.5%
8108801404 4
0.5%
7538700901 4
0.5%
Distinct89
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2024-04-18T04:05:17.011625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.840909
Min length7

Characters and Unicode

Total characters12180
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st row1350110000000
2nd row1350110000000
3rd row2041710000000
4th row1942110000000
5th row1942110000000
ValueCountFrequency (%)
1101110000000 164
18.2%
1101120000000 112
 
12.4%
1311110000000 50
 
5.6%
1341110000000 34
 
3.8%
1601110000000 31
 
3.4%
1801110000000 24
 
2.7%
1112410000000 22
 
2.4%
2001110000000 20
 
2.2%
1201110000000 20
 
2.2%
데이터 20
 
2.2%
Other values (80) 403
44.8%
2024-04-18T04:05:17.294436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6533
53.6%
1 3192
26.2%
880
 
7.2%
2 398
 
3.3%
4 258
 
2.1%
3 217
 
1.8%
7 175
 
1.4%
5 159
 
1.3%
6 113
 
0.9%
8 110
 
0.9%
Other values (7) 145
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11180
91.8%
Space Separator 880
 
7.2%
Other Letter 120
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6533
58.4%
1 3192
28.6%
2 398
 
3.6%
4 258
 
2.3%
3 217
 
1.9%
7 175
 
1.6%
5 159
 
1.4%
6 113
 
1.0%
8 110
 
1.0%
9 25
 
0.2%
Other Letter
ValueCountFrequency (%)
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%
Space Separator
ValueCountFrequency (%)
880
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12060
99.0%
Hangul 120
 
1.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6533
54.2%
1 3192
26.5%
880
 
7.3%
2 398
 
3.3%
4 258
 
2.1%
3 217
 
1.8%
7 175
 
1.5%
5 159
 
1.3%
6 113
 
0.9%
8 110
 
0.9%
Hangul
ValueCountFrequency (%)
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12060
99.0%
Hangul 120
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6533
54.2%
1 3192
26.5%
880
 
7.3%
2 398
 
3.3%
4 258
 
2.1%
3 217
 
1.8%
7 175
 
1.5%
5 159
 
1.3%
6 113
 
0.9%
8 110
 
0.9%
Hangul
ValueCountFrequency (%)
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%
20
16.7%

Interactions

2024-04-18T04:05:12.887187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:05:12.750148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:05:12.959417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:05:12.810969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T04:05:17.373232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호연도체계참여구분기관유형사업자번호법인번호
번호1.0000.9990.2050.0000.0390.2600.572
연도0.9991.0000.0000.0000.0000.0000.000
체계0.2050.0001.0000.1020.8890.2470.878
참여구분0.0000.0000.1021.0000.3460.0000.199
기관유형0.0390.0000.8890.3461.0000.3360.986
사업자번호0.2600.0000.2470.0000.3361.0000.921
법인번호0.5720.0000.8780.1990.9860.9211.000
2024-04-18T04:05:17.452662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체계기관유형연도참여구분
체계1.0000.6180.0000.170
기관유형0.6181.0000.0000.345
연도0.0000.0001.0000.000
참여구분0.1700.3450.0001.000
2024-04-18T04:05:17.524821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업자번호연도체계참여구분기관유형
번호1.0000.0180.9700.1240.0000.017
사업자번호0.0181.0000.0000.1500.0000.159
연도0.9700.0001.0000.0000.0000.000
체계0.1240.1500.0001.0000.1700.618
참여구분0.0000.0000.0000.1701.0000.345
기관유형0.0170.1590.0000.6180.3451.000

Missing values

2024-04-18T04:05:13.068435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:05:13.185687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호연도체계과제명연구책임자참여구분수행-참여기관명대표자명기관유형사업자번호법인번호
012021주관(구.협동)영구자석에 의한 누설자속 최소화 기술 적용을 통한 도시철도용 견인 전동기 및 추진제어시스템 기술개발이창희수행기관(주)다원시스박선순중소기업(연)13481299351350110000000
122021주관(구.협동)영구자석에 의한 누설자속 최소화 기술 적용을 통한 도시철도용 견인 전동기 및 추진제어시스템 기술개발이창희참여기관(주)다원시스박선순중소기업(연)13481299351350110000000
232021위탁IPM 전동기 및 추진제어시스템 현차시험나정민수행기관광주광역시도시철도공사윤진보지자체41082126742041710000000
342021공동도시철도차량 견인용 IPM 전동기 제작하창용수행기관(주)로텍김태훈중소기업(연)60981484211942110000000
452021공동도시철도차량 견인용 IPM 전동기 제작하창용참여기관(주)로텍김태훈중소기업(연)60981484211942110000000
562021공동도시철도차량 견인용 IPM 전동기 설계이주수행기관한양대학교산학협력단하성규대학교20682073061111710000000
672021공동IPM 전동기 제어 알고리즘 개발김원희수행기관중앙대학교산학협력단고중혁대학교10882059791150710000000
782021공동IPM 전동기 시험평가 및 신제품 인증김승주수행기관(재단)한국기계전기전자시험연구원제대식기타12382140981341220000000
892021주관(구.협동)재난 대응형 센서 및 통신 구축 기술 개발김영민수행기관(주)이에스피김영민중소기업(연)13486764281314110000000
9102021주관(구.협동)재난 대응형 센서 및 통신 구축 기술 개발김영민참여기관(주)이에스피김영민중소기업(연)13486764281314110000000
번호연도체계과제명연구책임자참여구분수행-참여기관명대표자명기관유형사업자번호법인번호
8708712022주관(구.협동)이동통신망을 이용한 오염지도 기반 환경예측시스템 개발이상엽참여기관주식회사 알앤에스랩이상엽중소기업(연)13586423341358110000000
8718722022공동이동통신망을 이용한 오염지도 기반 환경예측시스템 개발김성신수행기관부산대학교 산학협력단최경민대학교62182065301847710000000
8728732022주관(구.협동)국도형 스마트신호운영 관리시스템 개발이기웅수행기관주식회사 동부아이씨티정진욱중소기업(연)24088000571311110000000
8738742022주관(구.협동)국도형 스마트신호운영 관리시스템 개발이기웅참여기관주식회사 동부아이씨티정진욱중소기업(연)24088000571311110000000
8748752022공동국도형 스마트신호운영 관리시스템 개발고광용수행기관도로교통공단이주민기타20382320861101710000000
8758762022공동국도형 스마트신호운영 관리시스템 개발남두희수행기관한성대학교산학협력단이관우대학교20982083191144710000000
8768772022주관(구.협동)중소기업 보유기술 사업화(스타트업)최관영수행기관(주)퀀텀바이오닉스최관영중소기업(연)37781015861341110000000
8778782022주관(구.협동)중소기업 보유기술 사업화(스타트업)최관영참여기관(주)퀀텀바이오닉스최관영중소기업(연)37781015861341110000000
8788792022주관(구.협동)(공공 공사 연계) 노측용 가드레일 개선 기술 개발이승렬수행기관(주)디앤에스테크놀로지이원우중소기업(연)11081246331101110000000
8798802022주관(구.협동)(공공 공사 연계) 노측용 가드레일 개선 기술 개발이승렬참여기관(주)디앤에스테크놀로지이원우중소기업(연)11081246331101110000000