Overview

Dataset statistics

Number of variables5
Number of observations89
Missing cells12
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory42.5 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description* 중소기업수출지원센터 개방자료* 해외규격 인증획득 지원사업 컨설턴트사 목록 정보입니다* 구성(사업자번호, 기관명, 연락처, 홈페이지, 지역)
Author중소벤처기업부
URLhttps://www.data.go.kr/data/15042505/fileData.do

Alerts

홈페이지 has 12 (13.5%) missing valuesMissing
사업자번호 has unique valuesUnique
기관명 has unique valuesUnique
연락처 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:09:46.451867
Analysis finished2024-04-21 01:09:48.432350
Duration1.98 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업자번호
Real number (ℝ)

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0068871 × 109
Minimum1.0403092 × 109
Maximum8.8186015 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-04-21T10:09:48.517146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0403092 × 109
5-th percentile1.0587766 × 109
Q11.2681454 × 109
median2.1188061 × 109
Q34.5187002 × 109
95-th percentile7.6724007 × 109
Maximum8.8186015 × 109
Range7.7782923 × 109
Interquartile range (IQR)3.2505548 × 109

Descriptive statistics

Standard deviation2.2876664 × 109
Coefficient of variation (CV)0.76080888
Kurtosis-0.11251862
Mean3.0068871 × 109
Median Absolute Deviation (MAD)9.5979647 × 108
Skewness1.096121
Sum2.6761296 × 1011
Variance5.2334177 × 1018
MonotonicityNot monotonic
2024-04-21T10:09:48.652392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1388132811 1
 
1.1%
5161300335 1
 
1.1%
2178126965 1
 
1.1%
1808700874 1
 
1.1%
8818601514 1
 
1.1%
4538601207 1
 
1.1%
2200148298 1
 
1.1%
2140912336 1
 
1.1%
2148159754 1
 
1.1%
1358128088 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1040309250 1
1.1%
1048128783 1
1.1%
1048613679 1
1.1%
1058743019 1
1.1%
1058754103 1
1.1%
1058810306 1
1.1%
1068159002 1
1.1%
1071375272 1
1.1%
1078613213 1
1.1%
1132421802 1
1.1%
ValueCountFrequency (%)
8818601514 1
1.1%
8550100988 1
1.1%
8338701088 1
1.1%
8228100729 1
1.1%
7808600129 1
1.1%
7468101666 1
1.1%
7258100447 1
1.1%
7058600531 1
1.1%
6738601908 1
1.1%
6738101621 1
1.1%

기관명
Text

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-21T10:09:48.852537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length9.011236
Min length3

Characters and Unicode

Total characters802
Distinct characters178
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)100.0%

Sample

1st row(주)엔트리연구원
2nd row주식회사 켐론에프디에이코리아
3rd row세이프월드엔지니어링
4th row글로벌표준인증원
5th row관세법인스카이브릿지
ValueCountFrequency (%)
주식회사 16
 
14.7%
korea 2
 
1.8%
주)에이치시티 1
 
0.9%
알란코리아 1
 
0.9%
하우스부띠끄 1
 
0.9%
온빅스 1
 
0.9%
하이큐컨설팅 1
 
0.9%
세창경영인증원 1
 
0.9%
주)원택 1
 
0.9%
주)씨티케이 1
 
0.9%
Other values (83) 83
76.1%
2024-04-21T10:09:49.180474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
7.4%
( 45
 
5.6%
) 45
 
5.6%
29
 
3.6%
28
 
3.5%
22
 
2.7%
20
 
2.5%
19
 
2.4%
19
 
2.4%
18
 
2.2%
Other values (168) 498
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 659
82.2%
Open Punctuation 45
 
5.6%
Close Punctuation 45
 
5.6%
Uppercase Letter 26
 
3.2%
Space Separator 20
 
2.5%
Lowercase Letter 4
 
0.5%
Other Symbol 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
9.0%
29
 
4.4%
28
 
4.2%
22
 
3.3%
19
 
2.9%
19
 
2.9%
18
 
2.7%
18
 
2.7%
17
 
2.6%
17
 
2.6%
Other values (145) 413
62.7%
Uppercase Letter
ValueCountFrequency (%)
C 5
19.2%
I 4
15.4%
K 3
11.5%
T 2
 
7.7%
M 2
 
7.7%
F 1
 
3.8%
A 1
 
3.8%
E 1
 
3.8%
R 1
 
3.8%
O 1
 
3.8%
Other values (5) 5
19.2%
Lowercase Letter
ValueCountFrequency (%)
a 1
25.0%
e 1
25.0%
r 1
25.0%
o 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 662
82.5%
Common 110
 
13.7%
Latin 30
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
8.9%
29
 
4.4%
28
 
4.2%
22
 
3.3%
19
 
2.9%
19
 
2.9%
18
 
2.7%
18
 
2.7%
17
 
2.6%
17
 
2.6%
Other values (146) 416
62.8%
Latin
ValueCountFrequency (%)
C 5
16.7%
I 4
13.3%
K 3
 
10.0%
T 2
 
6.7%
M 2
 
6.7%
F 1
 
3.3%
A 1
 
3.3%
E 1
 
3.3%
R 1
 
3.3%
O 1
 
3.3%
Other values (9) 9
30.0%
Common
ValueCountFrequency (%)
( 45
40.9%
) 45
40.9%
20
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 659
82.2%
ASCII 140
 
17.5%
None 3
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
9.0%
29
 
4.4%
28
 
4.2%
22
 
3.3%
19
 
2.9%
19
 
2.9%
18
 
2.7%
18
 
2.7%
17
 
2.6%
17
 
2.6%
Other values (145) 413
62.7%
ASCII
ValueCountFrequency (%)
( 45
32.1%
) 45
32.1%
20
14.3%
C 5
 
3.6%
I 4
 
2.9%
K 3
 
2.1%
T 2
 
1.4%
M 2
 
1.4%
F 1
 
0.7%
A 1
 
0.7%
Other values (12) 12
 
8.6%
None
ValueCountFrequency (%)
3
100.0%

연락처
Text

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-21T10:09:49.427475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.898876
Min length11

Characters and Unicode

Total characters1059
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)100.0%

Sample

1st row031-893-1000
2nd row02-568-7744
3rd row031-427-8051
4th row02-899-4265
5th row02-3663-7757
ValueCountFrequency (%)
031-893-1000 1
 
1.1%
02-2164-1493 1
 
1.1%
02-959-9004 1
 
1.1%
070-4801-6226 1
 
1.1%
070-7011-3131 1
 
1.1%
02-566-3360 1
 
1.1%
031-750-0150 1
 
1.1%
02-3472-9078 1
 
1.1%
031-799-9500 1
 
1.1%
031-339-9970 1
 
1.1%
Other values (79) 79
88.8%
2024-04-21T10:09:49.788730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 188
17.8%
- 177
16.7%
2 103
9.7%
3 99
9.3%
1 95
9.0%
7 78
7.4%
5 71
 
6.7%
6 67
 
6.3%
9 66
 
6.2%
8 53
 
5.0%
Other values (2) 62
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 871
82.2%
Dash Punctuation 177
 
16.7%
Other Punctuation 11
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 188
21.6%
2 103
11.8%
3 99
11.4%
1 95
10.9%
7 78
9.0%
5 71
 
8.2%
6 67
 
7.7%
9 66
 
7.6%
8 53
 
6.1%
4 51
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 177
100.0%
Other Punctuation
ValueCountFrequency (%)
* 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1059
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 188
17.8%
- 177
16.7%
2 103
9.7%
3 99
9.3%
1 95
9.0%
7 78
7.4%
5 71
 
6.7%
6 67
 
6.3%
9 66
 
6.2%
8 53
 
5.0%
Other values (2) 62
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1059
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 188
17.8%
- 177
16.7%
2 103
9.7%
3 99
9.3%
1 95
9.0%
7 78
7.4%
5 71
 
6.7%
6 67
 
6.3%
9 66
 
6.2%
8 53
 
5.0%
Other values (2) 62
 
5.9%

홈페이지
Text

MISSING 

Distinct77
Distinct (%)100.0%
Missing12
Missing (%)13.5%
Memory size844.0 B
2024-04-21T10:09:50.003962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length26
Mean length18.116883
Min length11

Characters and Unicode

Total characters1395
Distinct characters45
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)100.0%

Sample

1st rowwww.ntree.or.kr
2nd rowwww.dfda.co.kr
3rd rowwww.safe-worldeng.com
4th rowhttp://www.gsckorea.co.kr/
5th rowhttp://sky-bridge.co.kr/
ValueCountFrequency (%)
isoaudit.or.kr 1
 
1.3%
www.medi-guide.com 1
 
1.3%
www.kbiotechsolutions.com 1
 
1.3%
www.erms.co.kr 1
 
1.3%
www.beijingalan.com 1
 
1.3%
www.house-boutique.net 1
 
1.3%
https://blog.naver.com/onbix 1
 
1.3%
www.hiqconsulting.com 1
 
1.3%
www.onetech.co.kr 1
 
1.3%
www.e-ctk.com 1
 
1.3%
Other values (67) 67
87.0%
2024-04-21T10:09:50.393682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 188
13.5%
. 173
12.4%
o 102
 
7.3%
c 101
 
7.2%
r 88
 
6.3%
t 83
 
5.9%
e 65
 
4.7%
/ 64
 
4.6%
k 64
 
4.6%
m 57
 
4.1%
Other values (35) 410
29.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1103
79.1%
Other Punctuation 259
 
18.6%
Dash Punctuation 14
 
1.0%
Other Letter 12
 
0.9%
Decimal Number 7
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 188
17.0%
o 102
 
9.2%
c 101
 
9.2%
r 88
 
8.0%
t 83
 
7.5%
e 65
 
5.9%
k 64
 
5.8%
m 57
 
5.2%
s 49
 
4.4%
i 48
 
4.4%
Other values (16) 258
23.4%
Other Letter
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Decimal Number
ValueCountFrequency (%)
1 3
42.9%
0 2
28.6%
2 1
 
14.3%
9 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
. 173
66.8%
/ 64
 
24.7%
: 22
 
8.5%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1103
79.1%
Common 280
 
20.1%
Hangul 12
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 188
17.0%
o 102
 
9.2%
c 101
 
9.2%
r 88
 
8.0%
t 83
 
7.5%
e 65
 
5.9%
k 64
 
5.8%
m 57
 
5.2%
s 49
 
4.4%
i 48
 
4.4%
Other values (16) 258
23.4%
Hangul
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Common
ValueCountFrequency (%)
. 173
61.8%
/ 64
 
22.9%
: 22
 
7.9%
- 14
 
5.0%
1 3
 
1.1%
0 2
 
0.7%
2 1
 
0.4%
9 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1383
99.1%
Hangul 12
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 188
13.6%
. 173
12.5%
o 102
 
7.4%
c 101
 
7.3%
r 88
 
6.4%
t 83
 
6.0%
e 65
 
4.7%
/ 64
 
4.6%
k 64
 
4.6%
m 57
 
4.1%
Other values (24) 398
28.8%
Hangul
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%

지역
Categorical

Distinct9
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size844.0 B
서울
39 
경기
35 
대전·세종
부산
경기북부
 
2
Other values (4)

Length

Max length5
Median length2
Mean length2.2808989
Min length2

Unique

Unique3 ?
Unique (%)3.4%

Sample

1st row경기
2nd row서울
3rd row경기
4th row경기
5th row서울

Common Values

ValueCountFrequency (%)
서울 39
43.8%
경기 35
39.3%
대전·세종 4
 
4.5%
부산 4
 
4.5%
경기북부 2
 
2.2%
부산.울산 2
 
2.2%
광주.전남 1
 
1.1%
경남 1
 
1.1%
인천 1
 
1.1%

Length

2024-04-21T10:09:50.521704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:09:50.625774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울 39
43.8%
경기 35
39.3%
대전·세종 4
 
4.5%
부산 4
 
4.5%
경기북부 2
 
2.2%
부산.울산 2
 
2.2%
광주.전남 1
 
1.1%
경남 1
 
1.1%
인천 1
 
1.1%

Interactions

2024-04-21T10:09:48.138257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:09:50.709422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호기관명연락처홈페이지지역
사업자번호\t1.0001.0001.0001.0000.550
기관명1.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.000
홈페이지1.0001.0001.0001.0001.000
지역0.5501.0001.0001.0001.000
2024-04-21T10:09:50.797651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자번호지역
사업자번호\t1.0000.287
지역0.2871.000

Missing values

2024-04-21T10:09:48.299432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:09:48.383928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업자번호기관명연락처홈페이지지역
01388132811(주)엔트리연구원031-893-1000www.ntree.or.kr경기
11208651336주식회사 켐론에프디에이코리아02-568-7744www.dfda.co.kr서울
21298674447세이프월드엔지니어링031-427-8051www.safe-worldeng.com경기
33918100768글로벌표준인증원02-899-4265http://www.gsckorea.co.kr/경기
42118806091관세법인스카이브릿지02-3663-7757http://sky-bridge.co.kr/서울
57808600129(주)씨엠디아이02-6919-0252www.icmdi.com서울
66738601908주식회사 씨디알아이02-6225-5253https://cdri.pro/서울
71138206228한국산업기술시험원02-860-1338https://www.ktl.re.kr/main.do서울
81058754103(주)GCS02-555-1537www.bestgcs.com경기
91238646943주식회사 에스엠지코리아070-5067-4886isoaudit.or.kr경기
사업자번호기관명연락처홈페이지지역
795823500289에이치경영컨설팅031-376-8112http://www.k-hmc.or.kr/경기
806738101621주식회사 매리스그룹코리아02-6264-1737http://www.maris-reg.kr/sns/서울
816518102294(주)한국방폭기술원031-5175-1472www.kext.kr경기
823858602301(주)타이거씨티031-784-8534https://blog.naver.com/tygerct경기
837258100447참조은㈜031-757-3485https://tgc-md.kr경기
844518700174(주)아라테크051-796-1361http://마루치아라치.com/부산.울산
851040309250글로벌스탠다드070-7510-3390<NA>경기
861071375272이큐에스02-6082-9001www.eqs119.com경기
871132421802한국의료기기인증원 (KMC)070-8965-5554www.kmcerti.com서울
881304272974스마트컨설팅070-4084-9177<NA>경기