Overview

Dataset statistics

Number of variables5
Number of observations322
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.0 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Text2
Categorical1
DateTime1

Dataset

Description2023년 6월 12일 기준 한국데이터산업진흥원에서 운영 중인 데이터품질인증, 데이터관리인증, 데이터보안인증 등 데이터 인증 획득 기관 및 기업 현황입니다. 해당 데이터가 보유한 컬럼은 다음과 같습니다. 칼럼명 : 일련번호, 기관(기업)명, 인증부문, 인증대상, 인증취득일자
URLhttps://www.data.go.kr/data/15037527/fileData.do

Alerts

일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:35:25.553951
Analysis finished2023-12-12 07:35:26.235546
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct322
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean161.5
Minimum1
Maximum322
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T16:35:26.318409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.05
Q181.25
median161.5
Q3241.75
95-th percentile305.95
Maximum322
Range321
Interquartile range (IQR)160.5

Descriptive statistics

Standard deviation93.097619
Coefficient of variation (CV)0.57645585
Kurtosis-1.2
Mean161.5
Median Absolute Deviation (MAD)80.5
Skewness0
Sum52003
Variance8667.1667
MonotonicityStrictly increasing
2023-12-12T16:35:26.480223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
243 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
215 1
 
0.3%
214 1
 
0.3%
Other values (312) 312
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
314 1
0.3%
313 1
0.3%
Distinct145
Distinct (%)45.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T16:35:26.777646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length7.7018634
Min length2

Characters and Unicode

Total characters2480
Distinct characters230
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)28.0%

Sample

1st row한국전력거래소
2nd row특허청
3rd row코리아크레딧뷰로
4th rowIBK 기업은행
5th row대한주택보증(주)
ValueCountFrequency (%)
건강보험심사평가원 29
 
8.6%
한국연구재단 14
 
4.1%
한국과학기술정보연구원 12
 
3.6%
한국산업기술평가관리원 10
 
3.0%
국민건강보험공단 10
 
3.0%
한국감정원 10
 
3.0%
국립환경과학원 8
 
2.4%
한국관광공사 7
 
2.1%
서울특별시청 7
 
2.1%
산학협력단 7
 
2.1%
Other values (141) 224
66.3%
2023-12-12T16:35:27.310080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
177
 
7.1%
157
 
6.3%
135
 
5.4%
88
 
3.5%
68
 
2.7%
64
 
2.6%
59
 
2.4%
58
 
2.3%
57
 
2.3%
54
 
2.2%
Other values (220) 1563
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2412
97.3%
Space Separator 16
 
0.6%
Lowercase Letter 16
 
0.6%
Uppercase Letter 11
 
0.4%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%
Other Symbol 5
 
0.2%
Other Punctuation 4
 
0.2%
Decimal Number 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
177
 
7.3%
157
 
6.5%
135
 
5.6%
88
 
3.6%
68
 
2.8%
64
 
2.7%
59
 
2.4%
58
 
2.4%
57
 
2.4%
54
 
2.2%
Other values (199) 1495
62.0%
Lowercase Letter
ValueCountFrequency (%)
r 4
25.0%
a 3
18.8%
t 3
18.8%
e 2
12.5%
d 1
 
6.2%
i 1
 
6.2%
w 1
 
6.2%
m 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
K 4
36.4%
S 2
18.2%
B 2
18.2%
W 1
 
9.1%
I 1
 
9.1%
G 1
 
9.1%
Space Separator
ValueCountFrequency (%)
16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 4
100.0%
Decimal Number
ValueCountFrequency (%)
8 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2417
97.5%
Common 36
 
1.5%
Latin 27
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
177
 
7.3%
157
 
6.5%
135
 
5.6%
88
 
3.6%
68
 
2.8%
64
 
2.6%
59
 
2.4%
58
 
2.4%
57
 
2.4%
54
 
2.2%
Other values (200) 1500
62.1%
Latin
ValueCountFrequency (%)
r 4
14.8%
K 4
14.8%
a 3
11.1%
t 3
11.1%
S 2
7.4%
e 2
7.4%
B 2
7.4%
W 1
 
3.7%
d 1
 
3.7%
i 1
 
3.7%
Other values (4) 4
14.8%
Common
ValueCountFrequency (%)
16
44.4%
( 7
19.4%
) 7
19.4%
/ 4
 
11.1%
8 1
 
2.8%
- 1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2412
97.3%
ASCII 63
 
2.5%
None 5
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
177
 
7.3%
157
 
6.5%
135
 
5.6%
88
 
3.6%
68
 
2.8%
64
 
2.7%
59
 
2.4%
58
 
2.4%
57
 
2.4%
54
 
2.2%
Other values (199) 1495
62.0%
ASCII
ValueCountFrequency (%)
16
25.4%
( 7
11.1%
) 7
11.1%
r 4
 
6.3%
K 4
 
6.3%
/ 4
 
6.3%
a 3
 
4.8%
t 3
 
4.8%
S 2
 
3.2%
e 2
 
3.2%
Other values (10) 11
17.5%
None
ValueCountFrequency (%)
5
100.0%

인증부문
Categorical

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
데이터품질
257 
데이터관리
39 
데이터보안
26 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row데이터관리
2nd row데이터관리
3rd row데이터관리
4th row데이터관리
5th row데이터관리

Common Values

ValueCountFrequency (%)
데이터품질 257
79.8%
데이터관리 39
 
12.1%
데이터보안 26
 
8.1%

Length

2023-12-12T16:35:27.449141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:35:27.545039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
데이터품질 257
79.8%
데이터관리 39
 
12.1%
데이터보안 26
 
8.1%
Distinct252
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T16:35:27.787424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length31
Mean length11.860248
Min length2

Characters and Unicode

Total characters3819
Distinct characters330
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique213 ?
Unique (%)66.1%

Sample

1st row전체 정보시스템
2nd row전체 정보시스템
3rd row전체 정보시스템
4th row전체 정보시스템
5th row전체 정보시스템
ValueCountFrequency (%)
db 24
 
4.0%
시스템 15
 
2.5%
정보시스템 14
 
2.4%
전체 11
 
1.8%
부동산가격정보db 8
 
1.3%
플랫폼 8
 
1.3%
홈페이지 8
 
1.3%
데이터베이스 7
 
1.2%
빅데이터 7
 
1.2%
cdm 7
 
1.2%
Other values (374) 486
81.7%
2023-12-12T16:35:28.268286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
273
 
7.1%
173
 
4.5%
147
 
3.8%
D 143
 
3.7%
138
 
3.6%
133
 
3.5%
115
 
3.0%
B 112
 
2.9%
73
 
1.9%
66
 
1.7%
Other values (320) 2446
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2877
75.3%
Uppercase Letter 471
 
12.3%
Space Separator 273
 
7.1%
Lowercase Letter 103
 
2.7%
Open Punctuation 33
 
0.9%
Close Punctuation 32
 
0.8%
Other Punctuation 20
 
0.5%
Dash Punctuation 9
 
0.2%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
6.0%
147
 
5.1%
138
 
4.8%
133
 
4.6%
115
 
4.0%
73
 
2.5%
66
 
2.3%
58
 
2.0%
58
 
2.0%
57
 
2.0%
Other values (269) 1859
64.6%
Uppercase Letter
ValueCountFrequency (%)
D 143
30.4%
B 112
23.8%
C 23
 
4.9%
S 22
 
4.7%
M 21
 
4.5%
W 17
 
3.6%
I 17
 
3.6%
R 16
 
3.4%
E 14
 
3.0%
P 13
 
2.8%
Other values (13) 73
15.5%
Lowercase Letter
ValueCountFrequency (%)
e 14
13.6%
a 13
12.6%
t 10
9.7%
o 9
8.7%
i 9
8.7%
r 8
7.8%
c 7
6.8%
s 7
6.8%
n 7
6.8%
p 4
 
3.9%
Other values (8) 15
14.6%
Other Punctuation
ValueCountFrequency (%)
, 8
40.0%
& 6
30.0%
/ 4
20.0%
· 1
 
5.0%
. 1
 
5.0%
Space Separator
ValueCountFrequency (%)
273
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2877
75.3%
Latin 574
 
15.0%
Common 368
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
6.0%
147
 
5.1%
138
 
4.8%
133
 
4.6%
115
 
4.0%
73
 
2.5%
66
 
2.3%
58
 
2.0%
58
 
2.0%
57
 
2.0%
Other values (269) 1859
64.6%
Latin
ValueCountFrequency (%)
D 143
24.9%
B 112
19.5%
C 23
 
4.0%
S 22
 
3.8%
M 21
 
3.7%
W 17
 
3.0%
I 17
 
3.0%
R 16
 
2.8%
e 14
 
2.4%
E 14
 
2.4%
Other values (31) 175
30.5%
Common
ValueCountFrequency (%)
273
74.2%
( 33
 
9.0%
) 32
 
8.7%
- 9
 
2.4%
, 8
 
2.2%
& 6
 
1.6%
/ 4
 
1.1%
2 1
 
0.3%
· 1
 
0.3%
. 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2877
75.3%
ASCII 941
 
24.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
273
29.0%
D 143
15.2%
B 112
11.9%
( 33
 
3.5%
) 32
 
3.4%
C 23
 
2.4%
S 22
 
2.3%
M 21
 
2.2%
W 17
 
1.8%
I 17
 
1.8%
Other values (40) 248
26.4%
Hangul
ValueCountFrequency (%)
173
 
6.0%
147
 
5.1%
138
 
4.8%
133
 
4.6%
115
 
4.0%
73
 
2.5%
66
 
2.3%
58
 
2.0%
58
 
2.0%
57
 
2.0%
Other values (269) 1859
64.6%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct160
Distinct (%)49.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2008-12-23 00:00:00
Maximum2022-12-23 00:00:00
2023-12-12T16:35:28.418008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:35:28.568371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T16:35:25.956407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:35:28.690515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호인증부문
일련번호1.0000.302
인증부문0.3021.000
2023-12-12T16:35:28.787959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호인증부문
일련번호1.0000.187
인증부문0.1871.000

Missing values

2023-12-12T16:35:26.080395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:35:26.185213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호기관(기업)명인증부문인증대상인증취득일자
01한국전력거래소데이터관리전체 정보시스템2008-12-23
12특허청데이터관리전체 정보시스템2009-01-22
23코리아크레딧뷰로데이터관리전체 정보시스템2009-06-23
34IBK 기업은행데이터관리전체 정보시스템2010-05-20
45대한주택보증(주)데이터관리전체 정보시스템2010-07-16
56교육과학기술부/한국과학기술정보연구원데이터관리한국과학기술정보연구원 NTIS2010-11-25
67산림청데이터품질국가생물종지식정보DB2010-12-23
78한국과학기술정보연구원데이터품질한국과학기술인용색인DB2010-12-29
89정보통신산업진흥원데이터품질전자정보통신 연구자료DB2011-01-07
910서울특별시데이터품질종합민원관리DB2011-01-13
일련번호기관(기업)명인증부문인증대상인증취득일자
312313한국금형산업진흥회데이터품질금형제조데이터 활용 플랫폼 시스템2022-11-25
313314한국과학기술정보연구원데이터품질공공기술사업화 통합 DB2022-11-25
314315국립암센터데이터품질임상 연구 데이터베이스 (CRDW)2022-11-25
315316연세의료원데이터품질임상연구분석포털 (CDW,CDM)2022-11-25
316317한림대학교성심병원 외 8개기관데이터품질한림대학교성심병원 컨소시엄 OMOP-CDM 의료데이터2022-12-23
317318서울시청데이터품질생활복지정보시스템2022-12-23
318319한국남동발전데이터품질대표홈페이지2022-12-23
319320연세대학교 산학협력단데이터품질바이오 소재 데이터 플랫폼2022-12-23
320321한국부동산원데이터품질공동주택관리정보시스템(K-apt)2022-12-23
321322한국문화관광연구원데이터품질관광개발정보시스템(관광개발사업)2022-12-23