Overview

Dataset statistics

Number of variables9
Number of observations29
Missing cells28
Missing cells (%)10.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory77.6 B

Variable types

Numeric1
Text3
Categorical4
Boolean1

Dataset

Description환경경영정보포털 홈페이지의 테이블 코드 그룹 정보 제공(코드그룹코드, 그룹코드명, 설명, 등록자, 등록일자 등)
Author환경부
URLhttps://www.data.go.kr/data/15039237/fileData.do

Alerts

설명 has constant value ""Constant
삭제여부 has constant value ""Constant
수정일 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
등록일 is highly overall correlated with 등록자 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 수정자 and 1 other fieldsHigh correlation
등록자 is highly overall correlated with 등록일 and 1 other fieldsHigh correlation
수정자 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
설명 has 28 (96.6%) missing valuesMissing
연번 has unique valuesUnique
코드그룹코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:29:14.974517
Analysis finished2023-12-12 04:29:15.716303
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-12T13:29:15.803482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median15
Q322
95-th percentile27.6
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.5146932
Coefficient of variation (CV)0.56764621
Kurtosis-1.2
Mean15
Median Absolute Deviation (MAD)7
Skewness0
Sum435
Variance72.5
MonotonicityNot monotonic
2023-12-12T13:29:15.998374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
28 1
 
3.4%
29 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
22 1
 
3.4%
21 1
 
3.4%
20 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
20 1
3.4%

코드그룹코드
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T13:29:16.314367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length8.0344828
Min length5

Characters and Unicode

Total characters233
Distinct characters28
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st rowCONN_SITE
2nd rowFAMILY_SITE
3rd rowCS_CATE
4th rowSP_CATE
5th rowCS_CMP_CATE
ValueCountFrequency (%)
conn_site 1
 
3.4%
email 1
 
3.4%
g0003 1
 
3.4%
g0004 1
 
3.4%
mbiz_type 1
 
3.4%
cmp_type 1
 
3.4%
hs_test 1
 
3.4%
blng_type 1
 
3.4%
job_cate 1
 
3.4%
ipr_st 1
 
3.4%
Other values (19) 19
65.5%
2023-12-12T13:29:16.817665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 30
12.9%
E 26
11.2%
T 20
 
8.6%
P 18
 
7.7%
S 17
 
7.3%
C 16
 
6.9%
A 13
 
5.6%
I 12
 
5.2%
R 9
 
3.9%
Y 9
 
3.9%
Other values (18) 63
27.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 195
83.7%
Connector Punctuation 30
 
12.9%
Decimal Number 8
 
3.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 26
13.3%
T 20
10.3%
P 18
 
9.2%
S 17
 
8.7%
C 16
 
8.2%
A 13
 
6.7%
I 12
 
6.2%
R 9
 
4.6%
Y 9
 
4.6%
M 8
 
4.1%
Other values (14) 47
24.1%
Decimal Number
ValueCountFrequency (%)
0 6
75.0%
4 1
 
12.5%
3 1
 
12.5%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 195
83.7%
Common 38
 
16.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 26
13.3%
T 20
10.3%
P 18
 
9.2%
S 17
 
8.7%
C 16
 
8.2%
A 13
 
6.7%
I 12
 
6.2%
R 9
 
4.6%
Y 9
 
4.6%
M 8
 
4.1%
Other values (14) 47
24.1%
Common
ValueCountFrequency (%)
_ 30
78.9%
0 6
 
15.8%
4 1
 
2.6%
3 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 233
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 30
12.9%
E 26
11.2%
T 20
 
8.6%
P 18
 
7.7%
S 17
 
7.3%
C 16
 
6.9%
A 13
 
5.6%
I 12
 
5.2%
R 9
 
3.9%
Y 9
 
3.9%
Other values (18) 63
27.0%
Distinct27
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T13:29:17.125775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length8.5517241
Min length3

Characters and Unicode

Total characters248
Distinct characters88
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)86.2%

Sample

1st row유관기관및단체
2nd row코네틱 패밀리사이트
3rd row컨설턴트 분야
4th row전문인력 분야
5th row컨설팅 기업 분야
ValueCountFrequency (%)
컨설팅 4
 
6.3%
에코디자인 4
 
6.3%
분야 4
 
6.3%
항목 3
 
4.8%
지역 3
 
4.8%
기업 3
 
4.8%
자가진단 2
 
3.2%
국번 2
 
3.2%
체크리스트 2
 
3.2%
업종 2
 
3.2%
Other values (31) 34
54.0%
2023-12-12T13:29:17.584457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
13.7%
10
 
4.0%
9
 
3.6%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (78) 152
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 213
85.9%
Space Separator 34
 
13.7%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
4.7%
9
 
4.2%
8
 
3.8%
7
 
3.3%
6
 
2.8%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (76) 146
68.5%
Space Separator
ValueCountFrequency (%)
34
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 213
85.9%
Common 35
 
14.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
4.7%
9
 
4.2%
8
 
3.8%
7
 
3.3%
6
 
2.8%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (76) 146
68.5%
Common
ValueCountFrequency (%)
34
97.1%
/ 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 213
85.9%
ASCII 35
 
14.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
97.1%
/ 1
 
2.9%
Hangul
ValueCountFrequency (%)
10
 
4.7%
9
 
4.2%
8
 
3.8%
7
 
3.3%
6
 
2.8%
6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (76) 146
68.5%

설명
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing28
Missing (%)96.6%
Memory size364.0 B
2023-12-12T13:29:17.687298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row병원
ValueCountFrequency (%)
병원 1
100.0%
2023-12-12T13:29:17.949411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

등록자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
admin
24 
-

Length

Max length5
Median length5
Mean length4.3103448
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd rowadmin
4th rowadmin
5th rowadmin

Common Values

ValueCountFrequency (%)
admin 24
82.8%
- 5
 
17.2%

Length

2023-12-12T13:29:18.088585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:29:18.206713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
admin 24
82.8%
5
 
17.2%

등록일
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Memory size364.0 B
2017-09-27
16 
-
2017-10-26
2017-11-10
2017-11-03
 
1

Length

Max length10
Median length10
Mean length7.5172414
Min length1

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row-
2nd row-
3rd row2017-09-27
4th row2017-09-27
5th row2017-09-27

Common Values

ValueCountFrequency (%)
2017-09-27 16
55.2%
- 8
27.6%
2017-10-26 2
 
6.9%
2017-11-10 2
 
6.9%
2017-11-03 1
 
3.4%

Length

2023-12-12T13:29:18.334453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:29:18.445958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-09-27 16
55.2%
8
27.6%
2017-10-26 2
 
6.9%
2017-11-10 2
 
6.9%
2017-11-03 1
 
3.4%

수정자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
admin
19 
-
10 

Length

Max length5
Median length5
Mean length3.6206897
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd rowadmin
4th rowadmin
5th rowadmin

Common Values

ValueCountFrequency (%)
admin 19
65.5%
- 10
34.5%

Length

2023-12-12T13:29:18.620038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:29:19.101394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
admin 19
65.5%
10
34.5%

수정일
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
2017-09-27
16 
-
13 

Length

Max length10
Median length10
Mean length5.9655172
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row2017-09-27
4th row2017-09-27
5th row2017-09-27

Common Values

ValueCountFrequency (%)
2017-09-27 16
55.2%
- 13
44.8%

Length

2023-12-12T13:29:19.206534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:29:19.328062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-09-27 16
55.2%
13
44.8%

삭제여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size161.0 B
False
29 
ValueCountFrequency (%)
False 29
100.0%
2023-12-12T13:29:19.427534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T13:29:15.351906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:29:19.518003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번코드그룹코드코드그룹명등록자등록일수정자수정일
연번1.0001.0000.8610.7010.6560.9920.994
코드그룹코드1.0001.0001.0001.0001.0001.0001.000
코드그룹명0.8611.0001.0000.0000.0000.5901.000
등록자0.7011.0000.0001.0000.5700.7170.558
등록일0.6561.0000.0000.5701.0000.6841.000
수정자0.9921.0000.5900.7170.6841.0000.905
수정일0.9941.0001.0000.5581.0000.9051.000
2023-12-12T13:29:19.652544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수정일등록자등록일수정자
수정일1.0000.3760.9430.720
등록자0.3761.0000.6470.508
등록일0.9430.6471.0000.769
수정자0.7200.5080.7691.000
2023-12-12T13:29:19.771438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록자등록일수정자수정일
연번1.0000.4500.2890.7730.795
등록자0.4501.0000.6470.5080.376
등록일0.2890.6471.0000.7690.943
수정자0.7730.5080.7691.0000.720
수정일0.7950.3760.9430.7201.000

Missing values

2023-12-12T13:29:15.484446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:29:15.641875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번코드그룹코드코드그룹명설명등록자등록일수정자수정일삭제여부
028CONN_SITE유관기관및단체<NA>----N
129FAMILY_SITE코네틱 패밀리사이트<NA>----N
21CS_CATE컨설턴트 분야<NA>admin2017-09-27admin2017-09-27N
32SP_CATE전문인력 분야<NA>admin2017-09-27admin2017-09-27N
43CS_CMP_CATE컨설팅 기업 분야<NA>admin2017-09-27admin2017-09-27N
54CS_BIZ_GRP컨설팅 기업 업종<NA>admin2017-09-27admin2017-09-27N
65CS_CMP_AREA컨설팅 기업 지역<NA>admin2017-09-27admin2017-09-27N
76CS_OFF_AREA오프라인 컨설팅 희망 지역<NA>admin2017-09-27admin2017-09-27N
87PRD_TYPE에코디자인 제품 분류<NA>admin2017-09-27admin2017-09-27N
98IDEA_KWD에코디자인 아이디어 키워드<NA>admin2017-09-27admin2017-09-27N
연번코드그룹코드코드그룹명설명등록자등록일수정자수정일삭제여부
1918SITE_FTYPE코네틱 패밀리사이트<NA>admin-admin-N
2019IPR_ST지식재산권 상태<NA>admin-admin-N
2120JOB_CATE종사분야<NA>admin2017-10-26--N
2221BLNG_TYPE소속구분<NA>admin2017-10-26--N
2322HS_TEST자가진단 체크리스트 항목<NA>admin2017-11-03--N
2423CMP_TYPE기업형태<NA>----N
2524MBIZ_TYPE기업회원 업종<NA>----N
2625G0004자가진단 체크리스트 항목병원----N
2726G0003회원 구분<NA>admin2017-11-10--N
2827CM_BD공통게시판 항목<NA>admin2017-11-10--N