Overview

Dataset statistics

Number of variables4
Number of observations37
Missing cells2
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory35.6 B

Variable types

Text3
Boolean1

Dataset

Description한국공예디자인문화진흥원 대표홈페이지의 코드 등록/사용/수정에 사용되는 데이터로 코드아이디, 코드명, 코드설명, 사용여부 항목을 제공합니다.
Author한국공예디자인문화진흥원
URLhttps://www.data.go.kr/data/15072645/fileData.do

Alerts

사용여부 is highly imbalanced (82.1%)Imbalance
코드설명 has 2 (5.4%) missing valuesMissing
코드아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:27:26.774777
Analysis finished2023-12-12 06:27:27.289500
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

코드아이디
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T15:27:27.475909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters222
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st rowCOM001
2nd rowCOM003
3rd rowCOM004
4th rowCOM005
5th rowCOM009
ValueCountFrequency (%)
com001 1
 
2.7%
com038 1
 
2.7%
itn001 1
 
2.7%
itn002 1
 
2.7%
itn003 1
 
2.7%
itn004 1
 
2.7%
itn005 1
 
2.7%
itn006 1
 
2.7%
itn007 1
 
2.7%
itn008 1
 
2.7%
Other values (27) 27
73.0%
2023-12-12T15:27:27.919137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 53
23.9%
C 21
 
9.5%
O 21
 
9.5%
M 21
 
9.5%
I 16
 
7.2%
T 16
 
7.2%
N 16
 
7.2%
1 15
 
6.8%
3 14
 
6.3%
2 6
 
2.7%
Other values (6) 23
10.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 111
50.0%
Uppercase Letter 111
50.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 53
47.7%
1 15
 
13.5%
3 14
 
12.6%
2 6
 
5.4%
4 5
 
4.5%
5 5
 
4.5%
9 5
 
4.5%
8 3
 
2.7%
6 3
 
2.7%
7 2
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
C 21
18.9%
O 21
18.9%
M 21
18.9%
I 16
14.4%
T 16
14.4%
N 16
14.4%

Most occurring scripts

ValueCountFrequency (%)
Common 111
50.0%
Latin 111
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 53
47.7%
1 15
 
13.5%
3 14
 
12.6%
2 6
 
5.4%
4 5
 
4.5%
5 5
 
4.5%
9 5
 
4.5%
8 3
 
2.7%
6 3
 
2.7%
7 2
 
1.8%
Latin
ValueCountFrequency (%)
C 21
18.9%
O 21
18.9%
M 21
18.9%
I 16
14.4%
T 16
14.4%
N 16
14.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 222
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 53
23.9%
C 21
 
9.5%
O 21
 
9.5%
M 21
 
9.5%
I 16
 
7.2%
T 16
 
7.2%
N 16
 
7.2%
1 15
 
6.8%
3 14
 
6.3%
2 6
 
2.7%
Other values (6) 23
10.4%
Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T15:27:28.188180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.972973
Min length4

Characters and Unicode

Total characters221
Distinct characters93
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row등록구분
2nd row업무구분
3rd row게시판유형
4th row템플릿유형
5th row게시판속성
ValueCountFrequency (%)
구분 4
 
8.5%
게시판유형 2
 
4.3%
정보공개자료 2
 
4.3%
종류 1
 
2.1%
메인이미지 1
 
2.1%
관리 1
 
2.1%
게시판템플릿 1
 
2.1%
게시판 1
 
2.1%
사용자 1
 
2.1%
신고 1
 
2.1%
Other values (32) 32
68.1%
2023-12-12T15:27:28.534328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
6.8%
15
 
6.8%
10
 
4.5%
9
 
4.1%
7
 
3.2%
7
 
3.2%
7
 
3.2%
7
 
3.2%
5
 
2.3%
5
 
2.3%
Other values (83) 134
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 209
94.6%
Space Separator 10
 
4.5%
Dash Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
7.2%
15
 
7.2%
9
 
4.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (81) 127
60.8%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 209
94.6%
Common 12
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
7.2%
15
 
7.2%
9
 
4.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (81) 127
60.8%
Common
ValueCountFrequency (%)
10
83.3%
- 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 209
94.6%
ASCII 12
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
7.2%
15
 
7.2%
9
 
4.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
7
 
3.3%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (81) 127
60.8%
ASCII
ValueCountFrequency (%)
10
83.3%
- 2
 
16.7%

코드설명
Text

MISSING 

Distinct35
Distinct (%)100.0%
Missing2
Missing (%)5.4%
Memory size428.0 B
2023-12-12T15:27:28.818985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length19
Mean length10.228571
Min length4

Characters and Unicode

Total characters358
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row게시판, 커뮤니티, 동호회 등록구분코드1
2nd row업무구분코드
3rd row게시판유형구분코드
4th row템플릿유형구분코드
5th row게시판 속성
ValueCountFrequency (%)
구분 6
 
8.3%
게시판 2
 
2.8%
코드 2
 
2.8%
상태구분 2
 
2.8%
정보공개자료 2
 
2.8%
대관관리-신청자구분 1
 
1.4%
신고 1
 
1.4%
내부직원/협력가족/일반사용자 1
 
1.4%
게시판템플릿 1
 
1.4%
매뉴화면구분 1
 
1.4%
Other values (53) 53
73.6%
2023-12-12T15:27:29.219881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
 
10.3%
21
 
5.9%
21
 
5.9%
10
 
2.8%
10
 
2.8%
8
 
2.2%
8
 
2.2%
7
 
2.0%
7
 
2.0%
/ 7
 
2.0%
Other values (120) 222
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 306
85.5%
Space Separator 37
 
10.3%
Other Punctuation 11
 
3.1%
Dash Punctuation 1
 
0.3%
Decimal Number 1
 
0.3%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
6.9%
21
 
6.9%
10
 
3.3%
10
 
3.3%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (113) 202
66.0%
Other Punctuation
ValueCountFrequency (%)
/ 7
63.6%
, 4
36.4%
Space Separator
ValueCountFrequency (%)
37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 306
85.5%
Common 52
 
14.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
6.9%
21
 
6.9%
10
 
3.3%
10
 
3.3%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (113) 202
66.0%
Common
ValueCountFrequency (%)
37
71.2%
/ 7
 
13.5%
, 4
 
7.7%
- 1
 
1.9%
1 1
 
1.9%
) 1
 
1.9%
( 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 306
85.5%
ASCII 52
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37
71.2%
/ 7
 
13.5%
, 4
 
7.7%
- 1
 
1.9%
1 1
 
1.9%
) 1
 
1.9%
( 1
 
1.9%
Hangul
ValueCountFrequency (%)
21
 
6.9%
21
 
6.9%
10
 
3.3%
10
 
3.3%
8
 
2.6%
8
 
2.6%
7
 
2.3%
7
 
2.3%
6
 
2.0%
6
 
2.0%
Other values (113) 202
66.0%

사용여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size169.0 B
True
36 
False
 
1
ValueCountFrequency (%)
True 36
97.3%
False 1
 
2.7%
2023-12-12T15:27:29.361026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:27:29.443123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
코드아이디코드명코드설명사용여부
코드아이디1.0001.0001.0001.000
코드명1.0001.0001.0001.000
코드설명1.0001.0001.0001.000
사용여부1.0001.0001.0001.000

Missing values

2023-12-12T15:27:27.074102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:27:27.231409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

코드아이디코드명코드설명사용여부
0COM001등록구분게시판, 커뮤니티, 동호회 등록구분코드1Y
1COM003업무구분업무구분코드Y
2COM004게시판유형게시판유형구분코드Y
3COM005템플릿유형템플릿유형구분코드Y
4COM009게시판속성게시판 속성Y
5COM013회원상태회원 가입 신청/승인/삭제를 위한 상태 구분Y
6COM014성별구분남녀 성별 구분Y
7COM018질문유형질문유형 객관식/주관식 상태구분Y
8COM019일정중요도일정중요도 낮음/보통/높음 상태구분Y
9COM022비밀번호 힌트비밀번호 힌트 구분코드Y
코드아이디코드명코드설명사용여부
27ITN007대관관리-신청자구분대관관리-신청자구분Y
28ITN008채용게시판 구분채용게시판 구분Y
29ITN009사업구분사업구분Y
30ITN010사업게시판 구분사업게시판 구분Y
31ITN011보도자료구분보도자료구분Y
32ITN012이용문의 구분이용문의 구분Y
33ITN013메인이미지 종류메인이미지 종류Y
34ITN014정보공개자료 관리정보공개자료 관리Y
35ITN015정보공개자료 연도정보공개자료 연도Y
36ITN016외국어사업구분외국어사업구분Y