Overview

Dataset statistics

Number of variables2
Number of observations189
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory3.1 KiB
Average record size in memory16.7 B

Variable types

Categorical1
Text1

Dataset

Description학점은행제 정보시스템 기준 코드관리 중 신용카드 및 현금영수증 결과코드 리스트로에 대한 데이터로 결제구분별 코드 내용 등의 항목을 제공합니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15089559/fileData.do

Alerts

거래구분 has constant value ""Constant
Dataset has 1 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 20:17:15.344135
Analysis finished2023-12-12 20:17:15.650092
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

거래구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
공통
189 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공통
2nd row공통
3rd row공통
4th row공통
5th row공통

Common Values

ValueCountFrequency (%)
공통 189
100.0%

Length

2023-12-13T05:17:15.721023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:17:15.823142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공통 189
100.0%

내용
Text

Distinct188
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T05:17:16.174289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length50
Mean length28.238095
Min length5

Characters and Unicode

Total characters5337
Distinct characters322
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)98.9%

Sample

1st row거래성공
2nd row현금영수증 금액 데이타 타입 오류
3rd row현금영수증 금액 5000원보다 작음
4th row현금영수증을 발급할 수 있는 서비스타입이 아닙니다
5th row현금영수증 TID 없음
ValueCountFrequency (%)
바랍니다 39
 
3.9%
상점에 28
 
2.8%
문의하시기 26
 
2.6%
아닙니다 22
 
2.2%
20
 
2.0%
없습니다 19
 
1.9%
상점이 17
 
1.7%
서비스를 16
 
1.6%
환불 13
 
1.3%
서비스 12
 
1.2%
Other values (477) 784
78.7%
2023-12-13T05:17:16.776819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
995
 
18.6%
192
 
3.6%
187
 
3.5%
. 178
 
3.3%
111
 
2.1%
94
 
1.8%
e 86
 
1.6%
76
 
1.4%
75
 
1.4%
t 73
 
1.4%
Other values (312) 3270
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3171
59.4%
Space Separator 995
 
18.6%
Lowercase Letter 596
 
11.2%
Uppercase Letter 248
 
4.6%
Other Punctuation 199
 
3.7%
Decimal Number 65
 
1.2%
Close Punctuation 22
 
0.4%
Open Punctuation 22
 
0.4%
Dash Punctuation 10
 
0.2%
Connector Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
 
6.1%
187
 
5.9%
111
 
3.5%
94
 
3.0%
76
 
2.4%
75
 
2.4%
69
 
2.2%
63
 
2.0%
56
 
1.8%
52
 
1.6%
Other values (244) 2196
69.3%
Lowercase Letter
ValueCountFrequency (%)
e 86
14.4%
t 73
12.2%
r 60
10.1%
o 46
 
7.7%
n 45
 
7.6%
a 40
 
6.7%
s 33
 
5.5%
h 30
 
5.0%
c 29
 
4.9%
i 28
 
4.7%
Other values (13) 126
21.1%
Uppercase Letter
ValueCountFrequency (%)
M 27
 
10.9%
C 22
 
8.9%
P 19
 
7.7%
D 18
 
7.3%
A 17
 
6.9%
R 16
 
6.5%
T 15
 
6.0%
B 14
 
5.6%
O 13
 
5.2%
S 13
 
5.2%
Other values (13) 74
29.8%
Decimal Number
ValueCountFrequency (%)
0 16
24.6%
4 9
13.8%
7 9
13.8%
1 8
12.3%
5 8
12.3%
8 7
10.8%
2 3
 
4.6%
3 2
 
3.1%
6 2
 
3.1%
9 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
. 178
89.4%
, 12
 
6.0%
' 7
 
3.5%
/ 2
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 20
90.9%
] 2
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 20
90.9%
[ 2
 
9.1%
Space Separator
ValueCountFrequency (%)
995
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3171
59.4%
Common 1322
24.8%
Latin 844
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
 
6.1%
187
 
5.9%
111
 
3.5%
94
 
3.0%
76
 
2.4%
75
 
2.4%
69
 
2.2%
63
 
2.0%
56
 
1.8%
52
 
1.6%
Other values (244) 2196
69.3%
Latin
ValueCountFrequency (%)
e 86
 
10.2%
t 73
 
8.6%
r 60
 
7.1%
o 46
 
5.5%
n 45
 
5.3%
a 40
 
4.7%
s 33
 
3.9%
h 30
 
3.6%
c 29
 
3.4%
i 28
 
3.3%
Other values (36) 374
44.3%
Common
ValueCountFrequency (%)
995
75.3%
. 178
 
13.5%
) 20
 
1.5%
( 20
 
1.5%
0 16
 
1.2%
, 12
 
0.9%
- 10
 
0.8%
4 9
 
0.7%
7 9
 
0.7%
1 8
 
0.6%
Other values (12) 45
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3171
59.4%
ASCII 2166
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
995
45.9%
. 178
 
8.2%
e 86
 
4.0%
t 73
 
3.4%
r 60
 
2.8%
o 46
 
2.1%
n 45
 
2.1%
a 40
 
1.8%
s 33
 
1.5%
h 30
 
1.4%
Other values (58) 580
26.8%
Hangul
ValueCountFrequency (%)
192
 
6.1%
187
 
5.9%
111
 
3.5%
94
 
3.0%
76
 
2.4%
75
 
2.4%
69
 
2.2%
63
 
2.0%
56
 
1.8%
52
 
1.6%
Other values (244) 2196
69.3%

Missing values

2023-12-13T05:17:15.536343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:17:15.617741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

거래구분내용
0공통거래성공
1공통현금영수증 금액 데이타 타입 오류
2공통현금영수증 금액 5000원보다 작음
3공통현금영수증을 발급할 수 있는 서비스타입이 아닙니다
4공통현금영수증 TID 없음
5공통현금영수증 MID 없음
6공통현금영수증 사업자번호 없음
7공통현금영수증 주민번호 없음
8공통현금영수증 CASHID 오류
9공통부분취소값이 원거래 가격보다 큽니다.
거래구분내용
179공통데이터베이스 오류
180공통데이터베이스 업데이트 오류
181공통CCKVAN(WebPos) 통신에러.
182공통VAN 승인 응답 전문의 Format이 틀립니다.
183공통VAN 승인취소 응답 전문의 Format이 틀립니다.
184공통이미 취소 성공된 거래입니다.
185공통승인성공 되지 않은 건의 취소요청입니다.
186공통가맹대행거래를 단독으로 취소불가
187공통단독거래를 가맹대행으로 취소불가
188공통01개월 할부는 불가능합니다.

Duplicate rows

Most frequently occurring

거래구분내용# duplicates
0공통공인인증 검증이 실패되었습니다. 인증서에 이상(폐기, 유효기간만료)이 있는지 확인 바랍니다.2