Overview

Dataset statistics

Number of variables13
Number of observations74
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory110.8 B

Variable types

Numeric1
Text1
Categorical9
Boolean2

Dataset

Description한국사학진흥재단에서 운영하고 있는 대학재정회계센터 DB와 연동된 시스템의 상품정보, 상품이미지 등에 대한 데이터입니다.
Author한국사학진흥재단
URLhttps://www.data.go.kr/data/15086003/fileData.do

Alerts

상품상태 has constant value ""Constant
상태명 has constant value ""Constant
기간사용 has constant value ""Constant
연동시스템코드 is highly overall correlated with 상품고유번호 and 4 other fieldsHigh correlation
연동시스템코드명 is highly overall correlated with 상품고유번호 and 4 other fieldsHigh correlation
삭제여부 is highly overall correlated with 등록일 and 2 other fieldsHigh correlation
등록일 is highly overall correlated with 상품고유번호 and 7 other fieldsHigh correlation
게시여부 is highly overall correlated with 등록일 and 2 other fieldsHigh correlation
상품종류 is highly overall correlated with 상품고유번호 and 4 other fieldsHigh correlation
삭제일 is highly overall correlated with 등록일 and 2 other fieldsHigh correlation
상품종류명 is highly overall correlated with 상품고유번호 and 4 other fieldsHigh correlation
상품고유번호 is highly overall correlated with 상품종류 and 4 other fieldsHigh correlation
삭제일 is highly imbalanced (73.4%)Imbalance
삭제여부 is highly imbalanced (64.3%)Imbalance
게시여부 is highly imbalanced (64.3%)Imbalance
상품고유번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:35:59.137046
Analysis finished2023-12-12 15:36:00.285123
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상품고유번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct74
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6953.5946
Minimum1101
Maximum10062
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size798.0 B
2023-12-13T00:36:00.372540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1101
5-th percentile1104.65
Q11427.5
median10010.5
Q310028.75
95-th percentile10043.35
Maximum10062
Range8961
Interquartile range (IQR)8601.25

Descriptive statistics

Standard deviation4202.1515
Coefficient of variation (CV)0.60431356
Kurtosis-1.6359649
Mean6953.5946
Median Absolute Deviation (MAD)26
Skewness-0.63749917
Sum514566
Variance17658077
MonotonicityNot monotonic
2023-12-13T00:36:00.527952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10039 1
 
1.4%
10004 1
 
1.4%
10000 1
 
1.4%
1601 1
 
1.4%
1506 1
 
1.4%
1505 1
 
1.4%
1504 1
 
1.4%
1503 1
 
1.4%
1502 1
 
1.4%
1501 1
 
1.4%
Other values (64) 64
86.5%
ValueCountFrequency (%)
1101 1
1.4%
1102 1
1.4%
1103 1
1.4%
1104 1
1.4%
1105 1
1.4%
1106 1
1.4%
1107 1
1.4%
1108 1
1.4%
1201 1
1.4%
1202 1
1.4%
ValueCountFrequency (%)
10062 1
1.4%
10061 1
1.4%
10060 1
1.4%
10044 1
1.4%
10043 1
1.4%
10042 1
1.4%
10041 1
1.4%
10040 1
1.4%
10039 1
1.4%
10038 1
1.4%
Distinct73
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size724.0 B
2023-12-13T00:36:00.814962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length31.5
Mean length18.162162
Min length5

Characters and Unicode

Total characters1344
Distinct characters172
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)97.3%

Sample

1st row부가가치세법 해설
2nd row소득세법 해설
3rd row부속병원회계세무
4th row산학협력단회계 해설
5th row2009.2_회계원리 기초과정
ValueCountFrequency (%)
14
 
7.2%
해설 8
 
4.1%
실무 5
 
2.6%
중심으로 3
 
1.5%
학교회계 3
 
1.5%
사립대학 3
 
1.5%
3
 
1.5%
신고해설 2
 
1.0%
2009.11_사례로 2
 
1.0%
2009.7_부가가치세 2
 
1.0%
Other values (127) 149
76.8%
2023-12-13T00:36:01.248952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
120
 
8.9%
0 86
 
6.4%
- 50
 
3.7%
2 46
 
3.4%
9 44
 
3.3%
_ 42
 
3.1%
. 38
 
2.8%
29
 
2.2%
26
 
1.9%
25
 
1.9%
Other values (162) 838
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 837
62.3%
Decimal Number 222
 
16.5%
Space Separator 120
 
8.9%
Dash Punctuation 50
 
3.7%
Connector Punctuation 42
 
3.1%
Other Punctuation 39
 
2.9%
Close Punctuation 14
 
1.0%
Open Punctuation 14
 
1.0%
Uppercase Letter 6
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
3.5%
26
 
3.1%
25
 
3.0%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
23
 
2.7%
20
 
2.4%
19
 
2.3%
Other values (141) 602
71.9%
Decimal Number
ValueCountFrequency (%)
0 86
38.7%
2 46
20.7%
9 44
19.8%
1 24
 
10.8%
8 6
 
2.7%
7 4
 
1.8%
4 4
 
1.8%
5 3
 
1.4%
3 3
 
1.4%
6 2
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
T 2
33.3%
O 1
16.7%
L 1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 38
97.4%
/ 1
 
2.6%
Space Separator
ValueCountFrequency (%)
120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 837
62.3%
Common 501
37.3%
Latin 6
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
3.5%
26
 
3.1%
25
 
3.0%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
23
 
2.7%
20
 
2.4%
19
 
2.3%
Other values (141) 602
71.9%
Common
ValueCountFrequency (%)
120
24.0%
0 86
17.2%
- 50
10.0%
2 46
 
9.2%
9 44
 
8.8%
_ 42
 
8.4%
. 38
 
7.6%
1 24
 
4.8%
) 14
 
2.8%
( 14
 
2.8%
Other values (7) 23
 
4.6%
Latin
ValueCountFrequency (%)
B 2
33.3%
T 2
33.3%
O 1
16.7%
L 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 837
62.3%
ASCII 507
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
120
23.7%
0 86
17.0%
- 50
9.9%
2 46
 
9.1%
9 44
 
8.7%
_ 42
 
8.3%
. 38
 
7.5%
1 24
 
4.7%
) 14
 
2.8%
( 14
 
2.8%
Other values (11) 29
 
5.7%
Hangul
ValueCountFrequency (%)
29
 
3.5%
26
 
3.1%
25
 
3.0%
24
 
2.9%
23
 
2.7%
23
 
2.7%
23
 
2.7%
23
 
2.7%
20
 
2.4%
19
 
2.3%
Other values (141) 602
71.9%

상품종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size724.0 B
10
40 
30
26 
20

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row10

Common Values

ValueCountFrequency (%)
10 40
54.1%
30 26
35.1%
20 8
 
10.8%

Length

2023-12-13T00:36:01.400525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:01.501209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 40
54.1%
30 26
35.1%
20 8
 
10.8%

상품종류명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size724.0 B
마일리지
40 
게시글관련
26 
상품

Length

Max length5
Median length4
Mean length4.1351351
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상품
2nd row상품
3rd row상품
4th row상품
5th row마일리지

Common Values

ValueCountFrequency (%)
마일리지 40
54.1%
게시글관련 26
35.1%
상품 8
 
10.8%

Length

2023-12-13T00:36:01.609906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:01.734228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
마일리지 40
54.1%
게시글관련 26
35.1%
상품 8
 
10.8%

연동시스템코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size724.0 B
1001
39 
1000
35 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1000
2nd row1000
3rd row1000
4th row1000
5th row1001

Common Values

ValueCountFrequency (%)
1001 39
52.7%
1000 35
47.3%

Length

2023-12-13T00:36:01.840865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:01.936038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1001 39
52.7%
1000 35
47.3%

연동시스템코드명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size724.0 B
교육연수센터
39 
학교경영지원센터
35 

Length

Max length8
Median length6
Mean length6.9459459
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교경영지원센터
2nd row학교경영지원센터
3rd row학교경영지원센터
4th row학교경영지원센터
5th row교육연수센터

Common Values

ValueCountFrequency (%)
교육연수센터 39
52.7%
학교경영지원센터 35
47.3%

Length

2023-12-13T00:36:02.052639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:02.198604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육연수센터 39
52.7%
학교경영지원센터 35
47.3%

상품상태
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
30
74 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30
2nd row30
3rd row30
4th row30
5th row30

Common Values

ValueCountFrequency (%)
30 74
100.0%

Length

2023-12-13T00:36:02.304177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:02.439042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30 74
100.0%

상태명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
영구사용
74 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영구사용
2nd row영구사용
3rd row영구사용
4th row영구사용
5th row영구사용

Common Values

ValueCountFrequency (%)
영구사용 74
100.0%

Length

2023-12-13T00:36:02.579945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:02.712242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영구사용 74
100.0%

기간사용
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
0
74 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 74
100.0%

Length

2023-12-13T00:36:03.213549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:03.378605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 74
100.0%

등록일
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size724.0 B
2009-10-20
37 
2009-08-21
26 
2009-10-21
2009-12-09
 
3
2009-10-23
 
2

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2009-10-21
2nd row2009-10-21
3rd row2009-10-21
4th row2009-10-23
5th row2009-10-20

Common Values

ValueCountFrequency (%)
2009-10-20 37
50.0%
2009-08-21 26
35.1%
2009-10-21 4
 
5.4%
2009-12-09 3
 
4.1%
2009-10-23 2
 
2.7%
2009-10-19 2
 
2.7%

Length

2023-12-13T00:36:03.506391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:03.660718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2009-10-20 37
50.0%
2009-08-21 26
35.1%
2009-10-21 4
 
5.4%
2009-12-09 3
 
4.1%
2009-10-23 2
 
2.7%
2009-10-19 2
 
2.7%

삭제일
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size724.0 B
1999-12-31
69 
2009-12-09
 
3
2009-10-20
 
2

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1999-12-31
2nd row1999-12-31
3rd row1999-12-31
4th row1999-12-31
5th row1999-12-31

Common Values

ValueCountFrequency (%)
1999-12-31 69
93.2%
2009-12-09 3
 
4.1%
2009-10-20 2
 
2.7%

Length

2023-12-13T00:36:03.828464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:36:03.962126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1999-12-31 69
93.2%
2009-12-09 3
 
4.1%
2009-10-20 2
 
2.7%

삭제여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
False
69 
True
 
5
ValueCountFrequency (%)
False 69
93.2%
True 5
 
6.8%
2023-12-13T00:36:04.145485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

게시여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
69 
False
 
5
ValueCountFrequency (%)
True 69
93.2%
False 5
 
6.8%
2023-12-13T00:36:04.245643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T00:35:59.827619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:36:04.339470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품고유번호상품명상품종류상품종류명연동시스템코드연동시스템코드명등록일삭제일삭제여부게시여부
상품고유번호1.0001.0001.0001.0000.9180.9181.0000.0640.1170.117
상품명1.0001.0001.0001.0001.0001.0001.0000.0000.0000.000
상품종류1.0001.0001.0001.0000.7470.7470.9980.3580.0700.070
상품종류명1.0001.0001.0001.0000.7470.7470.9980.3580.0700.070
연동시스템코드0.9181.0000.7470.7471.0000.9990.9990.1220.0000.000
연동시스템코드명0.9181.0000.7470.7470.9991.0000.9990.1220.0000.000
등록일1.0001.0000.9980.9980.9990.9991.0000.9810.9440.944
삭제일0.0640.0000.3580.3580.1220.1220.9811.0001.0001.000
삭제여부0.1170.0000.0700.0700.0000.0000.9441.0001.0000.985
게시여부0.1170.0000.0700.0700.0000.0000.9441.0000.9851.000
2023-12-13T00:36:04.535188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연동시스템코드연동시스템코드명삭제여부등록일게시여부상품종류삭제일상품종류명
연동시스템코드1.0000.9730.0000.9440.0000.9660.1990.966
연동시스템코드명0.9731.0000.0000.9440.0000.9660.1990.966
삭제여부0.0000.0001.0000.7670.8910.1140.9930.114
등록일0.9440.9440.7671.0000.7670.9190.8150.919
게시여부0.0000.0000.8910.7671.0000.1140.9930.114
상품종류0.9660.9660.1140.9190.1141.0000.1251.000
삭제일0.1990.1990.9930.8150.9930.1251.0000.125
상품종류명0.9660.9660.1140.9190.1141.0000.1251.000
2023-12-13T00:36:04.665057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품고유번호상품종류상품종류명연동시스템코드연동시스템코드명등록일삭제일삭제여부게시여부
상품고유번호1.0000.9930.9930.7440.7440.9720.1100.0800.080
상품종류0.9931.0001.0000.9660.9660.9190.1250.1140.114
상품종류명0.9931.0001.0000.9660.9660.9190.1250.1140.114
연동시스템코드0.7440.9660.9661.0000.9730.9440.1990.0000.000
연동시스템코드명0.7440.9660.9660.9731.0000.9440.1990.0000.000
등록일0.9720.9190.9190.9440.9441.0000.8150.7670.767
삭제일0.1100.1250.1250.1990.1990.8151.0000.9930.993
삭제여부0.0800.1140.1140.0000.0000.7670.9931.0000.891
게시여부0.0800.1140.1140.0000.0000.7670.9930.8911.000

Missing values

2023-12-13T00:35:59.969560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:36:00.207110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상품고유번호상품명상품종류상품종류명연동시스템코드연동시스템코드명상품상태상태명기간사용등록일삭제일삭제여부게시여부
010039부가가치세법 해설20상품1000학교경영지원센터30영구사용02009-10-211999-12-31NY
110040소득세법 해설20상품1000학교경영지원센터30영구사용02009-10-211999-12-31NY
210042부속병원회계세무20상품1000학교경영지원센터30영구사용02009-10-211999-12-31NY
310044산학협력단회계 해설20상품1000학교경영지원센터30영구사용02009-10-231999-12-31NY
4100022009.2_회계원리 기초과정10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
5100032009.3_용역 계약 실무과정10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
6100052009.4_근로기준법해설10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
7100062009.4_학교의 합리적인 내자구매 실무과정10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
8100272009.9_세무기초과정(2차)10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
9100282009.10_대학의 민간투자사업(BTL/BTO) 실무10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
상품고유번호상품명상품종류상품종류명연동시스템코드연동시스템코드명상품상태상태명기간사용등록일삭제일삭제여부게시여부
64100182009.8_법인세 및 상속증여세법 해설_학교중심으로10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
65100192009.8_법인세 및 상속증여세법 해설_산학협력단 중심으로10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
66100202009.8_비정규직 및 기간제 근로자의 노무관리10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
67100212009.8_소득세 및 원천징수 사례_학교중심으로10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
68100222009.8_소득세 및 원천징수 사례_산학협력단 중심으로10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
69100232009.9_세무기초과정(1차)10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
70100242009.9_사학기관재무회계규칙에 대한 특례규칙 해설(1차)10마일리지1001교육연수센터30영구사용02009-10-201999-12-31NY
7110043사립대학 학교회계해설20상품1000학교경영지원센터30영구사용02009-10-231999-12-31NY
72100602009.11 사립대학 학교회계 해설10마일리지1001교육연수센터30영구사용02009-12-092009-12-09YN
73100622009.11 사립대학 학교회계 해설10마일리지1001교육연수센터30영구사용02009-12-091999-12-31NY