Overview

Dataset statistics

Number of variables4
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory36.0 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description용인시 행정사료관 전시유물 목록으로 제공항목은 유물명, 수량, 가입금액 이며, 포은집, 성인록, 정암선생문집 등 총 67종의 자료 제공
Author경기도 용인시
URLhttps://www.data.go.kr/data/3047323/fileData.do

Alerts

번호 is highly overall correlated with 가입금액High correlation
가입금액 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 가입금액High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:41:22.244580
Analysis finished2023-12-12 08:41:22.963165
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.5
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T17:41:23.056061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.25
Q117.25
median33.5
Q349.75
95-th percentile62.75
Maximum66
Range65
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation19.196354
Coefficient of variation (CV)0.57302549
Kurtosis-1.2
Mean33.5
Median Absolute Deviation (MAD)16.5
Skewness0
Sum2211
Variance368.5
MonotonicityStrictly increasing
2023-12-12T17:41:23.279469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
51 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
44 1
 
1.5%
Other values (56) 56
84.8%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%
Distinct65
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T17:41:23.606756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length12
Mean length7.5151515
Min length2

Characters and Unicode

Total characters496
Distinct characters153
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)97.0%

Sample

1st row포은집
2nd row포은선생문집
3rd row성인록
4th row포은선생집
5th row정암선생문집
ValueCountFrequency (%)
간찰 4
 
3.8%
통계연보 3
 
2.9%
용인군 3
 
2.9%
사마방목 2
 
1.9%
타자기 2
 
1.9%
전화기 2
 
1.9%
제1회 2
 
1.9%
오죽필 2
 
1.9%
호산자락명 2
 
1.9%
목침 2
 
1.9%
Other values (80) 80
76.9%
2023-12-12T17:41:24.126859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
7.7%
14
 
2.8%
) 14
 
2.8%
( 14
 
2.8%
13
 
2.6%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
9
 
1.8%
Other values (143) 355
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 408
82.3%
Space Separator 38
 
7.7%
Decimal Number 18
 
3.6%
Close Punctuation 14
 
2.8%
Open Punctuation 14
 
2.8%
Other Punctuation 4
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
3.4%
13
 
3.2%
11
 
2.7%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (129) 309
75.7%
Decimal Number
ValueCountFrequency (%)
1 5
27.8%
3 3
16.7%
6 2
 
11.1%
5 2
 
11.1%
7 1
 
5.6%
4 1
 
5.6%
8 1
 
5.6%
0 1
 
5.6%
9 1
 
5.6%
2 1
 
5.6%
Space Separator
ValueCountFrequency (%)
38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 408
82.3%
Common 88
 
17.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
3.4%
13
 
3.2%
11
 
2.7%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (129) 309
75.7%
Common
ValueCountFrequency (%)
38
43.2%
) 14
 
15.9%
( 14
 
15.9%
1 5
 
5.7%
, 4
 
4.5%
3 3
 
3.4%
6 2
 
2.3%
5 2
 
2.3%
7 1
 
1.1%
4 1
 
1.1%
Other values (4) 4
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 408
82.3%
ASCII 88
 
17.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
43.2%
) 14
 
15.9%
( 14
 
15.9%
1 5
 
5.7%
, 4
 
4.5%
3 3
 
3.4%
6 2
 
2.3%
5 2
 
2.3%
7 1
 
1.1%
4 1
 
1.1%
Other values (4) 4
 
4.5%
Hangul
ValueCountFrequency (%)
14
 
3.4%
13
 
3.2%
11
 
2.7%
10
 
2.5%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
2.0%
8
 
2.0%
8
 
2.0%
Other values (129) 309
75.7%

수량
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Memory size660.0 B
1점
33 
1책
16 
2점
 
3
5책
 
2
3책
 
1
Other values (11)
11 

Length

Max length3
Median length2
Mean length2.0606061
Min length2

Unique

Unique12 ?
Unique (%)18.2%

Sample

1st row3책
2nd row5책
3rd row1책
4th row1책
5th row4책

Common Values

ValueCountFrequency (%)
1점 33
50.0%
1책 16
24.2%
2점 3
 
4.5%
5책 2
 
3.0%
3책 1
 
1.5%
4책 1
 
1.5%
30책 1
 
1.5%
6장 1
 
1.5%
7점 1
 
1.5%
3점 1
 
1.5%
Other values (6) 6
 
9.1%

Length

2023-12-12T17:41:24.321878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1점 33
50.0%
1책 16
24.2%
2점 3
 
4.5%
5책 2
 
3.0%
3책 1
 
1.5%
4책 1
 
1.5%
30책 1
 
1.5%
6장 1
 
1.5%
7점 1
 
1.5%
3점 1
 
1.5%
Other values (6) 6
 
9.1%

가입금액
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)54.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean992196.97
Minimum80000
Maximum8330000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T17:41:24.466838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80000
5-th percentile100000
Q1300000
median515000
Q31222500
95-th percentile3390000
Maximum8330000
Range8250000
Interquartile range (IQR)922500

Descriptive statistics

Standard deviation1327695.6
Coefficient of variation (CV)1.3381371
Kurtosis14.522698
Mean992196.97
Median Absolute Deviation (MAD)415000
Skewness3.3301453
Sum65485000
Variance1.7627755 × 1012
MonotonicityNot monotonic
2023-12-12T17:41:24.616331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
300000 12
18.2%
100000 10
 
15.2%
1230000 3
 
4.5%
430000 2
 
3.0%
500000 2
 
3.0%
2000000 2
 
3.0%
1200000 2
 
3.0%
1000000 2
 
3.0%
600000 2
 
3.0%
850000 2
 
3.0%
Other values (26) 27
40.9%
ValueCountFrequency (%)
80000 1
 
1.5%
100000 10
15.2%
150000 2
 
3.0%
200000 1
 
1.5%
280000 1
 
1.5%
300000 12
18.2%
330000 1
 
1.5%
400000 1
 
1.5%
430000 2
 
3.0%
500000 2
 
3.0%
ValueCountFrequency (%)
8330000 1
1.5%
4500000 1
1.5%
4330000 1
1.5%
3670000 1
1.5%
2550000 1
1.5%
2500000 1
1.5%
2000000 2
3.0%
1930000 1
1.5%
1850000 1
1.5%
1830000 1
1.5%

Interactions

2023-12-12T17:41:22.625736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:41:22.454221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:41:22.714913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:41:22.541851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:41:24.729494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호유물명수량가입금액
번호1.0000.9360.5190.422
유물명0.9361.0001.0001.000
수량0.5191.0001.0000.819
가입금액0.4221.0000.8191.000
2023-12-12T17:41:24.834204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호가입금액수량
번호1.000-0.5620.203
가입금액-0.5621.0000.524
수량0.2030.5241.000

Missing values

2023-12-12T17:41:22.819875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:41:22.924059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호유물명수량가입금액
01포은집3책670000
12포은선생문집5책1500000
23성인록1책1830000
34포은선생집1책970000
45정암선생문집4책870000
56정암집5책1200000
67국조유선록(조광조편)1책1400000
78번암집30책8330000
89고문서(이지시)6장2550000
910시과지,호패7점1230000
번호유물명수량가입금액
56571960년대 용인시가지 전경(항공사진)1점300000
5758구형 전화기1점100000
5859비상용 랜턴1점150000
5960공인(3), 고무인(8), 인장함(1)12점500000
60614벌식 타자기1점500000
6162다이얼 전화기1점100000
6263전자 타자기1점300000
6364용인군지1책300000
6465제35회 용인군 통계연보1책100000
6566제1회 용인시 사회통계조사1책100000