Overview

Dataset statistics

Number of variables5
Number of observations232
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.6 KiB
Average record size in memory42.6 B

Variable types

Categorical2
Text1
Numeric2

Dataset

Description보훈휴양원에서 개방하는 보훈휴양원 객실 평형별 보유 비품정보로 평형, 비품, 비품구분, 비품단가, 보유수량이 포함된 데이터입니다.
URLhttps://www.data.go.kr/data/15117118/fileData.do

Alerts

비품단가 has 18 (7.8%) zerosZeros

Reproduction

Analysis started2023-12-12 17:41:44.503542
Analysis finished2023-12-12 17:41:45.386985
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

평형
Categorical

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
12평
79 
23평
78 
15평
75 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12평
2nd row12평
3rd row23평
4th row12평
5th row15평

Common Values

ValueCountFrequency (%)
12평 79
34.1%
23평 78
33.6%
15평 75
32.3%

Length

2023-12-13T02:41:45.473614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:45.607270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12평 79
34.1%
23평 78
33.6%
15평 75
32.3%

비품
Text

Distinct91
Distinct (%)39.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T02:41:45.924552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.6767241
Min length1

Characters and Unicode

Total characters853
Distinct characters155
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)6.5%

Sample

1st row점보롤케이스
2nd row비누받침대
3rd row냉장고 음식받침대
4th row선풍기 날개
5th row선풍기 날개
ValueCountFrequency (%)
에어컨 6
 
2.5%
냉장고 6
 
2.5%
선풍기 6
 
2.5%
밥주걱 3
 
1.2%
수저세트 3
 
1.2%
쟁반 3
 
1.2%
가위 3
 
1.2%
냄비받침 3
 
1.2%
소주잔 3
 
1.2%
냄비(小 3
 
1.2%
Other values (81) 204
84.0%
2023-12-13T02:41:46.379732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
3.8%
( 24
 
2.8%
24
 
2.8%
) 24
 
2.8%
22
 
2.6%
20
 
2.3%
18
 
2.1%
15
 
1.8%
15
 
1.8%
15
 
1.8%
Other values (145) 644
75.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 784
91.9%
Open Punctuation 24
 
2.8%
Close Punctuation 24
 
2.8%
Space Separator 12
 
1.4%
Lowercase Letter 6
 
0.7%
Uppercase Letter 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
4.1%
24
 
3.1%
22
 
2.8%
20
 
2.6%
18
 
2.3%
15
 
1.9%
15
 
1.9%
15
 
1.9%
15
 
1.9%
13
 
1.7%
Other values (137) 595
75.9%
Lowercase Letter
ValueCountFrequency (%)
v 3
50.0%
t 3
50.0%
Uppercase Letter
ValueCountFrequency (%)
V 1
50.0%
T 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 776
91.0%
Common 61
 
7.2%
Han 8
 
0.9%
Latin 8
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
4.1%
24
 
3.1%
22
 
2.8%
20
 
2.6%
18
 
2.3%
15
 
1.9%
15
 
1.9%
15
 
1.9%
15
 
1.9%
13
 
1.7%
Other values (134) 587
75.6%
Common
ValueCountFrequency (%)
( 24
39.3%
) 24
39.3%
12
19.7%
. 1
 
1.6%
Latin
ValueCountFrequency (%)
v 3
37.5%
t 3
37.5%
V 1
 
12.5%
T 1
 
12.5%
Han
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 776
91.0%
ASCII 69
 
8.1%
CJK 8
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
4.1%
24
 
3.1%
22
 
2.8%
20
 
2.6%
18
 
2.3%
15
 
1.9%
15
 
1.9%
15
 
1.9%
15
 
1.9%
13
 
1.7%
Other values (134) 587
75.6%
ASCII
ValueCountFrequency (%)
( 24
34.8%
) 24
34.8%
12
17.4%
v 3
 
4.3%
t 3
 
4.3%
. 1
 
1.4%
V 1
 
1.4%
T 1
 
1.4%
CJK
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%

비품구분
Categorical

Distinct8
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
주방용품
109 
기타
42 
가전비품
32 
소모품
18 
가구비품
16 
Other values (3)
15 

Length

Max length4
Median length4
Mean length3.5474138
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row가전비품
4th row가전비품
5th row가전비품

Common Values

ValueCountFrequency (%)
주방용품 109
47.0%
기타 42
 
18.1%
가전비품 32
 
13.8%
소모품 18
 
7.8%
가구비품 16
 
6.9%
침구비품 9
 
3.9%
커텐류 3
 
1.3%
조리기구 3
 
1.3%

Length

2023-12-13T02:41:46.541986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:46.690838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주방용품 109
47.0%
기타 42
 
18.1%
가전비품 32
 
13.8%
소모품 18
 
7.8%
가구비품 16
 
6.9%
침구비품 9
 
3.9%
커텐류 3
 
1.3%
조리기구 3
 
1.3%

비품단가
Real number (ℝ)

ZEROS 

Distinct41
Distinct (%)17.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50934.914
Minimum0
Maximum675000
Zeros18
Zeros (%)7.8%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T02:41:46.847859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12000
median5000
Q325000
95-th percentile332700
Maximum675000
Range675000
Interquartile range (IQR)23000

Descriptive statistics

Standard deviation124192.55
Coefficient of variation (CV)2.4382597
Kurtosis11.230397
Mean50934.914
Median Absolute Deviation (MAD)5000
Skewness3.3267837
Sum11816900
Variance1.5423788 × 1010
MonotonicityNot monotonic
2023-12-13T02:41:47.031062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1000 32
13.8%
3000 23
 
9.9%
10000 21
 
9.1%
0 18
 
7.8%
5000 15
 
6.5%
2000 14
 
6.0%
4000 12
 
5.2%
25000 6
 
2.6%
40000 6
 
2.6%
20000 6
 
2.6%
Other values (31) 79
34.1%
ValueCountFrequency (%)
0 18
7.8%
1000 32
13.8%
2000 14
6.0%
2300 3
 
1.3%
3000 23
9.9%
4000 12
 
5.2%
5000 15
6.5%
6000 3
 
1.3%
7000 3
 
1.3%
8000 6
 
2.6%
ValueCountFrequency (%)
675000 1
 
0.4%
655000 3
1.3%
570000 1
 
0.4%
462000 2
0.9%
410000 1
 
0.4%
399000 1
 
0.4%
336000 3
1.3%
330000 1
 
0.4%
290000 2
0.9%
285000 3
1.3%

보유수량
Real number (ℝ)

Distinct6
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.75
Minimum1
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-13T02:41:47.181133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile5
Maximum20
Range19
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.3590115
Coefficient of variation (CV)1.3480065
Kurtosis44.95064
Mean1.75
Median Absolute Deviation (MAD)0
Skewness6.2200492
Sum406
Variance5.5649351
MonotonicityNot monotonic
2023-12-13T02:41:47.327514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 176
75.9%
2 19
 
8.2%
3 18
 
7.8%
5 14
 
6.0%
20 3
 
1.3%
4 2
 
0.9%
ValueCountFrequency (%)
1 176
75.9%
2 19
 
8.2%
3 18
 
7.8%
4 2
 
0.9%
5 14
 
6.0%
20 3
 
1.3%
ValueCountFrequency (%)
20 3
 
1.3%
5 14
 
6.0%
4 2
 
0.9%
3 18
 
7.8%
2 19
 
8.2%
1 176
75.9%

Interactions

2023-12-13T02:41:44.970644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:44.764085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:45.094046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:41:44.868732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:41:47.446819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평형비품비품구분비품단가보유수량
평형1.0000.0000.0000.0000.309
비품0.0001.0000.9990.9870.836
비품구분0.0000.9991.0000.7690.354
비품단가0.0000.9870.7691.0000.000
보유수량0.3090.8360.3540.0001.000
2023-12-13T02:41:47.561645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평형비품구분
평형1.0000.000
비품구분0.0001.000
2023-12-13T02:41:47.660363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비품단가보유수량평형비품구분
비품단가1.000-0.1640.0000.355
보유수량-0.1641.0000.2970.162
평형0.0000.2971.0000.000
비품구분0.3550.1620.0001.000

Missing values

2023-12-13T02:41:45.222734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:41:45.333862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

평형비품비품구분비품단가보유수량
012평점보롤케이스기타100001
112평비누받침대기타10001
223평냉장고 음식받침대가전비품100003
312평선풍기 날개가전비품50001
415평선풍기 날개가전비품50001
523평선풍기 날개가전비품50001
612평유리출입문기타2000001
723평접시(중)주방용품40001
823평접시(소)주방용품30001
915평접시(중)주방용품40001
평형비품비품구분비품단가보유수량
22215평드라이기가전비품250001
22323평드라이기가전비품250001
22423평화장대거울가전비품350001
22512평에어컨 리모컨가전비품290001
22615평에어컨 리모컨가전비품290001
22723평에어컨 리모컨가전비품290001
22823평변기커버기타100001
22915평변기커버기타100001
23012평변기커버기타100001
23123평침대가구비품4100001