Overview

Dataset statistics

Number of variables7
Number of observations8448
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory486.9 KiB
Average record size in memory59.0 B

Variable types

Categorical4
Numeric3

Dataset

Description보건복지부에서 장기 기증 현황 (남, 녀 성별, 연령, 혈액형, 시도, 기증 년 월, 기증 유형)에 대해서 정보를 제공합니다.
Author보건복지부
URLhttps://www.data.go.kr/data/15075223/fileData.do

Reproduction

Analysis started2023-12-23 06:55:52.783393
Analysis finished2023-12-23 06:56:02.153366
Duration9.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size66.1 KiB
남자
4651 
여자
3797 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남자
2nd row여자
3rd row여자
4th row여자
5th row남자

Common Values

ValueCountFrequency (%)
남자 4651
55.1%
여자 3797
44.9%

Length

2023-12-23T06:56:02.491808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T06:56:03.200528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남자 4651
55.1%
여자 3797
44.9%

연령
Real number (ℝ)

Distinct87
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.472064
Minimum0
Maximum86
Zeros12
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size74.4 KiB
2023-12-23T06:56:05.062496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20
Q132
median46
Q357
95-th percentile67
Maximum86
Range86
Interquartile range (IQR)25

Descriptive statistics

Standard deviation15.310014
Coefficient of variation (CV)0.34426137
Kurtosis-0.75437092
Mean44.472064
Median Absolute Deviation (MAD)12
Skewness-0.14029131
Sum375700
Variance234.39652
MonotonicityNot monotonic
2023-12-23T06:56:06.524418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56 211
 
2.5%
50 208
 
2.5%
52 206
 
2.4%
58 205
 
2.4%
51 204
 
2.4%
49 200
 
2.4%
48 196
 
2.3%
53 196
 
2.3%
47 194
 
2.3%
59 193
 
2.3%
Other values (77) 6435
76.2%
ValueCountFrequency (%)
0 12
0.1%
1 7
0.1%
2 3
 
< 0.1%
3 2
 
< 0.1%
4 5
0.1%
5 5
0.1%
6 5
0.1%
7 4
 
< 0.1%
8 1
 
< 0.1%
9 6
0.1%
ValueCountFrequency (%)
86 2
 
< 0.1%
85 1
 
< 0.1%
84 2
 
< 0.1%
83 2
 
< 0.1%
82 7
 
0.1%
81 10
0.1%
80 9
0.1%
79 6
 
0.1%
78 11
0.1%
77 19
0.2%

혈액형
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size66.1 KiB
A
2582 
O
2454 
B
2239 
AB
1173 

Length

Max length2
Median length1
Mean length1.1388494
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB
2nd rowO
3rd rowO
4th rowO
5th rowA

Common Values

ValueCountFrequency (%)
A 2582
30.6%
O 2454
29.0%
B 2239
26.5%
AB 1173
13.9%

Length

2023-12-23T06:56:08.187815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T06:56:08.841323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a 2582
30.6%
o 2454
29.0%
b 2239
26.5%
ab 1173
13.9%

시도
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size66.1 KiB
서울
2914 
경기
1363 
대구
938 
부산
878 
인천
411 
Other values (12)
1944 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row서울
2nd row강원
3rd row대전
4th row서울
5th row인천

Common Values

ValueCountFrequency (%)
서울 2914
34.5%
경기 1363
16.1%
대구 938
 
11.1%
부산 878
 
10.4%
인천 411
 
4.9%
경남 379
 
4.5%
울산 272
 
3.2%
광주 258
 
3.1%
전북 253
 
3.0%
강원 237
 
2.8%
Other values (7) 545
 
6.5%

Length

2023-12-23T06:56:09.890447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 2914
34.5%
경기 1363
16.1%
대구 938
 
11.1%
부산 878
 
10.4%
인천 411
 
4.9%
경남 379
 
4.5%
울산 272
 
3.2%
광주 258
 
3.1%
전북 253
 
3.0%
강원 237
 
2.8%
Other values (7) 545
 
6.5%

기증연
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4503
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size74.4 KiB
2023-12-23T06:56:10.576502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6913179
Coefficient of variation (CV)0.00083751402
Kurtosis-1.2395396
Mean2019.4503
Median Absolute Deviation (MAD)1
Skewness0.030591739
Sum17060316
Variance2.8605563
MonotonicityIncreasing
2023-12-23T06:56:11.326271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2019 1479
17.5%
2017 1449
17.2%
2020 1439
17.0%
2018 1401
16.6%
2021 1376
16.3%
2022 1304
15.4%
ValueCountFrequency (%)
2017 1449
17.2%
2018 1401
16.6%
2019 1479
17.5%
2020 1439
17.0%
2021 1376
16.3%
2022 1304
15.4%
ValueCountFrequency (%)
2022 1304
15.4%
2021 1376
16.3%
2020 1439
17.0%
2019 1479
17.5%
2018 1401
16.6%
2017 1449
17.2%

기증형태
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size66.1 KiB
생존
5927 
뇌사
2519 
NHBD
 
2

Length

Max length4
Median length2
Mean length2.0004735
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row뇌사
2nd row뇌사
3rd row뇌사
4th row뇌사
5th row뇌사

Common Values

ValueCountFrequency (%)
생존 5927
70.2%
뇌사 2519
29.8%
NHBD 2
 
< 0.1%

Length

2023-12-23T06:56:11.883174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T06:56:12.592451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생존 5927
70.2%
뇌사 2519
29.8%
nhbd 2
 
< 0.1%

건수
Real number (ℝ)

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1110322
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size74.4 KiB
2023-12-23T06:56:13.113939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile7.65
Maximum18
Range17
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.2545969
Coefficient of variation (CV)1.0680069
Kurtosis6.9642352
Mean2.1110322
Median Absolute Deviation (MAD)0
Skewness2.5937175
Sum17834
Variance5.0832073
MonotonicityNot monotonic
2023-12-23T06:56:14.290629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
1 5643
66.8%
2 1050
 
12.4%
3 443
 
5.2%
4 303
 
3.6%
5 235
 
2.8%
6 195
 
2.3%
7 156
 
1.8%
8 135
 
1.6%
9 100
 
1.2%
10 72
 
0.9%
Other values (7) 116
 
1.4%
ValueCountFrequency (%)
1 5643
66.8%
2 1050
 
12.4%
3 443
 
5.2%
4 303
 
3.6%
5 235
 
2.8%
6 195
 
2.3%
7 156
 
1.8%
8 135
 
1.6%
9 100
 
1.2%
10 72
 
0.9%
ValueCountFrequency (%)
18 1
 
< 0.1%
16 2
 
< 0.1%
15 6
 
0.1%
14 8
 
0.1%
13 20
 
0.2%
12 32
 
0.4%
11 47
 
0.6%
10 72
0.9%
9 100
1.2%
8 135
1.6%

Interactions

2023-12-23T06:55:58.890837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:55.078743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:57.349271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:59.504889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:55.890464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:57.839885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:56:00.007962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:56.456061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T06:55:58.365278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T06:56:14.754574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령혈액형시도기증연기증형태건수
성별1.0000.1440.0140.0750.0000.0800.052
연령0.1441.0000.0000.1230.0000.2990.185
혈액형0.0140.0001.0000.1100.0000.0180.121
시도0.0750.1230.1101.0000.0650.3910.474
기증연0.0000.0000.0000.0651.0000.0000.012
기증형태0.0800.2990.0180.3910.0001.0000.350
건수0.0520.1850.1210.4740.0120.3501.000
2023-12-23T06:56:15.210785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도기증형태혈액형성별
시도1.0000.2290.0610.067
기증형태0.2291.0000.0170.133
혈액형0.0610.0171.0000.009
성별0.0670.1330.0091.000
2023-12-23T06:56:15.615561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령기증연건수성별혈액형시도기증형태
연령1.0000.032-0.1120.1100.0000.0480.187
기증연0.0321.0000.0240.0000.0000.0290.013
건수-0.1120.0241.0000.0400.0720.2050.225
성별0.1100.0000.0401.0000.0090.0670.133
혈액형0.0000.0000.0720.0091.0000.0610.017
시도0.0480.0290.2050.0670.0611.0000.229
기증형태0.1870.0130.2250.1330.0170.2291.000

Missing values

2023-12-23T06:56:00.871255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T06:56:01.871065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

성별연령혈액형시도기증연기증형태건수
0남자0B서울2017뇌사1
1여자0O강원2017뇌사1
2여자0O대전2017뇌사1
3여자0O서울2017뇌사1
4남자1A인천2017뇌사1
5남자1AB대구2017뇌사1
6남자1B경남2017뇌사1
7여자1B대구2017뇌사1
8여자2B충남2017뇌사1
9남자3A서울2017뇌사1
성별연령혈액형시도기증연기증형태건수
8438남자80O서울2022뇌사1
8439여자80O서울2022뇌사1
8440여자81A부산2022뇌사1
8441여자81A서울2022뇌사1
8442남자81B울산2022뇌사1
8443여자82B경기2022뇌사1
8444남자82B서울2022뇌사1
8445남자83O부산2022뇌사1
8446여자83O서울2022뇌사1
8447여자85O서울2022뇌사1