Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Categorical6
Numeric2

Dataset

Description보건복지부에서 장기 이식 현황에 대해서 (남, 녀 성별, 연령, 혈액형, 시도, 기증 년도, 기증형태, 기증장기, 건수)에 대해서 정보를 제공합니다.
Author보건복지부
URLhttps://www.data.go.kr/data/15075224/fileData.do

Reproduction

Analysis started2023-12-12 13:39:01.999224
Analysis finished2023-12-12 13:39:03.461335
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
남자
6033 
여자
3967 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남자
2nd row여자
3rd row여자
4th row남자
5th row남자

Common Values

ValueCountFrequency (%)
남자 6033
60.3%
여자 3967
39.7%

Length

2023-12-12T22:39:03.529011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:03.618017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남자 6033
60.3%
여자 3967
39.7%

연령
Real number (ℝ)

Distinct81
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.3851
Minimum0
Maximum80
Zeros43
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:39:03.722010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21
Q142
median52
Q360
95-th percentile68
Maximum80
Range80
Interquartile range (IQR)18

Descriptive statistics

Standard deviation14.515001
Coefficient of variation (CV)0.29391459
Kurtosis1.0183714
Mean49.3851
Median Absolute Deviation (MAD)9
Skewness-0.97415951
Sum493851
Variance210.68527
MonotonicityNot monotonic
2023-12-12T22:39:03.864892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
57 343
 
3.4%
56 333
 
3.3%
55 322
 
3.2%
58 317
 
3.2%
54 313
 
3.1%
61 307
 
3.1%
53 307
 
3.1%
60 305
 
3.0%
59 300
 
3.0%
50 297
 
3.0%
Other values (71) 6856
68.6%
ValueCountFrequency (%)
0 43
0.4%
1 33
0.3%
2 29
0.3%
3 18
0.2%
4 11
 
0.1%
5 17
 
0.2%
6 14
 
0.1%
7 14
 
0.1%
8 16
 
0.2%
9 9
 
0.1%
ValueCountFrequency (%)
80 3
 
< 0.1%
79 5
 
0.1%
78 4
 
< 0.1%
77 12
 
0.1%
76 6
 
0.1%
75 20
 
0.2%
74 25
 
0.2%
73 57
0.6%
72 61
0.6%
71 68
0.7%

혈액형
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
A
3133 
B
2760 
O
2467 
AB
1640 

Length

Max length2
Median length1
Mean length1.164
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowO
2nd rowO
3rd rowO
4th rowB
5th rowO

Common Values

ValueCountFrequency (%)
A 3133
31.3%
B 2760
27.6%
O 2467
24.7%
AB 1640
16.4%

Length

2023-12-12T22:39:03.991434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:04.079569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a 3133
31.3%
b 2760
27.6%
o 2467
24.7%
ab 1640
16.4%

시도
Categorical

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울
4804 
경기
1315 
대구
965 
부산
824 
경남
551 
Other values (11)
1541 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row부산
2nd row인천
3rd row경기
4th row서울
5th row경기

Common Values

ValueCountFrequency (%)
서울 4804
48.0%
경기 1315
 
13.2%
대구 965
 
9.7%
부산 824
 
8.2%
경남 551
 
5.5%
인천 360
 
3.6%
광주 272
 
2.7%
울산 227
 
2.3%
대전 203
 
2.0%
전북 181
 
1.8%
Other values (6) 298
 
3.0%

Length

2023-12-12T22:39:04.182833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 4804
48.0%
경기 1315
 
13.2%
대구 965
 
9.7%
부산 824
 
8.2%
경남 551
 
5.5%
인천 360
 
3.6%
광주 272
 
2.7%
울산 227
 
2.3%
대전 203
 
2.0%
전북 181
 
1.8%
Other values (6) 298
 
3.0%

이식연
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019
2117 
2017
2008 
2020
1987 
2021
1949 
2018
1939 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2019
3rd row2021
4th row2018
5th row2020

Common Values

ValueCountFrequency (%)
2019 2117
21.2%
2017 2008
20.1%
2020 1987
19.9%
2021 1949
19.5%
2018 1939
19.4%

Length

2023-12-12T22:39:04.277688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:04.371711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 2117
21.2%
2017 2008
20.1%
2020 1987
19.9%
2021 1949
19.5%
2018 1939
19.4%

기증형태
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
생존
5035 
뇌사
4965 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row뇌사
2nd row생존
3rd row생존
4th row생존
5th row생존

Common Values

ValueCountFrequency (%)
생존 5035
50.3%
뇌사 4965
49.6%

Length

2023-12-12T22:39:04.501863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:04.602539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생존 5035
50.3%
뇌사 4965
49.6%

장기
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신장
5471 
간장
3274 
심장
635 
폐장
 
442
췌장
 
174
Other values (3)
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row신장
2nd row간장
3rd row간장
4th row간장
5th row간장

Common Values

ValueCountFrequency (%)
신장 5471
54.7%
간장 3274
32.7%
심장 635
 
6.3%
폐장 442
 
4.4%
췌장 174
 
1.7%
소장 2
 
< 0.1%
췌도 1
 
< 0.1%
팔( 1
 
< 0.1%

Length

2023-12-12T22:39:04.699338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:04.796006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신장 5471
54.7%
간장 3274
32.7%
심장 635
 
6.3%
폐장 442
 
4.4%
췌장 174
 
1.7%
소장 2
 
< 0.1%
췌도 1
 
< 0.1%
1
 
< 0.1%

건수
Real number (ℝ)

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6239
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:39:04.888422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum16
Range15
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.4539808
Coefficient of variation (CV)0.89536351
Kurtosis20.130197
Mean1.6239
Median Absolute Deviation (MAD)0
Skewness3.9004296
Sum16239
Variance2.1140602
MonotonicityNot monotonic
2023-12-12T22:39:04.999456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 7169
71.7%
2 1530
 
15.3%
3 562
 
5.6%
4 266
 
2.7%
5 162
 
1.6%
6 109
 
1.1%
7 71
 
0.7%
8 46
 
0.5%
9 28
 
0.3%
10 15
 
0.1%
Other values (6) 42
 
0.4%
ValueCountFrequency (%)
1 7169
71.7%
2 1530
 
15.3%
3 562
 
5.6%
4 266
 
2.7%
5 162
 
1.6%
6 109
 
1.1%
7 71
 
0.7%
8 46
 
0.5%
9 28
 
0.3%
10 15
 
0.1%
ValueCountFrequency (%)
16 2
 
< 0.1%
15 1
 
< 0.1%
14 8
 
0.1%
13 9
 
0.1%
12 11
 
0.1%
11 11
 
0.1%
10 15
 
0.1%
9 28
 
0.3%
8 46
0.5%
7 71
0.7%

Interactions

2023-12-12T22:39:02.981911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:02.717035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:03.111703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:39:02.862742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:39:05.167668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령혈액형시도이식연기증형태장기건수
성별1.0000.0680.0120.0790.0000.0700.0580.121
연령0.0681.0000.0140.1870.0690.1170.1820.170
혈액형0.0120.0141.0000.0750.0000.0440.0000.098
시도0.0790.1870.0751.0000.0540.2150.3450.276
이식연0.0000.0690.0000.0541.0000.0190.0270.054
기증형태0.0700.1170.0440.2150.0191.0000.5130.327
장기0.0580.1820.0000.3450.0270.5131.0000.106
건수0.1210.1700.0980.2760.0540.3270.1061.000
2023-12-12T22:39:05.300827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별장기이식연기증형태혈액형시도
성별1.0000.0430.0000.0450.0080.062
장기0.0431.0000.0170.3860.0000.127
이식연0.0000.0171.0000.0230.0000.028
기증형태0.0450.3860.0231.0000.0290.169
혈액형0.0080.0000.0000.0291.0000.035
시도0.0620.1270.0280.1690.0351.000
2023-12-12T22:39:05.405590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령건수성별혈액형시도이식연기증형태장기
연령1.0000.0970.0520.0000.0740.0270.0880.087
건수0.0971.0000.0930.0560.1110.0200.2470.050
성별0.0520.0931.0000.0080.0620.0000.0450.043
혈액형0.0000.0560.0081.0000.0350.0000.0290.000
시도0.0740.1110.0620.0351.0000.0280.1690.127
이식연0.0270.0200.0000.0000.0281.0000.0230.017
기증형태0.0880.2470.0450.0290.1690.0231.0000.386
장기0.0870.0500.0430.0000.1270.0170.3861.000

Missing values

2023-12-12T22:39:03.273736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:39:03.403112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

성별연령혈액형시도이식연기증형태장기건수
9387남자58O부산2020뇌사신장2
6819여자57O인천2019생존간장1
11838여자58O경기2021생존간장1
4439남자60B서울2018생존간장4
8972남자53O경기2020생존간장2
2780여자32A대구2018생존신장1
11414남자53B대구2021뇌사간장1
5867남자45A광주2019뇌사신장1
1070남자48O서울2017생존간장5
8408남자44O인천2020뇌사신장1
성별연령혈액형시도이식연기증형태장기건수
1733남자57A대전2017뇌사간장1
6481남자53O대구2019생존간장2
8378남자44AB경기2020뇌사간장1
5490여자36O부산2019뇌사신장1
8863남자52A서울2020생존간장8
5385남자33B서울2019생존신장1
609여자41AB서울2017뇌사신장1
12383남자66B경기2021뇌사신장1
12439여자67B대구2021뇌사신장1
6318남자51O부산2019뇌사신장1