Overview

Dataset statistics

Number of variables5
Number of observations138
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory46.0 B

Variable types

Categorical3
Numeric2

Dataset

Description□ ‘자료보관신청’ 데이터 설명 o 내용: 가상화룸 사용기간 종료 후 연구결과물 재사용을 위한 자료보관신청 현황 o 대상: 2021년, 2022년 국민건강정보자료를 분석한 연구결과물의 보관신청 내역 o 데이터컬럼 정보: 신청년도, 신청월, 보관심의결과, 보관개월수, 신청건수 - 신청년도 (자료보관 신청일 기준 년도) - 신청월 (자료보관 신청일 기준 월) - 보관심의결과 (1: 승인대기, 2: 보관승인, 3: 보관반려) - 보관개월수 (자료보관 신청 개월수) - 신청건수 (자료보관 신청년월별, 보관승인상태별, 보관개월수별 보관신청 건수)
Author국민건강보험공단
URLhttps://www.data.go.kr/data/15122147/fileData.do

Reproduction

Analysis started2023-12-12 21:41:46.408795
Analysis finished2023-12-12 21:41:47.028925
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

신청년도
Categorical

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2021
73 
2022
65 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 73
52.9%
2022 65
47.1%

Length

2023-12-13T06:41:47.089033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:41:47.185674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 73
52.9%
2022 65
47.1%

신청월
Real number (ℝ)

Distinct12
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5652174
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T06:41:47.271354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q39.75
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5.75

Descriptive statistics

Standard deviation3.4996486
Coefficient of variation (CV)0.53305906
Kurtosis-1.2257262
Mean6.5652174
Median Absolute Deviation (MAD)3
Skewness-0.0013954744
Sum906
Variance12.24754
MonotonicityNot monotonic
2023-12-13T06:41:47.415273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
12 14
10.1%
2 12
8.7%
4 12
8.7%
8 12
8.7%
9 12
8.7%
1 11
8.0%
3 11
8.0%
5 11
8.0%
6 11
8.0%
7 11
8.0%
Other values (2) 21
15.2%
ValueCountFrequency (%)
1 11
8.0%
2 12
8.7%
3 11
8.0%
4 12
8.7%
5 11
8.0%
6 11
8.0%
7 11
8.0%
8 12
8.7%
9 12
8.7%
10 11
8.0%
ValueCountFrequency (%)
12 14
10.1%
11 10
7.2%
10 11
8.0%
9 12
8.7%
8 12
8.7%
7 11
8.0%
6 11
8.0%
5 11
8.0%
4 12
8.7%
3 11
8.0%
Distinct4
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2
89 
3
44 
<NA>
 
3
1
 
2

Length

Max length4
Median length1
Mean length1.0652174
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row2

Common Values

ValueCountFrequency (%)
2 89
64.5%
3 44
31.9%
<NA> 3
 
2.2%
1 2
 
1.4%

Length

2023-12-13T06:41:47.523425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:41:47.622772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 89
64.5%
3 44
31.9%
na 3
 
2.2%
1 2
 
1.4%

보관개월수
Categorical

Distinct4
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
3
45 
12
33 
6
31 
9
29 

Length

Max length2
Median length1
Mean length1.2391304
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row6
4th row9
5th row3

Common Values

ValueCountFrequency (%)
3 45
32.6%
12 33
23.9%
6 31
22.5%
9 29
21.0%

Length

2023-12-13T06:41:47.731305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:41:47.825383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 45
32.6%
12 33
23.9%
6 31
22.5%
9 29
21.0%

신청건수
Real number (ℝ)

Distinct21
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.1014493
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T06:41:48.269346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q36
95-th percentile17.3
Maximum24
Range23
Interquartile range (IQR)4

Descriptive statistics

Standard deviation5.3152993
Coefficient of variation (CV)1.0419195
Kurtosis2.1784666
Mean5.1014493
Median Absolute Deviation (MAD)2
Skewness1.70632
Sum704
Variance28.252407
MonotonicityNot monotonic
2023-12-13T06:41:48.385531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 34
24.6%
2 27
19.6%
3 18
13.0%
4 11
 
8.0%
5 10
 
7.2%
9 6
 
4.3%
8 4
 
2.9%
6 4
 
2.9%
7 3
 
2.2%
14 3
 
2.2%
Other values (11) 18
13.0%
ValueCountFrequency (%)
1 34
24.6%
2 27
19.6%
3 18
13.0%
4 11
 
8.0%
5 10
 
7.2%
6 4
 
2.9%
7 3
 
2.2%
8 4
 
2.9%
9 6
 
4.3%
10 1
 
0.7%
ValueCountFrequency (%)
24 1
 
0.7%
21 2
1.4%
20 2
1.4%
19 2
1.4%
17 2
1.4%
16 1
 
0.7%
15 2
1.4%
14 3
2.2%
13 2
1.4%
12 2
1.4%

Interactions

2023-12-13T06:41:46.751029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:41:46.597013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:41:46.819159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:41:46.675248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:41:48.486059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청년도신청월보관심의결과보관개월수신청건수
신청년도1.0000.0000.0590.0000.000
신청월0.0001.0000.0000.0000.000
보관심의결과0.0590.0001.0000.0000.468
보관개월수0.0000.0000.0001.0000.474
신청건수0.0000.0000.4680.4741.000
2023-12-13T06:41:48.613318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보관심의결과신청년도보관개월수
보관심의결과1.0000.0970.000
신청년도0.0971.0000.000
보관개월수0.0000.0001.000
2023-12-13T06:41:48.726666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청월신청건수신청년도보관심의결과보관개월수
신청월1.0000.1190.0000.0000.000
신청건수0.1191.0000.0000.3090.295
신청년도0.0000.0001.0000.0970.000
보관심의결과0.0000.3090.0971.0000.000
보관개월수0.0000.2950.0000.0001.000

Missing values

2023-12-13T06:41:46.908227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:41:46.994880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신청년도신청월보관심의결과보관개월수신청건수
020211233
120211<NA>33
220211<NA>63
320211<NA>91
420212237
520212262
620212291
7202122122
820212333
9202132317
신청년도신청월보관심의결과보관개월수신청건수
128202211234
129202211261
1302022112127
131202212135
132202212161
1332022122311
134202212263
135202212292
1362022122129
137202212333