Overview

Dataset statistics

Number of variables8
Number of observations765
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.9 KiB
Average record size in memory68.2 B

Variable types

DateTime1
Categorical3
Numeric4

Dataset

Description충청북도 시군구별 보훈대상자 인원현황 자료1. 적용 대상 국가유공자는 「국가유공자 등 예우 및 지원에 관한 법률」 제4조 참조2. 참전유공자는 「참전유공자예우 및 단체설립에 관한 법률」제2조에 의거 등록된 대상자 현황<참고>* 제적(국적상실), 등급기준 미달자, 단순 수훈자, 희생자력 제외* 합계는 등록대상별 현황의 합계임 (실인원이 아님)* (고엽제후유증)은 국가유공자예우법상 "전몰·전상·순직·공상군경"에 포함(중복합산을 하지 않음)
Author국가보훈부
URLhttps://www.data.go.kr/data/15098588/fileData.do

Alerts

기준년월 has constant value ""Constant
지역명 has constant value ""Constant
순서 is highly overall correlated with 대상구분High correlation
합계 is highly overall correlated with 본인 and 1 other fieldsHigh correlation
본인 is highly overall correlated with 합계High correlation
유족 is highly overall correlated with 합계High correlation
대상구분 is highly overall correlated with 순서High correlation
합계 has 273 (35.7%) zerosZeros
본인 has 428 (55.9%) zerosZeros
유족 has 424 (55.4%) zerosZeros

Reproduction

Analysis started2024-03-16 04:16:07.347778
Analysis finished2024-03-16 04:16:10.473289
Duration3.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
Minimum2023-12-31 00:00:00
Maximum2023-12-31 00:00:00
2024-03-16T13:16:10.640265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:11.211598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

지역명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
충청북도
765 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청북도
2nd row충청북도
3rd row충청북도
4th row충청북도
5th row충청북도

Common Values

ValueCountFrequency (%)
충청북도 765
100.0%

Length

2024-03-16T13:16:11.429394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:16:11.561221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청북도 765
100.0%

시군구명
Categorical

Distinct15
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
괴산군
 
51
단양군
 
51
보은군
 
51
영동군
 
51
옥천군
 
51
Other values (10)
510 

Length

Max length7
Median length3
Mean length4.0666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row괴산군
2nd row괴산군
3rd row괴산군
4th row괴산군
5th row괴산군

Common Values

ValueCountFrequency (%)
괴산군 51
 
6.7%
단양군 51
 
6.7%
보은군 51
 
6.7%
영동군 51
 
6.7%
옥천군 51
 
6.7%
음성군 51
 
6.7%
제천시 51
 
6.7%
증평군 51
 
6.7%
진천군 51
 
6.7%
청원군 51
 
6.7%
Other values (5) 255
33.3%

Length

2024-03-16T13:16:11.710963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
청주시 204
21.1%
괴산군 51
 
5.3%
단양군 51
 
5.3%
보은군 51
 
5.3%
영동군 51
 
5.3%
옥천군 51
 
5.3%
음성군 51
 
5.3%
제천시 51
 
5.3%
증평군 51
 
5.3%
진천군 51
 
5.3%
Other values (6) 306
31.6%

순서
Real number (ℝ)

HIGH CORRELATION 

Distinct51
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2024-03-16T13:16:11.947933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q113
median26
Q339
95-th percentile49
Maximum51
Range50
Interquartile range (IQR)26

Descriptive statistics

Standard deviation14.729232
Coefficient of variation (CV)0.56650891
Kurtosis-1.200925
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum19890
Variance216.95026
MonotonicityNot monotonic
2024-03-16T13:16:12.281522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 15
 
2.0%
2 15
 
2.0%
29 15
 
2.0%
30 15
 
2.0%
31 15
 
2.0%
32 15
 
2.0%
33 15
 
2.0%
34 15
 
2.0%
35 15
 
2.0%
36 15
 
2.0%
Other values (41) 615
80.4%
ValueCountFrequency (%)
1 15
2.0%
2 15
2.0%
3 15
2.0%
4 15
2.0%
5 15
2.0%
6 15
2.0%
7 15
2.0%
8 15
2.0%
9 15
2.0%
10 15
2.0%
ValueCountFrequency (%)
51 15
2.0%
50 15
2.0%
49 15
2.0%
48 15
2.0%
47 15
2.0%
46 15
2.0%
45 15
2.0%
44 15
2.0%
43 15
2.0%
42 15
2.0%

대상구분
Categorical

HIGH CORRELATION 

Distinct49
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
건국포장
 
30
대통령표창
 
30
6·18자유상이자
 
15
고엽제후유증
 
15
지원순직군경
 
15
Other values (44)
660 

Length

Max length15
Median length11
Mean length7
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[순국선열]
2nd row건국훈장
3rd row건국포장
4th row대통령표창
5th row[애국지사]

Common Values

ValueCountFrequency (%)
건국포장 30
 
3.9%
대통령표창 30
 
3.9%
6·18자유상이자 15
 
2.0%
고엽제후유증 15
 
2.0%
지원순직군경 15
 
2.0%
보국수훈자 15
 
2.0%
[애국지사] 15
 
2.0%
건국훈장 15
 
2.0%
[전몰·전상·순직·공상군경] 15
 
2.0%
전몰군경 15
 
2.0%
Other values (39) 585
76.5%

Length

2024-03-16T13:16:12.500805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건국포장 30
 
3.6%
행불자 30
 
3.6%
또는 30
 
3.6%
건국훈장 30
 
3.6%
대통령표창 30
 
3.6%
5·18부상자 15
 
1.8%
참전유공자 15
 
1.8%
순국선열 15
 
1.8%
6.25및월남전쟁 15
 
1.8%
지원공상공무원 15
 
1.8%
Other values (40) 600
72.7%

합계
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct173
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.738562
Minimum0
Maximum1237
Zeros273
Zeros (%)35.7%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2024-03-16T13:16:12.717693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q330
95-th percentile323.4
Maximum1237
Range1237
Interquartile range (IQR)30

Descriptive statistics

Standard deviation146.33194
Coefficient of variation (CV)2.5343883
Kurtosis18.570978
Mean57.738562
Median Absolute Deviation (MAD)2
Skewness3.9541391
Sum44170
Variance21413.036
MonotonicityNot monotonic
2024-03-16T13:16:13.052705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 273
35.7%
1 81
 
10.6%
3 41
 
5.4%
2 34
 
4.4%
4 24
 
3.1%
6 20
 
2.6%
7 15
 
2.0%
5 12
 
1.6%
11 10
 
1.3%
12 8
 
1.0%
Other values (163) 247
32.3%
ValueCountFrequency (%)
0 273
35.7%
1 81
 
10.6%
2 34
 
4.4%
3 41
 
5.4%
4 24
 
3.1%
5 12
 
1.6%
6 20
 
2.6%
7 15
 
2.0%
8 4
 
0.5%
9 6
 
0.8%
ValueCountFrequency (%)
1237 1
0.1%
1032 1
0.1%
998 1
0.1%
950 1
0.1%
935 1
0.1%
895 1
0.1%
761 1
0.1%
731 1
0.1%
701 1
0.1%
689 1
0.1%

본인
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct139
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.938562
Minimum0
Maximum935
Zeros428
Zeros (%)55.9%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2024-03-16T13:16:13.397818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q38
95-th percentile206.8
Maximum935
Range935
Interquartile range (IQR)8

Descriptive statistics

Standard deviation106.61081
Coefficient of variation (CV)2.8861656
Kurtosis21.293562
Mean36.938562
Median Absolute Deviation (MAD)0
Skewness4.3081033
Sum28258
Variance11365.864
MonotonicityNot monotonic
2024-03-16T13:16:13.595760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 428
55.9%
1 50
 
6.5%
2 26
 
3.4%
3 26
 
3.4%
4 17
 
2.2%
5 12
 
1.6%
6 9
 
1.2%
21 7
 
0.9%
8 6
 
0.8%
10 6
 
0.8%
Other values (129) 178
23.3%
ValueCountFrequency (%)
0 428
55.9%
1 50
 
6.5%
2 26
 
3.4%
3 26
 
3.4%
4 17
 
2.2%
5 12
 
1.6%
6 9
 
1.2%
7 3
 
0.4%
8 6
 
0.8%
9 4
 
0.5%
ValueCountFrequency (%)
935 1
0.1%
731 1
0.1%
689 1
0.1%
660 1
0.1%
646 1
0.1%
628 1
0.1%
623 1
0.1%
615 1
0.1%
576 1
0.1%
565 1
0.1%

유족
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct104
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.8
Minimum0
Maximum807
Zeros424
Zeros (%)55.4%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2024-03-16T13:16:13.784534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile110.6
Maximum807
Range807
Interquartile range (IQR)4

Descriptive statistics

Standard deviation74.90144
Coefficient of variation (CV)3.6010308
Kurtosis43.171195
Mean20.8
Median Absolute Deviation (MAD)0
Skewness6.0075383
Sum15912
Variance5610.2257
MonotonicityNot monotonic
2024-03-16T13:16:14.002012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 424
55.4%
1 86
 
11.2%
3 32
 
4.2%
2 30
 
3.9%
6 11
 
1.4%
7 11
 
1.4%
5 9
 
1.2%
4 9
 
1.2%
12 8
 
1.0%
13 6
 
0.8%
Other values (94) 139
 
18.2%
ValueCountFrequency (%)
0 424
55.4%
1 86
 
11.2%
2 30
 
3.9%
3 32
 
4.2%
4 9
 
1.2%
5 9
 
1.2%
6 11
 
1.4%
7 11
 
1.4%
8 4
 
0.5%
9 3
 
0.4%
ValueCountFrequency (%)
807 1
0.1%
668 1
0.1%
617 1
0.1%
611 1
0.1%
583 1
0.1%
487 1
0.1%
480 1
0.1%
376 1
0.1%
372 1
0.1%
348 1
0.1%

Interactions

2024-03-16T13:16:09.601105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:07.848158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.507604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.102080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.756597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:07.968822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.657498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.228657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.899431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.126959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.822992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.348623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:10.051003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.307079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:08.957177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:16:09.450333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:16:14.171244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명순서대상구분합계본인유족
시군구명1.0000.0000.0000.1120.0000.000
순서0.0001.0000.9980.5030.3840.512
대상구분0.0000.9981.0000.6840.6710.679
합계0.1120.5030.6841.0000.8400.965
본인0.0000.3840.6710.8401.0000.558
유족0.0000.5120.6790.9650.5581.000
2024-03-16T13:16:14.347392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상구분시군구명
대상구분1.0000.000
시군구명0.0001.000
2024-03-16T13:16:14.494518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서합계본인유족시군구명대상구분
순서1.000-0.0700.221-0.4350.0000.952
합계-0.0701.0000.7920.6560.0410.299
본인0.2210.7921.0000.2500.0000.304
유족-0.4350.6560.2501.0000.0000.295
시군구명0.0000.0410.0000.0001.0000.000
대상구분0.9520.2990.3040.2950.0001.000

Missing values

2024-03-16T13:16:10.242286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:16:10.395335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월지역명시군구명순서대상구분합계본인유족
02023-12-31충청북도괴산군1[순국선열]000
12023-12-31충청북도괴산군2건국훈장000
22023-12-31충청북도괴산군3건국포장000
32023-12-31충청북도괴산군4대통령표창000
42023-12-31충청북도괴산군5[애국지사]10010
52023-12-31충청북도괴산군6건국훈장606
62023-12-31충청북도괴산군7건국포장101
72023-12-31충청북도괴산군8대통령표창303
82023-12-31충청북도괴산군9[전몰·전상·순직·공상군경]300100200
92023-12-31충청북도괴산군10전몰군경36036
기준년월지역명시군구명순서대상구분합계본인유족
7552023-12-31충청북도충주시42고엽제후유증2세110
7562023-12-31충청북도충주시43[5·18민주유공자]422
7572023-12-31충청북도충주시445·18사망자 또는 행불자000
7582023-12-31충청북도충주시455·18부상자312
7592023-12-31충청북도충주시465·18희생자110
7602023-12-31충청북도충주시47[특수임무유공자]963
7612023-12-31충청북도충주시48특수임무사망자 또는 행불자000
7622023-12-31충청북도충주시49특수임무부상자651
7632023-12-31충청북도충주시50특수임무공로자312
7642023-12-31충청북도충주시51중·장기복무제대군인6236230