Overview

Dataset statistics

Number of variables8
Number of observations2397
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory159.3 KiB
Average record size in memory68.1 B

Variable types

DateTime1
Categorical3
Numeric4

Dataset

Description경기도 시군구별 보훈대상자 인원현황 자료1. 적용 대상 국가유공자는 「국가유공자 등 예우 및 지원에 관한 법률」 제4조 참조2. 참전유공자는 「참전유공자예우 및 단체설립에 관한 법률」제2조에 의거 등록된 대상자 현황<참고>* 제적(국적상실), 등급기준 미달자, 단순 수훈자, 희생자력 제외* 합계는 등록대상별 현황의 합계임 (실인원이 아님)* (고엽제후유증)은 국가유공자예우법상 "전몰·전상·순직·공상군경"에 포함(중복합산을 하지 않음)
Author국가보훈부
URLhttps://www.data.go.kr/data/15098574/fileData.do

Alerts

기준년월 has constant value ""Constant
지역명 has constant value ""Constant
순서 is highly overall correlated with 대상구분High correlation
합계 is highly overall correlated with 본인 and 1 other fieldsHigh correlation
본인 is highly overall correlated with 합계High correlation
유족 is highly overall correlated with 합계High correlation
대상구분 is highly overall correlated with 순서High correlation
합계 has 591 (24.7%) zerosZeros
본인 has 1210 (50.5%) zerosZeros
유족 has 1025 (42.8%) zerosZeros

Reproduction

Analysis started2024-03-16 04:11:51.679460
Analysis finished2024-03-16 04:11:58.948495
Duration7.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년월
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
Minimum2023-12-31 00:00:00
Maximum2023-12-31 00:00:00
2024-03-16T13:11:59.106410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:59.373735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

지역명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
경기도
2397 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 2397
100.0%

Length

2024-03-16T13:11:59.719895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:11:59.921184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 2397
100.0%

시군구명
Categorical

Distinct47
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
가평군
 
51
동두천시
 
51
고양시 덕양구
 
51
고양시 일산동구
 
51
고양시 일산서구
 
51
Other values (42)
2142 

Length

Max length8
Median length7
Mean length4.9361702
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
가평군 51
 
2.1%
동두천시 51
 
2.1%
고양시 덕양구 51
 
2.1%
고양시 일산동구 51
 
2.1%
고양시 일산서구 51
 
2.1%
과천시 51
 
2.1%
광명시 51
 
2.1%
광주시(경기) 51
 
2.1%
구리시 51
 
2.1%
군포시 51
 
2.1%
Other values (37) 1887
78.7%

Length

2024-03-16T13:12:00.119088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 204
 
6.0%
부천시 204
 
6.0%
고양시 153
 
4.5%
안양시 153
 
4.5%
성남시 153
 
4.5%
용인시 153
 
4.5%
안산시 102
 
3.0%
가평군 51
 
1.5%
연천군 51
 
1.5%
상록구 51
 
1.5%
Other values (42) 2142
62.7%

순서
Real number (ℝ)

HIGH CORRELATION 

Distinct51
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.2 KiB
2024-03-16T13:12:00.400512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q113
median26
Q339
95-th percentile49
Maximum51
Range50
Interquartile range (IQR)26

Descriptive statistics

Standard deviation14.722673
Coefficient of variation (CV)0.56625665
Kurtosis-1.2009246
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum62322
Variance216.7571
MonotonicityNot monotonic
2024-03-16T13:12:00.671537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 47
 
2.0%
2 47
 
2.0%
29 47
 
2.0%
30 47
 
2.0%
31 47
 
2.0%
32 47
 
2.0%
33 47
 
2.0%
34 47
 
2.0%
35 47
 
2.0%
36 47
 
2.0%
Other values (41) 1927
80.4%
ValueCountFrequency (%)
1 47
2.0%
2 47
2.0%
3 47
2.0%
4 47
2.0%
5 47
2.0%
6 47
2.0%
7 47
2.0%
8 47
2.0%
9 47
2.0%
10 47
2.0%
ValueCountFrequency (%)
51 47
2.0%
50 47
2.0%
49 47
2.0%
48 47
2.0%
47 47
2.0%
46 47
2.0%
45 47
2.0%
44 47
2.0%
43 47
2.0%
42 47
2.0%

대상구분
Categorical

HIGH CORRELATION 

Distinct49
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size18.9 KiB
건국포장
 
94
대통령표창
 
94
6·18자유상이자
 
47
고엽제후유증
 
47
지원순직군경
 
47
Other values (44)
2068 

Length

Max length15
Median length11
Mean length7
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[순국선열]
2nd row건국훈장
3rd row건국포장
4th row대통령표창
5th row[애국지사]

Common Values

ValueCountFrequency (%)
건국포장 94
 
3.9%
대통령표창 94
 
3.9%
6·18자유상이자 47
 
2.0%
고엽제후유증 47
 
2.0%
지원순직군경 47
 
2.0%
보국수훈자 47
 
2.0%
[애국지사] 47
 
2.0%
건국훈장 47
 
2.0%
[전몰·전상·순직·공상군경] 47
 
2.0%
전몰군경 47
 
2.0%
Other values (39) 1833
76.5%

Length

2024-03-16T13:12:00.909125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건국포장 94
 
3.6%
행불자 94
 
3.6%
또는 94
 
3.6%
건국훈장 94
 
3.6%
대통령표창 94
 
3.6%
5·18부상자 47
 
1.8%
참전유공자 47
 
1.8%
순국선열 47
 
1.8%
6.25및월남전쟁 47
 
1.8%
지원공상공무원 47
 
1.8%
Other values (40) 1880
72.7%

합계
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct482
Distinct (%)20.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136.11598
Minimum0
Maximum3521
Zeros591
Zeros (%)24.7%
Negative0
Negative (%)0.0%
Memory size21.2 KiB
2024-03-16T13:12:01.231905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q357
95-th percentile832.4
Maximum3521
Range3521
Interquartile range (IQR)56

Descriptive statistics

Standard deviation349.47703
Coefficient of variation (CV)2.5674946
Kurtosis21.040526
Mean136.11598
Median Absolute Deviation (MAD)6
Skewness4.1108505
Sum326270
Variance122134.2
MonotonicityNot monotonic
2024-03-16T13:12:01.733233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 591
24.7%
1 236
 
9.8%
2 119
 
5.0%
5 73
 
3.0%
3 68
 
2.8%
6 59
 
2.5%
4 59
 
2.5%
8 45
 
1.9%
9 42
 
1.8%
7 34
 
1.4%
Other values (472) 1071
44.7%
ValueCountFrequency (%)
0 591
24.7%
1 236
 
9.8%
2 119
 
5.0%
3 68
 
2.8%
4 59
 
2.5%
5 73
 
3.0%
6 59
 
2.5%
7 34
 
1.4%
8 45
 
1.9%
9 42
 
1.8%
ValueCountFrequency (%)
3521 1
< 0.1%
3203 1
< 0.1%
2940 1
< 0.1%
2913 1
< 0.1%
2747 1
< 0.1%
2525 1
< 0.1%
2380 1
< 0.1%
2364 1
< 0.1%
2299 1
< 0.1%
2256 1
< 0.1%

본인
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct387
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.398832
Minimum0
Maximum2913
Zeros1210
Zeros (%)50.5%
Negative0
Negative (%)0.0%
Memory size21.2 KiB
2024-03-16T13:12:02.137655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q318
95-th percentile549
Maximum2913
Range2913
Interquartile range (IQR)18

Descriptive statistics

Standard deviation272.52336
Coefficient of variation (CV)2.9816941
Kurtosis28.326399
Mean91.398832
Median Absolute Deviation (MAD)0
Skewness4.7927989
Sum219083
Variance74268.982
MonotonicityNot monotonic
2024-03-16T13:12:02.601587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1210
50.5%
1 109
 
4.5%
2 75
 
3.1%
3 53
 
2.2%
8 40
 
1.7%
5 39
 
1.6%
6 39
 
1.6%
4 33
 
1.4%
10 32
 
1.3%
7 29
 
1.2%
Other values (377) 738
30.8%
ValueCountFrequency (%)
0 1210
50.5%
1 109
 
4.5%
2 75
 
3.1%
3 53
 
2.2%
4 33
 
1.4%
5 39
 
1.6%
6 39
 
1.6%
7 29
 
1.2%
8 40
 
1.7%
9 22
 
0.9%
ValueCountFrequency (%)
2913 1
< 0.1%
2747 1
< 0.1%
2380 1
< 0.1%
2256 1
< 0.1%
2183 1
< 0.1%
2119 1
< 0.1%
2064 1
< 0.1%
2059 1
< 0.1%
2048 1
< 0.1%
1890 1
< 0.1%

유족
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct278
Distinct (%)11.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.717146
Minimum0
Maximum2047
Zeros1025
Zeros (%)42.8%
Negative0
Negative (%)0.0%
Memory size21.2 KiB
2024-03-16T13:12:03.486334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q310
95-th percentile254.2
Maximum2047
Range2047
Interquartile range (IQR)10

Descriptive statistics

Standard deviation156.53577
Coefficient of variation (CV)3.500576
Kurtosis46.557275
Mean44.717146
Median Absolute Deviation (MAD)1
Skewness6.0382662
Sum107187
Variance24503.448
MonotonicityNot monotonic
2024-03-16T13:12:03.762901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1025
42.8%
1 279
 
11.6%
2 136
 
5.7%
3 82
 
3.4%
4 77
 
3.2%
5 67
 
2.8%
6 54
 
2.3%
9 31
 
1.3%
8 21
 
0.9%
10 19
 
0.8%
Other values (268) 606
25.3%
ValueCountFrequency (%)
0 1025
42.8%
1 279
 
11.6%
2 136
 
5.7%
3 82
 
3.4%
4 77
 
3.2%
5 67
 
2.8%
6 54
 
2.3%
7 19
 
0.8%
8 21
 
0.9%
9 31
 
1.3%
ValueCountFrequency (%)
2047 1
< 0.1%
1985 1
< 0.1%
1607 1
< 0.1%
1497 1
< 0.1%
1405 1
< 0.1%
1325 1
< 0.1%
1226 1
< 0.1%
1194 1
< 0.1%
1190 1
< 0.1%
1141 1
< 0.1%

Interactions

2024-03-16T13:11:55.537030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:52.728229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:53.670386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:54.728400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:56.398643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:52.928809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:53.919218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:54.954320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:57.035098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:53.144844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:54.192338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:55.133868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:57.566917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:53.414840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:54.521316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-16T13:11:55.323315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:12:04.102347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명순서대상구분합계본인유족
시군구명1.0000.0000.0000.0720.0000.000
순서0.0001.0000.9980.4910.4630.386
대상구분0.0000.9981.0000.6570.6750.683
합계0.0720.4910.6571.0000.9340.853
본인0.0000.4630.6750.9341.0000.472
유족0.0000.3860.6830.8530.4721.000
2024-03-16T13:12:04.437592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명대상구분
시군구명1.0000.000
대상구분0.0001.000
2024-03-16T13:12:04.688289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서합계본인유족시군구명대상구분
순서1.000-0.0410.286-0.4310.0000.971
합계-0.0411.0000.7690.6300.0240.285
본인0.2860.7691.0000.2100.0000.297
유족-0.4310.6300.2101.0000.0000.319
시군구명0.0000.0240.0000.0001.0000.000
대상구분0.9710.2850.2970.3190.0001.000

Missing values

2024-03-16T13:11:58.164883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:11:58.742412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년월지역명시군구명순서대상구분합계본인유족
02023-12-31경기도가평군1[순국선열]202
12023-12-31경기도가평군2건국훈장202
22023-12-31경기도가평군3건국포장000
32023-12-31경기도가평군4대통령표창000
42023-12-31경기도가평군5[애국지사]17017
52023-12-31경기도가평군6건국훈장909
62023-12-31경기도가평군7건국포장202
72023-12-31경기도가평군8대통령표창606
82023-12-31경기도가평군9[전몰·전상·순직·공상군경]524193331
92023-12-31경기도가평군10전몰군경81081
기준년월지역명시군구명순서대상구분합계본인유족
23872023-12-31경기도화성시42고엽제후유증2세330
23882023-12-31경기도화성시43[5·18민주유공자]27216
23892023-12-31경기도화성시445·18사망자 또는 행불자000
23902023-12-31경기도화성시455·18부상자1073
23912023-12-31경기도화성시465·18희생자17143
23922023-12-31경기도화성시47[특수임무유공자]645014
23932023-12-31경기도화성시48특수임무사망자 또는 행불자101
23942023-12-31경기도화성시49특수임무부상자14131
23952023-12-31경기도화성시50특수임무공로자493712
23962023-12-31경기도화성시51중·장기복무제대군인204820480