Overview

Dataset statistics

Number of variables7
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory63.2 B

Variable types

DateTime1
Numeric4
Categorical1
Text1

Dataset

Description국가보훈부 보훈대상자별 (성별연령별) 실인원현황 자료1. 적용 대상 국가유공자는 「국가유공자 등 예우 및 지원에 관한 법률」 제4조 참조2. 참전유공자는 「참전유공자예우 및 단체설립에 관한 법률」제2조에 의거 등록된 대상자 현황
Author국가보훈부
URLhttps://www.data.go.kr/data/15072654/fileData.do

Alerts

기준일 has constant value ""Constant
순서 is highly overall correlated with and 1 other fieldsHigh correlation
합계 is highly overall correlated with and 1 other fieldsHigh correlation
is highly overall correlated with 합계 and 1 other fieldsHigh correlation
is highly overall correlated with 순서 and 2 other fieldsHigh correlation
본인유족구분 is highly overall correlated with 순서High correlation
순서 has unique valuesUnique
합계 has 3 (7.3%) zerosZeros
has 4 (9.8%) zerosZeros
has 3 (7.3%) zerosZeros

Reproduction

Analysis started2024-04-21 01:39:45.756474
Analysis finished2024-04-21 01:39:49.053899
Duration3.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준일
Date

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum2024-03-31 00:00:00
Maximum2024-03-31 00:00:00
2024-04-21T10:39:49.097297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:49.176786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

순서
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-21T10:39:49.283650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2024-04-21T10:39:49.417183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%

본인유족구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size460.0 B
유족
21 
본인
20 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본인
2nd row본인
3rd row본인
4th row본인
5th row본인

Common Values

ValueCountFrequency (%)
유족 21
51.2%
본인 20
48.8%

Length

2024-04-21T10:39:49.536270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:39:49.623483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유족 21
51.2%
본인 20
48.8%
Distinct21
Distinct (%)51.2%
Missing0
Missing (%)0.0%
Memory size460.0 B
2024-04-21T10:39:49.779088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.8536585
Min length4

Characters and Unicode

Total characters240
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row해당없음
2nd row10~14세
3rd row15~19세
4th row20~24세
5th row25~29세
ValueCountFrequency (%)
해당없음 2
 
4.9%
60~64세 2
 
4.9%
100세이상 2
 
4.9%
95~99세 2
 
4.9%
90~94세 2
 
4.9%
85~89세 2
 
4.9%
80~84세 2
 
4.9%
75~79세 2
 
4.9%
70~74세 2
 
4.9%
65~69세 2
 
4.9%
Other values (11) 21
51.2%
2024-04-21T10:39:50.109308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
16.2%
~ 37
15.4%
5 27
11.2%
9 27
11.2%
4 26
10.8%
0 22
9.2%
1 10
 
4.2%
6 8
 
3.3%
8 8
 
3.3%
7 8
 
3.3%
Other values (8) 28
11.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 152
63.3%
Other Letter 51
 
21.2%
Math Symbol 37
 
15.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 27
17.8%
9 27
17.8%
4 26
17.1%
0 22
14.5%
1 10
 
6.6%
6 8
 
5.3%
8 8
 
5.3%
7 8
 
5.3%
2 8
 
5.3%
3 8
 
5.3%
Other Letter
ValueCountFrequency (%)
39
76.5%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
Math Symbol
ValueCountFrequency (%)
~ 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 189
78.8%
Hangul 51
 
21.2%

Most frequent character per script

Common
ValueCountFrequency (%)
~ 37
19.6%
5 27
14.3%
9 27
14.3%
4 26
13.8%
0 22
11.6%
1 10
 
5.3%
6 8
 
4.2%
8 8
 
4.2%
7 8
 
4.2%
2 8
 
4.2%
Hangul
ValueCountFrequency (%)
39
76.5%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 189
78.8%
Hangul 51
 
21.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
76.5%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
ASCII
ValueCountFrequency (%)
~ 37
19.6%
5 27
14.3%
9 27
14.3%
4 26
13.8%
0 22
11.6%
1 10
 
5.3%
6 8
 
4.2%
8 8
 
4.2%
7 8
 
4.2%
2 8
 
4.2%

합계
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16167.195
Minimum0
Maximum126847
Zeros3
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-21T10:39:50.224689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1333
median12736
Q323087
95-th percentile44200
Maximum126847
Range126847
Interquartile range (IQR)22754

Descriptive statistics

Standard deviation22720.758
Coefficient of variation (CV)1.4053617
Kurtosis13.508465
Mean16167.195
Median Absolute Deviation (MAD)12403
Skewness3.1043714
Sum662855
Variance5.1623282 × 108
MonotonicityNot monotonic
2024-04-21T10:39:50.350045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
0 3
 
7.3%
404 1
 
2.4%
83 1
 
2.4%
132 1
 
2.4%
327 1
 
2.4%
614 1
 
2.4%
1837 1
 
2.4%
4241 1
 
2.4%
8683 1
 
2.4%
13901 1
 
2.4%
Other values (29) 29
70.7%
ValueCountFrequency (%)
0 3
7.3%
1 1
 
2.4%
7 1
 
2.4%
14 1
 
2.4%
33 1
 
2.4%
83 1
 
2.4%
132 1
 
2.4%
327 1
 
2.4%
333 1
 
2.4%
404 1
 
2.4%
ValueCountFrequency (%)
126847 1
2.4%
48143 1
2.4%
44200 1
2.4%
42784 1
2.4%
34405 1
2.4%
34351 1
2.4%
33910 1
2.4%
29808 1
2.4%
26900 1
2.4%
25492 1
2.4%


Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct38
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11990.854
Minimum0
Maximum126654
Zeros4
Zeros (%)9.8%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-21T10:39:50.488095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q192
median5480
Q314485
95-th percentile34223
Maximum126654
Range126654
Interquartile range (IQR)14393

Descriptive statistics

Standard deviation21352.962
Coefficient of variation (CV)1.7807708
Kurtosis21.144925
Mean11990.854
Median Absolute Deviation (MAD)5476
Skewness4.1238982
Sum491625
Variance4.5594899 × 108
MonotonicityNot monotonic
2024-04-21T10:39:50.607310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
0 4
 
9.8%
17334 1
 
2.4%
77 1
 
2.4%
181 1
 
2.4%
353 1
 
2.4%
1073 1
 
2.4%
2661 1
 
2.4%
5598 1
 
2.4%
9292 1
 
2.4%
24728 1
 
2.4%
Other values (28) 28
68.3%
ValueCountFrequency (%)
0 4
9.8%
4 1
 
2.4%
10 1
 
2.4%
20 1
 
2.4%
45 1
 
2.4%
58 1
 
2.4%
77 1
 
2.4%
92 1
 
2.4%
181 1
 
2.4%
325 1
 
2.4%
ValueCountFrequency (%)
126654 1
2.4%
42600 1
2.4%
34223 1
2.4%
29231 1
2.4%
25132 1
2.4%
24728 1
2.4%
22564 1
2.4%
19658 1
2.4%
19105 1
2.4%
17334 1
2.4%


Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4176.3415
Minimum0
Maximum32578
Zeros3
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-04-21T10:39:50.718111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q155
median360
Q31580
95-th percentile22992
Maximum32578
Range32578
Interquartile range (IQR)1525

Descriptive statistics

Standard deviation8555.0172
Coefficient of variation (CV)2.0484477
Kurtosis3.8538055
Mean4176.3415
Median Absolute Deviation (MAD)355
Skewness2.2188227
Sum171230
Variance73188319
MonotonicityNot monotonic
2024-04-21T10:39:50.828831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
0 3
 
7.3%
5 1
 
2.4%
38 1
 
2.4%
55 1
 
2.4%
146 1
 
2.4%
261 1
 
2.4%
764 1
 
2.4%
1580 1
 
2.4%
3085 1
 
2.4%
4609 1
 
2.4%
Other values (29) 29
70.7%
ValueCountFrequency (%)
0 3
7.3%
1 1
 
2.4%
3 1
 
2.4%
4 1
 
2.4%
5 1
 
2.4%
8 1
 
2.4%
13 1
 
2.4%
38 1
 
2.4%
55 1
 
2.4%
77 1
 
2.4%
ValueCountFrequency (%)
32578 1
2.4%
29038 1
2.4%
22992 1
2.4%
19472 1
2.4%
18988 1
2.4%
18218 1
2.4%
9566 1
2.4%
4609 1
2.4%
3085 1
2.4%
2939 1
2.4%

Interactions

2024-04-21T10:39:48.529234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.476730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.880479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.211217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.623111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.621909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.966208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.294318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.713939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.707004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.039403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.368809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.796067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:47.792778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.121067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:39:48.453923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:39:50.908438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서본인유족구분연령구분합계
순서1.0001.0000.0000.7720.7160.371
본인유족구분1.0001.0000.0000.1580.3330.294
연령구분0.0000.0001.0000.6770.3280.000
합계0.7720.1580.6771.0000.9510.510
0.7160.3330.3280.9511.0000.000
0.3710.2940.0000.5100.0001.000
2024-04-21T10:39:51.013921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서합계본인유족구분
순서1.0000.199-0.0890.5720.848
합계0.1991.0000.9100.7030.179
-0.0890.9101.0000.5130.387
0.5720.7030.5131.0000.288
본인유족구분0.8480.1790.3870.2881.000

Missing values

2024-04-21T10:39:48.897602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:39:49.011927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일순서본인유족구분연령구분합계
02024-03-311본인해당없음000
12024-03-312본인10~14세000
22024-03-313본인15~19세000
32024-03-314본인20~24세4043995
42024-03-315본인25~29세61335859274
52024-03-316본인30~34세1427713424853
62024-03-317본인35~39세1395613096860
72024-03-318본인40~44세1479414096698
82024-03-319본인45~49세1502714415612
92024-03-3110본인50~54세1479014485305
기준일순서본인유족구분연령구분합계
312024-03-3132유족55~59세1390192924609
322024-03-3133유족60~64세26900173349566
332024-03-3134유족65~69세442002472819472
342024-03-3135유족70~74세481431910529038
352024-03-3136유족75~79세339101091822992
362024-03-3137유족80~84세22832384418988
372024-03-3138유족85~89세34351177332578
382024-03-3139유족90~94세1874652818218
392024-03-3140유족95~99세3031922939
402024-03-3141유족100세이상44858390