Overview

Dataset statistics

Number of variables5
Number of observations620
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.8 KiB
Average record size in memory44.2 B

Variable types

Categorical3
Numeric2

Alerts

대상자수 is highly overall correlated with 신청자수 and 1 other fieldsHigh correlation
신청자수 is highly overall correlated with 대상자수 and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 대상자수 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-22 21:45:50.701788
Analysis finished2023-12-22 21:45:53.947192
Duration3.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

Distinct5
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023
124 
2022
124 
2021
124 
2020
124 
2019
124 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 124
20.0%
2022 124
20.0%
2021 124
20.0%
2020 124
20.0%
2019 124
20.0%

Length

2023-12-22T21:45:54.291905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-22T21:45:54.840971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 124
20.0%
2022 124
20.0%
2021 124
20.0%
2020 124
20.0%
2019 124
20.0%

기준분기
Categorical

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
3
155 
4
155 
1
155 
2
155 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 155
25.0%
4 155
25.0%
1 155
25.0%
2 155
25.0%

Length

2023-12-22T21:45:55.538507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-22T21:45:55.965379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 155
25.0%
4 155
25.0%
1 155
25.0%
2 155
25.0%

시군명
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
용인시
 
20
오산시
 
20
안양시
 
20
광주시
 
20
안산시
 
20
Other values (30)
520 

Length

Max length4
Median length3
Mean length3.0919355
Min length3

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row용인시
2nd row성남시
3rd row부천시
4th row안산시
5th row화성시

Common Values

ValueCountFrequency (%)
용인시 20
 
3.2%
오산시 20
 
3.2%
안양시 20
 
3.2%
광주시 20
 
3.2%
안산시 20
 
3.2%
평택시 20
 
3.2%
부천시 20
 
3.2%
파주시 20
 
3.2%
시흥시 20
 
3.2%
김포시 20
 
3.2%
Other values (25) 420
67.7%

Length

2023-12-22T21:45:56.301232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용인시 20
 
3.2%
화성시 20
 
3.2%
구리시 20
 
3.2%
안성시 20
 
3.2%
포천시 20
 
3.2%
의왕시 20
 
3.2%
여주시 20
 
3.2%
수원시 20
 
3.2%
가평군 20
 
3.2%
연천군 20
 
3.2%
Other values (25) 420
67.7%

대상자수
Real number (ℝ)

HIGH CORRELATION 

Distinct545
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4696.4484
Minimum311
Maximum15963
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-22T21:45:57.006451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum311
5-th percentile477.95
Q11791.75
median3432.5
Q37179.75
95-th percentile12322.5
Maximum15963
Range15652
Interquartile range (IQR)5388

Descriptive statistics

Standard deviation3883.1023
Coefficient of variation (CV)0.82681678
Kurtosis0.091296321
Mean4696.4484
Median Absolute Deviation (MAD)2238
Skewness1.0025111
Sum2911798
Variance15078484
MonotonicityNot monotonic
2023-12-22T21:45:57.680038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7574 5
 
0.8%
5547 4
 
0.6%
890 4
 
0.6%
3654 4
 
0.6%
1058 4
 
0.6%
710 4
 
0.6%
488 4
 
0.6%
2267 3
 
0.5%
5803 3
 
0.5%
1766 3
 
0.5%
Other values (535) 582
93.9%
ValueCountFrequency (%)
311 1
0.2%
334 1
0.2%
335 1
0.2%
337 1
0.2%
338 1
0.2%
345 1
0.2%
366 1
0.2%
368 1
0.2%
370 2
0.3%
377 1
0.2%
ValueCountFrequency (%)
15963 1
0.2%
15874 1
0.2%
15859 1
0.2%
15805 1
0.2%
15750 1
0.2%
15707 1
0.2%
15706 1
0.2%
15548 1
0.2%
15289 1
0.2%
14977 1
0.2%

신청자수
Real number (ℝ)

HIGH CORRELATION 

Distinct591
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4324.3065
Minimum241
Maximum15240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-22T21:45:58.475363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum241
5-th percentile450.05
Q11624
median3165
Q36643.5
95-th percentile11173.5
Maximum15240
Range14999
Interquartile range (IQR)5019.5

Descriptive statistics

Standard deviation3567.3503
Coefficient of variation (CV)0.82495317
Kurtosis0.096444674
Mean4324.3065
Median Absolute Deviation (MAD)2203
Skewness0.99498895
Sum2681070
Variance12725988
MonotonicityNot monotonic
2023-12-22T21:45:59.257335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2134 3
 
0.5%
601 3
 
0.5%
888 2
 
0.3%
1671 2
 
0.3%
311 2
 
0.3%
681 2
 
0.3%
6130 2
 
0.3%
1624 2
 
0.3%
740 2
 
0.3%
423 2
 
0.3%
Other values (581) 598
96.5%
ValueCountFrequency (%)
241 1
0.2%
257 1
0.2%
268 1
0.2%
277 1
0.2%
279 1
0.2%
282 1
0.2%
311 2
0.3%
315 1
0.2%
324 1
0.2%
326 1
0.2%
ValueCountFrequency (%)
15240 1
0.2%
14924 1
0.2%
14900 1
0.2%
14700 1
0.2%
14532 1
0.2%
14454 1
0.2%
14248 1
0.2%
14227 1
0.2%
13988 1
0.2%
13951 1
0.2%

Interactions

2023-12-22T21:45:52.281855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-22T21:45:51.401850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-22T21:45:52.827943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-22T21:45:51.862210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-22T21:45:59.800092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도기준분기시군명대상자수신청자수
기준년도1.0000.0000.0000.2120.086
기준분기0.0001.0000.0000.0000.000
시군명0.0000.0001.0000.9760.952
대상자수0.2120.0000.9761.0000.978
신청자수0.0860.0000.9520.9781.000
2023-12-22T21:46:00.204339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명기준분기기준년도
시군명1.0000.0000.000
기준분기0.0001.0000.000
기준년도0.0000.0001.000
2023-12-22T21:46:00.603969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상자수신청자수기준년도기준분기시군명
대상자수1.0000.9950.0890.0000.805
신청자수0.9951.0000.0350.0000.711
기준년도0.0890.0351.0000.0000.000
기준분기0.0000.0000.0001.0000.000
시군명0.8050.7110.0000.0001.000

Missing values

2023-12-22T21:45:53.302159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-22T21:45:53.717463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년도기준분기시군명대상자수신청자수
020233용인시109969053
120233성남시92296542
220233부천시84177065
320233안산시82966953
420233화성시81676738
520233남양주시69525755
620233안양시58695122
720233평택시55014603
820233의정부시49544093
920233파주시45993791
기준년도기준분기시군명대상자수신청자수
61020193오산시23441849
61120193하남시20411611
61220193양주시22521923
61320193구리시24232172
61420193안성시16591404
61520193포천시15201310
61620193의왕시20501726
61720193여주시1240922
61820193양평시893706
61920193동두천시1045926