Overview

Dataset statistics

Number of variables8
Number of observations93
Missing cells186
Missing cells (%)25.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory71.4 B

Variable types

Categorical4
Unsupported2
Numeric2

Alerts

기준년도 has constant value ""Constant
기준월 has constant value ""Constant
노인수 is highly overall correlated with 일자리수High correlation
일자리수 is highly overall correlated with 노인수High correlation
학력 has 93 (100.0%) missing valuesMissing
연령 has 93 (100.0%) missing valuesMissing
학력 is an unsupported type, check if it needs cleaning or further analysisUnsupported
연령 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 21:14:29.674063
Analysis finished2023-12-10 21:14:30.378058
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023
93 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 93
100.0%

Length

2023-12-11T06:14:30.449155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:14:30.533416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 93
100.0%

기준월
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
6
93 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6
2nd row6
3rd row6
4th row6
5th row6

Common Values

ValueCountFrequency (%)
6 93
100.0%

Length

2023-12-11T06:14:30.636138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:14:30.713648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6 93
100.0%

시군명
Categorical

Distinct31
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size876.0 B
수원시
 
3
성남시
 
3
부천시
 
3
용인시
 
3
안산시
 
3
Other values (26)
78 

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시
2nd row수원시
3rd row수원시
4th row성남시
5th row성남시

Common Values

ValueCountFrequency (%)
수원시 3
 
3.2%
성남시 3
 
3.2%
부천시 3
 
3.2%
용인시 3
 
3.2%
안산시 3
 
3.2%
안양시 3
 
3.2%
평택시 3
 
3.2%
시흥시 3
 
3.2%
화성시 3
 
3.2%
광명시 3
 
3.2%
Other values (21) 63
67.7%

Length

2023-12-11T06:14:30.793391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 3
 
3.2%
하남시 3
 
3.2%
가평군 3
 
3.2%
동두천시 3
 
3.2%
포천시 3
 
3.2%
양주시 3
 
3.2%
구리시 3
 
3.2%
파주시 3
 
3.2%
의정부시 3
 
3.2%
남양주시 3
 
3.2%
Other values (21) 63
67.7%

성별
Categorical

Distinct3
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size876.0 B
31 
31 
31 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
31
33.3%
31
33.3%
31
33.3%

Length

2023-12-11T06:14:30.885381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:14:30.960201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
31
33.3%
31
33.3%
31
33.3%

학력
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing93
Missing (%)100.0%
Memory size969.0 B

연령
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing93
Missing (%)100.0%
Memory size969.0 B

노인수
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44216.753
Minimum5187
Maximum168125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size969.0 B
2023-12-11T06:14:31.050272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5187
5-th percentile8798
Q118253
median33382
Q361640
95-th percentile123116.4
Maximum168125
Range162938
Interquartile range (IQR)43387

Descriptive statistics

Standard deviation36161.215
Coefficient of variation (CV)0.81781707
Kurtosis2.511009
Mean44216.753
Median Absolute Deviation (MAD)16899
Skewness1.5947471
Sum4112158
Variance1.3076335 × 109
MonotonicityNot monotonic
2023-12-11T06:14:31.168374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31306 2
 
2.2%
11504 1
 
1.1%
79768 1
 
1.1%
65725 1
 
1.1%
52755 1
 
1.1%
118480 1
 
1.1%
94660 1
 
1.1%
73465 1
 
1.1%
168125 1
 
1.1%
6317 1
 
1.1%
Other values (82) 82
88.2%
ValueCountFrequency (%)
5187 1
1.1%
5712 1
1.1%
6317 1
1.1%
6942 1
1.1%
8633 1
1.1%
8908 1
1.1%
9620 1
1.1%
11504 1
1.1%
11511 1
1.1%
11740 1
1.1%
ValueCountFrequency (%)
168125 1
1.1%
161277 1
1.1%
152003 1
1.1%
148688 1
1.1%
130071 1
1.1%
118480 1
1.1%
94660 1
1.1%
93105 1
1.1%
89699 1
1.1%
86996 1
1.1%

일자리수
Real number (ℝ)

HIGH CORRELATION 

Distinct90
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2371.9355
Minimum132
Maximum7158
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size969.0 B
2023-12-11T06:14:31.299890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum132
5-th percentile547.8
Q11116
median1978
Q33278
95-th percentile5530
Maximum7158
Range7026
Interquartile range (IQR)2162

Descriptive statistics

Standard deviation1598.2336
Coefficient of variation (CV)0.67380991
Kurtosis0.40403829
Mean2371.9355
Median Absolute Deviation (MAD)1042
Skewness0.96160771
Sum220590
Variance2554350.8
MonotonicityNot monotonic
2023-12-11T06:14:31.441543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3422 2
 
2.2%
3324 2
 
2.2%
4916 2
 
2.2%
5713 1
 
1.1%
2060 1
 
1.1%
3225 1
 
1.1%
1726 1
 
1.1%
4951 1
 
1.1%
2242 1
 
1.1%
7158 1
 
1.1%
Other values (80) 80
86.0%
ValueCountFrequency (%)
132 1
1.1%
320 1
1.1%
323 1
1.1%
455 1
1.1%
537 1
1.1%
555 1
1.1%
565 1
1.1%
593 1
1.1%
635 1
1.1%
668 1
1.1%
ValueCountFrequency (%)
7158 1
1.1%
6837 1
1.1%
6232 1
1.1%
5713 1
1.1%
5593 1
1.1%
5488 1
1.1%
4951 1
1.1%
4916 2
2.2%
4878 1
1.1%
4579 1
1.1%

Interactions

2023-12-11T06:14:30.010633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:14:29.830653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:14:30.097759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:14:29.922408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:14:31.522791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명성별노인수일자리수
시군명1.0000.0000.0970.000
성별0.0001.0000.2940.653
노인수0.0970.2941.0000.903
일자리수0.0000.6530.9031.000
2023-12-11T06:14:31.595827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명성별
시군명1.0000.000
성별0.0001.000
2023-12-11T06:14:31.669145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노인수일자리수시군명성별
노인수1.0000.7770.0430.185
일자리수0.7771.0000.0000.487
시군명0.0430.0001.0000.000
성별0.1850.4870.0001.000

Missing values

2023-12-11T06:14:30.194126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:14:30.311407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년도기준월시군명성별학력연령노인수일자리수
020236수원시<NA><NA>1520035713
120236수원시<NA><NA>670411588
220236수원시<NA><NA>849624125
320236성남시<NA><NA>1486886232
420236성남시<NA><NA>668831699
520236성남시<NA><NA>818054533
620236부천시<NA><NA>1300716837
720236부천시<NA><NA>585081959
820236부천시<NA><NA>715634878
920236용인시<NA><NA>1612774916
기준년도기준월시군명성별학력연령노인수일자리수
8320236포천시<NA><NA>178631516
8420236동두천시<NA><NA>209142648
8520236동두천시<NA><NA>8908752
8620236동두천시<NA><NA>120061896
8720236가평군<NA><NA>182531711
8820236가평군<NA><NA>8633555
8920236가평군<NA><NA>96201156
9020236연천군<NA><NA>126541436
9120236연천군<NA><NA>5712320
9220236연천군<NA><NA>69421116