Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

행정동코드 has constant value ""Constant
시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동명 has constant value ""Constant
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2024-04-17 09:20:10.884483
Analysis finished2024-04-17 09:20:11.531057
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1114055000
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1114055000
2nd row1114055000
3rd row1114055000
4th row1114055000
5th row1114055000

Common Values

ValueCountFrequency (%)
1114055000 100
100.0%

Length

2024-04-17T18:20:11.586540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:11.673091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1114055000 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2024-04-17T18:20:11.764902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:11.846019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
중구
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
중구 100
100.0%

Length

2024-04-17T18:20:11.926927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:12.002328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중구 100
100.0%

행정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
명동
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row명동
2nd row명동
3rd row명동
4th row명동
5th row명동

Common Values

ValueCountFrequency (%)
명동 100
100.0%

Length

2024-04-17T18:20:12.077831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:12.151166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
명동 100
100.0%

기준일자
Real number (ℝ)

Distinct26
Distinct (%)26.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200811
Minimum20200801
Maximum20200829
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-17T18:20:12.228885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200801
5-th percentile20200801
Q120200806
median20200812
Q320200815
95-th percentile20200825
Maximum20200829
Range28
Interquartile range (IQR)9.25

Descriptive statistics

Standard deviation6.9405848
Coefficient of variation (CV)3.4357951 × 10-7
Kurtosis-0.33564253
Mean20200811
Median Absolute Deviation (MAD)4.5
Skewness0.47652025
Sum2.0200811 × 109
Variance48.171717
MonotonicityNot monotonic
2024-04-17T18:20:12.336537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
20200814 8
 
8.0%
20200815 8
 
8.0%
20200808 6
 
6.0%
20200801 6
 
6.0%
20200804 6
 
6.0%
20200805 5
 
5.0%
20200812 5
 
5.0%
20200806 5
 
5.0%
20200813 5
 
5.0%
20200816 5
 
5.0%
Other values (16) 41
41.0%
ValueCountFrequency (%)
20200801 6
6.0%
20200802 4
4.0%
20200803 4
4.0%
20200804 6
6.0%
20200805 5
5.0%
20200806 5
5.0%
20200807 5
5.0%
20200808 6
6.0%
20200809 3
3.0%
20200810 2
 
2.0%
ValueCountFrequency (%)
20200829 1
 
1.0%
20200828 1
 
1.0%
20200827 1
 
1.0%
20200826 1
 
1.0%
20200825 2
2.0%
20200823 3
3.0%
20200821 2
2.0%
20200819 2
2.0%
20200818 3
3.0%
20200817 3
3.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
48 
M
39 
X
13 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
F 48
48.0%
M 39
39.0%
X 13
 
13.0%

Length

2024-04-17T18:20:12.441956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:12.521280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 48
48.0%
m 39
39.0%
x 13
 
13.0%

연령대
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
25
23 
30
19 
35
16 
xx
13 
20
10 
Other values (5)
19 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row35
2nd row20
3rd row25
4th row30
5th row35

Common Values

ValueCountFrequency (%)
25 23
23.0%
30 19
19.0%
35 16
16.0%
xx 13
13.0%
20 10
10.0%
40 8
 
8.0%
45 5
 
5.0%
15 3
 
3.0%
50 2
 
2.0%
55 1
 
1.0%

Length

2024-04-17T18:20:12.616807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:12.709384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
25 23
23.0%
30 19
19.0%
35 16
16.0%
xx 13
13.0%
20 10
10.0%
40 8
 
8.0%
45 5
 
5.0%
15 3
 
3.0%
50 2
 
2.0%
55 1
 
1.0%

소비인구(명)
Real number (ℝ)

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.021134
Minimum22.479082
Maximum67.437247
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-17T18:20:12.812553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.479082
5-th percentile22.479082
Q122.479082
median29.97211
Q337.465137
95-th percentile52.825844
Maximum67.437247
Range44.958165
Interquartile range (IQR)14.986055

Descriptive statistics

Standard deviation11.221614
Coefficient of variation (CV)0.36174095
Kurtosis1.6473799
Mean31.021134
Median Absolute Deviation (MAD)7.4930275
Skewness1.4682443
Sum3102.1134
Variance125.92463
MonotonicityNot monotonic
2024-04-17T18:20:12.890998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
22.479082395 48
48.0%
29.97210986 22
22.0%
37.465137325 14
 
14.0%
44.95816479 7
 
7.0%
52.451192255 4
 
4.0%
59.94421972 3
 
3.0%
67.437247185 2
 
2.0%
ValueCountFrequency (%)
22.479082395 48
48.0%
29.97210986 22
22.0%
37.465137325 14
 
14.0%
44.95816479 7
 
7.0%
52.451192255 4
 
4.0%
59.94421972 3
 
3.0%
67.437247185 2
 
2.0%
ValueCountFrequency (%)
67.437247185 2
 
2.0%
59.94421972 3
 
3.0%
52.451192255 4
 
4.0%
44.95816479 7
 
7.0%
37.465137325 14
 
14.0%
29.97210986 22
22.0%
22.479082395 48
48.0%

Interactions

2024-04-17T18:20:11.184531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:20:11.036846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:20:11.261228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:20:11.115041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T18:20:12.953940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자성별연령대소비인구(명)
기준일자1.0000.0000.0000.309
성별0.0001.0000.8220.446
연령대0.0000.8221.0000.000
소비인구(명)0.3090.4460.0001.000
2024-04-17T18:20:13.034257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령대
성별1.0000.697
연령대0.6971.000
2024-04-17T18:20:13.387587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)성별연령대
기준일자1.000-0.0100.0000.000
소비인구(명)-0.0101.0000.3280.000
성별0.0000.3281.0000.697
연령대0.0000.0000.6971.000

Missing values

2024-04-17T18:20:11.362916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T18:20:11.489748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01114055000서울특별시중구명동20200801F3537.465137
11114055000서울특별시중구명동20200801M2029.97211
21114055000서울특별시중구명동20200801M2552.451192
31114055000서울특별시중구명동20200801M3029.97211
41114055000서울특별시중구명동20200801M3529.97211
51114055000서울특별시중구명동20200802F3022.479082
61114055000서울특별시중구명동20200802F4522.479082
71114055000서울특별시중구명동20200802M2529.97211
81114055000서울특별시중구명동20200802M5522.479082
91114055000서울특별시중구명동20200803F2529.97211
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901114055000서울특별시중구명동20200823F2537.465137
911114055000서울특별시중구명동20200823M3029.97211
921114055000서울특별시중구명동20200823M3522.479082
931114055000서울특별시중구명동20200825F2537.465137
941114055000서울특별시중구명동20200825Xxx37.465137
951114055000서울특별시중구명동20200826F3522.479082
961114055000서울특별시중구명동20200827Xxx52.451192
971114055000서울특별시중구명동20200828M2522.479082
981114055000서울특별시중구명동20200829F2522.479082
991114055000서울특별시중구명동20200801F2559.94422