Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

행정동코드 has constant value ""Constant
시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동명 has constant value ""Constant
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2023-12-10 13:10:27.567375
Analysis finished2023-12-10 13:10:28.699129
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1111053000
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1111053000
2nd row1111053000
3rd row1111053000
4th row1111053000
5th row1111053000

Common Values

ValueCountFrequency (%)
1111053000 100
100.0%

Length

2023-12-10T22:10:28.796678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:28.930041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1111053000 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T22:10:29.079904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:29.224544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 100
100.0%

Length

2023-12-10T22:10:29.425302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:29.569705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 100
100.0%

행정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사직동
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사직동
2nd row사직동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
사직동 100
100.0%

Length

2023-12-10T22:10:29.715915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:29.850609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사직동 100
100.0%

기준일자
Real number (ℝ)

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200803
Minimum20200801
Maximum20200806
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:10:29.991997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200801
5-th percentile20200801
Q120200802
median20200804
Q320200805
95-th percentile20200806
Maximum20200806
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6904664
Coefficient of variation (CV)8.3683129 × 10-8
Kurtosis-1.2340709
Mean20200803
Median Absolute Deviation (MAD)1
Skewness-0.056083311
Sum2.0200803 × 109
Variance2.8576768
MonotonicityNot monotonic
2023-12-10T22:10:30.201554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
20200804 20
20.0%
20200801 18
18.0%
20200805 18
18.0%
20200802 15
15.0%
20200803 15
15.0%
20200806 14
14.0%
ValueCountFrequency (%)
20200801 18
18.0%
20200802 15
15.0%
20200803 15
15.0%
20200804 20
20.0%
20200805 18
18.0%
20200806 14
14.0%
ValueCountFrequency (%)
20200806 14
14.0%
20200805 18
18.0%
20200804 20
20.0%
20200803 15
15.0%
20200802 15
15.0%
20200801 18
18.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
48 
F
47 
X

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
M 48
48.0%
F 47
47.0%
X 5
 
5.0%

Length

2023-12-10T22:10:30.395750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:10:30.537646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 48
48.0%
f 47
47.0%
x 5
 
5.0%

연령대
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
25
12 
30
12 
35
12 
20
12 
45
11 
Other values (8)
41 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row25
2nd row30
3rd row35
4th row40
5th row45

Common Values

ValueCountFrequency (%)
25 12
12.0%
30 12
12.0%
35 12
12.0%
20 12
12.0%
45 11
11.0%
40 10
10.0%
50 9
9.0%
55 6
6.0%
15 6
6.0%
xx 5
5.0%
Other values (3) 5
5.0%

Length

2023-12-10T22:10:30.731341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
25 12
12.0%
30 12
12.0%
35 12
12.0%
20 12
12.0%
45 11
11.0%
40 10
10.0%
50 9
9.0%
55 6
6.0%
15 6
6.0%
xx 5
5.0%
Other values (3) 5
5.0%

소비인구(명)
Real number (ℝ)

Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.174589
Minimum22.479082
Maximum172.33963
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:10:30.964937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.479082
5-th percentile22.479082
Q129.97211
median44.958165
Q374.930275
95-th percentile112.39541
Maximum172.33963
Range149.86055
Interquartile range (IQR)44.958165

Descriptive statistics

Standard deviation30.90106
Coefficient of variation (CV)0.57039768
Kurtosis1.1441597
Mean54.174589
Median Absolute Deviation (MAD)22.479082
Skewness1.0880738
Sum5417.4589
Variance954.87549
MonotonicityNot monotonic
2023-12-10T22:10:31.157263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
22.479082395 22
22.0%
29.97210986 13
13.0%
37.465137325 10
10.0%
59.94421972 9
9.0%
44.95816479 8
 
8.0%
67.437247185 7
 
7.0%
82.423302115 7
 
7.0%
52.451192255 5
 
5.0%
89.91632958 5
 
5.0%
112.39541198 4
 
4.0%
Other values (6) 10
10.0%
ValueCountFrequency (%)
22.479082395 22
22.0%
29.97210986 13
13.0%
37.465137325 10
10.0%
44.95816479 8
 
8.0%
52.451192255 5
 
5.0%
59.94421972 9
9.0%
67.437247185 7
 
7.0%
74.93027465 3
 
3.0%
82.423302115 7
 
7.0%
89.91632958 5
 
5.0%
ValueCountFrequency (%)
172.3396317 1
 
1.0%
134.87449437 1
 
1.0%
119.88843944 1
 
1.0%
112.39541198 4
4.0%
104.90238451 1
 
1.0%
97.409357045 3
3.0%
89.91632958 5
5.0%
82.423302115 7
7.0%
74.93027465 3
3.0%
67.437247185 7
7.0%

Interactions

2023-12-10T22:10:28.129682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:10:27.843960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:10:28.258434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:10:27.976816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:10:31.314121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자성별연령대소비인구(명)
기준일자1.0000.0000.0000.330
성별0.0001.0000.8060.433
연령대0.0000.8061.0000.000
소비인구(명)0.3300.4330.0001.000
2023-12-10T22:10:31.504043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대성별
연령대1.0000.630
성별0.6301.000
2023-12-10T22:10:31.640075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)성별연령대
기준일자1.0000.3300.0000.000
소비인구(명)0.3301.0000.2040.000
성별0.0000.2041.0000.630
연령대0.0000.0000.6301.000

Missing values

2023-12-10T22:10:28.448875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:10:28.623969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111053000서울특별시종로구사직동20200801F2559.94422
11111053000서울특별시종로구사직동20200801F3029.97211
21111053000서울특별시종로구사직동20200801F3522.479082
31111053000서울특별시종로구사직동20200801F4037.465137
41111053000서울특별시종로구사직동20200801F4552.451192
51111053000서울특별시종로구사직동20200801F5022.479082
61111053000서울특별시종로구사직동20200801F5522.479082
71111053000서울특별시종로구사직동20200801M2067.437247
81111053000서울특별시종로구사직동20200801M2582.423302
91111053000서울특별시종로구사직동20200801M3037.465137
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901111053000서울특별시종로구사직동20200806F4067.437247
911111053000서울특별시종로구사직동20200806F4544.958165
921111053000서울특별시종로구사직동20200806F5059.94422
931111053000서울특별시종로구사직동20200806F5537.465137
941111053000서울특별시종로구사직동20200806M1529.97211
951111053000서울특별시종로구사직동20200806M20172.339632
961111053000서울특별시종로구사직동20200806M25112.395412
971111053000서울특별시종로구사직동20200806M30112.395412
981111053000서울특별시종로구사직동20200806M35112.395412
991111053000서울특별시종로구사직동20200801F2029.97211