Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

행정동코드 has constant value ""Constant
시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동명 has constant value ""Constant
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2023-12-10 13:37:55.418863
Analysis finished2023-12-10 13:37:56.300781
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1111061500
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1111061500
2nd row1111061500
3rd row1111061500
4th row1111061500
5th row1111061500

Common Values

ValueCountFrequency (%)
1111061500 100
100.0%

Length

2023-12-10T22:37:56.378762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:37:56.489358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1111061500 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T22:37:56.583347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:37:56.672953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 100
100.0%

Length

2023-12-10T22:37:56.787213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:37:56.879680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 100
100.0%

행정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로1.2.3.4가동
100 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로1.2.3.4가동
2nd row종로1.2.3.4가동
3rd row종로1.2.3.4가동
4th row종로1.2.3.4가동
5th row종로1.2.3.4가동

Common Values

ValueCountFrequency (%)
종로1.2.3.4가동 100
100.0%

Length

2023-12-10T22:37:56.981625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:37:57.068324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로1.2.3.4가동 100
100.0%

기준일자
Real number (ℝ)

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200809
Minimum20200801
Maximum20200820
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:37:57.160724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200801
5-th percentile20200803
Q120200804
median20200806
Q320200814
95-th percentile20200820
Maximum20200820
Range19
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.0834184
Coefficient of variation (CV)3.0114727 × 10-7
Kurtosis-1.0938754
Mean20200809
Median Absolute Deviation (MAD)3
Skewness0.60643804
Sum2.0200809 × 109
Variance37.00798
MonotonicityNot monotonic
2023-12-10T22:37:57.294574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
20200804 16
16.0%
20200803 15
15.0%
20200805 9
 
9.0%
20200806 6
 
6.0%
20200820 6
 
6.0%
20200818 6
 
6.0%
20200811 5
 
5.0%
20200819 5
 
5.0%
20200814 4
 
4.0%
20200817 4
 
4.0%
Other values (9) 24
24.0%
ValueCountFrequency (%)
20200801 4
 
4.0%
20200802 1
 
1.0%
20200803 15
15.0%
20200804 16
16.0%
20200805 9
9.0%
20200806 6
 
6.0%
20200807 4
 
4.0%
20200808 3
 
3.0%
20200809 1
 
1.0%
20200810 4
 
4.0%
ValueCountFrequency (%)
20200820 6
6.0%
20200819 5
5.0%
20200818 6
6.0%
20200817 4
4.0%
20200815 1
 
1.0%
20200814 4
4.0%
20200813 3
3.0%
20200812 3
3.0%
20200811 5
5.0%
20200810 4
4.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
60 
M
37 
X
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowM
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 60
60.0%
M 37
37.0%
X 3
 
3.0%

Length

2023-12-10T22:37:57.433267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:37:57.537694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 60
60.0%
m 37
37.0%
x 3
 
3.0%

연령대
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20
31 
25
18 
50
15 
45
10 
55
Other values (6)
19 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row25
2nd row45
3rd row20
4th row20
5th row15

Common Values

ValueCountFrequency (%)
20 31
31.0%
25 18
18.0%
50 15
15.0%
45 10
 
10.0%
55 7
 
7.0%
30 5
 
5.0%
35 4
 
4.0%
xx 3
 
3.0%
60 3
 
3.0%
15 2
 
2.0%

Length

2023-12-10T22:37:57.633294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20 31
31.0%
25 18
18.0%
50 15
15.0%
45 10
 
10.0%
55 7
 
7.0%
30 5
 
5.0%
35 4
 
4.0%
xx 3
 
3.0%
60 3
 
3.0%
15 2
 
2.0%

소비인구(명)
Real number (ℝ)

Distinct14
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.410535
Minimum22.479082
Maximum149.86055
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:37:57.733193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.479082
5-th percentile22.479082
Q122.479082
median29.97211
Q359.94422
95-th percentile90.290981
Maximum149.86055
Range127.38147
Interquartile range (IQR)37.465137

Descriptive statistics

Standard deviation26.13559
Coefficient of variation (CV)0.6162523
Kurtosis4.0882192
Mean42.410535
Median Absolute Deviation (MAD)7.4930275
Skewness1.8577519
Sum4241.0535
Variance683.06908
MonotonicityNot monotonic
2023-12-10T22:37:57.853110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
22.479082395 39
39.0%
29.97210986 13
 
13.0%
37.465137325 12
 
12.0%
59.94421972 9
 
9.0%
67.437247185 6
 
6.0%
44.95816479 5
 
5.0%
52.451192255 5
 
5.0%
82.423302115 3
 
3.0%
74.93027465 2
 
2.0%
97.409357045 2
 
2.0%
Other values (4) 4
 
4.0%
ValueCountFrequency (%)
22.479082395 39
39.0%
29.97210986 13
 
13.0%
37.465137325 12
 
12.0%
44.95816479 5
 
5.0%
52.451192255 5
 
5.0%
59.94421972 9
 
9.0%
67.437247185 6
 
6.0%
74.93027465 2
 
2.0%
82.423302115 3
 
3.0%
89.91632958 1
 
1.0%
ValueCountFrequency (%)
149.8605493 1
 
1.0%
142.36752184 1
 
1.0%
119.88843944 1
 
1.0%
97.409357045 2
 
2.0%
89.91632958 1
 
1.0%
82.423302115 3
 
3.0%
74.93027465 2
 
2.0%
67.437247185 6
6.0%
59.94421972 9
9.0%
52.451192255 5
5.0%

Interactions

2023-12-10T22:37:55.797293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:55.605491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:55.943361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:55.710562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:37:57.964065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자성별연령대소비인구(명)
기준일자1.0000.0000.0000.000
성별0.0001.0000.8200.000
연령대0.0000.8201.0000.000
소비인구(명)0.0000.0000.0001.000
2023-12-10T22:37:58.198918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령대
성별1.0000.676
연령대0.6761.000
2023-12-10T22:37:58.302374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)성별연령대
기준일자1.000-0.2570.0000.000
소비인구(명)-0.2571.0000.0000.000
성별0.0000.0001.0000.676
연령대0.0000.0000.6761.000

Missing values

2023-12-10T22:37:56.112537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:37:56.253659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111061500서울특별시종로구종로1.2.3.4가동20200801F2537.465137
11111061500서울특별시종로구종로1.2.3.4가동20200801F4537.465137
21111061500서울특별시종로구종로1.2.3.4가동20200801M2022.479082
31111061500서울특별시종로구종로1.2.3.4가동20200802F2022.479082
41111061500서울특별시종로구종로1.2.3.4가동20200803F1522.479082
51111061500서울특별시종로구종로1.2.3.4가동20200803F20149.860549
61111061500서울특별시종로구종로1.2.3.4가동20200803F2582.423302
71111061500서울특별시종로구종로1.2.3.4가동20200803F3074.930275
81111061500서울특별시종로구종로1.2.3.4가동20200803F4037.465137
91111061500서울특별시종로구종로1.2.3.4가동20200803F45119.888439
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901111061500서울특별시종로구종로1.2.3.4가동20200819F5522.479082
911111061500서울특별시종로구종로1.2.3.4가동20200819M2522.479082
921111061500서울특별시종로구종로1.2.3.4가동20200819M6022.479082
931111061500서울특별시종로구종로1.2.3.4가동20200820F2074.930275
941111061500서울특별시종로구종로1.2.3.4가동20200820F3022.479082
951111061500서울특별시종로구종로1.2.3.4가동20200820F3522.479082
961111061500서울특별시종로구종로1.2.3.4가동20200820F4552.451192
971111061500서울특별시종로구종로1.2.3.4가동20200820F5022.479082
981111061500서울특별시종로구종로1.2.3.4가동20200820M2059.94422
991111061500서울특별시종로구종로1.2.3.4가동20200801F2022.479082