Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동코드 is highly overall correlated with 기준일자 and 1 other fieldsHigh correlation
행정동명 is highly overall correlated with 기준일자 and 1 other fieldsHigh correlation
기준일자 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2023-12-10 12:19:19.394766
Analysis finished2023-12-10 12:19:20.434433
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1111061500
55 
1111053000
45 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1111053000
2nd row1111053000
3rd row1111053000
4th row1111053000
5th row1111053000

Common Values

ValueCountFrequency (%)
1111061500 55
55.0%
1111053000 45
45.0%

Length

2023-12-10T21:19:20.506718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:20.640416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1111061500 55
55.0%
1111053000 45
45.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T21:19:20.746912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:20.838243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 100
100.0%

Length

2023-12-10T21:19:20.965316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:21.084500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 100
100.0%

행정동명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로1.2.3.4가동
55 
사직동
45 

Length

Max length11
Median length11
Mean length7.4
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사직동
2nd row사직동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
종로1.2.3.4가동 55
55.0%
사직동 45
45.0%

Length

2023-12-10T21:19:21.209198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:21.323388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로1.2.3.4가동 55
55.0%
사직동 45
45.0%

기준일자
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200868
Minimum20200801
Maximum20201031
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:19:21.447618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200801
5-th percentile20200801
Q120200805
median20200808
Q320201014
95-th percentile20201026
Maximum20201031
Range230
Interquartile range (IQR)209

Descriptive statistics

Standard deviation97.579378
Coefficient of variation (CV)4.8304546 × 10-6
Kurtosis-1.130357
Mean20200868
Median Absolute Deviation (MAD)4
Skewness0.94012066
Sum2.0200868 × 109
Variance9521.7349
MonotonicityNot monotonic
2023-12-10T21:19:21.581443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
20200808 10
 
10.0%
20200809 8
 
8.0%
20200801 7
 
7.0%
20200807 7
 
7.0%
20200806 7
 
7.0%
20200805 7
 
7.0%
20200803 6
 
6.0%
20200804 6
 
6.0%
20200810 5
 
5.0%
20200802 3
 
3.0%
Other values (22) 34
34.0%
ValueCountFrequency (%)
20200801 7
7.0%
20200802 3
 
3.0%
20200803 6
6.0%
20200804 6
6.0%
20200805 7
7.0%
20200806 7
7.0%
20200807 7
7.0%
20200808 10
10.0%
20200809 8
8.0%
20200810 5
5.0%
ValueCountFrequency (%)
20201031 2
2.0%
20201029 1
 
1.0%
20201028 1
 
1.0%
20201026 2
2.0%
20201025 1
 
1.0%
20201024 1
 
1.0%
20201023 1
 
1.0%
20201022 1
 
1.0%
20201021 3
3.0%
20201020 1
 
1.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
M
77 
F
20 
X
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowF
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 77
77.0%
F 20
 
20.0%
X 3
 
3.0%

Length

2023-12-10T21:19:21.719997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:21.821253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 77
77.0%
f 20
 
20.0%
x 3
 
3.0%

연령대
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20
36 
25
29 
30
16 
15
35
Other values (3)

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row20
2nd row25
3rd row20
4th row25
5th row20

Common Values

ValueCountFrequency (%)
20 36
36.0%
25 29
29.0%
30 16
16.0%
15 9
 
9.0%
35 4
 
4.0%
xx 3
 
3.0%
45 2
 
2.0%
50 1
 
1.0%

Length

2023-12-10T21:19:21.961107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:19:22.151222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 36
36.0%
25 29
29.0%
30 16
16.0%
15 9
 
9.0%
35 4
 
4.0%
xx 3
 
3.0%
45 2
 
2.0%
50 1
 
1.0%

소비인구(명)
Real number (ℝ)

Distinct14
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.785187
Minimum22.479082
Maximum142.36752
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:19:22.298929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.479082
5-th percentile22.479082
Q122.479082
median29.97211
Q352.451192
95-th percentile112.77006
Maximum142.36752
Range119.88844
Interquartile range (IQR)29.97211

Descriptive statistics

Standard deviation28.243824
Coefficient of variation (CV)0.66013089
Kurtosis2.5425702
Mean42.785187
Median Absolute Deviation (MAD)7.4930275
Skewness1.7997256
Sum4278.5187
Variance797.71357
MonotonicityNot monotonic
2023-12-10T21:19:22.472631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
22.479082395 37
37.0%
29.97210986 18
18.0%
37.465137325 10
 
10.0%
44.95816479 9
 
9.0%
52.451192255 6
 
6.0%
59.94421972 6
 
6.0%
119.88843944 4
 
4.0%
97.409357045 2
 
2.0%
112.39541198 2
 
2.0%
74.93027465 2
 
2.0%
Other values (4) 4
 
4.0%
ValueCountFrequency (%)
22.479082395 37
37.0%
29.97210986 18
18.0%
37.465137325 10
 
10.0%
44.95816479 9
 
9.0%
52.451192255 6
 
6.0%
59.94421972 6
 
6.0%
67.437247185 1
 
1.0%
74.93027465 2
 
2.0%
89.91632958 1
 
1.0%
97.409357045 2
 
2.0%
ValueCountFrequency (%)
142.36752184 1
 
1.0%
119.88843944 4
4.0%
112.39541198 2
 
2.0%
104.90238451 1
 
1.0%
97.409357045 2
 
2.0%
89.91632958 1
 
1.0%
74.93027465 2
 
2.0%
67.437247185 1
 
1.0%
59.94421972 6
6.0%
52.451192255 6
6.0%

Interactions

2023-12-10T21:19:19.973192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:19:19.752803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:19:20.079887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:19:19.852261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:19:22.575986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드행정동명기준일자성별연령대소비인구(명)
행정동코드1.0000.9990.8790.2340.0000.568
행정동명0.9991.0000.8790.2340.0000.568
기준일자0.8790.8791.0000.1490.0000.131
성별0.2340.2340.1491.0000.7860.000
연령대0.0000.0000.0000.7861.0000.000
소비인구(명)0.5680.5680.1310.0000.0001.000
2023-12-10T21:19:22.702968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대행정동코드행정동명성별
연령대1.0000.0000.0000.685
행정동코드0.0001.0000.9800.380
행정동명0.0000.9801.0000.380
성별0.6850.3800.3801.000
2023-12-10T21:19:22.824360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)행정동코드행정동명성별연령대
기준일자1.000-0.3510.6800.6800.2560.000
소비인구(명)-0.3511.0000.4190.4190.0000.000
행정동코드0.6800.4191.0000.9800.3800.000
행정동명0.6800.4190.9801.0000.3800.000
성별0.2560.0000.3800.3801.0000.685
연령대0.0000.0000.0000.0000.6851.000

Missing values

2023-12-10T21:19:20.232850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:19:20.383884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111053000서울특별시종로구사직동20201017M2029.97211
11111053000서울특별시종로구사직동20201018F2544.958165
21111053000서울특별시종로구사직동20201021M2052.451192
31111053000서울특별시종로구사직동20201012M2552.451192
41111053000서울특별시종로구사직동20201015M2022.479082
51111053000서울특별시종로구사직동20200804M2022.479082
61111053000서울특별시종로구사직동20201025M2022.479082
71111053000서울특별시종로구사직동20200811M3522.479082
81111053000서울특별시종로구사직동20201019M2029.97211
91111053000서울특별시종로구사직동20201017M2522.479082
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901111061500서울특별시종로구종로1.2.3.4가동20200804M3037.465137
911111061500서울특별시종로구종로1.2.3.4가동20200804M2552.451192
921111061500서울특별시종로구종로1.2.3.4가동20200804M2044.958165
931111061500서울특별시종로구종로1.2.3.4가동20200808M4522.479082
941111061500서울특별시종로구종로1.2.3.4가동20200804F2522.479082
951111061500서울특별시종로구종로1.2.3.4가동20200804F2029.97211
961111061500서울특별시종로구종로1.2.3.4가동20200803M3037.465137
971111061500서울특별시종로구종로1.2.3.4가동20200809F2529.97211
981111061500서울특별시종로구종로1.2.3.4가동20200809M3522.479082
991111061500서울특별시종로구종로1.2.3.4가동20200810M2537.465137