Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동명 is highly overall correlated with 기준일자 and 1 other fieldsHigh correlation
행정동코드 is highly overall correlated with 기준일자 and 1 other fieldsHigh correlation
기준일자 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation
행정동코드 is highly imbalanced (89.8%)Imbalance
행정동명 is highly imbalanced (89.8%)Imbalance

Reproduction

Analysis started2023-12-10 13:37:59.191910
Analysis finished2023-12-10 13:38:00.201098
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1111061500
98 
1111055000
 
1
1111058000
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row1111055000
2nd row1111058000
3rd row1111061500
4th row1111061500
5th row1111061500

Common Values

ValueCountFrequency (%)
1111061500 98
98.0%
1111055000 1
 
1.0%
1111058000 1
 
1.0%

Length

2023-12-10T22:38:00.263706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:38:00.360797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1111061500 98
98.0%
1111055000 1
 
1.0%
1111058000 1
 
1.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T22:38:00.479587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:38:00.595853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 100
100.0%

Length

2023-12-10T22:38:00.747718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:38:00.867281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 100
100.0%

행정동명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로1.2.3.4가동
98 
부암동
 
1
교남동
 
1

Length

Max length11
Median length11
Mean length10.84
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row부암동
2nd row교남동
3rd row종로1.2.3.4가동
4th row종로1.2.3.4가동
5th row종로1.2.3.4가동

Common Values

ValueCountFrequency (%)
종로1.2.3.4가동 98
98.0%
부암동 1
 
1.0%
교남동 1
 
1.0%

Length

2023-12-10T22:38:01.010588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:38:01.152773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로1.2.3.4가동 98
98.0%
부암동 1
 
1.0%
교남동 1
 
1.0%

기준일자
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200320
Minimum20200302
Maximum20200525
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:38:01.265580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200302
5-th percentile20200309
Q120200310
median20200316
Q320200323
95-th percentile20200324
Maximum20200525
Range223
Interquartile range (IQR)13

Descriptive statistics

Standard deviation28.901244
Coefficient of variation (CV)1.430732 × 10-6
Kurtosis43.333351
Mean20200320
Median Absolute Deviation (MAD)6
Skewness6.4945584
Sum2.020032 × 109
Variance835.28192
MonotonicityNot monotonic
2023-12-10T22:38:01.414793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
20200309 17
17.0%
20200324 12
12.0%
20200310 12
12.0%
20200323 11
11.0%
20200320 9
9.0%
20200312 5
 
5.0%
20200311 5
 
5.0%
20200318 4
 
4.0%
20200319 4
 
4.0%
20200317 4
 
4.0%
Other values (10) 17
17.0%
ValueCountFrequency (%)
20200302 1
 
1.0%
20200303 2
 
2.0%
20200306 1
 
1.0%
20200309 17
17.0%
20200310 12
12.0%
20200311 5
 
5.0%
20200312 5
 
5.0%
20200313 3
 
3.0%
20200314 2
 
2.0%
20200316 3
 
3.0%
ValueCountFrequency (%)
20200525 1
 
1.0%
20200507 1
 
1.0%
20200325 1
 
1.0%
20200324 12
12.0%
20200323 11
11.0%
20200321 2
 
2.0%
20200320 9
9.0%
20200319 4
 
4.0%
20200318 4
 
4.0%
20200317 4
 
4.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
55 
M
42 
X
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowX
4th rowF
5th rowM

Common Values

ValueCountFrequency (%)
F 55
55.0%
M 42
42.0%
X 3
 
3.0%

Length

2023-12-10T22:38:01.556919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:38:01.654577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 55
55.0%
m 42
42.0%
x 3
 
3.0%

연령대
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20
25 
50
20 
25
17 
45
12 
55
Other values (6)
19 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row45
2nd row45
3rd rowxx
4th row25
5th row50

Common Values

ValueCountFrequency (%)
20 25
25.0%
50 20
20.0%
25 17
17.0%
45 12
12.0%
55 7
 
7.0%
30 6
 
6.0%
60 4
 
4.0%
xx 3
 
3.0%
40 3
 
3.0%
35 2
 
2.0%

Length

2023-12-10T22:38:01.782392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20 25
25.0%
50 20
20.0%
25 17
17.0%
45 12
12.0%
55 7
 
7.0%
30 6
 
6.0%
60 4
 
4.0%
xx 3
 
3.0%
40 3
 
3.0%
35 2
 
2.0%

소비인구(명)
Real number (ℝ)

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.932797
Minimum22.861429
Maximum365.78287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:38:01.896857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.861429
5-th percentile22.861429
Q122.861429
median30.481906
Q353.343335
95-th percentile107.44872
Maximum365.78287
Range342.92144
Interquartile range (IQR)30.481906

Descriptive statistics

Standard deviation44.858933
Coefficient of variation (CV)0.93587138
Kurtosis26.922152
Mean47.932797
Median Absolute Deviation (MAD)7.6204765
Skewness4.5042285
Sum4793.2797
Variance2012.3239
MonotonicityNot monotonic
2023-12-10T22:38:02.032178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
22.86142941 33
33.0%
30.48190588 18
18.0%
45.72285882 11
 
11.0%
53.34333529 10
 
10.0%
38.10238235 9
 
9.0%
60.96381176 5
 
5.0%
76.2047647 3
 
3.0%
106.68667058 3
 
3.0%
137.16857646 2
 
2.0%
365.78287056 1
 
1.0%
Other values (5) 5
 
5.0%
ValueCountFrequency (%)
22.86142941 33
33.0%
30.48190588 18
18.0%
38.10238235 9
 
9.0%
45.72285882 11
 
11.0%
53.34333529 10
 
10.0%
60.96381176 5
 
5.0%
76.2047647 3
 
3.0%
83.82524117 1
 
1.0%
91.44571764 1
 
1.0%
99.06619411 1
 
1.0%
ValueCountFrequency (%)
365.78287056 1
 
1.0%
220.99381763 1
 
1.0%
137.16857646 2
 
2.0%
121.92762352 1
 
1.0%
106.68667058 3
3.0%
99.06619411 1
 
1.0%
91.44571764 1
 
1.0%
83.82524117 1
 
1.0%
76.2047647 3
3.0%
60.96381176 5
5.0%

Interactions

2023-12-10T22:37:59.732160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:59.549946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:59.821575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:37:59.629114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:38:02.160422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드행정동명기준일자성별연령대소비인구(명)
행정동코드1.0001.0000.9400.0000.0000.000
행정동명1.0001.0000.9400.0000.0000.000
기준일자0.9400.9401.0000.0000.0000.000
성별0.0000.0000.0001.0000.8160.000
연령대0.0000.0000.0000.8161.0000.000
소비인구(명)0.0000.0000.0000.0000.0001.000
2023-12-10T22:38:02.287730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동명성별행정동코드연령대
행정동명1.0000.0001.0000.000
성별0.0001.0000.0000.670
행정동코드1.0000.0001.0000.000
연령대0.0000.6700.0001.000
2023-12-10T22:38:02.415074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)행정동코드행정동명성별연령대
기준일자1.000-0.2380.7000.7000.0000.000
소비인구(명)-0.2381.0000.0000.0000.0000.000
행정동코드0.7000.0001.0001.0000.0000.000
행정동명0.7000.0001.0001.0000.0000.000
성별0.0000.0000.0000.0001.0000.670
연령대0.0000.0000.0000.0000.6701.000

Missing values

2023-12-10T22:37:59.963271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:38:00.135343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111055000서울특별시종로구부암동20200507F4522.861429
11111058000서울특별시종로구교남동20200525F4522.861429
21111061500서울특별시종로구종로1.2.3.4가동20200324Xxx22.861429
31111061500서울특별시종로구종로1.2.3.4가동20200303F2522.861429
41111061500서울특별시종로구종로1.2.3.4가동20200303M5030.481906
51111061500서울특별시종로구종로1.2.3.4가동20200306F2022.861429
61111061500서울특별시종로구종로1.2.3.4가동20200309F20365.782871
71111061500서울특별시종로구종로1.2.3.4가동20200309M2591.445718
81111061500서울특별시종로구종로1.2.3.4가동20200309F2599.066194
91111061500서울특별시종로구종로1.2.3.4가동20200309F3045.722859
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901111061500서울특별시종로구종로1.2.3.4가동20200324F3022.861429
911111061500서울특별시종로구종로1.2.3.4가동20200324F4522.861429
921111061500서울특별시종로구종로1.2.3.4가동20200324F5045.722859
931111061500서울특별시종로구종로1.2.3.4가동20200324F5530.481906
941111061500서울특별시종로구종로1.2.3.4가동20200302F5030.481906
951111061500서울특별시종로구종로1.2.3.4가동20200324M2530.481906
961111061500서울특별시종로구종로1.2.3.4가동20200324M3522.861429
971111061500서울특별시종로구종로1.2.3.4가동20200324M5030.481906
981111061500서울특별시종로구종로1.2.3.4가동20200324M5530.481906
991111061500서울특별시종로구종로1.2.3.4가동20200325F2045.722859