Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Categorical6
Numeric2

Alerts

행정동코드 has constant value ""Constant
시도명 has constant value ""Constant
시군구명 has constant value ""Constant
행정동명 has constant value ""Constant
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2023-12-10 12:02:34.545010
Analysis finished2023-12-10 12:02:36.643340
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1111053000
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1111053000
2nd row1111053000
3rd row1111053000
4th row1111053000
5th row1111053000

Common Values

ValueCountFrequency (%)
1111053000 100
100.0%

Length

2023-12-10T21:02:36.750496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:02:36.927357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1111053000 100
100.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T21:02:37.104542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:02:37.269782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 100
100.0%

Length

2023-12-10T21:02:37.455181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:02:37.633697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 100
100.0%

행정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사직동
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사직동
2nd row사직동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
사직동 100
100.0%

Length

2023-12-10T21:02:37.856382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:02:38.022339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사직동 100
100.0%

기준일자
Real number (ℝ)

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200305
Minimum20200301
Maximum20200309
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:02:38.216820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200301
5-th percentile20200301
Q120200303
median20200305
Q320200307
95-th percentile20200308
Maximum20200309
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.382936
Coefficient of variation (CV)1.1796535 × 10-7
Kurtosis-1.1594821
Mean20200305
Median Absolute Deviation (MAD)2
Skewness0.038398555
Sum2.0200305 × 109
Variance5.6783838
MonotonicityNot monotonic
2023-12-10T21:02:38.468929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
20200306 14
14.0%
20200302 13
13.0%
20200303 13
13.0%
20200305 12
12.0%
20200308 12
12.0%
20200304 11
11.0%
20200307 11
11.0%
20200301 10
10.0%
20200309 4
 
4.0%
ValueCountFrequency (%)
20200301 10
10.0%
20200302 13
13.0%
20200303 13
13.0%
20200304 11
11.0%
20200305 12
12.0%
20200306 14
14.0%
20200307 11
11.0%
20200308 12
12.0%
20200309 4
 
4.0%
ValueCountFrequency (%)
20200309 4
 
4.0%
20200308 12
12.0%
20200307 11
11.0%
20200306 14
14.0%
20200305 12
12.0%
20200304 11
11.0%
20200303 13
13.0%
20200302 13
13.0%
20200301 10
10.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
68 
M
28 
X
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 68
68.0%
M 28
28.0%
X 4
 
4.0%

Length

2023-12-10T21:02:39.122731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:02:39.722908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 68
68.0%
m 28
28.0%
x 4
 
4.0%

연령대
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
25
16 
20
14 
30
13 
35
12 
45
10 
Other values (7)
35 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row25
3rd row30
4th row40
5th row45

Common Values

ValueCountFrequency (%)
25 16
16.0%
20 14
14.0%
30 13
13.0%
35 12
12.0%
45 10
10.0%
40 8
8.0%
50 7
7.0%
55 7
7.0%
xx 4
 
4.0%
15 4
 
4.0%
Other values (2) 5
 
5.0%

Length

2023-12-10T21:02:40.377770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
25 16
16.0%
20 14
14.0%
30 13
13.0%
35 12
12.0%
45 10
10.0%
40 8
8.0%
50 7
7.0%
55 7
7.0%
xx 4
 
4.0%
15 4
 
4.0%
Other values (2) 5
 
5.0%

소비인구(명)
Real number (ℝ)

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.837916
Minimum22.861429
Maximum152.40953
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:02:40.685926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.861429
5-th percentile22.861429
Q130.481906
median38.102382
Q368.584288
95-th percentile114.30715
Maximum152.40953
Range129.5481
Interquartile range (IQR)38.102382

Descriptive statistics

Standard deviation30.005169
Coefficient of variation (CV)0.60205506
Kurtosis1.834307
Mean49.837916
Median Absolute Deviation (MAD)15.240953
Skewness1.4912371
Sum4983.7916
Variance900.31019
MonotonicityNot monotonic
2023-12-10T21:02:40.977802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
22.86142941 22
22.0%
30.48190588 17
17.0%
38.10238235 17
17.0%
45.72285882 9
9.0%
68.58428823 9
9.0%
53.34333529 7
 
7.0%
106.68667058 4
 
4.0%
114.30714705 3
 
3.0%
76.2047647 3
 
3.0%
152.4095294 2
 
2.0%
Other values (5) 7
 
7.0%
ValueCountFrequency (%)
22.86142941 22
22.0%
30.48190588 17
17.0%
38.10238235 17
17.0%
45.72285882 9
9.0%
53.34333529 7
 
7.0%
60.96381176 2
 
2.0%
68.58428823 9
9.0%
76.2047647 3
 
3.0%
83.82524117 1
 
1.0%
91.44571764 2
 
2.0%
ValueCountFrequency (%)
152.4095294 2
 
2.0%
121.92762352 1
 
1.0%
114.30714705 3
 
3.0%
106.68667058 4
4.0%
99.06619411 1
 
1.0%
91.44571764 2
 
2.0%
83.82524117 1
 
1.0%
76.2047647 3
 
3.0%
68.58428823 9
9.0%
60.96381176 2
 
2.0%

Interactions

2023-12-10T21:02:35.596718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:02:35.282405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:02:36.118130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:02:35.441953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:02:41.206450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자성별연령대소비인구(명)
기준일자1.0000.0000.0000.000
성별0.0001.0000.9330.532
연령대0.0000.9331.0000.334
소비인구(명)0.0000.5320.3341.000
2023-12-10T21:02:41.393321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령대
성별1.0000.676
연령대0.6761.000
2023-12-10T21:02:41.549827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자소비인구(명)성별연령대
기준일자1.0000.0780.0000.000
소비인구(명)0.0781.0000.2660.142
성별0.0000.2661.0000.676
연령대0.0000.1420.6761.000

Missing values

2023-12-10T21:02:36.308205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:02:36.535631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111053000서울특별시종로구사직동20200301F2045.722859
11111053000서울특별시종로구사직동20200301F2530.481906
21111053000서울특별시종로구사직동20200301F3038.102382
31111053000서울특별시종로구사직동20200301F4030.481906
41111053000서울특별시종로구사직동20200301F4530.481906
51111053000서울특별시종로구사직동20200301M2030.481906
61111053000서울특별시종로구사직동20200301M2522.861429
71111053000서울특별시종로구사직동20200301M3522.861429
81111053000서울특별시종로구사직동20200301Xxx38.102382
91111053000서울특별시종로구사직동20200302F1522.861429
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901111053000서울특별시종로구사직동20200308F5045.722859
911111053000서울특별시종로구사직동20200308M2538.102382
921111053000서울특별시종로구사직동20200308M3038.102382
931111053000서울특별시종로구사직동20200308M5030.481906
941111053000서울특별시종로구사직동20200308M5522.861429
951111053000서울특별시종로구사직동20200309F1530.481906
961111053000서울특별시종로구사직동20200309F2053.343335
971111053000서울특별시종로구사직동20200309F25114.307147
981111053000서울특별시종로구사직동20200309F3091.445718
991111053000서울특별시종로구사직동20200301F1522.861429