Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory69.3 B

Variable types

Numeric3
Categorical5

Alerts

시도명 has constant value ""Constant
행정동명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
시군구명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
행정동코드 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
연령대 is highly overall correlated with 성별High correlation
성별 is highly overall correlated with 연령대High correlation

Reproduction

Analysis started2024-04-22 00:32:22.508615
Analysis finished2024-04-22 00:32:23.721834
Duration1.21 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1151372 × 109
Minimum1.111053 × 109
Maximum1.120056 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T09:32:23.780999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.111053 × 109
5-th percentile1.111053 × 109
Q11.1110615 × 109
median1.1110615 × 109
Q31.1200535 × 109
95-th percentile1.120056 × 109
Maximum1.120056 × 109
Range9003000
Interquartile range (IQR)8992000

Descriptive statistics

Standard deviation4336724.6
Coefficient of variation (CV)0.0038889605
Kurtosis-1.9351511
Mean1.1151372 × 109
Median Absolute Deviation (MAD)8500
Skewness0.18278435
Sum1.1151372 × 1011
Variance1.880718 × 1013
MonotonicityIncreasing
2024-04-22T09:32:23.903808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1111061500 29
29.0%
1120056000 20
20.0%
1120053500 16
16.0%
1111053000 13
13.0%
1111056000 5
 
5.0%
1114064500 3
 
3.0%
1117063000 3
 
3.0%
1120054000 3
 
3.0%
1111055000 2
 
2.0%
1111057000 2
 
2.0%
Other values (3) 4
 
4.0%
ValueCountFrequency (%)
1111053000 13
13.0%
1111055000 2
 
2.0%
1111056000 5
 
5.0%
1111057000 2
 
2.0%
1111061500 29
29.0%
1114064500 3
 
3.0%
1117053000 1
 
1.0%
1117063000 3
 
3.0%
1117069000 1
 
1.0%
1120052000 2
 
2.0%
ValueCountFrequency (%)
1120056000 20
20.0%
1120054000 3
 
3.0%
1120053500 16
16.0%
1120052000 2
 
2.0%
1117069000 1
 
1.0%
1117063000 3
 
3.0%
1117053000 1
 
1.0%
1114064500 3
 
3.0%
1111061500 29
29.0%
1111057000 2
 
2.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2024-04-22T09:32:24.015659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:32:24.105331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
51 
성동구
41 
용산구
 
5
중구
 
3

Length

Max length3
Median length3
Mean length2.97
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 51
51.0%
성동구 41
41.0%
용산구 5
 
5.0%
중구 3
 
3.0%

Length

2024-04-22T09:32:24.209395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:32:24.301455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 51
51.0%
성동구 41
41.0%
용산구 5
 
5.0%
중구 3
 
3.0%

행정동명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로1.2.3.4가동
29 
행당1동
20 
왕십리도선동
16 
사직동
13 
평창동
Other values (8)
17 

Length

Max length11
Median length6
Mean length6.08
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row사직동
2nd row사직동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
종로1.2.3.4가동 29
29.0%
행당1동 20
20.0%
왕십리도선동 16
16.0%
사직동 13
13.0%
평창동 5
 
5.0%
청구동 3
 
3.0%
이촌1동 3
 
3.0%
마장동 3
 
3.0%
부암동 2
 
2.0%
무악동 2
 
2.0%
Other values (3) 4
 
4.0%

Length

2024-04-22T09:32:24.416644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
종로1.2.3.4가동 29
29.0%
행당1동 20
20.0%
왕십리도선동 16
16.0%
사직동 13
13.0%
평창동 5
 
5.0%
청구동 3
 
3.0%
이촌1동 3
 
3.0%
마장동 3
 
3.0%
부암동 2
 
2.0%
무악동 2
 
2.0%
Other values (3) 4
 
4.0%

기준일자
Real number (ℝ)

Distinct51
Distinct (%)51.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200883
Minimum20200802
Maximum20201031
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T09:32:24.557462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200802
5-th percentile20200803
Q120200809
median20200825
Q320201004
95-th percentile20201028
Maximum20201031
Range229
Interquartile range (IQR)195

Descriptive statistics

Standard deviation90.456282
Coefficient of variation (CV)4.4778381 × 10-6
Kurtosis-1.348171
Mean20200883
Median Absolute Deviation (MAD)21
Skewness0.65677008
Sum2.0200883 × 109
Variance8182.339
MonotonicityNot monotonic
2024-04-22T09:32:24.716369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200803 7
 
7.0%
20200813 6
 
6.0%
20200804 5
 
5.0%
20201005 5
 
5.0%
20200808 5
 
5.0%
20200810 4
 
4.0%
20200827 3
 
3.0%
20201028 3
 
3.0%
20200825 3
 
3.0%
20200826 3
 
3.0%
Other values (41) 56
56.0%
ValueCountFrequency (%)
20200802 1
 
1.0%
20200803 7
7.0%
20200804 5
5.0%
20200805 2
 
2.0%
20200806 2
 
2.0%
20200807 2
 
2.0%
20200808 5
5.0%
20200809 2
 
2.0%
20200810 4
4.0%
20200811 2
 
2.0%
ValueCountFrequency (%)
20201031 1
 
1.0%
20201030 2
2.0%
20201029 2
2.0%
20201028 3
3.0%
20201027 2
2.0%
20201026 1
 
1.0%
20201023 1
 
1.0%
20201021 1
 
1.0%
20201020 1
 
1.0%
20201019 1
 
1.0%

성별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
66 
M
34 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowM
5th rowF

Common Values

ValueCountFrequency (%)
F 66
66.0%
M 34
34.0%

Length

2024-04-22T09:32:24.840941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:32:24.928732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 66
66.0%
m 34
34.0%

연령대
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.25
Minimum15
Maximum55
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T09:32:25.007703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile20
Q140
median45
Q345
95-th percentile50
Maximum55
Range40
Interquartile range (IQR)5

Descriptive statistics

Standard deviation8.2380921
Coefficient of variation (CV)0.19498443
Kurtosis3.1837606
Mean42.25
Median Absolute Deviation (MAD)0
Skewness-1.9148318
Sum4225
Variance67.866162
MonotonicityNot monotonic
2024-04-22T09:32:25.111763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
45 52
52.0%
40 19
 
19.0%
50 16
 
16.0%
20 8
 
8.0%
35 2
 
2.0%
25 1
 
1.0%
15 1
 
1.0%
55 1
 
1.0%
ValueCountFrequency (%)
15 1
 
1.0%
20 8
 
8.0%
25 1
 
1.0%
35 2
 
2.0%
40 19
 
19.0%
45 52
52.0%
50 16
 
16.0%
55 1
 
1.0%
ValueCountFrequency (%)
55 1
 
1.0%
50 16
 
16.0%
45 52
52.0%
40 19
 
19.0%
35 2
 
2.0%
25 1
 
1.0%
20 8
 
8.0%
15 1
 
1.0%
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
22.479082395
73 
29.97210986
19 
37.465137325
 
5
44.95816479
 
3

Length

Max length12
Median length12
Mean length11.78
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row22.479082395
2nd row22.479082395
3rd row22.479082395
4th row22.479082395
5th row22.479082395

Common Values

ValueCountFrequency (%)
22.479082395 73
73.0%
29.97210986 19
 
19.0%
37.465137325 5
 
5.0%
44.95816479 3
 
3.0%

Length

2024-04-22T09:32:25.232503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:32:25.322111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
22.479082395 73
73.0%
29.97210986 19
 
19.0%
37.465137325 5
 
5.0%
44.95816479 3
 
3.0%

Interactions

2024-04-22T09:32:23.232624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:22.773408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:23.011188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:23.312697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:22.847804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:23.081426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:23.393917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:22.930818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:32:23.151914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T09:32:25.395518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드시군구명행정동명기준일자성별연령대소비인구(명)
행정동코드1.0001.0001.0000.3060.1880.6740.000
시군구명1.0001.0001.0000.2340.1150.6800.000
행정동명1.0001.0001.0000.4800.2280.7180.000
기준일자0.3060.2340.4801.0000.1640.2020.000
성별0.1880.1150.2280.1641.0000.6890.142
연령대0.6740.6800.7180.2020.6891.0000.000
소비인구(명)0.0000.0000.0000.0000.1420.0001.000
2024-04-22T09:32:25.500343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소비인구(명)성별행정동명시군구명
소비인구(명)1.0000.0920.0000.000
성별0.0921.0000.1970.073
행정동명0.0000.1971.0000.952
시군구명0.0000.0730.9521.000
2024-04-22T09:32:25.590095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드기준일자연령대시군구명행정동명성별소비인구(명)
행정동코드1.000-0.165-0.1921.0000.9520.0730.000
기준일자-0.1651.000-0.1100.1490.2520.1160.000
연령대-0.192-0.1101.0000.3510.4170.5080.000
시군구명1.0000.1490.3511.0000.9520.0730.000
행정동명0.9520.2520.4170.9521.0000.1970.000
성별0.0730.1160.5080.0730.1971.0000.092
소비인구(명)0.0000.0000.0000.0000.0000.0921.000

Missing values

2024-04-22T09:32:23.542510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T09:32:23.674670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111053000서울특별시종로구사직동20200920F4522.479082
11111053000서울특별시종로구사직동20200917F4522.479082
21111053000서울특별시종로구사직동20200809F4522.479082
31111053000서울특별시종로구사직동20201010M5022.479082
41111053000서울특별시종로구사직동20200807F4522.479082
51111053000서울특별시종로구사직동20200808F4544.958165
61111053000서울특별시종로구사직동20201018M5022.479082
71111053000서울특별시종로구사직동20200919M4522.479082
81111053000서울특별시종로구사직동20201004F4522.479082
91111053000서울특별시종로구사직동20200820F4522.479082
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901120056000서울특별시성동구행당1동20200808F4029.97211
911120056000서울특별시성동구행당1동20200807F4537.465137
921120056000서울특별시성동구행당1동20200806M4537.465137
931120056000서울특별시성동구행당1동20200818F4522.479082
941120056000서울특별시성동구행당1동20200806F4522.479082
951120056000서울특별시성동구행당1동20200804F4544.958165
961120056000서울특별시성동구행당1동20200804F4022.479082
971120056000서울특별시성동구행당1동20200803F4522.479082
981120056000서울특별시성동구행당1동20200803F4022.479082
991120056000서울특별시성동구행당1동20200802F4522.479082