Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory69.3 B

Variable types

Numeric3
Categorical5

Alerts

시도명 has constant value ""Constant
행정동명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
시군구명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
행정동코드 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 13:44:59.910471
Analysis finished2023-12-10 13:45:02.197939
Duration2.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1145679 × 109
Minimum1.111053 × 109
Maximum1.120056 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:45:02.294452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.111053 × 109
5-th percentile1.111053 × 109
Q11.1110606 × 109
median1.1110615 × 109
Q31.1200535 × 109
95-th percentile1.120056 × 109
Maximum1.120056 × 109
Range9003000
Interquartile range (IQR)8992875

Descriptive statistics

Standard deviation4133739.2
Coefficient of variation (CV)0.0037088266
Kurtosis-1.7090381
Mean1.1145679 × 109
Median Absolute Deviation (MAD)8500
Skewness0.44255384
Sum1.1145679 × 1011
Variance1.70878 × 1013
MonotonicityIncreasing
2023-12-10T22:45:02.466281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1111061500 29
29.0%
1120056000 19
19.0%
1120053500 13
13.0%
1111056000 10
 
10.0%
1111053000 8
 
8.0%
1114064500 5
 
5.0%
1117057000 4
 
4.0%
1111057000 3
 
3.0%
1111055000 2
 
2.0%
1111058000 2
 
2.0%
Other values (5) 5
 
5.0%
ValueCountFrequency (%)
1111053000 8
 
8.0%
1111055000 2
 
2.0%
1111056000 10
 
10.0%
1111057000 3
 
3.0%
1111058000 2
 
2.0%
1111061500 29
29.0%
1111067000 1
 
1.0%
1114064500 5
 
5.0%
1117051000 1
 
1.0%
1117057000 4
 
4.0%
ValueCountFrequency (%)
1120056000 19
19.0%
1120053500 13
13.0%
1117070000 1
 
1.0%
1117069000 1
 
1.0%
1117063000 1
 
1.0%
1117057000 4
 
4.0%
1117051000 1
 
1.0%
1114064500 5
 
5.0%
1111067000 1
 
1.0%
1111061500 29
29.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T22:45:02.657386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:02.807858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
55 
성동구
32 
용산구
중구
 
5

Length

Max length3
Median length3
Mean length2.95
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 55
55.0%
성동구 32
32.0%
용산구 8
 
8.0%
중구 5
 
5.0%

Length

2023-12-10T22:45:03.150470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:03.499762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 55
55.0%
성동구 32
32.0%
용산구 8
 
8.0%
중구 5
 
5.0%

행정동명
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로1.2.3.4가동
29 
행당1동
19 
왕십리도선동
13 
평창동
10 
사직동
Other values (10)
21 

Length

Max length11
Median length6
Mean length6.01
Min length3

Unique

Unique5 ?
Unique (%)5.0%

Sample

1st row사직동
2nd row사직동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
종로1.2.3.4가동 29
29.0%
행당1동 19
19.0%
왕십리도선동 13
13.0%
평창동 10
 
10.0%
사직동 8
 
8.0%
청구동 5
 
5.0%
원효로2동 4
 
4.0%
무악동 3
 
3.0%
부암동 2
 
2.0%
교남동 2
 
2.0%
Other values (5) 5
 
5.0%

Length

2023-12-10T22:45:03.704488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
종로1.2.3.4가동 29
29.0%
행당1동 19
19.0%
왕십리도선동 13
13.0%
평창동 10
 
10.0%
사직동 8
 
8.0%
청구동 5
 
5.0%
원효로2동 4
 
4.0%
무악동 3
 
3.0%
부암동 2
 
2.0%
교남동 2
 
2.0%
Other values (5) 5
 
5.0%

기준일자
Real number (ℝ)

Distinct54
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200390
Minimum20200302
Maximum20200528
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:45:04.045521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200302
5-th percentile20200302
Q120200308
median20200401
Q320200448
95-th percentile20200523
Maximum20200528
Range226
Interquartile range (IQR)139.25

Descriptive statistics

Standard deviation84.499823
Coefficient of variation (CV)4.1830789 × 10-6
Kurtosis-1.4279495
Mean20200390
Median Absolute Deviation (MAD)93
Skewness0.39551122
Sum2.020039 × 109
Variance7140.2201
MonotonicityNot monotonic
2023-12-10T22:45:04.290974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200302 6
 
6.0%
20200303 6
 
6.0%
20200304 5
 
5.0%
20200429 5
 
5.0%
20200427 4
 
4.0%
20200306 4
 
4.0%
20200309 4
 
4.0%
20200328 3
 
3.0%
20200307 3
 
3.0%
20200407 3
 
3.0%
Other values (44) 57
57.0%
ValueCountFrequency (%)
20200302 6
6.0%
20200303 6
6.0%
20200304 5
5.0%
20200305 1
 
1.0%
20200306 4
4.0%
20200307 3
3.0%
20200309 4
4.0%
20200310 3
3.0%
20200311 1
 
1.0%
20200312 2
 
2.0%
ValueCountFrequency (%)
20200528 1
1.0%
20200527 1
1.0%
20200526 1
1.0%
20200525 2
2.0%
20200523 1
1.0%
20200520 1
1.0%
20200518 1
1.0%
20200516 1
1.0%
20200514 1
1.0%
20200513 2
2.0%

성별
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
F
58 
M
42 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 58
58.0%
M 42
42.0%

Length

2023-12-10T22:45:05.053402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:05.261053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 58
58.0%
m 42
42.0%

연령대
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
45
58 
40
21 
50
17 
20
 
3
35
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row45
2nd row45
3rd row45
4th row45
5th row45

Common Values

ValueCountFrequency (%)
45 58
58.0%
40 21
 
21.0%
50 17
 
17.0%
20 3
 
3.0%
35 1
 
1.0%

Length

2023-12-10T22:45:05.454015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:45:05.677518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
45 58
58.0%
40 21
 
21.0%
50 17
 
17.0%
20 3
 
3.0%
35 1
 
1.0%

소비인구(명)
Real number (ℝ)

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.452391
Minimum22.861429
Maximum68.584288
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:45:05.884716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.861429
5-th percentile22.861429
Q122.861429
median22.861429
Q322.861429
95-th percentile30.481906
Maximum68.584288
Range45.722859
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.5185913
Coefficient of variation (CV)0.25610919
Kurtosis22.323984
Mean25.452391
Median Absolute Deviation (MAD)0
Skewness4.2186203
Sum2545.2391
Variance42.492032
MonotonicityNot monotonic
2023-12-10T22:45:06.139430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
22.86142941 77
77.0%
30.48190588 19
 
19.0%
38.10238235 1
 
1.0%
53.34333529 1
 
1.0%
45.72285882 1
 
1.0%
68.58428823 1
 
1.0%
ValueCountFrequency (%)
22.86142941 77
77.0%
30.48190588 19
 
19.0%
38.10238235 1
 
1.0%
45.72285882 1
 
1.0%
53.34333529 1
 
1.0%
68.58428823 1
 
1.0%
ValueCountFrequency (%)
68.58428823 1
 
1.0%
53.34333529 1
 
1.0%
45.72285882 1
 
1.0%
38.10238235 1
 
1.0%
30.48190588 19
 
19.0%
22.86142941 77
77.0%

Interactions

2023-12-10T22:45:01.472636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:00.674816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:01.086451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:01.599833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:00.824733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:01.221730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:01.734516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:00.962180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:45:01.351634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:45:06.356223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드시군구명행정동명기준일자성별연령대소비인구(명)
행정동코드1.0001.0001.0000.4430.3430.3820.238
시군구명1.0001.0001.0000.4170.2520.3890.273
행정동명1.0001.0001.0000.6350.3250.6270.000
기준일자0.4430.4170.6351.0000.0000.2980.000
성별0.3430.2520.3250.0001.0000.2620.000
연령대0.3820.3890.6270.2980.2621.0000.000
소비인구(명)0.2380.2730.0000.0000.0000.0001.000
2023-12-10T22:45:06.576348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동명연령대시군구명성별
행정동명1.0000.2990.9410.274
연령대0.2991.0000.3240.314
시군구명0.9410.3241.0000.165
성별0.2740.3140.1651.000
2023-12-10T22:45:06.800068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드기준일자소비인구(명)시군구명행정동명성별연령대
행정동코드1.000-0.2870.3561.0000.9410.1650.324
기준일자-0.2871.000-0.1540.2740.3070.0000.190
소비인구(명)0.356-0.1541.0000.1750.0000.0000.000
시군구명1.0000.2740.1751.0000.9410.1650.324
행정동명0.9410.3070.0000.9411.0000.2740.299
성별0.1650.0000.0000.1650.2741.0000.314
연령대0.3240.1900.0000.3240.2990.3141.000

Missing values

2023-12-10T22:45:01.905658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:45:02.112816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111053000서울특별시종로구사직동20200315F4522.861429
11111053000서울특별시종로구사직동20200418F4522.861429
21111053000서울특별시종로구사직동20200326F4522.861429
31111053000서울특별시종로구사직동20200502F4530.481906
41111053000서울특별시종로구사직동20200523F4522.861429
51111053000서울특별시종로구사직동20200503M5022.861429
61111053000서울특별시종로구사직동20200328F4538.102382
71111053000서울특별시종로구사직동20200423F4522.861429
81111055000서울특별시종로구부암동20200312F4022.861429
91111055000서울특별시종로구부암동20200508F4022.861429
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901120056000서울특별시성동구행당1동20200309F4022.861429
911120056000서울특별시성동구행당1동20200307M5030.481906
921120056000서울특별시성동구행당1동20200307M4530.481906
931120056000서울특별시성동구행당1동20200310F4568.584288
941120056000서울특별시성동구행당1동20200306F4522.861429
951120056000서울특별시성동구행당1동20200304M4530.481906
961120056000서울특별시성동구행당1동20200304F4530.481906
971120056000서울특별시성동구행당1동20200303M4522.861429
981120056000서울특별시성동구행당1동20200303F4530.481906
991120056000서울특별시성동구행당1동20200303F4022.861429