Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory68.3 B

Variable types

Numeric2
Categorical6

Alerts

시도명 has constant value ""Constant
시군구명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
행정동명 is highly overall correlated with 행정동코드 and 1 other fieldsHigh correlation
행정동코드 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation
성별 is highly imbalanced (53.6%)Imbalance
연령대 is highly imbalanced (63.4%)Imbalance

Reproduction

Analysis started2023-12-10 11:23:05.821883
Analysis finished2023-12-10 11:23:07.594522
Duration1.77 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1125285 × 109
Minimum1.1110515 × 109
Maximum1.114059 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:23:07.693836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110515 × 109
5-th percentile1.111053 × 109
Q11.1110615 × 109
median1.1110615 × 109
Q31.114059 × 109
95-th percentile1.114059 × 109
Maximum1.114059 × 109
Range3007500
Interquartile range (IQR)2997500

Descriptive statistics

Standard deviation1507353.4
Coefficient of variation (CV)0.0013548897
Kurtosis-2.0395385
Mean1.1125285 × 109
Median Absolute Deviation (MAD)10000
Skewness0.040609921
Sum1.1125285 × 1011
Variance2.2721143 × 1012
MonotonicityIncreasing
2023-12-10T20:23:07.906403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1114059000 42
42.0%
1111061500 31
31.0%
1111053000 11
 
11.0%
1111055000 6
 
6.0%
1114057000 4
 
4.0%
1114055000 3
 
3.0%
1111051500 2
 
2.0%
1111056000 1
 
1.0%
ValueCountFrequency (%)
1111051500 2
 
2.0%
1111053000 11
 
11.0%
1111055000 6
 
6.0%
1111056000 1
 
1.0%
1111061500 31
31.0%
1114055000 3
 
3.0%
1114057000 4
 
4.0%
1114059000 42
42.0%
ValueCountFrequency (%)
1114059000 42
42.0%
1114057000 4
 
4.0%
1114055000 3
 
3.0%
1111061500 31
31.0%
1111056000 1
 
1.0%
1111055000 6
 
6.0%
1111053000 11
 
11.0%
1111051500 2
 
2.0%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울특별시
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 100
100.0%

Length

2023-12-10T20:23:08.127684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:08.650456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 100
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
종로구
51 
중구
49 

Length

Max length3
Median length3
Mean length2.51
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row종로구
3rd row종로구
4th row종로구
5th row종로구

Common Values

ValueCountFrequency (%)
종로구 51
51.0%
중구 49
49.0%

Length

2023-12-10T20:23:08.837864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:09.040529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종로구 51
51.0%
중구 49
49.0%

행정동명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
광희동
42 
종로1.2.3.4가동
31 
사직동
11 
부암동
필동
 
4
Other values (3)

Length

Max length11
Median length3
Mean length5.45
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row청운효자동
2nd row청운효자동
3rd row사직동
4th row사직동
5th row사직동

Common Values

ValueCountFrequency (%)
광희동 42
42.0%
종로1.2.3.4가동 31
31.0%
사직동 11
 
11.0%
부암동 6
 
6.0%
필동 4
 
4.0%
명동 3
 
3.0%
청운효자동 2
 
2.0%
평창동 1
 
1.0%

Length

2023-12-10T20:23:09.225968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:09.434682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광희동 42
42.0%
종로1.2.3.4가동 31
31.0%
사직동 11
 
11.0%
부암동 6
 
6.0%
필동 4
 
4.0%
명동 3
 
3.0%
청운효자동 2
 
2.0%
평창동 1
 
1.0%

기준일자
Real number (ℝ)

Distinct55
Distinct (%)55.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200415
Minimum20200302
Maximum20200529
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:23:09.712658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200302
5-th percentile20200304
Q120200324
median20200414
Q320200507
95-th percentile20200526
Maximum20200529
Range227
Interquartile range (IQR)183.25

Descriptive statistics

Standard deviation81.026635
Coefficient of variation (CV)4.0111371 × 10-6
Kurtosis-1.41782
Mean20200415
Median Absolute Deviation (MAD)91.5
Skewness0.0081026257
Sum2.0200415 × 109
Variance6565.3156
MonotonicityNot monotonic
2023-12-10T20:23:09.989655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200428 4
 
4.0%
20200406 3
 
3.0%
20200429 3
 
3.0%
20200514 3
 
3.0%
20200303 3
 
3.0%
20200403 3
 
3.0%
20200306 3
 
3.0%
20200507 3
 
3.0%
20200410 3
 
3.0%
20200323 3
 
3.0%
Other values (45) 69
69.0%
ValueCountFrequency (%)
20200302 1
 
1.0%
20200303 3
3.0%
20200304 2
2.0%
20200306 3
3.0%
20200309 1
 
1.0%
20200310 1
 
1.0%
20200311 1
 
1.0%
20200312 2
2.0%
20200316 3
3.0%
20200317 2
2.0%
ValueCountFrequency (%)
20200529 2
2.0%
20200528 1
1.0%
20200527 2
2.0%
20200526 2
2.0%
20200523 2
2.0%
20200522 1
1.0%
20200520 1
1.0%
20200519 1
1.0%
20200518 2
2.0%
20200515 2
2.0%

성별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
X
82 
M
17 
F
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st rowF
2nd rowM
3rd rowM
4th rowX
5th rowX

Common Values

ValueCountFrequency (%)
X 82
82.0%
M 17
 
17.0%
F 1
 
1.0%

Length

2023-12-10T20:23:10.334147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:10.503558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
x 82
82.0%
m 17
 
17.0%
f 1
 
1.0%

연령대
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
xx
82 
55
 
7
45
 
5
50
 
2
35
 
1
Other values (3)
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row50
2nd row45
3rd row35
4th rowxx
5th rowxx

Common Values

ValueCountFrequency (%)
xx 82
82.0%
55 7
 
7.0%
45 5
 
5.0%
50 2
 
2.0%
35 1
 
1.0%
60 1
 
1.0%
25 1
 
1.0%
30 1
 
1.0%

Length

2023-12-10T20:23:10.678026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:10.886120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
xx 82
82.0%
55 7
 
7.0%
45 5
 
5.0%
50 2
 
2.0%
35 1
 
1.0%
60 1
 
1.0%
25 1
 
1.0%
30 1
 
1.0%
Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
22.86142941
53 
30.48190588
29 
38.10238235
10 
45.72285882
53.34333529
 
1

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row22.86142941
2nd row22.86142941
3rd row22.86142941
4th row22.86142941
5th row22.86142941

Common Values

ValueCountFrequency (%)
22.86142941 53
53.0%
30.48190588 29
29.0%
38.10238235 10
 
10.0%
45.72285882 7
 
7.0%
53.34333529 1
 
1.0%

Length

2023-12-10T20:23:11.111909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:23:11.306661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
22.86142941 53
53.0%
30.48190588 29
29.0%
38.10238235 10
 
10.0%
45.72285882 7
 
7.0%
53.34333529 1
 
1.0%

Interactions

2023-12-10T20:23:06.917002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:23:06.585325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:23:07.078554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:23:06.759636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:23:11.517481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드시군구명행정동명기준일자성별연령대소비인구(명)
행정동코드1.0000.9991.0000.0000.1480.3260.212
시군구명0.9991.0001.0000.0000.0570.3550.220
행정동명1.0001.0001.0000.0000.6190.6730.131
기준일자0.0000.0000.0001.0000.0000.2650.000
성별0.1480.0570.6190.0001.0000.8770.000
연령대0.3260.3550.6730.2650.8771.0000.000
소비인구(명)0.2120.2200.1310.0000.0000.0001.000
2023-12-10T20:23:11.708896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별소비인구(명)연령대시군구명행정동명
성별1.0000.0000.8240.0930.476
소비인구(명)0.0001.0000.0000.2640.074
연령대0.8240.0001.0000.2570.281
시군구명0.0930.2640.2571.0000.969
행정동명0.4760.0740.2810.9691.000
2023-12-10T20:23:11.903050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드기준일자시군구명행정동명성별연령대소비인구(명)
행정동코드1.000-0.1030.9800.9690.0930.2570.264
기준일자-0.1031.0000.0000.0000.0000.1480.000
시군구명0.9800.0001.0000.9690.0930.2570.264
행정동명0.9690.0000.9691.0000.4760.2810.074
성별0.0930.0000.0930.4761.0000.8240.000
연령대0.2570.1480.2570.2810.8241.0000.000
소비인구(명)0.2640.0000.2640.0740.0000.0001.000

Missing values

2023-12-10T20:23:07.273907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:23:07.502006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
01111051500서울특별시종로구청운효자동20200406F5022.861429
11111051500서울특별시종로구청운효자동20200425M4522.861429
21111053000서울특별시종로구사직동20200323M3522.861429
31111053000서울특별시종로구사직동20200330Xxx22.861429
41111053000서울특별시종로구사직동20200316Xxx22.861429
51111053000서울특별시종로구사직동20200406Xxx30.481906
61111053000서울특별시종로구사직동20200410Xxx22.861429
71111053000서울특별시종로구사직동20200421Xxx22.861429
81111053000서울특별시종로구사직동20200422Xxx22.861429
91111053000서울특별시종로구사직동20200331M4522.861429
행정동코드시도명시군구명행정동명기준일자성별연령대소비인구(명)
901114059000서울특별시중구광희동20200323Xxx22.861429
911114059000서울특별시중구광희동20200318Xxx22.861429
921114059000서울특별시중구광희동20200317Xxx22.861429
931114059000서울특별시중구광희동20200511Xxx45.722859
941114059000서울특별시중구광희동20200316Xxx45.722859
951114059000서울특별시중구광희동20200312Xxx30.481906
961114059000서울특별시중구광희동20200312M5530.481906
971114059000서울특별시중구광희동20200311Xxx30.481906
981114059000서울특별시중구광희동20200309Xxx45.722859
991114059000서울특별시중구광희동20200302Xxx22.861429