Overview

Dataset statistics

Number of variables6
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory55.4 B

Variable types

Numeric3
Categorical2
Boolean1

Dataset

Description샘플 데이터
Author나이스지니데이타(주)
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=2

Alerts

정보미제공구간해당여부(PROVISION_YN) has constant value ""Constant
평균소득(만원)(INCOME) has unique valuesUnique

Reproduction

Analysis started2023-12-10 15:03:14.766268
Analysis finished2023-12-10 15:03:17.471230
Duration2.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정동코드(DONG_CD)
Real number (ℝ)

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11413116
Minimum11110530
Maximum11740600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T00:03:17.606243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11110530
5-th percentile11110626
Q111219568
median11380632
Q311605584
95-th percentile11727055
Maximum11740600
Range630070
Interquartile range (IQR)386016.25

Descriptive statistics

Standard deviation220755.16
Coefficient of variation (CV)0.019342233
Kurtosis-1.4736725
Mean11413116
Median Absolute Deviation (MAD)195020
Skewness0.080068461
Sum3.4239348 × 108
Variance4.8732839 × 1010
MonotonicityNot monotonic
2023-12-11T00:03:17.916199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
11710550 2
 
6.7%
11110615 1
 
3.3%
11290610 1
 
3.3%
11530540 1
 
3.3%
11230750 1
 
3.3%
11410640 1
 
3.3%
11170580 1
 
3.3%
11710562 1
 
3.3%
11290600 1
 
3.3%
11530740 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
11110530 1
3.3%
11110615 1
3.3%
11110640 1
3.3%
11110680 1
3.3%
11170580 1
3.3%
11170630 1
3.3%
11170660 1
3.3%
11215840 1
3.3%
11230750 1
3.3%
11290555 1
3.3%
ValueCountFrequency (%)
11740600 1
3.3%
11740550 1
3.3%
11710562 1
3.3%
11710550 2
6.7%
11680600 1
3.3%
11620645 1
3.3%
11620545 1
3.3%
11560700 1
3.3%
11560660 1
3.3%
11545620 1
3.3%

구분(CATEGORY)
Categorical

Distinct8
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
50대
60세이상
여성
30대
20대
Other values (3)

Length

Max length5
Median length3
Mean length3.0666667
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row50대
2nd row여성
3rd row30대
4th row여성
5th row40대

Common Values

ValueCountFrequency (%)
50대 6
20.0%
60세이상 5
16.7%
여성 4
13.3%
30대 4
13.3%
20대 4
13.3%
40대 3
10.0%
전체 3
10.0%
남성 1
 
3.3%

Length

2023-12-11T00:03:18.211149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:03:18.994644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50대 6
20.0%
60세이상 5
16.7%
여성 4
13.3%
30대 4
13.3%
20대 4
13.3%
40대 3
10.0%
전체 3
10.0%
남성 1
 
3.3%

평균소득(만원)(INCOME)
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4024.6333
Minimum2713
Maximum5411
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T00:03:19.281112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2713
5-th percentile2951.2
Q13583.75
median3973
Q34429
95-th percentile5285.8
Maximum5411
Range2698
Interquartile range (IQR)845.25

Descriptive statistics

Standard deviation719.38966
Coefficient of variation (CV)0.17874663
Kurtosis-0.46818621
Mean4024.6333
Median Absolute Deviation (MAD)457
Skewness0.28558956
Sum120739
Variance517521.48
MonotonicityNot monotonic
2023-12-11T00:03:19.603899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
4984 1
 
3.3%
3905 1
 
3.3%
5067 1
 
3.3%
3405 1
 
3.3%
5189 1
 
3.3%
3238 1
 
3.3%
4469 1
 
3.3%
3879 1
 
3.3%
3677 1
 
3.3%
4099 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
2713 1
3.3%
2944 1
3.3%
2960 1
3.3%
3175 1
3.3%
3238 1
3.3%
3405 1
3.3%
3423 1
3.3%
3555 1
3.3%
3670 1
3.3%
3677 1
3.3%
ValueCountFrequency (%)
5411 1
3.3%
5365 1
3.3%
5189 1
3.3%
5067 1
3.3%
4984 1
3.3%
4757 1
3.3%
4611 1
3.3%
4469 1
3.3%
4309 1
3.3%
4194 1
3.3%
Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
3천만원미만
5천만원미만
5천만원이상
4천만원미만
2천만원미만

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5천만원이상
2nd row5천만원이상
3rd row4천만원미만
4th row4천만원미만
5th row3천만원미만

Common Values

ValueCountFrequency (%)
3천만원미만 7
23.3%
5천만원미만 7
23.3%
5천만원이상 6
20.0%
4천만원미만 5
16.7%
2천만원미만 5
16.7%

Length

2023-12-11T00:03:19.923907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T00:03:20.262323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3천만원미만 7
23.3%
5천만원미만 7
23.3%
5천만원이상 6
20.0%
4천만원미만 5
16.7%
2천만원미만 5
16.7%
Distinct22
Distinct (%)73.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.4
Minimum4
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-11T00:03:20.513639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile7.35
Q116
median21
Q327
95-th percentile38.75
Maximum49
Range45
Interquartile range (IQR)11

Descriptive statistics

Standard deviation10.254015
Coefficient of variation (CV)0.45776854
Kurtosis0.41638895
Mean22.4
Median Absolute Deviation (MAD)5.5
Skewness0.59642849
Sum672
Variance105.14483
MonotonicityNot monotonic
2023-12-11T00:03:20.870825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
16 3
 
10.0%
19 2
 
6.7%
23 2
 
6.7%
17 2
 
6.7%
27 2
 
6.7%
21 2
 
6.7%
36 2
 
6.7%
15 1
 
3.3%
35 1
 
3.3%
41 1
 
3.3%
Other values (12) 12
40.0%
ValueCountFrequency (%)
4 1
 
3.3%
6 1
 
3.3%
9 1
 
3.3%
12 1
 
3.3%
13 1
 
3.3%
15 1
 
3.3%
16 3
10.0%
17 2
6.7%
19 2
6.7%
20 1
 
3.3%
ValueCountFrequency (%)
49 1
3.3%
41 1
3.3%
36 2
6.7%
35 1
3.3%
33 1
3.3%
29 1
3.3%
27 2
6.7%
26 1
3.3%
24 1
3.3%
23 2
6.7%
Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-11T00:03:21.076385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-11T00:03:16.270666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:15.137589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:15.714477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:16.454827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:15.344383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:15.926046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:16.703338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:15.526801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:16.104249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T00:03:21.209804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드(DONG_CD)구분(CATEGORY)평균소득(만원)(INCOME)소득금액구간(INCOME_INTERVAL)금액구간별인원비중(INTERVAL_RATIO)
행정동코드(DONG_CD)1.0000.3460.6080.4730.000
구분(CATEGORY)0.3461.0000.3110.2860.602
평균소득(만원)(INCOME)0.6080.3111.0000.6400.000
소득금액구간(INCOME_INTERVAL)0.4730.2860.6401.0000.000
금액구간별인원비중(INTERVAL_RATIO)0.0000.6020.0000.0001.000
2023-12-11T00:03:21.425362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소득금액구간(INCOME_INTERVAL)구분(CATEGORY)
소득금액구간(INCOME_INTERVAL)1.0000.139
구분(CATEGORY)0.1391.000
2023-12-11T00:03:21.602931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동코드(DONG_CD)평균소득(만원)(INCOME)금액구간별인원비중(INTERVAL_RATIO)구분(CATEGORY)소득금액구간(INCOME_INTERVAL)
행정동코드(DONG_CD)1.000-0.2180.0680.1470.063
평균소득(만원)(INCOME)-0.2181.0000.1340.2120.293
금액구간별인원비중(INTERVAL_RATIO)0.0680.1341.0000.2670.174
구분(CATEGORY)0.1470.2120.2671.0000.139
소득금액구간(INCOME_INTERVAL)0.0630.2930.1740.1391.000

Missing values

2023-12-11T00:03:16.909307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T00:03:17.301445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정동코드(DONG_CD)구분(CATEGORY)평균소득(만원)(INCOME)소득금액구간(INCOME_INTERVAL)금액구간별인원비중(INTERVAL_RATIO)정보미제공구간해당여부(PROVISION_YN)
01111061550대49845천만원이상15N
111710550여성36705천만원이상6N
21117066030대38704천만원미만13N
311560700여성34234천만원미만19N
41147061040대41943천만원미만23N
51135062530대35552천만원미만17N
61174060050대31755천만원이상27N
71171055050대47575천만원미만29N
81130553420대38345천만원미만21N
91154562060세이상40882천만원미만4N
행정동코드(DONG_CD)구분(CATEGORY)평균소득(만원)(INCOME)소득금액구간(INCOME_INTERVAL)금액구간별인원비중(INTERVAL_RATIO)정보미제공구간해당여부(PROVISION_YN)
201168060060세이상37884천만원미만36N
211174055050대53655천만원이상24N
221153074060세이상40993천만원미만21N
2311290600여성36772천만원미만17N
241171056240대38794천만원미만12N
251117058020대44695천만원미만19N
261141064040대32383천만원미만23N
271123075020대51895천만원이상16N
281153054060세이상34055천만원이상26N
291129061050대50672천만원미만41N