Overview

Dataset statistics

Number of variables3
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory27.3 B

Variable types

Numeric3

Dataset

Description샘플 데이터
Author신한카드
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=3

Reproduction

Analysis started2023-12-10 15:03:08.070506
Analysis finished2023-12-10 15:03:10.462361
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct488
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1145158 × 1012
Minimum1.101056 × 1012
Maximum1.1250741 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:10.622972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.101056 × 1012
5-th percentile1.104057 × 1012
Q11.1080825 × 1012
median1.115068 × 1012
Q31.120073 × 1012
95-th percentile1.1240752 × 1012
Maximum1.1250741 × 1012
Range2.401803 × 1010
Interquartile range (IQR)1.1990493 × 1010

Descriptive statistics

Standard deviation6.7602244 × 109
Coefficient of variation (CV)0.0060656157
Kurtosis-1.1809936
Mean1.1145158 × 1012
Median Absolute Deviation (MAD)5.9989952 × 109
Skewness-0.084536385
Sum5.5725789 × 1014
Variance4.5700633 × 1019
MonotonicityNot monotonic
2023-12-11T00:03:10.997563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1110060010014 2
 
0.4%
1106090040405 2
 
0.4%
1124053010008 2
 
0.4%
1118051050001 2
 
0.4%
1105063041103 2
 
0.4%
1108077030005 2
 
0.4%
1116052020301 2
 
0.4%
1123078050005 2
 
0.4%
1118051040304 2
 
0.4%
1108071030005 2
 
0.4%
Other values (478) 480
96.0%
ValueCountFrequency (%)
1101056020013 1
0.2%
1101057010006 1
0.2%
1101058010005 1
0.2%
1101060020006 1
0.2%
1101069010203 1
0.2%
1102054070001 1
0.2%
1102059010201 1
0.2%
1102060020001 1
0.2%
1102065010001 1
0.2%
1102067010406 1
0.2%
ValueCountFrequency (%)
1125074050012 1
0.2%
1125074020014 1
0.2%
1125074010020 1
0.2%
1125073020007 1
0.2%
1125073010006 1
0.2%
1125071020007 1
0.2%
1125071010502 1
0.2%
1125070030004 1
0.2%
1125067020017 1
0.2%
1125066010018 1
0.2%
Distinct446
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1138065 × 1012
Minimum1.101053 × 1012
Maximum1.1250741 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:11.399813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.101053 × 1012
5-th percentile1.102054 × 1012
Q11.107069 × 1012
median1.1140661 × 1012
Q31.1210803 × 1012
95-th percentile1.1240713 × 1012
Maximum1.1250741 × 1012
Range2.402104 × 1010
Interquartile range (IQR)1.4011218 × 1010

Descriptive statistics

Standard deviation7.595127 × 109
Coefficient of variation (CV)0.0068190724
Kurtosis-1.3114696
Mean1.1138065 × 1012
Median Absolute Deviation (MAD)7.00249 × 109
Skewness-0.091550272
Sum5.5690324 × 1014
Variance5.7685954 × 1019
MonotonicityNot monotonic
2023-12-11T00:03:11.807044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1123067030002 4
 
0.8%
1122064010003 4
 
0.8%
1112058020013 3
 
0.6%
1102054010001 3
 
0.6%
1102059080001 3
 
0.6%
1123065010002 3
 
0.6%
1108068020001 3
 
0.6%
1114070010007 2
 
0.4%
1124058010005 2
 
0.4%
1101061040001 2
 
0.4%
Other values (436) 471
94.2%
ValueCountFrequency (%)
1101053010003 1
0.2%
1101053040001 1
0.2%
1101054010001 1
0.2%
1101054010003 1
0.2%
1101056010002 1
0.2%
1101057010009 1
0.2%
1101058030002 1
0.2%
1101060010001 1
0.2%
1101061020001 1
0.2%
1101061030002 2
0.4%
ValueCountFrequency (%)
1125074050019 1
0.2%
1125074050015 2
0.4%
1125074020013 1
0.2%
1125074020001 1
0.2%
1125073020006 1
0.2%
1125072010004 1
0.2%
1125071030001 2
0.4%
1125071020004 1
0.2%
1125070010005 1
0.2%
1125067020016 1
0.2%
Distinct6
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.186
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:03:12.097603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum7
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.55499553
Coefficient of variation (CV)0.46795576
Kurtosis31.89931
Mean1.186
Median Absolute Deviation (MAD)0
Skewness4.6375271
Sum593
Variance0.30802004
MonotonicityNot monotonic
2023-12-11T00:03:12.288282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 431
86.2%
2 53
 
10.6%
3 12
 
2.4%
4 2
 
0.4%
7 1
 
0.2%
5 1
 
0.2%
ValueCountFrequency (%)
1 431
86.2%
2 53
 
10.6%
3 12
 
2.4%
4 2
 
0.4%
5 1
 
0.2%
7 1
 
0.2%
ValueCountFrequency (%)
7 1
 
0.2%
5 1
 
0.2%
4 2
 
0.4%
3 12
 
2.4%
2 53
 
10.6%
1 431
86.2%

Interactions

2023-12-11T00:03:09.355084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:08.230708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:08.784438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:09.799030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:08.408236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:08.980298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:10.005232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:08.596083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:03:09.166745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T00:03:12.452915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거주지_집계구단위(residence)소비지_집계구단위(spend)장애인수(disabled_cnt)
거주지_집계구단위(residence)1.0000.0510.148
소비지_집계구단위(spend)0.0511.0000.000
장애인수(disabled_cnt)0.1480.0001.000
2023-12-11T00:03:12.649829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거주지_집계구단위(residence)소비지_집계구단위(spend)장애인수(disabled_cnt)
거주지_집계구단위(residence)1.000-0.019-0.032
소비지_집계구단위(spend)-0.0191.000-0.082
장애인수(disabled_cnt)-0.032-0.0821.000

Missing values

2023-12-11T00:03:10.232475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T00:03:10.399832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

거주지_집계구단위(residence)소비지_집계구단위(spend)장애인수(disabled_cnt)
0111607103000711060830200041
1111606702003611220600200021
2111607303000211010540100011
3111907401000311250630212011
4110506304110311100620200011
5112105802001611080680100161
6110205407000111180570200031
7111105101000411080760200071
8110305805000411190760200011
9112407902001111010610300021
거주지_집계구단위(residence)소비지_집계구단위(spend)장애인수(disabled_cnt)
490110807702000611030700300021
491112505204001011180570300221
492111607401001811240690200011
493112007204000511020540700011
494112106201001111170540100021
495112407502060311230630200041
496112308002000111150710200021
497111207201000911220640100031
498110609004020111050610100141
499110707304000211230710200071