Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory45.3 B

Variable types

Text1
Numeric4

Dataset

Description샘플 데이터
Author서울시, 신한카드, KCB(코리아크레딧뷰로)
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=321

Alerts

영유아_보유_가구수(3인이상)(INDEX02_CNT) has 14 (14.0%) zerosZeros
영유아보유_가구비율(INDEX02_RT) has 18 (18.0%) zerosZeros
영유아지수(INDEX02) has 20 (20.0%) zerosZeros

Reproduction

Analysis started2023-12-10 15:01:33.128091
Analysis finished2023-12-10 15:01:38.821918
Duration5.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct93
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-11T00:01:39.323659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.68
Min length4

Characters and Unicode

Total characters568
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)86.0%

Sample

1st row2*3*5*
2nd row2*6*7*
3rd row2*5*5*
4th row1*1*3
5th row1*5*1
ValueCountFrequency (%)
2*3*5 2
 
2.0%
1*1*3 2
 
2.0%
1*8*8 2
 
2.0%
2*7*7 2
 
2.0%
2*2*9 2
 
2.0%
2*1*9 2
 
2.0%
2*2*5 2
 
2.0%
2*0*0 2
 
2.0%
4*1*7 1
 
1.0%
4*4 1
 
1.0%
Other values (82) 82
82.0%
2023-12-11T00:01:40.326367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 271
47.7%
2 57
 
10.0%
1 49
 
8.6%
3 32
 
5.6%
4 32
 
5.6%
5 28
 
4.9%
8 26
 
4.6%
7 20
 
3.5%
9 18
 
3.2%
0 18
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 297
52.3%
Other Punctuation 271
47.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 57
19.2%
1 49
16.5%
3 32
10.8%
4 32
10.8%
5 28
9.4%
8 26
8.8%
7 20
 
6.7%
9 18
 
6.1%
0 18
 
6.1%
6 17
 
5.7%
Other Punctuation
ValueCountFrequency (%)
* 271
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 568
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
* 271
47.7%
2 57
 
10.0%
1 49
 
8.6%
3 32
 
5.6%
4 32
 
5.6%
5 28
 
4.9%
8 26
 
4.6%
7 20
 
3.5%
9 18
 
3.2%
0 18
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 568
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 271
47.7%
2 57
 
10.0%
1 49
 
8.6%
3 32
 
5.6%
4 32
 
5.6%
5 28
 
4.9%
8 26
 
4.6%
7 20
 
3.5%
9 18
 
3.2%
0 18
 
3.2%
Distinct55
Distinct (%)55.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.78
Minimum1
Maximum154
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T00:01:40.653496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110
median19
Q347.75
95-th percentile83.05
Maximum154
Range153
Interquartile range (IQR)37.75

Descriptive statistics

Standard deviation29.207575
Coefficient of variation (CV)0.94891406
Kurtosis2.7802095
Mean30.78
Median Absolute Deviation (MAD)12.5
Skewness1.5362861
Sum3078
Variance853.08242
MonotonicityNot monotonic
2023-12-11T00:01:41.016105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 7
 
7.0%
9 4
 
4.0%
1 4
 
4.0%
19 4
 
4.0%
14 4
 
4.0%
12 3
 
3.0%
15 3
 
3.0%
6 3
 
3.0%
5 3
 
3.0%
20 3
 
3.0%
Other values (45) 62
62.0%
ValueCountFrequency (%)
1 4
4.0%
2 1
 
1.0%
3 2
 
2.0%
4 2
 
2.0%
5 3
3.0%
6 3
3.0%
7 2
 
2.0%
8 2
 
2.0%
9 4
4.0%
10 7
7.0%
ValueCountFrequency (%)
154 1
1.0%
124 1
1.0%
100 1
1.0%
88 1
1.0%
84 1
1.0%
83 1
1.0%
80 1
1.0%
76 1
1.0%
73 1
1.0%
71 2
2.0%
Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.97
Minimum0
Maximum35
Zeros14
Zeros (%)14.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T00:01:41.320197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q37
95-th percentile15.1
Maximum35
Range35
Interquartile range (IQR)6

Descriptive statistics

Standard deviation6.1009272
Coefficient of variation (CV)1.2275508
Kurtosis6.682026
Mean4.97
Median Absolute Deviation (MAD)2
Skewness2.2788324
Sum497
Variance37.221313
MonotonicityNot monotonic
2023-12-11T00:01:41.603949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 25
25.0%
0 14
14.0%
2 12
12.0%
4 7
 
7.0%
5 6
 
6.0%
7 6
 
6.0%
6 5
 
5.0%
9 4
 
4.0%
10 4
 
4.0%
3 3
 
3.0%
Other values (9) 14
14.0%
ValueCountFrequency (%)
0 14
14.0%
1 25
25.0%
2 12
12.0%
3 3
 
3.0%
4 7
 
7.0%
5 6
 
6.0%
6 5
 
5.0%
7 6
 
6.0%
8 2
 
2.0%
9 4
 
4.0%
ValueCountFrequency (%)
35 1
 
1.0%
24 2
2.0%
23 1
 
1.0%
17 1
 
1.0%
15 2
2.0%
14 1
 
1.0%
13 2
2.0%
12 2
2.0%
10 4
4.0%
9 4
4.0%
Distinct56
Distinct (%)56.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.181545
Minimum0
Maximum0.6667
Zeros18
Zeros (%)18.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T00:01:41.932115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.0817
median0.14905
Q30.25
95-th percentile0.5
Maximum0.6667
Range0.6667
Interquartile range (IQR)0.1683

Descriptive statistics

Standard deviation0.1493623
Coefficient of variation (CV)0.8227288
Kurtosis0.76532922
Mean0.181545
Median Absolute Deviation (MAD)0.0833
Skewness0.9851656
Sum18.1545
Variance0.022309097
MonotonicityNot monotonic
2023-12-11T00:01:42.223069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 18
 
18.0%
0.2 5
 
5.0%
0.25 5
 
5.0%
0.125 5
 
5.0%
0.1429 3
 
3.0%
0.5 3
 
3.0%
0.1364 3
 
3.0%
0.3333 3
 
3.0%
0.3077 2
 
2.0%
0.2222 2
 
2.0%
Other values (46) 51
51.0%
ValueCountFrequency (%)
0.0 18
18.0%
0.0455 1
 
1.0%
0.0556 2
 
2.0%
0.0667 1
 
1.0%
0.0741 1
 
1.0%
0.0769 2
 
2.0%
0.0833 1
 
1.0%
0.0889 1
 
1.0%
0.0909 2
 
2.0%
0.1 2
 
2.0%
ValueCountFrequency (%)
0.6667 1
 
1.0%
0.5429 1
 
1.0%
0.5385 1
 
1.0%
0.5294 1
 
1.0%
0.5 3
3.0%
0.4865 1
 
1.0%
0.45 1
 
1.0%
0.4286 1
 
1.0%
0.383 1
 
1.0%
0.3636 1
 
1.0%

영유아지수(INDEX02)
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.32
Minimum0
Maximum9
Zeros20
Zeros (%)20.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-11T00:01:42.527321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median4
Q38
95-th percentile9
Maximum9
Range9
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.1682771
Coefficient of variation (CV)0.73339748
Kurtosis-1.3269251
Mean4.32
Median Absolute Deviation (MAD)3
Skewness0.048196993
Sum432
Variance10.03798
MonotonicityNot monotonic
2023-12-11T00:01:42.735824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 20
20.0%
8 15
15.0%
4 14
14.0%
9 12
12.0%
3 10
10.0%
5 9
9.0%
6 6
 
6.0%
1 6
 
6.0%
2 5
 
5.0%
7 3
 
3.0%
ValueCountFrequency (%)
0 20
20.0%
1 6
 
6.0%
2 5
 
5.0%
3 10
10.0%
4 14
14.0%
5 9
9.0%
6 6
 
6.0%
7 3
 
3.0%
8 15
15.0%
9 12
12.0%
ValueCountFrequency (%)
9 12
12.0%
8 15
15.0%
7 3
 
3.0%
6 6
 
6.0%
5 9
9.0%
4 14
14.0%
3 10
10.0%
2 5
 
5.0%
1 6
 
6.0%
0 20
20.0%

Interactions

2023-12-11T00:01:36.674915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:33.649474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:34.594754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:35.660832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:36.919456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:33.936408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:34.863331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:35.932902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:37.152473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:34.174941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:35.143588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:36.203467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:37.487012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:34.424953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:35.436512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:36.451214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T00:01:42.887228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울시_블록ID(BLK_CD)3인이상_가구수(GAGUSU_3O)영유아_보유_가구수(3인이상)(INDEX02_CNT)영유아보유_가구비율(INDEX02_RT)영유아지수(INDEX02)
서울시_블록ID(BLK_CD)1.0000.0000.7480.9580.658
3인이상_가구수(GAGUSU_3O)0.0001.0000.0000.0000.000
영유아_보유_가구수(3인이상)(INDEX02_CNT)0.7480.0001.0000.0000.392
영유아보유_가구비율(INDEX02_RT)0.9580.0000.0001.0000.000
영유아지수(INDEX02)0.6580.0000.3920.0001.000
2023-12-11T00:01:43.100318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
3인이상_가구수(GAGUSU_3O)영유아_보유_가구수(3인이상)(INDEX02_CNT)영유아보유_가구비율(INDEX02_RT)영유아지수(INDEX02)
3인이상_가구수(GAGUSU_3O)1.0000.1110.0690.137
영유아_보유_가구수(3인이상)(INDEX02_CNT)0.1111.000-0.071-0.008
영유아보유_가구비율(INDEX02_RT)0.069-0.0711.0000.071
영유아지수(INDEX02)0.137-0.0080.0711.000

Missing values

2023-12-11T00:01:37.753776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T00:01:38.710482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

서울시_블록ID(BLK_CD)3인이상_가구수(GAGUSU_3O)영유아_보유_가구수(3인이상)(INDEX02_CNT)영유아보유_가구비율(INDEX02_RT)영유아지수(INDEX02)
02*3*5*910.02
12*6*7*810.24
22*5*5*120.20
31*1*35110.20
41*5*12010.14813
52*7*7*4750.17896
61*8*81850.06670
71*0*68320.10
81*3*8*2210.05563
91*1*7*700.14293
서울시_블록ID(BLK_CD)3인이상_가구수(GAGUSU_3O)영유아_보유_가구수(3인이상)(INDEX02_CNT)영유아보유_가구비율(INDEX02_RT)영유아지수(INDEX02)
902*4*9*900.11764
912*5*6*510.00
924*1*0*47150.251
932*8*5510.01
941*2*0*1120.59
954*1*7*1670.19356
964*8*3*6410.12864
972*2*5*1240.24246
983*7*8*2330.00
992*4*7*10000.13468