Overview

Dataset statistics

Number of variables3
Number of observations111
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory27.2 B

Variable types

Numeric2
Categorical1

Dataset

Description조사년도,항목이름,조사값
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15602/S/1/datasetView.do

Alerts

조사값 has 47 (42.3%) zerosZeros

Reproduction

Analysis started2024-03-13 12:45:36.046747
Analysis finished2024-03-13 12:45:36.901626
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

조사년도
Real number (ℝ)

Distinct27
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.1532
Minimum1997
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T21:45:36.988402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1997
5-th percentile1998
Q12003.5
median2010
Q32017
95-th percentile2022
Maximum2023
Range26
Interquartile range (IQR)13.5

Descriptive statistics

Standard deviation7.8081521
Coefficient of variation (CV)0.0038843568
Kurtosis-1.1855168
Mean2010.1532
Median Absolute Deviation (MAD)7
Skewness-0.025645447
Sum223127
Variance60.96724
MonotonicityDecreasing
2024-03-13T21:45:37.184947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
2013 5
 
4.5%
2012 5
 
4.5%
2022 5
 
4.5%
2023 4
 
3.6%
2007 4
 
3.6%
1997 4
 
3.6%
1998 4
 
3.6%
1999 4
 
3.6%
2000 4
 
3.6%
2001 4
 
3.6%
Other values (17) 68
61.3%
ValueCountFrequency (%)
1997 4
3.6%
1998 4
3.6%
1999 4
3.6%
2000 4
3.6%
2001 4
3.6%
2002 4
3.6%
2003 4
3.6%
2004 4
3.6%
2005 4
3.6%
2006 4
3.6%
ValueCountFrequency (%)
2023 4
3.6%
2022 5
4.5%
2021 4
3.6%
2020 4
3.6%
2019 4
3.6%
2018 4
3.6%
2017 4
3.6%
2016 4
3.6%
2015 4
3.6%
2014 4
3.6%

항목이름
Categorical

Distinct5
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size1020.0 B
구리
27 
27 
수은
27 
카드뮴
27 
비소

Length

Max length3
Median length2
Mean length2
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구리
2nd row
3rd row수은
4th row카드뮴
5th row구리

Common Values

ValueCountFrequency (%)
구리 27
24.3%
27
24.3%
수은 27
24.3%
카드뮴 27
24.3%
비소 3
 
2.7%

Length

2024-03-13T21:45:37.379833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:45:37.540971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구리 27
24.3%
27
24.3%
수은 27
24.3%
카드뮴 27
24.3%
비소 3
 
2.7%

조사값
Real number (ℝ)

ZEROS 

Distinct38
Distinct (%)34.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.666667
Minimum0
Maximum154
Zeros47
Zeros (%)42.3%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T21:45:37.728402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median5
Q326
95-th percentile69.5
Maximum154
Range154
Interquartile range (IQR)26

Descriptive statistics

Standard deviation26.460033
Coefficient of variation (CV)1.587602
Kurtosis6.8035986
Mean16.666667
Median Absolute Deviation (MAD)5
Skewness2.3043312
Sum1850
Variance700.13333
MonotonicityNot monotonic
2024-03-13T21:45:37.959090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
0 47
42.3%
1 5
 
4.5%
9 4
 
3.6%
37 4
 
3.6%
5 4
 
3.6%
8 3
 
2.7%
13 3
 
2.7%
10 2
 
1.8%
6 2
 
1.8%
43 2
 
1.8%
Other values (28) 35
31.5%
ValueCountFrequency (%)
0 47
42.3%
1 5
 
4.5%
3 2
 
1.8%
4 1
 
0.9%
5 4
 
3.6%
6 2
 
1.8%
7 2
 
1.8%
8 3
 
2.7%
9 4
 
3.6%
10 2
 
1.8%
ValueCountFrequency (%)
154 1
0.9%
104 1
0.9%
87 1
0.9%
79 2
1.8%
75 1
0.9%
64 1
0.9%
58 1
0.9%
56 1
0.9%
55 1
0.9%
52 1
0.9%

Interactions

2024-03-13T21:45:36.372481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:45:36.150265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:45:36.577444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:45:36.243610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T21:45:38.083102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사년도항목이름조사값
조사년도1.0000.0000.262
항목이름0.0001.0000.408
조사값0.2620.4081.000
2024-03-13T21:45:38.213243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사년도조사값항목이름
조사년도1.0000.2280.000
조사값0.2281.0000.260
항목이름0.0000.2601.000

Missing values

2024-03-13T21:45:36.763273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T21:45:36.861068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

조사년도항목이름조사값
02023구리41
1202331
22023수은0
32023카드뮴3
42022구리75
5202287
62022비소7
72022수은0
82022카드뮴3
92021구리79
조사년도항목이름조사값
1011999수은0
1021999카드뮴0
1031998구리9
104199811
1051998수은0
1061998카드뮴0
1071997구리13
108199712
1091997수은0
1101997카드뮴0