Overview

Dataset statistics

Number of variables3
Number of observations107
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory27.2 B

Variable types

Numeric2
Categorical1

Dataset

Description자료기준년도,조사항목명,평균값
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-22148/S/1/datasetView.do

Alerts

평균값 has unique valuesUnique

Reproduction

Analysis started2024-05-18 01:48:44.127798
Analysis finished2024-05-18 01:48:46.599480
Duration2.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자료기준년도
Real number (ℝ)

Distinct26
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2009.6729
Minimum1997
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-05-18T10:48:46.810090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1997
5-th percentile1998
Q12003
median2010
Q32016
95-th percentile2021
Maximum2022
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.5370477
Coefficient of variation (CV)0.0037503853
Kurtosis-1.1828611
Mean2009.6729
Median Absolute Deviation (MAD)6
Skewness-0.024974086
Sum215035
Variance56.807089
MonotonicityDecreasing
2024-05-18T10:48:47.271881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2022 5
 
4.7%
2013 5
 
4.7%
2012 5
 
4.7%
2007 4
 
3.7%
1997 4
 
3.7%
1998 4
 
3.7%
1999 4
 
3.7%
2000 4
 
3.7%
2001 4
 
3.7%
2002 4
 
3.7%
Other values (16) 64
59.8%
ValueCountFrequency (%)
1997 4
3.7%
1998 4
3.7%
1999 4
3.7%
2000 4
3.7%
2001 4
3.7%
2002 4
3.7%
2003 4
3.7%
2004 4
3.7%
2005 4
3.7%
2006 4
3.7%
ValueCountFrequency (%)
2022 5
4.7%
2021 4
3.7%
2020 4
3.7%
2019 4
3.7%
2018 4
3.7%
2017 4
3.7%
2016 4
3.7%
2015 4
3.7%
2014 4
3.7%
2013 5
4.7%

조사항목명
Categorical

Distinct5
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size988.0 B
구리
26 
26 
수은
26 
카드뮴
26 
비소

Length

Max length3
Median length2
Mean length2
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구리
2nd row
3rd row비소
4th row수은
5th row카드뮴

Common Values

ValueCountFrequency (%)
구리 26
24.3%
26
24.3%
수은 26
24.3%
카드뮴 26
24.3%
비소 3
 
2.8%

Length

2024-05-18T10:48:47.805368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T10:48:48.191824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구리 26
24.3%
26
24.3%
수은 26
24.3%
카드뮴 26
24.3%
비소 3
 
2.8%

평균값
Real number (ℝ)

UNIQUE 

Distinct107
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.621438
Minimum0.0016233766
Maximum153.69429
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-05-18T10:48:48.578045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.0016233766
5-th percentile0.04330346
Q10.12227414
median4.5088312
Q323.578719
95-th percentile71.588762
Maximum153.69429
Range153.69266
Interquartile range (IQR)23.456445

Descriptive statistics

Standard deviation26.697179
Coefficient of variation (CV)1.6061895
Kurtosis6.7854293
Mean16.621438
Median Absolute Deviation (MAD)4.4400645
Skewness2.3251576
Sum1778.4939
Variance712.73935
MonotonicityNot monotonic
2024-05-18T10:48:49.082498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
74.94448818897636 1
 
0.9%
4.3544939759036145 1
 
0.9%
8.331620689655171 1
 
0.9%
0.1357272727272727 1
 
0.9%
0.1008029197080291 1
 
0.9%
20.666532846715327 1
 
0.9%
21.906007299270073 1
 
0.9%
0.0820210526315789 1
 
0.9%
0.0016233766233766 1
 
0.9%
5.091526315789474 1
 
0.9%
Other values (97) 97
90.7%
ValueCountFrequency (%)
0.0016233766233766 1
0.9%
0.0242694805194805 1
0.9%
0.0288005390835579 1
0.9%
0.0372771084337349 1
0.9%
0.0401915584415584 1
0.9%
0.0420539083557951 1
0.9%
0.0462190812720848 1
0.9%
0.0495853658536585 1
0.9%
0.0525448275862068 1
0.9%
0.0567 1
0.9%
ValueCountFrequency (%)
153.6942857142857 1
0.9%
103.88603174603173 1
0.9%
87.2507874015748 1
0.9%
79.4711743772242 1
0.9%
79.16466942106871 1
0.9%
74.94448818897636 1
0.9%
63.75873493975904 1
0.9%
58.285582822085885 1
0.9%
55.59143835616438 1
0.9%
54.81506024096386 1
0.9%

Interactions

2024-05-18T10:48:45.274342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T10:48:44.382505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T10:48:45.660588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T10:48:44.895975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T10:48:49.371927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료기준년도조사항목명평균값
자료기준년도1.0000.0000.282
조사항목명0.0001.0000.394
평균값0.2820.3941.000
2024-05-18T10:48:49.975762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료기준년도평균값조사항목명
자료기준년도1.0000.2840.000
평균값0.2841.0000.250
조사항목명0.0000.2501.000

Missing values

2024-05-18T10:48:46.097425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T10:48:46.464110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자료기준년도조사항목명평균값
02022구리74.944488
1202287.250787
22022비소6.555709
32022수은0.116535
42022카드뮴2.519488
52021구리79.471174
6202152.048399
72021수은0.121815
82021카드뮴1.045943
92020구리58.285583
자료기준년도조사항목명평균값
971999수은0.077767
981999카드뮴0.1228
991998구리9.119289
100199811.385967
1011998수은0.0567
1021998카드뮴0.122733
1031997구리13.284111
104199712.108222
1051997수은0.117689
1061997카드뮴0.1106