Overview

Dataset statistics

Number of variables5
Number of observations122
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory45.0 B

Variable types

Numeric3
Categorical2

Dataset

Description영치년도,영치월,영치기관,영치기관코드,영치건수
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-20269/S/1/datasetView.do

Alerts

영치기관 has constant value ""Constant
영치기관코드 has constant value ""Constant
영치년도 is highly overall correlated with 영치건수High correlation
영치건수 is highly overall correlated with 영치년도High correlation
영치건수 has 18 (14.8%) zerosZeros

Reproduction

Analysis started2024-04-13 12:41:27.669671
Analysis finished2024-04-13 12:41:31.764276
Duration4.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영치년도
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.5902
Minimum2014
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-13T21:41:31.949716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12016
median2019
Q32021
95-th percentile2023
Maximum2024
Range10
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.9451013
Coefficient of variation (CV)0.0014589892
Kurtosis-1.1982534
Mean2018.5902
Median Absolute Deviation (MAD)3
Skewness0.016043883
Sum246268
Variance8.6736215
MonotonicityDecreasing
2024-04-13T21:41:32.322167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2023 12
9.8%
2022 12
9.8%
2021 12
9.8%
2020 12
9.8%
2019 12
9.8%
2018 12
9.8%
2017 12
9.8%
2016 12
9.8%
2015 12
9.8%
2014 12
9.8%
ValueCountFrequency (%)
2014 12
9.8%
2015 12
9.8%
2016 12
9.8%
2017 12
9.8%
2018 12
9.8%
2019 12
9.8%
2020 12
9.8%
2021 12
9.8%
2022 12
9.8%
2023 12
9.8%
ValueCountFrequency (%)
2024 2
 
1.6%
2023 12
9.8%
2022 12
9.8%
2021 12
9.8%
2020 12
9.8%
2019 12
9.8%
2018 12
9.8%
2017 12
9.8%
2016 12
9.8%
2015 12
9.8%

영치월
Real number (ℝ)

Distinct12
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.4180328
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-13T21:41:32.668616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4969648
Coefficient of variation (CV)0.54486553
Kurtosis-1.23558
Mean6.4180328
Median Absolute Deviation (MAD)3
Skewness0.020910043
Sum783
Variance12.228763
MonotonicityNot monotonic
2024-04-13T21:41:33.020918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2 11
9.0%
1 11
9.0%
12 10
8.2%
11 10
8.2%
10 10
8.2%
9 10
8.2%
8 10
8.2%
7 10
8.2%
6 10
8.2%
5 10
8.2%
Other values (2) 20
16.4%
ValueCountFrequency (%)
1 11
9.0%
2 11
9.0%
3 10
8.2%
4 10
8.2%
5 10
8.2%
6 10
8.2%
7 10
8.2%
8 10
8.2%
9 10
8.2%
10 10
8.2%
ValueCountFrequency (%)
12 10
8.2%
11 10
8.2%
10 10
8.2%
9 10
8.2%
8 10
8.2%
7 10
8.2%
6 10
8.2%
5 10
8.2%
4 10
8.2%
3 10
8.2%

영치기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
강서구
122 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강서구
2nd row강서구
3rd row강서구
4th row강서구
5th row강서구

Common Values

ValueCountFrequency (%)
강서구 122
100.0%

Length

2024-04-13T21:41:33.398067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:41:33.708287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강서구 122
100.0%

영치기관코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
11500
122 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11500
2nd row11500
3rd row11500
4th row11500
5th row11500

Common Values

ValueCountFrequency (%)
11500 122
100.0%

Length

2024-04-13T21:41:34.019713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:41:34.321230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11500 122
100.0%

영치건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct72
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.45082
Minimum0
Maximum226
Zeros18
Zeros (%)14.8%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-13T21:41:34.662343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median40.5
Q379.5
95-th percentile140.85
Maximum226
Range226
Interquartile range (IQR)75.5

Descriptive statistics

Standard deviation52.381642
Coefficient of variation (CV)1.0180915
Kurtosis1.2756476
Mean51.45082
Median Absolute Deviation (MAD)37
Skewness1.2094416
Sum6277
Variance2743.8364
MonotonicityNot monotonic
2024-04-13T21:41:35.098210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 18
 
14.8%
1 5
 
4.1%
4 4
 
3.3%
3 3
 
2.5%
128 3
 
2.5%
43 3
 
2.5%
87 2
 
1.6%
50 2
 
1.6%
38 2
 
1.6%
14 2
 
1.6%
Other values (62) 78
63.9%
ValueCountFrequency (%)
0 18
14.8%
1 5
 
4.1%
2 2
 
1.6%
3 3
 
2.5%
4 4
 
3.3%
5 2
 
1.6%
6 2
 
1.6%
7 1
 
0.8%
10 2
 
1.6%
11 1
 
0.8%
ValueCountFrequency (%)
226 1
 
0.8%
219 1
 
0.8%
212 1
 
0.8%
195 1
 
0.8%
160 1
 
0.8%
159 1
 
0.8%
141 1
 
0.8%
138 1
 
0.8%
135 1
 
0.8%
128 3
2.5%

Interactions

2024-04-13T21:41:30.535783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:29.103556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:29.826604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:30.773022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:29.353910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:30.066940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:31.006188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:29.594276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:41:30.306294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-13T21:41:35.356856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영치년도영치월영치건수
영치년도1.0000.0000.780
영치월0.0001.0000.000
영치건수0.7800.0001.000
2024-04-13T21:41:35.597418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영치년도영치월영치건수
영치년도1.000-0.0410.705
영치월-0.0411.000-0.049
영치건수0.705-0.0491.000

Missing values

2024-04-13T21:41:31.311913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T21:41:31.627263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영치년도영치월영치기관영치기관코드영치건수
020242강서구1150062
120241강서구11500195
2202312강서구1150010
3202311강서구1150034
4202310강서구1150051
520239강서구1150049
620238강서구1150039
720237강서구1150035
820236강서구11500138
920235강서구1150063
영치년도영치월영치기관영치기관코드영치건수
112201410강서구115002
11320149강서구115000
11420148강서구1150015
11520147강서구1150027
11620146강서구115003
11720145강서구115000
11820144강서구115000
11920143강서구115000
12020142강서구115000
12120141강서구115000