Overview

Dataset statistics

Number of variables5
Number of observations1270
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory53.5 KiB
Average record size in memory43.1 B

Variable types

Numeric3
Categorical1
DateTime1

Dataset

Description경기도 안양시 공간정보시스템 상의 과속방지턱에 규격 테이터 현행화 에 따른 정보(관리번호, 폭원, 연장,관할지역,설치일자)입니다.
URLhttps://www.data.go.kr/data/15042417/fileData.do

Alerts

관리번호 is highly overall correlated with 폭원 and 2 other fieldsHigh correlation
폭원 is highly overall correlated with 관리번호 and 1 other fieldsHigh correlation
연장 is highly overall correlated with 관리번호 and 1 other fieldsHigh correlation
관할지역 is highly overall correlated with 관리번호High correlation
관할지역 is highly imbalanced (79.7%)Imbalance
연장 is highly skewed (γ1 = 35.63697612)Skewed
폭원 has 1098 (86.5%) zerosZeros
연장 has 1098 (86.5%) zerosZeros

Reproduction

Analysis started2023-12-13 00:44:33.135866
Analysis finished2023-12-13 00:44:34.121751
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1266
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4993366 × 108
Minimum1
Maximum2.1474836 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2023-12-13T09:44:34.192953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile69.45
Q1372.5
median745.5
Q31106.75
95-th percentile2.0191 × 109
Maximum2.1474836 × 109
Range2.1474836 × 109
Interquartile range (IQR)734.25

Descriptive statistics

Standard deviation5.3050572 × 108
Coefficient of variation (CV)3.5382697
Kurtosis8.6388769
Mean1.4993366 × 108
Median Absolute Deviation (MAD)367.5
Skewness3.2592053
Sum1.9041574 × 1011
Variance2.8143632 × 1017
MonotonicityNot monotonic
2023-12-13T09:44:34.323964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2147483647 4
 
0.3%
1000 2
 
0.2%
849 1
 
0.1%
514 1
 
0.1%
65 1
 
0.1%
66 1
 
0.1%
1141 1
 
0.1%
619 1
 
0.1%
620 1
 
0.1%
511 1
 
0.1%
Other values (1256) 1256
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
6 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
11 1
0.1%
12 1
0.1%
ValueCountFrequency (%)
2147483647 4
0.3%
2023040001 1
 
0.1%
2021120018 1
 
0.1%
2021120017 1
 
0.1%
2021120016 1
 
0.1%
2021120015 1
 
0.1%
2021120014 1
 
0.1%
2021120013 1
 
0.1%
2021120012 1
 
0.1%
2021120011 1
 
0.1%

폭원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct145
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.279189
Minimum0
Maximum36.51
Zeros1098
Zeros (%)86.5%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2023-12-13T09:44:34.451895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile8.779
Maximum36.51
Range36.51
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4.0719025
Coefficient of variation (CV)3.1831907
Kurtosis23.52821
Mean1.279189
Median Absolute Deviation (MAD)0
Skewness4.3978452
Sum1624.57
Variance16.58039
MonotonicityNot monotonic
2023-12-13T09:44:34.553882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 1098
86.5%
6.0 16
 
1.3%
6.9 3
 
0.2%
5.0 3
 
0.2%
3.6 3
 
0.2%
5.86 2
 
0.2%
6.6 2
 
0.2%
10.0 2
 
0.2%
4.15 2
 
0.2%
6.8 2
 
0.2%
Other values (135) 137
 
10.8%
ValueCountFrequency (%)
0.0 1098
86.5%
1.75 1
 
0.1%
1.8 1
 
0.1%
2.64 1
 
0.1%
2.68 1
 
0.1%
2.71 1
 
0.1%
2.74 1
 
0.1%
2.93 1
 
0.1%
3.04 1
 
0.1%
3.1 1
 
0.1%
ValueCountFrequency (%)
36.51 1
0.1%
35.93 1
0.1%
33.49 1
0.1%
32.02 1
0.1%
29.79 1
0.1%
26.31 1
0.1%
24.45 1
0.1%
23.62 1
0.1%
23.07 1
0.1%
22.8 1
0.1%

연장
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct114
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.642252
Minimum0
Maximum99999.9
Zeros1098
Zeros (%)86.5%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2023-12-13T09:44:34.882565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4.1955
Maximum99999.9
Range99999.9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2806.0417
Coefficient of variation (CV)35.233078
Kurtosis1269.996
Mean79.642252
Median Absolute Deviation (MAD)0
Skewness35.636976
Sum101145.66
Variance7873870.1
MonotonicityNot monotonic
2023-12-13T09:44:34.996307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 1098
86.5%
3.5 14
 
1.1%
3.6 13
 
1.0%
3.62 8
 
0.6%
3.59 6
 
0.5%
3.61 6
 
0.5%
2.9 3
 
0.2%
6.0 3
 
0.2%
1.96 3
 
0.2%
3.0 3
 
0.2%
Other values (104) 113
 
8.9%
ValueCountFrequency (%)
0.0 1098
86.5%
0.9 1
 
0.1%
0.93 1
 
0.1%
0.98 1
 
0.1%
1.0 1
 
0.1%
1.02 1
 
0.1%
1.04 2
 
0.2%
1.07 1
 
0.1%
1.21 1
 
0.1%
1.7 1
 
0.1%
ValueCountFrequency (%)
99999.9 1
0.1%
41.73 1
0.1%
38.82 1
0.1%
32.0 1
0.1%
30.28 1
0.1%
26.88 1
0.1%
25.99 1
0.1%
25.65 1
0.1%
25.25 1
0.1%
25.2 1
0.1%

관할지역
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct16
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
안양시
1140 
호계1동
 
38
비산2동
 
18
박달2동
 
17
석수1동
 
9
Other values (11)
 
48

Length

Max length10
Median length3
Mean length3.119685
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row안양시
2nd row안양시
3rd row안양시
4th row안양시
5th row안양시

Common Values

ValueCountFrequency (%)
안양시 1140
89.8%
호계1동 38
 
3.0%
비산2동 18
 
1.4%
박달2동 17
 
1.3%
석수1동 9
 
0.7%
호계3동 9
 
0.7%
안양2동 8
 
0.6%
호계2동 6
 
0.5%
안양9동 5
 
0.4%
석수2동 5
 
0.4%
Other values (6) 15
 
1.2%

Length

2023-12-13T09:44:35.099410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안양시 1140
89.5%
호계1동 38
 
3.0%
비산2동 18
 
1.4%
박달2동 17
 
1.3%
석수1동 9
 
0.7%
호계3동 9
 
0.7%
안양2동 8
 
0.6%
호계2동 6
 
0.5%
석수2동 5
 
0.4%
안양9동 5
 
0.4%
Other values (7) 19
 
1.5%
Distinct22
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
Minimum2007-02-23 00:00:00
Maximum2023-01-01 00:00:00
2023-12-13T09:44:35.181638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:35.261872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

Interactions

2023-12-13T09:44:33.785025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.323003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.550919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.848020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.399384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.620905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.919942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.482915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:44:33.706430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:44:35.319197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호폭원연장관할지역설치일자
관리번호1.0000.9180.0000.9950.999
폭원0.9181.0000.2190.7490.789
연장0.0000.2191.0000.0000.000
관할지역0.9950.7490.0001.0000.980
설치일자0.9990.7890.0000.9801.000
2023-12-13T09:44:35.390317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호폭원연장관할지역
관리번호1.0000.5870.5850.931
폭원0.5871.0000.9940.412
연장0.5850.9941.0000.000
관할지역0.9310.4120.0001.000

Missing values

2023-12-13T09:44:34.000924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:44:34.087432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호폭원연장관할지역설치일자
08490.00.0안양시2012-09-01
18480.00.0안양시2012-09-01
28470.00.0안양시2012-09-01
38460.00.0안양시2012-09-01
48450.00.0안양시2012-09-01
58440.00.0안양시2012-09-01
69220.00.0안양시2012-09-01
79210.00.0안양시2012-09-01
89200.00.0안양시2012-09-01
99190.00.0안양시2012-09-01
관리번호폭원연장관할지역설치일자
1260202112001619.3925.2비산2동2021-01-01
126120211200175.62.9비산2동2021-01-01
126220211200185.03.6비산2동2021-01-01
126320230400018.863.59안양2동2023-01-01
126421474836472.713.59안양2동2022-01-01
126521474836472.743.62안양2동2022-01-01
126611200.00.0안양시2012-09-01
12675570.00.0안양시2007-02-23
126821474836472.683.62안양2동2022-01-01
126921474836478.683.61안양2동2022-01-01