Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows116
Duplicate rows (%)1.2%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Categorical2
DateTime1
Numeric2

Dataset

Description고양시 안심주차 서비스 정보
Author고양시
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=VQNTNO356OS2N9YGQR2W26347730&infSeq=1

Alerts

Dataset has 116 (1.2%) duplicate rowsDuplicates
구역명 is highly overall correlated with 설치장소명High correlation
센서번호아이디 is highly overall correlated with 주정차여부구분명 (on:주정차, off:비어있음)High correlation
설치장소명 is highly overall correlated with 구역명High correlation
주정차여부구분명 (on:주정차, off:비어있음) is highly overall correlated with 센서번호아이디High correlation

Reproduction

Analysis started2023-12-10 22:37:11.708761
Analysis finished2023-12-10 22:37:12.488260
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

설치장소명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
장항IC
8163 
냉천초등학교
938 
성저초등학교
899 

Length

Max length6
Median length4
Mean length4.3674
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장항IC
2nd row장항IC
3rd row장항IC
4th row장항IC
5th row장항IC

Common Values

ValueCountFrequency (%)
장항IC 8163
81.6%
냉천초등학교 938
 
9.4%
성저초등학교 899
 
9.0%

Length

2023-12-11T07:37:12.555177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:37:12.653854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장항ic 8163
81.6%
냉천초등학교 938
 
9.4%
성저초등학교 899
 
9.0%
Distinct4519
Distinct (%)45.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-01 09:54:45
Maximum2022-05-05 10:17:52
2023-12-11T07:37:12.756540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:37:12.890952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구역명
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2458
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:37:12.981045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum6
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.72486889
Coefficient of variation (CV)0.58185013
Kurtosis9.8498246
Mean1.2458
Median Absolute Deviation (MAD)0
Skewness3.2212983
Sum12458
Variance0.5254349
MonotonicityNot monotonic
2023-12-11T07:37:13.082544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 8694
86.9%
2 647
 
6.5%
4 389
 
3.9%
3 219
 
2.2%
5 49
 
0.5%
6 2
 
< 0.1%
ValueCountFrequency (%)
1 8694
86.9%
2 647
 
6.5%
3 219
 
2.2%
4 389
 
3.9%
5 49
 
0.5%
6 2
 
< 0.1%
ValueCountFrequency (%)
6 2
 
< 0.1%
5 49
 
0.5%
4 389
 
3.9%
3 219
 
2.2%
2 647
 
6.5%
1 8694
86.9%

센서번호아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3959
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:37:13.158775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile7
Maximum9
Range8
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.1182096
Coefficient of variation (CV)0.623755
Kurtosis-0.60793409
Mean3.3959
Median Absolute Deviation (MAD)2
Skewness0.67082297
Sum33959
Variance4.4868119
MonotonicityNot monotonic
2023-12-11T07:37:13.240014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 2213
22.1%
2 2152
21.5%
3 1702
17.0%
4 993
9.9%
6 901
9.0%
5 881
 
8.8%
7 836
 
8.4%
8 197
 
2.0%
9 125
 
1.2%
ValueCountFrequency (%)
1 2213
22.1%
2 2152
21.5%
3 1702
17.0%
4 993
9.9%
5 881
 
8.8%
6 901
9.0%
7 836
 
8.4%
8 197
 
2.0%
9 125
 
1.2%
ValueCountFrequency (%)
9 125
 
1.2%
8 197
 
2.0%
7 836
 
8.4%
6 901
9.0%
5 881
 
8.8%
4 993
9.9%
3 1702
17.0%
2 2152
21.5%
1 2213
22.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
off
6863 
on
3137 

Length

Max length3
Median length3
Mean length2.6863
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowoff
2nd rowoff
3rd rowoff
4th rowoff
5th rowoff

Common Values

ValueCountFrequency (%)
off 6863
68.6%
on 3137
31.4%

Length

2023-12-11T07:37:13.336625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:37:13.412531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
off 6863
68.6%
on 3137
31.4%

Interactions

2023-12-11T07:37:12.137224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:37:11.964875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:37:12.217294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:37:12.041444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:37:13.470723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치장소명구역명센서번호아이디주정차여부구분명 (on:주정차, off:비어있음)
설치장소명1.0000.9390.1740.097
구역명0.9391.0000.1270.210
센서번호아이디0.1740.1271.0000.632
주정차여부구분명 (on:주정차, off:비어있음)0.0970.2100.6321.000
2023-12-11T07:37:13.544192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치장소명주정차여부구분명 (on:주정차, off:비어있음)
설치장소명1.0000.160
주정차여부구분명 (on:주정차, off:비어있음)0.1601.000
2023-12-11T07:37:13.606892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구역명센서번호아이디설치장소명주정차여부구분명 (on:주정차, off:비어있음)
구역명1.000-0.0070.7000.151
센서번호아이디-0.0071.0000.0770.638
설치장소명0.7000.0771.0000.160
주정차여부구분명 (on:주정차, off:비어있음)0.1510.6380.1601.000

Missing values

2023-12-11T07:37:12.355401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:37:12.442379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

설치장소명감지시간구역명센서번호아이디주정차여부구분명 (on:주정차, off:비어있음)
3672장항IC2020-04-06 14:10:2711off
23079장항IC2020-03-25 19:22:0117off
10316장항IC2020-04-02 15:34:3914off
17092장항IC2020-03-29 16:32:2412off
18384장항IC2020-03-28 17:45:5516off
15768장항IC2020-03-30 13:37:1914off
17867장항IC2020-03-29 00:26:1211off
9306장항IC2020-04-02 17:44:3512on
14329장항IC2020-03-31 10:09:0712on
17182장항IC2020-03-29 16:27:5312off
설치장소명감지시간구역명센서번호아이디주정차여부구분명 (on:주정차, off:비어있음)
1662성저초등학교2020-04-15 17:17:4632off
17835장항IC2020-03-29 00:45:4211off
20974장항IC2020-03-27 15:34:2712off
17939장항IC2020-03-28 23:52:0412on
10648장항IC2020-04-02 15:13:3713off
22414장항IC2020-03-27 09:49:5517off
23216장항IC2020-03-25 15:18:0812off
4780장항IC2020-04-06 07:14:3312on
19958장항IC2020-03-28 03:03:3413off
122냉천초등학교2020-05-30 11:04:2726off

Duplicate rows

Most frequently occurring

설치장소명감지시간구역명센서번호아이디주정차여부구분명 (on:주정차, off:비어있음)# duplicates
73성저초등학교2020-04-15 12:08:3232off6
79성저초등학교2020-04-15 16:53:5034on6
91성저초등학교2020-04-15 17:17:4632off6
19냉천초등학교2020-04-15 18:20:1944off5
26냉천초등학교2020-04-15 18:42:0343off5
28냉천초등학교2020-04-15 18:42:0346off5
32냉천초등학교2020-04-15 20:09:3511off5
36냉천초등학교2020-04-15 20:39:0544off5
70성저초등학교2020-04-15 11:32:5922off5
74성저초등학교2020-04-15 12:08:3233off5