Overview

Dataset statistics

Number of variables2
Number of observations177
Missing cells6
Missing cells (%)1.7%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory3.1 KiB
Average record size in memory17.7 B

Variable types

DateTime1
Numeric1

Dataset

Description창원시 시민공영자전거 누비자에 대한 이용현황을 2008년 10월부터 2023년3월까지 연도별, 월별로 구분하여 데이터를 제공하고 있습니다
Author경상남도 창원시
URLhttps://www.data.go.kr/data/15075281/fileData.do

Alerts

Dataset has 1 (0.6%) duplicate rowsDuplicates
년월 has 3 (1.7%) missing valuesMissing
이용건수 has 3 (1.7%) missing valuesMissing

Reproduction

Analysis started2024-03-14 14:50:43.172306
Analysis finished2024-03-14 14:50:44.381247
Duration1.21 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Date

MISSING 

Distinct174
Distinct (%)100.0%
Missing3
Missing (%)1.7%
Memory size1.5 KiB
Minimum2008-10-01 00:00:00
Maximum2023-03-01 00:00:00
2024-03-14T23:50:44.601012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:50:45.021594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

이용건수
Real number (ℝ)

MISSING 

Distinct174
Distinct (%)100.0%
Missing3
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean381186.39
Minimum1393
Maximum682512
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-14T23:50:45.414645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1393
5-th percentile90476.85
Q1284941
median389510.5
Q3495580.75
95-th percentile614748.25
Maximum682512
Range681119
Interquartile range (IQR)210639.75

Descriptive statistics

Standard deviation153803.73
Coefficient of variation (CV)0.40348695
Kurtosis-0.0010259609
Mean381186.39
Median Absolute Deviation (MAD)106761
Skewness-0.45466126
Sum66326432
Variance2.3655589 × 1010
MonotonicityNot monotonic
2024-03-14T23:50:45.856714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
493345 1
 
0.6%
264463 1
 
0.6%
355620 1
 
0.6%
430053 1
 
0.6%
456216 1
 
0.6%
471500 1
 
0.6%
457700 1
 
0.6%
436014 1
 
0.6%
428609 1
 
0.6%
427194 1
 
0.6%
Other values (164) 164
92.7%
(Missing) 3
 
1.7%
ValueCountFrequency (%)
1393 1
0.6%
4039 1
0.6%
5327 1
0.6%
6554 1
0.6%
7123 1
0.6%
9126 1
0.6%
14230 1
0.6%
41403 1
0.6%
87871 1
0.6%
91880 1
0.6%
ValueCountFrequency (%)
682512 1
0.6%
673049 1
0.6%
663850 1
0.6%
660771 1
0.6%
632993 1
0.6%
631941 1
0.6%
631431 1
0.6%
627692 1
0.6%
615486 1
0.6%
614351 1
0.6%

Interactions

2024-03-14T23:50:43.237019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-14T23:50:43.807727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:50:44.008041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T23:50:44.254473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년월이용건수
02008-10-011393
12008-11-017123
22008-12-015327
32009-01-014039
42009-02-016554
52009-03-019126
62009-04-0114230
72009-05-0141403
82009-06-01124037
92009-07-01202941
년월이용건수
1672022-09-01384104
1682022-10-01421501
1692022-11-01382729
1702022-12-01269849
1712023-01-01237051
1722023-02-01254091
1732023-03-01355137
174<NA><NA>
175<NA><NA>
176<NA><NA>

Duplicate rows

Most frequently occurring

년월이용건수# duplicates
0<NA><NA>3