Overview

Dataset statistics

Number of variables5
Number of observations1830
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory77.0 KiB
Average record size in memory43.1 B

Variable types

Categorical1
DateTime1
Numeric3

Dataset

Description인천지하철 1호선 2023년4월1일부터 2023년5월31일까지 일별 승하차현황입니다.(역명, 일자, 이용인원, 승차인원, 하차인원)
Author인천교통공사
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15060369&srcSe=7661IVAWM27C61E190

Alerts

이용인원 is highly overall correlated with 승차인원 and 2 other fieldsHigh correlation
승차인원 is highly overall correlated with 이용인원 and 2 other fieldsHigh correlation
하차인원 is highly overall correlated with 이용인원 and 2 other fieldsHigh correlation
역명 is highly overall correlated with 이용인원 and 2 other fieldsHigh correlation

Reproduction

Analysis started2024-03-18 03:10:40.257340
Analysis finished2024-03-18 03:10:41.448045
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

역명
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
계양
 
61
귤현
 
61
박촌
 
61
임학
 
61
계산
 
61
Other values (25)
1525 

Length

Max length8
Median length6
Mean length3.7333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계양
2nd row계양
3rd row계양
4th row계양
5th row계양

Common Values

ValueCountFrequency (%)
계양 61
 
3.3%
귤현 61
 
3.3%
박촌 61
 
3.3%
임학 61
 
3.3%
계산 61
 
3.3%
경인교대입구 61
 
3.3%
작전 61
 
3.3%
갈산 61
 
3.3%
부평구청 61
 
3.3%
부평시장 61
 
3.3%
Other values (20) 1220
66.7%

Length

2024-03-18T12:10:41.538108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
계양 61
 
3.3%
귤현 61
 
3.3%
국제업무지구 61
 
3.3%
센트럴파크 61
 
3.3%
인천대입구 61
 
3.3%
지식정보단지 61
 
3.3%
테크노파크 61
 
3.3%
캠퍼스타운 61
 
3.3%
동막 61
 
3.3%
동춘 61
 
3.3%
Other values (20) 1220
66.7%

일자
Date

Distinct61
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Minimum2023-04-01 00:00:00
Maximum2023-05-31 00:00:00
2024-03-18T12:10:41.670016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:41.814826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

이용인원
Real number (ℝ)

HIGH CORRELATION 

Distinct1735
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12753.533
Minimum628
Maximum33833
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2024-03-18T12:10:41.970531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum628
5-th percentile3209.45
Q16958.75
median11543
Q317734
95-th percentile27638.45
Maximum33833
Range33205
Interquartile range (IQR)10775.25

Descriptive statistics

Standard deviation7503.897
Coefficient of variation (CV)0.58837789
Kurtosis-0.47645111
Mean12753.533
Median Absolute Deviation (MAD)5174
Skewness0.63773817
Sum23338966
Variance56308470
MonotonicityNot monotonic
2024-03-18T12:10:42.221652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12119 3
 
0.2%
3258 3
 
0.2%
3659 3
 
0.2%
21794 2
 
0.1%
8275 2
 
0.1%
6572 2
 
0.1%
7990 2
 
0.1%
6259 2
 
0.1%
5393 2
 
0.1%
7790 2
 
0.1%
Other values (1725) 1807
98.7%
ValueCountFrequency (%)
628 1
0.1%
676 1
0.1%
685 1
0.1%
690 1
0.1%
732 1
0.1%
764 1
0.1%
765 1
0.1%
812 1
0.1%
858 1
0.1%
1151 1
0.1%
ValueCountFrequency (%)
33833 1
0.1%
33685 1
0.1%
33635 1
0.1%
33169 1
0.1%
32109 1
0.1%
31969 1
0.1%
31940 1
0.1%
31589 1
0.1%
31494 1
0.1%
31311 1
0.1%

승차인원
Real number (ℝ)

HIGH CORRELATION 

Distinct1691
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6405.1153
Minimum318
Maximum16444
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2024-03-18T12:10:42.384347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum318
5-th percentile1368.95
Q13529.75
median5915
Q38900.75
95-th percentile13699.75
Maximum16444
Range16126
Interquartile range (IQR)5371

Descriptive statistics

Standard deviation3757.8328
Coefficient of variation (CV)0.58669246
Kurtosis-0.48447033
Mean6405.1153
Median Absolute Deviation (MAD)2573
Skewness0.62275479
Sum11721361
Variance14121308
MonotonicityNot monotonic
2024-03-18T12:10:42.586993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5435 3
 
0.2%
1364 3
 
0.2%
6295 3
 
0.2%
834 3
 
0.2%
3496 3
 
0.2%
6101 3
 
0.2%
1318 3
 
0.2%
3446 2
 
0.1%
4033 2
 
0.1%
14954 2
 
0.1%
Other values (1681) 1803
98.5%
ValueCountFrequency (%)
318 1
0.1%
329 1
0.1%
356 1
0.1%
358 2
0.1%
388 1
0.1%
393 1
0.1%
415 1
0.1%
446 1
0.1%
561 1
0.1%
585 2
0.1%
ValueCountFrequency (%)
16444 1
0.1%
16404 1
0.1%
16295 1
0.1%
16191 1
0.1%
15970 1
0.1%
15890 1
0.1%
15644 1
0.1%
15614 1
0.1%
15596 1
0.1%
15570 1
0.1%

하차인원
Real number (ℝ)

HIGH CORRELATION 

Distinct1689
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6348.418
Minimum310
Maximum17444
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.2 KiB
2024-03-18T12:10:42.703862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum310
5-th percentile1627.7
Q13461.25
median5682.5
Q38801
95-th percentile13896.85
Maximum17444
Range17134
Interquartile range (IQR)5339.75

Descriptive statistics

Standard deviation3754.9157
Coefficient of variation (CV)0.59147267
Kurtosis-0.44404552
Mean6348.418
Median Absolute Deviation (MAD)2625
Skewness0.65799987
Sum11617605
Variance14099392
MonotonicityNot monotonic
2024-03-18T12:10:42.812235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1892 3
 
0.2%
3279 3
 
0.2%
2630 3
 
0.2%
6482 3
 
0.2%
3881 3
 
0.2%
5936 3
 
0.2%
1701 3
 
0.2%
1770 3
 
0.2%
4163 3
 
0.2%
4814 2
 
0.1%
Other values (1679) 1801
98.4%
ValueCountFrequency (%)
310 1
0.1%
320 1
0.1%
332 1
0.1%
356 1
0.1%
372 1
0.1%
374 1
0.1%
376 1
0.1%
397 1
0.1%
412 1
0.1%
566 1
0.1%
ValueCountFrequency (%)
17444 1
0.1%
17429 1
0.1%
17390 1
0.1%
16725 1
0.1%
16383 1
0.1%
16325 1
0.1%
16207 1
0.1%
16139 1
0.1%
16107 1
0.1%
15937 1
0.1%

Interactions

2024-03-18T12:10:41.008204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.481842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.748944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:41.103188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.569561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.830827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:41.194887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.663795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T12:10:40.916398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T12:10:43.067474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명일자이용인원승차인원하차인원
역명1.0000.0000.9310.9380.929
일자0.0001.0000.0980.0000.070
이용인원0.9310.0981.0000.9900.995
승차인원0.9380.0000.9901.0000.981
하차인원0.9290.0700.9950.9811.000
2024-03-18T12:10:43.145783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이용인원승차인원하차인원역명
이용인원1.0000.9990.9990.609
승차인원0.9991.0000.9950.628
하차인원0.9990.9951.0000.604
역명0.6090.6280.6041.000

Missing values

2024-03-18T12:10:41.298720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:10:41.398385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

역명일자이용인원승차인원하차인원
0계양2023-05-31839943424057
1계양2023-05-30852743834144
2계양2023-05-29510524932612
3계양2023-05-28360015952005
4계양2023-05-27495323232630
5계양2023-05-26859843944204
6계양2023-05-25893644324504
7계양2023-05-24846143084153
8계양2023-05-23842943614068
9계양2023-05-22828642844002
역명일자이용인원승차인원하차인원
1820송도달빛축제공원2023-04-101015650755081
1821송도달빛축제공원2023-04-09635632063150
1822송도달빛축제공원2023-04-08828442244060
1823송도달빛축제공원2023-04-071029652145082
1824송도달빛축제공원2023-04-06976349264837
1825송도달빛축제공원2023-04-05874443324412
1826송도달빛축제공원2023-04-04994349624981
1827송도달빛축제공원2023-04-031013251155017
1828송도달빛축제공원2023-04-02686434093455
1829송도달빛축제공원2023-04-01828141764105