Overview

Dataset statistics

Number of variables6
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory56.3 B

Variable types

Categorical4
Numeric2

Dataset

Description인천광역시 시내버스 운임(구분(일반버스/좌석버스/공항버스/M버스)/종류/나이/카드/현금/해당 노선)에 대한 정보를 제공하는 데이터입니다.- 거리비례제 시행· 좌석버스 : 10Km내 기본, 10~40Km(5Km마다 100원 추가), 40Km 초과(100원) · M 버스 : 30Km내 기본, 30~60Km(5Km마다 100원 추가), 60Km 초과(100원) · 지 하 철 : 10Km내 기본, 10~50Km(5Km마다 100원 추가), 50Km 초과(100원/8Km) - 조조할인제 시행· M버스, 지하철 : 06:30분이전 초승 탑승시 20% 할인- 어린이(만6세이상~만12세 이하), 청소년(만13세이상~만18세 이하)· 6세이하 소아 3명 무임(다만, 좌석배정을 원하는 경우 제외)
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15069137&srcSe=7661IVAWM27C61E190

Alerts

종류 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
해당노선 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 종류 and 1 other fieldsHigh correlation
카드 is highly overall correlated with 현금High correlation
현금 is highly overall correlated with 카드High correlation

Reproduction

Analysis started2024-01-28 06:43:42.508768
Analysis finished2024-01-28 06:43:43.101067
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
일반버스
좌석버스
광역버스(직행좌석)
M버스(광역급행)
BRT

Length

Max length10
Median length4
Mean length5.4285714
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반버스
2nd row일반버스
3rd row일반버스
4th row일반버스
5th row일반버스

Common Values

ValueCountFrequency (%)
일반버스 6
28.6%
좌석버스 6
28.6%
광역버스(직행좌석) 3
14.3%
M버스(광역급행) 3
14.3%
BRT 3
14.3%

Length

2024-01-28T15:43:43.158169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:43:43.246559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반버스 6
28.6%
좌석버스 6
28.6%
광역버스(직행좌석 3
14.3%
m버스(광역급행 3
14.3%
brt 3
14.3%

종류
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size300.0 B
간선형
지선형
타시도
영종행
공항버스(직행좌석)
Other values (2)

Length

Max length10
Median length3
Mean length4.8571429
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row간선형
2nd row간선형
3rd row간선형
4th row지선형
5th row지선형

Common Values

ValueCountFrequency (%)
간선형 3
14.3%
지선형 3
14.3%
타시도 3
14.3%
영종행 3
14.3%
공항버스(직행좌석) 3
14.3%
M버스(광역급행) 3
14.3%
BRT 3
14.3%

Length

2024-01-28T15:43:43.345684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:43:43.444740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
간선형 3
14.3%
지선형 3
14.3%
타시도 3
14.3%
영종행 3
14.3%
공항버스(직행좌석 3
14.3%
m버스(광역급행 3
14.3%
brt 3
14.3%

권종
Categorical

Distinct3
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size300.0 B
일반
청소년
어린이

Length

Max length3
Median length3
Mean length2.6666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row청소년
3rd row어린이
4th row일반
5th row청소년

Common Values

ValueCountFrequency (%)
일반 7
33.3%
청소년 7
33.3%
어린이 7
33.3%

Length

2024-01-28T15:43:43.551979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:43:43.641860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 7
33.3%
청소년 7
33.3%
어린이 7
33.3%

카드
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1278.5714
Minimum350
Maximum2800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-01-28T15:43:43.720282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum350
5-th percentile500
Q1870
median1200
Q31600
95-th percentile2650
Maximum2800
Range2450
Interquartile range (IQR)730

Descriptive statistics

Standard deviation678.994
Coefficient of variation (CV)0.53105676
Kurtosis0.22050476
Mean1278.5714
Median Absolute Deviation (MAD)400
Skewness0.87735758
Sum26850
Variance461032.86
MonotonicityNot monotonic
2024-01-28T15:43:44.013470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
1200 2
 
9.5%
700 1
 
4.8%
1000 1
 
4.8%
2200 1
 
4.8%
1600 1
 
4.8%
2000 1
 
4.8%
2800 1
 
4.8%
1100 1
 
4.8%
1500 1
 
4.8%
2650 1
 
4.8%
Other values (10) 10
47.6%
ValueCountFrequency (%)
350 1
4.8%
500 1
4.8%
530 1
4.8%
600 1
4.8%
700 1
4.8%
870 1
4.8%
900 1
4.8%
950 1
4.8%
1000 1
4.8%
1100 1
4.8%
ValueCountFrequency (%)
2800 1
4.8%
2650 1
4.8%
2200 1
4.8%
2000 1
4.8%
1650 1
4.8%
1600 1
4.8%
1500 1
4.8%
1300 1
4.8%
1250 1
4.8%
1200 2
9.5%

현금
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1488.0952
Minimum400
Maximum2900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2024-01-28T15:43:44.092320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum400
5-th percentile500
Q11000
median1500
Q32000
95-th percentile2650
Maximum2900
Range2500
Interquartile range (IQR)1000

Descriptive statistics

Standard deviation718.31483
Coefficient of variation (CV)0.48270756
Kurtosis-0.73601064
Mean1488.0952
Median Absolute Deviation (MAD)500
Skewness0.44872522
Sum31250
Variance515976.19
MonotonicityNot monotonic
2024-01-28T15:43:44.174239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1000 3
14.3%
1500 3
14.3%
900 2
 
9.5%
1300 1
 
4.8%
2650 1
 
4.8%
2500 1
 
4.8%
1600 1
 
4.8%
2100 1
 
4.8%
2900 1
 
4.8%
1100 1
 
4.8%
Other values (6) 6
28.6%
ValueCountFrequency (%)
400 1
 
4.8%
500 1
 
4.8%
700 1
 
4.8%
900 2
9.5%
1000 3
14.3%
1100 1
 
4.8%
1300 1
 
4.8%
1500 3
14.3%
1600 1
 
4.8%
1800 1
 
4.8%
ValueCountFrequency (%)
2900 1
 
4.8%
2650 1
 
4.8%
2500 1
 
4.8%
2400 1
 
4.8%
2100 1
 
4.8%
2000 1
 
4.8%
1800 1
 
4.8%
1600 1
 
4.8%
1500 3
14.3%
1300 1
 
4.8%

해당노선
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size300.0 B
[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88
[타시도행] 60-5, 800, 800A,B, 790, 790A,B
[청라행] 302B [영종행] 117, 304, 320 [공항행] 111, 111B,C, 117A, 223A, 302, 303, 303-1, 306, 306A, 307, 308, 308A, 310 330
[서울역] 1000, 1100, 1101, 1200, 1300, 1301, 1302, 1400,1500, 1601 [강남역] 9100, 9200, 9201, 9300, 9500, 9501, 9802
Other values (2)

Length

Max length133
Median length103
Mean length77.142857
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
2nd row[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
3rd row[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
4th row[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88
5th row[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88

Common Values

ValueCountFrequency (%)
[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99 3
14.3%
[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88 3
14.3%
[타시도행] 60-5, 800, 800A,B, 790, 790A,B 3
14.3%
[청라행] 302B [영종행] 117, 304, 320 [공항행] 111, 111B,C, 117A, 223A, 302, 303, 303-1, 306, 306A, 307, 308, 308A, 310 330 3
14.3%
[서울역] 1000, 1100, 1101, 1200, 1300, 1301, 1302, 1400,1500, 1601 [강남역] 9100, 9200, 9201, 9300, 9500, 9501, 9802 3
14.3%
M6405, M6439, M6450, M6628, M6724, M6751 3
14.3%
7700 3
14.3%

Length

2024-01-28T15:43:44.273366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:43:44.379767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
223 6
 
2.0%
간선형 3
 
1.0%
223a 3
 
1.0%
330 3
 
1.0%
310 3
 
1.0%
308a 3
 
1.0%
308 3
 
1.0%
307 3
 
1.0%
306a 3
 
1.0%
306 3
 
1.0%
Other values (87) 261
88.8%

Interactions

2024-01-28T15:43:42.863104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:43:42.754118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:43:42.918729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:43:42.806487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:43:44.470843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분종류권종카드현금해당노선
구분1.0001.0000.0000.0000.0001.000
종류1.0001.0000.0000.0000.0001.000
권종0.0000.0001.0000.3790.6610.000
카드0.0000.0000.3791.0000.0000.000
현금0.0000.0000.6610.0001.0000.000
해당노선1.0001.0000.0000.0000.0001.000
2024-01-28T15:43:44.551322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
권종종류해당노선구분
권종1.0000.0000.0000.000
종류0.0001.0001.0000.935
해당노선0.0001.0001.0000.935
구분0.0000.9350.9351.000
2024-01-28T15:43:44.622123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카드현금구분종류권종해당노선
카드1.0000.9470.0780.0000.2360.000
현금0.9471.0000.0000.0000.4330.000
구분0.0780.0001.0000.9350.0000.935
종류0.0000.0000.9351.0000.0001.000
권종0.2360.4330.0000.0001.0000.000
해당노선0.0000.0000.9351.0000.0001.000

Missing values

2024-01-28T15:43:42.994494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:43:43.070698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분종류권종카드현금해당노선
0일반버스간선형일반12501300[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
1일반버스간선형청소년870900[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
2일반버스간선형어린이500500[간선형] 1~93, 111-2, 223, 303~206, 222, 222A,B, 223, 300, 700-1 [좌석형] 103, 103-1 [급행형] 91, 95, 97, 98, 99
3일반버스지선형일반9501000[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88
4일반버스지선형청소년600700[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88
5일반버스지선형어린이350400[지선형] 500번대 [순환형] 41~44, 51, 52, 54, 56, 83 [GRT] 701, 702 [마을버스] 533, 534 [e음버스] 11~13, 15~17, 21, 22, 31, 45, 53, 55, 61, 71, 84~88
6좌석버스타시도일반13002000[타시도행] 60-5, 800, 800A,B, 790, 790A,B
7좌석버스타시도청소년9001500[타시도행] 60-5, 800, 800A,B, 790, 790A,B
8좌석버스타시도어린이530900[타시도행] 60-5, 800, 800A,B, 790, 790A,B
9좌석버스영종행일반16502400[청라행] 302B [영종행] 117, 304, 320 [공항행] 111, 111B,C, 117A, 223A, 302, 303, 303-1, 306, 306A, 307, 308, 308A, 310 330
구분종류권종카드현금해당노선
11좌석버스영종행어린이7001000[청라행] 302B [영종행] 117, 304, 320 [공항행] 111, 111B,C, 117A, 223A, 302, 303, 303-1, 306, 306A, 307, 308, 308A, 310 330
12광역버스(직행좌석)공항버스(직행좌석)일반26502650[서울역] 1000, 1100, 1101, 1200, 1300, 1301, 1302, 1400,1500, 1601 [강남역] 9100, 9200, 9201, 9300, 9500, 9501, 9802
13광역버스(직행좌석)공항버스(직행좌석)청소년15001500[서울역] 1000, 1100, 1101, 1200, 1300, 1301, 1302, 1400,1500, 1601 [강남역] 9100, 9200, 9201, 9300, 9500, 9501, 9802
14광역버스(직행좌석)공항버스(직행좌석)어린이11001100[서울역] 1000, 1100, 1101, 1200, 1300, 1301, 1302, 1400,1500, 1601 [강남역] 9100, 9200, 9201, 9300, 9500, 9501, 9802
15M버스(광역급행)M버스(광역급행)일반28002900M6405, M6439, M6450, M6628, M6724, M6751
16M버스(광역급행)M버스(광역급행)청소년20002100M6405, M6439, M6450, M6628, M6724, M6751
17M버스(광역급행)M버스(광역급행)어린이16001600M6405, M6439, M6450, M6628, M6724, M6751
18BRTBRT일반220025007700
19BRTBRT청소년120015007700
20BRTBRT어린이100010007700