Overview

Dataset statistics

Number of variables3
Number of observations666
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.0 KiB
Average record size in memory26.2 B

Variable types

Numeric2
Text1

Dataset

Description인천광역시 부평구 버스 정류장 현황 데이터는 부평구 내의 버스 정류장 명, 버스 정류소 ID에 대한 데이터를 제공합니다
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15103787&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 정류소IDHigh correlation
정류소ID is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
정류소ID has unique valuesUnique

Reproduction

Analysis started2024-01-28 15:46:02.130112
Analysis finished2024-01-28 15:46:02.702078
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct666
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean333.5
Minimum1
Maximum666
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.0 KiB
2024-01-29T00:46:02.755724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile34.25
Q1167.25
median333.5
Q3499.75
95-th percentile632.75
Maximum666
Range665
Interquartile range (IQR)332.5

Descriptive statistics

Standard deviation192.40192
Coefficient of variation (CV)0.57691731
Kurtosis-1.2
Mean333.5
Median Absolute Deviation (MAD)166.5
Skewness0
Sum222111
Variance37018.5
MonotonicityStrictly increasing
2024-01-29T00:46:02.872301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
439 1
 
0.2%
441 1
 
0.2%
442 1
 
0.2%
443 1
 
0.2%
444 1
 
0.2%
445 1
 
0.2%
446 1
 
0.2%
447 1
 
0.2%
448 1
 
0.2%
Other values (656) 656
98.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
666 1
0.2%
665 1
0.2%
664 1
0.2%
663 1
0.2%
662 1
0.2%
661 1
0.2%
660 1
0.2%
659 1
0.2%
658 1
0.2%
657 1
0.2%
Distinct405
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-01-29T00:46:03.083230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.8393393
Min length3

Characters and Unicode

Total characters4555
Distinct characters300
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)33.6%

Sample

1st row동암사거리
2nd row종근당
3rd row동암사거리
4th row형제마트
5th row종근당
ValueCountFrequency (%)
부평역 8
 
1.2%
한국아파트 6
 
0.9%
삼보아파트 5
 
0.7%
부평시장 5
 
0.7%
청천푸르지오아파트 5
 
0.7%
현대아파트 5
 
0.7%
휴먼시아1단지 4
 
0.6%
새마을금고 4
 
0.6%
산곡여자중학교 4
 
0.6%
부평구청 4
 
0.6%
Other values (396) 621
92.5%
2024-01-29T00:46:03.399421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
189
 
4.1%
174
 
3.8%
164
 
3.6%
150
 
3.3%
122
 
2.7%
112
 
2.5%
101
 
2.2%
99
 
2.2%
96
 
2.1%
78
 
1.7%
Other values (290) 3270
71.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4168
91.5%
Decimal Number 197
 
4.3%
Open Punctuation 72
 
1.6%
Close Punctuation 72
 
1.6%
Uppercase Letter 23
 
0.5%
Other Punctuation 15
 
0.3%
Space Separator 5
 
0.1%
Modifier Symbol 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
4.5%
174
 
4.2%
164
 
3.9%
150
 
3.6%
122
 
2.9%
112
 
2.7%
101
 
2.4%
99
 
2.4%
96
 
2.3%
78
 
1.9%
Other values (264) 2883
69.2%
Decimal Number
ValueCountFrequency (%)
1 66
33.5%
2 31
15.7%
0 25
 
12.7%
3 19
 
9.6%
6 14
 
7.1%
4 13
 
6.6%
7 12
 
6.1%
5 11
 
5.6%
9 3
 
1.5%
8 3
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
G 4
17.4%
C 4
17.4%
V 3
13.0%
U 3
13.0%
T 2
8.7%
K 2
8.7%
L 2
8.7%
H 2
8.7%
M 1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
. 8
53.3%
, 7
46.7%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4169
91.5%
Common 363
 
8.0%
Latin 23
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
189
 
4.5%
174
 
4.2%
164
 
3.9%
150
 
3.6%
122
 
2.9%
112
 
2.7%
101
 
2.4%
99
 
2.4%
96
 
2.3%
78
 
1.9%
Other values (265) 2884
69.2%
Common
ValueCountFrequency (%)
( 72
19.8%
) 72
19.8%
1 66
18.2%
2 31
8.5%
0 25
 
6.9%
3 19
 
5.2%
6 14
 
3.9%
4 13
 
3.6%
7 12
 
3.3%
5 11
 
3.0%
Other values (6) 28
 
7.7%
Latin
ValueCountFrequency (%)
G 4
17.4%
C 4
17.4%
V 3
13.0%
U 3
13.0%
T 2
8.7%
K 2
8.7%
L 2
8.7%
H 2
8.7%
M 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4168
91.5%
ASCII 386
 
8.5%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
189
 
4.5%
174
 
4.2%
164
 
3.9%
150
 
3.6%
122
 
2.9%
112
 
2.7%
101
 
2.4%
99
 
2.4%
96
 
2.3%
78
 
1.9%
Other values (264) 2883
69.2%
ASCII
ValueCountFrequency (%)
( 72
18.7%
) 72
18.7%
1 66
17.1%
2 31
8.0%
0 25
 
6.5%
3 19
 
4.9%
6 14
 
3.6%
4 13
 
3.4%
7 12
 
3.1%
5 11
 
2.8%
Other values (15) 51
13.2%
None
ValueCountFrequency (%)
1
100.0%

정류소ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct666
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40471.323
Minimum40001
Maximum41639
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.0 KiB
2024-01-29T00:46:03.506658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum40001
5-th percentile40048.25
Q140238.25
median40473.5
Q340676.75
95-th percentile40884.75
Maximum41639
Range1638
Interquartile range (IQR)438.5

Descriptive statistics

Standard deviation279.41093
Coefficient of variation (CV)0.0069039239
Kurtosis-0.092174929
Mean40471.323
Median Absolute Deviation (MAD)218.5
Skewness0.30556262
Sum26953901
Variance78070.469
MonotonicityStrictly increasing
2024-01-29T00:46:03.608096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40001 1
 
0.2%
40602 1
 
0.2%
40605 1
 
0.2%
40606 1
 
0.2%
40607 1
 
0.2%
40608 1
 
0.2%
40609 1
 
0.2%
40610 1
 
0.2%
40611 1
 
0.2%
40612 1
 
0.2%
Other values (656) 656
98.5%
ValueCountFrequency (%)
40001 1
0.2%
40002 1
0.2%
40003 1
0.2%
40004 1
0.2%
40005 1
0.2%
40006 1
0.2%
40007 1
0.2%
40008 1
0.2%
40009 1
0.2%
40010 1
0.2%
ValueCountFrequency (%)
41639 1
0.2%
41465 1
0.2%
41464 1
0.2%
41448 1
0.2%
41447 1
0.2%
40924 1
0.2%
40923 1
0.2%
40922 1
0.2%
40921 1
0.2%
40919 1
0.2%

Interactions

2024-01-29T00:46:02.470060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T00:46:02.326746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T00:46:02.539452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-29T00:46:02.401467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T00:46:03.677812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류소ID
연번1.0000.902
정류소ID0.9021.000
2024-01-29T00:46:03.735894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류소ID
연번1.0001.000
정류소ID1.0001.000

Missing values

2024-01-29T00:46:02.626848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T00:46:02.679435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번정류소명정류소ID
01동암사거리40001
12종근당40002
23동암사거리40003
34형제마트40004
45종근당40005
56형제마트40006
67십정시장입구40007
78십정시장입구40008
89수출산업단지6공단입구40009
910수출산업단지6공단입구40010
연번정류소명정류소ID
656657인천성모병원종점지주차장40919
657658남부고가교40921
658659부평4동성당40922
659660부개사거리40923
660661삼산사거리40924
661662광명아파트41447
662663광명아파트41448
663664인향아파트41464
664665인향아파트41465
665666청천천공영주차장41639