Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory429.7 KiB
Average record size in memory44.0 B

Variable types

Numeric4

Dataset

Description노선_ID,정류장_ID,링크_구간거리(m),정류장_순서
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21233/S/1/datasetView.do

Reproduction

Analysis started2024-05-03 20:32:15.623884
Analysis finished2024-05-03 20:32:22.036992
Duration6.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

노선_ID
Real number (ℝ)

Distinct768
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1513444 × 108
Minimum1.1500001 × 108
Maximum2.4146102 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T20:32:22.271062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1500001 × 108
5-th percentile1.22 × 108
Q12.1000001 × 108
median2.2400006 × 108
Q32.3400001 × 108
95-th percentile2.4100596 × 108
Maximum2.4146102 × 108
Range1.2646101 × 108
Interquartile range (IQR)23999998

Descriptive statistics

Standard deviation31016234
Coefficient of variation (CV)0.1441714
Kurtosis4.2455037
Mean2.1513444 × 108
Median Absolute Deviation (MAD)10000819
Skewness-2.2484926
Sum2.1513444 × 1012
Variance9.6200675 × 1014
MonotonicityNot monotonic
2024-05-03T20:32:22.739100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222000082 88
 
0.9%
236000048 83
 
0.8%
222000072 80
 
0.8%
218000116 77
 
0.8%
229000010 61
 
0.6%
222000022 60
 
0.6%
236000052 59
 
0.6%
213000009 57
 
0.6%
229000027 56
 
0.6%
204000013 56
 
0.6%
Other values (758) 9323
93.2%
ValueCountFrequency (%)
115000006 5
 
0.1%
115000007 18
0.2%
115000008 15
0.1%
115000009 11
0.1%
115000010 19
0.2%
115000012 13
0.1%
115900001 4
 
< 0.1%
115900002 6
 
0.1%
115900003 4
 
< 0.1%
115900004 17
0.2%
ValueCountFrequency (%)
241461015 2
 
< 0.1%
241461005 4
 
< 0.1%
241461002 5
 
0.1%
241457013 29
0.3%
241449011 13
0.1%
241449007 16
0.2%
241411001 9
 
0.1%
241409010 8
 
0.1%
241409009 11
 
0.1%
241409006 6
 
0.1%

정류장_ID
Real number (ℝ)

Distinct6142
Distinct (%)61.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0471669 × 108
Minimum1.0000002 × 108
Maximum2.8850009 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T20:32:23.251829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0000002 × 108
5-th percentile1.1100001 × 108
Q12.000001 × 108
median2.2100008 × 108
Q32.3200061 × 108
95-th percentile2.7710346 × 108
Maximum2.8850009 × 108
Range1.8850007 × 108
Interquartile range (IQR)32000511

Descriptive statistics

Standard deviation53909996
Coefficient of variation (CV)0.26333952
Kurtosis-0.70474799
Mean2.0471669 × 108
Median Absolute Deviation (MAD)14999776
Skewness-0.62388227
Sum2.0471669 × 1012
Variance2.9062877 × 1015
MonotonicityNot monotonic
2024-05-03T20:32:23.757500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
277103440 25
 
0.2%
277103387 23
 
0.2%
277103150 22
 
0.2%
277103388 19
 
0.2%
277103678 19
 
0.2%
121000220 17
 
0.2%
277103151 15
 
0.1%
277103309 14
 
0.1%
277103360 13
 
0.1%
277103679 13
 
0.1%
Other values (6132) 9820
98.2%
ValueCountFrequency (%)
100000023 1
< 0.1%
100000025 1
< 0.1%
100000031 1
< 0.1%
100000034 1
< 0.1%
100000119 1
< 0.1%
100000154 1
< 0.1%
100000165 1
< 0.1%
100000169 1
< 0.1%
100000174 2
< 0.1%
100000362 2
< 0.1%
ValueCountFrequency (%)
288500090 1
 
< 0.1%
285500030 1
 
< 0.1%
285500010 1
 
< 0.1%
277104758 3
< 0.1%
277104757 1
 
< 0.1%
277104746 1
 
< 0.1%
277104745 1
 
< 0.1%
277104740 1
 
< 0.1%
277104739 1
 
< 0.1%
277104737 1
 
< 0.1%

링크_구간거리(m)
Real number (ℝ)

Distinct1866
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean930.144
Minimum0
Maximum24300
Zeros93
Zeros (%)0.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T20:32:24.291060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile170
Q1310
median480
Q3940
95-th percentile3334.6
Maximum24300
Range24300
Interquartile range (IQR)630

Descriptive statistics

Standard deviation1290.4427
Coefficient of variation (CV)1.387358
Kurtosis41.695037
Mean930.144
Median Absolute Deviation (MAD)219
Skewness4.8423045
Sum9301440
Variance1665242.3
MonotonicityNot monotonic
2024-05-03T20:32:24.809830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300 126
 
1.3%
360 125
 
1.2%
320 124
 
1.2%
280 124
 
1.2%
420 122
 
1.2%
240 118
 
1.2%
380 116
 
1.2%
350 116
 
1.2%
290 111
 
1.1%
430 104
 
1.0%
Other values (1856) 8814
88.1%
ValueCountFrequency (%)
0 93
0.9%
13 1
 
< 0.1%
21 3
 
< 0.1%
25 1
 
< 0.1%
29 1
 
< 0.1%
34 1
 
< 0.1%
36 1
 
< 0.1%
38 1
 
< 0.1%
50 1
 
< 0.1%
51 5
 
0.1%
ValueCountFrequency (%)
24300 1
 
< 0.1%
22492 1
 
< 0.1%
18424 1
 
< 0.1%
15170 1
 
< 0.1%
14075 1
 
< 0.1%
13604 1
 
< 0.1%
13239 1
 
< 0.1%
13195 3
< 0.1%
13194 1
 
< 0.1%
13159 1
 
< 0.1%

정류장_순서
Real number (ℝ)

Distinct226
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.4708
Minimum1
Maximum240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T20:32:25.304663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q124
median46
Q378
95-th percentile136
Maximum240
Range239
Interquartile range (IQR)54

Descriptive statistics

Standard deviation40.676674
Coefficient of variation (CV)0.73329885
Kurtosis1.1016826
Mean55.4708
Median Absolute Deviation (MAD)25
Skewness1.0728292
Sum554708
Variance1654.5918
MonotonicityNot monotonic
2024-05-03T20:32:25.774383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34 137
 
1.4%
35 129
 
1.3%
28 126
 
1.3%
20 125
 
1.2%
30 122
 
1.2%
21 121
 
1.2%
39 121
 
1.2%
31 120
 
1.2%
41 119
 
1.2%
27 118
 
1.2%
Other values (216) 8762
87.6%
ValueCountFrequency (%)
1 93
0.9%
2 82
0.8%
3 95
0.9%
4 91
0.9%
5 98
1.0%
6 91
0.9%
7 107
1.1%
8 102
1.0%
9 97
1.0%
10 103
1.0%
ValueCountFrequency (%)
240 2
< 0.1%
237 2
< 0.1%
235 2
< 0.1%
231 1
< 0.1%
230 1
< 0.1%
229 1
< 0.1%
227 1
< 0.1%
226 1
< 0.1%
223 1
< 0.1%
222 1
< 0.1%

Interactions

2024-05-03T20:32:19.951963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:16.606992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:17.666304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:18.887397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:20.220897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:16.794819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:17.926288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:19.175044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:20.521972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:17.031919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:18.229921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:19.468510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:21.040997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:17.378458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:18.498953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T20:32:19.716352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-03T20:32:26.119990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선_ID정류장_ID링크_구간거리(m)정류장_순서
노선_ID1.0000.6270.1380.200
정류장_ID0.6271.0000.3770.323
링크_구간거리(m)0.1380.3771.0000.124
정류장_순서0.2000.3230.1241.000
2024-05-03T20:32:26.478238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선_ID정류장_ID링크_구간거리(m)정류장_순서
노선_ID1.0000.4190.2150.059
정류장_ID0.4191.0000.272-0.024
링크_구간거리(m)0.2150.2721.000-0.119
정류장_순서0.059-0.024-0.1191.000

Missing values

2024-05-03T20:32:21.438962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T20:32:21.787827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

노선_ID정류장_ID링크_구간거리(m)정류장_순서
2080223400006923400116838087
19716234000148277104179179949
26159233000328233002213533050
64945219000005218000067418102
36948229000102277103470287414
45962228000183277103443398133
7523921000001321000010038092
83457204000141121000260371059
6556221800011621800136259980
46635228000179206000043450111
노선_ID정류장_ID링크_구간거리(m)정류장_순서
18301234000875277104186180318
2079023400007210400006422073
17024145701323500109943845
21575234000050277103156303660
21920234000042228000332220120
86169204000013206000392570108
8319920400015320400015528255
11693236000048207000181220106
3606241007069277103779114524
6590721800011621800052137067