Overview

Dataset statistics

Number of variables7
Number of observations275
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory62.5 B

Variable types

Numeric6
Text1

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do

Alerts

연번 is highly overall correlated with 호선 and 1 other fieldsHigh correlation
호선 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
역번호 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
경로 is highly overall correlated with 장애 and 1 other fieldsHigh correlation
장애 is highly overall correlated with 경로 and 1 other fieldsHigh correlation
유공자 is highly overall correlated with 경로 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
역번호 has unique valuesUnique
경로 has unique valuesUnique
장애 has unique valuesUnique

Reproduction

Analysis started2024-04-29 21:10:10.012928
Analysis finished2024-04-29 21:10:13.375322
Duration3.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct275
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean138
Minimum1
Maximum275
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:13.632385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.7
Q169.5
median138
Q3206.5
95-th percentile261.3
Maximum275
Range274
Interquartile range (IQR)137

Descriptive statistics

Standard deviation79.529869
Coefficient of variation (CV)0.5763034
Kurtosis-1.2
Mean138
Median Absolute Deviation (MAD)69
Skewness0
Sum37950
Variance6325
MonotonicityStrictly increasing
2024-04-30T06:10:13.746140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
183 1
 
0.4%
189 1
 
0.4%
188 1
 
0.4%
187 1
 
0.4%
186 1
 
0.4%
185 1
 
0.4%
184 1
 
0.4%
182 1
 
0.4%
174 1
 
0.4%
Other values (265) 265
96.4%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
275 1
0.4%
274 1
0.4%
273 1
0.4%
272 1
0.4%
271 1
0.4%
270 1
0.4%
269 1
0.4%
268 1
0.4%
267 1
0.4%
266 1
0.4%

호선
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.6654545
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:13.851092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q36
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.0318826
Coefficient of variation (CV)0.43551653
Kurtosis-1.2116408
Mean4.6654545
Median Absolute Deviation (MAD)2
Skewness-0.10095916
Sum1283
Variance4.1285468
MonotonicityIncreasing
2024-04-30T06:10:13.936483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
5 51
18.5%
7 51
18.5%
2 50
18.2%
6 37
13.5%
3 33
12.0%
4 26
9.5%
8 17
 
6.2%
1 10
 
3.6%
ValueCountFrequency (%)
1 10
 
3.6%
2 50
18.2%
3 33
12.0%
4 26
9.5%
5 51
18.5%
6 37
13.5%
7 51
18.5%
8 17
 
6.2%
ValueCountFrequency (%)
8 17
 
6.2%
7 51
18.5%
6 37
13.5%
5 51
18.5%
4 26
9.5%
3 33
12.0%
2 50
18.2%
1 10
 
3.6%

역번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct275
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1631.3673
Minimum150
Maximum2827
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:14.047396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum150
5-th percentile204.7
Q1317.5
median2529
Q32647.5
95-th percentile2813.3
Maximum2827
Range2677
Interquartile range (IQR)2330

Descriptive statistics

Standard deviation1177.3932
Coefficient of variation (CV)0.72172173
Kurtosis-1.9158526
Mean1631.3673
Median Absolute Deviation (MAD)231
Skewness-0.26764389
Sum448626
Variance1386254.8
MonotonicityStrictly increasing
2024-04-30T06:10:14.174439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150 1
 
0.4%
2624 1
 
0.4%
2630 1
 
0.4%
2629 1
 
0.4%
2628 1
 
0.4%
2627 1
 
0.4%
2626 1
 
0.4%
2625 1
 
0.4%
2623 1
 
0.4%
2614 1
 
0.4%
Other values (265) 265
96.4%
ValueCountFrequency (%)
150 1
0.4%
151 1
0.4%
152 1
0.4%
153 1
0.4%
154 1
0.4%
155 1
0.4%
156 1
0.4%
157 1
0.4%
158 1
0.4%
159 1
0.4%
ValueCountFrequency (%)
2827 1
0.4%
2826 1
0.4%
2825 1
0.4%
2824 1
0.4%
2823 1
0.4%
2822 1
0.4%
2821 1
0.4%
2820 1
0.4%
2819 1
0.4%
2818 1
0.4%

역명
Text

Distinct242
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-04-30T06:10:14.402325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length4.3127273
Min length2

Characters and Unicode

Total characters1186
Distinct characters236
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)76.7%

Sample

1st row서울역
2nd row시청
3rd row종각
4th row종로3가
5th row종로5가
ValueCountFrequency (%)
종로3가 3
 
1.1%
동대문역사문화공원 3
 
1.1%
천호(풍납토성 2
 
0.7%
사당 2
 
0.7%
서울역 2
 
0.7%
영등포구청 2
 
0.7%
대림(구로구청 2
 
0.7%
불광 2
 
0.7%
약수 2
 
0.7%
오금 2
 
0.7%
Other values (232) 253
92.0%
2024-04-30T06:10:14.745088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 58
 
4.9%
( 58
 
4.9%
50
 
4.2%
49
 
4.1%
35
 
3.0%
31
 
2.6%
25
 
2.1%
22
 
1.9%
20
 
1.7%
20
 
1.7%
Other values (226) 818
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1059
89.3%
Close Punctuation 58
 
4.9%
Open Punctuation 58
 
4.9%
Decimal Number 8
 
0.7%
Other Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
4.7%
49
 
4.6%
35
 
3.3%
31
 
2.9%
25
 
2.4%
22
 
2.1%
20
 
1.9%
20
 
1.9%
19
 
1.8%
16
 
1.5%
Other values (220) 772
72.9%
Decimal Number
ValueCountFrequency (%)
3 5
62.5%
4 2
 
25.0%
5 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1059
89.3%
Common 127
 
10.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
4.7%
49
 
4.6%
35
 
3.3%
31
 
2.9%
25
 
2.4%
22
 
2.1%
20
 
1.9%
20
 
1.9%
19
 
1.8%
16
 
1.5%
Other values (220) 772
72.9%
Common
ValueCountFrequency (%)
) 58
45.7%
( 58
45.7%
3 5
 
3.9%
. 3
 
2.4%
4 2
 
1.6%
5 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1059
89.3%
ASCII 127
 
10.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 58
45.7%
( 58
45.7%
3 5
 
3.9%
. 3
 
2.4%
4 2
 
1.6%
5 1
 
0.8%
Hangul
ValueCountFrequency (%)
50
 
4.7%
49
 
4.6%
35
 
3.3%
31
 
2.9%
25
 
2.4%
22
 
2.1%
20
 
1.9%
20
 
1.9%
19
 
1.8%
16
 
1.5%
Other values (220) 772
72.9%

경로
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct275
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean818523.21
Minimum71898
Maximum3972545
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:14.871936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum71898
5-th percentile179931.9
Q1434132.5
median669873
Q3998479
95-th percentile2069412.7
Maximum3972545
Range3900647
Interquartile range (IQR)564346.5

Descriptive statistics

Standard deviation592558.66
Coefficient of variation (CV)0.72393629
Kurtosis5.6975056
Mean818523.21
Median Absolute Deviation (MAD)268446
Skewness1.9993698
Sum2.2509388 × 108
Variance3.5112576 × 1011
MonotonicityNot monotonic
2024-04-30T06:10:14.998326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2091960 1
 
0.4%
234594 1
 
0.4%
203425 1
 
0.4%
174796 1
 
0.4%
423483 1
 
0.4%
761679 1
 
0.4%
509912 1
 
0.4%
617413 1
 
0.4%
429902 1
 
0.4%
217837 1
 
0.4%
Other values (265) 265
96.4%
ValueCountFrequency (%)
71898 1
0.4%
84971 1
0.4%
88631 1
0.4%
91292 1
0.4%
101958 1
0.4%
107252 1
0.4%
109672 1
0.4%
137457 1
0.4%
153453 1
0.4%
157787 1
0.4%
ValueCountFrequency (%)
3972545 1
0.4%
3566296 1
0.4%
3330151 1
0.4%
2926950 1
0.4%
2858251 1
0.4%
2433535 1
0.4%
2258915 1
0.4%
2251765 1
0.4%
2241971 1
0.4%
2157517 1
0.4%

장애
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct275
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean165029.78
Minimum14527
Maximum607798
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:15.136006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14527
5-th percentile39066.3
Q190154.5
median138392
Q3214224
95-th percentile397562.5
Maximum607798
Range593271
Interquartile range (IQR)124069.5

Descriptive statistics

Standard deviation111974.13
Coefficient of variation (CV)0.67850861
Kurtosis2.3126618
Mean165029.78
Median Absolute Deviation (MAD)54356
Skewness1.4548405
Sum45383190
Variance1.2538205 × 1010
MonotonicityNot monotonic
2024-04-30T06:10:15.272913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
590159 1
 
0.4%
41335 1
 
0.4%
40202 1
 
0.4%
39591 1
 
0.4%
83234 1
 
0.4%
149219 1
 
0.4%
89951 1
 
0.4%
84036 1
 
0.4%
95304 1
 
0.4%
36122 1
 
0.4%
Other values (265) 265
96.4%
ValueCountFrequency (%)
14527 1
0.4%
15602 1
0.4%
18708 1
0.4%
18856 1
0.4%
18898 1
0.4%
20654 1
0.4%
22913 1
0.4%
25659 1
0.4%
28399 1
0.4%
32815 1
0.4%
ValueCountFrequency (%)
607798 1
0.4%
590159 1
0.4%
570098 1
0.4%
519538 1
0.4%
518170 1
0.4%
496421 1
0.4%
493901 1
0.4%
462315 1
0.4%
451972 1
0.4%
446438 1
0.4%

유공자
Real number (ℝ)

HIGH CORRELATION 

Distinct272
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12232.651
Minimum647
Maximum66278
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2024-04-30T06:10:15.402395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum647
5-th percentile2323.6
Q15729
median9668
Q315180.5
95-th percentile33131.8
Maximum66278
Range65631
Interquartile range (IQR)9451.5

Descriptive statistics

Standard deviation10088.507
Coefficient of variation (CV)0.82471959
Kurtosis6.7311839
Mean12232.651
Median Absolute Deviation (MAD)4514
Skewness2.2251708
Sum3363979
Variance1.0177797 × 108
MonotonicityNot monotonic
2024-04-30T06:10:15.529297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10534 2
 
0.7%
4078 2
 
0.7%
4438 2
 
0.7%
7735 1
 
0.4%
9855 1
 
0.4%
6967 1
 
0.4%
9654 1
 
0.4%
8052 1
 
0.4%
6105 1
 
0.4%
4679 1
 
0.4%
Other values (262) 262
95.3%
ValueCountFrequency (%)
647 1
0.4%
817 1
0.4%
821 1
0.4%
1343 1
0.4%
1609 1
0.4%
1694 1
0.4%
1702 1
0.4%
1724 1
0.4%
1834 1
0.4%
1958 1
0.4%
ValueCountFrequency (%)
66278 1
0.4%
62324 1
0.4%
55719 1
0.4%
50744 1
0.4%
47798 1
0.4%
41992 1
0.4%
40322 1
0.4%
36687 1
0.4%
36083 1
0.4%
35014 1
0.4%

Interactions

2024-04-30T06:10:12.741110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.298073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.726531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.217507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.676772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.244348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.812649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.359245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.815600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.285295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.783782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.320718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.885981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.440436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.907882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.365171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.875263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.416284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.964149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.502661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.987416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.432150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.961066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.497178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:13.054771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.571593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.062219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.527451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.061232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.593180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:13.150977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:10.642229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.139675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:11.597385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.153551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T06:10:12.662548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T06:10:15.612154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선역번호경로장애유공자
연번1.0000.9180.9210.4380.4940.498
호선0.9181.0000.9960.4930.4500.474
역번호0.9210.9961.0000.3860.3700.397
경로0.4380.4930.3861.0000.9250.903
장애0.4940.4500.3700.9251.0000.920
유공자0.4980.4740.3970.9030.9201.000
2024-04-30T06:10:15.696654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선역번호경로장애유공자
연번1.0000.9881.000-0.411-0.348-0.389
호선0.9881.0000.988-0.385-0.318-0.361
역번호1.0000.9881.000-0.411-0.348-0.389
경로-0.411-0.385-0.4111.0000.9360.907
장애-0.348-0.318-0.3480.9361.0000.907
유공자-0.389-0.361-0.3890.9070.9071.000

Missing values

2024-04-30T06:10:13.256557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T06:10:13.342663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번호선역번호역명경로장애유공자
011150서울역209196059015962324
121151시청111828123782225435
231152종각165300835722733775
341153종로3가397254560779866278
451154종로5가292695043077336687
561155동대문136675828056316416
671156신설동123160225422715959
781157제기동356629644643830965
891158청량리(서울시립대입구)333015151953841992
9101159동묘앞151276630386521589
연번호선역번호역명경로장애유공자
26526682818가락시장6186101014188334
26626782819문정5418871292738813
26726882820장지74162918433513726
26826982821복정4869051058119523
26927082822산성334663775673708
27027182823남한산성입구(성남법원.검찰청)76962217268510112
27127282824단대오거리6566671739865087
27227382825신흥4245131059594416
27327482826수진4926461139343835
27427582827모란5413591127236928