Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 39 |
Missing cells | 4 |
Missing cells (%) | 1.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.4 KiB |
Average record size in memory | 63.4 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 1 |
Text | 1 |
DateTime | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13317/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역개수 is highly overall correlated with 연장(km) | High correlation |
연장(km) is highly overall correlated with 역개수 | High correlation |
기관 is highly overall correlated with 연번 and 1 other fields | High correlation |
역개수 has 1 (2.6%) missing values | Missing |
연장(km) has 3 (7.7%) missing values | Missing |
연번 has unique values | Unique |
구간 has unique values | Unique |
Reproduction
Analysis started | 2023-12-11 06:13:18.126769 |
---|---|
Analysis finished | 2023-12-11 06:13:20.546797 |
Duration | 2.42 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 39 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20 |
Minimum | 1 |
---|---|
Maximum | 39 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 483.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.9 |
Q1 | 10.5 |
median | 20 |
Q3 | 29.5 |
95-th percentile | 37.1 |
Maximum | 39 |
Range | 38 |
Interquartile range (IQR) | 19 |
Descriptive statistics
Standard deviation | 11.401754 |
---|---|
Coefficient of variation (CV) | 0.57008771 |
Kurtosis | -1.2 |
Mean | 20 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0 |
Sum | 780 |
Variance | 130 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 2.6% |
2 | 1 | 2.6% |
23 | 1 | 2.6% |
24 | 1 | 2.6% |
25 | 1 | 2.6% |
26 | 1 | 2.6% |
27 | 1 | 2.6% |
28 | 1 | 2.6% |
29 | 1 | 2.6% |
30 | 1 | 2.6% |
Other values (29) | 29 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
39 | 1 | |
38 | 1 | |
37 | 1 | |
36 | 1 | |
35 | 1 | |
34 | 1 | |
33 | 1 | |
32 | 1 | |
31 | 1 | |
30 | 1 |
기관
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 10.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 444.0 B |
서울메트로 | |
---|---|
도시철도공사 | |
서울교통공사 | |
서울메트로9호선운영 | 1 |
Length
Max length | 10 |
---|---|
Median length | 5 |
Mean length | 5.5897436 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.6% |
Sample
1st row | 서울메트로 |
---|---|
2nd row | 서울메트로 |
3rd row | 서울메트로 |
4th row | 서울메트로 |
5th row | 서울메트로 |
Common Values
Value | Count | Frequency (%) |
서울메트로 | 20 | |
도시철도공사 | 14 | |
서울교통공사 | 4 | 10.3% |
서울메트로9호선운영 | 1 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울메트로 | 20 | |
도시철도공사 | 14 | |
서울교통공사 | 4 | 10.3% |
서울메트로9호선운영 | 1 | 2.6% |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 23.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.4102564 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 483.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.9 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 8.1 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.2561955 |
---|---|
Coefficient of variation (CV) | 0.51157922 |
Kurtosis | -0.84954215 |
Mean | 4.4102564 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.3700827 |
Sum | 172 |
Variance | 5.0904184 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2 | 9 | |
5 | 7 | |
3 | 5 | |
4 | 4 | |
6 | 4 | |
7 | 4 | |
1 | 2 | 5.1% |
8 | 2 | 5.1% |
9 | 2 | 5.1% |
Value | Count | Frequency (%) |
1 | 2 | 5.1% |
2 | 9 | |
3 | 5 | |
4 | 4 | |
5 | 7 | |
6 | 4 | |
7 | 4 | |
8 | 2 | 5.1% |
9 | 2 | 5.1% |
Value | Count | Frequency (%) |
9 | 2 | 5.1% |
8 | 2 | 5.1% |
7 | 4 | |
6 | 4 | |
5 | 7 | |
4 | 4 | |
3 | 5 | |
2 | 9 | |
1 | 2 | 5.1% |
구간
Text
UNIQUE
 
Distinct | 39 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 444.0 B |
Value | Count | Frequency (%) |
서울↔청량리 | 1 | 2.6% |
왕십리↔상일동 | 1 | 2.6% |
강동↔마천 | 1 | 2.6% |
까치산↔여의도 | 1 | 2.6% |
여의도↔왕십리 | 1 | 2.6% |
미사↔하남풍산 | 1 | 2.6% |
강일↔하남검단산 | 1 | 2.6% |
봉화산↔상월곡 | 1 | 2.6% |
응암↔상월곡 | 1 | 2.6% |
방화↔까치산 | 1 | 2.6% |
Other values (29) | 29 |
Most occurring characters
Value | Count | Frequency (%) |
↔ | 37 | 13.1% |
구 | 13 | 4.6% |
입 | 8 | 2.8% |
신 | 8 | 2.8% |
동 | 8 | 2.8% |
대 | 8 | 2.8% |
산 | 7 | 2.5% |
수 | 6 | 2.1% |
서 | 5 | 1.8% |
상 | 5 | 1.8% |
Other values (87) | 177 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 231 | |
Math Symbol | 37 | 13.1% |
Open Punctuation | 5 | 1.8% |
Close Punctuation | 5 | 1.8% |
Decimal Number | 4 | 1.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 13 | 5.6% |
입 | 8 | 3.5% |
신 | 8 | 3.5% |
동 | 8 | 3.5% |
대 | 8 | 3.5% |
산 | 7 | 3.0% |
수 | 6 | 2.6% |
서 | 5 | 2.2% |
상 | 5 | 2.2% |
리 | 5 | 2.2% |
Other values (82) | 158 |
Decimal Number
Value | Count | Frequency (%) |
2 | 3 | |
9 | 1 | 25.0% |
Math Symbol
Value | Count | Frequency (%) |
↔ | 37 |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 231 | |
Common | 51 | 18.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 13 | 5.6% |
입 | 8 | 3.5% |
신 | 8 | 3.5% |
동 | 8 | 3.5% |
대 | 8 | 3.5% |
산 | 7 | 3.0% |
수 | 6 | 2.6% |
서 | 5 | 2.2% |
상 | 5 | 2.2% |
리 | 5 | 2.2% |
Other values (82) | 158 |
Common
Value | Count | Frequency (%) |
↔ | 37 | |
( | 5 | 9.8% |
) | 5 | 9.8% |
2 | 3 | 5.9% |
9 | 1 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 231 | |
Arrows | 37 | 13.1% |
ASCII | 14 | 5.0% |
Most frequent character per block
Arrows
Value | Count | Frequency (%) |
↔ | 37 |
Hangul
Value | Count | Frequency (%) |
구 | 13 | 5.6% |
입 | 8 | 3.5% |
신 | 8 | 3.5% |
동 | 8 | 3.5% |
대 | 8 | 3.5% |
산 | 7 | 3.0% |
수 | 6 | 2.6% |
서 | 5 | 2.2% |
상 | 5 | 2.2% |
리 | 5 | 2.2% |
Other values (82) | 158 |
ASCII
Value | Count | Frequency (%) |
( | 5 | |
) | 5 | |
2 | 3 | |
9 | 1 | 7.1% |
역개수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 44.7% |
Missing | 1 |
Missing (%) | 2.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.7894737 |
Minimum | 1 |
---|---|
Maximum | 28 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 483.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 7 |
Q3 | 10.75 |
95-th percentile | 16.45 |
Maximum | 28 |
Range | 27 |
Interquartile range (IQR) | 7.75 |
Descriptive statistics
Standard deviation | 6.0767963 |
---|---|
Coefficient of variation (CV) | 0.78012926 |
Kurtosis | 1.8177581 |
Mean | 7.7894737 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 1.1415564 |
Sum | 296 |
Variance | 36.927454 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 7 | |
9 | 4 | |
5 | 3 | 7.7% |
8 | 3 | 7.7% |
7 | 3 | 7.7% |
4 | 2 | 5.1% |
13 | 2 | 5.1% |
14 | 2 | 5.1% |
3 | 2 | 5.1% |
2 | 2 | 5.1% |
Other values (7) | 8 |
Value | Count | Frequency (%) |
1 | 7 | |
2 | 2 | 5.1% |
3 | 2 | 5.1% |
4 | 2 | 5.1% |
5 | 3 | |
6 | 1 | 2.6% |
7 | 3 | |
8 | 3 | |
9 | 4 | |
10 | 1 | 2.6% |
Value | Count | Frequency (%) |
28 | 1 | 2.6% |
19 | 1 | 2.6% |
16 | 2 | |
15 | 1 | 2.6% |
14 | 2 | |
13 | 2 | |
11 | 1 | 2.6% |
10 | 1 | 2.6% |
9 | 4 | |
8 | 3 |
연장(km)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 34 |
---|---|
Distinct (%) | 94.4% |
Missing | 3 |
Missing (%) | 7.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.9583333 |
Minimum | 1.2 |
---|---|
Maximum | 30.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 483.0 B |
Quantile statistics
Minimum | 1.2 |
---|---|
5-th percentile | 1.375 |
Q1 | 3.9 |
median | 7.85 |
Q3 | 13.325 |
95-th percentile | 19.2 |
Maximum | 30.9 |
Range | 29.7 |
Interquartile range (IQR) | 9.425 |
Descriptive statistics
Standard deviation | 6.7475445 |
---|---|
Coefficient of variation (CV) | 0.75321427 |
Kurtosis | 1.6653815 |
Mean | 8.9583333 |
Median Absolute Deviation (MAD) | 4.9 |
Skewness | 1.181629 |
Sum | 322.5 |
Variance | 45.529357 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.9 | 2 | 5.1% |
4.6 | 2 | 5.1% |
1.3 | 1 | 2.6% |
7.1 | 1 | 2.6% |
14.0 | 1 | 2.6% |
2.9 | 1 | 2.6% |
4.2 | 1 | 2.6% |
30.9 | 1 | 2.6% |
19.0 | 1 | 2.6% |
14.4 | 1 | 2.6% |
Other values (24) | 24 | |
(Missing) | 3 | 7.7% |
Value | Count | Frequency (%) |
1.2 | 1 | |
1.3 | 1 | |
1.4 | 1 | |
1.5 | 1 | |
1.9 | 1 | |
2.2 | 1 | |
2.7 | 1 | |
2.9 | 1 | |
3.0 | 1 | |
4.2 | 1 |
Value | Count | Frequency (%) |
30.9 | 1 | |
19.8 | 1 | |
19.0 | 1 | |
18.7 | 1 | |
18.2 | 1 | |
16.5 | 1 | |
14.4 | 1 | |
14.3 | 1 | |
14.0 | 1 | |
13.1 | 1 |
개통일자
Date
Distinct | 37 |
---|---|
Distinct (%) | 94.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 444.0 B |
Minimum | 1974-08-15 00:00:00 |
---|---|
Maximum | 2021-03-27 00:00:00 |
연번 | 기관 | 호선 | 구간 | 역개수 | 연장(km) | 개통일자 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.724 | 0.944 | 1.000 | 0.000 | 0.000 | 0.827 |
기관 | 0.724 | 1.000 | 0.804 | 1.000 | 0.365 | 0.000 | 0.967 |
호선 | 0.944 | 0.804 | 1.000 | 1.000 | 0.000 | 0.000 | 0.920 |
구간 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
역개수 | 0.000 | 0.365 | 0.000 | 1.000 | 1.000 | 0.972 | 0.794 |
연장(km) | 0.000 | 0.000 | 0.000 | 1.000 | 0.972 | 1.000 | 0.983 |
개통일자 | 0.827 | 0.967 | 0.920 | 1.000 | 0.794 | 0.983 | 1.000 |
연번 | 호선 | 역개수 | 연장(km) | 기관 | |
---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 0.152 | 0.133 | 0.510 |
호선 | 0.988 | 1.000 | 0.232 | 0.210 | 0.614 |
역개수 | 0.152 | 0.232 | 1.000 | 0.975 | 0.142 |
연장(km) | 0.133 | 0.210 | 0.975 | 1.000 | 0.000 |
기관 | 0.510 | 0.614 | 0.142 | 0.000 | 1.000 |
연번 | 기관 | 호선 | 구간 | 역개수 | 연장(km) | 개통일자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 서울메트로 | 1 | 서울↔청량리 | 9 | 7.8 | 1974-08-15 |
1 | 2 | 서울메트로 | 1 | 동묘앞 | 1 | <NA> | 2005-12-21 |
2 | 3 | 서울메트로 | 2 | 신설동(2)↔종합운동장 | 11 | 14.3 | 1980-10-31 |
3 | 4 | 서울메트로 | 2 | 종합운동장↔교대(2) | 5 | 5.5 | 1982-12-23 |
4 | 5 | 서울메트로 | 2 | 을지입구↔성수 | 9 | 7.9 | 1983-09-16 |
5 | 6 | 서울메트로 | 2 | 교대(2)↔서울대입구 | 5 | 6.7 | 1983-12-17 |
6 | 7 | 서울메트로 | 2 | 서울대입구↔을지입구 | 16 | 19.8 | 1984-05-22 |
7 | 8 | 서울메트로 | 2 | 신도림↔양천구청 | 2 | 2.7 | 1992-05-22 |
8 | 9 | 서울메트로 | 2 | 양천구청↔신정네거리 | 1 | 1.9 | 1996-02-29 |
9 | 10 | 서울메트로 | 2 | 신정네거리↔까치산 | <NA> | 1.4 | 1996-03-20 |
연번 | 기관 | 호선 | 구간 | 역개수 | 연장(km) | 개통일자 | |
---|---|---|---|---|---|---|---|
29 | 30 | 도시철도공사 | 6 | 이태원↔약수 | 4 | <NA> | 2001-03-09 |
30 | 31 | 서울교통공사 | 6 | 봉화산↔신내 | 1 | 1.3 | 2019-12-21 |
31 | 32 | 도시철도공사 | 7 | 장암↔건대입구 | 19 | 19.0 | 1996-10-11 |
32 | 33 | 도시철도공사 | 7 | 온수↔신풍 | 8 | 9.2 | 2000-02-29 |
33 | 34 | 도시철도공사 | 7 | 온수↔부평구청 | 9 | 10.2 | 2012-10-27 |
34 | 35 | 도시철도공사 | 7 | 건대입구↔신풍 | 15 | 18.7 | 2000-08-01 |
35 | 36 | 도시철도공사 | 8 | 잠실↔모란 | 13 | 13.1 | 1996-11-23 |
36 | 37 | 도시철도공사 | 8 | 암사↔잠실 | 4 | 4.6 | 1999-07-02 |
37 | 38 | 서울메트로9호선운영 | 9 | 신논현↔종합운동장 | 5 | 4.5 | 2015-03-28 |
38 | 39 | 서울교통공사 | 9 | 종합운동장(9)↔중앙보훈병원 | 8 | 9.1 | 2018-12-01 |