Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 296 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 69.4 B |
Variable types
Numeric | 5 |
---|---|
Text | 1 |
Categorical | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-11572/S/1/datasetView.do |
연번 is highly overall correlated with 호선 and 2 other fields | High correlation |
호선 is highly overall correlated with 연번 and 2 other fields | High correlation |
길이(M) is highly overall correlated with 연번 and 3 other fields | High correlation |
준공연도 is highly overall correlated with 연번 and 2 other fields | High correlation |
층수 is highly overall correlated with 길이(M) | High correlation |
연번 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 15:51:59.687658 |
---|---|
Analysis finished | 2024-04-29 15:52:04.050767 |
Duration | 4.36 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 296 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 148.5 |
Minimum | 1 |
---|---|
Maximum | 296 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.75 |
Q1 | 74.75 |
median | 148.5 |
Q3 | 222.25 |
95-th percentile | 281.25 |
Maximum | 296 |
Range | 295 |
Interquartile range (IQR) | 147.5 |
Descriptive statistics
Standard deviation | 85.592056 |
---|---|
Coefficient of variation (CV) | 0.57637748 |
Kurtosis | -1.2 |
Mean | 148.5 |
Median Absolute Deviation (MAD) | 74 |
Skewness | 0 |
Sum | 43956 |
Variance | 7326 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.3% |
205 | 1 | 0.3% |
203 | 1 | 0.3% |
202 | 1 | 0.3% |
201 | 1 | 0.3% |
200 | 1 | 0.3% |
199 | 1 | 0.3% |
198 | 1 | 0.3% |
197 | 1 | 0.3% |
196 | 1 | 0.3% |
Other values (286) | 286 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
296 | 1 | |
295 | 1 | |
294 | 1 | |
293 | 1 | |
292 | 1 | |
291 | 1 | |
290 | 1 | |
289 | 1 | |
288 | 1 | |
287 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8648649 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.1554775 |
---|---|
Coefficient of variation (CV) | 0.44307038 |
Kurtosis | -1.0125135 |
Mean | 4.8648649 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.0021732148 |
Sum | 1440 |
Variance | 4.6460834 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 56 | |
7 | 51 | |
2 | 50 | |
6 | 39 | |
3 | 34 | |
4 | 26 | |
8 | 17 | 5.7% |
9 | 13 | 4.4% |
1 | 10 | 3.4% |
Value | Count | Frequency (%) |
1 | 10 | 3.4% |
2 | 50 | |
3 | 34 | |
4 | 26 | |
5 | 56 | |
6 | 39 | |
7 | 51 | |
8 | 17 | 5.7% |
9 | 13 | 4.4% |
Value | Count | Frequency (%) |
9 | 13 | 4.4% |
8 | 17 | 5.7% |
7 | 51 | |
6 | 39 | |
5 | 56 | |
4 | 26 | |
3 | 34 | |
2 | 50 | |
1 | 10 | 3.4% |
역명
Text
Distinct | 258 |
---|---|
Distinct (%) | 87.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.0% |
동대문역사문화공원 | 3 | 1.0% |
석촌 | 2 | 0.7% |
서울 | 2 | 0.7% |
충무로 | 2 | 0.7% |
충정로 | 2 | 0.7% |
시청 | 2 | 0.7% |
교대 | 2 | 0.7% |
사당 | 2 | 0.7% |
공덕 | 2 | 0.7% |
Other values (248) | 274 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 32 | 3.6% |
구 | 29 | 3.3% |
동 | 26 | 3.0% |
신 | 25 | 2.8% |
산 | 20 | 2.3% |
원 | 16 | 1.8% |
청 | 16 | 1.8% |
지 | 15 | 1.7% |
문 | 15 | 1.7% |
로 | 15 | 1.7% |
Other values (209) | 670 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 871 | |
Decimal Number | 8 | 0.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.3% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 20 | 2.3% |
원 | 16 | 1.8% |
청 | 16 | 1.8% |
지 | 15 | 1.7% |
문 | 15 | 1.7% |
로 | 15 | 1.7% |
Other values (206) | 662 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 871 | |
Common | 8 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.3% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 20 | 2.3% |
원 | 16 | 1.8% |
청 | 16 | 1.8% |
지 | 15 | 1.7% |
문 | 15 | 1.7% |
로 | 15 | 1.7% |
Other values (206) | 662 |
Common
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 871 | |
ASCII | 8 | 0.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.3% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 20 | 2.3% |
원 | 16 | 1.8% |
청 | 16 | 1.8% |
지 | 15 | 1.7% |
문 | 15 | 1.7% |
로 | 15 | 1.7% |
Other values (206) | 662 |
ASCII
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
형식
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
상대식 | |
---|---|
섬식 | |
복합식 | 14 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.7364865 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 섬식 |
---|---|
2nd row | 상대식 |
3rd row | 상대식 |
4th row | 상대식 |
5th row | 상대식 |
Common Values
Value | Count | Frequency (%) |
상대식 | 204 | |
섬식 | 78 | 26.4% |
복합식 | 14 | 4.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
상대식 | 204 | |
섬식 | 78 | 26.4% |
복합식 | 14 | 4.7% |
길이(M)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 177.78716 |
Minimum | 90 |
---|---|
Maximum | 210 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 90 |
---|---|
5-th percentile | 125 |
Q1 | 165 |
median | 165 |
Q3 | 205 |
95-th percentile | 205 |
Maximum | 210 |
Range | 120 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 24.253296 |
---|---|
Coefficient of variation (CV) | 0.13641759 |
Kurtosis | -0.29170714 |
Mean | 177.78716 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -0.31194373 |
Sum | 52625 |
Variance | 588.22234 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
165 | 162 | |
205 | 104 | |
125 | 17 | 5.7% |
210 | 10 | 3.4% |
130 | 2 | 0.7% |
90 | 1 | 0.3% |
Value | Count | Frequency (%) |
90 | 1 | 0.3% |
125 | 17 | 5.7% |
130 | 2 | 0.7% |
165 | 162 | |
205 | 104 | |
210 | 10 | 3.4% |
Value | Count | Frequency (%) |
210 | 10 | 3.4% |
205 | 104 | |
165 | 162 | |
130 | 2 | 0.7% |
125 | 17 | 5.7% |
90 | 1 | 0.3% |
층수
Categorical
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 6.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
B2 | |
---|---|
B3 | |
B4 | |
B5 | |
3F | |
Other values (13) |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0540541 |
Min length | 2 |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 3.4% |
Sample
1st row | B2 |
---|---|
2nd row | B2 |
3rd row | B2 |
4th row | B2 |
5th row | B2 |
Common Values
Value | Count | Frequency (%) |
B2 | 121 | |
B3 | 85 | |
B4 | 37 | 12.5% |
B5 | 17 | 5.7% |
3F | 14 | 4.7% |
2F | 7 | 2.4% |
B6 | 3 | 1.0% |
1FB3 | 2 | 0.7% |
5FB2 | 1 | 0.3% |
1F | 1 | 0.3% |
Other values (8) | 8 | 2.7% |
Length
Value | Count | Frequency (%) |
b2 | 121 | |
b3 | 85 | |
b4 | 37 | 12.5% |
b5 | 17 | 5.7% |
3f | 14 | 4.7% |
2f | 7 | 2.4% |
b6 | 3 | 1.0% |
1fb3 | 2 | 0.7% |
1fb5 | 1 | 0.3% |
2fb2 | 1 | 0.3% |
Other values (8) | 8 | 2.7% |
면적(m²)
Real number (ℝ)
Distinct | 294 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8825.7663 |
Minimum | 1069.5 |
---|---|
Maximum | 28768.4 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1069.5 |
---|---|
5-th percentile | 5074.575 |
Q1 | 6552.05 |
median | 8165.95 |
Q3 | 10087.35 |
95-th percentile | 14952.075 |
Maximum | 28768.4 |
Range | 27698.9 |
Interquartile range (IQR) | 3535.3 |
Descriptive statistics
Standard deviation | 3419.0103 |
---|---|
Coefficient of variation (CV) | 0.38738962 |
Kurtosis | 5.24256 |
Mean | 8825.7663 |
Median Absolute Deviation (MAD) | 1707.2 |
Skewness | 1.6113676 |
Sum | 2612426.8 |
Variance | 11689631 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6086.0 | 2 | 0.7% |
6439.0 | 2 | 0.7% |
6587.6 | 1 | 0.3% |
7085.2 | 1 | 0.3% |
14000.6 | 1 | 0.3% |
18195.2 | 1 | 0.3% |
5805.9 | 1 | 0.3% |
9093.3 | 1 | 0.3% |
6278.8 | 1 | 0.3% |
10805.0 | 1 | 0.3% |
Other values (284) | 284 |
Value | Count | Frequency (%) |
1069.5 | 1 | |
1423.0 | 1 | |
1503.1 | 1 | |
1583.0 | 1 | |
2203.0 | 1 | |
3860.0 | 1 | |
4496.9 | 1 | |
4691.0 | 1 | |
4838.6 | 1 | |
4844.8 | 1 |
Value | Count | Frequency (%) |
28768.4 | 1 | |
23052.8 | 1 | |
20302.8 | 1 | |
19246.0 | 1 | |
18984.6 | 1 | |
18812.7 | 1 | |
18506.0 | 1 | |
18459.4 | 1 | |
18195.2 | 1 | |
17268.9 | 1 |
준공연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1994.5135 |
Minimum | 1974 |
---|---|
Maximum | 2022 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1974 |
---|---|
5-th percentile | 1980 |
Q1 | 1985 |
median | 1996 |
Q3 | 2001 |
95-th percentile | 2015 |
Maximum | 2022 |
Range | 48 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 10.378587 |
---|---|
Coefficient of variation (CV) | 0.0052035682 |
Kurtosis | 0.022117884 |
Mean | 1994.5135 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.3904303 |
Sum | 590376 |
Variance | 107.71507 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1985 | 47 | |
1996 | 46 | |
2001 | 41 | |
1997 | 26 | |
2000 | 19 | 6.4% |
1984 | 16 | 5.4% |
1983 | 14 | 4.7% |
1995 | 12 | 4.1% |
1980 | 11 | 3.7% |
2012 | 9 | 3.0% |
Other values (16) | 55 |
Value | Count | Frequency (%) |
1974 | 9 | 3.0% |
1980 | 11 | 3.7% |
1982 | 5 | 1.7% |
1983 | 14 | 4.7% |
1984 | 16 | 5.4% |
1985 | 47 | |
1990 | 1 | 0.3% |
1992 | 2 | 0.7% |
1993 | 8 | 2.7% |
1994 | 1 | 0.3% |
Value | Count | Frequency (%) |
2022 | 1 | 0.3% |
2021 | 2 | 0.7% |
2020 | 2 | 0.7% |
2019 | 1 | 0.3% |
2018 | 8 | |
2015 | 5 | |
2012 | 9 | |
2010 | 3 | 1.0% |
2005 | 2 | 0.7% |
2002 | 1 | 0.3% |
연번 | 호선 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.947 | 0.258 | 0.908 | 0.503 | 0.152 | 0.823 |
호선 | 0.947 | 1.000 | 0.273 | 0.846 | 0.520 | 0.314 | 0.951 |
형식 | 0.258 | 0.273 | 1.000 | 0.077 | 0.000 | 0.517 | 0.335 |
길이(M) | 0.908 | 0.846 | 0.077 | 1.000 | 0.853 | 0.545 | 0.781 |
층수 | 0.503 | 0.520 | 0.000 | 0.853 | 1.000 | 0.779 | 0.804 |
면적(m²) | 0.152 | 0.314 | 0.517 | 0.545 | 0.779 | 1.000 | 0.325 |
준공연도 | 0.823 | 0.951 | 0.335 | 0.781 | 0.804 | 0.325 | 1.000 |
층수 | 형식 | |
---|---|---|
층수 | 1.000 | 0.000 |
형식 | 0.000 | 1.000 |
연번 | 호선 | 길이(M) | 면적(m²) | 준공연도 | 형식 | 층수 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.989 | -0.824 | 0.135 | 0.831 | 0.158 | 0.212 |
호선 | 0.989 | 1.000 | -0.821 | 0.136 | 0.812 | 0.123 | 0.195 |
길이(M) | -0.824 | -0.821 | 1.000 | 0.014 | -0.723 | 0.057 | 0.634 |
면적(m²) | 0.135 | 0.136 | 0.014 | 1.000 | 0.195 | 0.264 | 0.454 |
준공연도 | 0.831 | 0.812 | -0.723 | 0.195 | 1.000 | 0.145 | 0.376 |
형식 | 0.158 | 0.123 | 0.057 | 0.264 | 0.145 | 1.000 | 0.000 |
층수 | 0.212 | 0.195 | 0.634 | 0.454 | 0.376 | 0.000 | 1.000 |
연번 | 호선 | 역명 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 1 | 서울 | 섬식 | 210 | B2 | 10805.0 | 1974 |
1 | 2 | 1 | 시청 | 상대식 | 210 | B2 | 11317.0 | 1974 |
2 | 3 | 1 | 종각 | 상대식 | 210 | B2 | 10410.2 | 1974 |
3 | 4 | 1 | 종로3가 | 상대식 | 210 | B2 | 9311.0 | 1974 |
4 | 5 | 1 | 종로5가 | 상대식 | 210 | B2 | 10465.0 | 1974 |
5 | 6 | 1 | 동대문 | 상대식 | 210 | B2 | 5490.0 | 1974 |
6 | 7 | 1 | 동묘앞 | 상대식 | 210 | 5FB2 | 7031.7 | 2005 |
7 | 8 | 1 | 신설동 | 상대식 | 210 | B2 | 7240.0 | 1974 |
8 | 9 | 1 | 제기동 | 상대식 | 210 | B2 | 8662.0 | 1974 |
9 | 10 | 1 | 청량리 | 섬식 | 210 | B2 | 7125.0 | 1974 |
연번 | 호선 | 역명 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|---|
286 | 287 | 9 | 봉은사 | 상대식 | 165 | B2 | 9825.3 | 2015 |
287 | 288 | 9 | 종합운동장 | 상대식 | 165 | B4 | 13976.5 | 2015 |
288 | 289 | 9 | 삼전 | 상대식 | 165 | B2 | 8644.1 | 2018 |
289 | 290 | 9 | 석촌고분 | 섬식 | 165 | B2 | 6833.6 | 2018 |
290 | 291 | 9 | 석촌 | 섬식 | 165 | B4 | 10105.5 | 2018 |
291 | 292 | 9 | 송파나루 | 섬식 | 165 | B2 | 7833.3 | 2018 |
292 | 293 | 9 | 한성백제 | 섬식 | 165 | B2 | 8955.0 | 2018 |
293 | 294 | 9 | 올림픽공원 | 섬식 | 165 | B3 | 8372.1 | 2018 |
294 | 295 | 9 | 둔촌오륜 | 섬식 | 165 | B2 | 7544.3 | 2018 |
295 | 296 | 9 | 중앙보훈병원 | 복합식 | 165 | B2 | 8956.0 | 2018 |