Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 291 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 19.5 KiB |
Average record size in memory | 68.5 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-11572/S/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
길이(M) is highly overall correlated with 연번 and 2 other fields | High correlation |
층수 is highly overall correlated with 길이(M) | High correlation |
연번 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 15:52:07.958929 |
---|---|
Analysis finished | 2024-04-29 15:52:09.874171 |
Duration | 1.92 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 291 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 146 |
Minimum | 1 |
---|---|
Maximum | 291 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.5 |
Q1 | 73.5 |
median | 146 |
Q3 | 218.5 |
95-th percentile | 276.5 |
Maximum | 291 |
Range | 290 |
Interquartile range (IQR) | 145 |
Descriptive statistics
Standard deviation | 84.148678 |
---|---|
Coefficient of variation (CV) | 0.57636081 |
Kurtosis | -1.2 |
Mean | 146 |
Median Absolute Deviation (MAD) | 73 |
Skewness | 0 |
Sum | 42486 |
Variance | 7081 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.3% |
184 | 1 | 0.3% |
200 | 1 | 0.3% |
199 | 1 | 0.3% |
198 | 1 | 0.3% |
197 | 1 | 0.3% |
196 | 1 | 0.3% |
195 | 1 | 0.3% |
194 | 1 | 0.3% |
193 | 1 | 0.3% |
Other values (281) | 281 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
291 | 1 | |
290 | 1 | |
289 | 1 | |
288 | 1 | |
287 | 1 | |
286 | 1 | |
285 | 1 | |
284 | 1 | |
283 | 1 | |
282 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.862543 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.1739062 |
---|---|
Coefficient of variation (CV) | 0.44707187 |
Kurtosis | -1.0462745 |
Mean | 4.862543 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.0053774411 |
Sum | 1415 |
Variance | 4.725868 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 51 | |
7 | 51 | |
2 | 50 | |
6 | 39 | |
3 | 34 | |
4 | 26 | |
8 | 17 | 5.8% |
9 | 13 | 4.5% |
1 | 10 | 3.4% |
Value | Count | Frequency (%) |
1 | 10 | 3.4% |
2 | 50 | |
3 | 34 | |
4 | 26 | |
5 | 51 | |
6 | 39 | |
7 | 51 | |
8 | 17 | 5.8% |
9 | 13 | 4.5% |
Value | Count | Frequency (%) |
9 | 13 | 4.5% |
8 | 17 | 5.8% |
7 | 51 | |
6 | 39 | |
5 | 51 | |
4 | 26 | |
3 | 34 | |
2 | 50 | |
1 | 10 | 3.4% |
역명
Text
Distinct | 253 |
---|---|
Distinct (%) | 86.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
동대문역사문화공원 | 3 | 1.0% |
종로3가 | 3 | 1.0% |
충정로 | 2 | 0.7% |
고속터미널 | 2 | 0.7% |
노원 | 2 | 0.7% |
사당 | 2 | 0.7% |
석촌 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
합정 | 2 | 0.7% |
대림 | 2 | 0.7% |
Other values (243) | 269 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.4% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 18 | 2.1% |
원 | 16 | 1.9% |
지 | 15 | 1.7% |
문 | 15 | 1.7% |
로 | 15 | 1.7% |
청 | 15 | 1.7% |
Other values (208) | 656 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 854 | |
Decimal Number | 8 | 0.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.4% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 18 | 2.1% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 15 | 1.8% |
청 | 15 | 1.8% |
Other values (205) | 648 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 854 | |
Common | 8 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.4% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 18 | 2.1% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 15 | 1.8% |
청 | 15 | 1.8% |
Other values (205) | 648 |
Common
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 854 | |
ASCII | 8 | 0.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.7% |
구 | 29 | 3.4% |
동 | 26 | 3.0% |
신 | 25 | 2.9% |
산 | 18 | 2.1% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 15 | 1.8% |
청 | 15 | 1.8% |
Other values (205) | 648 |
ASCII
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
형식
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
상대식 | |
---|---|
섬식 | |
복합식 | 14 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.7319588 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 섬식 |
---|---|
2nd row | 상대식 |
3rd row | 상대식 |
4th row | 상대식 |
5th row | 상대식 |
Common Values
Value | Count | Frequency (%) |
상대식 | 199 | |
섬식 | 78 | 26.8% |
복합식 | 14 | 4.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
상대식 | 199 | |
섬식 | 78 | 26.8% |
복합식 | 14 | 4.8% |
길이(M)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 178.00687 |
Minimum | 90 |
---|---|
Maximum | 210 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 90 |
---|---|
5-th percentile | 125 |
Q1 | 165 |
median | 165 |
Q3 | 205 |
95-th percentile | 205 |
Maximum | 210 |
Range | 120 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 24.402797 |
---|---|
Coefficient of variation (CV) | 0.13708907 |
Kurtosis | -0.30126288 |
Mean | 178.00687 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -0.33626231 |
Sum | 51800 |
Variance | 595.4965 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
165 | 157 | |
205 | 104 | |
125 | 17 | 5.8% |
210 | 10 | 3.4% |
130 | 2 | 0.7% |
90 | 1 | 0.3% |
Value | Count | Frequency (%) |
90 | 1 | 0.3% |
125 | 17 | 5.8% |
130 | 2 | 0.7% |
165 | 157 | |
205 | 104 | |
210 | 10 | 3.4% |
Value | Count | Frequency (%) |
210 | 10 | 3.4% |
205 | 104 | |
165 | 157 | |
130 | 2 | 0.7% |
125 | 17 | 5.8% |
90 | 1 | 0.3% |
층수
Categorical
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
B2 | |
---|---|
B3 | |
B4 | |
B5 | |
3F | |
Other values (13) |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0549828 |
Min length | 2 |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 3.4% |
Sample
1st row | B2 |
---|---|
2nd row | B2 |
3rd row | B2 |
4th row | B2 |
5th row | B2 |
Common Values
Value | Count | Frequency (%) |
B2 | 118 | |
B3 | 83 | |
B4 | 37 | 12.7% |
B5 | 17 | 5.8% |
3F | 14 | 4.8% |
2F | 7 | 2.4% |
B6 | 3 | 1.0% |
1FB3 | 2 | 0.7% |
5FB2 | 1 | 0.3% |
1F | 1 | 0.3% |
Other values (8) | 8 | 2.7% |
Length
Value | Count | Frequency (%) |
b2 | 118 | |
b3 | 83 | |
b4 | 37 | 12.7% |
b5 | 17 | 5.8% |
3f | 14 | 4.8% |
2f | 7 | 2.4% |
b6 | 3 | 1.0% |
1fb3 | 2 | 0.7% |
1fb5 | 1 | 0.3% |
2fb2 | 1 | 0.3% |
Other values (8) | 8 | 2.7% |
면적(m²)
Real number (ℝ)
Distinct | 289 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8789.2901 |
Minimum | 1069.48 |
---|---|
Maximum | 28768.4 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.7 KiB |
Quantile statistics
Minimum | 1069.48 |
---|---|
5-th percentile | 5072.155 |
Q1 | 6544.455 |
median | 8138.28 |
Q3 | 10048.14 |
95-th percentile | 15074.237 |
Maximum | 28768.4 |
Range | 27698.92 |
Interquartile range (IQR) | 3503.685 |
Descriptive statistics
Standard deviation | 3427.0069 |
---|---|
Coefficient of variation (CV) | 0.38990714 |
Kurtosis | 5.3564199 |
Mean | 8789.2901 |
Median Absolute Deviation (MAD) | 1698.06 |
Skewness | 1.6435784 |
Sum | 2557683.4 |
Variance | 11744376 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6086.0 | 2 | 0.7% |
6439.0 | 2 | 0.7% |
11098.900000000001 | 1 | 0.3% |
6793.790000000001 | 1 | 0.3% |
14457.44 | 1 | 0.3% |
10677.550000000001 | 1 | 0.3% |
7799.15 | 1 | 0.3% |
9495.25 | 1 | 0.3% |
6278.789999999999 | 1 | 0.3% |
6545.91 | 1 | 0.3% |
Other values (279) | 279 |
Value | Count | Frequency (%) |
1069.48 | 1 | |
1423.0 | 1 | |
1503.05 | 1 | |
1583.0 | 1 | |
2203.0 | 1 | |
3860.0 | 1 | |
4496.9400000000005 | 1 | |
4691.0 | 1 | |
4838.6 | 1 | |
4844.77 | 1 |
Value | Count | Frequency (%) |
28768.4 | 1 | |
23052.81 | 1 | |
20302.8 | 1 | |
19246.0 | 1 | |
18984.55 | 1 | |
18812.649999999998 | 1 | |
18506.0 | 1 | |
18459.41 | 1 | |
18195.21 | 1 | |
17268.9 | 1 |
준공연도
Date
Distinct | 61 |
---|---|
Distinct (%) | 21.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Minimum | 1974-08-15 00:00:00 |
---|---|
Maximum | 2019-10-28 00:00:00 |
연번 | 호선 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.939 | 0.194 | 0.898 | 0.476 | 0.158 | 0.985 |
호선 | 0.939 | 1.000 | 0.272 | 0.846 | 0.529 | 0.311 | 0.995 |
형식 | 0.194 | 0.272 | 1.000 | 0.081 | 0.000 | 0.519 | 0.638 |
길이(M) | 0.898 | 0.846 | 0.081 | 1.000 | 0.854 | 0.543 | 0.976 |
층수 | 0.476 | 0.529 | 0.000 | 0.854 | 1.000 | 0.780 | 0.919 |
면적(m²) | 0.158 | 0.311 | 0.519 | 0.543 | 0.780 | 1.000 | 0.512 |
준공연도 | 0.985 | 0.995 | 0.638 | 0.976 | 0.919 | 0.512 | 1.000 |
층수 | 형식 | |
---|---|---|
층수 | 1.000 | 0.000 |
형식 | 0.000 | 1.000 |
연번 | 호선 | 길이(M) | 면적(m²) | 형식 | 층수 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.990 | -0.826 | 0.135 | 0.121 | 0.207 |
호선 | 0.990 | 1.000 | -0.822 | 0.138 | 0.122 | 0.199 |
길이(M) | -0.826 | -0.822 | 1.000 | 0.024 | 0.060 | 0.635 |
면적(m²) | 0.135 | 0.138 | 0.024 | 1.000 | 0.265 | 0.455 |
형식 | 0.121 | 0.122 | 0.060 | 0.265 | 1.000 | 0.000 |
층수 | 0.207 | 0.199 | 0.635 | 0.455 | 0.000 | 1.000 |
연번 | 호선 | 역명 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 1 | 서울 | 섬식 | 210 | B2 | 10805.0 | 1974-08-15 |
1 | 2 | 1 | 시청 | 상대식 | 210 | B2 | 11317.0 | 1974-08-15 |
2 | 3 | 1 | 종각 | 상대식 | 210 | B2 | 10410.24 | 1974-08-15 |
3 | 4 | 1 | 종로3가 | 상대식 | 210 | B2 | 9311.0 | 1974-08-15 |
4 | 5 | 1 | 종로5가 | 상대식 | 210 | B2 | 10465.0 | 1974-08-15 |
5 | 6 | 1 | 동대문 | 상대식 | 210 | B2 | 5490.0 | 1974-08-15 |
6 | 7 | 1 | 동묘앞 | 상대식 | 210 | 5FB2 | 7031.66 | 2005-12-21 |
7 | 8 | 1 | 신설동 | 상대식 | 210 | B2 | 7240.0 | 1974-08-15 |
8 | 9 | 1 | 제기동 | 상대식 | 210 | B2 | 8662.0 | 1974-08-15 |
9 | 10 | 1 | 청량리 | 섬식 | 210 | B2 | 7125.0 | 1974-08-15 |
연번 | 호선 | 역명 | 형식 | 길이(M) | 층수 | 면적(m²) | 준공연도 | |
---|---|---|---|---|---|---|---|---|
281 | 282 | 9 | 봉은사 | 상대식 | 165 | B2 | 9825.28 | 2015-03-28 |
282 | 283 | 9 | 종합운동장 | 상대식 | 165 | B4 | 13976.51 | 2015-03-28 |
283 | 284 | 9 | 삼전 | 상대식 | 165 | B2 | 8644.07 | 2018-12-01 |
284 | 285 | 9 | 석촌고분 | 섬식 | 165 | B2 | 6833.56 | 2018-12-01 |
285 | 286 | 9 | 석촌 | 섬식 | 165 | B4 | 10105.46 | 2018-12-01 |
286 | 287 | 9 | 송파나루 | 섬식 | 165 | B2 | 7833.29 | 2018-12-01 |
287 | 288 | 9 | 한성백제 | 섬식 | 165 | B2 | 8954.96 | 2018-12-01 |
288 | 289 | 9 | 올림픽공원 | 섬식 | 165 | B3 | 8372.08 | 2018-12-01 |
289 | 290 | 9 | 둔촌오륜 | 섬식 | 165 | B2 | 7544.33 | 2018-12-01 |
290 | 291 | 9 | 중앙보훈병원 | 복합식 | 165 | B2 | 8956.0 | 2018-12-01 |