Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 667 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 22.3 KiB |
Average record size in memory | 34.2 B |
Variable types
Numeric | 2 |
---|---|
Text | 1 |
Categorical | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-12764/A/1/datasetView.do |
Reproduction
Analysis started | 2024-05-04 00:29:34.249938 |
---|---|
Analysis finished | 2024-05-04 00:29:36.383497 |
Duration | 2.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SUBWAY_ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1026.9055 |
Minimum | 1001 |
---|---|
Maximum | 1093 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.0 KiB |
Quantile statistics
Minimum | 1001 |
---|---|
5-th percentile | 1001 |
Q1 | 1003 |
median | 1006 |
Q3 | 1063 |
95-th percentile | 1088.7 |
Maximum | 1093 |
Range | 92 |
Interquartile range (IQR) | 60 |
Descriptive statistics
Standard deviation | 33.129898 |
---|---|
Coefficient of variation (CV) | 0.032261875 |
Kurtosis | -1.1501864 |
Mean | 1026.9055 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 0.82583454 |
Sum | 684946 |
Variance | 1097.5902 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1001 | 102 | |
1075 | 63 | |
1005 | 56 | |
1063 | 55 | |
1007 | 53 | |
1002 | 51 | |
1004 | 48 | |
1003 | 44 | 6.6% |
1006 | 39 | 5.8% |
1009 | 38 | 5.7% |
Other values (7) | 118 |
Value | Count | Frequency (%) |
1001 | 102 | |
1002 | 51 | |
1003 | 44 | |
1004 | 48 | |
1005 | 56 | |
1006 | 39 | 5.8% |
1007 | 53 | |
1008 | 18 | 2.7% |
1009 | 38 | 5.7% |
1063 | 55 |
Value | Count | Frequency (%) |
1093 | 21 | 3.1% |
1092 | 13 | 1.9% |
1081 | 11 | 1.6% |
1077 | 16 | 2.4% |
1075 | 63 | |
1067 | 25 | 3.7% |
1065 | 14 | 2.1% |
1063 | 55 | |
1009 | 38 | |
1008 | 18 | 2.7% |
STATN_ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 667 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0269299 × 109 |
Minimum | 1.0010001 × 109 |
---|---|
Maximum | 1.093004 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.0 KiB |
Quantile statistics
Minimum | 1.0010001 × 109 |
---|---|
5-th percentile | 1.0010001 × 109 |
Q1 | 1.0030003 × 109 |
median | 1.0060006 × 109 |
Q3 | 1.0630753 × 109 |
95-th percentile | 1.0887145 × 109 |
Maximum | 1.093004 × 109 |
Range | 92003922 |
Interquartile range (IQR) | 60075012 |
Descriptive statistics
Standard deviation | 33147658 |
---|---|
Coefficient of variation (CV) | 0.032278404 |
Kurtosis | -1.1520386 |
Mean | 1.0269299 × 109 |
Median Absolute Deviation (MAD) | 4000428 |
Skewness | 0.82537834 |
Sum | 6.8496225 × 1011 |
Variance | 1.0987673 × 1015 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1001000100 | 1 | 0.1% |
1009000929 | 1 | 0.1% |
1009000931 | 1 | 0.1% |
1009000932 | 1 | 0.1% |
1009000933 | 1 | 0.1% |
1009000934 | 1 | 0.1% |
1009000935 | 1 | 0.1% |
1009000936 | 1 | 0.1% |
1009000937 | 1 | 0.1% |
1009000938 | 1 | 0.1% |
Other values (657) | 657 |
Value | Count | Frequency (%) |
1001000100 | 1 | |
1001000101 | 1 | |
1001000102 | 1 | |
1001000103 | 1 | |
1001000104 | 1 | |
1001000105 | 1 | |
1001000106 | 1 | |
1001000107 | 1 | |
1001000108 | 1 | |
1001000109 | 1 |
Value | Count | Frequency (%) |
1093004022 | 1 | |
1093004021 | 1 | |
1093004020 | 1 | |
1093004019 | 1 | |
1093004018 | 1 | |
1093004017 | 1 | |
1093004016 | 1 | |
1093004014 | 1 | |
1093004013 | 1 | |
1093004012 | 1 |
STATN_NM
Text
Distinct | 546 |
---|---|
Distinct (%) | 81.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.3 KiB |
Value | Count | Frequency (%) |
청량리 | 4 | 0.6% |
김포공항 | 4 | 0.6% |
공덕 | 4 | 0.6% |
서울 | 4 | 0.6% |
왕십리 | 4 | 0.6% |
홍대입구 | 3 | 0.4% |
대곡 | 3 | 0.4% |
고속터미널 | 3 | 0.4% |
초지 | 3 | 0.4% |
신설동 | 3 | 0.4% |
Other values (537) | 633 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 69 | 3.4% |
산 | 56 | 2.8% |
신 | 52 | 2.6% |
구 | 51 | 2.5% |
동 | 48 | 2.4% |
천 | 40 | 2.0% |
원 | 38 | 1.9% |
정 | 36 | 1.8% |
청 | 32 | 1.6% |
) | 29 | 1.4% |
Other values (283) | 1585 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1962 | |
Close Punctuation | 29 | 1.4% |
Open Punctuation | 29 | 1.4% |
Decimal Number | 13 | 0.6% |
Other Punctuation | 2 | 0.1% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 69 | 3.5% |
산 | 56 | 2.9% |
신 | 52 | 2.7% |
구 | 51 | 2.6% |
동 | 48 | 2.4% |
천 | 40 | 2.0% |
원 | 38 | 1.9% |
정 | 36 | 1.8% |
청 | 32 | 1.6% |
지 | 29 | 1.5% |
Other values (272) | 1511 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 3 | |
1 | 2 | 15.4% |
9 | 1 | 7.7% |
2 | 1 | 7.7% |
5 | 1 | 7.7% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 | |
, | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 29 |
Open Punctuation
Value | Count | Frequency (%) |
( | 29 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1962 | |
Common | 74 | 3.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 69 | 3.5% |
산 | 56 | 2.9% |
신 | 52 | 2.7% |
구 | 51 | 2.6% |
동 | 48 | 2.4% |
천 | 40 | 2.0% |
원 | 38 | 1.9% |
정 | 36 | 1.8% |
청 | 32 | 1.6% |
지 | 29 | 1.5% |
Other values (272) | 1511 |
Common
Value | Count | Frequency (%) |
) | 29 | |
( | 29 | |
3 | 5 | 6.8% |
4 | 3 | 4.1% |
1 | 2 | 2.7% |
9 | 1 | 1.4% |
1 | 1.4% | |
2 | 1 | 1.4% |
. | 1 | 1.4% |
, | 1 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1962 | |
ASCII | 74 | 3.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 69 | 3.5% |
산 | 56 | 2.9% |
신 | 52 | 2.7% |
구 | 51 | 2.6% |
동 | 48 | 2.4% |
천 | 40 | 2.0% |
원 | 38 | 1.9% |
정 | 36 | 1.8% |
청 | 32 | 1.6% |
지 | 29 | 1.5% |
Other values (272) | 1511 |
ASCII
Value | Count | Frequency (%) |
) | 29 | |
( | 29 | |
3 | 5 | 6.8% |
4 | 3 | 4.1% |
1 | 2 | 2.7% |
9 | 1 | 1.4% |
1 | 1.4% | |
2 | 1 | 1.4% |
. | 1 | 1.4% |
, | 1 | 1.4% |
호선이름
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.3 KiB |
1호선 | |
---|---|
수인분당선 | |
5호선 | |
경의중앙선 | |
7호선 | |
Other values (12) |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.4377811 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1호선 |
---|---|
2nd row | 1호선 |
3rd row | 1호선 |
4th row | 1호선 |
5th row | 1호선 |
Common Values
Value | Count | Frequency (%) |
1호선 | 102 | |
수인분당선 | 63 | |
5호선 | 56 | |
경의중앙선 | 55 | |
7호선 | 53 | |
2호선 | 51 | |
4호선 | 48 | |
3호선 | 44 | 6.6% |
6호선 | 39 | 5.8% |
9호선 | 38 | 5.7% |
Other values (7) | 118 |
Length
Value | Count | Frequency (%) |
1호선 | 102 | |
수인분당선 | 63 | |
5호선 | 56 | |
경의중앙선 | 55 | |
7호선 | 53 | |
2호선 | 51 | |
4호선 | 48 | |
3호선 | 44 | 6.6% |
6호선 | 39 | 5.8% |
9호선 | 38 | 5.7% |
Other values (7) | 118 |
SUBWAY_ID | STATN_ID | 호선이름 | |
---|---|---|---|
SUBWAY_ID | 1.000 | 1.000 | 1.000 |
STATN_ID | 1.000 | 1.000 | 1.000 |
호선이름 | 1.000 | 1.000 | 1.000 |
SUBWAY_ID | STATN_ID | 호선이름 | |
---|---|---|---|
SUBWAY_ID | 1.000 | 0.996 | 0.991 |
STATN_ID | 0.996 | 1.000 | 0.991 |
호선이름 | 0.991 | 0.991 | 1.000 |
SUBWAY_ID | STATN_ID | STATN_NM | 호선이름 | |
---|---|---|---|---|
0 | 1001 | 1001000100 | 소요산 | 1호선 |
1 | 1001 | 1001000101 | 동두천 | 1호선 |
2 | 1001 | 1001000102 | 보산 | 1호선 |
3 | 1001 | 1001000103 | 동두천중앙 | 1호선 |
4 | 1001 | 1001000104 | 지행 | 1호선 |
5 | 1001 | 1001000105 | 덕정 | 1호선 |
6 | 1001 | 1001000106 | 덕계 | 1호선 |
7 | 1001 | 1001000107 | 양주 | 1호선 |
8 | 1001 | 1001000108 | 녹양 | 1호선 |
9 | 1001 | 1001000109 | 가능 | 1호선 |
SUBWAY_ID | STATN_ID | STATN_NM | 호선이름 | |
---|---|---|---|---|
657 | 1093 | 1093004012 | 시흥대야 | 서해선 |
658 | 1093 | 1093004013 | 신천 | 서해선 |
659 | 1093 | 1093004014 | 신현 | 서해선 |
660 | 1093 | 1093004016 | 시흥시청 | 서해선 |
661 | 1093 | 1093004017 | 시흥능곡 | 서해선 |
662 | 1093 | 1093004018 | 달미 | 서해선 |
663 | 1093 | 1093004019 | 선부 | 서해선 |
664 | 1093 | 1093004020 | 초지 | 서해선 |
665 | 1093 | 1093004021 | 시우 | 서해선 |
666 | 1093 | 1093004022 | 원시 | 서해선 |