Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Categorical | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15067/S/1/datasetView.do |
NODE_ID is highly overall correlated with ARS_ID and 1 other fields | High correlation |
ARS_ID is highly overall correlated with NODE_ID and 1 other fields | High correlation |
Y좌표 is highly overall correlated with NODE_ID and 1 other fields | High correlation |
NODE_ID has unique values | Unique |
ARS_ID has unique values | Unique |
Reproduction
Analysis started | 2024-04-06 11:26:39.407311 |
---|---|
Analysis finished | 2024-04-06 11:26:44.145239 |
Duration | 4.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NODE_ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.132309 × 108 |
Minimum | 1 × 108 |
---|---|
Maximum | 1.6700064 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 × 108 |
---|---|
5-th percentile | 1.0100029 × 108 |
Q1 | 1.0790011 × 108 |
median | 1.1390007 × 108 |
Q3 | 1.1990013 × 108 |
95-th percentile | 1.2300052 × 108 |
Maximum | 1.6700064 × 108 |
Range | 67000639 |
Interquartile range (IQR) | 12000015 |
Descriptive statistics
Standard deviation | 6997297.5 |
---|---|
Coefficient of variation (CV) | 0.061796713 |
Kurtosis | -0.80155146 |
Mean | 1.132309 × 108 |
Median Absolute Deviation (MAD) | 6000015 |
Skewness | -0.11528895 |
Sum | 1.132309 × 1012 |
Variance | 4.8962172 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
115000162 | 1 | < 0.1% |
115000557 | 1 | < 0.1% |
102900020 | 1 | < 0.1% |
104000258 | 1 | < 0.1% |
103900297 | 1 | < 0.1% |
113900157 | 1 | < 0.1% |
118900183 | 1 | < 0.1% |
122000697 | 1 | < 0.1% |
117900070 | 1 | < 0.1% |
120900119 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
100000001 | 1 | |
100000002 | 1 | |
100000003 | 1 | |
100000004 | 1 | |
100000006 | 1 | |
100000007 | 1 | |
100000008 | 1 | |
100000009 | 1 | |
100000010 | 1 | |
100000011 | 1 |
Value | Count | Frequency (%) |
167000640 | 1 | |
124900141 | 1 | |
124900140 | 1 | |
124900139 | 1 | |
124900138 | 1 | |
124900137 | 1 | |
124900136 | 1 | |
124900135 | 1 | |
124900134 | 1 | |
124900133 | 1 |
ARS_ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14345.807 |
Minimum | 1001 |
---|---|
Maximum | 25999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1001 |
---|---|
5-th percentile | 2522.95 |
Q1 | 8589.75 |
median | 14603.5 |
Q3 | 20795.25 |
95-th percentile | 24429.05 |
Maximum | 25999 |
Range | 24998 |
Interquartile range (IQR) | 12205.5 |
Descriptive statistics
Standard deviation | 6984.6838 |
---|---|
Coefficient of variation (CV) | 0.48687981 |
Kurtosis | -1.1233465 |
Mean | 14345.807 |
Median Absolute Deviation (MAD) | 6071.5 |
Skewness | -0.1550181 |
Sum | 1.4345807 × 108 |
Variance | 48785808 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16259 | 1 | < 0.1% |
16851 | 1 | < 0.1% |
3551 | 1 | < 0.1% |
5697 | 1 | < 0.1% |
4533 | 1 | < 0.1% |
14889 | 1 | < 0.1% |
19881 | 1 | < 0.1% |
23179 | 1 | < 0.1% |
18565 | 1 | < 0.1% |
21785 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1001 | 1 | |
1002 | 1 | |
1003 | 1 | |
1004 | 1 | |
1006 | 1 | |
1007 | 1 | |
1008 | 1 | |
1010 | 1 | |
1011 | 1 | |
1012 | 1 |
Value | Count | Frequency (%) |
25999 | 1 | |
25998 | 1 | |
25997 | 1 | |
25996 | 1 | |
25995 | 1 | |
25994 | 1 | |
25990 | 1 | |
25784 | 1 | |
25782 | 1 | |
25781 | 1 |
정류소명
Text
Distinct | 6694 |
---|---|
Distinct (%) | 66.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
벽산아파트 | 12 | 0.1% |
새마을금고 | 11 | 0.1% |
현대아파트 | 10 | 0.1% |
구로디지털단지역 | 9 | 0.1% |
북서울꿈의숲 | 9 | 0.1% |
삼성래미안아파트 | 8 | 0.1% |
가산디지털단지역 | 8 | 0.1% |
신대방역 | 8 | 0.1% |
합정역 | 8 | 0.1% |
우성아파트 | 8 | 0.1% |
Other values (6685) | 9910 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2251 | 2.9% |
파 | 2132 | 2.7% |
. | 2124 | 2.7% |
아 | 2116 | 2.7% |
트 | 2059 | 2.6% |
교 | 1808 | 2.3% |
구 | 1567 | 2.0% |
역 | 1482 | 1.9% |
대 | 1288 | 1.7% |
지 | 1235 | 1.6% |
Other values (652) | 59657 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 72092 | |
Decimal Number | 2514 | 3.2% |
Other Punctuation | 2150 | 2.8% |
Uppercase Letter | 669 | 0.9% |
Close Punctuation | 127 | 0.2% |
Open Punctuation | 125 | 0.2% |
Lowercase Letter | 32 | < 0.1% |
Dash Punctuation | 9 | < 0.1% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2251 | 3.1% |
파 | 2132 | 3.0% |
아 | 2116 | 2.9% |
트 | 2059 | 2.9% |
교 | 1808 | 2.5% |
구 | 1567 | 2.2% |
역 | 1482 | 2.1% |
대 | 1288 | 1.8% |
지 | 1235 | 1.7% |
학 | 1226 | 1.7% |
Other values (605) | 54928 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 86 | |
K | 71 | |
S | 70 | |
C | 66 | |
A | 56 | |
P | 52 | |
G | 42 | 6.3% |
M | 38 | 5.7% |
D | 33 | 4.9% |
B | 29 | 4.3% |
Other values (14) | 126 |
Decimal Number
Value | Count | Frequency (%) |
1 | 749 | |
2 | 472 | |
3 | 342 | |
4 | 211 | 8.4% |
5 | 168 | 6.7% |
0 | 160 | 6.4% |
7 | 124 | 4.9% |
6 | 122 | 4.9% |
9 | 105 | 4.2% |
8 | 61 | 2.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 2124 | |
· | 12 | 0.6% |
& | 11 | 0.5% |
, | 2 | 0.1% |
? | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 24 | |
k | 4 | 12.5% |
t | 2 | 6.2% |
s | 2 | 6.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 127 |
Open Punctuation
Value | Count | Frequency (%) |
( | 125 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 72092 | |
Common | 4926 | 6.3% |
Latin | 701 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2251 | 3.1% |
파 | 2132 | 3.0% |
아 | 2116 | 2.9% |
트 | 2059 | 2.9% |
교 | 1808 | 2.5% |
구 | 1567 | 2.2% |
역 | 1482 | 2.1% |
대 | 1288 | 1.8% |
지 | 1235 | 1.7% |
학 | 1226 | 1.7% |
Other values (605) | 54928 |
Latin
Value | Count | Frequency (%) |
T | 86 | |
K | 71 | |
S | 70 | |
C | 66 | |
A | 56 | 8.0% |
P | 52 | 7.4% |
G | 42 | 6.0% |
M | 38 | 5.4% |
D | 33 | 4.7% |
B | 29 | 4.1% |
Other values (18) | 158 |
Common
Value | Count | Frequency (%) |
. | 2124 | |
1 | 749 | 15.2% |
2 | 472 | 9.6% |
3 | 342 | 6.9% |
4 | 211 | 4.3% |
5 | 168 | 3.4% |
0 | 160 | 3.2% |
) | 127 | 2.6% |
( | 125 | 2.5% |
7 | 124 | 2.5% |
Other values (9) | 324 | 6.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 72092 | |
ASCII | 5615 | 7.2% |
None | 12 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2251 | 3.1% |
파 | 2132 | 3.0% |
아 | 2116 | 2.9% |
트 | 2059 | 2.9% |
교 | 1808 | 2.5% |
구 | 1567 | 2.2% |
역 | 1482 | 2.1% |
대 | 1288 | 1.8% |
지 | 1235 | 1.7% |
학 | 1226 | 1.7% |
Other values (605) | 54928 |
ASCII
Value | Count | Frequency (%) |
. | 2124 | |
1 | 749 | 13.3% |
2 | 472 | 8.4% |
3 | 342 | 6.1% |
4 | 211 | 3.8% |
5 | 168 | 3.0% |
0 | 160 | 2.8% |
) | 127 | 2.3% |
( | 125 | 2.2% |
7 | 124 | 2.2% |
Other values (36) | 1013 |
None
Value | Count | Frequency (%) |
· | 12 |
X좌표
Real number (ℝ)
Distinct | 9992 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.98632 |
Minimum | 126.45723 |
---|---|
Maximum | 127.18176 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 126.45723 |
---|---|
5-th percentile | 126.84227 |
Q1 | 126.91706 |
median | 126.99511 |
Q3 | 127.05156 |
95-th percentile | 127.12836 |
Maximum | 127.18176 |
Range | 0.72453 |
Interquartile range (IQR) | 0.13450275 |
Descriptive statistics
Standard deviation | 0.086448895 |
---|---|
Coefficient of variation (CV) | 0.00068077329 |
Kurtosis | -0.73179472 |
Mean | 126.98632 |
Median Absolute Deviation (MAD) | 0.068365881 |
Skewness | -0.063892168 |
Sum | 1269863.2 |
Variance | 0.0074734115 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.18176 | 2 | < 0.1% |
127.040517 | 2 | < 0.1% |
127.0360924585 | 2 | < 0.1% |
127.0520250488 | 2 | < 0.1% |
127.1443598929 | 2 | < 0.1% |
127.1480886874 | 2 | < 0.1% |
127.013138 | 2 | < 0.1% |
127.013707 | 2 | < 0.1% |
126.8128627805 | 1 | < 0.1% |
127.1090124148 | 1 | < 0.1% |
Other values (9982) | 9982 |
Value | Count | Frequency (%) |
126.45723 | 1 | |
126.7210313414 | 1 | |
126.797811 | 1 | |
126.7978638462 | 1 | |
126.797978 | 1 | |
126.798335 | 1 | |
126.7984631135 | 1 | |
126.7985207144 | 1 | |
126.7985641294 | 1 | |
126.7987623811 | 1 |
Value | Count | Frequency (%) |
127.18176 | 2 | |
127.1817343335 | 1 | |
127.1816669472 | 1 | |
127.18013794 | 1 | |
127.18013 | 1 | |
127.1799002887 | 1 | |
127.1798392415 | 1 | |
127.179726 | 1 | |
127.1797196537 | 1 | |
127.1794170581 | 1 |
Y좌표
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9991 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.549975 |
Minimum | 37.43052 |
---|---|
Maximum | 37.690177 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 37.43052 |
---|---|
5-th percentile | 37.471265 |
Q1 | 37.50218 |
median | 37.549361 |
Q3 | 37.589644 |
95-th percentile | 37.646861 |
Maximum | 37.690177 |
Range | 0.25965706 |
Interquartile range (IQR) | 0.087464028 |
Descriptive statistics
Standard deviation | 0.054737504 |
---|---|
Coefficient of variation (CV) | 0.0014577241 |
Kurtosis | -0.76947379 |
Mean | 37.549975 |
Median Absolute Deviation (MAD) | 0.0442798 |
Skewness | 0.26675677 |
Sum | 375499.75 |
Variance | 0.0029961944 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.4763036239 | 2 | < 0.1% |
37.450172246 | 2 | < 0.1% |
37.4892113897 | 2 | < 0.1% |
37.6015704448 | 2 | < 0.1% |
37.5625221156 | 2 | < 0.1% |
37.5361286235 | 2 | < 0.1% |
37.5553932418 | 2 | < 0.1% |
37.5661382638 | 2 | < 0.1% |
37.553371595 | 2 | < 0.1% |
37.5039954584 | 1 | < 0.1% |
Other values (9981) | 9981 |
Value | Count | Frequency (%) |
37.4305199435 | 1 | |
37.4309469125 | 1 | |
37.4345128931 | 1 | |
37.4347964213 | 1 | |
37.4348585994 | 1 | |
37.4349735461 | 1 | |
37.4350042057 | 1 | |
37.4355241561 | 1 | |
37.4371542291 | 1 | |
37.4373210738 | 1 |
Value | Count | Frequency (%) |
37.690177 | 1 | |
37.6899483575 | 1 | |
37.6898762161 | 1 | |
37.6893500743 | 1 | |
37.689202857 | 1 | |
37.6890118581 | 1 | |
37.688568 | 1 | |
37.6879883235 | 1 | |
37.6879397664 | 1 | |
37.6874938159 | 1 |
정류소타입
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
일반차로 | |
---|---|
마을버스 | |
중앙차로 | 353 |
가로변시간 | 236 |
가로변전일 | 134 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.0449 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반차로 |
---|---|
2nd row | 마을버스 |
3rd row | 일반차로 |
4th row | 마을버스 |
5th row | 일반차로 |
Common Values
Value | Count | Frequency (%) |
일반차로 | 5494 | |
마을버스 | 3704 | |
중앙차로 | 353 | 3.5% |
가로변시간 | 236 | 2.4% |
가로변전일 | 134 | 1.3% |
가상정류장 | 79 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반차로 | 5494 | |
마을버스 | 3704 | |
중앙차로 | 353 | 3.5% |
가로변시간 | 236 | 2.4% |
가로변전일 | 134 | 1.3% |
가상정류장 | 79 | 0.8% |
NODE_ID | ARS_ID | X좌표 | Y좌표 | 정류소타입 | |
---|---|---|---|---|---|
NODE_ID | 1.000 | 0.977 | 0.822 | 0.835 | 0.132 |
ARS_ID | 0.977 | 1.000 | 0.792 | 0.859 | 0.285 |
X좌표 | 0.822 | 0.792 | 1.000 | 0.397 | 0.207 |
Y좌표 | 0.835 | 0.859 | 0.397 | 1.000 | 0.168 |
정류소타입 | 0.132 | 0.285 | 0.207 | 0.168 | 1.000 |
NODE_ID | ARS_ID | X좌표 | Y좌표 | 정류소타입 | |
---|---|---|---|---|---|
NODE_ID | 1.000 | 0.998 | -0.051 | -0.674 | 0.090 |
ARS_ID | 0.998 | 1.000 | -0.052 | -0.674 | 0.154 |
X좌표 | -0.051 | -0.052 | 1.000 | 0.217 | 0.116 |
Y좌표 | -0.674 | -0.674 | 0.217 | 1.000 | 0.089 |
정류소타입 | 0.090 | 0.154 | 0.116 | 0.089 | 1.000 |
NODE_ID | ARS_ID | 정류소명 | X좌표 | Y좌표 | 정류소타입 | |
---|---|---|---|---|---|---|
6382 | 115000162 | 16259 | 가양역9번출구.우성아파트 | 126.853974 | 37.561452 | 일반차로 |
6512 | 115900235 | 16467 | 한광고 | 126.858481 | 37.537793 | 마을버스 |
9821 | 122000024 | 23124 | 언북중학교입구 | 127.030821 | 37.520298 | 일반차로 |
3290 | 108900093 | 9682 | 벽산아파트 | 127.019933 | 37.640982 | 마을버스 |
6326 | 115000106 | 16203 | 신월초등학교 | 126.838969 | 37.538635 | 일반차로 |
3977 | 110000197 | 11297 | 쌍용스윗닷홈아파트 | 127.045425 | 37.630398 | 일반차로 |
3228 | 108900138 | 9528 | 당진슈퍼 | 127.032306 | 37.625575 | 마을버스 |
10254 | 122000741 | 23816 | 봉은사역코엑스인터컨티넨탈 | 127.057443 | 37.513613 | 일반차로 |
2037 | 106000431 | 7011 | 금란교회 | 127.104033 | 37.60053 | 중앙차로 |
10528 | 123000174 | 24264 | 오금동대림아파트 | 127.127992 | 37.507916 | 일반차로 |
NODE_ID | ARS_ID | 정류소명 | X좌표 | Y좌표 | 정류소타입 | |
---|---|---|---|---|---|---|
3248 | 108900203 | 9551 | 수유1동주민센터.파출소 | 127.017525 | 37.630153 | 마을버스 |
5174 | 112900016 | 13839 | 금강빌라.인왕중학교 | 126.951626 | 37.592211 | 마을버스 |
363 | 100900030 | 1878 | 서울대치과대학 | 126.997801 | 37.577408 | 마을버스 |
4003 | 110000222 | 11323 | 월계삼호4차아파트 | 127.065982 | 37.626663 | 일반차로 |
2259 | 106000228 | 7324 | 아남리치카운티아파트 | 127.085382 | 37.590406 | 일반차로 |
749 | 102000152 | 3246 | 중앙하이츠빌라앞 | 126.959881 | 37.537974 | 일반차로 |
3392 | 108900007 | 9890 | 번3동주민센터 | 127.046675 | 37.626024 | 마을버스 |
5225 | 112900204 | 13912 | 홍제우체국 | 126.946333 | 37.586761 | 마을버스 |
8256 | 119000056 | 20149 | 상도초등학교입구 | 126.936766 | 37.503306 | 일반차로 |
7313 | 116900126 | 17954 | 1호선구일역 | 126.872215 | 37.495217 | 마을버스 |