Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 507.8 KiB |
Average record size in memory | 52.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-12912/S/1/datasetView.do |
NODE_ID(노드ID) is highly overall correlated with STTN_NO(정류소ID) and 1 other fields | High correlation |
STTN_NO(정류소ID) is highly overall correlated with NODE_ID(노드ID) and 1 other fields | High correlation |
CRDNT_Y(Y좌표) is highly overall correlated with NODE_ID(노드ID) and 1 other fields | High correlation |
NODE_ID(노드ID) has unique values | Unique |
STTN_NO(정류소ID) has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 16:28:37.388058 |
---|---|
Analysis finished | 2024-04-29 16:28:40.972988 |
Duration | 3.58 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NODE_ID(노드ID)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1310799 × 108 |
Minimum | 1 × 108 |
---|---|
Maximum | 1.2900019 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 × 108 |
---|---|
5-th percentile | 1.0100028 × 108 |
Q1 | 1.0790014 × 108 |
median | 1.1300047 × 108 |
Q3 | 1.1900014 × 108 |
95-th percentile | 1.2300022 × 108 |
Maximum | 1.2900019 × 108 |
Range | 29000185 |
Interquartile range (IQR) | 11099994 |
Descriptive statistics
Standard deviation | 6872993.7 |
---|---|
Coefficient of variation (CV) | 0.06076488 |
Kurtosis | -1.1042772 |
Mean | 1.1310799 × 108 |
Median Absolute Deviation (MAD) | 5899727 |
Skewness | -0.14394119 |
Sum | 1.1310799 × 1012 |
Variance | 4.7238042 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
114000157 | 1 | < 0.1% |
121900217 | 1 | < 0.1% |
107900298 | 1 | < 0.1% |
124900077 | 1 | < 0.1% |
108000364 | 1 | < 0.1% |
116900273 | 1 | < 0.1% |
122000214 | 1 | < 0.1% |
116900006 | 1 | < 0.1% |
108000094 | 1 | < 0.1% |
107900004 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
100000001 | 1 | |
100000002 | 1 | |
100000003 | 1 | |
100000004 | 1 | |
100000005 | 1 | |
100000006 | 1 | |
100000007 | 1 | |
100000008 | 1 | |
100000010 | 1 | |
100000011 | 1 |
Value | Count | Frequency (%) |
129000186 | 1 | |
124900124 | 1 | |
124900123 | 1 | |
124900122 | 1 | |
124900120 | 1 | |
124900119 | 1 | |
124900118 | 1 | |
124900116 | 1 | |
124900113 | 1 | |
124900112 | 1 |
STTN_NM(정류소명칭)
Text
Distinct | 6436 |
---|---|
Distinct (%) | 64.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 19 |
Mean length | 7.4161 |
Min length | 2 |
Characters and Unicode
Total characters | 74161 |
---|---|
Distinct characters | 651 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 3762 ? |
---|---|
Unique (%) | 37.6% |
Sample
1st row | 서울남부지방법원.서울남부지방검찰청 |
---|---|
2nd row | 방배역.방배서리풀이편한세상 |
3rd row | 동성주택 |
4th row | 거성푸르뫼아파트 |
5th row | 동아아파트 |
Value | Count | Frequency (%) |
벽산아파트 | 11 | 0.1% |
현대아파트 | 11 | 0.1% |
국민은행 | 11 | 0.1% |
경남아파트 | 11 | 0.1% |
새마을금고 | 10 | 0.1% |
성원아파트 | 10 | 0.1% |
가산디지털단지역 | 10 | 0.1% |
우성아파트 | 9 | 0.1% |
신대방역 | 9 | 0.1% |
북서울꿈의숲 | 9 | 0.1% |
Other values (6434) | 9914 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2278 | 3.1% |
아 | 2098 | 2.8% |
파 | 2037 | 2.7% |
트 | 2033 | 2.7% |
. | 1904 | 2.6% |
교 | 1712 | 2.3% |
구 | 1442 | 1.9% |
역 | 1425 | 1.9% |
대 | 1246 | 1.7% |
지 | 1233 | 1.7% |
Other values (641) | 56753 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 69197 | |
Decimal Number | 2274 | 3.1% |
Other Punctuation | 1914 | 2.6% |
Uppercase Letter | 651 | 0.9% |
Open Punctuation | 39 | 0.1% |
Close Punctuation | 39 | 0.1% |
Lowercase Letter | 22 | < 0.1% |
Space Separator | 15 | < 0.1% |
Dash Punctuation | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2278 | 3.3% |
아 | 2098 | 3.0% |
파 | 2037 | 2.9% |
트 | 2033 | 2.9% |
교 | 1712 | 2.5% |
구 | 1442 | 2.1% |
역 | 1425 | 2.1% |
대 | 1246 | 1.8% |
지 | 1233 | 1.8% |
학 | 1198 | 1.7% |
Other values (599) | 52495 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 99 | |
K | 83 | |
A | 64 | |
S | 56 | |
C | 54 | |
P | 50 | |
G | 39 | 6.0% |
B | 38 | 5.8% |
M | 29 | 4.5% |
L | 28 | 4.3% |
Other values (12) | 111 |
Decimal Number
Value | Count | Frequency (%) |
1 | 693 | |
2 | 446 | |
3 | 323 | |
4 | 181 | 8.0% |
5 | 149 | 6.6% |
0 | 138 | 6.1% |
7 | 109 | 4.8% |
6 | 99 | 4.4% |
9 | 86 | 3.8% |
8 | 50 | 2.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1904 | |
& | 6 | 0.3% |
· | 4 | 0.2% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 18 | |
k | 2 | 9.1% |
t | 2 | 9.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 39 |
Close Punctuation
Value | Count | Frequency (%) |
) | 39 |
Space Separator
Value | Count | Frequency (%) |
15 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 69197 | |
Common | 4291 | 5.8% |
Latin | 673 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2278 | 3.3% |
아 | 2098 | 3.0% |
파 | 2037 | 2.9% |
트 | 2033 | 2.9% |
교 | 1712 | 2.5% |
구 | 1442 | 2.1% |
역 | 1425 | 2.1% |
대 | 1246 | 1.8% |
지 | 1233 | 1.8% |
학 | 1198 | 1.7% |
Other values (599) | 52495 |
Latin
Value | Count | Frequency (%) |
T | 99 | |
K | 83 | |
A | 64 | |
S | 56 | |
C | 54 | |
P | 50 | 7.4% |
G | 39 | 5.8% |
B | 38 | 5.6% |
M | 29 | 4.3% |
L | 28 | 4.2% |
Other values (15) | 133 |
Common
Value | Count | Frequency (%) |
. | 1904 | |
1 | 693 | 16.2% |
2 | 446 | 10.4% |
3 | 323 | 7.5% |
4 | 181 | 4.2% |
5 | 149 | 3.5% |
0 | 138 | 3.2% |
7 | 109 | 2.5% |
6 | 99 | 2.3% |
9 | 86 | 2.0% |
Other values (7) | 163 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 69197 | |
ASCII | 4960 | 6.7% |
None | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2278 | 3.3% |
아 | 2098 | 3.0% |
파 | 2037 | 2.9% |
트 | 2033 | 2.9% |
교 | 1712 | 2.5% |
구 | 1442 | 2.1% |
역 | 1425 | 2.1% |
대 | 1246 | 1.8% |
지 | 1233 | 1.8% |
학 | 1198 | 1.7% |
Other values (599) | 52495 |
ASCII
Value | Count | Frequency (%) |
. | 1904 | |
1 | 693 | 14.0% |
2 | 446 | 9.0% |
3 | 323 | 6.5% |
4 | 181 | 3.6% |
5 | 149 | 3.0% |
0 | 138 | 2.8% |
7 | 109 | 2.2% |
6 | 99 | 2.0% |
T | 99 | 2.0% |
Other values (31) | 819 |
None
Value | Count | Frequency (%) |
· | 4 |
STTN_NO(정류소ID)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14197.906 |
Minimum | 1001 |
---|---|
Maximum | 25990 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1001 |
---|---|
5-th percentile | 2521.95 |
Q1 | 8755.5 |
median | 14355.5 |
Q3 | 20228.5 |
95-th percentile | 24306.05 |
Maximum | 25990 |
Range | 24989 |
Interquartile range (IQR) | 11473 |
Descriptive statistics
Standard deviation | 6883.9191 |
---|---|
Coefficient of variation (CV) | 0.48485453 |
Kurtosis | -1.0987394 |
Mean | 14197.906 |
Median Absolute Deviation (MAD) | 5807.5 |
Skewness | -0.13831109 |
Sum | 1.4197906 × 108 |
Variance | 47388342 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15260 | 1 | < 0.1% |
22694 | 1 | < 0.1% |
8568 | 1 | < 0.1% |
25574 | 1 | < 0.1% |
9304 | 1 | < 0.1% |
17695 | 1 | < 0.1% |
23318 | 1 | < 0.1% |
17682 | 1 | < 0.1% |
9182 | 1 | < 0.1% |
8474 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1001 | 1 | |
1002 | 1 | |
1003 | 1 | |
1004 | 1 | |
1005 | 1 | |
1006 | 1 | |
1008 | 1 | |
1009 | 1 | |
1010 | 1 | |
1011 | 1 |
Value | Count | Frequency (%) |
25990 | 1 | |
25989 | 1 | |
25988 | 1 | |
25784 | 1 | |
25782 | 1 | |
25781 | 1 | |
25752 | 1 | |
25749 | 1 | |
25746 | 1 | |
25740 | 1 |
CRDNT_X(X좌표)
Real number (ℝ)
Distinct | 9979 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.98484 |
Minimum | 126.79835 |
---|---|
Maximum | 127.18027 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 126.79835 |
---|---|
5-th percentile | 126.8438 |
Q1 | 126.9168 |
median | 126.9931 |
Q3 | 127.04987 |
95-th percentile | 127.12252 |
Maximum | 127.18027 |
Range | 0.38191232 |
Interquartile range (IQR) | 0.13307738 |
Descriptive statistics
Standard deviation | 0.084557844 |
---|---|
Coefficient of variation (CV) | 0.00066588931 |
Kurtosis | -0.86908477 |
Mean | 126.98484 |
Median Absolute Deviation (MAD) | 0.067004976 |
Skewness | -0.045195181 |
Sum | 1269848.4 |
Variance | 0.007150029 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.0324578678 | 3 | < 0.1% |
126.8992453681 | 2 | < 0.1% |
126.9375749794 | 2 | < 0.1% |
126.948037 | 2 | < 0.1% |
126.9478121337 | 2 | < 0.1% |
126.9275473964 | 2 | < 0.1% |
126.9760324524 | 2 | < 0.1% |
126.8995348053 | 2 | < 0.1% |
127.056404146 | 2 | < 0.1% |
126.840232155 | 2 | < 0.1% |
Other values (9969) | 9979 |
Value | Count | Frequency (%) |
126.7983534326 | 1 | |
126.7984749196 | 1 | |
126.798649 | 1 | |
126.7986847129 | 1 | |
126.798773 | 1 | |
126.799863985 | 1 | |
126.8000835715 | 1 | |
126.8013423185 | 1 | |
126.8017961457 | 1 | |
126.8019653455 | 1 |
Value | Count | Frequency (%) |
127.1802657501 | 1 | |
127.1800898791 | 1 | |
127.1795399999 | 1 | |
127.1795016106 | 1 | |
127.1783352627 | 1 | |
127.1780401265 | 1 | |
127.1779976114 | 1 | |
127.1779314283 | 1 | |
127.1776973726 | 1 | |
127.1774135685 | 1 |
CRDNT_Y(Y좌표)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9978 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.551092 |
Minimum | 37.43078 |
---|---|
Maximum | 37.781594 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 37.43078 |
---|---|
5-th percentile | 37.471178 |
Q1 | 37.502786 |
median | 37.54995 |
Q3 | 37.592236 |
95-th percentile | 37.648054 |
Maximum | 37.781594 |
Range | 0.35081385 |
Interquartile range (IQR) | 0.089449235 |
Descriptive statistics
Standard deviation | 0.055445342 |
---|---|
Coefficient of variation (CV) | 0.0014765307 |
Kurtosis | -0.78801501 |
Mean | 37.551092 |
Median Absolute Deviation (MAD) | 0.045286304 |
Skewness | 0.25694316 |
Sum | 375510.92 |
Variance | 0.003074186 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.6255057955 | 3 | < 0.1% |
37.5661890393 | 2 | < 0.1% |
37.4941827082 | 2 | < 0.1% |
37.5614344732 | 2 | < 0.1% |
37.490902 | 2 | < 0.1% |
37.5560249673 | 2 | < 0.1% |
37.5781517573 | 2 | < 0.1% |
37.5498210288 | 2 | < 0.1% |
37.520832794 | 2 | < 0.1% |
37.477212 | 2 | < 0.1% |
Other values (9968) | 9979 |
Value | Count | Frequency (%) |
37.430779662 | 1 | |
37.4337190645 | 1 | |
37.4346702667 | 1 | |
37.434793586 | 1 | |
37.4348444186 | 1 | |
37.4349898625 | 1 | |
37.4350042396 | 1 | |
37.4355268028 | 1 | |
37.436857042 | 1 | |
37.437324573 | 1 |
Value | Count | Frequency (%) |
37.7815935083 | 1 | |
37.690199 | 1 | |
37.6899469943 | 1 | |
37.6893523043 | 1 | |
37.6891947508 | 1 | |
37.689128 | 1 | |
37.6890060442 | 1 | |
37.6887853785 | 1 | |
37.6879849018 | 1 | |
37.6879546953 | 1 |
NODE_ID(노드ID) | STTN_NO(정류소ID) | CRDNT_X(X좌표) | CRDNT_Y(Y좌표) | |
---|---|---|---|---|
NODE_ID(노드ID) | 1.000 | 0.986 | 0.901 | 0.735 |
STTN_NO(정류소ID) | 0.986 | 1.000 | 0.904 | 0.734 |
CRDNT_X(X좌표) | 0.901 | 0.904 | 1.000 | 0.437 |
CRDNT_Y(Y좌표) | 0.735 | 0.734 | 0.437 | 1.000 |
NODE_ID(노드ID) | STTN_NO(정류소ID) | CRDNT_X(X좌표) | CRDNT_Y(Y좌표) | |
---|---|---|---|---|
NODE_ID(노드ID) | 1.000 | 0.998 | -0.095 | -0.676 |
STTN_NO(정류소ID) | 0.998 | 1.000 | -0.096 | -0.676 |
CRDNT_X(X좌표) | -0.095 | -0.096 | 1.000 | 0.247 |
CRDNT_Y(Y좌표) | -0.676 | -0.676 | 0.247 | 1.000 |
NODE_ID(노드ID) | STTN_NM(정류소명칭) | STTN_NO(정류소ID) | CRDNT_X(X좌표) | CRDNT_Y(Y좌표) | |
---|---|---|---|---|---|
5977 | 114000157 | 서울남부지방법원.서울남부지방검찰청 | 15260 | 126.865796 | 37.521801 |
9206 | 121000153 | 방배역.방배서리풀이편한세상 | 22229 | 126.99641 | 37.483396 |
3183 | 108900122 | 동성주택 | 9550 | 127.032208 | 37.623052 |
6958 | 116000287 | 거성푸르뫼아파트 | 17306 | 126.843676 | 37.500611 |
1901 | 105900022 | 동아아파트 | 6515 | 127.045485 | 37.574201 |
995 | 103000074 | 천주교성수동성당앞 | 4173 | 127.046912 | 37.53908 |
3422 | 109000035 | 도봉보건소 | 10118 | 127.040057 | 37.657774 |
3218 | 108900103 | 경남아파트 | 9588 | 127.033962 | 37.616613 |
2087 | 106000127 | 동서그랜드맨션 | 7222 | 127.078196 | 37.58618 |
1074 | 103000162 | 뚝섬서울숲 | 4264 | 127.037535 | 37.543533 |
NODE_ID(노드ID) | STTN_NM(정류소명칭) | STTN_NO(정류소ID) | CRDNT_X(X좌표) | CRDNT_Y(Y좌표) | |
---|---|---|---|---|---|
2440 | 107000105 | 석관중고등학교앞 | 8195 | 127.064083 | 37.609297 |
5368 | 113000080 | 서강대학교 | 14171 | 126.93743 | 37.551849 |
1573 | 104900018 | 장신대앞 | 5581 | 127.105198 | 37.547619 |
4376 | 111000057 | 806의무경찰대.우남아파트 | 12145 | 126.906366 | 37.617937 |
975 | 103000054 | 금남시장앞.백범학원터 | 4153 | 127.021885 | 37.548256 |
4333 | 111000929 | 동명여고.천주교불광동성당 | 12020 | 126.923892 | 37.616191 |
3202 | 108900167 | 롯데백화점 | 9570 | 127.030879 | 37.614569 |
3140 | 108900152 | 빨래골 | 9504 | 127.009954 | 37.627385 |
2729 | 107900081 | 풍림106동 | 8584 | 127.022015 | 37.597573 |
1702 | 105000058 | 장안동현대아파트앞 | 6144 | 127.068419 | 37.579746 |