Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 8334 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 9 |
Duplicate rows (%) | 0.1% |
Total size in memory | 268.7 KiB |
Average record size in memory | 33.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Categorical | 1 |
Dataset
Description | 역코드(내부),역코드(외부),출구번호,건물명 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-15993/S/1/datasetView.do |
Dataset has 9 (0.1%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2024-04-06 12:52:22.070858 |
---|---|
Analysis finished | 2024-04-06 12:52:23.586035 |
Duration | 1.52 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
역코드(내부)
Real number (ℝ)
Distinct | 439 |
---|---|
Distinct (%) | 5.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1703.6866 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 73.4 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 206 |
Q1 | 332 |
median | 1956 |
Q3 | 2634 |
95-th percentile | 3110 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2302 |
Descriptive statistics
Standard deviation | 1151.3368 |
---|---|
Coefficient of variation (CV) | 0.67579146 |
Kurtosis | -1.363701 |
Mean | 1703.6866 |
Median Absolute Deviation (MAD) | 791 |
Skewness | -0.10557137 |
Sum | 14198524 |
Variance | 1325576.5 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
433 | 70 | 0.8% |
2533 | 53 | 0.6% |
2621 | 48 | 0.6% |
2534 | 48 | 0.6% |
318 | 48 | 0.6% |
203 | 46 | 0.6% |
151 | 44 | 0.5% |
2511 | 44 | 0.5% |
317 | 43 | 0.5% |
2731 | 43 | 0.5% |
Other values (429) | 7847 |
Value | Count | Frequency (%) |
150 | 27 | |
151 | 44 | |
152 | 25 | |
153 | 25 | |
154 | 27 | |
155 | 30 | |
156 | 22 | |
157 | 19 | |
158 | 15 | 0.2% |
159 | 9 | 0.1% |
Value | Count | Frequency (%) |
4138 | 15 | |
4137 | 10 | 0.1% |
4136 | 20 | |
4135 | 14 | |
4134 | 16 | |
4133 | 18 | |
4132 | 25 | |
4131 | 29 | |
4130 | 4 | < 0.1% |
4129 | 23 |
역코드(외부)
Text
Distinct | 439 |
---|---|
Distinct (%) | 5.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 65.2 KiB |
Value | Count | Frequency (%) |
433 | 70 | 0.8% |
532 | 53 | 0.6% |
328 | 48 | 0.6% |
620 | 48 | 0.6% |
533 | 48 | 0.6% |
203 | 46 | 0.6% |
510 | 44 | 0.5% |
132 | 44 | 0.5% |
729 | 43 | 0.5% |
327 | 43 | 0.5% |
Other values (429) | 7847 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 4587 | |
1 | 4048 | |
3 | 3770 | |
4 | 2990 | |
5 | 2807 | |
6 | 1805 | 6.9% |
7 | 1795 | 6.9% |
0 | 1226 | 4.7% |
8 | 1151 | 4.4% |
9 | 933 | 3.6% |
Other values (4) | 949 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 25112 | |
Uppercase Letter | 839 | 3.2% |
Dash Punctuation | 110 | 0.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 4587 | |
1 | 4048 | |
3 | 3770 | |
4 | 2990 | |
5 | 2807 | |
6 | 1805 | 7.2% |
7 | 1795 | 7.1% |
0 | 1226 | 4.9% |
8 | 1151 | 4.6% |
9 | 933 | 3.7% |
Uppercase Letter
Value | Count | Frequency (%) |
K | 348 | |
P | 309 | |
I | 182 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 110 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 25222 | |
Latin | 839 | 3.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 4587 | |
1 | 4048 | |
3 | 3770 | |
4 | 2990 | |
5 | 2807 | |
6 | 1805 | 7.2% |
7 | 1795 | 7.1% |
0 | 1226 | 4.9% |
8 | 1151 | 4.6% |
9 | 933 | 3.7% |
Latin
Value | Count | Frequency (%) |
K | 348 | |
P | 309 | |
I | 182 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 26061 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 4587 | |
1 | 4048 | |
3 | 3770 | |
4 | 2990 | |
5 | 2807 | |
6 | 1805 | 6.9% |
7 | 1795 | 6.9% |
0 | 1226 | 4.7% |
8 | 1151 | 4.4% |
9 | 933 | 3.6% |
Other values (4) | 949 | 3.6% |
출구번호
Categorical
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 65.2 KiB |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | |
Other values (21) |
Length
Max length | 8 |
---|---|
Median length | 1 |
Mean length | 1.0517159 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 1839 | |
2 | 1543 | |
3 | 1304 | |
4 | 1136 | |
5 | 649 | 7.8% |
6 | 592 | 7.1% |
7 | 415 | 5.0% |
8 | 315 | 3.8% |
9 | 152 | 1.8% |
10 | 129 | 1.5% |
Other values (16) | 260 | 3.1% |
Length
Value | Count | Frequency (%) |
1 | 1839 | |
2 | 1543 | |
3 | 1304 | |
4 | 1136 | |
5 | 649 | 7.8% |
6 | 592 | 7.1% |
7 | 415 | 5.0% |
8 | 315 | 3.8% |
9 | 152 | 1.8% |
10 | 129 | 1.5% |
Other values (16) | 260 | 3.1% |
건물명
Text
Distinct | 6503 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 65.2 KiB |
Value | Count | Frequency (%) |
방면 | 152 | 1.6% |
주민센터 | 111 | 1.1% |
아파트 | 47 | 0.5% |
우체국 | 35 | 0.4% |
기업은행 | 30 | 0.3% |
청계천 | 30 | 0.3% |
국민건강보험공단 | 21 | 0.2% |
현대아파트 | 21 | 0.2% |
서울 | 18 | 0.2% |
우리은행 | 16 | 0.2% |
Other values (6707) | 9215 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2016 | 3.6% |
교 | 1896 | 3.3% |
학 | 1655 | 2.9% |
1369 | 2.4% | |
서 | 1101 | 1.9% |
등 | 1027 | 1.8% |
파 | 980 | 1.7% |
원 | 936 | 1.6% |
대 | 875 | 1.5% |
아 | 855 | 1.5% |
Other values (659) | 44069 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 52328 | |
Decimal Number | 1582 | 2.8% |
Space Separator | 1369 | 2.4% |
Other Punctuation | 419 | 0.7% |
Uppercase Letter | 384 | 0.7% |
Close Punctuation | 291 | 0.5% |
Open Punctuation | 290 | 0.5% |
Lowercase Letter | 90 | 0.2% |
Dash Punctuation | 10 | < 0.1% |
Math Symbol | 9 | < 0.1% |
Other values (4) | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2016 | 3.9% |
교 | 1896 | 3.6% |
학 | 1655 | 3.2% |
서 | 1101 | 2.1% |
등 | 1027 | 2.0% |
파 | 980 | 1.9% |
원 | 936 | 1.8% |
대 | 875 | 1.7% |
아 | 855 | 1.6% |
지 | 852 | 1.6% |
Other values (586) | 40135 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 64 | |
T | 47 | |
C | 37 | |
S | 25 | 6.5% |
B | 25 | 6.5% |
G | 24 | 6.2% |
I | 23 | 6.0% |
A | 19 | 4.9% |
E | 17 | 4.4% |
D | 16 | 4.2% |
Other values (13) | 87 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 42 | |
e | 13 | 14.4% |
k | 5 | 5.6% |
c | 4 | 4.4% |
n | 3 | 3.3% |
i | 3 | 3.3% |
t | 3 | 3.3% |
b | 2 | 2.2% |
u | 2 | 2.2% |
h | 2 | 2.2% |
Other values (8) | 11 | 12.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 528 | |
2 | 339 | |
3 | 199 | 12.6% |
4 | 135 | 8.5% |
5 | 95 | 6.0% |
0 | 94 | 5.9% |
9 | 67 | 4.2% |
6 | 53 | 3.4% |
7 | 49 | 3.1% |
8 | 23 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 249 | |
. | 97 | 23.2% |
? | 45 | 10.7% |
/ | 19 | 4.5% |
& | 6 | 1.4% |
: | 2 | 0.5% |
@ | 1 | 0.2% |
Other Number
Value | Count | Frequency (%) |
③ | 1 | |
⑥ | 1 | |
⑦ | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 278 | |
] | 13 | 4.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 277 | |
[ | 13 | 4.5% |
Math Symbol
Value | Count | Frequency (%) |
~ | 8 | |
+ | 1 | 11.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 1 | |
Ⅳ | 1 |
Space Separator
Value | Count | Frequency (%) |
1369 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 52328 | |
Common | 3974 | 7.0% |
Latin | 476 | 0.8% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2016 | 3.9% |
교 | 1896 | 3.6% |
학 | 1655 | 3.2% |
서 | 1101 | 2.1% |
등 | 1027 | 2.0% |
파 | 980 | 1.9% |
원 | 936 | 1.8% |
대 | 875 | 1.7% |
아 | 855 | 1.6% |
지 | 852 | 1.6% |
Other values (586) | 40135 |
Latin
Value | Count | Frequency (%) |
K | 64 | |
T | 47 | 9.9% |
m | 42 | 8.8% |
C | 37 | 7.8% |
S | 25 | 5.3% |
B | 25 | 5.3% |
G | 24 | 5.0% |
I | 23 | 4.8% |
A | 19 | 4.0% |
E | 17 | 3.6% |
Other values (33) | 153 |
Common
Value | Count | Frequency (%) |
1369 | ||
1 | 528 | 13.3% |
2 | 339 | 8.5% |
) | 278 | 7.0% |
( | 277 | 7.0% |
, | 249 | 6.3% |
3 | 199 | 5.0% |
4 | 135 | 3.4% |
. | 97 | 2.4% |
5 | 95 | 2.4% |
Other values (19) | 408 | 10.3% |
Han
Value | Count | Frequency (%) |
內 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 52321 | |
ASCII | 4445 | 7.8% |
Compat Jamo | 6 | < 0.1% |
Enclosed Alphanum | 3 | < 0.1% |
Number Forms | 2 | < 0.1% |
None | 1 | < 0.1% |
CJK | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2016 | 3.9% |
교 | 1896 | 3.6% |
학 | 1655 | 3.2% |
서 | 1101 | 2.1% |
등 | 1027 | 2.0% |
파 | 980 | 1.9% |
원 | 936 | 1.8% |
대 | 875 | 1.7% |
아 | 855 | 1.6% |
지 | 852 | 1.6% |
Other values (584) | 40128 |
ASCII
Value | Count | Frequency (%) |
1369 | ||
1 | 528 | 11.9% |
2 | 339 | 7.6% |
) | 278 | 6.3% |
( | 277 | 6.2% |
, | 249 | 5.6% |
3 | 199 | 4.5% |
4 | 135 | 3.0% |
. | 97 | 2.2% |
5 | 95 | 2.1% |
Other values (57) | 879 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 6 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 1 | |
Ⅳ | 1 |
None
Value | Count | Frequency (%) |
㈜ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
③ | 1 | |
⑥ | 1 | |
⑦ | 1 |
CJK
Value | Count | Frequency (%) |
內 | 1 |
역코드(내부) | 출구번호 | |
---|---|---|
역코드(내부) | 1.000 | 0.227 |
출구번호 | 0.227 | 1.000 |
역코드(내부) | 출구번호 | |
---|---|---|
역코드(내부) | 1.000 | 0.092 |
출구번호 | 0.092 | 1.000 |
역코드(내부) | 역코드(외부) | 출구번호 | 건물명 | |
---|---|---|---|---|
0 | 150 | 133 | 1 | 동자동 |
1 | 150 | 133 | 1 | 서울시티투어버스 타는 곳 |
2 | 150 | 133 | 1 | 서울역 |
3 | 150 | 133 | 2 | 역전우체국 |
4 | 150 | 133 | 2 | 경의선 서울역 |
5 | 150 | 133 | 2 | 역전파출소 |
6 | 150 | 133 | 2 | 문화역서울284 |
7 | 150 | 133 | 2 | 서울로 |
8 | 150 | 133 | 2 | 국민권익위원회 |
9 | 150 | 133 | 3 | 한국주택금융공사 |
역코드(내부) | 역코드(외부) | 출구번호 | 건물명 | |
---|---|---|---|---|
8324 | 4138 | 938 | 1 | 서울한산초등학교 |
8325 | 4138 | 938 | 1 | 한산중학교 |
8326 | 4138 | 938 | 1 | 중앙보훈병원후문 |
8327 | 4138 | 938 | 2 | 중앙보훈병원 |
8328 | 4138 | 938 | 2 | 생태공원앞교차로 방면 |
8329 | 4138 | 938 | 3 | 일자산제1체육관 |
8330 | 4138 | 938 | 3 | 일자산 |
8331 | 4138 | 938 | 3 | 일자산도시자연공원(잔디광장) |
8332 | 4138 | 938 | 3 | 강동구도시농업공원 |
8333 | 4138 | 938 | 3 | 일자산허브천문공원 방면 |
Most frequently occurring
역코드(내부) | 역코드(외부) | 출구번호 | 건물명 | # duplicates | |
---|---|---|---|---|---|
0 | 224 | 224 | 1 | 대한법률구조공단 | 2 |
1 | 228 | 228 | 3 | 영락고등학교 | 2 |
2 | 419 | 419 | 3 | 한성대학교 | 2 |
3 | 1452 | 437 | 2 | 서울랜드 | 2 |
4 | 1452 | 437 | 3 | 국립현대미술관 | 2 |
5 | 2561 | P555 | 1 | 하늘산성교회 | 2 |
6 | 2716 | 714 | 6 | 상계주공1?2단지 | 2 |
7 | 2814 | 813 | 2 | 방이복지관 | 2 |
8 | 3117 | I117 | 1 | 북부소방서 | 2 |