Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 5468 |
Missing cells | 1 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 229.7 KiB |
Average record size in memory | 43.0 B |
Variable types
Numeric | 3 |
---|---|
Text | 2 |
Dataset
Description | 서울교통공사 1-8호선 275개역 5436개(하남선 5개역, 201개 포함) 지하철 시각장애인 음성유도기 설치 위치 정보입니다. 해당 데이터는 연번,호선,역번호,역명,음성유도기 설치 위치로 구성되어 있습니다. |
---|---|
URL | https://www.data.go.kr/data/15100171/fileData.do |
Reproduction
Analysis started | 2023-12-13 00:55:21.892116 |
---|---|
Analysis finished | 2023-12-13 00:55:23.142290 |
Duration | 1.25 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 5468 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2734.5 |
Minimum | 1 |
---|---|
Maximum | 5468 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 48.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 274.35 |
Q1 | 1367.75 |
median | 2734.5 |
Q3 | 4101.25 |
95-th percentile | 5194.65 |
Maximum | 5468 |
Range | 5467 |
Interquartile range (IQR) | 2733.5 |
Descriptive statistics
Standard deviation | 1578.62 |
---|---|
Coefficient of variation (CV) | 0.57729748 |
Kurtosis | -1.2 |
Mean | 2734.5 |
Median Absolute Deviation (MAD) | 1367 |
Skewness | 0 |
Sum | 14952246 |
Variance | 2492041 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
3645 | 1 | < 0.1% |
3653 | 1 | < 0.1% |
3652 | 1 | < 0.1% |
3651 | 1 | < 0.1% |
3650 | 1 | < 0.1% |
3649 | 1 | < 0.1% |
3648 | 1 | < 0.1% |
3647 | 1 | < 0.1% |
3646 | 1 | < 0.1% |
Other values (5458) | 5458 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
5468 | 1 | |
5467 | 1 | |
5466 | 1 | |
5465 | 1 | |
5464 | 1 | |
5463 | 1 | |
5462 | 1 | |
5461 | 1 | |
5460 | 1 | |
5459 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6305779 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 48.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0375911 |
---|---|
Coefficient of variation (CV) | 0.44002955 |
Kurtosis | -1.1115275 |
Mean | 4.6305779 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.12059121 |
Sum | 25320 |
Variance | 4.1517775 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 1145 | |
2 | 963 | |
7 | 885 | |
6 | 718 | |
4 | 602 | |
3 | 496 | |
8 | 373 | 6.8% |
1 | 286 | 5.2% |
Value | Count | Frequency (%) |
1 | 286 | 5.2% |
2 | 963 | |
3 | 496 | |
4 | 602 | |
5 | 1145 | |
6 | 718 | |
7 | 885 | |
8 | 373 | 6.8% |
Value | Count | Frequency (%) |
8 | 373 | 6.8% |
7 | 885 | |
6 | 718 | |
5 | 1145 | |
4 | 602 | |
3 | 496 | |
2 | 963 | |
1 | 286 | 5.2% |
외부역번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 238 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 502.76372 |
Minimum | 126 |
---|---|
Maximum | 2114 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 48.2 KiB |
Quantile statistics
Minimum | 126 |
---|---|
5-th percentile | 133 |
Q1 | 327 |
median | 530 |
Q3 | 644 |
95-th percentile | 815 |
Maximum | 2114 |
Range | 1988 |
Interquartile range (IQR) | 317 |
Descriptive statistics
Standard deviation | 241.93721 |
---|---|
Coefficient of variation (CV) | 0.48121455 |
Kurtosis | 10.912883 |
Mean | 502.76372 |
Median Absolute Deviation (MAD) | 185.5 |
Skewness | 1.7407462 |
Sum | 2749112 |
Variance | 58533.616 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
423 | 52 | 1.0% |
130 | 48 | 0.9% |
208 | 44 | 0.8% |
128 | 44 | 0.8% |
562 | 44 | 0.8% |
564 | 43 | 0.8% |
425 | 41 | 0.7% |
329 | 41 | 0.7% |
561 | 41 | 0.7% |
421 | 40 | 0.7% |
Other values (228) | 5030 |
Value | Count | Frequency (%) |
126 | 22 | |
127 | 37 | |
128 | 44 | |
129 | 32 | |
130 | 48 | |
131 | 38 | |
132 | 32 | |
133 | 33 | |
202 | 38 | |
203 | 32 |
Value | Count | Frequency (%) |
2114 | 17 | |
2113 | 19 | |
827 | 19 | |
826 | 13 | |
825 | 14 | |
824 | 25 | |
823 | 18 | |
822 | 13 | |
821 | 31 | |
820 | 14 |
역명
Text
Distinct | 220 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 42.8 KiB |
Value | Count | Frequency (%) |
1 | 113 | 1.9% |
종로3가 | 111 | 1.9% |
3 | 109 | 1.9% |
을지로3가 | 71 | 1.2% |
2 | 67 | 1.2% |
왕십리 | 65 | 1.1% |
잠실 | 62 | 1.1% |
동묘앞 | 59 | 1.0% |
불광 | 59 | 1.0% |
고속터미널 | 58 | 1.0% |
Other values (210) | 5045 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 740 | 3.9% |
) | 567 | 3.0% |
( | 567 | 3.0% |
구 | 529 | 2.8% |
동 | 486 | 2.6% |
로 | 432 | 2.3% |
문 | 392 | 2.1% |
384 | 2.0% | |
지 | 380 | 2.0% |
신 | 375 | 2.0% |
Other values (193) | 14054 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 16554 | |
Decimal Number | 834 | 4.4% |
Close Punctuation | 567 | 3.0% |
Open Punctuation | 567 | 3.0% |
Space Separator | 384 | 2.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 740 | 4.5% |
구 | 529 | 3.2% |
동 | 486 | 2.9% |
로 | 432 | 2.6% |
문 | 392 | 2.4% |
지 | 380 | 2.3% |
신 | 375 | 2.3% |
가 | 363 | 2.2% |
산 | 349 | 2.1% |
입 | 328 | 2.0% |
Other values (185) | 12180 |
Decimal Number
Value | Count | Frequency (%) |
3 | 291 | |
4 | 231 | |
1 | 157 | |
2 | 113 | 13.5% |
5 | 42 | 5.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 567 |
Open Punctuation
Value | Count | Frequency (%) |
( | 567 |
Space Separator
Value | Count | Frequency (%) |
384 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 16554 | |
Common | 2352 | 12.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 740 | 4.5% |
구 | 529 | 3.2% |
동 | 486 | 2.9% |
로 | 432 | 2.6% |
문 | 392 | 2.4% |
지 | 380 | 2.3% |
신 | 375 | 2.3% |
가 | 363 | 2.2% |
산 | 349 | 2.1% |
입 | 328 | 2.0% |
Other values (185) | 12180 |
Common
Value | Count | Frequency (%) |
) | 567 | |
( | 567 | |
384 | ||
3 | 291 | |
4 | 231 | |
1 | 157 | 6.7% |
2 | 113 | 4.8% |
5 | 42 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 16554 | |
ASCII | 2352 | 12.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 740 | 4.5% |
구 | 529 | 3.2% |
동 | 486 | 2.9% |
로 | 432 | 2.6% |
문 | 392 | 2.4% |
지 | 380 | 2.3% |
신 | 375 | 2.3% |
가 | 363 | 2.2% |
산 | 349 | 2.1% |
입 | 328 | 2.0% |
Other values (185) | 12180 |
ASCII
Value | Count | Frequency (%) |
) | 567 | |
( | 567 | |
384 | ||
3 | 291 | |
4 | 231 | |
1 | 157 | 6.7% |
2 | 113 | 4.8% |
5 | 42 | 1.8% |
설치위치
Text
Distinct | 3818 |
---|---|
Distinct (%) | 69.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 42.8 KiB |
Length
Max length | 41 |
---|---|
Median length | 32 |
Mean length | 12.67752 |
Min length | 3 |
Characters and Unicode
Total characters | 69308 |
---|---|
Distinct characters | 334 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 3409 ? |
---|---|
Unique (%) | 62.4% |
Sample
1st row | 4호선 환승통로입구 |
---|---|
2nd row | 서울역쪽 개표소 |
3rd row | 서울역쪽 발매기 앞 |
4th row | 역무실 앞 발매기 앞기둥 |
5th row | 역무실 앞 중간개표소(내부) |
Value | Count | Frequency (%) |
앞 | 1459 | 7.3% |
출구 | 969 | 4.9% |
계단 | 931 | 4.7% |
승강장 | 824 | 4.1% |
상선 | 722 | 3.6% |
하선 | 683 | 3.4% |
e/v | 674 | 3.4% |
내부 | 571 | 2.9% |
위 | 468 | 2.3% |
e/s | 403 | 2.0% |
Other values (1422) | 12217 |
Most occurring characters
Value | Count | Frequency (%) |
14527 | 21.0% | |
선 | 2197 | 3.2% |
구 | 2119 | 3.1% |
번 | 2084 | 3.0% |
1 | 2000 | 2.9% |
앞 | 1865 | 2.7% |
E | 1592 | 2.3% |
- | 1509 | 2.2% |
출 | 1488 | 2.1% |
/ | 1451 | 2.1% |
Other values (324) | 38476 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 36574 | |
Space Separator | 14527 | 21.0% |
Decimal Number | 8050 | 11.6% |
Uppercase Letter | 3981 | 5.7% |
Other Punctuation | 2365 | 3.4% |
Dash Punctuation | 1509 | 2.2% |
Close Punctuation | 1116 | 1.6% |
Open Punctuation | 1114 | 1.6% |
Math Symbol | 49 | 0.1% |
Lowercase Letter | 16 | < 0.1% |
Other values (2) | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
선 | 2197 | 6.0% |
구 | 2119 | 5.8% |
번 | 2084 | 5.7% |
앞 | 1865 | 5.1% |
출 | 1488 | 4.1% |
단 | 1356 | 3.7% |
내 | 1343 | 3.7% |
장 | 1342 | 3.7% |
계 | 1304 | 3.6% |
하 | 1141 | 3.1% |
Other values (267) | 20335 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 1592 | |
V | 847 | |
S | 554 | 13.9% |
B | 254 | 6.4% |
I | 237 | 6.0% |
L | 123 | 3.1% |
C | 98 | 2.5% |
A | 54 | 1.4% |
F | 45 | 1.1% |
G | 38 | 1.0% |
Other values (10) | 139 | 3.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 2000 | |
2 | 1408 | |
4 | 1294 | |
3 | 1188 | |
5 | 617 | 7.7% |
6 | 443 | 5.5% |
7 | 401 | 5.0% |
8 | 329 | 4.1% |
9 | 187 | 2.3% |
0 | 183 | 2.3% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 2 | |
x | 2 | |
e | 2 | |
v | 2 | |
t | 2 | |
a | 2 | |
b | 1 | |
s | 1 | |
i | 1 | |
j | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1451 | |
, | 887 | |
# | 21 | 0.9% |
. | 5 | 0.2% |
& | 1 | < 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅲ | 2 | |
Ⅱ | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1109 | |
] | 7 | 0.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1107 | |
[ | 7 | 0.6% |
Math Symbol
Value | Count | Frequency (%) |
~ | 46 | |
> | 3 | 6.1% |
Space Separator
Value | Count | Frequency (%) |
14527 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1509 |
Other Symbol
Value | Count | Frequency (%) |
○ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 36534 | |
Common | 28731 | |
Latin | 4003 | 5.8% |
Han | 40 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
선 | 2197 | 6.0% |
구 | 2119 | 5.8% |
번 | 2084 | 5.7% |
앞 | 1865 | 5.1% |
출 | 1488 | 4.1% |
단 | 1356 | 3.7% |
내 | 1343 | 3.7% |
장 | 1342 | 3.7% |
계 | 1304 | 3.6% |
하 | 1141 | 3.1% |
Other values (265) | 20295 |
Latin
Value | Count | Frequency (%) |
E | 1592 | |
V | 847 | |
S | 554 | 13.8% |
B | 254 | 6.3% |
I | 237 | 5.9% |
L | 123 | 3.1% |
C | 98 | 2.4% |
A | 54 | 1.3% |
F | 45 | 1.1% |
G | 38 | 0.9% |
Other values (23) | 161 | 4.0% |
Common
Value | Count | Frequency (%) |
14527 | ||
1 | 2000 | 7.0% |
- | 1509 | 5.3% |
/ | 1451 | 5.1% |
2 | 1408 | 4.9% |
4 | 1294 | 4.5% |
3 | 1188 | 4.1% |
) | 1109 | 3.9% |
( | 1107 | 3.9% |
, | 887 | 3.1% |
Other values (14) | 2251 | 7.8% |
Han
Value | Count | Frequency (%) |
下 | 23 | |
上 | 17 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 36534 | |
ASCII | 32727 | |
CJK | 40 | 0.1% |
Number Forms | 6 | < 0.1% |
Geometric Shapes | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
14527 | ||
1 | 2000 | 6.1% |
E | 1592 | 4.9% |
- | 1509 | 4.6% |
/ | 1451 | 4.4% |
2 | 1408 | 4.3% |
4 | 1294 | 4.0% |
3 | 1188 | 3.6% |
) | 1109 | 3.4% |
( | 1107 | 3.4% |
Other values (43) | 5542 | 16.9% |
Hangul
Value | Count | Frequency (%) |
선 | 2197 | 6.0% |
구 | 2119 | 5.8% |
번 | 2084 | 5.7% |
앞 | 1865 | 5.1% |
출 | 1488 | 4.1% |
단 | 1356 | 3.7% |
내 | 1343 | 3.7% |
장 | 1342 | 3.7% |
계 | 1304 | 3.6% |
하 | 1141 | 3.1% |
Other values (265) | 20295 |
CJK
Value | Count | Frequency (%) |
下 | 23 | |
上 | 17 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅲ | 2 | |
Ⅱ | 2 |
Geometric Shapes
Value | Count | Frequency (%) |
○ | 1 |
연번 | 호선 | 외부역번호 | |
---|---|---|---|
연번 | 1.000 | 0.929 | 0.981 |
호선 | 0.929 | 1.000 | 0.860 |
외부역번호 | 0.981 | 0.860 | 1.000 |
연번 | 호선 | 외부역번호 | |
---|---|---|---|
연번 | 1.000 | 0.988 | 0.974 |
호선 | 0.988 | 1.000 | 0.960 |
외부역번호 | 0.974 | 0.960 | 1.000 |
연번 | 호선 | 외부역번호 | 역명 | 설치위치 | |
---|---|---|---|---|---|
0 | 1 | 1 | 133 | 서울역 (1) | 4호선 환승통로입구 |
1 | 2 | 1 | 133 | 서울역 (1) | 서울역쪽 개표소 |
2 | 3 | 1 | 133 | 서울역 (1) | 서울역쪽 발매기 앞 |
3 | 4 | 1 | 133 | 서울역 (1) | 역무실 앞 발매기 앞기둥 |
4 | 5 | 1 | 133 | 서울역 (1) | 역무실 앞 중간개표소(내부) |
5 | 6 | 1 | 133 | 서울역 (1) | 2번 출구 (내부) |
6 | 7 | 1 | 133 | 서울역 (1) | 2번 입구 (외부) |
7 | 8 | 1 | 133 | 서울역 (1) | E/V 앞 2번 출구 하단 |
8 | 9 | 1 | 133 | 서울역 (1) | E/V 앞 2번 출구 상단 |
9 | 10 | 1 | 133 | 서울역 (1) | 대합실에서 승강장가는 E/V 앞기둥 |
연번 | 호선 | 외부역번호 | 역명 | 설치위치 | |
---|---|---|---|---|---|
5458 | 5459 | 8 | 827 | 모란 | 분당선 환승통로 |
5459 | 5460 | 8 | 827 | 모란 | 상선 승강장 가는 계단 |
5460 | 5461 | 8 | 827 | 모란 | 상선 2-4 |
5461 | 5462 | 8 | 827 | 모란 | 하선 5-1 |
5462 | 5463 | 8 | 827 | 모란 | 상선 4-3 |
5463 | 5464 | 8 | 827 | 모란 | 상선 5-3 (E/V앞) |
5464 | 5465 | 8 | 827 | 모란 | 하선 3-2 |
5465 | 5466 | 8 | 827 | 모란 | 하선 2-3 (E/V앞) |
5466 | 5467 | 8 | 827 | 모란 | 상선 6-4 |
5467 | 5468 | 8 | 827 | 모란 | 하선 1-1 |