Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 27 |
Missing cells | 113 |
Missing cells (%) | 27.9% |
Duplicate rows | 3 |
Duplicate rows (%) | 11.1% |
Total size in memory | 3.3 KiB |
Average record size in memory | 124.9 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Unsupported | 11 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13231/F/1/datasetView.do |
Dataset has 3 (11.1%) duplicate rows | Duplicates |
시 설 명 has 8 (29.6%) missing values | Missing |
Unnamed: 1 has 21 (77.8%) missing values | Missing |
Unnamed: 2 has 24 (88.9%) missing values | Missing |
계 has 7 (25.9%) missing values | Missing |
1~4호선 has 6 (22.2%) missing values | Missing |
Unnamed: 6 has 6 (22.2%) missing values | Missing |
Unnamed: 7 has 6 (22.2%) missing values | Missing |
Unnamed: 8 has 5 (18.5%) missing values | Missing |
Unnamed: 9 has 6 (22.2%) missing values | Missing |
5~8호선 has 6 (22.2%) missing values | Missing |
Unnamed: 11 has 6 (22.2%) missing values | Missing |
Unnamed: 12 has 6 (22.2%) missing values | Missing |
Unnamed: 13 has 2 (7.4%) missing values | Missing |
Unnamed: 14 has 4 (14.8%) missing values | Missing |
계 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
1~4호선 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
5~8호선 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-29 16:47:17.075716 |
---|---|
Analysis finished | 2024-04-29 16:47:19.007162 |
Duration | 1.93 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시 설 명
Text
MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 94.7% |
Missing | 8 |
Missing (%) | 29.6% |
Memory size | 348.0 B |
Value | Count | Frequency (%) |
본선/측선 | 2 | 10.5% |
콘크리트도상 | 1 | 5.3% |
b2s | 1 | 5.3% |
신축이음매 | 1 | 5.3% |
차량기지 | 1 | 5.3% |
장비유치선 | 1 | 5.3% |
구간 | 1 | 5.3% |
장치 | 1 | 5.3% |
체결 | 1 | 5.3% |
방진 | 1 | 5.3% |
Other values (8) | 8 |
Most occurring characters
Value | Count | Frequency (%) |
선 | 7 | 9.2% |
기 | 6 | 7.9% |
장 | 4 | 5.3% |
) | 3 | 3.9% |
도 | 3 | 3.9% |
( | 3 | 3.9% |
2 | 2 | 2.6% |
연 | 2 | 2.6% |
유 | 2 | 2.6% |
본 | 2 | 2.6% |
Other values (36) | 42 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 59 | |
Decimal Number | 5 | 6.6% |
Close Punctuation | 3 | 3.9% |
Open Punctuation | 3 | 3.9% |
Uppercase Letter | 3 | 3.9% |
Other Punctuation | 2 | 2.6% |
Math Symbol | 1 | 1.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
선 | 7 | 11.9% |
기 | 6 | 10.2% |
장 | 4 | 6.8% |
도 | 3 | 5.1% |
연 | 2 | 3.4% |
유 | 2 | 3.4% |
본 | 2 | 3.4% |
최 | 2 | 3.4% |
치 | 2 | 3.4% |
곡 | 2 | 3.4% |
Other values (26) | 27 |
Decimal Number
Value | Count | Frequency (%) |
2 | 2 | |
0 | 2 | |
1 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1 | |
R | 1 | |
S | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 |
Math Symbol
Value | Count | Frequency (%) |
< | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 59 | |
Common | 14 | 18.4% |
Latin | 3 | 3.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
선 | 7 | 11.9% |
기 | 6 | 10.2% |
장 | 4 | 6.8% |
도 | 3 | 5.1% |
연 | 2 | 3.4% |
유 | 2 | 3.4% |
본 | 2 | 3.4% |
최 | 2 | 3.4% |
치 | 2 | 3.4% |
곡 | 2 | 3.4% |
Other values (26) | 27 |
Common
Value | Count | Frequency (%) |
) | 3 | |
( | 3 | |
2 | 2 | |
0 | 2 | |
/ | 2 | |
1 | 1 | 7.1% |
< | 1 | 7.1% |
Latin
Value | Count | Frequency (%) |
B | 1 | |
R | 1 | |
S | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 59 | |
ASCII | 17 | 22.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
선 | 7 | 11.9% |
기 | 6 | 10.2% |
장 | 4 | 6.8% |
도 | 3 | 5.1% |
연 | 2 | 3.4% |
유 | 2 | 3.4% |
본 | 2 | 3.4% |
최 | 2 | 3.4% |
치 | 2 | 3.4% |
곡 | 2 | 3.4% |
Other values (26) | 27 |
ASCII
Value | Count | Frequency (%) |
) | 3 | |
( | 3 | |
2 | 2 | |
0 | 2 | |
/ | 2 | |
B | 1 | 5.9% |
1 | 1 | 5.9% |
< | 1 | 5.9% |
R | 1 | 5.9% |
S | 1 | 5.9% |
Unnamed: 1
Text
MISSING
 
Distinct | 4 |
---|---|
Distinct (%) | 66.7% |
Missing | 21 |
Missing (%) | 77.8% |
Memory size | 348.0 B |
Value | Count | Frequency (%) |
연장 | 2 | |
m | 2 | |
구 | 2 | |
간 | 2 | |
곡선반경 | 1 | |
r | 1 | |
기울기 | 1 | |
‰ | 1 |
Most occurring characters
Value | Count | Frequency (%) |
6 | ||
( | 4 | |
) | 4 | |
연 | 2 | 6.1% |
장 | 2 | 6.1% |
m | 2 | 6.1% |
구 | 2 | 6.1% |
간 | 2 | 6.1% |
기 | 2 | 6.1% |
곡 | 1 | 3.0% |
Other values (6) | 6 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 15 | |
Space Separator | 6 | 18.2% |
Open Punctuation | 4 | 12.1% |
Close Punctuation | 4 | 12.1% |
Lowercase Letter | 2 | 6.1% |
Uppercase Letter | 1 | 3.0% |
Other Punctuation | 1 | 3.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
연 | 2 | |
장 | 2 | |
구 | 2 | |
간 | 2 | |
기 | 2 | |
곡 | 1 | |
선 | 1 | |
반 | 1 | |
경 | 1 | |
울 | 1 |
Space Separator
Value | Count | Frequency (%) |
6 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 2 |
Uppercase Letter
Value | Count | Frequency (%) |
R | 1 |
Other Punctuation
Value | Count | Frequency (%) |
‰ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 15 | |
Hangul | 15 | |
Latin | 3 | 9.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
연 | 2 | |
장 | 2 | |
구 | 2 | |
간 | 2 | |
기 | 2 | |
곡 | 1 | |
선 | 1 | |
반 | 1 | |
경 | 1 | |
울 | 1 |
Common
Value | Count | Frequency (%) |
6 | ||
( | 4 | |
) | 4 | |
‰ | 1 | 6.7% |
Latin
Value | Count | Frequency (%) |
m | 2 | |
R | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17 | |
Hangul | 15 | |
Punctuation | 1 | 3.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6 | ||
( | 4 | |
) | 4 | |
m | 2 | 11.8% |
R | 1 | 5.9% |
Hangul
Value | Count | Frequency (%) |
연 | 2 | |
장 | 2 | |
구 | 2 | |
간 | 2 | |
기 | 2 | |
곡 | 1 | |
선 | 1 | |
반 | 1 | |
경 | 1 | |
울 | 1 |
Punctuation
Value | Count | Frequency (%) |
‰ | 1 |
Unnamed: 2
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | 100.0% |
Missing | 24 |
Missing (%) | 88.9% |
Memory size | 348.0 B |
Value | Count | Frequency (%) |
alt-ⅰ | 1 | |
alt-ⅱ | 1 | |
dff14 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
A | 2 | |
l | 2 | |
t | 2 | |
- | 2 | |
F | 2 | |
Ⅰ | 1 | |
Ⅱ | 1 | |
D | 1 | |
1 | 1 | |
4 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 5 | |
Lowercase Letter | 4 | |
Dash Punctuation | 2 | 13.3% |
Letter Number | 2 | 13.3% |
Decimal Number | 2 | 13.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 2 | |
F | 2 | |
D | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 2 | |
t | 2 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 1 | |
Ⅱ | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 | |
4 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 11 | |
Common | 4 | 26.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 2 | |
l | 2 | |
t | 2 | |
F | 2 | |
Ⅰ | 1 | |
Ⅱ | 1 | |
D | 1 |
Common
Value | Count | Frequency (%) |
- | 2 | |
1 | 1 | |
4 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13 | |
Number Forms | 2 | 13.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 2 | |
l | 2 | |
t | 2 | |
- | 2 | |
F | 2 | |
D | 1 | |
1 | 1 | |
4 | 1 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 1 | |
Ⅱ | 1 |
단위
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 33.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 348.0 B |
<NA> | |
---|---|
km | |
m | |
개 | |
개소 | |
Other values (4) |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.2592593 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 11.1% |
Sample
1st row | <NA> |
---|---|
2nd row | km |
3rd row | <NA> |
4th row | km |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9 | |
km | 4 | |
m | 3 | 11.1% |
개 | 3 | 11.1% |
개소 | 3 | 11.1% |
- | 2 | 7.4% |
‰ | 1 | 3.7% |
틀 | 1 | 3.7% |
대 | 1 | 3.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9 | |
km | 4 | |
m | 3 | 11.1% |
개 | 3 | 11.1% |
개소 | 3 | 11.1% |
2 | 7.4% | |
‰ | 1 | 3.7% |
틀 | 1 | 3.7% |
대 | 1 | 3.7% |
계
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 7 |
---|---|
Missing (%) | 25.9% |
Memory size | 348.0 B |
1~4호선
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 5 |
---|---|
Missing (%) | 18.5% |
Memory size | 348.0 B |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
5~8호선
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 12
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6 |
---|---|
Missing (%) | 22.2% |
Memory size | 348.0 B |
Unnamed: 13
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | 7.4% |
Memory size | 348.0 B |
Unnamed: 14
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 4 |
---|---|
Missing (%) | 14.8% |
Memory size | 348.0 B |
시 설 명 | Unnamed: 1 | Unnamed: 2 | 단위 | |
---|---|---|---|---|
시 설 명 | 1.000 | 1.000 | 1.000 | 1.000 |
Unnamed: 1 | 1.000 | 1.000 | NaN | 1.000 |
Unnamed: 2 | 1.000 | NaN | 1.000 | NaN |
단위 | 1.000 | 1.000 | NaN | 1.000 |
시 설 명 | Unnamed: 1 | Unnamed: 2 | 단위 | 계 | 1~4호선 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | 5~8호선 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | NaN | 소계 | 1호선 | 2호선 | 3호선 | 4호선 | 소계 | 5호선 | 6호선 | 7호선 | 8호선 |
1 | 궤도연장 | <NA> | <NA> | km | 618.098 | 284.129 | 18.831 | 122.711 | 77.983 | 64.604 | 333.969 | 109.399 | 67.435 | 118.091 | 39.044 |
2 | (본선/측선) | <NA> | <NA> | <NA> | /225.258 | /120.567 | /1.043 | /51.806 | /45.398 | /22.320 | /104.691 | /35.097 | /20.489 | /37.980 | /11.125 |
3 | 곡선연장 | <NA> | <NA> | km | 254.922 | 124.25 | 9.769 | 49.565 | 35.872 | 29.044 | 130.672 | 46.18 | 31.107 | 38.242 | 15.143 |
4 | (R<1200) | <NA> | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
5 | 최소 | 곡선반경 (R) | <NA> | m | - | - | 136 | 200 | 199 | 180 | - | 246 | 269 | 246 | 248 |
6 | 곡선 | 연장 (m) | <NA> | m | - | - | 318 | 311 | 434 | 180 | - | 438 | 311 | 278 | 524 |
7 | <NA> | 구 간 | <NA> | - | - | - | 시청~종각 | 서초~방배외1개소 | 안국~종로 | 당고개~상계 | - | 방이~오금 | 응암~역촌 | 장승 | 산성~단대 |
8 | <NA> | <NA> | <NA> | <NA> | NaN | NaN | NaN | NaN | 3가 | NaN | NaN | NaN | NaN | 배기~ | 오거리 |
9 | <NA> | <NA> | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 신대방 | NaN |
시 설 명 | Unnamed: 1 | Unnamed: 2 | 단위 | 계 | 1~4호선 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | 5~8호선 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
17 | 콘크리트도상 | <NA> | <NA> | km | 471.6 | 146.7 | 14.5 | 63.3 | 39.4 | 29.4 | 324.9 | 109.4 | 65.8 | 112.9 | 36.9 |
18 | B2S | <NA> | <NA> | km | 53.5 | 53.5 | 5.8 | 20.7 | 13.6 | 13.4 | - | - | - | - | - |
19 | 방진 | <NA> | Alt-Ⅰ | 개 | 50510 | 26732 | 4086 | 11344 | 6337 | 4965 | 23778 | - | 14336 | 7582 | 1860 |
20 | 체결 | <NA> | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
21 | 장치 | <NA> | Alt-Ⅱ | 개 | 43320 | 43320 | 3437 | 17917 | 14205 | 7761 | - | - | - | - | - |
22 | 구간 | <NA> | DFF14 | 개 | 11941 | 11941 | - | 11235 | - | 706 | - | - | - | - | - |
23 | 장비유치선 | <NA> | <NA> | 개소 | 28 | 8 | - | 4 | 2 | 2 | 20 | 7 | 3 | 6 | 4 |
24 | 차량기지 | <NA> | <NA> | 개소 | 11 | 5 | - | 2 | 2 | 1 | 6 | 2 | 1 | 2 | 1 |
25 | 신축이음매 | <NA> | <NA> | 개소 | 409 | 375 | 21 | 152 | 98 | 104 | 34 | 4 | 2 | 24 | 4 |
26 | 도유기 | <NA> | <NA> | 대 | 253 | 131 | 6 | 52 | 33 | 41 | 121 | 68 | 15 | 32 | 6 |
Most frequently occurring
시 설 명 | Unnamed: 1 | Unnamed: 2 | 단위 | # duplicates | |
---|---|---|---|---|---|
2 | <NA> | <NA> | <NA> | <NA> | 5 |
0 | (본선/측선) | <NA> | <NA> | <NA> | 2 |
1 | <NA> | 구 간 | <NA> | - | 2 |