Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 39 |
Missing cells | 165 |
Missing cells (%) | 28.2% |
Duplicate rows | 2 |
Duplicate rows (%) | 5.1% |
Total size in memory | 4.7 KiB |
Average record size in memory | 124.4 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Unsupported | 10 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13214/F/1/datasetView.do |
Dataset has 2 (5.1%) duplicate rows | Duplicates |
구분 has 26 (66.7%) missing values | Missing |
설 비 명 has 21 (53.8%) missing values | Missing |
Unnamed: 2 has 7 (17.9%) missing values | Missing |
계 has 11 (28.2%) missing values | Missing |
1~4호선 has 10 (25.6%) missing values | Missing |
Unnamed: 6 has 10 (25.6%) missing values | Missing |
Unnamed: 7 has 10 (25.6%) missing values | Missing |
Unnamed: 8 has 10 (25.6%) missing values | Missing |
Unnamed: 9 has 10 (25.6%) missing values | Missing |
5~8호선 has 10 (25.6%) missing values | Missing |
Unnamed: 11 has 10 (25.6%) missing values | Missing |
Unnamed: 12 has 10 (25.6%) missing values | Missing |
Unnamed: 13 has 10 (25.6%) missing values | Missing |
Unnamed: 14 has 10 (25.6%) missing values | Missing |
1~4호선 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
5~8호선 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-29 16:44:45.086251 |
---|---|
Analysis finished | 2024-04-29 16:44:47.315563 |
Duration | 2.23 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Text
MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 84.6% |
Missing | 26 |
Missing (%) | 66.7% |
Memory size | 444.0 B |
Value | Count | Frequency (%) |
전 | 3 | |
변 | 1 | 7.7% |
설 | 1 | 7.7% |
비 | 1 | 7.7% |
역사 | 1 | 7.7% |
전기 | 1 | 7.7% |
설비 | 1 | 7.7% |
차 | 1 | 7.7% |
선 | 1 | 7.7% |
송 | 1 | 7.7% |
Most occurring characters
Value | Count | Frequency (%) |
전 | 4 | |
설 | 2 | |
비 | 2 | |
변 | 1 | 6.2% |
역 | 1 | 6.2% |
사 | 1 | 6.2% |
기 | 1 | 6.2% |
차 | 1 | 6.2% |
선 | 1 | 6.2% |
송 | 1 | 6.2% |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 16 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
전 | 4 | |
설 | 2 | |
비 | 2 | |
변 | 1 | 6.2% |
역 | 1 | 6.2% |
사 | 1 | 6.2% |
기 | 1 | 6.2% |
차 | 1 | 6.2% |
선 | 1 | 6.2% |
송 | 1 | 6.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 16 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
전 | 4 | |
설 | 2 | |
비 | 2 | |
변 | 1 | 6.2% |
역 | 1 | 6.2% |
사 | 1 | 6.2% |
기 | 1 | 6.2% |
차 | 1 | 6.2% |
선 | 1 | 6.2% |
송 | 1 | 6.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 16 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
전 | 4 | |
설 | 2 | |
비 | 2 | |
변 | 1 | 6.2% |
역 | 1 | 6.2% |
사 | 1 | 6.2% |
기 | 1 | 6.2% |
차 | 1 | 6.2% |
선 | 1 | 6.2% |
송 | 1 | 6.2% |
설 비 명
Text
MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 83.3% |
Missing | 21 |
Missing (%) | 53.8% |
Memory size | 444.0 B |
Value | Count | Frequency (%) |
변압기 | 2 | 11.1% |
차단기 | 2 | 11.1% |
계 | 2 | 11.1% |
변전소 | 1 | 5.6% |
정류기 | 1 | 5.6% |
원제반 | 1 | 5.6% |
담당역사 | 1 | 5.6% |
역사전기실 | 1 | 5.6% |
본선(터널)전기실 | 1 | 5.6% |
강체 | 1 | 5.6% |
Other values (5) | 5 |
Most occurring characters
Value | Count | Frequency (%) |
기 | 7 | 13.2% |
전 | 5 | 9.4% |
변 | 3 | 5.7% |
압 | 2 | 3.8% |
사 | 2 | 3.8% |
역 | 2 | 3.8% |
실 | 2 | 3.8% |
계 | 2 | 3.8% |
단 | 2 | 3.8% |
차 | 2 | 3.8% |
Other values (24) | 24 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 51 | |
Close Punctuation | 1 | 1.9% |
Open Punctuation | 1 | 1.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 7 | 13.7% |
전 | 5 | 9.8% |
변 | 3 | 5.9% |
압 | 2 | 3.9% |
사 | 2 | 3.9% |
역 | 2 | 3.9% |
실 | 2 | 3.9% |
계 | 2 | 3.9% |
단 | 2 | 3.9% |
차 | 2 | 3.9% |
Other values (22) | 22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 51 | |
Common | 2 | 3.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 7 | 13.7% |
전 | 5 | 9.8% |
변 | 3 | 5.9% |
압 | 2 | 3.9% |
사 | 2 | 3.9% |
역 | 2 | 3.9% |
실 | 2 | 3.9% |
계 | 2 | 3.9% |
단 | 2 | 3.9% |
차 | 2 | 3.9% |
Other values (22) | 22 |
Common
Value | Count | Frequency (%) |
) | 1 | |
( | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 51 | |
ASCII | 2 | 3.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
기 | 7 | 13.7% |
전 | 5 | 9.8% |
변 | 3 | 5.9% |
압 | 2 | 3.9% |
사 | 2 | 3.9% |
역 | 2 | 3.9% |
실 | 2 | 3.9% |
계 | 2 | 3.9% |
단 | 2 | 3.9% |
차 | 2 | 3.9% |
Other values (22) | 22 |
ASCII
Value | Count | Frequency (%) |
) | 1 | |
( | 1 |
Unnamed: 2
Text
MISSING
 
Distinct | 25 |
---|---|
Distinct (%) | 78.1% |
Missing | 7 |
Missing (%) | 17.9% |
Memory size | 444.0 B |
Value | Count | Frequency (%) |
계 | 5 | 15.6% |
vcb | 2 | 6.2% |
전기실 | 2 | 6.2% |
역사 | 2 | 6.2% |
22.9kv)연장 | 2 | 6.2% |
rtu | 1 | 3.1% |
6.6kv | 1 | 3.1% |
지상부 | 1 | 3.1% |
지하부 | 1 | 3.1% |
터널용 | 1 | 3.1% |
Other values (14) | 14 |
Most occurring characters
Value | Count | Frequency (%) |
) | 8 | 7.1% |
( | 8 | 7.1% |
용 | 7 | 6.2% |
전 | 6 | 5.4% |
계 | 5 | 4.5% |
V | 5 | 4.5% |
2 | 4 | 3.6% |
연 | 4 | 3.6% |
고 | 3 | 2.7% |
. | 3 | 2.7% |
Other values (37) | 59 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 69 | |
Uppercase Letter | 12 | 10.7% |
Close Punctuation | 8 | 7.1% |
Open Punctuation | 8 | 7.1% |
Decimal Number | 8 | 7.1% |
Other Punctuation | 4 | 3.6% |
Lowercase Letter | 3 | 2.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
용 | 7 | 10.1% |
전 | 6 | 8.7% |
계 | 5 | 7.2% |
연 | 4 | 5.8% |
고 | 3 | 4.3% |
장 | 3 | 4.3% |
역 | 3 | 4.3% |
사 | 3 | 4.3% |
실 | 3 | 4.3% |
지 | 2 | 2.9% |
Other values (23) | 30 |
Uppercase Letter
Value | Count | Frequency (%) |
V | 5 | |
C | 2 | 16.7% |
B | 2 | 16.7% |
U | 1 | 8.3% |
T | 1 | 8.3% |
R | 1 | 8.3% |
Decimal Number
Value | Count | Frequency (%) |
2 | 4 | |
6 | 2 | |
9 | 2 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 | |
, | 1 | 25.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 8 |
Open Punctuation
Value | Count | Frequency (%) |
( | 8 |
Lowercase Letter
Value | Count | Frequency (%) |
k | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 69 | |
Common | 28 | |
Latin | 15 | 13.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
용 | 7 | 10.1% |
전 | 6 | 8.7% |
계 | 5 | 7.2% |
연 | 4 | 5.8% |
고 | 3 | 4.3% |
장 | 3 | 4.3% |
역 | 3 | 4.3% |
사 | 3 | 4.3% |
실 | 3 | 4.3% |
지 | 2 | 2.9% |
Other values (23) | 30 |
Common
Value | Count | Frequency (%) |
) | 8 | |
( | 8 | |
2 | 4 | |
. | 3 | 10.7% |
6 | 2 | 7.1% |
9 | 2 | 7.1% |
, | 1 | 3.6% |
Latin
Value | Count | Frequency (%) |
V | 5 | |
k | 3 | |
C | 2 | 13.3% |
B | 2 | 13.3% |
U | 1 | 6.7% |
T | 1 | 6.7% |
R | 1 | 6.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 69 | |
ASCII | 43 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 8 | |
( | 8 | |
V | 5 | |
2 | 4 | |
. | 3 | 7.0% |
k | 3 | 7.0% |
6 | 2 | 4.7% |
9 | 2 | 4.7% |
C | 2 | 4.7% |
B | 2 | 4.7% |
Other values (4) | 4 |
Hangul
Value | Count | Frequency (%) |
용 | 7 | 10.1% |
전 | 6 | 8.7% |
계 | 5 | 7.2% |
연 | 4 | 5.8% |
고 | 3 | 4.3% |
장 | 3 | 4.3% |
역 | 3 | 4.3% |
사 | 3 | 4.3% |
실 | 3 | 4.3% |
지 | 2 | 2.9% |
Other values (23) | 30 |
단위
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 12.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 444.0 B |
대 | |
---|---|
<NA> | |
km | |
개소 | |
면 | 1 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.1794872 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.6% |
Sample
1st row | <NA> |
---|---|
2nd row | 개소 |
3rd row | 개소 |
4th row | 개소 |
5th row | 대 |
Common Values
Value | Count | Frequency (%) |
대 | 14 | |
<NA> | 11 | |
km | 7 | |
개소 | 6 | |
면 | 1 | 2.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대 | 14 | |
na | 11 | |
km | 7 | |
개소 | 6 | |
면 | 1 | 2.6% |
계
Real number (ℝ)
MISSING
 
Distinct | 26 |
---|---|
Distinct (%) | 92.9% |
Missing | 11 |
Missing (%) | 28.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 960.53571 |
Minimum | 10 |
---|---|
Maximum | 4437 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 483.0 B |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 91.5 |
Q1 | 198.5 |
median | 455 |
Q3 | 1439.25 |
95-th percentile | 2747.3 |
Maximum | 4437 |
Range | 4427 |
Interquartile range (IQR) | 1240.75 |
Descriptive statistics
Standard deviation | 1084.8095 |
---|---|
Coefficient of variation (CV) | 1.1293797 |
Kurtosis | 2.5677552 |
Mean | 960.53571 |
Median Absolute Deviation (MAD) | 356 |
Skewness | 1.6327804 |
Sum | 26895 |
Variance | 1176811.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
309 | 2 | 5.1% |
185 | 2 | 5.1% |
10 | 1 | 2.6% |
704 | 1 | 2.6% |
2832 | 1 | 2.6% |
996 | 1 | 2.6% |
609 | 1 | 2.6% |
4437 | 1 | 2.6% |
329 | 1 | 2.6% |
581 | 1 | 2.6% |
Other values (16) | 16 | |
(Missing) | 11 |
Value | Count | Frequency (%) |
10 | 1 | |
88 | 1 | |
98 | 1 | |
100 | 1 | |
185 | 2 | |
194 | 1 | |
200 | 1 | |
269 | 1 | |
278 | 1 | |
302 | 1 |
Value | Count | Frequency (%) |
4437 | 1 | |
2832 | 1 | |
2590 | 1 | |
2321 | 1 | |
2224 | 1 | |
1898 | 1 | |
1713 | 1 | |
1348 | 1 | |
996 | 1 | |
910 | 1 |
1~4호선
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
5~8호선
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 12
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 13
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
Unnamed: 14
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10 |
---|---|
Missing (%) | 25.6% |
Memory size | 444.0 B |
구분 | 설 비 명 | Unnamed: 2 | 단위 | 계 | |
---|---|---|---|---|---|
구분 | 1.000 | 0.977 | 0.964 | 0.829 | 0.799 |
설 비 명 | 0.977 | 1.000 | 1.000 | 1.000 | 0.000 |
Unnamed: 2 | 0.964 | 1.000 | 1.000 | 0.913 | 0.000 |
단위 | 0.829 | 1.000 | 0.913 | 1.000 | 0.540 |
계 | 0.799 | 0.000 | 0.000 | 0.540 | 1.000 |
계 | 단위 | |
---|---|---|
계 | 1.000 | 0.221 |
단위 | 0.221 | 1.000 |
구분 | 설 비 명 | Unnamed: 2 | 단위 | 계 | 1~4호선 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | 5~8호선 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 소계 | 1호선 | 2호선 | 3호선 | 4호선 | 소계 | 5호선 | 6호선 | 7호선 | 8호선 |
1 | 변 | 변전소 | 계 | 개소 | 98 | 42 | 3 | 15 | 13 | 11 | 56 | 19 | 12 | 20 | 5 |
2 | 전 | <NA> | 수전용 | 개소 | 88 | 42 | 3 | 15 | 13 | 11 | 46 | 14 | 12 | 17 | 3 |
3 | 설 | <NA> | 연락용 | 개소 | 10 | - | - | - | - | - | 10 | 5 | - | 3 | 2 |
4 | 비 | 정류기 | 실리콘 | 대 | 309 | 145 | 11 | 59 | 41 | 34 | 164 | 57 | 35 | 57 | 15 |
5 | <NA> | 변압기 | 계 | 대 | 704 | 312 | 22 | 116 | 93 | 81 | 392 | 135 | 83 | 138 | 36 |
6 | <NA> | <NA> | 정류용 | 대 | 309 | 145 | 11 | 59 | 41 | 34 | 164 | 57 | 35 | 57 | 15 |
7 | <NA> | <NA> | (전차선) | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
8 | <NA> | <NA> | 배전용 | 대 | 194 | 83 | 5 | 27 | 26 | 25 | 112 | 38 | 24 | 40 | 10 |
9 | <NA> | <NA> | (역사) | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
구분 | 설 비 명 | Unnamed: 2 | 단위 | 계 | 1~4호선 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | 5~8호선 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
29 | <NA> | <NA> | (VCB) | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
30 | 전 | 계 | <NA> | km | 910 | 436 | 21 | 193 | 129 | 93 | 474 | 157 | 92 | 168 | 57 |
31 | 차 | 강체 | 지하부 | km | 581 | 238 | 18 | 90 | 77 | 53 | 343 | 115 | 72 | 115 | 40 |
32 | 선 | 카테 | 지상부 | km | 329 | 198 | 3 | 103 | 52 | 40 | 131 | 42 | 20 | 53 | 17 |
33 | <NA> | 나리 | <NA> | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
34 | 송 | 계 | <NA> | km | 4437 | 1989 | 110 | 820 | 596 | 463 | 2448 | 833 | 485 | 804 | 326 |
35 | 배 | 수전 | (22.9kV)연장 | km | 609 | 238 | 12 | 76 | 77 | 74 | 371 | 156 | 52 | 100 | 64 |
36 | 전 | 연락 | (22.9kV)연장 | km | 996 | 435 | 18 | 188 | 122 | 107 | 561 | 191 | 101 | 198 | 71 |
37 | <NA> | 배전 | (6.6kV) | km | 2832 | 1316 | 81 | 556 | 397 | 282 | 1516 | 487 | 332 | 506 | 191 |
38 | <NA> | <NA> | 연장 | <NA> | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
Most frequently occurring
구분 | 설 비 명 | Unnamed: 2 | 단위 | 계 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | (VCB) | <NA> | <NA> | 2 |
1 | <NA> | <NA> | 전기실 | <NA> | <NA> | 2 |