Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 28 |
Missing cells | 27 |
Missing cells (%) | 12.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.0 KiB |
Average record size in memory | 74.7 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Numeric | 5 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13208/F/1/datasetView.do |
본사 is highly overall correlated with 수량 and 5 other fields | High correlation |
단위 is highly overall correlated with 수량 and 5 other fields | High correlation |
수량 is highly overall correlated with 1호선 and 5 other fields | High correlation |
1호선 is highly overall correlated with 수량 and 5 other fields | High correlation |
2호선 is highly overall correlated with 수량 and 5 other fields | High correlation |
3호선 is highly overall correlated with 수량 and 5 other fields | High correlation |
4호선 is highly overall correlated with 수량 and 5 other fields | High correlation |
단위 is highly imbalanced (77.8%) | Imbalance |
1호선 has 8 (28.6%) missing values | Missing |
2호선 has 6 (21.4%) missing values | Missing |
3호선 has 7 (25.0%) missing values | Missing |
4호선 has 6 (21.4%) missing values | Missing |
Reproduction
Analysis started | 2024-04-29 16:43:18.903677 |
---|---|
Analysis finished | 2024-04-29 16:43:24.249918 |
Duration | 5.35 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
설비명
Text
Distinct | 27 |
---|---|
Distinct (%) | 96.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 356.0 B |
Length
Max length | 12 |
---|---|
Median length | 9.5 |
Mean length | 6.7142857 |
Min length | 1 |
Characters and Unicode
Total characters | 188 |
---|---|
Distinct characters | 73 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 26 ? |
---|---|
Unique (%) | 92.9% |
Sample
1st row | 총계 |
---|---|
2nd row | 개집표기 턴스타일게이트 |
3rd row | 개집표기 슬림게이트 |
4th row | 개집표기 장애인게이트 |
5th row | 개집표기 스피드게이트 |
Value | Count | Frequency (%) |
개집표기 | 5 | 15.2% |
유인충전기 | 2 | 6.1% |
유지보수관리전산기 | 1 | 3.0% |
cctv모니터 | 1 | 3.0% |
cctv카메라 | 1 | 3.0% |
발권기 | 1 | 3.0% |
판매기 | 1 | 3.0% |
무인정산기 | 1 | 3.0% |
환급기 | 1 | 3.0% |
발매기 | 1 | 3.0% |
Other values (18) | 18 |
Most occurring characters
Value | Count | Frequency (%) |
기 | 17 | 9.0% |
전 | 8 | 4.3% |
스 | 8 | 4.3% |
집 | 7 | 3.7% |
산 | 6 | 3.2% |
C | 5 | 2.7% |
개 | 5 | 2.7% |
표 | 5 | 2.7% |
5 | 2.7% | |
시 | 5 | 2.7% |
Other values (63) | 117 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 170 | |
Uppercase Letter | 13 | 6.9% |
Space Separator | 5 | 2.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 17 | 10.0% |
전 | 8 | 4.7% |
스 | 8 | 4.7% |
집 | 7 | 4.1% |
산 | 6 | 3.5% |
개 | 5 | 2.9% |
표 | 5 | 2.9% |
시 | 5 | 2.9% |
템 | 5 | 2.9% |
이 | 4 | 2.4% |
Other values (55) | 100 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 5 | |
V | 2 | 15.4% |
T | 2 | 15.4% |
N | 1 | 7.7% |
S | 1 | 7.7% |
M | 1 | 7.7% |
P | 1 | 7.7% |
Space Separator
Value | Count | Frequency (%) |
5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 170 | |
Latin | 13 | 6.9% |
Common | 5 | 2.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 17 | 10.0% |
전 | 8 | 4.7% |
스 | 8 | 4.7% |
집 | 7 | 4.1% |
산 | 6 | 3.5% |
개 | 5 | 2.9% |
표 | 5 | 2.9% |
시 | 5 | 2.9% |
템 | 5 | 2.9% |
이 | 4 | 2.4% |
Other values (55) | 100 |
Latin
Value | Count | Frequency (%) |
C | 5 | |
V | 2 | 15.4% |
T | 2 | 15.4% |
N | 1 | 7.7% |
S | 1 | 7.7% |
M | 1 | 7.7% |
P | 1 | 7.7% |
Common
Value | Count | Frequency (%) |
5 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 170 | |
ASCII | 18 | 9.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
기 | 17 | 10.0% |
전 | 8 | 4.7% |
스 | 8 | 4.7% |
집 | 7 | 4.1% |
산 | 6 | 3.5% |
개 | 5 | 2.9% |
표 | 5 | 2.9% |
시 | 5 | 2.9% |
템 | 5 | 2.9% |
이 | 4 | 2.4% |
Other values (55) | 100 |
ASCII
Value | Count | Frequency (%) |
C | 5 | |
5 | ||
V | 2 | 11.1% |
T | 2 | 11.1% |
N | 1 | 5.6% |
S | 1 | 5.6% |
M | 1 | 5.6% |
P | 1 | 5.6% |
단위
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 356.0 B |
대 | |
---|---|
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1071429 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.6% |
Sample
1st row | 대 |
---|---|
2nd row | 대 |
3rd row | 대 |
4th row | 대 |
5th row | 대 |
Common Values
Value | Count | Frequency (%) |
대 | 27 | |
<NA> | 1 | 3.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대 | 27 | |
na | 1 | 3.6% |
수량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 78.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 600.60714 |
Minimum | 1 |
---|---|
Maximum | 6524 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 384.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 9.25 |
median | 159 |
Q3 | 334.25 |
95-th percentile | 3219.35 |
Maximum | 6524 |
Range | 6523 |
Interquartile range (IQR) | 325 |
Descriptive statistics
Standard deviation | 1406.8203 |
---|---|
Coefficient of variation (CV) | 2.3423303 |
Kurtosis | 12.259621 |
Mean | 600.60714 |
Median Absolute Deviation (MAD) | 156 |
Skewness | 3.4157355 |
Sum | 16817 |
Variance | 1979143.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5 | |
119 | 2 | 7.1% |
270 | 2 | 7.1% |
39 | 1 | 3.6% |
227 | 1 | 3.6% |
332 | 1 | 3.6% |
17 | 1 | 3.6% |
199 | 1 | 3.6% |
335 | 1 | 3.6% |
432 | 1 | 3.6% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
1 | 5 | |
5 | 1 | 3.6% |
7 | 1 | 3.6% |
10 | 1 | 3.6% |
11 | 1 | 3.6% |
15 | 1 | 3.6% |
17 | 1 | 3.6% |
39 | 1 | 3.6% |
119 | 2 | 7.1% |
199 | 1 | 3.6% |
Value | Count | Frequency (%) |
6524 | 1 | |
3499 | 1 | |
2700 | 1 | |
601 | 1 | |
455 | 1 | |
432 | 1 | |
335 | 1 | |
334 | 1 | |
332 | 1 | |
292 | 1 |
본사
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 14.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 356.0 B |
<NA> | |
---|---|
1 | |
14 | 1 |
5 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 2.75 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 7.1% |
Sample
1st row | 14 |
---|---|
2nd row | 1 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 16 | |
1 | 10 | |
14 | 1 | 3.6% |
5 | 1 | 3.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 16 | |
1 | 10 | |
14 | 1 | 3.6% |
5 | 1 | 3.6% |
1호선
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 90.0% |
Missing | 8 |
Missing (%) | 28.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 93.65 |
Minimum | 1 |
---|---|
Maximum | 724 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 384.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.95 |
Q1 | 10 |
median | 28 |
Q3 | 57 |
95-th percentile | 418.1 |
Maximum | 724 |
Range | 723 |
Interquartile range (IQR) | 47 |
Descriptive statistics
Standard deviation | 179.57179 |
---|---|
Coefficient of variation (CV) | 1.9174778 |
Kurtosis | 8.1855439 |
Mean | 93.65 |
Median Absolute Deviation (MAD) | 20.5 |
Skewness | 2.8287786 |
Sum | 1873 |
Variance | 32246.029 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 2 | 7.1% |
23 | 2 | 7.1% |
51 | 1 | 3.6% |
1 | 1 | 3.6% |
25 | 1 | 3.6% |
36 | 1 | 3.6% |
3 | 1 | 3.6% |
18 | 1 | 3.6% |
37 | 1 | 3.6% |
724 | 1 | 3.6% |
Other values (8) | 8 | |
(Missing) | 8 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
10 | 2 | |
18 | 1 | |
23 | 2 | |
25 | 1 | |
31 | 1 | |
34 | 1 |
Value | Count | Frequency (%) |
724 | 1 | |
402 | 1 | |
284 | 1 | |
80 | 1 | |
75 | 1 | |
51 | 1 | |
37 | 1 | |
36 | 1 | |
34 | 1 | |
31 | 1 |
2호선
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 19 |
---|---|
Distinct (%) | 86.4% |
Missing | 6 |
Missing (%) | 21.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 351.54545 |
Minimum | 3 |
---|---|
Maximum | 2983 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 384.0 B |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 4.05 |
Q1 | 29.75 |
median | 118 |
Q3 | 180 |
95-th percentile | 1630.5 |
Maximum | 2983 |
Range | 2980 |
Interquartile range (IQR) | 150.25 |
Descriptive statistics
Standard deviation | 716.61345 |
---|---|
Coefficient of variation (CV) | 2.038466 |
Kurtosis | 8.9241826 |
Mean | 351.54545 |
Median Absolute Deviation (MAD) | 83 |
Skewness | 2.9551662 |
Sum | 7734 |
Variance | 513534.83 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7 | 2 | 7.1% |
50 | 2 | 7.1% |
118 | 2 | 7.1% |
153 | 1 | 3.6% |
1650 | 1 | 3.6% |
3 | 1 | 3.6% |
4 | 1 | 3.6% |
5 | 1 | 3.6% |
233 | 1 | 3.6% |
1260 | 1 | 3.6% |
Other values (9) | 9 | |
(Missing) | 6 |
Value | Count | Frequency (%) |
3 | 1 | |
4 | 1 | |
5 | 1 | |
7 | 2 | |
23 | 1 | |
50 | 2 | |
94 | 1 | |
103 | 1 | |
118 | 2 | |
122 | 1 |
Value | Count | Frequency (%) |
2983 | 1 | |
1650 | 1 | |
1260 | 1 | |
261 | 1 | |
233 | 1 | |
189 | 1 | |
153 | 1 | |
151 | 1 | |
150 | 1 | |
122 | 1 |
3호선
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 71.4% |
Missing | 7 |
Missing (%) | 25.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 170.7619 |
Minimum | 2 |
---|---|
Maximum | 1408 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 384.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2 |
Q1 | 33 |
median | 71 |
Q3 | 73 |
95-th percentile | 699 |
Maximum | 1408 |
Range | 1406 |
Interquartile range (IQR) | 40 |
Descriptive statistics
Standard deviation | 335.12728 |
---|---|
Coefficient of variation (CV) | 1.9625412 |
Kurtosis | 9.6122936 |
Mean | 170.7619 |
Median Absolute Deviation (MAD) | 38 |
Skewness | 3.0269447 |
Sum | 3586 |
Variance | 112310.29 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
71 | 3 | |
73 | 2 | 7.1% |
2 | 2 | 7.1% |
3 | 2 | 7.1% |
33 | 2 | 7.1% |
1408 | 1 | 3.6% |
562 | 1 | 3.6% |
64 | 1 | 3.6% |
699 | 1 | 3.6% |
72 | 1 | 3.6% |
Other values (5) | 5 | |
(Missing) | 7 |
Value | Count | Frequency (%) |
2 | 2 | |
3 | 2 | |
7 | 1 | 3.6% |
33 | 2 | |
43 | 1 | 3.6% |
58 | 1 | 3.6% |
64 | 1 | 3.6% |
71 | 3 | |
72 | 1 | 3.6% |
73 | 2 |
Value | Count | Frequency (%) |
1408 | 1 | 3.6% |
699 | 1 | 3.6% |
562 | 1 | 3.6% |
139 | 1 | 3.6% |
99 | 1 | 3.6% |
73 | 2 | |
72 | 1 | 3.6% |
71 | 3 | |
64 | 1 | 3.6% |
58 | 1 | 3.6% |
4호선
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 81.8% |
Missing | 6 |
Missing (%) | 21.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 163.81818 |
Minimum | 1 |
---|---|
Maximum | 1395 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 384.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.05 |
Q1 | 11.75 |
median | 62.5 |
Q3 | 77.25 |
95-th percentile | 739.3 |
Maximum | 1395 |
Range | 1394 |
Interquartile range (IQR) | 65.5 |
Descriptive statistics
Standard deviation | 333.08567 |
---|---|
Coefficient of variation (CV) | 2.0332643 |
Kurtosis | 9.154695 |
Mean | 163.81818 |
Median Absolute Deviation (MAD) | 36.5 |
Skewness | 2.9820463 |
Sum | 3604 |
Variance | 110946.06 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
74 | 2 | 7.1% |
3 | 2 | 7.1% |
26 | 2 | 7.1% |
67 | 2 | 7.1% |
93 | 1 | 3.6% |
7 | 1 | 3.6% |
41 | 1 | 3.6% |
75 | 1 | 3.6% |
4 | 1 | 3.6% |
44 | 1 | 3.6% |
Other values (8) | 8 | |
(Missing) | 6 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 2 | |
4 | 1 | |
7 | 1 | |
26 | 2 | |
41 | 1 | |
44 | 1 | |
58 | 1 | |
67 | 2 |
Value | Count | Frequency (%) |
1395 | 1 | |
747 | 1 | |
593 | 1 | |
126 | 1 | |
93 | 1 | |
78 | 1 | |
75 | 1 | |
74 | 2 | |
67 | 2 | |
58 | 1 |
설비명 | 수량 | 본사 | 1호선 | 2호선 | 3호선 | 4호선 | |
---|---|---|---|---|---|---|---|
설비명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
수량 | 1.000 | 1.000 | 0.579 | 1.000 | 1.000 | 1.000 | 1.000 |
본사 | 1.000 | 0.579 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
1호선 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
2호선 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
3호선 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
4호선 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
본사 | 단위 | |
---|---|---|
본사 | 1.000 | 1.000 |
단위 | 1.000 | 1.000 |
수량 | 1호선 | 2호선 | 3호선 | 4호선 | 단위 | 본사 | |
---|---|---|---|---|---|---|---|
수량 | 1.000 | 0.979 | 0.999 | 0.959 | 0.993 | 1.000 | 0.540 |
1호선 | 0.979 | 1.000 | 0.976 | 0.909 | 0.968 | 1.000 | 1.000 |
2호선 | 0.999 | 0.976 | 1.000 | 0.959 | 0.992 | 1.000 | 0.707 |
3호선 | 0.959 | 0.909 | 0.959 | 1.000 | 0.956 | 1.000 | 0.707 |
4호선 | 0.993 | 0.968 | 0.992 | 0.956 | 1.000 | 1.000 | 0.707 |
단위 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
본사 | 0.540 | 1.000 | 0.707 | 0.707 | 0.707 | 1.000 | 1.000 |
설비명 | 단위 | 수량 | 본사 | 1호선 | 2호선 | 3호선 | 4호선 | |
---|---|---|---|---|---|---|---|---|
0 | 총계 | 대 | 6524 | 14 | 724 | 2983 | 1408 | 1395 |
1 | 개집표기 턴스타일게이트 | 대 | 2700 | 1 | 284 | 1260 | 562 | 593 |
2 | 개집표기 슬림게이트 | 대 | 455 | <NA> | 80 | 233 | 64 | 78 |
3 | 개집표기 장애인게이트 | 대 | 10 | <NA> | 4 | 4 | <NA> | 2 |
4 | 개집표기 스피드게이트 | 대 | 334 | <NA> | 34 | 153 | 73 | 74 |
5 | 계 | <NA> | 3499 | 1 | 402 | 1650 | 699 | 747 |
6 | 센터시스템 | 대 | 1 | 1 | <NA> | <NA> | <NA> | <NA> |
7 | NMS서브메니저 | 대 | 1 | 1 | <NA> | <NA> | <NA> | <NA> |
8 | PC보안원격제어서버 | 대 | 1 | 1 | <NA> | <NA> | <NA> | <NA> |
9 | 원격정비시스템 | 대 | 7 | 1 | <NA> | 3 | 2 | 1 |
설비명 | 단위 | 수량 | 본사 | 1호선 | 2호선 | 3호선 | 4호선 | |
---|---|---|---|---|---|---|---|---|
18 | 유인충전기 | 대 | 270 | <NA> | 23 | 118 | 71 | 67 |
19 | 휴대용정산기 | 대 | 292 | <NA> | 31 | 122 | 72 | 67 |
20 | 발매기 | 대 | 601 | <NA> | 75 | 261 | 139 | 126 |
21 | 환급기 | 대 | 432 | <NA> | 51 | 189 | 99 | 93 |
22 | 무인정산기 | 대 | 335 | <NA> | 37 | 151 | 73 | 74 |
23 | 판매기 | 대 | 199 | <NA> | 18 | 94 | 43 | 44 |
24 | 발권기 | 대 | 17 | <NA> | 3 | 7 | 3 | 4 |
25 | CCTV카메라 | 대 | 332 | <NA> | 36 | 150 | 71 | 75 |
26 | CCTV모니터 | 대 | 227 | <NA> | 25 | 103 | 58 | 41 |
27 | 무정전전원장치 | 대 | 39 | 1 | 1 | 23 | 7 | 7 |