Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 810.5 KiB |
Average record size in memory | 83.0 B |
Variable types
Text | 1 |
---|---|
Categorical | 5 |
Numeric | 3 |
Dataset
Description | 개집표기를 통해 거래가 지불되지 않았던 미처리 현황에 대한 데이터이며,2006년 3월부터 2023 9월까지와 관련된 자료입니다.월별로 역사와 카드 종류에 따른 미처리 현황을 제공하고 있습니다. |
---|---|
Author | 대전교통공사 |
URL | https://www.data.go.kr/data/15122861/fileData.do |
카드사구분 is highly overall correlated with 선후불구분 | High correlation |
선후불구분 is highly overall correlated with 카드사구분 | High correlation |
선후불구분 is highly imbalanced (52.0%) | Imbalance |
거래금액 is highly skewed (γ1 = 23.71684396) | Skewed |
선불잔액 is highly skewed (γ1 = 21.96614883) | Skewed |
거래금액 has 3223 (32.2%) zeros | Zeros |
선불잔액 has 8938 (89.4%) zeros | Zeros |
후불잔액 has 1341 (13.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 03:57:18.034114 |
---|---|
Analysis finished | 2023-12-12 03:57:21.416125 |
Duration | 3.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
정산일자
Text
Distinct | 211 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
nov-18 | 887 | 8.9% |
feb-11 | 437 | 4.4% |
jan-14 | 340 | 3.4% |
aug-16 | 292 | 2.9% |
nov-11 | 243 | 2.4% |
jun-11 | 196 | 2.0% |
mar-11 | 163 | 1.6% |
mar-19 | 150 | 1.5% |
dec-07 | 144 | 1.4% |
may-07 | 135 | 1.4% |
Other values (201) | 7013 |
Most occurring characters
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7716 | 12.9% |
0 | 3206 | 5.3% |
a | 2520 | 4.2% |
J | 2231 | 3.7% |
e | 2227 | 3.7% |
2 | 2223 | 3.7% |
u | 2218 | 3.7% |
8 | 2122 | 3.5% |
r | 1777 | 3.0% |
Other values (23) | 23760 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20000 | |
Lowercase Letter | 20000 | |
Dash Punctuation | 10000 | |
Uppercase Letter | 10000 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2520 | |
e | 2227 | |
u | 2218 | |
r | 1777 | |
o | 1677 | |
v | 1677 | |
n | 1606 | |
p | 1366 | |
c | 1273 | |
b | 946 | 4.7% |
Other values (4) | 2713 |
Decimal Number
Value | Count | Frequency (%) |
1 | 7716 | |
0 | 3206 | |
2 | 2223 | 11.1% |
8 | 2122 | 10.6% |
9 | 1325 | 6.6% |
7 | 1095 | 5.5% |
6 | 934 | 4.7% |
3 | 543 | 2.7% |
4 | 504 | 2.5% |
5 | 332 | 1.7% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 2231 | |
N | 1677 | |
M | 1659 | |
A | 1650 | |
F | 946 | |
D | 717 | 7.2% |
S | 564 | 5.6% |
O | 556 | 5.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30000 | |
Latin | 30000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2520 | 8.4% |
J | 2231 | 7.4% |
e | 2227 | 7.4% |
u | 2218 | 7.4% |
r | 1777 | 5.9% |
N | 1677 | 5.6% |
o | 1677 | 5.6% |
v | 1677 | 5.6% |
M | 1659 | 5.5% |
A | 1650 | 5.5% |
Other values (12) | 10687 |
Common
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7716 | |
0 | 3206 | 10.7% |
2 | 2223 | 7.4% |
8 | 2122 | 7.1% |
9 | 1325 | 4.4% |
7 | 1095 | 3.6% |
6 | 934 | 3.1% |
3 | 543 | 1.8% |
4 | 504 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 60000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7716 | 12.9% |
0 | 3206 | 5.3% |
a | 2520 | 4.2% |
J | 2231 | 3.7% |
e | 2227 | 3.7% |
2 | 2223 | 3.7% |
u | 2218 | 3.7% |
8 | 2122 | 3.5% |
r | 1777 | 3.0% |
Other values (23) | 23760 |
역이름
Categorical
Distinct | 22 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대전역 | |
---|---|
유성온천역 | |
정부청사역 | |
서대전네거리역 | |
시청역 | 633 |
Other values (17) |
Length
Max length | 7 |
---|---|
Median length | 3 |
Mean length | 3.7749 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 현충원역 |
---|---|
2nd row | 월드컵경기장역 |
3rd row | 용문역 |
4th row | 대동역 |
5th row | 대전역 |
Common Values
Value | Count | Frequency (%) |
대전역 | 922 | 9.2% |
유성온천역 | 695 | 7.0% |
정부청사역 | 682 | 6.8% |
서대전네거리역 | 643 | 6.4% |
시청역 | 633 | 6.3% |
용문역 | 601 | 6.0% |
탄방역 | 596 | 6.0% |
중앙로역 | 593 | 5.9% |
반석역 | 437 | 4.4% |
월평역 | 435 | 4.3% |
Other values (12) | 3763 |
Length
Value | Count | Frequency (%) |
대전역 | 922 | 9.2% |
유성온천역 | 695 | 7.0% |
정부청사역 | 682 | 6.8% |
서대전네거리역 | 643 | 6.4% |
시청역 | 633 | 6.3% |
용문역 | 601 | 6.0% |
탄방역 | 596 | 6.0% |
중앙로역 | 593 | 5.9% |
반석역 | 437 | 4.4% |
월평역 | 435 | 4.3% |
Other values (12) | 3763 |
선후불구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
선불 | |
---|---|
후불 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 선불 |
---|---|
2nd row | 선불 |
3rd row | 선불 |
4th row | 선불 |
5th row | 선불 |
Common Values
Value | Count | Frequency (%) |
선불 | 8965 | |
후불 | 1035 | 10.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
선불 | 8965 | |
후불 | 1035 | 10.3% |
카드사구분
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
교통카드운임(국민) | |
---|---|
교통카드운임(BC) | |
교통카드운임(하나) | |
교통카드운임(신한) | |
교통카드운임(삼성) | |
Other values (12) |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 10.0094 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 교통카드운임(국민) |
---|---|
2nd row | 교통카드운임(신한) |
3rd row | 교통카드운임(LG) |
4th row | 교통카드운임(BC) |
5th row | 교통카드운임(신한) |
Common Values
Value | Count | Frequency (%) |
교통카드운임(국민) | 1894 | |
교통카드운임(BC) | 1188 | |
교통카드운임(하나) | 1046 | |
교통카드운임(신한) | 1022 | |
교통카드운임(삼성) | 952 | |
교통카드운임(현대) | 601 | 6.0% |
교통카드운임(농협) | 592 | 5.9% |
교통카드운임(LG) | 581 | 5.8% |
한꿈이충전(선불) | 513 | 5.1% |
교통카드운임(외환) | 510 | 5.1% |
Other values (7) | 1101 |
Length
Value | Count | Frequency (%) |
교통카드운임(국민 | 1894 | |
교통카드운임(bc | 1188 | |
교통카드운임(하나 | 1046 | |
교통카드운임(신한 | 1022 | |
교통카드운임(삼성 | 952 | |
교통카드운임(현대 | 601 | 6.0% |
교통카드운임(농협 | 592 | 5.9% |
교통카드운임(lg | 581 | 5.8% |
한꿈이충전(선불 | 513 | 5.1% |
교통카드운임(외환 | 510 | 5.1% |
Other values (7) | 1101 |
거래금액
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 472 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4370.9773 |
Minimum | 0 |
---|---|
Maximum | 1345160 |
Zeros | 3223 |
Zeros (%) | 32.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1100 |
Q3 | 1250 |
95-th percentile | 13750 |
Maximum | 1345160 |
Range | 1345160 |
Interquartile range (IQR) | 1250 |
Descriptive statistics
Standard deviation | 25363.355 |
---|---|
Coefficient of variation (CV) | 5.8026738 |
Kurtosis | 967.93883 |
Mean | 4370.9773 |
Median Absolute Deviation (MAD) | 1100 |
Skewness | 23.716844 |
Sum | 43709773 |
Variance | 6.432998 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3223 | |
1250 | 2226 | |
950 | 600 | 6.0% |
1100 | 554 | 5.5% |
100 | 535 | 5.3% |
2500 | 373 | 3.7% |
1350 | 318 | 3.2% |
200 | 124 | 1.2% |
1900 | 118 | 1.2% |
2200 | 99 | 1.0% |
Other values (462) | 1830 |
Value | Count | Frequency (%) |
0 | 3223 | |
1 | 3 | < 0.1% |
80 | 8 | 0.1% |
100 | 535 | 5.3% |
150 | 10 | 0.1% |
180 | 1 | < 0.1% |
200 | 124 | 1.2% |
230 | 3 | < 0.1% |
250 | 4 | < 0.1% |
280 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1345160 | 1 | |
867910 | 1 | |
445960 | 1 | |
432250 | 1 | |
354830 | 1 | |
345610 | 1 | |
336300 | 1 | |
329650 | 1 | |
328700 | 1 | |
320090 | 1 |
선불잔액
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 872 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26315.325 |
Minimum | 0 |
---|---|
Maximum | 11987730 |
Zeros | 8938 |
Zeros (%) | 89.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 28750 |
Maximum | 11987730 |
Range | 11987730 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 253011.28 |
---|---|
Coefficient of variation (CV) | 9.6145984 |
Kurtosis | 740.4862 |
Mean | 26315.325 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 21.966149 |
Sum | 2.6315325 × 108 |
Variance | 6.401471 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8938 | |
28750 | 37 | 0.4% |
1800 | 8 | 0.1% |
1250 | 6 | 0.1% |
27500 | 6 | 0.1% |
2500 | 5 | 0.1% |
4050 | 5 | 0.1% |
57500 | 5 | 0.1% |
4800 | 4 | < 0.1% |
2100 | 4 | < 0.1% |
Other values (862) | 982 | 9.8% |
Value | Count | Frequency (%) |
0 | 8938 | |
8 | 1 | < 0.1% |
50 | 1 | < 0.1% |
140 | 2 | < 0.1% |
150 | 1 | < 0.1% |
200 | 2 | < 0.1% |
230 | 1 | < 0.1% |
250 | 1 | < 0.1% |
300 | 1 | < 0.1% |
340 | 1 | < 0.1% |
Value | Count | Frequency (%) |
11987730 | 1 | |
8771804 | 1 | |
5642659 | 1 | |
5075220 | 1 | |
3976253 | 1 | |
3971150 | 1 | |
3844580 | 1 | |
3542323 | 1 | |
3464730 | 1 | |
3427790 | 1 |
후불잔액
Real number (ℝ)
ZEROS
 
Distinct | 2160 |
---|---|
Distinct (%) | 21.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 74747.3 |
Minimum | 0 |
---|---|
Maximum | 6500564 |
Zeros | 1341 |
Zeros (%) | 13.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1250 |
median | 2600 |
Q3 | 19312.5 |
95-th percentile | 371730 |
Maximum | 6500564 |
Range | 6500564 |
Interquartile range (IQR) | 18062.5 |
Descriptive statistics
Standard deviation | 302859.69 |
---|---|
Coefficient of variation (CV) | 4.0517811 |
Kurtosis | 98.062928 |
Mean | 74747.3 |
Median Absolute Deviation (MAD) | 2600 |
Skewness | 8.3560124 |
Sum | 7.47473 × 108 |
Variance | 9.1723995 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1250 | 1513 | 15.1% |
0 | 1341 | 13.4% |
950 | 487 | 4.9% |
2500 | 427 | 4.3% |
1100 | 394 | 3.9% |
1350 | 263 | 2.6% |
3750 | 178 | 1.8% |
1900 | 167 | 1.7% |
2200 | 101 | 1.0% |
2850 | 95 | 0.9% |
Other values (2150) | 5034 |
Value | Count | Frequency (%) |
0 | 1341 | |
450 | 3 | < 0.1% |
500 | 2 | < 0.1% |
550 | 1 | < 0.1% |
800 | 38 | 0.4% |
880 | 1 | < 0.1% |
900 | 1 | < 0.1% |
950 | 487 | 4.9% |
1050 | 61 | 0.6% |
1100 | 394 | 3.9% |
Value | Count | Frequency (%) |
6500564 | 1 | |
6055594 | 1 | |
5184020 | 1 | |
4854167 | 1 | |
4767292 | 1 | |
4341828 | 1 | |
3949214 | 1 | |
3695426 | 1 | |
3531414 | 1 | |
3482600 | 1 |
승하차구분
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
승차 | |
---|---|
하차 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 승차 |
---|---|
2nd row | 하차 |
3rd row | 승차 |
4th row | 하차 |
5th row | 승차 |
Common Values
Value | Count | Frequency (%) |
승차 | 6637 | |
하차 | 3363 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
승차 | 6637 | |
하차 | 3363 |
환승구분
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
X | |
---|---|
O |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | X |
---|---|
2nd row | X |
3rd row | X |
4th row | X |
5th row | X |
Common Values
Value | Count | Frequency (%) |
X | 8363 | |
O | 1637 | 16.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
x | 8363 | |
o | 1637 | 16.4% |
역이름 | 선후불구분 | 카드사구분 | 거래금액 | 선불잔액 | 후불잔액 | 승하차구분 | 환승구분 | |
---|---|---|---|---|---|---|---|---|
역이름 | 1.000 | 0.061 | 0.086 | 0.000 | 0.000 | 0.060 | 0.090 | 0.098 |
선후불구분 | 0.061 | 1.000 | 0.963 | 0.155 | 0.220 | 0.053 | 0.349 | 0.092 |
카드사구분 | 0.086 | 0.963 | 1.000 | 0.148 | 0.210 | 0.055 | 0.343 | 0.117 |
거래금액 | 0.000 | 0.155 | 0.148 | 1.000 | 0.857 | 0.515 | 0.151 | 0.037 |
선불잔액 | 0.000 | 0.220 | 0.210 | 0.857 | 1.000 | 0.000 | 0.066 | 0.000 |
후불잔액 | 0.060 | 0.053 | 0.055 | 0.515 | 0.000 | 1.000 | 0.120 | 0.031 |
승하차구분 | 0.090 | 0.349 | 0.343 | 0.151 | 0.066 | 0.120 | 1.000 | 0.295 |
환승구분 | 0.098 | 0.092 | 0.117 | 0.037 | 0.000 | 0.031 | 0.295 | 1.000 |
승하차구분 | 환승구분 | 카드사구분 | 역이름 | 선후불구분 | |
---|---|---|---|---|---|
승하차구분 | 1.000 | 0.191 | 0.308 | 0.071 | 0.227 |
환승구분 | 0.191 | 1.000 | 0.105 | 0.078 | 0.059 |
카드사구분 | 0.308 | 0.105 | 1.000 | 0.026 | 0.960 |
역이름 | 0.071 | 0.078 | 0.026 | 1.000 | 0.048 |
선후불구분 | 0.227 | 0.059 | 0.960 | 0.048 | 1.000 |
거래금액 | 선불잔액 | 후불잔액 | 역이름 | 선후불구분 | 카드사구분 | 승하차구분 | 환승구분 | |
---|---|---|---|---|---|---|---|---|
거래금액 | 1.000 | 0.047 | 0.026 | 0.000 | 0.111 | 0.070 | 0.109 | 0.027 |
선불잔액 | 0.047 | 1.000 | -0.487 | 0.000 | 0.235 | 0.096 | 0.071 | 0.000 |
후불잔액 | 0.026 | -0.487 | 1.000 | 0.023 | 0.053 | 0.022 | 0.120 | 0.031 |
역이름 | 0.000 | 0.000 | 0.023 | 1.000 | 0.048 | 0.026 | 0.071 | 0.078 |
선후불구분 | 0.111 | 0.235 | 0.053 | 0.048 | 1.000 | 0.960 | 0.227 | 0.059 |
카드사구분 | 0.070 | 0.096 | 0.022 | 0.026 | 0.960 | 1.000 | 0.308 | 0.105 |
승하차구분 | 0.109 | 0.071 | 0.120 | 0.071 | 0.227 | 0.308 | 1.000 | 0.191 |
환승구분 | 0.027 | 0.000 | 0.031 | 0.078 | 0.059 | 0.105 | 0.191 | 1.000 |
정산일자 | 역이름 | 선후불구분 | 카드사구분 | 거래금액 | 선불잔액 | 후불잔액 | 승하차구분 | 환승구분 | |
---|---|---|---|---|---|---|---|---|---|
2333 | Jul-09 | 현충원역 | 선불 | 교통카드운임(국민) | 950 | 0 | 950 | 승차 | X |
5211 | Jan-14 | 월드컵경기장역 | 선불 | 교통카드운임(신한) | 6600 | 0 | 18650 | 하차 | X |
1499 | Jun-08 | 용문역 | 선불 | 교통카드운임(LG) | 0 | 0 | 1900 | 승차 | X |
140 | Apr-06 | 대동역 | 선불 | 교통카드운임(BC) | 2400 | 0 | 23000 | 하차 | X |
6552 | Mar-17 | 대전역 | 선불 | 교통카드운임(신한) | 1250 | 0 | 1250 | 승차 | X |
1963 | Jan-09 | 서대전네거리역 | 선불 | 교통카드운임(국민) | 100 | 0 | 50000 | 승차 | X |
421 | Mar-07 | 지족역 | 선불 | 교통카드운임(BC) | 0 | 0 | 52000 | 승차 | X |
7240 | Jul-18 | 탄방역 | 선불 | 교통카드운임(하나) | 2500 | 0 | 7500 | 하차 | X |
9832 | Apr-21 | 용문역 | 선불 | 교통카드운임(하나) | 1250 | 0 | 1250 | 승차 | X |
3603 | Mar-11 | 용문역 | 선불 | 교통카드운임(국민) | 3800 | 0 | 117250 | 하차 | X |
정산일자 | 역이름 | 선후불구분 | 카드사구분 | 거래금액 | 선불잔액 | 후불잔액 | 승하차구분 | 환승구분 | |
---|---|---|---|---|---|---|---|---|---|
1307 | Apr-08 | 대전역 | 선불 | 교통카드운임(국민) | 200 | 0 | 3150 | 승차 | O |
239 | Jun-06 | 시청역 | 선불 | 교통카드운임(BC) | 0 | 0 | 6850 | 승차 | X |
10829 | Jun-23 | 중앙로역 | 선불 | 교통카드운임(신한) | 1250 | 0 | 1250 | 승차 | X |
7156 | May-18 | 유성온천역 | 후불 | 교통카드운임(유페이) | 0 | 9440 | 0 | 승차 | X |
5467 | Feb-15 | 대동역 | 선불 | 교통카드운임(삼성) | 0 | 0 | 5600 | 승차 | X |
931 | Nov-07 | 중앙로역 | 선불 | 교통카드운임(국민) | 0 | 0 | 55450 | 승차 | X |
7556 | Nov-18 | 대전역 | 선불 | 교통카드운임(신한) | 4350 | 0 | 2338392 | 승차 | X |
1152 | Jan-08 | 용문역 | 선불 | 교통카드운임(BC) | 100 | 0 | 19400 | 승차 | X |
3907 | Jun-11 | 시청역 | 선불 | 교통카드운임(외환) | 5700 | 0 | 74300 | 하차 | X |
3370 | Feb-11 | 정부청사역 | 선불 | 교통카드운임(하나) | 16150 | 0 | 356800 | 하차 | X |