Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 51 |
Missing cells | 14 |
Missing cells (%) | 6.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 37.6 B |
Variable types
Text | 1 |
---|---|
Numeric | 3 |
Dataset
Description | 역별 KTX 상행 여객 승하차 실적 입니다. |
---|---|
Author | 한국철도공사 |
URL | https://www.data.go.kr/data/15068465/fileData.do |
Reproduction
Analysis started | 2023-12-12 04:50:56.323703 |
---|---|
Analysis finished | 2023-12-12 04:50:58.272611 |
Duration | 1.95 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
역명
Text
UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 540.0 B |
Value | Count | Frequency (%) |
강릉 | 1 | 2.0% |
순천 | 1 | 2.0% |
양평 | 1 | 2.0% |
여수엑스포 | 1 | 2.0% |
여천 | 1 | 2.0% |
영등포 | 1 | 2.0% |
오송 | 1 | 2.0% |
용산 | 1 | 2.0% |
울산 | 1 | 2.0% |
익산 | 1 | 2.0% |
Other values (41) | 41 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 8 | 6.4% |
주 | 6 | 4.8% |
천 | 6 | 4.8% |
포 | 5 | 4.0% |
구 | 5 | 4.0% |
원 | 4 | 3.2% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (60) | 79 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 123 | |
Uppercase Letter | 1 | 0.8% |
Decimal Number | 1 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 1 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 123 | |
Latin | 1 | 0.8% |
Common | 1 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
Latin
Value | Count | Frequency (%) |
T | 1 |
Common
Value | Count | Frequency (%) |
2 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 123 | |
ASCII | 2 | 1.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
ASCII
Value | Count | Frequency (%) |
T | 1 | |
2 | 1 |
승차
Real number (ℝ)
MISSING
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 3.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 688765.18 |
Minimum | 76 |
---|---|
Maximum | 5827046 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 76 |
---|---|
5-th percentile | 985.4 |
Q1 | 54376 |
median | 216129 |
Q3 | 706879 |
95-th percentile | 2727537 |
Maximum | 5827046 |
Range | 5826970 |
Interquartile range (IQR) | 652503 |
Descriptive statistics
Standard deviation | 1165156.2 |
---|---|
Coefficient of variation (CV) | 1.6916595 |
Kurtosis | 9.2130342 |
Mean | 688765.18 |
Median Absolute Deviation (MAD) | 188812 |
Skewness | 2.8883152 |
Sum | 33749494 |
Variance | 1.3575889 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
752729 | 1 | 2.0% |
541243 | 1 | 2.0% |
90776 | 1 | 2.0% |
506540 | 1 | 2.0% |
219640 | 1 | 2.0% |
249 | 1 | 2.0% |
1907081 | 1 | 2.0% |
14101 | 1 | 2.0% |
1957545 | 1 | 2.0% |
989507 | 1 | 2.0% |
Other values (39) | 39 | |
(Missing) | 2 | 3.9% |
Value | Count | Frequency (%) |
76 | 1 | |
249 | 1 | |
431 | 1 | |
1817 | 1 | |
6852 | 1 | |
14101 | 1 | |
32114 | 1 | |
33366 | 1 | |
34533 | 1 | |
40624 | 1 |
Value | Count | Frequency (%) |
5827046 | 1 | |
4527555 | 1 | |
3215735 | 1 | |
1995240 | 1 | |
1957545 | 1 | |
1907081 | 1 | |
1771169 | 1 | |
1634532 | 1 | |
1115682 | 1 | |
989507 | 1 |
하차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 11.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 749988.76 |
Minimum | 697 |
---|---|
Maximum | 14147344 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 697 |
---|---|
5-th percentile | 1528.6 |
Q1 | 11046 |
median | 37335 |
Q3 | 245550 |
95-th percentile | 3991137.2 |
Maximum | 14147344 |
Range | 14146647 |
Interquartile range (IQR) | 234504 |
Descriptive statistics
Standard deviation | 2297930.2 |
---|---|
Coefficient of variation (CV) | 3.0639529 |
Kurtosis | 27.435851 |
Mean | 749988.76 |
Median Absolute Deviation (MAD) | 35910 |
Skewness | 4.9682504 |
Sum | 33749494 |
Variance | 5.2804833 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
123266 | 1 | 2.0% |
32011 | 1 | 2.0% |
697 | 1 | 2.0% |
149077 | 1 | 2.0% |
943731 | 1 | 2.0% |
5134368 | 1 | 2.0% |
199254 | 1 | 2.0% |
373365 | 1 | 2.0% |
74752 | 1 | 2.0% |
13990 | 1 | 2.0% |
Other values (35) | 35 | |
(Missing) | 6 | 11.8% |
Value | Count | Frequency (%) |
697 | 1 | |
1294 | 1 | |
1425 | 1 | |
1943 | 1 | |
3640 | 1 | |
4086 | 1 | |
5448 | 1 | |
5500 | 1 | |
8058 | 1 | |
10015 | 1 |
Value | Count | Frequency (%) |
14147344 | 1 | |
5134368 | 1 | |
4574649 | 1 | |
1657090 | 1 | |
1582696 | 1 | |
1475801 | 1 | |
943731 | 1 | |
818527 | 1 | |
721378 | 1 | |
578420 | 1 |
인키로
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 45 |
---|---|
Distinct (%) | 100.0% |
Missing | 6 |
Missing (%) | 11.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.8035968 × 108 |
Minimum | 234486 |
---|---|
Maximum | 3.7494927 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 234486 |
---|---|
5-th percentile | 460416.4 |
Q1 | 2205368 |
median | 6308384 |
Q3 | 53973277 |
95-th percentile | 9.9228424 × 108 |
Maximum | 3.7494927 × 109 |
Range | 3.7492582 × 109 |
Interquartile range (IQR) | 51767909 |
Descriptive statistics
Standard deviation | 6.0283636 × 108 |
---|---|
Coefficient of variation (CV) | 3.3424121 |
Kurtosis | 29.292309 |
Mean | 1.8035968 × 108 |
Median Absolute Deviation (MAD) | 5922364 |
Skewness | 5.1734486 |
Sum | 8.1161854 × 109 |
Variance | 3.6341167 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24926453 | 1 | 2.0% |
3062668 | 1 | 2.0% |
234486 | 1 | 2.0% |
43540653 | 1 | 2.0% |
139283355 | 1 | 2.0% |
1305008712 | 1 | 2.0% |
53973277 | 1 | 2.0% |
63970243 | 1 | 2.0% |
16873772 | 1 | 2.0% |
3765912 | 1 | 2.0% |
Other values (35) | 35 | |
(Missing) | 6 | 11.8% |
Value | Count | Frequency (%) |
234486 | 1 | |
386020 | 1 | |
442688 | 1 | |
531330 | 1 | |
969741 | 1 | |
1087206 | 1 | |
1088340 | 1 | |
1298570 | 1 | |
1321289 | 1 | |
1674647 | 1 |
Value | Count | Frequency (%) |
3749492675 | 1 | |
1305008712 | 1 | |
1156636746 | 1 | |
334874235 | 1 | |
265572789 | 1 | |
210497000 | 1 | |
204827799 | 1 | |
163860144 | 1 | |
155406325 | 1 | |
139283355 | 1 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
역명 | 1.000 | 1.000 | 1.000 | 1.000 |
승차 | 1.000 | 1.000 | 0.484 | 0.000 |
하차 | 1.000 | 0.484 | 1.000 | 1.000 |
인키로 | 1.000 | 0.000 | 1.000 | 1.000 |
승차 | 하차 | 인키로 | |
---|---|---|---|
승차 | 1.000 | 0.182 | 0.199 |
하차 | 0.182 | 1.000 | 0.981 |
인키로 | 0.199 | 0.981 | 1.000 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
0 | 강릉 | 1634532 | <NA> | <NA> |
1 | 검암 | 76 | 20052 | 5791027 |
2 | 경산 | 40624 | 5448 | 1298570 |
3 | 계룡 | 115373 | 10513 | 2001790 |
4 | 곡성 | 32114 | 5500 | 1088340 |
5 | 공주 | 54376 | 30285 | 4405050 |
6 | 광명 | 169092 | 4574649 | 1156636746 |
7 | 광주송정 | 1995240 | 114212 | 30356807 |
8 | 구례구 | 33366 | 1943 | 531330 |
9 | 구포 | 476842 | 3640 | 1087206 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
41 | 진영 | 109294 | 1294 | 386020 |
42 | 진주 | 157353 | <NA> | <NA> |
43 | 창원 | 182919 | 1425 | 442688 |
44 | 창원중앙 | 750092 | 11773 | 3269376 |
45 | 천안아산 | 1771169 | 1475801 | 210497000 |
46 | 청량리 | 6852 | 818527 | 163860144 |
47 | 평창 | 92503 | 24671 | 3062297 |
48 | 포항 | 1115682 | <NA> | <NA> |
49 | 행신 | <NA> | 721378 | 204827799 |
50 | 횡성 | 66002 | 19240 | 2205368 |