Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 51 |
Missing cells | 9 |
Missing cells (%) | 4.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 37.6 B |
Variable types
Text | 1 |
---|---|
Numeric | 3 |
Dataset
Description | 역별 KTX 하행 여객 승하차 실적 입니다. |
---|---|
Author | 한국철도공사 |
URL | https://www.data.go.kr/data/15068468/fileData.do |
Reproduction
Analysis started | 2023-12-12 13:05:15.302579 |
---|---|
Analysis finished | 2023-12-12 13:05:17.106975 |
Duration | 1.8 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
역명
Text
UNIQUE
 
Distinct | 51 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 540.0 B |
Value | Count | Frequency (%) |
강릉 | 1 | 2.0% |
순천 | 1 | 2.0% |
양평 | 1 | 2.0% |
여수엑스포 | 1 | 2.0% |
여천 | 1 | 2.0% |
영등포 | 1 | 2.0% |
오송 | 1 | 2.0% |
용산 | 1 | 2.0% |
울산 | 1 | 2.0% |
익산 | 1 | 2.0% |
Other values (41) | 41 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 8 | 6.4% |
주 | 6 | 4.8% |
천 | 6 | 4.8% |
포 | 5 | 4.0% |
구 | 5 | 4.0% |
원 | 4 | 3.2% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (60) | 79 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 123 | |
Uppercase Letter | 1 | 0.8% |
Decimal Number | 1 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 1 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 123 | |
Latin | 1 | 0.8% |
Common | 1 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
Latin
Value | Count | Frequency (%) |
T | 1 |
Common
Value | Count | Frequency (%) |
2 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 123 | |
ASCII | 2 | 1.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 8 | 6.5% |
주 | 6 | 4.9% |
천 | 6 | 4.9% |
포 | 5 | 4.1% |
구 | 5 | 4.1% |
원 | 4 | 3.3% |
진 | 3 | 2.4% |
공 | 3 | 2.4% |
창 | 3 | 2.4% |
항 | 3 | 2.4% |
Other values (58) | 77 |
ASCII
Value | Count | Frequency (%) |
T | 1 | |
2 | 1 |
승차
Real number (ℝ)
MISSING
 
Distinct | 46 |
---|---|
Distinct (%) | 100.0% |
Missing | 5 |
Missing (%) | 9.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 723732.61 |
Minimum | 1 |
---|---|
Maximum | 13780902 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 947.75 |
Q1 | 12429.25 |
median | 35051 |
Q3 | 233942.25 |
95-th percentile | 3813676 |
Maximum | 13780902 |
Range | 13780901 |
Interquartile range (IQR) | 221513 |
Descriptive statistics
Standard deviation | 2216096.6 |
---|---|
Coefficient of variation (CV) | 3.0620378 |
Kurtosis | 27.945271 |
Mean | 723732.61 |
Median Absolute Deviation (MAD) | 33306 |
Skewness | 5.0065796 |
Sum | 33291700 |
Variance | 4.911084 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
130290 | 1 | 2.0% |
32307 | 1 | 2.0% |
476 | 1 | 2.0% |
147009 | 1 | 2.0% |
932884 | 1 | 2.0% |
4908924 | 1 | 2.0% |
178781 | 1 | 2.0% |
382861 | 1 | 2.0% |
59722 | 1 | 2.0% |
14696 | 1 | 2.0% |
Other values (36) | 36 | |
(Missing) | 5 | 9.8% |
Value | Count | Frequency (%) |
1 | 1 | |
476 | 1 | |
830 | 1 | |
1301 | 1 | |
2189 | 1 | |
2572 | 1 | |
4015 | 1 | |
5371 | 1 | |
8335 | 1 | |
8570 | 1 |
Value | Count | Frequency (%) |
13780902 | 1 | |
4908924 | 1 | |
4527436 | 1 | |
1672396 | 1 | |
1555762 | 1 | |
1527006 | 1 | |
948233 | 1 | |
932884 | 1 | |
770568 | 1 | |
548810 | 1 |
하차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 3.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 679422.45 |
Minimum | 64 |
---|---|
Maximum | 5742556 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 64 |
---|---|
5-th percentile | 610.6 |
Q1 | 52820 |
median | 206069 |
Q3 | 711150 |
95-th percentile | 2751429.4 |
Maximum | 5742556 |
Range | 5742492 |
Interquartile range (IQR) | 658330 |
Descriptive statistics
Standard deviation | 1152269.9 |
---|---|
Coefficient of variation (CV) | 1.695955 |
Kurtosis | 9.1208762 |
Mean | 679422.45 |
Median Absolute Deviation (MAD) | 180712 |
Skewness | 2.8804005 |
Sum | 33291700 |
Variance | 1.3277259 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
735713 | 1 | 2.0% |
536578 | 1 | 2.0% |
70357 | 1 | 2.0% |
527737 | 1 | 2.0% |
212298 | 1 | 2.0% |
259 | 1 | 2.0% |
1799898 | 1 | 2.0% |
2653 | 1 | 2.0% |
1930224 | 1 | 2.0% |
959664 | 1 | 2.0% |
Other values (39) | 39 | |
(Missing) | 2 | 3.9% |
Value | Count | Frequency (%) |
64 | 1 | |
125 | 1 | |
259 | 1 | |
1138 | 1 | |
2653 | 1 | |
2669 | 1 | |
33887 | 1 | |
34021 | 1 | |
36883 | 1 | |
37215 | 1 |
Value | Count | Frequency (%) |
5742556 | 1 | |
4468446 | 1 | |
3245951 | 1 | |
2009647 | 1 | |
1930224 | 1 | |
1801251 | 1 | |
1799898 | 1 | |
1561776 | 1 | |
1085362 | 1 | |
959664 | 1 |
인키로
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 3.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6355107 × 108 |
Minimum | 18483 |
---|---|
Maximum | 2.0313909 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 591.0 B |
Quantile statistics
Minimum | 18483 |
---|---|
5-th percentile | 126299.6 |
Q1 | 8981125 |
median | 49928653 |
Q3 | 1.6737027 × 108 |
95-th percentile | 5.2963179 × 108 |
Maximum | 2.0313909 × 109 |
Range | 2.0313724 × 109 |
Interquartile range (IQR) | 1.5838914 × 108 |
Descriptive statistics
Standard deviation | 3.2833977 × 108 |
---|---|
Coefficient of variation (CV) | 2.0075672 |
Kurtosis | 22.421849 |
Mean | 1.6355107 × 108 |
Median Absolute Deviation (MAD) | 49254337 |
Skewness | 4.3297967 |
Sum | 8.0140024 × 109 |
Variance | 1.07807 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
148773512 | 1 | 2.0% |
139645375 | 1 | 2.0% |
6731440 | 1 | 2.0% |
167370269 | 1 | 2.0% |
71421641 | 1 | 2.0% |
75646 | 1 | 2.0% |
265643316 | 1 | 2.0% |
674316 | 1 | 2.0% |
522852816 | 1 | 2.0% |
164423391 | 1 | 2.0% |
Other values (39) | 39 | |
(Missing) | 2 | 3.9% |
Value | Count | Frequency (%) |
18483 | 1 | |
28216 | 1 | |
75646 | 1 | |
202280 | 1 | |
534305 | 1 | |
674316 | 1 | |
5646955 | 1 | |
6731440 | 1 | |
7324595 | 1 | |
7364105 | 1 |
Value | Count | Frequency (%) |
2031390876 | 1 | |
945454740 | 1 | |
534151106 | 1 | |
522852816 | 1 | |
520210887 | 1 | |
330951936 | 1 | |
289659833 | 1 | |
265643316 | 1 | |
256916706 | 1 | |
211995418 | 1 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
역명 | 1.000 | 1.000 | 1.000 | 1.000 |
승차 | 1.000 | 1.000 | 0.558 | 0.364 |
하차 | 1.000 | 0.558 | 1.000 | 0.915 |
인키로 | 1.000 | 0.364 | 0.915 | 1.000 |
승차 | 하차 | 인키로 | |
---|---|---|---|
승차 | 1.000 | 0.085 | 0.053 |
하차 | 0.085 | 1.000 | 0.979 |
인키로 | 0.053 | 0.979 | 1.000 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
0 | 강릉 | <NA> | 1561776 | 330951936 |
1 | 검암 | 21662 | 64 | 18483 |
2 | 경산 | 8335 | 36883 | 8791331 |
3 | 계룡 | 11342 | 98720 | 18797364 |
4 | 곡성 | 5371 | 37215 | 7364105 |
5 | 공주 | 31053 | 52820 | 7682837 |
6 | 광명 | 4527436 | 164049 | 41477521 |
7 | 광주송정 | 111645 | 2009647 | 534151106 |
8 | 구례구 | 2189 | 34021 | 9303339 |
9 | 구포 | 2572 | 486056 | 145176697 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
41 | 진영 | 830 | 99760 | 29759961 |
42 | 진주 | <NA> | 164421 | 48877496 |
43 | 창원 | 1301 | 179394 | 55730230 |
44 | 창원중앙 | 13185 | 763394 | 211995418 |
45 | 천안아산 | 1555762 | 1801251 | 256916706 |
46 | 청량리 | 948233 | 2669 | 534305 |
47 | 평창 | 23990 | 100967 | 12532567 |
48 | 포항 | <NA> | 1085362 | 289659833 |
49 | 행신 | 770568 | <NA> | <NA> |
50 | 횡성 | 20533 | 63901 | 7324595 |