Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 236 |
Missing cells | 40 |
Missing cells (%) | 4.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.2 KiB |
Average record size in memory | 35.6 B |
Variable types
Text | 1 |
---|---|
Numeric | 3 |
Dataset
Description | 역별 무궁화 상행 여객 승하차 실적 입니다. 역별 승차, 하차, 인키로 데이터로 구성되어 있습니다. |
---|---|
Author | 한국철도공사 |
URL | https://www.data.go.kr/data/15068479/fileData.do |
승차 is highly overall correlated with 하차 and 1 other fields | High correlation |
하차 is highly overall correlated with 승차 and 1 other fields | High correlation |
인키로 is highly overall correlated with 승차 and 1 other fields | High correlation |
승차 has 16 (6.8%) missing values | Missing |
하차 has 12 (5.1%) missing values | Missing |
인키로 has 12 (5.1%) missing values | Missing |
역명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 07:42:08.074243 |
---|---|
Analysis finished | 2023-12-12 07:42:09.456329 |
Duration | 1.38 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
역명
Text
UNIQUE
 
Distinct | 236 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.0 KiB |
Value | Count | Frequency (%) |
가평 | 1 | 0.4% |
의정부 | 1 | 0.4% |
전곡 | 1 | 0.4% |
용궁 | 1 | 0.4% |
용문 | 1 | 0.4% |
용산 | 1 | 0.4% |
웅천 | 1 | 0.4% |
원동 | 1 | 0.4% |
원주 | 1 | 0.4% |
월내 | 1 | 0.4% |
Other values (226) | 226 |
Most occurring characters
Value | Count | Frequency (%) |
천 | 26 | 4.9% |
동 | 18 | 3.4% |
산 | 17 | 3.2% |
주 | 16 | 3.0% |
양 | 12 | 2.3% |
원 | 12 | 2.3% |
성 | 11 | 2.1% |
신 | 11 | 2.1% |
사 | 8 | 1.5% |
강 | 8 | 1.5% |
Other values (171) | 387 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 526 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
천 | 26 | 4.9% |
동 | 18 | 3.4% |
산 | 17 | 3.2% |
주 | 16 | 3.0% |
양 | 12 | 2.3% |
원 | 12 | 2.3% |
성 | 11 | 2.1% |
신 | 11 | 2.1% |
사 | 8 | 1.5% |
강 | 8 | 1.5% |
Other values (171) | 387 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 526 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
천 | 26 | 4.9% |
동 | 18 | 3.4% |
산 | 17 | 3.2% |
주 | 16 | 3.0% |
양 | 12 | 2.3% |
원 | 12 | 2.3% |
성 | 11 | 2.1% |
신 | 11 | 2.1% |
사 | 8 | 1.5% |
강 | 8 | 1.5% |
Other values (171) | 387 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 526 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
천 | 26 | 4.9% |
동 | 18 | 3.4% |
산 | 17 | 3.2% |
주 | 16 | 3.0% |
양 | 12 | 2.3% |
원 | 12 | 2.3% |
성 | 11 | 2.1% |
신 | 11 | 2.1% |
사 | 8 | 1.5% |
강 | 8 | 1.5% |
Other values (171) | 387 |
승차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 220 |
---|---|
Distinct (%) | 100.0% |
Missing | 16 |
Missing (%) | 6.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127359.32 |
Minimum | 1 |
---|---|
Maximum | 1998931 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 360.45 |
Q1 | 4382.5 |
median | 23487 |
Q3 | 90827.25 |
95-th percentile | 520678.1 |
Maximum | 1998931 |
Range | 1998930 |
Interquartile range (IQR) | 86444.75 |
Descriptive statistics
Standard deviation | 296613.89 |
---|---|
Coefficient of variation (CV) | 2.3289531 |
Kurtosis | 17.926595 |
Mean | 127359.32 |
Median Absolute Deviation (MAD) | 22665.5 |
Skewness | 4.0607666 |
Sum | 28019051 |
Variance | 8.7979799 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14070 | 1 | 0.4% |
86572 | 1 | 0.4% |
1 | 1 | 0.4% |
44161 | 1 | 0.4% |
34775 | 1 | 0.4% |
548078 | 1 | 0.4% |
13749 | 1 | 0.4% |
8544 | 1 | 0.4% |
49537 | 1 | 0.4% |
23223 | 1 | 0.4% |
Other values (210) | 210 | |
(Missing) | 16 | 6.8% |
Value | Count | Frequency (%) |
1 | 1 | |
3 | 1 | |
27 | 1 | |
123 | 1 | |
129 | 1 | |
186 | 1 | |
246 | 1 | |
316 | 1 | |
321 | 1 | |
322 | 1 |
Value | Count | Frequency (%) |
1998931 | 1 | |
1675082 | 1 | |
1609755 | 1 | |
1607107 | 1 | |
1473786 | 1 | |
1229098 | 1 | |
1148855 | 1 | |
1016564 | 1 | |
834551 | 1 | |
576776 | 1 |
하차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 223 |
---|---|
Distinct (%) | 99.6% |
Missing | 12 |
Missing (%) | 5.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 125085.05 |
Minimum | 8 |
---|---|
Maximum | 3436201 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.2 KiB |
Quantile statistics
Minimum | 8 |
---|---|
5-th percentile | 321.9 |
Q1 | 1776 |
median | 11618 |
Q3 | 56035 |
95-th percentile | 622734.55 |
Maximum | 3436201 |
Range | 3436193 |
Interquartile range (IQR) | 54259 |
Descriptive statistics
Standard deviation | 387961.34 |
---|---|
Coefficient of variation (CV) | 3.1015804 |
Kurtosis | 33.934072 |
Mean | 125085.05 |
Median Absolute Deviation (MAD) | 11052.5 |
Skewness | 5.3592677 |
Sum | 28019051 |
Variance | 1.50514 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
464 | 2 | 0.8% |
1047 | 1 | 0.4% |
1534323 | 1 | 0.4% |
9654 | 1 | 0.4% |
49562 | 1 | 0.4% |
132203 | 1 | 0.4% |
15200 | 1 | 0.4% |
4224 | 1 | 0.4% |
18438 | 1 | 0.4% |
19671 | 1 | 0.4% |
Other values (213) | 213 | |
(Missing) | 12 | 5.1% |
Value | Count | Frequency (%) |
8 | 1 | |
11 | 1 | |
28 | 1 | |
32 | 1 | |
130 | 1 | |
157 | 1 | |
188 | 1 | |
210 | 1 | |
217 | 1 | |
243 | 1 |
Value | Count | Frequency (%) |
3436201 | 1 | |
2362352 | 1 | |
2310984 | 1 | |
1534323 | 1 | |
1484660 | 1 | |
1376504 | 1 | |
1243330 | 1 | |
1123420 | 1 | |
1114498 | 1 | |
686187 | 1 |
인키로
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 224 |
---|---|
Distinct (%) | 100.0% |
Missing | 12 |
Missing (%) | 5.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12292736 |
Minimum | 670 |
---|---|
Maximum | 4.0051176 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.2 KiB |
Quantile statistics
Minimum | 670 |
---|---|
5-th percentile | 40562 |
Q1 | 191303.25 |
median | 1043941 |
Q3 | 5123953.8 |
95-th percentile | 48054888 |
Maximum | 4.0051176 × 108 |
Range | 4.0051109 × 108 |
Interquartile range (IQR) | 4932650.5 |
Descriptive statistics
Standard deviation | 43691033 |
---|---|
Coefficient of variation (CV) | 3.5542154 |
Kurtosis | 43.037645 |
Mean | 12292736 |
Median Absolute Deviation (MAD) | 980054.5 |
Skewness | 6.137737 |
Sum | 2.7535729 × 109 |
Variance | 1.9089063 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
344526 | 1 | 0.4% |
2269072 | 1 | 0.4% |
211406103 | 1 | 0.4% |
1230520 | 1 | 0.4% |
1634821 | 1 | 0.4% |
12716595 | 1 | 0.4% |
683841 | 1 | 0.4% |
92360 | 1 | 0.4% |
1258145 | 1 | 0.4% |
1859815 | 1 | 0.4% |
Other values (214) | 214 | |
(Missing) | 12 | 5.1% |
Value | Count | Frequency (%) |
670 | 1 | |
1943 | 1 | |
2032 | 1 | |
8688 | 1 | |
10525 | 1 | |
15813 | 1 | |
19855 | 1 | |
22386 | 1 | |
25311 | 1 | |
25394 | 1 |
Value | Count | Frequency (%) |
400511758 | 1 | |
323093118 | 1 | |
211406103 | 1 | |
209344144 | 1 | |
189366065 | 1 | |
117556137 | 1 | |
109212451 | 1 | |
101519313 | 1 | |
95354220 | 1 | |
65108216 | 1 |
승차 | 하차 | 인키로 | |
---|---|---|---|
승차 | 1.000 | 0.837 | 0.756 |
하차 | 0.837 | 1.000 | 0.922 |
인키로 | 0.756 | 0.922 | 1.000 |
승차 | 하차 | 인키로 | |
---|---|---|---|
승차 | 1.000 | 0.748 | 0.741 |
하차 | 0.748 | 1.000 | 0.975 |
인키로 | 0.741 | 0.975 | 1.000 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
0 | 가평 | 2542 | <NA> | <NA> |
1 | 각계 | 405 | 8 | 15813 |
2 | 간석 | <NA> | 919 | 187922 |
3 | 강경 | 100493 | 24645 | 2637473 |
4 | 강구 | 17585 | 622 | 25311 |
5 | 강릉 | 35018 | 27578 | 2111313 |
6 | 개포 | 3 | 217 | 855332 |
7 | 건천 | 2375 | 513 | 22386 |
8 | 경산 | 576776 | 435374 | 31974705 |
9 | 경주 | 253274 | 260797 | 16523852 |
역명 | 승차 | 하차 | 인키로 | |
---|---|---|---|---|
226 | 현동 | 123 | 929 | 88540 |
227 | 호계 | 165016 | 190407 | 14257792 |
228 | 홍성 | 372565 | 57130 | 5149612 |
229 | 화명 | 48082 | 33098 | 3374444 |
230 | 화본 | 5564 | 6520 | 611561 |
231 | 화순 | 5786 | 5958 | 603650 |
232 | 황간 | 28328 | 11833 | 1267928 |
233 | 횡천 | 6053 | 713 | 62096 |
234 | 효천 | 5303 | 2309 | 267198 |
235 | 희방사 | 1212 | 130 | 8688 |