Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 286 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 14.7 KiB |
Average record size in memory | 52.5 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 4 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 지하철 : 서울시버스정류장 : 서울시(스마트카드사) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=15 |
지하철역코드(SUB_STA_SN) has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:53:05.298003 |
---|---|
Analysis finished | 2023-12-10 14:53:07.904035 |
Duration | 2.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구명(GU_NM)
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
강남구 | |
---|---|
송파구 | 19 |
영등포구 | 18 |
중구 | 18 |
마포구 | 15 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0734266 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 성북구 |
---|---|
2nd row | 영등포구 |
3rd row | 성북구 |
4th row | 종로구 |
5th row | 종로구 |
Common Values
Value | Count | Frequency (%) |
강남구 | 23 | 8.0% |
송파구 | 19 | 6.6% |
영등포구 | 18 | 6.3% |
중구 | 18 | 6.3% |
마포구 | 15 | 5.2% |
서초구 | 14 | 4.9% |
용산구 | 14 | 4.9% |
성동구 | 14 | 4.9% |
노원구 | 12 | 4.2% |
동작구 | 12 | 4.2% |
Other values (15) | 127 |
Length
Value | Count | Frequency (%) |
강남구 | 23 | 8.0% |
송파구 | 19 | 6.6% |
영등포구 | 18 | 6.3% |
중구 | 18 | 6.3% |
마포구 | 15 | 5.2% |
서초구 | 14 | 4.9% |
용산구 | 14 | 4.9% |
성동구 | 14 | 4.9% |
노원구 | 12 | 4.2% |
동작구 | 12 | 4.2% |
Other values (15) | 127 |
구코드(GU_CD)
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11431.084 |
Minimum | 11110 |
---|---|
Maximum | 11740 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11140 |
Q1 | 11230 |
median | 11440 |
Q3 | 11620 |
95-th percentile | 11710 |
Maximum | 11740 |
Range | 630 |
Interquartile range (IQR) | 390 |
Descriptive statistics
Standard deviation | 200.28063 |
---|---|
Coefficient of variation (CV) | 0.017520703 |
Kurtosis | -1.3627262 |
Mean | 11431.084 |
Median Absolute Deviation (MAD) | 180 |
Skewness | -0.056964358 |
Sum | 3269290 |
Variance | 40112.33 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11680 | 23 | 8.0% |
11710 | 19 | 6.6% |
11560 | 18 | 6.3% |
11140 | 18 | 6.3% |
11440 | 15 | 5.2% |
11170 | 14 | 4.9% |
11650 | 14 | 4.9% |
11200 | 14 | 4.9% |
11350 | 12 | 4.2% |
11380 | 12 | 4.2% |
Other values (15) | 127 |
Value | Count | Frequency (%) |
11110 | 11 | |
11140 | 18 | |
11170 | 14 | |
11200 | 14 | |
11215 | 8 | |
11230 | 11 | |
11260 | 8 | |
11290 | 10 | |
11305 | 3 | 1.0% |
11320 | 6 | 2.1% |
Value | Count | Frequency (%) |
11740 | 10 | |
11710 | 19 | |
11680 | 23 | |
11650 | 14 | |
11620 | 7 | 2.4% |
11590 | 12 | |
11560 | 18 | |
11545 | 3 | 1.0% |
11530 | 11 | |
11500 | 9 | 3.1% |
지하철역코드(SUB_STA_SN)
Real number (ℝ)
UNIQUE
 
Distinct | 286 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 183.08042 |
Minimum | 40 |
---|---|
Maximum | 326 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 40 |
---|---|
5-th percentile | 54.25 |
Q1 | 111.25 |
median | 183.5 |
Q3 | 254.75 |
95-th percentile | 311.75 |
Maximum | 326 |
Range | 286 |
Interquartile range (IQR) | 143.5 |
Descriptive statistics
Standard deviation | 83.128249 |
---|---|
Coefficient of variation (CV) | 0.45405319 |
Kurtosis | -1.2054139 |
Mean | 183.08042 |
Median Absolute Deviation (MAD) | 72 |
Skewness | -0.0028478455 |
Sum | 52361 |
Variance | 6910.3058 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
64 | 1 | 0.3% |
256 | 1 | 0.3% |
158 | 1 | 0.3% |
87 | 1 | 0.3% |
219 | 1 | 0.3% |
128 | 1 | 0.3% |
62 | 1 | 0.3% |
258 | 1 | 0.3% |
159 | 1 | 0.3% |
55 | 1 | 0.3% |
Other values (276) | 276 |
Value | Count | Frequency (%) |
40 | 1 | |
41 | 1 | |
42 | 1 | |
43 | 1 | |
44 | 1 | |
45 | 1 | |
46 | 1 | |
47 | 1 | |
48 | 1 | |
49 | 1 |
Value | Count | Frequency (%) |
326 | 1 | |
325 | 1 | |
324 | 1 | |
323 | 1 | |
322 | 1 | |
321 | 1 | |
320 | 1 | |
319 | 1 | |
318 | 1 | |
317 | 1 |
Distinct | 261 |
---|---|
Distinct (%) | 91.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
신촌역 | 3 | 1.0% |
동대문운동장역 | 3 | 1.0% |
양재역 | 2 | 0.7% |
사당역 | 2 | 0.7% |
버티고개역 | 2 | 0.7% |
독립문역 | 2 | 0.7% |
남태령역 | 2 | 0.7% |
시청역 | 2 | 0.7% |
이대역 | 2 | 0.7% |
대방역 | 2 | 0.7% |
Other values (252) | 265 |
Most occurring characters
Value | Count | Frequency (%) |
역 | 287 | |
대 | 35 | 3.1% |
구 | 29 | 2.5% |
신 | 28 | 2.4% |
동 | 26 | 2.3% |
산 | 18 | 1.6% |
지 | 15 | 1.3% |
호 | 14 | 1.2% |
청 | 13 | 1.1% |
선 | 13 | 1.1% |
Other values (205) | 667 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1105 | |
Decimal Number | 15 | 1.3% |
Open Punctuation | 12 | 1.0% |
Close Punctuation | 12 | 1.0% |
Space Separator | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
역 | 287 | |
대 | 35 | 3.2% |
구 | 29 | 2.6% |
신 | 28 | 2.5% |
동 | 26 | 2.4% |
산 | 18 | 1.6% |
지 | 15 | 1.4% |
호 | 14 | 1.3% |
청 | 13 | 1.2% |
선 | 13 | 1.2% |
Other values (197) | 627 |
Decimal Number
Value | Count | Frequency (%) |
7 | 6 | |
3 | 3 | |
5 | 2 | 13.3% |
4 | 2 | 13.3% |
2 | 2 | 13.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 12 |
Close Punctuation
Value | Count | Frequency (%) |
) | 12 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1105 | |
Common | 40 | 3.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
역 | 287 | |
대 | 35 | 3.2% |
구 | 29 | 2.6% |
신 | 28 | 2.5% |
동 | 26 | 2.4% |
산 | 18 | 1.6% |
지 | 15 | 1.4% |
호 | 14 | 1.3% |
청 | 13 | 1.2% |
선 | 13 | 1.2% |
Other values (197) | 627 |
Common
Value | Count | Frequency (%) |
( | 12 | |
) | 12 | |
7 | 6 | |
3 | 3 | 7.5% |
5 | 2 | 5.0% |
4 | 2 | 5.0% |
2 | 2 | 5.0% |
1 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1105 | |
ASCII | 40 | 3.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
역 | 287 | |
대 | 35 | 3.2% |
구 | 29 | 2.6% |
신 | 28 | 2.5% |
동 | 26 | 2.4% |
산 | 18 | 1.6% |
지 | 15 | 1.4% |
호 | 14 | 1.3% |
청 | 13 | 1.2% |
선 | 13 | 1.2% |
Other values (197) | 627 |
ASCII
Value | Count | Frequency (%) |
( | 12 | |
) | 12 | |
7 | 6 | |
3 | 3 | 7.5% |
5 | 2 | 5.0% |
4 | 2 | 5.0% |
2 | 2 | 5.0% |
1 | 2.5% |
X좌표(POINT_X)
Real number (ℝ)
Distinct | 273 |
---|---|
Distinct (%) | 95.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 199670.74 |
Minimum | 182443 |
---|---|
Maximum | 214745 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 182443 |
---|---|
5-th percentile | 187672.75 |
Q1 | 194044.75 |
median | 200509 |
Q3 | 204939.75 |
95-th percentile | 211184.25 |
Maximum | 214745 |
Range | 32302 |
Interquartile range (IQR) | 10895 |
Descriptive statistics
Standard deviation | 7103.518 |
---|---|
Coefficient of variation (CV) | 0.035576159 |
Kurtosis | -0.62728954 |
Mean | 199670.74 |
Median Absolute Deviation (MAD) | 5302 |
Skewness | -0.19735579 |
Sum | 57105832 |
Variance | 50459968 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
198356 | 3 | 1.0% |
198423 | 2 | 0.7% |
196270 | 2 | 0.7% |
196116 | 2 | 0.7% |
195207 | 2 | 0.7% |
194358 | 2 | 0.7% |
190700 | 2 | 0.7% |
191242 | 2 | 0.7% |
210933 | 2 | 0.7% |
198841 | 2 | 0.7% |
Other values (263) | 265 |
Value | Count | Frequency (%) |
182443 | 1 | |
182905 | 1 | |
183450 | 1 | |
183488 | 1 | |
184273 | 1 | |
184483 | 1 | |
185542 | 1 | |
185613 | 1 | |
185638 | 1 | |
185906 | 1 |
Value | Count | Frequency (%) |
214745 | 1 | |
213678 | 1 | |
213526 | 1 | |
212731 | 1 | |
212613 | 1 | |
212611 | 1 | |
212382 | 1 | |
212036 | 1 | |
211955 | 1 | |
211729 | 1 |
Y좌표(POINT_Y)
Real number (ℝ)
Distinct | 274 |
---|---|
Distinct (%) | 95.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 449635.68 |
Minimum | 439550 |
---|---|
Maximum | 465554 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 439550 |
---|---|
5-th percentile | 442606.5 |
Q1 | 445464.25 |
median | 449761.5 |
Q3 | 452541.5 |
95-th percentile | 459570 |
Maximum | 465554 |
Range | 26004 |
Interquartile range (IQR) | 7077.25 |
Descriptive statistics
Standard deviation | 5213.7306 |
---|---|
Coefficient of variation (CV) | 0.011595456 |
Kurtosis | 0.022565237 |
Mean | 449635.68 |
Median Absolute Deviation (MAD) | 3613 |
Skewness | 0.55512736 |
Sum | 1.285958 × 108 |
Variance | 27182987 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
441875 | 3 | 1.0% |
451859 | 2 | 0.7% |
452972 | 2 | 0.7% |
450665 | 2 | 0.7% |
448746 | 2 | 0.7% |
450810 | 2 | 0.7% |
440897 | 2 | 0.7% |
443123 | 2 | 0.7% |
452760 | 2 | 0.7% |
450871 | 2 | 0.7% |
Other values (264) | 265 |
Value | Count | Frequency (%) |
439550 | 1 | 0.3% |
440761 | 1 | 0.3% |
440897 | 2 | |
441271 | 1 | 0.3% |
441875 | 3 | |
441899 | 1 | 0.3% |
441956 | 1 | 0.3% |
442044 | 1 | 0.3% |
442389 | 1 | 0.3% |
442410 | 1 | 0.3% |
Value | Count | Frequency (%) |
465554 | 1 | |
464409 | 1 | |
464220 | 1 | |
463433 | 1 | |
463083 | 1 | |
462826 | 1 | |
462370 | 1 | |
461734 | 1 | |
461487 | 1 | |
460974 | 1 |
구명(GU_NM) | 구코드(GU_CD) | 지하철역코드(SUB_STA_SN) | X좌표(POINT_X) | Y좌표(POINT_Y) | |
---|---|---|---|---|---|
구명(GU_NM) | 1.000 | 0.223 | 0.000 | 0.144 | 0.000 |
구코드(GU_CD) | 0.223 | 1.000 | 0.000 | 0.000 | 0.049 |
지하철역코드(SUB_STA_SN) | 0.000 | 0.000 | 1.000 | 0.089 | 0.000 |
X좌표(POINT_X) | 0.144 | 0.000 | 0.089 | 1.000 | 0.000 |
Y좌표(POINT_Y) | 0.000 | 0.049 | 0.000 | 0.000 | 1.000 |
구코드(GU_CD) | 지하철역코드(SUB_STA_SN) | X좌표(POINT_X) | Y좌표(POINT_Y) | 구명(GU_NM) | |
---|---|---|---|---|---|
구코드(GU_CD) | 1.000 | -0.049 | -0.026 | -0.011 | 0.093 |
지하철역코드(SUB_STA_SN) | -0.049 | 1.000 | 0.015 | 0.058 | 0.000 |
X좌표(POINT_X) | -0.026 | 0.015 | 1.000 | 0.014 | 0.035 |
Y좌표(POINT_Y) | -0.011 | 0.058 | 0.014 | 1.000 | 0.000 |
구명(GU_NM) | 0.093 | 0.000 | 0.035 | 0.000 | 1.000 |
구명(GU_NM) | 구코드(GU_CD) | 지하철역코드(SUB_STA_SN) | 지하철역명(KOR_SUB_NM) | X좌표(POINT_X) | Y좌표(POINT_Y) | |
---|---|---|---|---|---|---|
0 | 성북구 | 11440 | 64 | 사당역 | 198423 | 443117 |
1 | 영등포구 | 11200 | 214 | 성신여대입구역 | 196835 | 444084 |
2 | 성북구 | 11110 | 63 | 학동역 | 205670 | 453021 |
3 | 종로구 | 11530 | 47 | 중곡역 | 195074 | 448023 |
4 | 종로구 | 11590 | 192 | 을지로3가역 | 205575 | 441899 |
5 | 강남구 | 11440 | 233 | 양천구청역 | 202563 | 459649 |
6 | 은평구 | 11170 | 73 | 강변역 | 203357 | 450491 |
7 | 서초구 | 11620 | 154 | 이태원역 | 201102 | 443850 |
8 | 용산구 | 11380 | 150 | 종각역 | 197029 | 452157 |
9 | 성동구 | 11290 | 218 | 수락산역 | 201632 | 451222 |
구명(GU_NM) | 구코드(GU_CD) | 지하철역코드(SUB_STA_SN) | 지하철역명(KOR_SUB_NM) | X좌표(POINT_X) | Y좌표(POINT_Y) | |
---|---|---|---|---|---|---|
276 | 강남구 | 11650 | 285 | 신설동역 | 204537 | 443098 |
277 | 양천구 | 11500 | 200 | 당고개역 | 202798 | 452972 |
278 | 양천구 | 11380 | 49 | 상월곡역 | 203851 | 455885 |
279 | 중구 | 11560 | 157 | 독바위역 | 191598 | 455546 |
280 | 영등포구 | 11260 | 48 | 교대역 | 188942 | 450639 |
281 | 강남구 | 11470 | 284 | 무악재역 | 200159 | 460550 |
282 | 서초구 | 11710 | 134 | 아현역 | 204096 | 450665 |
283 | 광진구 | 11740 | 141 | 신림역 | 200525 | 454628 |
284 | 중구 | 11200 | 223 | 신용산역 | 207738 | 459652 |
285 | 노원구 | 11110 | 272 | 삼각지역 | 205556 | 451880 |