Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 1348 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 44.9 KiB |
Average record size in memory | 34.1 B |
Variable types
Numeric | 2 |
---|---|
Text | 1 |
Categorical | 1 |
Dataset
Description | 노선_ID,노선_명칭,노선_유형,거리 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-21230/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-04 02:53:18.970453 |
---|---|
Analysis finished | 2024-05-04 02:53:21.903849 |
Duration | 2.93 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
노선_ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 1348 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6112944 × 108 |
Minimum | 1.0000002 × 108 |
---|---|
Maximum | 2.4146102 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.0 KiB |
Quantile statistics
Minimum | 1.0000002 × 108 |
---|---|
5-th percentile | 1.0010009 × 108 |
Q1 | 1.0010058 × 108 |
median | 1.2190001 × 108 |
Q3 | 2.2725008 × 108 |
95-th percentile | 2.4100588 × 108 |
Maximum | 2.4146102 × 108 |
Range | 1.41461 × 108 |
Interquartile range (IQR) | 1.271495 × 108 |
Descriptive statistics
Standard deviation | 59364451 |
---|---|
Coefficient of variation (CV) | 0.3684271 |
Kurtosis | -1.836749 |
Mean | 1.6112944 × 108 |
Median Absolute Deviation (MAD) | 21799970 |
Skewness | 0.19164509 |
Sum | 2.1720248 × 1011 |
Variance | 3.5241381 × 1015 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
241461015 | 1 | 0.1% |
107900017 | 1 | 0.1% |
107900009 | 1 | 0.1% |
107900010 | 1 | 0.1% |
107900011 | 1 | 0.1% |
107900012 | 1 | 0.1% |
107900013 | 1 | 0.1% |
107900014 | 1 | 0.1% |
107900015 | 1 | 0.1% |
107900016 | 1 | 0.1% |
Other values (1338) | 1338 |
Value | Count | Frequency (%) |
100000016 | 1 | |
100000017 | 1 | |
100000018 | 1 | |
100000020 | 1 | |
100100001 | 1 | |
100100006 | 1 | |
100100007 | 1 | |
100100008 | 1 | |
100100009 | 1 | |
100100010 | 1 |
Value | Count | Frequency (%) |
241461015 | 1 | |
241461005 | 1 | |
241461002 | 1 | |
241457013 | 1 | |
241449011 | 1 | |
241449007 | 1 | |
241411001 | 1 | |
241409010 | 1 | |
241409009 | 1 | |
241409006 | 1 |
노선_명칭
Text
Distinct | 1344 |
---|---|
Distinct (%) | 99.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.7 KiB |
Value | Count | Frequency (%) |
01b | 2 | 0.1% |
01a | 2 | 0.1% |
m5443수원 | 2 | 0.1% |
8112 | 2 | 0.1% |
성북14-1 | 1 | 0.1% |
성북06 | 1 | 0.1% |
성북07 | 1 | 0.1% |
성북10-1 | 1 | 0.1% |
성북15 | 1 | 0.1% |
성북14-2 | 1 | 0.1% |
Other values (1334) | 1334 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1024 | |
1 | 883 | 12.9% |
2 | 423 | 6.2% |
3 | 411 | 6.0% |
5 | 348 | 5.1% |
6 | 346 | 5.1% |
7 | 325 | 4.8% |
4 | 265 | 3.9% |
8 | 179 | 2.6% |
양 | 172 | 2.5% |
Other values (110) | 2461 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4357 | |
Other Letter | 2065 | |
Uppercase Letter | 207 | 3.0% |
Dash Punctuation | 120 | 1.8% |
Open Punctuation | 44 | 0.6% |
Close Punctuation | 44 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
양 | 172 | 8.3% |
주 | 171 | 8.3% |
남 | 125 | 6.1% |
성 | 107 | 5.2% |
광 | 91 | 4.4% |
천 | 87 | 4.2% |
포 | 82 | 4.0% |
고 | 79 | 3.8% |
인 | 60 | 2.9% |
서 | 52 | 2.5% |
Other values (85) | 1039 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 46 | |
N | 32 | |
G | 29 | |
A | 26 | |
P | 26 | |
B | 21 | |
O | 6 | 2.9% |
R | 6 | 2.9% |
T | 6 | 2.9% |
U | 6 | 2.9% |
Other values (2) | 3 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 1024 | |
1 | 883 | |
2 | 423 | |
3 | 411 | |
5 | 348 | 8.0% |
6 | 346 | 7.9% |
7 | 325 | 7.5% |
4 | 265 | 6.1% |
8 | 179 | 4.1% |
9 | 153 | 3.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 120 |
Open Punctuation
Value | Count | Frequency (%) |
( | 44 |
Close Punctuation
Value | Count | Frequency (%) |
) | 44 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4565 | |
Hangul | 2065 | |
Latin | 207 | 3.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
양 | 172 | 8.3% |
주 | 171 | 8.3% |
남 | 125 | 6.1% |
성 | 107 | 5.2% |
광 | 91 | 4.4% |
천 | 87 | 4.2% |
포 | 82 | 4.0% |
고 | 79 | 3.8% |
인 | 60 | 2.9% |
서 | 52 | 2.5% |
Other values (85) | 1039 |
Common
Value | Count | Frequency (%) |
0 | 1024 | |
1 | 883 | |
2 | 423 | |
3 | 411 | |
5 | 348 | 7.6% |
6 | 346 | 7.6% |
7 | 325 | 7.1% |
4 | 265 | 5.8% |
8 | 179 | 3.9% |
9 | 153 | 3.4% |
Other values (3) | 208 | 4.6% |
Latin
Value | Count | Frequency (%) |
M | 46 | |
N | 32 | |
G | 29 | |
A | 26 | |
P | 26 | |
B | 21 | |
O | 6 | 2.9% |
R | 6 | 2.9% |
T | 6 | 2.9% |
U | 6 | 2.9% |
Other values (2) | 3 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4772 | |
Hangul | 2065 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1024 | |
1 | 883 | |
2 | 423 | |
3 | 411 | |
5 | 348 | 7.3% |
6 | 346 | 7.3% |
7 | 325 | 6.8% |
4 | 265 | 5.6% |
8 | 179 | 3.8% |
9 | 153 | 3.2% |
Other values (15) | 415 |
Hangul
Value | Count | Frequency (%) |
양 | 172 | 8.3% |
주 | 171 | 8.3% |
남 | 125 | 6.1% |
성 | 107 | 5.2% |
광 | 91 | 4.4% |
천 | 87 | 4.2% |
포 | 82 | 4.0% |
고 | 79 | 3.8% |
인 | 60 | 2.9% |
서 | 52 | 2.5% |
Other values (85) | 1039 |
노선_유형
Categorical
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.7 KiB |
경기 | |
---|---|
마을 | |
지선 | |
간선 | |
공항 | 38 |
Other values (5) | 59 |
Length
Max length | 6 |
---|---|
Median length | 2 |
Mean length | 2.0178042 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 경기 |
---|---|
2nd row | 경기 |
3rd row | 경기 |
4th row | 경기 |
5th row | 경기 |
Common Values
Value | Count | Frequency (%) |
경기 | 605 | |
마을 | 254 | |
지선 | 241 | 17.9% |
간선 | 151 | 11.2% |
공항 | 38 | 2.8% |
인천 | 30 | 2.2% |
광역 | 11 | 0.8% |
광역(서울) | 6 | 0.4% |
순환 | 6 | 0.4% |
관광 | 6 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경기 | 605 | |
마을 | 254 | |
지선 | 241 | 17.9% |
간선 | 151 | 11.2% |
공항 | 38 | 2.8% |
인천 | 30 | 2.2% |
광역 | 11 | 0.8% |
광역(서울 | 6 | 0.4% |
순환 | 6 | 0.4% |
관광 | 6 | 0.4% |
거리
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 456 |
---|---|
Distinct (%) | 33.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.522661 |
Minimum | 0 |
---|---|
Maximum | 220 |
Zeros | 635 |
Zeros (%) | 47.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 4.15 |
Q3 | 25.8 |
95-th percentile | 63.68 |
Maximum | 220 |
Range | 220 |
Interquartile range (IQR) | 25.8 |
Descriptive statistics
Standard deviation | 30.912761 |
---|---|
Coefficient of variation (CV) | 1.764159 |
Kurtosis | 12.775783 |
Mean | 17.522661 |
Median Absolute Deviation (MAD) | 4.15 |
Skewness | 3.180299 |
Sum | 23620.547 |
Variance | 955.59877 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 635 | |
7.0 | 11 | 0.8% |
13.0 | 10 | 0.7% |
12.0 | 8 | 0.6% |
5.5 | 6 | 0.4% |
39.0 | 6 | 0.4% |
7.8 | 5 | 0.4% |
5.9 | 5 | 0.4% |
7.2 | 5 | 0.4% |
20.0 | 5 | 0.4% |
Other values (446) | 652 |
Value | Count | Frequency (%) |
0.0 | 635 | |
1.2 | 1 | 0.1% |
1.6 | 1 | 0.1% |
1.8 | 1 | 0.1% |
2.0 | 1 | 0.1% |
2.1 | 2 | 0.1% |
2.6 | 4 | 0.3% |
2.8 | 2 | 0.1% |
2.9 | 2 | 0.1% |
3.0 | 2 | 0.1% |
Value | Count | Frequency (%) |
220.0 | 1 | |
207.0 | 1 | |
204.4 | 1 | |
196.0 | 1 | |
193.0 | 1 | |
192.0 | 1 | |
190.0 | 1 | |
188.0 | 1 | |
187.6 | 1 | |
184.0 | 2 |
노선_ID | 노선_유형 | 거리 | |
---|---|---|---|
노선_ID | 1.000 | 0.849 | 0.497 |
노선_유형 | 0.849 | 1.000 | 0.877 |
거리 | 0.497 | 0.877 | 1.000 |
노선_ID | 거리 | 노선_유형 | |
---|---|---|---|
노선_ID | 1.000 | -0.861 | 0.662 |
거리 | -0.861 | 1.000 | 0.457 |
노선_유형 | 0.662 | 0.457 | 1.000 |
노선_ID | 노선_명칭 | 노선_유형 | 거리 | |
---|---|---|---|---|
0 | 241461015 | 김포16A | 경기 | 0.0 |
1 | 241461005 | 김포16-1 | 경기 | 0.0 |
2 | 241461002 | 김포16 | 경기 | 0.0 |
3 | 241457013 | 양주15-1(구파발) | 경기 | 0.0 |
4 | 241449011 | 양주15-1구파발 | 경기 | 0.0 |
5 | 241449007 | 양주15-1막차 | 경기 | 0.0 |
6 | 241411001 | 하남01 | 경기 | 0.0 |
7 | 241409010 | 하남감북-01 | 경기 | 0.0 |
8 | 241409009 | 하남위례-01 | 경기 | 0.0 |
9 | 241409006 | 하남08 | 경기 | 0.0 |
노선_ID | 노선_명칭 | 노선_유형 | 거리 | |
---|---|---|---|---|
1338 | 100100010 | 105 | 간선 | 38.77 |
1339 | 100100009 | 104 | 간선 | 30.5 |
1340 | 100100008 | 103 | 간선 | 30.42 |
1341 | 100100007 | 102 | 간선 | 30.2 |
1342 | 100100006 | 101 | 간선 | 37.81 |
1343 | 100100001 | 01A | 순환 | 16.0 |
1344 | 100000020 | 청와대A01(자율주행) | 순환 | 2.6 |
1345 | 100000018 | TOUR12 | 관광 | 29.5 |
1346 | 100000017 | TOUR11 | 관광 | 25.0 |
1347 | 100000016 | N876 | 간선 | 36.0 |