Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 646 |
Missing cells | 585 |
Missing cells (%) | 18.1% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.2% |
Total size in memory | 27.3 KiB |
Average record size in memory | 43.2 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 부산교통공사_승강기연도별설치현황_20211231 |
---|---|
Author | 부산교통공사 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15052663 |
Dataset has 1 (0.2%) duplicate rows | Duplicates |
설치년도 is highly overall correlated with 교체주기(개량년도) and 1 other fields | High correlation |
교체주기(개량년도) is highly overall correlated with 설치년도 and 1 other fields | High correlation |
호선 is highly overall correlated with 설치년도 and 1 other fields | High correlation |
교체주기(개량년도) has 585 (90.6%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 16:07:21.135968 |
---|---|
Analysis finished | 2023-12-10 16:07:21.832486 |
Duration | 0.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
2 | |
---|---|
3 | |
1 | |
4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 202 | |
3 | 174 | |
1 | 139 | |
4 | 131 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 202 | |
3 | 174 | |
1 | 139 | |
4 | 131 |
역명
Text
Distinct | 77 |
---|---|
Distinct (%) | 11.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
Value | Count | Frequency (%) |
동래 | 26 | 4.0% |
다대포항 | 21 | 3.3% |
만덕 | 20 | 3.1% |
연산동 | 20 | 3.1% |
배산 | 16 | 2.5% |
서면 | 16 | 2.5% |
수안 | 16 | 2.5% |
장림 | 14 | 2.2% |
센텀시티 | 14 | 2.2% |
낫개 | 13 | 2.0% |
Other values (67) | 470 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 120 | 6.8% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.4% |
부 | 37 | 2.1% |
구 | 37 | 2.1% |
Other values (99) | 1103 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1755 | |
Close Punctuation | 4 | 0.2% |
Open Punctuation | 4 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 120 | 6.8% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
부 | 37 | 2.1% |
구 | 37 | 2.1% |
Other values (97) | 1095 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1755 | |
Common | 8 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 120 | 6.8% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
부 | 37 | 2.1% |
구 | 37 | 2.1% |
Other values (97) | 1095 |
Common
Value | Count | Frequency (%) |
) | 4 | |
( | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1755 | |
ASCII | 8 | 0.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 120 | 6.8% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
부 | 37 | 2.1% |
구 | 37 | 2.1% |
Other values (97) | 1095 |
ASCII
Value | Count | Frequency (%) |
) | 4 | |
( | 4 |
호기
Categorical
Distinct | 27 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | |
Other values (22) |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.1919505 |
Min length | 1 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 3 |
4th row | 4 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
1 | 79 | |
2 | 77 | |
3 | 69 | |
4 | 68 | |
5 | 59 | |
6 | 57 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
Length
Value | Count | Frequency (%) |
1 | 79 | |
2 | 77 | |
3 | 69 | |
4 | 68 | |
5 | 59 | |
6 | 57 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
설치년도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2006.469 |
Minimum | 1985 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1985 |
---|---|
5-th percentile | 1989 |
Q1 | 2002 |
median | 2005 |
Q3 | 2011 |
95-th percentile | 2017 |
Maximum | 2021 |
Range | 36 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 7.3536239 |
---|---|
Coefficient of variation (CV) | 0.0036649576 |
Kurtosis | 0.77774392 |
Mean | 2006.469 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.66807875 |
Sum | 1296179 |
Variance | 54.075784 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2005 | 167 | |
2011 | 135 | |
2017 | 88 | |
2001 | 64 | 9.9% |
1998 | 52 | 8.0% |
2007 | 34 | 5.3% |
2008 | 18 | 2.8% |
1985 | 13 | 2.0% |
2002 | 12 | 1.9% |
2006 | 9 | 1.4% |
Other values (12) | 54 | 8.4% |
Value | Count | Frequency (%) |
1985 | 13 | 2.0% |
1987 | 6 | 0.9% |
1988 | 8 | 1.2% |
1989 | 7 | 1.1% |
1994 | 4 | 0.6% |
1998 | 52 | 8.0% |
2001 | 64 | 9.9% |
2002 | 12 | 1.9% |
2004 | 1 | 0.2% |
2005 | 167 |
Value | Count | Frequency (%) |
2021 | 4 | 0.6% |
2018 | 4 | 0.6% |
2017 | 88 | |
2016 | 5 | 0.8% |
2015 | 1 | 0.2% |
2014 | 4 | 0.6% |
2012 | 4 | 0.6% |
2011 | 135 | |
2009 | 6 | 0.9% |
2008 | 18 | 2.8% |
교체주기(개량년도)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 29.5% |
Missing | 585 |
Missing (%) | 90.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2009.8033 |
Minimum | 1998 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1998 |
---|---|
5-th percentile | 2004 |
Q1 | 2005 |
median | 2009 |
Q3 | 2016 |
95-th percentile | 2018 |
Maximum | 2021 |
Range | 23 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 5.8046524 |
---|---|
Coefficient of variation (CV) | 0.0028881694 |
Kurtosis | -1.3602124 |
Mean | 2009.8033 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 0.20295419 |
Sum | 122598 |
Variance | 33.693989 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2004 | 12 | 1.9% |
2005 | 10 | 1.5% |
2016 | 9 | 1.4% |
2006 | 4 | 0.6% |
2011 | 4 | 0.6% |
2015 | 4 | 0.6% |
2017 | 3 | 0.5% |
2003 | 2 | 0.3% |
2018 | 2 | 0.3% |
2013 | 2 | 0.3% |
Other values (8) | 9 | 1.4% |
(Missing) | 585 |
Value | Count | Frequency (%) |
1998 | 1 | 0.2% |
2003 | 2 | 0.3% |
2004 | 12 | |
2005 | 10 | |
2006 | 4 | 0.6% |
2008 | 1 | 0.2% |
2009 | 1 | 0.2% |
2010 | 1 | 0.2% |
2011 | 4 | 0.6% |
2012 | 2 | 0.3% |
Value | Count | Frequency (%) |
2021 | 1 | 0.2% |
2020 | 1 | 0.2% |
2019 | 1 | 0.2% |
2018 | 2 | 0.3% |
2017 | 3 | 0.5% |
2016 | 9 | |
2015 | 4 | |
2013 | 2 | 0.3% |
2012 | 2 | 0.3% |
2011 | 4 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
호선 | 1.000 | 0.998 | 0.115 | 0.987 | 0.999 |
역명 | 0.998 | 1.000 | 0.000 | 0.998 | 0.928 |
호기 | 0.115 | 0.000 | 1.000 | 0.000 | 0.000 |
설치년도 | 0.987 | 0.998 | 0.000 | 1.000 | 0.722 |
교체주기(개량년도) | 0.999 | 0.928 | 0.000 | 0.722 | 1.000 |
호기 | 호선 | |
---|---|---|
호기 | 1.000 | 0.059 |
호선 | 0.059 | 1.000 |
설치년도 | 교체주기(개량년도) | 호선 | 호기 | |
---|---|---|---|---|
설치년도 | 1.000 | 0.660 | 0.941 | 0.000 |
교체주기(개량년도) | 0.660 | 1.000 | 0.912 | 0.000 |
호선 | 0.941 | 0.912 | 1.000 | 0.059 |
호기 | 0.000 | 0.000 | 0.059 | 1.000 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
0 | 1 | 서면 | 1 | 1985 | 2004 |
1 | 1 | 서면 | 2 | 1985 | 2004 |
2 | 1 | 서면 | 3 | 1985 | 2004 |
3 | 1 | 서면 | 4 | 1985 | 2004 |
4 | 1 | 서면 | 5 | 1985 | 2004 |
5 | 1 | 서면 | 6 | 1985 | 2004 |
6 | 1 | 서면 | 7 | 1985 | 2004 |
7 | 1 | 서면 | 8 | 1985 | 2004 |
8 | 1 | 서면 | 9 | 1985 | 2004 |
9 | 1 | 서면 | 10 | 1985 | 2004 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
636 | 4 | 금사 | 6 | 2011 | <NA> |
637 | 4 | 금사 | 7 | 2011 | <NA> |
638 | 4 | 금사 | 8 | 2011 | <NA> |
639 | 4 | 금사 | 9 | 2011 | <NA> |
640 | 4 | 고촌 | 1 | 2011 | <NA> |
641 | 4 | 고촌 | 2 | 2011 | <NA> |
642 | 4 | 고촌 | 3 | 2011 | <NA> |
643 | 4 | 고촌 | 4 | 2011 | <NA> |
644 | 4 | 고촌 | 5 | 2011 | <NA> |
645 | 4 | 고촌 | 6 | 2011 | <NA> |
Most frequently occurring
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | # duplicates | |
---|---|---|---|---|---|---|
0 | 1 | 남포 | 7 | 2017 | <NA> | 2 |