Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 640 |
Missing cells | 582 |
Missing cells (%) | 18.2% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.2% |
Total size in memory | 27.0 KiB |
Average record size in memory | 43.2 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 부산교통공사_승강기연도별설치현황_20200527 |
---|---|
Author | 부산교통공사 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15052663 |
Dataset has 1 (0.2%) duplicate rows | Duplicates |
설치년도 is highly overall correlated with 교체주기(개량년도) and 1 other fields | High correlation |
교체주기(개량년도) is highly overall correlated with 설치년도 and 1 other fields | High correlation |
호선 is highly overall correlated with 설치년도 and 1 other fields | High correlation |
교체주기(개량년도) has 582 (90.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 16:07:29.922775 |
---|---|
Analysis finished | 2023-12-10 16:07:31.114993 |
Duration | 1.19 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
2 | |
---|---|
3 | |
1 | |
4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 198 | |
3 | 174 | |
1 | 137 | |
4 | 131 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 198 | |
3 | 174 | |
1 | 137 | |
4 | 131 |
역명
Text
Distinct | 76 |
---|---|
Distinct (%) | 11.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
Value | Count | Frequency (%) |
동래 | 26 | 4.0% |
다대포항 | 21 | 3.2% |
연산동 | 20 | 3.1% |
만덕 | 20 | 3.1% |
배산 | 16 | 2.5% |
서면 | 16 | 2.5% |
수안 | 16 | 2.5% |
장림 | 14 | 2.2% |
센텀시티 | 14 | 2.2% |
낫개 | 13 | 2.0% |
Other values (67) | 475 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 118 | 6.7% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.4% |
구 | 37 | 2.1% |
부 | 35 | 2.0% |
Other values (99) | 1115 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1741 | |
Space Separator | 22 | 1.2% |
Close Punctuation | 4 | 0.2% |
Open Punctuation | 4 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 118 | 6.8% |
대 | 104 | 6.0% |
동 | 97 | 5.6% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.8% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 35 | 2.0% |
Other values (96) | 1085 |
Space Separator
Value | Count | Frequency (%) |
22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1741 | |
Common | 30 | 1.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 118 | 6.8% |
대 | 104 | 6.0% |
동 | 97 | 5.6% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.8% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 35 | 2.0% |
Other values (96) | 1085 |
Common
Value | Count | Frequency (%) |
22 | ||
) | 4 | 13.3% |
( | 4 | 13.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1741 | |
ASCII | 30 | 1.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 118 | 6.8% |
대 | 104 | 6.0% |
동 | 97 | 5.6% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.8% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 35 | 2.0% |
Other values (96) | 1085 |
ASCII
Value | Count | Frequency (%) |
22 | ||
) | 4 | 13.3% |
( | 4 | 13.3% |
호기
Categorical
Distinct | 27 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | |
Other values (22) |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.19375 |
Min length | 1 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 78 | |
2 | 76 | |
3 | 68 | |
4 | 67 | |
5 | 58 | |
6 | 56 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
Length
Value | Count | Frequency (%) |
1 | 78 | |
2 | 76 | |
3 | 68 | |
4 | 67 | |
5 | 58 | |
6 | 56 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
설치년도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2006.3422 |
Minimum | 1985 |
---|---|
Maximum | 2018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1985 |
---|---|
5-th percentile | 1989 |
Q1 | 2002 |
median | 2005 |
Q3 | 2011 |
95-th percentile | 2017 |
Maximum | 2018 |
Range | 33 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 7.2683916 |
---|---|
Coefficient of variation (CV) | 0.0036227078 |
Kurtosis | 0.82369968 |
Mean | 2006.3422 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.70855837 |
Sum | 1284059 |
Variance | 52.829516 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2005 | 167 | |
2011 | 135 | |
2017 | 88 | |
2001 | 64 | 10.0% |
1998 | 52 | 8.1% |
2007 | 34 | 5.3% |
2008 | 18 | 2.8% |
1985 | 13 | 2.0% |
2002 | 12 | 1.9% |
2006 | 9 | 1.4% |
Other values (11) | 48 | 7.5% |
Value | Count | Frequency (%) |
1985 | 13 | 2.0% |
1987 | 6 | 0.9% |
1988 | 8 | 1.2% |
1989 | 7 | 1.1% |
1994 | 4 | 0.6% |
1998 | 52 | 8.1% |
2001 | 64 | 10.0% |
2002 | 12 | 1.9% |
2004 | 1 | 0.2% |
2005 | 167 |
Value | Count | Frequency (%) |
2018 | 2 | 0.3% |
2017 | 88 | |
2016 | 5 | 0.8% |
2015 | 1 | 0.2% |
2014 | 4 | 0.6% |
2012 | 4 | 0.6% |
2011 | 135 | |
2009 | 6 | 0.9% |
2008 | 18 | 2.8% |
2007 | 34 | 5.3% |
교체주기(개량년도)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 15 |
---|---|
Distinct (%) | 25.9% |
Missing | 582 |
Missing (%) | 90.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2009.2759 |
Minimum | 1998 |
---|---|
Maximum | 2018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1998 |
---|---|
5-th percentile | 2003.85 |
Q1 | 2004.25 |
median | 2007 |
Q3 | 2015 |
95-th percentile | 2017 |
Maximum | 2018 |
Range | 20 |
Interquartile range (IQR) | 10.75 |
Descriptive statistics
Standard deviation | 5.4476556 |
---|---|
Coefficient of variation (CV) | 0.0027112532 |
Kurtosis | -1.4414379 |
Mean | 2009.2759 |
Median Absolute Deviation (MAD) | 3.5 |
Skewness | 0.19633221 |
Sum | 116538 |
Variance | 29.676951 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2004 | 12 | 1.9% |
2005 | 10 | 1.6% |
2016 | 9 | 1.4% |
2011 | 4 | 0.6% |
2006 | 4 | 0.6% |
2015 | 4 | 0.6% |
2017 | 3 | 0.5% |
2012 | 2 | 0.3% |
2013 | 2 | 0.3% |
2003 | 2 | 0.3% |
Other values (5) | 6 | 0.9% |
(Missing) | 582 |
Value | Count | Frequency (%) |
1998 | 1 | 0.2% |
2003 | 2 | 0.3% |
2004 | 12 | |
2005 | 10 | |
2006 | 4 | 0.6% |
2008 | 1 | 0.2% |
2009 | 1 | 0.2% |
2010 | 1 | 0.2% |
2011 | 4 | 0.6% |
2012 | 2 | 0.3% |
Value | Count | Frequency (%) |
2018 | 2 | 0.3% |
2017 | 3 | 0.5% |
2016 | 9 | |
2015 | 4 | |
2013 | 2 | 0.3% |
2012 | 2 | 0.3% |
2011 | 4 | |
2010 | 1 | 0.2% |
2009 | 1 | 0.2% |
2008 | 1 | 0.2% |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
호선 | 1.000 | 1.000 | 0.108 | 0.949 | 1.000 |
역명 | 1.000 | 1.000 | 0.000 | 0.998 | 0.954 |
호기 | 0.108 | 0.000 | 1.000 | 0.000 | 0.000 |
설치년도 | 0.949 | 0.998 | 0.000 | 1.000 | 0.905 |
교체주기(개량년도) | 1.000 | 0.954 | 0.000 | 0.905 | 1.000 |
호기 | 호선 | |
---|---|---|
호기 | 1.000 | 0.056 |
호선 | 0.056 | 1.000 |
설치년도 | 교체주기(개량년도) | 호선 | 호기 | |
---|---|---|---|---|
설치년도 | 1.000 | 0.635 | 0.872 | 0.000 |
교체주기(개량년도) | 0.635 | 1.000 | 0.935 | 0.000 |
호선 | 0.872 | 0.935 | 1.000 | 0.056 |
호기 | 0.000 | 0.000 | 0.056 | 1.000 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
0 | 1 | 노포 | 1 | 2004 | <NA> |
1 | 1 | 범어사 | 1 | 1985 | 2011 |
2 | 1 | 동래 | 1 | 2011 | <NA> |
3 | 1 | 동래 | 2 | 2011 | <NA> |
4 | 1 | 교대 | 1 | 2016 | <NA> |
5 | 1 | 연산 | 1 | 2006 | <NA> |
6 | 1 | 연산 | 2 | 2006 | <NA> |
7 | 1 | 연산 | 3 | 2008 | <NA> |
8 | 1 | 연산 | 4 | 2008 | <NA> |
9 | 1 | 서면 | 1 | 1985 | 2004 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
630 | 4 | 고촌 | 3 | 2011 | <NA> |
631 | 4 | 고촌 | 4 | 2011 | <NA> |
632 | 4 | 고촌 | 5 | 2011 | <NA> |
633 | 4 | 고촌 | 6 | 2011 | <NA> |
634 | 4 | 안평 | 1 | 2011 | <NA> |
635 | 4 | 안평 | 2 | 2011 | <NA> |
636 | 4 | 안평 | 3 | 2011 | <NA> |
637 | 4 | 안평 | 4 | 2011 | <NA> |
638 | 4 | 안평 | 5 | 2011 | <NA> |
639 | 4 | 안평 | 6 | 2011 | <NA> |
Most frequently occurring
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | # duplicates | |
---|---|---|---|---|---|---|
0 | 1 | 남포 | 7 | 2017 | <NA> | 2 |