Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 644 |
Missing cells | 584 |
Missing cells (%) | 18.1% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.2% |
Total size in memory | 27.2 KiB |
Average record size in memory | 43.2 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 부산교통공사_승강기연도별설치현황_20210526 |
---|---|
Author | 부산교통공사 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15052663 |
Dataset has 1 (0.2%) duplicate rows | Duplicates |
설치년도 is highly overall correlated with 교체주기(개량년도) and 1 other fields | High correlation |
교체주기(개량년도) is highly overall correlated with 설치년도 and 1 other fields | High correlation |
호선 is highly overall correlated with 설치년도 and 1 other fields | High correlation |
교체주기(개량년도) has 584 (90.7%) missing values | Missing |
Reproduction
Analysis started | 2023-12-10 16:07:25.198484 |
---|---|
Analysis finished | 2023-12-10 16:07:26.183882 |
Duration | 0.99 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
2 | |
---|---|
3 | |
1 | |
4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 200 | |
3 | 174 | |
1 | 139 | |
4 | 131 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 200 | |
3 | 174 | |
1 | 139 | |
4 | 131 |
역명
Text
Distinct | 77 |
---|---|
Distinct (%) | 12.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
Value | Count | Frequency (%) |
동래 | 26 | 4.0% |
다대포항 | 21 | 3.2% |
만덕 | 20 | 3.1% |
연산동 | 20 | 3.1% |
배산 | 16 | 2.4% |
서면 | 16 | 2.4% |
수안 | 16 | 2.4% |
센텀시티 | 14 | 2.1% |
장림 | 14 | 2.1% |
종합운동장 | 13 | 2.0% |
Other values (68) | 479 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 120 | 6.7% |
대 | 104 | 5.8% |
동 | 97 | 5.4% |
장 | 70 | 3.9% |
포 | 57 | 3.2% |
남 | 48 | 2.7% |
서 | 47 | 2.6% |
수 | 43 | 2.4% |
구 | 37 | 2.1% |
부 | 37 | 2.1% |
Other values (100) | 1121 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1751 | |
Space Separator | 22 | 1.2% |
Close Punctuation | 4 | 0.2% |
Open Punctuation | 4 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 120 | 6.9% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 37 | 2.1% |
Other values (97) | 1091 |
Space Separator
Value | Count | Frequency (%) |
22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1751 | |
Common | 30 | 1.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 120 | 6.9% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 37 | 2.1% |
Other values (97) | 1091 |
Common
Value | Count | Frequency (%) |
22 | ||
) | 4 | 13.3% |
( | 4 | 13.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1751 | |
ASCII | 30 | 1.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 120 | 6.9% |
대 | 104 | 5.9% |
동 | 97 | 5.5% |
장 | 70 | 4.0% |
포 | 57 | 3.3% |
남 | 48 | 2.7% |
서 | 47 | 2.7% |
수 | 43 | 2.5% |
구 | 37 | 2.1% |
부 | 37 | 2.1% |
Other values (97) | 1091 |
ASCII
Value | Count | Frequency (%) |
22 | ||
) | 4 | 13.3% |
( | 4 | 13.3% |
호기
Categorical
Distinct | 27 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.2 KiB |
1 | |
---|---|
2 | |
3 | |
4 | |
5 | |
Other values (22) |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.1925466 |
Min length | 1 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 79 | |
2 | 77 | |
3 | 68 | |
4 | 67 | |
5 | 59 | |
6 | 57 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
Length
Value | Count | Frequency (%) |
1 | 79 | |
2 | 77 | |
3 | 68 | |
4 | 67 | |
5 | 59 | |
6 | 57 | |
7 | 49 | |
8 | 43 | |
9 | 27 | 4.2% |
10 | 25 | 3.9% |
Other values (17) | 93 |
설치년도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2006.4239 |
Minimum | 1985 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1985 |
---|---|
5-th percentile | 1989 |
Q1 | 2002 |
median | 2005 |
Q3 | 2011 |
95-th percentile | 2017 |
Maximum | 2021 |
Range | 36 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 7.32019 |
---|---|
Coefficient of variation (CV) | 0.0036483766 |
Kurtosis | 0.79359092 |
Mean | 2006.4239 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.6852411 |
Sum | 1292137 |
Variance | 53.585182 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2005 | 167 | |
2011 | 135 | |
2017 | 88 | |
2001 | 64 | 9.9% |
1998 | 52 | 8.1% |
2007 | 34 | 5.3% |
2008 | 18 | 2.8% |
1985 | 13 | 2.0% |
2002 | 12 | 1.9% |
2006 | 9 | 1.4% |
Other values (12) | 52 | 8.1% |
Value | Count | Frequency (%) |
1985 | 13 | 2.0% |
1987 | 6 | 0.9% |
1988 | 8 | 1.2% |
1989 | 7 | 1.1% |
1994 | 4 | 0.6% |
1998 | 52 | 8.1% |
2001 | 64 | 9.9% |
2002 | 12 | 1.9% |
2004 | 1 | 0.2% |
2005 | 167 |
Value | Count | Frequency (%) |
2021 | 2 | 0.3% |
2018 | 4 | 0.6% |
2017 | 88 | |
2016 | 5 | 0.8% |
2015 | 1 | 0.2% |
2014 | 4 | 0.6% |
2012 | 4 | 0.6% |
2011 | 135 | |
2009 | 6 | 0.9% |
2008 | 18 | 2.8% |
교체주기(개량년도)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 28.3% |
Missing | 584 |
Missing (%) | 90.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2009.6167 |
Minimum | 1998 |
---|---|
Maximum | 2020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.8 KiB |
Quantile statistics
Minimum | 1998 |
---|---|
5-th percentile | 2003.95 |
Q1 | 2004.75 |
median | 2008.5 |
Q3 | 2016 |
95-th percentile | 2018 |
Maximum | 2020 |
Range | 22 |
Interquartile range (IQR) | 11.25 |
Descriptive statistics
Standard deviation | 5.66611 |
---|---|
Coefficient of variation (CV) | 0.0028194979 |
Kurtosis | -1.4105201 |
Mean | 2009.6167 |
Median Absolute Deviation (MAD) | 4.5 |
Skewness | 0.18898412 |
Sum | 120577 |
Variance | 32.104802 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2004 | 12 | 1.9% |
2005 | 10 | 1.6% |
2016 | 9 | 1.4% |
2006 | 4 | 0.6% |
2015 | 4 | 0.6% |
2011 | 4 | 0.6% |
2017 | 3 | 0.5% |
2012 | 2 | 0.3% |
2013 | 2 | 0.3% |
2003 | 2 | 0.3% |
Other values (7) | 8 | 1.2% |
(Missing) | 584 |
Value | Count | Frequency (%) |
1998 | 1 | 0.2% |
2003 | 2 | 0.3% |
2004 | 12 | |
2005 | 10 | |
2006 | 4 | 0.6% |
2008 | 1 | 0.2% |
2009 | 1 | 0.2% |
2010 | 1 | 0.2% |
2011 | 4 | 0.6% |
2012 | 2 | 0.3% |
Value | Count | Frequency (%) |
2020 | 1 | 0.2% |
2019 | 1 | 0.2% |
2018 | 2 | 0.3% |
2017 | 3 | 0.5% |
2016 | 9 | |
2015 | 4 | |
2013 | 2 | 0.3% |
2012 | 2 | 0.3% |
2011 | 4 | |
2010 | 1 | 0.2% |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
호선 | 1.000 | 0.998 | 0.109 | 0.987 | 1.000 |
역명 | 0.998 | 1.000 | 0.000 | 0.997 | 0.883 |
호기 | 0.109 | 0.000 | 1.000 | 0.000 | 0.000 |
설치년도 | 0.987 | 0.997 | 0.000 | 1.000 | 0.720 |
교체주기(개량년도) | 1.000 | 0.883 | 0.000 | 0.720 | 1.000 |
호기 | 호선 | |
---|---|---|
호기 | 1.000 | 0.056 |
호선 | 0.056 | 1.000 |
설치년도 | 교체주기(개량년도) | 호선 | 호기 | |
---|---|---|---|---|
설치년도 | 1.000 | 0.659 | 0.941 | 0.000 |
교체주기(개량년도) | 0.659 | 1.000 | 0.938 | 0.000 |
호선 | 0.941 | 0.938 | 1.000 | 0.056 |
호기 | 0.000 | 0.000 | 0.056 | 1.000 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
0 | 1 | 노포 | 1 | 2004 | <NA> |
1 | 1 | 범어사 | 1 | 1985 | 2011 |
2 | 1 | 동래 | 1 | 2011 | <NA> |
3 | 1 | 동래 | 2 | 2011 | <NA> |
4 | 1 | 교대 | 1 | 2016 | <NA> |
5 | 1 | 연산 | 1 | 2006 | <NA> |
6 | 1 | 연산 | 2 | 2006 | <NA> |
7 | 1 | 연산 | 3 | 2008 | <NA> |
8 | 1 | 연산 | 4 | 2008 | <NA> |
9 | 1 | 서면 | 1 | 1985 | 2004 |
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | |
---|---|---|---|---|---|
634 | 4 | 고촌 | 3 | 2011 | <NA> |
635 | 4 | 고촌 | 4 | 2011 | <NA> |
636 | 4 | 고촌 | 5 | 2011 | <NA> |
637 | 4 | 고촌 | 6 | 2011 | <NA> |
638 | 4 | 안평 | 1 | 2011 | <NA> |
639 | 4 | 안평 | 2 | 2011 | <NA> |
640 | 4 | 안평 | 3 | 2011 | <NA> |
641 | 4 | 안평 | 4 | 2011 | <NA> |
642 | 4 | 안평 | 5 | 2011 | <NA> |
643 | 4 | 안평 | 6 | 2011 | <NA> |
Most frequently occurring
호선 | 역명 | 호기 | 설치년도 | 교체주기(개량년도) | # duplicates | |
---|---|---|---|---|---|---|
0 | 1 | 남포 | 7 | 2017 | <NA> | 2 |