Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 1613 |
Missing cells | 1528 |
Missing cells (%) | 9.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 132.4 KiB |
Average record size in memory | 84.1 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 3 |
Text | 2 |
DateTime | 2 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-12927/F/1/datasetView.do |
호선 has constant value "" | Constant |
면적(제곱미터) has 59 (3.7%) missing values | Missing |
계약시작일자 has 416 (25.8%) missing values | Missing |
계약종료일자 has 416 (25.8%) missing values | Missing |
월임대료 has 637 (39.5%) missing values | Missing |
면적(제곱미터) is highly skewed (γ1 = 31.31107565) | Skewed |
연번 has unique values | Unique |
상가번호 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 16:39:31.177748 |
---|---|
Analysis finished | 2024-04-29 16:39:32.896880 |
Duration | 1.72 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
UNIQUE
 
Distinct | 1613 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 807 |
Minimum | 1 |
---|---|
Maximum | 1613 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 81.6 |
Q1 | 404 |
median | 807 |
Q3 | 1210 |
95-th percentile | 1532.4 |
Maximum | 1613 |
Range | 1612 |
Interquartile range (IQR) | 806 |
Descriptive statistics
Standard deviation | 465.77731 |
---|---|
Coefficient of variation (CV) | 0.57717138 |
Kurtosis | -1.2 |
Mean | 807 |
Median Absolute Deviation (MAD) | 403 |
Skewness | 0 |
Sum | 1301691 |
Variance | 216948.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
1073 | 1 | 0.1% |
1083 | 1 | 0.1% |
1082 | 1 | 0.1% |
1081 | 1 | 0.1% |
1080 | 1 | 0.1% |
1079 | 1 | 0.1% |
1078 | 1 | 0.1% |
1077 | 1 | 0.1% |
1076 | 1 | 0.1% |
Other values (1603) | 1603 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
1613 | 1 | |
1612 | 1 | |
1611 | 1 | |
1610 | 1 | |
1609 | 1 | |
1608 | 1 | |
1607 | 1 | |
1606 | 1 | |
1605 | 1 | |
1604 | 1 |
상가유형
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 KiB |
개별(일반) | |
---|---|
네트워크 | |
67일괄 | |
복합 | |
공실 | |
Other values (2) |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.3261004 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 개별(일반) |
---|---|
2nd row | 개별(일반) |
3rd row | 개별(일반) |
4th row | 개별(일반) |
5th row | 네트워크 |
Common Values
Value | Count | Frequency (%) |
개별(일반) | 600 | |
네트워크 | 312 | |
67일괄 | 262 | |
복합 | 217 | 13.5% |
공실 | 154 | 9.5% |
소송상가 | 34 | 2.1% |
개별(대형) | 34 | 2.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
개별(일반 | 600 | |
네트워크 | 312 | |
67일괄 | 262 | |
복합 | 217 | 13.5% |
공실 | 154 | 9.5% |
소송상가 | 34 | 2.1% |
개별(대형 | 34 | 2.1% |
호선
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 1613 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 1613 |
역사명
Text
Distinct | 245 |
---|---|
Distinct (%) | 15.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 KiB |
Value | Count | Frequency (%) |
오목교역 | 46 | 2.9% |
고속터미널(3)역 | 39 | 2.4% |
공덕(5)역 | 29 | 1.8% |
천호(5)역 | 27 | 1.7% |
잠실(8)역 | 26 | 1.6% |
사당(4)역 | 25 | 1.5% |
노원(7)역 | 22 | 1.4% |
강남구청역 | 21 | 1.3% |
마들역 | 20 | 1.2% |
미아사거리역 | 19 | 1.2% |
Other values (235) | 1339 |
Most occurring characters
Value | Count | Frequency (%) |
역 | 1639 | 20.3% |
) | 561 | 7.0% |
( | 561 | 7.0% |
대 | 211 | 2.6% |
구 | 188 | 2.3% |
신 | 129 | 1.6% |
2 | 114 | 1.4% |
입 | 113 | 1.4% |
사 | 110 | 1.4% |
5 | 107 | 1.3% |
Other values (199) | 4329 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 6337 | |
Decimal Number | 603 | 7.5% |
Close Punctuation | 561 | 7.0% |
Open Punctuation | 561 | 7.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
역 | 1639 | |
대 | 211 | 3.3% |
구 | 188 | 3.0% |
신 | 129 | 2.0% |
입 | 113 | 1.8% |
사 | 110 | 1.7% |
산 | 98 | 1.5% |
동 | 96 | 1.5% |
공 | 94 | 1.5% |
미 | 86 | 1.4% |
Other values (189) | 3573 |
Decimal Number
Value | Count | Frequency (%) |
2 | 114 | |
5 | 107 | |
3 | 106 | |
7 | 102 | |
6 | 68 | |
4 | 62 | |
8 | 32 | 5.3% |
1 | 12 | 2.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 561 |
Open Punctuation
Value | Count | Frequency (%) |
( | 561 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 6337 | |
Common | 1725 | 21.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
역 | 1639 | |
대 | 211 | 3.3% |
구 | 188 | 3.0% |
신 | 129 | 2.0% |
입 | 113 | 1.8% |
사 | 110 | 1.7% |
산 | 98 | 1.5% |
동 | 96 | 1.5% |
공 | 94 | 1.5% |
미 | 86 | 1.4% |
Other values (189) | 3573 |
Common
Value | Count | Frequency (%) |
) | 561 | |
( | 561 | |
2 | 114 | 6.6% |
5 | 107 | 6.2% |
3 | 106 | 6.1% |
7 | 102 | 5.9% |
6 | 68 | 3.9% |
4 | 62 | 3.6% |
8 | 32 | 1.9% |
1 | 12 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 6337 | |
ASCII | 1725 | 21.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
역 | 1639 | |
대 | 211 | 3.3% |
구 | 188 | 3.0% |
신 | 129 | 2.0% |
입 | 113 | 1.8% |
사 | 110 | 1.7% |
산 | 98 | 1.5% |
동 | 96 | 1.5% |
공 | 94 | 1.5% |
미 | 86 | 1.4% |
Other values (189) | 3573 |
ASCII
Value | Count | Frequency (%) |
) | 561 | |
( | 561 | |
2 | 114 | 6.6% |
5 | 107 | 6.2% |
3 | 106 | 6.1% |
7 | 102 | 5.9% |
6 | 68 | 3.9% |
4 | 62 | 3.6% |
8 | 32 | 1.9% |
1 | 12 | 0.7% |
상가번호
Text
UNIQUE
 
Distinct | 1613 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 KiB |
Value | Count | Frequency (%) |
150-107 | 1 | 0.1% |
639-204 | 1 | 0.1% |
641-202 | 1 | 0.1% |
641-201 | 1 | 0.1% |
641-103 | 1 | 0.1% |
641-102 | 1 | 0.1% |
641-101 | 1 | 0.1% |
640-105 | 1 | 0.1% |
640-104 | 1 | 0.1% |
640-103 | 1 | 0.1% |
Other values (1603) | 1603 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2286 | |
2 | 1646 | |
- | 1613 | |
0 | 1581 | |
3 | 1068 | |
4 | 747 | 6.6% |
7 | 666 | 5.9% |
5 | 653 | 5.8% |
6 | 496 | 4.4% |
8 | 275 | 2.4% |
Other values (2) | 260 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 9676 | |
Dash Punctuation | 1613 | 14.3% |
Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2286 | |
2 | 1646 | |
0 | 1581 | |
3 | 1068 | |
4 | 747 | 7.7% |
7 | 666 | 6.9% |
5 | 653 | 6.7% |
6 | 496 | 5.1% |
8 | 275 | 2.8% |
9 | 258 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1613 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 11289 | |
Latin | 2 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2286 | |
2 | 1646 | |
- | 1613 | |
0 | 1581 | |
3 | 1068 | |
4 | 747 | 6.6% |
7 | 666 | 5.9% |
5 | 653 | 5.8% |
6 | 496 | 4.4% |
8 | 275 | 2.4% |
Latin
Value | Count | Frequency (%) |
M | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11291 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2286 | |
2 | 1646 | |
- | 1613 | |
0 | 1581 | |
3 | 1068 | |
4 | 747 | 6.6% |
7 | 666 | 5.9% |
5 | 653 | 5.8% |
6 | 496 | 4.4% |
8 | 275 | 2.4% |
Other values (2) | 260 | 2.3% |
면적(제곱미터)
Real number (ℝ)
MISSING
  SKEWED
 
Distinct | 823 |
---|---|
Distinct (%) | 53.0% |
Missing | 59 |
Missing (%) | 3.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.435418 |
Minimum | 7.61 |
---|---|
Maximum | 7475.19 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.3 KiB |
Quantile statistics
Minimum | 7.61 |
---|---|
5-th percentile | 13.186 |
Q1 | 22.4125 |
median | 32 |
Q3 | 44.37 |
95-th percentile | 100 |
Maximum | 7475.19 |
Range | 7467.58 |
Interquartile range (IQR) | 21.9575 |
Descriptive statistics
Standard deviation | 204.77191 |
---|---|
Coefficient of variation (CV) | 4.0600815 |
Kurtosis | 1117.0738 |
Mean | 50.435418 |
Median Absolute Deviation (MAD) | 10.7 |
Skewness | 31.311076 |
Sum | 78376.64 |
Variance | 41931.534 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30.0 | 47 | 2.9% |
33.0 | 38 | 2.4% |
40.0 | 29 | 1.8% |
20.0 | 20 | 1.2% |
35.0 | 19 | 1.2% |
50.0 | 18 | 1.1% |
37.0 | 17 | 1.1% |
32.0 | 15 | 0.9% |
25.0 | 15 | 0.9% |
31.0 | 15 | 0.9% |
Other values (813) | 1321 | |
(Missing) | 59 | 3.7% |
Value | Count | Frequency (%) |
7.61 | 1 | |
8.0 | 1 | |
8.15 | 1 | |
8.25 | 1 | |
9.01 | 1 | |
9.05 | 1 | |
9.06 | 1 | |
9.2 | 1 | |
9.36 | 1 | |
9.41 | 1 |
Value | Count | Frequency (%) |
7475.19 | 1 | |
1351.0 | 1 | |
1260.58 | 1 | |
900.39 | 1 | |
871.4 | 1 | |
867.64 | 1 | |
849.0 | 1 | |
808.0 | 1 | |
708.0 | 1 | |
592.0 | 1 |
영업업종
Categorical
Distinct | 12 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.7 KiB |
<NA> | |
---|---|
의류 | |
기타 | |
편의점 | |
식음료 | |
Other values (7) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 2.9578425 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 사무실 |
---|---|
2nd row | 의류 |
3rd row | 기타 |
4th row | 플라워 |
5th row | 식음료 |
Common Values
Value | Count | Frequency (%) |
<NA> | 416 | |
의류 | 272 | |
기타 | 197 | |
편의점 | 174 | |
식음료 | 148 | 9.2% |
제과 | 134 | 8.3% |
액세서리 | 101 | 6.3% |
플라워 | 52 | 3.2% |
화장품 | 48 | 3.0% |
사무실 | 32 | 2.0% |
Other values (2) | 39 | 2.4% |
Length
Value | Count | Frequency (%) |
na | 416 | |
의류 | 272 | |
기타 | 197 | |
편의점 | 174 | |
식음료 | 148 | 9.2% |
제과 | 134 | 8.3% |
액세서리 | 101 | 6.3% |
플라워 | 52 | 3.2% |
화장품 | 48 | 3.0% |
사무실 | 32 | 2.0% |
Other values (2) | 39 | 2.4% |
계약시작일자
Date
MISSING
 
Distinct | 316 |
---|---|
Distinct (%) | 26.4% |
Missing | 416 |
Missing (%) | 25.8% |
Memory size | 12.7 KiB |
Minimum | 2010-01-28 00:00:00 |
---|---|
Maximum | 2021-12-13 00:00:00 |
계약종료일자
Date
MISSING
 
Distinct | 332 |
---|---|
Distinct (%) | 27.7% |
Missing | 416 |
Missing (%) | 25.8% |
Memory size | 12.7 KiB |
Minimum | 2017-04-27 00:00:00 |
---|---|
Maximum | 2027-01-21 00:00:00 |
월임대료
Real number (ℝ)
MISSING
 
Distinct | 870 |
---|---|
Distinct (%) | 89.1% |
Missing | 637 |
Missing (%) | 39.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6330099.3 |
Minimum | 153600 |
---|---|
Maximum | 2.8462293 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.3 KiB |
Quantile statistics
Minimum | 153600 |
---|---|
5-th percentile | 596625 |
Q1 | 1928872.9 |
median | 3821348 |
Q3 | 6975317.4 |
95-th percentile | 15565267 |
Maximum | 2.8462293 × 108 |
Range | 2.8446933 × 108 |
Interquartile range (IQR) | 5046444.5 |
Descriptive statistics
Standard deviation | 15111661 |
---|---|
Coefficient of variation (CV) | 2.3872707 |
Kurtosis | 188.74379 |
Mean | 6330099.3 |
Median Absolute Deviation (MAD) | 2230000 |
Skewness | 12.586932 |
Sum | 6.1781769 × 109 |
Variance | 2.2836229 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2500000.0 | 5 | 0.3% |
2810000.0 | 4 | 0.2% |
2300000.0 | 4 | 0.2% |
1700000.0 | 4 | 0.2% |
1200000.0 | 4 | 0.2% |
3550000.0 | 4 | 0.2% |
2200000.0 | 4 | 0.2% |
4310000.0 | 4 | 0.2% |
2150000.0 | 3 | 0.2% |
4200000.0 | 3 | 0.2% |
Other values (860) | 937 | |
(Missing) | 637 |
Value | Count | Frequency (%) |
153600.0 | 1 | |
186000.0 | 1 | |
233500.0 | 1 | |
252000.0 | 1 | |
300000.0 | 1 | |
302500.0 | 1 | |
311666.6667 | 1 | |
328100.0 | 1 | |
330000.0 | 1 | |
337800.0 | 1 |
Value | Count | Frequency (%) |
284622927.0 | 1 | |
217793378.0 | 1 | |
176500000.0 | 1 | |
152935000.0 | 1 | |
145000000.0 | 1 | |
61517300.0 | 1 | |
55185100.0 | 1 | |
48204012.07 | 1 | |
40100000.0 | 1 | |
29358258.0 | 1 |
연번 | 상가유형 | 면적(제곱미터) | 영업업종 | 월임대료 | |
---|---|---|---|---|---|
연번 | 1.000 | 0.476 | 0.057 | 0.340 | 0.000 |
상가유형 | 0.476 | 1.000 | 0.346 | 0.569 | 0.368 |
면적(제곱미터) | 0.057 | 0.346 | 1.000 | 0.179 | 0.809 |
영업업종 | 0.340 | 0.569 | 0.179 | 1.000 | 0.000 |
월임대료 | 0.000 | 0.368 | 0.809 | 0.000 | 1.000 |
영업업종 | 상가유형 | |
---|---|---|
영업업종 | 1.000 | 0.356 |
상가유형 | 0.356 | 1.000 |
연번 | 면적(제곱미터) | 월임대료 | 상가유형 | 영업업종 | |
---|---|---|---|---|---|
연번 | 1.000 | 0.390 | 0.098 | 0.264 | 0.151 |
면적(제곱미터) | 0.390 | 1.000 | 0.382 | 0.248 | 0.105 |
월임대료 | 0.098 | 0.382 | 1.000 | 0.246 | 0.000 |
상가유형 | 0.264 | 0.248 | 0.246 | 1.000 | 0.356 |
영업업종 | 0.151 | 0.105 | 0.000 | 0.356 | 1.000 |
연번 | 상가유형 | 호선 | 역사명 | 상가번호 | 면적(제곱미터) | 영업업종 | 계약시작일자 | 계약종료일자 | 월임대료 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 개별(일반) | 1 | 서울(1)역 | 150-107 | 33.0 | 사무실 | 2019-05-08 | 2024-06-06 | 527100.0 |
1 | 2 | 개별(일반) | 1 | 시청(1)역 | 151-101 | 29.73 | 의류 | 2017-04-04 | 2022-05-03 | 3858954.0 |
2 | 3 | 개별(일반) | 1 | 시청(1)역 | 151-103 | 57.6 | 기타 | 2020-02-01 | 2025-01-31 | 1858300.0 |
3 | 4 | 개별(일반) | 1 | 시청(1)역 | 151-104 | 25.0 | 플라워 | 2020-12-31 | 2026-01-30 | 2470600.0 |
4 | 5 | 네트워크 | 1 | 시청(1)역 | 151-105 | 25.0 | 식음료 | 2021-06-03 | 2026-08-02 | 4145884.24 |
5 | 6 | 개별(일반) | 1 | 시청(1)역 | 151-106 | 14.0 | 액세서리 | 2017-09-19 | 2022-11-17 | 1801800.0 |
6 | 7 | 개별(일반) | 1 | 시청(1)역 | 151-107 | 22.0 | 의류 | 2020-09-18 | 2025-10-18 | 2613800.0 |
7 | 8 | 공실 | 1 | 종각역 | 152-101 | 36.85 | <NA> | <NA> | <NA> | <NA> |
8 | 9 | 공실 | 1 | 종각역 | 152-104 | 18.64 | <NA> | <NA> | <NA> | <NA> |
9 | 10 | 개별(일반) | 1 | 종각역 | 152-105 | 29.3 | 편의점 | 2017-04-18 | 2022-04-17 | 6549400.0 |
연번 | 상가유형 | 호선 | 역사명 | 상가번호 | 면적(제곱미터) | 영업업종 | 계약시작일자 | 계약종료일자 | 월임대료 | |
---|---|---|---|---|---|---|---|---|---|---|
1603 | 1604 | 개별(일반) | 1 | 남한산성입구역 | 822-205 | 17.0 | 식음료 | 2019-10-29 | 2024-11-27 | 1666600.0 |
1604 | 1605 | 공실 | 1 | 단대오거리역 | 823-101 | 42.5 | <NA> | <NA> | <NA> | <NA> |
1605 | 1606 | 개별(일반) | 1 | 단대오거리역 | 823-102 | 36.78 | 기타 | 2021-01-21 | 2026-02-20 | 1700000.0 |
1606 | 1607 | 네트워크 | 1 | 단대오거리역 | 823-201 | 32.5 | 편의점 | 2016-07-25 | 2021-11-17 | 8712991.0 |
1607 | 1608 | 공실 | 1 | 단대오거리역 | 823-202 | 28.97 | <NA> | <NA> | <NA> | <NA> |
1608 | 1609 | 개별(일반) | 1 | 단대오거리역 | 823-203 | 54.03 | 식음료 | 2018-08-31 | 2023-09-29 | 7630000.0 |
1609 | 1610 | 개별(일반) | 1 | 단대오거리역 | 823-204 | 75.09 | 의류 | 2021-03-18 | 2026-04-17 | 3780000.0 |
1610 | 1611 | 네트워크 | 1 | 신흥역 | 824-101 | 40.0 | 편의점 | 2016-07-25 | 2021-11-17 | 6124682.0 |
1611 | 1612 | 네트워크 | 1 | 수진역 | 825-101 | 40.0 | 편의점 | 2016-07-25 | 2021-11-17 | 5575875.0 |
1612 | 1613 | 네트워크 | 1 | 모란역 | 826-101 | 50.0 | 편의점 | 2016-07-25 | 2021-11-17 | 5831070.0 |