Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 1942 |
Missing cells | 1943 |
Missing cells (%) | 8.3% |
Duplicate rows | 297 |
Duplicate rows (%) | 15.3% |
Total size in memory | 182.2 KiB |
Average record size in memory | 96.1 B |
Variable types
Unsupported | 6 |
---|---|
Categorical | 3 |
Text | 2 |
DateTime | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-12927/F/1/datasetView.do |
Unnamed: 11 has constant value "" | Constant |
Dataset has 297 (15.3%) duplicate rows | Duplicates |
Unnamed: 1 is highly overall correlated with Unnamed: 10 | High correlation |
Unnamed: 2 is highly overall correlated with Unnamed: 10 | High correlation |
Unnamed: 10 is highly overall correlated with Unnamed: 1 and 1 other fields | High correlation |
Unnamed: 10 is highly imbalanced (57.7%) | Imbalance |
Unnamed: 5 has 208 (10.7%) missing values | Missing |
Unnamed: 6 has 467 (24.0%) missing values | Missing |
Unnamed: 7 has 526 (27.1%) missing values | Missing |
Unnamed: 8 has 526 (27.1%) missing values | Missing |
Unnamed: 9 has 215 (11.1%) missing values | Missing |
상가현황(2017.10월) is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-29 16:39:04.347364 |
---|---|
Analysis finished | 2024-04-29 16:39:05.271155 |
Duration | 0.92 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
상가현황(2017.10월)
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Unnamed: 1
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
네트워크(브랜드) | |
---|---|
GS | |
개별(일반) | |
공실 | |
복합 | |
Other values (7) |
Length
Max length | 12 |
---|---|
Median length | 2 |
Mean length | 4.5942327 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 상가유형 |
---|---|
2nd row | 공실 |
3rd row | 네트워크(브랜드) |
4th row | 네트워크(브랜드) |
5th row | 네트워크(브랜드) |
Common Values
Value | Count | Frequency (%) |
네트워크(브랜드) | 426 | |
GS | 406 | |
개별(일반) | 398 | |
공실 | 309 | |
복합 | 250 | |
입찰공고중 | 82 | 4.2% |
개별(장기) | 29 | 1.5% |
개별(대형) | 19 | 1.0% |
기타 | 19 | 1.0% |
개별(일반-무상) | 2 | 0.1% |
Other values (2) | 2 | 0.1% |
Length
Value | Count | Frequency (%) |
네트워크(브랜드 | 426 | |
gs | 406 | |
개별(일반 | 398 | |
공실 | 309 | |
복합 | 250 | |
입찰공고중 | 82 | 4.2% |
개별(장기 | 29 | 1.5% |
개별(대형 | 19 | 1.0% |
기타 | 19 | 1.0% |
개별(일반-무상 | 2 | 0.1% |
Other values (2) | 2 | 0.1% |
Unnamed: 2
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
7호선 | |
---|---|
5호선 | |
2호선 | |
6호선 | |
4호선 | |
Other values (4) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.9994851 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 호선 |
---|---|
2nd row | 1호선 |
3rd row | 1호선 |
4th row | 1호선 |
5th row | 1호선 |
Common Values
Value | Count | Frequency (%) |
7호선 | 519 | |
5호선 | 358 | |
2호선 | 329 | |
6호선 | 265 | |
4호선 | 195 | 10.0% |
3호선 | 170 | 8.8% |
8호선 | 55 | 2.8% |
1호선 | 50 | 2.6% |
호선 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
7호선 | 519 | |
5호선 | 358 | |
2호선 | 329 | |
6호선 | 265 | |
4호선 | 195 | 10.0% |
3호선 | 170 | 8.8% |
8호선 | 55 | 2.8% |
1호선 | 50 | 2.6% |
호선 | 1 | 0.1% |
Unnamed: 3
Text
Distinct | 249 |
---|---|
Distinct (%) | 12.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Value | Count | Frequency (%) |
오목교역 | 83 | 4.3% |
반포역 | 46 | 2.4% |
청담역 | 39 | 2.0% |
사당(4)역 | 33 | 1.7% |
잠실역 | 33 | 1.7% |
합정역 | 30 | 1.5% |
공덕역 | 29 | 1.5% |
천호역 | 28 | 1.4% |
고속터미널역 | 27 | 1.4% |
이수역 | 25 | 1.3% |
Other values (233) | 1569 |
Most occurring characters
Value | Count | Frequency (%) |
역 | 1965 | |
624 | 7.1% | |
대 | 234 | 2.7% |
) | 230 | 2.6% |
( | 230 | 2.6% |
구 | 193 | 2.2% |
신 | 152 | 1.7% |
입 | 127 | 1.4% |
사 | 123 | 1.4% |
동 | 120 | 1.4% |
Other values (202) | 4801 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7427 | |
Space Separator | 624 | 7.1% |
Decimal Number | 288 | 3.3% |
Close Punctuation | 230 | 2.6% |
Open Punctuation | 230 | 2.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
역 | 1965 | |
대 | 234 | 3.2% |
구 | 193 | 2.6% |
신 | 152 | 2.0% |
입 | 127 | 1.7% |
사 | 123 | 1.7% |
동 | 120 | 1.6% |
산 | 113 | 1.5% |
목 | 106 | 1.4% |
교 | 103 | 1.4% |
Other values (192) | 4191 |
Decimal Number
Value | Count | Frequency (%) |
4 | 81 | |
3 | 65 | |
2 | 52 | |
7 | 28 | 9.7% |
6 | 28 | 9.7% |
1 | 23 | 8.0% |
5 | 11 | 3.8% |
Space Separator
Value | Count | Frequency (%) |
624 |
Close Punctuation
Value | Count | Frequency (%) |
) | 230 |
Open Punctuation
Value | Count | Frequency (%) |
( | 230 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7427 | |
Common | 1372 | 15.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
역 | 1965 | |
대 | 234 | 3.2% |
구 | 193 | 2.6% |
신 | 152 | 2.0% |
입 | 127 | 1.7% |
사 | 123 | 1.7% |
동 | 120 | 1.6% |
산 | 113 | 1.5% |
목 | 106 | 1.4% |
교 | 103 | 1.4% |
Other values (192) | 4191 |
Common
Value | Count | Frequency (%) |
624 | ||
) | 230 | 16.8% |
( | 230 | 16.8% |
4 | 81 | 5.9% |
3 | 65 | 4.7% |
2 | 52 | 3.8% |
7 | 28 | 2.0% |
6 | 28 | 2.0% |
1 | 23 | 1.7% |
5 | 11 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7427 | |
ASCII | 1372 | 15.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
역 | 1965 | |
대 | 234 | 3.2% |
구 | 193 | 2.6% |
신 | 152 | 2.0% |
입 | 127 | 1.7% |
사 | 123 | 1.7% |
동 | 120 | 1.6% |
산 | 113 | 1.5% |
목 | 106 | 1.4% |
교 | 103 | 1.4% |
Other values (192) | 4191 |
ASCII
Value | Count | Frequency (%) |
624 | ||
) | 230 | 16.8% |
( | 230 | 16.8% |
4 | 81 | 5.9% |
3 | 65 | 4.7% |
2 | 52 | 3.8% |
7 | 28 | 2.0% |
6 | 28 | 2.0% |
1 | 23 | 1.7% |
5 | 11 | 0.8% |
Unnamed: 4
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 208 |
---|---|
Missing (%) | 10.7% |
Memory size | 15.3 KiB |
Unnamed: 6
Text
MISSING
 
Distinct | 62 |
---|---|
Distinct (%) | 4.2% |
Missing | 467 |
Missing (%) | 24.0% |
Memory size | 15.3 KiB |
Value | Count | Frequency (%) |
화장품 | 203 | |
편의점 | 199 | |
의류 | 157 | 10.0% |
액세서리 | 111 | 7.0% |
제과 | 88 | 5.6% |
기타 | 86 | 5.5% |
의류(여성 | 83 | 5.3% |
복합상가 | 81 | 5.1% |
커피 | 77 | 4.9% |
공실 | 65 | 4.1% |
Other values (54) | 425 |
Most occurring characters
Value | Count | Frequency (%) |
의 | 462 | 8.5% |
화 | 260 | 4.8% |
류 | 260 | 4.8% |
품 | 217 | 4.0% |
장 | 204 | 3.7% |
편 | 201 | 3.7% |
점 | 200 | 3.7% |
과 | 149 | 2.7% |
제 | 144 | 2.6% |
리 | 138 | 2.5% |
Other values (128) | 3219 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4982 | |
Other Punctuation | 142 | 2.6% |
Close Punctuation | 113 | 2.1% |
Open Punctuation | 113 | 2.1% |
Space Separator | 102 | 1.9% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 462 | 9.3% |
화 | 260 | 5.2% |
류 | 260 | 5.2% |
품 | 217 | 4.4% |
장 | 204 | 4.1% |
편 | 201 | 4.0% |
점 | 200 | 4.0% |
과 | 149 | 3.0% |
제 | 144 | 2.9% |
리 | 138 | 2.8% |
Other values (121) | 2747 |
Other Punctuation
Value | Count | Frequency (%) |
, | 84 | |
. | 56 | |
@ | 2 | 1.4% |
Close Punctuation
Value | Count | Frequency (%) |
) | 113 |
Open Punctuation
Value | Count | Frequency (%) |
( | 113 |
Space Separator
Value | Count | Frequency (%) |
102 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4982 | |
Common | 472 | 8.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 462 | 9.3% |
화 | 260 | 5.2% |
류 | 260 | 5.2% |
품 | 217 | 4.4% |
장 | 204 | 4.1% |
편 | 201 | 4.0% |
점 | 200 | 4.0% |
과 | 149 | 3.0% |
제 | 144 | 2.9% |
리 | 138 | 2.8% |
Other values (121) | 2747 |
Common
Value | Count | Frequency (%) |
) | 113 | |
( | 113 | |
102 | ||
, | 84 | |
. | 56 | |
@ | 2 | 0.4% |
+ | 2 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4982 | |
ASCII | 472 | 8.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
의 | 462 | 9.3% |
화 | 260 | 5.2% |
류 | 260 | 5.2% |
품 | 217 | 4.4% |
장 | 204 | 4.1% |
편 | 201 | 4.0% |
점 | 200 | 4.0% |
과 | 149 | 3.0% |
제 | 144 | 2.9% |
리 | 138 | 2.8% |
Other values (121) | 2747 |
ASCII
Value | Count | Frequency (%) |
) | 113 | |
( | 113 | |
102 | ||
, | 84 | |
. | 56 | |
@ | 2 | 0.4% |
+ | 2 | 0.4% |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 526 |
---|---|
Missing (%) | 27.1% |
Memory size | 15.3 KiB |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 526 |
---|---|
Missing (%) | 27.1% |
Memory size | 15.3 KiB |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 215 |
---|---|
Missing (%) | 11.1% |
Memory size | 15.3 KiB |
Unnamed: 10
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
<NA> | |
---|---|
공실 | |
입찰공고중 | 82 |
명도거부 | 45 |
계약만료 | 15 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 3.722966 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 비고 |
---|---|
2nd row | 공실 |
3rd row | <NA> |
4th row | 명도거부 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 1490 | |
공실 | 309 | 15.9% |
입찰공고중 | 82 | 4.2% |
명도거부 | 45 | 2.3% |
계약만료 | 15 | 0.8% |
비고 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 1490 | |
공실 | 309 | 15.9% |
입찰공고중 | 82 | 4.2% |
명도거부 | 45 | 2.3% |
계약만료 | 15 | 0.8% |
비고 | 1 | 0.1% |
Unnamed: 11
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 1 |
Missing (%) | 0.1% |
Memory size | 15.3 KiB |
Minimum | 2017-10-11 00:00:00 |
---|---|
Maximum | 2017-10-11 00:00:00 |
Unnamed: 1 | Unnamed: 2 | Unnamed: 6 | Unnamed: 10 | |
---|---|---|---|---|
Unnamed: 1 | 1.000 | 0.768 | 0.920 | 0.937 |
Unnamed: 2 | 0.768 | 1.000 | 0.860 | 0.765 |
Unnamed: 6 | 0.920 | 0.860 | 1.000 | 0.828 |
Unnamed: 10 | 0.937 | 0.765 | 0.828 | 1.000 |
Unnamed: 10 | Unnamed: 2 | Unnamed: 1 | |
---|---|---|---|
Unnamed: 10 | 1.000 | 0.575 | 0.897 |
Unnamed: 2 | 0.575 | 1.000 | 0.460 |
Unnamed: 1 | 0.897 | 0.460 | 1.000 |
Unnamed: 1 | Unnamed: 2 | Unnamed: 10 | |
---|---|---|---|
Unnamed: 1 | 1.000 | 0.460 | 0.897 |
Unnamed: 2 | 0.460 | 1.000 | 0.575 |
Unnamed: 10 | 0.897 | 0.575 | 1.000 |
상가현황(2017.10월) | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | NO | 상가유형 | 호선 | 역명 | 상가번호 | 면적 | 업종 | 계약시작 | 계약종료 | 월 임대료(원) | 비고 | NaT |
1 | 1 | 공실 | 1호선 | 서울(1)역 | 150107 | 33 | <NA> | NaN | NaN | 6254183.333333 | 공실 | 2017-10-11 |
2 | 2 | 네트워크(브랜드) | 1호선 | 서울(1)역 | 150108 | 33 | 커피 | 2012-07-06 00:00:00 | 2017-08-15 00:00:00 | 5386912 | <NA> | 2017-10-11 |
3 | 3 | 네트워크(브랜드) | 1호선 | 서울(1)역 | 150109 | 12 | 커피 | 2012-05-21 00:00:00 | 2017-07-10 00:00:00 | 5178550 | 명도거부 | 2017-10-11 |
4 | 4 | 네트워크(브랜드) | 1호선 | 서울(1)역 | 150110 | 41.3 | 화장품 | 2015-09-05 00:00:00 | 2018-11-03 00:00:00 | 16970250 | <NA> | 2017-10-11 |
5 | 5 | 개별(일반) | 1호선 | 시청(1)역 | 151101 | 19.18 | 액세서리 | 2013-03-18 00:00:00 | 2018-03-17 00:00:00 | 4800900 | <NA> | 2017-10-11 |
6 | 6 | 공실 | 1호선 | 시청(1)역 | 151102 | 15.03 | <NA> | NaN | NaN | 3077250 | 공실 | 2017-10-11 |
7 | 7 | 개별(일반-무상) | 1호선 | 시청(1)역 | 151103 | 57.6 | 액세서리 | 2015-02-01 00:00:00 | 2020-01-31 00:00:00 | 무상 | <NA> | 2017-10-11 |
8 | 8 | 공실 | 1호선 | 시청(1)역 | 151104 | 25 | <NA> | NaN | NaN | 7266625 | 공실 | 2017-10-11 |
9 | 9 | 네트워크(브랜드) | 1호선 | 시청(1)역 | 151105 | 25 | 커피 | 2012-06-28 00:00:00 | 2017-08-07 00:00:00 | 6164224 | <NA> | 2017-10-11 |
상가현황(2017.10월) | Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
1932 | 1932 | 네트워크(브랜드) | 8호선 | 남한산성입구역 | 822-2005 | 17 | 음료.제과 | 2014-05-26 00:00:00 | 2019-06-25 00:00:00 | 2031500 | <NA> | 2017-10-11 |
1933 | 1933 | 공실 | 8호선 | 단대오거리역 | 823-1001 | 42.5 | <NA> | NaN | NaN | 3201000 | 공실 | 2017-10-11 |
1934 | 1934 | 네트워크(브랜드) | 8호선 | 단대오거리역 | 823-1002 | 36.78 | 화장품 | 2014-01-24 00:00:00 | 2019-03-25 00:00:00 | 15685314.383333 | <NA> | 2017-10-11 |
1935 | 1935 | 네트워크(브랜드) | 8호선 | 단대오거리역 | 823-2001 | 32.5 | 편의점 | 2016-07-25 00:00:00 | 2021-11.17 | 8712991 | <NA> | 2017-10-11 |
1936 | 1936 | 네트워크(브랜드) | 8호선 | 단대오거리역 | 823-2002 | 28.97 | 음료.제과 | 2014-10-06 00:00:00 | 2019-11-04 00:00:00 | 5225054.666667 | <NA> | 2017-10-11 |
1937 | 1937 | 네트워크(브랜드) | 8호선 | 단대오거리역 | 823-2003 | 54.03 | 음료.제과 | 2015-05-21 00:00:00 | 2020-06-20 00:00:00 | 9418666.666667 | <NA> | 2017-10-11 |
1938 | 1938 | 네트워크(브랜드) | 8호선 | 단대오거리역 | 823-2004 | 75.09 | 액세서리 | 2016-05-23 00:00:00 | 2021-06-22 00:00:00 | 4284455 | <NA> | 2017-10-11 |
1939 | 1939 | 네트워크(브랜드) | 8호선 | 신흥역 | 824-1001 | 40 | 편의점 | 2016-07-25 00:00:00 | 2021-11.17 | 6124682 | <NA> | 2017-10-11 |
1940 | 1940 | 네트워크(브랜드) | 8호선 | 수진역 | 825-1001 | 40 | 편의점 | 2016-07-25 00:00:00 | 2021-11.17 | 5575875 | <NA> | 2017-10-11 |
1941 | 1941 | 네트워크(브랜드) | 8호선 | 모란역 | 826-1001 | 50 | 편의점 | 2016-07-25 00:00:00 | 2021-11.17 | 5831070 | <NA> | 2017-10-11 |
Most frequently occurring
Unnamed: 1 | Unnamed: 2 | Unnamed: 3 | Unnamed: 6 | Unnamed: 10 | Unnamed: 11 | # duplicates | |
---|---|---|---|---|---|---|---|
272 | 복합 | 5호선 | 오목교역 | <NA> | <NA> | 2017-10-11 | 67 |
296 | 입찰공고중 | 7호선 | 반포역 | <NA> | 입찰공고중 | 2017-10-11 | 40 |
273 | 복합 | 5호선 | 천호역 | 천호 복합상가 | <NA> | 2017-10-11 | 26 |
228 | 공실 | 7호선 | 청담역 | <NA> | 공실 | 2017-10-11 | 22 |
205 | 공실 | 5호선 | 오목교역 | <NA> | 공실 | 2017-10-11 | 15 |
278 | 복합 | 7호선 | 고속터미널역 | 고속터미널 복합상가 | <NA> | 2017-10-11 | 12 |
279 | 복합 | 7호선 | 노원역 | 테라피휴 복합상가 | <NA> | 2017-10-11 | 12 |
270 | 복합 | 5호선 | 공덕역 | 공덕,합정,영등포구청 스트리트몰 | <NA> | 2017-10-11 | 10 |
285 | 복합 | 8호선 | 잠실역 | 잠실스트리트몰 | <NA> | 2017-10-11 | 9 |
21 | GS | 6호선 | 석계역 | 공실 | <NA> | 2017-10-11 | 8 |