Overview

Dataset statistics

Number of variables44
Number of observations30
Missing cells81
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.8 KiB
Average record size in memory369.4 B

Variable types

Text12
Categorical15
Unsupported2
Numeric11
Boolean4

Dataset

Description샘플 데이터
AuthorKB국민은행
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=1

Alerts

시도명 has constant value ""Constant
분양권여부 has constant value ""Constant
리모델링여부 has constant value ""Constant
주상복합여부 is highly imbalanced (78.9%)Imbalance
빌라연립여부 is highly imbalanced (78.9%)Imbalance
동사무소명 is highly imbalanced (59.9%)Imbalance
소방서명 is highly imbalanced (73.5%)Imbalance
은행명 is highly imbalanced (64.1%)Imbalance
스포츠센터명 is highly imbalanced (52.0%)Imbalance
좌석버스번호 is highly imbalanced (64.1%)Imbalance
구명 has 30 (100.0%) missing valuesMissing
리명 has 30 (100.0%) missing valuesMissing
건설업체명 has 5 (16.7%) missing valuesMissing
단지특성 has 16 (53.3%) missing valuesMissing
부동산물건uid has unique valuesUnique
번지 has unique valuesUnique
아파트명 has unique valuesUnique
총세대수 has unique valuesUnique
우편번호 has unique valuesUnique
단지좌표X값 has unique valuesUnique
단지좌표Y값 has unique valuesUnique
구명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
리명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
준공연월 has 1 (3.3%) zerosZeros
주차대수 has 9 (30.0%) zerosZeros

Reproduction

Analysis started2023-12-10 14:51:23.421986
Analysis finished2023-12-10 14:51:24.549432
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

부동산물건uid
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:24.690335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters270
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowKBA013357
2nd rowKBA010740
3rd rowKBA023933
4th rowKBA014516
5th rowKBA021376
ValueCountFrequency (%)
kba013357 1
 
3.3%
kba010740 1
 
3.3%
kba017483 1
 
3.3%
kba019167 1
 
3.3%
kba016209 1
 
3.3%
kba008372 1
 
3.3%
kba011630 1
 
3.3%
kba000361 1
 
3.3%
kba001221 1
 
3.3%
kba009210 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:51:25.052830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 62
23.0%
K 30
11.1%
B 30
11.1%
A 30
11.1%
1 28
10.4%
6 18
 
6.7%
3 15
 
5.6%
2 14
 
5.2%
7 11
 
4.1%
8 9
 
3.3%
Other values (3) 23
 
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 180
66.7%
Uppercase Letter 90
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 62
34.4%
1 28
15.6%
6 18
 
10.0%
3 15
 
8.3%
2 14
 
7.8%
7 11
 
6.1%
8 9
 
5.0%
4 8
 
4.4%
9 8
 
4.4%
5 7
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
K 30
33.3%
B 30
33.3%
A 30
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 180
66.7%
Latin 90
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 62
34.4%
1 28
15.6%
6 18
 
10.0%
3 15
 
8.3%
2 14
 
7.8%
7 11
 
6.1%
8 9
 
5.0%
4 8
 
4.4%
9 8
 
4.4%
5 7
 
3.9%
Latin
ValueCountFrequency (%)
K 30
33.3%
B 30
33.3%
A 30
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 270
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 62
23.0%
K 30
11.1%
B 30
11.1%
A 30
11.1%
1 28
10.4%
6 18
 
6.7%
3 15
 
5.6%
2 14
 
5.2%
7 11
 
4.1%
8 9
 
3.3%
Other values (3) 23
 
8.5%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
서울특별시
30 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 30
100.0%

Length

2023-12-10T23:51:25.192107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:25.291470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 30
100.0%
Distinct19
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:25.431513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.1666667
Min length3

Characters and Unicode

Total characters95
Distinct characters29
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row도봉구
2nd row송파구
3rd row성동구
4th row동작구
5th row서초구
ValueCountFrequency (%)
송파구 4
13.3%
영등포구 3
 
10.0%
도봉구 2
 
6.7%
양천구 2
 
6.7%
강동구 2
 
6.7%
강남구 2
 
6.7%
마포구 2
 
6.7%
성동구 2
 
6.7%
중랑구 1
 
3.3%
동대문구 1
 
3.3%
Other values (9) 9
30.0%
2023-12-10T23:51:25.734255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
31.6%
6
 
6.3%
6
 
6.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (19) 28
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
31.6%
6
 
6.3%
6
 
6.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (19) 28
29.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
31.6%
6
 
6.3%
6
 
6.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (19) 28
29.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
31.6%
6
 
6.3%
6
 
6.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (19) 28
29.5%

구명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B
Distinct26
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:25.933416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0333333
Min length2

Characters and Unicode

Total characters91
Distinct characters44
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)73.3%

Sample

1st row공릉동
2nd row염창동
3rd row서초동
4th row한강로3가
5th row목동
ValueCountFrequency (%)
서초동 2
 
6.7%
신월동 2
 
6.7%
목동 2
 
6.7%
신정동 2
 
6.7%
삼성동 1
 
3.3%
공릉동 1
 
3.3%
신내동 1
 
3.3%
명일동 1
 
3.3%
상일동 1
 
3.3%
항동 1
 
3.3%
Other values (16) 16
53.3%
2023-12-10T23:51:26.256699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
30.8%
5
 
5.5%
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (34) 40
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89
97.8%
Decimal Number 2
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
31.5%
5
 
5.6%
3
 
3.4%
3
 
3.4%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (32) 38
42.7%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
3 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 89
97.8%
Common 2
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
31.5%
5
 
5.6%
3
 
3.4%
3
 
3.4%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (32) 38
42.7%
Common
ValueCountFrequency (%)
1 1
50.0%
3 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 89
97.8%
ASCII 2
 
2.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
31.5%
5
 
5.6%
3
 
3.4%
3
 
3.4%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (32) 38
42.7%
ASCII
ValueCountFrequency (%)
1 1
50.0%
3 1
50.0%

리명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)100.0%
Memory size402.0 B

번지
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:26.506781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9.5
Mean length4.5666667
Min length2

Characters and Unicode

Total characters137
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row384
2nd row1029
3rd row1714
4th row877
5th row412-3
ValueCountFrequency (%)
384 1
 
3.0%
77 1
 
3.0%
974 1
 
3.0%
611 1
 
3.0%
463외1필지 1
 
3.0%
450 1
 
3.0%
970 1
 
3.0%
746 1
 
3.0%
454 1
 
3.0%
366 1
 
3.0%
Other values (23) 23
69.7%
2023-12-10T23:51:26.897141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 21
15.3%
4 16
11.7%
7 11
8.0%
8 10
 
7.3%
0 10
 
7.3%
2 10
 
7.3%
3 9
 
6.6%
6 9
 
6.6%
- 8
 
5.8%
5 8
 
5.8%
Other values (5) 25
18.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 108
78.8%
Other Letter 18
 
13.1%
Dash Punctuation 8
 
5.8%
Space Separator 3
 
2.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 21
19.4%
4 16
14.8%
7 11
10.2%
8 10
9.3%
0 10
9.3%
2 10
9.3%
3 9
8.3%
6 9
8.3%
5 8
 
7.4%
9 4
 
3.7%
Other Letter
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 119
86.9%
Hangul 18
 
13.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 21
17.6%
4 16
13.4%
7 11
9.2%
8 10
8.4%
0 10
8.4%
2 10
8.4%
3 9
7.6%
6 9
7.6%
- 8
 
6.7%
5 8
 
6.7%
Other values (2) 7
 
5.9%
Hangul
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119
86.9%
Hangul 18
 
13.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 21
17.6%
4 16
13.4%
7 11
9.2%
8 10
8.4%
0 10
8.4%
2 10
8.4%
3 9
7.6%
6 9
7.6%
- 8
 
6.7%
5 8
 
6.7%
Other values (2) 7
 
5.9%
Hangul
ValueCountFrequency (%)
6
33.3%
6
33.3%
6
33.3%

기타번지
Categorical

Distinct12
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
19 
746
 
1
106-133
 
1
1344,1344-1,1344-3
 
1
988-5,991-1
 
1
Other values (7)

Length

Max length23
Median length1
Mean length5.0333333
Min length1

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row
2nd row
3rd row
4th row
5th row746

Common Values

ValueCountFrequency (%)
19
63.3%
746 1
 
3.3%
106-133 1
 
3.3%
1344,1344-1,1344-3 1
 
3.3%
988-5,991-1 1
 
3.3%
1-15, 1-16,1-17,1-18 1
 
3.3%
576-1 1
 
3.3%
776-3,776-4 1
 
3.3%
(종전27-1) 1
 
3.3%
401 1
 
3.3%
Other values (2) 2
 
6.7%

Length

2023-12-10T23:51:27.036273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
746 1
 
7.1%
106-133 1
 
7.1%
1344,1344-1,1344-3 1
 
7.1%
988-5,991-1 1
 
7.1%
1-15 1
 
7.1%
1-16,1-17,1-18 1
 
7.1%
576-1 1
 
7.1%
776-3,776-4 1
 
7.1%
종전27-1 1
 
7.1%
401 1
 
7.1%
Other values (4) 4
28.6%

아파트명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:27.226845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length10
Mean length7.0666667
Min length2

Characters and Unicode

Total characters212
Distinct characters70
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row브*운*톤*도*
2nd row개*우*2*
3rd row창*쌍*(*차*
4th row경*
5th row신*포*한*2*차*
ValueCountFrequency (%)
브*운*톤*도 1
 
3.3%
개*우*2 1
 
3.3%
삼*동*성*니 1
 
3.3%
보*매*오*하*채 1
 
3.3%
한*(*차 1
 
3.3%
수*산*원*자 1
 
3.3%
현*2 1
 
3.3%
송*센*레 1
 
3.3%
1
 
3.3%
g*강*자 1
 
3.3%
Other values (20) 20
66.7%
2023-12-10T23:51:27.568555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 106
50.0%
2 4
 
1.9%
4
 
1.9%
( 4
 
1.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
3
 
1.4%
2
 
0.9%
2
 
0.9%
Other values (60) 77
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 107
50.5%
Other Letter 92
43.4%
Decimal Number 6
 
2.8%
Open Punctuation 4
 
1.9%
Close Punctuation 2
 
0.9%
Uppercase Letter 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
4.3%
4
 
4.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 65
70.7%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
1 1
 
16.7%
4 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
* 106
99.1%
. 1
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 119
56.1%
Hangul 92
43.4%
Latin 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
4.3%
4
 
4.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 65
70.7%
Common
ValueCountFrequency (%)
* 106
89.1%
2 4
 
3.4%
( 4
 
3.4%
) 2
 
1.7%
1 1
 
0.8%
. 1
 
0.8%
4 1
 
0.8%
Latin
ValueCountFrequency (%)
G 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120
56.6%
Hangul 92
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 106
88.3%
2 4
 
3.3%
( 4
 
3.3%
) 2
 
1.7%
1 1
 
0.8%
G 1
 
0.8%
. 1
 
0.8%
4 1
 
0.8%
Hangul
ValueCountFrequency (%)
4
 
4.3%
4
 
4.3%
3
 
3.3%
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 65
70.7%

총세대수
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean463.7
Minimum60
Maximum2282
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:27.715736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile65.6
Q1156
median305.5
Q3568.25
95-th percentile1284.85
Maximum2282
Range2222
Interquartile range (IQR)412.25

Descriptive statistics

Standard deviation489.87712
Coefficient of variation (CV)1.0564527
Kurtosis5.6568456
Mean463.7
Median Absolute Deviation (MAD)193.5
Skewness2.1860661
Sum13911
Variance239979.6
MonotonicityNot monotonic
2023-12-10T23:51:27.853626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
270 1
 
3.3%
312 1
 
3.3%
162 1
 
3.3%
298 1
 
3.3%
80 1
 
3.3%
154 1
 
3.3%
362 1
 
3.3%
62 1
 
3.3%
1297 1
 
3.3%
416 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
60 1
3.3%
62 1
3.3%
70 1
3.3%
80 1
3.3%
88 1
3.3%
98 1
3.3%
126 1
3.3%
154 1
3.3%
162 1
3.3%
192 1
3.3%
ValueCountFrequency (%)
2282 1
3.3%
1297 1
3.3%
1270 1
3.3%
1070 1
3.3%
893 1
3.3%
845 1
3.3%
625 1
3.3%
583 1
3.3%
524 1
3.3%
416 1
3.3%

총동수
Real number (ℝ)

Distinct12
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5
Minimum1
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:27.955618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2.5
Q36
95-th percentile12.55
Maximum18
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.4236823
Coefficient of variation (CV)0.98304052
Kurtosis1.7814283
Mean4.5
Median Absolute Deviation (MAD)1.5
Skewness1.4903479
Sum135
Variance19.568966
MonotonicityNot monotonic
2023-12-10T23:51:28.057281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 10
33.3%
2 5
16.7%
4 3
 
10.0%
6 3
 
10.0%
3 2
 
6.7%
13 1
 
3.3%
12 1
 
3.3%
11 1
 
3.3%
8 1
 
3.3%
10 1
 
3.3%
Other values (2) 2
 
6.7%
ValueCountFrequency (%)
1 10
33.3%
2 5
16.7%
3 2
 
6.7%
4 3
 
10.0%
6 3
 
10.0%
7 1
 
3.3%
8 1
 
3.3%
10 1
 
3.3%
11 1
 
3.3%
12 1
 
3.3%
ValueCountFrequency (%)
18 1
 
3.3%
13 1
 
3.3%
12 1
 
3.3%
11 1
 
3.3%
10 1
 
3.3%
8 1
 
3.3%
7 1
 
3.3%
6 3
10.0%
4 3
10.0%
3 2
6.7%

단지최고층
Real number (ℝ)

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.566667
Minimum6
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:28.183206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile10.9
Q115
median16.5
Q321.75
95-th percentile25
Maximum25
Range19
Interquartile range (IQR)6.75

Descriptive statistics

Standard deviation4.889703
Coefficient of variation (CV)0.27835121
Kurtosis-0.36280611
Mean17.566667
Median Absolute Deviation (MAD)2.5
Skewness-0.040824468
Sum527
Variance23.909195
MonotonicityNot monotonic
2023-12-10T23:51:28.303253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
15 8
26.7%
25 4
13.3%
19 3
 
10.0%
18 2
 
6.7%
22 2
 
6.7%
24 2
 
6.7%
13 2
 
6.7%
14 1
 
3.3%
12 1
 
3.3%
16 1
 
3.3%
Other values (4) 4
13.3%
ValueCountFrequency (%)
6 1
 
3.3%
10 1
 
3.3%
12 1
 
3.3%
13 2
 
6.7%
14 1
 
3.3%
15 8
26.7%
16 1
 
3.3%
17 1
 
3.3%
18 2
 
6.7%
19 3
 
10.0%
ValueCountFrequency (%)
25 4
13.3%
24 2
 
6.7%
22 2
 
6.7%
21 1
 
3.3%
19 3
 
10.0%
18 2
 
6.7%
17 1
 
3.3%
16 1
 
3.3%
15 8
26.7%
14 1
 
3.3%

단지최저층
Real number (ℝ)

Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.8
Minimum4
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:28.418263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile4.45
Q16.25
median11.5
Q315
95-th percentile21.55
Maximum22
Range18
Interquartile range (IQR)8.75

Descriptive statistics

Standard deviation5.7739009
Coefficient of variation (CV)0.48931363
Kurtosis-1.0424314
Mean11.8
Median Absolute Deviation (MAD)4.5
Skewness0.28873371
Sum354
Variance33.337931
MonotonicityNot monotonic
2023-12-10T23:51:28.542464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
5 5
16.7%
15 4
13.3%
9 3
10.0%
10 3
10.0%
13 3
10.0%
22 2
 
6.7%
21 2
 
6.7%
4 2
 
6.7%
6 1
 
3.3%
17 1
 
3.3%
Other values (4) 4
13.3%
ValueCountFrequency (%)
4 2
 
6.7%
5 5
16.7%
6 1
 
3.3%
7 1
 
3.3%
9 3
10.0%
10 3
10.0%
13 3
10.0%
14 1
 
3.3%
15 4
13.3%
16 1
 
3.3%
ValueCountFrequency (%)
22 2
6.7%
21 2
6.7%
19 1
 
3.3%
17 1
 
3.3%
16 1
 
3.3%
15 4
13.3%
14 1
 
3.3%
13 3
10.0%
10 3
10.0%
9 3
10.0%

준공연월
Real number (ℝ)

ZEROS 

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean193234.63
Minimum0
Maximum201406
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:28.665921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile197944.35
Q1199604.5
median200011
Q3200404.5
95-th percentile201109.75
Maximum201406
Range201406
Interquartile range (IQR)800

Descriptive statistics

Standard deviation36506.168
Coefficient of variation (CV)0.18892145
Kurtosis29.963715
Mean193234.63
Median Absolute Deviation (MAD)400.5
Skewness-5.4724296
Sum5797039
Variance1.3327003 × 109
MonotonicityNot monotonic
2023-12-10T23:51:28.796739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
199912 2
 
6.7%
201112 1
 
3.3%
200403 1
 
3.3%
200109 1
 
3.3%
198809 1
 
3.3%
200411 1
 
3.3%
198607 1
 
3.3%
199712 1
 
3.3%
200405 1
 
3.3%
199905 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
0 1
3.3%
197403 1
3.3%
198606 1
3.3%
198607 1
3.3%
198806 1
3.3%
198808 1
3.3%
198809 1
3.3%
199604 1
3.3%
199606 1
3.3%
199708 1
3.3%
ValueCountFrequency (%)
201406 1
3.3%
201112 1
3.3%
201107 1
3.3%
200512 1
3.3%
200506 1
3.3%
200412 1
3.3%
200411 1
3.3%
200405 1
3.3%
200403 1
3.3%
200312 1
3.3%

건설업체명
Text

MISSING 

Distinct22
Distinct (%)88.0%
Missing5
Missing (%)16.7%
Memory size372.0 B
2023-12-10T23:51:28.983984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.24
Min length2

Characters and Unicode

Total characters106
Distinct characters30
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)76.0%

Sample

1st row코*롱*설*
2nd row삼*건*
3rd row대*건*
4th row월*건*
5th row동*건*
ValueCountFrequency (%)
현*산*개 2
 
8.0%
대*건 2
 
8.0%
삼*건 2
 
8.0%
영*산 1
 
4.0%
코*롱*설 1
 
4.0%
1
 
4.0%
삼*물 1
 
4.0%
극*건 1
 
4.0%
동*고*건 1
 
4.0%
s*공 1
 
4.0%
Other values (12) 12
48.0%
2023-12-10T23:51:29.365508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 53
50.0%
15
 
14.2%
4
 
3.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
1
 
0.9%
1
 
0.9%
Other values (20) 20
 
18.9%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 53
50.0%
Other Letter 52
49.1%
Uppercase Letter 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
28.8%
4
 
7.7%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (18) 18
34.6%
Other Punctuation
ValueCountFrequency (%)
* 53
100.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 53
50.0%
Hangul 52
49.1%
Latin 1
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
28.8%
4
 
7.7%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (18) 18
34.6%
Common
ValueCountFrequency (%)
* 53
100.0%
Latin
ValueCountFrequency (%)
S 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 54
50.9%
Hangul 52
49.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 53
98.1%
S 1
 
1.9%
Hangul
ValueCountFrequency (%)
15
28.8%
4
 
7.7%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (18) 18
34.6%

난방방식
Categorical

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
개별
24 
중앙
지역

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중앙
2nd row개별
3rd row지역
4th row개별
5th row개별

Common Values

ValueCountFrequency (%)
개별 24
80.0%
중앙 3
 
10.0%
지역 3
 
10.0%

Length

2023-12-10T23:51:29.520201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:29.628615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개별 24
80.0%
중앙 3
 
10.0%
지역 3
 
10.0%

난방연료
Categorical

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
도시가스
23 
열병합

Length

Max length4
Median length4
Mean length3.7666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도시가스
2nd row도시가스
3rd row도시가스
4th row도시가스
5th row도시가스

Common Values

ValueCountFrequency (%)
도시가스 23
76.7%
열병합 7
 
23.3%

Length

2023-12-10T23:51:29.735003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:29.836071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도시가스 23
76.7%
열병합 7
 
23.3%

주차대수
Real number (ℝ)

ZEROS 

Distinct19
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean265.73333
Minimum0
Maximum1110
Zeros9
Zeros (%)30.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:29.938633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median208
Q3323.75
95-th percentile1005.15
Maximum1110
Range1110
Interquartile range (IQR)323.75

Descriptive statistics

Standard deviation312.32454
Coefficient of variation (CV)1.1753307
Kurtosis2.1261263
Mean265.73333
Median Absolute Deviation (MAD)193.5
Skewness1.5749984
Sum7972
Variance97546.616
MonotonicityNot monotonic
2023-12-10T23:51:30.075350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 9
30.0%
84 2
 
6.7%
300 2
 
6.7%
1110 2
 
6.7%
459 1
 
3.3%
54 1
 
3.3%
624 1
 
3.3%
877 1
 
3.3%
468 1
 
3.3%
107 1
 
3.3%
Other values (9) 9
30.0%
ValueCountFrequency (%)
0 9
30.0%
54 1
 
3.3%
84 2
 
6.7%
107 1
 
3.3%
137 1
 
3.3%
200 1
 
3.3%
216 1
 
3.3%
220 1
 
3.3%
276 1
 
3.3%
300 2
 
6.7%
ValueCountFrequency (%)
1110 2
6.7%
877 1
3.3%
624 1
3.3%
468 1
3.3%
459 1
3.3%
387 1
3.3%
325 1
3.3%
320 1
3.3%
314 1
3.3%
300 2
6.7%
Distinct22
Distinct (%)73.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:30.272955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.2666667
Min length1

Characters and Unicode

Total characters248
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)70.0%

Sample

1st row02-**7-**54
2nd row02-**27-**63
3rd row02-**2-**09
4th row02-**9-**79
5th row02-**5-**76
ValueCountFrequency (%)
02-**7-**54 1
 
4.8%
02-**63-**45 1
 
4.8%
02-**5-**52 1
 
4.8%
02-**6-**71 1
 
4.8%
02-**6-**16 1
 
4.8%
02-**93-**68 1
 
4.8%
02-**5-**95 1
 
4.8%
02-**6-**77 1
 
4.8%
02-**6-**63 1
 
4.8%
02-**71-**33 1
 
4.8%
Other values (11) 11
52.4%
2023-12-10T23:51:30.627033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 84
33.9%
- 42
16.9%
2 27
 
10.9%
0 25
 
10.1%
7 13
 
5.2%
6 11
 
4.4%
3 10
 
4.0%
9
 
3.6%
5 7
 
2.8%
9 7
 
2.8%
Other values (3) 13
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 113
45.6%
Other Punctuation 84
33.9%
Dash Punctuation 42
 
16.9%
Space Separator 9
 
3.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 27
23.9%
0 25
22.1%
7 13
11.5%
6 11
9.7%
3 10
 
8.8%
5 7
 
6.2%
9 7
 
6.2%
1 7
 
6.2%
4 5
 
4.4%
8 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
* 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 248
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
* 84
33.9%
- 42
16.9%
2 27
 
10.9%
0 25
 
10.1%
7 13
 
5.2%
6 11
 
4.4%
3 10
 
4.0%
9
 
3.6%
5 7
 
2.8%
9 7
 
2.8%
Other values (3) 13
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 248
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 84
33.9%
- 42
16.9%
2 27
 
10.9%
0 25
 
10.1%
7 13
 
5.2%
6 11
 
4.4%
3 10
 
4.0%
9
 
3.6%
5 7
 
2.8%
9 7
 
2.8%
Other values (3) 13
 
5.2%

분양권여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-10T23:51:30.743316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

입주년월
Real number (ℝ)

Distinct29
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean199587.87
Minimum197010
Maximum201607
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:30.837442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum197010
5-th percentile197109.9
Q1198935.25
median199906
Q3200280
95-th percentile201067.8
Maximum201607
Range4597
Interquartile range (IQR)1344.75

Descriptive statistics

Standard deviation1158.1543
Coefficient of variation (CV)0.0058027288
Kurtosis0.4060303
Mean199587.87
Median Absolute Deviation (MAD)553.5
Skewness-0.84037637
Sum5987636
Variance1341321.3
MonotonicityNot monotonic
2023-12-10T23:51:30.968211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
199708 2
 
6.7%
200103 1
 
3.3%
197109 1
 
3.3%
199011 1
 
3.3%
199206 1
 
3.3%
199610 1
 
3.3%
200905 1
 
3.3%
198407 1
 
3.3%
197111 1
 
3.3%
198906 1
 
3.3%
Other values (19) 19
63.3%
ValueCountFrequency (%)
197010 1
3.3%
197109 1
3.3%
197111 1
3.3%
198012 1
3.3%
198407 1
3.3%
198810 1
3.3%
198906 1
3.3%
198910 1
3.3%
199011 1
3.3%
199206 1
3.3%
ValueCountFrequency (%)
201607 1
3.3%
201201 1
3.3%
200905 1
3.3%
200704 1
3.3%
200511 1
3.3%
200408 1
3.3%
200311 1
3.3%
200303 1
3.3%
200211 1
3.3%
200205 1
3.3%

단지특성
Text

MISSING 

Distinct14
Distinct (%)100.0%
Missing16
Missing (%)53.3%
Memory size372.0 B
2023-12-10T23:51:31.142190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length57
Mean length47.785714
Min length15

Characters and Unicode

Total characters669
Distinct characters101
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)100.0%

Sample

1st row2** **역**보**분**최** **(**고**상**,**덕**)** **산**남**조**,**심**전**
2nd row1**%**주**,**선**호**1** **철**망**
3rd row신**와**인**인** **대**향** **평**로**성**쾌** **단**동**리**인**도**접** **
4th row조**고**차** **움**
5th row남**한**망**최**복**파**건**
ValueCountFrequency (%)
11
 
19.6%
8 2
 
3.6%
2 1
 
1.8%
h 1
 
1.8%
도**역**보 1
 
1.8%
1
 
1.8%
1
 
1.8%
용**구**위**획**정**업**관**주**건**사**정**업 1
 
1.8%
0**8**3**)**류 1
 
1.8%
총**수**6**대**분 1
 
1.8%
Other values (35) 35
62.5%
2023-12-10T23:51:31.462638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 447
66.8%
42
 
6.3%
, 8
 
1.2%
1 6
 
0.9%
5
 
0.7%
5
 
0.7%
8 5
 
0.7%
0 5
 
0.7%
4
 
0.6%
4
 
0.6%
Other values (91) 138
 
20.6%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 461
68.9%
Other Letter 131
 
19.6%
Space Separator 42
 
6.3%
Decimal Number 25
 
3.7%
Open Punctuation 4
 
0.6%
Close Punctuation 3
 
0.4%
Dash Punctuation 1
 
0.1%
Uppercase Letter 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (71) 93
71.0%
Decimal Number
ValueCountFrequency (%)
1 6
24.0%
8 5
20.0%
0 5
20.0%
2 2
 
8.0%
5 2
 
8.0%
6 2
 
8.0%
3 2
 
8.0%
4 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
* 447
97.0%
, 8
 
1.7%
. 3
 
0.7%
: 1
 
0.2%
% 1
 
0.2%
! 1
 
0.2%
Space Separator
ValueCountFrequency (%)
42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
H 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 537
80.3%
Hangul 131
 
19.6%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (71) 93
71.0%
Common
ValueCountFrequency (%)
* 447
83.2%
42
 
7.8%
, 8
 
1.5%
1 6
 
1.1%
8 5
 
0.9%
0 5
 
0.9%
( 4
 
0.7%
. 3
 
0.6%
) 3
 
0.6%
2 2
 
0.4%
Other values (9) 12
 
2.2%
Latin
ValueCountFrequency (%)
H 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 538
80.4%
Hangul 131
 
19.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 447
83.1%
42
 
7.8%
, 8
 
1.5%
1 6
 
1.1%
8 5
 
0.9%
0 5
 
0.9%
( 4
 
0.7%
. 3
 
0.6%
) 3
 
0.6%
2 2
 
0.4%
Other values (10) 13
 
2.4%
Hangul
ValueCountFrequency (%)
5
 
3.8%
5
 
3.8%
4
 
3.1%
4
 
3.1%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (71) 93
71.0%

우편번호
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5544.0667
Minimum1647
Maximum8792
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:31.593597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1647
5-th percentile1923.1
Q13658.5
median5755
Q37553.75
95-th percentile8700.75
Maximum8792
Range7145
Interquartile range (IQR)3895.25

Descriptive statistics

Standard deviation2335.3308
Coefficient of variation (CV)0.42123065
Kurtosis-1.3369643
Mean5544.0667
Median Absolute Deviation (MAD)2086
Skewness-0.17770917
Sum166322
Variance5453769.9
MonotonicityNot monotonic
2023-12-10T23:51:31.730443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
5350 1
 
3.3%
4425 1
 
3.3%
5068 1
 
3.3%
8775 1
 
3.3%
8327 1
 
3.3%
4187 1
 
3.3%
7535 1
 
3.3%
5005 1
 
3.3%
2609 1
 
3.3%
8610 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1647 1
3.3%
1816 1
3.3%
2054 1
3.3%
2489 1
3.3%
2609 1
3.3%
2710 1
3.3%
3438 1
3.3%
3648 1
3.3%
3690 1
3.3%
4187 1
3.3%
ValueCountFrequency (%)
8792 1
3.3%
8775 1
3.3%
8610 1
3.3%
8393 1
3.3%
8327 1
3.3%
7983 1
3.3%
7962 1
3.3%
7560 1
3.3%
7535 1
3.3%
7376 1
3.3%

주상복합여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
29 
True
 
1
ValueCountFrequency (%)
False 29
96.7%
True 1
 
3.3%
2023-12-10T23:51:31.850370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

빌라연립여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
29 
True
 
1
ValueCountFrequency (%)
False 29
96.7%
True 1
 
3.3%
2023-12-10T23:51:31.937314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

구청명
Categorical

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
20 
마포
 
2
강서
 
2
송파
 
1
강남
 
1
Other values (4)

Length

Max length3
Median length1
Mean length1.3666667
Min length1

Unique

Unique6 ?
Unique (%)20.0%

Sample

1st row송파
2nd row
3rd row마포
4th row
5th row강서

Common Values

ValueCountFrequency (%)
20
66.7%
마포 2
 
6.7%
강서 2
 
6.7%
송파 1
 
3.3%
강남 1
 
3.3%
도봉 1
 
3.3%
서대문 1
 
3.3%
서초 1
 
3.3%
관악 1
 
3.3%

Length

2023-12-10T23:51:32.063654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:32.208013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
마포 2
20.0%
강서 2
20.0%
송파 1
10.0%
강남 1
10.0%
도봉 1
10.0%
서대문 1
10.0%
서초 1
10.0%
관악 1
10.0%

동사무소명
Categorical

IMBALANCE 

Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
25 
오금동
 
1
개포1
 
1
상도2동
 
1
송파2
 
1

Length

Max length4
Median length1
Mean length1.4
Min length1

Unique

Unique5 ?
Unique (%)16.7%

Sample

1st row
2nd row
3rd row오금동
4th row개포1
5th row

Common Values

ValueCountFrequency (%)
25
83.3%
오금동 1
 
3.3%
개포1 1
 
3.3%
상도2동 1
 
3.3%
송파2 1
 
3.3%
가양1동 1
 
3.3%

Length

2023-12-10T23:51:32.342545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:32.450838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오금동 1
20.0%
개포1 1
20.0%
상도2동 1
20.0%
송파2 1
20.0%
가양1동 1
20.0%

파출소명
Categorical

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
22 
청량리
 
2
노량진
 
2
강남
 
1
방배경찰서
 
1
Other values (2)
 
2

Length

Max length9
Median length1
Mean length1.7666667
Min length1

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row
2nd row청량리
3rd row
4th row노량진
5th row

Common Values

ValueCountFrequency (%)
22
73.3%
청량리 2
 
6.7%
노량진 2
 
6.7%
강남 1
 
3.3%
방배경찰서 1
 
3.3%
서대문 1
 
3.3%
한양파출소, 강남 1
 
3.3%

Length

2023-12-10T23:51:32.848932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:32.959000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청량리 2
22.2%
노량진 2
22.2%
강남 2
22.2%
방배경찰서 1
11.1%
서대문 1
11.1%
한양파출소 1
11.1%

소방서명
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
28 
백운파출소
 
1
수궁119
 
1

Length

Max length5
Median length1
Mean length1.2666667
Min length1

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
28
93.3%
백운파출소 1
 
3.3%
수궁119 1
 
3.3%

Length

2023-12-10T23:51:33.086892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:33.220445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
백운파출소 1
50.0%
수궁119 1
50.0%

은행명
Categorical

IMBALANCE 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
26 
새마을금고.신한은행
 
1
우리
 
1
기업은행, 한빛은행
 
1
서울은행, 조흥은행
 
1

Length

Max length10
Median length1
Mean length1.9333333
Min length1

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
26
86.7%
새마을금고.신한은행 1
 
3.3%
우리 1
 
3.3%
기업은행, 한빛은행 1
 
3.3%
서울은행, 조흥은행 1
 
3.3%

Length

2023-12-10T23:51:33.314450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:33.416589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
새마을금고.신한은행 1
16.7%
우리 1
16.7%
기업은행 1
16.7%
한빛은행 1
16.7%
서울은행 1
16.7%
조흥은행 1
16.7%

병원명
Categorical

Distinct12
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
19 
오산당, 대항병원
 
1
이대목동병원,홍익병
 
1
금강메디칼, 원자력/백/을지
 
1
송영한치과, 연세정형정형외과, 원문섭피부과, 이소아과, 김오경내과, 임이비인후과, 강동성심
 
1
Other values (7)

Length

Max length50
Median length1
Mean length7.3666667
Min length1

Unique

Unique11 ?
Unique (%)36.7%

Sample

1st row오산당, 대항병원
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
19
63.3%
오산당, 대항병원 1
 
3.3%
이대목동병원,홍익병 1
 
3.3%
금강메디칼, 원자력/백/을지 1
 
3.3%
송영한치과, 연세정형정형외과, 원문섭피부과, 이소아과, 김오경내과, 임이비인후과, 강동성심 1
 
3.3%
황기은내과, 명제근치과, 국립경찰병원 1
 
3.3%
목동이대 1
 
3.3%
김현우내과, 그린치과, 서울성심 1
 
3.3%
장도훈치과, 백가정내과, 연세정형외과, 조소아과, 권이비인후과, 미즈메디 1
 
3.3%
한림대학강동성심, 현대아산 1
 
3.3%
Other values (2) 2
 
6.7%

Length

2023-12-10T23:51:33.540412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
오산당 1
 
3.4%
목동이대 1
 
3.4%
원자력병원,을지병원,태릉성심병원 1
 
3.4%
현대아산 1
 
3.4%
한림대학강동성심 1
 
3.4%
미즈메디 1
 
3.4%
권이비인후과 1
 
3.4%
조소아과 1
 
3.4%
연세정형외과 1
 
3.4%
백가정내과 1
 
3.4%
Other values (19) 19
65.5%

스포츠센터명
Categorical

IMBALANCE 

Distinct8
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
23 
나산,보라매
 
1
싸파리스포렉스, 올림픽, 단지내
 
1
롯데백화점스포츠센
 
1
삼호스포츠센타, 우성스포츠센타
 
1
Other values (3)

Length

Max length22
Median length1
Mean length3.6
Min length1

Unique

Unique7 ?
Unique (%)23.3%

Sample

1st row
2nd row나산,보라매
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
23
76.7%
나산,보라매 1
 
3.3%
싸파리스포렉스, 올림픽, 단지내 1
 
3.3%
롯데백화점스포츠센 1
 
3.3%
삼호스포츠센타, 우성스포츠센타 1
 
3.3%
연세스포츠센타. 1
 
3.3%
건영스포츠센타 1
 
3.3%
대성 스포츠센터 성북 종합 레포츠타운 1
 
3.3%

Length

2023-12-10T23:51:33.662070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:33.762416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
나산,보라매 1
 
7.1%
싸파리스포렉스 1
 
7.1%
올림픽 1
 
7.1%
단지내 1
 
7.1%
롯데백화점스포츠센 1
 
7.1%
삼호스포츠센타 1
 
7.1%
우성스포츠센타 1
 
7.1%
연세스포츠센타 1
 
7.1%
건영스포츠센타 1
 
7.1%
대성 1
 
7.1%
Other values (4) 4
28.6%
Distinct17
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:33.921743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length5.6
Min length1

Characters and Unicode

Total characters168
Distinct characters59
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)50.0%

Sample

1st row
2nd row이마트
3rd row까르푸, 그랜드마트
4th row
5th row
ValueCountFrequency (%)
이마트 4
13.8%
롯데 4
13.8%
신세계 3
 
10.3%
킴스클럽 2
 
6.9%
현대 2
 
6.9%
그랜드마트 1
 
3.4%
빅마켓.이마트.홈플러스.하나로클럽 1
 
3.4%
영동시장 1
 
3.4%
빅마켓 1
 
3.4%
까르푸 1
 
3.4%
Other values (9) 9
31.0%
2023-12-10T23:51:34.204416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
14.9%
, 17
 
10.1%
11
 
6.5%
8
 
4.8%
7
 
4.2%
7
 
4.2%
6
 
3.6%
6
 
3.6%
5
 
3.0%
5
 
3.0%
Other values (49) 71
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 123
73.2%
Space Separator 25
 
14.9%
Other Punctuation 20
 
11.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
8.9%
8
 
6.5%
7
 
5.7%
7
 
5.7%
6
 
4.9%
6
 
4.9%
5
 
4.1%
5
 
4.1%
5
 
4.1%
5
 
4.1%
Other values (46) 58
47.2%
Other Punctuation
ValueCountFrequency (%)
, 17
85.0%
. 3
 
15.0%
Space Separator
ValueCountFrequency (%)
25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 123
73.2%
Common 45
 
26.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
8.9%
8
 
6.5%
7
 
5.7%
7
 
5.7%
6
 
4.9%
6
 
4.9%
5
 
4.1%
5
 
4.1%
5
 
4.1%
5
 
4.1%
Other values (46) 58
47.2%
Common
ValueCountFrequency (%)
25
55.6%
, 17
37.8%
. 3
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 123
73.2%
ASCII 45
 
26.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25
55.6%
, 17
37.8%
. 3
 
6.7%
Hangul
ValueCountFrequency (%)
11
 
8.9%
8
 
6.5%
7
 
5.7%
7
 
5.7%
6
 
4.9%
6
 
4.9%
5
 
4.1%
5
 
4.1%
5
 
4.1%
5
 
4.1%
Other values (46) 58
47.2%
Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:34.420302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length29
Mean length9.3333333
Min length1

Characters and Unicode

Total characters280
Distinct characters69
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)56.7%

Sample

1st row양전, 개원, 경기여자
2nd row계남, 은정 ,갈산, 목일, 봉영여중, 신목, 영천여고
3rd row
4th row삼성초등, 신림중/여중, 삼성고
5th row도곡초,도성초, 대명중,진선여중, 진선여고,휘문중고
ValueCountFrequency (%)
잠신 3
 
5.6%
중동 2
 
3.7%
당서 2
 
3.7%
재현 2
 
3.7%
방화 2
 
3.7%
둔춘고,강동고,한신고 1
 
1.9%
은정 1
 
1.9%
갈산 1
 
1.9%
목일 1
 
1.9%
봉영여중 1
 
1.9%
Other values (38) 38
70.4%
2023-12-10T23:51:34.801775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
17.9%
, 43
 
15.4%
17
 
6.1%
12
 
4.3%
11
 
3.9%
7
 
2.5%
7
 
2.5%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (59) 118
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 183
65.4%
Space Separator 50
 
17.9%
Other Punctuation 47
 
16.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
9.3%
12
 
6.6%
11
 
6.0%
7
 
3.8%
7
 
3.8%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.2%
Other values (55) 105
57.4%
Other Punctuation
ValueCountFrequency (%)
, 43
91.5%
. 3
 
6.4%
/ 1
 
2.1%
Space Separator
ValueCountFrequency (%)
50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 183
65.4%
Common 97
34.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
9.3%
12
 
6.6%
11
 
6.0%
7
 
3.8%
7
 
3.8%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.2%
Other values (55) 105
57.4%
Common
ValueCountFrequency (%)
50
51.5%
, 43
44.3%
. 3
 
3.1%
/ 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 183
65.4%
ASCII 97
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
51.5%
, 43
44.3%
. 3
 
3.1%
/ 1
 
1.0%
Hangul
ValueCountFrequency (%)
17
 
9.3%
12
 
6.6%
11
 
6.0%
7
 
3.8%
7
 
3.8%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.2%
Other values (55) 105
57.4%

공원명
Categorical

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
22 
단지공원, 근린공원
 
1
용산가족
 
1
까치산근린
 
1
뚝섬체육공원 어린이대공원
 
1
Other values (4)

Length

Max length13
Median length1
Mean length2.7333333
Min length1

Unique

Unique8 ?
Unique (%)26.7%

Sample

1st row
2nd row
3rd row
4th row단지공원, 근린공원
5th row

Common Values

ValueCountFrequency (%)
22
73.3%
단지공원, 근린공원 1
 
3.3%
용산가족 1
 
3.3%
까치산근린 1
 
3.3%
뚝섬체육공원 어린이대공원 1
 
3.3%
양천공원, 안양천체육공원 1
 
3.3%
건너말, 가락근린 1
 
3.3%
개웅산 1
 
3.3%
1-1 1
 
3.3%

Length

2023-12-10T23:51:34.944725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:35.060781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단지공원 1
8.3%
근린공원 1
8.3%
용산가족 1
8.3%
까치산근린 1
8.3%
뚝섬체육공원 1
8.3%
어린이대공원 1
8.3%
양천공원 1
8.3%
안양천체육공원 1
8.3%
건너말 1
8.3%
가락근린 1
8.3%
Other values (2) 2
16.7%
Distinct13
Distinct (%)43.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
18 
1132, 1135, 1139, 1143, 1154, 149
 
1
703, 72-1
 
1
17, 239-1, 28, 28-1, 289, 405, 414, 45, 452, 540, 552-2, 555-2, 66, 68, 716, 78-1, 78-3, 83-1, 97-2
 
1
202
 
1
Other values (8)

Length

Max length99
Median length1
Mean length8.3333333
Min length1

Unique

Unique12 ?
Unique (%)40.0%

Sample

1st row
2nd row
3rd row1132, 1135, 1139, 1143, 1154, 149
4th row
5th row

Common Values

ValueCountFrequency (%)
18
60.0%
1132, 1135, 1139, 1143, 1154, 149 1
 
3.3%
703, 72-1 1
 
3.3%
17, 239-1, 28, 28-1, 289, 405, 414, 45, 452, 540, 552-2, 555-2, 66, 68, 716, 78-1, 78-3, 83-1, 97-2 1
 
3.3%
202 1
 
3.3%
17, 47, 567 1
 
3.3%
2112, 202, 147, 1017, 370, 330, 300, 721 1
 
3.3%
간선 143 1
 
3.3%
8,28,6,333, 1
 
3.3%
752 1
 
3.3%
Other values (3) 3
 
10.0%

Length

2023-12-10T23:51:35.209856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
202 2
 
4.3%
17 2
 
4.3%
1132 1
 
2.2%
370 1
 
2.2%
83-1 1
 
2.2%
97-2 1
 
2.2%
47 1
 
2.2%
567 1
 
2.2%
2112 1
 
2.2%
147 1
 
2.2%
Other values (34) 34
73.9%

좌석버스번호
Categorical

IMBALANCE 

Distinct5
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
26 
1007, 1115-1
 
1
567
 
1
960
 
1
146
 
1

Length

Max length12
Median length1
Mean length1.5666667
Min length1

Unique

Unique4 ?
Unique (%)13.3%

Sample

1st row
2nd row
3rd row1007, 1115-1
4th row567
5th row

Common Values

ValueCountFrequency (%)
26
86.7%
1007, 1115-1 1
 
3.3%
567 1
 
3.3%
960 1
 
3.3%
146 1
 
3.3%

Length

2023-12-10T23:51:35.345785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:51:35.446946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1007 1
20.0%
1115-1 1
20.0%
567 1
20.0%
960 1
20.0%
146 1
20.0%
Distinct14
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
12 
5호선
2호선
7호선
3호선
Other values (9)

Length

Max length13
Median length8
Mean length4.0333333
Min length1

Unique

Unique9 ?
Unique (%)30.0%

Sample

1st row
2nd row5호선
3rd row
4th row1호선
5th row2호선, 8호선

Common Values

ValueCountFrequency (%)
12
40.0%
5호선 3
 
10.0%
2호선 2
 
6.7%
7호선 2
 
6.7%
3호선 2
 
6.7%
1호선 1
 
3.3%
2호선, 8호선 1
 
3.3%
1호선, 2호선 1
 
3.3%
1호선, 4호선 1
 
3.3%
6호선 1
 
3.3%
Other values (4) 4
 
13.3%

Length

2023-12-10T23:51:35.556355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1호선 7
24.1%
5호선 6
20.7%
2호선 6
20.7%
7호선 3
10.3%
3호선 2
 
6.9%
4호선 2
 
6.9%
6호선 2
 
6.9%
8호선 1
 
3.4%
Distinct23
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:35.751246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length13
Mean length4.8
Min length1

Characters and Unicode

Total characters144
Distinct characters63
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)73.3%

Sample

1st row
2nd row
3rd row면목
4th row
5th row
ValueCountFrequency (%)
까치산 2
 
5.6%
성수 2
 
5.6%
신도림 2
 
5.6%
창동역 2
 
5.6%
군자 2
 
5.6%
고덕 1
 
2.8%
장한평 1
 
2.8%
어린이대공원 1
 
2.8%
오목교 1
 
2.8%
면목 1
 
2.8%
Other values (21) 21
58.3%
2023-12-10T23:51:36.099429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
15.3%
, 14
 
9.7%
10
 
6.9%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (53) 72
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
75.0%
Space Separator 22
 
15.3%
Other Punctuation 14
 
9.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.3%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (51) 67
62.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
75.0%
Common 36
 
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.3%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (51) 67
62.0%
Common
ValueCountFrequency (%)
22
61.1%
, 14
38.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
75.0%
ASCII 36
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22
61.1%
, 14
38.9%
Hangul
ValueCountFrequency (%)
10
 
9.3%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (51) 67
62.0%
Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-10T23:51:36.242035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length6.3333333
Min length1

Characters and Unicode

Total characters190
Distinct characters12
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)40.0%

Sample

1st row버스10분
2nd row도보5 분
3rd row도보15분, 도보10분
4th row도보10분, 도보10분
5th row도보5 분
ValueCountFrequency (%)
도보10분 11
26.8%
10
24.4%
도보5 4
 
9.8%
버스5 4
 
9.8%
버스15분 4
 
9.8%
버스10분 3
 
7.3%
도보15분 3
 
7.3%
도보9 1
 
2.4%
도보7 1
 
2.4%
2023-12-10T23:51:36.516051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
17.4%
31
16.3%
1 21
11.1%
20
10.5%
20
10.5%
5 15
7.9%
0 14
7.4%
, 12
 
6.3%
11
 
5.8%
11
 
5.8%
Other values (2) 2
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
48.9%
Decimal Number 52
27.4%
Space Separator 33
 
17.4%
Other Punctuation 12
 
6.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
33.3%
20
21.5%
20
21.5%
11
 
11.8%
11
 
11.8%
Decimal Number
ValueCountFrequency (%)
1 21
40.4%
5 15
28.8%
0 14
26.9%
9 1
 
1.9%
7 1
 
1.9%
Space Separator
ValueCountFrequency (%)
33
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 97
51.1%
Hangul 93
48.9%

Most frequent character per script

Common
ValueCountFrequency (%)
33
34.0%
1 21
21.6%
5 15
15.5%
0 14
14.4%
, 12
 
12.4%
9 1
 
1.0%
7 1
 
1.0%
Hangul
ValueCountFrequency (%)
31
33.3%
20
21.5%
20
21.5%
11
 
11.8%
11
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 97
51.1%
Hangul 93
48.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33
34.0%
1 21
21.6%
5 15
15.5%
0 14
14.4%
, 12
 
12.4%
9 1
 
1.0%
7 1
 
1.0%
Hangul
ValueCountFrequency (%)
31
33.3%
20
21.5%
20
21.5%
11
 
11.8%
11
 
11.8%

법정동코드
Real number (ℝ)

Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1495106 × 109
Minimum1.1230103 × 109
Maximum1.1740109 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:36.691192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1230103 × 109
5-th percentile1.1230106 × 109
Q11.1350102 × 109
median1.1515104 × 109
Q31.1680101 × 109
95-th percentile1.1740107 × 109
Maximum1.1740109 × 109
Range51000600
Interquartile range (IQR)32999900

Descriptive statistics

Standard deviation18325969
Coefficient of variation (CV)0.015942409
Kurtosis-1.4189701
Mean1.1495106 × 109
Median Absolute Deviation (MAD)16500150
Skewness-0.11244108
Sum3.4485319 × 1010
Variance3.3584113 × 1014
MonotonicityNot monotonic
2023-12-10T23:51:36.842927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
1135010200 2
 
6.7%
1123010600 2
 
6.7%
1168010100 2
 
6.7%
1129013300 1
 
3.3%
1174010600 1
 
3.3%
1165010100 1
 
3.3%
1168011000 1
 
3.3%
1168010500 1
 
3.3%
1171010900 1
 
3.3%
1150010600 1
 
3.3%
Other values (17) 17
56.7%
ValueCountFrequency (%)
1123010300 1
3.3%
1123010600 2
6.7%
1123010900 1
3.3%
1126010300 1
3.3%
1126010600 1
3.3%
1129013300 1
3.3%
1135010200 2
6.7%
1135010500 1
3.3%
1144010200 1
3.3%
1144010800 1
3.3%
ValueCountFrequency (%)
1174010900 1
3.3%
1174010800 1
3.3%
1174010600 1
3.3%
1174010500 1
3.3%
1171010900 1
3.3%
1168011000 1
3.3%
1168010500 1
3.3%
1168010100 2
6.7%
1165010100 1
3.3%
1162010100 1
3.3%

리모델링여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-10T23:51:36.969089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

단지좌표X값
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean955608.77
Minimum941041.83
Maximum968833.65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:37.069148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum941041.83
5-th percentile942372.76
Q1949042.48
median956716.92
Q3961038.11
95-th percentile967730.39
Maximum968833.65
Range27791.823
Interquartile range (IQR)11995.63

Descriptive statistics

Standard deviation7905.1249
Coefficient of variation (CV)0.0082723445
Kurtosis-0.81462164
Mean955608.77
Median Absolute Deviation (MAD)5064.9772
Skewness-0.30192517
Sum28668263
Variance62491000
MonotonicityNot monotonic
2023-12-10T23:51:37.210329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
966908.4271357 1
 
3.3%
958299.777289 1
 
3.3%
953935.0 1
 
3.3%
955892.2471324 1
 
3.3%
956350.0078392 1
 
3.3%
945904.2959509 1
 
3.3%
941041.8280937 1
 
3.3%
968402.9102836 1
 
3.3%
963150.5254392 1
 
3.3%
944731.8418041 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
941041.8280937 1
3.3%
941761.5610395 1
3.3%
943119.7882597 1
3.3%
944731.8418041 1
3.3%
945630.2169673 1
3.3%
945904.2959509 1
3.3%
947996.9774297 1
3.3%
948179.3031604 1
3.3%
951632.0 1
3.3%
951671.8819229 1
3.3%
ValueCountFrequency (%)
968833.651177 1
3.3%
968402.9102836 1
3.3%
966908.4271357 1
3.3%
963379.1976985 1
3.3%
963150.5254392 1
3.3%
962081.6544194 1
3.3%
961551.2916145 1
3.3%
961182.9994295 1
3.3%
960603.4319954 1
3.3%
960310.1961204 1
3.3%

단지좌표Y값
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1949523.5
Minimum1941497.8
Maximum1964300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-10T23:51:37.346215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1941497.8
5-th percentile1943198.5
Q11946379.7
median1947936.3
Q31952309.8
95-th percentile1959458.5
Maximum1964300
Range22802.212
Interquartile range (IQR)5930.0918

Descriptive statistics

Standard deviation5431.6969
Coefficient of variation (CV)0.0027861665
Kurtosis0.82972012
Mean1949523.5
Median Absolute Deviation (MAD)2965.8008
Skewness1.0342825
Sum58485704
Variance29503331
MonotonicityNot monotonic
2023-12-10T23:51:37.505432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1952398.0684939 1
 
3.3%
1947841.2527499 1
 
3.3%
1949475.925128 1
 
3.3%
1945130.0 1
 
3.3%
1947810.1681292 1
 
3.3%
1944810.9147491 1
 
3.3%
1950074.5848712 1
 
3.3%
1942817.1714337 1
 
3.3%
1946613.9140041 1
 
3.3%
1964300.0 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1941497.7878205 1
3.3%
1942817.1714337 1
3.3%
1943664.5041457 1
3.3%
1944069.0 1
3.3%
1944366.5193068 1
3.3%
1944810.9147491 1
3.3%
1945130.0 1
3.3%
1946342.5555938 1
3.3%
1946491.2826931 1
3.3%
1946613.9140041 1
3.3%
ValueCountFrequency (%)
1964300.0 1
3.3%
1961229.4139545 1
3.3%
1957294.0418841 1
3.3%
1956898.9877605 1
3.3%
1955593.6162735 1
3.3%
1954555.3793611 1
3.3%
1953036.3299035 1
3.3%
1952398.0684939 1
3.3%
1952045.1112457 1
3.3%
1950102.6228681 1
3.3%

Sample

부동산물건uid시도명시군구명구명읍면동명리명번지기타번지아파트명총세대수총동수단지최고층단지최저층준공연월건설업체명난방방식난방연료주차대수관리사무소전화번호분양권여부입주년월단지특성우편번호주상복합여부빌라연립여부구청명동사무소명파출소명소방서명은행명병원명스포츠센터명쇼핑센터명학교명공원명일반버스번호좌석버스번호지하철노선명지하철역명소요시간법정동코드리모델링여부단지좌표X값단지좌표Y값
0KBA013357서울특별시도봉구<NA>공릉동<NA>384브*운*톤*도*27011815201112코*롱*설*중앙도시가스45902-**7-**54F200103<NA>5350FF송파오산당, 대항병원양전, 개원, 경기여자버스10분1129013300F966908.4271361952398.068494
1KBA010740서울특별시송파구<NA>염창동<NA>1029개*우*2*70131922200308삼*건*개별도시가스32002-**27-**63F2005112** **역**보**분**최** **(**고**상**,**덕**)** **산**남**조**,**심**전**2710TF청량리나산,보라매이마트계남, 은정 ,갈산, 목일, 봉영여중, 신목, 영천여고5호선도보5 분1153010200F960310.196121956898.987761
2KBA023933서울특별시성동구<NA>서초동<NA>1714창*쌍*(*차*98122221198808대*건*지역도시가스8402-**2-**09F200303<NA>1647FF마포오금동까르푸, 그랜드마트1132, 1135, 1139, 1143, 1154, 1491007, 1115-1면목도보15분, 도보10분1144012300F941761.561041954555.379361
3KBA014516서울특별시동작구<NA>한강로3가<NA>877경*6251246198606<NA>개별도시가스30002-**9-**79F1997081**%**주**,**선**호**1** **철**망**6635FF개포1노량진삼성초등, 신림중/여중, 삼성고단지공원, 근린공원5671호선도보10분, 도보10분1174010900F945630.2169671957294.041884
4KBA021376서울특별시서초구<NA>목동<NA>412-3746신*포*한*2*차*1941189200412월*건*개별도시가스21602-**5-**76F199912신**와**인**인** **대**향** **평**로**성**쾌** **단**동**리**인**도**접** **6544FF강서도곡초,도성초, 대명중,진선여중, 진선여고,휘문중고2호선, 8호선도보5 분1153011200F943119.788261953036.329903
5KBA000034서울특별시용산구<NA>등촌동<NA>578목*트*빌*228241410200306동*건*개별열병합002-**2-**74F199708조**고**차** **움**3690FF703, 72-15호선성수도보9 분1147010100F955246.01946342.555594
6KBA006690서울특별시송파구<NA>잠원동<NA>126외 2필지창*쌍*스*닷*3583155201406세*건*개별도시가스0F201201<NA>8792FF청계, 중원, 대진여자용산가족17, 239-1, 28, 28-1, 289, 405, 414, 45, 452, 540, 552-2, 555-2, 66, 68, 716, 78-1, 78-3, 83-1, 97-2잠원역1123010600F951632.01948031.263574
7KBA007728서울특별시동대문구<NA>신정동<NA>647미*(*차*299111517200312임*토*개별도시가스22002-**37-**49F200211남**한**망**최**복**파**건**3648FF빅마켓.이마트.홈플러스.하나로클럽, 롯데상도초등학교, 장승중학교, 당곡중학교, 강남여중학교, 당곡고등학교2021호선, 2호선버스5 분, 도보10분1153010700F962081.6544191947216.0
8KBA000663서울특별시양천구<NA>홍제동<NA>500북*현*107042510199604건*개별도시가스111002-**4-**93F197010<NA>2054FF강남이대목동병원,홍익병여의초, 여의중, 여의고1호선, 4호선신도림, 도림천, 신도림도보7 분1135010500F951671.8819231941497.787821
9KBA009768서울특별시중랑구<NA>청량리동<NA>425-5외3필지삼*동*데*슬*덤*12708245200010롯*건*개별도시가스200F198910관** ** ** **하**있**공**고**적**,**장** **트**다**전** ** **!**4** **8393FF강남금강메디칼, 원자력/백/을지17, 47, 5676호선남부터미널역도보15분1162010100F961551.2916141946983.433565
부동산물건uid시도명시군구명구명읍면동명리명번지기타번지아파트명총세대수총동수단지최고층단지최저층준공연월건설업체명난방방식난방연료주차대수관리사무소전화번호분양권여부입주년월단지특성우편번호주상복합여부빌라연립여부구청명동사무소명파출소명소방서명은행명병원명스포츠센터명쇼핑센터명학교명공원명일반버스번호좌석버스번호지하철노선명지하철역명소요시간법정동코드리모델링여부단지좌표X값단지좌표Y값
20KBA006358서울특별시서대문구<NA>신정동<NA>311-13776-3,776-4미*89312515200506S*공*개별도시가스111002-**71-**33F198012총**수**6**대**분** **8**,** **5**.**조** **8** **분**1**세** **H**6705FT상도2동5호선버스5 분, 버스5 분1144010800F959952.9424311944366.519307
21KBA009210서울특별시강서구<NA>목동<NA>366(종전27-1)G*강*자*88181713199912현*산*개*개별도시가스32502-**6-**63F200005<NA>7560FF서대문원자력병원,을지병원,태릉성심병원킴스클럽중계, 재현, 상계제일, 재현양천공원, 안양천체육공원성수, 군자, 장한평, 어린이대공원, 군자1150010600F948179.303161952045.111246
22KBA001221서울특별시도봉구<NA>월계동<NA>454401고*.*4163615201107<NA>개별도시가스31402-**6-**77F198810<NA>8610FF송파2서울중앙병원홈에버,농산물직판장, 행복한세상,현대건너말, 가락근린66403호선오목교1171010900F944731.8418041964300.0
23KBA000361서울특별시노원구<NA>명륜1가<NA>746송*센*레*129721513199905동*고*건*중앙도시가스0F198906<NA>2609FF서초롯데아울렛,이마트,롯데마트고덕도보10분1168010100F963150.5254391946613.914004
24KBA011630서울특별시영등포구<NA>창동<NA>970260-1,260-2,260-3,260-4현*2*6221519200405현*산*개*지역열병합10702-**5-**95F197111<NA>5005FF건영스포츠센타천호현대, 이마트, 롯데마그넷개웅산오류동역1135010200F968402.9102841942817.171434
25KBA008372서울특별시강남구<NA>항동<NA>450수*산*원*자*3626199199712극*건*개별열병합46802-**93-**68F1984071**0**인**2**,**~**라** **평** **차** ** **봉** **에**치**아**단** **7535FF대성 스포츠센터 성북 종합 레포츠타운이마트용산, 신용산, 마포, 효창공원앞버스5 분, 도보10분, 도보10분1168010500F941041.8280941950074.584871
26KBA016209서울특별시성북구<NA>상일동<NA>463외1필지327-6, 공고시 269-31번지외3필지한*(*차*15471310198607삼*물*개별열병합0F200905-**전**능**반**1**세**조** **대**8**번** **지**0**0**:**지**4187FF한양파출소, 강남710수색버스10분, 버스10분, 버스15분1168011000F945904.2959511944810.914749
27KBA019167서울특별시양천구<NA>신월동<NA>611보*매*오*하*채*8042213200411우*건*개별도시가스002-**6-**16F199610<NA>8327FF관악가양1동현대,신세계대림, 문창1-15호선, 5호선, 5호선뚝섬역, 서울숲역1168010100F956350.0078391947810.168129
28KBA017483서울특별시성동구<NA>명일동<NA>974삼*동*성*니*29812516198809<NA>지역도시가스87702-**6-**71F199206<NA>8775FF서울은행, 조흥은행염동2호선도보5 분1165010100F955892.2471321945130.0
29KBA016522서울특별시영등포구<NA>구로동<NA>218서*네*처*4*지*1621104200109대*건*중앙도시가스62402-**21-**03F199011***세** **5**(**전**3**.**임**6**대**별**(**가**8**대**0**1**동**일**5068FF마포수궁119영희, 중동, 중동92.1.33.36146도보10분, 도보10분1174010600F953935.01949475.925128