Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells77777
Missing cells (%)26.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical9
Numeric4
DateTime8
Text7
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author은평구
URLhttps://data.seoul.go.kr/dataList/OA-18813/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 has constant value ""Constant
자산규모 is highly imbalanced (58.6%)Imbalance
부채총액 is highly imbalanced (58.6%)Imbalance
자본금 is highly imbalanced (58.6%)Imbalance
판매방식명 is highly imbalanced (72.1%)Imbalance
인허가취소일자 has 9999 (> 99.9%) missing valuesMissing
폐업일자 has 7072 (70.7%) missing valuesMissing
휴업시작일자 has 9954 (99.5%) missing valuesMissing
휴업종료일자 has 9954 (99.5%) missing valuesMissing
재개업일자 has 9984 (99.8%) missing valuesMissing
전화번호 has 6431 (64.3%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7997 (80.0%) missing valuesMissing
지번주소 has 190 (1.9%) missing valuesMissing
도로명주소 has 1189 (11.9%) missing valuesMissing
도로명우편번호 has 2493 (24.9%) missing valuesMissing
좌표정보(X) has 1257 (12.6%) missing valuesMissing
좌표정보(Y) has 1257 (12.6%) missing valuesMissing
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 08:21:46.483738
Analysis finished2024-05-11 08:21:51.610276
Duration5.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3110000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3110000
2nd row3110000
3rd row3110000
4th row3110000
5th row3110000

Common Values

ValueCountFrequency (%)
3110000 10000
100.0%

Length

2024-05-11T08:21:51.918186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:21:52.190906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3110000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.016566 × 1018
Minimum1.998311 × 1018
Maximum2.024311 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:21:52.576852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.998311 × 1018
5-th percentile2.004311 × 1018
Q12.012311 × 1018
median2.018311 × 1018
Q32.021311 × 1018
95-th percentile2.023311 × 1018
Maximum2.024311 × 1018
Range2.6 × 1016
Interquartile range (IQR)9.0000086 × 1015

Descriptive statistics

Standard deviation6.102274 × 1015
Coefficient of variation (CV)0.003026072
Kurtosis-0.67675261
Mean2.016566 × 1018
Median Absolute Deviation (MAD)4.0000086 × 1015
Skewness-0.71542579
Sum3.3688984 × 1018
Variance3.7237748 × 1031
MonotonicityNot monotonic
2024-05-11T08:21:53.290455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2021311021730201951 1
 
< 0.1%
2012311013130200014 1
 
< 0.1%
2020311021730202378 1
 
< 0.1%
2022311021730201157 1
 
< 0.1%
2022311021730200079 1
 
< 0.1%
2021311021730202036 1
 
< 0.1%
2016311013130200662 1
 
< 0.1%
2021311021730200775 1
 
< 0.1%
2023311021730201919 1
 
< 0.1%
2023311021730200803 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1998311021730201131 1
< 0.1%
2002311011130200001 1
< 0.1%
2002311011130200002 1
< 0.1%
2002311011130200004 1
< 0.1%
2002311011130200005 1
< 0.1%
2002311011130200007 1
< 0.1%
2002311011130200010 1
< 0.1%
2002311011130200014 1
< 0.1%
2002311011130200015 1
< 0.1%
2002311011130200018 1
< 0.1%
ValueCountFrequency (%)
2024311021730200720 1
< 0.1%
2024311021730200719 1
< 0.1%
2024311021730200718 1
< 0.1%
2024311021730200717 1
< 0.1%
2024311021730200715 1
< 0.1%
2024311021730200714 1
< 0.1%
2024311021730200713 1
< 0.1%
2024311021730200710 1
< 0.1%
2024311021730200709 1
< 0.1%
2024311021730200708 1
< 0.1%
Distinct3853
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1998-02-19 00:00:00
Maximum2206-11-14 00:00:00
2024-05-11T08:21:53.869678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:21:54.529987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2014-04-10 00:00:00
Maximum2014-04-10 00:00:00
2024-05-11T08:21:54.924469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:21:55.464010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
3902 
3
2904 
4
2741 
5
426 
2
 
27

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row3
5th row4

Common Values

ValueCountFrequency (%)
1 3902
39.0%
3 2904
29.0%
4 2741
27.4%
5 426
 
4.3%
2 27
 
0.3%

Length

2024-05-11T08:21:56.025981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:21:56.589945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3902
39.0%
3 2904
29.0%
4 2741
27.4%
5 426
 
4.3%
2 27
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
3902 
폐업
2904 
취소/말소/만료/정지/중지
2741 
제외/삭제/전출
426 
휴업
 
27

Length

Max length14
Median length8
Mean length6.7154
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row폐업
4th row폐업
5th row취소/말소/만료/정지/중지

Common Values

ValueCountFrequency (%)
영업/정상 3902
39.0%
폐업 2904
29.0%
취소/말소/만료/정지/중지 2741
27.4%
제외/삭제/전출 426
 
4.3%
휴업 27
 
0.3%

Length

2024-05-11T08:21:57.001572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:21:57.449052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 3902
39.0%
폐업 2904
29.0%
취소/말소/만료/정지/중지 2741
27.4%
제외/삭제/전출 426
 
4.3%
휴업 27
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3979
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:21:57.939967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q37
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)6

Descriptive statistics

Standard deviation2.4410595
Coefficient of variation (CV)0.71840239
Kurtosis-1.3293244
Mean3.3979
Median Absolute Deviation (MAD)2
Skewness0.52065505
Sum33979
Variance5.9587715
MonotonicityNot monotonic
2024-05-11T08:21:58.495375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 3902
39.0%
3 2904
29.0%
7 2739
27.4%
5 426
 
4.3%
2 27
 
0.3%
4 2
 
< 0.1%
ValueCountFrequency (%)
1 3902
39.0%
2 27
 
0.3%
3 2904
29.0%
4 2
 
< 0.1%
5 426
 
4.3%
7 2739
27.4%
ValueCountFrequency (%)
7 2739
27.4%
5 426
 
4.3%
4 2
 
< 0.1%
3 2904
29.0%
2 27
 
0.3%
1 3902
39.0%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
3902 
폐업처리
2904 
직권말소
2739 
타시군구이관
426 
휴업처리
 
27

Length

Max length6
Median length4
Mean length4.0852
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row폐업처리
4th row폐업처리
5th row직권말소

Common Values

ValueCountFrequency (%)
정상영업 3902
39.0%
폐업처리 2904
29.0%
직권말소 2739
27.4%
타시군구이관 426
 
4.3%
휴업처리 27
 
0.3%
직권취소 2
 
< 0.1%

Length

2024-05-11T08:21:59.091161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:21:59.718020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 3902
39.0%
폐업처리 2904
29.0%
직권말소 2739
27.4%
타시군구이관 426
 
4.3%
휴업처리 27
 
0.3%
직권취소 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1778
Distinct (%)60.7%
Missing7072
Missing (%)70.7%
Memory size156.2 KiB
Minimum2002-05-11 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T08:22:00.747875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:01.339300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct45
Distinct (%)97.8%
Missing9954
Missing (%)99.5%
Memory size156.2 KiB
Minimum2007-08-20 00:00:00
Maximum2024-03-12 00:00:00
2024-05-11T08:22:01.915384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:02.338192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)

휴업종료일자
Date

MISSING 

Distinct44
Distinct (%)95.7%
Missing9954
Missing (%)99.5%
Memory size156.2 KiB
Minimum2008-06-01 00:00:00
Maximum2033-12-31 00:00:00
2024-05-11T08:22:02.760716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:03.419283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)

재개업일자
Date

MISSING 

Distinct15
Distinct (%)93.8%
Missing9984
Missing (%)99.8%
Memory size156.2 KiB
Minimum2007-06-04 00:00:00
Maximum2024-04-06 00:00:00
2024-05-11T08:22:03.804407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:04.106396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)

전화번호
Text

MISSING 

Distinct3361
Distinct (%)94.2%
Missing6431
Missing (%)64.3%
Memory size156.2 KiB
2024-05-11T08:22:04.848439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length11.597366
Min length1

Characters and Unicode

Total characters41391
Distinct characters15
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3307 ?
Unique (%)92.7%

Sample

1st row02 388 8832
2nd row02-6397-0687
3rd row1644 1020
4th row02 384 3669
5th row070-4066-0683
ValueCountFrequency (%)
02 1396
 
20.3%
070 230
 
3.3%
02-000-0000 154
 
2.2%
383 73
 
1.1%
355 70
 
1.0%
382 67
 
1.0%
388 63
 
0.9%
356 59
 
0.9%
353 58
 
0.8%
387 58
 
0.8%
Other values (3591) 4665
67.7%
2024-05-11T08:22:06.018889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7433
18.0%
2 4929
11.9%
4381
10.6%
3 4331
10.5%
- 3588
8.7%
7 3045
7.4%
8 2935
 
7.1%
5 2879
 
7.0%
6 2055
 
5.0%
4 2046
 
4.9%
Other values (5) 3769
9.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 33417
80.7%
Space Separator 4381
 
10.6%
Dash Punctuation 3588
 
8.7%
Other Punctuation 2
 
< 0.1%
Math Symbol 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7433
22.2%
2 4929
14.7%
3 4331
13.0%
7 3045
9.1%
8 2935
 
8.8%
5 2879
 
8.6%
6 2055
 
6.1%
4 2046
 
6.1%
1 1981
 
5.9%
9 1783
 
5.3%
Space Separator
ValueCountFrequency (%)
4381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3588
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 41391
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7433
18.0%
2 4929
11.9%
4381
10.6%
3 4331
10.5%
- 3588
8.7%
7 3045
7.4%
8 2935
 
7.1%
5 2879
 
7.0%
6 2055
 
5.0%
4 2046
 
4.9%
Other values (5) 3769
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41391
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7433
18.0%
2 4929
11.9%
4381
10.6%
3 4331
10.5%
- 3588
8.7%
7 3045
7.4%
8 2935
 
7.1%
5 2879
 
7.0%
6 2055
 
5.0%
4 2046
 
4.9%
Other values (5) 3769
9.1%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Text

MISSING 

Distinct197
Distinct (%)9.8%
Missing7997
Missing (%)80.0%
Memory size156.2 KiB
2024-05-11T08:22:06.842593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0004993
Min length6

Characters and Unicode

Total characters12019
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)2.2%

Sample

1st row122910
2nd row122010
3rd row122060
4th row122834
5th row122050
ValueCountFrequency (%)
122010 189
 
9.4%
122050 124
 
6.2%
122040 124
 
6.2%
122080 94
 
4.7%
122070 49
 
2.4%
122200 46
 
2.3%
122900 41
 
2.0%
122842 40
 
2.0%
122837 32
 
1.6%
122020 31
 
1.5%
Other values (187) 1233
61.6%
2024-05-11T08:22:07.915428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 4301
35.8%
1 2534
21.1%
0 1896
15.8%
8 1076
 
9.0%
9 616
 
5.1%
4 365
 
3.0%
3 362
 
3.0%
7 334
 
2.8%
5 332
 
2.8%
6 202
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12018
> 99.9%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 4301
35.8%
1 2534
21.1%
0 1896
15.8%
8 1076
 
9.0%
9 616
 
5.1%
4 365
 
3.0%
3 362
 
3.0%
7 334
 
2.8%
5 332
 
2.8%
6 202
 
1.7%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12019
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 4301
35.8%
1 2534
21.1%
0 1896
15.8%
8 1076
 
9.0%
9 616
 
5.1%
4 365
 
3.0%
3 362
 
3.0%
7 334
 
2.8%
5 332
 
2.8%
6 202
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12019
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 4301
35.8%
1 2534
21.1%
0 1896
15.8%
8 1076
 
9.0%
9 616
 
5.1%
4 365
 
3.0%
3 362
 
3.0%
7 334
 
2.8%
5 332
 
2.8%
6 202
 
1.7%

지번주소
Text

MISSING 

Distinct4019
Distinct (%)41.0%
Missing190
Missing (%)1.9%
Memory size156.2 KiB
2024-05-11T08:22:08.409128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length46
Mean length25.586239
Min length6

Characters and Unicode

Total characters251001
Distinct characters499
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3136 ?
Unique (%)32.0%

Sample

1st row서울특별시 은평구 갈현동 ***-** 대신홈타운
2nd row서울특별시 은평구 증산동 ***-* 한성벨라루체
3rd row서울특별시 은평구 녹번동 **-*** ***호
4th row서울특별시 은평구 응암동 ***번지 *호 수현맨션 ***호
5th row서울특별시 은평구 응암동 ***번지 **호
ValueCountFrequency (%)
서울특별시 9806
18.7%
은평구 9793
18.6%
5579
10.6%
5060
9.6%
번지 4554
 
8.7%
응암동 1693
 
3.2%
불광동 1226
 
2.3%
갈현동 1152
 
2.2%
역촌동 1022
 
1.9%
신사동 946
 
1.8%
Other values (2478) 11682
22.2%
2024-05-11T08:22:09.225059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 47271
18.8%
43874
17.5%
10646
 
4.2%
10532
 
4.2%
10433
 
4.2%
10385
 
4.1%
9956
 
4.0%
9912
 
3.9%
9856
 
3.9%
9810
 
3.9%
Other values (489) 78326
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152790
60.9%
Other Punctuation 47304
 
18.8%
Space Separator 43874
 
17.5%
Dash Punctuation 4238
 
1.7%
Decimal Number 1712
 
0.7%
Uppercase Letter 875
 
0.3%
Lowercase Letter 131
 
0.1%
Close Punctuation 35
 
< 0.1%
Open Punctuation 35
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10646
 
7.0%
10532
 
6.9%
10433
 
6.8%
10385
 
6.8%
9956
 
6.5%
9912
 
6.5%
9856
 
6.5%
9810
 
6.4%
9806
 
6.4%
6001
 
3.9%
Other values (428) 55453
36.3%
Uppercase Letter
ValueCountFrequency (%)
B 158
18.1%
A 121
13.8%
C 112
12.8%
D 87
9.9%
M 83
9.5%
J 48
 
5.5%
S 47
 
5.4%
T 43
 
4.9%
P 27
 
3.1%
K 25
 
2.9%
Other values (13) 124
14.2%
Lowercase Letter
ValueCountFrequency (%)
e 91
69.5%
a 10
 
7.6%
s 8
 
6.1%
u 4
 
3.1%
i 3
 
2.3%
b 2
 
1.5%
c 2
 
1.5%
t 2
 
1.5%
l 2
 
1.5%
r 2
 
1.5%
Other values (5) 5
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 344
20.1%
2 251
14.7%
3 188
11.0%
4 167
9.8%
0 160
9.3%
5 147
8.6%
6 132
 
7.7%
7 126
 
7.4%
8 103
 
6.0%
9 94
 
5.5%
Other Punctuation
ValueCountFrequency (%)
* 47271
99.9%
/ 12
 
< 0.1%
, 11
 
< 0.1%
& 6
 
< 0.1%
. 2
 
< 0.1%
@ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
43874
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4238
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 152790
60.9%
Common 97202
38.7%
Latin 1009
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10646
 
7.0%
10532
 
6.9%
10433
 
6.8%
10385
 
6.8%
9956
 
6.5%
9912
 
6.5%
9856
 
6.5%
9810
 
6.4%
9806
 
6.4%
6001
 
3.9%
Other values (428) 55453
36.3%
Latin
ValueCountFrequency (%)
B 158
15.7%
A 121
12.0%
C 112
11.1%
e 91
9.0%
D 87
8.6%
M 83
8.2%
J 48
 
4.8%
S 47
 
4.7%
T 43
 
4.3%
P 27
 
2.7%
Other values (30) 192
19.0%
Common
ValueCountFrequency (%)
* 47271
48.6%
43874
45.1%
- 4238
 
4.4%
1 344
 
0.4%
2 251
 
0.3%
3 188
 
0.2%
4 167
 
0.2%
0 160
 
0.2%
5 147
 
0.2%
6 132
 
0.1%
Other values (11) 430
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152790
60.9%
ASCII 98208
39.1%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 47271
48.1%
43874
44.7%
- 4238
 
4.3%
1 344
 
0.4%
2 251
 
0.3%
3 188
 
0.2%
4 167
 
0.2%
0 160
 
0.2%
B 158
 
0.2%
5 147
 
0.1%
Other values (49) 1410
 
1.4%
Hangul
ValueCountFrequency (%)
10646
 
7.0%
10532
 
6.9%
10433
 
6.8%
10385
 
6.8%
9956
 
6.5%
9912
 
6.5%
9856
 
6.5%
9810
 
6.4%
9806
 
6.4%
6001
 
3.9%
Other values (428) 55453
36.3%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

도로명주소
Text

MISSING 

Distinct5708
Distinct (%)64.8%
Missing1189
Missing (%)11.9%
Memory size156.2 KiB
2024-05-11T08:22:09.881764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length51
Mean length35.976847
Min length17

Characters and Unicode

Total characters316992
Distinct characters511
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4521 ?
Unique (%)51.3%

Sample

1st row서울특별시 은평구 연서로**길 **-*, ***호 (갈현동, 대신홈타운)
2nd row서울특별시 은평구 증산로 ***, *층 ***호 (증산동, 한성벨라루체)
3rd row서울특별시 은평구 은평로**길 **, 수현맨션 ***호 (응암동)
4th row서울특별시 은평구 불광천길 ***, ***호 (응암동, 미성쉐르빌)
5th row서울특별시 은평구 연서로 ***-**, ***호 (불광동, 도시휴먼빌)
ValueCountFrequency (%)
서울특별시 8810
14.7%
은평구 8794
14.7%
8397
14.0%
5342
 
8.9%
2106
 
3.5%
1721
 
2.9%
응암동 1390
 
2.3%
불광동 1011
 
1.7%
갈현동 935
 
1.6%
역촌동 804
 
1.3%
Other values (2972) 20636
34.4%
2024-05-11T08:22:10.893588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 57052
18.0%
51161
16.1%
, 11437
 
3.6%
11403
 
3.6%
10795
 
3.4%
10447
 
3.3%
10387
 
3.3%
9478
 
3.0%
8961
 
2.8%
8863
 
2.8%
Other values (501) 127008
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171476
54.1%
Other Punctuation 68505
 
21.6%
Space Separator 51161
 
16.1%
Open Punctuation 8832
 
2.8%
Close Punctuation 8832
 
2.8%
Dash Punctuation 4104
 
1.3%
Decimal Number 2768
 
0.9%
Uppercase Letter 1128
 
0.4%
Lowercase Letter 176
 
0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11403
 
6.6%
10795
 
6.3%
10447
 
6.1%
10387
 
6.1%
9478
 
5.5%
8961
 
5.2%
8863
 
5.2%
8815
 
5.1%
8810
 
5.1%
8764
 
5.1%
Other values (436) 74753
43.6%
Uppercase Letter
ValueCountFrequency (%)
B 342
30.3%
A 178
15.8%
C 127
 
11.3%
M 90
 
8.0%
D 89
 
7.9%
S 50
 
4.4%
J 48
 
4.3%
T 35
 
3.1%
K 26
 
2.3%
R 24
 
2.1%
Other values (13) 119
 
10.5%
Lowercase Letter
ValueCountFrequency (%)
e 92
52.3%
a 16
 
9.1%
s 14
 
8.0%
b 10
 
5.7%
u 7
 
4.0%
i 5
 
2.8%
r 4
 
2.3%
c 4
 
2.3%
y 3
 
1.7%
l 3
 
1.7%
Other values (10) 18
 
10.2%
Decimal Number
ValueCountFrequency (%)
1 714
25.8%
2 470
17.0%
0 429
15.5%
3 304
11.0%
4 186
 
6.7%
7 156
 
5.6%
5 156
 
5.6%
6 152
 
5.5%
8 113
 
4.1%
9 88
 
3.2%
Other Punctuation
ValueCountFrequency (%)
* 57052
83.3%
, 11437
 
16.7%
. 7
 
< 0.1%
& 6
 
< 0.1%
/ 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
51161
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8832
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8832
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4104
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 171476
54.1%
Common 144209
45.5%
Latin 1307
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11403
 
6.6%
10795
 
6.3%
10447
 
6.1%
10387
 
6.1%
9478
 
5.5%
8961
 
5.2%
8863
 
5.2%
8815
 
5.1%
8810
 
5.1%
8764
 
5.1%
Other values (436) 74753
43.6%
Latin
ValueCountFrequency (%)
B 342
26.2%
A 178
13.6%
C 127
 
9.7%
e 92
 
7.0%
M 90
 
6.9%
D 89
 
6.8%
S 50
 
3.8%
J 48
 
3.7%
T 35
 
2.7%
K 26
 
2.0%
Other values (35) 230
17.6%
Common
ValueCountFrequency (%)
* 57052
39.6%
51161
35.5%
, 11437
 
7.9%
( 8832
 
6.1%
) 8832
 
6.1%
- 4104
 
2.8%
1 714
 
0.5%
2 470
 
0.3%
0 429
 
0.3%
3 304
 
0.2%
Other values (10) 874
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171476
54.1%
ASCII 145513
45.9%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 57052
39.2%
51161
35.2%
, 11437
 
7.9%
( 8832
 
6.1%
) 8832
 
6.1%
- 4104
 
2.8%
1 714
 
0.5%
2 470
 
0.3%
0 429
 
0.3%
B 342
 
0.2%
Other values (53) 2140
 
1.5%
Hangul
ValueCountFrequency (%)
11403
 
6.6%
10795
 
6.3%
10447
 
6.1%
10387
 
6.1%
9478
 
5.5%
8961
 
5.2%
8863
 
5.2%
8815
 
5.1%
8810
 
5.1%
8764
 
5.1%
Other values (436) 74753
43.6%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

도로명우편번호
Text

MISSING 

Distinct390
Distinct (%)5.2%
Missing2493
Missing (%)24.9%
Memory size156.2 KiB
2024-05-11T08:22:11.744640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1409351
Min length5

Characters and Unicode

Total characters38593
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)0.5%

Sample

1st row03334
2nd row03500
3rd row03462
4th row122910
5th row03349
ValueCountFrequency (%)
03311 117
 
1.6%
03382 95
 
1.3%
03306 94
 
1.3%
03453 87
 
1.2%
03477 83
 
1.1%
03396 78
 
1.0%
03385 78
 
1.0%
03450 76
 
1.0%
03300 69
 
0.9%
03472 66
 
0.9%
Other values (380) 6664
88.8%
2024-05-11T08:22:13.372691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 11195
29.0%
0 8605
22.3%
4 4507
11.7%
2 3687
 
9.6%
1 2815
 
7.3%
8 1750
 
4.5%
5 1602
 
4.2%
9 1587
 
4.1%
7 1462
 
3.8%
6 1373
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38583
> 99.9%
Dash Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 11195
29.0%
0 8605
22.3%
4 4507
11.7%
2 3687
 
9.6%
1 2815
 
7.3%
8 1750
 
4.5%
5 1602
 
4.2%
9 1587
 
4.1%
7 1462
 
3.8%
6 1373
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38593
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 11195
29.0%
0 8605
22.3%
4 4507
11.7%
2 3687
 
9.6%
1 2815
 
7.3%
8 1750
 
4.5%
5 1602
 
4.2%
9 1587
 
4.1%
7 1462
 
3.8%
6 1373
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38593
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 11195
29.0%
0 8605
22.3%
4 4507
11.7%
2 3687
 
9.6%
1 2815
 
7.3%
8 1750
 
4.5%
5 1602
 
4.2%
9 1587
 
4.1%
7 1462
 
3.8%
6 1373
 
3.6%
Distinct9786
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T08:22:14.387633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length6.6713
Min length1

Characters and Unicode

Total characters66713
Distinct characters1159
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9592 ?
Unique (%)95.9%

Sample

1st row중탱이직구
2nd row스페인상점 올레
3rd row엠와이피넷 (mypnet)
4th row아토
5th row딸기네 리본스쿨
ValueCountFrequency (%)
주식회사 312
 
2.5%
51
 
0.4%
company 30
 
0.2%
29
 
0.2%
컴퍼니 28
 
0.2%
26
 
0.2%
스튜디오 24
 
0.2%
22
 
0.2%
인셀덤 18
 
0.1%
korea 18
 
0.1%
Other values (10999) 12141
95.6%
2024-05-11T08:22:15.807931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2704
 
4.1%
2368
 
3.5%
1856
 
2.8%
) 1843
 
2.8%
( 1841
 
2.8%
1150
 
1.7%
e 1004
 
1.5%
829
 
1.2%
o 828
 
1.2%
a 744
 
1.1%
Other values (1149) 51546
77.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45069
67.6%
Lowercase Letter 8374
 
12.6%
Uppercase Letter 5967
 
8.9%
Space Separator 2704
 
4.1%
Close Punctuation 1846
 
2.8%
Open Punctuation 1844
 
2.8%
Decimal Number 502
 
0.8%
Other Punctuation 323
 
0.5%
Dash Punctuation 66
 
0.1%
Connector Punctuation 13
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2368
 
5.3%
1856
 
4.1%
1150
 
2.6%
829
 
1.8%
696
 
1.5%
685
 
1.5%
663
 
1.5%
610
 
1.4%
608
 
1.3%
607
 
1.3%
Other values (1064) 34997
77.7%
Lowercase Letter
ValueCountFrequency (%)
e 1004
12.0%
o 828
 
9.9%
a 744
 
8.9%
i 640
 
7.6%
n 631
 
7.5%
r 516
 
6.2%
l 486
 
5.8%
t 473
 
5.6%
s 427
 
5.1%
m 334
 
4.0%
Other values (16) 2291
27.4%
Uppercase Letter
ValueCountFrequency (%)
A 494
 
8.3%
O 475
 
8.0%
S 409
 
6.9%
E 400
 
6.7%
N 370
 
6.2%
I 350
 
5.9%
M 332
 
5.6%
L 319
 
5.3%
T 299
 
5.0%
R 278
 
4.7%
Other values (16) 2241
37.6%
Other Punctuation
ValueCountFrequency (%)
. 164
50.8%
& 84
26.0%
' 25
 
7.7%
, 22
 
6.8%
: 11
 
3.4%
? 7
 
2.2%
# 4
 
1.2%
/ 3
 
0.9%
@ 1
 
0.3%
% 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 87
17.3%
1 81
16.1%
3 60
12.0%
0 52
10.4%
5 48
9.6%
4 48
9.6%
9 40
8.0%
8 31
 
6.2%
7 30
 
6.0%
6 25
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 1843
99.8%
] 2
 
0.1%
} 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1841
99.8%
[ 2
 
0.1%
{ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2704
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45051
67.5%
Latin 14341
 
21.5%
Common 7301
 
10.9%
Han 20
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2368
 
5.3%
1856
 
4.1%
1150
 
2.6%
829
 
1.8%
696
 
1.5%
685
 
1.5%
663
 
1.5%
610
 
1.4%
608
 
1.3%
607
 
1.3%
Other values (1047) 34979
77.6%
Latin
ValueCountFrequency (%)
e 1004
 
7.0%
o 828
 
5.8%
a 744
 
5.2%
i 640
 
4.5%
n 631
 
4.4%
r 516
 
3.6%
A 494
 
3.4%
l 486
 
3.4%
O 475
 
3.3%
t 473
 
3.3%
Other values (42) 8050
56.1%
Common
ValueCountFrequency (%)
2704
37.0%
) 1843
25.2%
( 1841
25.2%
. 164
 
2.2%
2 87
 
1.2%
& 84
 
1.2%
1 81
 
1.1%
- 66
 
0.9%
3 60
 
0.8%
0 52
 
0.7%
Other values (22) 319
 
4.4%
Han
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (8) 8
40.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45049
67.5%
ASCII 21642
32.4%
CJK 19
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2704
 
12.5%
) 1843
 
8.5%
( 1841
 
8.5%
e 1004
 
4.6%
o 828
 
3.8%
a 744
 
3.4%
i 640
 
3.0%
n 631
 
2.9%
r 516
 
2.4%
A 494
 
2.3%
Other values (74) 10397
48.0%
Hangul
ValueCountFrequency (%)
2368
 
5.3%
1856
 
4.1%
1150
 
2.6%
829
 
1.8%
696
 
1.5%
685
 
1.5%
663
 
1.5%
610
 
1.4%
608
 
1.3%
607
 
1.3%
Other values (1046) 34977
77.6%
CJK
ValueCountFrequency (%)
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (7) 7
36.8%
None
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9526
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-18 11:00:13
Maximum2024-05-09 16:10:19
2024-05-11T08:22:16.339214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:16.960261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7180 
U
2820 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7180
71.8%
U 2820
 
28.2%

Length

2024-05-11T08:22:17.404784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:22:17.773795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7180
71.8%
u 2820
 
28.2%
Distinct1424
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T08:22:18.237477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:22:18.773603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct411
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T08:22:19.287781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.7413
Min length1

Characters and Unicode

Total characters87413
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique247 ?
Unique (%)2.5%

Sample

1st row종합몰
2nd row종합몰
3rd row-
4th row가전 가구/수납용품
5th row종합몰
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3827
27.9%
종합몰 3083
22.4%
기타 1741
12.7%
1114
 
8.1%
건강/식품 908
 
6.6%
교육/도서/완구/오락 728
 
5.3%
컴퓨터/사무용품 573
 
4.2%
가구/수납용품 508
 
3.7%
가전 454
 
3.3%
레져/여행/공연 337
 
2.5%
Other values (3) 462
 
3.4%
2024-05-11T08:22:20.520907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16699
19.1%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3827
 
4.4%
3735
 
4.3%
Other values (41) 36363
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65865
75.3%
Other Punctuation 16699
 
19.1%
Space Separator 3735
 
4.3%
Dash Punctuation 1114
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3083
 
4.7%
3083
 
4.7%
Other values (38) 29083
44.2%
Other Punctuation
ValueCountFrequency (%)
/ 16699
100.0%
Space Separator
ValueCountFrequency (%)
3735
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1114
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65865
75.3%
Common 21548
 
24.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3083
 
4.7%
3083
 
4.7%
Other values (38) 29083
44.2%
Common
ValueCountFrequency (%)
/ 16699
77.5%
3735
 
17.3%
- 1114
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65865
75.3%
ASCII 21548
 
24.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16699
77.5%
3735
 
17.3%
- 1114
 
5.2%
Hangul
ValueCountFrequency (%)
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3827
 
5.8%
3083
 
4.7%
3083
 
4.7%
Other values (38) 29083
44.2%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct5296
Distinct (%)60.6%
Missing1257
Missing (%)12.6%
Infinite0
Infinite (%)0.0%
Mean192786.39
Minimum173779.11
Maximum207026.17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:22:20.856350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum173779.11
5-th percentile191553.42
Q1192260.85
median192782.81
Q3193317.99
95-th percentile194101.13
Maximum207026.17
Range33247.053
Interquartile range (IQR)1057.1415

Descriptive statistics

Standard deviation881.48333
Coefficient of variation (CV)0.0045723317
Kurtosis44.214631
Mean192786.39
Median Absolute Deviation (MAD)527.41986
Skewness-0.33915074
Sum1.6855314 × 109
Variance777012.87
MonotonicityNot monotonic
2024-05-11T08:22:21.323310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
194183.438617956 50
 
0.5%
191100.037469089 49
 
0.5%
193600.888431741 45
 
0.4%
193326.661927575 38
 
0.4%
193178.165720379 37
 
0.4%
193914.648457333 34
 
0.3%
193226.593098986 33
 
0.3%
194029.235749604 31
 
0.3%
192998.202204135 30
 
0.3%
192600.451566859 28
 
0.3%
Other values (5286) 8368
83.7%
(Missing) 1257
 
12.6%
ValueCountFrequency (%)
173779.112241967 1
 
< 0.1%
183426.959933313 1
 
< 0.1%
185342.892387576 1
 
< 0.1%
185344.733441824 1
 
< 0.1%
187575.767405266 1
 
< 0.1%
189736.227998038 4
< 0.1%
189748.841711004 1
 
< 0.1%
189789.097246453 1
 
< 0.1%
189793.673057417 1
 
< 0.1%
189818.523319307 1
 
< 0.1%
ValueCountFrequency (%)
207026.165706533 1
< 0.1%
206099.711769712 1
< 0.1%
203204.755637424 1
< 0.1%
201913.141981906 1
< 0.1%
197567.747824202 1
< 0.1%
196576.705499528 1
< 0.1%
196111.168694145 1
< 0.1%
195504.203242847 1
< 0.1%
195498.306937492 1
< 0.1%
195485.478339888 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct5298
Distinct (%)60.6%
Missing1257
Missing (%)12.6%
Infinite0
Infinite (%)0.0%
Mean456195.74
Minimum440136.78
Maximum461624.52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:22:21.872975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum440136.78
5-th percentile453719.23
Q1455115.32
median456058.21
Q3457245.7
95-th percentile459266.82
Maximum461624.52
Range21487.738
Interquartile range (IQR)2130.3759

Descriptive statistics

Standard deviation1651.9279
Coefficient of variation (CV)0.0036210944
Kurtosis1.5528736
Mean456195.74
Median Absolute Deviation (MAD)1079.399
Skewness0.22284265
Sum3.9885194 × 109
Variance2728865.7
MonotonicityNot monotonic
2024-05-11T08:22:22.419570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
457350.932047179 50
 
0.5%
453842.444261662 49
 
0.5%
456957.233580498 45
 
0.4%
454458.493979132 38
 
0.4%
454282.485459616 37
 
0.4%
455188.446841774 34
 
0.3%
453966.677734224 33
 
0.3%
456185.615536089 31
 
0.3%
457479.99522553 30
 
0.3%
455197.91922534 28
 
0.3%
Other values (5288) 8368
83.7%
(Missing) 1257
 
12.6%
ValueCountFrequency (%)
440136.781183797 1
< 0.1%
444252.866602958 1
< 0.1%
445978.112229828 1
< 0.1%
447572.530804795 1
< 0.1%
449478.849222576 1
< 0.1%
449621.809378109 1
< 0.1%
450425.852790862 1
< 0.1%
450832.041460617 1
< 0.1%
450971.656176955 1
< 0.1%
451254.304299091 1
< 0.1%
ValueCountFrequency (%)
461624.519311438 2
 
< 0.1%
461610.341071848 1
 
< 0.1%
461584.700702146 2
 
< 0.1%
461536.107826838 1
 
< 0.1%
461516.537389431 1
 
< 0.1%
461485.037749866 3
 
< 0.1%
461458.190651603 1
 
< 0.1%
461455.876513588 8
0.1%
461445.126468997 1
 
< 0.1%
461423.011700226 1
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9165 
0
 
835

Length

Max length4
Median length4
Mean length3.7495
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9165
91.6%
0 835
 
8.3%

Length

2024-05-11T08:22:22.835116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:22:23.152363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9165
91.6%
0 835
 
8.3%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9165 
0
 
835

Length

Max length4
Median length4
Mean length3.7495
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9165
91.6%
0 835
 
8.3%

Length

2024-05-11T08:22:23.519436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:22:23.814762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9165
91.6%
0 835
 
8.3%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9165 
0
 
835

Length

Max length4
Median length4
Mean length3.7495
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9165
91.6%
0 835
 
8.3%

Length

2024-05-11T08:22:24.118507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:22:24.474600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9165
91.6%
0 835
 
8.3%

판매방식명
Categorical

IMBALANCE 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5120 
인터넷
4618 
인터넷, 기타
 
100
기타
 
48
TV홈쇼핑, 인터넷
 
22
Other values (16)
 
92

Length

Max length26
Median length4
Mean length3.6779
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row인터넷

Common Values

ValueCountFrequency (%)
<NA> 5120
51.2%
인터넷 4618
46.2%
인터넷, 기타 100
 
1.0%
기타 48
 
0.5%
TV홈쇼핑, 인터넷 22
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 19
 
0.2%
인터넷, 카다로그 18
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 8
 
0.1%
인터넷, 카다로그, 기타 7
 
0.1%
인터넷, 카다로그, 신문잡지 6
 
0.1%
Other values (11) 34
 
0.3%

Length

2024-05-11T08:22:24.984412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5120
49.6%
인터넷 4821
46.7%
기타 188
 
1.8%
카다로그 76
 
0.7%
tv홈쇼핑 68
 
0.7%
신문잡지 45
 
0.4%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
166763110000202131102173020195120211021<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 갈현동 ***-** 대신홈타운서울특별시 은평구 연서로**길 **-*, ***호 (갈현동, 대신홈타운)03334중탱이직구2021-10-21 11:15:59I2021-10-23 00:23:02.0종합몰192597.569446456961.804995000인터넷
21109311000020243110217302000632024-01-09<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 증산동 ***-* 한성벨라루체서울특별시 은평구 증산로 ***, *층 ***호 (증산동, 한성벨라루체)03500스페인상점 올레2024-01-09 14:56:06I2023-11-30 23:01:00.0종합몰191645.360057453195.62632<NA><NA><NA><NA>
6063110000200331101113020061420030902<NA>3폐업3폐업처리<NA><NA><NA><NA>02 388 8832<NA><NA>서울특별시 은평구 녹번동 **-*** ***호<NA><NA>엠와이피넷 (mypnet)2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
105603110000201831101313020064920180621<NA>3폐업3폐업처리20201112<NA><NA><NA>02-6397-0687<NA><NA>서울특별시 은평구 응암동 ***번지 *호 수현맨션 ***호서울특별시 은평구 은평로**길 **, 수현맨션 ***호 (응암동)03462아토2020-11-12 16:26:07I2021-12-03 22:02:00.0가전 가구/수납용품193362.619342455216.627207<NA><NA><NA><NA>
70433110000201431101313020036820140611<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA>122910서울특별시 은평구 응암동 ***번지 **호서울특별시 은평구 불광천길 ***, ***호 (응암동, 미성쉐르빌)122910딸기네 리본스쿨2017-12-14 09:03:20I2018-08-31 23:59:59.0종합몰192570.309083455178.13267<NA><NA><NA>인터넷
129203110000202031102173020046420200325<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 불광동 ***번지 **호 도시휴먼빌서울특별시 은평구 연서로 ***-**, ***호 (불광동, 도시휴먼빌)03349비채나월드2020-03-25 15:38:27I2021-12-03 22:02:00.0종합몰 건강/식품 의류/패션/잡화/뷰티193387.841191457542.791012<NA><NA><NA><NA>
63003110000201331101313020029320100414<NA>1영업/정상1정상영업<NA><NA><NA><NA>1644 1020<NA>122010서울특별시 은평구 응암동 ***번지 **호 *층서울특별시 은평구 가좌로*나길 * (응암동,*층)<NA>디앤와이스포츠2013-05-08 13:09:24I2021-12-03 22:02:00.0종합몰 기타192793.361854453687.777745<NA><NA><NA><NA>
21746311000020243110217302007092024-05-08<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 응암동 **-**서울특별시 은평구 은평로**나길 *-* (응암동)03471미소2024-05-08 14:50:25I2023-12-04 23:00:00.0종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 건강/식품 레져/여행/공연 의류/패션/잡화/뷰티 가구/수납용품 성인/성인용품 상품권 자동차/자동차용품193693.517287455205.899815<NA><NA><NA><NA>
45893110000201031101313020070520101208<NA>3폐업3폐업처리20141124<NA><NA><NA><NA><NA>122060서울특별시 은평구 구산동 ***번지 **호 하나홈타운 ***호서울특별시 은평구 서오릉로**길 **-*, ***호 (구산동,하나홈타운)<NA>카시아(kasia)2014-11-24 17:49:14I2018-08-31 23:59:59.0종합몰192018.706536456743.181828<NA><NA><NA>인터넷
67243110000201331101313020078020131218<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 백련산로*길 **, ***동 ****호 (응암동, 백련산힐스테이트)122010어니스트 뷰티(honest B)2015-06-05 10:57:58I2018-08-31 23:59:59.0의류/패션/잡화/뷰티193238.672897454137.273838<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
76583110000201531101313020026320150324<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 387 5467<NA>122880서울특별시 은평구 신사동 **번지 **호서울특별시 은평구 갈현로*길 **-**, ***호 (신사동)122880앤큐브2017-12-13 23:06:08I2018-08-31 23:59:59.0종합몰191889.602612455428.043971<NA><NA><NA>인터넷
130293110000202031102173020058320200409<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-356-5532<NA><NA>서울특별시 은평구 응암동 ***번지 *호서울특별시 은평구 응암로 ***, *층 (응암동)03464예가악기2020-04-09 19:27:54I2020-04-11 00:23:21.0기타193084.571473455304.001885<NA><NA><NA>인터넷
155843110000202131102173020083420210413<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 응암동 ***-**서울특별시 은평구 가좌로*나길 **-**, *층 (응암동)03481지댕2021-04-13 16:14:40I2021-04-15 00:22:57.0의류/패션/잡화/뷰티192832.45618453539.035289<NA><NA><NA>인터넷
29333110000200831101113020012320080321<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA>122010서울특별시 은평구 응암동 **번지 **호 조이팰리스 ****호서울특별시 은평구 은평로 ***, ****호 (응암동,조이팰리스)<NA>레고머리2009-11-13 15:21:54I2018-08-31 23:59:59.0의류/패션/잡화/뷰티192800.079591455307.362525<NA><NA><NA>인터넷
17230311000020223110217302002312022-02-03<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 대조동 *-** 아이안빌서울특별시 은평구 통일로 ***-**, ****호 (대조동, 아이안빌)03396최셈상정2023-12-11 19:26:24U2022-11-01 23:03:00.0종합몰193542.444566456558.780144<NA><NA><NA><NA>
150803110000202131102173020031320210204<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 구산동 **-** 동일홈타운서울특별시 은평구 연서로*길 **-*, ***호 (구산동, 동일홈타운)03414개뽐뿌2021-02-04 17:10:31I2021-02-06 00:23:02.0기타192089.39194456151.089941<NA><NA><NA>인터넷
74153110000201431101313020080020141223<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 325 9562<NA>122060서울특별시 은평구 구산동 ***번지 경남아너스빌 ***동 ****호서울특별시 은평구 갈현로**길 **, ***동 ****호 (구산동, 경남아너스빌)122060디레인(D-rain)2014-12-23 17:35:39I2021-12-03 22:02:00.0종합몰 의류/패션/잡화/뷰티191575.605155455919.170241<NA><NA><NA><NA>
172503110000202231102173020025120220207<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 은평구 응암동 ***-**서울특별시 은평구 은평로*길 **-*, O.K 팰리스 (응암동)03453상호작용2022-02-07 13:02:18I2022-02-09 00:22:39.0종합몰192649.163626455002.485396000인터넷
110453110000201831101313020119220181213<NA>3폐업3폐업처리20211229<NA><NA><NA>02-737-1975<NA><NA>서울특별시 은평구 대조동 ***번지 **호서울특별시 은평구 통일로**길 * (대조동)03385남대문통상2021-12-29 10:32:35U2021-12-31 02:40:00.0기타193196.227873457042.670039000인터넷
11174311000020193110217302001132019-01-22<NA>3폐업3폐업처리2023-12-29<NA><NA><NA>02-373-1411<NA><NA>서울특별시 은평구 신사동 ***번지 **호서울특별시 은평구 가좌로 ***-*, *층 (신사동)03440우진상사2023-12-29 16:51:34U2022-11-01 21:01:00.0기타191856.992068454882.072737<NA><NA><NA><NA>