Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells78450
Missing cells (%)27.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical9
Numeric5
DateTime8
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author동대문구
URLhttps://data.seoul.go.kr/dataList/OA-18807/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (68.6%)Imbalance
부채총액 is highly imbalanced (68.6%)Imbalance
자본금 is highly imbalanced (68.6%)Imbalance
판매방식명 is highly imbalanced (72.3%)Imbalance
인허가취소일자 has 9935 (99.4%) missing valuesMissing
폐업일자 has 6881 (68.8%) missing valuesMissing
휴업시작일자 has 9966 (99.7%) missing valuesMissing
휴업종료일자 has 9966 (99.7%) missing valuesMissing
재개업일자 has 9964 (99.6%) missing valuesMissing
전화번호 has 5576 (55.8%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8011 (80.1%) missing valuesMissing
지번주소 has 1318 (13.2%) missing valuesMissing
도로명주소 has 1288 (12.9%) missing valuesMissing
도로명우편번호 has 2984 (29.8%) missing valuesMissing
좌표정보(X) has 1280 (12.8%) missing valuesMissing
좌표정보(Y) has 1280 (12.8%) missing valuesMissing
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 11:09:34.303588
Analysis finished2024-04-06 11:09:37.884156
Duration3.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3050000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3050000
2nd row3050000
3rd row3050000
4th row3050000
5th row3050000

Common Values

ValueCountFrequency (%)
3050000 10000
100.0%

Length

2024-04-06T20:09:38.019075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:38.188017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3050000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0104494 × 1018
Minimum3.0501003 × 1014
Maximum2.024305 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:09:38.383785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0501003 × 1014
5-th percentile2.004305 × 1018
Q12.010305 × 1018
median2.016305 × 1018
Q32.021305 × 1018
95-th percentile2.023305 × 1018
Maximum2.024305 × 1018
Range2.024 × 1018
Interquartile range (IQR)1.1000004 × 1016

Descriptive statistics

Standard deviation1.008368 × 1017
Coefficient of variation (CV)0.050156347
Kurtosis392.07174
Mean2.0104494 × 1018
Median Absolute Deviation (MAD)5 × 1015
Skewness-19.809871
Sum-2.4569039 × 1018
Variance1.016806 × 1034
MonotonicityNot monotonic
2024-04-06T20:09:38.680761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2023305021030201085 1
 
< 0.1%
2016305014030200232 1
 
< 0.1%
2021305014030200054 1
 
< 0.1%
2003305010030200572 1
 
< 0.1%
2019305014030201473 1
 
< 0.1%
2021305014030200891 1
 
< 0.1%
2023305021030200056 1
 
< 0.1%
2023305021030201972 1
 
< 0.1%
2004305010030201011 1
 
< 0.1%
2023305021030201046 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
305010030200154 1
< 0.1%
305010030200155 1
< 0.1%
305010030200157 1
< 0.1%
305010030200158 1
< 0.1%
305010030200160 1
< 0.1%
305010030200162 1
< 0.1%
305010030200165 1
< 0.1%
305010030200170 1
< 0.1%
305010030200177 1
< 0.1%
305010030200178 1
< 0.1%
ValueCountFrequency (%)
2024305021030200873 1
< 0.1%
2024305021030200869 1
< 0.1%
2024305021030200867 1
< 0.1%
2024305021030200862 1
< 0.1%
2024305021030200860 1
< 0.1%
2024305021030200858 1
< 0.1%
2024305021030200856 1
< 0.1%
2024305021030200855 1
< 0.1%
2024305021030200853 1
< 0.1%
2024305021030200850 1
< 0.1%
Distinct4036
Distinct (%)40.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1995-01-01 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T20:09:38.912124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:39.137334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct9
Distinct (%)13.8%
Missing9935
Missing (%)99.4%
Memory size156.2 KiB
Minimum2005-03-02 00:00:00
Maximum2023-04-10 00:00:00
2024-04-06T20:09:39.299445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:39.466166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5355 
3
2686 
4
1496 
5
 
439
2
 
24

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row4
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 5355
53.5%
3 2686
26.9%
4 1496
 
15.0%
5 439
 
4.4%
2 24
 
0.2%

Length

2024-04-06T20:09:39.665653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:39.841131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5355
53.5%
3 2686
26.9%
4 1496
 
15.0%
5 439
 
4.4%
2 24
 
0.2%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
5355 
폐업
2686 
취소/말소/만료/정지/중지
1496 
제외/삭제/전출
 
439
휴업
 
24

Length

Max length14
Median length5
Mean length5.6651
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row취소/말소/만료/정지/중지
4th row폐업
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 5355
53.5%
폐업 2686
26.9%
취소/말소/만료/정지/중지 1496
 
15.0%
제외/삭제/전출 439
 
4.4%
휴업 24
 
0.2%

Length

2024-04-06T20:09:40.011358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:40.173137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 5355
53.5%
폐업 2686
26.9%
취소/말소/만료/정지/중지 1496
 
15.0%
제외/삭제/전출 439
 
4.4%
휴업 24
 
0.2%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5913
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:09:40.321032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1060645
Coefficient of variation (CV)0.81274439
Kurtosis-0.017300843
Mean2.5913
Median Absolute Deviation (MAD)0
Skewness1.1436112
Sum25913
Variance4.4355079
MonotonicityNot monotonic
2024-04-06T20:09:40.530936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 5353
53.5%
3 2686
26.9%
7 1421
 
14.2%
5 439
 
4.4%
4 75
 
0.8%
2 24
 
0.2%
6 2
 
< 0.1%
ValueCountFrequency (%)
1 5353
53.5%
2 24
 
0.2%
3 2686
26.9%
4 75
 
0.8%
5 439
 
4.4%
6 2
 
< 0.1%
7 1421
 
14.2%
ValueCountFrequency (%)
7 1421
 
14.2%
6 2
 
< 0.1%
5 439
 
4.4%
4 75
 
0.8%
3 2686
26.9%
2 24
 
0.2%
1 5353
53.5%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
5353 
폐업처리
2686 
직권말소
1421 
타시군구이관
 
439
직권취소
 
75
Other values (2)
 
26

Length

Max length6
Median length4
Mean length4.0882
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row직권말소
4th row폐업처리
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 5353
53.5%
폐업처리 2686
26.9%
직권말소 1421
 
14.2%
타시군구이관 439
 
4.4%
직권취소 75
 
0.8%
휴업처리 24
 
0.2%
타시군구전입 2
 
< 0.1%

Length

2024-04-06T20:09:40.874610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:41.064993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 5353
53.5%
폐업처리 2686
26.9%
직권말소 1421
 
14.2%
타시군구이관 439
 
4.4%
직권취소 75
 
0.8%
휴업처리 24
 
0.2%
타시군구전입 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2050
Distinct (%)65.7%
Missing6881
Missing (%)68.8%
Memory size156.2 KiB
Minimum1997-04-18 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T20:09:41.264874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:41.629268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct33
Distinct (%)97.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-02-29 00:00:00
Maximum2024-03-25 00:00:00
2024-04-06T20:09:41.840805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:42.060450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

휴업종료일자
Date

MISSING 

Distinct31
Distinct (%)91.2%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-09-30 00:00:00
Maximum2099-12-31 00:00:00
2024-04-06T20:09:42.270815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:42.517262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

재개업일자
Date

MISSING 

Distinct33
Distinct (%)91.7%
Missing9964
Missing (%)99.6%
Memory size156.2 KiB
Minimum2004-01-27 00:00:00
Maximum2023-08-09 00:00:00
2024-04-06T20:09:42.704643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:43.222268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

전화번호
Text

MISSING 

Distinct4118
Distinct (%)93.1%
Missing5576
Missing (%)55.8%
Memory size156.2 KiB
2024-04-06T20:09:43.556362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length9.738472
Min length1

Characters and Unicode

Total characters43083
Distinct characters19
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4020 ?
Unique (%)90.9%

Sample

1st row02-965-3333
2nd row070-4251-4932
3rd row2236-2311
4th row02-3394-5661
5th row3453-1992
ValueCountFrequency (%)
163
 
3.6%
02 63
 
1.4%
1 9
 
0.2%
070 7
 
0.2%
2212-8629 4
 
0.1%
6370-2020 4
 
0.1%
2242-8973 4
 
0.1%
2248-8386 3
 
0.1%
0 3
 
0.1%
070-5088-3947 3
 
0.1%
Other values (4126) 4214
94.1%
2024-04-06T20:09:44.173856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6928
16.1%
- 6217
14.4%
0 5309
12.3%
7 3501
8.1%
4 3437
8.0%
9 3398
7.9%
6 2952
6.9%
1 2921
6.8%
5 2845
6.6%
3 2834
6.6%
Other values (9) 2741
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36761
85.3%
Dash Punctuation 6217
 
14.4%
Space Separator 64
 
0.1%
Other Punctuation 32
 
0.1%
Math Symbol 5
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6928
18.8%
0 5309
14.4%
7 3501
9.5%
4 3437
9.3%
9 3398
9.2%
6 2952
8.0%
1 2921
7.9%
5 2845
7.7%
3 2834
7.7%
8 2636
 
7.2%
Other Punctuation
ValueCountFrequency (%)
. 26
81.2%
/ 3
 
9.4%
, 2
 
6.2%
' 1
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 6217
100.0%
Space Separator
ValueCountFrequency (%)
64
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 43083
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6928
16.1%
- 6217
14.4%
0 5309
12.3%
7 3501
8.1%
4 3437
8.0%
9 3398
7.9%
6 2952
6.9%
1 2921
6.8%
5 2845
6.6%
3 2834
6.6%
Other values (9) 2741
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43083
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6928
16.1%
- 6217
14.4%
0 5309
12.3%
7 3501
8.1%
4 3437
8.0%
9 3398
7.9%
6 2952
6.9%
1 2921
6.8%
5 2845
6.6%
3 2834
6.6%
Other values (9) 2741
 
6.4%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct155
Distinct (%)7.8%
Missing8011
Missing (%)80.1%
Infinite0
Infinite (%)0.0%
Mean130482.67
Minimum110843
Maximum140200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:09:44.468678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110843
5-th percentile130021
Q1130080
median130769
Q3130839
95-th percentile130872
Maximum140200
Range29357
Interquartile range (IQR)759

Descriptive statistics

Standard deviation632.07739
Coefficient of variation (CV)0.0048441481
Kurtosis498.22538
Mean130482.67
Median Absolute Deviation (MAD)103
Skewness-13.003035
Sum2.5953004 × 108
Variance399521.83
MonotonicityNot monotonic
2024-04-06T20:09:44.704417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
130100 171
 
1.7%
130070 120
 
1.2%
130030 101
 
1.0%
130060 78
 
0.8%
130110 64
 
0.6%
130842 56
 
0.6%
130020 55
 
0.5%
130823 55
 
0.5%
130864 55
 
0.5%
130090 50
 
0.5%
Other values (145) 1184
 
11.8%
(Missing) 8011
80.1%
ValueCountFrequency (%)
110843 1
 
< 0.1%
130010 28
 
0.3%
130011 4
 
< 0.1%
130012 2
 
< 0.1%
130020 55
0.5%
130021 13
 
0.1%
130022 5
 
0.1%
130023 4
 
< 0.1%
130024 2
 
< 0.1%
130030 101
1.0%
ValueCountFrequency (%)
140200 1
 
< 0.1%
135856 1
 
< 0.1%
130883 10
0.1%
130882 1
 
< 0.1%
130879 2
 
< 0.1%
130878 14
0.1%
130876 19
0.2%
130875 21
0.2%
130874 9
0.1%
130873 13
0.1%

지번주소
Text

MISSING 

Distinct3053
Distinct (%)35.2%
Missing1318
Missing (%)13.2%
Memory size156.2 KiB
2024-04-06T20:09:45.080921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length24.918337
Min length2

Characters and Unicode

Total characters216341
Distinct characters450
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2221 ?
Unique (%)25.6%

Sample

1st row서울특별시 동대문구 장안동 ***-* 광성빌딩
2nd row서울특별시 동대문구 용두동 ***번지 한방천하포스빌 ****호
3rd row***-* 신삼송빌딩*층
4th row서울특별시 동대문구 답십리동 ***번지 **호
5th row서울특별시 동대문구 이문*동 ***번지 **호
ValueCountFrequency (%)
서울특별시 7547
17.7%
동대문구 7543
17.7%
4701
11.0%
4554
10.7%
번지 4229
9.9%
장안동 1686
 
4.0%
답십리동 1047
 
2.5%
용두동 1010
 
2.4%
제기동 705
 
1.7%
휘경동 679
 
1.6%
Other values (1725) 8857
20.8%
2024-04-06T20:09:45.810182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 45216
20.9%
33980
15.7%
16360
 
7.6%
8265
 
3.8%
8104
 
3.7%
7692
 
3.6%
7634
 
3.5%
7610
 
3.5%
7558
 
3.5%
7549
 
3.5%
Other values (440) 66373
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131140
60.6%
Other Punctuation 45303
 
20.9%
Space Separator 33980
 
15.7%
Dash Punctuation 4035
 
1.9%
Uppercase Letter 685
 
0.3%
Decimal Number 650
 
0.3%
Lowercase Letter 321
 
0.1%
Close Punctuation 109
 
0.1%
Open Punctuation 109
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16360
 
12.5%
8265
 
6.3%
8104
 
6.2%
7692
 
5.9%
7634
 
5.8%
7610
 
5.8%
7558
 
5.8%
7549
 
5.8%
7547
 
5.8%
4927
 
3.8%
Other values (375) 47894
36.5%
Uppercase Letter
ValueCountFrequency (%)
K 131
19.1%
S 115
16.8%
B 79
11.5%
T 64
9.3%
A 53
7.7%
C 39
 
5.7%
Y 25
 
3.6%
R 23
 
3.4%
J 17
 
2.5%
E 17
 
2.5%
Other values (14) 122
17.8%
Lowercase Letter
ValueCountFrequency (%)
e 110
34.3%
o 51
15.9%
w 50
15.6%
r 49
15.3%
s 12
 
3.7%
a 6
 
1.9%
c 6
 
1.9%
l 6
 
1.9%
t 6
 
1.9%
b 4
 
1.2%
Other values (10) 21
 
6.5%
Decimal Number
ValueCountFrequency (%)
1 138
21.2%
3 78
12.0%
4 71
10.9%
2 68
10.5%
0 65
10.0%
5 56
8.6%
9 47
 
7.2%
8 43
 
6.6%
6 42
 
6.5%
7 42
 
6.5%
Other Punctuation
ValueCountFrequency (%)
* 45216
99.8%
, 66
 
0.1%
. 12
 
< 0.1%
& 5
 
< 0.1%
/ 4
 
< 0.1%
Space Separator
ValueCountFrequency (%)
33980
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4035
100.0%
Close Punctuation
ValueCountFrequency (%)
) 109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 131140
60.6%
Common 84192
38.9%
Latin 1009
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16360
 
12.5%
8265
 
6.3%
8104
 
6.2%
7692
 
5.9%
7634
 
5.8%
7610
 
5.8%
7558
 
5.8%
7549
 
5.8%
7547
 
5.8%
4927
 
3.8%
Other values (375) 47894
36.5%
Latin
ValueCountFrequency (%)
K 131
13.0%
S 115
11.4%
e 110
10.9%
B 79
 
7.8%
T 64
 
6.3%
A 53
 
5.3%
o 51
 
5.1%
w 50
 
5.0%
r 49
 
4.9%
C 39
 
3.9%
Other values (35) 268
26.6%
Common
ValueCountFrequency (%)
* 45216
53.7%
33980
40.4%
- 4035
 
4.8%
1 138
 
0.2%
) 109
 
0.1%
( 109
 
0.1%
3 78
 
0.1%
4 71
 
0.1%
2 68
 
0.1%
, 66
 
0.1%
Other values (10) 322
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131139
60.6%
ASCII 85198
39.4%
Number Forms 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 45216
53.1%
33980
39.9%
- 4035
 
4.7%
1 138
 
0.2%
K 131
 
0.2%
S 115
 
0.1%
e 110
 
0.1%
) 109
 
0.1%
( 109
 
0.1%
B 79
 
0.1%
Other values (54) 1176
 
1.4%
Hangul
ValueCountFrequency (%)
16360
 
12.5%
8265
 
6.3%
8104
 
6.2%
7692
 
5.9%
7634
 
5.8%
7610
 
5.8%
7558
 
5.8%
7549
 
5.8%
7547
 
5.8%
4927
 
3.8%
Other values (374) 47893
36.5%
Number Forms
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4902
Distinct (%)56.3%
Missing1288
Missing (%)12.9%
Memory size156.2 KiB
2024-04-06T20:09:46.281898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length53
Mean length36.183884
Min length14

Characters and Unicode

Total characters315234
Distinct characters482
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3632 ?
Unique (%)41.7%

Sample

1st row서울특별시 동대문구 장한로 **, 광성빌딩 *층 ***호 (장안동)
2nd row서울특별시 동대문구 약령중앙로**길 *, 지층 *호 (제기동, 한성빌딩)
3rd row서울특별시 동대문구 왕산로 ***, ****호 (용두동,한방천하포스빌)
4th row서울특별시 동대문구 답십리로 ***, *층 (장안동)
5th row서울특별시 동대문구 천호대로**가길 * (답십리동)
ValueCountFrequency (%)
서울특별시 8712
15.0%
동대문구 8699
15.0%
8574
14.8%
4656
 
8.0%
2615
 
4.5%
장안동 1807
 
3.1%
1653
 
2.9%
답십리동 1106
 
1.9%
용두동 1037
 
1.8%
제기동 769
 
1.3%
Other values (2365) 18351
31.7%
2024-04-06T20:09:47.015360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 53859
17.1%
49463
 
15.7%
20500
 
6.5%
10685
 
3.4%
, 10367
 
3.3%
9834
 
3.1%
9295
 
2.9%
9077
 
2.9%
9015
 
2.9%
) 8779
 
2.8%
Other values (472) 124360
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 180095
57.1%
Other Punctuation 64244
 
20.4%
Space Separator 49463
 
15.7%
Close Punctuation 8779
 
2.8%
Open Punctuation 8779
 
2.8%
Dash Punctuation 1805
 
0.6%
Uppercase Letter 987
 
0.3%
Decimal Number 860
 
0.3%
Lowercase Letter 208
 
0.1%
Math Symbol 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20500
 
11.4%
10685
 
5.9%
9834
 
5.5%
9295
 
5.2%
9077
 
5.0%
9015
 
5.0%
8723
 
4.8%
8715
 
4.8%
8713
 
4.8%
8712
 
4.8%
Other values (406) 76826
42.7%
Uppercase Letter
ValueCountFrequency (%)
B 180
18.2%
C 144
14.6%
A 133
13.5%
K 127
12.9%
S 122
12.4%
R 36
 
3.6%
T 30
 
3.0%
Y 24
 
2.4%
J 20
 
2.0%
E 17
 
1.7%
Other values (14) 154
15.6%
Lowercase Letter
ValueCountFrequency (%)
e 81
38.9%
o 20
 
9.6%
w 18
 
8.7%
r 17
 
8.2%
s 14
 
6.7%
b 11
 
5.3%
a 9
 
4.3%
t 6
 
2.9%
l 6
 
2.9%
c 6
 
2.9%
Other values (10) 20
 
9.6%
Decimal Number
ValueCountFrequency (%)
1 199
23.1%
2 140
16.3%
0 124
14.4%
3 87
10.1%
4 70
 
8.1%
6 59
 
6.9%
5 53
 
6.2%
9 49
 
5.7%
7 42
 
4.9%
8 37
 
4.3%
Other Punctuation
ValueCountFrequency (%)
* 53859
83.8%
, 10367
 
16.1%
. 9
 
< 0.1%
& 5
 
< 0.1%
/ 4
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 10
90.9%
+ 1
 
9.1%
Space Separator
ValueCountFrequency (%)
49463
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8779
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8779
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1805
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 180095
57.1%
Common 133941
42.5%
Latin 1198
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20500
 
11.4%
10685
 
5.9%
9834
 
5.5%
9295
 
5.2%
9077
 
5.0%
9015
 
5.0%
8723
 
4.8%
8715
 
4.8%
8713
 
4.8%
8712
 
4.8%
Other values (406) 76826
42.7%
Latin
ValueCountFrequency (%)
B 180
15.0%
C 144
12.0%
A 133
11.1%
K 127
10.6%
S 122
10.2%
e 81
 
6.8%
R 36
 
3.0%
T 30
 
2.5%
Y 24
 
2.0%
o 20
 
1.7%
Other values (35) 301
25.1%
Common
ValueCountFrequency (%)
* 53859
40.2%
49463
36.9%
, 10367
 
7.7%
) 8779
 
6.6%
( 8779
 
6.6%
- 1805
 
1.3%
1 199
 
0.1%
2 140
 
0.1%
0 124
 
0.1%
3 87
 
0.1%
Other values (11) 339
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 180094
57.1%
ASCII 135136
42.9%
Number Forms 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 53859
39.9%
49463
36.6%
, 10367
 
7.7%
) 8779
 
6.5%
( 8779
 
6.5%
- 1805
 
1.3%
1 199
 
0.1%
B 180
 
0.1%
C 144
 
0.1%
2 140
 
0.1%
Other values (55) 1421
 
1.1%
Hangul
ValueCountFrequency (%)
20500
 
11.4%
10685
 
5.9%
9834
 
5.5%
9295
 
5.2%
9077
 
5.0%
9015
 
5.0%
8723
 
4.8%
8715
 
4.8%
8713
 
4.8%
8712
 
4.8%
Other values (405) 76825
42.7%
Number Forms
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct429
Distinct (%)6.1%
Missing2984
Missing (%)29.8%
Memory size156.2 KiB
2024-04-06T20:09:47.512910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.2112315
Min length5

Characters and Unicode

Total characters36562
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)0.8%

Sample

1st row02629
2nd row130864
3rd row02624
4th row130870
5th row02587
ValueCountFrequency (%)
02445 176
 
2.5%
02586 163
 
2.3%
02624 157
 
2.2%
02585 98
 
1.4%
130864 96
 
1.4%
02584 87
 
1.2%
02582 76
 
1.1%
02639 72
 
1.0%
130804 65
 
0.9%
02633 64
 
0.9%
Other values (419) 5962
85.0%
2024-04-06T20:09:48.238453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8408
23.0%
2 7394
20.2%
5 3991
10.9%
4 3255
 
8.9%
3 2981
 
8.2%
6 2872
 
7.9%
1 2506
 
6.9%
8 2445
 
6.7%
7 1438
 
3.9%
9 1160
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36450
99.7%
Dash Punctuation 112
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8408
23.1%
2 7394
20.3%
5 3991
10.9%
4 3255
 
8.9%
3 2981
 
8.2%
6 2872
 
7.9%
1 2506
 
6.9%
8 2445
 
6.7%
7 1438
 
3.9%
9 1160
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36562
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8408
23.0%
2 7394
20.2%
5 3991
10.9%
4 3255
 
8.9%
3 2981
 
8.2%
6 2872
 
7.9%
1 2506
 
6.9%
8 2445
 
6.7%
7 1438
 
3.9%
9 1160
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36562
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8408
23.0%
2 7394
20.2%
5 3991
10.9%
4 3255
 
8.9%
3 2981
 
8.2%
6 2872
 
7.9%
1 2506
 
6.9%
8 2445
 
6.7%
7 1438
 
3.9%
9 1160
 
3.2%
Distinct9830
Distinct (%)98.3%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-06T20:09:48.797431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length36
Mean length6.8027803
Min length1

Characters and Unicode

Total characters68021
Distinct characters1094
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9675 ?
Unique (%)96.8%

Sample

1st row주식회사 티엠에스티
2nd row스베나(SVENA)
3rd row폼나니
4th row삼성케녹스카메라서비스(주)
5th row인모엘
ValueCountFrequency (%)
주식회사 469
 
3.7%
137
 
1.1%
company 44
 
0.3%
컴퍼니 32
 
0.3%
20
 
0.2%
korea 20
 
0.2%
스튜디오 18
 
0.1%
16
 
0.1%
co.,ltd 16
 
0.1%
shop 14
 
0.1%
Other values (10974) 11826
93.8%
2024-04-06T20:09:49.590913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2619
 
3.9%
2314
 
3.4%
) 2207
 
3.2%
( 2201
 
3.2%
1808
 
2.7%
1270
 
1.9%
1048
 
1.5%
870
 
1.3%
827
 
1.2%
e 824
 
1.2%
Other values (1084) 52033
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46274
68.0%
Lowercase Letter 7614
 
11.2%
Uppercase Letter 6108
 
9.0%
Space Separator 2619
 
3.9%
Close Punctuation 2208
 
3.2%
Open Punctuation 2202
 
3.2%
Decimal Number 522
 
0.8%
Other Punctuation 340
 
0.5%
Dash Punctuation 96
 
0.1%
Other Symbol 27
 
< 0.1%
Other values (3) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2314
 
5.0%
1808
 
3.9%
1270
 
2.7%
1048
 
2.3%
870
 
1.9%
827
 
1.8%
661
 
1.4%
619
 
1.3%
615
 
1.3%
611
 
1.3%
Other values (999) 35631
77.0%
Lowercase Letter
ValueCountFrequency (%)
e 824
 
10.8%
o 797
 
10.5%
a 670
 
8.8%
n 572
 
7.5%
i 566
 
7.4%
r 469
 
6.2%
t 428
 
5.6%
l 400
 
5.3%
s 375
 
4.9%
m 327
 
4.3%
Other values (16) 2186
28.7%
Uppercase Letter
ValueCountFrequency (%)
O 518
 
8.5%
A 509
 
8.3%
E 444
 
7.3%
S 425
 
7.0%
N 372
 
6.1%
C 333
 
5.5%
T 331
 
5.4%
M 327
 
5.4%
I 309
 
5.1%
L 302
 
4.9%
Other values (16) 2238
36.6%
Other Punctuation
ValueCountFrequency (%)
. 187
55.0%
& 75
22.1%
, 39
 
11.5%
' 20
 
5.9%
/ 8
 
2.4%
# 3
 
0.9%
? 2
 
0.6%
: 2
 
0.6%
@ 1
 
0.3%
% 1
 
0.3%
Other values (2) 2
 
0.6%
Decimal Number
ValueCountFrequency (%)
1 111
21.3%
2 103
19.7%
9 56
10.7%
0 53
10.2%
5 46
8.8%
3 42
 
8.0%
4 41
 
7.9%
8 24
 
4.6%
7 23
 
4.4%
6 23
 
4.4%
Close Punctuation
ValueCountFrequency (%)
) 2207
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2201
> 99.9%
[ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 1
50.0%
~ 1
50.0%
Space Separator
ValueCountFrequency (%)
2619
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%
Other Symbol
ValueCountFrequency (%)
27
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46277
68.0%
Latin 13722
 
20.2%
Common 7998
 
11.8%
Han 19
 
< 0.1%
Hiragana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2314
 
5.0%
1808
 
3.9%
1270
 
2.7%
1048
 
2.3%
870
 
1.9%
827
 
1.8%
661
 
1.4%
619
 
1.3%
615
 
1.3%
611
 
1.3%
Other values (977) 35634
77.0%
Latin
ValueCountFrequency (%)
e 824
 
6.0%
o 797
 
5.8%
a 670
 
4.9%
n 572
 
4.2%
i 566
 
4.1%
O 518
 
3.8%
A 509
 
3.7%
r 469
 
3.4%
E 444
 
3.2%
t 428
 
3.1%
Other values (42) 7925
57.8%
Common
ValueCountFrequency (%)
2619
32.7%
) 2207
27.6%
( 2201
27.5%
. 187
 
2.3%
1 111
 
1.4%
2 103
 
1.3%
- 96
 
1.2%
& 75
 
0.9%
9 56
 
0.7%
0 53
 
0.7%
Other values (22) 290
 
3.6%
Han
ValueCountFrequency (%)
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (8) 8
42.1%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46250
68.0%
ASCII 21719
31.9%
None 28
 
< 0.1%
CJK 17
 
< 0.1%
Hiragana 5
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2619
 
12.1%
) 2207
 
10.2%
( 2201
 
10.1%
e 824
 
3.8%
o 797
 
3.7%
a 670
 
3.1%
n 572
 
2.6%
i 566
 
2.6%
O 518
 
2.4%
A 509
 
2.3%
Other values (73) 10236
47.1%
Hangul
ValueCountFrequency (%)
2314
 
5.0%
1808
 
3.9%
1270
 
2.7%
1048
 
2.3%
870
 
1.9%
827
 
1.8%
661
 
1.4%
619
 
1.3%
615
 
1.3%
611
 
1.3%
Other values (976) 35607
77.0%
None
ValueCountFrequency (%)
27
96.4%
1
 
3.6%
CJK
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Hiragana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct9305
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-10-29 16:30:23
Maximum2024-04-03 17:27:48
2024-04-06T20:09:49.799073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:50.014511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
8105 
U
1895 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 8105
81.0%
U 1895
 
18.9%

Length

2024-04-06T20:09:50.238434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:50.391110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 8105
81.0%
u 1895
 
18.9%
Distinct1450
Distinct (%)14.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:05:00
2024-04-06T20:09:50.550123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:09:50.822067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct418
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T20:09:51.103180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.5726
Min length1

Characters and Unicode

Total characters85726
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique271 ?
Unique (%)2.7%

Sample

1st row의류/패션/잡화/뷰티
2nd row의류/패션/잡화/뷰티
3rd row의류/패션/잡화/뷰티
4th row-
5th row의류/패션/잡화/뷰티
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4353
33.3%
종합몰 2324
17.8%
1512
 
11.6%
기타 1483
 
11.3%
건강/식품 960
 
7.3%
교육/도서/완구/오락 529
 
4.0%
컴퓨터/사무용품 403
 
3.1%
가구/수납용품 387
 
3.0%
자동차/자동차용품 367
 
2.8%
가전 348
 
2.7%
Other values (3) 401
 
3.1%
2024-04-06T20:09:51.661685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17351
20.2%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
4353
 
5.1%
3067
 
3.6%
Other values (41) 30484
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63796
74.4%
Other Punctuation 17351
 
20.2%
Space Separator 3067
 
3.6%
Dash Punctuation 1512
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
2324
 
3.6%
2324
 
3.6%
Other values (38) 24324
38.1%
Other Punctuation
ValueCountFrequency (%)
/ 17351
100.0%
Space Separator
ValueCountFrequency (%)
3067
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1512
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63796
74.4%
Common 21930
 
25.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
2324
 
3.6%
2324
 
3.6%
Other values (38) 24324
38.1%
Common
ValueCountFrequency (%)
/ 17351
79.1%
3067
 
14.0%
- 1512
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63796
74.4%
ASCII 21930
 
25.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17351
79.1%
3067
 
14.0%
- 1512
 
6.9%
Hangul
ValueCountFrequency (%)
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
4353
 
6.8%
2324
 
3.6%
2324
 
3.6%
Other values (38) 24324
38.1%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct4166
Distinct (%)47.8%
Missing1280
Missing (%)12.8%
Infinite0
Infinite (%)0.0%
Mean204586.1
Minimum199232.17
Maximum219309.84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:09:51.881654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum199232.17
5-th percentile202257.99
Q1203340.69
median204870.94
Q3205746.05
95-th percentile206329.14
Maximum219309.84
Range20077.666
Interquartile range (IQR)2405.3646

Descriptive statistics

Standard deviation1327.8685
Coefficient of variation (CV)0.0064905116
Kurtosis0.64039101
Mean204586.1
Median Absolute Deviation (MAD)1034.6184
Skewness-0.21591879
Sum1.7839908 × 109
Variance1763234.7
MonotonicityNot monotonic
2024-04-06T20:09:52.134159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205022.515415062 155
 
1.6%
203075.098504907 95
 
0.9%
204439.678555494 88
 
0.9%
206590.046202018 63
 
0.6%
206179.384647281 63
 
0.6%
205271.704936121 60
 
0.6%
202257.985695221 55
 
0.5%
206521.405320134 54
 
0.5%
206090.434292925 50
 
0.5%
205668.475124572 49
 
0.5%
Other values (4156) 7988
79.9%
(Missing) 1280
 
12.8%
ValueCountFrequency (%)
199232.174339254 1
 
< 0.1%
200501.092299817 1
 
< 0.1%
200788.034412516 1
 
< 0.1%
201995.643218229 3
 
< 0.1%
201997.95066552 1
 
< 0.1%
201998.969766069 1
 
< 0.1%
202012.621510207 1
 
< 0.1%
202013.401502902 3
 
< 0.1%
202014.988956307 13
0.1%
202022.650035503 2
 
< 0.1%
ValueCountFrequency (%)
219309.840028553 1
 
< 0.1%
210396.44570146 1
 
< 0.1%
207026.483549278 1
 
< 0.1%
206676.04734862 3
< 0.1%
206649.304119443 2
< 0.1%
206643.334997521 1
 
< 0.1%
206641.051820999 1
 
< 0.1%
206637.16421148 1
 
< 0.1%
206630.901669352 1
 
< 0.1%
206623.222570241 2
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct4164
Distinct (%)47.8%
Missing1280
Missing (%)12.8%
Infinite0
Infinite (%)0.0%
Mean452843.68
Minimum433205.91
Maximum477907.96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:09:52.426173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum433205.91
5-th percentile451258.36
Q1452106.57
median452677.08
Q3453461.62
95-th percentile454895.8
Maximum477907.96
Range44702.047
Interquartile range (IQR)1355.0495

Descriptive statistics

Standard deviation1124.7325
Coefficient of variation (CV)0.0024837103
Kurtosis39.042616
Mean452843.68
Median Absolute Deviation (MAD)654.66262
Skewness1.0717521
Sum3.9487968 × 109
Variance1265023.2
MonotonicityNot monotonic
2024-04-06T20:09:52.645026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
454212.542182175 155
 
1.6%
452918.836458244 95
 
0.9%
453411.742856906 63
 
0.6%
452410.619464697 63
 
0.6%
452706.897879436 60
 
0.6%
452478.339896363 55
 
0.5%
451940.119059947 54
 
0.5%
452393.003188348 54
 
0.5%
452102.733064064 50
 
0.5%
452777.056578271 49
 
0.5%
Other values (4154) 8022
80.2%
(Missing) 1280
 
12.8%
ValueCountFrequency (%)
433205.912545272 1
 
< 0.1%
442893.091826117 1
 
< 0.1%
447789.670429863 1
 
< 0.1%
448396.985153331 1
 
< 0.1%
450994.913744005 10
0.1%
451004.764350926 1
 
< 0.1%
451011.194201757 3
 
< 0.1%
451014.511031732 1
 
< 0.1%
451016.227910914 3
 
< 0.1%
451024.506229475 1
 
< 0.1%
ValueCountFrequency (%)
477907.959332858 1
 
< 0.1%
456116.027162 1
 
< 0.1%
455955.382095144 1
 
< 0.1%
455840.326731611 1
 
< 0.1%
455813.713540339 2
 
< 0.1%
455811.048113466 1
 
< 0.1%
455808.419175247 1
 
< 0.1%
455801.638525525 34
0.3%
455797.417581881 1
 
< 0.1%
455779.74269345 3
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9434 
0
 
566

Length

Max length4
Median length4
Mean length3.8302
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9434
94.3%
0 566
 
5.7%

Length

2024-04-06T20:09:52.873830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:53.065142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9434
94.3%
0 566
 
5.7%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9434 
0
 
566

Length

Max length4
Median length4
Mean length3.8302
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9434
94.3%
0 566
 
5.7%

Length

2024-04-06T20:09:53.249818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:53.769377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9434
94.3%
0 566
 
5.7%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9434 
0
 
566

Length

Max length4
Median length4
Mean length3.8302
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9434
94.3%
0 566
 
5.7%

Length

2024-04-06T20:09:53.944750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:09:54.115351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9434
94.3%
0 566
 
5.7%

판매방식명
Categorical

IMBALANCE 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5002 
인터넷
4758 
인터넷, 기타
 
73
기타
 
35
TV홈쇼핑, 인터넷
 
35
Other values (16)
 
97

Length

Max length26
Median length4
Mean length3.6701
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row인터넷
3rd row인터넷
4th row<NA>
5th row인터넷

Common Values

ValueCountFrequency (%)
<NA> 5002
50.0%
인터넷 4758
47.6%
인터넷, 기타 73
 
0.7%
기타 35
 
0.4%
TV홈쇼핑, 인터넷 35
 
0.4%
인터넷, 카다로그 21
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 19
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 10
 
0.1%
TV홈쇼핑 9
 
0.1%
인터넷, 카다로그, 신문잡지, 기타 8
 
0.1%
Other values (11) 30
 
0.3%

Length

2024-04-06T20:09:54.306119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5002
48.5%
인터넷 4950
48.0%
기타 146
 
1.4%
tv홈쇼핑 83
 
0.8%
카다로그 75
 
0.7%
신문잡지 54
 
0.5%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
25060305000020233050210302010852023-06-15<NA>1영업/정상01정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 장안동 ***-* 광성빌딩서울특별시 동대문구 장한로 **, 광성빌딩 *층 ***호 (장안동)02629주식회사 티엠에스티2023-06-15 13:38:55I2022-12-05 23:07:00.0의류/패션/잡화/뷰티205821.493944451425.917121<NA><NA><NA><NA>
109723050000201430501403020077220140808<NA>1영업/정상01정상영업<NA><NA><NA><NA>02-965-3333<NA><NA><NA>서울특별시 동대문구 약령중앙로**길 *, 지층 *호 (제기동, 한성빌딩)130864스베나(SVENA)2014-08-12 17:30:41I2018-08-31 23:59:59.0의류/패션/잡화/뷰티203275.594246453367.928423<NA><NA><NA>인터넷
81783050000201130501403020028220110919<NA>4취소/말소/만료/정지/중지07직권말소<NA><NA><NA><NA>070-4251-4932<NA>130070서울특별시 동대문구 용두동 ***번지 한방천하포스빌 ****호서울특별시 동대문구 왕산로 ***, ****호 (용두동,한방천하포스빌)<NA>폼나니2015-05-22 16:49:06I2018-08-31 23:59:59.0의류/패션/잡화/뷰티203188.427527452835.044819<NA><NA><NA>인터넷
11453050000200330501003020096520030331<NA>3폐업03폐업처리20120223<NA><NA><NA>2236-2311<NA><NA>***-* 신삼송빌딩*층<NA><NA>삼성케녹스카메라서비스(주)2012-02-23 17:47:41I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
116203050000201530501403020032920150327<NA>1영업/정상01정상영업<NA><NA><NA><NA>02-3394-5661<NA><NA><NA>서울특별시 동대문구 답십리로 ***, *층 (장안동)02624인모엘2016-10-06 13:15:03I2018-08-31 23:59:59.0의류/패션/잡화/뷰티206151.12568452213.55584<NA><NA><NA>인터넷
34303050000200630501003020314120061212<NA>3폐업03폐업처리20090924<NA><NA><NA>3453-1992<NA><NA>서울특별시 동대문구 답십리동 ***번지 **호서울특별시 동대문구 천호대로**가길 * (답십리동)<NA>OMS2009-09-24 15:09:24I2021-12-03 22:02:00.0-204569.371498451870.871119<NA><NA><NA><NA>
56623050000200930501003020029820090406<NA>3폐업03폐업처리20090708<NA><NA><NA>1644-6544<NA>130825서울특별시 동대문구 이문*동 ***번지 **호서울특별시 동대문구 한천로 ***-* (이문동)<NA>S.A.soft2009-07-08 14:24:32I2018-08-31 23:59:59.0의류/패션/잡화/뷰티205883.761378454954.320452<NA><NA><NA>인터넷
89123050000201230501403020054420120605<NA>4취소/말소/만료/정지/중지07직권말소<NA><NA><NA><NA>962-5858<NA><NA>서울특별시 동대문구 청량리동 738번지 청량리현대코아동관4층서울특별시 동대문구 홍릉로 13, 4층 (청량리동, 청량리현대코아동관)130870한국모링가2022-12-26 15:30:58U2021-11-01 22:08:00.0건강/식품203851.375425453238.594218<NA><NA><NA><NA>
36183050000200730501003020015920070530<NA>3폐업03폐업처리20071206<NA><NA><NA><NA><NA>130823서울특별시 동대문구 용두동 ***번지 *호 ***호서울특별시 동대문구 무학로**길 **-*, ***호 (용두동)<NA>매니쉬걸(manishgirl)2007-12-10 14:27:04I2018-08-31 23:59:59.0기타202638.051765453139.962527<NA><NA><NA>인터넷
26547305000020243050210302004402024-02-02<NA>1영업/정상01정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 용두동 *** e편한세상 청계센트럴포레 아파트서울특별시 동대문구 무학로 **, ***동 ***호 (용두동, e편한세상 청계센트럴포레 아파트)02587디케이컴퍼니2024-02-05 11:36:44I2023-12-02 00:07:00.0교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티<NA><NA><NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
185883050000202030501403020112720200619<NA>1영업/정상01정상영업<NA><NA><NA><NA>--<NA><NA>서울특별시 동대문구 답십리동 ***번지 답십리한화아파트서울특별시 동대문구 고미술로 **, ***동 *층 *호 (답십리동, 답십리한화아파트)02604봄봄바스2020-06-19 17:02:27I2020-06-21 00:23:27.0종합몰204698.831171451628.940828<NA><NA><NA>인터넷
78143050000201130501003020060720110524<NA>3폐업03폐업처리20130527<NA><NA><NA>070-8628-1695<NA>130752서울특별시 동대문구 답십리동 ***번지 *통 *반 답십리대우아파트 ***동 ****호서울특별시 동대문구 답십리로**길 **, ***동 ****호 (답십리동,답십리대우아파트)<NA>더프라임2013-05-27 09:44:03I2018-08-31 23:59:59.0기타204650.613807452346.485061<NA><NA><NA>인터넷
11003050000200330501003020074120031124<NA>3폐업03폐업처리20060808<NA><NA><NA>2242-3548<NA><NA>*-* 두산(아)***-***<NA><NA>meihinn2007-12-11 15:33:33I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
26260305000020243050210302001532024-01-15<NA>1영업/정상01정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 제기동 *** 안암골 벽산아파트서울특별시 동대문구 약령시로 **, ***동 ***호 (제기동, 안암골 벽산아파트)02477지나랜드2024-01-15 08:52:31I2023-11-30 23:07:00.0종합몰202748.961555453461.616078<NA><NA><NA><NA>
23125305000020223050140302012262022-07-21<NA>5제외/삭제/전출05타시군구이관2023-10-26<NA><NA><NA><NA><NA><NA>서울특별시 동대문구 장안동 ***-**서울특별시 동대문구 장한로**길 **-**, 나동 (장안동)02519맑은하늘2023-10-26 13:30:18U2022-10-30 22:08:00.0종합몰206412.662381452862.640405<NA><NA><NA><NA>
22967305000020223050140302010682022-06-28<NA>3폐업03폐업처리2023-09-19<NA><NA><NA><NA><NA><NA>서울특별시 동대문구 장안동 *** 위더스빌서울특별시 동대문구 한천로 **, C동 ***호 (장안동, 위더스빌)02631영마마스토어2023-09-19 09:01:38U2022-12-08 22:01:00.0종합몰205423.969492451556.210578<NA><NA><NA><NA>
25525305000020233050210302015502023-09-07<NA>1영업/정상01정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 제기동 ***-**서울특별시 동대문구 고산자로 *** (제기동)02465인더헤세드2023-09-07 13:15:03I2022-12-09 00:09:00.0종합몰203201.885149454080.881211<NA><NA><NA><NA>
67193050000201030501003020050520100607<NA>4취소/말소/만료/정지/중지07직권말소<NA><NA><NA><NA><NA><NA>130070서울특별시 동대문구 용두동 ***번지 롯데캐슬피렌체 ***동 ****호서울특별시 동대문구 정릉천동로 **, ***동 ****호 (용두동,롯데캐슬피렌체)<NA>유토피아2014-02-10 12:41:50I2018-08-31 23:59:59.0종합몰203085.866786452766.014196<NA><NA><NA>인터넷
51123050000200830501003020068120080822<NA>1영업/정상01정상영업<NA><NA><NA><NA>966-7458<NA>130010서울특별시 동대문구 청량리동 ***번지 홍릉동부아파트 ***동 ***호서울특별시 동대문구 홍릉로**길 **, ***동 ***호 (청량리동,홍릉동부아파트)<NA>폰월드2008-08-22 17:13:38I2018-08-31 23:59:59.0기타203995.391422453762.252766<NA><NA><NA>인터넷
142303050000201730501403020083320130322<NA>1영업/정상01정상영업<NA><NA><NA><NA>307-4092<NA><NA>서울특별시 동대문구 장안동 ***번지 *호서울특별시 동대문구 장한로**가길 ***-* (장안동, 후암빌딩)02519(주)박의지2017-07-26 09:29:01I2018-08-31 23:59:59.0의류/패션/잡화/뷰티206448.775565452924.887247<NA><NA><NA>인터넷