Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells78768
Missing cells (%)27.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory251.0 B

Variable types

Categorical9
Numeric5
DateTime8
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author성동구
URLhttps://data.seoul.go.kr/dataList/OA-18805/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (66.3%)Imbalance
부채총액 is highly imbalanced (66.3%)Imbalance
자본금 is highly imbalanced (66.3%)Imbalance
판매방식명 is highly imbalanced (73.4%)Imbalance
인허가취소일자 has 9997 (> 99.9%) missing valuesMissing
폐업일자 has 6770 (67.7%) missing valuesMissing
휴업시작일자 has 9966 (99.7%) missing valuesMissing
휴업종료일자 has 9966 (99.7%) missing valuesMissing
재개업일자 has 9993 (99.9%) missing valuesMissing
전화번호 has 4772 (47.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 9297 (93.0%) missing valuesMissing
지번주소 has 1677 (16.8%) missing valuesMissing
도로명주소 has 1497 (15.0%) missing valuesMissing
도로명우편번호 has 2267 (22.7%) missing valuesMissing
좌표정보(X) has 1283 (12.8%) missing valuesMissing
좌표정보(Y) has 1283 (12.8%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = 26.46737292)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 08:30:48.180144
Analysis finished2024-05-11 08:30:49.995259
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3030000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3030000
2nd row3030000
3rd row3030000
4th row3030000
5th row3030000

Common Values

ValueCountFrequency (%)
3030000 10000
100.0%

Length

2024-05-11T17:30:50.050670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:50.136170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3030000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0166278 × 1018
Minimum1.997303 × 1018
Maximum2.024303 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T17:30:50.238651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.997303 × 1018
5-th percentile2.005303 × 1018
Q12.012303 × 1018
median2.019303 × 1018
Q32.021303 × 1018
95-th percentile2.023303 × 1018
Maximum2.024303 × 1018
Range2.7 × 1016
Interquartile range (IQR)9 × 1015

Descriptive statistics

Standard deviation6.0947699 × 1015
Coefficient of variation (CV)0.0030222582
Kurtosis-0.53944467
Mean2.0166278 × 1018
Median Absolute Deviation (MAD)3 × 1015
Skewness-0.78479708
Sum3.9868307 × 1018
Variance3.714622 × 1031
MonotonicityNot monotonic
2024-05-11T17:30:50.369050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2022303010330200697 1
 
< 0.1%
2014303010330200319 1
 
< 0.1%
2023303010330201937 1
 
< 0.1%
2023303010330200943 1
 
< 0.1%
2023303010330201108 1
 
< 0.1%
2021303010330200312 1
 
< 0.1%
2009303010330202388 1
 
< 0.1%
2020303010330200406 1
 
< 0.1%
2003303010330200652 1
 
< 0.1%
2021303010330201942 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1997303010330200535 1
< 0.1%
1997303010330200564 1
< 0.1%
1997303010330201065 1
< 0.1%
1998303010330201234 1
< 0.1%
1998303010330201553 1
< 0.1%
1998303010330202294 1
< 0.1%
1999303010330201896 1
< 0.1%
1999303010330202077 1
< 0.1%
1999303010330202480 1
< 0.1%
1999303010330202497 1
< 0.1%
ValueCountFrequency (%)
2024303010330200939 1
< 0.1%
2024303010330200937 1
< 0.1%
2024303010330200935 1
< 0.1%
2024303010330200933 1
< 0.1%
2024303010330200932 1
< 0.1%
2024303010330200929 1
< 0.1%
2024303010330200927 1
< 0.1%
2024303010330200926 1
< 0.1%
2024303010330200922 1
< 0.1%
2024303010330200918 1
< 0.1%
Distinct4004
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-01-28 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T17:30:50.480204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:50.592864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct3
Distinct (%)100.0%
Missing9997
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2007-08-23 00:00:00
Maximum2024-03-26 00:00:00
2024-05-11T17:30:50.677690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:50.771240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4509 
3
2531 
4
2231 
5
715 
2
 
14

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row1
4th row1
5th row4

Common Values

ValueCountFrequency (%)
1 4509
45.1%
3 2531
25.3%
4 2231
22.3%
5 715
 
7.1%
2 14
 
0.1%

Length

2024-05-11T17:30:50.874956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:50.983133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4509
45.1%
3 2531
25.3%
4 2231
22.3%
5 715
 
7.1%
2 14
 
0.1%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4509 
폐업
2531 
취소/말소/만료/정지/중지
2231 
제외/삭제/전출
715 
휴업
 
14

Length

Max length14
Median length8
Mean length6.4589
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row취소/말소/만료/정지/중지
3rd row영업/정상
4th row영업/정상
5th row취소/말소/만료/정지/중지

Common Values

ValueCountFrequency (%)
영업/정상 4509
45.1%
폐업 2531
25.3%
취소/말소/만료/정지/중지 2231
22.3%
제외/삭제/전출 715
 
7.1%
휴업 14
 
0.1%

Length

2024-05-11T17:30:51.097150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:51.193582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4509
45.1%
폐업 2531
25.3%
취소/말소/만료/정지/중지 2231
22.3%
제외/삭제/전출 715
 
7.1%
휴업 14
 
0.1%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.131
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T17:30:51.286006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q35
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.3744479
Coefficient of variation (CV)0.75836726
Kurtosis-1.1096499
Mean3.131
Median Absolute Deviation (MAD)2
Skewness0.67240173
Sum31310
Variance5.6380028
MonotonicityNot monotonic
2024-05-11T17:30:51.386441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 4509
45.1%
3 2531
25.3%
7 2227
22.3%
5 715
 
7.1%
2 14
 
0.1%
4 4
 
< 0.1%
ValueCountFrequency (%)
1 4509
45.1%
2 14
 
0.1%
3 2531
25.3%
4 4
 
< 0.1%
5 715
 
7.1%
7 2227
22.3%
ValueCountFrequency (%)
7 2227
22.3%
5 715
 
7.1%
4 4
 
< 0.1%
3 2531
25.3%
2 14
 
0.1%
1 4509
45.1%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4509 
폐업처리
2531 
직권말소
2227 
타시군구이관
715 
휴업처리
 
14

Length

Max length6
Median length4
Mean length4.143
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업처리
2nd row직권말소
3rd row정상영업
4th row정상영업
5th row직권말소

Common Values

ValueCountFrequency (%)
정상영업 4509
45.1%
폐업처리 2531
25.3%
직권말소 2227
22.3%
타시군구이관 715
 
7.1%
휴업처리 14
 
0.1%
직권취소 4
 
< 0.1%

Length

2024-05-11T17:30:51.494206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:51.593504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4509
45.1%
폐업처리 2531
25.3%
직권말소 2227
22.3%
타시군구이관 715
 
7.1%
휴업처리 14
 
0.1%
직권취소 4
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1933
Distinct (%)59.8%
Missing6770
Missing (%)67.7%
Memory size156.2 KiB
Minimum1999-02-11 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T17:30:51.701698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:51.826629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct34
Distinct (%)100.0%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2007-02-06 00:00:00
Maximum2024-05-10 00:00:00
2024-05-11T17:30:51.939085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:52.045461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

휴업종료일자
Date

MISSING 

Distinct32
Distinct (%)94.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-02-08 00:00:00
Maximum2050-01-01 00:00:00
2024-05-11T17:30:52.335744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:52.457646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)

재개업일자
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing9993
Missing (%)99.9%
Memory size156.2 KiB
Minimum2008-09-19 00:00:00
Maximum2024-05-07 00:00:00
2024-05-11T17:30:52.576950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:52.682377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

전화번호
Text

MISSING 

Distinct4671
Distinct (%)89.3%
Missing4772
Missing (%)47.7%
Memory size156.2 KiB
2024-05-11T17:30:52.846541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.73508
Min length1

Characters and Unicode

Total characters56123
Distinct characters16
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4581 ?
Unique (%)87.6%

Sample

1st row497-9111
2nd row02-6402-7535
3rd row070-7722-0880
4th row2009-2450
5th row02 498 0742
ValueCountFrequency (%)
02 1216
 
16.4%
346
 
4.7%
2299 44
 
0.6%
2295 36
 
0.5%
2281 34
 
0.5%
2298 34
 
0.5%
2297 33
 
0.4%
2296 32
 
0.4%
2294 32
 
0.4%
2293 28
 
0.4%
Other values (4904) 5561
75.2%
2024-05-11T17:30:53.171087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8799
15.7%
0 8477
15.1%
- 6495
11.6%
4 4139
7.4%
7 4133
7.4%
3922
7.0%
9 3526
6.3%
6 3509
 
6.3%
1 3366
 
6.0%
5 3307
 
5.9%
Other values (6) 6450
11.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 45374
80.8%
Dash Punctuation 6495
 
11.6%
Space Separator 3922
 
7.0%
Other Punctuation 323
 
0.6%
Math Symbol 8
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 8799
19.4%
0 8477
18.7%
4 4139
9.1%
7 4133
9.1%
9 3526
7.8%
6 3509
 
7.7%
1 3366
 
7.4%
5 3307
 
7.3%
8 3111
 
6.9%
3 3007
 
6.6%
Other Punctuation
ValueCountFrequency (%)
. 321
99.4%
, 2
 
0.6%
Dash Punctuation
ValueCountFrequency (%)
- 6495
100.0%
Space Separator
ValueCountFrequency (%)
3922
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 56123
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 8799
15.7%
0 8477
15.1%
- 6495
11.6%
4 4139
7.4%
7 4133
7.4%
3922
7.0%
9 3526
6.3%
6 3509
 
6.3%
1 3366
 
6.0%
5 3307
 
5.9%
Other values (6) 6450
11.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56123
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 8799
15.7%
0 8477
15.1%
- 6495
11.6%
4 4139
7.4%
7 4133
7.4%
3922
7.0%
9 3526
6.3%
6 3509
 
6.3%
1 3366
 
6.0%
5 3307
 
5.9%
Other values (6) 6450
11.5%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct121
Distinct (%)17.2%
Missing9297
Missing (%)93.0%
Infinite0
Infinite (%)0.0%
Mean133820.74
Minimum133010
Maximum415851
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T17:30:53.295259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum133010
5-th percentile133030
Q1133110
median133160
Q3133821.5
95-th percentile133864
Maximum415851
Range282841
Interquartile range (IQR)711.5

Descriptive statistics

Standard deviation10658.38
Coefficient of variation (CV)0.079646697
Kurtosis701.34409
Mean133820.74
Median Absolute Deviation (MAD)120
Skewness26.467373
Sum94075982
Variance1.1360107 × 108
MonotonicityNot monotonic
2024-05-11T17:30:53.426570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
133120 107
 
1.1%
133110 48
 
0.5%
133170 36
 
0.4%
133070 35
 
0.4%
133050 25
 
0.2%
133832 20
 
0.2%
133030 15
 
0.1%
133020 13
 
0.1%
133094 13
 
0.1%
133815 12
 
0.1%
Other values (111) 379
 
3.8%
(Missing) 9297
93.0%
ValueCountFrequency (%)
133010 3
 
< 0.1%
133020 13
 
0.1%
133021 5
 
0.1%
133022 4
 
< 0.1%
133030 15
0.1%
133040 8
 
0.1%
133050 25
0.2%
133060 4
 
< 0.1%
133070 35
0.4%
133071 6
 
0.1%
ValueCountFrequency (%)
415851 1
 
< 0.1%
135010 1
 
< 0.1%
133923 8
0.1%
133883 3
 
< 0.1%
133882 6
0.1%
133880 8
0.1%
133872 2
 
< 0.1%
133871 1
 
< 0.1%
133870 1
 
< 0.1%
133868 3
 
< 0.1%

지번주소
Text

MISSING 

Distinct3048
Distinct (%)36.6%
Missing1677
Missing (%)16.8%
Memory size156.2 KiB
2024-05-11T17:30:53.640346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length46
Mean length27.771356
Min length5

Characters and Unicode

Total characters231141
Distinct characters471
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2189 ?
Unique (%)26.3%

Sample

1st row서울특별시 성동구 하왕십리동 *** 벽산아파트
2nd row서울특별시 성동구 사근동 ***번지 *호
3rd row서울특별시 성동구 금호동*가 ***번지
4th row서울특별시 성동구 성수동*가 ***-*** 정안맨션*차
5th row서울특별시 성동구 성수동*가 ***번지 ***호 *층
ValueCountFrequency (%)
서울특별시 8271
18.5%
성동구 8266
18.5%
5421
12.1%
성수동*가 3683
 
8.2%
3007
 
6.7%
번지 2968
 
6.6%
금호동*가 711
 
1.6%
행당동 646
 
1.4%
529
 
1.2%
하왕십리동 517
 
1.2%
Other values (1697) 10726
24.0%
2024-05-11T17:30:54.018725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 45938
19.9%
39120
16.9%
17540
 
7.6%
13026
 
5.6%
8841
 
3.8%
8832
 
3.8%
8366
 
3.6%
8297
 
3.6%
8274
 
3.6%
8272
 
3.6%
Other values (461) 64635
28.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137223
59.4%
Other Punctuation 46005
 
19.9%
Space Separator 39120
 
16.9%
Dash Punctuation 4043
 
1.7%
Uppercase Letter 2545
 
1.1%
Decimal Number 1491
 
0.6%
Lowercase Letter 446
 
0.2%
Close Punctuation 120
 
0.1%
Open Punctuation 115
 
< 0.1%
Letter Number 30
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17540
 
12.8%
13026
 
9.5%
8841
 
6.4%
8832
 
6.4%
8366
 
6.1%
8297
 
6.0%
8274
 
6.0%
8272
 
6.0%
4942
 
3.6%
4814
 
3.5%
Other values (392) 46019
33.5%
Uppercase Letter
ValueCountFrequency (%)
T 307
12.1%
E 266
 
10.5%
K 241
 
9.5%
S 183
 
7.2%
A 181
 
7.1%
R 176
 
6.9%
C 140
 
5.5%
I 136
 
5.3%
V 125
 
4.9%
O 123
 
4.8%
Other values (14) 667
26.2%
Lowercase Letter
ValueCountFrequency (%)
o 92
20.6%
e 90
20.2%
r 83
18.6%
w 71
15.9%
l 15
 
3.4%
a 14
 
3.1%
m 11
 
2.5%
h 10
 
2.2%
i 10
 
2.2%
z 9
 
2.0%
Other values (10) 41
9.2%
Decimal Number
ValueCountFrequency (%)
1 285
19.1%
2 275
18.4%
6 155
10.4%
3 148
9.9%
5 142
9.5%
8 109
 
7.3%
7 108
 
7.2%
4 108
 
7.2%
0 94
 
6.3%
9 67
 
4.5%
Other Punctuation
ValueCountFrequency (%)
* 45938
99.9%
, 40
 
0.1%
/ 19
 
< 0.1%
. 5
 
< 0.1%
& 2
 
< 0.1%
: 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 119
99.2%
] 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 114
99.1%
[ 1
 
0.9%
Letter Number
ValueCountFrequency (%)
16
53.3%
14
46.7%
Space Separator
ValueCountFrequency (%)
39120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4043
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 137223
59.4%
Common 90897
39.3%
Latin 3021
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17540
 
12.8%
13026
 
9.5%
8841
 
6.4%
8832
 
6.4%
8366
 
6.1%
8297
 
6.0%
8274
 
6.0%
8272
 
6.0%
4942
 
3.6%
4814
 
3.5%
Other values (392) 46019
33.5%
Latin
ValueCountFrequency (%)
T 307
 
10.2%
E 266
 
8.8%
K 241
 
8.0%
S 183
 
6.1%
A 181
 
6.0%
R 176
 
5.8%
C 140
 
4.6%
I 136
 
4.5%
V 125
 
4.1%
O 123
 
4.1%
Other values (36) 1143
37.8%
Common
ValueCountFrequency (%)
* 45938
50.5%
39120
43.0%
- 4043
 
4.4%
1 285
 
0.3%
2 275
 
0.3%
6 155
 
0.2%
3 148
 
0.2%
5 142
 
0.2%
) 119
 
0.1%
( 114
 
0.1%
Other values (13) 558
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 137223
59.4%
ASCII 93888
40.6%
Number Forms 30
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 45938
48.9%
39120
41.7%
- 4043
 
4.3%
T 307
 
0.3%
1 285
 
0.3%
2 275
 
0.3%
E 266
 
0.3%
K 241
 
0.3%
S 183
 
0.2%
A 181
 
0.2%
Other values (57) 3049
 
3.2%
Hangul
ValueCountFrequency (%)
17540
 
12.8%
13026
 
9.5%
8841
 
6.4%
8832
 
6.4%
8366
 
6.1%
8297
 
6.0%
8274
 
6.0%
8272
 
6.0%
4942
 
3.6%
4814
 
3.5%
Other values (392) 46019
33.5%
Number Forms
ValueCountFrequency (%)
16
53.3%
14
46.7%

도로명주소
Text

MISSING 

Distinct5138
Distinct (%)60.4%
Missing1497
Missing (%)15.0%
Memory size156.2 KiB
2024-05-11T17:30:54.279648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length62
Mean length38.443843
Min length20

Characters and Unicode

Total characters326888
Distinct characters502
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3926 ?
Unique (%)46.2%

Sample

1st row서울특별시 성동구 무학로 **, ***동 ****호 (하왕십리동, 벽산아파트)
2nd row서울특별시 성동구 사근동**길 *, *층 ***호 (사근동)
3rd row서울특별시 성동구 무수막길 **-*, ***호 (금호동*가)
4th row서울특별시 성동구 뚝섬로**길 **, 정안 *동 ***호 부속 지하실*호 (성수동*가, 정안맨션*차)
5th row서울특별시 성동구 왕십리로*길 **-**, *층호 (성수동*가)
ValueCountFrequency (%)
서울특별시 8503
13.9%
성동구 8492
13.9%
8212
13.4%
5184
 
8.5%
성수동*가 3869
 
6.3%
3120
 
5.1%
1731
 
2.8%
금호동*가 683
 
1.1%
행당동 610
 
1.0%
왕십리로 512
 
0.8%
Other values (2425) 20190
33.0%
2024-05-11T17:30:54.682600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 58221
17.8%
52762
16.1%
20317
 
6.2%
14768
 
4.5%
, 10606
 
3.2%
9884
 
3.0%
9367
 
2.9%
8817
 
2.7%
) 8585
 
2.6%
( 8581
 
2.6%
Other values (492) 124980
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 180928
55.3%
Other Punctuation 68839
 
21.1%
Space Separator 52762
 
16.1%
Close Punctuation 8586
 
2.6%
Open Punctuation 8582
 
2.6%
Uppercase Letter 3003
 
0.9%
Decimal Number 2147
 
0.7%
Dash Punctuation 1520
 
0.5%
Lowercase Letter 463
 
0.1%
Letter Number 35
 
< 0.1%
Other values (2) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20317
 
11.2%
14768
 
8.2%
9884
 
5.5%
9367
 
5.2%
8817
 
4.9%
8534
 
4.7%
8510
 
4.7%
8508
 
4.7%
7455
 
4.1%
6362
 
3.5%
Other values (420) 78406
43.3%
Uppercase Letter
ValueCountFrequency (%)
T 338
11.3%
B 276
 
9.2%
E 253
 
8.4%
S 250
 
8.3%
A 250
 
8.3%
K 242
 
8.1%
R 183
 
6.1%
I 163
 
5.4%
O 151
 
5.0%
C 143
 
4.8%
Other values (14) 754
25.1%
Lowercase Letter
ValueCountFrequency (%)
e 93
20.1%
o 92
19.9%
r 83
17.9%
w 74
16.0%
b 16
 
3.5%
l 13
 
2.8%
a 11
 
2.4%
m 10
 
2.2%
s 10
 
2.2%
t 9
 
1.9%
Other values (11) 52
11.2%
Decimal Number
ValueCountFrequency (%)
1 631
29.4%
2 358
16.7%
0 299
13.9%
3 198
 
9.2%
4 165
 
7.7%
5 161
 
7.5%
6 97
 
4.5%
7 87
 
4.1%
8 85
 
4.0%
9 66
 
3.1%
Other Punctuation
ValueCountFrequency (%)
* 58221
84.6%
, 10606
 
15.4%
& 5
 
< 0.1%
/ 3
 
< 0.1%
. 2
 
< 0.1%
# 1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 8585
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 8581
> 99.9%
[ 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
19
54.3%
16
45.7%
Space Separator
ValueCountFrequency (%)
52762
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1520
100.0%
Math Symbol
ValueCountFrequency (%)
~ 22
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 180928
55.3%
Common 142459
43.6%
Latin 3501
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20317
 
11.2%
14768
 
8.2%
9884
 
5.5%
9367
 
5.2%
8817
 
4.9%
8534
 
4.7%
8510
 
4.7%
8508
 
4.7%
7455
 
4.1%
6362
 
3.5%
Other values (420) 78406
43.3%
Latin
ValueCountFrequency (%)
T 338
 
9.7%
B 276
 
7.9%
E 253
 
7.2%
S 250
 
7.1%
A 250
 
7.1%
K 242
 
6.9%
R 183
 
5.2%
I 163
 
4.7%
O 151
 
4.3%
C 143
 
4.1%
Other values (37) 1252
35.8%
Common
ValueCountFrequency (%)
* 58221
40.9%
52762
37.0%
, 10606
 
7.4%
) 8585
 
6.0%
( 8581
 
6.0%
- 1520
 
1.1%
1 631
 
0.4%
2 358
 
0.3%
0 299
 
0.2%
3 198
 
0.1%
Other values (15) 698
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 180927
55.3%
ASCII 145924
44.6%
Number Forms 35
 
< 0.1%
Compat Jamo 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 58221
39.9%
52762
36.2%
, 10606
 
7.3%
) 8585
 
5.9%
( 8581
 
5.9%
- 1520
 
1.0%
1 631
 
0.4%
2 358
 
0.2%
T 338
 
0.2%
0 299
 
0.2%
Other values (59) 4023
 
2.8%
Hangul
ValueCountFrequency (%)
20317
 
11.2%
14768
 
8.2%
9884
 
5.5%
9367
 
5.2%
8817
 
4.9%
8534
 
4.7%
8510
 
4.7%
8508
 
4.7%
7455
 
4.1%
6362
 
3.5%
Other values (419) 78405
43.3%
Number Forms
ValueCountFrequency (%)
19
54.3%
16
45.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct343
Distinct (%)4.4%
Missing2267
Missing (%)22.7%
Memory size156.2 KiB
2024-05-11T17:30:54.959210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1783267
Min length5

Characters and Unicode

Total characters40044
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)1.0%

Sample

1st row04703
2nd row04761
3rd row04728
4th row04785
5th row133110
ValueCountFrequency (%)
04793 303
 
3.9%
04782 272
 
3.5%
04779 231
 
3.0%
04790 230
 
3.0%
04709 211
 
2.7%
04778 210
 
2.7%
04799 196
 
2.5%
04783 164
 
2.1%
04808 162
 
2.1%
04794 154
 
2.0%
Other values (333) 5600
72.4%
2024-05-11T17:30:55.420318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9042
22.6%
7 7817
19.5%
4 7703
19.2%
3 3463
 
8.6%
8 3222
 
8.0%
1 2722
 
6.8%
9 2392
 
6.0%
2 1436
 
3.6%
5 991
 
2.5%
6 924
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39712
99.2%
Dash Punctuation 332
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9042
22.8%
7 7817
19.7%
4 7703
19.4%
3 3463
 
8.7%
8 3222
 
8.1%
1 2722
 
6.9%
9 2392
 
6.0%
2 1436
 
3.6%
5 991
 
2.5%
6 924
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 332
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40044
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9042
22.6%
7 7817
19.5%
4 7703
19.2%
3 3463
 
8.6%
8 3222
 
8.0%
1 2722
 
6.8%
9 2392
 
6.0%
2 1436
 
3.6%
5 991
 
2.5%
6 924
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9042
22.6%
7 7817
19.5%
4 7703
19.2%
3 3463
 
8.6%
8 3222
 
8.0%
1 2722
 
6.8%
9 2392
 
6.0%
2 1436
 
3.6%
5 991
 
2.5%
6 924
 
2.3%
Distinct9894
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T17:30:55.726124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length7.9459
Min length1

Characters and Unicode

Total characters79459
Distinct characters1083
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9793 ?
Unique (%)97.9%

Sample

1st row더드림
2nd row벤자민
3rd row미드나잇1988
4th row플러스아츠 (+ARTS)
5th row알래
ValueCountFrequency (%)
주식회사 1319
 
9.5%
195
 
1.4%
co 56
 
0.4%
ltd 53
 
0.4%
inc 38
 
0.3%
컴퍼니 37
 
0.3%
company 35
 
0.3%
co.,ltd 33
 
0.2%
31
 
0.2%
스튜디오 30
 
0.2%
Other values (11188) 12124
86.9%
2024-05-11T17:30:56.127986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3965
 
5.0%
) 2893
 
3.6%
( 2890
 
3.6%
2611
 
3.3%
2595
 
3.3%
2192
 
2.8%
1680
 
2.1%
1445
 
1.8%
1401
 
1.8%
1153
 
1.5%
Other values (1073) 56634
71.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52212
65.7%
Lowercase Letter 8974
 
11.3%
Uppercase Letter 7279
 
9.2%
Space Separator 3965
 
5.0%
Close Punctuation 2894
 
3.6%
Open Punctuation 2891
 
3.6%
Other Punctuation 621
 
0.8%
Decimal Number 488
 
0.6%
Dash Punctuation 101
 
0.1%
Connector Punctuation 16
 
< 0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2611
 
5.0%
2595
 
5.0%
2192
 
4.2%
1680
 
3.2%
1445
 
2.8%
1401
 
2.7%
1153
 
2.2%
905
 
1.7%
831
 
1.6%
800
 
1.5%
Other values (990) 36599
70.1%
Lowercase Letter
ValueCountFrequency (%)
e 988
 
11.0%
o 973
 
10.8%
a 785
 
8.7%
n 679
 
7.6%
i 636
 
7.1%
t 582
 
6.5%
r 554
 
6.2%
l 541
 
6.0%
s 405
 
4.5%
d 344
 
3.8%
Other values (16) 2487
27.7%
Uppercase Letter
ValueCountFrequency (%)
A 582
 
8.0%
O 548
 
7.5%
E 537
 
7.4%
S 498
 
6.8%
N 449
 
6.2%
L 449
 
6.2%
C 427
 
5.9%
T 405
 
5.6%
I 392
 
5.4%
M 378
 
5.2%
Other values (16) 2614
35.9%
Other Punctuation
ValueCountFrequency (%)
. 374
60.2%
, 107
 
17.2%
& 102
 
16.4%
' 14
 
2.3%
? 8
 
1.3%
# 5
 
0.8%
3
 
0.5%
: 3
 
0.5%
/ 2
 
0.3%
! 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 92
18.9%
1 91
18.6%
3 46
9.4%
9 45
9.2%
0 44
9.0%
4 39
8.0%
5 36
 
7.4%
7 35
 
7.2%
6 30
 
6.1%
8 30
 
6.1%
Close Punctuation
ValueCountFrequency (%)
) 2893
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2890
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
3965
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52204
65.7%
Latin 16253
 
20.5%
Common 10985
 
13.8%
Han 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2611
 
5.0%
2595
 
5.0%
2192
 
4.2%
1680
 
3.2%
1445
 
2.8%
1401
 
2.7%
1153
 
2.2%
905
 
1.7%
831
 
1.6%
800
 
1.5%
Other values (975) 36591
70.1%
Latin
ValueCountFrequency (%)
e 988
 
6.1%
o 973
 
6.0%
a 785
 
4.8%
n 679
 
4.2%
i 636
 
3.9%
A 582
 
3.6%
t 582
 
3.6%
r 554
 
3.4%
O 548
 
3.4%
l 541
 
3.3%
Other values (42) 9385
57.7%
Common
ValueCountFrequency (%)
3965
36.1%
) 2893
26.3%
( 2890
26.3%
. 374
 
3.4%
, 107
 
1.0%
& 102
 
0.9%
- 101
 
0.9%
2 92
 
0.8%
1 91
 
0.8%
3 46
 
0.4%
Other values (20) 324
 
2.9%
Han
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52194
65.7%
ASCII 27235
34.3%
CJK 15
 
< 0.1%
None 12
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3965
 
14.6%
) 2893
 
10.6%
( 2890
 
10.6%
e 988
 
3.6%
o 973
 
3.6%
a 785
 
2.9%
n 679
 
2.5%
i 636
 
2.3%
A 582
 
2.1%
t 582
 
2.1%
Other values (71) 12262
45.0%
Hangul
ValueCountFrequency (%)
2611
 
5.0%
2595
 
5.0%
2192
 
4.2%
1680
 
3.2%
1445
 
2.8%
1401
 
2.7%
1153
 
2.2%
905
 
1.7%
831
 
1.6%
800
 
1.5%
Other values (973) 36581
70.1%
None
ValueCountFrequency (%)
9
75.0%
3
 
25.0%
CJK
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct9410
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-19 09:31:08
Maximum2024-05-09 16:48:02
2024-05-11T17:30:56.243170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:56.359379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
5800 
U
4200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowU
3rd rowI
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 5800
58.0%
U 4200
42.0%

Length

2024-05-11T17:30:56.475183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:56.567876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 5800
58.0%
u 4200
42.0%
Distinct1554
Distinct (%)15.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T17:30:56.663757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T17:30:56.797008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct472
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T17:30:56.963516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length83
Mean length9.259
Min length1

Characters and Unicode

Total characters92590
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique298 ?
Unique (%)3.0%

Sample

1st row종합몰
2nd row의류/패션/잡화/뷰티
3rd row의류/패션/잡화/뷰티
4th row기타
5th row의류/패션/잡화/뷰티
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4330
30.9%
종합몰 2502
17.9%
기타 2096
15.0%
건강/식품 1085
 
7.7%
981
 
7.0%
교육/도서/완구/오락 689
 
4.9%
컴퓨터/사무용품 545
 
3.9%
가구/수납용품 497
 
3.5%
가전 418
 
3.0%
레져/여행/공연 355
 
2.5%
Other values (3) 506
 
3.6%
2024-05-11T17:30:57.261389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 18297
19.8%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4330
 
4.7%
4004
 
4.3%
Other values (41) 35649
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69308
74.9%
Other Punctuation 18297
 
19.8%
Space Separator 4004
 
4.3%
Dash Punctuation 981
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
2633
 
3.8%
2502
 
3.6%
Other values (38) 29533
42.6%
Other Punctuation
ValueCountFrequency (%)
/ 18297
100.0%
Space Separator
ValueCountFrequency (%)
4004
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 981
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69308
74.9%
Common 23282
 
25.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
2633
 
3.8%
2502
 
3.6%
Other values (38) 29533
42.6%
Common
ValueCountFrequency (%)
/ 18297
78.6%
4004
 
17.2%
- 981
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69308
74.9%
ASCII 23282
 
25.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 18297
78.6%
4004
 
17.2%
- 981
 
4.2%
Hangul
ValueCountFrequency (%)
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
4330
 
6.2%
2633
 
3.8%
2502
 
3.6%
Other values (38) 29533
42.6%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct3143
Distinct (%)36.1%
Missing1283
Missing (%)12.8%
Infinite0
Infinite (%)0.0%
Mean203796.17
Minimum164927.63
Maximum210282.63
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T17:30:57.380852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum164927.63
5-th percentile201481.2
Q1202776.23
median204071.51
Q3204795.4
95-th percentile205638.68
Maximum210282.63
Range45354.999
Interquartile range (IQR)2019.1618

Descriptive statistics

Standard deviation1354.3336
Coefficient of variation (CV)0.0066455303
Kurtosis76.796867
Mean203796.17
Median Absolute Deviation (MAD)941.70084
Skewness-3.0318506
Sum1.7764912 × 109
Variance1834219.5
MonotonicityNot monotonic
2024-05-11T17:30:57.490962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202372.912023599 110
 
1.1%
202326.503044305 101
 
1.0%
204764.658079784 94
 
0.9%
204167.721292495 80
 
0.8%
203014.126371125 79
 
0.8%
202326.470897024 75
 
0.8%
205039.907145725 74
 
0.7%
202113.869605464 74
 
0.7%
204743.677915464 67
 
0.7%
202511.142930696 66
 
0.7%
Other values (3133) 7897
79.0%
(Missing) 1283
 
12.8%
ValueCountFrequency (%)
164927.626906141 1
 
< 0.1%
196881.746507124 1
 
< 0.1%
200782.353831759 1
 
< 0.1%
200794.15011761 2
 
< 0.1%
200807.704843164 1
 
< 0.1%
200812.992681398 16
0.2%
200813.000885632 3
 
< 0.1%
200832.281387425 1
 
< 0.1%
200834.31104693 1
 
< 0.1%
200841.945538402 1
 
< 0.1%
ValueCountFrequency (%)
210282.625548983 1
< 0.1%
206392.92125763 2
< 0.1%
206390.482595211 1
< 0.1%
206382.157826605 2
< 0.1%
206381.885182354 1
< 0.1%
206365.843579506 1
< 0.1%
206352.830451414 1
< 0.1%
206347.265568329 1
< 0.1%
206345.830722823 1
< 0.1%
206327.212793112 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct3145
Distinct (%)36.1%
Missing1283
Missing (%)12.8%
Infinite0
Infinite (%)0.0%
Mean449917.15
Minimum445675.5
Maximum462863.32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T17:30:57.615304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum445675.5
5-th percentile448558.27
Q1449159.45
median449583.92
Q3450883.52
95-th percentile451805.57
Maximum462863.32
Range17187.821
Interquartile range (IQR)1724.0712

Descriptive statistics

Standard deviation1031.4054
Coefficient of variation (CV)0.0022924341
Kurtosis2.5897569
Mean449917.15
Median Absolute Deviation (MAD)649.05632
Skewness0.76007107
Sum3.9219278 × 109
Variance1063797.2
MonotonicityNot monotonic
2024-05-11T17:30:57.732954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
451536.680876573 110
 
1.1%
450625.58422744 101
 
1.0%
449437.628334972 94
 
0.9%
449290.778225677 80
 
0.8%
451106.855353223 79
 
0.8%
449954.683065141 75
 
0.8%
449216.223655036 74
 
0.7%
451897.581865192 74
 
0.7%
449357.9464135 67
 
0.7%
450401.303715561 66
 
0.7%
Other values (3135) 7897
79.0%
(Missing) 1283
 
12.8%
ValueCountFrequency (%)
445675.502928217 1
< 0.1%
447528.63482061 1
< 0.1%
448032.759033613 1
< 0.1%
448061.529326769 1
< 0.1%
448068.188938354 2
< 0.1%
448074.599156032 1
< 0.1%
448076.259608224 1
< 0.1%
448076.291065241 1
< 0.1%
448081.16757518 2
< 0.1%
448084.000347493 1
< 0.1%
ValueCountFrequency (%)
462863.323977861 1
 
< 0.1%
459169.464921021 1
 
< 0.1%
452148.339373773 1
 
< 0.1%
452138.17136839 4
< 0.1%
452136.43641437 1
 
< 0.1%
452134.616539343 1
 
< 0.1%
452132.397540245 1
 
< 0.1%
452126.951366987 1
 
< 0.1%
452123.823940035 1
 
< 0.1%
452118.573751193 2
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9377 
0
 
623

Length

Max length4
Median length4
Mean length3.8131
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9377
93.8%
0 623
 
6.2%

Length

2024-05-11T17:30:57.846331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:57.938126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9377
93.8%
0 623
 
6.2%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9377 
0
 
623

Length

Max length4
Median length4
Mean length3.8131
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9377
93.8%
0 623
 
6.2%

Length

2024-05-11T17:30:58.029759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:58.127850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9377
93.8%
0 623
 
6.2%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9377 
0
 
623

Length

Max length4
Median length4
Mean length3.8131
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9377
93.8%
0 623
 
6.2%

Length

2024-05-11T17:30:58.235383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T17:30:58.568636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9377
93.8%
0 623
 
6.2%

판매방식명
Categorical

IMBALANCE 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6675 
인터넷
3071 
인터넷, 기타
 
72
TV홈쇼핑, 인터넷
 
43
기타
 
42
Other values (14)
 
97

Length

Max length26
Median length4
Mean length3.8409
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row인터넷
4th row인터넷
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6675
66.8%
인터넷 3071
30.7%
인터넷, 기타 72
 
0.7%
TV홈쇼핑, 인터넷 43
 
0.4%
기타 42
 
0.4%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 22
 
0.2%
인터넷, 카다로그 14
 
0.1%
TV홈쇼핑 14
 
0.1%
인터넷, 카다로그, 신문잡지, 기타 9
 
0.1%
TV홈쇼핑, 인터넷, 카다로그 6
 
0.1%
Other values (9) 32
 
0.3%

Length

2024-05-11T17:30:58.660634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6675
64.7%
인터넷 3263
31.6%
기타 159
 
1.5%
tv홈쇼핑 100
 
1.0%
카다로그 73
 
0.7%
신문잡지 45
 
0.4%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
21909303000020223030103302006972022-03-29<NA>3폐업3폐업처리2023-11-24<NA><NA><NA><NA><NA><NA>서울특별시 성동구 하왕십리동 *** 벽산아파트서울특별시 성동구 무학로 **, ***동 ****호 (하왕십리동, 벽산아파트)04703더드림2023-11-24 16:25:38U2022-10-31 22:06:00.0종합몰202727.770917451824.598496<NA><NA><NA><NA>
15115303000020193030103302017372019-09-27<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 사근동 ***번지 *호서울특별시 성동구 사근동**길 *, *층 ***호 (사근동)04761벤자민2023-10-04 16:09:40U2022-10-31 00:06:00.0의류/패션/잡화/뷰티204167.126585451046.730531<NA><NA><NA><NA>
163243030000202030301033020081020200402<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 금호동*가 ***번지서울특별시 성동구 무수막길 **-*, ***호 (금호동*가)04728미드나잇19882020-04-02 09:40:45I2020-04-04 00:23:22.0의류/패션/잡화/뷰티201771.242462450140.781718<NA><NA><NA>인터넷
171453030000202030301033020170520200717<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 성수동*가 ***-*** 정안맨션*차서울특별시 성동구 뚝섬로**길 **, 정안 *동 ***호 부속 지하실*호 (성수동*가, 정안맨션*차)04785플러스아츠 (+ARTS)2020-07-17 14:20:56I2020-07-21 00:23:30.0기타205198.392011448541.420292<NA><NA><NA>인터넷
71953030000201230301033020072320121023<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 성수동*가 ***번지 ***호 *층서울특별시 성동구 왕십리로*길 **-**, *층호 (성수동*가)133110알래2022-11-21 11:17:07U2021-10-31 22:03:00.0의류/패션/잡화/뷰티204046.152628449258.56607<NA><NA><NA><NA>
59363030000201130301033020013120110224<NA>1영업/정상1정상영업<NA><NA><NA><NA>497-9111<NA>133120서울특별시 성동구 성수동*가 ***번지 ***호 *층서울특별시 성동구 뚝섬로 *** (성수동*가,*층)<NA>화이어폭스(firefox)2011-02-24 14:04:28I2018-08-31 23:59:59.0기타204988.441963448430.261389<NA><NA><NA>인터넷
131493030000201830301033020129120180911<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6402-7535<NA><NA>서울특별시 성동구 용답동 ***번지 *호서울특별시 성동구 용답*가길 ** (용답동)04803시얀(SEEYARN)2018-09-11 13:05:49U2018-09-11 23:59:59.0의류/패션/잡화/뷰티204338.319699451496.384765<NA><NA><NA>인터넷
47703030000200930301033020220720090626<NA>3폐업3폐업처리20091008<NA><NA><NA>070-7722-0880<NA>133120서울특별시 성동구 성수동*가 ***번지 **호 동일빌딩 ***호<NA><NA>오리스 쿠폰2009-10-08 16:10:56I2018-08-31 23:59:59.0상품권204694.788012449274.734886<NA><NA><NA>인터넷
71933030000201230301033020072120090831<NA>3폐업3폐업처리20181226<NA><NA><NA>2009-2450<NA><NA>서울특별시 성동구 성수동*가 ***번지 *호 영동테크노타워***,***호서울특별시 성동구 광나루로*길 **, ***호 (성수동*가, 성수동 우림 이비즈센터)04799(주) 다인스케치2018-12-26 10:02:35U2018-12-28 02:40:00.0기타204859.957619449199.388104<NA><NA><NA>인터넷
95763030000201530301033020096820151030<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 살곶이길 **, ***동 ****호 (마장동, 현대아파트)04753링마트2015-10-30 13:21:20I2018-08-31 23:59:59.0종합몰203689.261216451908.86238<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
19656303000020213030103302012952021-05-10<NA>5제외/삭제/전출5타시군구이관2023-02-09<NA><NA><NA><NA><NA><NA>서울특별시 성동구 성수동*가 ***-** 더리브 세종타워서울특별시 성동구 아차산로 ***, 더리브 세종타워 B***호 (성수동*가)04783로종2023-02-09 15:50:05U2022-12-01 23:01:00.0종합몰205076.420527449018.135624<NA><NA><NA><NA>
204253030000202130301033020210220210819<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 성수동*가 *** 한진타운아파트서울특별시 성동구 뚝섬로 ***, ***동 *층 ***호 (성수동*가, 한진타운아파트)04773포쉬독2021-08-19 09:52:43I2021-08-21 00:22:50.0기타203933.697717448762.825899000인터넷
28623030000200630301033020274720061120<NA>3폐업3폐업처리20080716<NA><NA><NA>02 2293 6131<NA><NA>서울특별시 성동구 마장동 ***-**<NA><NA>지킬앤하이드2008-07-16 09:54:21I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
160763030000202030301033020054120200305<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7542-3099<NA><NA>서울특별시 성동구 마장동 ***번지 **호 군산빌딩서울특별시 성동구 마장로 ***-*, *층 (마장동)04756로지팩 주식회사2020-03-05 13:27:18I2020-03-07 00:23:22.0기타203930.337124451745.115766<NA><NA><NA>인터넷
177273030000202030301033020233320200924<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 옥수동 *** 훈빌딩서울특별시 성동구 한림말*길 **-*, 훈빌딩 ***호 (옥수동)04735비율컴퍼니2020-09-24 14:59:20I2020-09-26 00:23:11.0의류/패션/잡화/뷰티201359.174155448941.509587<NA><NA><NA>인터넷
206453030000202130301033020232720210925<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-4112-9739<NA><NA>서울특별시 성동구 성수동*가 ***-** 성수역 현대테라스타워서울특별시 성동구 연무장*가길 *, 성수역 현대테라스타워 이동 *층 ***호 (성수동*가)04782주식회사 우알롱2021-09-27 15:14:22U2021-09-29 02:40:00.0의류/패션/잡화/뷰티204702.224732449076.199238000인터넷
214393030000202230301033020022220220126<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-2051-7290<NA><NA>서울특별시 성동구 성수동*가 ***-** 우리큐브서울특별시 성동구 연무장**길 **, 우리큐브 *층 ***호 (성수동*가)04783주식회사 트루에드2022-01-26 13:55:29I2022-01-28 00:22:39.0의류/패션/잡화/뷰티205014.436192448961.20833000인터넷
220053030000202230301033020079320220413<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7954-2308<NA><NA>서울특별시 성동구 행당동 *-***서울특별시 성동구 마조로**길 *(행당동)04759두빗2022-04-13 17:33:37I2021-12-03 23:05:00.0종합몰203565.059442451069.681828<NA><NA><NA><NA>
167153030000202030301033020123020200521<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성동구 성수동*가 ***번지 ***호서울특별시 성동구 아차산로*길 ** (성수동*가)04788임언니네2020-05-21 14:34:15I2020-05-23 00:23:19.0의류/패션/잡화/뷰티203973.668724449647.706806<NA><NA><NA>인터넷
21253030000200630301033020199520060123<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 3409 1430<NA><NA>서울특별시 성동구 성수동*가 ***-** 성수아카데미타워 ****호<NA><NA>(주)미래ASP2009-11-30 17:16:58I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>