Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells74805
Missing cells (%)25.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory251.0 B

Variable types

Categorical9
Numeric5
DateTime8
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author중랑구
URLhttps://data.seoul.go.kr/dataList/OA-18808/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (67.3%)Imbalance
부채총액 is highly imbalanced (67.3%)Imbalance
자본금 is highly imbalanced (67.3%)Imbalance
판매방식명 is highly imbalanced (72.6%)Imbalance
인허가취소일자 has 9995 (> 99.9%) missing valuesMissing
폐업일자 has 7089 (70.9%) missing valuesMissing
휴업시작일자 has 9966 (99.7%) missing valuesMissing
휴업종료일자 has 9966 (99.7%) missing valuesMissing
재개업일자 has 9991 (99.9%) missing valuesMissing
전화번호 has 3891 (38.9%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8252 (82.5%) missing valuesMissing
지번주소 has 575 (5.8%) missing valuesMissing
도로명주소 has 959 (9.6%) missing valuesMissing
도로명우편번호 has 2311 (23.1%) missing valuesMissing
좌표정보(X) has 905 (9.0%) missing valuesMissing
좌표정보(Y) has 905 (9.0%) missing valuesMissing
관리번호 is highly skewed (γ1 = -65.02225017)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-17 20:17:01.634165
Analysis finished2024-04-17 20:17:03.121221
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3060000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3060000
2nd row3060000
3rd row3060000
4th row3060000
5th row3060000

Common Values

ValueCountFrequency (%)
3060000 10000
100.0%

Length

2024-04-18T05:17:03.168844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:03.237252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3060000 10000
100.0%

관리번호
Real number (ℝ)

SKEWED  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0162935 × 1018
Minimum2.005306 × 1017
Maximum2.024306 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-18T05:17:03.317669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.005306 × 1017
5-th percentile2.005306 × 1018
Q12.012306 × 1018
median2.018306 × 1018
Q32.021306 × 1018
95-th percentile2.023306 × 1018
Maximum2.024306 × 1018
Range1.8237754 × 1018
Interquartile range (IQR)9.0000081 × 1015

Descriptive statistics

Standard deviation2.6410398 × 1016
Coefficient of variation (CV)0.013098489
Kurtosis4468.6865
Mean2.0162935 × 1018
Median Absolute Deviation (MAD)4 × 1015
Skewness-65.02225
Sum6.4332476 × 1017
Variance6.9750911 × 1032
MonotonicityNot monotonic
2024-04-18T05:17:03.428168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2021306017630200022 1
 
< 0.1%
2004306009530200493 1
 
< 0.1%
2019306017630200940 1
 
< 0.1%
2022306017630200807 1
 
< 0.1%
2017306014530200588 1
 
< 0.1%
2020306017630201579 1
 
< 0.1%
2021306017630200753 1
 
< 0.1%
2016306014530200535 1
 
< 0.1%
2022306017630201905 1
 
< 0.1%
2023306020230200356 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
200530600953020018 1
< 0.1%
200530600953020030 1
< 0.1%
1993306009530200001 1
< 0.1%
1996306009530200051 1
< 0.1%
1996306009530200346 1
< 0.1%
1997306009530200539 1
< 0.1%
1997306009530200585 1
< 0.1%
1997306009530200610 1
< 0.1%
1997306009530200778 1
< 0.1%
1998306009530201142 1
< 0.1%
ValueCountFrequency (%)
2024306020230200769 1
< 0.1%
2024306020230200767 1
< 0.1%
2024306020230200763 1
< 0.1%
2024306020230200761 1
< 0.1%
2024306020230200759 1
< 0.1%
2024306020230200758 1
< 0.1%
2024306020230200757 1
< 0.1%
2024306020230200756 1
< 0.1%
2024306020230200754 1
< 0.1%
2024306020230200752 1
< 0.1%
Distinct3854
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1993-04-20 00:00:00
Maximum2024-04-16 00:00:00
2024-04-18T05:17:03.537234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:03.637206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct5
Distinct (%)100.0%
Missing9995
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2007-07-13 00:00:00
Maximum2023-03-14 00:00:00
2024-04-18T05:17:03.717744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:03.798295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5582 
3
2553 
4
1453 
5
 
384
2
 
28

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row1
4th row1
5th row5

Common Values

ValueCountFrequency (%)
1 5582
55.8%
3 2553
25.5%
4 1453
 
14.5%
5 384
 
3.8%
2 28
 
0.3%

Length

2024-04-18T05:17:03.892954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:03.978619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5582
55.8%
3 2553
25.5%
4 1453
 
14.5%
5 384
 
3.8%
2 28
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
5582 
폐업
2553 
취소/말소/만료/정지/중지
1453 
제외/삭제/전출
 
384
휴업
 
28

Length

Max length14
Median length5
Mean length5.6486
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row영업/정상
4th row영업/정상
5th row제외/삭제/전출

Common Values

ValueCountFrequency (%)
영업/정상 5582
55.8%
폐업 2553
25.5%
취소/말소/만료/정지/중지 1453
 
14.5%
제외/삭제/전출 384
 
3.8%
휴업 28
 
0.3%

Length

2024-04-18T05:17:04.065589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:04.147113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 5582
55.8%
폐업 2553
25.5%
취소/말소/만료/정지/중지 1453
 
14.5%
제외/삭제/전출 384
 
3.8%
휴업 28
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5379
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-18T05:17:04.218264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1190594
Coefficient of variation (CV)0.83496567
Kurtosis0.069788711
Mean2.5379
Median Absolute Deviation (MAD)0
Skewness1.2006408
Sum25379
Variance4.4904126
MonotonicityNot monotonic
2024-04-18T05:17:04.295310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 5579
55.8%
3 2553
25.5%
7 1445
 
14.4%
5 384
 
3.8%
2 28
 
0.3%
4 8
 
0.1%
6 3
 
< 0.1%
ValueCountFrequency (%)
1 5579
55.8%
2 28
 
0.3%
3 2553
25.5%
4 8
 
0.1%
5 384
 
3.8%
6 3
 
< 0.1%
7 1445
 
14.4%
ValueCountFrequency (%)
7 1445
 
14.4%
6 3
 
< 0.1%
5 384
 
3.8%
4 8
 
0.1%
3 2553
25.5%
2 28
 
0.3%
1 5579
55.8%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
5579 
폐업처리
2553 
직권말소
1445 
타시군구이관
 
384
휴업처리
 
28
Other values (2)
 
11

Length

Max length6
Median length4
Mean length4.0774
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row폐업처리
3rd row정상영업
4th row정상영업
5th row타시군구이관

Common Values

ValueCountFrequency (%)
정상영업 5579
55.8%
폐업처리 2553
25.5%
직권말소 1445
 
14.4%
타시군구이관 384
 
3.8%
휴업처리 28
 
0.3%
직권취소 8
 
0.1%
타시군구전입 3
 
< 0.1%

Length

2024-04-18T05:17:04.410288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:04.497510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 5579
55.8%
폐업처리 2553
25.5%
직권말소 1445
 
14.4%
타시군구이관 384
 
3.8%
휴업처리 28
 
0.3%
직권취소 8
 
0.1%
타시군구전입 3
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1858
Distinct (%)63.8%
Missing7089
Missing (%)70.9%
Memory size156.2 KiB
Minimum1996-12-16 00:00:00
Maximum2024-04-17 00:00:00
2024-04-18T05:17:04.593001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:04.706546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct33
Distinct (%)97.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-06-01 00:00:00
Maximum2024-04-17 00:00:00
2024-04-18T05:17:04.796321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:04.890586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

휴업종료일자
Date

MISSING 

Distinct33
Distinct (%)97.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-11-30 00:00:00
Maximum2028-12-14 00:00:00
2024-04-18T05:17:05.230612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:05.321194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

재개업일자
Date

MISSING 

Distinct9
Distinct (%)100.0%
Missing9991
Missing (%)99.9%
Memory size156.2 KiB
Minimum2003-01-20 00:00:00
Maximum2023-08-23 00:00:00
2024-04-18T05:17:05.418177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:05.509444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

전화번호
Text

MISSING 

Distinct3211
Distinct (%)52.6%
Missing3891
Missing (%)38.9%
Memory size156.2 KiB
2024-04-18T05:17:05.767605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length6.5696513
Min length1

Characters and Unicode

Total characters40134
Distinct characters15
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3150 ?
Unique (%)51.6%

Sample

1st row.
2nd row070-8119-9273
3rd row.
4th row02
5th row02
ValueCountFrequency (%)
2744
28.4%
02 1773
 
18.3%
070 194
 
2.0%
433 74
 
0.8%
432 71
 
0.7%
436 70
 
0.7%
434 66
 
0.7%
438 63
 
0.7%
435 62
 
0.6%
494 61
 
0.6%
Other values (3454) 4487
46.4%
2024-04-18T05:17:06.158211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6026
15.0%
2 5519
13.8%
3567
8.9%
4 3499
8.7%
3 2850
7.1%
. 2712
6.8%
7 2711
6.8%
- 2588
6.4%
9 2534
6.3%
8 2083
 
5.2%
Other values (5) 6045
15.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 31265
77.9%
Space Separator 3567
 
8.9%
Other Punctuation 2713
 
6.8%
Dash Punctuation 2588
 
6.4%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6026
19.3%
2 5519
17.7%
4 3499
11.2%
3 2850
9.1%
7 2711
8.7%
9 2534
8.1%
8 2083
 
6.7%
1 2045
 
6.5%
5 2021
 
6.5%
6 1977
 
6.3%
Other Punctuation
ValueCountFrequency (%)
. 2712
> 99.9%
' 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
3567
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2588
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40134
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6026
15.0%
2 5519
13.8%
3567
8.9%
4 3499
8.7%
3 2850
7.1%
. 2712
6.8%
7 2711
6.8%
- 2588
6.4%
9 2534
6.3%
8 2083
 
5.2%
Other values (5) 6045
15.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40134
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6026
15.0%
2 5519
13.8%
3567
8.9%
4 3499
8.7%
3 2850
7.1%
. 2712
6.8%
7 2711
6.8%
- 2588
6.4%
9 2534
6.3%
8 2083
 
5.2%
Other values (5) 6045
15.1%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct139
Distinct (%)8.0%
Missing8252
Missing (%)82.5%
Infinite0
Infinite (%)0.0%
Mean131580.71
Minimum130854
Maximum131883
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-18T05:17:06.288931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum130854
5-th percentile131120
Q1131200
median131807
Q3131841.25
95-th percentile131878
Maximum131883
Range1029
Interquartile range (IQR)641.25

Descriptive statistics

Standard deviation320.08338
Coefficient of variation (CV)0.0024326012
Kurtosis-1.7081405
Mean131580.71
Median Absolute Deviation (MAD)58
Skewness-0.48408017
Sum2.3000308 × 108
Variance102453.37
MonotonicityNot monotonic
2024-04-18T05:17:06.411843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
131200 189
 
1.9%
131120 100
 
1.0%
131230 99
 
1.0%
131140 73
 
0.7%
131130 53
 
0.5%
131220 50
 
0.5%
131814 44
 
0.4%
131802 33
 
0.3%
131809 32
 
0.3%
131811 27
 
0.3%
Other values (129) 1048
 
10.5%
(Missing) 8252
82.5%
ValueCountFrequency (%)
130854 1
 
< 0.1%
131120 100
1.0%
131121 5
 
0.1%
131122 6
 
0.1%
131123 1
 
< 0.1%
131130 53
0.5%
131131 12
 
0.1%
131132 3
 
< 0.1%
131140 73
0.7%
131141 15
 
0.1%
ValueCountFrequency (%)
131883 2
 
< 0.1%
131882 18
0.2%
131881 19
0.2%
131880 20
0.2%
131879 17
0.2%
131878 16
0.2%
131877 10
 
0.1%
131876 10
 
0.1%
131875 25
0.2%
131873 6
 
0.1%

지번주소
Text

MISSING 

Distinct3855
Distinct (%)40.9%
Missing575
Missing (%)5.8%
Memory size156.2 KiB
2024-04-18T05:17:06.625876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length28.640318
Min length3

Characters and Unicode

Total characters269935
Distinct characters503
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2973 ?
Unique (%)31.5%

Sample

1st row서울특별시 중랑구 면목동 ***-** 예성그린빌 ***호
2nd row서울특별시 중랑구 망우동 ***번지 *호
3rd row서울특별시 중랑구 신내동 ***번지 우남푸르미아 ***동 ***호
4th row서울특별시 중랑구 면목동 ***-* 오페라하우스 ***호
5th row서울특별시 중랑구 신내동 646-0 금강 425호
ValueCountFrequency (%)
서울특별시 9370
16.5%
중랑구 9367
16.5%
8137
14.3%
4939
8.7%
번지 4572
8.0%
면목동 3110
 
5.5%
2142
 
3.8%
1599
 
2.8%
신내동 1378
 
2.4%
묵동 1260
 
2.2%
Other values (2058) 11048
19.4%
2024-04-18T05:17:06.976796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 65417
24.2%
47674
17.7%
11966
 
4.4%
10653
 
3.9%
9565
 
3.5%
9468
 
3.5%
9445
 
3.5%
9414
 
3.5%
9391
 
3.5%
9372
 
3.5%
Other values (493) 77570
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149172
55.3%
Other Punctuation 65541
24.3%
Space Separator 47674
 
17.7%
Dash Punctuation 4317
 
1.6%
Uppercase Letter 1429
 
0.5%
Decimal Number 1325
 
0.5%
Lowercase Letter 413
 
0.2%
Open Punctuation 29
 
< 0.1%
Close Punctuation 29
 
< 0.1%
Math Symbol 4
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11966
 
8.0%
10653
 
7.1%
9565
 
6.4%
9468
 
6.3%
9445
 
6.3%
9414
 
6.3%
9391
 
6.3%
9372
 
6.3%
9372
 
6.3%
8592
 
5.8%
Other values (426) 51934
34.8%
Uppercase Letter
ValueCountFrequency (%)
B 251
17.6%
A 231
16.2%
S 155
10.8%
E 92
 
6.4%
T 87
 
6.1%
O 84
 
5.9%
R 82
 
5.7%
G 79
 
5.5%
W 76
 
5.3%
K 67
 
4.7%
Other values (14) 225
15.7%
Lowercase Letter
ValueCountFrequency (%)
e 120
29.1%
r 61
14.8%
c 58
14.0%
n 57
13.8%
t 56
13.6%
b 16
 
3.9%
a 15
 
3.6%
s 6
 
1.5%
h 5
 
1.2%
i 3
 
0.7%
Other values (9) 16
 
3.9%
Decimal Number
ValueCountFrequency (%)
1 316
23.8%
0 196
14.8%
2 184
13.9%
3 140
10.6%
4 124
 
9.4%
5 109
 
8.2%
6 76
 
5.7%
9 63
 
4.8%
8 60
 
4.5%
7 57
 
4.3%
Other Punctuation
ValueCountFrequency (%)
* 65417
99.8%
, 80
 
0.1%
. 28
 
< 0.1%
@ 12
 
< 0.1%
/ 2
 
< 0.1%
& 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 3
75.0%
= 1
 
25.0%
Space Separator
ValueCountFrequency (%)
47674
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4317
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149170
55.3%
Common 118920
44.1%
Latin 1843
 
0.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11966
 
8.0%
10653
 
7.1%
9565
 
6.4%
9468
 
6.3%
9445
 
6.3%
9414
 
6.3%
9391
 
6.3%
9372
 
6.3%
9372
 
6.3%
8592
 
5.8%
Other values (425) 51932
34.8%
Latin
ValueCountFrequency (%)
B 251
13.6%
A 231
 
12.5%
S 155
 
8.4%
e 120
 
6.5%
E 92
 
5.0%
T 87
 
4.7%
O 84
 
4.6%
R 82
 
4.4%
G 79
 
4.3%
W 76
 
4.1%
Other values (34) 586
31.8%
Common
ValueCountFrequency (%)
* 65417
55.0%
47674
40.1%
- 4317
 
3.6%
1 316
 
0.3%
0 196
 
0.2%
2 184
 
0.2%
3 140
 
0.1%
4 124
 
0.1%
5 109
 
0.1%
, 80
 
0.1%
Other values (13) 363
 
0.3%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149168
55.3%
ASCII 120762
44.7%
CJK 2
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 65417
54.2%
47674
39.5%
- 4317
 
3.6%
1 316
 
0.3%
B 251
 
0.2%
A 231
 
0.2%
0 196
 
0.2%
2 184
 
0.2%
S 155
 
0.1%
3 140
 
0.1%
Other values (56) 1881
 
1.6%
Hangul
ValueCountFrequency (%)
11966
 
8.0%
10653
 
7.1%
9565
 
6.4%
9468
 
6.3%
9445
 
6.3%
9414
 
6.3%
9391
 
6.3%
9372
 
6.3%
9372
 
6.3%
8592
 
5.8%
Other values (423) 51930
34.8%
CJK
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4539
Distinct (%)50.2%
Missing959
Missing (%)9.6%
Memory size156.2 KiB
2024-04-18T05:17:07.169207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length54
Mean length35.210043
Min length20

Characters and Unicode

Total characters318334
Distinct characters484
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3297 ?
Unique (%)36.5%

Sample

1st row서울특별시 중랑구 동일로***길 **, ***호 (면목동, 예성그린빌)
2nd row서울특별시 중랑구 용마산로**가길 * (면목동)
3rd row서울특별시 중랑구 양원역로 **-* (망우동)
4th row서울특별시 중랑구 봉화산로**길 **, ***동 ***호 (신내동,우남푸르미아)
5th row서울특별시 중랑구 용마산로**길 **, ***호 (면목동, 오페라하우스)
ValueCountFrequency (%)
서울특별시 9037
15.0%
중랑구 9028
15.0%
8922
14.8%
5021
 
8.3%
면목동 2890
 
4.8%
2436
 
4.0%
1735
 
2.9%
신내동 1214
 
2.0%
상봉동 1150
 
1.9%
묵동 1134
 
1.9%
Other values (2163) 17668
29.3%
2024-04-18T05:17:07.493924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 59856
18.8%
51254
16.1%
13380
 
4.2%
11228
 
3.5%
, 10777
 
3.4%
9848
 
3.1%
9254
 
2.9%
9128
 
2.9%
9094
 
2.9%
9086
 
2.9%
Other values (474) 125429
39.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 173359
54.5%
Other Punctuation 70655
22.2%
Space Separator 51254
 
16.1%
Close Punctuation 9058
 
2.8%
Open Punctuation 9058
 
2.8%
Dash Punctuation 2363
 
0.7%
Uppercase Letter 1436
 
0.5%
Decimal Number 781
 
0.2%
Lowercase Letter 362
 
0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13380
 
7.7%
11228
 
6.5%
9848
 
5.7%
9254
 
5.3%
9128
 
5.3%
9094
 
5.2%
9086
 
5.2%
9054
 
5.2%
9040
 
5.2%
9039
 
5.2%
Other values (412) 75208
43.4%
Uppercase Letter
ValueCountFrequency (%)
B 277
19.3%
A 233
16.2%
S 153
10.7%
E 94
 
6.5%
T 89
 
6.2%
O 84
 
5.8%
R 84
 
5.8%
W 78
 
5.4%
G 78
 
5.4%
K 59
 
4.1%
Other values (14) 207
14.4%
Lowercase Letter
ValueCountFrequency (%)
e 111
30.7%
c 53
14.6%
n 53
14.6%
r 52
14.4%
t 51
14.1%
b 15
 
4.1%
a 7
 
1.9%
s 6
 
1.7%
i 3
 
0.8%
h 3
 
0.8%
Other values (6) 8
 
2.2%
Decimal Number
ValueCountFrequency (%)
1 189
24.2%
0 129
16.5%
2 122
15.6%
4 66
 
8.5%
3 61
 
7.8%
5 55
 
7.0%
6 50
 
6.4%
9 44
 
5.6%
7 38
 
4.9%
8 27
 
3.5%
Other Punctuation
ValueCountFrequency (%)
* 59856
84.7%
, 10777
 
15.3%
. 16
 
< 0.1%
/ 3
 
< 0.1%
& 2
 
< 0.1%
@ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
51254
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9058
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9058
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2363
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 173357
54.5%
Common 143176
45.0%
Latin 1799
 
0.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13380
 
7.7%
11228
 
6.5%
9848
 
5.7%
9254
 
5.3%
9128
 
5.3%
9094
 
5.2%
9086
 
5.2%
9054
 
5.2%
9040
 
5.2%
9039
 
5.2%
Other values (411) 75206
43.4%
Latin
ValueCountFrequency (%)
B 277
15.4%
A 233
13.0%
S 153
 
8.5%
e 111
 
6.2%
E 94
 
5.2%
T 89
 
4.9%
O 84
 
4.7%
R 84
 
4.7%
W 78
 
4.3%
G 78
 
4.3%
Other values (31) 518
28.8%
Common
ValueCountFrequency (%)
* 59856
41.8%
51254
35.8%
, 10777
 
7.5%
) 9058
 
6.3%
( 9058
 
6.3%
- 2363
 
1.7%
1 189
 
0.1%
0 129
 
0.1%
2 122
 
0.1%
4 66
 
< 0.1%
Other values (11) 304
 
0.2%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 173357
54.5%
ASCII 144974
45.5%
CJK 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 59856
41.3%
51254
35.4%
, 10777
 
7.4%
) 9058
 
6.2%
( 9058
 
6.2%
- 2363
 
1.6%
B 277
 
0.2%
A 233
 
0.2%
1 189
 
0.1%
S 153
 
0.1%
Other values (51) 1756
 
1.2%
Hangul
ValueCountFrequency (%)
13380
 
7.7%
11228
 
6.5%
9848
 
5.7%
9254
 
5.3%
9128
 
5.3%
9094
 
5.2%
9086
 
5.2%
9054
 
5.2%
9040
 
5.2%
9039
 
5.2%
Other values (411) 75206
43.4%
CJK
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct431
Distinct (%)5.6%
Missing2311
Missing (%)23.1%
Memory size156.2 KiB
2024-04-18T05:17:07.752301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1515152
Min length5

Characters and Unicode

Total characters39610
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)0.7%

Sample

1st row02130
2nd row131811
3rd row02064
4th row02208
5th row02213
ValueCountFrequency (%)
02076 278
 
3.6%
02055 132
 
1.7%
02122 90
 
1.2%
02054 76
 
1.0%
02262 73
 
0.9%
02007 70
 
0.9%
02057 65
 
0.8%
02175 63
 
0.8%
131230 61
 
0.8%
131200 58
 
0.8%
Other values (421) 6723
87.4%
2024-04-18T05:17:08.122023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11142
28.1%
2 9881
24.9%
1 6268
15.8%
3 2818
 
7.1%
5 1829
 
4.6%
8 1802
 
4.5%
7 1723
 
4.3%
6 1558
 
3.9%
4 1472
 
3.7%
9 1067
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39560
99.9%
Dash Punctuation 50
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11142
28.2%
2 9881
25.0%
1 6268
15.8%
3 2818
 
7.1%
5 1829
 
4.6%
8 1802
 
4.6%
7 1723
 
4.4%
6 1558
 
3.9%
4 1472
 
3.7%
9 1067
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39610
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11142
28.1%
2 9881
24.9%
1 6268
15.8%
3 2818
 
7.1%
5 1829
 
4.6%
8 1802
 
4.5%
7 1723
 
4.3%
6 1558
 
3.9%
4 1472
 
3.7%
9 1067
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39610
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11142
28.1%
2 9881
24.9%
1 6268
15.8%
3 2818
 
7.1%
5 1829
 
4.6%
8 1802
 
4.5%
7 1723
 
4.3%
6 1558
 
3.9%
4 1472
 
3.7%
9 1067
 
2.7%
Distinct9765
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-18T05:17:08.437346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length41
Mean length6.2667
Min length1

Characters and Unicode

Total characters62667
Distinct characters1101
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9551 ?
Unique (%)95.5%

Sample

1st row제이디인터내셔널
2nd row마싯게도 냠냠(masitgedo yumyum)
3rd row햄비가
4th row치마바람
5th row바라래컴퍼니
ValueCountFrequency (%)
주식회사 348
 
2.9%
컴퍼니 52
 
0.4%
41
 
0.3%
company 38
 
0.3%
19
 
0.2%
코리아 14
 
0.1%
디자인 13
 
0.1%
12
 
0.1%
korea 12
 
0.1%
스튜디오 12
 
0.1%
Other values (10676) 11573
95.4%
2024-04-18T05:17:08.860891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2397
 
3.8%
2137
 
3.4%
1868
 
3.0%
( 1584
 
2.5%
) 1583
 
2.5%
1061
 
1.7%
865
 
1.4%
791
 
1.3%
728
 
1.2%
706
 
1.1%
Other values (1091) 48947
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45590
72.7%
Lowercase Letter 5906
 
9.4%
Uppercase Letter 4889
 
7.8%
Space Separator 2137
 
3.4%
Open Punctuation 1586
 
2.5%
Close Punctuation 1586
 
2.5%
Decimal Number 564
 
0.9%
Other Punctuation 318
 
0.5%
Dash Punctuation 64
 
0.1%
Connector Punctuation 17
 
< 0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2397
 
5.3%
1868
 
4.1%
1061
 
2.3%
865
 
1.9%
791
 
1.7%
728
 
1.6%
706
 
1.5%
696
 
1.5%
647
 
1.4%
602
 
1.3%
Other values (1004) 35229
77.3%
Lowercase Letter
ValueCountFrequency (%)
e 704
11.9%
o 587
 
9.9%
a 545
 
9.2%
n 454
 
7.7%
i 447
 
7.6%
r 362
 
6.1%
t 340
 
5.8%
l 322
 
5.5%
s 261
 
4.4%
m 239
 
4.0%
Other values (16) 1645
27.9%
Uppercase Letter
ValueCountFrequency (%)
O 389
 
8.0%
A 368
 
7.5%
E 334
 
6.8%
S 313
 
6.4%
N 310
 
6.3%
C 293
 
6.0%
M 271
 
5.5%
I 257
 
5.3%
L 247
 
5.1%
T 226
 
4.6%
Other values (16) 1881
38.5%
Other Punctuation
ValueCountFrequency (%)
. 154
48.4%
& 81
25.5%
, 33
 
10.4%
' 21
 
6.6%
? 10
 
3.1%
# 6
 
1.9%
/ 3
 
0.9%
: 2
 
0.6%
! 2
 
0.6%
@ 2
 
0.6%
Other values (3) 4
 
1.3%
Decimal Number
ValueCountFrequency (%)
2 121
21.5%
1 104
18.4%
0 69
12.2%
4 62
11.0%
9 45
 
8.0%
5 44
 
7.8%
3 38
 
6.7%
6 29
 
5.1%
7 29
 
5.1%
8 23
 
4.1%
Close Punctuation
ValueCountFrequency (%)
) 1583
99.8%
] 2
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1584
99.9%
[ 2
 
0.1%
Other Symbol
ValueCountFrequency (%)
7
87.5%
° 1
 
12.5%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
2137
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45581
72.7%
Latin 10795
 
17.2%
Common 6275
 
10.0%
Han 14
 
< 0.1%
Hiragana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2397
 
5.3%
1868
 
4.1%
1061
 
2.3%
865
 
1.9%
791
 
1.7%
728
 
1.6%
706
 
1.5%
696
 
1.5%
647
 
1.4%
602
 
1.3%
Other values (990) 35220
77.3%
Latin
ValueCountFrequency (%)
e 704
 
6.5%
o 587
 
5.4%
a 545
 
5.0%
n 454
 
4.2%
i 447
 
4.1%
O 389
 
3.6%
A 368
 
3.4%
r 362
 
3.4%
t 340
 
3.1%
E 334
 
3.1%
Other values (42) 6265
58.0%
Common
ValueCountFrequency (%)
2137
34.1%
( 1584
25.2%
) 1583
25.2%
. 154
 
2.5%
2 121
 
1.9%
1 104
 
1.7%
& 81
 
1.3%
0 69
 
1.1%
- 64
 
1.0%
4 62
 
1.0%
Other values (24) 316
 
5.0%
Han
ValueCountFrequency (%)
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (3) 3
21.4%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45574
72.7%
ASCII 17065
 
27.2%
CJK 14
 
< 0.1%
None 12
 
< 0.1%
Hiragana 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2397
 
5.3%
1868
 
4.1%
1061
 
2.3%
865
 
1.9%
791
 
1.7%
728
 
1.6%
706
 
1.5%
696
 
1.5%
647
 
1.4%
602
 
1.3%
Other values (989) 35213
77.3%
ASCII
ValueCountFrequency (%)
2137
 
12.5%
( 1584
 
9.3%
) 1583
 
9.3%
e 704
 
4.1%
o 587
 
3.4%
a 545
 
3.2%
n 454
 
2.7%
i 447
 
2.6%
O 389
 
2.3%
A 368
 
2.2%
Other values (72) 8267
48.4%
None
ValueCountFrequency (%)
7
58.3%
2
 
16.7%
1
 
8.3%
° 1
 
8.3%
1
 
8.3%
CJK
ValueCountFrequency (%)
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (3) 3
21.4%
Hiragana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct9470
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-21 01:10:49
Maximum2024-04-16 16:27:03
2024-04-18T05:17:08.974741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:09.081380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7252 
U
2748 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 7252
72.5%
U 2748
 
27.5%

Length

2024-04-18T05:17:09.175947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:09.246091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7252
72.5%
u 2748
 
27.5%
Distinct1565
Distinct (%)15.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-18T05:17:09.323814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:17:09.424209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct417
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-18T05:17:09.581039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.6985
Min length1

Characters and Unicode

Total characters86985
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique264 ?
Unique (%)2.6%

Sample

1st row의류/패션/잡화/뷰티
2nd row건강/식품
3rd row의류/패션/잡화/뷰티
4th row의류/패션/잡화/뷰티
5th row컴퓨터/사무용품 가구/수납용품 건강/식품
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4211
31.7%
종합몰 2872
21.6%
기타 1853
14.0%
건강/식품 850
 
6.4%
763
 
5.8%
교육/도서/완구/오락 558
 
4.2%
컴퓨터/사무용품 527
 
4.0%
가구/수납용품 467
 
3.5%
가전 465
 
3.5%
자동차/자동차용품 301
 
2.3%
Other values (3) 401
 
3.0%
2024-04-18T05:17:09.827234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17052
19.6%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
4211
 
4.8%
3268
 
3.8%
Other values (41) 32977
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65902
75.8%
Other Punctuation 17052
 
19.6%
Space Separator 3268
 
3.8%
Dash Punctuation 763
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
2872
 
4.4%
2872
 
4.4%
Other values (38) 26470
40.2%
Other Punctuation
ValueCountFrequency (%)
/ 17052
100.0%
Space Separator
ValueCountFrequency (%)
3268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 763
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65902
75.8%
Common 21083
 
24.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
2872
 
4.4%
2872
 
4.4%
Other values (38) 26470
40.2%
Common
ValueCountFrequency (%)
/ 17052
80.9%
3268
 
15.5%
- 763
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65902
75.8%
ASCII 21083
 
24.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17052
80.9%
3268
 
15.5%
- 763
 
3.6%
Hangul
ValueCountFrequency (%)
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
4211
 
6.4%
2872
 
4.4%
2872
 
4.4%
Other values (38) 26470
40.2%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct4960
Distinct (%)54.5%
Missing905
Missing (%)9.0%
Infinite0
Infinite (%)0.0%
Mean207670.15
Minimum193201.64
Maximum227269.31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-18T05:17:09.925489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum193201.64
5-th percentile206502.54
Q1206974.1
median207605.32
Q3208294.26
95-th percentile209085.93
Maximum227269.31
Range34067.67
Interquartile range (IQR)1320.1615

Descriptive statistics

Standard deviation868.31653
Coefficient of variation (CV)0.0041812294
Kurtosis37.572309
Mean207670.15
Median Absolute Deviation (MAD)663.54153
Skewness1.170878
Sum1.88876 × 109
Variance753973.59
MonotonicityNot monotonic
2024-04-18T05:17:10.024554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
208295.099818379 254
 
2.5%
206974.095837709 69
 
0.7%
209085.926142332 59
 
0.6%
208842.713556665 58
 
0.6%
207163.791145804 53
 
0.5%
208294.257367707 51
 
0.5%
206595.707775401 48
 
0.5%
207966.140189875 46
 
0.5%
207423.898976834 46
 
0.5%
207982.810825453 45
 
0.4%
Other values (4950) 8366
83.7%
(Missing) 905
 
9.0%
ValueCountFrequency (%)
193201.644480588 1
 
< 0.1%
201508.726469982 1
 
< 0.1%
204283.264985792 1
 
< 0.1%
206204.782194072 1
 
< 0.1%
206214.154363545 1
 
< 0.1%
206216.81258777 2
< 0.1%
206218.223856652 1
 
< 0.1%
206221.262759294 2
< 0.1%
206221.462480638 1
 
< 0.1%
206221.834183749 3
< 0.1%
ValueCountFrequency (%)
227269.314671806 1
 
< 0.1%
216668.936475106 1
 
< 0.1%
210112.539634159 1
 
< 0.1%
210010.784397164 1
 
< 0.1%
209963.045464704 1
 
< 0.1%
209938.946080275 1
 
< 0.1%
209931.172836 36
0.4%
209929.109464013 1
 
< 0.1%
209921.255674935 2
 
< 0.1%
209874.832484197 2
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct4959
Distinct (%)54.5%
Missing905
Missing (%)9.0%
Infinite0
Infinite (%)0.0%
Mean454969.96
Minimum442771.62
Maximum461263.19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-18T05:17:10.139289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum442771.62
5-th percentile452800.33
Q1454062.03
median454980.02
Q3455974.03
95-th percentile457113.64
Maximum461263.19
Range18491.576
Interquartile range (IQR)1912.0008

Descriptive statistics

Standard deviation1296.1705
Coefficient of variation (CV)0.0028489143
Kurtosis0.43006451
Mean454969.96
Median Absolute Deviation (MAD)943.77995
Skewness-0.1574477
Sum4.1379518 × 109
Variance1680057.8
MonotonicityNot monotonic
2024-04-18T05:17:10.254056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
456014.999180519 254
 
2.5%
454601.381554597 69
 
0.7%
457283.215342184 59
 
0.6%
455205.504339797 58
 
0.6%
454984.412728653 53
 
0.5%
456632.108607443 51
 
0.5%
453913.427011089 48
 
0.5%
456995.441709842 46
 
0.5%
457122.700436003 46
 
0.5%
454980.023827024 45
 
0.4%
Other values (4949) 8366
83.7%
(Missing) 905
 
9.0%
ValueCountFrequency (%)
442771.616373056 1
< 0.1%
445879.640829807 1
< 0.1%
451210.070892418 1
< 0.1%
451486.158412733 1
< 0.1%
452074.882858795 1
< 0.1%
452075.948613979 2
< 0.1%
452077.782812274 1
< 0.1%
452078.001484822 2
< 0.1%
452087.574394704 1
< 0.1%
452096.276883247 1
< 0.1%
ValueCountFrequency (%)
461263.19215372 1
 
< 0.1%
460743.124910745 1
 
< 0.1%
458123.69532995 1
 
< 0.1%
457702.631123 23
0.2%
457507.041329636 1
 
< 0.1%
457482.82473114 1
 
< 0.1%
457478.009625484 1
 
< 0.1%
457465.83066498 1
 
< 0.1%
457462.766652935 2
 
< 0.1%
457452.194521115 1
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9402 
0
 
598

Length

Max length4
Median length4
Mean length3.8206
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9402
94.0%
0 598
 
6.0%

Length

2024-04-18T05:17:10.356843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:10.426031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9402
94.0%
0 598
 
6.0%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9402 
0
 
598

Length

Max length4
Median length4
Mean length3.8206
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9402
94.0%
0 598
 
6.0%

Length

2024-04-18T05:17:10.500156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:10.572342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9402
94.0%
0 598
 
6.0%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9402 
0
 
598

Length

Max length4
Median length4
Mean length3.8206
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9402
94.0%
0 598
 
6.0%

Length

2024-04-18T05:17:10.647995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T05:17:10.721497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9402
94.0%
0 598
 
6.0%

판매방식명
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5133 
인터넷
4663 
인터넷, 기타
 
73
기타
 
26
TV홈쇼핑, 인터넷
 
25
Other values (15)
 
80

Length

Max length26
Median length4
Mean length3.6276
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row인터넷
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5133
51.3%
인터넷 4663
46.6%
인터넷, 기타 73
 
0.7%
기타 26
 
0.3%
TV홈쇼핑, 인터넷 25
 
0.2%
TV홈쇼핑 18
 
0.2%
인터넷, 카다로그 18
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 8
 
0.1%
인터넷, 카다로그, 기타 7
 
0.1%
인터넷, 카다로그, 신문잡지 6
 
0.1%
Other values (10) 23
 
0.2%

Length

2024-04-18T05:17:10.797309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5133
50.3%
인터넷 4817
47.2%
기타 122
 
1.2%
tv홈쇼핑 59
 
0.6%
카다로그 54
 
0.5%
신문잡지 28
 
0.3%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
142953060000202130601763020002220210106<NA>1영업/정상1정상영업<NA><NA><NA><NA>.<NA><NA>서울특별시 중랑구 면목동 ***-** 예성그린빌 ***호서울특별시 중랑구 동일로***길 **, ***호 (면목동, 예성그린빌)02130제이디인터내셔널2021-01-06 17:45:27I2021-01-08 00:23:04.0의류/패션/잡화/뷰티206928.153526454104.449533<NA><NA><NA>인터넷
70843060000201430600953020076220141204<NA>3폐업3폐업처리20150701<NA><NA><NA>070-8119-9273<NA><NA><NA>서울특별시 중랑구 용마산로**가길 * (면목동)131811마싯게도 냠냠(masitgedo yumyum)2015-07-01 10:28:52I2018-08-31 23:59:59.0건강/식품208538.275192454070.9005<NA><NA><NA>인터넷
128493060000202030601763020073520200507<NA>1영업/정상1정상영업<NA><NA><NA><NA>.<NA><NA>서울특별시 중랑구 망우동 ***번지 *호서울특별시 중랑구 양원역로 **-* (망우동)02064햄비가2020-05-08 10:58:07I2020-05-10 00:23:20.0의류/패션/잡화/뷰티209454.475794455520.597493<NA><NA><NA>인터넷
18843060000200630600953020050020060907<NA>1영업/정상1정상영업<NA><NA><NA><NA>02<NA>131865서울특별시 중랑구 신내동 ***번지 우남푸르미아 ***동 ***호서울특별시 중랑구 봉화산로**길 **, ***동 ***호 (신내동,우남푸르미아)<NA>치마바람2008-05-08 13:40:34I2018-08-31 23:59:59.0의류/패션/잡화/뷰티208657.879149455987.655282<NA><NA><NA>인터넷
17090306000020223060176302007042022-04-29<NA>5제외/삭제/전출5타시군구이관2024-01-30<NA><NA><NA><NA><NA><NA>서울특별시 중랑구 면목동 ***-* 오페라하우스 ***호서울특별시 중랑구 용마산로**길 **, ***호 (면목동, 오페라하우스)02208바라래컴퍼니2024-01-30 14:40:05U2023-12-02 00:01:00.0컴퓨터/사무용품 가구/수납용품 건강/식품208029.268365453580.137315<NA><NA><NA><NA>
18553060000200630600953020046520060811<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02<NA><NA>서울특별시 중랑구 신내동 646-0 금강 425호<NA><NA>놀라운기술2022-12-15 10:13:34U2021-11-01 23:07:00.0종합몰<NA><NA><NA><NA><NA><NA>
179933060000202230601763020162120221018<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 중랑구 면목동 ***-* *층서울특별시 중랑구 면목로**나길 *-*, *층 (면목동)02213바우어2022-10-19 09:14:11I2021-10-30 22:01:00.0기타 교육/도서/완구/오락 상품권 레져/여행/공연207927.880397453231.666698<NA><NA><NA><NA>
167123060000202230601763020032320090408<NA>1영업/정상1정상영업<NA><NA><NA><NA>466-8259<NA><NA>서울특별시 중랑구 묵동 *** 묵동아이파크아파트 ***동 ****호서울특별시 중랑구 중랑천로 ***, ***동 ****호 (묵동, 묵동아이파크아파트)02007안전252022-02-22 16:05:44I2022-02-24 00:22:37.0기타206434.297984456466.677675000인터넷
172053060000202230601763020082220220524<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 중랑구 면목동 **** 면목현대아파트 ***동 ***호서울특별시 중랑구 용마산로 ***, ***동 ***호 (면목동, 면목현대아파트)02258메이드영원2022-05-24 14:30:02I2021-12-04 22:06:00.0의류/패션/잡화/뷰티207861.870603452666.306495<NA><NA><NA><NA>
106423060000201830601763020014720181114<NA>3폐업3폐업처리20220603<NA><NA><NA><NA><NA><NA>서울특별시 중랑구 망우동 ***번지 **호 *층서울특별시 중랑구 봉우재로 ***, *층 (망우동)02171MK상사2022-06-07 10:54:30U2021-12-06 00:09:00.0자동차/자동차용품208430.335949454750.415858<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
84663060000201630601453020066720160909<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>.<NA><NA>서울특별시 중랑구 중화동 **번지 *호서울특별시 중랑구 동일로***가길 *, *층 ***호 (중화동)02050제이제이 컴퍼니2022-05-06 14:18:05U2021-12-05 00:08:00.0의류/패션/잡화/뷰티207099.431229455615.471685<NA><NA><NA><NA>
10747306000020183060176302002592018-12-12<NA>5제외/삭제/전출5타시군구이관2023-08-07<NA><NA><NA><NA><NA><NA>서울특별시 중랑구 면목동 ***번지 **호 *층서울특별시 중랑구 면목로**길 **-*, *층 (면목동)02227늘봄인터내셔널2023-08-07 10:15:08U2022-12-08 00:09:00.0건강/식품207444.442837453373.703936<NA><NA><NA><NA>
11613060000200530600953020019420050614<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 438 1328<NA><NA>서울특별시 중랑구 면목동 ***-*** 리더스 빌딩*층<NA><NA>(주)아이엠 몰2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
18632306000020233060202302003382023-02-10<NA>3폐업3폐업처리2023-02-13<NA><NA><NA><NA><NA><NA>서울특별시 중랑구 묵동 ***-*** 에코팰리스 *층 ***호서울특별시 중랑구 동일로***길 **, *층 ***호 (묵동, 에코팰리스)02007행복한우리세상2023-02-13 14:58:37I2022-12-01 23:05:00.0종합몰206572.429842456391.751581<NA><NA><NA><NA>
47163060000201130600953020024920110413<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA>131878서울특별시 중랑구 중화동 ***번지 **호 **통 *반 성우그린맨션 B동 ***호서울특별시 중랑구 망우로**길 *, B동 ***호 (중화동,성우그린맨션)<NA>APOLLO2015-05-07 16:38:10I2018-08-31 23:59:59.0의류/패션/잡화/뷰티206231.444239454559.30644<NA><NA><NA>인터넷
18667306000020233060202302003772023-02-16<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 중랑구 상봉동 ** 건영*차아파트 ***동 ****호서울특별시 중랑구 신내로*나길 **, ***동 ****호 (상봉동, 건영*차아파트)02084셉트2023-02-17 09:10:19I2022-12-01 23:09:00.0의류/패션/잡화/뷰티208136.90873455433.855856<NA><NA><NA><NA>
48533060000201130600953020039720110616<NA>3폐업3폐업처리20181226<NA><NA><NA>02 433 5672<NA><NA>서울특별시 중랑구 신내동 ***번지 *호서울특별시 중랑구 신내로 ** (신내동)<NA>사회복지법인 유린보은동산2018-12-26 13:38:37U2018-12-28 02:40:00.0기타208408.995081455768.04157<NA><NA><NA>인터넷
66243060000201430600953020023120140407<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02-435-9225<NA><NA>서울특별시 중랑구 망우동 ***번지서울특별시 중랑구 용마산로***길 ** (망우동)131230제이디(J2D)2022-04-25 16:41:06U2021-12-03 22:07:00.0의류/패션/잡화/뷰티209185.183939455255.655186<NA><NA><NA><NA>
100013060000201830601453020036020140825<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6007-2151<NA><NA>서울특별시 중랑구 신내동 ***번지 디아뜨갤러리 ***호서울특별시 중랑구 신내로 ***, 디아뜨갤러리 ***호 (신내동)02024파라테라코리아 (주)2019-06-24 10:05:12I2021-12-03 22:02:00.0종합몰 의류/패션/잡화/뷰티207990.479257457183.637796<NA><NA><NA><NA>
64223060000201330600953020067420131205<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-491-5797<NA><NA><NA>서울특별시 중랑구 면목로**길 **, ***호 (면목동)131818실로암식품2013-12-05 12:54:34I2018-08-31 23:59:59.0건강/식품207675.82958453768.646549<NA><NA><NA>인터넷