Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells79030
Missing cells (%)27.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical9
Numeric7
DateTime5
Text7
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author송파구
URLhttps://data.seoul.go.kr/dataList/OA-18825/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (86.5%)Imbalance
부채총액 is highly imbalanced (86.5%)Imbalance
자본금 is highly imbalanced (86.5%)Imbalance
판매방식명 is highly imbalanced (71.2%)Imbalance
인허가취소일자 has 9989 (99.9%) missing valuesMissing
폐업일자 has 6417 (64.2%) missing valuesMissing
휴업시작일자 has 9967 (99.7%) missing valuesMissing
휴업종료일자 has 9967 (99.7%) missing valuesMissing
재개업일자 has 9985 (99.9%) missing valuesMissing
전화번호 has 3568 (35.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7998 (80.0%) missing valuesMissing
지번주소 has 1470 (14.7%) missing valuesMissing
도로명주소 has 1948 (19.5%) missing valuesMissing
도로명우편번호 has 3611 (36.1%) missing valuesMissing
좌표정보(X) has 2055 (20.5%) missing valuesMissing
좌표정보(Y) has 2055 (20.5%) missing valuesMissing
좌표정보(X) is highly skewed (γ1 = 39.20068538)Skewed
좌표정보(Y) is highly skewed (γ1 = -60.34409913)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 12:59:42.262890
Analysis finished2024-04-06 12:59:46.320834
Duration4.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3230000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3230000
2nd row3230000
3rd row3230000
4th row3230000
5th row3230000

Common Values

ValueCountFrequency (%)
3230000 10000
100.0%

Length

2024-04-06T21:59:46.440816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:59:46.635627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3230000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0132344 × 1018
Minimum1.996323 × 1018
Maximum2.020323 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T21:59:46.861212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.996323 × 1018
5-th percentile2.003323 × 1018
Q12.009323 × 1018
median2.014323 × 1018
Q32.018323 × 1018
95-th percentile2.020323 × 1018
Maximum2.020323 × 1018
Range2.4000013 × 1016
Interquartile range (IQR)9.0000126 × 1015

Descriptive statistics

Standard deviation5.5766402 × 1015
Coefficient of variation (CV)0.0027699905
Kurtosis-0.88582858
Mean2.0132344 × 1018
Median Absolute Deviation (MAD)4.0000067 × 1015
Skewness-0.47570174
Sum6.9464155 × 1018
Variance3.1098916 × 1031
MonotonicityNot monotonic
2024-04-06T21:59:47.152495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2012323019830200518 1
 
< 0.1%
2018323025730200978 1
 
< 0.1%
2019323026130200247 1
 
< 0.1%
2004323013130201998 1
 
< 0.1%
2005323013130203286 1
 
< 0.1%
2015323023130201057 1
 
< 0.1%
2014323019830200602 1
 
< 0.1%
2017323023130201697 1
 
< 0.1%
2009323013130205251 1
 
< 0.1%
2011323019830201724 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1996323013130200008 1
< 0.1%
1996323013130200063 1
< 0.1%
1996323013130200078 1
< 0.1%
1996323013130200135 1
< 0.1%
1996323013130200213 1
< 0.1%
1996323013130200318 1
< 0.1%
1996323013130200360 1
< 0.1%
1996323013130200361 1
< 0.1%
1996323013130200374 1
< 0.1%
1996323013130200381 1
< 0.1%
ValueCountFrequency (%)
2020323026130202987 1
< 0.1%
2020323026130202982 1
< 0.1%
2020323026130202981 1
< 0.1%
2020323026130202980 1
< 0.1%
2020323026130202979 1
< 0.1%
2020323026130202976 1
< 0.1%
2020323026130202973 1
< 0.1%
2020323026130202972 1
< 0.1%
2020323026130202970 1
< 0.1%
2020323026130202967 1
< 0.1%
Distinct3881
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-07-22 00:00:00
Maximum2020-08-14 00:00:00
2024-04-06T21:59:47.457938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:59:47.754853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Real number (ℝ)

MISSING 

Distinct10
Distinct (%)90.9%
Missing9989
Missing (%)99.9%
Infinite0
Infinite (%)0.0%
Mean20149680
Minimum20080718
Maximum20200219
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T21:59:47.987561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20080718
5-th percentile20110468
Q120140265
median20160420
Q320160954
95-th percentile20180724
Maximum20200219
Range119501
Interquartile range (IQR)20689.5

Descriptive statistics

Standard deviation28778.066
Coefficient of variation (CV)0.0014282145
Kurtosis3.6516595
Mean20149680
Median Absolute Deviation (MAD)19804
Skewness-1.0009422
Sum2.2164648 × 108
Variance8.2817707 × 108
MonotonicityNot monotonic
2024-04-06T21:59:48.214803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
20140217 2
 
< 0.1%
20160624 1
 
< 0.1%
20161205 1
 
< 0.1%
20140313 1
 
< 0.1%
20160704 1
 
< 0.1%
20080718 1
 
< 0.1%
20200219 1
 
< 0.1%
20160420 1
 
< 0.1%
20161228 1
 
< 0.1%
20140616 1
 
< 0.1%
(Missing) 9989
99.9%
ValueCountFrequency (%)
20080718 1
< 0.1%
20140217 2
< 0.1%
20140313 1
< 0.1%
20140616 1
< 0.1%
20160420 1
< 0.1%
20160624 1
< 0.1%
20160704 1
< 0.1%
20161205 1
< 0.1%
20161228 1
< 0.1%
20200219 1
< 0.1%
ValueCountFrequency (%)
20200219 1
< 0.1%
20161228 1
< 0.1%
20161205 1
< 0.1%
20160704 1
< 0.1%
20160624 1
< 0.1%
20160420 1
< 0.1%
20140616 1
< 0.1%
20140313 1
< 0.1%
20140217 2
< 0.1%
20080718 1
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4628 
3
3063 
4
1733 
5
547 
2
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row1
3rd row1
4th row3
5th row3

Common Values

ValueCountFrequency (%)
1 4628
46.3%
3 3063
30.6%
4 1733
 
17.3%
5 547
 
5.5%
2 29
 
0.3%

Length

2024-04-06T21:59:48.470062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:59:48.660661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4628
46.3%
3 3063
30.6%
4 1733
 
17.3%
5 547
 
5.5%
2 29
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4628 
폐업
3063 
취소/말소/만료/정지/중지
1733 
제외/삭제/전출
547 
휴업
 
29

Length

Max length14
Median length8
Mean length5.7962
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row영업/정상
3rd row영업/정상
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
영업/정상 4628
46.3%
폐업 3063
30.6%
취소/말소/만료/정지/중지 1733
 
17.3%
제외/삭제/전출 547
 
5.5%
휴업 29
 
0.3%

Length

2024-04-06T21:59:48.894062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:59:49.079156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4628
46.3%
폐업 3063
30.6%
취소/말소/만료/정지/중지 1733
 
17.3%
제외/삭제/전출 547
 
5.5%
휴업 29
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8696
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T21:59:49.314574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1924134
Coefficient of variation (CV)0.76401359
Kurtosis-0.54127024
Mean2.8696
Median Absolute Deviation (MAD)2
Skewness0.91232518
Sum28696
Variance4.8066765
MonotonicityNot monotonic
2024-04-06T21:59:49.614801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 4628
46.3%
3 3063
30.6%
7 1718
 
17.2%
5 547
 
5.5%
2 29
 
0.3%
4 15
 
0.1%
ValueCountFrequency (%)
1 4628
46.3%
2 29
 
0.3%
3 3063
30.6%
4 15
 
0.1%
5 547
 
5.5%
7 1718
 
17.2%
ValueCountFrequency (%)
7 1718
 
17.2%
5 547
 
5.5%
4 15
 
0.1%
3 3063
30.6%
2 29
 
0.3%
1 4628
46.3%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4628 
폐업처리
3063 
직권말소
1718 
타시군구이관
547 
휴업처리
 
29

Length

Max length6
Median length4
Mean length4.1094
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업처리
2nd row정상영업
3rd row정상영업
4th row폐업처리
5th row폐업처리

Common Values

ValueCountFrequency (%)
정상영업 4628
46.3%
폐업처리 3063
30.6%
직권말소 1718
 
17.2%
타시군구이관 547
 
5.5%
휴업처리 29
 
0.3%
직권취소 15
 
0.1%

Length

2024-04-06T21:59:49.835677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:59:50.033385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4628
46.3%
폐업처리 3063
30.6%
직권말소 1718
 
17.2%
타시군구이관 547
 
5.5%
휴업처리 29
 
0.3%
직권취소 15
 
0.1%

폐업일자
Text

MISSING 

Distinct2364
Distinct (%)66.0%
Missing6417
Missing (%)64.2%
Memory size156.2 KiB
2024-04-06T21:59:50.570490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.1417806
Min length8

Characters and Unicode

Total characters29172
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1542 ?
Unique (%)43.0%

Sample

1st row20210129
2nd row20121205
3rd row2023-03-24
4th row2023-03-23
5th row20140526
ValueCountFrequency (%)
20200115 9
 
0.3%
20201231 7
 
0.2%
20061229 7
 
0.2%
20191231 7
 
0.2%
20180119 6
 
0.2%
20200324 6
 
0.2%
20080714 6
 
0.2%
2024-01-12 5
 
0.1%
20191230 5
 
0.1%
20180117 5
 
0.1%
Other values (2354) 3520
98.2%
2024-04-06T21:59:51.425756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9129
31.3%
2 7391
25.3%
1 5253
18.0%
3 1418
 
4.9%
4 1018
 
3.5%
7 931
 
3.2%
9 911
 
3.1%
8 883
 
3.0%
6 870
 
3.0%
5 860
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 28664
98.3%
Dash Punctuation 508
 
1.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9129
31.8%
2 7391
25.8%
1 5253
18.3%
3 1418
 
4.9%
4 1018
 
3.6%
7 931
 
3.2%
9 911
 
3.2%
8 883
 
3.1%
6 870
 
3.0%
5 860
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 508
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29172
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9129
31.3%
2 7391
25.3%
1 5253
18.0%
3 1418
 
4.9%
4 1018
 
3.5%
7 931
 
3.2%
9 911
 
3.1%
8 883
 
3.0%
6 870
 
3.0%
5 860
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29172
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9129
31.3%
2 7391
25.3%
1 5253
18.0%
3 1418
 
4.9%
4 1018
 
3.5%
7 931
 
3.2%
9 911
 
3.1%
8 883
 
3.0%
6 870
 
3.0%
5 860
 
2.9%

휴업시작일자
Date

MISSING 

Distinct33
Distinct (%)100.0%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2004-12-17 00:00:00
Maximum2024-03-27 00:00:00
2024-04-06T21:59:51.698920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:59:51.934773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

휴업종료일자
Date

MISSING 

Distinct33
Distinct (%)100.0%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2005-12-06 00:00:00
Maximum2030-12-31 00:00:00
2024-04-06T21:59:52.171414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:59:52.406423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

재개업일자
Real number (ℝ)

MISSING 

Distinct15
Distinct (%)100.0%
Missing9985
Missing (%)99.9%
Infinite0
Infinite (%)0.0%
Mean20100087
Minimum20060217
Maximum20200928
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T21:59:52.619004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20060217
5-th percentile20060700
Q120070659
median20070914
Q320110717
95-th percentile20193643
Maximum20200928
Range140711
Interquartile range (IQR)40058

Descriptive statistics

Standard deviation50244.226
Coefficient of variation (CV)0.0024997019
Kurtosis0.11972965
Mean20100087
Median Absolute Deviation (MAD)10007
Skewness1.3046698
Sum3.015013 × 108
Variance2.5244822 × 109
MonotonicityNot monotonic
2024-04-06T21:59:52.842477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
20181221 1
 
< 0.1%
20070904 1
 
< 0.1%
20060217 1
 
< 0.1%
20070914 1
 
< 0.1%
20090909 1
 
< 0.1%
20070515 1
 
< 0.1%
20070803 1
 
< 0.1%
20080306 1
 
< 0.1%
20060914 1
 
< 0.1%
20060907 1
 
< 0.1%
Other values (5) 5
 
0.1%
(Missing) 9985
99.9%
ValueCountFrequency (%)
20060217 1
< 0.1%
20060907 1
< 0.1%
20060914 1
< 0.1%
20070515 1
< 0.1%
20070803 1
< 0.1%
20070809 1
< 0.1%
20070904 1
< 0.1%
20070914 1
< 0.1%
20080306 1
< 0.1%
20090909 1
< 0.1%
ValueCountFrequency (%)
20200928 1
< 0.1%
20190521 1
< 0.1%
20181221 1
< 0.1%
20130212 1
< 0.1%
20091222 1
< 0.1%
20090909 1
< 0.1%
20080306 1
< 0.1%
20070914 1
< 0.1%
20070904 1
< 0.1%
20070809 1
< 0.1%

전화번호
Text

MISSING 

Distinct6126
Distinct (%)95.2%
Missing3568
Missing (%)35.7%
Memory size156.2 KiB
2024-04-06T21:59:53.460979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length10.892568
Min length1

Characters and Unicode

Total characters70061
Distinct characters17
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6011 ?
Unique (%)93.5%

Sample

1st row2157-2006
2nd row070-7514-2500
3rd row02 4861 69
4th row070-7813-8746
5th row02-3210-1588
ValueCountFrequency (%)
02 1780
 
18.2%
140
 
1.4%
422 39
 
0.4%
413 36
 
0.4%
414 36
 
0.4%
412 34
 
0.3%
420 34
 
0.3%
3431 33
 
0.3%
421 32
 
0.3%
2202 32
 
0.3%
Other values (6522) 7593
77.6%
2024-04-06T21:59:54.316120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11828
16.9%
2 9656
13.8%
- 8127
11.6%
4 7019
10.0%
1 5055
7.2%
7 5027
7.2%
3 4319
 
6.2%
4269
 
6.1%
5 4106
 
5.9%
8 3962
 
5.7%
Other values (7) 6693
9.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 57623
82.2%
Dash Punctuation 8127
 
11.6%
Space Separator 4269
 
6.1%
Other Punctuation 32
 
< 0.1%
Math Symbol 7
 
< 0.1%
Close Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11828
20.5%
2 9656
16.8%
4 7019
12.2%
1 5055
8.8%
7 5027
8.7%
3 4319
 
7.5%
5 4106
 
7.1%
8 3962
 
6.9%
6 3604
 
6.3%
9 3047
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 30
93.8%
, 1
 
3.1%
1
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 8127
100.0%
Space Separator
ValueCountFrequency (%)
4269
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70061
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11828
16.9%
2 9656
13.8%
- 8127
11.6%
4 7019
10.0%
1 5055
7.2%
7 5027
7.2%
3 4319
 
6.2%
4269
 
6.1%
5 4106
 
5.9%
8 3962
 
5.7%
Other values (7) 6693
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70060
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11828
16.9%
2 9656
13.8%
- 8127
11.6%
4 7019
10.0%
1 5055
7.2%
7 5027
7.2%
3 4319
 
6.2%
4269
 
6.1%
5 4106
 
5.9%
8 3962
 
5.7%
Other values (6) 6692
9.6%
None
ValueCountFrequency (%)
1
100.0%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct173
Distinct (%)8.6%
Missing7998
Missing (%)80.0%
Infinite0
Infinite (%)0.0%
Mean139684.26
Minimum100032
Maximum609310
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T21:59:54.562070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100032
5-th percentile138050
Q1138160
median138190
Q3138220
95-th percentile138864
Maximum609310
Range509278
Interquartile range (IQR)60

Descriptive statistics

Standard deviation21820.166
Coefficient of variation (CV)0.15621063
Kurtosis253.33798
Mean139684.26
Median Absolute Deviation (MAD)30
Skewness15.590666
Sum2.7964789 × 108
Variance4.7611966 × 108
MonotonicityNot monotonic
2024-04-06T21:59:54.851171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
138160 228
 
2.3%
138200 201
 
2.0%
138220 179
 
1.8%
138050 163
 
1.6%
138170 132
 
1.3%
138190 124
 
1.2%
138130 85
 
0.9%
138240 81
 
0.8%
138180 76
 
0.8%
138110 70
 
0.7%
Other values (163) 663
 
6.6%
(Missing) 7998
80.0%
ValueCountFrequency (%)
100032 1
< 0.1%
120805 1
< 0.1%
121070 1
< 0.1%
130809 1
< 0.1%
130811 1
< 0.1%
130823 1
< 0.1%
133170 1
< 0.1%
134010 1
< 0.1%
134870 1
< 0.1%
135010 1
< 0.1%
ValueCountFrequency (%)
609310 1
< 0.1%
471010 1
< 0.1%
465010 1
< 0.1%
463050 1
< 0.1%
446913 1
< 0.1%
426837 1
< 0.1%
422230 1
< 0.1%
413835 1
< 0.1%
410380 1
< 0.1%
150096 1
< 0.1%

지번주소
Text

MISSING 

Distinct3907
Distinct (%)45.8%
Missing1470
Missing (%)14.7%
Memory size156.2 KiB
2024-04-06T21:59:55.297180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length75
Mean length27.341501
Min length8

Characters and Unicode

Total characters233223
Distinct characters505
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3161 ?
Unique (%)37.1%

Sample

1st row서울특별시 송파구 문정동 ***번지 가든파이브라이프
2nd row서울특별시 송파구 송파동 **번지
3rd row서울특별시 송파구 잠실동 **-* 갤러리아팰리스a동 ****호
4th row서울특별시 송파구 잠실동 **-* 잠실파인애플상가
5th row서울특별시 송파구 석촌동 ***번지 *호 ***
ValueCountFrequency (%)
서울특별시 8503
18.5%
송파구 8477
18.4%
번지 6105
13.3%
5053
11.0%
3043
 
6.6%
문정동 1641
 
3.6%
가락동 1174
 
2.5%
잠실동 961
 
2.1%
방이동 934
 
2.0%
739
 
1.6%
Other values (2334) 9422
20.5%
2024-04-06T21:59:56.171412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48358
20.7%
* 41424
17.8%
10717
 
4.6%
9492
 
4.1%
9119
 
3.9%
8818
 
3.8%
8546
 
3.7%
8542
 
3.7%
8530
 
3.7%
8506
 
3.6%
Other values (495) 71171
30.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 139406
59.8%
Space Separator 48361
 
20.7%
Other Punctuation 41568
 
17.8%
Dash Punctuation 2743
 
1.2%
Uppercase Letter 484
 
0.2%
Close Punctuation 194
 
0.1%
Open Punctuation 190
 
0.1%
Decimal Number 136
 
0.1%
Lowercase Letter 128
 
0.1%
Math Symbol 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10717
 
7.7%
9492
 
6.8%
9119
 
6.5%
8818
 
6.3%
8546
 
6.1%
8542
 
6.1%
8530
 
6.1%
8506
 
6.1%
8503
 
6.1%
6968
 
5.0%
Other values (429) 51665
37.1%
Uppercase Letter
ValueCountFrequency (%)
B 112
23.1%
A 49
10.1%
C 45
9.3%
S 39
 
8.1%
L 31
 
6.4%
T 28
 
5.8%
D 23
 
4.8%
K 20
 
4.1%
G 18
 
3.7%
F 15
 
3.1%
Other values (13) 104
21.5%
Lowercase Letter
ValueCountFrequency (%)
b 50
39.1%
a 17
 
13.3%
c 10
 
7.8%
e 6
 
4.7%
i 5
 
3.9%
k 5
 
3.9%
m 5
 
3.9%
h 4
 
3.1%
l 4
 
3.1%
p 3
 
2.3%
Other values (8) 19
 
14.8%
Decimal Number
ValueCountFrequency (%)
1 29
21.3%
6 22
16.2%
4 16
11.8%
2 16
11.8%
0 12
8.8%
5 12
8.8%
3 12
8.8%
9 8
 
5.9%
7 6
 
4.4%
8 3
 
2.2%
Other Punctuation
ValueCountFrequency (%)
* 41424
99.7%
, 86
 
0.2%
/ 31
 
0.1%
@ 14
 
< 0.1%
. 7
 
< 0.1%
? 4
 
< 0.1%
& 1
 
< 0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
48358
> 99.9%
  3
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 2743
100.0%
Close Punctuation
ValueCountFrequency (%)
) 194
100.0%
Open Punctuation
ValueCountFrequency (%)
( 190
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 139406
59.8%
Common 93203
40.0%
Latin 614
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10717
 
7.7%
9492
 
6.8%
9119
 
6.5%
8818
 
6.3%
8546
 
6.1%
8542
 
6.1%
8530
 
6.1%
8506
 
6.1%
8503
 
6.1%
6968
 
5.0%
Other values (429) 51665
37.1%
Latin
ValueCountFrequency (%)
B 112
18.2%
b 50
 
8.1%
A 49
 
8.0%
C 45
 
7.3%
S 39
 
6.4%
L 31
 
5.0%
T 28
 
4.6%
D 23
 
3.7%
K 20
 
3.3%
G 18
 
2.9%
Other values (32) 199
32.4%
Common
ValueCountFrequency (%)
48358
51.9%
* 41424
44.4%
- 2743
 
2.9%
) 194
 
0.2%
( 190
 
0.2%
, 86
 
0.1%
/ 31
 
< 0.1%
1 29
 
< 0.1%
6 22
 
< 0.1%
4 16
 
< 0.1%
Other values (14) 110
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 139405
59.8%
ASCII 93812
40.2%
None 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48358
51.5%
* 41424
44.2%
- 2743
 
2.9%
) 194
 
0.2%
( 190
 
0.2%
B 112
 
0.1%
, 86
 
0.1%
b 50
 
0.1%
A 49
 
0.1%
C 45
 
< 0.1%
Other values (54) 561
 
0.6%
Hangul
ValueCountFrequency (%)
10717
 
7.7%
9492
 
6.8%
9119
 
6.5%
8818
 
6.3%
8546
 
6.1%
8542
 
6.1%
8530
 
6.1%
8506
 
6.1%
8503
 
6.1%
6968
 
5.0%
Other values (428) 51664
37.1%
None
ValueCountFrequency (%)
  3
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct5869
Distinct (%)72.9%
Missing1948
Missing (%)19.5%
Memory size156.2 KiB
2024-04-06T21:59:57.098125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length73
Mean length37.597367
Min length21

Characters and Unicode

Total characters302734
Distinct characters514
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4913 ?
Unique (%)61.0%

Sample

1st row서울특별시 송파구 충민로 ** (문정동, 가든파이브라이프관T-****호)
2nd row서울특별시 송파구 충민로 **, 가든파이브라이프 *층 와이-****호 (문정동)
3rd row서울특별시 송파구 백제고분로**길 **, *층 (송파동)
4th row서울특별시 송파구 올림픽로 ***, 잠실파인애플상가 지*층 A**호 (잠실동)
5th row서울특별시 송파구 올림픽로 ***, ****호 (방이동, 대우유토피아오피스텔)
ValueCountFrequency (%)
8321
15.2%
서울특별시 8052
14.7%
송파구 8047
14.7%
4438
 
8.1%
2128
 
3.9%
문정동 1655
 
3.0%
1110
 
2.0%
백제고분로**길 925
 
1.7%
가락동 861
 
1.6%
잠실동 771
 
1.4%
Other values (3148) 18539
33.8%
2024-04-06T21:59:57.840513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 55425
18.3%
49527
16.4%
11549
 
3.8%
10614
 
3.5%
, 10556
 
3.5%
10099
 
3.3%
8358
 
2.8%
) 8169
 
2.7%
( 8163
 
2.7%
8092
 
2.7%
Other values (504) 122182
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 166704
55.1%
Other Punctuation 66014
 
21.8%
Space Separator 49530
 
16.4%
Close Punctuation 8169
 
2.7%
Open Punctuation 8163
 
2.7%
Dash Punctuation 2569
 
0.8%
Uppercase Letter 1211
 
0.4%
Decimal Number 248
 
0.1%
Lowercase Letter 79
 
< 0.1%
Math Symbol 44
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11549
 
6.9%
10614
 
6.4%
10099
 
6.1%
8358
 
5.0%
8092
 
4.9%
8088
 
4.9%
8081
 
4.8%
8067
 
4.8%
8052
 
4.8%
8052
 
4.8%
Other values (439) 77652
46.6%
Uppercase Letter
ValueCountFrequency (%)
B 358
29.6%
A 204
16.8%
C 142
 
11.7%
L 80
 
6.6%
T 70
 
5.8%
Y 53
 
4.4%
S 52
 
4.3%
F 36
 
3.0%
G 34
 
2.8%
D 31
 
2.6%
Other values (14) 151
12.5%
Lowercase Letter
ValueCountFrequency (%)
b 15
19.0%
e 10
12.7%
i 9
11.4%
l 9
11.4%
c 6
 
7.6%
t 6
 
7.6%
n 5
 
6.3%
u 4
 
5.1%
r 3
 
3.8%
a 3
 
3.8%
Other values (6) 9
11.4%
Decimal Number
ValueCountFrequency (%)
1 61
24.6%
0 41
16.5%
2 35
14.1%
6 25
10.1%
3 24
 
9.7%
4 15
 
6.0%
9 12
 
4.8%
5 12
 
4.8%
8 12
 
4.8%
7 11
 
4.4%
Other Punctuation
ValueCountFrequency (%)
* 55425
84.0%
, 10556
 
16.0%
/ 21
 
< 0.1%
? 5
 
< 0.1%
& 3
 
< 0.1%
. 3
 
< 0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
49527
> 99.9%
  3
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 8169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8163
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2569
100.0%
Math Symbol
ValueCountFrequency (%)
~ 44
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 166706
55.1%
Common 134737
44.5%
Latin 1291
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11549
 
6.9%
10614
 
6.4%
10099
 
6.1%
8358
 
5.0%
8092
 
4.9%
8088
 
4.9%
8081
 
4.8%
8067
 
4.8%
8052
 
4.8%
8052
 
4.8%
Other values (440) 77654
46.6%
Latin
ValueCountFrequency (%)
B 358
27.7%
A 204
15.8%
C 142
 
11.0%
L 80
 
6.2%
T 70
 
5.4%
Y 53
 
4.1%
S 52
 
4.0%
F 36
 
2.8%
G 34
 
2.6%
D 31
 
2.4%
Other values (31) 231
17.9%
Common
ValueCountFrequency (%)
* 55425
41.1%
49527
36.8%
, 10556
 
7.8%
) 8169
 
6.1%
( 8163
 
6.1%
- 2569
 
1.9%
1 61
 
< 0.1%
~ 44
 
< 0.1%
0 41
 
< 0.1%
2 35
 
< 0.1%
Other values (13) 147
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 166703
55.1%
ASCII 136024
44.9%
None 5
 
< 0.1%
Number Forms 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 55425
40.7%
49527
36.4%
, 10556
 
7.8%
) 8169
 
6.0%
( 8163
 
6.0%
- 2569
 
1.9%
B 358
 
0.3%
A 204
 
0.1%
C 142
 
0.1%
L 80
 
0.1%
Other values (52) 831
 
0.6%
Hangul
ValueCountFrequency (%)
11549
 
6.9%
10614
 
6.4%
10099
 
6.1%
8358
 
5.0%
8092
 
4.9%
8088
 
4.9%
8081
 
4.8%
8067
 
4.8%
8052
 
4.8%
8052
 
4.8%
Other values (438) 77651
46.6%
None
ValueCountFrequency (%)
  3
60.0%
2
40.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct534
Distinct (%)8.4%
Missing3611
Missing (%)36.1%
Memory size156.2 KiB
2024-04-06T21:59:58.470260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.2800125
Min length5

Characters and Unicode

Total characters33734
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)1.3%

Sample

1st row138200
2nd row05838
3rd row05667
4th row05501
5th row138-827
ValueCountFrequency (%)
05838 224
 
3.5%
05854 193
 
3.0%
05836 157
 
2.5%
05855 152
 
2.4%
138220 130
 
2.0%
138160 114
 
1.8%
138200 105
 
1.6%
05699 80
 
1.3%
138960 78
 
1.2%
05840 70
 
1.1%
Other values (524) 5086
79.6%
2024-04-06T21:59:59.345671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 7199
21.3%
0 6722
19.9%
8 4894
14.5%
1 3376
10.0%
3 3200
9.5%
6 2430
 
7.2%
7 1784
 
5.3%
2 1524
 
4.5%
4 1394
 
4.1%
9 1189
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 33712
99.9%
Dash Punctuation 22
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 7199
21.4%
0 6722
19.9%
8 4894
14.5%
1 3376
10.0%
3 3200
9.5%
6 2430
 
7.2%
7 1784
 
5.3%
2 1524
 
4.5%
4 1394
 
4.1%
9 1189
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 33734
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 7199
21.3%
0 6722
19.9%
8 4894
14.5%
1 3376
10.0%
3 3200
9.5%
6 2430
 
7.2%
7 1784
 
5.3%
2 1524
 
4.5%
4 1394
 
4.1%
9 1189
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33734
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 7199
21.3%
0 6722
19.9%
8 4894
14.5%
1 3376
10.0%
3 3200
9.5%
6 2430
 
7.2%
7 1784
 
5.3%
2 1524
 
4.5%
4 1394
 
4.1%
9 1189
 
3.5%
Distinct9867
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T21:59:59.976420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length7.5867
Min length1

Characters and Unicode

Total characters75867
Distinct characters1087
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9743 ?
Unique (%)97.4%

Sample

1st row골드스타
2nd row주식회사 본랩코리아
3rd row엘레나(ELENA)
4th row아이엔씨티
5th row더블제이 컴퍼니
ValueCountFrequency (%)
주식회사 950
 
6.9%
288
 
2.1%
57
 
0.4%
41
 
0.3%
co 40
 
0.3%
company 38
 
0.3%
35
 
0.3%
코리아 33
 
0.2%
컴퍼니 33
 
0.2%
korea 31
 
0.2%
Other values (11265) 12292
88.8%
2024-04-06T22:00:00.886508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3853
 
5.1%
) 2703
 
3.6%
2678
 
3.5%
( 2663
 
3.5%
2310
 
3.0%
2213
 
2.9%
1376
 
1.8%
1203
 
1.6%
1095
 
1.4%
1057
 
1.4%
Other values (1077) 54716
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52258
68.9%
Lowercase Letter 7770
 
10.2%
Uppercase Letter 5551
 
7.3%
Space Separator 3855
 
5.1%
Close Punctuation 2704
 
3.6%
Open Punctuation 2664
 
3.5%
Decimal Number 424
 
0.6%
Other Punctuation 404
 
0.5%
Other Symbol 118
 
0.2%
Dash Punctuation 90
 
0.1%
Other values (5) 29
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2678
 
5.1%
2310
 
4.4%
2213
 
4.2%
1376
 
2.6%
1203
 
2.3%
1095
 
2.1%
1057
 
2.0%
1047
 
2.0%
871
 
1.7%
768
 
1.5%
Other values (989) 37640
72.0%
Lowercase Letter
ValueCountFrequency (%)
e 869
 
11.2%
o 799
 
10.3%
a 667
 
8.6%
n 643
 
8.3%
i 590
 
7.6%
t 476
 
6.1%
r 433
 
5.6%
l 428
 
5.5%
s 392
 
5.0%
c 270
 
3.5%
Other values (16) 2203
28.4%
Uppercase Letter
ValueCountFrequency (%)
A 440
 
7.9%
O 403
 
7.3%
S 380
 
6.8%
E 380
 
6.8%
N 336
 
6.1%
I 333
 
6.0%
L 326
 
5.9%
C 316
 
5.7%
T 313
 
5.6%
M 294
 
5.3%
Other values (16) 2030
36.6%
Other Punctuation
ValueCountFrequency (%)
. 228
56.4%
& 82
 
20.3%
, 55
 
13.6%
' 22
 
5.4%
/ 5
 
1.2%
! 3
 
0.7%
? 3
 
0.7%
: 2
 
0.5%
# 2
 
0.5%
@ 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 92
21.7%
1 79
18.6%
5 48
11.3%
4 47
11.1%
0 42
9.9%
3 40
9.4%
9 24
 
5.7%
7 22
 
5.2%
6 18
 
4.2%
8 12
 
2.8%
Math Symbol
ValueCountFrequency (%)
× 3
42.9%
+ 3
42.9%
= 1
 
14.3%
Space Separator
ValueCountFrequency (%)
3853
99.9%
  2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 2703
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2663
> 99.9%
[ 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
118
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%
Final Punctuation
ValueCountFrequency (%)
9
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52362
69.0%
Latin 13321
 
17.6%
Common 10170
 
13.4%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2678
 
5.1%
2310
 
4.4%
2213
 
4.2%
1376
 
2.6%
1203
 
2.3%
1095
 
2.1%
1057
 
2.0%
1047
 
2.0%
871
 
1.7%
768
 
1.5%
Other values (976) 37744
72.1%
Latin
ValueCountFrequency (%)
e 869
 
6.5%
o 799
 
6.0%
a 667
 
5.0%
n 643
 
4.8%
i 590
 
4.4%
t 476
 
3.6%
A 440
 
3.3%
r 433
 
3.3%
l 428
 
3.2%
O 403
 
3.0%
Other values (42) 7573
56.9%
Common
ValueCountFrequency (%)
3853
37.9%
) 2703
26.6%
( 2663
26.2%
. 228
 
2.2%
2 92
 
0.9%
- 90
 
0.9%
& 82
 
0.8%
1 79
 
0.8%
, 55
 
0.5%
5 48
 
0.5%
Other values (25) 277
 
2.7%
Han
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52244
68.9%
ASCII 23474
30.9%
None 124
 
0.2%
CJK 13
 
< 0.1%
Punctuation 11
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3853
 
16.4%
) 2703
 
11.5%
( 2663
 
11.3%
e 869
 
3.7%
o 799
 
3.4%
a 667
 
2.8%
n 643
 
2.7%
i 590
 
2.5%
t 476
 
2.0%
A 440
 
1.9%
Other values (72) 9771
41.6%
Hangul
ValueCountFrequency (%)
2678
 
5.1%
2310
 
4.4%
2213
 
4.2%
1376
 
2.6%
1203
 
2.3%
1095
 
2.1%
1057
 
2.0%
1047
 
2.0%
871
 
1.7%
768
 
1.5%
Other values (975) 37626
72.0%
None
ValueCountFrequency (%)
118
95.2%
× 3
 
2.4%
  2
 
1.6%
1
 
0.8%
Punctuation
ValueCountFrequency (%)
9
81.8%
2
 
18.2%
CJK
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9983
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-14 15:14:58
Maximum2024-04-04 17:14:10
2024-04-06T22:00:01.108352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:00:01.357272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
8362 
U
1638 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 8362
83.6%
U 1638
 
16.4%

Length

2024-04-06T22:00:01.546142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:00:01.699455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 8362
83.6%
u 1638
 
16.4%
Distinct1274
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:06:00
2024-04-06T22:00:01.897121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:00:02.172905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct452
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T22:00:02.486580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length83
Mean length9.2333
Min length1

Characters and Unicode

Total characters92333
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique272 ?
Unique (%)2.7%

Sample

1st row종합몰
2nd row종합몰 의류/패션/잡화/뷰티 건강/식품
3rd row의류/패션/잡화/뷰티
4th row레져/여행/공연
5th row기타
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3480
25.9%
종합몰 2310
17.2%
기타 2039
15.1%
건강/식품 1210
 
9.0%
자동차/자동차용품 1098
 
8.2%
교육/도서/완구/오락 700
 
5.2%
컴퓨터/사무용품 555
 
4.1%
가전 528
 
3.9%
성인/성인용품 504
 
3.7%
레져/여행/공연 471
 
3.5%
Other values (3) 565
 
4.2%
2024-04-06T22:00:03.050142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17247
 
18.7%
3869
 
4.2%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
3480
 
3.8%
Other values (41) 43377
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71563
77.5%
Other Punctuation 17247
 
18.7%
Space Separator 3460
 
3.7%
Dash Punctuation 63
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3869
 
5.4%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
2555
 
3.6%
Other values (38) 37299
52.1%
Other Punctuation
ValueCountFrequency (%)
/ 17247
100.0%
Space Separator
ValueCountFrequency (%)
3460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71563
77.5%
Common 20770
 
22.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3869
 
5.4%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
2555
 
3.6%
Other values (38) 37299
52.1%
Common
ValueCountFrequency (%)
/ 17247
83.0%
3460
 
16.7%
- 63
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71563
77.5%
ASCII 20770
 
22.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17247
83.0%
3460
 
16.7%
- 63
 
0.3%
Hangul
ValueCountFrequency (%)
3869
 
5.4%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
3480
 
4.9%
2555
 
3.6%
Other values (38) 37299
52.1%

좌표정보(X)
Real number (ℝ)

MISSING  SKEWED 

Distinct3895
Distinct (%)49.0%
Missing2055
Missing (%)20.5%
Infinite0
Infinite (%)0.0%
Mean210125.9
Minimum179708.83
Maximum389632.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:00:03.325675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum179708.83
5-th percentile207262.6
Q1209180.58
median210343.7
Q3211046.56
95-th percentile212752.01
Maximum389632.01
Range209923.18
Interquartile range (IQR)1865.9765

Descriptive statistics

Standard deviation2636.7356
Coefficient of variation (CV)0.012548361
Kurtosis2707.9251
Mean210125.9
Median Absolute Deviation (MAD)895.29105
Skewness39.200685
Sum1.6694502 × 109
Variance6952374.7
MonotonicityNot monotonic
2024-04-06T22:00:03.585852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
210986.460698452 248
 
2.5%
211053.119518 158
 
1.6%
209790.959909032 117
 
1.2%
210535.089616 110
 
1.1%
210627.0 78
 
0.8%
210480.282018702 73
 
0.7%
209394.231297346 60
 
0.6%
207750.499013776 54
 
0.5%
208233.926974585 51
 
0.5%
210749.0 49
 
0.5%
Other values (3885) 6947
69.5%
(Missing) 2055
 
20.5%
ValueCountFrequency (%)
179708.826286267 1
< 0.1%
186555.398800658 1
< 0.1%
189743.357842202 1
< 0.1%
190024.440927666 1
< 0.1%
192344.544098471 1
< 0.1%
194496.884633662 1
< 0.1%
196800.125758275 1
< 0.1%
199088.322667 1
< 0.1%
201753.303709664 1
< 0.1%
202191.715036688 1
< 0.1%
ValueCountFrequency (%)
389632.011167579 1
 
< 0.1%
217137.519410187 1
 
< 0.1%
214185.037966 2
 
< 0.1%
214104.196337 10
0.1%
213999.956783491 5
0.1%
213977.355937651 7
0.1%
213951.78963949 1
 
< 0.1%
213919.422177928 1
 
< 0.1%
213903.879311669 1
 
< 0.1%
213885.891267909 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct3884
Distinct (%)48.9%
Missing2055
Missing (%)20.5%
Infinite0
Infinite (%)0.0%
Mean444289.82
Minimum197424.23
Maximum461404.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:00:03.825535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum197424.23
5-th percentile441909.99
Q1443352.22
median444443.41
Q3445259.61
95-th percentile446426.81
Maximum461404.8
Range263980.57
Interquartile range (IQR)1907.3902

Descriptive statistics

Standard deviation3154.1069
Coefficient of variation (CV)0.0070992104
Kurtosis4724.9484
Mean444289.82
Median Absolute Deviation (MAD)962.20055
Skewness-60.344099
Sum3.5298826 × 109
Variance9948390.4
MonotonicityNot monotonic
2024-04-06T22:00:04.076841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
441725.293491662 248
 
2.5%
442026.988783 158
 
1.6%
443481.212174317 117
 
1.2%
442137.578674 110
 
1.1%
442875.0 78
 
0.8%
441833.16323518 73
 
0.7%
443951.236927017 60
 
0.6%
445435.257647797 51
 
0.5%
446156.529775445 50
 
0.5%
442560.0 49
 
0.5%
Other values (3874) 6951
69.5%
(Missing) 2055
 
20.5%
ValueCountFrequency (%)
197424.230768187 1
 
< 0.1%
424744.099057566 1
 
< 0.1%
431880.785795565 1
 
< 0.1%
440547.513165151 1
 
< 0.1%
440870.395232555 1
 
< 0.1%
441412.0 3
< 0.1%
441426.0 4
< 0.1%
441446.0 3
< 0.1%
441534.957797015 1
 
< 0.1%
441585.372783391 1
 
< 0.1%
ValueCountFrequency (%)
461404.803949418 1
< 0.1%
455806.456688572 1
< 0.1%
453192.362400812 1
< 0.1%
452602.700126854 1
< 0.1%
452002.278744736 1
< 0.1%
451477.658296453 1
< 0.1%
451230.610149193 1
< 0.1%
451002.807624223 1
< 0.1%
450968.166497544 1
< 0.1%
450158.241480494 1
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9811 
0
 
189

Length

Max length4
Median length4
Mean length3.9433
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9811
98.1%
0 189
 
1.9%

Length

2024-04-06T22:00:04.309776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:00:04.460334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9811
98.1%
0 189
 
1.9%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9811 
0
 
189

Length

Max length4
Median length4
Mean length3.9433
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9811
98.1%
0 189
 
1.9%

Length

2024-04-06T22:00:04.648821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:00:04.850564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9811
98.1%
0 189
 
1.9%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9811 
0
 
189

Length

Max length4
Median length4
Mean length3.9433
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9811
98.1%
0 189
 
1.9%

Length

2024-04-06T22:00:05.156661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:00:05.359784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9811
98.1%
0 189
 
1.9%

판매방식명
Categorical

IMBALANCE 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
5680 
<NA>
3890 
인터넷, 기타
 
131
기타
 
62
TV홈쇼핑, 인터넷
 
49
Other values (20)
 
188

Length

Max length26
Median length3
Mean length3.6927
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row<NA>
3rd row인터넷
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
인터넷 5680
56.8%
<NA> 3890
38.9%
인터넷, 기타 131
 
1.3%
기타 62
 
0.6%
TV홈쇼핑, 인터넷 49
 
0.5%
인터넷, 카다로그 31
 
0.3%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 31
 
0.3%
TV홈쇼핑 21
 
0.2%
TV홈쇼핑, 인터넷, 카다로그 17
 
0.2%
인터넷, 카다로그, 신문잡지, 기타 14
 
0.1%
Other values (15) 74
 
0.7%

Length

2024-04-06T22:00:05.608124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 6014
57.0%
na 3890
36.8%
기타 278
 
2.6%
tv홈쇼핑 155
 
1.5%
카다로그 140
 
1.3%
신문잡지 80
 
0.8%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
118713230000201232301983020051820120409<NA>3폐업3폐업처리20210129<NA><NA><NA>2157-2006<NA><NA><NA>서울특별시 송파구 충민로 ** (문정동, 가든파이브라이프관T-****호)138200골드스타2021-01-29 09:56:30U2021-01-31 02:40:00.0종합몰211053.119518442026.988783<NA><NA><NA>인터넷
256093230000201932302613020284820191114<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7514-2500<NA><NA>서울특별시 송파구 문정동 ***번지 가든파이브라이프서울특별시 송파구 충민로 **, 가든파이브라이프 *층 와이-****호 (문정동)05838주식회사 본랩코리아2019-11-14 15:19:23I2021-12-03 22:02:00.0종합몰 의류/패션/잡화/뷰티 건강/식품210986.460698441725.293492<NA><NA><NA><NA>
166333230000201532302313020144320150810<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 송파동 **번지서울특별시 송파구 백제고분로**길 **, *층 (송파동)05667엘레나(ELENA)2015-09-17 16:52:37I2018-08-31 23:59:59.0의류/패션/잡화/뷰티209663.79247444768.479054<NA><NA><NA>인터넷
51923230000200732301313020511720070223<NA>3폐업3폐업처리20121205<NA><NA><NA>02 4861 69<NA><NA>서울특별시 송파구 잠실동 **-* 갤러리아팰리스a동 ****호<NA><NA>아이엔씨티2012-12-05 14:05:22I2018-08-31 23:59:59.0레져/여행/공연<NA><NA><NA><NA><NA><NA>
28517323000020203230261302026722020-07-24<NA>3폐업3폐업처리2023-03-24<NA><NA><NA><NA><NA><NA>서울특별시 송파구 잠실동 **-* 잠실파인애플상가서울특별시 송파구 올림픽로 ***, 잠실파인애플상가 지*층 A**호 (잠실동)05501더블제이 컴퍼니2023-03-24 16:12:57U2022-12-02 22:06:00.0기타207373.500215445545.656024<NA><NA><NA><NA>
15473323000020153230231302000082015-01-05<NA>3폐업3폐업처리2023-03-23<NA><NA><NA>070-7813-8746<NA><NA><NA>서울특별시 송파구 올림픽로 ***, ****호 (방이동, 대우유토피아오피스텔)138-827테테2023-03-23 16:31:23U2022-12-04 22:06:00.0기타209430.450574445913.088912<NA><NA><NA><NA>
111063230000201132301983020141220111004<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA>138190서울특별시 송파구 석촌동 ***번지 *호 ***서울특별시 송파구 가락로 *** (석촌동,***)<NA>잭클라인닷컴2011-10-04 17:47:23I2018-08-31 23:59:59.0의류/패션/잡화/뷰티209449.06674444354.012159<NA><NA><NA>인터넷
200593230000201732302313020156120170817<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 석촌동 ***번지 *호서울특별시 송파구 송파대로**길 **, B***호 (석촌동)05610팩트 코스트2017-08-17 13:13:03I2018-08-31 23:59:59.0의류/패션/잡화/뷰티208898.581724444944.350336<NA><NA><NA>인터넷
240633230000201932302613020112820190425<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 문정동 *번지 문정래미안아파트서울특별시 송파구 문정로 **, ***동 ***호 (문정동, 문정래미안아파트)05794로에일 스튜디오2019-04-25 15:10:43I2019-04-27 02:20:31.0의류/패션/잡화/뷰티211453.050927443165.755656<NA><NA><NA>인터넷
130983230000201332301983020042520130405<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 삼전동 ***번지 **호서울특별시 송파구 백제고분로**길 **-**, *층 ***호 (삼전동)05604노띠끄2018-02-12 12:57:16I2018-08-31 23:59:59.0의류/패션/잡화/뷰티208219.105201444706.039882<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
25363230000200432301313020221120040826<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 408 8127<NA><NA>서울특별시 송파구 송파동 ***-*<NA><NA>(주)나나세상2014-02-17 12:21:31I2018-08-31 23:59:59.0자동차/자동차용품<NA><NA><NA><NA><NA><NA>
70683230000200832301313020104220081104<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>418-3538<NA>138180서울특별시 송파구 삼전동 **번지 *호 B**호서울특별시 송파구 삼전로*길 ** (삼전동,B**호)<NA>코리아 알파2014-02-17 10:55:02I2018-08-31 23:59:59.0종합몰207964.733669444728.223753<NA><NA><NA>인터넷
29233230000200532301313020268220050221<NA>3폐업3폐업처리20090828<NA><NA><NA>02 418 2474<NA><NA>서울특별시 송파구 석촌동 ***-**<NA><NA>9je42009-08-28 14:17:14I2018-08-31 23:59:59.0성인/성인용품<NA><NA><NA><NA><NA><NA>
14554323000020143230198302006732014-05-12<NA>3폐업3폐업처리2024-01-23<NA><NA><NA>031-944-0034<NA><NA><NA>서울특별시 송파구 올림픽로 ***, **호 (잠실동, 리센츠상가)138-911다봄안경원2024-01-22 15:42:58U2023-11-30 22:04:00.0의류/패션/잡화/뷰티 기타207624.424721445533.439634<NA><NA><NA><NA>
55393230000200732301313020548120070608<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 5138 33<NA><NA>서울특별시 송파구 가락동 **-* 익수빌딩 *층<NA><NA>주)뉴트라시너지2014-01-15 17:44:31I2018-08-31 23:59:59.0종합몰<NA><NA><NA><NA><NA><NA>
75353230000200932301313020477820090323<NA>5제외/삭제/전출5타시군구이관<NA><NA><NA><NA>416-0801<NA>138849서울특별시 송파구 송파*동 **번지 **호 *층서울특별시 송파구 백제고분로**길 **-* (송파동,*층)<NA>지오애드2010-02-24 10:55:22I2021-12-03 22:02:00.0기타 의류/패션/잡화/뷰티209581.072188445330.763478<NA><NA><NA><NA>
179953230000201632302313020121220160728<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-443-3379<NA><NA>서울특별시 송파구 방이동 ***-* 광남벨라스아파트서울특별시 송파구 백제고분로**길 **, *층 ***호 (방이동, 광남벨라스아파트)05630주식회사 에코티앤엘2022-05-11 14:06:59U2021-12-04 23:03:00.0가전 기타210214.552127445741.407076<NA><NA><NA><NA>
124113230000201232301983020114520120824<NA>3폐업3폐업처리20140102<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 거마로 **, *층 (마천동)138120아베스타2014-01-02 16:30:03I2018-08-31 23:59:59.0종합몰212963.41936444120.722362<NA><NA><NA>TV홈쇼핑, 인터넷, 카다로그
34633230000200532301313020327820050902<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 415 7710<NA>138862서울특별시 송파구 잠실본동 ***번지 mbc아카데미사옥*층서울특별시 송파구 백제고분로*길 ** (잠실동,mbc아카데미사옥*층)<NA>(주)고고에듀2010-11-22 11:42:29I2018-08-31 23:59:59.0자동차/자동차용품207022.590546445203.586767<NA><NA><NA><NA>
155413230000201532302313020009120150114<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7563-0906<NA><NA><NA>서울특별시 송파구 마천로 **-*, *층 (오금동)138856주식회사 빌트바이 (BUILT BY architects Co. , Ltd)2015-01-14 16:32:46I2018-08-31 23:59:59.0기타211238.119544445021.3532<NA><NA><NA>인터넷