Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells71782
Missing cells (%)24.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical10
Numeric5
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author마포구
URLhttps://data.seoul.go.kr/dataList/OA-18815/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (99.9%)Imbalance
자산규모 is highly imbalanced (85.1%)Imbalance
부채총액 is highly imbalanced (85.1%)Imbalance
자본금 is highly imbalanced (85.1%)Imbalance
판매방식명 is highly imbalanced (69.1%)Imbalance
폐업일자 has 6515 (65.1%) missing valuesMissing
휴업시작일자 has 9967 (99.7%) missing valuesMissing
휴업종료일자 has 9967 (99.7%) missing valuesMissing
재개업일자 has 9991 (99.9%) missing valuesMissing
전화번호 has 4210 (42.1%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7893 (78.9%) missing valuesMissing
지번주소 has 2671 (26.7%) missing valuesMissing
도로명주소 has 2309 (23.1%) missing valuesMissing
도로명우편번호 has 3927 (39.3%) missing valuesMissing
좌표정보(X) has 2166 (21.7%) missing valuesMissing
좌표정보(Y) has 2166 (21.7%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = 34.51360204)Skewed
좌표정보(X) is highly skewed (γ1 = 45.99934901)Skewed
좌표정보(Y) is highly skewed (γ1 = -74.01131973)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 19:49:28.777948
Analysis finished2024-04-29 19:49:30.636632
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3130000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3130000
2nd row3130000
3rd row3130000
4th row3130000
5th row3130000

Common Values

ValueCountFrequency (%)
3130000 10000
100.0%

Length

2024-04-30T04:49:30.696688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:30.770951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3130000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0129545 × 1018
Minimum1.996313 × 1018
Maximum2.020313 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:30.860690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.996313 × 1018
5-th percentile2.003313 × 1018
Q12.008313 × 1018
median2.013313 × 1018
Q32.018313 × 1018
95-th percentile2.020313 × 1018
Maximum2.020313 × 1018
Range2.4000011 × 1016
Interquartile range (IQR)1.0000008 × 1016

Descriptive statistics

Standard deviation5.5215097 × 1015
Coefficient of variation (CV)0.0027429878
Kurtosis-1.1962489
Mean2.0129545 × 1018
Median Absolute Deviation (MAD)5.0000083 × 1015
Skewness-0.27979148
Sum4.1473725 × 1018
Variance3.0487069 × 1031
MonotonicityNot monotonic
2024-04-30T04:49:30.980869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2018313020130200607 1
 
< 0.1%
2020313022530201731 1
 
< 0.1%
2017313020130201119 1
 
< 0.1%
2016313020130200835 1
 
< 0.1%
2018313020130202091 1
 
< 0.1%
2015313018030200763 1
 
< 0.1%
2017313020130201532 1
 
< 0.1%
2004313011830202281 1
 
< 0.1%
2016313020130200470 1
 
< 0.1%
2011313011830201296 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1996313011830200277 1
< 0.1%
1997313011830200974 1
< 0.1%
1998313011830200370 1
< 0.1%
1999313011830200326 1
< 0.1%
1999313018030200001 1
< 0.1%
2000313011830200148 1
< 0.1%
2000313011830200212 1
< 0.1%
2000313011830200230 1
< 0.1%
2000313011830200345 1
< 0.1%
2001313011830200127 1
< 0.1%
ValueCountFrequency (%)
2020313022530203514 1
< 0.1%
2020313022530203513 1
< 0.1%
2020313022530203509 1
< 0.1%
2020313022530203499 1
< 0.1%
2020313022530203498 1
< 0.1%
2020313022530203496 1
< 0.1%
2020313022530203488 1
< 0.1%
2020313022530203487 1
< 0.1%
2020313022530203482 1
< 0.1%
2020313022530203476 1
< 0.1%
Distinct3809
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-12-04 00:00:00
Maximum2020-10-07 00:00:00
2024-04-30T04:49:31.096849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:31.232731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9999 
20130531
 
1

Length

Max length8
Median length4
Mean length4.0004
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9999
> 99.9%
20130531 1
 
< 0.1%

Length

2024-04-30T04:49:31.345281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:31.575472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9999
> 99.9%
20130531 1
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4267 
3
3804 
4
1249 
5
655 
2
 
25

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row1
4th row3
5th row3

Common Values

ValueCountFrequency (%)
1 4267
42.7%
3 3804
38.0%
4 1249
 
12.5%
5 655
 
6.6%
2 25
 
0.2%

Length

2024-04-30T04:49:31.676889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:31.767347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4267
42.7%
3 3804
38.0%
4 1249
 
12.5%
5 655
 
6.6%
2 25
 
0.2%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4267 
폐업
3804 
취소/말소/만료/정지/중지
1249 
제외/삭제/전출
655 
휴업
 
25

Length

Max length14
Median length8
Mean length5.1719
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row영업/정상
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
영업/정상 4267
42.7%
폐업 3804
38.0%
취소/말소/만료/정지/중지 1249
 
12.5%
제외/삭제/전출 655
 
6.6%
휴업 25
 
0.2%

Length

2024-04-30T04:49:31.864727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:31.952834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4267
42.7%
폐업 3804
38.0%
취소/말소/만료/정지/중지 1249
 
12.5%
제외/삭제/전출 655
 
6.6%
휴업 25
 
0.2%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7754
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:32.043908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.9796835
Coefficient of variation (CV)0.71329664
Kurtosis-0.028898469
Mean2.7754
Median Absolute Deviation (MAD)2
Skewness0.99969628
Sum27754
Variance3.9191468
MonotonicityNot monotonic
2024-04-30T04:49:32.124116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 4265
42.6%
3 3804
38.0%
7 1248
 
12.5%
5 655
 
6.6%
2 25
 
0.2%
6 2
 
< 0.1%
4 1
 
< 0.1%
ValueCountFrequency (%)
1 4265
42.6%
2 25
 
0.2%
3 3804
38.0%
4 1
 
< 0.1%
5 655
 
6.6%
6 2
 
< 0.1%
7 1248
 
12.5%
ValueCountFrequency (%)
7 1248
 
12.5%
6 2
 
< 0.1%
5 655
 
6.6%
4 1
 
< 0.1%
3 3804
38.0%
2 25
 
0.2%
1 4265
42.6%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4265 
폐업처리
3804 
직권말소
1248 
타시군구이관
655 
휴업처리
 
25
Other values (2)
 
3

Length

Max length6
Median length4
Mean length4.1314
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row정상영업
2nd row폐업처리
3rd row정상영업
4th row폐업처리
5th row폐업처리

Common Values

ValueCountFrequency (%)
정상영업 4265
42.6%
폐업처리 3804
38.0%
직권말소 1248
 
12.5%
타시군구이관 655
 
6.6%
휴업처리 25
 
0.2%
타시군구전입 2
 
< 0.1%
직권취소 1
 
< 0.1%

Length

2024-04-30T04:49:32.237015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:32.345166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4265
42.6%
폐업처리 3804
38.0%
직권말소 1248
 
12.5%
타시군구이관 655
 
6.6%
휴업처리 25
 
0.2%
타시군구전입 2
 
< 0.1%
직권취소 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2225
Distinct (%)63.8%
Missing6515
Missing (%)65.1%
Memory size156.2 KiB
Minimum2004-02-05 00:00:00
Maximum2024-04-25 00:00:00
2024-04-30T04:49:32.455725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:32.577058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct30
Distinct (%)90.9%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-01-19 00:00:00
Maximum2023-08-16 00:00:00
2024-04-30T04:49:32.679197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:32.794813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)

휴업종료일자
Date

MISSING 

Distinct26
Distinct (%)78.8%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-12-31 00:00:00
Maximum2068-07-01 00:00:00
2024-04-30T04:49:32.907641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:33.016279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)

재개업일자
Date

MISSING 

Distinct9
Distinct (%)100.0%
Missing9991
Missing (%)99.9%
Memory size156.2 KiB
Minimum2008-09-17 00:00:00
Maximum2024-04-12 00:00:00
2024-04-30T04:49:33.126122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:33.220074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)

전화번호
Text

MISSING 

Distinct5697
Distinct (%)98.4%
Missing4210
Missing (%)42.1%
Memory size156.2 KiB
2024-04-30T04:49:33.520647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length10.197927
Min length1

Characters and Unicode

Total characters59046
Distinct characters18
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5613 ?
Unique (%)96.9%

Sample

1st row02-6406-5969
2nd row718-5230
3rd row337 8645
4th row6377 1177
5th row070-8237-3641
ValueCountFrequency (%)
02 86
 
1.2%
322 55
 
0.7%
338 53
 
0.7%
325 49
 
0.7%
323 48
 
0.6%
3142 48
 
0.6%
3141 47
 
0.6%
334 46
 
0.6%
333 44
 
0.6%
336 44
 
0.6%
Other values (5906) 6903
93.0%
2024-04-30T04:49:33.986895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8526
14.4%
3 7159
12.1%
2 6690
11.3%
- 6553
11.1%
7 5873
9.9%
1 4591
7.8%
4 4069
6.9%
8 3772
6.4%
6 3719
6.3%
5 3681
6.2%
Other values (8) 4413
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50725
85.9%
Dash Punctuation 6553
 
11.1%
Space Separator 1708
 
2.9%
Other Punctuation 30
 
0.1%
Math Symbol 20
 
< 0.1%
Close Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8526
16.8%
3 7159
14.1%
2 6690
13.2%
7 5873
11.6%
1 4591
9.1%
4 4069
8.0%
8 3772
7.4%
6 3719
7.3%
5 3681
7.3%
9 2645
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 17
56.7%
, 6
 
20.0%
/ 5
 
16.7%
* 2
 
6.7%
Dash Punctuation
ValueCountFrequency (%)
- 6553
100.0%
Space Separator
ValueCountFrequency (%)
1708
100.0%
Math Symbol
ValueCountFrequency (%)
~ 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 59046
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8526
14.4%
3 7159
12.1%
2 6690
11.3%
- 6553
11.1%
7 5873
9.9%
1 4591
7.8%
4 4069
6.9%
8 3772
6.4%
6 3719
6.3%
5 3681
6.2%
Other values (8) 4413
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 59046
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8526
14.4%
3 7159
12.1%
2 6690
11.3%
- 6553
11.1%
7 5873
9.9%
1 4591
7.8%
4 4069
6.9%
8 3772
6.4%
6 3719
6.3%
5 3681
6.2%
Other values (8) 4413
7.5%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct173
Distinct (%)8.2%
Missing7893
Missing (%)78.9%
Infinite0
Infinite (%)0.0%
Mean121784.22
Minimum120110
Maximum604080
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:34.126682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum120110
5-th percentile121020
Q1121160
median121220
Q3121781
95-th percentile121884
Maximum604080
Range483970
Interquartile range (IQR)621

Descriptive statistics

Standard deviation12370.88
Coefficient of variation (CV)0.10158032
Kurtosis1247.4642
Mean121784.22
Median Absolute Deviation (MAD)110
Skewness34.513602
Sum2.5659935 × 108
Variance1.5303866 × 108
MonotonicityNot monotonic
2024-04-30T04:49:34.246004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
121210 303
 
3.0%
121250 132
 
1.3%
121200 127
 
1.3%
121220 120
 
1.2%
121040 107
 
1.1%
121020 78
 
0.8%
121230 75
 
0.8%
121270 62
 
0.6%
121240 61
 
0.6%
121050 55
 
0.5%
Other values (163) 987
 
9.9%
(Missing) 7893
78.9%
ValueCountFrequency (%)
120110 1
 
< 0.1%
120190 1
 
< 0.1%
121010 34
 
0.3%
121020 78
0.8%
121030 19
 
0.2%
121040 107
1.1%
121050 55
0.5%
121060 3
 
< 0.1%
121070 33
 
0.3%
121080 41
 
0.4%
ValueCountFrequency (%)
604080 1
< 0.1%
415020 1
< 0.1%
150855 1
< 0.1%
150103 1
< 0.1%
150101 1
< 0.1%
138170 1
< 0.1%
135080 1
< 0.1%
133091 1
< 0.1%
130821 1
< 0.1%
122010 1
< 0.1%

지번주소
Text

MISSING 

Distinct3650
Distinct (%)49.8%
Missing2671
Missing (%)26.7%
Memory size156.2 KiB
2024-04-30T04:49:34.444954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length48
Mean length27.02688
Min length13

Characters and Unicode

Total characters198080
Distinct characters515
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2854 ?
Unique (%)38.9%

Sample

1st row서울특별시 마포구 연남동 ***번지 **호
2nd row서울특별시 마포구 서교동 ***번지 *호 **통 *반
3rd row서울특별시 마포구 서교동 ***번지 **호
4th row서울특별시 마포구 노고산동 **번지 ***호 *층
5th row서울특별시 마포구 상수동 ***번지 래미안 밤섬리베뉴Ⅱ
ValueCountFrequency (%)
서울특별시 7327
17.9%
마포구 7314
17.8%
5017
12.2%
번지 4489
11.0%
2912
 
7.1%
서교동 1549
 
3.8%
927
 
2.3%
합정동 648
 
1.6%
성산동 602
 
1.5%
도화동 512
 
1.2%
Other values (2157) 9681
23.6%
2024-04-30T04:49:34.786119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 40605
20.5%
33826
17.1%
9109
 
4.6%
8127
 
4.1%
7822
 
3.9%
7737
 
3.9%
7460
 
3.8%
7426
 
3.7%
7410
 
3.7%
7345
 
3.7%
Other values (505) 61213
30.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 119574
60.4%
Other Punctuation 40714
 
20.6%
Space Separator 33826
 
17.1%
Dash Punctuation 2457
 
1.2%
Uppercase Letter 997
 
0.5%
Lowercase Letter 176
 
0.1%
Decimal Number 155
 
0.1%
Close Punctuation 61
 
< 0.1%
Open Punctuation 60
 
< 0.1%
Letter Number 29
 
< 0.1%
Other values (2) 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9109
 
7.6%
8127
 
6.8%
7822
 
6.5%
7737
 
6.5%
7460
 
6.2%
7426
 
6.2%
7410
 
6.2%
7345
 
6.1%
7342
 
6.1%
5314
 
4.4%
Other values (432) 44482
37.2%
Uppercase Letter
ValueCountFrequency (%)
B 147
14.7%
C 104
10.4%
D 85
 
8.5%
M 80
 
8.0%
A 72
 
7.2%
S 67
 
6.7%
G 57
 
5.7%
L 55
 
5.5%
T 43
 
4.3%
K 40
 
4.0%
Other values (15) 247
24.8%
Lowercase Letter
ValueCountFrequency (%)
e 26
14.8%
i 23
13.1%
s 13
 
7.4%
t 12
 
6.8%
o 10
 
5.7%
y 10
 
5.7%
p 9
 
5.1%
r 9
 
5.1%
u 9
 
5.1%
v 8
 
4.5%
Other values (13) 47
26.7%
Decimal Number
ValueCountFrequency (%)
2 25
16.1%
1 21
13.5%
6 19
12.3%
4 19
12.3%
3 16
10.3%
5 14
9.0%
7 12
7.7%
8 11
7.1%
0 10
 
6.5%
9 8
 
5.2%
Other Punctuation
ValueCountFrequency (%)
* 40605
99.7%
, 68
 
0.2%
/ 26
 
0.1%
. 6
 
< 0.1%
? 5
 
< 0.1%
& 4
 
< 0.1%
Letter Number
ValueCountFrequency (%)
25
86.2%
3
 
10.3%
1
 
3.4%
Space Separator
ValueCountFrequency (%)
33826
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2457
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Other Symbol
ValueCountFrequency (%)
28
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 119600
60.4%
Common 77276
39.0%
Latin 1202
 
0.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9109
 
7.6%
8127
 
6.8%
7822
 
6.5%
7737
 
6.5%
7460
 
6.2%
7426
 
6.2%
7410
 
6.2%
7345
 
6.1%
7342
 
6.1%
5314
 
4.4%
Other values (431) 44508
37.2%
Latin
ValueCountFrequency (%)
B 147
 
12.2%
C 104
 
8.7%
D 85
 
7.1%
M 80
 
6.7%
A 72
 
6.0%
S 67
 
5.6%
G 57
 
4.7%
L 55
 
4.6%
T 43
 
3.6%
K 40
 
3.3%
Other values (41) 452
37.6%
Common
ValueCountFrequency (%)
* 40605
52.5%
33826
43.8%
- 2457
 
3.2%
, 68
 
0.1%
) 61
 
0.1%
( 60
 
0.1%
/ 26
 
< 0.1%
2 25
 
< 0.1%
1 21
 
< 0.1%
6 19
 
< 0.1%
Other values (11) 108
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 119572
60.4%
ASCII 78449
39.6%
Number Forms 29
 
< 0.1%
None 28
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 40605
51.8%
33826
43.1%
- 2457
 
3.1%
B 147
 
0.2%
C 104
 
0.1%
D 85
 
0.1%
M 80
 
0.1%
A 72
 
0.1%
, 68
 
0.1%
S 67
 
0.1%
Other values (59) 938
 
1.2%
Hangul
ValueCountFrequency (%)
9109
 
7.6%
8127
 
6.8%
7822
 
6.5%
7737
 
6.5%
7460
 
6.2%
7426
 
6.2%
7410
 
6.2%
7345
 
6.1%
7342
 
6.1%
5314
 
4.4%
Other values (430) 44480
37.2%
None
ValueCountFrequency (%)
28
100.0%
Number Forms
ValueCountFrequency (%)
25
86.2%
3
 
10.3%
1
 
3.4%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

도로명주소
Text

MISSING 

Distinct5353
Distinct (%)69.6%
Missing2309
Missing (%)23.1%
Memory size156.2 KiB
2024-04-30T04:49:35.035449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length57
Mean length35.267716
Min length19

Characters and Unicode

Total characters271244
Distinct characters535
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4321 ?
Unique (%)56.2%

Sample

1st row서울특별시 마포구 동교로**길 **, ***호 (연남동)
2nd row서울특별시 마포구 월드컵로**길 **-* (서교동)
3rd row서울특별시 마포구 홍익로 **-**, 지층 *호 (서교동)
4th row서울특별시 마포구 마포대로 ***, ***동 ***호 (아현동, 마포 래미안 푸르지오)
5th row서울특별시 마포구 고산길 * (노고산동,*층)
ValueCountFrequency (%)
7784
15.1%
서울특별시 7690
14.9%
마포구 7679
14.9%
4191
 
8.1%
2691
 
5.2%
서교동 1250
 
2.4%
816
 
1.6%
합정동 571
 
1.1%
성산동 563
 
1.1%
마포대로 539
 
1.0%
Other values (2811) 17816
34.5%
2024-04-30T04:49:35.437151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 45763
16.9%
43913
 
16.2%
10048
 
3.7%
, 9864
 
3.6%
9572
 
3.5%
9140
 
3.4%
9006
 
3.3%
7998
 
2.9%
7833
 
2.9%
7800
 
2.9%
Other values (525) 110307
40.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152679
56.3%
Other Punctuation 55654
 
20.5%
Space Separator 43913
 
16.2%
Open Punctuation 7728
 
2.8%
Close Punctuation 7728
 
2.8%
Dash Punctuation 1641
 
0.6%
Uppercase Letter 1359
 
0.5%
Decimal Number 251
 
0.1%
Lowercase Letter 249
 
0.1%
Letter Number 30
 
< 0.1%
Other values (2) 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10048
 
6.6%
9572
 
6.3%
9140
 
6.0%
9006
 
5.9%
7998
 
5.2%
7833
 
5.1%
7800
 
5.1%
7715
 
5.1%
7710
 
5.0%
7402
 
4.8%
Other values (451) 68455
44.8%
Uppercase Letter
ValueCountFrequency (%)
B 265
19.5%
C 163
12.0%
A 132
9.7%
D 124
9.1%
M 122
9.0%
S 95
 
7.0%
G 56
 
4.1%
T 52
 
3.8%
L 52
 
3.8%
E 46
 
3.4%
Other values (16) 252
18.5%
Lowercase Letter
ValueCountFrequency (%)
b 83
33.3%
e 32
 
12.9%
t 16
 
6.4%
i 16
 
6.4%
o 10
 
4.0%
y 10
 
4.0%
s 9
 
3.6%
a 9
 
3.6%
r 9
 
3.6%
p 8
 
3.2%
Other values (13) 47
18.9%
Decimal Number
ValueCountFrequency (%)
1 55
21.9%
0 40
15.9%
2 36
14.3%
4 31
12.4%
3 21
 
8.4%
9 16
 
6.4%
6 15
 
6.0%
5 15
 
6.0%
8 15
 
6.0%
7 7
 
2.8%
Other Punctuation
ValueCountFrequency (%)
* 45763
82.2%
, 9864
 
17.7%
/ 11
 
< 0.1%
. 8
 
< 0.1%
? 4
 
< 0.1%
& 4
 
< 0.1%
Letter Number
ValueCountFrequency (%)
25
83.3%
4
 
13.3%
1
 
3.3%
Space Separator
ValueCountFrequency (%)
43913
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7728
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7728
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1641
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 152679
56.3%
Common 116927
43.1%
Latin 1638
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10048
 
6.6%
9572
 
6.3%
9140
 
6.0%
9006
 
5.9%
7998
 
5.2%
7833
 
5.1%
7800
 
5.1%
7715
 
5.1%
7710
 
5.0%
7402
 
4.8%
Other values (451) 68455
44.8%
Latin
ValueCountFrequency (%)
B 265
16.2%
C 163
 
10.0%
A 132
 
8.1%
D 124
 
7.6%
M 122
 
7.4%
S 95
 
5.8%
b 83
 
5.1%
G 56
 
3.4%
T 52
 
3.2%
L 52
 
3.2%
Other values (42) 494
30.2%
Common
ValueCountFrequency (%)
* 45763
39.1%
43913
37.6%
, 9864
 
8.4%
( 7728
 
6.6%
) 7728
 
6.6%
- 1641
 
1.4%
1 55
 
< 0.1%
0 40
 
< 0.1%
2 36
 
< 0.1%
4 31
 
< 0.1%
Other values (12) 128
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152678
56.3%
ASCII 118535
43.7%
Number Forms 30
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 45763
38.6%
43913
37.0%
, 9864
 
8.3%
( 7728
 
6.5%
) 7728
 
6.5%
- 1641
 
1.4%
B 265
 
0.2%
C 163
 
0.1%
A 132
 
0.1%
D 124
 
0.1%
Other values (61) 1214
 
1.0%
Hangul
ValueCountFrequency (%)
10048
 
6.6%
9572
 
6.3%
9140
 
6.0%
9006
 
5.9%
7998
 
5.2%
7833
 
5.1%
7800
 
5.1%
7715
 
5.1%
7710
 
5.0%
7402
 
4.8%
Other values (450) 68454
44.8%
Number Forms
ValueCountFrequency (%)
25
83.3%
4
 
13.3%
1
 
3.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct529
Distinct (%)8.7%
Missing3927
Missing (%)39.3%
Memory size156.2 KiB
2024-04-30T04:49:35.712030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.29096
Min length5

Characters and Unicode

Total characters32132
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)1.3%

Sample

1st row03983
2nd row04040
3rd row04129
4th row04077
5th row03911
ValueCountFrequency (%)
04072 126
 
2.1%
04089 94
 
1.5%
04053 84
 
1.4%
03925 56
 
0.9%
04017 54
 
0.9%
04071 53
 
0.9%
04168 52
 
0.9%
04167 50
 
0.8%
03993 49
 
0.8%
04049 47
 
0.8%
Other values (519) 5408
89.0%
2024-04-30T04:49:36.133269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7648
23.8%
1 5372
16.7%
4 4373
13.6%
2 2997
 
9.3%
9 2785
 
8.7%
8 2554
 
7.9%
3 2484
 
7.7%
7 1456
 
4.5%
5 1257
 
3.9%
6 1146
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 32072
99.8%
Dash Punctuation 60
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7648
23.8%
1 5372
16.7%
4 4373
13.6%
2 2997
 
9.3%
9 2785
 
8.7%
8 2554
 
8.0%
3 2484
 
7.7%
7 1456
 
4.5%
5 1257
 
3.9%
6 1146
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32132
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7648
23.8%
1 5372
16.7%
4 4373
13.6%
2 2997
 
9.3%
9 2785
 
8.7%
8 2554
 
7.9%
3 2484
 
7.7%
7 1456
 
4.5%
5 1257
 
3.9%
6 1146
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7648
23.8%
1 5372
16.7%
4 4373
13.6%
2 2997
 
9.3%
9 2785
 
8.7%
8 2554
 
7.9%
3 2484
 
7.7%
7 1456
 
4.5%
5 1257
 
3.9%
6 1146
 
3.6%
Distinct9896
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:36.430682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length8.0275
Min length1

Characters and Unicode

Total characters80275
Distinct characters1090
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9796 ?
Unique (%)98.0%

Sample

1st row보배언니
2nd row다크프린스(Dark Prince)
3rd row주식회사 토리프리즘
4th row욜로짱의 한일이야기
5th row(주)씨아트
ValueCountFrequency (%)
주식회사 1126
 
8.2%
136
 
1.0%
co 42
 
0.3%
스튜디오 37
 
0.3%
co.,ltd 32
 
0.2%
ltd 30
 
0.2%
도서출판 29
 
0.2%
inc 29
 
0.2%
korea 28
 
0.2%
company 27
 
0.2%
Other values (11171) 12142
88.9%
2024-04-30T04:49:36.887206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3670
 
4.6%
( 2726
 
3.4%
) 2725
 
3.4%
2655
 
3.3%
2273
 
2.8%
2225
 
2.8%
1526
 
1.9%
1230
 
1.5%
1203
 
1.5%
1192
 
1.5%
Other values (1080) 58850
73.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52757
65.7%
Lowercase Letter 9289
 
11.6%
Uppercase Letter 7453
 
9.3%
Space Separator 3670
 
4.6%
Open Punctuation 2727
 
3.4%
Close Punctuation 2726
 
3.4%
Other Symbol 548
 
0.7%
Decimal Number 531
 
0.7%
Other Punctuation 486
 
0.6%
Dash Punctuation 74
 
0.1%
Other values (2) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2655
 
5.0%
2273
 
4.3%
2225
 
4.2%
1526
 
2.9%
1230
 
2.3%
1203
 
2.3%
1192
 
2.3%
1111
 
2.1%
895
 
1.7%
767
 
1.5%
Other values (997) 37680
71.4%
Lowercase Letter
ValueCountFrequency (%)
e 1098
11.8%
o 1049
11.3%
a 784
 
8.4%
n 734
 
7.9%
i 661
 
7.1%
t 563
 
6.1%
r 537
 
5.8%
l 512
 
5.5%
s 428
 
4.6%
m 363
 
3.9%
Other values (16) 2560
27.6%
Uppercase Letter
ValueCountFrequency (%)
A 574
 
7.7%
O 547
 
7.3%
E 546
 
7.3%
S 489
 
6.6%
N 449
 
6.0%
L 443
 
5.9%
T 438
 
5.9%
I 428
 
5.7%
C 424
 
5.7%
R 374
 
5.0%
Other values (16) 2741
36.8%
Other Punctuation
ValueCountFrequency (%)
. 281
57.8%
& 99
 
20.4%
, 72
 
14.8%
' 19
 
3.9%
# 4
 
0.8%
? 4
 
0.8%
: 3
 
0.6%
/ 2
 
0.4%
; 1
 
0.2%
% 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 96
18.1%
1 94
17.7%
0 68
12.8%
4 56
10.5%
3 56
10.5%
9 48
9.0%
5 32
 
6.0%
7 29
 
5.5%
6 28
 
5.3%
8 24
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 2726
> 99.9%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2725
> 99.9%
] 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
547
99.8%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 4
80.0%
= 1
 
20.0%
Space Separator
ValueCountFrequency (%)
3670
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53272
66.4%
Latin 16742
 
20.9%
Common 10229
 
12.7%
Han 32
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2655
 
5.0%
2273
 
4.3%
2225
 
4.2%
1526
 
2.9%
1230
 
2.3%
1203
 
2.3%
1192
 
2.2%
1111
 
2.1%
895
 
1.7%
767
 
1.4%
Other values (970) 38195
71.7%
Latin
ValueCountFrequency (%)
e 1098
 
6.6%
o 1049
 
6.3%
a 784
 
4.7%
n 734
 
4.4%
i 661
 
3.9%
A 574
 
3.4%
t 563
 
3.4%
O 547
 
3.3%
E 546
 
3.3%
r 537
 
3.2%
Other values (42) 9649
57.6%
Common
ValueCountFrequency (%)
3670
35.9%
( 2726
26.6%
) 2725
26.6%
. 281
 
2.7%
& 99
 
1.0%
2 96
 
0.9%
1 94
 
0.9%
- 74
 
0.7%
, 72
 
0.7%
0 68
 
0.7%
Other values (20) 324
 
3.2%
Han
ValueCountFrequency (%)
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (18) 18
56.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52723
65.7%
ASCII 26970
33.6%
None 547
 
0.7%
CJK 29
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3670
 
13.6%
( 2726
 
10.1%
) 2725
 
10.1%
e 1098
 
4.1%
o 1049
 
3.9%
a 784
 
2.9%
n 734
 
2.7%
i 661
 
2.5%
A 574
 
2.1%
t 563
 
2.1%
Other values (71) 12386
45.9%
Hangul
ValueCountFrequency (%)
2655
 
5.0%
2273
 
4.3%
2225
 
4.2%
1526
 
2.9%
1230
 
2.3%
1203
 
2.3%
1192
 
2.3%
1111
 
2.1%
895
 
1.7%
767
 
1.5%
Other values (967) 37646
71.4%
None
ValueCountFrequency (%)
547
100.0%
CJK
ValueCountFrequency (%)
2
 
6.9%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (15) 15
51.7%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct8880
Distinct (%)88.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-06 16:09:24
Maximum2024-04-25 16:06:31
2024-04-30T04:49:37.016756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:37.157858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
8151 
U
1849 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 8151
81.5%
U 1849
 
18.5%

Length

2024-04-30T04:49:37.271718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:37.350366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 8151
81.5%
u 1849
 
18.5%
Distinct1317
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-30T04:49:37.445274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:37.591395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct479
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:37.731192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.2618
Min length1

Characters and Unicode

Total characters82618
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique296 ?
Unique (%)3.0%

Sample

1st row의류/패션/잡화/뷰티
2nd row의류/패션/잡화/뷰티
3rd row의류/패션/잡화/뷰티
4th row의류/패션/잡화/뷰티 레져/여행/공연
5th row기타
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3440
25.7%
2121
15.8%
기타 2044
15.3%
종합몰 1675
12.5%
교육/도서/완구/오락 992
 
7.4%
건강/식품 799
 
6.0%
컴퓨터/사무용품 566
 
4.2%
레져/여행/공연 518
 
3.9%
가전 418
 
3.1%
가구/수납용품 408
 
3.0%
Other values (3) 406
 
3.0%
2024-04-30T04:49:38.234619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16405
19.9%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3440
 
4.2%
3387
 
4.1%
Other values (41) 35306
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60705
73.5%
Other Punctuation 16405
 
19.9%
Space Separator 3387
 
4.1%
Dash Punctuation 2121
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
2179
 
3.6%
2044
 
3.4%
Other values (38) 28962
47.7%
Other Punctuation
ValueCountFrequency (%)
/ 16405
100.0%
Space Separator
ValueCountFrequency (%)
3387
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60705
73.5%
Common 21913
 
26.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
2179
 
3.6%
2044
 
3.4%
Other values (38) 28962
47.7%
Common
ValueCountFrequency (%)
/ 16405
74.9%
3387
 
15.5%
- 2121
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60705
73.5%
ASCII 21913
 
26.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16405
74.9%
3387
 
15.5%
- 2121
 
9.7%
Hangul
ValueCountFrequency (%)
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
3440
 
5.7%
2179
 
3.6%
2044
 
3.4%
Other values (38) 28962
47.7%

좌표정보(X)
Real number (ℝ)

MISSING  SKEWED 

Distinct4020
Distinct (%)51.3%
Missing2166
Missing (%)21.7%
Infinite0
Infinite (%)0.0%
Mean193240.45
Minimum187902
Maximum382137.72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:38.365851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum187902
5-th percentile190609.35
Q1192171.65
median192987.46
Q3194440.49
95-th percentile195852.52
Maximum382137.72
Range194235.72
Interquartile range (IQR)2268.8437

Descriptive statistics

Standard deviation2655.3567
Coefficient of variation (CV)0.013741206
Kurtosis3269.3559
Mean193240.45
Median Absolute Deviation (MAD)1057.7965
Skewness45.999349
Sum1.5138457 × 109
Variance7050919.1
MonotonicityNot monotonic
2024-04-30T04:49:38.482292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
192652.282023976 101
 
1.0%
194301.304865904 95
 
0.9%
193166.430144679 61
 
0.6%
195703.425296812 55
 
0.5%
191843.829548101 48
 
0.5%
191263.451931121 44
 
0.4%
194974.587674037 42
 
0.4%
195324.793653981 39
 
0.4%
193286.152946388 38
 
0.4%
191920.248175227 35
 
0.4%
Other values (4010) 7276
72.8%
(Missing) 2166
 
21.7%
ValueCountFrequency (%)
187902.0 1
 
< 0.1%
189192.411433557 1
 
< 0.1%
189212.708916184 1
 
< 0.1%
189212.737535822 4
< 0.1%
189315.310470024 1
 
< 0.1%
189315.370584751 1
 
< 0.1%
189333.273693129 2
< 0.1%
189378.0 3
< 0.1%
189392.864151 3
< 0.1%
189392.975995366 3
< 0.1%
ValueCountFrequency (%)
382137.722569182 1
< 0.1%
202685.344146881 1
< 0.1%
202600.132889759 1
< 0.1%
196693.578355085 1
< 0.1%
196686.244500916 1
< 0.1%
196674.057781198 1
< 0.1%
196670.515295427 1
< 0.1%
196651.146604466 1
< 0.1%
196634.553138166 1
< 0.1%
196630.25802617 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct4022
Distinct (%)51.3%
Missing2166
Missing (%)21.7%
Infinite0
Infinite (%)0.0%
Mean450180.87
Minimum180519.3
Maximum456953.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:38.595436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum180519.3
5-th percentile448677.8
Q1449463.14
median450143.02
Q3450699.53
95-th percentile452513.7
Maximum456953.25
Range276433.94
Interquartile range (IQR)1236.3842

Descriptive statistics

Standard deviation3233.7825
Coefficient of variation (CV)0.0071832961
Kurtosis6174.9143
Mean450180.87
Median Absolute Deviation (MAD)646.47272
Skewness-74.01132
Sum3.5267169 × 109
Variance10457349
MonotonicityNot monotonic
2024-04-30T04:49:38.720218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
449483.723207355 101
 
1.0%
449388.053900946 95
 
0.9%
450425.852790862 61
 
0.6%
449330.800717721 55
 
0.5%
451282.5581689 48
 
0.5%
452207.500653521 44
 
0.4%
448229.063825491 42
 
0.4%
448885.430093289 39
 
0.4%
449527.466294041 38
 
0.4%
448644.311298762 35
 
0.4%
Other values (4012) 7276
72.8%
(Missing) 2166
 
21.7%
ValueCountFrequency (%)
180519.302939129 1
 
< 0.1%
442370.143700207 1
 
< 0.1%
443497.668910998 1
 
< 0.1%
445725.710868321 1
 
< 0.1%
448116.639953919 5
0.1%
448125.391869079 3
< 0.1%
448161.548233627 1
 
< 0.1%
448196.660960383 2
 
< 0.1%
448201.92884513 5
0.1%
448203.634013692 1
 
< 0.1%
ValueCountFrequency (%)
456953.246288161 1
 
< 0.1%
454307.338278 1
 
< 0.1%
454178.7168 2
 
< 0.1%
454136.901035 5
 
0.1%
453923.20042427 1
 
< 0.1%
453923.18626299 1
 
< 0.1%
453913.815947 24
0.2%
453879.760075 3
 
< 0.1%
453872.42578746 8
 
0.1%
453797.100407641 2
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9787 
0
 
213

Length

Max length4
Median length4
Mean length3.9361
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9787
97.9%
0 213
 
2.1%

Length

2024-04-30T04:49:38.833724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:38.916142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9787
97.9%
0 213
 
2.1%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9787 
0
 
213

Length

Max length4
Median length4
Mean length3.9361
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9787
97.9%
0 213
 
2.1%

Length

2024-04-30T04:49:39.007609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:39.104617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9787
97.9%
0 213
 
2.1%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9787 
0
 
213

Length

Max length4
Median length4
Mean length3.9361
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9787
97.9%
0 213
 
2.1%

Length

2024-04-30T04:49:39.192498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:39.274888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9787
97.9%
0 213
 
2.1%

판매방식명
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
5065 
<NA>
4454 
인터넷, 기타
 
162
기타
 
106
TV홈쇼핑, 인터넷
 
53
Other values (17)
 
160

Length

Max length26
Median length3
Mean length3.7318
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row<NA>
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 5065
50.6%
<NA> 4454
44.5%
인터넷, 기타 162
 
1.6%
기타 106
 
1.1%
TV홈쇼핑, 인터넷 53
 
0.5%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 28
 
0.3%
인터넷, 카다로그 23
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 16
 
0.2%
인터넷, 카다로그, 기타 15
 
0.1%
TV홈쇼핑 13
 
0.1%
Other values (12) 65
 
0.7%

Length

2024-04-30T04:49:39.363289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 5418
51.4%
na 4454
42.2%
기타 340
 
3.2%
tv홈쇼핑 134
 
1.3%
카다로그 116
 
1.1%
신문잡지 85
 
0.8%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
224143130000201831302013020060720180330<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 연남동 ***번지 **호서울특별시 마포구 동교로**길 **, ***호 (연남동)03983보배언니2019-07-19 11:06:09U2019-07-21 02:40:00.0의류/패션/잡화/뷰티193219.830163451329.635216<NA><NA><NA>인터넷
73243130000200831301183020032220080313<NA>3폐업3폐업처리20090220<NA><NA><NA><NA><NA>121842서울특별시 마포구 서교동 ***번지 *호 **통 *반서울특별시 마포구 월드컵로**길 **-* (서교동)<NA>다크프린스(Dark Prince)2009-02-20 11:10:27I2018-08-31 23:59:59.0의류/패션/잡화/뷰티192214.056191450572.286707<NA><NA><NA>인터넷
240173130000201931302013020005620190106<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6406-5969<NA><NA>서울특별시 마포구 서교동 ***번지 **호서울특별시 마포구 홍익로 **-**, 지층 *호 (서교동)04040주식회사 토리프리즘2019-01-06 11:39:16I2019-01-08 02:20:42.0의류/패션/잡화/뷰티193118.235209450105.147932<NA><NA><NA>인터넷
213873130000201731302013020171720170911<NA>3폐업3폐업처리20171031<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 마포대로 ***, ***동 ***호 (아현동, 마포 래미안 푸르지오)04129욜로짱의 한일이야기2017-10-31 15:28:00I2021-12-03 22:02:00.0의류/패션/잡화/뷰티 레져/여행/공연195818.0449944.0<NA><NA><NA><NA>
72113130000200831301183020020320080213<NA>3폐업3폐업처리20110208<NA><NA><NA>718-5230<NA>121100서울특별시 마포구 노고산동 **번지 ***호 *층서울특별시 마포구 고산길 * (노고산동,*층)<NA>(주)씨아트2011-02-09 16:11:40I2018-08-31 23:59:59.0기타194427.887518450105.683141<NA><NA><NA>인터넷
219333130000201831302013020004120180108<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 상수동 ***번지 래미안 밤섬리베뉴Ⅱ서울특별시 마포구 독막로**나길 **, ***동 ****호 (상수동, 래미안 밤섬리베뉴Ⅱ)04077수지컴퍼니2018-01-08 10:53:55I2018-08-31 23:59:59.0종합몰193498.185371449374.198256<NA><NA><NA>인터넷, 기타
4133130000200231301183020042920020916<NA>1영업/정상1정상영업<NA><NA><NA><NA>337 8645<NA><NA>서울특별시 마포구 성산동 **-*<NA><NA>랄프몰2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
5313130000200231301183020055220021102<NA>3폐업3폐업처리<NA><NA><NA><NA>6377 1177<NA><NA>서울특별시 마포구 공덕*동 *** 삼부르네상스 ***<NA><NA>퓨어옥시즌코리아㈜2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
17808313000020153130180302014242015-09-02<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 매봉산로 **, ***-*호 (상암동, 마포창업복지관)0391160.1(식스티포인트원)2023-01-06 14:26:47U2022-12-04 22:06:00.0의류/패션/잡화/뷰티190204.923594452434.679035<NA><NA><NA><NA>
126653130000201131301183020168020111110<NA>3폐업3폐업처리20140701<NA><NA><NA>070-8237-3641<NA><NA><NA>서울특별시 마포구 마포대로 ***-**, ***동 ***호 (공덕동, 공덕래미안*차아파트)121020(주)테라커피2014-07-01 15:37:56I2018-08-31 23:59:59.0건강/식품195704.773542449831.361976<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
129493130000201231301183020032820120307<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 독막로 **-*, ***호 (상수동)121829플래티넘아로와나2015-05-20 10:10:03I2018-08-31 23:59:59.0의류/패션/잡화/뷰티192935.776364449539.348857<NA><NA><NA>인터넷
36213130000200531301183020366420050914<NA>3폐업3폐업처리<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 합정동 ***-** 신우빌딩 ***호<NA><NA>월드CNM2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
59533130000200731301183020603320070327<NA>3폐업3폐업처리20091029<NA><NA><NA>3260752<NA><NA>서울특별시 마포구 성산동 ***-*<NA><NA>헬로 배드민턴2009-10-29 15:03:58I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
144673130000201331301183020065520130521<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-3272-9756<NA><NA>서울특별시 마포구 신수동 **-**서울특별시 마포구 독막로**길 **, *동 ***호 (신수동)04096선스튜디오아트앤디자인 (SUN STUDIO)2021-01-07 15:05:22U2021-01-09 02:40:00.0의류/패션/잡화/뷰티194552.035357449516.972967<NA><NA><NA>인터넷, 기타
175743130000201531301803020113720050419<NA>3폐업3폐업처리20170207<NA><NA><NA>070-8692-4926<NA><NA><NA>서울특별시 마포구 잔다리로 **, ***호 (서교동)121893마이홈이야기2017-02-07 09:34:40I2021-12-03 22:02:00.0-192744.720507450034.025927<NA><NA><NA><NA>
41633130000200631301183020421420060207<NA>3폐업3폐업처리20100518<NA><NA><NA><NA><NA><NA>서울특별시 마포구 도화동 *** 마스터즈빌딩 ****<NA><NA>㈜휴라이프2010-05-18 10:47:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
163673130000201431301183020142220141013<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-325-1100<NA><NA><NA>서울특별시 마포구 성산로*길 **, *층 (성산동, 한성빌딩)121848팀버울프코리아 주식회사2019-09-06 15:51:23U2019-09-08 02:40:00.0기타191540.894421451197.970014<NA><NA><NA>인터넷
274333130000202031302253020072220200304<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 중동 **번지 **호 영빈빌라서울특별시 마포구 월드컵북로**가길 **, A동 *층 ***호 (중동, 영빈빌라)03941수노코2020-03-04 21:13:12I2020-03-06 00:23:23.0기타191601.311604452085.842863<NA><NA><NA>인터넷
212873130000201731302013020160420170822<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 신촌로**가길 **, *층 (아현동, *층)04114성보2017-08-22 23:59:29I2021-12-03 22:02:00.0종합몰 교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 건강/식품 의류/패션/잡화/뷰티 레져/여행/공연195644.401219450497.850705<NA><NA><NA><NA>
214633130000201731302013020180720170922<NA>5제외/삭제/전출5타시군구이관20210726<NA><NA><NA><NA><NA><NA><NA>서울특별시 마포구 월드컵로 ***, ****호 (성산동, 이안상암)03938오클린코리아2021-07-26 10:56:27I2021-12-03 22:02:00.0종합몰 의류/패션/잡화/뷰티 건강/식품191284.394535451460.537435<NA><NA><NA><NA>