Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells76422
Missing cells (%)26.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical9
Numeric5
DateTime8
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-18824/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
영업상태코드 is highly imbalanced (57.0%)Imbalance
영업상태명 is highly imbalanced (57.0%)Imbalance
상세영업상태명 is highly imbalanced (61.1%)Imbalance
자산규모 is highly imbalanced (52.0%)Imbalance
부채총액 is highly imbalanced (52.0%)Imbalance
자본금 is highly imbalanced (52.0%)Imbalance
판매방식명 is highly imbalanced (72.3%)Imbalance
인허가취소일자 has 9998 (> 99.9%) missing valuesMissing
폐업일자 has 8263 (82.6%) missing valuesMissing
휴업시작일자 has 9985 (99.9%) missing valuesMissing
휴업종료일자 has 9985 (99.9%) missing valuesMissing
재개업일자 has 9990 (99.9%) missing valuesMissing
전화번호 has 6923 (69.2%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 9908 (99.1%) missing valuesMissing
지번주소 has 112 (1.1%) missing valuesMissing
도로명주소 has 270 (2.7%) missing valuesMissing
도로명우편번호 has 346 (3.5%) missing valuesMissing
좌표정보(X) has 321 (3.2%) missing valuesMissing
좌표정보(Y) has 321 (3.2%) missing valuesMissing
좌표정보(Y) is highly skewed (γ1 = -64.90092493)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 05:43:59.590433
Analysis finished2024-05-11 05:44:02.679938
Duration3.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3220000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3220000
2nd row3220000
3rd row3220000
4th row3220000
5th row3220000

Common Values

ValueCountFrequency (%)
3220000 10000
100.0%

Length

2024-05-11T14:44:02.754527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:02.893369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3220000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0208584 × 1018
Minimum1.997322 × 1018
Maximum2.024322 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:44:03.051615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.997322 × 1018
5-th percentile2.013322 × 1018
Q12.020322 × 1018
median2.022322 × 1018
Q32.023322 × 1018
95-th percentile2.024322 × 1018
Maximum2.024322 × 1018
Range2.7000017 × 1016
Interquartile range (IQR)3.0000013 × 1015

Descriptive statistics

Standard deviation3.930146 × 1015
Coefficient of variation (CV)0.0019447904
Kurtosis7.5835039
Mean2.0208584 × 1018
Median Absolute Deviation (MAD)1 × 1015
Skewness-2.5593848
Sum-9.0472731 × 1018
Variance1.5446048 × 1031
MonotonicityNot monotonic
2024-05-11T14:44:03.237741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2020322023630203286 1
 
< 0.1%
2019322023630202720 1
 
< 0.1%
2023322024930201272 1
 
< 0.1%
2020322024930201301 1
 
< 0.1%
2023322024930202446 1
 
< 0.1%
2022322024930203920 1
 
< 0.1%
2024322024930201517 1
 
< 0.1%
2023322024930203848 1
 
< 0.1%
2009322012730201330 1
 
< 0.1%
2020322023630202639 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1997322008330200597 1
< 0.1%
1997322008330201069 1
< 0.1%
1998322008330201202 1
< 0.1%
1998322008330201240 1
< 0.1%
1998322008330201326 1
< 0.1%
1998322008330201361 1
< 0.1%
1999322008330201958 1
< 0.1%
1999322008330202430 1
< 0.1%
1999322008330202443 1
< 0.1%
2000322008330202738 1
< 0.1%
ValueCountFrequency (%)
2024322024930203089 1
< 0.1%
2024322024930203088 1
< 0.1%
2024322024930203083 1
< 0.1%
2024322024930203080 1
< 0.1%
2024322024930203077 1
< 0.1%
2024322024930203076 1
< 0.1%
2024322024930203073 1
< 0.1%
2024322024930203069 1
< 0.1%
2024322024930203068 1
< 0.1%
2024322024930203067 1
< 0.1%
Distinct2619
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-02-28 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T14:44:03.450070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:03.637514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct2
Distinct (%)100.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2023-04-27 00:00:00
Maximum2024-03-07 00:00:00
2024-05-11T14:44:03.775395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:03.961733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

영업상태코드
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
8034 
3
915 
5
823 
4
 
215
2
 
13

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 8034
80.3%
3 915
 
9.2%
5 823
 
8.2%
4 215
 
2.1%
2 13
 
0.1%

Length

2024-05-11T14:44:04.106966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:04.215584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 8034
80.3%
3 915
 
9.2%
5 823
 
8.2%
4 215
 
2.1%
2 13
 
0.1%

영업상태명
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
8034 
폐업
915 
제외/삭제/전출
823 
취소/말소/만료/정지/중지
 
215
휴업
 
13

Length

Max length14
Median length5
Mean length5.162
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 8034
80.3%
폐업 915
 
9.2%
제외/삭제/전출 823
 
8.2%
취소/말소/만료/정지/중지 215
 
2.1%
휴업 13
 
0.1%

Length

2024-05-11T14:44:04.348840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:04.511517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 8034
80.3%
폐업 915
 
9.2%
제외/삭제/전출 823
 
8.2%
취소/말소/만료/정지/중지 215
 
2.1%
휴업 13
 
0.1%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6386
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:44:04.676631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile5
Maximum7
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.4196449
Coefficient of variation (CV)0.86637671
Kurtosis3.8199885
Mean1.6386
Median Absolute Deviation (MAD)0
Skewness2.1947736
Sum16386
Variance2.0153916
MonotonicityNot monotonic
2024-05-11T14:44:04.815276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 8034
80.3%
3 915
 
9.2%
5 823
 
8.2%
7 202
 
2.0%
2 13
 
0.1%
4 13
 
0.1%
ValueCountFrequency (%)
1 8034
80.3%
2 13
 
0.1%
3 915
 
9.2%
4 13
 
0.1%
5 823
 
8.2%
7 202
 
2.0%
ValueCountFrequency (%)
7 202
 
2.0%
5 823
 
8.2%
4 13
 
0.1%
3 915
 
9.2%
2 13
 
0.1%
1 8034
80.3%

상세영업상태명
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
8034 
폐업처리
915 
타시군구이관
823 
직권말소
 
202
휴업처리
 
13

Length

Max length6
Median length4
Mean length4.1646
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 8034
80.3%
폐업처리 915
 
9.2%
타시군구이관 823
 
8.2%
직권말소 202
 
2.0%
휴업처리 13
 
0.1%
직권취소 13
 
0.1%

Length

2024-05-11T14:44:04.997302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:05.158132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 8034
80.3%
폐업처리 915
 
9.2%
타시군구이관 823
 
8.2%
직권말소 202
 
2.0%
휴업처리 13
 
0.1%
직권취소 13
 
0.1%

폐업일자
Date

MISSING 

Distinct577
Distinct (%)33.2%
Missing8263
Missing (%)82.6%
Memory size156.2 KiB
Minimum2003-01-29 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T14:44:05.319975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:05.542892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct14
Distinct (%)93.3%
Missing9985
Missing (%)99.9%
Memory size156.2 KiB
Minimum2010-10-11 00:00:00
Maximum2024-04-30 00:00:00
2024-05-11T14:44:05.720281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:05.859131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

휴업종료일자
Date

MISSING 

Distinct13
Distinct (%)86.7%
Missing9985
Missing (%)99.9%
Memory size156.2 KiB
Minimum2011-01-18 00:00:00
Maximum2099-12-31 00:00:00
2024-05-11T14:44:05.994149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:06.148281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)

재개업일자
Date

MISSING 

Distinct10
Distinct (%)100.0%
Missing9990
Missing (%)99.9%
Memory size156.2 KiB
Minimum2007-07-13 00:00:00
Maximum2024-03-11 00:00:00
2024-05-11T14:44:06.306020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:06.460778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)

전화번호
Text

MISSING 

Distinct3001
Distinct (%)97.5%
Missing6923
Missing (%)69.2%
Memory size156.2 KiB
2024-05-11T14:44:06.817502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length11.877803
Min length1

Characters and Unicode

Total characters36548
Distinct characters17
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2964 ?
Unique (%)96.3%

Sample

1st row561-6848
2nd row02-6956-1778
3rd row02-546-8288
4th row02-511-0917
5th row02-416-2001
ValueCountFrequency (%)
705
 
14.1%
02 517
 
10.4%
070 107
 
2.1%
517 13
 
0.3%
545 12
 
0.2%
511 12
 
0.2%
512 11
 
0.2%
566 11
 
0.2%
3445 9
 
0.2%
540 9
 
0.2%
Other values (3258) 3588
71.8%
2024-05-11T14:44:07.413346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5990
16.4%
- 5182
14.2%
2 4297
11.8%
5 3369
9.2%
7 2618
7.2%
4 2485
6.8%
1 2469
6.8%
2238
 
6.1%
6 2157
 
5.9%
8 2155
 
5.9%
Other values (7) 3588
9.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29105
79.6%
Dash Punctuation 5182
 
14.2%
Space Separator 2238
 
6.1%
Other Punctuation 21
 
0.1%
Math Symbol 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5990
20.6%
2 4297
14.8%
5 3369
11.6%
7 2618
9.0%
4 2485
8.5%
1 2469
8.5%
6 2157
 
7.4%
8 2155
 
7.4%
3 2041
 
7.0%
9 1524
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 19
90.5%
/ 1
 
4.8%
, 1
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 5182
100.0%
Space Separator
ValueCountFrequency (%)
2238
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36548
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5990
16.4%
- 5182
14.2%
2 4297
11.8%
5 3369
9.2%
7 2618
7.2%
4 2485
6.8%
1 2469
6.8%
2238
 
6.1%
6 2157
 
5.9%
8 2155
 
5.9%
Other values (7) 3588
9.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36548
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5990
16.4%
- 5182
14.2%
2 4297
11.8%
5 3369
9.2%
7 2618
7.2%
4 2485
6.8%
1 2469
6.8%
2238
 
6.1%
6 2157
 
5.9%
8 2155
 
5.9%
Other values (7) 3588
9.8%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct65
Distinct (%)70.7%
Missing9908
Missing (%)99.1%
Infinite0
Infinite (%)0.0%
Mean138036.01
Minimum135010
Maximum339014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:44:07.629088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum135010
5-th percentile135080
Q1135280
median135821
Q3135897
95-th percentile135961.8
Maximum339014
Range204004
Interquartile range (IQR)617

Descriptive statistics

Standard deviation21262.185
Coefficient of variation (CV)0.15403361
Kurtosis90.605717
Mean138036.01
Median Absolute Deviation (MAD)118.5
Skewness9.489196
Sum12699313
Variance4.520805 × 108
MonotonicityNot monotonic
2024-05-11T14:44:07.852376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135897 5
 
0.1%
135080 5
 
0.1%
135280 4
 
< 0.1%
135010 3
 
< 0.1%
135120 3
 
< 0.1%
135887 2
 
< 0.1%
135955 2
 
< 0.1%
135895 2
 
< 0.1%
135960 2
 
< 0.1%
135893 2
 
< 0.1%
Other values (55) 62
 
0.6%
(Missing) 9908
99.1%
ValueCountFrequency (%)
135010 3
< 0.1%
135011 1
 
< 0.1%
135080 5
0.1%
135081 1
 
< 0.1%
135090 2
 
< 0.1%
135100 2
 
< 0.1%
135120 3
< 0.1%
135190 1
 
< 0.1%
135200 1
 
< 0.1%
135240 1
 
< 0.1%
ValueCountFrequency (%)
339014 1
< 0.1%
152855 1
< 0.1%
135971 1
< 0.1%
135965 1
< 0.1%
135964 1
< 0.1%
135960 2
< 0.1%
135955 2
< 0.1%
135954 1
< 0.1%
135948 1
< 0.1%
135943 1
< 0.1%

지번주소
Text

MISSING 

Distinct2026
Distinct (%)20.5%
Missing112
Missing (%)1.1%
Memory size156.2 KiB
2024-05-11T14:44:08.183552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length43
Mean length22.175971
Min length15

Characters and Unicode

Total characters219276
Distinct characters515
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1433 ?
Unique (%)14.5%

Sample

1st row서울특별시 강남구 역삼동 ***번지 **호
2nd row서울특별시 강남구 대치동 ***-**
3rd row서울특별시 강남구 개포동 ****-**
4th row서울특별시 강남구 삼성동 **-**
5th row서울특별시 강남구 청담동 ***번지 *호 *-*층
ValueCountFrequency (%)
서울특별시 9885
21.6%
강남구 9878
21.6%
7614
16.6%
역삼동 2396
 
5.2%
번지 2226
 
4.9%
1920
 
4.2%
논현동 1908
 
4.2%
삼성동 1553
 
3.4%
대치동 1056
 
2.3%
신사동 988
 
2.2%
Other values (1789) 6376
13.9%
2024-05-11T14:44:08.756819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 41803
19.1%
36192
16.5%
10299
 
4.7%
10136
 
4.6%
10080
 
4.6%
10075
 
4.6%
9989
 
4.6%
9944
 
4.5%
9903
 
4.5%
9886
 
4.5%
Other values (505) 60969
27.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132761
60.5%
Other Punctuation 41827
 
19.1%
Space Separator 36192
 
16.5%
Dash Punctuation 6095
 
2.8%
Decimal Number 1308
 
0.6%
Uppercase Letter 782
 
0.4%
Lowercase Letter 225
 
0.1%
Open Punctuation 39
 
< 0.1%
Close Punctuation 39
 
< 0.1%
Letter Number 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10299
 
7.8%
10136
 
7.6%
10080
 
7.6%
10075
 
7.6%
9989
 
7.5%
9944
 
7.5%
9903
 
7.5%
9886
 
7.4%
9886
 
7.4%
4186
 
3.2%
Other values (437) 38377
28.9%
Uppercase Letter
ValueCountFrequency (%)
L 69
 
8.8%
H 63
 
8.1%
S 62
 
7.9%
I 60
 
7.7%
A 56
 
7.2%
O 56
 
7.2%
R 45
 
5.8%
E 45
 
5.8%
W 41
 
5.2%
B 41
 
5.2%
Other values (16) 244
31.2%
Lowercase Letter
ValueCountFrequency (%)
e 28
12.4%
o 26
11.6%
n 21
9.3%
s 18
 
8.0%
a 17
 
7.6%
l 15
 
6.7%
i 15
 
6.7%
t 13
 
5.8%
r 12
 
5.3%
k 11
 
4.9%
Other values (10) 49
21.8%
Decimal Number
ValueCountFrequency (%)
1 262
20.0%
2 189
14.4%
4 143
10.9%
6 127
9.7%
5 117
8.9%
3 114
8.7%
7 109
8.3%
8 92
 
7.0%
0 85
 
6.5%
9 70
 
5.4%
Other Punctuation
ValueCountFrequency (%)
* 41803
99.9%
, 10
 
< 0.1%
. 6
 
< 0.1%
/ 4
 
< 0.1%
& 3
 
< 0.1%
; 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
36192
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6095
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132759
60.5%
Common 85500
39.0%
Latin 1015
 
0.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10299
 
7.8%
10136
 
7.6%
10080
 
7.6%
10075
 
7.6%
9989
 
7.5%
9944
 
7.5%
9903
 
7.5%
9886
 
7.4%
9886
 
7.4%
4186
 
3.2%
Other values (435) 38375
28.9%
Latin
ValueCountFrequency (%)
L 69
 
6.8%
H 63
 
6.2%
S 62
 
6.1%
I 60
 
5.9%
A 56
 
5.5%
O 56
 
5.5%
R 45
 
4.4%
E 45
 
4.4%
W 41
 
4.0%
B 41
 
4.0%
Other values (38) 477
47.0%
Common
ValueCountFrequency (%)
* 41803
48.9%
36192
42.3%
- 6095
 
7.1%
1 262
 
0.3%
2 189
 
0.2%
4 143
 
0.2%
6 127
 
0.1%
5 117
 
0.1%
3 114
 
0.1%
7 109
 
0.1%
Other values (10) 349
 
0.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132758
60.5%
ASCII 86507
39.5%
Number Forms 8
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 41803
48.3%
36192
41.8%
- 6095
 
7.0%
1 262
 
0.3%
2 189
 
0.2%
4 143
 
0.2%
6 127
 
0.1%
5 117
 
0.1%
3 114
 
0.1%
7 109
 
0.1%
Other values (56) 1356
 
1.6%
Hangul
ValueCountFrequency (%)
10299
 
7.8%
10136
 
7.6%
10080
 
7.6%
10075
 
7.6%
9989
 
7.5%
9944
 
7.5%
9903
 
7.5%
9886
 
7.4%
9886
 
7.4%
4186
 
3.2%
Other values (434) 38374
28.9%
Number Forms
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

도로명주소
Text

MISSING 

Distinct5860
Distinct (%)60.2%
Missing270
Missing (%)2.7%
Memory size156.2 KiB
2024-05-11T14:44:09.205546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length58
Mean length37.125488
Min length22

Characters and Unicode

Total characters361231
Distinct characters591
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4713 ?
Unique (%)48.4%

Sample

1st row서울특별시 강남구 역삼로*길 *, 고명빌딩 B*층 (역삼동)
2nd row서울특별시 강남구 영동대로**길 **, *층 ***호 (대치동)
3rd row서울특별시 강남구 논현로*길 **, *층 ****호 (개포동)
4th row서울특별시 강남구 봉은사로 ***, *층 위즈빌딩 (삼성동)
5th row서울특별시 강남구 도산대로 ***, **층 (청담동, 디올메디컬허브빌딩)
ValueCountFrequency (%)
9781
13.9%
서울특별시 9730
13.9%
강남구 9723
13.9%
5588
 
8.0%
5325
 
7.6%
역삼동 2352
 
3.4%
논현동 1865
 
2.7%
삼성동 1641
 
2.3%
대치동 1041
 
1.5%
테헤란로**길 991
 
1.4%
Other values (3292) 22131
31.5%
2024-05-11T14:44:09.804006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 68221
18.9%
60598
16.8%
12506
 
3.5%
, 11705
 
3.2%
10972
 
3.0%
10824
 
3.0%
10199
 
2.8%
10001
 
2.8%
9867
 
2.7%
9831
 
2.7%
Other values (581) 146507
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 195289
54.1%
Other Punctuation 79956
22.1%
Space Separator 60598
 
16.8%
Open Punctuation 9801
 
2.7%
Close Punctuation 9800
 
2.7%
Uppercase Letter 1963
 
0.5%
Decimal Number 1827
 
0.5%
Dash Punctuation 1520
 
0.4%
Lowercase Letter 445
 
0.1%
Letter Number 17
 
< 0.1%
Other values (2) 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12506
 
6.4%
10972
 
5.6%
10824
 
5.5%
10199
 
5.2%
10001
 
5.1%
9867
 
5.1%
9831
 
5.0%
9770
 
5.0%
9735
 
5.0%
9730
 
5.0%
Other values (508) 91854
47.0%
Uppercase Letter
ValueCountFrequency (%)
B 476
24.2%
A 242
12.3%
S 109
 
5.6%
L 98
 
5.0%
C 97
 
4.9%
H 93
 
4.7%
E 88
 
4.5%
K 81
 
4.1%
I 71
 
3.6%
W 69
 
3.5%
Other values (16) 539
27.5%
Lowercase Letter
ValueCountFrequency (%)
e 47
10.6%
n 46
10.3%
o 37
 
8.3%
b 35
 
7.9%
t 32
 
7.2%
s 31
 
7.0%
a 29
 
6.5%
g 26
 
5.8%
r 26
 
5.8%
k 24
 
5.4%
Other values (14) 112
25.2%
Decimal Number
ValueCountFrequency (%)
1 407
22.3%
2 284
15.5%
0 218
11.9%
3 184
10.1%
4 157
 
8.6%
5 150
 
8.2%
7 133
 
7.3%
6 127
 
7.0%
9 87
 
4.8%
8 80
 
4.4%
Other Punctuation
ValueCountFrequency (%)
* 68221
85.3%
, 11705
 
14.6%
. 18
 
< 0.1%
& 8
 
< 0.1%
/ 4
 
< 0.1%
Letter Number
ValueCountFrequency (%)
16
94.1%
1
 
5.9%
Space Separator
ValueCountFrequency (%)
60598
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9801
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9800
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1520
100.0%
Math Symbol
ValueCountFrequency (%)
~ 14
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 195287
54.1%
Common 163517
45.3%
Latin 2425
 
0.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12506
 
6.4%
10972
 
5.6%
10824
 
5.5%
10199
 
5.2%
10001
 
5.1%
9867
 
5.1%
9831
 
5.0%
9770
 
5.0%
9735
 
5.0%
9730
 
5.0%
Other values (506) 91852
47.0%
Latin
ValueCountFrequency (%)
B 476
19.6%
A 242
 
10.0%
S 109
 
4.5%
L 98
 
4.0%
C 97
 
4.0%
H 93
 
3.8%
E 88
 
3.6%
K 81
 
3.3%
I 71
 
2.9%
W 69
 
2.8%
Other values (42) 1001
41.3%
Common
ValueCountFrequency (%)
* 68221
41.7%
60598
37.1%
, 11705
 
7.2%
( 9801
 
6.0%
) 9800
 
6.0%
- 1520
 
0.9%
1 407
 
0.2%
2 284
 
0.2%
0 218
 
0.1%
3 184
 
0.1%
Other values (11) 779
 
0.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 195286
54.1%
ASCII 165925
45.9%
Number Forms 17
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 68221
41.1%
60598
36.5%
, 11705
 
7.1%
( 9801
 
5.9%
) 9800
 
5.9%
- 1520
 
0.9%
B 476
 
0.3%
1 407
 
0.2%
2 284
 
0.2%
A 242
 
0.1%
Other values (61) 2871
 
1.7%
Hangul
ValueCountFrequency (%)
12506
 
6.4%
10972
 
5.6%
10824
 
5.5%
10199
 
5.2%
10001
 
5.1%
9867
 
5.1%
9831
 
5.0%
9770
 
5.0%
9735
 
5.0%
9730
 
5.0%
Other values (505) 91851
47.0%
Number Forms
ValueCountFrequency (%)
16
94.1%
1
 
5.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

도로명우편번호
Text

MISSING 

Distinct509
Distinct (%)5.3%
Missing346
Missing (%)3.5%
Memory size156.2 KiB
2024-05-11T14:44:10.279406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0264139
Min length5

Characters and Unicode

Total characters48525
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)1.1%

Sample

1st row06243
2nd row06180
3rd row06313
4th row06097
5th row06012
ValueCountFrequency (%)
06083 289
 
3.0%
06159 223
 
2.3%
06178 200
 
2.1%
06061 179
 
1.9%
06197 162
 
1.7%
06313 145
 
1.5%
06049 139
 
1.4%
06037 136
 
1.4%
06052 108
 
1.1%
06134 104
 
1.1%
Other values (499) 7969
82.5%
2024-05-11T14:44:10.988132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14089
29.0%
6 11180
23.0%
1 5624
 
11.6%
3 4066
 
8.4%
2 3836
 
7.9%
5 2089
 
4.3%
4 2053
 
4.2%
9 2024
 
4.2%
7 1862
 
3.8%
8 1639
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 48462
99.9%
Dash Punctuation 63
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 14089
29.1%
6 11180
23.1%
1 5624
 
11.6%
3 4066
 
8.4%
2 3836
 
7.9%
5 2089
 
4.3%
4 2053
 
4.2%
9 2024
 
4.2%
7 1862
 
3.8%
8 1639
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 48525
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 14089
29.0%
6 11180
23.0%
1 5624
 
11.6%
3 4066
 
8.4%
2 3836
 
7.9%
5 2089
 
4.3%
4 2053
 
4.2%
9 2024
 
4.2%
7 1862
 
3.8%
8 1639
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48525
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14089
29.0%
6 11180
23.0%
1 5624
 
11.6%
3 4066
 
8.4%
2 3836
 
7.9%
5 2089
 
4.3%
4 2053
 
4.2%
9 2024
 
4.2%
7 1862
 
3.8%
8 1639
 
3.4%
Distinct9952
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T14:44:11.457027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length9.1853
Min length1

Characters and Unicode

Total characters91853
Distinct characters1081
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9908 ?
Unique (%)99.1%

Sample

1st row라임스퀘어
2nd row베러퓨처
3rd row지투지투컴퍼니
4th row농업회사법인 애그리얼라이언스 주식회사(Agri Alliance Inc.)
5th row황후연
ValueCountFrequency (%)
주식회사 2666
 
16.7%
330
 
2.1%
inc 120
 
0.8%
co.,ltd 97
 
0.6%
co 83
 
0.5%
ltd 83
 
0.5%
유한회사 75
 
0.5%
korea 44
 
0.3%
41
 
0.3%
스튜디오 35
 
0.2%
Other values (11461) 12412
77.6%
2024-05-11T14:44:12.022906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6002
 
6.5%
3829
 
4.2%
3144
 
3.4%
3084
 
3.4%
2942
 
3.2%
2785
 
3.0%
) 2733
 
3.0%
( 2731
 
3.0%
2507
 
2.7%
1266
 
1.4%
Other values (1071) 60830
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62741
68.3%
Lowercase Letter 8880
 
9.7%
Uppercase Letter 7241
 
7.9%
Space Separator 6002
 
6.5%
Close Punctuation 2736
 
3.0%
Open Punctuation 2735
 
3.0%
Other Punctuation 909
 
1.0%
Decimal Number 540
 
0.6%
Dash Punctuation 45
 
< 0.1%
Connector Punctuation 14
 
< 0.1%
Other values (3) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3829
 
6.1%
3144
 
5.0%
3084
 
4.9%
2942
 
4.7%
2785
 
4.4%
2507
 
4.0%
1266
 
2.0%
1052
 
1.7%
1037
 
1.7%
1004
 
1.6%
Other values (987) 40091
63.9%
Lowercase Letter
ValueCountFrequency (%)
e 997
11.2%
o 971
10.9%
a 772
 
8.7%
n 753
 
8.5%
i 667
 
7.5%
t 639
 
7.2%
r 553
 
6.2%
l 483
 
5.4%
d 395
 
4.4%
s 379
 
4.3%
Other values (16) 2271
25.6%
Uppercase Letter
ValueCountFrequency (%)
A 561
 
7.7%
C 555
 
7.7%
E 541
 
7.5%
O 528
 
7.3%
L 517
 
7.1%
I 498
 
6.9%
S 435
 
6.0%
T 415
 
5.7%
N 401
 
5.5%
M 351
 
4.8%
Other values (16) 2439
33.7%
Decimal Number
ValueCountFrequency (%)
1 107
19.8%
2 95
17.6%
3 72
13.3%
0 56
10.4%
5 44
8.1%
4 44
8.1%
7 36
 
6.7%
9 30
 
5.6%
8 29
 
5.4%
6 27
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 607
66.8%
, 185
 
20.4%
& 73
 
8.0%
' 17
 
1.9%
? 8
 
0.9%
/ 7
 
0.8%
: 5
 
0.6%
# 5
 
0.6%
! 2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 2731
99.9%
[ 3
 
0.1%
{ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 3
60.0%
< 1
 
20.0%
> 1
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 2733
99.9%
] 3
 
0.1%
Space Separator
ValueCountFrequency (%)
6002
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62733
68.3%
Latin 16121
 
17.6%
Common 12987
 
14.1%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3829
 
6.1%
3144
 
5.0%
3084
 
4.9%
2942
 
4.7%
2785
 
4.4%
2507
 
4.0%
1266
 
2.0%
1052
 
1.7%
1037
 
1.7%
1004
 
1.6%
Other values (977) 40083
63.9%
Latin
ValueCountFrequency (%)
e 997
 
6.2%
o 971
 
6.0%
a 772
 
4.8%
n 753
 
4.7%
i 667
 
4.1%
t 639
 
4.0%
A 561
 
3.5%
C 555
 
3.4%
r 553
 
3.4%
E 541
 
3.4%
Other values (42) 9112
56.5%
Common
ValueCountFrequency (%)
6002
46.2%
) 2733
21.0%
( 2731
21.0%
. 607
 
4.7%
, 185
 
1.4%
1 107
 
0.8%
2 95
 
0.7%
& 73
 
0.6%
3 72
 
0.6%
0 56
 
0.4%
Other values (21) 326
 
2.5%
Han
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62729
68.3%
ASCII 29108
31.7%
CJK 12
 
< 0.1%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6002
20.6%
) 2733
 
9.4%
( 2731
 
9.4%
e 997
 
3.4%
o 971
 
3.3%
a 772
 
2.7%
n 753
 
2.6%
i 667
 
2.3%
t 639
 
2.2%
. 607
 
2.1%
Other values (73) 12236
42.0%
Hangul
ValueCountFrequency (%)
3829
 
6.1%
3144
 
5.0%
3084
 
4.9%
2942
 
4.7%
2785
 
4.4%
2507
 
4.0%
1266
 
2.0%
1052
 
1.7%
1037
 
1.7%
1004
 
1.6%
Other values (976) 40079
63.9%
None
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Distinct9959
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-25 11:24:02
Maximum2024-05-09 20:33:50
2024-05-11T14:44:12.194540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:12.361533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
5279 
U
4721 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 5279
52.8%
U 4721
47.2%

Length

2024-05-11T14:44:12.544194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:12.663716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 5279
52.8%
u 4721
47.2%
Distinct1581
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T14:44:12.789898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:44:13.006691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct462
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T14:44:13.209466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.7863
Min length1

Characters and Unicode

Total characters87863
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique282 ?
Unique (%)2.8%

Sample

1st row기타
2nd row교육/도서/완구/오락
3rd row종합몰
4th row건강/식품
5th row기타
ValueCountFrequency (%)
종합몰 3853
27.3%
의류/패션/잡화/뷰티 3242
22.9%
기타 2375
16.8%
건강/식품 1251
 
8.9%
교육/도서/완구/오락 872
 
6.2%
가구/수납용품 526
 
3.7%
컴퓨터/사무용품 499
 
3.5%
레져/여행/공연 439
 
3.1%
가전 408
 
2.9%
자동차/자동차용품 313
 
2.2%
Other values (3) 349
 
2.5%
2024-05-11T14:44:13.928667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 15876
 
18.1%
4127
 
4.7%
3853
 
4.4%
3853
 
4.4%
3853
 
4.4%
3242
 
3.7%
3242
 
3.7%
3242
 
3.7%
3242
 
3.7%
3242
 
3.7%
Other values (41) 40091
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67752
77.1%
Other Punctuation 15876
 
18.1%
Space Separator 4127
 
4.7%
Dash Punctuation 108
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3853
 
5.7%
3853
 
5.7%
3853
 
5.7%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
Other values (38) 33499
49.4%
Other Punctuation
ValueCountFrequency (%)
/ 15876
100.0%
Space Separator
ValueCountFrequency (%)
4127
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67752
77.1%
Common 20111
 
22.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3853
 
5.7%
3853
 
5.7%
3853
 
5.7%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
Other values (38) 33499
49.4%
Common
ValueCountFrequency (%)
/ 15876
78.9%
4127
 
20.5%
- 108
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67752
77.1%
ASCII 20111
 
22.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 15876
78.9%
4127
 
20.5%
- 108
 
0.5%
Hangul
ValueCountFrequency (%)
3853
 
5.7%
3853
 
5.7%
3853
 
5.7%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
3242
 
4.8%
Other values (38) 33499
49.4%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct4376
Distinct (%)45.2%
Missing321
Missing (%)3.2%
Infinite0
Infinite (%)0.0%
Mean203920.29
Minimum188241.23
Maximum211087.68
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:44:14.095270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum188241.23
5-th percentile202019.45
Q1202776.93
median203607.07
Q3204715.15
95-th percentile207243.08
Maximum211087.68
Range22846.446
Interquartile range (IQR)1938.2158

Descriptive statistics

Standard deviation1593.3523
Coefficient of variation (CV)0.0078136033
Kurtosis4.3545609
Mean203920.29
Median Absolute Deviation (MAD)974.62317
Skewness1.0920309
Sum1.9737445 × 109
Variance2538771.5
MonotonicityNot monotonic
2024-05-11T14:44:14.277682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205271.275273064 276
 
2.8%
205079.418138107 182
 
1.8%
204691.655 169
 
1.7%
203125.055 158
 
1.6%
204596.386898687 133
 
1.3%
204767.377525939 128
 
1.3%
202124.262977683 110
 
1.1%
202668.065422549 93
 
0.9%
203588.048190661 82
 
0.8%
202724.214297488 74
 
0.7%
Other values (4366) 8274
82.7%
(Missing) 321
 
3.2%
ValueCountFrequency (%)
188241.230161722 1
 
< 0.1%
189681.679972842 1
 
< 0.1%
192204.400962 1
 
< 0.1%
193326.661927575 1
 
< 0.1%
199197.79507181 1
 
< 0.1%
199926.293186604 1
 
< 0.1%
201509.712645065 1
 
< 0.1%
201537.291178087 1
 
< 0.1%
201543.962547061 4
< 0.1%
201595.925382148 3
< 0.1%
ValueCountFrequency (%)
211087.676151765 1
 
< 0.1%
210442.413455881 5
0.1%
210409.444142629 6
0.1%
210050.902574 4
 
< 0.1%
209906.371368442 2
 
< 0.1%
209897.807021555 10
0.1%
209802.490272781 1
 
< 0.1%
209646.114645275 6
0.1%
209631.0 2
 
< 0.1%
209630.187801567 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct4372
Distinct (%)45.2%
Missing321
Missing (%)3.2%
Infinite0
Infinite (%)0.0%
Mean444800.3
Minimum182787
Maximum454458.49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:44:14.463947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum182787
5-th percentile441599.66
Q1444024.18
median444945.1
Q3445945.2
95-th percentile446935.65
Maximum454458.49
Range271671.49
Interquartile range (IQR)1921.0211

Descriptive statistics

Standard deviation3060.7986
Coefficient of variation (CV)0.0068812871
Kurtosis5549.0502
Mean444800.3
Median Absolute Deviation (MAD)970.51131
Skewness-64.900925
Sum4.3052222 × 109
Variance9368488
MonotonicityNot monotonic
2024-05-11T14:44:14.654907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
445852.638814088 276
 
2.8%
444869.834721154 182
 
1.8%
444968.62 169
 
1.7%
446027.47 158
 
1.6%
444623.044946341 133
 
1.3%
441349.074287866 128
 
1.3%
446374.647464072 110
 
1.1%
446332.171860773 93
 
0.9%
446183.368328818 82
 
0.8%
445894.305854116 74
 
0.7%
Other values (4362) 8274
82.7%
(Missing) 321
 
3.2%
ValueCountFrequency (%)
182786.999825 1
 
< 0.1%
439796.044686133 9
0.1%
440091.83370207 1
 
< 0.1%
440183.87349816 4
< 0.1%
440224.26115069 1
 
< 0.1%
440226.898146447 6
0.1%
440233.388928253 3
 
< 0.1%
440459.565 3
 
< 0.1%
440468.096921167 3
 
< 0.1%
440475.774156274 2
 
< 0.1%
ValueCountFrequency (%)
454458.493979132 1
 
< 0.1%
451234.452170761 1
 
< 0.1%
450698.570810848 1
 
< 0.1%
449052.003743497 1
 
< 0.1%
448521.484487285 1
 
< 0.1%
447864.763737276 8
0.1%
447800.663176567 4
< 0.1%
447748.161018109 2
 
< 0.1%
447703.870744411 1
 
< 0.1%
447663.86257666 9
0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8964 
0
1036 

Length

Max length4
Median length4
Mean length3.6892
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8964
89.6%
0 1036
 
10.4%

Length

2024-05-11T14:44:14.822128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:14.936305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8964
89.6%
0 1036
 
10.4%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8964 
0
1036 

Length

Max length4
Median length4
Mean length3.6892
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8964
89.6%
0 1036
 
10.4%

Length

2024-05-11T14:44:15.053841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:15.203508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8964
89.6%
0 1036
 
10.4%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8964 
0
1036 

Length

Max length4
Median length4
Mean length3.6892
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8964
89.6%
0 1036
 
10.4%

Length

2024-05-11T14:44:15.393728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:44:15.544175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8964
89.6%
0 1036
 
10.4%

판매방식명
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6937 
인터넷
2620 
기타
 
136
인터넷, 기타
 
125
TV홈쇼핑, 인터넷
 
59
Other values (15)
 
123

Length

Max length26
Median length4
Mean length3.9431
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row<NA>
4th row<NA>
5th row기타

Common Values

ValueCountFrequency (%)
<NA> 6937
69.4%
인터넷 2620
 
26.2%
기타 136
 
1.4%
인터넷, 기타 125
 
1.2%
TV홈쇼핑, 인터넷 59
 
0.6%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 33
 
0.3%
인터넷, 카다로그 19
 
0.2%
TV홈쇼핑, 인터넷, 기타 14
 
0.1%
TV홈쇼핑, 인터넷, 카다로그 13
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 8
 
0.1%
Other values (10) 36
 
0.4%

Length

2024-05-11T14:44:15.681807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6937
66.2%
인터넷 2919
27.9%
기타 326
 
3.1%
tv홈쇼핑 140
 
1.3%
카다로그 93
 
0.9%
신문잡지 62
 
0.6%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
257013220000202032202363020328620200603<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***번지 **호서울특별시 강남구 역삼로*길 *, 고명빌딩 B*층 (역삼동)06243라임스퀘어2020-06-09 13:56:40U2020-06-11 02:40:00.0기타202777.041481443552.088628<NA><NA><NA>인터넷
280643220000202132202493020551720210916<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 대치동 ***-**서울특별시 강남구 영동대로**길 **, *층 ***호 (대치동)06180베러퓨처2021-09-16 09:38:55I2021-09-18 00:22:49.0교육/도서/완구/오락205314.473811444975.112614000인터넷
9004322000020243220249302024082024-04-03<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 ****-**서울특별시 강남구 논현로*길 **, *층 ****호 (개포동)06313지투지투컴퍼니2024-04-03 14:41:51I2023-12-04 00:05:00.0종합몰204767.377526441349.074288<NA><NA><NA><NA>
150953220000202232202493020469220220824<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 삼성동 **-**서울특별시 강남구 봉은사로 ***, *층 위즈빌딩 (삼성동)06097농업회사법인 애그리얼라이언스 주식회사(Agri Alliance Inc.)2022-08-24 10:35:06I2021-12-07 22:06:00.0건강/식품204070.238359445477.407575<NA><NA><NA><NA>
186943220000201432201623020202920140717<NA>1영업/정상1정상영업<NA><NA><NA><NA>561-6848<NA><NA>서울특별시 강남구 청담동 ***번지 *호 *-*층서울특별시 강남구 도산대로 ***, **층 (청담동, 디올메디컬허브빌딩)06012황후연2016-05-17 14:56:49I2018-08-31 23:59:59.0기타204256.531483446946.912933<NA><NA><NA>기타
4475322000020233220249302040012022-08-02<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 ****-* 지성빌딩서울특별시 강남구 논현로 **, *층 *-***호 지성빌딩 (개포동)06307챕터2023-07-25 13:59:27I2022-12-06 22:07:00.0기타203952.774672441675.495466<NA><NA><NA><NA>
10772322000020243220249302021252024-03-21<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***-**서울특별시 강남구 봉은사로**길 *, *층 ****호 (역삼동)06135(유)홍콩직구(Hongkong Jikgu Limited Company)2024-03-21 10:21:26I2023-12-02 22:03:00.0종합몰 의류/패션/잡화/뷰티203149.125584445032.355574<NA><NA><NA><NA>
136633220000202232202493020296820220525<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 신사동 ***서울특별시 강남구 언주로***길 **, *층 (신사동)06017라이언뷰티(주)2022-05-30 18:12:37U2021-12-06 00:08:00.0의류/패션/잡화/뷰티203025.084706447173.224016<NA><NA><NA><NA>
171793220000202332202493020008320230103<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 696-43서울특별시 강남구 선릉로 517, 6층 601호 A02 (역삼동)06149디비로드커머셜2023-01-03 10:30:19I2022-12-01 00:05:00.0종합몰204187.366878444832.565758<NA><NA><NA><NA>
6622322000020233220249302058722021-03-05<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6956-1778<NA><NA>서울특별시 강남구 논현동 ***-* 성우빌딩서울특별시 강남구 학동로**길 *, 성우빌딩 *층 ***호 (논현동)06061주식회사 스패너(Xpanner lnc.)2023-11-09 16:10:18I2022-10-31 23:01:00.0자동차/자동차용품 기타203271.271836446032.23695<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
9290322000020243220249302003332024-01-12<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***-**서울특별시 강남구 봉은사로**길 *, *층 ****호 (역삼동)06135무한상점2024-01-12 11:35:22I2023-11-30 23:04:00.0종합몰203149.125584445032.355574<NA><NA><NA><NA>
178613220000202232202493020672320221222<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 삼성동 107서울특별시 강남구 영동대로 602, 6층 엔290호 (삼성동, 삼성동 미켈란 107)06083주식회사 아페라 (APERA CO.,LTD)2022-12-22 13:41:36I2021-11-01 22:04:00.0종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 자동차/자동차용품205271.275273445852.638814<NA><NA><NA><NA>
243723220000202232202493020047820220119<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 청담동 **-**서울특별시 강남구 영동대로***길 **, *층 ***호 (청담동, 청담동테라스)06072디아망 필라테스2022-01-19 10:27:13I2022-01-21 00:22:39.0레져/여행/공연204806.320436446482.840887000인터넷
250343220000202032202363020246920190523<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-546-3934<NA><NA>서울특별시 강남구 논현동 ***번지 **호서울특별시 강남구 봉은사로**길 *-*, 비* **호 (논현동)06109(주)갤럭시인터내셔널2021-11-03 15:06:59U2021-11-05 02:40:00.0의류/패션/잡화/뷰티203124.417767445144.50744000인터넷
1317322000020233220249302018802017-01-02<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-336-9626<NA><NA>서울특별시 강남구 논현동 **서울특별시 강남구 도산대로*길 **, *층 (논현동)06039주빈(ZUVIN)2023-04-06 11:08:16U2022-12-04 00:08:00.0의류/패션/잡화/뷰티201903.695735446003.638073<NA><NA><NA><NA>
145403220000202232202493020326920220615<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 청담동 ***-** 대우리츠카운티서울특별시 강남구 도산대로**길 **, ***호 (청담동, 대우리츠카운티)06011주식회사 민티스트클럽2022-06-15 15:00:02I2021-12-05 23:07:00.0종합몰204219.574446447161.296316<NA><NA><NA><NA>
153943220000202232202493020433720220805<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 수서동 ***서울특별시 강남구 광평로**길 **, ****동 ****호 (수서동, 까치마을아파트)06354베니앤코(Bennie & Co)2022-08-05 14:54:07I2021-12-08 00:07:00.0종합몰 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 자동차/자동차용품207679.579279442508.006384<NA><NA><NA><NA>
10591322000020243220249302026352024-04-17<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***-* 역삼현대벤쳐텔서울특별시 강남구 테헤란로**길 **, 역삼현대벤쳐텔 *층 ***호 (역삼동)06132프리미에라커피2024-04-17 10:13:29I2023-12-03 23:09:00.0건강/식품203081.161709444381.101104<NA><NA><NA><NA>
243823220000202232202493020001720220103<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 청담동 **-*서울특별시 강남구 선릉로 ***, *층 ***호 유진빌딩 (청담동)06065스터닝뷰티2022-01-03 12:26:47I2022-01-05 00:22:41.0의류/패션/잡화/뷰티203543.046428446328.53261000인터넷
251293220000202032202363020224720200416<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 신사동 ***번지 *호서울특별시 강남구 강남대로***길 **-*, *층 (신사동)06028마이 미아(mai mia)2020-05-15 14:42:13U2020-05-17 02:40:00.0의류/패션/잡화/뷰티201845.879627446487.104143<NA><NA><NA>인터넷