Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells83694
Missing cells (%)28.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical9
Numeric6
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author서초구
URLhttps://data.seoul.go.kr/dataList/OA-18823/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
상세영업상태명 is highly imbalanced (52.1%)Imbalance
자산규모 is highly imbalanced (87.2%)Imbalance
부채총액 is highly imbalanced (87.2%)Imbalance
자본금 is highly imbalanced (87.2%)Imbalance
판매방식명 is highly imbalanced (69.5%)Imbalance
인허가취소일자 has 9994 (99.9%) missing valuesMissing
폐업일자 has 6942 (69.4%) missing valuesMissing
휴업시작일자 has 9975 (99.8%) missing valuesMissing
휴업종료일자 has 9975 (99.8%) missing valuesMissing
재개업일자 has 9993 (99.9%) missing valuesMissing
전화번호 has 1857 (18.6%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7445 (74.5%) missing valuesMissing
지번주소 has 2923 (29.2%) missing valuesMissing
도로명주소 has 3144 (31.4%) missing valuesMissing
도로명우편번호 has 5360 (53.6%) missing valuesMissing
좌표정보(X) has 3043 (30.4%) missing valuesMissing
좌표정보(Y) has 3043 (30.4%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = 50.4684232)Skewed
좌표정보(Y) is highly skewed (γ1 = -62.39053251)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 10:47:16.866909
Analysis finished2024-04-06 10:47:21.036107
Duration4.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3210000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3210000
2nd row3210000
3rd row3210000
4th row3210000
5th row3210000

Common Values

ValueCountFrequency (%)
3210000 10000
100.0%

Length

2024-04-06T19:47:21.149665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:21.331162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3210000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0104905 × 1018
Minimum1.996321 × 1018
Maximum2.019321 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:21.536023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.996321 × 1018
5-th percentile2.002321 × 1018
Q12.006321 × 1018
median2.010321 × 1018
Q32.015321 × 1018
95-th percentile2.018321 × 1018
Maximum2.019321 × 1018
Range2.3000008 × 1016
Interquartile range (IQR)9.0000077 × 1015

Descriptive statistics

Standard deviation5.3254892 × 1015
Coefficient of variation (CV)0.0026488507
Kurtosis-1.0415712
Mean2.0104905 × 1018
Median Absolute Deviation (MAD)5.0000032 × 1015
Skewness-0.13622108
Sum-2.0459246 × 1018
Variance2.8360835 × 1031
MonotonicityNot monotonic
2024-04-06T19:47:21.773786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2003321007630202975 1
 
< 0.1%
2015321015330201365 1
 
< 0.1%
2004321007630204434 1
 
< 0.1%
2011321012130200158 1
 
< 0.1%
2000321007630200420 1
 
< 0.1%
2004321007630203671 1
 
< 0.1%
2010321012130200977 1
 
< 0.1%
2009321012130200579 1
 
< 0.1%
2011321012130201599 1
 
< 0.1%
2016321015330200937 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1996321007630200001 1
< 0.1%
1996321007630200004 1
< 0.1%
1996321007630200005 1
< 0.1%
1996321007630200006 1
< 0.1%
1996321007630200009 1
< 0.1%
1996321007630200010 1
< 0.1%
1996321007630200013 1
< 0.1%
1996321007630200018 1
< 0.1%
1996321007630200019 1
< 0.1%
1996321007630200020 1
< 0.1%
ValueCountFrequency (%)
2019321015330200455 1
< 0.1%
2019321015330200454 1
< 0.1%
2019321015330200452 1
< 0.1%
2019321015330200451 1
< 0.1%
2019321015330200445 1
< 0.1%
2019321015330200444 1
< 0.1%
2019321015330200443 1
< 0.1%
2019321015330200440 1
< 0.1%
2019321015330200438 1
< 0.1%
2019321015330200425 1
< 0.1%
Distinct3769
Distinct (%)37.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-07-29 00:00:00
Maximum2019-02-07 00:00:00
2024-04-06T19:47:21.987114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:22.230822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Real number (ℝ)

MISSING 

Distinct6
Distinct (%)100.0%
Missing9994
Missing (%)99.9%
Infinite0
Infinite (%)0.0%
Mean20100631
Minimum20071115
Maximum20130206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:22.451529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20071115
5-th percentile20075962
Q120090508
median20100372
Q320110968
95-th percentile20125459
Maximum20130206
Range59091
Interquartile range (IQR)20460.25

Descriptive statistics

Standard deviation20758.233
Coefficient of variation (CV)0.0010327155
Kurtosis-0.34624901
Mean20100631
Median Absolute Deviation (MAD)10359
Skewness0.0051392176
Sum1.2060378 × 108
Variance4.3090424 × 108
MonotonicityNot monotonic
2024-04-06T19:47:22.719581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
20090501 1
 
< 0.1%
20130206 1
 
< 0.1%
20071115 1
 
< 0.1%
20111219 1
 
< 0.1%
20090528 1
 
< 0.1%
20110215 1
 
< 0.1%
(Missing) 9994
99.9%
ValueCountFrequency (%)
20071115 1
< 0.1%
20090501 1
< 0.1%
20090528 1
< 0.1%
20110215 1
< 0.1%
20111219 1
< 0.1%
20130206 1
< 0.1%
ValueCountFrequency (%)
20130206 1
< 0.1%
20111219 1
< 0.1%
20110215 1
< 0.1%
20090528 1
< 0.1%
20090501 1
< 0.1%
20071115 1
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5281 
3
3858 
5
567 
4
 
279
2
 
15

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row1
3rd row1
4th row1
5th row4

Common Values

ValueCountFrequency (%)
1 5281
52.8%
3 3858
38.6%
5 567
 
5.7%
4 279
 
2.8%
2 15
 
0.1%

Length

2024-04-06T19:47:22.942243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:23.152142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5281
52.8%
3 3858
38.6%
5 567
 
5.7%
4 279
 
2.8%
2 15
 
0.1%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
5281 
폐업
3858 
제외/삭제/전출
567 
취소/말소/만료/정지/중지
 
279
휴업
 
15

Length

Max length14
Median length5
Mean length4.2593
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row취소/말소/만료/정지/중지
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row취소/말소/만료/정지/중지

Common Values

ValueCountFrequency (%)
영업/정상 5281
52.8%
폐업 3858
38.6%
제외/삭제/전출 567
 
5.7%
취소/말소/만료/정지/중지 279
 
2.8%
휴업 15
 
0.1%

Length

2024-04-06T19:47:23.389333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:23.598146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 5281
52.8%
폐업 3858
38.6%
제외/삭제/전출 567
 
5.7%
취소/말소/만료/정지/중지 279
 
2.8%
휴업 15
 
0.1%

상세영업상태코드
Real number (ℝ)

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1456
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:23.807074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile5
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.394057
Coefficient of variation (CV)0.64972829
Kurtosis1.31647
Mean2.1456
Median Absolute Deviation (MAD)0
Skewness1.1616948
Sum21456
Variance1.943395
MonotonicityNot monotonic
2024-04-06T19:47:24.057497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 5278
52.8%
3 3858
38.6%
5 567
 
5.7%
7 201
 
2.0%
4 78
 
0.8%
2 15
 
0.1%
6 2
 
< 0.1%
8 1
 
< 0.1%
ValueCountFrequency (%)
1 5278
52.8%
2 15
 
0.1%
3 3858
38.6%
4 78
 
0.8%
5 567
 
5.7%
6 2
 
< 0.1%
7 201
 
2.0%
8 1
 
< 0.1%
ValueCountFrequency (%)
8 1
 
< 0.1%
7 201
 
2.0%
6 2
 
< 0.1%
5 567
 
5.7%
4 78
 
0.8%
3 3858
38.6%
2 15
 
0.1%
1 5278
52.8%

상세영업상태명
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
5278 
폐업처리
3858 
타시군구이관
567 
직권말소
 
201
직권취소
 
78
Other values (3)
 
18

Length

Max length6
Median length4
Mean length4.1138
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row직권말소
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row직권취소

Common Values

ValueCountFrequency (%)
정상영업 5278
52.8%
폐업처리 3858
38.6%
타시군구이관 567
 
5.7%
직권말소 201
 
2.0%
직권취소 78
 
0.8%
휴업처리 15
 
0.1%
타시군구전입 2
 
< 0.1%
영업재개 1
 
< 0.1%

Length

2024-04-06T19:47:24.349412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:24.608387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 5278
52.8%
폐업처리 3858
38.6%
타시군구이관 567
 
5.7%
직권말소 201
 
2.0%
직권취소 78
 
0.8%
휴업처리 15
 
0.1%
타시군구전입 2
 
< 0.1%
영업재개 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2052
Distinct (%)67.1%
Missing6942
Missing (%)69.4%
Memory size156.2 KiB
Minimum2004-12-01 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T19:47:24.809257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:25.117288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct25
Distinct (%)100.0%
Missing9975
Missing (%)99.8%
Memory size156.2 KiB
Minimum2007-11-15 00:00:00
Maximum2024-02-27 00:00:00
2024-04-06T19:47:25.465286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:25.694644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

휴업종료일자
Date

MISSING 

Distinct24
Distinct (%)96.0%
Missing9975
Missing (%)99.8%
Memory size156.2 KiB
Minimum2008-10-30 00:00:00
Maximum2099-12-31 00:00:00
2024-04-06T19:47:25.898636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:26.114027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)

재개업일자
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing9993
Missing (%)99.9%
Memory size156.2 KiB
Minimum2012-07-27 00:00:00
Maximum2023-01-28 00:00:00
2024-04-06T19:47:26.302983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:26.453545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

전화번호
Text

MISSING 

Distinct7843
Distinct (%)96.3%
Missing1857
Missing (%)18.6%
Memory size156.2 KiB
2024-04-06T19:47:27.056010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length11.482132
Min length1

Characters and Unicode

Total characters93499
Distinct characters18
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7738 ?
Unique (%)95.0%

Sample

1st row02 2055 1114
2nd row02 535 1535
3rd row02 583 4405
4th row02-556-5611
5th row070-4035-8082
ValueCountFrequency (%)
02 3300
 
22.0%
471
 
3.1%
525 81
 
0.5%
581 67
 
0.4%
523 67
 
0.4%
533 66
 
0.4%
070 63
 
0.4%
521 62
 
0.4%
588 59
 
0.4%
598 59
 
0.4%
Other values (7667) 10681
71.3%
2024-04-06T19:47:27.924708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13929
14.9%
2 12615
13.5%
10731
11.5%
5 9288
9.9%
7 6894
7.4%
3 6233
6.7%
- 6220
6.7%
4 5967
6.4%
8 5891
6.3%
1 5416
 
5.8%
Other values (8) 10315
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 75138
80.4%
Space Separator 10731
 
11.5%
Dash Punctuation 6220
 
6.7%
Other Punctuation 1387
 
1.5%
Math Symbol 17
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 13929
18.5%
2 12615
16.8%
5 9288
12.4%
7 6894
9.2%
3 6233
8.3%
4 5967
7.9%
8 5891
7.8%
1 5416
 
7.2%
6 4732
 
6.3%
9 4173
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 1377
99.3%
/ 6
 
0.4%
, 4
 
0.3%
Space Separator
ValueCountFrequency (%)
10731
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6220
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 93499
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 13929
14.9%
2 12615
13.5%
10731
11.5%
5 9288
9.9%
7 6894
7.4%
3 6233
6.7%
- 6220
6.7%
4 5967
6.4%
8 5891
6.3%
1 5416
 
5.8%
Other values (8) 10315
11.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 93499
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13929
14.9%
2 12615
13.5%
10731
11.5%
5 9288
9.9%
7 6894
7.4%
3 6233
6.7%
- 6220
6.7%
4 5967
6.4%
8 5891
6.3%
1 5416
 
5.8%
Other values (8) 10315
11.0%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct140
Distinct (%)5.5%
Missing7445
Missing (%)74.5%
Infinite0
Infinite (%)0.0%
Mean137292.71
Minimum137030
Maximum502858
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:28.171306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum137030
5-th percentile137030
Q1137060
median137070
Q3137130
95-th percentile137856
Maximum502858
Range365828
Interquartile range (IQR)70

Descriptive statistics

Standard deviation7238.7708
Coefficient of variation (CV)0.052725093
Kurtosis2549.6971
Mean137292.71
Median Absolute Deviation (MAD)10
Skewness50.468423
Sum3.5078287 × 108
Variance52399803
MonotonicityNot monotonic
2024-04-06T19:47:28.412065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
137070 952
 
9.5%
137060 423
 
4.2%
137130 353
 
3.5%
137040 228
 
2.3%
137030 204
 
2.0%
137071 30
 
0.3%
137180 21
 
0.2%
137131 19
 
0.2%
137140 18
 
0.2%
137061 15
 
0.1%
Other values (130) 292
 
2.9%
(Missing) 7445
74.5%
ValueCountFrequency (%)
137030 204
 
2.0%
137040 228
 
2.3%
137041 13
 
0.1%
137060 423
4.2%
137061 15
 
0.1%
137069 1
 
< 0.1%
137070 952
9.5%
137071 30
 
0.3%
137072 1
 
< 0.1%
137073 1
 
< 0.1%
ValueCountFrequency (%)
502858 1
 
< 0.1%
137953 1
 
< 0.1%
137942 1
 
< 0.1%
137940 1
 
< 0.1%
137937 1
 
< 0.1%
137934 2
< 0.1%
137932 3
< 0.1%
137930 4
< 0.1%
137927 1
 
< 0.1%
137921 1
 
< 0.1%

지번주소
Text

MISSING 

Distinct4599
Distinct (%)65.0%
Missing2923
Missing (%)29.2%
Memory size156.2 KiB
2024-04-06T19:47:28.871963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length29.218454
Min length9

Characters and Unicode

Total characters206779
Distinct characters517
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3933 ?
Unique (%)55.6%

Sample

1st row서울특별시 서초구 서초동 ****-*
2nd row서울특별시 서초구 방배동 ***번지 **호 신진빌딩 *층 ***호
3rd row서울특별시 서초구 방배동 ***번지 *호
4th row서울특별시 서초구 방배동 ***-* 금동연립 라동 ***호
5th row서울특별시 서초구 서초동 ****번지 서산빌딩
ValueCountFrequency (%)
서울특별시 7076
16.9%
서초구 7076
16.9%
5868
14.0%
번지 3804
9.1%
3431
8.2%
서초동 2885
6.9%
1371
 
3.3%
방배동 1334
 
3.2%
양재동 1189
 
2.8%
반포동 711
 
1.7%
Other values (2864) 7155
17.1%
2024-04-06T19:47:29.545572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 46135
22.3%
35101
17.0%
17625
 
8.5%
10350
 
5.0%
7644
 
3.7%
7223
 
3.5%
7120
 
3.4%
7106
 
3.4%
7083
 
3.4%
7076
 
3.4%
Other values (507) 54316
26.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120485
58.3%
Other Punctuation 46278
 
22.4%
Space Separator 35101
 
17.0%
Dash Punctuation 3587
 
1.7%
Uppercase Letter 608
 
0.3%
Decimal Number 331
 
0.2%
Open Punctuation 178
 
0.1%
Close Punctuation 178
 
0.1%
Lowercase Letter 20
 
< 0.1%
Math Symbol 9
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17625
14.6%
10350
 
8.6%
7644
 
6.3%
7223
 
6.0%
7120
 
5.9%
7106
 
5.9%
7083
 
5.9%
7076
 
5.9%
6418
 
5.3%
4376
 
3.6%
Other values (450) 38464
31.9%
Uppercase Letter
ValueCountFrequency (%)
B 213
35.0%
A 133
21.9%
S 32
 
5.3%
G 31
 
5.1%
L 25
 
4.1%
T 23
 
3.8%
K 21
 
3.5%
C 20
 
3.3%
E 18
 
3.0%
R 15
 
2.5%
Other values (12) 77
 
12.7%
Decimal Number
ValueCountFrequency (%)
1 82
24.8%
3 50
15.1%
2 41
12.4%
5 27
 
8.2%
6 27
 
8.2%
4 26
 
7.9%
0 24
 
7.3%
7 21
 
6.3%
8 20
 
6.0%
9 13
 
3.9%
Lowercase Letter
ValueCountFrequency (%)
e 6
30.0%
b 3
15.0%
p 2
 
10.0%
w 2
 
10.0%
a 2
 
10.0%
c 1
 
5.0%
d 1
 
5.0%
i 1
 
5.0%
r 1
 
5.0%
o 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
* 46135
99.7%
, 122
 
0.3%
/ 8
 
< 0.1%
. 4
 
< 0.1%
& 3
 
< 0.1%
? 3
 
< 0.1%
@ 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
35101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3587
100.0%
Open Punctuation
ValueCountFrequency (%)
( 178
100.0%
Close Punctuation
ValueCountFrequency (%)
) 178
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 120485
58.3%
Common 85663
41.4%
Latin 631
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17625
14.6%
10350
 
8.6%
7644
 
6.3%
7223
 
6.0%
7120
 
5.9%
7106
 
5.9%
7083
 
5.9%
7076
 
5.9%
6418
 
5.3%
4376
 
3.6%
Other values (450) 38464
31.9%
Latin
ValueCountFrequency (%)
B 213
33.8%
A 133
21.1%
S 32
 
5.1%
G 31
 
4.9%
L 25
 
4.0%
T 23
 
3.6%
K 21
 
3.3%
C 20
 
3.2%
E 18
 
2.9%
R 15
 
2.4%
Other values (24) 100
15.8%
Common
ValueCountFrequency (%)
* 46135
53.9%
35101
41.0%
- 3587
 
4.2%
( 178
 
0.2%
) 178
 
0.2%
, 122
 
0.1%
1 82
 
0.1%
3 50
 
0.1%
2 41
 
< 0.1%
5 27
 
< 0.1%
Other values (13) 162
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 120484
58.3%
ASCII 86290
41.7%
Number Forms 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 46135
53.5%
35101
40.7%
- 3587
 
4.2%
B 213
 
0.2%
( 178
 
0.2%
) 178
 
0.2%
A 133
 
0.2%
, 122
 
0.1%
1 82
 
0.1%
3 50
 
0.1%
Other values (44) 511
 
0.6%
Hangul
ValueCountFrequency (%)
17625
14.6%
10350
 
8.6%
7644
 
6.3%
7223
 
6.0%
7120
 
5.9%
7106
 
5.9%
7083
 
5.9%
7076
 
5.9%
6418
 
5.3%
4376
 
3.6%
Other values (449) 38463
31.9%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
´ 1
100.0%

도로명주소
Text

MISSING 

Distinct5361
Distinct (%)78.2%
Missing3144
Missing (%)31.4%
Memory size156.2 KiB
2024-04-06T19:47:30.028583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length51
Mean length36.092036
Min length18

Characters and Unicode

Total characters247447
Distinct characters543
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4620 ?
Unique (%)67.4%

Sample

1st row서울특별시 서초구 방배로 **, ***호 (방배동,신진빌딩 *층)
2nd row서울특별시 서초구 청두곶**길 **, ***호 (방배동)
3rd row서울특별시 서초구 사평대로**길 **, ***호 (반포동)
4th row서울특별시 서초구 강남대로 ***, 서산빌딩 *,*층 (서초동)
5th row서울특별시 서초구 강남대로 ***, *층 ***호 (서초동, 송남빌딩)
ValueCountFrequency (%)
7149
15.8%
서울특별시 6856
15.1%
서초구 6832
15.1%
3735
 
8.2%
2158
 
4.8%
서초동 1951
 
4.3%
방배동 1071
 
2.4%
양재동 802
 
1.8%
564
 
1.2%
반포동 541
 
1.2%
Other values (3428) 13701
30.2%
2024-04-06T19:47:30.883982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 42612
17.2%
38533
15.6%
18497
 
7.5%
11198
 
4.5%
, 9737
 
3.9%
8214
 
3.3%
7009
 
2.8%
6911
 
2.8%
6892
 
2.8%
( 6875
 
2.8%
Other values (533) 90969
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140260
56.7%
Other Punctuation 52374
 
21.2%
Space Separator 38533
 
15.6%
Open Punctuation 6875
 
2.8%
Close Punctuation 6874
 
2.8%
Dash Punctuation 1402
 
0.6%
Uppercase Letter 704
 
0.3%
Decimal Number 366
 
0.1%
Lowercase Letter 38
 
< 0.1%
Math Symbol 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18497
 
13.2%
11198
 
8.0%
8214
 
5.9%
7009
 
5.0%
6911
 
4.9%
6892
 
4.9%
6870
 
4.9%
6856
 
4.9%
6608
 
4.7%
4307
 
3.1%
Other values (467) 56898
40.6%
Uppercase Letter
ValueCountFrequency (%)
B 205
29.1%
A 130
18.5%
S 44
 
6.2%
L 38
 
5.4%
E 29
 
4.1%
G 26
 
3.7%
R 23
 
3.3%
K 23
 
3.3%
T 22
 
3.1%
C 19
 
2.7%
Other values (14) 145
20.6%
Lowercase Letter
ValueCountFrequency (%)
e 7
18.4%
t 4
10.5%
i 3
 
7.9%
o 3
 
7.9%
a 3
 
7.9%
d 2
 
5.3%
s 2
 
5.3%
u 2
 
5.3%
l 2
 
5.3%
r 2
 
5.3%
Other values (8) 8
21.1%
Decimal Number
ValueCountFrequency (%)
1 94
25.7%
0 47
12.8%
3 47
12.8%
5 40
10.9%
2 32
 
8.7%
4 30
 
8.2%
6 23
 
6.3%
8 21
 
5.7%
7 18
 
4.9%
9 14
 
3.8%
Other Punctuation
ValueCountFrequency (%)
* 42612
81.4%
, 9737
 
18.6%
. 7
 
< 0.1%
/ 7
 
< 0.1%
& 5
 
< 0.1%
@ 3
 
< 0.1%
? 2
 
< 0.1%
' 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
38533
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6875
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6874
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1402
100.0%
Math Symbol
ValueCountFrequency (%)
~ 20
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140260
56.7%
Common 106444
43.0%
Latin 743
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18497
 
13.2%
11198
 
8.0%
8214
 
5.9%
7009
 
5.0%
6911
 
4.9%
6892
 
4.9%
6870
 
4.9%
6856
 
4.9%
6608
 
4.7%
4307
 
3.1%
Other values (467) 56898
40.6%
Latin
ValueCountFrequency (%)
B 205
27.6%
A 130
17.5%
S 44
 
5.9%
L 38
 
5.1%
E 29
 
3.9%
G 26
 
3.5%
R 23
 
3.1%
K 23
 
3.1%
T 22
 
3.0%
C 19
 
2.6%
Other values (33) 184
24.8%
Common
ValueCountFrequency (%)
* 42612
40.0%
38533
36.2%
, 9737
 
9.1%
( 6875
 
6.5%
) 6874
 
6.5%
- 1402
 
1.3%
1 94
 
0.1%
0 47
 
< 0.1%
3 47
 
< 0.1%
5 40
 
< 0.1%
Other values (13) 183
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140260
56.7%
ASCII 107186
43.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 42612
39.8%
38533
35.9%
, 9737
 
9.1%
( 6875
 
6.4%
) 6874
 
6.4%
- 1402
 
1.3%
B 205
 
0.2%
A 130
 
0.1%
1 94
 
0.1%
0 47
 
< 0.1%
Other values (55) 677
 
0.6%
Hangul
ValueCountFrequency (%)
18497
 
13.2%
11198
 
8.0%
8214
 
5.9%
7009
 
5.0%
6911
 
4.9%
6892
 
4.9%
6870
 
4.9%
6856
 
4.9%
6608
 
4.7%
4307
 
3.1%
Other values (467) 56898
40.6%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct509
Distinct (%)11.0%
Missing5360
Missing (%)53.6%
Memory size156.2 KiB
2024-04-06T19:47:31.498591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.4446121
Min length5

Characters and Unicode

Total characters25263
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)1.9%

Sample

1st row06677
2nd row06575
3rd row06612
4th row06730
5th row137140
ValueCountFrequency (%)
137071 278
 
6.0%
137061 131
 
2.8%
137131 111
 
2.4%
06627 97
 
2.1%
137041 85
 
1.8%
137030 63
 
1.4%
06621 61
 
1.3%
137060 46
 
1.0%
06628 44
 
0.9%
06626 44
 
0.9%
Other values (499) 3680
79.3%
2024-04-06T19:47:32.458024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 4618
18.3%
0 4236
16.8%
7 4066
16.1%
1 3453
13.7%
3 2877
11.4%
8 1634
 
6.5%
5 1467
 
5.8%
4 988
 
3.9%
2 985
 
3.9%
9 860
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 25184
99.7%
Dash Punctuation 79
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 4618
18.3%
0 4236
16.8%
7 4066
16.1%
1 3453
13.7%
3 2877
11.4%
8 1634
 
6.5%
5 1467
 
5.8%
4 988
 
3.9%
2 985
 
3.9%
9 860
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25263
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 4618
18.3%
0 4236
16.8%
7 4066
16.1%
1 3453
13.7%
3 2877
11.4%
8 1634
 
6.5%
5 1467
 
5.8%
4 988
 
3.9%
2 985
 
3.9%
9 860
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25263
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 4618
18.3%
0 4236
16.8%
7 4066
16.1%
1 3453
13.7%
3 2877
11.4%
8 1634
 
6.5%
5 1467
 
5.8%
4 988
 
3.9%
2 985
 
3.9%
9 860
 
3.4%
Distinct9887
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T19:47:33.124917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length36
Mean length8.0047
Min length1

Characters and Unicode

Total characters80047
Distinct characters1023
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9777 ?
Unique (%)97.8%

Sample

1st row(주)네오핑
2nd row(주)아이엘비
3rd row에버그린
4th row샵베이빌론
5th row러블리유
ValueCountFrequency (%)
주식회사 870
 
6.7%
233
 
1.8%
company 25
 
0.2%
korea 24
 
0.2%
co.,ltd 23
 
0.2%
co 20
 
0.2%
inc 18
 
0.1%
컴퍼니 18
 
0.1%
법무법인 17
 
0.1%
17
 
0.1%
Other values (11016) 11704
90.2%
2024-04-06T19:47:34.048035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4170
 
5.2%
) 4170
 
5.2%
3915
 
4.9%
2982
 
3.7%
2896
 
3.6%
2415
 
3.0%
1339
 
1.7%
1198
 
1.5%
1195
 
1.5%
992
 
1.2%
Other values (1013) 54775
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56982
71.2%
Lowercase Letter 6128
 
7.7%
Uppercase Letter 4763
 
6.0%
Open Punctuation 4171
 
5.2%
Close Punctuation 4171
 
5.2%
Space Separator 2982
 
3.7%
Decimal Number 404
 
0.5%
Other Punctuation 374
 
0.5%
Dash Punctuation 60
 
0.1%
Other Symbol 6
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3915
 
6.9%
2896
 
5.1%
2415
 
4.2%
1339
 
2.3%
1198
 
2.1%
1195
 
2.1%
992
 
1.7%
992
 
1.7%
971
 
1.7%
875
 
1.5%
Other values (933) 40194
70.5%
Lowercase Letter
ValueCountFrequency (%)
e 726
11.8%
o 629
 
10.3%
a 561
 
9.2%
n 507
 
8.3%
i 475
 
7.8%
r 387
 
6.3%
t 360
 
5.9%
l 337
 
5.5%
s 295
 
4.8%
m 221
 
3.6%
Other values (16) 1630
26.6%
Uppercase Letter
ValueCountFrequency (%)
A 361
 
7.6%
E 345
 
7.2%
O 324
 
6.8%
S 318
 
6.7%
C 313
 
6.6%
I 280
 
5.9%
L 271
 
5.7%
N 265
 
5.6%
T 263
 
5.5%
M 222
 
4.7%
Other values (16) 1801
37.8%
Decimal Number
ValueCountFrequency (%)
1 110
27.2%
2 83
20.5%
0 47
11.6%
3 34
 
8.4%
4 30
 
7.4%
9 25
 
6.2%
5 25
 
6.2%
8 23
 
5.7%
7 14
 
3.5%
6 13
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 198
52.9%
& 97
25.9%
, 47
 
12.6%
' 20
 
5.3%
/ 4
 
1.1%
: 4
 
1.1%
! 2
 
0.5%
? 2
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 4170
> 99.9%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4170
> 99.9%
] 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
2982
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56979
71.2%
Common 12168
 
15.2%
Latin 10891
 
13.6%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3915
 
6.9%
2896
 
5.1%
2415
 
4.2%
1339
 
2.3%
1198
 
2.1%
1195
 
2.1%
992
 
1.7%
992
 
1.7%
971
 
1.7%
875
 
1.5%
Other values (925) 40191
70.5%
Latin
ValueCountFrequency (%)
e 726
 
6.7%
o 629
 
5.8%
a 561
 
5.2%
n 507
 
4.7%
i 475
 
4.4%
r 387
 
3.6%
A 361
 
3.3%
t 360
 
3.3%
E 345
 
3.2%
l 337
 
3.1%
Other values (42) 6203
57.0%
Common
ValueCountFrequency (%)
( 4170
34.3%
) 4170
34.3%
2982
24.5%
. 198
 
1.6%
1 110
 
0.9%
& 97
 
0.8%
2 83
 
0.7%
- 60
 
0.5%
, 47
 
0.4%
0 47
 
0.4%
Other values (17) 204
 
1.7%
Han
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56973
71.2%
ASCII 23059
28.8%
CJK 9
 
< 0.1%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4170
18.1%
) 4170
18.1%
2982
 
12.9%
e 726
 
3.1%
o 629
 
2.7%
a 561
 
2.4%
n 507
 
2.2%
i 475
 
2.1%
r 387
 
1.7%
A 361
 
1.6%
Other values (69) 8091
35.1%
Hangul
ValueCountFrequency (%)
3915
 
6.9%
2896
 
5.1%
2415
 
4.2%
1339
 
2.4%
1198
 
2.1%
1195
 
2.1%
992
 
1.7%
992
 
1.7%
971
 
1.7%
875
 
1.5%
Other values (924) 40185
70.5%
None
ValueCountFrequency (%)
6
100.0%
CJK
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Distinct7710
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-16 09:42:39
Maximum2024-04-03 17:14:14
2024-04-06T19:47:34.423158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:34.680627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
8811 
U
1189 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 8811
88.1%
U 1189
 
11.9%

Length

2024-04-06T19:47:34.965224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:35.160641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 8811
88.1%
u 1189
 
11.9%
Distinct759
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:05:00
2024-04-06T19:47:35.349419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T19:47:35.641993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct432
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T19:47:35.972571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length6.8191
Min length1

Characters and Unicode

Total characters68191
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique272 ?
Unique (%)2.7%

Sample

1st row기타
2nd row의류/패션/잡화/뷰티 기타
3rd row의류/패션/잡화/뷰티
4th row종합몰 가구/수납용품
5th row-
ValueCountFrequency (%)
3477
26.6%
의류/패션/잡화/뷰티 2470
18.9%
기타 2125
16.2%
종합몰 1276
 
9.7%
건강/식품 828
 
6.3%
교육/도서/완구/오락 776
 
5.9%
컴퓨터/사무용품 529
 
4.0%
가전 433
 
3.3%
레져/여행/공연 388
 
3.0%
가구/수납용품 345
 
2.6%
Other values (3) 447
 
3.4%
2024-04-06T19:47:37.135382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 12525
 
18.4%
- 3477
 
5.1%
3094
 
4.5%
2470
 
3.6%
2470
 
3.6%
2470
 
3.6%
2470
 
3.6%
2470
 
3.6%
2470
 
3.6%
2470
 
3.6%
Other values (41) 31805
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49095
72.0%
Other Punctuation 12525
 
18.4%
Dash Punctuation 3477
 
5.1%
Space Separator 3094
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2149
 
4.4%
2125
 
4.3%
Other values (38) 25061
51.0%
Other Punctuation
ValueCountFrequency (%)
/ 12525
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3477
100.0%
Space Separator
ValueCountFrequency (%)
3094
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49095
72.0%
Common 19096
 
28.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2149
 
4.4%
2125
 
4.3%
Other values (38) 25061
51.0%
Common
ValueCountFrequency (%)
/ 12525
65.6%
- 3477
 
18.2%
3094
 
16.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49095
72.0%
ASCII 19096
 
28.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 12525
65.6%
- 3477
 
18.2%
3094
 
16.2%
Hangul
ValueCountFrequency (%)
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2470
 
5.0%
2149
 
4.4%
2125
 
4.3%
Other values (38) 25061
51.0%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct3590
Distinct (%)51.6%
Missing3043
Missing (%)30.4%
Infinite0
Infinite (%)0.0%
Mean201427.37
Minimum186938.68
Maximum207931.21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:37.436788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum186938.68
5-th percentile198641.72
Q1200358.95
median201501.67
Q3202501.6
95-th percentile203872.86
Maximum207931.21
Range20992.527
Interquartile range (IQR)2142.6532

Descriptive statistics

Standard deviation1647.4425
Coefficient of variation (CV)0.0081788415
Kurtosis0.68580205
Mean201427.37
Median Absolute Deviation (MAD)1022.4233
Skewness-0.039253774
Sum1.4013302 × 109
Variance2714066.8
MonotonicityNot monotonic
2024-04-06T19:47:37.684178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201501.672660443 116
 
1.2%
202638.948768564 73
 
0.7%
202501.602561199 69
 
0.7%
202654.181045911 54
 
0.5%
201213.64069933 37
 
0.4%
202623.297219041 35
 
0.4%
202459.257859185 35
 
0.4%
203392.793460583 34
 
0.3%
200554.74395758 32
 
0.3%
202524.096001449 31
 
0.3%
Other values (3580) 6441
64.4%
(Missing) 3043
30.4%
ValueCountFrequency (%)
186938.682994 1
 
< 0.1%
198345.279900118 1
 
< 0.1%
198351.955 2
< 0.1%
198353.824006002 1
 
< 0.1%
198356.006840205 1
 
< 0.1%
198357.319178572 2
< 0.1%
198358.525854925 1
 
< 0.1%
198359.032400861 1
 
< 0.1%
198361.94579798 2
< 0.1%
198362.476944652 4
< 0.1%
ValueCountFrequency (%)
207931.209977276 1
< 0.1%
207861.845236575 1
< 0.1%
207801.233093604 1
< 0.1%
207767.186825091 1
< 0.1%
207731.30060147 2
< 0.1%
207684.687091186 1
< 0.1%
207576.384821489 1
< 0.1%
207505.445913882 1
< 0.1%
207422.577439638 1
< 0.1%
207409.528119388 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct3590
Distinct (%)51.6%
Missing3043
Missing (%)30.4%
Infinite0
Infinite (%)0.0%
Mean442973.49
Minimum180707.7
Maximum446565.03
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T19:47:37.931863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum180707.7
5-th percentile440676.38
Q1442199.27
median442911.3
Q3443820.81
95-th percentile445598.27
Maximum446565.03
Range265857.33
Interquartile range (IQR)1621.5378

Descriptive statistics

Standard deviation3464.3757
Coefficient of variation (CV)0.0078207292
Kurtosis4722.9705
Mean442973.49
Median Absolute Deviation (MAD)802.87993
Skewness-62.390533
Sum3.0817666 × 109
Variance12001899
MonotonicityNot monotonic
2024-04-06T19:47:38.187459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
442505.573142545 116
 
1.2%
443109.509930091 73
 
0.7%
443226.434038556 69
 
0.7%
442602.079075345 54
 
0.5%
442739.647017252 37
 
0.4%
443154.964346798 35
 
0.4%
443620.263219246 35
 
0.4%
440676.379919661 34
 
0.3%
444811.364826199 32
 
0.3%
443449.99298641 31
 
0.3%
Other values (3580) 6441
64.4%
(Missing) 3043
30.4%
ValueCountFrequency (%)
180707.703056 1
< 0.1%
436741.854568773 1
< 0.1%
436967.525291467 1
< 0.1%
436987.528628409 1
< 0.1%
437000.170737922 1
< 0.1%
437087.666487114 1
< 0.1%
437093.686819253 1
< 0.1%
437119.595971567 1
< 0.1%
437768.498681368 1
< 0.1%
437864.710924867 1
< 0.1%
ValueCountFrequency (%)
446565.028672992 3
 
< 0.1%
446439.134713956 1
 
< 0.1%
446421.477235361 1
 
< 0.1%
446331.380046658 4
 
< 0.1%
446312.298040264 4
 
< 0.1%
446311.188098686 1
 
< 0.1%
446206.354902748 14
0.1%
446205.621646883 1
 
< 0.1%
446202.471850496 4
 
< 0.1%
446183.502793135 19
0.2%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9824 
0
 
176

Length

Max length4
Median length4
Mean length3.9472
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9824
98.2%
0 176
 
1.8%

Length

2024-04-06T19:47:38.418465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:38.571129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9824
98.2%
0 176
 
1.8%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9824 
0
 
176

Length

Max length4
Median length4
Mean length3.9472
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9824
98.2%
0 176
 
1.8%

Length

2024-04-06T19:47:38.761399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:38.955766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9824
98.2%
0 176
 
1.8%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9824 
0
 
176

Length

Max length4
Median length4
Mean length3.9472
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9824
98.2%
0 176
 
1.8%

Length

2024-04-06T19:47:39.160992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T19:47:39.343441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9824
98.2%
0 176
 
1.8%

판매방식명
Categorical

IMBALANCE 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5384 
인터넷
4053 
기타
 
154
인터넷, 기타
 
120
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타
 
48
Other values (21)
 
241

Length

Max length26
Median length4
Mean length3.9241
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row기타
2nd row<NA>
3rd rowTV홈쇼핑
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5384
53.8%
인터넷 4053
40.5%
기타 154
 
1.5%
인터넷, 기타 120
 
1.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 48
 
0.5%
TV홈쇼핑, 인터넷 45
 
0.4%
인터넷, 카다로그 33
 
0.3%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 32
 
0.3%
인터넷, 카다로그, 기타 21
 
0.2%
인터넷, 카다로그, 신문잡지, 기타 17
 
0.2%
Other values (16) 93
 
0.9%

Length

2024-04-06T19:47:39.532062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5384
50.3%
인터넷 4430
41.4%
기타 385
 
3.6%
카다로그 194
 
1.8%
tv홈쇼핑 175
 
1.6%
신문잡지 143
 
1.3%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
31113210000200332100763020297520030709<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 2055 1114<NA><NA>서울특별시 서초구 서초동 ****-*<NA><NA>(주)네오핑2021-11-25 13:56:28I2021-11-27 00:22:44.0기타<NA><NA>000기타
74363210000200632100763020737420060125<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 535 1535<NA>137060서울특별시 서초구 방배동 ***번지 **호 신진빌딩 *층 ***호서울특별시 서초구 방배로 **, ***호 (방배동,신진빌딩 *층)<NA>(주)아이엘비2008-01-11 09:40:02I2021-12-03 22:02:00.0의류/패션/잡화/뷰티 기타199637.538673442248.511632<NA><NA><NA><NA>
281523210000201832101533020021920180125<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 방배동 ***번지 *호서울특별시 서초구 청두곶**길 **, ***호 (방배동)06677에버그린2018-05-14 11:15:05I2018-08-31 23:59:59.0의류/패션/잡화/뷰티198582.759345442150.525229<NA><NA><NA>TV홈쇼핑
251713210000201632101533020133620160729<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 사평대로**길 **, ***호 (반포동)06575샵베이빌론2016-07-29 10:45:31I2021-12-03 22:02:00.0종합몰 가구/수납용품199438.694473443948.824511<NA><NA><NA><NA>
67063210000200532100763020662820050914<NA>4취소/말소/만료/정지/중지4직권취소<NA><NA><NA><NA>02 583 4405<NA><NA>서울특별시 서초구 방배동 ***-* 금동연립 라동 ***호<NA><NA>러블리유2009-01-12 17:17:18I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
301953210000201832101533020261520181213<NA>5제외/삭제/전출5타시군구이관20220829<NA><NA><NA>02-556-5611<NA><NA>서울특별시 서초구 서초동 ****번지 서산빌딩서울특별시 서초구 강남대로 ***, 서산빌딩 *,*층 (서초동)06612더조은컴퓨터아카데미2022-08-29 16:26:38U2021-12-07 21:01:00.0교육/도서/완구/오락202156.76444388.655<NA><NA><NA><NA>
26993321000020173210153302012732014-08-05<NA>3폐업3폐업처리2023-01-21<NA><NA><NA>070-4035-8082<NA><NA><NA>서울특별시 서초구 강남대로 ***, *층 ***호 (서초동, 송남빌딩)06730주식회사 와이즈원2023-01-24 10:53:49U2022-12-04 22:06:00.0기타202758.71151442815.347406<NA><NA><NA><NA>
196393210000201332101213020044420130325<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-577-4510<NA><NA><NA>서울특별시 서초구 태봉로*길 *, ***호 (우면동, 네이쳐힐아파트***호***동)137140정보통신번역사 제이피북, 킴, 일본서적2013-03-25 10:48:42I2018-08-31 23:59:59.0기타202288.798438440395.714734<NA><NA><NA>인터넷
70863210000200532100763020701520051123<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 579 4265<NA><NA>서울특별시 서초구 서초동 ****-** 신명스카이뷰멤버스 ***호<NA><NA>제이커스텀2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
225683210000201532101533020030920150213<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-535-0761<NA><NA><NA>서울특별시 서초구 방배로**길 **, *층 (방배동)137834주식회사 더에이치알2016-07-22 14:27:36I2018-08-31 23:59:59.0기타199157.8807443067.540295<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
137443210000200932101213020140420091012<NA>1영업/정상1정상영업<NA><NA><NA><NA>555-5911<NA><NA>서울특별시 서초구 서초동 ****-* 대호빌딩서울특별시 서초구 효령로 ***, 대호빌딩 *층 ***호 (서초동)06728파코국제특허법률사무소2021-10-13 12:49:46U2021-10-15 02:40:00.0기타202490.750689442918.903668000인터넷
34633210000200332100763020333220031013<NA>3폐업3폐업처리<NA><NA><NA><NA>02 588 2607<NA><NA>서울특별시 서초구 서초동 ****-* 동아빌라트 *타운 비* ***호<NA><NA>에네셀글로벌(주)2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
180193210000201232101213020042020120308<NA>3폐업3폐업처리20160205<NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 명달로 **, ***호 (방배동,서초타운)137850손종수2016-02-05 10:05:33I2018-08-31 23:59:59.0종합몰200447.074441896.6045<NA><NA><NA>인터넷
51803210000200432100763020507620041116<NA>3폐업3폐업처리<NA><NA><NA><NA>02 529 2167<NA><NA>서울특별시 서초구 양재동 ***-* 천혜빌딩 ***호<NA><NA>투투2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
24273210000200332100763020228420030211<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 589 0432<NA><NA>서울특별시 서초구 양재동 ***-* 삼호물산 A-****<NA><NA>(주)엠피텍시스템2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
101873210000200732101213021016320070702<NA>1영업/정상1정상영업<NA><NA><NA><NA>0220595852<NA>137130서울특별시 서초구 양재동 ***번지 서울오트갤러리금관 ***호서울특별시 서초구 양재대로**길 **, ***호 (양재동,서울오트갤러리금관)<NA>(주)파인킴트레이딩2008-06-19 16:26:43I2018-08-31 23:59:59.0기타203059.646701439960.533114<NA><NA><NA>인터넷
17768321000020123210121302001412012-01-18<NA>1영업/정상1정상영업<NA><NA><NA><NA>025714749<NA><NA>서울특별시 서초구 양재동 *-** 자강빌딩서울특별시 서초구 남부순환로***길 **, 자강빌딩 (양재동)06739(주)알레마나2024-01-18 14:16:27U2023-11-30 22:00:00.0의류/패션/잡화/뷰티203469.989134442487.967126<NA><NA><NA><NA>
296733210000201832101533020200820180920<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 서초동 ****번지 서초*차현대아파트서울특별시 서초구 사임당로**길 **, 서초*차현대아파트 ***동 *층 ***호 (서초동, 서초*차현대아파트)06636아우어(our)2018-09-20 12:49:44U2018-09-20 23:59:59.0의류/패션/잡화/뷰티201814.718133443373.676915<NA><NA><NA>인터넷
199393210000201332101213020081320120406<NA>5제외/삭제/전출5타시군구이관20150817<NA><NA><NA>02-566-1144<NA><NA><NA>서울특별시 서초구 바우뫼로**길 **-*, *층 (양재동, 미래빌딩)137888(주)휴레크2015-08-17 10:04:33I2021-12-03 22:02:00.0기타 교육/도서/완구/오락203378.224922442042.330103<NA><NA><NA><NA>
109653210000200832101213020015220080125<NA>1영업/정상1정상영업<NA><NA><NA><NA>0262078868<NA>137030서울특별시 서초구 잠원동 **번지 *호 신반포*차아파트 ***동 ****호서울특별시 서초구 신반포로 ***, ***동 ****호 (잠원동,신반포*차아파트)<NA>곰베2008-11-03 20:00:55I2018-08-31 23:59:59.0의류/패션/잡화/뷰티200071.558447444999.898059<NA><NA><NA>인터넷