Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells81239
Missing cells (%)28.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical9
Numeric5
Text7
DateTime7
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author강동구
URLhttps://data.seoul.go.kr/dataList/OA-18826/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
판매방식명 is highly imbalanced (69.6%)Imbalance
인허가취소일자 has 9998 (> 99.9%) missing valuesMissing
폐업일자 has 7155 (71.5%) missing valuesMissing
휴업시작일자 has 9948 (99.5%) missing valuesMissing
휴업종료일자 has 9948 (99.5%) missing valuesMissing
재개업일자 has 9988 (99.9%) missing valuesMissing
전화번호 has 7240 (72.4%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8423 (84.2%) missing valuesMissing
지번주소 has 1631 (16.3%) missing valuesMissing
도로명주소 has 2421 (24.2%) missing valuesMissing
도로명우편번호 has 2421 (24.2%) missing valuesMissing
좌표정보(X) has 1033 (10.3%) missing valuesMissing
좌표정보(Y) has 1033 (10.3%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = 39.44139088)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 05:22:49.805693
Analysis finished2024-05-11 05:22:52.927125
Duration3.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3240000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3240000
2nd row3240000
3rd row3240000
4th row3240000
5th row3240000

Common Values

ValueCountFrequency (%)
3240000 10000
100.0%

Length

2024-05-11T14:22:53.034408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:22:53.166512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3240000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0170748 × 1018
Minimum2.000324 × 1018
Maximum2.024324 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:22:53.330904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.000324 × 1018
5-th percentile2.006324 × 1018
Q12.012324 × 1018
median2.019324 × 1018
Q32.022324 × 1018
95-th percentile2.023324 × 1018
Maximum2.024324 × 1018
Range2.4000015 × 1016
Interquartile range (IQR)1.0000011 × 1016

Descriptive statistics

Standard deviation5.8290785 × 1015
Coefficient of variation (CV)0.0028898673
Kurtosis-0.67722051
Mean2.0170748 × 1018
Median Absolute Deviation (MAD)4.0000047 × 1015
Skewness-0.71335506
Sum8.4569422 × 1018
Variance3.3978156 × 1031
MonotonicityNot monotonic
2024-05-11T14:22:53.549824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2011324013930200059 1
 
< 0.1%
2022324028130200258 1
 
< 0.1%
2017324018930200227 1
 
< 0.1%
2023324028930202569 1
 
< 0.1%
2005324013930201519 1
 
< 0.1%
2012324017530200811 1
 
< 0.1%
2020324023630201098 1
 
< 0.1%
2019324023630201069 1
 
< 0.1%
2019324023630201377 1
 
< 0.1%
2008324013930200386 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2000324013930200136 1
< 0.1%
2001324013930200087 1
< 0.1%
2001324013930200100 1
< 0.1%
2001324013930200102 1
< 0.1%
2001324013930200108 1
< 0.1%
2001324013930200109 1
< 0.1%
2001324013930200116 1
< 0.1%
2001324013930200134 1
< 0.1%
2001324013930200138 1
< 0.1%
2002324013930200001 1
< 0.1%
ValueCountFrequency (%)
2024324028930200970 1
< 0.1%
2024324028930200969 1
< 0.1%
2024324028930200968 1
< 0.1%
2024324028930200965 1
< 0.1%
2024324028930200962 1
< 0.1%
2024324028930200958 1
< 0.1%
2024324028930200957 1
< 0.1%
2024324028930200952 1
< 0.1%
2024324028930200950 1
< 0.1%
2024324028930200945 1
< 0.1%
Distinct4449
Distinct (%)44.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T14:22:54.019406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.4462
Min length8

Characters and Unicode

Total characters84462
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2051 ?
Unique (%)20.5%

Sample

1st row2011-01-19
2nd row2023-07-24
3rd row20200402
4th row20210204
5th row20200513
ValueCountFrequency (%)
2024-03-19 16
 
0.2%
20200818 13
 
0.1%
20210105 13
 
0.1%
20210111 12
 
0.1%
20210406 12
 
0.1%
2023-08-23 12
 
0.1%
20210319 12
 
0.1%
2024-01-05 12
 
0.1%
20211123 11
 
0.1%
20220411 11
 
0.1%
Other values (4439) 9876
98.8%
2024-05-11T14:22:54.751118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25572
30.3%
2 21273
25.2%
1 13921
16.5%
- 4462
 
5.3%
3 3952
 
4.7%
4 2802
 
3.3%
7 2622
 
3.1%
8 2567
 
3.0%
9 2541
 
3.0%
6 2463
 
2.9%
Other values (2) 2287
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79999
94.7%
Dash Punctuation 4462
 
5.3%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25572
32.0%
2 21273
26.6%
1 13921
17.4%
3 3952
 
4.9%
4 2802
 
3.5%
7 2622
 
3.3%
8 2567
 
3.2%
9 2541
 
3.2%
6 2463
 
3.1%
5 2286
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 4462
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 84462
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 25572
30.3%
2 21273
25.2%
1 13921
16.5%
- 4462
 
5.3%
3 3952
 
4.7%
4 2802
 
3.3%
7 2622
 
3.1%
8 2567
 
3.0%
9 2541
 
3.0%
6 2463
 
2.9%
Other values (2) 2287
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 84462
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25572
30.3%
2 21273
25.2%
1 13921
16.5%
- 4462
 
5.3%
3 3952
 
4.7%
4 2802
 
3.3%
7 2622
 
3.1%
8 2567
 
3.0%
9 2541
 
3.0%
6 2463
 
2.9%
Other values (2) 2287
 
2.7%

인허가취소일자
Date

MISSING 

Distinct2
Distinct (%)100.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2020-03-06 00:00:00
Maximum2023-03-09 00:00:00
2024-05-11T14:22:54.971425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:22:55.155398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
6501 
3
2536 
4
 
611
5
 
309
2
 
43

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 6501
65.0%
3 2536
 
25.4%
4 611
 
6.1%
5 309
 
3.1%
2 43
 
0.4%

Length

2024-05-11T14:22:55.400157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:22:55.578679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6501
65.0%
3 2536
 
25.4%
4 611
 
6.1%
5 309
 
3.1%
2 43
 
0.4%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
6501 
폐업
2536 
취소/말소/만료/정지/중지
 
611
제외/삭제/전출
 
309
휴업
 
43

Length

Max length14
Median length5
Mean length4.8689
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 6501
65.0%
폐업 2536
 
25.4%
취소/말소/만료/정지/중지 611
 
6.1%
제외/삭제/전출 309
 
3.1%
휴업 43
 
0.4%

Length

2024-05-11T14:22:55.790848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:22:55.964858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 6501
65.0%
폐업 2536
 
25.4%
취소/말소/만료/정지/중지 611
 
6.1%
제외/삭제/전출 309
 
3.1%
휴업 43
 
0.4%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0011
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:22:56.112730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6448007
Coefficient of variation (CV)0.82194828
Kurtosis2.6645975
Mean2.0011
Median Absolute Deviation (MAD)0
Skewness1.808036
Sum20011
Variance2.7053693
MonotonicityNot monotonic
2024-05-11T14:22:56.293649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 6501
65.0%
3 2536
 
25.4%
7 609
 
6.1%
5 309
 
3.1%
2 43
 
0.4%
4 2
 
< 0.1%
ValueCountFrequency (%)
1 6501
65.0%
2 43
 
0.4%
3 2536
 
25.4%
4 2
 
< 0.1%
5 309
 
3.1%
7 609
 
6.1%
ValueCountFrequency (%)
7 609
 
6.1%
5 309
 
3.1%
4 2
 
< 0.1%
3 2536
 
25.4%
2 43
 
0.4%
1 6501
65.0%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
6501 
폐업처리
2536 
직권말소
 
609
타시군구이관
 
309
휴업처리
 
43

Length

Max length6
Median length4
Mean length4.0618
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 6501
65.0%
폐업처리 2536
 
25.4%
직권말소 609
 
6.1%
타시군구이관 309
 
3.1%
휴업처리 43
 
0.4%
직권취소 2
 
< 0.1%

Length

2024-05-11T14:22:56.495798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:22:56.682766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 6501
65.0%
폐업처리 2536
 
25.4%
직권말소 609
 
6.1%
타시군구이관 309
 
3.1%
휴업처리 43
 
0.4%
직권취소 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1736
Distinct (%)61.0%
Missing7155
Missing (%)71.5%
Memory size156.2 KiB
Minimum2006-06-30 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T14:22:56.909953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:22:57.151840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct52
Distinct (%)100.0%
Missing9948
Missing (%)99.5%
Memory size156.2 KiB
Minimum2009-03-16 00:00:00
Maximum2024-04-02 00:00:00
2024-05-11T14:22:57.413534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:22:57.603479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업종료일자
Date

MISSING 

Distinct50
Distinct (%)96.2%
Missing9948
Missing (%)99.5%
Memory size156.2 KiB
Minimum2009-07-31 00:00:00
Maximum2099-12-31 00:00:00
2024-05-11T14:22:57.810831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:22:58.016268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

재개업일자
Date

MISSING 

Distinct12
Distinct (%)100.0%
Missing9988
Missing (%)99.9%
Memory size156.2 KiB
Minimum2009-08-18 00:00:00
Maximum2024-02-27 00:00:00
2024-05-11T14:22:58.177251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:22:58.319740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)

전화번호
Text

MISSING 

Distinct2684
Distinct (%)97.2%
Missing7240
Missing (%)72.4%
Memory size156.2 KiB
2024-05-11T14:22:58.598771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length10.499638
Min length1

Characters and Unicode

Total characters28979
Distinct characters17
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2629 ?
Unique (%)95.3%

Sample

1st row442-7545
2nd row0
3rd row070-7630-4044
4th row441-5092
5th row070-4589-5746
ValueCountFrequency (%)
02 141
 
4.6%
0 17
 
0.6%
14
 
0.5%
470 11
 
0.4%
477 10
 
0.3%
475 8
 
0.3%
478 7
 
0.2%
474 7
 
0.2%
485 7
 
0.2%
483 7
 
0.2%
Other values (2722) 2831
92.5%
2024-05-11T14:22:59.190158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4249
14.7%
- 4192
14.5%
4 3275
11.3%
2 3272
11.3%
7 2956
10.2%
8 2360
8.1%
1 1803
6.2%
5 1780
6.1%
6 1705
5.9%
3 1655
 
5.7%
Other values (7) 1732
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24337
84.0%
Dash Punctuation 4192
 
14.5%
Space Separator 432
 
1.5%
Other Punctuation 10
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Math Symbol 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4249
17.5%
4 3275
13.5%
2 3272
13.4%
7 2956
12.1%
8 2360
9.7%
1 1803
7.4%
5 1780
7.3%
6 1705
7.0%
3 1655
 
6.8%
9 1282
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 8
80.0%
/ 2
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 4192
100.0%
Space Separator
ValueCountFrequency (%)
432
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 28979
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4249
14.7%
- 4192
14.5%
4 3275
11.3%
2 3272
11.3%
7 2956
10.2%
8 2360
8.1%
1 1803
6.2%
5 1780
6.1%
6 1705
5.9%
3 1655
 
5.7%
Other values (7) 1732
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28979
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4249
14.7%
- 4192
14.5%
4 3275
11.3%
2 3272
11.3%
7 2956
10.2%
8 2360
8.1%
1 1803
6.2%
5 1780
6.1%
6 1705
5.9%
3 1655
 
5.7%
Other values (7) 1732
6.0%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct152
Distinct (%)9.6%
Missing8423
Missing (%)84.2%
Infinite0
Infinite (%)0.0%
Mean134616.56
Minimum121885
Maximum472848
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:22:59.408907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum121885
5-th percentile134010
Q1134030
median134080
Q3134838
95-th percentile134868
Maximum472848
Range350963
Interquartile range (IQR)808

Descriptive statistics

Standard deviation8541.9297
Coefficient of variation (CV)0.063453779
Kurtosis1562.7722
Mean134616.56
Median Absolute Deviation (MAD)70
Skewness39.441391
Sum2.1229032 × 108
Variance72964563
MonotonicityNot monotonic
2024-05-11T14:22:59.617878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
134020 174
 
1.7%
134030 173
 
1.7%
134010 106
 
1.1%
134050 80
 
0.8%
134070 67
 
0.7%
134090 40
 
0.4%
134060 30
 
0.3%
134868 29
 
0.3%
134021 27
 
0.3%
134864 25
 
0.2%
Other values (142) 826
 
8.3%
(Missing) 8423
84.2%
ValueCountFrequency (%)
121885 1
 
< 0.1%
134010 106
1.1%
134011 11
 
0.1%
134012 1
 
< 0.1%
134020 174
1.7%
134021 27
 
0.3%
134022 4
 
< 0.1%
134023 11
 
0.1%
134024 3
 
< 0.1%
134030 173
1.7%
ValueCountFrequency (%)
472848 1
 
< 0.1%
143130 1
 
< 0.1%
138160 1
 
< 0.1%
138040 2
 
< 0.1%
134890 13
0.1%
134888 1
 
< 0.1%
134884 12
0.1%
134883 1
 
< 0.1%
134882 4
 
< 0.1%
134880 1
 
< 0.1%

지번주소
Text

MISSING 

Distinct3279
Distinct (%)39.2%
Missing1631
Missing (%)16.3%
Memory size156.2 KiB
2024-05-11T14:22:59.956199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length25.650137
Min length11

Characters and Unicode

Total characters214666
Distinct characters497
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2457 ?
Unique (%)29.4%

Sample

1st row서울특별시 강동구 길동 ***-* 광남벨라스아파트
2nd row서울특별시 강동구 암사동 ***-**
3rd row서울특별시 강동구 명일동 ***번지 래미안 솔베뉴
4th row서울특별시 강동구 천호동 ***-**
5th row서울특별시 강동구 상일동 ***번지 고덕아르테온아파트
ValueCountFrequency (%)
강동구 8360
18.8%
서울특별시 7638
17.2%
4954
11.2%
3691
 
8.3%
번지 3259
 
7.3%
천호동 1863
 
4.2%
성내동 1683
 
3.8%
암사동 1038
 
2.3%
길동 980
 
2.2%
명일동 859
 
1.9%
Other values (1958) 10055
22.7%
2024-05-11T14:23:00.522873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 40896
19.1%
37308
17.4%
17799
 
8.3%
9330
 
4.3%
8392
 
3.9%
7824
 
3.6%
7679
 
3.6%
7658
 
3.6%
7638
 
3.6%
7638
 
3.6%
Other values (487) 62504
29.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130926
61.0%
Other Punctuation 41055
 
19.1%
Space Separator 37308
 
17.4%
Dash Punctuation 3863
 
1.8%
Decimal Number 532
 
0.2%
Uppercase Letter 464
 
0.2%
Close Punctuation 189
 
0.1%
Open Punctuation 188
 
0.1%
Lowercase Letter 131
 
0.1%
Letter Number 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17799
 
13.6%
9330
 
7.1%
8392
 
6.4%
7824
 
6.0%
7679
 
5.9%
7658
 
5.8%
7638
 
5.8%
7638
 
5.8%
5945
 
4.5%
3858
 
2.9%
Other values (422) 47165
36.0%
Uppercase Letter
ValueCountFrequency (%)
B 101
21.8%
A 71
15.3%
D 43
9.3%
S 32
 
6.9%
K 31
 
6.7%
G 30
 
6.5%
R 30
 
6.5%
I 27
 
5.8%
P 22
 
4.7%
M 14
 
3.0%
Other values (13) 63
13.6%
Lowercase Letter
ValueCountFrequency (%)
e 33
25.2%
i 20
15.3%
l 10
 
7.6%
r 10
 
7.6%
s 8
 
6.1%
w 8
 
6.1%
v 8
 
6.1%
k 8
 
6.1%
b 7
 
5.3%
c 4
 
3.1%
Other values (8) 15
11.5%
Decimal Number
ValueCountFrequency (%)
4 119
22.4%
3 80
15.0%
1 76
14.3%
5 70
13.2%
6 42
 
7.9%
2 36
 
6.8%
9 33
 
6.2%
7 28
 
5.3%
0 27
 
5.1%
8 21
 
3.9%
Other Punctuation
ValueCountFrequency (%)
* 40896
99.6%
. 60
 
0.1%
/ 40
 
0.1%
, 33
 
0.1%
@ 21
 
0.1%
& 4
 
< 0.1%
? 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
37308
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3863
100.0%
Close Punctuation
ValueCountFrequency (%)
) 189
100.0%
Open Punctuation
ValueCountFrequency (%)
( 188
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 130915
61.0%
Common 83137
38.7%
Latin 603
 
0.3%
Han 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17799
 
13.6%
9330
 
7.1%
8392
 
6.4%
7824
 
6.0%
7679
 
5.9%
7658
 
5.8%
7638
 
5.8%
7638
 
5.8%
5945
 
4.5%
3858
 
2.9%
Other values (421) 47154
36.0%
Latin
ValueCountFrequency (%)
B 101
16.7%
A 71
 
11.8%
D 43
 
7.1%
e 33
 
5.5%
S 32
 
5.3%
K 31
 
5.1%
G 30
 
5.0%
R 30
 
5.0%
I 27
 
4.5%
P 22
 
3.6%
Other values (33) 183
30.3%
Common
ValueCountFrequency (%)
* 40896
49.2%
37308
44.9%
- 3863
 
4.6%
) 189
 
0.2%
( 188
 
0.2%
4 119
 
0.1%
3 80
 
0.1%
1 76
 
0.1%
5 70
 
0.1%
. 60
 
0.1%
Other values (12) 288
 
0.3%
Han
ValueCountFrequency (%)
11
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 130912
61.0%
ASCII 83732
39.0%
CJK 11
 
< 0.1%
Number Forms 8
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 40896
48.8%
37308
44.6%
- 3863
 
4.6%
) 189
 
0.2%
( 188
 
0.2%
4 119
 
0.1%
B 101
 
0.1%
3 80
 
0.1%
1 76
 
0.1%
A 71
 
0.1%
Other values (53) 841
 
1.0%
Hangul
ValueCountFrequency (%)
17799
 
13.6%
9330
 
7.1%
8392
 
6.4%
7824
 
6.0%
7679
 
5.9%
7658
 
5.8%
7638
 
5.8%
7638
 
5.8%
5945
 
4.5%
3858
 
2.9%
Other values (418) 47151
36.0%
CJK
ValueCountFrequency (%)
11
100.0%
Number Forms
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

도로명주소
Text

MISSING 

Distinct4041
Distinct (%)53.3%
Missing2421
Missing (%)24.2%
Memory size156.2 KiB
2024-05-11T14:23:00.869244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length54
Mean length37.69996
Min length15

Characters and Unicode

Total characters285728
Distinct characters502
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2980 ?
Unique (%)39.3%

Sample

1st row서울특별시 강동구 천중로**가길 **, ***호 (길동, 광남벨라스아파트)
2nd row서울특별시 강동구 고덕로**길 **, ***호 (암사동)
3rd row서울특별시 강동구 양재대로 ****, ***동 ****호 (명일동, 래미안 솔베뉴)
4th row서울특별시 강동구 올림픽로**길 **, ***호 (천호동)
5th row서울특별시 강동구 고덕로 ***, ***동 ***호 (상일동, 고덕아르테온아파트)
ValueCountFrequency (%)
서울특별시 7578
14.0%
강동구 7564
14.0%
7460
13.8%
5413
 
10.0%
2548
 
4.7%
1795
 
3.3%
천호동 1710
 
3.2%
성내동 1525
 
2.8%
암사동 995
 
1.8%
길동 925
 
1.7%
Other values (2048) 16709
30.8%
2024-05-11T14:23:01.744967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 55390
19.4%
46653
16.3%
18181
 
6.4%
, 10315
 
3.6%
8642
 
3.0%
8617
 
3.0%
8074
 
2.8%
7752
 
2.7%
7625
 
2.7%
( 7615
 
2.7%
Other values (492) 106864
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 154752
54.2%
Other Punctuation 65721
23.0%
Space Separator 46653
 
16.3%
Open Punctuation 7615
 
2.7%
Close Punctuation 7615
 
2.7%
Dash Punctuation 1528
 
0.5%
Decimal Number 1008
 
0.4%
Uppercase Letter 685
 
0.2%
Lowercase Letter 141
 
< 0.1%
Letter Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18181
 
11.7%
8642
 
5.6%
8617
 
5.6%
8074
 
5.2%
7752
 
5.0%
7625
 
4.9%
7609
 
4.9%
7578
 
4.9%
7578
 
4.9%
7448
 
4.8%
Other values (425) 65648
42.4%
Uppercase Letter
ValueCountFrequency (%)
B 170
24.8%
A 159
23.2%
K 93
13.6%
R 58
 
8.5%
S 31
 
4.5%
I 29
 
4.2%
G 26
 
3.8%
P 23
 
3.4%
M 17
 
2.5%
C 14
 
2.0%
Other values (14) 65
 
9.5%
Lowercase Letter
ValueCountFrequency (%)
e 35
24.8%
i 22
15.6%
r 13
 
9.2%
l 12
 
8.5%
w 11
 
7.8%
v 9
 
6.4%
a 8
 
5.7%
b 7
 
5.0%
n 4
 
2.8%
d 4
 
2.8%
Other values (10) 16
11.3%
Decimal Number
ValueCountFrequency (%)
1 222
22.0%
0 152
15.1%
2 141
14.0%
5 105
10.4%
3 104
10.3%
4 69
 
6.8%
7 63
 
6.2%
9 60
 
6.0%
8 46
 
4.6%
6 46
 
4.6%
Other Punctuation
ValueCountFrequency (%)
* 55390
84.3%
, 10315
 
15.7%
. 8
 
< 0.1%
/ 5
 
< 0.1%
& 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
46653
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7615
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7615
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1528
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 154740
54.2%
Common 130143
45.5%
Latin 833
 
0.3%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18181
 
11.7%
8642
 
5.6%
8617
 
5.6%
8074
 
5.2%
7752
 
5.0%
7625
 
4.9%
7609
 
4.9%
7578
 
4.9%
7578
 
4.9%
7448
 
4.8%
Other values (424) 65636
42.4%
Latin
ValueCountFrequency (%)
B 170
20.4%
A 159
19.1%
K 93
11.2%
R 58
 
7.0%
e 35
 
4.2%
S 31
 
3.7%
I 29
 
3.5%
G 26
 
3.1%
P 23
 
2.8%
i 22
 
2.6%
Other values (37) 187
22.4%
Common
ValueCountFrequency (%)
* 55390
42.6%
46653
35.8%
, 10315
 
7.9%
( 7615
 
5.9%
) 7615
 
5.9%
- 1528
 
1.2%
1 222
 
0.2%
0 152
 
0.1%
2 141
 
0.1%
5 105
 
0.1%
Other values (10) 407
 
0.3%
Han
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 154740
54.2%
ASCII 130969
45.8%
CJK 12
 
< 0.1%
Number Forms 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 55390
42.3%
46653
35.6%
, 10315
 
7.9%
( 7615
 
5.8%
) 7615
 
5.8%
- 1528
 
1.2%
1 222
 
0.2%
B 170
 
0.1%
A 159
 
0.1%
0 152
 
0.1%
Other values (54) 1150
 
0.9%
Hangul
ValueCountFrequency (%)
18181
 
11.7%
8642
 
5.6%
8617
 
5.6%
8074
 
5.2%
7752
 
5.0%
7625
 
4.9%
7609
 
4.9%
7578
 
4.9%
7578
 
4.9%
7448
 
4.8%
Other values (424) 65636
42.4%
CJK
ValueCountFrequency (%)
12
100.0%
Number Forms
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%

도로명우편번호
Text

MISSING 

Distinct371
Distinct (%)4.9%
Missing2421
Missing (%)24.2%
Memory size156.2 KiB
2024-05-11T14:23:02.155882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1223117
Min length5

Characters and Unicode

Total characters38822
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)0.7%

Sample

1st row05354
2nd row05251
3rd row05266
4th row05245
5th row05274
ValueCountFrequency (%)
05295 279
 
3.7%
05238 156
 
2.1%
05314 121
 
1.6%
05224 97
 
1.3%
05355 96
 
1.3%
05354 94
 
1.2%
05376 90
 
1.2%
05303 84
 
1.1%
05248 84
 
1.1%
05353 82
 
1.1%
Other values (361) 6396
84.4%
2024-05-11T14:23:02.768215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 8542
22.0%
0 8343
21.5%
3 5661
14.6%
2 4292
11.1%
4 3062
 
7.9%
1 2415
 
6.2%
8 2112
 
5.4%
6 1492
 
3.8%
7 1477
 
3.8%
9 1389
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38785
99.9%
Dash Punctuation 37
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 8542
22.0%
0 8343
21.5%
3 5661
14.6%
2 4292
11.1%
4 3062
 
7.9%
1 2415
 
6.2%
8 2112
 
5.4%
6 1492
 
3.8%
7 1477
 
3.8%
9 1389
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38822
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 8542
22.0%
0 8343
21.5%
3 5661
14.6%
2 4292
11.1%
4 3062
 
7.9%
1 2415
 
6.2%
8 2112
 
5.4%
6 1492
 
3.8%
7 1477
 
3.8%
9 1389
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38822
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 8542
22.0%
0 8343
21.5%
3 5661
14.6%
2 4292
11.1%
4 3062
 
7.9%
1 2415
 
6.2%
8 2112
 
5.4%
6 1492
 
3.8%
7 1477
 
3.8%
9 1389
 
3.6%
Distinct9830
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T14:23:03.220547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length6.6144
Min length1

Characters and Unicode

Total characters66144
Distinct characters1103
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9667 ?
Unique (%)96.7%

Sample

1st row블루베이
2nd row쿠우쿠
3rd row빙고스튜디오
4th row두두(DEUXDEUX)
5th row밍구망구
ValueCountFrequency (%)
주식회사 410
 
3.3%
60
 
0.5%
컴퍼니 29
 
0.2%
27
 
0.2%
스튜디오 21
 
0.2%
인셀덤 17
 
0.1%
company 17
 
0.1%
코리아 17
 
0.1%
스토어 15
 
0.1%
international 15
 
0.1%
Other values (10914) 11707
94.9%
2024-05-11T14:23:03.863959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2542
 
3.8%
2349
 
3.6%
2062
 
3.1%
( 1799
 
2.7%
) 1798
 
2.7%
1168
 
1.8%
996
 
1.5%
903
 
1.4%
774
 
1.2%
e 769
 
1.2%
Other values (1093) 50984
77.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47813
72.3%
Lowercase Letter 6542
 
9.9%
Uppercase Letter 4671
 
7.1%
Space Separator 2349
 
3.6%
Open Punctuation 1801
 
2.7%
Close Punctuation 1800
 
2.7%
Decimal Number 622
 
0.9%
Other Punctuation 319
 
0.5%
Other Symbol 142
 
0.2%
Dash Punctuation 68
 
0.1%
Other values (3) 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2542
 
5.3%
2062
 
4.3%
1168
 
2.4%
996
 
2.1%
903
 
1.9%
774
 
1.6%
758
 
1.6%
661
 
1.4%
647
 
1.4%
586
 
1.2%
Other values (1010) 36716
76.8%
Lowercase Letter
ValueCountFrequency (%)
e 769
11.8%
o 650
 
9.9%
a 591
 
9.0%
i 534
 
8.2%
n 519
 
7.9%
l 422
 
6.5%
t 372
 
5.7%
r 367
 
5.6%
s 330
 
5.0%
m 238
 
3.6%
Other values (16) 1750
26.8%
Uppercase Letter
ValueCountFrequency (%)
O 356
 
7.6%
S 351
 
7.5%
A 338
 
7.2%
E 326
 
7.0%
M 277
 
5.9%
N 271
 
5.8%
L 246
 
5.3%
I 231
 
4.9%
C 230
 
4.9%
T 230
 
4.9%
Other values (16) 1815
38.9%
Other Punctuation
ValueCountFrequency (%)
. 156
48.9%
& 93
29.2%
, 24
 
7.5%
' 21
 
6.6%
? 10
 
3.1%
# 5
 
1.6%
/ 3
 
0.9%
: 3
 
0.9%
! 2
 
0.6%
; 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 113
18.2%
0 113
18.2%
1 113
18.2%
4 53
8.5%
5 50
8.0%
3 48
7.7%
6 44
 
7.1%
7 32
 
5.1%
9 32
 
5.1%
8 24
 
3.9%
Open Punctuation
ValueCountFrequency (%)
( 1799
99.9%
[ 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1798
99.9%
] 2
 
0.1%
Space Separator
ValueCountFrequency (%)
2349
100.0%
Other Symbol
ValueCountFrequency (%)
142
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47943
72.5%
Latin 11213
 
17.0%
Common 6976
 
10.5%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2542
 
5.3%
2062
 
4.3%
1168
 
2.4%
996
 
2.1%
903
 
1.9%
774
 
1.6%
758
 
1.6%
661
 
1.4%
647
 
1.3%
586
 
1.2%
Other values (999) 36846
76.9%
Latin
ValueCountFrequency (%)
e 769
 
6.9%
o 650
 
5.8%
a 591
 
5.3%
i 534
 
4.8%
n 519
 
4.6%
l 422
 
3.8%
t 372
 
3.3%
r 367
 
3.3%
O 356
 
3.2%
S 351
 
3.1%
Other values (42) 6282
56.0%
Common
ValueCountFrequency (%)
2349
33.7%
( 1799
25.8%
) 1798
25.8%
. 156
 
2.2%
2 113
 
1.6%
0 113
 
1.6%
1 113
 
1.6%
& 93
 
1.3%
- 68
 
1.0%
4 53
 
0.8%
Other values (20) 321
 
4.6%
Han
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47800
72.3%
ASCII 18188
 
27.5%
None 143
 
0.2%
CJK 11
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2542
 
5.3%
2062
 
4.3%
1168
 
2.4%
996
 
2.1%
903
 
1.9%
774
 
1.6%
758
 
1.6%
661
 
1.4%
647
 
1.4%
586
 
1.2%
Other values (997) 36703
76.8%
ASCII
ValueCountFrequency (%)
2349
 
12.9%
( 1799
 
9.9%
) 1798
 
9.9%
e 769
 
4.2%
o 650
 
3.6%
a 591
 
3.2%
i 534
 
2.9%
n 519
 
2.9%
l 422
 
2.3%
t 372
 
2.0%
Other values (71) 8385
46.1%
None
ValueCountFrequency (%)
142
99.3%
1
 
0.7%
CJK
ValueCountFrequency (%)
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9828
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-06-30 17:53:25
Maximum2024-05-09 11:23:15
2024-05-11T14:23:04.037201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:23:04.222468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7549 
U
2451 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7549
75.5%
U 2451
 
24.5%

Length

2024-05-11T14:23:04.507428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:23:04.673871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7549
75.5%
u 2451
 
24.5%
Distinct1516
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T14:23:04.843652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T14:23:05.054907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct456
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T14:23:05.282834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.6457
Min length1

Characters and Unicode

Total characters86457
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique286 ?
Unique (%)2.9%

Sample

1st row기타
2nd row의류/패션/잡화/뷰티
3rd row의류/패션/잡화/뷰티
4th row건강/식품
5th row교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 레져/여행/공연 자동차/자동차용품
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3520
25.6%
종합몰 3405
24.7%
기타 1881
13.7%
건강/식품 1104
 
8.0%
교육/도서/완구/오락 720
 
5.2%
711
 
5.2%
가구/수납용품 561
 
4.1%
컴퓨터/사무용품 530
 
3.9%
가전 461
 
3.4%
레져/여행/공연 383
 
2.8%
Other values (3) 484
 
3.5%
2024-05-11T14:23:05.739218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16074
18.6%
3760
 
4.3%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
3520
 
4.1%
Other values (41) 38463
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65912
76.2%
Other Punctuation 16074
 
18.6%
Space Separator 3760
 
4.3%
Dash Punctuation 711
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3405
 
5.2%
3405
 
5.2%
Other values (38) 30942
46.9%
Other Punctuation
ValueCountFrequency (%)
/ 16074
100.0%
Space Separator
ValueCountFrequency (%)
3760
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 711
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65912
76.2%
Common 20545
 
23.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3405
 
5.2%
3405
 
5.2%
Other values (38) 30942
46.9%
Common
ValueCountFrequency (%)
/ 16074
78.2%
3760
 
18.3%
- 711
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65912
76.2%
ASCII 20545
 
23.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16074
78.2%
3760
 
18.3%
- 711
 
3.5%
Hangul
ValueCountFrequency (%)
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3520
 
5.3%
3405
 
5.2%
3405
 
5.2%
Other values (38) 30942
46.9%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct4176
Distinct (%)46.6%
Missing1033
Missing (%)10.3%
Infinite0
Infinite (%)0.0%
Mean212259.73
Minimum192081.84
Maximum226417.65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:23:05.928414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum192081.84
5-th percentile210812.02
Q1211411.32
median212022.34
Q3212679.85
95-th percentile215097.64
Maximum226417.65
Range34335.804
Interquartile range (IQR)1268.5279

Descriptive statistics

Standard deviation1246.3101
Coefficient of variation (CV)0.0058716276
Kurtosis10.450733
Mean212259.73
Median Absolute Deviation (MAD)638.75704
Skewness0.70802196
Sum1.903333 × 109
Variance1553288.9
MonotonicityNot monotonic
2024-05-11T14:23:06.148228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
212631.80093407 258
 
2.6%
211458.648093395 139
 
1.4%
212481.618347928 118
 
1.2%
211047.878697259 76
 
0.8%
212480.463676316 70
 
0.7%
211976.269176583 65
 
0.7%
212926.089912845 57
 
0.6%
212143.778930005 55
 
0.5%
213048.692179757 54
 
0.5%
211542.314116941 51
 
0.5%
Other values (4166) 8024
80.2%
(Missing) 1033
 
10.3%
ValueCountFrequency (%)
192081.84194947 1
 
< 0.1%
204511.0 1
 
< 0.1%
204586.181660577 1
 
< 0.1%
206219.841921018 1
 
< 0.1%
210128.950458944 1
 
< 0.1%
210501.030257012 1
 
< 0.1%
210522.681981372 1
 
< 0.1%
210543.133561209 1
 
< 0.1%
210550.777941681 1
 
< 0.1%
210553.290352325 3
< 0.1%
ValueCountFrequency (%)
226417.645595042 1
 
< 0.1%
216029.388021 7
 
0.1%
216018.463725432 1
 
< 0.1%
215966.729045494 4
 
< 0.1%
215945.480276364 1
 
< 0.1%
215927.688523964 10
0.1%
215908.25673494 1
 
< 0.1%
215901.594590118 3
 
< 0.1%
215898.113091143 19
0.2%
215888.898816 6
 
0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct4173
Distinct (%)46.5%
Missing1033
Missing (%)10.3%
Infinite0
Infinite (%)0.0%
Mean449008.94
Minimum441257.22
Maximum469653
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T14:23:06.360522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum441257.22
5-th percentile447240.66
Q1448106.44
median448993.71
Q3449832.19
95-th percentile450919.11
Maximum469653
Range28395.785
Interquartile range (IQR)1725.7492

Descriptive statistics

Standard deviation1195.7718
Coefficient of variation (CV)0.0026631358
Kurtosis10.45652
Mean449008.94
Median Absolute Deviation (MAD)878.9038
Skewness0.89832759
Sum4.0262632 × 109
Variance1429870.2
MonotonicityNot monotonic
2024-05-11T14:23:06.581816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
449475.1682321 258
 
2.6%
450316.255472825 139
 
1.4%
449133.956765955 118
 
1.2%
450021.504957301 76
 
0.8%
450472.374955204 70
 
0.7%
447462.077236492 65
 
0.7%
450124.105734971 62
 
0.6%
450751.524331915 57
 
0.6%
448048.253831401 55
 
0.5%
448273.114107211 51
 
0.5%
Other values (4163) 8016
80.2%
(Missing) 1033
 
10.3%
ValueCountFrequency (%)
441257.215350061 1
 
< 0.1%
443318.264543412 1
 
< 0.1%
446281.080793211 1
 
< 0.1%
446519.434702727 1
 
< 0.1%
446598.591776331 3
< 0.1%
446680.303498539 5
0.1%
446692.439727784 1
 
< 0.1%
446702.384306431 1
 
< 0.1%
446742.800284862 1
 
< 0.1%
446748.493341691 1
 
< 0.1%
ValueCountFrequency (%)
469653.0 1
 
< 0.1%
460419.891464186 1
 
< 0.1%
452785.109305238 1
 
< 0.1%
452758.558082703 1
 
< 0.1%
452632.154359492 1
 
< 0.1%
452344.444357981 1
 
< 0.1%
452309.711718 6
 
0.1%
452309.337081125 1
 
< 0.1%
452305.723682264 30
0.3%
452233.685440247 3
 
< 0.1%

자산규모
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6608 
<NA>
3392 

Length

Max length4
Median length1
Mean length2.0176
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 6608
66.1%
<NA> 3392
33.9%

Length

2024-05-11T14:23:06.801076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:23:06.954702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6608
66.1%
na 3392
33.9%

부채총액
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6608 
<NA>
3392 

Length

Max length4
Median length1
Mean length2.0176
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 6608
66.1%
<NA> 3392
33.9%

Length

2024-05-11T14:23:07.120015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:23:07.280636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6608
66.1%
na 3392
33.9%

자본금
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6608 
<NA>
3392 

Length

Max length4
Median length1
Mean length2.0176
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 6608
66.1%
<NA> 3392
33.9%

Length

2024-05-11T14:23:07.427763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:23:07.587204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6608
66.1%
na 3392
33.9%

판매방식명
Categorical

IMBALANCE 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
5378 
<NA>
4059 
인터넷, 기타
 
191
기타
 
75
TV홈쇼핑, 인터넷
 
60
Other values (21)
 
237

Length

Max length26
Median length3
Mean length3.8281
Min length2

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row인터넷
4th row인터넷
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 5378
53.8%
<NA> 4059
40.6%
인터넷, 기타 191
 
1.9%
기타 75
 
0.8%
TV홈쇼핑, 인터넷 60
 
0.6%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 45
 
0.4%
인터넷, 카다로그 32
 
0.3%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 24
 
0.2%
TV홈쇼핑, 인터넷, 카다로그 23
 
0.2%
TV홈쇼핑, 인터넷, 기타 23
 
0.2%
Other values (16) 90
 
0.9%

Length

2024-05-11T14:23:07.748601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 5840
54.2%
na 4059
37.6%
기타 385
 
3.6%
tv홈쇼핑 205
 
1.9%
카다로그 176
 
1.6%
신문잡지 116
 
1.1%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
4576324000020113240139302000592011-01-19<NA>1영업/정상1정상영업<NA><NA><NA><NA>442-7545<NA><NA>서울특별시 강동구 길동 ***-* 광남벨라스아파트서울특별시 강동구 천중로**가길 **, ***호 (길동, 광남벨라스아파트)05354블루베이2023-11-14 17:27:00U2022-10-31 23:06:00.0기타212013.378659448447.414642<NA><NA><NA><NA>
21867324000020233240289302017332023-07-24<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 암사동 ***-**서울특별시 강동구 고덕로**길 **, ***호 (암사동)05251쿠우쿠2023-07-24 13:35:33I2022-12-06 22:06:00.0의류/패션/잡화/뷰티211373.306956450065.370312<NA><NA><NA><NA>
133383240000202032402363020061620200402<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 명일동 ***번지 래미안 솔베뉴서울특별시 강동구 양재대로 ****, ***동 ****호 (명일동, 래미안 솔베뉴)05266빙고스튜디오2020-04-02 17:15:15I2020-04-04 00:23:22.0의류/패션/잡화/뷰티<NA><NA>000인터넷
155423240000202132402363020045520210204<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 천호동 ***-**서울특별시 강동구 올림픽로**길 **, ***호 (천호동)05245두두(DEUXDEUX)2021-02-04 16:25:55I2021-02-06 00:23:02.0건강/식품210921.329227449216.808923000인터넷
136183240000202032402363020091220200513<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 상일동 ***번지 고덕아르테온아파트서울특별시 강동구 고덕로 ***, ***동 ***호 (상일동, 고덕아르테온아파트)05274밍구망구2020-05-13 14:42:02I2022-01-26 00:22:40.0교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 레져/여행/공연 자동차/자동차용품214830.790079450337.049756000인터넷
121953240000201932402363020084620190723<NA>3폐업3폐업처리20220228<NA><NA><NA><NA><NA><NA>서울특별시 강동구 고덕동 ***번지 래미안힐스테이트고덕서울특별시 강동구 아리수로**길 **, ***동 **층 ****호 (고덕동, 래미안힐스테이트고덕)05229해피원Mall2022-03-10 14:06:26U2022-03-12 02:40:00.0종합몰212926.089913450751.524332000인터넷
31983240000200932401393020038220090615<NA>3폐업3폐업처리20150227<NA><NA><NA>0<NA>134021서울특별시 강동구 천호*동 ***번지 **호 부광아파트 ***호<NA><NA>오키샵2015-02-27 14:50:14I2018-08-31 23:59:59.0의류/패션/잡화/뷰티211000.524197449514.207107000인터넷
62333240000201332401893020009020130206<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7630-4044<NA>134020서울특별시 강동구 천호동 ***번지 *호<NA><NA>미로아르테2016-06-27 12:53:47I2018-08-31 23:59:59.0의류/패션/잡화/뷰티210760.824358448613.061756000인터넷
57803240000201232401753020041620120608<NA>3폐업3폐업처리20170705<NA><NA><NA><NA><NA>134020서울특별시 강동구 천호동 ***번지 **호서울특별시 강동구 구천면로**길 ** (천호동)134874마루미르2017-07-05 11:56:22I2018-08-31 23:59:59.0가구/수납용품210543.133561448784.8691000인터넷, 카다로그
35053240000200932401393020071320091015<NA>3폐업3폐업처리20211210<NA><NA><NA><NA><NA><NA>서울특별시 강동구 암사동 ***번지 **호 **통 *반<NA><NA>아이사랑2021-12-03 14:02:36U2021-12-07 02:40:00.0의류/패션/잡화/뷰티212180.209219449961.175248000인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
139933240000202032402363020130320200707<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 둔촌동 *** 둔촌푸르지오아파트서울특별시 강동구 명일로 ***, ***동 **층 ****호 (둔촌동, 둔촌푸르지오아파트)05360럭스화이트2020-07-07 09:39:06I2020-07-09 00:23:16.0종합몰212880.468341447998.456215000인터넷
23605324000020243240289302008092021-03-03<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 고덕동 *** 아남아파트서울특별시 강동구 양재대로 ****, 상가동 지하층 *** (G-***)호 (고덕동, 아남아파트)05230티에이치주얼리2024-04-08 10:04:23I2023-12-03 23:00:00.0의류/패션/잡화/뷰티212828.316146450633.424646<NA><NA><NA><NA>
95693240000201732401893020024220170317<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 천호대로***길 **, 지하*층 (길동)05351위토피아상사2022-11-22 17:40:55U2021-10-31 22:04:00.0가전 기타212548.250708448272.521845<NA><NA><NA><NA>
58693240000201232401753020051520120716<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 암사길 **, ***호 (암사동)134852라루체-프리마리(La luce-primary)2012-07-16 14:09:59I2018-08-31 23:59:59.0의류/패션/잡화/뷰티211606.095985449933.639817000인터넷
172863240000202132402363020225920211026<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 천호동 **-** 드림리치빌서울특별시 강동구 상암로**길 *, *층 ***호 (천호동, 드림리치빌)05307로로케이스2021-10-26 10:53:37I2021-10-28 00:22:56.0의류/패션/잡화/뷰티212350.971937449335.68175000인터넷
105613240000201832401893020022520180312<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 길동 ***번지 *호서울특별시 강동구 천중로**길 **, ***호 (길동)05344케이제이컴퍼니2018-03-12 10:12:08I2018-08-31 23:59:59.0기타212827.348923448527.016653000인터넷, 기타
23763240000200832401393020018920080321<NA>3폐업3폐업처리20080814<NA><NA><NA>477-3968<NA>134011서울특별시 강동구 길동 ***번지 *호 일성빌딩 ****호<NA><NA>우기닷컴2008-08-14 10:00:39I2022-01-26 00:22:40.0가전 컴퓨터/사무용품 의류/패션/잡화/뷰티211980.248326448111.060278000인터넷
21521324000020233240289302013862023-06-08<NA>3폐업3폐업처리2023-07-01<NA><NA><NA><NA><NA><NA>서울특별시 강동구 강일동 *** 강일리버파크*단지아파트서울특별시 강동구 아리수로**가길 **, ***동 ***호 (강일동, 강일리버파크*단지아파트)05211삼일공2023-07-03 15:05:10U2022-12-07 00:05:00.0기타215125.286683451722.369441<NA><NA><NA><NA>
81853240000201532401893020067520150810<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 성내동 ***번지 **호 정원빌딩서울특별시 강동구 양재대로**길 **, 정원빌딩 *층 ***호 (성내동)05403미장원 by 우리2020-02-28 14:53:42U2020-03-01 02:40:00.0의류/패션/잡화/뷰티211606.341465447388.053626000인터넷
184243240000202232402813020081820220426<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강동구 천호동 ** 용명브릿지아파트서울특별시 강동구 양재대로***길 **, ***동 ***호 (천호동, 용명브릿지아파트)05307허군상회2022-04-26 14:14:47I2021-12-03 22:08:00.0종합몰 의류/패션/잡화/뷰티212345.434584449497.61944<NA><NA><NA><NA>