Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells65657
Missing cells (%)22.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical10
Numeric5
DateTime6
Text7
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author광진구
URLhttps://data.seoul.go.kr/dataList/OA-18806/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (99.9%)Imbalance
자산규모 is highly imbalanced (60.6%)Imbalance
부채총액 is highly imbalanced (60.6%)Imbalance
자본금 is highly imbalanced (60.6%)Imbalance
판매방식명 is highly imbalanced (72.4%)Imbalance
폐업일자 has 6941 (69.4%) missing valuesMissing
휴업시작일자 has 9946 (99.5%) missing valuesMissing
휴업종료일자 has 9946 (99.5%) missing valuesMissing
재개업일자 has 9988 (99.9%) missing valuesMissing
전화번호 has 6076 (60.8%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7587 (75.9%) missing valuesMissing
도로명주소 has 956 (9.6%) missing valuesMissing
도로명우편번호 has 2326 (23.3%) missing valuesMissing
좌표정보(X) has 931 (9.3%) missing valuesMissing
좌표정보(Y) has 931 (9.3%) missing valuesMissing
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 08:45:57.134468
Analysis finished2024-05-11 08:46:02.643723
Duration5.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3040000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3040000
2nd row3040000
3rd row3040000
4th row3040000
5th row3040000

Common Values

ValueCountFrequency (%)
3040000 10000
100.0%

Length

2024-05-11T08:46:02.960067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:03.294616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3040000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0165342 × 1018
Minimum1.997304 × 1018
Maximum2.032304 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:46:04.017820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.997304 × 1018
5-th percentile2.005304 × 1018
Q12.011304 × 1018
median2.018304 × 1018
Q32.021304 × 1018
95-th percentile2.023304 × 1018
Maximum2.032304 × 1018
Range3.5000009 × 1016
Interquartile range (IQR)1.0000004 × 1016

Descriptive statistics

Standard deviation6.0292357 × 1015
Coefficient of variation (CV)0.0029899
Kurtosis-0.78195328
Mean2.0165342 × 1018
Median Absolute Deviation (MAD)4 × 1015
Skewness-0.63940149
Sum3.0508927 × 1018
Variance3.6351683 × 1031
MonotonicityNot monotonic
2024-05-11T08:46:04.604730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2022304019030200284 1
 
< 0.1%
2019304019030200125 1
 
< 0.1%
2007304010330203606 1
 
< 0.1%
2010304010330200018 1
 
< 0.1%
2016304019030200358 1
 
< 0.1%
2016304019030200124 1
 
< 0.1%
2007304010330203814 1
 
< 0.1%
2009304010330200436 1
 
< 0.1%
2015304016830200547 1
 
< 0.1%
2017304019030201194 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1997304010330200797 1
< 0.1%
1997304010330200835 1
< 0.1%
1998304010330200104 1
< 0.1%
1999304010330201892 1
< 0.1%
1999304010330201912 1
< 0.1%
1999304010330201929 1
< 0.1%
1999304010330202248 1
< 0.1%
1999304010330202336 1
< 0.1%
2000304010330200029 1
< 0.1%
2000304010330202906 1
< 0.1%
ValueCountFrequency (%)
2032304019030200005 1
< 0.1%
2032304019030200003 1
< 0.1%
2024304019030200854 1
< 0.1%
2024304019030200851 1
< 0.1%
2024304019030200850 1
< 0.1%
2024304019030200847 1
< 0.1%
2024304019030200846 1
< 0.1%
2024304019030200845 1
< 0.1%
2024304019030200842 1
< 0.1%
2024304019030200841 1
< 0.1%
Distinct4113
Distinct (%)41.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T08:46:05.053442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:05.533077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9999 
20070529
 
1

Length

Max length8
Median length4
Mean length4.0004
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9999
> 99.9%
20070529 1
 
< 0.1%

Length

2024-05-11T08:46:06.017697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:06.447080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9999
> 99.9%
20070529 1
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4647 
3
2482 
4
2260 
5
577 
2
 
34

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 4647
46.5%
3 2482
24.8%
4 2260
22.6%
5 577
 
5.8%
2 34
 
0.3%

Length

2024-05-11T08:46:06.806388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:07.136484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4647
46.5%
3 2482
24.8%
4 2260
22.6%
5 577
 
5.8%
2 34
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4647 
폐업
2482 
취소/말소/만료/정지/중지
2260 
제외/삭제/전출
577 
휴업
 
34

Length

Max length14
Median length8
Mean length6.4523
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 4647
46.5%
폐업 2482
24.8%
취소/말소/만료/정지/중지 2260
22.6%
제외/삭제/전출 577
 
5.8%
휴업 34
 
0.3%

Length

2024-05-11T08:46:07.348801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:07.667359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4647
46.5%
폐업 2482
24.8%
취소/말소/만료/정지/중지 2260
22.6%
제외/삭제/전출 577
 
5.8%
휴업 34
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.086
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:46:07.977120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q35
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.3872943
Coefficient of variation (CV)0.77358855
Kurtosis-1.0731758
Mean3.086
Median Absolute Deviation (MAD)2
Skewness0.71470846
Sum30860
Variance5.6991739
MonotonicityNot monotonic
2024-05-11T08:46:08.321643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 4647
46.5%
3 2482
24.8%
7 2258
22.6%
5 577
 
5.8%
2 34
 
0.3%
4 2
 
< 0.1%
ValueCountFrequency (%)
1 4647
46.5%
2 34
 
0.3%
3 2482
24.8%
4 2
 
< 0.1%
5 577
 
5.8%
7 2258
22.6%
ValueCountFrequency (%)
7 2258
22.6%
5 577
 
5.8%
4 2
 
< 0.1%
3 2482
24.8%
2 34
 
0.3%
1 4647
46.5%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4647 
폐업처리
2482 
직권말소
2258 
타시군구이관
577 
휴업처리
 
34

Length

Max length6
Median length4
Mean length4.1154
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 4647
46.5%
폐업처리 2482
24.8%
직권말소 2258
22.6%
타시군구이관 577
 
5.8%
휴업처리 34
 
0.3%
직권취소 2
 
< 0.1%

Length

2024-05-11T08:46:08.746062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:09.101297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4647
46.5%
폐업처리 2482
24.8%
직권말소 2258
22.6%
타시군구이관 577
 
5.8%
휴업처리 34
 
0.3%
직권취소 2
 
< 0.1%

폐업일자
Text

MISSING 

Distinct2023
Distinct (%)66.1%
Missing6941
Missing (%)69.4%
Memory size156.2 KiB
2024-05-11T08:46:09.839384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.3579601
Min length7

Characters and Unicode

Total characters25567
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1346 ?
Unique (%)44.0%

Sample

1st row20201031
2nd row20120626
3rd row20191203
4th row20220504
5th row20080619
ValueCountFrequency (%)
20201231 9
 
0.3%
20171229 7
 
0.2%
2024-01-31 7
 
0.2%
20221220 7
 
0.2%
2023-12-30 7
 
0.2%
20230116 6
 
0.2%
2024-02-01 6
 
0.2%
2024-01-23 6
 
0.2%
2023-12-31 6
 
0.2%
2024-01-30 6
 
0.2%
Other values (2013) 2992
97.8%
2024-05-11T08:46:11.002716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7344
28.7%
2 6909
27.0%
1 4441
17.4%
3 1403
 
5.5%
- 1096
 
4.3%
4 907
 
3.5%
8 779
 
3.0%
9 772
 
3.0%
7 670
 
2.6%
5 644
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24471
95.7%
Dash Punctuation 1096
 
4.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7344
30.0%
2 6909
28.2%
1 4441
18.1%
3 1403
 
5.7%
4 907
 
3.7%
8 779
 
3.2%
9 772
 
3.2%
7 670
 
2.7%
5 644
 
2.6%
6 602
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 1096
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25567
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7344
28.7%
2 6909
27.0%
1 4441
17.4%
3 1403
 
5.5%
- 1096
 
4.3%
4 907
 
3.5%
8 779
 
3.0%
9 772
 
3.0%
7 670
 
2.6%
5 644
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25567
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7344
28.7%
2 6909
27.0%
1 4441
17.4%
3 1403
 
5.5%
- 1096
 
4.3%
4 907
 
3.5%
8 779
 
3.0%
9 772
 
3.0%
7 670
 
2.6%
5 644
 
2.5%

휴업시작일자
Date

MISSING 

Distinct52
Distinct (%)96.3%
Missing9946
Missing (%)99.5%
Memory size156.2 KiB
Minimum2007-06-20 00:00:00
Maximum2024-03-27 00:00:00
2024-05-11T08:46:11.449794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:11.950181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업종료일자
Date

MISSING 

Distinct49
Distinct (%)90.7%
Missing9946
Missing (%)99.5%
Memory size156.2 KiB
Minimum2007-12-31 00:00:00
Maximum2030-04-10 00:00:00
2024-05-11T08:46:12.390427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:12.915408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

재개업일자
Date

MISSING 

Distinct12
Distinct (%)100.0%
Missing9988
Missing (%)99.9%
Memory size156.2 KiB
Minimum2008-08-22 00:00:00
Maximum2023-08-28 00:00:00
2024-05-11T08:46:13.371593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:13.884060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)

전화번호
Text

MISSING 

Distinct3400
Distinct (%)86.6%
Missing6076
Missing (%)60.8%
Memory size156.2 KiB
2024-05-11T08:46:14.625931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length10.843272
Min length1

Characters and Unicode

Total characters42549
Distinct characters20
Distinct categories8 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3305 ?
Unique (%)84.2%

Sample

1st row02-3445-1445
2nd row02-436-3126
3rd row02 2209 19
4th row02-420-3671
5th row1577-2947
ValueCountFrequency (%)
02 258
 
5.9%
123
 
2.8%
000-0000 94
 
2.1%
1 64
 
1.5%
00 58
 
1.3%
0 34
 
0.8%
031 27
 
0.6%
070 22
 
0.5%
02-0000-0000 20
 
0.5%
2201 10
 
0.2%
Other values (3517) 3697
83.9%
2024-05-11T08:46:16.124154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7815
18.4%
- 6594
15.5%
2 5250
12.3%
4 4405
10.4%
7 3270
7.7%
5 2831
 
6.7%
3 2590
 
6.1%
6 2552
 
6.0%
1 2383
 
5.6%
8 2328
 
5.5%
Other values (10) 2531
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 35190
82.7%
Dash Punctuation 6594
 
15.5%
Space Separator 703
 
1.7%
Other Punctuation 37
 
0.1%
Close Punctuation 9
 
< 0.1%
Math Symbol 8
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7815
22.2%
2 5250
14.9%
4 4405
12.5%
7 3270
9.3%
5 2831
 
8.0%
3 2590
 
7.4%
6 2552
 
7.3%
1 2383
 
6.8%
8 2328
 
6.6%
9 1766
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 31
83.8%
/ 3
 
8.1%
, 2
 
5.4%
* 1
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 6594
100.0%
Space Separator
ValueCountFrequency (%)
703
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42549
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7815
18.4%
- 6594
15.5%
2 5250
12.3%
4 4405
10.4%
7 3270
7.7%
5 2831
 
6.7%
3 2590
 
6.1%
6 2552
 
6.0%
1 2383
 
5.6%
8 2328
 
5.5%
Other values (10) 2531
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42549
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7815
18.4%
- 6594
15.5%
2 5250
12.3%
4 4405
10.4%
7 3270
7.7%
5 2831
 
6.7%
3 2590
 
6.1%
6 2552
 
6.0%
1 2383
 
5.6%
8 2328
 
5.5%
Other values (10) 2531
 
5.9%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct149
Distinct (%)6.2%
Missing7587
Missing (%)75.9%
Infinite0
Infinite (%)0.0%
Mean143449.34
Minimum130851
Maximum153023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:46:16.838411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum130851
5-th percentile143150
Q1143200
median143220
Q3143837
95-th percentile143910.8
Maximum153023
Range22172
Interquartile range (IQR)637

Descriptive statistics

Standard deviation544.38943
Coefficient of variation (CV)0.0037949944
Kurtosis239.05206
Mean143449.34
Median Absolute Deviation (MAD)40
Skewness-7.6962534
Sum3.4614325 × 108
Variance296359.85
MonotonicityNot monotonic
2024-05-11T08:46:17.400743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
143200 458
 
4.6%
143220 315
 
3.1%
143190 304
 
3.0%
143130 104
 
1.0%
143150 77
 
0.8%
143210 67
 
0.7%
143721 66
 
0.7%
143180 61
 
0.6%
143838 31
 
0.3%
143915 27
 
0.3%
Other values (139) 903
 
9.0%
(Missing) 7587
75.9%
ValueCountFrequency (%)
130851 1
 
< 0.1%
133120 1
 
< 0.1%
135280 1
 
< 0.1%
137040 1
 
< 0.1%
143130 104
 
1.0%
143140 6
 
0.1%
143150 77
 
0.8%
143180 61
 
0.6%
143190 304
3.0%
143191 2
 
< 0.1%
ValueCountFrequency (%)
153023 1
 
< 0.1%
143962 7
 
0.1%
143960 5
 
0.1%
143959 3
 
< 0.1%
143926 5
 
0.1%
143917 21
0.2%
143916 15
0.1%
143915 27
0.3%
143914 19
0.2%
143912 18
0.2%
Distinct4175
Distinct (%)41.9%
Missing29
Missing (%)0.3%
Memory size156.2 KiB
2024-05-11T08:46:18.306412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length71
Mean length26.917461
Min length13

Characters and Unicode

Total characters268394
Distinct characters516
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3339 ?
Unique (%)33.5%

Sample

1st row서울특별시 광진구 중곡동 ***-** 우진빌딩
2nd row서울특별시 광진구 자양동 ***번지 *호 *층 ***호
3rd row서울특별시 광진구 중곡동 ***-* 미트빌
4th row서울특별시 광진구 화양동 **-** ***호
5th row서울특별시 광진구 중곡동 **-* 세명더클래스
ValueCountFrequency (%)
광진구 9963
17.8%
서울특별시 9129
16.3%
7354
13.1%
번지 5139
9.2%
4334
7.7%
구의동 2546
 
4.5%
중곡동 2328
 
4.1%
자양동 2218
 
4.0%
2004
 
3.6%
화양동 1025
 
1.8%
Other values (2155) 10064
17.9%
2024-05-11T08:46:20.088521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 59480
22.2%
47737
17.8%
12689
 
4.7%
11138
 
4.1%
10842
 
4.0%
10216
 
3.8%
10097
 
3.8%
10066
 
3.8%
10012
 
3.7%
9132
 
3.4%
Other values (506) 76985
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 153999
57.4%
Other Punctuation 59811
 
22.3%
Space Separator 47737
 
17.8%
Dash Punctuation 4969
 
1.9%
Uppercase Letter 1129
 
0.4%
Decimal Number 445
 
0.2%
Lowercase Letter 182
 
0.1%
Close Punctuation 50
 
< 0.1%
Open Punctuation 49
 
< 0.1%
Math Symbol 11
 
< 0.1%
Other values (2) 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12689
 
8.2%
11138
 
7.2%
10842
 
7.0%
10216
 
6.6%
10097
 
6.6%
10066
 
6.5%
10012
 
6.5%
9132
 
5.9%
9129
 
5.9%
7901
 
5.1%
Other values (434) 52777
34.3%
Uppercase Letter
ValueCountFrequency (%)
B 387
34.3%
A 196
17.4%
C 154
 
13.6%
D 107
 
9.5%
S 43
 
3.8%
T 37
 
3.3%
M 30
 
2.7%
K 26
 
2.3%
Y 23
 
2.0%
I 20
 
1.8%
Other values (14) 106
 
9.4%
Lowercase Letter
ValueCountFrequency (%)
e 39
21.4%
r 18
9.9%
a 17
9.3%
o 14
 
7.7%
c 13
 
7.1%
t 13
 
7.1%
w 11
 
6.0%
i 11
 
6.0%
l 9
 
4.9%
s 8
 
4.4%
Other values (11) 29
15.9%
Decimal Number
ValueCountFrequency (%)
3 76
17.1%
2 71
16.0%
1 67
15.1%
4 49
11.0%
5 45
10.1%
6 32
7.2%
7 31
7.0%
9 27
 
6.1%
0 27
 
6.1%
8 20
 
4.5%
Other Punctuation
ValueCountFrequency (%)
* 59480
99.4%
, 284
 
0.5%
/ 19
 
< 0.1%
@ 15
 
< 0.1%
& 6
 
< 0.1%
. 6
 
< 0.1%
? 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
36.4%
3
27.3%
2
18.2%
2
18.2%
Space Separator
ValueCountFrequency (%)
47737
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4969
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 154000
57.4%
Common 113072
42.1%
Latin 1322
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12689
 
8.2%
11138
 
7.2%
10842
 
7.0%
10216
 
6.6%
10097
 
6.6%
10066
 
6.5%
10012
 
6.5%
9132
 
5.9%
9129
 
5.9%
7901
 
5.1%
Other values (435) 52778
34.3%
Latin
ValueCountFrequency (%)
B 387
29.3%
A 196
14.8%
C 154
 
11.6%
D 107
 
8.1%
S 43
 
3.3%
e 39
 
3.0%
T 37
 
2.8%
M 30
 
2.3%
K 26
 
2.0%
Y 23
 
1.7%
Other values (39) 280
21.2%
Common
ValueCountFrequency (%)
* 59480
52.6%
47737
42.2%
- 4969
 
4.4%
, 284
 
0.3%
3 76
 
0.1%
2 71
 
0.1%
1 67
 
0.1%
) 50
 
< 0.1%
( 49
 
< 0.1%
4 49
 
< 0.1%
Other values (12) 240
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 153999
57.4%
ASCII 114383
42.6%
Number Forms 11
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 59480
52.0%
47737
41.7%
- 4969
 
4.3%
B 387
 
0.3%
, 284
 
0.2%
A 196
 
0.2%
C 154
 
0.1%
D 107
 
0.1%
3 76
 
0.1%
2 71
 
0.1%
Other values (57) 922
 
0.8%
Hangul
ValueCountFrequency (%)
12689
 
8.2%
11138
 
7.2%
10842
 
7.0%
10216
 
6.6%
10097
 
6.6%
10066
 
6.5%
10012
 
6.5%
9132
 
5.9%
9129
 
5.9%
7901
 
5.1%
Other values (434) 52777
34.3%
Number Forms
ValueCountFrequency (%)
4
36.4%
3
27.3%
2
18.2%
2
18.2%
None
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4971
Distinct (%)55.0%
Missing956
Missing (%)9.6%
Memory size156.2 KiB
2024-05-11T08:46:20.752484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length54
Mean length34.679345
Min length16

Characters and Unicode

Total characters313640
Distinct characters514
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3782 ?
Unique (%)41.8%

Sample

1st row서울특별시 광진구 능동로 ***, 우진빌딩 *층 ***호 (중곡동)
2nd row서울특별시 광진구 자양로*길 **, *층 ***호 (자양동)
3rd row서울특별시 광진구 용마산로**길 **, ***호 (중곡동, 미트빌)
4th row서울특별시 광진구 동일로**길 **-*, ***호 (화양동)
5th row서울특별시 광진구 긴고랑로**길 *-**, ***호 (중곡동, 세명더클래스)
ValueCountFrequency (%)
서울특별시 9053
15.2%
9039
15.2%
광진구 9034
15.2%
4700
 
7.9%
3040
 
5.1%
중곡동 2030
 
3.4%
구의동 1922
 
3.2%
자양동 1875
 
3.2%
화양동 906
 
1.5%
광나루로**길 832
 
1.4%
Other values (2417) 16971
28.6%
2024-05-11T08:46:22.133502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 58157
18.5%
50399
16.1%
12761
 
4.1%
11787
 
3.8%
11021
 
3.5%
, 10295
 
3.3%
9284
 
3.0%
9169
 
2.9%
9135
 
2.9%
9121
 
2.9%
Other values (504) 122511
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171970
54.8%
Other Punctuation 68478
 
21.8%
Space Separator 50399
 
16.1%
Close Punctuation 9082
 
2.9%
Open Punctuation 9077
 
2.9%
Dash Punctuation 2558
 
0.8%
Uppercase Letter 1230
 
0.4%
Decimal Number 595
 
0.2%
Lowercase Letter 198
 
0.1%
Math Symbol 42
 
< 0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12761
 
7.4%
11787
 
6.9%
11021
 
6.4%
9284
 
5.4%
9169
 
5.3%
9135
 
5.3%
9121
 
5.3%
9084
 
5.3%
9059
 
5.3%
9055
 
5.3%
Other values (433) 72494
42.2%
Uppercase Letter
ValueCountFrequency (%)
B 455
37.0%
A 277
22.5%
C 146
 
11.9%
D 111
 
9.0%
S 41
 
3.3%
Y 23
 
1.9%
K 23
 
1.9%
T 21
 
1.7%
I 18
 
1.5%
E 18
 
1.5%
Other values (14) 97
 
7.9%
Lowercase Letter
ValueCountFrequency (%)
e 39
19.7%
r 18
9.1%
o 17
8.6%
a 16
8.1%
b 15
 
7.6%
s 14
 
7.1%
c 13
 
6.6%
t 12
 
6.1%
w 11
 
5.6%
i 9
 
4.5%
Other values (11) 34
17.2%
Decimal Number
ValueCountFrequency (%)
1 126
21.2%
2 95
16.0%
3 75
12.6%
0 69
11.6%
4 59
9.9%
5 52
8.7%
6 42
 
7.1%
7 37
 
6.2%
8 25
 
4.2%
9 15
 
2.5%
Other Punctuation
ValueCountFrequency (%)
* 58157
84.9%
, 10295
 
15.0%
/ 10
 
< 0.1%
. 7
 
< 0.1%
& 6
 
< 0.1%
@ 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
40.0%
3
30.0%
2
20.0%
1
 
10.0%
Space Separator
ValueCountFrequency (%)
50399
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9082
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9077
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2558
100.0%
Math Symbol
ValueCountFrequency (%)
~ 42
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 171971
54.8%
Common 140231
44.7%
Latin 1438
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12761
 
7.4%
11787
 
6.9%
11021
 
6.4%
9284
 
5.4%
9169
 
5.3%
9135
 
5.3%
9121
 
5.3%
9084
 
5.3%
9059
 
5.3%
9055
 
5.3%
Other values (434) 72495
42.2%
Latin
ValueCountFrequency (%)
B 455
31.6%
A 277
19.3%
C 146
 
10.2%
D 111
 
7.7%
S 41
 
2.9%
e 39
 
2.7%
Y 23
 
1.6%
K 23
 
1.6%
T 21
 
1.5%
r 18
 
1.3%
Other values (39) 284
19.7%
Common
ValueCountFrequency (%)
* 58157
41.5%
50399
35.9%
, 10295
 
7.3%
) 9082
 
6.5%
( 9077
 
6.5%
- 2558
 
1.8%
1 126
 
0.1%
2 95
 
0.1%
3 75
 
0.1%
0 69
 
< 0.1%
Other values (11) 298
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171970
54.8%
ASCII 141659
45.2%
Number Forms 10
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 58157
41.1%
50399
35.6%
, 10295
 
7.3%
) 9082
 
6.4%
( 9077
 
6.4%
- 2558
 
1.8%
B 455
 
0.3%
A 277
 
0.2%
C 146
 
0.1%
1 126
 
0.1%
Other values (56) 1087
 
0.8%
Hangul
ValueCountFrequency (%)
12761
 
7.4%
11787
 
6.9%
11021
 
6.4%
9284
 
5.4%
9169
 
5.3%
9135
 
5.3%
9121
 
5.3%
9084
 
5.3%
9059
 
5.3%
9055
 
5.3%
Other values (433) 72494
42.2%
Number Forms
ValueCountFrequency (%)
4
40.0%
3
30.0%
2
20.0%
1
 
10.0%
None
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct377
Distinct (%)4.9%
Missing2326
Missing (%)23.3%
Memory size156.2 KiB
2024-05-11T08:46:23.054601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.170185
Min length5

Characters and Unicode

Total characters39676
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)0.5%

Sample

1st row04928
2nd row05112
3rd row04938
4th row05019
5th row04945
ValueCountFrequency (%)
05116 264
 
3.4%
05015 190
 
2.5%
05001 141
 
1.8%
04969 120
 
1.6%
04941 115
 
1.5%
143721 104
 
1.4%
04996 74
 
1.0%
05026 73
 
1.0%
05041 73
 
1.0%
05022 72
 
0.9%
Other values (367) 6448
84.0%
2024-05-11T08:46:24.447659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10870
27.4%
4 5574
14.0%
5 4934
12.4%
9 4525
11.4%
1 4461
11.2%
3 2585
 
6.5%
8 2075
 
5.2%
2 1588
 
4.0%
6 1578
 
4.0%
7 1473
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39663
> 99.9%
Dash Punctuation 13
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10870
27.4%
4 5574
14.1%
5 4934
12.4%
9 4525
11.4%
1 4461
11.2%
3 2585
 
6.5%
8 2075
 
5.2%
2 1588
 
4.0%
6 1578
 
4.0%
7 1473
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39676
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10870
27.4%
4 5574
14.0%
5 4934
12.4%
9 4525
11.4%
1 4461
11.2%
3 2585
 
6.5%
8 2075
 
5.2%
2 1588
 
4.0%
6 1578
 
4.0%
7 1473
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39676
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10870
27.4%
4 5574
14.0%
5 4934
12.4%
9 4525
11.4%
1 4461
11.2%
3 2585
 
6.5%
8 2075
 
5.2%
2 1588
 
4.0%
6 1578
 
4.0%
7 1473
 
3.7%
Distinct9868
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T08:46:25.428663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length38
Mean length7.0922
Min length1

Characters and Unicode

Total characters70922
Distinct characters1107
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9742 ?
Unique (%)97.4%

Sample

1st row(주)에이퍼스
2nd row브라이트휠(B&W)
3rd row소이따
4th row위대한 겨자씨(Great little seeds)
5th row민들레꽃차
ValueCountFrequency (%)
주식회사 657
 
5.1%
98
 
0.8%
컴퍼니 41
 
0.3%
company 35
 
0.3%
스튜디오 29
 
0.2%
korea 20
 
0.2%
co 18
 
0.1%
코리아 16
 
0.1%
co.,ltd 16
 
0.1%
유한회사 14
 
0.1%
Other values (11078) 11959
92.7%
2024-05-11T08:46:27.119860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2907
 
4.1%
2479
 
3.5%
( 2293
 
3.2%
) 2293
 
3.2%
2069
 
2.9%
1505
 
2.1%
1089
 
1.5%
1024
 
1.4%
e 961
 
1.4%
910
 
1.3%
Other values (1097) 53392
75.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48324
68.1%
Lowercase Letter 7775
 
11.0%
Uppercase Letter 6268
 
8.8%
Space Separator 2907
 
4.1%
Close Punctuation 2295
 
3.2%
Open Punctuation 2294
 
3.2%
Decimal Number 534
 
0.8%
Other Punctuation 419
 
0.6%
Dash Punctuation 75
 
0.1%
Connector Punctuation 20
 
< 0.1%
Other values (3) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2479
 
5.1%
2069
 
4.3%
1505
 
3.1%
1089
 
2.3%
1024
 
2.1%
910
 
1.9%
786
 
1.6%
753
 
1.6%
743
 
1.5%
692
 
1.4%
Other values (1013) 36274
75.1%
Lowercase Letter
ValueCountFrequency (%)
e 961
12.4%
o 777
 
10.0%
a 662
 
8.5%
n 576
 
7.4%
i 558
 
7.2%
r 485
 
6.2%
t 476
 
6.1%
l 404
 
5.2%
s 392
 
5.0%
m 306
 
3.9%
Other values (16) 2178
28.0%
Uppercase Letter
ValueCountFrequency (%)
A 515
 
8.2%
O 481
 
7.7%
E 476
 
7.6%
S 410
 
6.5%
C 374
 
6.0%
N 365
 
5.8%
I 363
 
5.8%
T 335
 
5.3%
L 326
 
5.2%
M 310
 
4.9%
Other values (16) 2313
36.9%
Other Punctuation
ValueCountFrequency (%)
. 217
51.8%
& 94
22.4%
, 53
 
12.6%
' 25
 
6.0%
# 8
 
1.9%
? 8
 
1.9%
: 6
 
1.4%
/ 5
 
1.2%
! 1
 
0.2%
; 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 107
20.0%
2 105
19.7%
0 70
13.1%
3 56
10.5%
5 42
 
7.9%
9 42
 
7.9%
4 41
 
7.7%
7 34
 
6.4%
8 21
 
3.9%
6 16
 
3.0%
Open Punctuation
ValueCountFrequency (%)
( 2293
> 99.9%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2293
99.9%
] 2
 
0.1%
Other Symbol
ValueCountFrequency (%)
1
50.0%
° 1
50.0%
Space Separator
ValueCountFrequency (%)
2907
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 20
100.0%
Math Symbol
ValueCountFrequency (%)
+ 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48306
68.1%
Latin 14043
 
19.8%
Common 8554
 
12.1%
Han 19
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2479
 
5.1%
2069
 
4.3%
1505
 
3.1%
1089
 
2.3%
1024
 
2.1%
910
 
1.9%
786
 
1.6%
753
 
1.6%
743
 
1.5%
692
 
1.4%
Other values (995) 36256
75.1%
Latin
ValueCountFrequency (%)
e 961
 
6.8%
o 777
 
5.5%
a 662
 
4.7%
n 576
 
4.1%
i 558
 
4.0%
A 515
 
3.7%
r 485
 
3.5%
O 481
 
3.4%
t 476
 
3.4%
E 476
 
3.4%
Other values (42) 8076
57.5%
Common
ValueCountFrequency (%)
2907
34.0%
( 2293
26.8%
) 2293
26.8%
. 217
 
2.5%
1 107
 
1.3%
2 105
 
1.2%
& 94
 
1.1%
- 75
 
0.9%
0 70
 
0.8%
3 56
 
0.7%
Other values (21) 337
 
3.9%
Han
ValueCountFrequency (%)
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (9) 9
47.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48304
68.1%
ASCII 22595
31.9%
CJK 19
 
< 0.1%
None 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2907
 
12.9%
( 2293
 
10.1%
) 2293
 
10.1%
e 961
 
4.3%
o 777
 
3.4%
a 662
 
2.9%
n 576
 
2.5%
i 558
 
2.5%
A 515
 
2.3%
r 485
 
2.1%
Other values (71) 10568
46.8%
Hangul
ValueCountFrequency (%)
2479
 
5.1%
2069
 
4.3%
1505
 
3.1%
1089
 
2.3%
1024
 
2.1%
910
 
1.9%
786
 
1.6%
753
 
1.6%
743
 
1.5%
692
 
1.4%
Other values (993) 36254
75.1%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
° 1
33.3%
CJK
ValueCountFrequency (%)
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (9) 9
47.4%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct9865
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-03 10:15:39
Maximum2024-05-09 16:07:20
2024-05-11T08:46:27.698219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:28.246323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7670 
U
2330 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7670
76.7%
U 2330
 
23.3%

Length

2024-05-11T08:46:28.741296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:29.015052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7670
76.7%
u 2330
 
23.3%
Distinct1459
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T08:46:29.331425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T08:46:29.908486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct513
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T08:46:30.491488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length9.1832
Min length1

Characters and Unicode

Total characters91832
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique325 ?
Unique (%)3.2%

Sample

1st row가전 컴퓨터/사무용품 기타
2nd row종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 건강/식품 기타 상품권 레져/여행/공연 가구/수납용품 자동차/자동차용품
3rd row종합몰
4th row기타
5th row기타
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3990
28.3%
종합몰 3005
21.3%
기타 1855
13.1%
건강/식품 1015
 
7.2%
808
 
5.7%
교육/도서/완구/오락 722
 
5.1%
컴퓨터/사무용품 681
 
4.8%
가전 635
 
4.5%
가구/수납용품 512
 
3.6%
레져/여행/공연 389
 
2.8%
Other values (3) 510
 
3.6%
2024-05-11T08:46:31.505822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17531
19.1%
4122
 
4.5%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
3990
 
4.3%
Other values (41) 38259
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69371
75.5%
Other Punctuation 17531
 
19.1%
Space Separator 4122
 
4.5%
Dash Punctuation 808
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3005
 
4.3%
3005
 
4.3%
Other values (38) 31441
45.3%
Other Punctuation
ValueCountFrequency (%)
/ 17531
100.0%
Space Separator
ValueCountFrequency (%)
4122
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 808
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69371
75.5%
Common 22461
 
24.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3005
 
4.3%
3005
 
4.3%
Other values (38) 31441
45.3%
Common
ValueCountFrequency (%)
/ 17531
78.1%
4122
 
18.4%
- 808
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69371
75.5%
ASCII 22461
 
24.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17531
78.1%
4122
 
18.4%
- 808
 
3.6%
Hangul
ValueCountFrequency (%)
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3990
 
5.8%
3005
 
4.3%
3005
 
4.3%
Other values (38) 31441
45.3%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct5063
Distinct (%)55.8%
Missing931
Missing (%)9.3%
Infinite0
Infinite (%)0.0%
Mean207231.67
Minimum189089.93
Maximum217951.29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:46:32.534872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum189089.93
5-th percentile205846.81
Q1206562.36
median207214.48
Q3207846.15
95-th percentile208598.55
Maximum217951.29
Range28861.366
Interquartile range (IQR)1283.7912

Descriptive statistics

Standard deviation934.12033
Coefficient of variation (CV)0.0045076137
Kurtosis17.403387
Mean207231.67
Median Absolute Deviation (MAD)640.64599
Skewness-0.43727774
Sum1.879384 × 109
Variance872580.78
MonotonicityNot monotonic
2024-05-11T08:46:33.116214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
208394.416382167 477
 
4.8%
206015.280335652 157
 
1.6%
206232.014507331 122
 
1.2%
207641.17491047 79
 
0.8%
208558.286631653 68
 
0.7%
207388.308848516 54
 
0.5%
206349.675048285 50
 
0.5%
208403.746065593 44
 
0.4%
206792.027736463 38
 
0.4%
209734.218553133 36
 
0.4%
Other values (5053) 7944
79.4%
(Missing) 931
 
9.3%
ValueCountFrequency (%)
189089.927764903 1
< 0.1%
200542.258243379 1
< 0.1%
202376.541635724 1
< 0.1%
204286.367631321 1
< 0.1%
204325.143969001 1
< 0.1%
205275.890261202 1
< 0.1%
205281.473683434 1
< 0.1%
205287.059755932 1
< 0.1%
205290.335639401 1
< 0.1%
205310.627685744 2
< 0.1%
ValueCountFrequency (%)
217951.293581483 1
 
< 0.1%
209775.267831522 3
 
< 0.1%
209734.218553133 36
0.4%
209682.485117912 2
 
< 0.1%
209679.58447051 11
 
0.1%
209668.90120464 14
 
0.1%
209663.422431459 1
 
< 0.1%
209653.285186637 11
 
0.1%
209639.359596423 1
 
< 0.1%
209638.865966592 5
 
0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct5063
Distinct (%)55.8%
Missing931
Missing (%)9.3%
Infinite0
Infinite (%)0.0%
Mean449396.57
Minimum442569.3
Maximum465956.24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T08:46:33.699571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum442569.3
5-th percentile447725.92
Q1448353.91
median449180.41
Q3450398.16
95-th percentile451512.48
Maximum465956.24
Range23386.943
Interquartile range (IQR)2044.2509

Descriptive statistics

Standard deviation1225.5605
Coefficient of variation (CV)0.0027271248
Kurtosis2.7412537
Mean449396.57
Median Absolute Deviation (MAD)1015.1285
Skewness0.55630512
Sum4.0755775 × 109
Variance1501998.6
MonotonicityNot monotonic
2024-05-11T08:46:34.272969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
448165.279999905 477
 
4.8%
449180.408499373 157
 
1.6%
450174.133962364 122
 
1.2%
451208.296961305 79
 
0.8%
448353.905156369 68
 
0.7%
448313.517465433 54
 
0.5%
448396.939704285 50
 
0.5%
448490.239179557 44
 
0.4%
450583.054957622 38
 
0.4%
449933.547744527 36
 
0.4%
Other values (5053) 7944
79.4%
(Missing) 931
 
9.3%
ValueCountFrequency (%)
442569.300676147 1
 
< 0.1%
444689.095230482 1
 
< 0.1%
445340.688938637 1
 
< 0.1%
447218.339550141 1
 
< 0.1%
447286.464742664 1
 
< 0.1%
447291.017509694 1
 
< 0.1%
447309.792408824 1
 
< 0.1%
447322.516325441 5
0.1%
447325.407152379 10
0.1%
447325.749899489 1
 
< 0.1%
ValueCountFrequency (%)
465956.244058608 1
 
< 0.1%
453421.056687561 1
 
< 0.1%
452394.714714 1
 
< 0.1%
452140.330430016 1
 
< 0.1%
452134.45276901 1
 
< 0.1%
452131.060476514 1
 
< 0.1%
452123.458782283 2
< 0.1%
452121.121578344 1
 
< 0.1%
452119.675279713 3
< 0.1%
452113.336508229 1
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9223 
0
 
777

Length

Max length4
Median length4
Mean length3.7669
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9223
92.2%
0 777
 
7.8%

Length

2024-05-11T08:46:34.875883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:35.216040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9223
92.2%
0 777
 
7.8%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9223 
0
 
777

Length

Max length4
Median length4
Mean length3.7669
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9223
92.2%
0 777
 
7.8%

Length

2024-05-11T08:46:35.648074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:36.250529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9223
92.2%
0 777
 
7.8%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9223 
0
 
777

Length

Max length4
Median length4
Mean length3.7669
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9223
92.2%
0 777
 
7.8%

Length

2024-05-11T08:46:36.850892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T08:46:37.282958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9223
92.2%
0 777
 
7.8%

판매방식명
Categorical

IMBALANCE 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
4910 
인터넷
4764 
인터넷, 기타
 
110
기타
 
56
TV홈쇼핑, 인터넷
 
28
Other values (20)
 
132

Length

Max length26
Median length22
Mean length3.6863
Min length2

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row인터넷, 기타
2nd row<NA>
3rd row<NA>
4th row인터넷
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4910
49.1%
인터넷 4764
47.6%
인터넷, 기타 110
 
1.1%
기타 56
 
0.6%
TV홈쇼핑, 인터넷 28
 
0.3%
인터넷, 카다로그 21
 
0.2%
TV홈쇼핑 20
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 18
 
0.2%
인터넷, 카다로그, 기타 13
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 11
 
0.1%
Other values (15) 49
 
0.5%

Length

2024-05-11T08:46:37.653499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 4999
48.2%
na 4910
47.4%
기타 217
 
2.1%
카다로그 91
 
0.9%
tv홈쇼핑 88
 
0.8%
신문잡지 63
 
0.6%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
193743040000202230401903020028420121121<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-3445-1445<NA><NA>서울특별시 광진구 중곡동 ***-** 우진빌딩서울특별시 광진구 능동로 ***, 우진빌딩 *층 ***호 (중곡동)04928(주)에이퍼스2022-03-13 16:57:56U2022-03-15 02:40:00.0가전 컴퓨터/사무용품 기타207062.687268450747.18141000인터넷, 기타
138823040000201930401903020144320191112<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 자양동 ***번지 *호 *층 ***호서울특별시 광진구 자양로*길 **, *층 ***호 (자양동)05112브라이트휠(B&W)2019-11-18 11:04:15I2021-12-03 22:02:00.0종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 건강/식품 기타 상품권 레져/여행/공연 가구/수납용품 자동차/자동차용품207610.321392447789.538814<NA><NA><NA><NA>
213493040000202230401903020226220221103<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-436-3126<NA><NA>서울특별시 광진구 중곡동 ***-* 미트빌서울특별시 광진구 용마산로**길 **, ***호 (중곡동, 미트빌)04938소이따2022-11-07 14:10:25I2021-11-01 00:09:00.0종합몰207717.744036452031.747004<NA><NA><NA><NA>
167533040000202130401903020019320210118<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 화양동 **-** ***호서울특별시 광진구 동일로**길 **-*, ***호 (화양동)05019위대한 겨자씨(Great little seeds)2021-01-18 13:59:05I2021-01-20 00:23:04.0기타205900.221448980.164848<NA><NA><NA>인터넷
210713040000202230401903020198420220926<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 **-* 세명더클래스서울특별시 광진구 긴고랑로**길 *-**, ***호 (중곡동, 세명더클래스)04945민들레꽃차2022-09-29 10:56:14I2021-10-31 00:01:00.0기타207841.657779450712.061474<NA><NA><NA><NA>
101333040000201630401903020027520160920<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 자양동 ***번지 **호 양자빌딩 ***호서울특별시 광진구 자양로*길 **, ***호 (자양동, 양자빌딩)05112프리썸2018-05-08 11:03:07I2018-08-31 23:59:59.0의류/패션/잡화/뷰티207624.830311447727.902711<NA><NA><NA>인터넷
154343040000202030401903020145520200722<NA>3폐업3폐업처리20201031<NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 ***-** *층 ***호서울특별시 광진구 용마산로**길 **-*, *층 ***호 (중곡동)04938더 글라스샵2020-11-02 13:12:43U2020-11-04 02:40:00.0의류/패션/잡화/뷰티207627.052747451934.155638<NA><NA><NA>인터넷
21363040000200730401033020321720070102<NA>3폐업3폐업처리20120626<NA><NA><NA>02 2209 19<NA><NA>서울시 광진구 중곡동***-*<NA><NA>주식회사이콘도텔2012-06-27 16:54:24I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
201813040000202230401903020109220220526<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 ***-** 성웅빌딩서울특별시 광진구 용마산로 ***, 성웅빌딩 *층 (중곡동)04941제이인터내셔널2022-05-30 09:44:03I2021-12-06 00:04:00.0종합몰 기타207642.618403451170.391662<NA><NA><NA><NA>
217133040000202330401903020008320170619<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-420-3671<NA><NA>서울특별시 광진구 중곡동 ***-** 고씨중앙종문회 빌딩서울특별시 광진구 천호대로 ***, 고씨중앙종문회 빌딩 *층 (중곡동)04931(주) 하임코퍼레이션2023-01-11 09:28:37U2022-11-30 23:03:00.0종합몰 가전 가구/수납용품 의류/패션/잡화/뷰티 건강/식품 기타207568.156752450289.706551<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
104813040000201730401903020011920170124<NA>5제외/삭제/전출5타시군구이관20170627<NA><NA><NA>02-3436-1330<NA><NA>서울특별시 광진구 구의동 ***번지 **호 대영트윈.투 *층 ***호서울특별시 광진구 아차산로**길 **-**, *층 ***호 (구의동, 대영트윈.투)05047주식회사 안전아이티씨2017-06-27 17:12:03I2021-12-03 22:02:00.0레져/여행/공연 상품권208154.57191448512.071261<NA><NA><NA><NA>
133803040000201930401903020089120190703<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 광장동 ***번지 *호 광장극동아파트 *동 ****호서울특별시 광진구 아차산로 ***, *동 ****호 (광장동, 광장극동아파트)04971쨈앤빠다2019-07-05 08:12:05I2019-07-07 02:21:30.0의류/패션/잡화/뷰티209138.047265448993.925427<NA><NA><NA>인터넷
16975304000020213040190302004292021-02-09<NA>3폐업3폐업처리2023-05-08<NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 ***-* 바론채 ***호서울특별시 광진구 면목로 ***, ***호 (중곡동, 바론채)04911페레코코2023-05-08 13:28:17U2022-12-04 23:00:00.0종합몰207003.769645451311.662974<NA><NA><NA><NA>
156193040000202030401903020165520200814<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 ***-** *층 ***호서울특별시 광진구 천호대로***길 **, *층 ***호 (중곡동)04920사소한잡화점2020-08-14 09:54:12I2020-08-16 00:23:14.0종합몰206656.706098450828.189723<NA><NA><NA>인터넷
177223040000202130401903020120820210520<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 군자동 ***-* 레이나 B서울특별시 광진구 능동로**길 **-**, ***호 (군자동, 레이나 B)04998부티크인서울2021-05-25 11:15:57I2021-05-27 00:22:55.0종합몰206606.797852450010.664285<NA><NA><NA>인터넷
15143040000200630401033020244920051214<NA>3폐업3폐업처리20160322<NA><NA><NA><NA><NA><NA>서울시 광진구 중곡동***-*<NA><NA>파인드폼2016-03-22 16:12:16I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
62553040000201130401553020046320111213<NA>3폐업3폐업처리20120327<NA><NA><NA>02-3424-3123<NA>143200서울특별시 광진구 구의동 ***번지 *호 테크노마트 *층 A-**,**서울특별시 광진구 광나루로**길 **, *층 A-**,**호 (구의동, 테크노마트)143721현대상사2012-03-28 13:23:00I2018-08-31 23:59:59.0가전208394.416382448165.28<NA><NA><NA>인터넷
183303040000202130401903020182920210817<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 군자동 ***-**서울특별시 광진구 군자로**길 **-*, ***호 (군자동)05005캠팍스2021-08-19 11:17:32I2021-08-21 00:22:50.0의류/패션/잡화/뷰티206298.038516449690.601286000인터넷
14320304000020203040190302002502017-04-06<NA>5제외/삭제/전출5타시군구이관2023-06-20<NA><NA><NA><NA><NA><NA>서울특별시 광진구 구의동 ***번지 **호 *층 ***호서울특별시 광진구 자양로 ***-**, *층 ***호 (구의동)05038낭만그라피2023-06-20 09:20:31U2022-12-05 22:02:00.0기타207459.965012449001.544191<NA><NA><NA><NA>
167493040000202130401903020018920210118<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 광진구 중곡동 ***-* ***호서울특별시 광진구 면목로 ***, ***호 (중곡동)04902미르 컴퍼니2021-01-18 13:57:51I2021-01-20 00:23:04.0종합몰207274.528489452076.858609<NA><NA><NA>인터넷