Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells76287
Missing cells (%)26.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Numeric5
DateTime8
Categorical8
Text7
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-16100/S/1/datasetView.do

Alerts

자산규모 is highly imbalanced (96.2%)Imbalance
부채총액 is highly imbalanced (96.2%)Imbalance
자본금 is highly imbalanced (96.2%)Imbalance
판매방식명 is highly imbalanced (99.3%)Imbalance
인허가취소일자 has 9959 (99.6%) missing valuesMissing
폐업일자 has 7344 (73.4%) missing valuesMissing
휴업시작일자 has 9967 (99.7%) missing valuesMissing
휴업종료일자 has 9967 (99.7%) missing valuesMissing
재개업일자 has 9983 (99.8%) missing valuesMissing
전화번호 has 8371 (83.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 9998 (> 99.9%) missing valuesMissing
도로명우편번호 has 174 (1.7%) missing valuesMissing
좌표정보(X) has 173 (1.7%) missing valuesMissing
좌표정보(Y) has 173 (1.7%) missing valuesMissing
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 03:21:44.776767
Analysis finished2024-05-11 03:21:50.506036
Duration5.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3135979
Minimum3000000
Maximum3240000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T03:21:50.742507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000000
5-th percentile3010000
Q13070000
median3150000
Q33210000
95-th percentile3230000
Maximum3240000
Range240000
Interquartile range (IQR)140000

Descriptive statistics

Standard deviation73292.965
Coefficient of variation (CV)0.023371638
Kurtosis-1.1477802
Mean3135979
Median Absolute Deviation (MAD)60000
Skewness-0.32214279
Sum3.135979 × 1010
Variance5.3718587 × 109
MonotonicityNot monotonic
2024-05-11T03:21:51.285572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3220000 1001
 
10.0%
3130000 611
 
6.1%
3230000 595
 
5.9%
3210000 572
 
5.7%
3150000 565
 
5.7%
3120000 506
 
5.1%
3170000 496
 
5.0%
3180000 457
 
4.6%
3160000 414
 
4.1%
3010000 392
 
3.9%
Other values (15) 4391
43.9%
ValueCountFrequency (%)
3000000 283
2.8%
3010000 392
3.9%
3020000 320
3.2%
3030000 376
3.8%
3040000 332
3.3%
3050000 306
3.1%
3060000 318
3.2%
3070000 274
2.7%
3080000 188
1.9%
3090000 211
2.1%
ValueCountFrequency (%)
3240000 385
 
3.9%
3230000 595
5.9%
3220000 1001
10.0%
3210000 572
5.7%
3200000 381
 
3.8%
3190000 220
 
2.2%
3180000 457
4.6%
3170000 496
5.0%
3160000 414
4.1%
3150000 565
5.7%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0219947 × 1018
Minimum1.998322 × 1018
Maximum2.032304 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T03:21:51.795810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.998322 × 1018
5-th percentile2.016315 × 1018
Q12.0223 × 1018
median2.023307 × 1018
Q32.023318 × 1018
95-th percentile2.023324 × 1018
Maximum2.032304 × 1018
Range3.3982011 × 1016
Interquartile range (IQR)1.0180052 × 1015

Descriptive statistics

Standard deviation3.0778216 × 1015
Coefficient of variation (CV)0.001522171
Kurtosis13.825312
Mean2.0219947 × 1018
Median Absolute Deviation (MAD)1.4995 × 1013
Skewness-3.4352261
Sum2.3156841 × 1018
Variance9.472986 × 1030
MonotonicityNot monotonic
2024-05-11T03:21:52.274713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2023323029130200364 1
 
< 0.1%
2013301013030200165 1
 
< 0.1%
2023305021030201772 1
 
< 0.1%
2023316015930202259 1
 
< 0.1%
2021324023630201961 1
 
< 0.1%
2023316015930200566 1
 
< 0.1%
2020321015330204009 1
 
< 0.1%
2023323029130201376 1
 
< 0.1%
2023321015330201196 1
 
< 0.1%
2023314016730200724 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1998322008330201109 1
< 0.1%
2000301010030200041 1
< 0.1%
2001301010030200011 1
< 0.1%
2002301010030200160 1
< 0.1%
2002301010030200453 1
< 0.1%
2002301010030200607 1
< 0.1%
2002301010030200671 1
< 0.1%
2002301010030200733 1
< 0.1%
2002301010030200793 1
< 0.1%
2002301010030200849 1
< 0.1%
ValueCountFrequency (%)
2032304019030200007 1
< 0.1%
2032304019030200006 1
< 0.1%
2024324028930200729 1
< 0.1%
2024324028930200727 1
< 0.1%
2024324028930200725 1
< 0.1%
2024324028930200721 1
< 0.1%
2024324028930200720 1
< 0.1%
2024324028930200429 1
< 0.1%
2024324028930200428 1
< 0.1%
2024324028930200425 1
< 0.1%
Distinct2207
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1998-01-23 00:00:00
Maximum2024-05-09 00:00:00
2024-05-11T03:21:52.847470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:21:53.459173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Date

MISSING 

Distinct7
Distinct (%)17.1%
Missing9959
Missing (%)99.6%
Memory size156.2 KiB
Minimum2008-06-03 00:00:00
Maximum2023-04-10 00:00:00
2024-05-11T03:21:53.916620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:21:54.324779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
6871 
3
1604 
5
1054 
4
 
439
2
 
32

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 6871
68.7%
3 1604
 
16.0%
5 1054
 
10.5%
4 439
 
4.4%
2 32
 
0.3%

Length

2024-05-11T03:21:54.732793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:21:55.094332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6871
68.7%
3 1604
 
16.0%
5 1054
 
10.5%
4 439
 
4.4%
2 32
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
6871 
폐업
1604 
제외/삭제/전출
1054 
취소/말소/만료/정지/중지
 
439
휴업
 
32

Length

Max length14
Median length5
Mean length5.2205
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row취소/말소/만료/정지/중지
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 6871
68.7%
폐업 1604
 
16.0%
제외/삭제/전출 1054
 
10.5%
취소/말소/만료/정지/중지 439
 
4.4%
휴업 32
 
0.3%

Length

2024-05-11T03:21:55.594827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:21:56.019496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 6871
68.7%
폐업 1604
 
16.0%
제외/삭제/전출 1054
 
10.5%
취소/말소/만료/정지/중지 439
 
4.4%
휴업 32
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9958
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T03:21:56.508086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile5
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.673757
Coefficient of variation (CV)0.83863964
Kurtosis1.3682072
Mean1.9958
Median Absolute Deviation (MAD)0
Skewness1.5630933
Sum19958
Variance2.8014625
MonotonicityNot monotonic
2024-05-11T03:21:56.877685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 6871
68.7%
3 1604
 
16.0%
5 1054
 
10.5%
7 395
 
4.0%
4 44
 
0.4%
2 32
 
0.3%
ValueCountFrequency (%)
1 6871
68.7%
2 32
 
0.3%
3 1604
 
16.0%
4 44
 
0.4%
5 1054
 
10.5%
7 395
 
4.0%
ValueCountFrequency (%)
7 395
 
4.0%
5 1054
 
10.5%
4 44
 
0.4%
3 1604
 
16.0%
2 32
 
0.3%
1 6871
68.7%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
6871 
폐업처리
1604 
타시군구이관
1054 
직권말소
 
395
직권취소
 
44

Length

Max length6
Median length4
Mean length4.2108
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업처리
2nd row직권말소
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 6871
68.7%
폐업처리 1604
 
16.0%
타시군구이관 1054
 
10.5%
직권말소 395
 
4.0%
직권취소 44
 
0.4%
휴업처리 32
 
0.3%

Length

2024-05-11T03:21:57.334018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:21:57.706330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 6871
68.7%
폐업처리 1604
 
16.0%
타시군구이관 1054
 
10.5%
직권말소 395
 
4.0%
직권취소 44
 
0.4%
휴업처리 32
 
0.3%

폐업일자
Date

MISSING 

Distinct372
Distinct (%)14.0%
Missing7344
Missing (%)73.4%
Memory size156.2 KiB
Minimum2006-06-08 00:00:00
Maximum2024-05-08 00:00:00
2024-05-11T03:21:58.077468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:21:58.507180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct31
Distinct (%)93.9%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2022-04-19 00:00:00
Maximum2024-04-17 00:00:00
2024-05-11T03:21:58.885706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:21:59.271885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

휴업종료일자
Date

MISSING 

Distinct30
Distinct (%)90.9%
Missing9967
Missing (%)99.7%
Memory size156.2 KiB
Minimum2023-04-18 00:00:00
Maximum2099-12-31 00:00:00
2024-05-11T03:21:59.662172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:22:00.121912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)

재개업일자
Date

MISSING 

Distinct16
Distinct (%)94.1%
Missing9983
Missing (%)99.8%
Memory size156.2 KiB
Minimum2023-03-07 00:00:00
Maximum2024-05-08 00:00:00
2024-05-11T03:22:00.479086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:22:00.861655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

전화번호
Text

MISSING 

Distinct1568
Distinct (%)96.3%
Missing8371
Missing (%)83.7%
Memory size156.2 KiB
2024-05-11T03:22:01.402797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length12.027624
Min length1

Characters and Unicode

Total characters19593
Distinct characters16
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1508 ?
Unique (%)92.6%

Sample

1st row02-2004-8213
2nd row02-1600-2018
3rd row02-2269-8923
4th row070-7706-1709
5th row070-4327-1816
ValueCountFrequency (%)
327
 
13.3%
02 188
 
7.6%
070 65
 
2.6%
050 4
 
0.2%
7954 4
 
0.2%
02-444-9004 3
 
0.1%
5006 3
 
0.1%
775 3
 
0.1%
518 3
 
0.1%
3398 3
 
0.1%
Other values (1774) 1862
75.5%
2024-05-11T03:22:02.589303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3299
16.8%
- 2846
14.5%
2 2506
12.8%
7 1641
8.4%
4 1257
 
6.4%
5 1256
 
6.4%
3 1236
 
6.3%
8 1233
 
6.3%
1 1202
 
6.1%
6 1169
 
6.0%
Other values (6) 1948
9.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15771
80.5%
Dash Punctuation 2846
 
14.5%
Space Separator 951
 
4.9%
Other Punctuation 17
 
0.1%
Close Punctuation 6
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3299
20.9%
2 2506
15.9%
7 1641
10.4%
4 1257
 
8.0%
5 1256
 
8.0%
3 1236
 
7.8%
8 1233
 
7.8%
1 1202
 
7.6%
6 1169
 
7.4%
9 972
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 13
76.5%
/ 4
 
23.5%
Dash Punctuation
ValueCountFrequency (%)
- 2846
100.0%
Space Separator
ValueCountFrequency (%)
951
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19593
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3299
16.8%
- 2846
14.5%
2 2506
12.8%
7 1641
8.4%
4 1257
 
6.4%
5 1256
 
6.4%
3 1236
 
6.3%
8 1233
 
6.3%
1 1202
 
6.1%
6 1169
 
6.0%
Other values (6) 1948
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19593
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3299
16.8%
- 2846
14.5%
2 2506
12.8%
7 1641
8.4%
4 1257
 
6.4%
5 1256
 
6.4%
3 1236
 
6.3%
8 1233
 
6.3%
1 1202
 
6.1%
6 1169
 
6.0%
Other values (6) 1948
9.9%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
2024-05-11T03:22:02.912371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters14
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row100-746
2nd row135-936
ValueCountFrequency (%)
100-746 1
50.0%
135-936 1
50.0%
2024-05-11T03:22:03.702791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2
14.3%
0 2
14.3%
- 2
14.3%
6 2
14.3%
3 2
14.3%
7 1
7.1%
4 1
7.1%
5 1
7.1%
9 1
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12
85.7%
Dash Punctuation 2
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2
16.7%
0 2
16.7%
6 2
16.7%
3 2
16.7%
7 1
8.3%
4 1
8.3%
5 1
8.3%
9 1
8.3%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2
14.3%
0 2
14.3%
- 2
14.3%
6 2
14.3%
3 2
14.3%
7 1
7.1%
4 1
7.1%
5 1
7.1%
9 1
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2
14.3%
0 2
14.3%
- 2
14.3%
6 2
14.3%
3 2
14.3%
7 1
7.1%
4 1
7.1%
5 1
7.1%
9 1
7.1%
Distinct5566
Distinct (%)56.1%
Missing80
Missing (%)0.8%
Memory size156.2 KiB
2024-05-11T03:22:04.350831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length24.739214
Min length14

Characters and Unicode

Total characters245413
Distinct characters641
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4214 ?
Unique (%)42.5%

Sample

1st row서울특별시 송파구 삼전동 ***-* 서이빌
2nd row서울특별시 금천구 가산동 ***번지 **호
3rd row서울특별시 종로구 종로*가 ***-* 동대문종합시장
4th row서울특별시 강남구 역삼동 ***-** 강남역두산위브센티움
5th row서울특별시 종로구 운니동 **번지 *호 월드오피스텔 ***호
ValueCountFrequency (%)
서울특별시 9914
20.2%
9050
 
18.5%
1322
 
2.7%
강남구 994
 
2.0%
번지 918
 
1.9%
마포구 607
 
1.2%
송파구 594
 
1.2%
서초구 568
 
1.2%
강서구 556
 
1.1%
서대문구 504
 
1.0%
Other values (4344) 24013
49.0%
2024-05-11T03:22:05.493018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 45524
18.5%
39353
16.0%
12103
 
4.9%
11846
 
4.8%
10740
 
4.4%
10282
 
4.2%
10022
 
4.1%
9918
 
4.0%
9914
 
4.0%
- 6719
 
2.7%
Other values (631) 78992
32.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152121
62.0%
Other Punctuation 45591
 
18.6%
Space Separator 39353
 
16.0%
Dash Punctuation 6719
 
2.7%
Uppercase Letter 1124
 
0.5%
Lowercase Letter 334
 
0.1%
Open Punctuation 51
 
< 0.1%
Close Punctuation 51
 
< 0.1%
Letter Number 47
 
< 0.1%
Decimal Number 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12103
 
8.0%
11846
 
7.8%
10740
 
7.1%
10282
 
6.8%
10022
 
6.6%
9918
 
6.5%
9914
 
6.5%
2392
 
1.6%
2252
 
1.5%
1809
 
1.2%
Other values (555) 70843
46.6%
Uppercase Letter
ValueCountFrequency (%)
S 121
 
10.8%
K 105
 
9.3%
A 88
 
7.8%
T 70
 
6.2%
C 68
 
6.0%
E 64
 
5.7%
B 58
 
5.2%
I 55
 
4.9%
D 51
 
4.5%
M 47
 
4.2%
Other values (15) 397
35.3%
Lowercase Letter
ValueCountFrequency (%)
e 101
30.2%
n 39
 
11.7%
r 32
 
9.6%
t 31
 
9.3%
c 30
 
9.0%
a 15
 
4.5%
l 14
 
4.2%
s 14
 
4.2%
i 12
 
3.6%
o 8
 
2.4%
Other values (12) 38
 
11.4%
Decimal Number
ValueCountFrequency (%)
1 4
21.1%
2 3
15.8%
3 3
15.8%
7 3
15.8%
4 2
10.5%
6 1
 
5.3%
9 1
 
5.3%
8 1
 
5.3%
5 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
* 45524
99.9%
, 41
 
0.1%
. 13
 
< 0.1%
& 7
 
< 0.1%
? 2
 
< 0.1%
/ 2
 
< 0.1%
@ 1
 
< 0.1%
' 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
24
51.1%
16
34.0%
6
 
12.8%
1
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 49
96.1%
[ 2
 
3.9%
Close Punctuation
ValueCountFrequency (%)
) 49
96.1%
] 2
 
3.9%
Math Symbol
ValueCountFrequency (%)
~ 2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
39353
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6719
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 152121
62.0%
Common 91787
37.4%
Latin 1505
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12103
 
8.0%
11846
 
7.8%
10740
 
7.1%
10282
 
6.8%
10022
 
6.6%
9918
 
6.5%
9914
 
6.5%
2392
 
1.6%
2252
 
1.5%
1809
 
1.2%
Other values (555) 70843
46.6%
Latin
ValueCountFrequency (%)
S 121
 
8.0%
K 105
 
7.0%
e 101
 
6.7%
A 88
 
5.8%
T 70
 
4.7%
C 68
 
4.5%
E 64
 
4.3%
B 58
 
3.9%
I 55
 
3.7%
D 51
 
3.4%
Other values (41) 724
48.1%
Common
ValueCountFrequency (%)
* 45524
49.6%
39353
42.9%
- 6719
 
7.3%
( 49
 
0.1%
) 49
 
0.1%
, 41
 
< 0.1%
. 13
 
< 0.1%
& 7
 
< 0.1%
1 4
 
< 0.1%
2 3
 
< 0.1%
Other values (15) 25
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152121
62.0%
ASCII 93244
38.0%
Number Forms 47
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 45524
48.8%
39353
42.2%
- 6719
 
7.2%
S 121
 
0.1%
K 105
 
0.1%
e 101
 
0.1%
A 88
 
0.1%
T 70
 
0.1%
C 68
 
0.1%
E 64
 
0.1%
Other values (61) 1031
 
1.1%
Hangul
ValueCountFrequency (%)
12103
 
8.0%
11846
 
7.8%
10740
 
7.1%
10282
 
6.8%
10022
 
6.6%
9918
 
6.5%
9914
 
6.5%
2392
 
1.6%
2252
 
1.5%
1809
 
1.2%
Other values (555) 70843
46.6%
Number Forms
ValueCountFrequency (%)
24
51.1%
16
34.0%
6
 
12.8%
1
 
2.1%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct8543
Distinct (%)86.3%
Missing98
Missing (%)1.0%
Memory size156.2 KiB
2024-05-11T03:22:06.337638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length57
Mean length38.530499
Min length21

Characters and Unicode

Total characters381529
Distinct characters699
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7719 ?
Unique (%)78.0%

Sample

1st row서울특별시 송파구 백제고분로**길 **-*, ***호 (삼전동, 서이빌)
2nd row서울특별시 금천구 가산디지털*로 ***, A동 ***-*호 (가산동, 우림라이온스밸리)
3rd row서울특별시 종로구 종로 ***, 동대문종합시장 B동 *층 B-**호 (종로*가)
4th row서울특별시 강남구 테헤란로**길 *, 강남역두산위브센티움 ***호 (역삼동)
5th row서울특별시 종로구 율곡로*길 **, ***호 (운니동, 월드오피스텔)
ValueCountFrequency (%)
서울특별시 9900
 
13.7%
9878
 
13.7%
6912
 
9.6%
4003
 
5.6%
2003
 
2.8%
강남구 996
 
1.4%
마포구 606
 
0.8%
송파구 595
 
0.8%
서초구 570
 
0.8%
강서구 561
 
0.8%
Other values (6683) 36075
50.0%
2024-05-11T03:22:08.367366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 73296
19.2%
62292
16.3%
14853
 
3.9%
, 12814
 
3.4%
12568
 
3.3%
10858
 
2.8%
10785
 
2.8%
10398
 
2.7%
10056
 
2.6%
( 9989
 
2.6%
Other values (689) 153620
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 208077
54.5%
Other Punctuation 86146
22.6%
Space Separator 62292
 
16.3%
Open Punctuation 9991
 
2.6%
Close Punctuation 9991
 
2.6%
Dash Punctuation 2497
 
0.7%
Uppercase Letter 1968
 
0.5%
Lowercase Letter 467
 
0.1%
Letter Number 43
 
< 0.1%
Math Symbol 28
 
< 0.1%
Other values (2) 29
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14853
 
7.1%
12568
 
6.0%
10858
 
5.2%
10785
 
5.2%
10398
 
5.0%
10056
 
4.8%
9906
 
4.8%
9900
 
4.8%
8157
 
3.9%
6084
 
2.9%
Other values (609) 104512
50.2%
Uppercase Letter
ValueCountFrequency (%)
B 428
21.7%
A 339
17.2%
C 137
 
7.0%
S 134
 
6.8%
K 108
 
5.5%
T 81
 
4.1%
E 79
 
4.0%
D 73
 
3.7%
I 62
 
3.2%
R 60
 
3.0%
Other values (15) 467
23.7%
Lowercase Letter
ValueCountFrequency (%)
e 116
24.8%
n 47
10.1%
b 41
 
8.8%
t 40
 
8.6%
c 40
 
8.6%
r 34
 
7.3%
a 25
 
5.4%
s 18
 
3.9%
i 17
 
3.6%
l 14
 
3.0%
Other values (13) 75
16.1%
Decimal Number
ValueCountFrequency (%)
2 5
17.9%
1 5
17.9%
3 4
14.3%
8 3
10.7%
7 3
10.7%
6 3
10.7%
0 2
 
7.1%
9 1
 
3.6%
4 1
 
3.6%
5 1
 
3.6%
Other Punctuation
ValueCountFrequency (%)
* 73296
85.1%
, 12814
 
14.9%
. 16
 
< 0.1%
& 9
 
< 0.1%
/ 6
 
< 0.1%
? 2
 
< 0.1%
' 1
 
< 0.1%
: 1
 
< 0.1%
# 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
23
53.5%
14
32.6%
5
 
11.6%
1
 
2.3%
Open Punctuation
ValueCountFrequency (%)
( 9989
> 99.9%
[ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 9989
> 99.9%
] 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 27
96.4%
1
 
3.6%
Space Separator
ValueCountFrequency (%)
62292
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2497
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 208077
54.5%
Common 170974
44.8%
Latin 2478
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14853
 
7.1%
12568
 
6.0%
10858
 
5.2%
10785
 
5.2%
10398
 
5.0%
10056
 
4.8%
9906
 
4.8%
9900
 
4.8%
8157
 
3.9%
6084
 
2.9%
Other values (609) 104512
50.2%
Latin
ValueCountFrequency (%)
B 428
17.3%
A 339
 
13.7%
C 137
 
5.5%
S 134
 
5.4%
e 116
 
4.7%
K 108
 
4.4%
T 81
 
3.3%
E 79
 
3.2%
D 73
 
2.9%
I 62
 
2.5%
Other values (42) 921
37.2%
Common
ValueCountFrequency (%)
* 73296
42.9%
62292
36.4%
, 12814
 
7.5%
( 9989
 
5.8%
) 9989
 
5.8%
- 2497
 
1.5%
~ 27
 
< 0.1%
. 16
 
< 0.1%
& 9
 
< 0.1%
/ 6
 
< 0.1%
Other values (18) 39
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 208076
54.5%
ASCII 173408
45.5%
Number Forms 43
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 73296
42.3%
62292
35.9%
, 12814
 
7.4%
( 9989
 
5.8%
) 9989
 
5.8%
- 2497
 
1.4%
B 428
 
0.2%
A 339
 
0.2%
C 137
 
0.1%
S 134
 
0.1%
Other values (65) 1493
 
0.9%
Hangul
ValueCountFrequency (%)
14853
 
7.1%
12568
 
6.0%
10858
 
5.2%
10785
 
5.2%
10398
 
5.0%
10056
 
4.8%
9906
 
4.8%
9900
 
4.8%
8157
 
3.9%
6084
 
2.9%
Other values (608) 104511
50.2%
Number Forms
ValueCountFrequency (%)
23
53.5%
14
32.6%
5
 
11.6%
1
 
2.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct3778
Distinct (%)38.4%
Missing174
Missing (%)1.7%
Memory size156.2 KiB
2024-05-11T03:22:10.031369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0216772
Min length5

Characters and Unicode

Total characters49343
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1622 ?
Unique (%)16.5%

Sample

1st row05587
2nd row153-786
3rd row03198
4th row06234
5th row03131
ValueCountFrequency (%)
03785 58
 
0.6%
06083 50
 
0.5%
08503 38
 
0.4%
06037 35
 
0.4%
06159 33
 
0.3%
07256 32
 
0.3%
05854 31
 
0.3%
04057 31
 
0.3%
06178 29
 
0.3%
06061 24
 
0.2%
Other values (3768) 9465
96.3%
2024-05-11T03:22:12.239856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13164
26.7%
7 4700
 
9.5%
6 4600
 
9.3%
5 4552
 
9.2%
3 4398
 
8.9%
4 3975
 
8.1%
8 3903
 
7.9%
2 3859
 
7.8%
1 3773
 
7.6%
9 2318
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 49242
99.8%
Dash Punctuation 101
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 13164
26.7%
7 4700
 
9.5%
6 4600
 
9.3%
5 4552
 
9.2%
3 4398
 
8.9%
4 3975
 
8.1%
8 3903
 
7.9%
2 3859
 
7.8%
1 3773
 
7.7%
9 2318
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 49343
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 13164
26.7%
7 4700
 
9.5%
6 4600
 
9.3%
5 4552
 
9.2%
3 4398
 
8.9%
4 3975
 
8.1%
8 3903
 
7.9%
2 3859
 
7.8%
1 3773
 
7.6%
9 2318
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49343
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13164
26.7%
7 4700
 
9.5%
6 4600
 
9.3%
5 4552
 
9.2%
3 4398
 
8.9%
4 3975
 
8.1%
8 3903
 
7.9%
2 3859
 
7.8%
1 3773
 
7.6%
9 2318
 
4.7%
Distinct9803
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T03:22:12.989902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length7.4618
Min length1

Characters and Unicode

Total characters74618
Distinct characters1119
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9612 ?
Unique (%)96.1%

Sample

1st row오늘의모든것
2nd row하게
3rd row원스냅
4th row유한회사 에이원인터네셔날(A-ONE International Ltd.)
5th row주식회사 현대불교신문사
ValueCountFrequency (%)
주식회사 1301
 
9.5%
107
 
0.8%
인셀덤 77
 
0.6%
유한회사 56
 
0.4%
inc 46
 
0.3%
스튜디오 44
 
0.3%
co 38
 
0.3%
컴퍼니 37
 
0.3%
ltd 36
 
0.3%
company 33
 
0.2%
Other values (11043) 11974
87.1%
2024-05-11T03:22:14.385755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3758
 
5.0%
2720
 
3.6%
2112
 
2.8%
( 1962
 
2.6%
) 1962
 
2.6%
1894
 
2.5%
1733
 
2.3%
1496
 
2.0%
1394
 
1.9%
1203
 
1.6%
Other values (1109) 54384
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53110
71.2%
Lowercase Letter 7238
 
9.7%
Uppercase Letter 5428
 
7.3%
Space Separator 3758
 
5.0%
Open Punctuation 1965
 
2.6%
Close Punctuation 1965
 
2.6%
Decimal Number 541
 
0.7%
Other Punctuation 474
 
0.6%
Other Symbol 59
 
0.1%
Dash Punctuation 57
 
0.1%
Other values (3) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2720
 
5.1%
2112
 
4.0%
1894
 
3.6%
1733
 
3.3%
1496
 
2.8%
1394
 
2.6%
1203
 
2.3%
873
 
1.6%
825
 
1.6%
762
 
1.4%
Other values (1028) 38098
71.7%
Lowercase Letter
ValueCountFrequency (%)
e 855
11.8%
o 772
10.7%
a 674
 
9.3%
n 566
 
7.8%
i 502
 
6.9%
r 473
 
6.5%
t 462
 
6.4%
l 392
 
5.4%
s 340
 
4.7%
d 275
 
3.8%
Other values (16) 1927
26.6%
Uppercase Letter
ValueCountFrequency (%)
A 404
 
7.4%
E 404
 
7.4%
O 396
 
7.3%
I 365
 
6.7%
L 334
 
6.2%
N 321
 
5.9%
T 306
 
5.6%
C 303
 
5.6%
S 300
 
5.5%
M 286
 
5.3%
Other values (16) 2009
37.0%
Decimal Number
ValueCountFrequency (%)
1 93
17.2%
2 93
17.2%
0 75
13.9%
3 74
13.7%
4 42
7.8%
5 42
7.8%
9 35
 
6.5%
6 34
 
6.3%
7 28
 
5.2%
8 25
 
4.6%
Other Punctuation
ValueCountFrequency (%)
. 263
55.5%
, 87
 
18.4%
& 74
 
15.6%
' 21
 
4.4%
? 12
 
2.5%
: 10
 
2.1%
/ 3
 
0.6%
# 2
 
0.4%
! 2
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 1962
99.8%
[ 3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1962
99.8%
] 3
 
0.2%
Space Separator
ValueCountFrequency (%)
3758
100.0%
Other Symbol
ValueCountFrequency (%)
59
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53155
71.2%
Latin 12666
 
17.0%
Common 8783
 
11.8%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2720
 
5.1%
2112
 
4.0%
1894
 
3.6%
1733
 
3.3%
1496
 
2.8%
1394
 
2.6%
1203
 
2.3%
873
 
1.6%
825
 
1.6%
762
 
1.4%
Other values (1017) 38143
71.8%
Latin
ValueCountFrequency (%)
e 855
 
6.8%
o 772
 
6.1%
a 674
 
5.3%
n 566
 
4.5%
i 502
 
4.0%
r 473
 
3.7%
t 462
 
3.6%
A 404
 
3.2%
E 404
 
3.2%
O 396
 
3.1%
Other values (42) 7158
56.5%
Common
ValueCountFrequency (%)
3758
42.8%
( 1962
22.3%
) 1962
22.3%
. 263
 
3.0%
1 93
 
1.1%
2 93
 
1.1%
, 87
 
1.0%
0 75
 
0.9%
& 74
 
0.8%
3 74
 
0.8%
Other values (18) 342
 
3.9%
Han
ValueCountFrequency (%)
2
14.3%
2
14.3%
貿 1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53093
71.2%
ASCII 21448
28.7%
None 59
 
0.1%
CJK 14
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3758
 
17.5%
( 1962
 
9.1%
) 1962
 
9.1%
e 855
 
4.0%
o 772
 
3.6%
a 674
 
3.1%
n 566
 
2.6%
i 502
 
2.3%
r 473
 
2.2%
t 462
 
2.2%
Other values (69) 9462
44.1%
Hangul
ValueCountFrequency (%)
2720
 
5.1%
2112
 
4.0%
1894
 
3.6%
1733
 
3.3%
1496
 
2.8%
1394
 
2.6%
1203
 
2.3%
873
 
1.6%
825
 
1.6%
762
 
1.4%
Other values (1014) 38081
71.7%
None
ValueCountFrequency (%)
59
100.0%
CJK
ValueCountFrequency (%)
2
14.3%
2
14.3%
貿 1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct9973
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-11-05 11:23:30
Maximum2024-05-09 13:59:35
2024-05-11T03:22:14.961999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:22:15.441817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
5644 
U
4356 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowU
3rd rowI
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 5644
56.4%
U 4356
43.6%

Length

2024-05-11T03:22:15.949085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:22:16.273036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 5644
56.4%
u 4356
43.6%
Distinct390
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T03:22:16.859305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T03:22:17.340738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct560
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T03:22:17.749971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length9.6357
Min length1

Characters and Unicode

Total characters96357
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique348 ?
Unique (%)3.5%

Sample

1st row종합몰
2nd row기타
3rd row의류/패션/잡화/뷰티
4th row의류/패션/잡화/뷰티 레져/여행/공연
5th row종합몰
ValueCountFrequency (%)
종합몰 4834
31.8%
의류/패션/잡화/뷰티 3384
22.2%
기타 1807
 
11.9%
건강/식품 1200
 
7.9%
교육/도서/완구/오락 920
 
6.0%
가구/수납용품 715
 
4.7%
컴퓨터/사무용품 690
 
4.5%
가전 550
 
3.6%
레져/여행/공연 448
 
2.9%
자동차/자동차용품 376
 
2.5%
Other values (3) 286
 
1.9%
2024-05-11T03:22:18.956754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16881
 
17.5%
5210
 
5.4%
4834
 
5.0%
4834
 
5.0%
4834
 
5.0%
3384
 
3.5%
3384
 
3.5%
3384
 
3.5%
3384
 
3.5%
3384
 
3.5%
Other values (41) 42844
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74200
77.0%
Other Punctuation 16881
 
17.5%
Space Separator 5210
 
5.4%
Dash Punctuation 66
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4834
 
6.5%
4834
 
6.5%
4834
 
6.5%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
Other values (38) 36010
48.5%
Other Punctuation
ValueCountFrequency (%)
/ 16881
100.0%
Space Separator
ValueCountFrequency (%)
5210
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74200
77.0%
Common 22157
 
23.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4834
 
6.5%
4834
 
6.5%
4834
 
6.5%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
Other values (38) 36010
48.5%
Common
ValueCountFrequency (%)
/ 16881
76.2%
5210
 
23.5%
- 66
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74200
77.0%
ASCII 22157
 
23.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16881
76.2%
5210
 
23.5%
- 66
 
0.3%
Hangul
ValueCountFrequency (%)
4834
 
6.5%
4834
 
6.5%
4834
 
6.5%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
3384
 
4.6%
Other values (38) 36010
48.5%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct7180
Distinct (%)73.1%
Missing173
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean198848.49
Minimum169420.7
Maximum215966.73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T03:22:19.533191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum169420.7
5-th percentile186331.24
Q1192411.78
median200284.08
Q3204691.65
95-th percentile211069.04
Maximum215966.73
Range46546.032
Interquartile range (IQR)12279.874

Descriptive statistics

Standard deviation7569.4108
Coefficient of variation (CV)0.038066222
Kurtosis-0.96997639
Mean198848.49
Median Absolute Deviation (MAD)6302.3387
Skewness-0.07973631
Sum1.9540841 × 109
Variance57295980
MonotonicityNot monotonic
2024-05-11T03:22:20.069246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
193712.476983727 56
 
0.6%
205271.275273064 48
 
0.5%
202124.262977683 33
 
0.3%
190586.062540107 31
 
0.3%
193870.417173466 26
 
0.3%
205079.418138107 25
 
0.2%
204691.655 25
 
0.2%
208295.099818379 23
 
0.2%
200949.551476636 22
 
0.2%
212631.80093407 22
 
0.2%
Other values (7170) 9516
95.2%
(Missing) 173
 
1.7%
ValueCountFrequency (%)
169420.696936678 1
< 0.1%
180824.823988101 1
< 0.1%
181881.234660314 1
< 0.1%
182086.388313239 1
< 0.1%
182141.205465089 1
< 0.1%
182443.109448908 1
< 0.1%
182742.537963411 1
< 0.1%
182857.528466005 1
< 0.1%
182859.059692916 1
< 0.1%
182872.579435573 1
< 0.1%
ValueCountFrequency (%)
215966.729045494 2
< 0.1%
215901.594590118 1
 
< 0.1%
215898.113091143 2
< 0.1%
215523.83400078 2
< 0.1%
215505.695374616 2
< 0.1%
215496.149626989 2
< 0.1%
215422.746119624 1
 
< 0.1%
215366.063352066 4
< 0.1%
215278.711932608 1
 
< 0.1%
215264.585458612 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct7180
Distinct (%)73.1%
Missing173
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean448787.74
Minimum436903.24
Maximum465025.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T03:22:20.659173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum436903.24
5-th percentile441447.3
Q1444432.96
median448741.81
Q3451897.58
95-th percentile458962.16
Maximum465025.25
Range28122.006
Interquartile range (IQR)7464.6202

Descriptive statistics

Standard deviation5350.0256
Coefficient of variation (CV)0.01192106
Kurtosis-0.16608943
Mean448787.74
Median Absolute Deviation (MAD)3825.6135
Skewness0.51490042
Sum4.4102372 × 109
Variance28622774
MonotonicityNot monotonic
2024-05-11T03:22:21.156939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
450637.010875398 56
 
0.6%
445852.638814088 48
 
0.5%
446374.647464072 33
 
0.3%
447216.925498911 31
 
0.3%
450441.587738525 26
 
0.3%
444869.834721154 25
 
0.2%
444968.62 25
 
0.2%
456014.999180519 23
 
0.2%
443483.146383375 22
 
0.2%
449475.1682321 22
 
0.2%
Other values (7170) 9516
95.2%
(Missing) 173
 
1.7%
ValueCountFrequency (%)
436903.240679787 1
< 0.1%
437062.738869505 1
< 0.1%
437085.180151344 1
< 0.1%
437099.172901614 1
< 0.1%
437554.475708821 1
< 0.1%
437736.112150933 1
< 0.1%
437769.972920132 1
< 0.1%
437799.809089881 1
< 0.1%
437831.130359287 1
< 0.1%
437837.919221179 1
< 0.1%
ValueCountFrequency (%)
465025.246646886 1
 
< 0.1%
464909.633026206 1
 
< 0.1%
464814.717432497 3
< 0.1%
464725.202550965 1
 
< 0.1%
464709.524245203 1
 
< 0.1%
464620.146666666 1
 
< 0.1%
464602.891647099 1
 
< 0.1%
464552.486156478 1
 
< 0.1%
464514.893929523 1
 
< 0.1%
464463.403607965 1
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9959 
0
 
41

Length

Max length4
Median length4
Mean length3.9877
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9959
99.6%
0 41
 
0.4%

Length

2024-05-11T03:22:21.853857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:22:22.311096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9959
99.6%
0 41
 
0.4%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9959 
0
 
41

Length

Max length4
Median length4
Mean length3.9877
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9959
99.6%
0 41
 
0.4%

Length

2024-05-11T03:22:22.844187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:22:23.341923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9959
99.6%
0 41
 
0.4%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9959 
0
 
41

Length

Max length4
Median length4
Mean length3.9877
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9959
99.6%
0 41
 
0.4%

Length

2024-05-11T03:22:23.890347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:22:24.304474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9959
99.6%
0 41
 
0.4%

판매방식명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9994 
인터넷
 
6

Length

Max length4
Median length4
Mean length3.9994
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9994
99.9%
인터넷 6
 
0.1%

Length

2024-05-11T03:22:24.706530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T03:22:25.107207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9994
99.9%
인터넷 6
 
0.1%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
19809323000020233230291302003642023-01-19<NA>3폐업3폐업처리2023-05-16<NA><NA><NA><NA><NA><NA>서울특별시 송파구 삼전동 ***-* 서이빌서울특별시 송파구 백제고분로**길 **-*, ***호 (삼전동, 서이빌)05587오늘의모든것2023-05-16 16:34:39U2022-12-04 23:08:00.0종합몰208362.338737444224.203232<NA><NA><NA><NA>
9168317000020143170142302003022014-03-24<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 금천구 가산동 ***번지 **호서울특별시 금천구 가산디지털*로 ***, A동 ***-*호 (가산동, 우림라이온스밸리)153-786하게2023-08-10 17:23:57U2022-12-07 23:02:00.0기타189538.020936441982.427935<NA><NA><NA><NA>
9098300000020243000245302001102024-01-11<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 종로*가 ***-* 동대문종합시장서울특별시 종로구 종로 ***, 동대문종합시장 B동 *층 B-**호 (종로*가)03198원스냅2024-01-15 14:18:18I2023-11-30 23:07:00.0의류/패션/잡화/뷰티200542.258243452057.12686<NA><NA><NA><NA>
15691322000020233220249302016642023-03-17<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***-** 강남역두산위브센티움서울특별시 강남구 테헤란로**길 *, 강남역두산위브센티움 ***호 (역삼동)06234유한회사 에이원인터네셔날(A-ONE International Ltd.)2023-03-17 11:51:39I2022-12-02 23:09:00.0의류/패션/잡화/뷰티 레져/여행/공연202842.375594444049.698784<NA><NA><NA><NA>
4385300000020033000101302005332003-01-14<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-2004-8213<NA><NA>서울특별시 종로구 운니동 **번지 *호 월드오피스텔 ***호서울특별시 종로구 율곡로*길 **, ***호 (운니동, 월드오피스텔)03131주식회사 현대불교신문사2023-04-12 09:47:22U2022-12-03 23:04:00.0종합몰198966.466796452608.062544<NA><NA><NA><NA>
6216321000020223210153302019132022-07-11<NA>5제외/삭제/전출5타시군구이관2023-05-11<NA><NA><NA><NA><NA><NA>서울특별시 서초구 양재동 ***-*서울특별시 서초구 마방로*길 **-**, *층, 지하*층 (양재동)06777바이루나2023-05-11 10:03:27U2022-12-04 23:03:00.0의류/패션/잡화/뷰티203672.716127441272.691177<NA><NA><NA><NA>
7429312000020233120219302004662023-03-24<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 서대문구 충정로*가 ***-*서울특별시 서대문구 경기대로 **, *층 ***호 (충정로*가)03752주식회사 에너지라이프2023-03-24 17:03:10I2022-12-02 22:06:00.0종합몰196606.081101451281.610055<NA><NA><NA><NA>
558321000020233210153302007142023-02-28<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 서초동 ****-* 서초오피스텔서울특별시 서초구 서운로 ***, 서초오피스텔 ***호 (서초동)06609원메디컬2023-02-28 17:43:03I2022-12-03 00:03:00.0종합몰 기타201853.889909444549.81239<NA><NA><NA><NA>
8498312000020203120192302015502020-10-05<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 서대문구 홍은동 ***-** 대한레드빌서울특별시 서대문구 증가로*길 **-*, ***호 (홍은동, 대한레드빌)03663정쓰픽2023-03-24 17:49:12U2022-12-02 22:06:00.0종합몰193492.621716453001.74595<NA><NA><NA><NA>
14427322000020233220249302022592023-04-17<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 삼성동 ***-* 승광빌딩서울특별시 강남구 테헤란로**길 **, 승광빌딩 *층 (삼성동)06159인셀덤 선릉대리점2023-04-17 12:19:34I2022-12-03 23:09:00.0종합몰204718.327234445034.184824<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
12780300000020233000202302004942023-03-28<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 숭인동 ***-** 숭인상가아파트서울특별시 종로구 청계천로 ***, *층 **, **호 (숭인동, 숭인상가아파트)03117쓰리스타지퍼(123상사)2023-03-29 15:38:21I2022-12-02 21:01:00.0종합몰 의류/패션/잡화/뷰티 기타201747.387869452204.768497<NA><NA><NA><NA>
17208302000020083020095302007912008-07-16<NA>3폐업3폐업처리2023-03-23<NA><NA><NA>02-790-3357<NA><NA>서울특별시 용산구 한강로*가 ***번지 래미안용산 더 센트럴서울특별시 용산구 한강대로 **, 지하*층 ***호 (한강로*가, 래미안용산 더 센트럴)04378주식회사 용산데이터복구샵2023-03-31 17:00:23U2022-12-04 00:02:00.0종합몰 컴퓨터/사무용품197011.737529447430.100251<NA><NA><NA><NA>
7715312000020213120192302009992021-05-24<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 서대문구 홍제동 ***-** 광산아파트서울특별시 서대문구 모래내로 ***, A동 ***호 (홍제동, 광산아파트)03646파라요2023-03-24 17:16:48U2022-12-02 22:06:00.0종합몰195108.783924453701.815692<NA><NA><NA><NA>
21731322000020243220249302030642024-05-08<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 대치동 ***-**서울특별시 강남구 테헤란로**길 **, 디아이타워 ****호 (대치동)06178네발바닥2024-05-08 17:36:51I2023-12-04 23:00:00.0종합몰205079.418138444869.834721<NA><NA><NA><NA>
16575300000020213000202302015802021-08-17<NA>5제외/삭제/전출5타시군구이관2023-04-18<NA><NA><NA><NA><NA><NA>서울특별시 종로구 명륜*가 * 어젤리아 명륜*서울특별시 종로구 창경궁로 ***-*, *층 **호 (명륜*가, 어젤리아 명륜*)03075무드모아젤(MOODmoajell)2023-04-18 11:13:26U2022-12-03 22:00:00.0의류/패션/잡화/뷰티199884.924611453572.123373<NA><NA><NA><NA>
13361317000020193170190302015392019-09-17<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7566-8477<NA><NA>서울특별시 금천구 가산동 ***-*서울특별시 금천구 가산디지털*로 **, ***호 (가산동)08592주식회사 엔에스생활건강2023-03-29 15:16:13U2022-12-02 21:01:00.0종합몰189622.332386441057.177316<NA><NA><NA><NA>
12571322000020183220162302012592018-03-27<NA>3폐업3폐업처리2023-06-05<NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 ***번지 **호서울특별시 강남구 언주로**길 **, ****호 (역삼동, 한화 진넥스빌)06210(주) 펫앤브이2023-06-05 10:43:05U2022-12-06 00:08:00.0교육/도서/완구/오락 기타203842.339919444416.798167<NA><NA><NA><NA>
8073312000020243120219302000872024-01-11<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 서대문구 창천동 ***서울특별시 서대문구 연희로*안길 *, ***호 (창천동)03784케이스타일난다2024-01-15 17:50:49U2023-11-30 23:07:00.0종합몰 의류/패션/잡화/뷰티193499.308069450922.334073<NA><NA><NA><NA>
10319308000020213080169302009542021-07-05<NA>3폐업3폐업처리2023-04-21<NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ****-***서울특별시 강북구 삼양로 ***-*, *층 (미아동)01208무이비엔플라워2023-04-21 17:56:19U2022-12-03 22:03:00.0기타201910.781039457031.739323<NA><NA><NA><NA>
6288316000020233160159302008572023-04-04<NA>3폐업3폐업처리2023-08-11<NA><NA><NA><NA><NA><NA>서울특별시 구로구 가리봉동 **-***서울특별시 구로구 디지털로**길 ***, *층 ***호 (가리봉동)08385꼼꼼수사장2023-08-10 15:27:57U2022-12-07 23:02:00.0종합몰190033.709684442492.459647<NA><NA><NA><NA>