Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells77719
Missing cells (%)26.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical9
Numeric6
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author양천구
URLhttps://data.seoul.go.kr/dataList/OA-18816/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (61.0%)Imbalance
부채총액 is highly imbalanced (61.0%)Imbalance
자본금 is highly imbalanced (61.0%)Imbalance
판매방식명 is highly imbalanced (71.7%)Imbalance
인허가취소일자 has 9620 (96.2%) missing valuesMissing
폐업일자 has 6416 (64.2%) missing valuesMissing
휴업시작일자 has 9958 (99.6%) missing valuesMissing
휴업종료일자 has 9956 (99.6%) missing valuesMissing
재개업일자 has 9989 (99.9%) missing valuesMissing
전화번호 has 3859 (38.6%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8516 (85.2%) missing valuesMissing
지번주소 has 1739 (17.4%) missing valuesMissing
도로명주소 has 1575 (15.8%) missing valuesMissing
도로명우편번호 has 2875 (28.7%) missing valuesMissing
좌표정보(X) has 1608 (16.1%) missing valuesMissing
좌표정보(Y) has 1608 (16.1%) missing valuesMissing
관리번호 is highly skewed (γ1 = -64.29892365)Skewed
좌표정보(Y) is highly skewed (γ1 = -57.91129544)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 11:32:42.899230
Analysis finished2024-04-06 11:32:45.835462
Duration2.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3140000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3140000
2nd row3140000
3rd row3140000
4th row3140000
5th row3140000

Common Values

ValueCountFrequency (%)
3140000 10000
100.0%

Length

2024-04-06T20:32:46.001061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:32:46.177869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3140000 10000
100.0%

관리번호
Real number (ℝ)

SKEWED  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0152834 × 1018
Minimum1.996314 × 1017
Maximum2.024314 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:32:46.376246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.996314 × 1017
5-th percentile2.003314 × 1018
Q12.010314 × 1018
median2.017314 × 1018
Q32.021314 × 1018
95-th percentile2.023314 × 1018
Maximum2.024314 × 1018
Range1.8246826 × 1018
Interquartile range (IQR)1.1000005 × 1016

Descriptive statistics

Standard deviation2.6506539 × 1016
Coefficient of variation (CV)0.01315276
Kurtosis4402.6009
Mean2.0152834 × 1018
Median Absolute Deviation (MAD)5 × 1015
Skewness-64.298924
Sum8.9893548 × 1018
Variance7.0259661 × 1032
MonotonicityNot monotonic
2024-04-06T20:32:46.638220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2018314016730200252 1
 
< 0.1%
2023314016730200960 1
 
< 0.1%
2018314016730200385 1
 
< 0.1%
2022314016730201565 1
 
< 0.1%
2019314016730200308 1
 
< 0.1%
2015314016730200512 1
 
< 0.1%
2021314016730202033 1
 
< 0.1%
2006314011430202231 1
 
< 0.1%
2020314016730201888 1
 
< 0.1%
2017314016730200499 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
199631401143020161 1
< 0.1%
199731401143020655 1
< 0.1%
1997314011430201055 1
< 0.1%
1999314011430201843 1
< 0.1%
1999314011430201937 1
< 0.1%
1999314011430202341 1
< 0.1%
2000314011430202771 1
< 0.1%
2000314011430203145 1
< 0.1%
2000314011430203267 1
< 0.1%
2000314011430203284 1
< 0.1%
ValueCountFrequency (%)
2024314016730200513 1
< 0.1%
2024314016730200511 1
< 0.1%
2024314016730200506 1
< 0.1%
2024314016730200504 1
< 0.1%
2024314016730200502 1
< 0.1%
2024314016730200501 1
< 0.1%
2024314016730200500 1
< 0.1%
2024314016730200499 1
< 0.1%
2024314016730200497 1
< 0.1%
2024314016730200496 1
< 0.1%
Distinct4007
Distinct (%)40.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-08-02 00:00:00
Maximum2024-04-04 00:00:00
2024-04-06T20:32:46.890382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:47.120244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Real number (ℝ)

MISSING 

Distinct40
Distinct (%)10.5%
Missing9620
Missing (%)96.2%
Infinite0
Infinite (%)0.0%
Mean20086012
Minimum20030923
Maximum20201222
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:32:47.339888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20030923
5-th percentile20080523
Q120080605
median20081013
Q320091226
95-th percentile20091226
Maximum20201222
Range170299
Interquartile range (IQR)10621

Descriptive statistics

Standard deviation17796.522
Coefficient of variation (CV)0.00088601572
Kurtosis23.753159
Mean20086012
Median Absolute Deviation (MAD)486
Skewness3.4511143
Sum7.6326844 × 109
Variance3.1671619 × 108
MonotonicityNot monotonic
2024-04-06T20:32:47.551605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
20091226 131
 
1.3%
20080605 61
 
0.6%
20081009 55
 
0.5%
20081013 43
 
0.4%
20080527 16
 
0.2%
20080523 12
 
0.1%
20090506 10
 
0.1%
20080528 7
 
0.1%
20080530 6
 
0.1%
20031021 5
 
0.1%
Other values (30) 34
 
0.3%
(Missing) 9620
96.2%
ValueCountFrequency (%)
20030923 2
 
< 0.1%
20031021 5
 
0.1%
20041013 2
 
< 0.1%
20070809 1
 
< 0.1%
20071008 1
 
< 0.1%
20071009 1
 
< 0.1%
20080311 1
 
< 0.1%
20080523 12
0.1%
20080524 1
 
< 0.1%
20080527 16
0.2%
ValueCountFrequency (%)
20201222 1
< 0.1%
20201221 1
< 0.1%
20201216 1
< 0.1%
20201029 1
< 0.1%
20180628 1
< 0.1%
20180418 1
< 0.1%
20171023 1
< 0.1%
20140117 1
< 0.1%
20130103 1
< 0.1%
20121113 1
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5599 
3
3078 
4
765 
5
 
521
2
 
37

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row1
4th row1
5th row3

Common Values

ValueCountFrequency (%)
1 5599
56.0%
3 3078
30.8%
4 765
 
7.6%
5 521
 
5.2%
2 37
 
0.4%

Length

2024-04-06T20:32:47.764587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:32:47.921888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5599
56.0%
3 3078
30.8%
4 765
 
7.6%
5 521
 
5.2%
2 37
 
0.4%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
5599 
폐업
3078 
취소/말소/만료/정지/중지
765 
제외/삭제/전출
 
521
휴업
 
37

Length

Max length14
Median length5
Mean length4.9103
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row영업/정상
4th row영업/정상
5th row폐업

Common Values

ValueCountFrequency (%)
영업/정상 5599
56.0%
폐업 3078
30.8%
취소/말소/만료/정지/중지 765
 
7.6%
제외/삭제/전출 521
 
5.2%
휴업 37
 
0.4%

Length

2024-04-06T20:32:48.104675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:32:48.286426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 5599
56.0%
폐업 3078
30.8%
취소/말소/만료/정지/중지 765
 
7.6%
제외/삭제/전출 521
 
5.2%
휴업 37
 
0.4%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.171
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:32:48.456974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile5
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.5520309
Coefficient of variation (CV)0.71489216
Kurtosis1.4117167
Mean2.171
Median Absolute Deviation (MAD)0
Skewness1.32675
Sum21710
Variance2.4087999
MonotonicityNot monotonic
2024-04-06T20:32:48.631603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 5598
56.0%
3 3078
30.8%
5 521
 
5.2%
4 388
 
3.9%
7 377
 
3.8%
2 37
 
0.4%
8 1
 
< 0.1%
ValueCountFrequency (%)
1 5598
56.0%
2 37
 
0.4%
3 3078
30.8%
4 388
 
3.9%
5 521
 
5.2%
7 377
 
3.8%
8 1
 
< 0.1%
ValueCountFrequency (%)
8 1
 
< 0.1%
7 377
 
3.8%
5 521
 
5.2%
4 388
 
3.9%
3 3078
30.8%
2 37
 
0.4%
1 5598
56.0%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
5598 
폐업처리
3078 
타시군구이관
 
521
직권취소
 
388
직권말소
 
377
Other values (2)
 
38

Length

Max length6
Median length4
Mean length4.1042
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row정상영업
2nd row폐업처리
3rd row정상영업
4th row정상영업
5th row폐업처리

Common Values

ValueCountFrequency (%)
정상영업 5598
56.0%
폐업처리 3078
30.8%
타시군구이관 521
 
5.2%
직권취소 388
 
3.9%
직권말소 377
 
3.8%
휴업처리 37
 
0.4%
영업재개 1
 
< 0.1%

Length

2024-04-06T20:32:48.838670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:32:49.020365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 5598
56.0%
폐업처리 3078
30.8%
타시군구이관 521
 
5.2%
직권취소 388
 
3.9%
직권말소 377
 
3.8%
휴업처리 37
 
0.4%
영업재개 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2227
Distinct (%)62.1%
Missing6416
Missing (%)64.2%
Memory size156.2 KiB
Minimum2000-02-18 00:00:00
Maximum2024-04-27 00:00:00
2024-04-06T20:32:49.245591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:49.503455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct42
Distinct (%)100.0%
Missing9958
Missing (%)99.6%
Memory size156.2 KiB
Minimum2008-01-01 00:00:00
Maximum2024-03-25 00:00:00
2024-04-06T20:32:49.751174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:49.998248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)

휴업종료일자
Date

MISSING 

Distinct42
Distinct (%)95.5%
Missing9956
Missing (%)99.6%
Memory size156.2 KiB
Minimum2004-07-01 00:00:00
Maximum2030-10-01 00:00:00
2024-04-06T20:32:50.198847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:50.382861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)

재개업일자
Date

MISSING 

Distinct11
Distinct (%)100.0%
Missing9989
Missing (%)99.9%
Memory size156.2 KiB
Minimum2006-10-18 00:00:00
Maximum2024-02-26 00:00:00
2024-04-06T20:32:50.888333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:51.131479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

전화번호
Text

MISSING 

Distinct3862
Distinct (%)62.9%
Missing3859
Missing (%)38.6%
Memory size156.2 KiB
2024-04-06T20:32:51.557889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length8.2185312
Min length1

Characters and Unicode

Total characters50470
Distinct characters18
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3769 ?
Unique (%)61.4%

Sample

1st row02-3282-3813
2nd row02-
3rd row02 26974333
4th row02-2691-0809
5th row3219-8428
ValueCountFrequency (%)
02 3597
46.3%
26
 
0.3%
651 10
 
0.1%
699 9
 
0.1%
0000 7
 
0.1%
655 7
 
0.1%
653 7
 
0.1%
696 6
 
0.1%
691 6
 
0.1%
654 6
 
0.1%
Other values (3944) 4080
52.6%
2024-04-06T20:32:52.144870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 9662
19.1%
0 9084
18.0%
- 6384
12.6%
6 5072
10.0%
4 2942
 
5.8%
7 2928
 
5.8%
5 2717
 
5.4%
1 2622
 
5.2%
9 2518
 
5.0%
3 2451
 
4.9%
Other values (8) 4090
8.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 42428
84.1%
Dash Punctuation 6384
 
12.6%
Space Separator 1627
 
3.2%
Other Punctuation 17
 
< 0.1%
Math Symbol 7
 
< 0.1%
Close Punctuation 7
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 9662
22.8%
0 9084
21.4%
6 5072
12.0%
4 2942
 
6.9%
7 2928
 
6.9%
5 2717
 
6.4%
1 2622
 
6.2%
9 2518
 
5.9%
3 2451
 
5.8%
8 2432
 
5.7%
Other Punctuation
ValueCountFrequency (%)
. 13
76.5%
* 2
 
11.8%
, 1
 
5.9%
' 1
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 6384
100.0%
Space Separator
ValueCountFrequency (%)
1627
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 50470
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 9662
19.1%
0 9084
18.0%
- 6384
12.6%
6 5072
10.0%
4 2942
 
5.8%
7 2928
 
5.8%
5 2717
 
5.4%
1 2622
 
5.2%
9 2518
 
5.0%
3 2451
 
4.9%
Other values (8) 4090
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50470
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 9662
19.1%
0 9084
18.0%
- 6384
12.6%
6 5072
10.0%
4 2942
 
5.8%
7 2928
 
5.8%
5 2717
 
5.4%
1 2622
 
5.2%
9 2518
 
5.0%
3 2451
 
4.9%
Other values (8) 4090
8.1%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct100
Distinct (%)6.7%
Missing8516
Missing (%)85.2%
Infinite0
Infinite (%)0.0%
Mean160218.07
Minimum110410
Maximum704080
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:32:52.369691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110410
5-th percentile158050
Q1158050
median158070
Q3158090
95-th percentile158733.95
Maximum704080
Range593670
Interquartile range (IQR)40

Descriptive statistics

Standard deviation28458.995
Coefficient of variation (CV)0.17762662
Kurtosis169.1113
Mean160218.07
Median Absolute Deviation (MAD)20
Skewness12.317288
Sum2.3776362 × 108
Variance8.099144 × 108
MonotonicityNot monotonic
2024-04-06T20:32:52.592478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
158070 473
 
4.7%
158050 452
 
4.5%
158090 363
 
3.6%
158861 13
 
0.1%
158091 13
 
0.1%
158051 12
 
0.1%
158073 7
 
0.1%
158075 6
 
0.1%
158053 6
 
0.1%
158071 5
 
0.1%
Other values (90) 134
 
1.3%
(Missing) 8516
85.2%
ValueCountFrequency (%)
110410 1
< 0.1%
110552 1
< 0.1%
120050 1
< 0.1%
121220 1
< 0.1%
122060 1
< 0.1%
134031 1
< 0.1%
135010 1
< 0.1%
135080 1
< 0.1%
136130 1
< 0.1%
137040 1
< 0.1%
ValueCountFrequency (%)
704080 1
< 0.1%
560741 1
< 0.1%
464861 1
< 0.1%
462130 1
< 0.1%
449851 1
< 0.1%
422090 1
< 0.1%
421190 1
< 0.1%
420120 1
< 0.1%
415020 1
< 0.1%
411320 1
< 0.1%

지번주소
Text

MISSING 

Distinct3296
Distinct (%)39.9%
Missing1739
Missing (%)17.4%
Memory size156.2 KiB
2024-04-06T20:32:53.084389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length26.429488
Min length5

Characters and Unicode

Total characters218334
Distinct characters506
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2605 ?
Unique (%)31.5%

Sample

1st row서울특별시 양천구 신월동 ***번지 *호 이스트힐
2nd row서울특별시 구로구 구로동 ***번지 **호 이스페이스 ***호
3rd row서울특별시 양천구 신월동 ***-* 행복하우스
4th row서울특별시 양천구 목동 ***-*
5th row서울특별시 양천구 신월동 **-* 신월빌딩 지하*층
ValueCountFrequency (%)
서울특별시 8210
18.7%
양천구 8174
18.6%
5201
11.8%
3659
8.3%
번지 3182
 
7.2%
목동 3082
 
7.0%
신정동 3017
 
6.9%
신월동 2181
 
5.0%
466
 
1.1%
295
 
0.7%
Other values (2380) 6470
14.7%
2024-04-06T20:32:53.826625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 44650
20.5%
35817
16.4%
9998
 
4.6%
8874
 
4.1%
8363
 
3.8%
8325
 
3.8%
8288
 
3.8%
8281
 
3.8%
8259
 
3.8%
8212
 
3.8%
Other values (496) 69267
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131719
60.3%
Other Punctuation 44755
 
20.5%
Space Separator 35817
 
16.4%
Dash Punctuation 4329
 
2.0%
Decimal Number 1101
 
0.5%
Uppercase Letter 314
 
0.1%
Open Punctuation 128
 
0.1%
Close Punctuation 128
 
0.1%
Lowercase Letter 29
 
< 0.1%
Math Symbol 8
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9998
 
7.6%
8874
 
6.7%
8363
 
6.3%
8325
 
6.3%
8288
 
6.3%
8281
 
6.3%
8259
 
6.3%
8212
 
6.2%
8210
 
6.2%
6289
 
4.8%
Other values (435) 48620
36.9%
Uppercase Letter
ValueCountFrequency (%)
B 77
24.5%
A 49
15.6%
S 26
 
8.3%
K 20
 
6.4%
M 19
 
6.1%
C 15
 
4.8%
O 14
 
4.5%
E 11
 
3.5%
T 11
 
3.5%
D 9
 
2.9%
Other values (15) 63
20.1%
Decimal Number
ValueCountFrequency (%)
1 264
24.0%
2 132
12.0%
0 127
11.5%
3 114
10.4%
9 110
10.0%
7 86
 
7.8%
4 85
 
7.7%
8 63
 
5.7%
6 62
 
5.6%
5 58
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
e 9
31.0%
l 4
13.8%
a 4
13.8%
b 4
13.8%
i 2
 
6.9%
s 2
 
6.9%
k 1
 
3.4%
r 1
 
3.4%
d 1
 
3.4%
v 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
* 44650
99.8%
@ 49
 
0.1%
, 38
 
0.1%
/ 8
 
< 0.1%
. 5
 
< 0.1%
& 3
 
< 0.1%
? 1
 
< 0.1%
' 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
35817
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4329
100.0%
Open Punctuation
ValueCountFrequency (%)
( 128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 128
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 131712
60.3%
Common 86266
39.5%
Latin 347
 
0.2%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9998
 
7.6%
8874
 
6.7%
8363
 
6.3%
8325
 
6.3%
8288
 
6.3%
8281
 
6.3%
8259
 
6.3%
8212
 
6.2%
8210
 
6.2%
6289
 
4.8%
Other values (433) 48613
36.9%
Latin
ValueCountFrequency (%)
B 77
22.2%
A 49
14.1%
S 26
 
7.5%
K 20
 
5.8%
M 19
 
5.5%
C 15
 
4.3%
O 14
 
4.0%
E 11
 
3.2%
T 11
 
3.2%
e 9
 
2.6%
Other values (26) 96
27.7%
Common
ValueCountFrequency (%)
* 44650
51.8%
35817
41.5%
- 4329
 
5.0%
1 264
 
0.3%
2 132
 
0.2%
( 128
 
0.1%
) 128
 
0.1%
0 127
 
0.1%
3 114
 
0.1%
9 110
 
0.1%
Other values (14) 467
 
0.5%
Han
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131710
60.3%
ASCII 86608
39.7%
CJK 9
 
< 0.1%
Number Forms 4
 
< 0.1%
None 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 44650
51.6%
35817
41.4%
- 4329
 
5.0%
1 264
 
0.3%
2 132
 
0.2%
( 128
 
0.1%
) 128
 
0.1%
0 127
 
0.1%
3 114
 
0.1%
9 110
 
0.1%
Other values (48) 809
 
0.9%
Hangul
ValueCountFrequency (%)
9998
 
7.6%
8874
 
6.7%
8363
 
6.3%
8325
 
6.3%
8288
 
6.3%
8281
 
6.3%
8259
 
6.3%
8212
 
6.2%
8210
 
6.2%
6289
 
4.8%
Other values (432) 48611
36.9%
Number Forms
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%
None
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4941
Distinct (%)58.6%
Missing1575
Missing (%)15.8%
Memory size156.2 KiB
2024-04-06T20:32:54.264021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length52
Mean length37.833116
Min length18

Characters and Unicode

Total characters318744
Distinct characters528
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3841 ?
Unique (%)45.6%

Sample

1st row서울특별시 양천구 남부순환로**길 **, ***동 ***호 (신월동, 이스트힐)
2nd row서울특별시 구로구 디지털로**길 **, ***호 (구로동,이스페이스)
3rd row서울특별시 양천구 남부순환로**길 **, *동 ***호 (신월동, 행복하우스)
4th row서울특별시 양천구 목동중앙남로 **, *층 (목동)
5th row서울특별시 양천구 오목로 *** (신정동)
ValueCountFrequency (%)
서울특별시 8416
14.3%
양천구 8372
14.3%
8358
14.2%
5788
 
9.9%
신정동 2793
 
4.8%
목동 2660
 
4.5%
2239
 
3.8%
신월동 2056
 
3.5%
1887
 
3.2%
목동동로 732
 
1.2%
Other values (2962) 15428
26.3%
2024-04-06T20:32:54.997479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 58367
18.3%
50354
 
15.8%
17221
 
5.4%
, 12052
 
3.8%
9380
 
2.9%
9173
 
2.9%
8817
 
2.8%
8813
 
2.8%
8654
 
2.7%
8520
 
2.7%
Other values (518) 127393
40.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 177227
55.6%
Other Punctuation 70446
 
22.1%
Space Separator 50354
 
15.8%
Close Punctuation 8478
 
2.7%
Open Punctuation 8475
 
2.7%
Dash Punctuation 2007
 
0.6%
Decimal Number 1082
 
0.3%
Uppercase Letter 587
 
0.2%
Lowercase Letter 60
 
< 0.1%
Math Symbol 19
 
< 0.1%
Other values (2) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17221
 
9.7%
9380
 
5.3%
9173
 
5.2%
8817
 
5.0%
8813
 
5.0%
8654
 
4.9%
8520
 
4.8%
8473
 
4.8%
8459
 
4.8%
8418
 
4.7%
Other values (453) 81299
45.9%
Uppercase Letter
ValueCountFrequency (%)
B 258
44.0%
A 99
 
16.9%
S 32
 
5.5%
C 29
 
4.9%
K 25
 
4.3%
M 23
 
3.9%
O 14
 
2.4%
D 12
 
2.0%
E 12
 
2.0%
T 11
 
1.9%
Other values (15) 72
 
12.3%
Lowercase Letter
ValueCountFrequency (%)
b 24
40.0%
a 9
 
15.0%
e 7
 
11.7%
c 5
 
8.3%
l 4
 
6.7%
s 3
 
5.0%
i 2
 
3.3%
d 2
 
3.3%
n 1
 
1.7%
r 1
 
1.7%
Other values (2) 2
 
3.3%
Decimal Number
ValueCountFrequency (%)
1 254
23.5%
0 180
16.6%
2 155
14.3%
3 119
11.0%
4 90
 
8.3%
5 68
 
6.3%
6 61
 
5.6%
8 56
 
5.2%
7 56
 
5.2%
9 43
 
4.0%
Other Punctuation
ValueCountFrequency (%)
* 58367
82.9%
, 12052
 
17.1%
. 10
 
< 0.1%
@ 9
 
< 0.1%
/ 3
 
< 0.1%
& 2
 
< 0.1%
# 1
 
< 0.1%
? 1
 
< 0.1%
' 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 18
94.7%
1
 
5.3%
Letter Number
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
50354
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8478
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8475
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2007
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 177220
55.6%
Common 140861
44.2%
Latin 654
 
0.2%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17221
 
9.7%
9380
 
5.3%
9173
 
5.2%
8817
 
5.0%
8813
 
5.0%
8654
 
4.9%
8520
 
4.8%
8473
 
4.8%
8459
 
4.8%
8418
 
4.8%
Other values (451) 81292
45.9%
Latin
ValueCountFrequency (%)
B 258
39.4%
A 99
 
15.1%
S 32
 
4.9%
C 29
 
4.4%
K 25
 
3.8%
b 24
 
3.7%
M 23
 
3.5%
O 14
 
2.1%
D 12
 
1.8%
E 12
 
1.8%
Other values (29) 126
19.3%
Common
ValueCountFrequency (%)
* 58367
41.4%
50354
35.7%
, 12052
 
8.6%
) 8478
 
6.0%
( 8475
 
6.0%
- 2007
 
1.4%
1 254
 
0.2%
0 180
 
0.1%
2 155
 
0.1%
3 119
 
0.1%
Other values (15) 420
 
0.3%
Han
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 177218
55.6%
ASCII 141507
44.4%
CJK 9
 
< 0.1%
Number Forms 7
 
< 0.1%
None 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 58367
41.2%
50354
35.6%
, 12052
 
8.5%
) 8478
 
6.0%
( 8475
 
6.0%
- 2007
 
1.4%
B 258
 
0.2%
1 254
 
0.2%
0 180
 
0.1%
2 155
 
0.1%
Other values (51) 927
 
0.7%
Hangul
ValueCountFrequency (%)
17221
 
9.7%
9380
 
5.3%
9173
 
5.2%
8817
 
5.0%
8813
 
5.0%
8654
 
4.9%
8520
 
4.8%
8473
 
4.8%
8459
 
4.8%
8418
 
4.8%
Other values (450) 81290
45.9%
Number Forms
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
CJK
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%
None
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct374
Distinct (%)5.2%
Missing2875
Missing (%)28.7%
Memory size156.2 KiB
2024-04-06T20:32:55.471721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1671579
Min length5

Characters and Unicode

Total characters36816
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)0.8%

Sample

1st row07915
2nd row07928
3rd row07963
4th row158859
5th row158718
ValueCountFrequency (%)
158050 168
 
2.4%
158070 139
 
2.0%
158090 121
 
1.7%
07938 109
 
1.5%
08026 103
 
1.4%
08023 101
 
1.4%
07997 99
 
1.4%
07946 84
 
1.2%
07965 71
 
1.0%
07983 70
 
1.0%
Other values (364) 6060
85.1%
2024-04-06T20:32:56.196718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10771
29.3%
8 5519
15.0%
7 4902
13.3%
9 4648
12.6%
1 2669
 
7.2%
5 2597
 
7.1%
2 1562
 
4.2%
6 1412
 
3.8%
4 1354
 
3.7%
3 1352
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36786
99.9%
Dash Punctuation 30
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10771
29.3%
8 5519
15.0%
7 4902
13.3%
9 4648
12.6%
1 2669
 
7.3%
5 2597
 
7.1%
2 1562
 
4.2%
6 1412
 
3.8%
4 1354
 
3.7%
3 1352
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36816
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10771
29.3%
8 5519
15.0%
7 4902
13.3%
9 4648
12.6%
1 2669
 
7.2%
5 2597
 
7.1%
2 1562
 
4.2%
6 1412
 
3.8%
4 1354
 
3.7%
3 1352
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36816
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10771
29.3%
8 5519
15.0%
7 4902
13.3%
9 4648
12.6%
1 2669
 
7.2%
5 2597
 
7.1%
2 1562
 
4.2%
6 1412
 
3.8%
4 1354
 
3.7%
3 1352
 
3.7%
Distinct9791
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T20:32:56.820773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length6.6532
Min length1

Characters and Unicode

Total characters66532
Distinct characters1108
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9597 ?
Unique (%)96.0%

Sample

1st row주선이옷가게
2nd row(주)시맥스디앤티
3rd row제롬 리(jerome lee)
4th row샤르망
5th row예루살렘 기독교 백화점
ValueCountFrequency (%)
주식회사 413
 
3.3%
37
 
0.3%
25
 
0.2%
목동점 24
 
0.2%
company 23
 
0.2%
21
 
0.2%
컴퍼니 20
 
0.2%
스튜디오 16
 
0.1%
인셀덤 16
 
0.1%
international 16
 
0.1%
Other values (10855) 11741
95.1%
2024-04-06T20:32:57.713269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2511
 
3.8%
2363
 
3.6%
1967
 
3.0%
) 1964
 
3.0%
( 1919
 
2.9%
1188
 
1.8%
1152
 
1.7%
927
 
1.4%
791
 
1.2%
e 743
 
1.1%
Other values (1098) 51007
76.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47568
71.5%
Lowercase Letter 6461
 
9.7%
Uppercase Letter 5401
 
8.1%
Space Separator 2363
 
3.6%
Close Punctuation 1965
 
3.0%
Open Punctuation 1920
 
2.9%
Decimal Number 427
 
0.6%
Other Punctuation 317
 
0.5%
Dash Punctuation 69
 
0.1%
Other Symbol 20
 
< 0.1%
Other values (3) 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2511
 
5.3%
1967
 
4.1%
1188
 
2.5%
1152
 
2.4%
927
 
1.9%
791
 
1.7%
693
 
1.5%
641
 
1.3%
601
 
1.3%
578
 
1.2%
Other values (1014) 36519
76.8%
Lowercase Letter
ValueCountFrequency (%)
e 743
11.5%
o 670
 
10.4%
a 565
 
8.7%
n 479
 
7.4%
i 466
 
7.2%
r 413
 
6.4%
l 398
 
6.2%
t 362
 
5.6%
s 346
 
5.4%
u 250
 
3.9%
Other values (16) 1769
27.4%
Uppercase Letter
ValueCountFrequency (%)
O 445
 
8.2%
A 402
 
7.4%
E 388
 
7.2%
S 381
 
7.1%
N 320
 
5.9%
I 291
 
5.4%
T 287
 
5.3%
M 285
 
5.3%
C 282
 
5.2%
L 270
 
5.0%
Other values (16) 2050
38.0%
Other Punctuation
ValueCountFrequency (%)
. 146
46.1%
& 102
32.2%
, 20
 
6.3%
' 18
 
5.7%
? 10
 
3.2%
# 6
 
1.9%
/ 5
 
1.6%
! 4
 
1.3%
3
 
0.9%
2
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 91
21.3%
1 87
20.4%
0 48
11.2%
4 46
10.8%
3 41
9.6%
9 38
8.9%
7 25
 
5.9%
5 23
 
5.4%
6 17
 
4.0%
8 11
 
2.6%
Close Punctuation
ValueCountFrequency (%)
) 1964
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1919
99.9%
[ 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
18
90.0%
2
 
10.0%
Space Separator
ValueCountFrequency (%)
2363
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 18
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47578
71.5%
Latin 11862
 
17.8%
Common 7084
 
10.6%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2511
 
5.3%
1967
 
4.1%
1188
 
2.5%
1152
 
2.4%
927
 
1.9%
791
 
1.7%
693
 
1.5%
641
 
1.3%
601
 
1.3%
578
 
1.2%
Other values (1008) 36529
76.8%
Latin
ValueCountFrequency (%)
e 743
 
6.3%
o 670
 
5.6%
a 565
 
4.8%
n 479
 
4.0%
i 466
 
3.9%
O 445
 
3.8%
r 413
 
3.5%
A 402
 
3.4%
l 398
 
3.4%
E 388
 
3.3%
Other values (42) 6893
58.1%
Common
ValueCountFrequency (%)
2363
33.4%
) 1964
27.7%
( 1919
27.1%
. 146
 
2.1%
& 102
 
1.4%
2 91
 
1.3%
1 87
 
1.2%
- 69
 
1.0%
0 48
 
0.7%
4 46
 
0.6%
Other values (21) 249
 
3.5%
Han
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47560
71.5%
ASCII 18939
 
28.5%
None 23
 
< 0.1%
CJK 8
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2511
 
5.3%
1967
 
4.1%
1188
 
2.5%
1152
 
2.4%
927
 
1.9%
791
 
1.7%
693
 
1.5%
641
 
1.3%
601
 
1.3%
578
 
1.2%
Other values (1007) 36511
76.8%
ASCII
ValueCountFrequency (%)
2363
 
12.5%
) 1964
 
10.4%
( 1919
 
10.1%
e 743
 
3.9%
o 670
 
3.5%
a 565
 
3.0%
n 479
 
2.5%
i 466
 
2.5%
O 445
 
2.3%
r 413
 
2.2%
Other values (70) 8912
47.1%
None
ValueCountFrequency (%)
18
78.3%
3
 
13.0%
2
 
8.7%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct9199
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-27 14:42:22
Maximum2024-04-04 20:40:55
2024-04-06T20:32:57.992650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:58.290080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7902 
U
2098 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7902
79.0%
U 2098
 
21.0%

Length

2024-04-06T20:32:58.540119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:32:58.686037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7902
79.0%
u 2098
 
21.0%
Distinct1481
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:06:00
2024-04-06T20:32:58.848610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T20:32:59.072605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct523
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T20:32:59.352142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length83
Mean length8.6394
Min length1

Characters and Unicode

Total characters86394
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique343 ?
Unique (%)3.4%

Sample

1st row의류/패션/잡화/뷰티
2nd row컴퓨터/사무용품
3rd row종합몰
4th row종합몰
5th row-
ValueCountFrequency (%)
의류/패션/잡화/뷰티 3452
24.5%
종합몰 2985
21.2%
기타 2202
15.6%
1172
 
8.3%
건강/식품 941
 
6.7%
교육/도서/완구/오락 799
 
5.7%
컴퓨터/사무용품 611
 
4.3%
가구/수납용품 542
 
3.9%
가전 520
 
3.7%
레져/여행/공연 357
 
2.5%
Other values (3) 491
 
3.5%
2024-04-06T20:32:59.873839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 15961
18.5%
4072
 
4.7%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
3452
 
4.0%
Other values (41) 38745
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65189
75.5%
Other Punctuation 15961
 
18.5%
Space Separator 4072
 
4.7%
Dash Punctuation 1172
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
2985
 
4.6%
2985
 
4.6%
Other values (38) 31603
48.5%
Other Punctuation
ValueCountFrequency (%)
/ 15961
100.0%
Space Separator
ValueCountFrequency (%)
4072
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1172
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65189
75.5%
Common 21205
 
24.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
2985
 
4.6%
2985
 
4.6%
Other values (38) 31603
48.5%
Common
ValueCountFrequency (%)
/ 15961
75.3%
4072
 
19.2%
- 1172
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65189
75.5%
ASCII 21205
 
24.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 15961
75.3%
4072
 
19.2%
- 1172
 
5.5%
Hangul
ValueCountFrequency (%)
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
3452
 
5.3%
2985
 
4.6%
2985
 
4.6%
Other values (38) 31603
48.5%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct3973
Distinct (%)47.3%
Missing1608
Missing (%)16.1%
Infinite0
Infinite (%)0.0%
Mean187327.97
Minimum174456.9
Maximum228441
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:33:00.089320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum174456.9
5-th percentile184854.66
Q1186046.45
median187676.46
Q3188425.76
95-th percentile189144.49
Maximum228441
Range53984.096
Interquartile range (IQR)2379.3124

Descriptive statistics

Standard deviation1736.2375
Coefficient of variation (CV)0.0092684373
Kurtosis82.952501
Mean187327.97
Median Absolute Deviation (MAD)952.95667
Skewness4.6628212
Sum1.5720563 × 109
Variance3014520.7
MonotonicityNot monotonic
2024-04-06T20:33:00.332305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
188953.066831076 130
 
1.3%
188467.215363052 80
 
0.8%
189151.208015925 67
 
0.7%
188012.804745161 65
 
0.7%
188584.345447275 65
 
0.7%
185263.865468449 63
 
0.6%
187652.903985452 58
 
0.6%
187995.261631804 54
 
0.5%
186852.153851552 53
 
0.5%
188248.45432693 49
 
0.5%
Other values (3963) 7708
77.1%
(Missing) 1608
 
16.1%
ValueCountFrequency (%)
174456.903556299 1
 
< 0.1%
180507.645356271 1
 
< 0.1%
184242.730019702 12
0.1%
184260.782196 1
 
< 0.1%
184298.438528197 1
 
< 0.1%
184359.032127833 1
 
< 0.1%
184359.248011837 1
 
< 0.1%
184373.07604881 5
0.1%
184384.354361539 1
 
< 0.1%
184388.747620342 1
 
< 0.1%
ValueCountFrequency (%)
228440.999432171 1
< 0.1%
221362.162194 1
< 0.1%
212298.540864675 1
< 0.1%
210750.209556247 1
< 0.1%
210495.880373 1
< 0.1%
210400.171224239 1
< 0.1%
210064.371102607 1
< 0.1%
207487.643232117 1
< 0.1%
206469.619424915 1
< 0.1%
203463.676635622 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct3975
Distinct (%)47.4%
Missing1608
Missing (%)16.1%
Infinite0
Infinite (%)0.0%
Mean447286.57
Minimum258147.9
Maximum464794.34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T20:33:00.668402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum258147.9
5-th percentile445546.56
Q1446522
median447143.55
Q3448200.07
95-th percentile449302.22
Maximum464794.34
Range206646.43
Interquartile range (IQR)1678.0677

Descriptive statistics

Standard deviation2406.7056
Coefficient of variation (CV)0.0053806793
Kurtosis4548.3567
Mean447286.57
Median Absolute Deviation (MAD)837.65326
Skewness-57.911295
Sum3.7536289 × 109
Variance5792231.8
MonotonicityNot monotonic
2024-04-06T20:33:01.019425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
447333.569187997 130
 
1.3%
446003.941216623 80
 
0.8%
448200.067716589 67
 
0.7%
445428.447103333 65
 
0.7%
447255.070457495 65
 
0.7%
446199.492398091 63
 
0.6%
445655.04439912 58
 
0.6%
445782.650926649 54
 
0.5%
446714.974104065 53
 
0.5%
447406.301288366 49
 
0.5%
Other values (3965) 7708
77.1%
(Missing) 1608
 
16.1%
ValueCountFrequency (%)
258147.901215 1
< 0.1%
423548.771052 1
< 0.1%
428986.033205408 1
< 0.1%
437130.014867827 1
< 0.1%
442403.568114284 1
< 0.1%
442596.203831363 1
< 0.1%
443131.777183599 1
< 0.1%
443186.686256059 1
< 0.1%
443511.30566781 1
< 0.1%
443610.086800382 1
< 0.1%
ValueCountFrequency (%)
464794.33591471 1
< 0.1%
459813.317652333 1
< 0.1%
459775.992830125 1
< 0.1%
456528.476331771 1
< 0.1%
456206.883711284 1
< 0.1%
452421.333601368 1
< 0.1%
452132.648343993 1
< 0.1%
451638.706934364 1
< 0.1%
450756.598824259 1
< 0.1%
450562.020225978 1
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9233 
0
 
767

Length

Max length4
Median length4
Mean length3.7699
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9233
92.3%
0 767
 
7.7%

Length

2024-04-06T20:33:01.286120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:33:01.458484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9233
92.3%
0 767
 
7.7%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9233 
0
 
767

Length

Max length4
Median length4
Mean length3.7699
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9233
92.3%
0 767
 
7.7%

Length

2024-04-06T20:33:01.632404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:33:01.820136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9233
92.3%
0 767
 
7.7%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9233 
0
 
767

Length

Max length4
Median length4
Mean length3.7699
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9233
92.3%
0 767
 
7.7%

Length

2024-04-06T20:33:01.971152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:33:02.118976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9233
92.3%
0 767
 
7.7%

판매방식명
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5019 
인터넷
4676 
인터넷, 기타
 
91
기타
 
55
TV홈쇼핑, 인터넷
 
41
Other values (17)
 
118

Length

Max length26
Median length4
Mean length3.7105
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5019
50.2%
인터넷 4676
46.8%
인터넷, 기타 91
 
0.9%
기타 55
 
0.5%
TV홈쇼핑, 인터넷 41
 
0.4%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 26
 
0.3%
TV홈쇼핑 14
 
0.1%
인터넷, 카다로그 14
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 12
 
0.1%
TV홈쇼핑, 인터넷, 카다로그 11
 
0.1%
Other values (12) 41
 
0.4%

Length

2024-04-06T20:33:02.332222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5019
48.3%
인터넷 4904
47.2%
기타 202
 
1.9%
tv홈쇼핑 115
 
1.1%
카다로그 88
 
0.8%
신문잡지 55
 
0.5%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
105933140000201831401673020025220180323<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 양천구 신월동 ***번지 *호 이스트힐서울특별시 양천구 남부순환로**길 **, ***동 ***호 (신월동, 이스트힐)07915주선이옷가게2018-03-23 16:39:17I2018-08-31 23:59:59.0의류/패션/잡화/뷰티184841.340767447553.18493<NA><NA><NA>인터넷
35153140000200731401143020334320071029<NA>3폐업3폐업처리20081020<NA><NA><NA>02-3282-3813<NA>152050서울특별시 구로구 구로동 ***번지 **호 이스페이스 ***호서울특별시 구로구 디지털로**길 **, ***호 (구로동,이스페이스)<NA>(주)시맥스디앤티2008-10-20 10:43:30I2018-08-31 23:59:59.0컴퓨터/사무용품190423.216683442403.568114<NA><NA><NA>인터넷
164653140000202131401673020203020211221<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-<NA><NA>서울특별시 양천구 신월동 ***-* 행복하우스서울특별시 양천구 남부순환로**길 **, *동 ***호 (신월동, 행복하우스)07928제롬 리(jerome lee)2021-12-21 09:02:31I2021-12-23 00:22:42.0종합몰185680.572273446654.09364000인터넷
18780314000020233140167302005362023-03-16<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 양천구 목동 ***-*서울특별시 양천구 목동중앙남로 **, *층 (목동)07963샤르망2023-03-16 14:22:22I2022-12-02 23:08:00.0종합몰188258.405933448190.04636<NA><NA><NA><NA>
19143140000200531401143020165720050504<NA>3폐업3폐업처리20080529<NA><NA><NA>02 26974333<NA><NA>서울특별시 양천구 신월동 **-* 신월빌딩 지하*층<NA><NA>예루살렘 기독교 백화점2008-05-29 15:02:27I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
80993140000201531401673020010820150209<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-2691-0809<NA><NA><NA>서울특별시 양천구 오목로 *** (신정동)158859하와이 헬스클럽2015-02-09 17:06:40I2018-08-31 23:59:59.0종합몰186992.734329446925.412722<NA><NA><NA>기타
75863140000201431401673020029720101209<NA>3폐업3폐업처리20211223<NA><NA><NA>3219-8428<NA><NA><NA>서울특별시 양천구 목동동로 ***-*, ****-*호 (목동, 드림타워)158718주식회사 스포츠조선헬스앤드라이프케어2021-12-23 09:11:45U2021-12-25 02:40:00.0종합몰188584.345447447255.070457000인터넷
11693140000200431401143020064620040727<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 26075142<NA><NA>서울특별시 양천구 신정동 ***-* 해룡빌딩 *층 ***호<NA><NA>(주)지엔씨글로벌2008-02-21 00:00:00I2018-08-31 23:59:59.0의류/패션/잡화/뷰티<NA><NA><NA><NA><NA>인터넷
150383140000202131401673020056620210318<NA>2휴업2휴업처리<NA>2022122820230630<NA>02-<NA><NA>서울특별시 양천구 신정동 1259 신트리1단지아파트서울특별시 양천구 신정로 293, 105동 903호 (신정동, 신트리1단지아파트)08078톡톡프렌즈2022-12-27 10:01:22U2021-11-01 22:09:00.0종합몰 교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 자동차/자동차용품187047.81011445755.514912<NA><NA><NA><NA>
96283140000201731401673020012620170214<NA>3폐업3폐업처리20210225<NA><NA><NA>02-2693-0383<NA><NA><NA>서울특별시 양천구 곰달래로**길 ** (신월동, 지하)07925영성2021-02-25 20:31:34U2021-02-27 02:40:00.0의류/패션/잡화/뷰티185653.733022447339.122448<NA><NA><NA>인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
159113140000202131401673020146520210823<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-<NA><NA>서울특별시 양천구 신월동 ***-* 대림아파트서울특별시 양천구 신월로**길 *, *층 ***호 (신월동, 대림아파트)08065이모네별난곱창2021-08-23 09:15:17I2021-08-25 00:22:50.0건강/식품185821.926012446035.819636000인터넷
46003140000200931401143020052020090918<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>070-8248-0983<NA>158090서울특별시 양천구 신월동 ***번지 *호 세아쎄닛빌아파트 ***동 ***호서울특별시 양천구 남부순환로 ***, ***동 ***호 (신월동,세아쎄닛빌아파트)<NA>폴로아이(polo4i)2014-01-20 09:49:23I2018-08-31 23:59:59.0의류/패션/잡화/뷰티185738.892375446234.935085<NA><NA><NA>인터넷
98393140000201731401673020036720170208<NA>3폐업3폐업처리20170706<NA><NA><NA><NA><NA><NA><NA>서울특별시 양천구 지양로**길 **, 가동 *층 *호 (신월동, 영진주택)08034꼬미엘2017-07-06 16:33:43I2018-08-31 23:59:59.0의류/패션/잡화/뷰티185322.653273446888.304508<NA><NA><NA>인터넷
47803140000201031401143020000620100104<NA>1영업/정상1정상영업<NA><NA><NA><NA>425-0841<NA>158070서울특별시 양천구 신정동 ***번지 *호 ***호서울특별시 양천구 신정중앙로**길 **, ***호 (신정동)<NA>(주)피떠블유 종합상사2010-01-04 16:25:47I2021-12-03 22:02:00.0종합몰 기타187751.802267447222.877254<NA><NA><NA><NA>
4413140000200231401143020836220020828<NA>3폐업3폐업처리20020827<NA><NA><NA>02 652 6456<NA><NA>서울특별시 양천구 목동 ***-** *층<NA><NA>호주건강식품2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
66943140000201231401143020069320121109<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02-2038-0016<NA><NA>서울특별시 양천구 목동 ***번지 *호 현대**타워 ****호서울특별시 양천구 목동동로 ***, ****호 (목동, 현대**타워)158050주식회사 지티아이에스2021-11-22 16:42:30I2021-11-24 00:22:44.0종합몰188953.066831447333.569188000인터넷
17252314000020223140167302007732022-06-02<NA>3폐업3폐업처리2023-02-15<NA><NA><NA><NA><NA><NA>서울특별시 양천구 신정동 ***-* 신정종합사회복지관서울특별시 양천구 신정중앙로 **, 신정종합사회복지관 비*호 (신정동)07945한끼스토리협동조합2023-02-15 09:49:32U2022-12-01 23:07:00.0건강/식품187598.593699447147.978956<NA><NA><NA><NA>
88513140000201631401673020015320160314<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 양천구 남부순환로**길 *-*, ***호 (신월동, 가온빌)07900싱긋투유2016-03-15 09:26:09I2018-08-31 23:59:59.0건강/식품184570.548574448688.911289<NA><NA><NA>인터넷
183243140000202331401673020006520230104<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 양천구 목동 772-5서울특별시 양천구 목동중앙남로 5(목동)07964이리지온(IRIJION)2023-01-04 13:55:15I2022-12-01 00:06:00.0의류/패션/잡화/뷰티188365.90011448097.2945<NA><NA><NA><NA>
15603140000200431401143020128920040825<NA>3폐업3폐업처리20041206<NA><NA><NA>02 60917455<NA><NA>서울특별시 양천구 목동 ***-** 목동트윈빌 ***호<NA><NA>design ZETA2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>