Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells69742
Missing cells (%)24.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory251.0 B

Variable types

Categorical10
Numeric5
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author노원구
URLhttps://data.seoul.go.kr/dataList/OA-18812/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (99.9%)Imbalance
판매방식명 is highly imbalanced (72.7%)Imbalance
폐업일자 has 6984 (69.8%) missing valuesMissing
휴업시작일자 has 9959 (99.6%) missing valuesMissing
휴업종료일자 has 9959 (99.6%) missing valuesMissing
재개업일자 has 9965 (99.7%) missing valuesMissing
전화번호 has 6770 (67.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8780 (87.8%) missing valuesMissing
지번주소 has 1978 (19.8%) missing valuesMissing
도로명주소 has 1260 (12.6%) missing valuesMissing
도로명우편번호 has 2221 (22.2%) missing valuesMissing
좌표정보(X) has 933 (9.3%) missing valuesMissing
좌표정보(Y) has 933 (9.3%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = -30.85013208)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 19:49:11.037056
Analysis finished2024-04-29 19:49:12.826127
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3100000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3100000
2nd row3100000
3rd row3100000
4th row3100000
5th row3100000

Common Values

ValueCountFrequency (%)
3100000 10000
100.0%

Length

2024-04-30T04:49:12.890184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:12.968910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3100000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0169517 × 1018
Minimum1.99731 × 1018
Maximum2.02431 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:13.070578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.99731 × 1018
5-th percentile2.00631 × 1018
Q12.01231 × 1018
median2.01831 × 1018
Q32.02131 × 1018
95-th percentile2.02331 × 1018
Maximum2.02431 × 1018
Range2.7000007 × 1016
Interquartile range (IQR)9 × 1015

Descriptive statistics

Standard deviation5.6591651 × 1015
Coefficient of variation (CV)0.002805801
Kurtosis-0.64134462
Mean2.0169517 × 1018
Median Absolute Deviation (MAD)4 × 1015
Skewness-0.70203044
Sum7.2258981 × 1018
Variance3.202615 × 1031
MonotonicityNot monotonic
2024-04-30T04:49:13.197172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2022310018430200957 1
 
< 0.1%
2021310018430200708 1
 
< 0.1%
2017310018430200036 1
 
< 0.1%
2021310018430200872 1
 
< 0.1%
2003310010930200755 1
 
< 0.1%
2017310018430200487 1
 
< 0.1%
2016310018430200990 1
 
< 0.1%
2007310010930202714 1
 
< 0.1%
2020310018430201907 1
 
< 0.1%
2019310018430201096 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1997310010930200798 1
< 0.1%
2000310010930203048 1
< 0.1%
2000310010930204188 1
< 0.1%
2001310010930200103 1
< 0.1%
2001310010930200106 1
< 0.1%
2001310010930200128 1
< 0.1%
2001310010930200155 1
< 0.1%
2001310010930204536 1
< 0.1%
2001310010930204898 1
< 0.1%
2001310010930205225 1
< 0.1%
ValueCountFrequency (%)
2024310018430200689 1
< 0.1%
2024310018430200688 1
< 0.1%
2024310018430200687 1
< 0.1%
2024310018430200686 1
< 0.1%
2024310018430200683 1
< 0.1%
2024310018430200681 1
< 0.1%
2024310018430200679 1
< 0.1%
2024310018430200676 1
< 0.1%
2024310018430200674 1
< 0.1%
2024310018430200673 1
< 0.1%
Distinct3656
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-06-13 00:00:00
Maximum2024-04-24 00:00:00
2024-04-30T04:49:13.314826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:13.416753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9999 
20090130
 
1

Length

Max length8
Median length4
Mean length4.0004
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9999
> 99.9%
20090130 1
 
< 0.1%

Length

2024-04-30T04:49:13.524974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:13.627956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9999
> 99.9%
20090130 1
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4083 
4
2882 
3
2524 
5
494 
2
 
17

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row4
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 4083
40.8%
4 2882
28.8%
3 2524
25.2%
5 494
 
4.9%
2 17
 
0.2%

Length

2024-04-30T04:49:13.719495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:13.798369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4083
40.8%
4 2882
28.8%
3 2524
25.2%
5 494
 
4.9%
2 17
 
0.2%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4083 
취소/말소/만료/정지/중지
2882 
폐업
2524 
제외/삭제/전출
494 
휴업
 
17

Length

Max length14
Median length8
Mean length6.9797
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row취소/말소/만료/정지/중지
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 4083
40.8%
취소/말소/만료/정지/중지 2882
28.8%
폐업 2524
25.2%
제외/삭제/전출 494
 
4.9%
휴업 17
 
0.2%

Length

2024-04-30T04:49:13.893158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:13.980126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4083
40.8%
취소/말소/만료/정지/중지 2882
28.8%
폐업 2524
25.2%
제외/삭제/전출 494
 
4.9%
휴업 17
 
0.2%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4345
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:14.063590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q37
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)6

Descriptive statistics

Standard deviation2.5011068
Coefficient of variation (CV)0.72823026
Kurtosis-1.4353717
Mean3.4345
Median Absolute Deviation (MAD)2
Skewness0.46932828
Sum34345
Variance6.2555353
MonotonicityNot monotonic
2024-04-30T04:49:14.163119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 4080
40.8%
7 2881
28.8%
3 2524
25.2%
5 494
 
4.9%
2 17
 
0.2%
6 3
 
< 0.1%
4 1
 
< 0.1%
ValueCountFrequency (%)
1 4080
40.8%
2 17
 
0.2%
3 2524
25.2%
4 1
 
< 0.1%
5 494
 
4.9%
6 3
 
< 0.1%
7 2881
28.8%
ValueCountFrequency (%)
7 2881
28.8%
6 3
 
< 0.1%
5 494
 
4.9%
4 1
 
< 0.1%
3 2524
25.2%
2 17
 
0.2%
1 4080
40.8%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4080 
직권말소
2881 
폐업처리
2524 
타시군구이관
494 
휴업처리
 
17
Other values (2)
 
4

Length

Max length6
Median length4
Mean length4.0994
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row정상영업
2nd row직권말소
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 4080
40.8%
직권말소 2881
28.8%
폐업처리 2524
25.2%
타시군구이관 494
 
4.9%
휴업처리 17
 
0.2%
타시군구전입 3
 
< 0.1%
직권취소 1
 
< 0.1%

Length

2024-04-30T04:49:14.444470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:14.556249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4080
40.8%
직권말소 2881
28.8%
폐업처리 2524
25.2%
타시군구이관 494
 
4.9%
휴업처리 17
 
0.2%
타시군구전입 3
 
< 0.1%
직권취소 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1856
Distinct (%)61.5%
Missing6984
Missing (%)69.8%
Memory size156.2 KiB
Minimum2005-07-13 00:00:00
Maximum2024-12-30 00:00:00
2024-04-30T04:49:14.653134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:14.755396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct39
Distinct (%)95.1%
Missing9959
Missing (%)99.6%
Memory size156.2 KiB
Minimum2002-11-29 00:00:00
Maximum2024-01-26 00:00:00
2024-04-30T04:49:14.863811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:14.966286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)

휴업종료일자
Date

MISSING 

Distinct38
Distinct (%)92.7%
Missing9959
Missing (%)99.6%
Memory size156.2 KiB
Minimum2003-05-23 00:00:00
Maximum2030-01-22 00:00:00
2024-04-30T04:49:15.080185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:15.194468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)

재개업일자
Date

MISSING 

Distinct31
Distinct (%)88.6%
Missing9965
Missing (%)99.7%
Memory size156.2 KiB
Minimum2007-05-04 00:00:00
Maximum2024-04-17 00:00:00
2024-04-30T04:49:15.305483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:15.412472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

전화번호
Text

MISSING 

Distinct3012
Distinct (%)93.3%
Missing6770
Missing (%)67.7%
Memory size156.2 KiB
2024-04-30T04:49:15.570410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length10.150464
Min length1

Characters and Unicode

Total characters32786
Distinct characters17
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2962 ?
Unique (%)91.7%

Sample

1st row02936 4211
2nd row0262230660
3rd row02-932-5200
4th row3296 1180
5th row939-0609
ValueCountFrequency (%)
02 159
 
3.9%
34
 
0.8%
02951 24
 
0.6%
070 24
 
0.6%
02939 24
 
0.6%
02972 23
 
0.6%
02933 23
 
0.6%
02937 21
 
0.5%
02936 21
 
0.5%
02930 20
 
0.5%
Other values (3181) 3746
90.9%
2024-04-30T04:49:15.858928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5130
15.6%
2 4061
12.4%
- 3729
11.4%
9 3368
10.3%
7 2979
9.1%
3 2673
8.2%
1 2152
6.6%
5 2024
 
6.2%
4 1932
 
5.9%
8 1919
 
5.9%
Other values (7) 2819
8.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 28144
85.8%
Dash Punctuation 3729
 
11.4%
Space Separator 891
 
2.7%
Math Symbol 8
 
< 0.1%
Other Punctuation 7
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5130
18.2%
2 4061
14.4%
9 3368
12.0%
7 2979
10.6%
3 2673
9.5%
1 2152
7.6%
5 2024
 
7.2%
4 1932
 
6.9%
8 1919
 
6.8%
6 1906
 
6.8%
Other Punctuation
ValueCountFrequency (%)
. 6
85.7%
/ 1
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 3729
100.0%
Space Separator
ValueCountFrequency (%)
891
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32786
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5130
15.6%
2 4061
12.4%
- 3729
11.4%
9 3368
10.3%
7 2979
9.1%
3 2673
8.2%
1 2152
6.6%
5 2024
 
6.2%
4 1932
 
5.9%
8 1919
 
5.9%
Other values (7) 2819
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32786
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5130
15.6%
2 4061
12.4%
- 3729
11.4%
9 3368
10.3%
7 2979
9.1%
3 2673
8.2%
1 2152
6.6%
5 2024
 
6.2%
4 1932
 
5.9%
8 1919
 
5.9%
Other values (7) 2819
8.6%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct164
Distinct (%)13.4%
Missing8780
Missing (%)87.8%
Infinite0
Infinite (%)0.0%
Mean139502.41
Minimum100450
Maximum139959
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:15.979803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100450
5-th percentile139050
Q1139203.75
median139758.5
Q3139828.5
95-th percentile139909.15
Maximum139959
Range39509
Interquartile range (IQR)624.75

Descriptive statistics

Standard deviation1166.3582
Coefficient of variation (CV)0.008360846
Kurtosis1033.1125
Mean139502.41
Median Absolute Deviation (MAD)156
Skewness-30.850132
Sum1.7019294 × 108
Variance1360391.4
MonotonicityNot monotonic
2024-04-30T04:49:16.092203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
139200 207
 
2.1%
139240 106
 
1.1%
139050 63
 
0.6%
139220 53
 
0.5%
139764 41
 
0.4%
139230 32
 
0.3%
139838 21
 
0.2%
139771 18
 
0.2%
139837 17
 
0.2%
139051 17
 
0.2%
Other values (154) 645
 
6.5%
(Missing) 8780
87.8%
ValueCountFrequency (%)
100450 1
 
< 0.1%
136831 1
 
< 0.1%
139050 63
 
0.6%
139051 17
 
0.2%
139052 1
 
< 0.1%
139053 1
 
< 0.1%
139054 1
 
< 0.1%
139200 207
2.1%
139201 11
 
0.1%
139202 1
 
< 0.1%
ValueCountFrequency (%)
139959 3
 
< 0.1%
139957 5
0.1%
139956 10
0.1%
139955 4
 
< 0.1%
139947 2
 
< 0.1%
139945 1
 
< 0.1%
139942 3
 
< 0.1%
139940 2
 
< 0.1%
139939 6
0.1%
139938 1
 
< 0.1%

지번주소
Text

MISSING 

Distinct2852
Distinct (%)35.6%
Missing1978
Missing (%)19.8%
Memory size156.2 KiB
2024-04-30T04:49:16.297306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length47
Mean length27.931065
Min length11

Characters and Unicode

Total characters224063
Distinct characters444
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2044 ?
Unique (%)25.5%

Sample

1st row서울특별시 노원구 상계동 **** 성림아파트
2nd row서울특별시 노원구 상계동 **** 불암현대아파트
3rd row서울시 노원구 상계동***-* 거성빌라C동 ***호
4th row서울특별시 노원구 상계동 *** 상계주공*단지아파트
5th row서울시 노원구 하계동*-* 청솔@***-***
ValueCountFrequency (%)
노원구 8016
18.4%
서울특별시 7284
16.7%
4211
9.7%
상계동 3272
 
7.5%
번지 3139
 
7.2%
2660
 
6.1%
공릉동 1641
 
3.8%
중계동 1285
 
2.9%
월계동 1053
 
2.4%
756
 
1.7%
Other values (1651) 10252
23.5%
2024-04-30T04:49:16.653128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 43586
19.5%
35593
15.9%
9308
 
4.2%
8296
 
3.7%
8200
 
3.7%
8187
 
3.7%
8139
 
3.6%
8101
 
3.6%
8098
 
3.6%
7994
 
3.6%
Other values (434) 78561
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140249
62.6%
Other Punctuation 44306
 
19.8%
Space Separator 35593
 
15.9%
Dash Punctuation 3007
 
1.3%
Decimal Number 480
 
0.2%
Uppercase Letter 228
 
0.1%
Open Punctuation 71
 
< 0.1%
Close Punctuation 71
 
< 0.1%
Lowercase Letter 53
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9308
 
6.6%
8296
 
5.9%
8200
 
5.8%
8187
 
5.8%
8139
 
5.8%
8101
 
5.8%
8098
 
5.8%
7994
 
5.7%
7290
 
5.2%
7284
 
5.2%
Other values (379) 59352
42.3%
Uppercase Letter
ValueCountFrequency (%)
B 55
24.1%
S 39
17.1%
E 27
11.8%
A 27
11.8%
K 16
 
7.0%
I 12
 
5.3%
G 7
 
3.1%
L 6
 
2.6%
D 6
 
2.6%
C 6
 
2.6%
Other values (11) 27
11.8%
Lowercase Letter
ValueCountFrequency (%)
n 23
43.4%
h 6
 
11.3%
e 4
 
7.5%
b 3
 
5.7%
w 3
 
5.7%
o 2
 
3.8%
r 2
 
3.8%
c 2
 
3.8%
a 2
 
3.8%
g 2
 
3.8%
Other values (4) 4
 
7.5%
Decimal Number
ValueCountFrequency (%)
1 84
17.5%
5 62
12.9%
2 55
11.5%
3 52
10.8%
7 47
9.8%
6 43
9.0%
4 43
9.0%
9 37
7.7%
0 29
 
6.0%
8 28
 
5.8%
Other Punctuation
ValueCountFrequency (%)
* 43586
98.4%
, 386
 
0.9%
@ 259
 
0.6%
. 58
 
0.1%
/ 17
 
< 0.1%
Space Separator
ValueCountFrequency (%)
35593
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3007
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140249
62.6%
Common 83533
37.3%
Latin 281
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9308
 
6.6%
8296
 
5.9%
8200
 
5.8%
8187
 
5.8%
8139
 
5.8%
8101
 
5.8%
8098
 
5.8%
7994
 
5.7%
7290
 
5.2%
7284
 
5.2%
Other values (379) 59352
42.3%
Latin
ValueCountFrequency (%)
B 55
19.6%
S 39
13.9%
E 27
9.6%
A 27
9.6%
n 23
 
8.2%
K 16
 
5.7%
I 12
 
4.3%
G 7
 
2.5%
h 6
 
2.1%
L 6
 
2.1%
Other values (25) 63
22.4%
Common
ValueCountFrequency (%)
* 43586
52.2%
35593
42.6%
- 3007
 
3.6%
, 386
 
0.5%
@ 259
 
0.3%
1 84
 
0.1%
( 71
 
0.1%
) 71
 
0.1%
5 62
 
0.1%
. 58
 
0.1%
Other values (10) 356
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140248
62.6%
ASCII 83814
37.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 43586
52.0%
35593
42.5%
- 3007
 
3.6%
, 386
 
0.5%
@ 259
 
0.3%
1 84
 
0.1%
( 71
 
0.1%
) 71
 
0.1%
5 62
 
0.1%
. 58
 
0.1%
Other values (45) 637
 
0.8%
Hangul
ValueCountFrequency (%)
9308
 
6.6%
8296
 
5.9%
8200
 
5.8%
8187
 
5.8%
8139
 
5.8%
8101
 
5.8%
8098
 
5.8%
7994
 
5.7%
7290
 
5.2%
7284
 
5.2%
Other values (378) 59351
42.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4151
Distinct (%)47.5%
Missing1260
Missing (%)12.6%
Memory size156.2 KiB
2024-04-30T04:49:16.873183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length56
Mean length41.36167
Min length21

Characters and Unicode

Total characters361501
Distinct characters477
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3076 ?
Unique (%)35.2%

Sample

1st row서울특별시 노원구 덕릉로***길 **, 상가동 *층 ***-***호 (상계동, 성림아파트)
2nd row서울특별시 노원구 덕릉로***길 **, ***동 ***호 (상계동, 불암현대아파트)
3rd row서울특별시 노원구 동일로***길 **, ***동 ***호 (상계동, 상계주공*단지아파트)
4th row서울특별시 노원구 동일로 ****, ***동 ***호 (상계동, 상계주공*단지아파트)
5th row서울특별시 노원구 공릉로**나길 **-**, 야쿠르트 *층 ***호 (하계동)
ValueCountFrequency (%)
서울특별시 8739
13.4%
노원구 8728
13.4%
8659
13.3%
6924
 
10.6%
4661
 
7.2%
상계동 3284
 
5.0%
1906
 
2.9%
동일로***길 1663
 
2.6%
공릉동 1637
 
2.5%
중계동 1255
 
1.9%
Other values (2265) 17636
27.1%
2024-04-30T04:49:17.235802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 75539
20.9%
56366
 
15.6%
16959
 
4.7%
, 14621
 
4.0%
9908
 
2.7%
9778
 
2.7%
9705
 
2.7%
8927
 
2.5%
8926
 
2.5%
8877
 
2.5%
Other values (467) 141895
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 194308
53.8%
Other Punctuation 90197
25.0%
Space Separator 56366
 
15.6%
Open Punctuation 8783
 
2.4%
Close Punctuation 8783
 
2.4%
Dash Punctuation 1368
 
0.4%
Decimal Number 1036
 
0.3%
Uppercase Letter 573
 
0.2%
Lowercase Letter 76
 
< 0.1%
Math Symbol 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16959
 
8.7%
9908
 
5.1%
9778
 
5.0%
9705
 
5.0%
8927
 
4.6%
8926
 
4.6%
8877
 
4.6%
8831
 
4.5%
8781
 
4.5%
8750
 
4.5%
Other values (407) 94866
48.8%
Uppercase Letter
ValueCountFrequency (%)
B 179
31.2%
A 121
21.1%
S 53
 
9.2%
E 34
 
5.9%
D 24
 
4.2%
I 23
 
4.0%
K 21
 
3.7%
C 21
 
3.7%
T 16
 
2.8%
F 15
 
2.6%
Other values (14) 66
 
11.5%
Lowercase Letter
ValueCountFrequency (%)
n 24
31.6%
b 9
 
11.8%
c 8
 
10.5%
o 8
 
10.5%
e 5
 
6.6%
i 4
 
5.3%
t 3
 
3.9%
s 2
 
2.6%
h 2
 
2.6%
m 2
 
2.6%
Other values (5) 9
 
11.8%
Decimal Number
ValueCountFrequency (%)
1 247
23.8%
0 163
15.7%
2 160
15.4%
3 108
10.4%
4 105
10.1%
7 58
 
5.6%
5 57
 
5.5%
6 52
 
5.0%
9 47
 
4.5%
8 39
 
3.8%
Other Punctuation
ValueCountFrequency (%)
* 75539
83.7%
, 14621
 
16.2%
@ 20
 
< 0.1%
: 7
 
< 0.1%
. 6
 
< 0.1%
/ 4
 
< 0.1%
Space Separator
ValueCountFrequency (%)
56366
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8783
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8783
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1368
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 194307
53.8%
Common 166544
46.1%
Latin 649
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16959
 
8.7%
9908
 
5.1%
9778
 
5.0%
9705
 
5.0%
8927
 
4.6%
8926
 
4.6%
8877
 
4.6%
8831
 
4.5%
8781
 
4.5%
8750
 
4.5%
Other values (406) 94865
48.8%
Latin
ValueCountFrequency (%)
B 179
27.6%
A 121
18.6%
S 53
 
8.2%
E 34
 
5.2%
n 24
 
3.7%
D 24
 
3.7%
I 23
 
3.5%
K 21
 
3.2%
C 21
 
3.2%
T 16
 
2.5%
Other values (29) 133
20.5%
Common
ValueCountFrequency (%)
* 75539
45.4%
56366
33.8%
, 14621
 
8.8%
( 8783
 
5.3%
) 8783
 
5.3%
- 1368
 
0.8%
1 247
 
0.1%
0 163
 
0.1%
2 160
 
0.1%
3 108
 
0.1%
Other values (11) 406
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 194307
53.8%
ASCII 167193
46.2%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 75539
45.2%
56366
33.7%
, 14621
 
8.7%
( 8783
 
5.3%
) 8783
 
5.3%
- 1368
 
0.8%
1 247
 
0.1%
B 179
 
0.1%
0 163
 
0.1%
2 160
 
0.1%
Other values (50) 984
 
0.6%
Hangul
ValueCountFrequency (%)
16959
 
8.7%
9908
 
5.1%
9778
 
5.0%
9705
 
5.0%
8927
 
4.6%
8926
 
4.6%
8877
 
4.6%
8831
 
4.5%
8781
 
4.5%
8750
 
4.5%
Other values (406) 94865
48.8%
CJK
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct498
Distinct (%)6.4%
Missing2221
Missing (%)22.2%
Memory size156.2 KiB
2024-04-30T04:49:17.532802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.149248
Min length5

Characters and Unicode

Total characters40056
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)0.8%

Sample

1st row01634
2nd row01646
3rd row01764
4th row139757
5th row01809
ValueCountFrequency (%)
01634 141
 
1.8%
01849 135
 
1.7%
01909 97
 
1.2%
01695 94
 
1.2%
01913 88
 
1.1%
01675 80
 
1.0%
139200 80
 
1.0%
01663 78
 
1.0%
01690 74
 
1.0%
01776 71
 
0.9%
Other values (488) 6841
87.9%
2024-04-30T04:49:17.943025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 9458
23.6%
0 8391
20.9%
8 3927
9.8%
6 3891
9.7%
7 3776
 
9.4%
9 3139
 
7.8%
3 2719
 
6.8%
4 1698
 
4.2%
5 1587
 
4.0%
2 1451
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40037
> 99.9%
Dash Punctuation 19
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 9458
23.6%
0 8391
21.0%
8 3927
9.8%
6 3891
9.7%
7 3776
 
9.4%
9 3139
 
7.8%
3 2719
 
6.8%
4 1698
 
4.2%
5 1587
 
4.0%
2 1451
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40056
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 9458
23.6%
0 8391
20.9%
8 3927
9.8%
6 3891
9.7%
7 3776
 
9.4%
9 3139
 
7.8%
3 2719
 
6.8%
4 1698
 
4.2%
5 1587
 
4.0%
2 1451
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40056
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 9458
23.6%
0 8391
20.9%
8 3927
9.8%
6 3891
9.7%
7 3776
 
9.4%
9 3139
 
7.8%
3 2719
 
6.8%
4 1698
 
4.2%
5 1587
 
4.0%
2 1451
 
3.6%
Distinct9762
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:18.241557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length38
Mean length6.6826
Min length1

Characters and Unicode

Total characters66826
Distinct characters1111
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9546 ?
Unique (%)95.5%

Sample

1st row걸스 넘버원 (GIRLS NO.1)
2nd row베이글램
3rd row케이구제
4th row라멜스토리
5th row영지유통
ValueCountFrequency (%)
주식회사 353
 
2.8%
57
 
0.5%
company 35
 
0.3%
컴퍼니 34
 
0.3%
스튜디오 23
 
0.2%
22
 
0.2%
코리아 19
 
0.2%
co 19
 
0.2%
the 18
 
0.1%
16
 
0.1%
Other values (10987) 12068
95.3%
2024-04-30T04:49:18.674589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2665
 
4.0%
2473
 
3.7%
1974
 
3.0%
) 1872
 
2.8%
( 1869
 
2.8%
1100
 
1.6%
e 892
 
1.3%
867
 
1.3%
o 804
 
1.2%
780
 
1.2%
Other values (1101) 51530
77.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45599
68.2%
Lowercase Letter 7959
 
11.9%
Uppercase Letter 5828
 
8.7%
Space Separator 2665
 
4.0%
Close Punctuation 1874
 
2.8%
Open Punctuation 1871
 
2.8%
Decimal Number 566
 
0.8%
Other Punctuation 363
 
0.5%
Dash Punctuation 75
 
0.1%
Connector Punctuation 20
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2473
 
5.4%
1974
 
4.3%
1100
 
2.4%
867
 
1.9%
780
 
1.7%
687
 
1.5%
673
 
1.5%
662
 
1.5%
629
 
1.4%
618
 
1.4%
Other values (1017) 35136
77.1%
Lowercase Letter
ValueCountFrequency (%)
e 892
 
11.2%
o 804
 
10.1%
a 711
 
8.9%
n 610
 
7.7%
i 567
 
7.1%
r 478
 
6.0%
l 459
 
5.8%
t 448
 
5.6%
s 403
 
5.1%
m 314
 
3.9%
Other values (16) 2273
28.6%
Uppercase Letter
ValueCountFrequency (%)
O 465
 
8.0%
A 455
 
7.8%
S 417
 
7.2%
E 389
 
6.7%
N 337
 
5.8%
M 332
 
5.7%
L 323
 
5.5%
I 313
 
5.4%
C 284
 
4.9%
T 281
 
4.8%
Other values (16) 2232
38.3%
Other Punctuation
ValueCountFrequency (%)
. 193
53.2%
& 92
25.3%
, 30
 
8.3%
' 17
 
4.7%
: 7
 
1.9%
? 7
 
1.9%
/ 7
 
1.9%
# 6
 
1.7%
2
 
0.6%
! 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 106
18.7%
1 104
18.4%
0 89
15.7%
3 50
8.8%
9 48
8.5%
4 43
7.6%
7 36
 
6.4%
5 35
 
6.2%
8 30
 
5.3%
6 25
 
4.4%
Close Punctuation
ValueCountFrequency (%)
) 1872
99.9%
] 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1869
99.9%
[ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
2665
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 20
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45576
68.2%
Latin 13787
 
20.6%
Common 7440
 
11.1%
Han 18
 
< 0.1%
Hiragana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2473
 
5.4%
1974
 
4.3%
1100
 
2.4%
867
 
1.9%
780
 
1.7%
687
 
1.5%
673
 
1.5%
662
 
1.5%
629
 
1.4%
618
 
1.4%
Other values (996) 35113
77.0%
Latin
ValueCountFrequency (%)
e 892
 
6.5%
o 804
 
5.8%
a 711
 
5.2%
n 610
 
4.4%
i 567
 
4.1%
r 478
 
3.5%
O 465
 
3.4%
l 459
 
3.3%
A 455
 
3.3%
t 448
 
3.2%
Other values (42) 7898
57.3%
Common
ValueCountFrequency (%)
2665
35.8%
) 1872
25.2%
( 1869
25.1%
. 193
 
2.6%
2 106
 
1.4%
1 104
 
1.4%
& 92
 
1.2%
0 89
 
1.2%
- 75
 
1.0%
3 50
 
0.7%
Other values (22) 325
 
4.4%
Han
ValueCountFrequency (%)
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (7) 7
38.9%
Hiragana
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45576
68.2%
ASCII 21223
31.8%
CJK 18
 
< 0.1%
Hiragana 5
 
< 0.1%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2665
 
12.6%
) 1872
 
8.8%
( 1869
 
8.8%
e 892
 
4.2%
o 804
 
3.8%
a 711
 
3.4%
n 610
 
2.9%
i 567
 
2.7%
r 478
 
2.3%
O 465
 
2.2%
Other values (71) 10290
48.5%
Hangul
ValueCountFrequency (%)
2473
 
5.4%
1974
 
4.3%
1100
 
2.4%
867
 
1.9%
780
 
1.7%
687
 
1.5%
673
 
1.5%
662
 
1.5%
629
 
1.4%
618
 
1.4%
Other values (996) 35113
77.0%
Hiragana
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
CJK
ValueCountFrequency (%)
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (7) 7
38.9%
None
ValueCountFrequency (%)
2
50.0%
° 1
25.0%
1
25.0%
Distinct9879
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-03 11:14:10
Maximum2024-04-25 16:44:39
2024-04-30T04:49:18.794538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:18.926518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
6125 
U
3875 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowU
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 6125
61.3%
U 3875
38.8%

Length

2024-04-30T04:49:19.042548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:19.124563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 6125
61.3%
u 3875
38.8%
Distinct1410
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-30T04:49:19.208415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:19.329827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct440
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:19.487519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length9.1249
Min length1

Characters and Unicode

Total characters91249
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)2.6%

Sample

1st row의류/패션/잡화/뷰티
2nd row의류/패션/잡화/뷰티
3rd row-
4th row종합몰
5th row-
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4013
28.8%
종합몰 3080
22.1%
기타 1848
13.3%
건강/식품 941
 
6.8%
교육/도서/완구/오락 809
 
5.8%
754
 
5.4%
컴퓨터/사무용품 577
 
4.1%
가구/수납용품 560
 
4.0%
가전 521
 
3.7%
자동차/자동차용품 335
 
2.4%
Other values (3) 475
 
3.4%
2024-04-30T04:49:19.777198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17590
19.3%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
4013
 
4.4%
3913
 
4.3%
Other values (41) 37642
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68992
75.6%
Other Punctuation 17590
 
19.3%
Space Separator 3913
 
4.3%
Dash Punctuation 754
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
3080
 
4.5%
3080
 
4.5%
Other values (38) 30728
44.5%
Other Punctuation
ValueCountFrequency (%)
/ 17590
100.0%
Space Separator
ValueCountFrequency (%)
3913
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 754
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68992
75.6%
Common 22257
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
3080
 
4.5%
3080
 
4.5%
Other values (38) 30728
44.5%
Common
ValueCountFrequency (%)
/ 17590
79.0%
3913
 
17.6%
- 754
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68992
75.6%
ASCII 22257
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17590
79.0%
3913
 
17.6%
- 754
 
3.4%
Hangul
ValueCountFrequency (%)
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
4013
 
5.8%
3080
 
4.5%
3080
 
4.5%
Other values (38) 30728
44.5%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct2390
Distinct (%)26.4%
Missing933
Missing (%)9.3%
Infinite0
Infinite (%)0.0%
Mean205939.14
Minimum192272.4
Maximum209779.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:20.088474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum192272.4
5-th percentile204609.04
Q1205276.46
median205953.04
Q3206603.92
95-th percentile207261.78
Maximum209779.1
Range17506.7
Interquartile range (IQR)1327.4599

Descriptive statistics

Standard deviation862.86013
Coefficient of variation (CV)0.0041898792
Kurtosis6.8198901
Mean205939.14
Median Absolute Deviation (MAD)659.88948
Skewness-0.32335078
Sum1.8672502 × 109
Variance744527.6
MonotonicityNot monotonic
2024-04-30T04:49:20.226248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205984.15211292 120
 
1.2%
204670.573678123 102
 
1.0%
205264.637477558 101
 
1.0%
204846.017230075 99
 
1.0%
205227.342971697 95
 
0.9%
205705.336876035 89
 
0.9%
206708.878294585 86
 
0.9%
205601.066015629 85
 
0.9%
204589.830805772 84
 
0.8%
205644.260618089 81
 
0.8%
Other values (2380) 8125
81.2%
(Missing) 933
 
9.3%
ValueCountFrequency (%)
192272.402458233 1
 
< 0.1%
200912.763957462 1
 
< 0.1%
203516.862893076 1
 
< 0.1%
203719.161728968 10
0.1%
203721.865206618 1
 
< 0.1%
203728.002090361 1
 
< 0.1%
203758.344662437 2
 
< 0.1%
203764.305574578 1
 
< 0.1%
203764.913539913 1
 
< 0.1%
203768.475646501 1
 
< 0.1%
ValueCountFrequency (%)
209779.102480245 2
 
< 0.1%
209752.492743427 1
 
< 0.1%
209696.960166997 9
0.1%
209358.946417968 1
 
< 0.1%
209315.772806876 1
 
< 0.1%
209288.472624641 6
0.1%
209234.634090964 1
 
< 0.1%
209221.923150049 3
 
< 0.1%
208567.645570191 1
 
< 0.1%
208347.884966182 2
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct2385
Distinct (%)26.3%
Missing933
Missing (%)9.3%
Infinite0
Infinite (%)0.0%
Mean460399.3
Minimum449334.17
Maximum465253.15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:20.358815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum449334.17
5-th percentile457417.73
Q1458381.27
median460516.39
Q3462143.75
95-th percentile463593.96
Maximum465253.15
Range15918.982
Interquartile range (IQR)3762.483

Descriptive statistics

Standard deviation2070.3262
Coefficient of variation (CV)0.0044968056
Kurtosis-1.0845493
Mean460399.3
Median Absolute Deviation (MAD)1852.8715
Skewness0.034756177
Sum4.1744405 × 109
Variance4286250.4
MonotonicityNot monotonic
2024-04-30T04:49:20.490470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
457275.799282625 120
 
1.2%
462933.316362013 102
 
1.0%
461827.579520796 101
 
1.0%
462293.140879558 99
 
1.0%
462298.014627148 95
 
0.9%
458028.977530331 89
 
0.9%
461058.836528245 88
 
0.9%
457477.154924921 86
 
0.9%
459233.675551946 85
 
0.9%
461270.63619789 81
 
0.8%
Other values (2375) 8121
81.2%
(Missing) 933
 
9.3%
ValueCountFrequency (%)
449334.170317351 1
 
< 0.1%
450078.855010335 1
 
< 0.1%
452706.897879436 1
 
< 0.1%
455127.209244063 1
 
< 0.1%
456908.206379034 1
 
< 0.1%
456916.117836191 1
 
< 0.1%
456962.088212392 2
< 0.1%
456968.030446131 1
 
< 0.1%
456990.07335337 3
< 0.1%
457003.886641127 1
 
< 0.1%
ValueCountFrequency (%)
465253.152004 1
 
< 0.1%
465103.755134816 12
0.1%
465011.262691591 3
 
< 0.1%
464959.058464501 1
 
< 0.1%
464947.538929422 1
 
< 0.1%
464922.213107238 14
0.1%
464849.968985063 4
 
< 0.1%
464650.949131666 1
 
< 0.1%
464636.126839194 1
 
< 0.1%
464589.376201942 2
 
< 0.1%

자산규모
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8828 
0
1172 

Length

Max length4
Median length4
Mean length3.6484
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8828
88.3%
0 1172
 
11.7%

Length

2024-04-30T04:49:20.608037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:20.683285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8828
88.3%
0 1172
 
11.7%

부채총액
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8828 
0
1172 

Length

Max length4
Median length4
Mean length3.6484
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8828
88.3%
0 1172
 
11.7%

Length

2024-04-30T04:49:20.766794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:20.841431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8828
88.3%
0 1172
 
11.7%

자본금
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8828 
0
1172 

Length

Max length4
Median length4
Mean length3.6484
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8828
88.3%
0 1172
 
11.7%

Length

2024-04-30T04:49:20.924331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:21.008662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8828
88.3%
0 1172
 
11.7%

판매방식명
Categorical

IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5654 
인터넷
4124 
인터넷, 기타
 
91
TV홈쇼핑, 인터넷
 
27
기타
 
20
Other values (15)
 
84

Length

Max length26
Median length4
Mean length3.7239
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5654
56.5%
인터넷 4124
41.2%
인터넷, 기타 91
 
0.9%
TV홈쇼핑, 인터넷 27
 
0.3%
기타 20
 
0.2%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 15
 
0.1%
인터넷, 카다로그 13
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 10
 
0.1%
TV홈쇼핑 8
 
0.1%
인터넷, 카다로그, 신문잡지, 기타 7
 
0.1%
Other values (10) 31
 
0.3%

Length

2024-04-30T04:49:21.103289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5654
54.9%
인터넷 4316
41.9%
기타 147
 
1.4%
tv홈쇼핑 73
 
0.7%
카다로그 62
 
0.6%
신문잡지 44
 
0.4%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
169633100000202231001843020095720220526<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 **** 성림아파트서울특별시 노원구 덕릉로***길 **, 상가동 *층 ***-***호 (상계동, 성림아파트)01634걸스 넘버원 (GIRLS NO.1)2022-05-26 15:25:34I2021-12-04 22:08:00.0의류/패션/잡화/뷰티207098.659537463287.582708<NA><NA><NA><NA>
138373100000202031001843020211320201125<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 **** 불암현대아파트서울특별시 노원구 덕릉로***길 **, ***동 ***호 (상계동, 불암현대아파트)01646베이글램2022-10-26 17:19:56U2021-10-30 22:08:00.0의류/패션/잡화/뷰티207024.694846462864.131706<NA><NA><NA><NA>
9423100000200631001093020191620060222<NA>1영업/정상1정상영업<NA><NA><NA><NA>02936 4211<NA><NA>서울시 노원구 상계동***-* 거성빌라C동 ***호<NA><NA>케이구제2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
178103100000202231001843020180720221019<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 *** 상계주공*단지아파트서울특별시 노원구 동일로***길 **, ***동 ***호 (상계동, 상계주공*단지아파트)01764라멜스토리2022-10-19 13:06:49I2021-10-30 22:01:00.0종합몰205094.69721460821.487932<NA><NA><NA><NA>
8043100000200531001093020175220051130<NA>1영업/정상1정상영업<NA><NA><NA><NA>0262230660<NA><NA>서울시 노원구 하계동*-* 청솔@***-***<NA><NA>영지유통2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
67223100000201431001843020081320141202<NA>3폐업3폐업처리20210610<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 동일로 ****, ***동 ***호 (상계동, 상계주공*단지아파트)139757핑거프랜드 (FIF)2021-06-09 16:22:00I2021-12-03 22:02:00.0건강/식품 성인/성인용품205264.637478461827.579521<NA><NA><NA><NA>
122003100000202031001843020038020200304<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 하계동 ***번지 **호 천지하우스서울특별시 노원구 공릉로**나길 **-**, 야쿠르트 *층 ***호 (하계동)01809연어킹2020-03-04 10:13:34I2020-03-06 00:23:23.0종합몰206470.3304459334.606994<NA><NA><NA>인터넷
38713100000201031001563020069220101209<NA>3폐업3폐업처리20110422<NA><NA><NA>02-932-5200<NA>139200서울특별시 노원구 상계동 ***번지 *호 현대빌딩 ***서울특별시 노원구 동일로 **** (상계동,현대빌딩 ***)<NA>코코호도 노원점2011-04-22 16:24:40I2018-08-31 23:59:59.0건강/식품205324.316308460953.875995<NA><NA><NA>인터넷
18423100000200731001093020306020071017<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>3296 1180<NA><NA>서울특별시 노원구 공릉동 *** 번지 **호 연승빌딩 ***호서울특별시 노원구 동일로***가길 ** (공릉동,연승빌딩 ***호)<NA>(주)미단라임2022-10-25 11:43:28U2021-10-30 22:07:00.0의류/패션/잡화/뷰티206321.030385457731.027184<NA><NA><NA><NA>
18786310000020233100184302006862023-03-24<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 *** 보람아파트*단지서울특별시 노원구 한글비석로 ***, ***동 ****호 (상계동, 보람아파트*단지)01671꽃맘플라워2023-03-24 11:29:59I2022-12-02 22:06:00.0종합몰205660.986625462645.285472<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
24363100000200831001093020051320081007<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-7582-2051<NA><NA><NA>서울특별시 노원구 광운로*나길 **, *동 ***호 (월계동, 동신아파트)01900캠핑하이몰2017-02-27 10:23:12I2018-08-31 23:59:59.0레져/여행/공연205349.717862457258.26775<NA><NA><NA>인터넷
152193100000202131001843020130320190513<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 *** 상계주공**단지아파트서울특별시 노원구 한글비석로 ***, ****동 ****호 (상계동, 상계주공**단지아파트)01673오비홀딩스2021-06-28 14:38:26I2021-12-03 22:02:00.0컴퓨터/사무용품 기타205164.287941462643.124806<NA><NA><NA><NA>
86873100000201731001843020033120170404<NA>3폐업3폐업처리20180808<NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 중계로 ***, ***동 ***호 (중계동, 중계*단지주공아파트)01713푸른(blue)2018-08-13 10:05:14I2021-12-03 22:02:00.0의류/패션/잡화/뷰티 건강/식품206675.0968461337.048251<NA><NA><NA><NA>
10433100000200631001093020204420060419<NA>3폐업3폐업처리20100302<NA><NA><NA>02936 2070<NA><NA>서울시 노원구 상계동****-**<NA><NA>스카이테크2012-06-21 17:16:20I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
89253100000201731001843020060820170704<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 노원구 공릉로 *** (공릉동, 서울과학기술대학 제*창업보육센터)01811주식회사 로휠(RoWheel Inc.)2022-11-02 11:02:38U2021-11-01 00:04:00.0교육/도서/완구/오락 가전207169.585532459519.457804<NA><NA><NA><NA>
58303100000201331001843020050420130809<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>1688-3281<NA><NA><NA>서울특별시 노원구 동일로***가길 **, ***동 ***호 (상계동, 은빛아파트)139891핑크홀릭 룸2021-12-03 21:52:53U2021-12-07 02:40:00.0의류/패션/잡화/뷰티204650.437591464174.15842000인터넷
40763100000201131001843020017420110310<NA>3폐업3폐업처리20120224<NA><NA><NA>02-937-4141<NA>139200서울특별시 노원구 상계동 ***번지 *호 동익빌딩 *층서울특별시 노원구 상계로 ** (상계동,동익빌딩 *층)<NA>노신사2012-08-28 09:31:47I2018-08-31 23:59:59.0의류/패션/잡화/뷰티205356.625413461535.575794<NA><NA><NA>인터넷
29413100000200931001093020040020090713<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA>139848서울특별시 노원구 월계*동 ***번지 *호 **통 *반<NA><NA>썬텔레콤2015-02-10 14:24:36I2018-08-31 23:59:59.0기타204616.273335459746.873731<NA><NA><NA>인터넷
25763100000200931001093020001320090109<NA>1영업/정상1정상영업<NA><NA><NA><NA>0505 829 9797<NA>139050서울특별시 노원구 월계동 ***번지 **통 *반 건양노블레스아파트 ****호서울특별시 노원구 화랑로**길 **-**, ****호 (월계동,건양노블레스아파트)<NA>엄지공주2009-01-09 14:01:04I2018-08-31 23:59:59.0기타205796.70716457209.345176<NA><NA><NA>인터넷
14381310000020213100184302004242021-02-19<NA>3폐업3폐업처리2024-01-12<NA><NA><NA><NA><NA><NA>서울특별시 노원구 상계동 ****-*서울특별시 노원구 동일로***바길 **-*, 지하*층, *층 (상계동)01627카페71792024-01-11 15:38:02U2023-11-30 23:03:00.0기타205303.555782463551.180596<NA><NA><NA><NA>