Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells67283
Missing cells (%)23.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical10
Numeric5
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author성북구
URLhttps://data.seoul.go.kr/dataList/OA-18809/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (97.5%)Imbalance
자산규모 is highly imbalanced (67.8%)Imbalance
부채총액 is highly imbalanced (67.8%)Imbalance
자본금 is highly imbalanced (67.8%)Imbalance
판매방식명 is highly imbalanced (72.2%)Imbalance
폐업일자 has 6394 (63.9%) missing valuesMissing
휴업시작일자 has 9957 (99.6%) missing valuesMissing
휴업종료일자 has 9957 (99.6%) missing valuesMissing
재개업일자 has 9981 (99.8%) missing valuesMissing
전화번호 has 3269 (32.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8662 (86.6%) missing valuesMissing
지번주소 has 1969 (19.7%) missing valuesMissing
도로명주소 has 2489 (24.9%) missing valuesMissing
도로명우편번호 has 2491 (24.9%) missing valuesMissing
좌표정보(X) has 1057 (10.6%) missing valuesMissing
좌표정보(Y) has 1057 (10.6%) missing valuesMissing
좌표정보(Y) is highly skewed (γ1 = -51.54678945)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 09:52:19.357289
Analysis finished2024-04-06 09:52:23.234880
Duration3.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3070000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3070000
2nd row3070000
3rd row3070000
4th row3070000
5th row3070000

Common Values

ValueCountFrequency (%)
3070000 10000
100.0%

Length

2024-04-06T18:52:23.387805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:23.542223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3070000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0163613 × 1018
Minimum1.997307 × 1018
Maximum2.024307 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T18:52:23.947429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.997307 × 1018
5-th percentile2.006307 × 1018
Q12.011307 × 1018
median2.018307 × 1018
Q32.021307 × 1018
95-th percentile2.023307 × 1018
Maximum2.024307 × 1018
Range2.7000016 × 1016
Interquartile range (IQR)1.000001 × 1016

Descriptive statistics

Standard deviation5.8804487 × 1015
Coefficient of variation (CV)0.0029163665
Kurtosis-0.79438112
Mean2.0163613 × 1018
Median Absolute Deviation (MAD)4.000007 × 1015
Skewness-0.61176091
Sum1.3219524 × 1018
Variance3.4579676 × 1031
MonotonicityNot monotonic
2024-04-06T18:52:24.272754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2020307028630201273 1
 
< 0.1%
2024307029930200277 1
 
< 0.1%
2022307028630200048 1
 
< 0.1%
2011307018930200936 1
 
< 0.1%
2017307021630201151 1
 
< 0.1%
2022307028630201102 1
 
< 0.1%
2011307018930200618 1
 
< 0.1%
2023307029930200049 1
 
< 0.1%
2024307029930200211 1
 
< 0.1%
2007307013430200025 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1997307013430201076 1
< 0.1%
1997307013430201077 1
< 0.1%
1998307013430201305 1
< 0.1%
1998307013430201367 1
< 0.1%
1999307013430202470 1
< 0.1%
2000307013430202964 1
< 0.1%
2000307013430203381 1
< 0.1%
2000307013430204093 1
< 0.1%
2000307013430204429 1
< 0.1%
2001307013430204659 1
< 0.1%
ValueCountFrequency (%)
2024307029930200508 1
< 0.1%
2024307029930200507 1
< 0.1%
2024307029930200504 1
< 0.1%
2024307029930200497 1
< 0.1%
2024307029930200496 1
< 0.1%
2024307029930200492 1
< 0.1%
2024307029930200491 1
< 0.1%
2024307029930200490 1
< 0.1%
2024307029930200489 1
< 0.1%
2024307029930200485 1
< 0.1%
Distinct3891
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-12-27 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T18:52:24.626488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:24.962222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9911 
20080618
 
54
20080731
 
16
20080722
 
6
20200625
 
4
Other values (7)
 
9

Length

Max length10
Median length4
Mean length4.0358
Min length4

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9911
99.1%
20080618 54
 
0.5%
20080731 16
 
0.2%
20080722 6
 
0.1%
20200625 4
 
< 0.1%
20200929 3
 
< 0.1%
20081217 1
 
< 0.1%
20200727 1
 
< 0.1%
2023-02-08 1
 
< 0.1%
20201224 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Length

2024-04-06T18:52:25.258090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 9911
99.1%
20080618 54
 
0.5%
20080731 16
 
0.2%
20080722 6
 
0.1%
20200625 4
 
< 0.1%
20200929 3
 
< 0.1%
20081217 1
 
< 0.1%
20200727 1
 
< 0.1%
2023-02-08 1
 
< 0.1%
20201224 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4724 
3
3167 
4
1639 
5
 
439
2
 
31

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row4
3rd row5
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 4724
47.2%
3 3167
31.7%
4 1639
 
16.4%
5 439
 
4.4%
2 31
 
0.3%

Length

2024-04-06T18:52:25.529080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:25.779483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4724
47.2%
3 3167
31.7%
4 1639
 
16.4%
5 439
 
4.4%
2 31
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4724 
폐업
3167 
취소/말소/만료/정지/중지
1639 
제외/삭제/전출
 
439
휴업
 
31

Length

Max length14
Median length8
Mean length5.6474
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row취소/말소/만료/정지/중지
3rd row제외/삭제/전출
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 4724
47.2%
폐업 3167
31.7%
취소/말소/만료/정지/중지 1639
 
16.4%
제외/삭제/전출 439
 
4.4%
휴업 31
 
0.3%

Length

2024-04-06T18:52:25.979939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:26.151035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4724
47.2%
폐업 3167
31.7%
취소/말소/만료/정지/중지 1639
 
16.4%
제외/삭제/전출 439
 
4.4%
휴업 31
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.767
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T18:52:26.333911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1199907
Coefficient of variation (CV)0.76616938
Kurtosis-0.26159892
Mean2.767
Median Absolute Deviation (MAD)2
Skewness1.0093174
Sum27670
Variance4.4943604
MonotonicityNot monotonic
2024-04-06T18:52:26.528645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 4724
47.2%
3 3167
31.7%
7 1544
 
15.4%
5 439
 
4.4%
4 95
 
0.9%
2 31
 
0.3%
ValueCountFrequency (%)
1 4724
47.2%
2 31
 
0.3%
3 3167
31.7%
4 95
 
0.9%
5 439
 
4.4%
7 1544
 
15.4%
ValueCountFrequency (%)
7 1544
 
15.4%
5 439
 
4.4%
4 95
 
0.9%
3 3167
31.7%
2 31
 
0.3%
1 4724
47.2%
Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4724 
폐업처리
3167 
직권말소
1544 
타시군구이관
 
439
직권취소
 
95

Length

Max length6
Median length4
Mean length4.0878
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row직권말소
3rd row타시군구이관
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 4724
47.2%
폐업처리 3167
31.7%
직권말소 1544
 
15.4%
타시군구이관 439
 
4.4%
직권취소 95
 
0.9%
휴업처리 31
 
0.3%

Length

2024-04-06T18:52:26.766742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:26.955162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4724
47.2%
폐업처리 3167
31.7%
직권말소 1544
 
15.4%
타시군구이관 439
 
4.4%
직권취소 95
 
0.9%
휴업처리 31
 
0.3%

폐업일자
Date

MISSING 

Distinct1933
Distinct (%)53.6%
Missing6394
Missing (%)63.9%
Memory size156.2 KiB
Minimum2007-03-06 00:00:00
Maximum2024-04-26 00:00:00
2024-04-06T18:52:27.157481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:27.429113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct43
Distinct (%)100.0%
Missing9957
Missing (%)99.6%
Memory size156.2 KiB
Minimum2007-05-16 00:00:00
Maximum2024-02-06 00:00:00
2024-04-06T18:52:27.678392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:27.905489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)

휴업종료일자
Date

MISSING 

Distinct39
Distinct (%)90.7%
Missing9957
Missing (%)99.6%
Memory size156.2 KiB
Minimum2007-09-16 00:00:00
Maximum2032-12-31 00:00:00
2024-04-06T18:52:28.154305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:28.404134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)

재개업일자
Date

MISSING 

Distinct17
Distinct (%)89.5%
Missing9981
Missing (%)99.8%
Memory size156.2 KiB
Minimum2007-09-27 00:00:00
Maximum2024-03-18 00:00:00
2024-04-06T18:52:28.615512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:28.846689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

전화번호
Text

MISSING 

Distinct3295
Distinct (%)49.0%
Missing3269
Missing (%)32.7%
Memory size156.2 KiB
2024-04-06T18:52:29.188132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length6.83316
Min length1

Characters and Unicode

Total characters45994
Distinct characters17
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3229 ?
Unique (%)48.0%

Sample

1st row02-
2nd row029170923
3rd row02-6247-7400
4th row3296-3854
5th row02-
ValueCountFrequency (%)
02 3397
48.9%
115
 
1.7%
070 18
 
0.3%
8064 7
 
0.1%
921 3
 
< 0.1%
02-928-8146 3
 
< 0.1%
070-8744-1468 3
 
< 0.1%
02-591-3871 3
 
< 0.1%
7954 3
 
< 0.1%
029271006 2
 
< 0.1%
Other values (3325) 3390
48.8%
2024-04-06T18:52:30.158964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9245
20.1%
2 8724
19.0%
- 7528
16.4%
9 3525
 
7.7%
7 2942
 
6.4%
1 2587
 
5.6%
4 2378
 
5.2%
6 2240
 
4.9%
5 2230
 
4.8%
3 2202
 
4.8%
Other values (7) 2393
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38225
83.1%
Dash Punctuation 7528
 
16.4%
Space Separator 220
 
0.5%
Other Punctuation 14
 
< 0.1%
Math Symbol 5
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9245
24.2%
2 8724
22.8%
9 3525
 
9.2%
7 2942
 
7.7%
1 2587
 
6.8%
4 2378
 
6.2%
6 2240
 
5.9%
5 2230
 
5.8%
3 2202
 
5.8%
8 2152
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 12
85.7%
, 1
 
7.1%
/ 1
 
7.1%
Dash Punctuation
ValueCountFrequency (%)
- 7528
100.0%
Space Separator
ValueCountFrequency (%)
220
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 45994
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9245
20.1%
2 8724
19.0%
- 7528
16.4%
9 3525
 
7.7%
7 2942
 
6.4%
1 2587
 
5.6%
4 2378
 
5.2%
6 2240
 
4.9%
5 2230
 
4.8%
3 2202
 
4.8%
Other values (7) 2393
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45994
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9245
20.1%
2 8724
19.0%
- 7528
16.4%
9 3525
 
7.7%
7 2942
 
6.4%
1 2587
 
5.6%
4 2378
 
5.2%
6 2240
 
4.9%
5 2230
 
4.8%
3 2202
 
4.8%
Other values (7) 2393
 
5.2%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct190
Distinct (%)14.2%
Missing8662
Missing (%)86.6%
Infinite0
Infinite (%)0.0%
Mean137310.42
Minimum100400
Maximum487823
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T18:52:30.455612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100400
5-th percentile136034
Q1136075
median136130
Q3136817
95-th percentile136865
Maximum487823
Range387423
Interquartile range (IQR)742

Descriptive statistics

Standard deviation17616.864
Coefficient of variation (CV)0.12829954
Kurtosis323.90329
Mean137310.42
Median Absolute Deviation (MAD)86
Skewness17.598846
Sum1.8372134 × 108
Variance3.1035389 × 108
MonotonicityNot monotonic
2024-04-06T18:52:30.724356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
136130 53
 
0.5%
136100 50
 
0.5%
136110 47
 
0.5%
136140 43
 
0.4%
136090 36
 
0.4%
136150 32
 
0.3%
136053 30
 
0.3%
136101 29
 
0.3%
136120 26
 
0.3%
136087 26
 
0.3%
Other values (180) 966
 
9.7%
(Missing) 8662
86.6%
ValueCountFrequency (%)
100400 1
< 0.1%
110524 1
< 0.1%
110771 1
< 0.1%
110826 1
< 0.1%
110847 1
< 0.1%
120100 1
< 0.1%
130861 1
< 0.1%
133030 1
< 0.1%
133050 1
< 0.1%
133170 1
< 0.1%
ValueCountFrequency (%)
487823 1
< 0.1%
476822 1
< 0.1%
464873 1
< 0.1%
325852 1
< 0.1%
302748 1
< 0.1%
158070 1
< 0.1%
152050 1
< 0.1%
143200 1
< 0.1%
139810 1
< 0.1%
138200 1
< 0.1%

지번주소
Text

MISSING 

Distinct3677
Distinct (%)45.8%
Missing1969
Missing (%)19.7%
Memory size156.2 KiB
2024-04-06T18:52:31.371241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length46
Mean length27.46831
Min length13

Characters and Unicode

Total characters220598
Distinct characters482
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2707 ?
Unique (%)33.7%

Sample

1st row서울특별시 성북구 성북동 ***-** 금강주택
2nd row서울특별시 성북구 장위동***-*** 해주빌라트 B-***
3rd row서울특별시 성북구 동소문동*가 ***번지
4th row서울특별시 성북구 안암동*가 *** 래미안 안암
5th row서울특별시 성북구 정릉동 ***-* 정릉*동새마을금고
ValueCountFrequency (%)
서울특별시 8022
18.4%
성북구 7999
18.3%
4231
 
9.7%
3593
 
8.2%
번지 3485
 
8.0%
정릉동 1181
 
2.7%
970
 
2.2%
장위동 960
 
2.2%
847
 
1.9%
하월곡동 693
 
1.6%
Other values (1929) 11731
26.8%
2024-04-06T18:52:32.210953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 47381
21.5%
35971
16.3%
10407
 
4.7%
8658
 
3.9%
8402
 
3.8%
8134
 
3.7%
8071
 
3.7%
8064
 
3.7%
8040
 
3.6%
8023
 
3.6%
Other values (472) 69447
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 132538
60.1%
Other Punctuation 47503
 
21.5%
Space Separator 35971
 
16.3%
Dash Punctuation 3235
 
1.5%
Decimal Number 562
 
0.3%
Uppercase Letter 452
 
0.2%
Close Punctuation 121
 
0.1%
Open Punctuation 117
 
0.1%
Lowercase Letter 90
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10407
 
7.9%
8658
 
6.5%
8402
 
6.3%
8134
 
6.1%
8071
 
6.1%
8064
 
6.1%
8040
 
6.1%
8023
 
6.1%
8022
 
6.1%
4548
 
3.4%
Other values (406) 52169
39.4%
Uppercase Letter
ValueCountFrequency (%)
B 118
26.1%
S 53
11.7%
K 42
 
9.3%
H 40
 
8.8%
A 34
 
7.5%
I 26
 
5.8%
V 21
 
4.6%
E 21
 
4.6%
T 15
 
3.3%
W 15
 
3.3%
Other values (15) 67
14.8%
Lowercase Letter
ValueCountFrequency (%)
e 34
37.8%
i 8
 
8.9%
r 6
 
6.7%
w 6
 
6.7%
o 5
 
5.6%
b 5
 
5.6%
l 5
 
5.6%
u 4
 
4.4%
m 3
 
3.3%
s 2
 
2.2%
Other values (8) 12
 
13.3%
Decimal Number
ValueCountFrequency (%)
1 129
23.0%
2 93
16.5%
0 83
14.8%
3 62
11.0%
4 48
 
8.5%
6 38
 
6.8%
8 30
 
5.3%
5 29
 
5.2%
7 27
 
4.8%
9 23
 
4.1%
Other Punctuation
ValueCountFrequency (%)
* 47381
99.7%
@ 65
 
0.1%
, 40
 
0.1%
? 8
 
< 0.1%
. 5
 
< 0.1%
/ 4
 
< 0.1%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
35971
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3235
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Open Punctuation
ValueCountFrequency (%)
( 117
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132531
60.1%
Common 87514
39.7%
Latin 546
 
0.2%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10407
 
7.9%
8658
 
6.5%
8402
 
6.3%
8134
 
6.1%
8071
 
6.1%
8064
 
6.1%
8040
 
6.1%
8023
 
6.1%
8022
 
6.1%
4548
 
3.4%
Other values (401) 52162
39.4%
Latin
ValueCountFrequency (%)
B 118
21.6%
S 53
 
9.7%
K 42
 
7.7%
H 40
 
7.3%
e 34
 
6.2%
A 34
 
6.2%
I 26
 
4.8%
V 21
 
3.8%
E 21
 
3.8%
T 15
 
2.7%
Other values (35) 142
26.0%
Common
ValueCountFrequency (%)
* 47381
54.1%
35971
41.1%
- 3235
 
3.7%
1 129
 
0.1%
) 121
 
0.1%
( 117
 
0.1%
2 93
 
0.1%
0 83
 
0.1%
@ 65
 
0.1%
3 62
 
0.1%
Other values (11) 257
 
0.3%
Han
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132531
60.1%
ASCII 88056
39.9%
CJK 7
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 47381
53.8%
35971
40.9%
- 3235
 
3.7%
1 129
 
0.1%
) 121
 
0.1%
B 118
 
0.1%
( 117
 
0.1%
2 93
 
0.1%
0 83
 
0.1%
@ 65
 
0.1%
Other values (54) 743
 
0.8%
Hangul
ValueCountFrequency (%)
10407
 
7.9%
8658
 
6.5%
8402
 
6.3%
8134
 
6.1%
8071
 
6.1%
8064
 
6.1%
8040
 
6.1%
8023
 
6.1%
8022
 
6.1%
4548
 
3.4%
Other values (401) 52162
39.4%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

도로명주소
Text

MISSING 

Distinct4372
Distinct (%)58.2%
Missing2489
Missing (%)24.9%
Memory size156.2 KiB
2024-04-06T18:52:32.687965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length51
Mean length37.5227
Min length20

Characters and Unicode

Total characters281833
Distinct characters500
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3372 ?
Unique (%)44.9%

Sample

1st row서울특별시 성북구 성북로 **-**, ***호 (성북동, 금강주택)
2nd row서울특별시 성북구 동소문로*길 *-**, *층 (동소문동*가)
3rd row서울특별시 성북구 고려대로**가길 **, ***동 ****호 (안암동*가, 래미안 안암)
4th row서울특별시 성북구 보국문로 *, 정릉*동새마을금고 *층 (정릉동)
5th row서울특별시 성북구 숭인로*길 **, ***동 ****호 (길음동, 롯데캐슬 클라시아)
ValueCountFrequency (%)
서울특별시 7511
14.4%
성북구 7492
14.4%
7283
14.0%
4505
 
8.7%
2236
 
4.3%
2159
 
4.1%
정릉동 1153
 
2.2%
장위동 897
 
1.7%
하월곡동 645
 
1.2%
종암동 574
 
1.1%
Other values (2246) 17600
33.8%
2024-04-06T18:52:33.361995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 53547
19.0%
44592
 
15.8%
12059
 
4.3%
, 9924
 
3.5%
8572
 
3.0%
8555
 
3.0%
7699
 
2.7%
7619
 
2.7%
) 7539
 
2.7%
( 7538
 
2.7%
Other values (490) 114189
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 155282
55.1%
Other Punctuation 63498
22.5%
Space Separator 44592
 
15.8%
Close Punctuation 7539
 
2.7%
Open Punctuation 7538
 
2.7%
Dash Punctuation 1885
 
0.7%
Decimal Number 777
 
0.3%
Uppercase Letter 612
 
0.2%
Lowercase Letter 102
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12059
 
7.8%
8572
 
5.5%
8555
 
5.5%
7699
 
5.0%
7619
 
4.9%
7536
 
4.9%
7533
 
4.9%
7512
 
4.8%
7511
 
4.8%
7443
 
4.8%
Other values (425) 73243
47.2%
Uppercase Letter
ValueCountFrequency (%)
B 194
31.7%
A 83
13.6%
S 53
 
8.7%
H 44
 
7.2%
K 43
 
7.0%
E 27
 
4.4%
C 26
 
4.2%
I 26
 
4.2%
V 25
 
4.1%
W 18
 
2.9%
Other values (15) 73
 
11.9%
Lowercase Letter
ValueCountFrequency (%)
e 43
42.2%
b 12
 
11.8%
i 7
 
6.9%
l 6
 
5.9%
r 6
 
5.9%
w 6
 
5.9%
o 4
 
3.9%
m 3
 
2.9%
u 3
 
2.9%
t 3
 
2.9%
Other values (7) 9
 
8.8%
Decimal Number
ValueCountFrequency (%)
1 174
22.4%
2 141
18.1%
0 129
16.6%
3 77
9.9%
4 64
 
8.2%
5 49
 
6.3%
8 40
 
5.1%
6 40
 
5.1%
7 33
 
4.2%
9 30
 
3.9%
Other Punctuation
ValueCountFrequency (%)
* 53547
84.3%
, 9924
 
15.6%
? 9
 
< 0.1%
. 9
 
< 0.1%
/ 6
 
< 0.1%
@ 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
44592
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7539
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7538
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1885
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 155274
55.1%
Common 125833
44.6%
Latin 718
 
0.3%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12059
 
7.8%
8572
 
5.5%
8555
 
5.5%
7699
 
5.0%
7619
 
4.9%
7536
 
4.9%
7533
 
4.9%
7512
 
4.8%
7511
 
4.8%
7443
 
4.8%
Other values (419) 73235
47.2%
Latin
ValueCountFrequency (%)
B 194
27.0%
A 83
11.6%
S 53
 
7.4%
H 44
 
6.1%
e 43
 
6.0%
K 43
 
6.0%
E 27
 
3.8%
C 26
 
3.6%
I 26
 
3.6%
V 25
 
3.5%
Other values (34) 154
21.4%
Common
ValueCountFrequency (%)
* 53547
42.6%
44592
35.4%
, 9924
 
7.9%
) 7539
 
6.0%
( 7538
 
6.0%
- 1885
 
1.5%
1 174
 
0.1%
2 141
 
0.1%
0 129
 
0.1%
3 77
 
0.1%
Other values (11) 287
 
0.2%
Han
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 155274
55.1%
ASCII 126547
44.9%
CJK 8
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 53547
42.3%
44592
35.2%
, 9924
 
7.8%
) 7539
 
6.0%
( 7538
 
6.0%
- 1885
 
1.5%
B 194
 
0.2%
1 174
 
0.1%
2 141
 
0.1%
0 129
 
0.1%
Other values (53) 884
 
0.7%
Hangul
ValueCountFrequency (%)
12059
 
7.8%
8572
 
5.5%
8555
 
5.5%
7699
 
5.0%
7619
 
4.9%
7536
 
4.9%
7533
 
4.9%
7512
 
4.8%
7511
 
4.8%
7443
 
4.8%
Other values (419) 73235
47.2%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

도로명우편번호
Text

MISSING 

Distinct375
Distinct (%)5.0%
Missing2491
Missing (%)24.9%
Memory size156.2 KiB
2024-04-06T18:52:33.949798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1566121
Min length5

Characters and Unicode

Total characters38721
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)0.7%

Sample

1st row02835
2nd row02860
3rd row02843
4th row02718
5th row02730
ValueCountFrequency (%)
02831 132
 
1.8%
02829 114
 
1.5%
02826 102
 
1.4%
02845 101
 
1.3%
02872 91
 
1.2%
02832 87
 
1.2%
02844 86
 
1.1%
02781 85
 
1.1%
02754 84
 
1.1%
02880 81
 
1.1%
Other values (365) 6546
87.2%
2024-04-06T18:52:34.938392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8481
21.9%
2 7956
20.5%
7 4677
12.1%
8 4636
12.0%
1 3289
 
8.5%
3 2829
 
7.3%
6 2416
 
6.2%
5 1676
 
4.3%
4 1667
 
4.3%
9 1070
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38697
99.9%
Dash Punctuation 24
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8481
21.9%
2 7956
20.6%
7 4677
12.1%
8 4636
12.0%
1 3289
 
8.5%
3 2829
 
7.3%
6 2416
 
6.2%
5 1676
 
4.3%
4 1667
 
4.3%
9 1070
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38721
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8481
21.9%
2 7956
20.5%
7 4677
12.1%
8 4636
12.0%
1 3289
 
8.5%
3 2829
 
7.3%
6 2416
 
6.2%
5 1676
 
4.3%
4 1667
 
4.3%
9 1070
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38721
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8481
21.9%
2 7956
20.5%
7 4677
12.1%
8 4636
12.0%
1 3289
 
8.5%
3 2829
 
7.3%
6 2416
 
6.2%
5 1676
 
4.3%
4 1667
 
4.3%
9 1070
 
2.8%
Distinct9802
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T18:52:35.566252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length36
Mean length6.9198
Min length1

Characters and Unicode

Total characters69198
Distinct characters1110
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9622 ?
Unique (%)96.2%

Sample

1st row혜윰
2nd row허브몰
3rd row오마이푸드
4th row헬스프로
5th row엘포도르
ValueCountFrequency (%)
주식회사 412
 
3.2%
45
 
0.4%
33
 
0.3%
스튜디오 27
 
0.2%
company 26
 
0.2%
the 26
 
0.2%
컴퍼니 21
 
0.2%
19
 
0.1%
co 18
 
0.1%
international 16
 
0.1%
Other values (11070) 12058
94.9%
2024-04-06T18:52:36.616100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2701
 
3.9%
2415
 
3.5%
) 2175
 
3.1%
( 2172
 
3.1%
1918
 
2.8%
1116
 
1.6%
e 1041
 
1.5%
o 909
 
1.3%
888
 
1.3%
835
 
1.2%
Other values (1100) 53028
76.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45679
66.0%
Lowercase Letter 8754
 
12.7%
Uppercase Letter 6691
 
9.7%
Space Separator 2701
 
3.9%
Close Punctuation 2175
 
3.1%
Open Punctuation 2172
 
3.1%
Decimal Number 526
 
0.8%
Other Punctuation 394
 
0.6%
Dash Punctuation 73
 
0.1%
Connector Punctuation 16
 
< 0.1%
Other values (2) 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2415
 
5.3%
1918
 
4.2%
1116
 
2.4%
888
 
1.9%
835
 
1.8%
730
 
1.6%
664
 
1.5%
659
 
1.4%
650
 
1.4%
602
 
1.3%
Other values (1018) 35202
77.1%
Lowercase Letter
ValueCountFrequency (%)
e 1041
11.9%
o 909
 
10.4%
a 743
 
8.5%
n 658
 
7.5%
i 621
 
7.1%
r 528
 
6.0%
l 520
 
5.9%
t 500
 
5.7%
s 418
 
4.8%
m 331
 
3.8%
Other values (16) 2485
28.4%
Uppercase Letter
ValueCountFrequency (%)
A 539
 
8.1%
O 519
 
7.8%
E 489
 
7.3%
S 467
 
7.0%
N 426
 
6.4%
T 382
 
5.7%
I 375
 
5.6%
L 370
 
5.5%
M 342
 
5.1%
C 309
 
4.6%
Other values (16) 2473
37.0%
Other Punctuation
ValueCountFrequency (%)
. 203
51.5%
& 79
 
20.1%
, 46
 
11.7%
' 28
 
7.1%
? 15
 
3.8%
: 11
 
2.8%
# 5
 
1.3%
/ 4
 
1.0%
; 1
 
0.3%
1
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 111
21.1%
1 100
19.0%
0 65
12.4%
3 49
9.3%
4 47
8.9%
8 41
 
7.8%
9 37
 
7.0%
7 29
 
5.5%
5 27
 
5.1%
6 20
 
3.8%
Math Symbol
ValueCountFrequency (%)
~ 2
50.0%
+ 1
25.0%
= 1
25.0%
Space Separator
ValueCountFrequency (%)
2701
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2175
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45673
66.0%
Latin 15445
 
22.3%
Common 8061
 
11.6%
Han 19
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2415
 
5.3%
1918
 
4.2%
1116
 
2.4%
888
 
1.9%
835
 
1.8%
730
 
1.6%
664
 
1.5%
659
 
1.4%
650
 
1.4%
602
 
1.3%
Other values (1001) 35196
77.1%
Latin
ValueCountFrequency (%)
e 1041
 
6.7%
o 909
 
5.9%
a 743
 
4.8%
n 658
 
4.3%
i 621
 
4.0%
A 539
 
3.5%
r 528
 
3.4%
l 520
 
3.4%
O 519
 
3.4%
t 500
 
3.2%
Other values (42) 8867
57.4%
Common
ValueCountFrequency (%)
2701
33.5%
) 2175
27.0%
( 2172
26.9%
. 203
 
2.5%
2 111
 
1.4%
1 100
 
1.2%
& 79
 
1.0%
- 73
 
0.9%
0 65
 
0.8%
3 49
 
0.6%
Other values (19) 333
 
4.1%
Han
ValueCountFrequency (%)
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (8) 8
42.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45659
66.0%
ASCII 23505
34.0%
CJK 18
 
< 0.1%
None 14
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2701
 
11.5%
) 2175
 
9.3%
( 2172
 
9.2%
e 1041
 
4.4%
o 909
 
3.9%
a 743
 
3.2%
n 658
 
2.8%
i 621
 
2.6%
A 539
 
2.3%
r 528
 
2.2%
Other values (70) 11418
48.6%
Hangul
ValueCountFrequency (%)
2415
 
5.3%
1918
 
4.2%
1116
 
2.4%
888
 
1.9%
835
 
1.8%
730
 
1.6%
664
 
1.5%
659
 
1.4%
650
 
1.4%
602
 
1.3%
Other values (999) 35182
77.1%
None
ValueCountFrequency (%)
13
92.9%
1
 
7.1%
CJK
ValueCountFrequency (%)
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (7) 7
38.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9957
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-11 14:27:32
Maximum2024-04-04 11:23:44
2024-04-06T18:52:36.858058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:37.122205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7047 
U
2953 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7047
70.5%
U 2953
29.5%

Length

2024-04-06T18:52:37.388453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:37.560663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7047
70.5%
u 2953
29.5%
Distinct1539
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:06:00
2024-04-06T18:52:37.714377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:52:37.951609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct392
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T18:52:38.209126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length9.4726
Min length1

Characters and Unicode

Total characters94726
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)2.5%

Sample

1st row종합몰
2nd row의류/패션/잡화/뷰티
3rd row건강/식품 의류/패션/잡화/뷰티
4th row건강/식품
5th row의류/패션/잡화/뷰티
ValueCountFrequency (%)
의류/패션/잡화/뷰티 5175
40.1%
종합몰 2644
20.5%
기타 1621
 
12.6%
건강/식품 827
 
6.4%
교육/도서/완구/오락 694
 
5.4%
가구/수납용품 423
 
3.3%
컴퓨터/사무용품 421
 
3.3%
가전 368
 
2.9%
레져/여행/공연 272
 
2.1%
자동차/자동차용품 232
 
1.8%
Other values (3) 214
 
1.7%
2024-04-06T18:52:38.713538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 20109
21.2%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
5175
 
5.5%
2891
 
3.1%
Other values (41) 30326
32.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71614
75.6%
Other Punctuation 20109
 
21.2%
Space Separator 2891
 
3.1%
Dash Punctuation 112
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
2644
 
3.7%
2644
 
3.7%
Other values (38) 24926
34.8%
Other Punctuation
ValueCountFrequency (%)
/ 20109
100.0%
Space Separator
ValueCountFrequency (%)
2891
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71614
75.6%
Common 23112
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
2644
 
3.7%
2644
 
3.7%
Other values (38) 24926
34.8%
Common
ValueCountFrequency (%)
/ 20109
87.0%
2891
 
12.5%
- 112
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71614
75.6%
ASCII 23112
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 20109
87.0%
2891
 
12.5%
- 112
 
0.5%
Hangul
ValueCountFrequency (%)
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
5175
 
7.2%
2644
 
3.7%
2644
 
3.7%
Other values (38) 24926
34.8%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct4749
Distinct (%)53.1%
Missing1057
Missing (%)10.6%
Infinite0
Infinite (%)0.0%
Mean202258.04
Minimum170501.31
Maximum231289.95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T18:52:38.895269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum170501.31
5-th percentile200232.81
Q1201112.45
median201832.75
Q3203338.78
95-th percentile205213.56
Maximum231289.95
Range60788.648
Interquartile range (IQR)2226.3355

Descriptive statistics

Standard deviation1677.7974
Coefficient of variation (CV)0.0082953313
Kurtosis36.984424
Mean202258.04
Median Absolute Deviation (MAD)991.02097
Skewness0.25997402
Sum1.8087936 × 109
Variance2815004.2
MonotonicityNot monotonic
2024-04-06T18:52:39.195815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200841.726990037 173
 
1.7%
203318.127254872 88
 
0.9%
201665.346470935 75
 
0.8%
205996.717928956 66
 
0.7%
201356.035379848 53
 
0.5%
200575.420521471 47
 
0.5%
202593.37574683 45
 
0.4%
203186.220809685 45
 
0.4%
203196.800963466 43
 
0.4%
201989.271689499 41
 
0.4%
Other values (4739) 8267
82.7%
(Missing) 1057
 
10.6%
ValueCountFrequency (%)
170501.305940536 1
 
< 0.1%
180349.707871 1
 
< 0.1%
186442.664671788 1
 
< 0.1%
186834.838338426 1
 
< 0.1%
189849.063862731 1
 
< 0.1%
192744.720506849 1
 
< 0.1%
194112.552458867 1
 
< 0.1%
196896.329082976 1
 
< 0.1%
198841.170137572 1
 
< 0.1%
198888.209911699 27
0.3%
ValueCountFrequency (%)
231289.953621861 1
 
< 0.1%
229805.118611595 1
 
< 0.1%
212832.457188149 1
 
< 0.1%
210778.377127675 1
 
< 0.1%
207358.331271995 1
 
< 0.1%
207008.092994104 1
 
< 0.1%
206112.457605113 2
 
< 0.1%
205996.717928956 66
0.7%
205858.920626619 1
 
< 0.1%
205844.74660078 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct4751
Distinct (%)53.1%
Missing1057
Missing (%)10.6%
Infinite0
Infinite (%)0.0%
Mean455367.96
Minimum281901.16
Maximum469112.75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T18:52:39.439015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum281901.16
5-th percentile453340.61
Q1454382.54
median455500.57
Q3456399.16
95-th percentile457227.13
Maximum469112.75
Range187211.59
Interquartile range (IQR)2016.6167

Descriptive statistics

Standard deviation2251.0758
Coefficient of variation (CV)0.0049434216
Kurtosis3947.0585
Mean455367.96
Median Absolute Deviation (MAD)978.62259
Skewness-51.546789
Sum4.0723557 × 109
Variance5067342.2
MonotonicityNot monotonic
2024-04-06T18:52:39.675457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
454721.505180141 137
 
1.4%
456078.982123486 107
 
1.1%
456704.522324447 66
 
0.7%
456637.258124599 61
 
0.6%
455429.846270094 54
 
0.5%
456606.133621913 53
 
0.5%
457360.617372111 47
 
0.5%
455407.749060476 45
 
0.4%
456457.110608708 45
 
0.4%
455190.095862939 41
 
0.4%
Other values (4741) 8287
82.9%
(Missing) 1057
 
10.6%
ValueCountFrequency (%)
281901.163471 1
< 0.1%
427908.251964165 1
< 0.1%
434530.760644996 1
< 0.1%
442988.093174501 1
< 0.1%
444468.235460681 1
< 0.1%
445016.961835846 1
< 0.1%
445105.25340517 1
< 0.1%
445444.226905118 1
< 0.1%
446121.978254554 1
< 0.1%
446245.710231288 1
< 0.1%
ValueCountFrequency (%)
469112.749916422 1
 
< 0.1%
462930.041768876 1
 
< 0.1%
458230.719875 4
 
< 0.1%
458127.923477 2
 
< 0.1%
457988.509545404 1
 
< 0.1%
457873.129406188 12
0.1%
457844.348010616 12
0.1%
457819.802526758 1
 
< 0.1%
457803.264646008 5
0.1%
457791.927618039 1
 
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9413 
0
 
587

Length

Max length4
Median length4
Mean length3.8239
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9413
94.1%
0 587
 
5.9%

Length

2024-04-06T18:52:39.910358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:40.071394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9413
94.1%
0 587
 
5.9%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9413 
0
 
587

Length

Max length4
Median length4
Mean length3.8239
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9413
94.1%
0 587
 
5.9%

Length

2024-04-06T18:52:40.276192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:40.473915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9413
94.1%
0 587
 
5.9%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9413 
0
 
587

Length

Max length4
Median length4
Mean length3.8239
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9413
94.1%
0 587
 
5.9%

Length

2024-04-06T18:52:40.661038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:52:40.854021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9413
94.1%
0 587
 
5.9%

판매방식명
Categorical

IMBALANCE 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
6075 
<NA>
3701 
인터넷, 기타
 
96
TV홈쇼핑, 인터넷
 
27
기타
 
16
Other values (13)
 
85

Length

Max length26
Median length3
Mean length3.5267
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인터넷
2nd row인터넷
3rd row<NA>
4th row인터넷
5th row<NA>

Common Values

ValueCountFrequency (%)
인터넷 6075
60.8%
<NA> 3701
37.0%
인터넷, 기타 96
 
1.0%
TV홈쇼핑, 인터넷 27
 
0.3%
기타 16
 
0.2%
인터넷, 카다로그 13
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 12
 
0.1%
인터넷, 카다로그, 기타 11
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 10
 
0.1%
TV홈쇼핑 7
 
0.1%
Other values (8) 32
 
0.3%

Length

2024-04-06T18:52:41.039954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 6272
60.9%
na 3701
35.9%
기타 152
 
1.5%
tv홈쇼핑 67
 
0.7%
카다로그 65
 
0.6%
신문잡지 39
 
0.4%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
147123070000202030702863020127320200717<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-<NA><NA>서울특별시 성북구 성북동 ***-** 금강주택서울특별시 성북구 성북로 **-**, ***호 (성북동, 금강주택)02835혜윰2020-07-18 10:56:23I2020-07-21 00:23:30.0종합몰200108.380146454571.525948<NA><NA><NA>인터넷
20293070000200730701343020028920070412<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>029170923<NA><NA>서울특별시 성북구 장위동***-*** 해주빌라트 B-***<NA><NA>허브몰2015-04-18 13:31:05I2018-08-31 23:59:59.0의류/패션/잡화/뷰티<NA><NA><NA><NA><NA>인터넷
86973070000201630702163020000220160105<NA>5제외/삭제/전출5타시군구이관20160913<NA><NA><NA><NA><NA><NA>서울특별시 성북구 동소문동*가 ***번지서울특별시 성북구 동소문로*길 *-**, *층 (동소문동*가)02860오마이푸드2016-09-13 11:07:54I2021-12-03 22:02:00.0건강/식품 의류/패션/잡화/뷰티200695.579197454101.019961<NA><NA><NA><NA>
169853070000202130702863020134020210708<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6247-7400<NA><NA>서울특별시 성북구 안암동*가 *** 래미안 안암서울특별시 성북구 고려대로**가길 **, ***동 ****호 (안암동*가, 래미안 안암)02843헬스프로2021-07-08 14:01:19I2021-07-10 00:22:52.0건강/식품202027.961999454111.099649000인터넷
21981307000020243070299302003722024-03-07<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 정릉동 ***-* 정릉*동새마을금고서울특별시 성북구 보국문로 *, 정릉*동새마을금고 *층 (정릉동)02718엘포도르2024-03-07 16:50:22I2023-12-03 00:09:00.0의류/패션/잡화/뷰티200960.373918455895.41455<NA><NA><NA><NA>
27913070000200830701343020026120080410<NA>3폐업3폐업처리20150310<NA><NA><NA>3296-3854<NA>136824서울특별시 성북구 성북동 ***번지 *호<NA><NA>풍경속으로2015-03-10 20:31:17I2018-08-31 23:59:59.0의류/패션/잡화/뷰티199614.50981454499.782802<NA><NA><NA>인터넷
196323070000202230702863020190520221122<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 길음동 **** 롯데캐슬 클라시아 ***동 ****호서울특별시 성북구 숭인로*길 **, ***동 ****호 (길음동, 롯데캐슬 클라시아)02730제이엠스토어2022-11-22 16:19:46I2021-10-31 22:04:00.0종합몰202321.560603456539.659508<NA><NA><NA><NA>
156903070000202030702863020230020201224<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-<NA><NA>서울특별시 성북구 하월곡동 *** 샹그레빌아파트서울특별시 성북구 회기로*길 ***, ***호 (하월곡동, 샹그레빌아파트)02794튜브하이파이 (TUBE HiFi)2020-12-24 09:54:07I2021-12-03 22:02:00.0가전 자동차/자동차용품 기타203585.984321454902.907206<NA><NA><NA><NA>
178963070000202230702863020016720220118<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-<NA><NA>서울특별시 성북구 돈암동 *-** ***호서울특별시 성북구 정릉로**길 **, ***호 (돈암동)02801원더풀 투데이2022-01-21 07:37:56I2022-01-23 00:22:51.0종합몰 교육/도서/완구/오락 컴퓨터/사무용품 가구/수납용품 건강/식품 의류/패션/잡화/뷰티 레져/여행/공연 자동차/자동차용품 가전202474.287488455594.11067000인터넷
133143070000201930702163020139020170620<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 보문동*가 **번지 *호서울특별시 성북구 보문로 **, ***호 (보문동*가)02873바이시블루2022-11-10 16:12:35U2021-10-31 23:02:00.0의류/패션/잡화/뷰티201826.002331453162.376312<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
20744307000020233070299302010032023-06-13<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 하월곡동 *** 월곡두산위브아파트 ***동 ***호서울특별시 성북구 오패산로 **, ***동 ***호 (하월곡동, 월곡두산위브아파트)02750도트니젠2023-06-13 16:35:57I2022-12-05 23:05:00.0가구/수납용품 의류/패션/잡화/뷰티203318.127255456078.982123<NA><NA><NA><NA>
82993070000201530702163020057320150623<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6339-5559<NA><NA>서울특별시 성북구 동선동*가 ***번지 *호서울특별시 성북구 동소문로**길 **, *층 (동선동*가)02829넛쏘굿(nut so good)2018-11-02 17:45:07U2018-11-04 02:36:14.0건강/식품201380.014181454818.774252<NA><NA><NA>인터넷
9633070000200530701343020048020050926<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>029148138<NA><NA>서울특별시 성북구 종암동***-* 에스케이아파트***-****<NA><NA>시티지2015-04-20 13:27:12I2018-08-31 23:59:59.0의류/패션/잡화/뷰티<NA><NA><NA><NA><NA>인터넷
21569307000020233070299302018292023-12-05<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 - 928 - 6176<NA><NA>서울특별시 성북구 정릉동 **** 길음뉴타운**단지 B*층 ***호서울특별시 성북구 정릉로 ***, B*층 ***호 (정릉동, 길음뉴타운**단지)02719(주)누룩코리아2023-12-05 16:41:35I2022-11-02 00:07:00.0의류/패션/잡화/뷰티201548.495616455659.817852<NA><NA><NA><NA>
20484307000020233070299302007432023-04-24<NA>3폐업3폐업처리2023-09-11<NA><NA><NA><NA><NA><NA>서울특별시 성북구 종암동 *** 종암우림카이저팰리스 ***동 ****호서울특별시 성북구 종암로 ***, ***동 **층 ****호 (종암동, 종암우림카이저팰리스)02797편리한 해외구매대행2023-09-11 15:58:24U2022-12-08 23:03:00.0종합몰202887.735294455516.204265<NA><NA><NA><NA>
18913070000200730701343020013620070214<NA>3폐업3폐업처리20090701<NA><NA><NA>029887017<NA><NA>서울특별시 성북구 길음동*-* 동부센트레빌(아) ***-***<NA><NA>내츄럴티셔츠클럽2009-07-03 16:14:38I2018-08-31 23:59:59.0의류/패션/잡화/뷰티<NA><NA><NA><NA><NA>인터넷
21047307000020233070299302013072023-08-11<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 정릉동 ***-** 정릉연립 *동 ***호서울특별시 성북구 정릉로**라길 **-*, *동 ***호 (정릉동, 정릉연립)02812쏘피타민2023-08-11 15:47:46I2022-12-07 23:03:00.0의류/패션/잡화/뷰티199870.488605455864.584712<NA><NA><NA><NA>
170163070000202130702863020137120160106<NA>3폐업3폐업처리20220422<NA><NA><NA>-<NA><NA>서울특별시 성북구 정릉동 ***-*** 삼성홈타운서울특별시 성북구 서경로**가길 *, ***호 (정릉동, 삼성홈타운)02717서연2022-04-22 16:08:55U2021-12-03 22:04:00.0교육/도서/완구/오락201019.556688456655.205042<NA><NA><NA><NA>
22203070000200730701343020054820070704<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02<NA>136140서울특별시 성북구 장위동 **번지 **호<NA><NA>HFC2015-04-22 14:05:50I2018-08-31 23:59:59.0의류/패션/잡화/뷰티204893.970144456797.938626<NA><NA><NA>인터넷
189073070000202230702863020117920220718<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 성북구 삼선동*가 *** 삼선 SK VIEW 아파트 ***동 ***호서울특별시 성북구 삼선교로**길 **, ***동 ***호 (삼선동*가, 삼선 SK VIEW 아파트)028635KSEOUL(오케이서울)2022-07-19 16:53:00I2021-12-06 22:01:00.0의류/패션/잡화/뷰티200958.804988453889.56215<NA><NA><NA><NA>