Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells75482
Missing cells (%)26.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical10
Numeric4
DateTime7
Unsupported2
Text6

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author종로구
URLhttps://data.seoul.go.kr/dataList/OA-18802/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (70.2%)Imbalance
부채총액 is highly imbalanced (70.2%)Imbalance
자본금 is highly imbalanced (70.2%)Imbalance
판매방식명 is highly imbalanced (70.4%)Imbalance
인허가취소일자 has 10000 (100.0%) missing valuesMissing
폐업일자 has 6639 (66.4%) missing valuesMissing
휴업시작일자 has 9945 (99.5%) missing valuesMissing
휴업종료일자 has 9946 (99.5%) missing valuesMissing
재개업일자 has 9992 (99.9%) missing valuesMissing
전화번호 has 3558 (35.6%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7580 (75.8%) missing valuesMissing
지번주소 has 413 (4.1%) missing valuesMissing
도로명주소 has 1545 (15.4%) missing valuesMissing
도로명우편번호 has 3044 (30.4%) missing valuesMissing
좌표정보(X) has 1410 (14.1%) missing valuesMissing
좌표정보(Y) has 1410 (14.1%) missing valuesMissing
관리번호 is highly skewed (γ1 = -54.48254578)Skewed
소재지우편번호 is highly skewed (γ1 = 47.87069454)Skewed
관리번호 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 13:06:28.968637
Analysis finished2024-04-06 13:06:33.037132
Duration4.07 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3000000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3000000
2nd row3000000
3rd row3000000
4th row3000000
5th row3000000

Common Values

ValueCountFrequency (%)
3000000 10000
100.0%

Length

2024-04-06T22:06:33.208692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:33.396543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3000000 10000
100.0%

관리번호
Real number (ℝ)

SKEWED  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0145717 × 1018
Minimum2.0073 × 1017
Maximum2.0243 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:06:33.622623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0073 × 1017
5-th percentile2.0043 × 1018
Q12.0093 × 1018
median2.0163 × 1018
Q32.0203 × 1018
95-th percentile2.0233 × 1018
Maximum2.0243 × 1018
Range1.82357 × 1018
Interquartile range (IQR)1.1000007 × 1016

Descriptive statistics

Standard deviation3.2033162 × 1016
Coefficient of variation (CV)0.01590073
Kurtosis3083.1985
Mean2.0145717 × 1018
Median Absolute Deviation (MAD)5.0000033 × 1015
Skewness-54.482546
Sum1.8729247 × 1018
Variance1.0261235 × 1033
MonotonicityNot monotonic
2024-04-06T22:06:33.921374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2005300010130202625 1
 
< 0.1%
2007300010130204631 1
 
< 0.1%
2021300020230201861 1
 
< 0.1%
2007300012930205332 1
 
< 0.1%
2021300020230200636 1
 
< 0.1%
2009300012930200164 1
 
< 0.1%
2012300012930201076 1
 
< 0.1%
2023300020230200389 1
 
< 0.1%
2022300020230201194 1
 
< 0.1%
2002300010130213585 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
200730001013020419 1
< 0.1%
200730001293020530 1
< 0.1%
200730001293020534 1
< 0.1%
1996300010130200051 1
< 0.1%
1996300010130200081 1
< 0.1%
1996300010130200083 1
< 0.1%
1996300010130200329 1
< 0.1%
1996300010130200408 1
< 0.1%
1996300010130200457 1
< 0.1%
1997300010130200447 1
< 0.1%
ValueCountFrequency (%)
2024300024530200513 1
< 0.1%
2024300024530200511 1
< 0.1%
2024300024530200508 1
< 0.1%
2024300024530200502 1
< 0.1%
2024300024530200501 1
< 0.1%
2024300024530200497 1
< 0.1%
2024300024530200491 1
< 0.1%
2024300024530200490 1
< 0.1%
2024300024530200489 1
< 0.1%
2024300024530200486 1
< 0.1%
Distinct4349
Distinct (%)43.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-08-30 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T22:06:34.208813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:34.493255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4837 
3
2771 
4
1773 
5
590 
2
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row1
3rd row1
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 4837
48.4%
3 2771
27.7%
4 1773
 
17.7%
5 590
 
5.9%
2 29
 
0.3%

Length

2024-04-06T22:06:34.724832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:34.896817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4837
48.4%
3 2771
27.7%
4 1773
 
17.7%
5 590
 
5.9%
2 29
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
4837 
폐업
2771 
취소/말소/만료/정지/중지
1773 
제외/삭제/전출
590 
휴업
 
29

Length

Max length14
Median length8
Mean length5.9327
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row영업/정상
3rd row영업/정상
4th row폐업
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 4837
48.4%
폐업 2771
27.7%
취소/말소/만료/정지/중지 1773
 
17.7%
제외/삭제/전출 590
 
5.9%
휴업 29
 
0.3%

Length

2024-04-06T22:06:35.079943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:35.309072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 4837
48.4%
폐업 2771
27.7%
취소/말소/만료/정지/중지 1773
 
17.7%
제외/삭제/전출 590
 
5.9%
휴업 29
 
0.3%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4837 
3
2771 
7
1773 
5
590 
2
 
29

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row1
3rd row1
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 4837
48.4%
3 2771
27.7%
7 1773
 
17.7%
5 590
 
5.9%
2 29
 
0.3%

Length

2024-04-06T22:06:35.556911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:35.782049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4837
48.4%
3 2771
27.7%
7 1773
 
17.7%
5 590
 
5.9%
2 29
 
0.3%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
4837 
폐업처리
2771 
직권말소
1773 
타시군구이관
590 
휴업처리
 
29

Length

Max length6
Median length4
Mean length4.118
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업처리
2nd row정상영업
3rd row정상영업
4th row폐업처리
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 4837
48.4%
폐업처리 2771
27.7%
직권말소 1773
 
17.7%
타시군구이관 590
 
5.9%
휴업처리 29
 
0.3%

Length

2024-04-06T22:06:36.042887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:36.258338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 4837
48.4%
폐업처리 2771
27.7%
직권말소 1773
 
17.7%
타시군구이관 590
 
5.9%
휴업처리 29
 
0.3%

폐업일자
Date

MISSING 

Distinct1946
Distinct (%)57.9%
Missing6639
Missing (%)66.4%
Memory size156.2 KiB
Minimum2002-08-31 00:00:00
Maximum2024-04-03 00:00:00
2024-04-06T22:06:36.490325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:36.748770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct52
Distinct (%)94.5%
Missing9945
Missing (%)99.5%
Memory size156.2 KiB
Minimum2006-09-04 00:00:00
Maximum2024-02-22 00:00:00
2024-04-06T22:06:37.032180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:37.281268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업종료일자
Date

MISSING 

Distinct47
Distinct (%)87.0%
Missing9946
Missing (%)99.5%
Memory size156.2 KiB
Minimum2006-12-31 00:00:00
Maximum2099-12-31 00:00:00
2024-04-06T22:06:37.509608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:37.706434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)

재개업일자
Date

MISSING 

Distinct8
Distinct (%)100.0%
Missing9992
Missing (%)99.9%
Memory size156.2 KiB
Minimum2002-09-03 00:00:00
Maximum2020-09-02 00:00:00
2024-04-06T22:06:37.868441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:38.011586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

전화번호
Text

MISSING 

Distinct5397
Distinct (%)83.8%
Missing3558
Missing (%)35.6%
Memory size156.2 KiB
2024-04-06T22:06:38.322084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length9.9282831
Min length1

Characters and Unicode

Total characters63958
Distinct characters16
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5289 ?
Unique (%)82.1%

Sample

1st row02 720 3170
2nd row02-783-1779
3rd row02-720-6202
4th row02 741 7920
5th row-
ValueCountFrequency (%)
02 1237
 
13.6%
908
 
10.0%
747 67
 
0.7%
741 46
 
0.5%
765 42
 
0.5%
745 42
 
0.5%
766 40
 
0.4%
762 37
 
0.4%
720 36
 
0.4%
737 36
 
0.4%
Other values (5504) 6615
72.6%
2024-04-06T22:06:38.935183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 9600
15.0%
0 9326
14.6%
- 8317
13.0%
7 6973
10.9%
3 4565
7.1%
4271
6.7%
6 4176
6.5%
4 3995
6.2%
5 3537
 
5.5%
1 3360
 
5.3%
Other values (6) 5838
9.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 51317
80.2%
Dash Punctuation 8317
 
13.0%
Space Separator 4271
 
6.7%
Other Punctuation 31
 
< 0.1%
Math Symbol 22
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 9600
18.7%
0 9326
18.2%
7 6973
13.6%
3 4565
8.9%
6 4176
8.1%
4 3995
7.8%
5 3537
 
6.9%
1 3360
 
6.5%
8 3190
 
6.2%
9 2595
 
5.1%
Other Punctuation
ValueCountFrequency (%)
. 18
58.1%
/ 7
 
22.6%
, 6
 
19.4%
Dash Punctuation
ValueCountFrequency (%)
- 8317
100.0%
Space Separator
ValueCountFrequency (%)
4271
100.0%
Math Symbol
ValueCountFrequency (%)
~ 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 63958
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 9600
15.0%
0 9326
14.6%
- 8317
13.0%
7 6973
10.9%
3 4565
7.1%
4271
6.7%
6 4176
6.5%
4 3995
6.2%
5 3537
 
5.5%
1 3360
 
5.3%
Other values (6) 5838
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63958
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 9600
15.0%
0 9326
14.6%
- 8317
13.0%
7 6973
10.9%
3 4565
7.1%
4271
6.7%
6 4176
6.5%
4 3995
6.2%
5 3537
 
5.5%
1 3360
 
5.3%
Other values (6) 5838
9.1%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct184
Distinct (%)7.6%
Missing7580
Missing (%)75.8%
Infinite0
Infinite (%)0.0%
Mean110617.8
Minimum110010
Maximum407769
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:06:39.152884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110010
5-th percentile110041
Q1110240
median110521
Q3110786
95-th percentile110862
Maximum407769
Range297759
Interquartile range (IQR)546

Descriptive statistics

Standard deviation6102.8644
Coefficient of variation (CV)0.055170726
Kurtosis2326.9731
Mean110617.8
Median Absolute Deviation (MAD)275
Skewness47.870695
Sum2.6769508 × 108
Variance37244954
MonotonicityNot monotonic
2024-04-06T22:06:39.694307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
110540 209
 
2.1%
110550 161
 
1.6%
110320 60
 
0.6%
110430 56
 
0.6%
110841 52
 
0.5%
110390 51
 
0.5%
110126 50
 
0.5%
110842 44
 
0.4%
110070 43
 
0.4%
110410 43
 
0.4%
Other values (174) 1651
 
16.5%
(Missing) 7580
75.8%
ValueCountFrequency (%)
110010 6
 
0.1%
110011 12
0.1%
110012 23
0.2%
110020 9
 
0.1%
110021 13
0.1%
110030 13
0.1%
110031 1
 
< 0.1%
110032 8
 
0.1%
110033 3
 
< 0.1%
110034 4
 
< 0.1%
ValueCountFrequency (%)
407769 1
 
< 0.1%
150090 1
 
< 0.1%
110999 10
0.1%
110901 3
 
< 0.1%
110890 23
0.2%
110888 24
0.2%
110877 2
 
< 0.1%
110876 16
0.2%
110873 5
 
0.1%
110872 5
 
0.1%

지번주소
Text

MISSING 

Distinct5657
Distinct (%)59.0%
Missing413
Missing (%)4.1%
Memory size156.2 KiB
2024-04-06T22:06:40.166801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length50
Mean length26.983832
Min length13

Characters and Unicode

Total characters258694
Distinct characters539
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4407 ?
Unique (%)46.0%

Sample

1st row서울특별시 종로구 수송동 **번지 두산위브파빌리온 ***호
2nd row서울특별시 종로구 종로*가 *번지 *층 *호
3rd row서울특별시 종로구 신문로*가 **번지 *층 ***호
4th row서울특별시 종로구 와룡동 ***번지 *층
5th row서울특별시 종로구 명륜*가 **번지 동숭비즈니스센터 *층 ***호
ValueCountFrequency (%)
종로구 9588
17.8%
서울특별시 9586
17.8%
6418
11.9%
번지 5258
 
9.8%
3297
 
6.1%
2376
 
4.4%
창신동 1267
 
2.4%
숭인동 980
 
1.8%
종로*가 958
 
1.8%
명륜*가 295
 
0.5%
Other values (2458) 13818
25.7%
2024-04-06T22:06:40.916527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 51975
20.1%
44506
17.2%
11152
 
4.3%
11103
 
4.3%
10244
 
4.0%
9803
 
3.8%
9728
 
3.8%
9722
 
3.8%
9651
 
3.7%
9612
 
3.7%
Other values (529) 81198
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 156425
60.5%
Other Punctuation 52230
 
20.2%
Space Separator 44506
 
17.2%
Dash Punctuation 3903
 
1.5%
Uppercase Letter 1021
 
0.4%
Decimal Number 363
 
0.1%
Lowercase Letter 103
 
< 0.1%
Close Punctuation 52
 
< 0.1%
Open Punctuation 52
 
< 0.1%
Math Symbol 31
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11152
 
7.1%
11103
 
7.1%
10244
 
6.5%
9803
 
6.3%
9728
 
6.2%
9722
 
6.2%
9651
 
6.2%
9612
 
6.1%
9593
 
6.1%
6986
 
4.5%
Other values (460) 58831
37.6%
Uppercase Letter
ValueCountFrequency (%)
B 240
23.5%
A 189
18.5%
C 91
 
8.9%
D 90
 
8.8%
S 50
 
4.9%
I 39
 
3.8%
J 33
 
3.2%
K 30
 
2.9%
E 30
 
2.9%
G 28
 
2.7%
Other values (15) 201
19.7%
Lowercase Letter
ValueCountFrequency (%)
c 19
18.4%
i 13
12.6%
n 11
10.7%
o 10
9.7%
a 8
7.8%
b 7
 
6.8%
r 6
 
5.8%
d 5
 
4.9%
s 4
 
3.9%
k 4
 
3.9%
Other values (7) 16
15.5%
Decimal Number
ValueCountFrequency (%)
1 95
26.2%
2 54
14.9%
3 39
10.7%
0 31
 
8.5%
5 29
 
8.0%
6 29
 
8.0%
4 24
 
6.6%
8 23
 
6.3%
9 21
 
5.8%
7 18
 
5.0%
Other Punctuation
ValueCountFrequency (%)
* 51975
99.5%
, 233
 
0.4%
. 10
 
< 0.1%
/ 7
 
< 0.1%
& 5
 
< 0.1%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 50
96.2%
] 2
 
3.8%
Open Punctuation
ValueCountFrequency (%)
( 50
96.2%
[ 2
 
3.8%
Space Separator
ValueCountFrequency (%)
44506
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3903
100.0%
Math Symbol
ValueCountFrequency (%)
~ 31
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
˚ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 156425
60.5%
Common 101139
39.1%
Latin 1130
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11152
 
7.1%
11103
 
7.1%
10244
 
6.5%
9803
 
6.3%
9728
 
6.2%
9722
 
6.2%
9651
 
6.2%
9612
 
6.1%
9593
 
6.1%
6986
 
4.5%
Other values (460) 58831
37.6%
Latin
ValueCountFrequency (%)
B 240
21.2%
A 189
16.7%
C 91
 
8.1%
D 90
 
8.0%
S 50
 
4.4%
I 39
 
3.5%
J 33
 
2.9%
K 30
 
2.7%
E 30
 
2.7%
G 28
 
2.5%
Other values (35) 310
27.4%
Common
ValueCountFrequency (%)
* 51975
51.4%
44506
44.0%
- 3903
 
3.9%
, 233
 
0.2%
1 95
 
0.1%
2 54
 
0.1%
) 50
 
< 0.1%
( 50
 
< 0.1%
3 39
 
< 0.1%
~ 31
 
< 0.1%
Other values (14) 203
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 156424
60.5%
ASCII 102262
39.5%
Number Forms 6
 
< 0.1%
Modifier Letters 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 51975
50.8%
44506
43.5%
- 3903
 
3.8%
B 240
 
0.2%
, 233
 
0.2%
A 189
 
0.2%
1 95
 
0.1%
C 91
 
0.1%
D 90
 
0.1%
2 54
 
0.1%
Other values (55) 886
 
0.9%
Hangul
ValueCountFrequency (%)
11152
 
7.1%
11103
 
7.1%
10244
 
6.5%
9803
 
6.3%
9728
 
6.2%
9722
 
6.2%
9651
 
6.2%
9612
 
6.1%
9593
 
6.1%
6986
 
4.5%
Other values (459) 58830
37.6%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Modifier Letters
ValueCountFrequency (%)
˚ 1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct5930
Distinct (%)70.1%
Missing1545
Missing (%)15.4%
Memory size156.2 KiB
2024-04-06T22:06:41.342927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length57
Mean length34.339799
Min length20

Characters and Unicode

Total characters290343
Distinct characters536
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4735 ?
Unique (%)56.0%

Sample

1st row서울특별시 종로구 삼봉로 **, ***호 (수송동,두산위브파빌리온)
2nd row서울특별시 종로구 종로 ***, *층 *호 (종로*가)
3rd row서울특별시 종로구 새문안로 **, *층 ***호 (신문로*가)
4th row서울특별시 종로구 돈화문로 **, *층 (와룡동)
5th row서울특별시 종로구 성균관로*길 **, *층 ***호 (명륜*가, 동숭비즈니스센터)
ValueCountFrequency (%)
서울특별시 8455
14.9%
종로구 8454
14.9%
8358
14.7%
4312
 
7.6%
3494
 
6.2%
창신동 989
 
1.7%
종로 956
 
1.7%
종로*가 726
 
1.3%
숭인동 704
 
1.2%
종로**길 569
 
1.0%
Other values (2875) 19731
34.8%
2024-04-06T22:06:42.168603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 49098
16.9%
48330
16.6%
16887
 
5.8%
11731
 
4.0%
, 9762
 
3.4%
9522
 
3.3%
9063
 
3.1%
8714
 
3.0%
8585
 
3.0%
) 8516
 
2.9%
Other values (526) 110135
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 162129
55.8%
Other Punctuation 58881
 
20.3%
Space Separator 48330
 
16.6%
Close Punctuation 8518
 
2.9%
Open Punctuation 8517
 
2.9%
Dash Punctuation 2061
 
0.7%
Uppercase Letter 1228
 
0.4%
Decimal Number 516
 
0.2%
Lowercase Letter 117
 
< 0.1%
Math Symbol 39
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16887
 
10.4%
11731
 
7.2%
9522
 
5.9%
9063
 
5.6%
8714
 
5.4%
8585
 
5.3%
8516
 
5.3%
8482
 
5.2%
8462
 
5.2%
5218
 
3.2%
Other values (458) 66949
41.3%
Uppercase Letter
ValueCountFrequency (%)
B 314
25.6%
A 262
21.3%
D 120
 
9.8%
C 99
 
8.1%
S 50
 
4.1%
E 40
 
3.3%
I 39
 
3.2%
J 37
 
3.0%
G 36
 
2.9%
F 30
 
2.4%
Other values (15) 201
16.4%
Lowercase Letter
ValueCountFrequency (%)
i 20
17.1%
c 18
15.4%
b 15
12.8%
a 11
9.4%
o 11
9.4%
n 10
8.5%
d 6
 
5.1%
r 5
 
4.3%
t 4
 
3.4%
u 3
 
2.6%
Other values (6) 14
12.0%
Decimal Number
ValueCountFrequency (%)
1 145
28.1%
2 73
14.1%
0 57
 
11.0%
3 44
 
8.5%
5 43
 
8.3%
4 35
 
6.8%
9 34
 
6.6%
6 32
 
6.2%
8 31
 
6.0%
7 22
 
4.3%
Other Punctuation
ValueCountFrequency (%)
* 49098
83.4%
, 9762
 
16.6%
. 11
 
< 0.1%
/ 6
 
< 0.1%
& 4
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 8516
> 99.9%
] 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 8515
> 99.9%
[ 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
48330
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2061
100.0%
Math Symbol
ValueCountFrequency (%)
~ 39
100.0%
Modifier Symbol
ValueCountFrequency (%)
˚ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 162127
55.8%
Common 126864
43.7%
Latin 1350
 
0.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16887
 
10.4%
11731
 
7.2%
9522
 
5.9%
9063
 
5.6%
8714
 
5.4%
8585
 
5.3%
8516
 
5.3%
8482
 
5.2%
8462
 
5.2%
5218
 
3.2%
Other values (456) 66947
41.3%
Latin
ValueCountFrequency (%)
B 314
23.3%
A 262
19.4%
D 120
 
8.9%
C 99
 
7.3%
S 50
 
3.7%
E 40
 
3.0%
I 39
 
2.9%
J 37
 
2.7%
G 36
 
2.7%
F 30
 
2.2%
Other values (34) 323
23.9%
Common
ValueCountFrequency (%)
* 49098
38.7%
48330
38.1%
, 9762
 
7.7%
) 8516
 
6.7%
( 8515
 
6.7%
- 2061
 
1.6%
1 145
 
0.1%
2 73
 
0.1%
0 57
 
< 0.1%
3 44
 
< 0.1%
Other values (14) 263
 
0.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 162126
55.8%
ASCII 128208
44.2%
Number Forms 5
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Modifier Letters 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 49098
38.3%
48330
37.7%
, 9762
 
7.6%
) 8516
 
6.6%
( 8515
 
6.6%
- 2061
 
1.6%
B 314
 
0.2%
A 262
 
0.2%
1 145
 
0.1%
D 120
 
0.1%
Other values (54) 1085
 
0.8%
Hangul
ValueCountFrequency (%)
16887
 
10.4%
11731
 
7.2%
9522
 
5.9%
9063
 
5.6%
8714
 
5.4%
8585
 
5.3%
8516
 
5.3%
8482
 
5.2%
8462
 
5.2%
5218
 
3.2%
Other values (455) 66946
41.3%
Number Forms
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Modifier Letters
ValueCountFrequency (%)
˚ 1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

도로명우편번호
Text

MISSING 

Distinct360
Distinct (%)5.2%
Missing3044
Missing (%)30.4%
Memory size156.2 KiB
2024-04-06T22:06:42.757064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1942208
Min length5

Characters and Unicode

Total characters36131
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)0.6%

Sample

1st row03139
2nd row03182
3rd row03131
4th row03074
5th row03081
ValueCountFrequency (%)
03121 260
 
3.7%
03138 214
 
3.1%
03198 194
 
2.8%
03134 151
 
2.2%
03182 125
 
1.8%
03119 104
 
1.5%
03139 95
 
1.4%
03086 91
 
1.3%
03114 84
 
1.2%
03193 78
 
1.1%
Other values (350) 5560
79.9%
2024-04-06T22:06:43.580875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10378
28.7%
1 8019
22.2%
3 7432
20.6%
8 2081
 
5.8%
2 1731
 
4.8%
4 1541
 
4.3%
9 1512
 
4.2%
7 1224
 
3.4%
5 1136
 
3.1%
6 1053
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36107
99.9%
Dash Punctuation 24
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10378
28.7%
1 8019
22.2%
3 7432
20.6%
8 2081
 
5.8%
2 1731
 
4.8%
4 1541
 
4.3%
9 1512
 
4.2%
7 1224
 
3.4%
5 1136
 
3.1%
6 1053
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36131
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10378
28.7%
1 8019
22.2%
3 7432
20.6%
8 2081
 
5.8%
2 1731
 
4.8%
4 1541
 
4.3%
9 1512
 
4.2%
7 1224
 
3.4%
5 1136
 
3.1%
6 1053
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36131
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10378
28.7%
1 8019
22.2%
3 7432
20.6%
8 2081
 
5.8%
2 1731
 
4.8%
4 1541
 
4.3%
9 1512
 
4.2%
7 1224
 
3.4%
5 1136
 
3.1%
6 1053
 
2.9%
Distinct9838
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T22:06:44.221087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length7.5253
Min length1

Characters and Unicode

Total characters75253
Distinct characters1100
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9683 ?
Unique (%)96.8%

Sample

1st row(주)골드스톤코리아
2nd row신골드(신캐스팅)
3rd row주식회사 국제회계세무교육원
4th row오마뎅 창덕궁점
5th row(재)전문무용수지원센터
ValueCountFrequency (%)
주식회사 895
 
6.6%
305
 
2.2%
co.,ltd 44
 
0.3%
company 35
 
0.3%
29
 
0.2%
유한회사 28
 
0.2%
co 28
 
0.2%
ltd 24
 
0.2%
korea 21
 
0.2%
스튜디오 21
 
0.2%
Other values (11191) 12172
89.5%
2024-04-06T22:06:45.080134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3607
 
4.8%
) 2775
 
3.7%
( 2740
 
3.6%
2344
 
3.1%
2303
 
3.1%
1893
 
2.5%
1637
 
2.2%
1203
 
1.6%
1081
 
1.4%
1038
 
1.4%
Other values (1090) 54632
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51346
68.2%
Lowercase Letter 7318
 
9.7%
Uppercase Letter 6416
 
8.5%
Space Separator 3607
 
4.8%
Close Punctuation 2778
 
3.7%
Open Punctuation 2743
 
3.6%
Other Punctuation 474
 
0.6%
Decimal Number 473
 
0.6%
Dash Punctuation 80
 
0.1%
Connector Punctuation 10
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2344
 
4.6%
2303
 
4.5%
1893
 
3.7%
1637
 
3.2%
1203
 
2.3%
1081
 
2.1%
1038
 
2.0%
983
 
1.9%
762
 
1.5%
680
 
1.3%
Other values (1006) 37422
72.9%
Lowercase Letter
ValueCountFrequency (%)
e 871
11.9%
o 775
 
10.6%
a 691
 
9.4%
n 548
 
7.5%
i 507
 
6.9%
r 441
 
6.0%
t 435
 
5.9%
l 421
 
5.8%
s 354
 
4.8%
d 271
 
3.7%
Other values (16) 2004
27.4%
Uppercase Letter
ValueCountFrequency (%)
A 531
 
8.3%
E 499
 
7.8%
L 430
 
6.7%
S 425
 
6.6%
O 415
 
6.5%
T 395
 
6.2%
I 357
 
5.6%
N 356
 
5.5%
C 349
 
5.4%
M 339
 
5.3%
Other values (16) 2320
36.2%
Other Punctuation
ValueCountFrequency (%)
. 283
59.7%
, 84
 
17.7%
& 74
 
15.6%
' 14
 
3.0%
: 6
 
1.3%
? 5
 
1.1%
/ 4
 
0.8%
# 2
 
0.4%
1
 
0.2%
1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 105
22.2%
2 66
14.0%
0 63
13.3%
3 44
9.3%
9 40
 
8.5%
5 39
 
8.2%
7 35
 
7.4%
4 32
 
6.8%
8 30
 
6.3%
6 19
 
4.0%
Other Symbol
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 2775
99.9%
] 3
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2740
99.9%
[ 3
 
0.1%
Space Separator
ValueCountFrequency (%)
3607
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51318
68.2%
Latin 13735
 
18.3%
Common 10168
 
13.5%
Han 32
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2344
 
4.6%
2303
 
4.5%
1893
 
3.7%
1637
 
3.2%
1203
 
2.3%
1081
 
2.1%
1038
 
2.0%
983
 
1.9%
762
 
1.5%
680
 
1.3%
Other values (977) 37394
72.9%
Latin
ValueCountFrequency (%)
e 871
 
6.3%
o 775
 
5.6%
a 691
 
5.0%
n 548
 
4.0%
A 531
 
3.9%
i 507
 
3.7%
E 499
 
3.6%
r 441
 
3.2%
t 435
 
3.2%
L 430
 
3.1%
Other values (43) 8007
58.3%
Common
ValueCountFrequency (%)
3607
35.5%
) 2775
27.3%
( 2740
26.9%
. 283
 
2.8%
1 105
 
1.0%
, 84
 
0.8%
- 80
 
0.8%
& 74
 
0.7%
2 66
 
0.6%
0 63
 
0.6%
Other values (20) 291
 
2.9%
Han
ValueCountFrequency (%)
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (20) 20
62.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51310
68.2%
ASCII 23898
31.8%
CJK 30
 
< 0.1%
None 6
 
< 0.1%
Compat Jamo 4
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
Number Forms 1
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3607
 
15.1%
) 2775
 
11.6%
( 2740
 
11.5%
e 871
 
3.6%
o 775
 
3.2%
a 691
 
2.9%
n 548
 
2.3%
A 531
 
2.2%
i 507
 
2.1%
E 499
 
2.1%
Other values (68) 10354
43.3%
Hangul
ValueCountFrequency (%)
2344
 
4.6%
2303
 
4.5%
1893
 
3.7%
1637
 
3.2%
1203
 
2.3%
1081
 
2.1%
1038
 
2.0%
983
 
1.9%
762
 
1.5%
680
 
1.3%
Other values (973) 37386
72.9%
None
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
CJK
ValueCountFrequency (%)
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (18) 18
60.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct9805
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-23 16:59:27
Maximum2024-04-04 15:09:41
2024-04-06T22:06:45.328925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:45.586128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7950 
U
2050 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowU
3rd rowU
4th rowU
5th rowI

Common Values

ValueCountFrequency (%)
I 7950
79.5%
U 2050
 
20.5%

Length

2024-04-06T22:06:45.808679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:45.981871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7950
79.5%
u 2050
 
20.5%
Distinct1559
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:06:00
2024-04-06T22:06:46.180042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T22:06:46.435877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct357
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-06T22:06:46.689603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.274
Min length1

Characters and Unicode

Total characters82740
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique218 ?
Unique (%)2.2%

Sample

1st row레져/여행/공연
2nd row의류/패션/잡화/뷰티
3rd row교육/도서/완구/오락
4th row기타
5th row기타
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4083
32.0%
기타 2219
17.4%
종합몰 1758
13.8%
1413
 
11.1%
교육/도서/완구/오락 731
 
5.7%
건강/식품 708
 
5.5%
레져/여행/공연 664
 
5.2%
가구/수납용품 315
 
2.5%
컴퓨터/사무용품 307
 
2.4%
가전 292
 
2.3%
Other values (3) 269
 
2.1%
2024-04-06T22:06:47.260639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 17279
20.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
4083
 
4.9%
2759
 
3.3%
Other values (41) 30038
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 61289
74.1%
Other Punctuation 17279
 
20.9%
Space Separator 2759
 
3.3%
Dash Punctuation 1413
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
2219
 
3.6%
2219
 
3.6%
Other values (38) 24187
39.5%
Other Punctuation
ValueCountFrequency (%)
/ 17279
100.0%
Space Separator
ValueCountFrequency (%)
2759
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1413
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61289
74.1%
Common 21451
 
25.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
2219
 
3.6%
2219
 
3.6%
Other values (38) 24187
39.5%
Common
ValueCountFrequency (%)
/ 17279
80.6%
2759
 
12.9%
- 1413
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61289
74.1%
ASCII 21451
 
25.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 17279
80.6%
2759
 
12.9%
- 1413
 
6.6%
Hangul
ValueCountFrequency (%)
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
4083
 
6.7%
2219
 
3.6%
2219
 
3.6%
Other values (38) 24187
39.5%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct3811
Distinct (%)44.4%
Missing1410
Missing (%)14.1%
Infinite0
Infinite (%)0.0%
Mean199385.44
Minimum176377.97
Maximum201974.15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:06:47.504410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum176377.97
5-th percentile196820.33
Q1198269.33
median199357.47
Q3200656.26
95-th percentile201671.7
Maximum201974.15
Range25596.175
Interquartile range (IQR)2386.9309

Descriptive statistics

Standard deviation1555.3912
Coefficient of variation (CV)0.0078009266
Kurtosis4.480688
Mean199385.44
Median Absolute Deviation (MAD)1184.7875
Skewness-0.56482522
Sum1.712721 × 109
Variance2419241.8
MonotonicityNot monotonic
2024-04-06T22:06:47.788707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201273.887781838 324
 
3.2%
200542.258243379 187
 
1.9%
198324.653631679 95
 
0.9%
197607.398336321 93
 
0.9%
198150.300374121 93
 
0.9%
199812.609633255 89
 
0.9%
201359.53353905 73
 
0.7%
201258.062657124 68
 
0.7%
199509.962000656 61
 
0.6%
197567.747824202 52
 
0.5%
Other values (3801) 7455
74.6%
(Missing) 1410
 
14.1%
ValueCountFrequency (%)
176377.972946702 1
< 0.1%
195784.339558492 1
< 0.1%
195792.852713222 1
< 0.1%
195792.871766006 1
< 0.1%
195877.645568351 2
< 0.1%
195903.852646231 1
< 0.1%
195935.488990947 1
< 0.1%
195982.475968192 1
< 0.1%
195998.732099539 1
< 0.1%
196005.232881324 1
< 0.1%
ValueCountFrequency (%)
201974.148316461 1
 
< 0.1%
201963.938724209 2
 
< 0.1%
201962.62904341 4
 
< 0.1%
201960.951300031 30
0.3%
201960.304346071 2
 
< 0.1%
201959.524603547 1
 
< 0.1%
201949.750623535 1
 
< 0.1%
201949.65163583 2
 
< 0.1%
201949.270716491 8
 
0.1%
201946.963231249 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct3809
Distinct (%)44.3%
Missing1410
Missing (%)14.1%
Infinite0
Infinite (%)0.0%
Mean452665.99
Minimum442117.04
Maximum457064.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-06T22:06:48.004124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum442117.04
5-th percentile451921.6
Q1452120.03
median452357.01
Q3452807.18
95-th percentile454877.07
Maximum457064.07
Range14947.026
Interquartile range (IQR)687.15569

Descriptive statistics

Standard deviation939.72201
Coefficient of variation (CV)0.0020759722
Kurtosis7.9235887
Mean452665.99
Median Absolute Deviation (MAD)299.88743
Skewness2.2270393
Sum3.8884009 × 109
Variance883077.46
MonotonicityNot monotonic
2024-04-06T22:06:48.226702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
452163.292543039 324
 
3.2%
452057.126859583 187
 
1.9%
452252.812389497 95
 
0.9%
452019.212642931 93
 
0.9%
452120.026024435 93
 
0.9%
452132.648343993 89
 
0.9%
452395.659064842 73
 
0.7%
452787.132831042 68
 
0.7%
451888.555001545 61
 
0.6%
452347.973860449 52
 
0.5%
Other values (3799) 7455
74.6%
(Missing) 1410
 
14.1%
ValueCountFrequency (%)
442117.043689711 1
 
< 0.1%
451543.629004831 1
 
< 0.1%
451580.22699738 1
 
< 0.1%
451596.900364676 1
 
< 0.1%
451658.76871749 1
 
< 0.1%
451670.158825331 3
< 0.1%
451682.186997114 1
 
< 0.1%
451718.36932938 1
 
< 0.1%
451733.540405524 1
 
< 0.1%
451759.857202136 1
 
< 0.1%
ValueCountFrequency (%)
457064.069468618 1
< 0.1%
457056.172712929 1
< 0.1%
457022.886371969 1
< 0.1%
456992.820036366 1
< 0.1%
456937.191304131 1
< 0.1%
456926.240434731 1
< 0.1%
456924.2263248 1
< 0.1%
456911.801056534 1
< 0.1%
456908.959142753 1
< 0.1%
456905.265647387 1
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9473 
0
 
527

Length

Max length4
Median length4
Mean length3.8419
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9473
94.7%
0 527
 
5.3%

Length

2024-04-06T22:06:48.477536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:48.643293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9473
94.7%
0 527
 
5.3%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9473 
0
 
527

Length

Max length4
Median length4
Mean length3.8419
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9473
94.7%
0 527
 
5.3%

Length

2024-04-06T22:06:48.813251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:48.968355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9473
94.7%
0 527
 
5.3%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9473 
0
 
527

Length

Max length4
Median length4
Mean length3.8419
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9473
94.7%
0 527
 
5.3%

Length

2024-04-06T22:06:49.129365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T22:06:49.298321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9473
94.7%
0 527
 
5.3%

판매방식명
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
5014 
<NA>
4605 
인터넷, 기타
 
134
기타
 
50
TV홈쇼핑, 인터넷
 
35
Other values (17)
 
162

Length

Max length26
Median length3
Mean length3.7266
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row<NA>
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 5014
50.1%
<NA> 4605
46.1%
인터넷, 기타 134
 
1.3%
기타 50
 
0.5%
TV홈쇼핑, 인터넷 35
 
0.4%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 34
 
0.3%
인터넷, 카다로그 27
 
0.3%
인터넷, 신문잡지 16
 
0.2%
인터넷, 신문잡지, 기타 12
 
0.1%
인터넷, 카다로그, 신문잡지, 기타 11
 
0.1%
Other values (12) 62
 
0.6%

Length

2024-04-06T22:06:49.477565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 5323
50.7%
na 4605
43.8%
기타 263
 
2.5%
카다로그 115
 
1.1%
tv홈쇼핑 103
 
1.0%
신문잡지 93
 
0.9%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
17553000000200530001013020262520050608<NA>3폐업3폐업처리20170418<NA><NA><NA>02 720 3170<NA>110858서울특별시 종로구 수송동 **번지 두산위브파빌리온 ***호서울특별시 종로구 삼봉로 **, ***호 (수송동,두산위브파빌리온)<NA>(주)골드스톤코리아2017-04-19 09:42:35I2018-08-31 23:59:59.0레져/여행/공연198324.653632452252.812389<NA><NA><NA>인터넷
175533000000201930002023020152820191028<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 종로*가 *번지 *층 *호서울특별시 종로구 종로 ***, *층 *호 (종로*가)03139신골드(신캐스팅)2019-11-21 15:17:29U2019-11-23 02:40:00.0의류/패션/잡화/뷰티199029.582981452031.423022<NA><NA><NA>인터넷
174873000000201930002023020145720191008<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-783-1779<NA><NA>서울특별시 종로구 신문로*가 **번지 *층 ***호서울특별시 종로구 새문안로 **, *층 ***호 (신문로*가)03182주식회사 국제회계세무교육원2019-11-13 18:36:23U2019-11-15 02:40:00.0교육/도서/완구/오락197680.78008452016.543891<NA><NA><NA>인터넷
163303000000201930002023020016320190125<NA>3폐업3폐업처리20221102<NA><NA><NA><NA><NA><NA>서울특별시 종로구 와룡동 ***번지 *층서울특별시 종로구 돈화문로 **, *층 (와룡동)03131오마뎅 창덕궁점2022-11-01 14:14:01U2021-11-01 00:03:00.0기타199074.670635452648.894407<NA><NA><NA><NA>
141783000000201730001693020063920170602<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-720-6202<NA><NA>서울특별시 종로구 명륜*가 **번지 동숭비즈니스센터 *층 ***호서울특별시 종로구 성균관로*길 **, *층 ***호 (명륜*가, 동숭비즈니스센터)03074(재)전문무용수지원센터2017-06-05 10:23:32I2018-08-31 23:59:59.0기타199781.667014453645.757969<NA><NA><NA>인터넷
8333000000200330001013020116720031024<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 741 7920<NA><NA>서울특별시 종로구 종로*가***-* 신흥빌딩 *층<NA><NA>K.LINE2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
229583000000202230002023020060520220425<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 원남동 *-*서울특별시 종로구 창경궁로**길 **-*, *층 (원남동)03081이베르솝2022-04-26 10:38:52I2021-12-03 22:08:00.0의류/패션/잡화/뷰티199782.151586452700.900679<NA><NA><NA><NA>
57793000000200930001293020017220090217<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>-<NA>110054서울특별시 종로구 사직동 *번지 광화문풍림스페이스본 ***동 ***호서울특별시 종로구 사직로*길 *, ***동 ***호 (사직동,광화문풍림스페이스본)<NA>숙영이네2016-11-28 12:00:52I2018-08-31 23:59:59.0의류/패션/잡화/뷰티197181.393302452458.826652<NA><NA><NA>인터넷
13581300000020163000169302012852016-12-01<NA>5제외/삭제/전출5타시군구이관2023-02-13<NA><NA><NA>02-757-9957<NA><NA>서울특별시 종로구 종로*가 **번지 *호 대보귀금속상가 ***호서울특별시 종로구 종로 ***-*, ***호 (종로*가, 대보귀금속상가)03138링크 (LINK)2023-02-13 10:14:10U2022-12-01 23:05:00.0의류/패션/잡화/뷰티199368.555549452053.574019<NA><NA><NA><NA>
222683000000202130002023020214420211209<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 혜화동 **-* 성연아트홀서울특별시 종로구 창경궁로**길 **, 성연아트홀 지하*층 (혜화동)03076탁 앙상블(Takk)2021-12-10 13:17:41I2021-12-14 00:22:42.0종합몰200118.064136453848.810955000인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
89213000000201230001293020002320120103<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>-<NA><NA>서울특별시 종로구 연건동 ***번지 *호서울특별시 종로구 율곡로**가길 **-* (연건동)110460마블링2019-06-19 16:39:36U2019-06-21 02:40:00.0의류/패션/잡화/뷰티199854.418499452696.808188<NA><NA><NA>인터넷
24654300000020233000202302005412023-04-05<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 명륜*가 **-** 영은타운서울특별시 종로구 혜화로*길 **, ***호 (명륜*가, 영은타운)03068화목백화2023-04-06 19:28:01I2022-12-04 00:08:00.0종합몰199901.475401453888.559593<NA><NA><NA><NA>
52543000000200830001293020091920080808<NA>3폐업3폐업처리20101130<NA><NA><NA>02-1544-3238<NA>110753서울특별시 종로구 종로*가 *번지 Y M C A ***호서울특별시 종로구 종로 **, ***호 (종로*가,Y M C A)<NA>두솔에듀케이션2010-11-30 16:47:22I2018-08-31 23:59:59.0기타198630.796271452035.412647<NA><NA><NA>인터넷
27123000000200630001013020389820060607<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 2232 6180<NA><NA>서울특별시 종로구 숭인동***-**<NA><NA>H2J2008-02-21 00:00:00I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
115993000000201530001693020007020150112<NA>3폐업3폐업처리20160127<NA><NA><NA>-<NA>110829서울특별시 종로구 숭인동 ****번지 *호 한양립스 ****호서울특별시 종로구 난계로**가길 **, ****호 (숭인동, 한양립스)110829폴앤다이드 (FALL & DYED)2016-01-27 11:30:05I2018-08-31 23:59:59.0의류/패션/잡화/뷰티201906.610017452369.83658<NA><NA><NA>인터넷
33953000000200730001013020466520070206<NA>3폐업3폐업처리20100128<NA><NA><NA>02 2266 2877<NA><NA>서울특별시 종로구 종로*가***-** *층<NA><NA>엘제이스토어2010-01-28 13:54:12I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
99353000000201330001293020017920130225<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02-723-0082<NA><NA>서울특별시 종로구 견지동 **번지 *호 S&S빌딩 ***호서울특별시 종로구 우정국로 **, S&S빌딩 ***호 (견지동)03145주식회사 뚜르드월드2022-03-28 19:37:37U2021-12-02 21:00:00.0레져/여행/공연198457.208292452339.530425<NA><NA><NA><NA>
157413000000201830001693020106520180828<NA>3폐업3폐업처리20220809<NA><NA><NA>02-735-1122<NA><NA>서울특별시 종로구 수송동 **번지 두산위브파빌리온 ***호서울특별시 종로구 삼봉로 **, 두산위브파빌리온 ***호 (수송동)03150(주)헬로하이난2022-08-11 09:37:23U2021-12-07 23:03:00.0레져/여행/공연198324.653632452252.812389<NA><NA><NA><NA>
216523000000202130002023020151720210803<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 숭인동 ***-** 숭인상가아파트서울특별시 종로구 청계천로 ***, ***호 (숭인동, 숭인상가아파트)03117직구쌤2021-08-05 15:33:31I2021-08-07 00:22:51.0종합몰201747.387869452204.768497000인터넷
122823000000201530001693020089620150911<NA>3폐업3폐업처리20170216<NA><NA><NA>-<NA><NA>서울특별시 종로구 원서동 **번지서울특별시 종로구 창덕궁길 ** (원서동)03057쿠에른 CUEREN2017-02-16 10:32:01I2018-08-31 23:59:59.0의류/패션/잡화/뷰티198969.268095453191.821713<NA><NA><NA>인터넷