Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells78002
Missing cells (%)26.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory250.0 B

Variable types

Categorical9
Numeric6
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author중구
URLhttps://data.seoul.go.kr/dataList/OA-18803/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
판매방식명 is highly imbalanced (69.5%)Imbalance
인허가취소일자 has 9512 (95.1%) missing valuesMissing
폐업일자 has 6447 (64.5%) missing valuesMissing
휴업시작일자 has 9961 (99.6%) missing valuesMissing
휴업종료일자 has 9961 (99.6%) missing valuesMissing
재개업일자 has 9975 (99.8%) missing valuesMissing
전화번호 has 1940 (19.4%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7923 (79.2%) missing valuesMissing
지번주소 has 823 (8.2%) missing valuesMissing
도로명주소 has 2480 (24.8%) missing valuesMissing
도로명우편번호 has 4310 (43.1%) missing valuesMissing
좌표정보(X) has 2335 (23.4%) missing valuesMissing
좌표정보(Y) has 2335 (23.4%) missing valuesMissing
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-22 00:02:08.867191
Analysis finished2024-04-22 00:02:10.744878
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3010000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3010000
2nd row3010000
3rd row3010000
4th row3010000
5th row3010000

Common Values

ValueCountFrequency (%)
3010000 10000
100.0%

Length

2024-04-22T09:02:10.798969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:10.879515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3010000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0126221 × 1018
Minimum1.996301 × 1018
Maximum2.021301 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:10.979292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.996301 × 1018
5-th percentile2.003301 × 1018
Q12.007301 × 1018
median2.012301 × 1018
Q32.018301 × 1018
95-th percentile2.021301 × 1018
Maximum2.021301 × 1018
Range2.5000007 × 1016
Interquartile range (IQR)1.1000003 × 1016

Descriptive statistics

Standard deviation6.0649733 × 1015
Coefficient of variation (CV)0.0030134685
Kurtosis-1.2141843
Mean2.0126221 × 1018
Median Absolute Deviation (MAD)6 × 1015
Skewness-0.17537436
Sum8.2333855 × 1017
Variance3.6783901 × 1031
MonotonicityNot monotonic
2024-04-22T09:02:11.102036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2021301016530201547 1
 
< 0.1%
2016301013030200375 1
 
< 0.1%
2020301016530201078 1
 
< 0.1%
2021301016530200595 1
 
< 0.1%
2013301013030201057 1
 
< 0.1%
2005301010030201565 1
 
< 0.1%
2017301013030201420 1
 
< 0.1%
2004301010030201061 1
 
< 0.1%
2005301010030201261 1
 
< 0.1%
2008301010030201043 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1996301010030200001 1
< 0.1%
1996301010030200003 1
< 0.1%
1996301010030200006 1
< 0.1%
1996301010030200009 1
< 0.1%
1996301010030200011 1
< 0.1%
1996301010030200012 1
< 0.1%
1996301010030200016 1
< 0.1%
1996301010030200018 1
< 0.1%
1996301010030200025 1
< 0.1%
1996301010030200030 1
< 0.1%
ValueCountFrequency (%)
2021301016530201717 1
< 0.1%
2021301016530201716 1
< 0.1%
2021301016530201715 1
< 0.1%
2021301016530201713 1
< 0.1%
2021301016530201711 1
< 0.1%
2021301016530201708 1
< 0.1%
2021301016530201707 1
< 0.1%
2021301016530201704 1
< 0.1%
2021301016530201698 1
< 0.1%
2021301016530201695 1
< 0.1%
Distinct3819
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1996-07-19 00:00:00
Maximum2021-07-07 00:00:00
2024-04-22T09:02:11.220391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:11.357893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Real number (ℝ)

MISSING 

Distinct66
Distinct (%)13.5%
Missing9512
Missing (%)95.1%
Infinite0
Infinite (%)0.0%
Mean20100225
Minimum20080228
Maximum20220125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:11.487915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20080228
5-th percentile20080603
Q120091130
median20091130
Q320101119
95-th percentile20200106
Maximum20220125
Range139897
Interquartile range (IQR)9989

Descriptive statistics

Standard deviation28911.371
Coefficient of variation (CV)0.0014383605
Kurtosis8.1106078
Mean20100225
Median Absolute Deviation (MAD)0
Skewness3.062265
Sum9.80891 × 109
Variance8.3586736 × 108
MonotonicityNot monotonic
2024-04-22T09:02:11.612555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20091130 272
 
2.7%
20101119 91
 
0.9%
20080603 37
 
0.4%
20090105 19
 
0.2%
20200806 3
 
< 0.1%
20101231 3
 
< 0.1%
20081229 2
 
< 0.1%
20081103 2
 
< 0.1%
20080723 2
 
< 0.1%
20200512 1
 
< 0.1%
Other values (56) 56
 
0.6%
(Missing) 9512
95.1%
ValueCountFrequency (%)
20080228 1
 
< 0.1%
20080313 1
 
< 0.1%
20080603 37
0.4%
20080723 2
 
< 0.1%
20080930 1
 
< 0.1%
20081008 1
 
< 0.1%
20081015 1
 
< 0.1%
20081022 1
 
< 0.1%
20081103 2
 
< 0.1%
20081107 1
 
< 0.1%
ValueCountFrequency (%)
20220125 1
< 0.1%
20211231 1
< 0.1%
20210722 1
< 0.1%
20210713 1
< 0.1%
20210330 1
< 0.1%
20210323 1
< 0.1%
20210304 1
< 0.1%
20210122 1
< 0.1%
20210115 1
< 0.1%
20210111 1
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
3853 
3
3446 
4
2466 
5
 
207
2
 
28

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row4
4th row3
5th row3

Common Values

ValueCountFrequency (%)
1 3853
38.5%
3 3446
34.5%
4 2466
24.7%
5 207
 
2.1%
2 28
 
0.3%

Length

2024-04-22T09:02:11.733884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:11.824475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3853
38.5%
3 3446
34.5%
4 2466
24.7%
5 207
 
2.1%
2 28
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
3853 
폐업
3446 
취소/말소/만료/정지/중지
2466 
제외/삭제/전출
 
207
휴업
 
28

Length

Max length14
Median length8
Mean length6.2393
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row취소/말소/만료/정지/중지
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
영업/정상 3853
38.5%
폐업 3446
34.5%
취소/말소/만료/정지/중지 2466
24.7%
제외/삭제/전출 207
 
2.1%
휴업 28
 
0.3%

Length

2024-04-22T09:02:11.927378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:12.017180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 3853
38.5%
폐업 3446
34.5%
취소/말소/만료/정지/중지 2466
24.7%
제외/삭제/전출 207
 
2.1%
휴업 28
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1079
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:12.102892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q34
95-th percentile7
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.1968023
Coefficient of variation (CV)0.70684458
Kurtosis-0.71100593
Mean3.1079
Median Absolute Deviation (MAD)2
Skewness0.77560761
Sum31079
Variance4.8259402
MonotonicityNot monotonic
2024-04-22T09:02:12.198956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 3851
38.5%
3 3446
34.5%
7 1973
19.7%
4 493
 
4.9%
5 207
 
2.1%
2 28
 
0.3%
8 2
 
< 0.1%
ValueCountFrequency (%)
1 3851
38.5%
2 28
 
0.3%
3 3446
34.5%
4 493
 
4.9%
5 207
 
2.1%
7 1973
19.7%
8 2
 
< 0.1%
ValueCountFrequency (%)
8 2
 
< 0.1%
7 1973
19.7%
5 207
 
2.1%
4 493
 
4.9%
3 3446
34.5%
2 28
 
0.3%
1 3851
38.5%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
3851 
폐업처리
3446 
직권말소
1973 
직권취소
493 
타시군구이관
 
207
Other values (2)
 
30

Length

Max length6
Median length4
Mean length4.0414
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row직권말소
4th row폐업처리
5th row폐업처리

Common Values

ValueCountFrequency (%)
정상영업 3851
38.5%
폐업처리 3446
34.5%
직권말소 1973
19.7%
직권취소 493
 
4.9%
타시군구이관 207
 
2.1%
휴업처리 28
 
0.3%
영업재개 2
 
< 0.1%

Length

2024-04-22T09:02:12.322916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:12.442211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 3851
38.5%
폐업처리 3446
34.5%
직권말소 1973
19.7%
직권취소 493
 
4.9%
타시군구이관 207
 
2.1%
휴업처리 28
 
0.3%
영업재개 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2231
Distinct (%)62.8%
Missing6447
Missing (%)64.5%
Memory size156.2 KiB
Minimum1996-11-21 00:00:00
Maximum2024-12-31 00:00:00
2024-04-22T09:02:12.559319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:12.671087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct37
Distinct (%)94.9%
Missing9961
Missing (%)99.6%
Memory size156.2 KiB
Minimum2007-06-05 00:00:00
Maximum2024-01-17 00:00:00
2024-04-22T09:02:12.983086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:13.105843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)

휴업종료일자
Date

MISSING 

Distinct35
Distinct (%)89.7%
Missing9961
Missing (%)99.6%
Memory size156.2 KiB
Minimum2008-03-31 00:00:00
Maximum2030-12-31 00:00:00
2024-04-22T09:02:13.209332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:13.320458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

재개업일자
Date

MISSING 

Distinct17
Distinct (%)68.0%
Missing9975
Missing (%)99.8%
Memory size156.2 KiB
Minimum2008-08-05 00:00:00
Maximum2021-12-02 00:00:00
2024-04-22T09:02:13.429088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:13.523616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

전화번호
Text

MISSING 

Distinct6843
Distinct (%)84.9%
Missing1940
Missing (%)19.4%
Memory size156.2 KiB
2024-04-22T09:02:13.802032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length10.393921
Min length1

Characters and Unicode

Total characters83775
Distinct characters18
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6746 ?
Unique (%)83.7%

Sample

1st row.
2nd row070-4624-8182
3rd row02-319-8866
4th row070-4286-3701
5th row02 757 2231
ValueCountFrequency (%)
02 2787
 
20.0%
1472
 
10.6%
2233 78
 
0.6%
2232 73
 
0.5%
070 68
 
0.5%
2231 68
 
0.5%
2253 67
 
0.5%
2252 67
 
0.5%
777 66
 
0.5%
00 64
 
0.5%
Other values (6859) 9124
65.5%
2024-04-22T09:02:14.214292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 15585
18.6%
0 10983
13.1%
9569
11.4%
7 7443
8.9%
- 7411
8.8%
3 6090
 
7.3%
5 4873
 
5.8%
6 4697
 
5.6%
8 4551
 
5.4%
1 4389
 
5.2%
Other values (8) 8184
9.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 65930
78.7%
Space Separator 9569
 
11.4%
Dash Punctuation 7411
 
8.8%
Other Punctuation 841
 
1.0%
Math Symbol 22
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 15585
23.6%
0 10983
16.7%
7 7443
11.3%
3 6090
 
9.2%
5 4873
 
7.4%
6 4697
 
7.1%
8 4551
 
6.9%
1 4389
 
6.7%
4 3813
 
5.8%
9 3506
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 756
89.9%
, 74
 
8.8%
/ 11
 
1.3%
Math Symbol
ValueCountFrequency (%)
~ 21
95.5%
+ 1
 
4.5%
Space Separator
ValueCountFrequency (%)
9569
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7411
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 83775
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 15585
18.6%
0 10983
13.1%
9569
11.4%
7 7443
8.9%
- 7411
8.8%
3 6090
 
7.3%
5 4873
 
5.8%
6 4697
 
5.6%
8 4551
 
5.4%
1 4389
 
5.2%
Other values (8) 8184
9.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 83775
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 15585
18.6%
0 10983
13.1%
9569
11.4%
7 7443
8.9%
- 7411
8.8%
3 6090
 
7.3%
5 4873
 
5.8%
6 4697
 
5.6%
8 4551
 
5.4%
1 4389
 
5.2%
Other values (8) 8184
9.8%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct158
Distinct (%)7.6%
Missing7923
Missing (%)79.2%
Infinite0
Infinite (%)0.0%
Mean100436.1
Minimum100011
Maximum140012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:14.350771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100011
5-th percentile100015
Q1100193
median100340
Q3100450
95-th percentile100824
Maximum140012
Range40001
Interquartile range (IQR)257

Descriptive statistics

Standard deviation1850.3104
Coefficient of variation (CV)0.018422763
Kurtosis370.24344
Mean100436.1
Median Absolute Deviation (MAD)110
Skewness18.969625
Sum2.0860578 × 108
Variance3423648.7
MonotonicityNot monotonic
2024-04-22T09:02:14.508307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100450 473
 
4.7%
100196 113
 
1.1%
100060 98
 
1.0%
100440 64
 
0.6%
100195 43
 
0.4%
100310 43
 
0.4%
100430 40
 
0.4%
100272 38
 
0.4%
100330 36
 
0.4%
100193 35
 
0.4%
Other values (148) 1094
 
10.9%
(Missing) 7923
79.2%
ValueCountFrequency (%)
100011 20
0.2%
100012 22
0.2%
100013 23
0.2%
100014 21
0.2%
100015 19
0.2%
100021 1
 
< 0.1%
100022 15
0.1%
100031 1
 
< 0.1%
100032 17
0.2%
100041 9
 
0.1%
ValueCountFrequency (%)
140012 1
 
< 0.1%
139200 1
 
< 0.1%
138160 1
 
< 0.1%
133093 1
 
< 0.1%
132030 1
 
< 0.1%
121210 1
 
< 0.1%
100951 3
< 0.1%
100899 4
< 0.1%
100891 2
< 0.1%
100890 2
< 0.1%

지번주소
Text

MISSING 

Distinct4592
Distinct (%)50.0%
Missing823
Missing (%)8.2%
Memory size156.2 KiB
2024-04-22T09:02:14.725563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length71
Mean length29.97886
Min length12

Characters and Unicode

Total characters275116
Distinct characters507
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3693 ?
Unique (%)40.2%

Sample

1st row서울특별시 중구 신당동 *** 광희시장
2nd row서울특별시 중구 신당동 ***번지 제일평화시장
3rd row서울특별시 중구 신당동 ***번지 **호
4th row서울특별시 중구 태평로*가 **-**
5th row서울특별시 중구 신당동 **번지 **호 우진빌딩B***
ValueCountFrequency (%)
서울특별시 9163
19.6%
중구 9158
19.6%
번지 5380
11.5%
4690
10.0%
신당동 2637
 
5.6%
2497
 
5.3%
을지로*가 1146
 
2.4%
638
 
1.4%
남창동 637
 
1.4%
충무로*가 466
 
1.0%
Other values (2871) 10381
22.2%
2024-04-22T09:02:15.107798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82723
30.1%
* 46359
16.9%
9567
 
3.5%
9445
 
3.4%
9323
 
3.4%
9293
 
3.4%
9233
 
3.4%
9187
 
3.3%
9172
 
3.3%
7693
 
2.8%
Other values (497) 73121
26.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138690
50.4%
Space Separator 82723
30.1%
Other Punctuation 46689
 
17.0%
Dash Punctuation 3521
 
1.3%
Decimal Number 1777
 
0.6%
Uppercase Letter 795
 
0.3%
Lowercase Letter 612
 
0.2%
Close Punctuation 127
 
< 0.1%
Open Punctuation 126
 
< 0.1%
Math Symbol 50
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9567
 
6.9%
9445
 
6.8%
9323
 
6.7%
9293
 
6.7%
9233
 
6.7%
9187
 
6.6%
9172
 
6.6%
7693
 
5.5%
7225
 
5.2%
6007
 
4.3%
Other values (427) 52545
37.9%
Uppercase Letter
ValueCountFrequency (%)
B 163
20.5%
A 116
14.6%
C 85
10.7%
D 56
 
7.0%
F 48
 
6.0%
M 48
 
6.0%
P 45
 
5.7%
E 35
 
4.4%
T 32
 
4.0%
I 24
 
3.0%
Other values (13) 143
18.0%
Lowercase Letter
ValueCountFrequency (%)
e 113
18.5%
n 99
16.2%
i 77
12.6%
a 65
10.6%
s 37
 
6.0%
o 35
 
5.7%
r 34
 
5.6%
c 33
 
5.4%
t 32
 
5.2%
h 21
 
3.4%
Other values (10) 66
10.8%
Decimal Number
ValueCountFrequency (%)
1 403
22.7%
2 309
17.4%
3 187
10.5%
0 168
9.5%
4 155
 
8.7%
5 134
 
7.5%
6 112
 
6.3%
8 107
 
6.0%
7 106
 
6.0%
9 96
 
5.4%
Other Punctuation
ValueCountFrequency (%)
* 46359
99.3%
, 232
 
0.5%
/ 50
 
0.1%
. 38
 
0.1%
& 7
 
< 0.1%
@ 2
 
< 0.1%
¡ 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 126
99.2%
] 1
 
0.8%
Space Separator
ValueCountFrequency (%)
82723
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3521
100.0%
Open Punctuation
ValueCountFrequency (%)
( 126
100.0%
Math Symbol
ValueCountFrequency (%)
~ 50
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138690
50.4%
Common 135014
49.1%
Latin 1412
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9567
 
6.9%
9445
 
6.8%
9323
 
6.7%
9293
 
6.7%
9233
 
6.7%
9187
 
6.6%
9172
 
6.6%
7693
 
5.5%
7225
 
5.2%
6007
 
4.3%
Other values (427) 52545
37.9%
Latin
ValueCountFrequency (%)
B 163
 
11.5%
A 116
 
8.2%
e 113
 
8.0%
n 99
 
7.0%
C 85
 
6.0%
i 77
 
5.5%
a 65
 
4.6%
D 56
 
4.0%
F 48
 
3.4%
M 48
 
3.4%
Other values (36) 542
38.4%
Common
ValueCountFrequency (%)
82723
61.3%
* 46359
34.3%
- 3521
 
2.6%
1 403
 
0.3%
2 309
 
0.2%
, 232
 
0.2%
3 187
 
0.1%
0 168
 
0.1%
4 155
 
0.1%
5 134
 
0.1%
Other values (14) 823
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138690
50.4%
ASCII 136419
49.6%
Number Forms 5
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
82723
60.6%
* 46359
34.0%
- 3521
 
2.6%
1 403
 
0.3%
2 309
 
0.2%
, 232
 
0.2%
3 187
 
0.1%
0 168
 
0.1%
B 163
 
0.1%
4 155
 
0.1%
Other values (55) 2199
 
1.6%
Hangul
ValueCountFrequency (%)
9567
 
6.9%
9445
 
6.8%
9323
 
6.7%
9293
 
6.7%
9233
 
6.7%
9187
 
6.6%
9172
 
6.6%
7693
 
5.5%
7225
 
5.2%
6007
 
4.3%
Other values (427) 52545
37.9%
Number Forms
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
None
ValueCountFrequency (%)
° 1
50.0%
¡ 1
50.0%

도로명주소
Text

MISSING 

Distinct5781
Distinct (%)76.9%
Missing2480
Missing (%)24.8%
Memory size156.2 KiB
2024-04-22T09:02:15.380095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length62
Mean length35.326064
Min length19

Characters and Unicode

Total characters265652
Distinct characters528
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4961 ?
Unique (%)66.0%

Sample

1st row서울특별시 중구 마장로*길 **, 광희패션몰내 퀸즈스퀘어 *층 ***호 (신당동)
2nd row서울특별시 중구 마장로 **, 제일평화시장 *층 **,**호 (신당동)
3rd row서울특별시 중구 청계천로 ***, ***동 ****호 (황학동, 롯데캐슬베네치아)
4th row서울특별시 중구 장충단로 ***-** (장충동*가)
5th row서울특별시 중구 명동길 **, **층 (명동*가, 태흥빌딩)
ValueCountFrequency (%)
7620
15.0%
서울특별시 7520
14.8%
중구 7507
14.8%
3536
 
7.0%
2836
 
5.6%
신당동 1712
 
3.4%
을지로*가 659
 
1.3%
퇴계로**길 588
 
1.2%
퇴계로 561
 
1.1%
다산로**길 505
 
1.0%
Other values (3495) 17602
34.8%
2024-04-22T09:02:15.790618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 47650
17.9%
43357
 
16.3%
, 9184
 
3.5%
8655
 
3.3%
8279
 
3.1%
8128
 
3.1%
7958
 
3.0%
7829
 
2.9%
7729
 
2.9%
7640
 
2.9%
Other values (518) 109243
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 144884
54.5%
Other Punctuation 56937
 
21.4%
Space Separator 43357
 
16.3%
Close Punctuation 7637
 
2.9%
Open Punctuation 7636
 
2.9%
Dash Punctuation 1779
 
0.7%
Decimal Number 1731
 
0.7%
Uppercase Letter 1131
 
0.4%
Lowercase Letter 475
 
0.2%
Math Symbol 78
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8655
 
6.0%
8279
 
5.7%
8128
 
5.6%
7958
 
5.5%
7829
 
5.4%
7729
 
5.3%
7640
 
5.3%
7551
 
5.2%
7533
 
5.2%
5269
 
3.6%
Other values (445) 68313
47.2%
Uppercase Letter
ValueCountFrequency (%)
B 263
23.3%
A 218
19.3%
C 101
 
8.9%
E 77
 
6.8%
D 64
 
5.7%
M 49
 
4.3%
T 42
 
3.7%
I 40
 
3.5%
P 40
 
3.5%
F 38
 
3.4%
Other values (15) 199
17.6%
Lowercase Letter
ValueCountFrequency (%)
e 90
18.9%
a 70
14.7%
n 61
12.8%
i 50
10.5%
s 31
 
6.5%
c 27
 
5.7%
t 24
 
5.1%
r 22
 
4.6%
p 20
 
4.2%
l 18
 
3.8%
Other values (12) 62
13.1%
Decimal Number
ValueCountFrequency (%)
1 353
20.4%
2 305
17.6%
0 215
12.4%
3 202
11.7%
4 147
8.5%
6 130
 
7.5%
5 117
 
6.8%
7 91
 
5.3%
8 90
 
5.2%
9 81
 
4.7%
Other Punctuation
ValueCountFrequency (%)
* 47650
83.7%
, 9184
 
16.1%
/ 73
 
0.1%
. 23
 
< 0.1%
& 4
 
< 0.1%
# 2
 
< 0.1%
@ 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
43357
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7637
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7636
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1779
100.0%
Math Symbol
ValueCountFrequency (%)
~ 78
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 144884
54.5%
Common 119156
44.9%
Latin 1612
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8655
 
6.0%
8279
 
5.7%
8128
 
5.6%
7958
 
5.5%
7829
 
5.4%
7729
 
5.3%
7640
 
5.3%
7551
 
5.2%
7533
 
5.2%
5269
 
3.6%
Other values (445) 68313
47.2%
Latin
ValueCountFrequency (%)
B 263
16.3%
A 218
 
13.5%
C 101
 
6.3%
e 90
 
5.6%
E 77
 
4.8%
a 70
 
4.3%
D 64
 
4.0%
n 61
 
3.8%
i 50
 
3.1%
M 49
 
3.0%
Other values (40) 569
35.3%
Common
ValueCountFrequency (%)
* 47650
40.0%
43357
36.4%
, 9184
 
7.7%
) 7637
 
6.4%
( 7636
 
6.4%
- 1779
 
1.5%
1 353
 
0.3%
2 305
 
0.3%
0 215
 
0.2%
3 202
 
0.2%
Other values (13) 838
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 144884
54.5%
ASCII 120761
45.5%
Number Forms 6
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 47650
39.5%
43357
35.9%
, 9184
 
7.6%
) 7637
 
6.3%
( 7636
 
6.3%
- 1779
 
1.5%
1 353
 
0.3%
2 305
 
0.3%
B 263
 
0.2%
A 218
 
0.2%
Other values (59) 2379
 
2.0%
Hangul
ValueCountFrequency (%)
8655
 
6.0%
8279
 
5.7%
8128
 
5.6%
7958
 
5.5%
7829
 
5.4%
7729
 
5.3%
7640
 
5.3%
7551
 
5.2%
7533
 
5.2%
5269
 
3.6%
Other values (445) 68313
47.2%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
None
ValueCountFrequency (%)
½ 1
100.0%

도로명우편번호
Text

MISSING 

Distinct386
Distinct (%)6.8%
Missing4310
Missing (%)43.1%
Memory size156.2 KiB
2024-04-22T09:02:16.073381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.2711775
Min length5

Characters and Unicode

Total characters29993
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)1.6%

Sample

1st row04567
2nd row04567
3rd row100-440
4th row04606
5th row04538
ValueCountFrequency (%)
04529 237
 
4.2%
04563 181
 
3.2%
04546 121
 
2.1%
04567 111
 
2.0%
04566 110
 
1.9%
04547 105
 
1.8%
04528 100
 
1.8%
04570 95
 
1.7%
04568 94
 
1.7%
04572 84
 
1.5%
Other values (376) 4452
78.2%
2024-04-22T09:02:16.600421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8368
27.9%
4 5565
18.6%
5 4609
15.4%
1 2687
 
9.0%
6 2244
 
7.5%
8 1492
 
5.0%
2 1380
 
4.6%
7 1272
 
4.2%
3 1189
 
4.0%
9 1084
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29890
99.7%
Dash Punctuation 103
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8368
28.0%
4 5565
18.6%
5 4609
15.4%
1 2687
 
9.0%
6 2244
 
7.5%
8 1492
 
5.0%
2 1380
 
4.6%
7 1272
 
4.3%
3 1189
 
4.0%
9 1084
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29993
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8368
27.9%
4 5565
18.6%
5 4609
15.4%
1 2687
 
9.0%
6 2244
 
7.5%
8 1492
 
5.0%
2 1380
 
4.6%
7 1272
 
4.2%
3 1189
 
4.0%
9 1084
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29993
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8368
27.9%
4 5565
18.6%
5 4609
15.4%
1 2687
 
9.0%
6 2244
 
7.5%
8 1492
 
5.0%
2 1380
 
4.6%
7 1272
 
4.2%
3 1189
 
4.0%
9 1084
 
3.6%
Distinct9757
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T09:02:16.956804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length38
Mean length7.1001
Min length1

Characters and Unicode

Total characters71001
Distinct characters1067
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9537 ?
Unique (%)95.4%

Sample

1st row썬데이무드(SUNDAY MOOD)
2nd row덤앤더머(DUMB&DUMBER)
3rd row물반고기반
4th row고상하
5th row필스테이 명동점
ValueCountFrequency (%)
주식회사 734
 
5.8%
132
 
1.0%
co 27
 
0.2%
company 26
 
0.2%
ltd 25
 
0.2%
22
 
0.2%
21
 
0.2%
co.,ltd 18
 
0.1%
the 16
 
0.1%
korea 15
 
0.1%
Other values (10757) 11632
91.8%
2024-04-22T09:02:17.421287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 2698
 
3.8%
( 2697
 
3.8%
2674
 
3.8%
2158
 
3.0%
2137
 
3.0%
1863
 
2.6%
1340
 
1.9%
977
 
1.4%
890
 
1.3%
872
 
1.2%
Other values (1057) 52695
74.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48141
67.8%
Lowercase Letter 7337
 
10.3%
Uppercase Letter 6297
 
8.9%
Close Punctuation 2698
 
3.8%
Open Punctuation 2697
 
3.8%
Space Separator 2674
 
3.8%
Other Punctuation 520
 
0.7%
Decimal Number 474
 
0.7%
Dash Punctuation 100
 
0.1%
Other Symbol 44
 
0.1%
Other values (3) 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2158
 
4.5%
2137
 
4.4%
1863
 
3.9%
1340
 
2.8%
977
 
2.0%
890
 
1.8%
872
 
1.8%
820
 
1.7%
670
 
1.4%
651
 
1.4%
Other values (973) 35763
74.3%
Lowercase Letter
ValueCountFrequency (%)
e 825
11.2%
o 773
 
10.5%
a 629
 
8.6%
n 595
 
8.1%
i 555
 
7.6%
r 472
 
6.4%
t 446
 
6.1%
l 412
 
5.6%
s 376
 
5.1%
c 255
 
3.5%
Other values (16) 1999
27.2%
Uppercase Letter
ValueCountFrequency (%)
O 507
 
8.1%
E 468
 
7.4%
A 455
 
7.2%
S 391
 
6.2%
I 363
 
5.8%
N 362
 
5.7%
T 359
 
5.7%
C 358
 
5.7%
L 341
 
5.4%
M 332
 
5.3%
Other values (16) 2361
37.5%
Other Punctuation
ValueCountFrequency (%)
. 299
57.5%
& 101
 
19.4%
, 62
 
11.9%
' 23
 
4.4%
: 11
 
2.1%
? 8
 
1.5%
/ 7
 
1.3%
# 5
 
1.0%
2
 
0.4%
* 2
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 117
24.7%
2 82
17.3%
0 55
11.6%
5 42
 
8.9%
4 41
 
8.6%
9 41
 
8.6%
3 37
 
7.8%
7 21
 
4.4%
8 21
 
4.4%
6 17
 
3.6%
Math Symbol
ValueCountFrequency (%)
+ 3
42.9%
= 2
28.6%
> 1
 
14.3%
< 1
 
14.3%
Letter Number
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 2698
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2697
100.0%
Space Separator
ValueCountFrequency (%)
2674
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 100
100.0%
Other Symbol
ValueCountFrequency (%)
44
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48169
67.8%
Latin 13638
 
19.2%
Common 9178
 
12.9%
Han 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2158
 
4.5%
2137
 
4.4%
1863
 
3.9%
1340
 
2.8%
977
 
2.0%
890
 
1.8%
872
 
1.8%
820
 
1.7%
670
 
1.4%
651
 
1.4%
Other values (960) 35791
74.3%
Latin
ValueCountFrequency (%)
e 825
 
6.0%
o 773
 
5.7%
a 629
 
4.6%
n 595
 
4.4%
i 555
 
4.1%
O 507
 
3.7%
r 472
 
3.5%
E 468
 
3.4%
A 455
 
3.3%
t 446
 
3.3%
Other values (44) 7913
58.0%
Common
ValueCountFrequency (%)
) 2698
29.4%
( 2697
29.4%
2674
29.1%
. 299
 
3.3%
1 117
 
1.3%
& 101
 
1.1%
- 100
 
1.1%
2 82
 
0.9%
, 62
 
0.7%
0 55
 
0.6%
Other values (19) 293
 
3.2%
Han
ValueCountFrequency (%)
3
18.8%
1
 
6.2%
1
 
6.2%
1
 
6.2%
貿 1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (4) 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48123
67.8%
ASCII 22810
32.1%
None 46
 
0.1%
CJK 16
 
< 0.1%
Number Forms 4
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 2698
 
11.8%
( 2697
 
11.8%
2674
 
11.7%
e 825
 
3.6%
o 773
 
3.4%
a 629
 
2.8%
n 595
 
2.6%
i 555
 
2.4%
O 507
 
2.2%
r 472
 
2.1%
Other values (70) 10385
45.5%
Hangul
ValueCountFrequency (%)
2158
 
4.5%
2137
 
4.4%
1863
 
3.9%
1340
 
2.8%
977
 
2.0%
890
 
1.8%
872
 
1.8%
820
 
1.7%
670
 
1.4%
651
 
1.4%
Other values (957) 35745
74.3%
None
ValueCountFrequency (%)
44
95.7%
2
 
4.3%
Number Forms
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
3
18.8%
1
 
6.2%
1
 
6.2%
1
 
6.2%
貿 1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (4) 4
25.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct9136
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-09 20:04:13
Maximum2024-04-16 09:10:34
2024-04-22T09:02:17.540743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:17.659875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7271 
U
2729 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowU
4th rowU
5th rowU

Common Values

ValueCountFrequency (%)
I 7271
72.7%
U 2729
 
27.3%

Length

2024-04-22T09:02:17.768748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:17.849058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7271
72.7%
u 2729
 
27.3%
Distinct1375
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-22T09:02:17.937706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T09:02:18.068065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct347
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T09:02:18.222270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length7.7496
Min length1

Characters and Unicode

Total characters77496
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique225 ?
Unique (%)2.2%

Sample

1st row의류/패션/잡화/뷰티
2nd row의류/패션/잡화/뷰티
3rd row기타
4th row의류/패션/잡화/뷰티
5th row종합몰 기타
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4342
35.7%
기타 2113
17.4%
1980
16.3%
종합몰 1224
 
10.1%
건강/식품 510
 
4.2%
레져/여행/공연 465
 
3.8%
교육/도서/완구/오락 461
 
3.8%
컴퓨터/사무용품 288
 
2.4%
가전 274
 
2.3%
가구/수납용품 257
 
2.1%
Other values (3) 252
 
2.1%
2024-04-22T09:02:18.494225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16564
21.4%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
4342
 
5.6%
2166
 
2.8%
Other values (41) 24030
31.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56786
73.3%
Other Punctuation 16564
 
21.4%
Space Separator 2166
 
2.8%
Dash Punctuation 1980
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
2113
 
3.7%
2113
 
3.7%
Other values (38) 17824
31.4%
Other Punctuation
ValueCountFrequency (%)
/ 16564
100.0%
Space Separator
ValueCountFrequency (%)
2166
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1980
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56786
73.3%
Common 20710
 
26.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
2113
 
3.7%
2113
 
3.7%
Other values (38) 17824
31.4%
Common
ValueCountFrequency (%)
/ 16564
80.0%
2166
 
10.5%
- 1980
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56786
73.3%
ASCII 20710
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16564
80.0%
2166
 
10.5%
- 1980
 
9.6%
Hangul
ValueCountFrequency (%)
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
4342
 
7.6%
2113
 
3.7%
2113
 
3.7%
Other values (38) 17824
31.4%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct3026
Distinct (%)39.5%
Missing2335
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean199926.37
Minimum192644.9
Maximum210357.38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:18.609383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum192644.9
5-th percentile197624.29
Q1198757.67
median200143.05
Q3201070.64
95-th percentile201823.91
Maximum210357.38
Range17712.471
Interquartile range (IQR)2312.9697

Descriptive statistics

Standard deviation1388.6386
Coefficient of variation (CV)0.00694575
Kurtosis-0.4613755
Mean199926.37
Median Absolute Deviation (MAD)1027.8831
Skewness-0.33834613
Sum1.5324357 × 109
Variance1928317.2
MonotonicityNot monotonic
2024-04-22T09:02:18.730842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201823.908977364 116
 
1.2%
200750.455125653 96
 
1.0%
200664.582936542 73
 
0.7%
200002.952736046 72
 
0.7%
200613.510297669 65
 
0.7%
200573.641273539 64
 
0.6%
200015.499568016 62
 
0.6%
197831.002724157 61
 
0.6%
201236.031514508 60
 
0.6%
200703.625559248 59
 
0.6%
Other values (3016) 6937
69.4%
(Missing) 2335
 
23.4%
ValueCountFrequency (%)
192644.903847315 1
< 0.1%
196546.97612053 2
< 0.1%
196599.772120416 2
< 0.1%
196608.671568845 1
< 0.1%
196643.03972807 1
< 0.1%
196649.902016661 1
< 0.1%
196653.18104988 1
< 0.1%
196663.892558606 1
< 0.1%
196676.041712704 1
< 0.1%
196681.00970118 1
< 0.1%
ValueCountFrequency (%)
210357.375077208 1
 
< 0.1%
206273.793438841 1
 
< 0.1%
205496.541710755 1
 
< 0.1%
202287.637174612 1
 
< 0.1%
202275.274635685 1
 
< 0.1%
202246.956538881 1
 
< 0.1%
202222.143510841 6
0.1%
202215.563754519 1
 
< 0.1%
202200.924494567 1
 
< 0.1%
202192.066980455 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct3021
Distinct (%)39.4%
Missing2335
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean451216.19
Minimum444083.72
Maximum462171.52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T09:02:18.851448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum444083.72
5-th percentile450276.69
Q1450878.21
median451273.36
Q3451650.44
95-th percentile451878.23
Maximum462171.52
Range18087.807
Interquartile range (IQR)772.22668

Descriptive statistics

Standard deviation558.19759
Coefficient of variation (CV)0.0012370957
Kurtosis36.167692
Mean451216.19
Median Absolute Deviation (MAD)383.11348
Skewness0.70309978
Sum3.4585721 × 109
Variance311584.55
MonotonicityNot monotonic
2024-04-22T09:02:18.972490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
452076.818664092 116
 
1.2%
449638.824308081 96
 
1.0%
451781.8683954 73
 
0.7%
451823.532462914 72
 
0.7%
451817.515366883 65
 
0.7%
451621.036467544 64
 
0.6%
451485.349952584 62
 
0.6%
450794.31483169 61
 
0.6%
451872.409388295 60
 
0.6%
451836.458256618 59
 
0.6%
Other values (3011) 6937
69.4%
(Missing) 2335
 
23.4%
ValueCountFrequency (%)
444083.716311538 1
 
< 0.1%
445992.442199323 1
 
< 0.1%
447582.719757807 1
 
< 0.1%
449540.893733201 4
< 0.1%
449591.616910619 1
 
< 0.1%
449600.503188527 1
 
< 0.1%
449603.401016136 1
 
< 0.1%
449607.607625784 1
 
< 0.1%
449611.221496994 4
< 0.1%
449612.715102676 1
 
< 0.1%
ValueCountFrequency (%)
462171.5231343 1
 
< 0.1%
461050.787587037 1
 
< 0.1%
452413.412894 18
 
0.2%
452183.455758 1
 
< 0.1%
452130.812943 1
 
< 0.1%
452076.818664092 116
1.2%
452026.206096556 1
 
< 0.1%
452016.629661945 1
 
< 0.1%
452008.363110583 3
 
< 0.1%
452005.950500534 1
 
< 0.1%

자산규모
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
8849 
<NA>
1151 

Length

Max length4
Median length1
Mean length1.3453
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8849
88.5%
<NA> 1151
 
11.5%

Length

2024-04-22T09:02:19.103275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:19.221332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8849
88.5%
na 1151
 
11.5%

부채총액
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
8849 
<NA>
1151 

Length

Max length4
Median length1
Mean length1.3453
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8849
88.5%
<NA> 1151
 
11.5%

Length

2024-04-22T09:02:19.319568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:19.700927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8849
88.5%
na 1151
 
11.5%

자본금
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
8849 
<NA>
1151 

Length

Max length4
Median length1
Mean length1.3453
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8849
88.5%
<NA> 1151
 
11.5%

Length

2024-04-22T09:02:19.790321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T09:02:19.873187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8849
88.5%
na 1151
 
11.5%

판매방식명
Categorical

IMBALANCE 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
6312 
<NA>
3079 
인터넷, 기타
 
162
기타
 
99
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타
 
65
Other values (19)
 
283

Length

Max length26
Median length3
Mean length3.7944
Min length2

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row<NA>
4th row인터넷
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 6312
63.1%
<NA> 3079
30.8%
인터넷, 기타 162
 
1.6%
기타 99
 
1.0%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 65
 
0.7%
인터넷, 카다로그 49
 
0.5%
TV홈쇼핑, 인터넷 49
 
0.5%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 34
 
0.3%
인터넷, 카다로그, 신문잡지, 기타 28
 
0.3%
인터넷, 카다로그, 기타 25
 
0.2%
Other values (14) 98
 
1.0%

Length

2024-04-22T09:02:19.964031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 6791
62.4%
na 3079
28.3%
기타 408
 
3.7%
카다로그 242
 
2.2%
tv홈쇼핑 211
 
1.9%
신문잡지 159
 
1.5%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
286293010000202130101653020154720210614<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 중구 신당동 *** 광희시장서울특별시 중구 마장로*길 **, 광희패션몰내 퀸즈스퀘어 *층 ***호 (신당동)04567썬데이무드(SUNDAY MOOD)2021-06-14 16:09:28I2021-06-16 00:22:54.0의류/패션/잡화/뷰티200982.709077451817.384039000인터넷
236463010000201930101653020160220190819<NA>1영업/정상1정상영업<NA><NA><NA><NA>.<NA><NA>서울특별시 중구 신당동 ***번지 제일평화시장서울특별시 중구 마장로 **, 제일평화시장 *층 **,**호 (신당동)04567덤앤더머(DUMB&DUMBER)2019-08-19 10:56:38I2019-08-21 02:22:20.0의류/패션/잡화/뷰티200938.528137451798.003086000인터넷
15154301000020133010130302005892013-07-30<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>070-4624-8182<NA><NA><NA>서울특별시 중구 청계천로 ***, ***동 ****호 (황학동, 롯데캐슬베네치아)100-440물반고기반2022-12-19 18:18:52U2022-12-04 22:06:00.0기타201823.908977452076.818664<NA><NA><NA><NA>
194273010000201730101303020031720170222<NA>3폐업3폐업처리20200915<NA><NA><NA><NA><NA><NA><NA>서울특별시 중구 장충단로 ***-** (장충동*가)04606고상하2020-09-15 09:50:59U2020-09-17 02:40:00.0의류/패션/잡화/뷰티200497.722674450969.792016000인터넷
195383010000201730101303020045220170317<NA>3폐업3폐업처리20220103<NA><NA><NA>02-319-8866<NA><NA><NA>서울특별시 중구 명동길 **, **층 (명동*가, 태흥빌딩)04538필스테이 명동점2022-01-03 17:37:13U2022-01-05 02:40:00.0종합몰 기타198643.162278451312.338669000인터넷
175123010000201530101303020119620150916<NA>3폐업3폐업처리20180427<NA><NA><NA>070-4286-3701<NA><NA>서울특별시 중구 신당동 ***번지 **호서울특별시 중구 청구로 ***, *층 (신당동)046135.52018-05-02 17:45:22I2018-08-31 23:59:59.0의류/패션/잡화/뷰티200865.344334451075.929095000TV홈쇼핑, 인터넷, 기타
37033010000200530101003020068420050520<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>02 757 2231<NA><NA>서울특별시 중구 태평로*가 **-**<NA><NA>세진무역2012-03-01 11:52:51I2021-12-03 00:22:43.0-<NA><NA>000<NA>
76313010000200730101003020794420071022<NA>1영업/정상1정상영업<NA><NA><NA><NA>2231-4039<NA><NA>서울특별시 중구 신당동 **번지 **호 우진빌딩B***서울특별시 중구 다산로**길 ** (신당동,우진빌딩B***)<NA>제로큐브(zerocube)2007-10-22 15:16:16I2018-08-31 23:59:59.0의류/패션/잡화/뷰티201430.56661451272.466477000인터넷
169593010000201530101303020049420150408<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-4112-1685<NA>100823서울특별시 중구 신당동 ***번지 **호서울특별시 중구 청구로**길 **-*, *층 (신당동)100823이에이치테크(EHTech)2015-04-13 19:09:50I2018-08-31 23:59:59.0컴퓨터/사무용품201128.726349451112.971559000인터넷
142213010000201230101003020108720120830<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-4606-7934<NA><NA><NA>서울특별시 중구 퇴계로 **-*, ***호 (남창동, 대윤빌딩)100060유나이티드코리아2014-03-18 14:22:41I2018-08-31 23:59:59.0의류/패션/잡화/뷰티198012.718463450745.12666000인터넷
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
12217301000020113010100302002322011-02-11<NA>5제외/삭제/전출5타시군구이관2024-03-05<NA><NA><NA>02-2268-4842<NA><NA>서울특별시 중구 필동*가 **번지 *호서울특별시 중구 서애로*길 **-**, *층 ***호 (필동*가, 수송빌딩)04623ADJUN (애드준)2024-03-05 16:22:48U2023-12-03 00:07:00.0기타199674.250968450736.389411<NA><NA><NA><NA>
96693010000200930101003020052820090414<NA>3폐업3폐업처리20201030<NA><NA><NA>2263-3353<NA><NA>서울특별시 중구 주교동 **번지 *호 방산종합시장A동*층***호서울특별시 중구 동호로**길 ** (주교동,방산종합시장A동*층***호)<NA>정우상사2020-10-30 15:05:28U2020-11-01 02:40:00.0건강/식품200002.952736451823.532463000인터넷
76733010000200730101003020799120071101<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>2238-3538<NA><NA>서울특별시 중구 신당*동 ***번지 *호 남산타운아파트*-****서울특별시 중구 다산로 **, *동 ****호 (신당동, 남산타운아파트)100754sucrere(슈크에르)2021-12-02 08:03:22U2021-12-04 02:40:00.0의류/패션/잡화/뷰티200984.713668449703.239756000인터넷
100733010000200930101003020103120090722<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA>756-9002<NA><NA>서울특별시 중구 회현동1가 138번지 1호서울특별시 중구 퇴계로12길 79 (회현동1가)<NA>일리피아2022-12-21 11:48:14U2021-11-01 22:03:00.0의류/패션/잡화/뷰티198342.901358450444.593743<NA><NA><NA><NA>
19453010000200330101003020083520030926<NA>1영업/정상1정상영업<NA><NA><NA><NA>2080-6984<NA><NA>서울특별시 중구 충정로*가 **번지<NA><NA>농협중앙회신용협동조합2018-03-20 12:36:45I2018-08-31 23:59:59.0종합몰<NA><NA>000인터넷
285163010000202130101653020142920210528<NA>3폐업3폐업처리20220114<NA><NA><NA><NA><NA><NA>서울특별시 중구 신당동 ***-* 대승빌딩서울특별시 중구 다산로**길 *, 대승빌딩 *층 (신당동)04585르뤼에르(re luere)2022-01-17 11:17:15U2022-01-20 02:40:00.0의류/패션/잡화/뷰티201382.115923451287.858645000인터넷
237113010000201930101653020167920190903<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-3445-3062<NA><NA>서울특별시 중구 흥인동 ***번지서울특별시 중구 퇴계로**길 **, *tree *층 ***호 (흥인동)04569베티르(VETIR)2019-09-03 17:55:35I2019-09-05 02:22:19.0의류/패션/잡화/뷰티201187.990064451558.419121000인터넷
21643010000200330101003020108320031223200901054취소/말소/만료/정지/중지4직권취소<NA><NA><NA><NA>02 2232 2940<NA><NA>서울특별시 중구 신당동 ***-**건주빌딩***호<NA><NA>(주)모아엔터라인2009-01-06 14:26:27I2021-12-03 00:22:43.0-<NA><NA>000<NA>
243533010000202030101653020007120200106<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 - 6956 - 4787<NA><NA>서울특별시 중구 저동*가 **번지 대신파이낸스센터(Daishin Finance Center)서울특별시 중구 삼일대로 ***, 대신파이낸스센터(Daishin Finance Center) *층 ***호 (저동*가)04538프로그레스 골프 코리아(Progress Golf Korea)2020-01-06 16:18:22I2020-01-08 00:23:39.0기타198786.599773451426.811563000인터넷
26493010000200430101003020054820040609200911304취소/말소/만료/정지/중지4직권취소<NA><NA><NA><NA>02 364 7743<NA><NA>서울특별시 중구 중림동 ***-*삼성상가*동***호<NA><NA>(주)비엠티에스2009-12-02 13:26:10I2021-12-03 00:22:43.0-<NA><NA>000<NA>