Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells76399
Missing cells (%)26.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical9
Numeric6
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author강북구
URLhttps://data.seoul.go.kr/dataList/OA-18810/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
자산규모 is highly imbalanced (63.9%)Imbalance
부채총액 is highly imbalanced (63.9%)Imbalance
자본금 is highly imbalanced (63.9%)Imbalance
판매방식명 is highly imbalanced (71.6%)Imbalance
인허가취소일자 has 9879 (98.8%) missing valuesMissing
폐업일자 has 7117 (71.2%) missing valuesMissing
휴업시작일자 has 9966 (99.7%) missing valuesMissing
휴업종료일자 has 9966 (99.7%) missing valuesMissing
재개업일자 has 9983 (99.8%) missing valuesMissing
전화번호 has 6904 (69.0%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 8267 (82.7%) missing valuesMissing
지번주소 has 558 (5.6%) missing valuesMissing
도로명주소 has 686 (6.9%) missing valuesMissing
도로명우편번호 has 1895 (18.9%) missing valuesMissing
좌표정보(X) has 589 (5.9%) missing valuesMissing
좌표정보(Y) has 589 (5.9%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = 30.55603684)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 19:48:53.922688
Analysis finished2024-04-29 19:48:55.663033
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3080000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3080000
2nd row3080000
3rd row3080000
4th row3080000
5th row3080000

Common Values

ValueCountFrequency (%)
3080000 10000
100.0%

Length

2024-04-30T04:48:55.730788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:48:55.815244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3080000 10000
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0172107 × 1018
Minimum2.000308 × 1018
Maximum2.024308 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:48:55.904282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.000308 × 1018
5-th percentile2.007308 × 1018
Q12.013308 × 1018
median2.019308 × 1018
Q32.021308 × 1018
95-th percentile2.023308 × 1018
Maximum2.024308 × 1018
Range2.400001 × 1016
Interquartile range (IQR)8.000003 × 1015

Descriptive statistics

Standard deviation5.3412842 × 1015
Coefficient of variation (CV)0.0026478563
Kurtosis-0.63369341
Mean2.0172107 × 1018
Median Absolute Deviation (MAD)3.000003 × 1015
Skewness-0.69228333
Sum-8.6308647 × 1018
Variance2.8529317 × 1031
MonotonicityNot monotonic
2024-04-30T04:48:56.044076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2024308019030200110 1
 
< 0.1%
2016308013930200042 1
 
< 0.1%
2021308016930201464 1
 
< 0.1%
2020308016930200433 1
 
< 0.1%
2021308016930201011 1
 
< 0.1%
2017308013930200408 1
 
< 0.1%
2016308013930200419 1
 
< 0.1%
2022308016930201120 1
 
< 0.1%
2022308016930200236 1
 
< 0.1%
2020308016930200574 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2000308009230200007 1
< 0.1%
2001308009230200010 1
< 0.1%
2001308009230200014 1
< 0.1%
2001308009230200041 1
< 0.1%
2001308009230200046 1
< 0.1%
2002308009230200022 1
< 0.1%
2002308009230200045 1
< 0.1%
2002308009230200205 1
< 0.1%
2002308009230200206 1
< 0.1%
2002308009230200211 1
< 0.1%
ValueCountFrequency (%)
2024308019030200474 1
< 0.1%
2024308019030200473 1
< 0.1%
2024308019030200472 1
< 0.1%
2024308019030200470 1
< 0.1%
2024308019030200469 1
< 0.1%
2024308019030200467 1
< 0.1%
2024308019030200466 1
< 0.1%
2024308019030200464 1
< 0.1%
2024308019030200459 1
< 0.1%
2024308019030200458 1
< 0.1%
Distinct3594
Distinct (%)35.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-08-29 00:00:00
Maximum2024-04-25 00:00:00
2024-04-30T04:48:56.175010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:48:56.304145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Real number (ℝ)

MISSING 

Distinct16
Distinct (%)13.2%
Missing9879
Missing (%)98.8%
Infinite0
Infinite (%)0.0%
Mean20082811
Minimum20071119
Maximum20090129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:48:56.419387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20071119
5-th percentile20071119
Q120080209
median20080709
Q320090129
95-th percentile20090129
Maximum20090129
Range19010
Interquartile range (IQR)9920

Descriptive statistics

Standard deviation5924.4659
Coefficient of variation (CV)0.00029500182
Kurtosis-0.64130881
Mean20082811
Median Absolute Deviation (MAD)502
Skewness-0.1601815
Sum2.4300202 × 109
Variance35099296
MonotonicityNot monotonic
2024-04-30T04:48:56.538794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
20090129 41
 
0.4%
20080209 26
 
0.3%
20080529 12
 
0.1%
20071119 9
 
0.1%
20080725 8
 
0.1%
20080709 7
 
0.1%
20080125 5
 
0.1%
20071206 3
 
< 0.1%
20080402 2
 
< 0.1%
20081227 2
 
< 0.1%
Other values (6) 6
 
0.1%
(Missing) 9879
98.8%
ValueCountFrequency (%)
20071119 9
 
0.1%
20071206 3
 
< 0.1%
20080125 5
 
0.1%
20080209 26
0.3%
20080402 2
 
< 0.1%
20080430 1
 
< 0.1%
20080527 1
 
< 0.1%
20080529 12
0.1%
20080709 7
 
0.1%
20080725 8
 
0.1%
ValueCountFrequency (%)
20090129 41
0.4%
20081227 2
 
< 0.1%
20081211 1
 
< 0.1%
20081031 1
 
< 0.1%
20080818 1
 
< 0.1%
20080728 1
 
< 0.1%
20080725 8
 
0.1%
20080709 7
 
0.1%
20080529 12
 
0.1%
20080527 1
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
6354 
3
2457 
4
738 
5
 
426
2
 
25

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row5
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 6354
63.5%
3 2457
 
24.6%
4 738
 
7.4%
5 426
 
4.3%
2 25
 
0.2%

Length

2024-04-30T04:48:56.665219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:48:56.758267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6354
63.5%
3 2457
 
24.6%
4 738
 
7.4%
5 426
 
4.3%
2 25
 
0.2%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
6354 
폐업
2457 
취소/말소/만료/정지/중지
738 
제외/삭제/전출
 
426
휴업
 
25

Length

Max length14
Median length5
Mean length5.0474
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row제외/삭제/전출
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 6354
63.5%
폐업 2457
 
24.6%
취소/말소/만료/정지/중지 738
 
7.4%
제외/삭제/전출 426
 
4.3%
휴업 25
 
0.2%

Length

2024-04-30T04:48:56.856357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:48:56.948137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 6354
63.5%
폐업 2457
 
24.6%
취소/말소/만료/정지/중지 738
 
7.4%
제외/삭제/전출 426
 
4.3%
휴업 25
 
0.2%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0718
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:48:57.027355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6892395
Coefficient of variation (CV)0.81534873
Kurtosis2.0086212
Mean2.0718
Median Absolute Deviation (MAD)0
Skewness1.6534736
Sum20718
Variance2.8535301
MonotonicityNot monotonic
2024-04-30T04:48:57.252385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 6352
63.5%
3 2457
 
24.6%
7 617
 
6.2%
5 426
 
4.3%
4 121
 
1.2%
2 25
 
0.2%
6 2
 
< 0.1%
ValueCountFrequency (%)
1 6352
63.5%
2 25
 
0.2%
3 2457
 
24.6%
4 121
 
1.2%
5 426
 
4.3%
6 2
 
< 0.1%
7 617
 
6.2%
ValueCountFrequency (%)
7 617
 
6.2%
6 2
 
< 0.1%
5 426
 
4.3%
4 121
 
1.2%
3 2457
 
24.6%
2 25
 
0.2%
1 6352
63.5%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
6352 
폐업처리
2457 
직권말소
 
617
타시군구이관
 
426
직권취소
 
121
Other values (2)
 
27

Length

Max length6
Median length4
Mean length4.0856
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row폐업처리
3rd row타시군구이관
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 6352
63.5%
폐업처리 2457
 
24.6%
직권말소 617
 
6.2%
타시군구이관 426
 
4.3%
직권취소 121
 
1.2%
휴업처리 25
 
0.2%
타시군구전입 2
 
< 0.1%

Length

2024-04-30T04:48:57.370925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:48:57.473715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 6352
63.5%
폐업처리 2457
 
24.6%
직권말소 617
 
6.2%
타시군구이관 426
 
4.3%
직권취소 121
 
1.2%
휴업처리 25
 
0.2%
타시군구전입 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct1797
Distinct (%)62.3%
Missing7117
Missing (%)71.2%
Memory size156.2 KiB
Minimum2002-08-30 00:00:00
Maximum2024-04-25 00:00:00
2024-04-30T04:48:57.576165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:48:57.707223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct33
Distinct (%)97.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2007-11-27 00:00:00
Maximum2024-04-12 00:00:00
2024-04-30T04:48:57.817587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:48:57.917006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

휴업종료일자
Date

MISSING 

Distinct33
Distinct (%)97.1%
Missing9966
Missing (%)99.7%
Memory size156.2 KiB
Minimum2008-05-27 00:00:00
Maximum2030-01-01 00:00:00
2024-04-30T04:48:58.017600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:48:58.128419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

재개업일자
Date

MISSING 

Distinct15
Distinct (%)88.2%
Missing9983
Missing (%)99.8%
Memory size156.2 KiB
Minimum2007-11-19 00:00:00
Maximum2024-01-02 00:00:00
2024-04-30T04:48:58.223842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:48:58.317794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)

전화번호
Text

MISSING 

Distinct2963
Distinct (%)95.7%
Missing6904
Missing (%)69.0%
Memory size156.2 KiB
2024-04-30T04:48:58.501171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length10.552003
Min length1

Characters and Unicode

Total characters32669
Distinct characters16
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2877 ?
Unique (%)92.9%

Sample

1st row02-1688-0236
2nd row02-963-1217
3rd row02 6409 5168
4th row02-6369-9200
5th row070-8961-6031
ValueCountFrequency (%)
02 492
 
12.2%
33
 
0.8%
987 30
 
0.7%
945 26
 
0.6%
900 21
 
0.5%
980 20
 
0.5%
988 19
 
0.5%
986 18
 
0.4%
999 17
 
0.4%
993 17
 
0.4%
Other values (3038) 3352
82.9%
2024-04-30T04:48:58.840654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5433
16.6%
9 3768
11.5%
2 3725
11.4%
- 3432
10.5%
8 2747
8.4%
7 2505
7.7%
4 1986
 
6.1%
5 1961
 
6.0%
1 1944
 
6.0%
6 1892
 
5.8%
Other values (6) 3276
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 27744
84.9%
Dash Punctuation 3432
 
10.5%
Space Separator 1472
 
4.5%
Other Punctuation 14
 
< 0.1%
Math Symbol 6
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5433
19.6%
9 3768
13.6%
2 3725
13.4%
8 2747
9.9%
7 2505
9.0%
4 1986
 
7.2%
5 1961
 
7.1%
1 1944
 
7.0%
6 1892
 
6.8%
3 1783
 
6.4%
Other Punctuation
ValueCountFrequency (%)
, 8
57.1%
. 6
42.9%
Dash Punctuation
ValueCountFrequency (%)
- 3432
100.0%
Space Separator
ValueCountFrequency (%)
1472
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32669
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5433
16.6%
9 3768
11.5%
2 3725
11.4%
- 3432
10.5%
8 2747
8.4%
7 2505
7.7%
4 1986
 
6.1%
5 1961
 
6.0%
1 1944
 
6.0%
6 1892
 
5.8%
Other values (6) 3276
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32669
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5433
16.6%
9 3768
11.5%
2 3725
11.4%
- 3432
10.5%
8 2747
8.4%
7 2505
7.7%
4 1986
 
6.1%
5 1961
 
6.0%
1 1944
 
6.0%
6 1892
 
5.8%
Other values (6) 3276
10.0%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct124
Distinct (%)7.2%
Missing8267
Missing (%)82.7%
Infinite0
Infinite (%)0.0%
Mean142781.67
Minimum121200
Maximum604772
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:48:58.979690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum121200
5-th percentile142061
Q1142071
median142100
Q3142820
95-th percentile142885
Maximum604772
Range483572
Interquartile range (IQR)749

Descriptive statistics

Standard deviation13537.581
Coefficient of variation (CV)0.094813154
Kurtosis962.23573
Mean142781.67
Median Absolute Deviation (MAD)39
Skewness30.556037
Sum2.4744064 × 108
Variance1.8326609 × 108
MonotonicityNot monotonic
2024-04-30T04:48:59.105628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
142100 414
 
4.1%
142070 186
 
1.9%
142071 132
 
1.3%
142061 98
 
1.0%
142060 75
 
0.8%
142820 36
 
0.4%
142886 34
 
0.3%
142090 31
 
0.3%
142878 26
 
0.3%
142876 23
 
0.2%
Other values (114) 678
 
6.8%
(Missing) 8267
82.7%
ValueCountFrequency (%)
121200 1
< 0.1%
122200 1
< 0.1%
131120 1
< 0.1%
132030 1
< 0.1%
134010 1
< 0.1%
136020 1
< 0.1%
136110 1
< 0.1%
136111 1
< 0.1%
136140 1
< 0.1%
136825 1
< 0.1%
ValueCountFrequency (%)
604772 1
 
< 0.1%
462130 1
 
< 0.1%
150102 1
 
< 0.1%
142892 12
 
0.1%
142891 4
 
< 0.1%
142890 12
 
0.1%
142889 1
 
< 0.1%
142888 10
 
0.1%
142887 9
 
0.1%
142886 34
0.3%

지번주소
Text

MISSING 

Distinct3753
Distinct (%)39.7%
Missing558
Missing (%)5.6%
Memory size156.2 KiB
2024-04-30T04:48:59.355039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length46
Mean length26.468333
Min length7

Characters and Unicode

Total characters249914
Distinct characters496
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3028 ?
Unique (%)32.1%

Sample

1st row서울특별시 강북구 수유동 ***-** 예움하우스 B**호
2nd row서울특별시 강북구 미아동 ***번지 **호 신화빌딩*층
3rd row서울특별시 강북구 수유동 **번지 **호
4th row서울특별시 강북구 수유동 *** 수유현대아파트 ***동 ****호
5th row서울특별시 강북구 번동 *** 번동*단지주공아파트 ***동 ****호
ValueCountFrequency (%)
강북구 9422
18.2%
서울특별시 8951
17.3%
5841
11.3%
번지 4516
8.7%
4442
8.6%
미아동 3740
 
7.2%
수유동 3229
 
6.2%
번동 1497
 
2.9%
605
 
1.2%
536
 
1.0%
Other values (2527) 9096
17.5%
2024-04-30T04:48:59.740615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 52420
21.0%
42595
17.0%
10680
 
4.3%
9776
 
3.9%
9725
 
3.9%
9479
 
3.8%
9465
 
3.8%
9457
 
3.8%
9456
 
3.8%
8956
 
3.6%
Other values (486) 77905
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 146627
58.7%
Other Punctuation 52548
 
21.0%
Space Separator 42595
 
17.0%
Dash Punctuation 3998
 
1.6%
Decimal Number 2957
 
1.2%
Uppercase Letter 966
 
0.4%
Lowercase Letter 93
 
< 0.1%
Close Punctuation 60
 
< 0.1%
Open Punctuation 60
 
< 0.1%
Letter Number 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10680
 
7.3%
9776
 
6.7%
9725
 
6.6%
9479
 
6.5%
9465
 
6.5%
9457
 
6.4%
9456
 
6.4%
8956
 
6.1%
8951
 
6.1%
6631
 
4.5%
Other values (421) 54051
36.9%
Uppercase Letter
ValueCountFrequency (%)
S 243
25.2%
K 230
23.8%
B 163
16.9%
A 137
14.2%
D 30
 
3.1%
H 20
 
2.1%
R 18
 
1.9%
M 13
 
1.3%
I 13
 
1.3%
O 13
 
1.3%
Other values (13) 86
 
8.9%
Lowercase Letter
ValueCountFrequency (%)
s 16
17.2%
k 15
16.1%
b 14
15.1%
e 10
10.8%
a 8
8.6%
n 5
 
5.4%
h 4
 
4.3%
o 4
 
4.3%
m 4
 
4.3%
u 3
 
3.2%
Other values (8) 10
10.8%
Decimal Number
ValueCountFrequency (%)
1 619
20.9%
2 415
14.0%
3 368
12.4%
4 315
10.7%
0 315
10.7%
5 221
 
7.5%
6 200
 
6.8%
7 198
 
6.7%
8 174
 
5.9%
9 132
 
4.5%
Other Punctuation
ValueCountFrequency (%)
* 52420
99.8%
, 94
 
0.2%
@ 23
 
< 0.1%
/ 4
 
< 0.1%
. 4
 
< 0.1%
& 3
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 3
60.0%
+ 1
 
20.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
42595
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3998
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 146622
58.7%
Common 102223
40.9%
Latin 1064
 
0.4%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10680
 
7.3%
9776
 
6.7%
9725
 
6.6%
9479
 
6.5%
9465
 
6.5%
9457
 
6.4%
9456
 
6.4%
8956
 
6.1%
8951
 
6.1%
6631
 
4.5%
Other values (416) 54046
36.9%
Latin
ValueCountFrequency (%)
S 243
22.8%
K 230
21.6%
B 163
15.3%
A 137
12.9%
D 30
 
2.8%
H 20
 
1.9%
R 18
 
1.7%
s 16
 
1.5%
k 15
 
1.4%
b 14
 
1.3%
Other values (32) 178
16.7%
Common
ValueCountFrequency (%)
* 52420
51.3%
42595
41.7%
- 3998
 
3.9%
1 619
 
0.6%
2 415
 
0.4%
3 368
 
0.4%
4 315
 
0.3%
0 315
 
0.3%
5 221
 
0.2%
6 200
 
0.2%
Other values (13) 757
 
0.7%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 146622
58.7%
ASCII 103281
41.3%
Number Forms 5
 
< 0.1%
CJK 5
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 52420
50.8%
42595
41.2%
- 3998
 
3.9%
1 619
 
0.6%
2 415
 
0.4%
3 368
 
0.4%
4 315
 
0.3%
0 315
 
0.3%
S 243
 
0.2%
K 230
 
0.2%
Other values (53) 1763
 
1.7%
Hangul
ValueCountFrequency (%)
10680
 
7.3%
9776
 
6.7%
9725
 
6.6%
9479
 
6.5%
9465
 
6.5%
9457
 
6.4%
9456
 
6.4%
8956
 
6.1%
8951
 
6.1%
6631
 
4.5%
Other values (416) 54046
36.9%
Number Forms
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct4986
Distinct (%)53.5%
Missing686
Missing (%)6.9%
Memory size156.2 KiB
2024-04-30T04:48:59.950749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length51
Mean length35.287954
Min length16

Characters and Unicode

Total characters328672
Distinct characters510
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3863 ?
Unique (%)41.5%

Sample

1st row서울특별시 강북구 삼양로**길 **-*, 지층 B**호 (수유동, 예움하우스)
2nd row서울특별시 강북구 도봉로**길 *, *층 (미아동, 신화빌딩)
3rd row서울특별시 강북구 노해로 **, *층 (수유동, 유성빌딩)
4th row서울특별시 강북구 한천로 ****, ***동 ****호 (수유동, 수유현대아파트)
5th row서울특별시 강북구 도봉로**길 **, ***호 (미아동)
ValueCountFrequency (%)
서울특별시 9313
15.0%
강북구 9289
14.9%
8808
14.2%
4902
 
7.9%
미아동 3638
 
5.9%
수유동 3233
 
5.2%
2503
 
4.0%
1509
 
2.4%
번동 1457
 
2.3%
도봉로**길 929
 
1.5%
Other values (2959) 16606
26.7%
2024-04-30T04:49:00.292631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 59896
18.2%
52891
 
16.1%
11794
 
3.6%
, 11517
 
3.5%
9645
 
2.9%
9606
 
2.9%
9350
 
2.8%
9349
 
2.8%
9344
 
2.8%
9338
 
2.8%
Other values (500) 135942
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 178624
54.3%
Other Punctuation 71547
21.8%
Space Separator 52891
 
16.1%
Close Punctuation 9320
 
2.8%
Open Punctuation 9320
 
2.8%
Decimal Number 2764
 
0.8%
Dash Punctuation 2699
 
0.8%
Uppercase Letter 1378
 
0.4%
Lowercase Letter 113
 
< 0.1%
Math Symbol 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11794
 
6.6%
9645
 
5.4%
9606
 
5.4%
9350
 
5.2%
9349
 
5.2%
9344
 
5.2%
9338
 
5.2%
9319
 
5.2%
9313
 
5.2%
9310
 
5.2%
Other values (433) 82256
46.0%
Uppercase Letter
ValueCountFrequency (%)
B 373
27.1%
A 256
18.6%
S 250
18.1%
K 238
17.3%
R 45
 
3.3%
D 41
 
3.0%
C 28
 
2.0%
H 20
 
1.5%
E 20
 
1.5%
J 16
 
1.2%
Other values (13) 91
 
6.6%
Lowercase Letter
ValueCountFrequency (%)
b 21
18.6%
e 16
14.2%
a 13
11.5%
n 9
8.0%
m 7
 
6.2%
o 7
 
6.2%
t 6
 
5.3%
r 5
 
4.4%
s 5
 
4.4%
h 4
 
3.5%
Other values (9) 20
17.7%
Decimal Number
ValueCountFrequency (%)
1 694
25.1%
0 391
14.1%
2 377
13.6%
3 295
10.7%
4 240
 
8.7%
5 205
 
7.4%
7 166
 
6.0%
9 140
 
5.1%
6 135
 
4.9%
8 121
 
4.4%
Other Punctuation
ValueCountFrequency (%)
* 59896
83.7%
, 11517
 
16.1%
. 125
 
0.2%
@ 6
 
< 0.1%
& 3
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 8
72.7%
+ 2
 
18.2%
1
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 9319
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 9319
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
52891
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2699
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 178619
54.3%
Common 148552
45.2%
Latin 1496
 
0.5%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11794
 
6.6%
9645
 
5.4%
9606
 
5.4%
9350
 
5.2%
9349
 
5.2%
9344
 
5.2%
9338
 
5.2%
9319
 
5.2%
9313
 
5.2%
9310
 
5.2%
Other values (428) 82251
46.0%
Latin
ValueCountFrequency (%)
B 373
24.9%
A 256
17.1%
S 250
16.7%
K 238
15.9%
R 45
 
3.0%
D 41
 
2.7%
C 28
 
1.9%
b 21
 
1.4%
H 20
 
1.3%
E 20
 
1.3%
Other values (33) 204
13.6%
Common
ValueCountFrequency (%)
* 59896
40.3%
52891
35.6%
, 11517
 
7.8%
) 9319
 
6.3%
( 9319
 
6.3%
- 2699
 
1.8%
1 694
 
0.5%
0 391
 
0.3%
2 377
 
0.3%
3 295
 
0.2%
Other values (14) 1154
 
0.8%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 178619
54.3%
ASCII 150042
45.7%
Number Forms 5
 
< 0.1%
CJK 5
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 59896
39.9%
52891
35.3%
, 11517
 
7.7%
) 9319
 
6.2%
( 9319
 
6.2%
- 2699
 
1.8%
1 694
 
0.5%
0 391
 
0.3%
2 377
 
0.3%
B 373
 
0.2%
Other values (55) 2566
 
1.7%
Hangul
ValueCountFrequency (%)
11794
 
6.6%
9645
 
5.4%
9606
 
5.4%
9350
 
5.2%
9349
 
5.2%
9344
 
5.2%
9338
 
5.2%
9319
 
5.2%
9313
 
5.2%
9310
 
5.2%
Other values (428) 82251
46.0%
Number Forms
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct327
Distinct (%)4.0%
Missing1895
Missing (%)18.9%
Memory size156.2 KiB
2024-04-30T04:49:00.603894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1584207
Min length5

Characters and Unicode

Total characters41809
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)0.4%

Sample

1st row01086
2nd row142100
3rd row01077
4th row01039
5th row142100
ValueCountFrequency (%)
142100 406
 
5.0%
142071 227
 
2.8%
01192 197
 
2.4%
01170 175
 
2.2%
01197 141
 
1.7%
01062 133
 
1.6%
142061 117
 
1.4%
01054 103
 
1.3%
01191 86
 
1.1%
01114 81
 
1.0%
Other values (317) 6439
79.4%
2024-04-30T04:49:01.020893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 13266
31.7%
0 12621
30.2%
2 3887
 
9.3%
4 2695
 
6.4%
7 2139
 
5.1%
6 1613
 
3.9%
9 1516
 
3.6%
5 1385
 
3.3%
3 1337
 
3.2%
8 1335
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 41794
> 99.9%
Dash Punctuation 15
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 13266
31.7%
0 12621
30.2%
2 3887
 
9.3%
4 2695
 
6.4%
7 2139
 
5.1%
6 1613
 
3.9%
9 1516
 
3.6%
5 1385
 
3.3%
3 1337
 
3.2%
8 1335
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 41809
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 13266
31.7%
0 12621
30.2%
2 3887
 
9.3%
4 2695
 
6.4%
7 2139
 
5.1%
6 1613
 
3.9%
9 1516
 
3.6%
5 1385
 
3.3%
3 1337
 
3.2%
8 1335
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41809
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 13266
31.7%
0 12621
30.2%
2 3887
 
9.3%
4 2695
 
6.4%
7 2139
 
5.1%
6 1613
 
3.9%
9 1516
 
3.6%
5 1385
 
3.3%
3 1337
 
3.2%
8 1335
 
3.2%
Distinct9718
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:01.341530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length37
Mean length6.6044
Min length1

Characters and Unicode

Total characters66044
Distinct characters1112
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9460 ?
Unique (%)94.6%

Sample

1st row아트제스페이스
2nd row신선이야기
3rd row(주) 보라웹
4th row제이에스 글로벌
5th row더힙컴퍼니
ValueCountFrequency (%)
주식회사 331
 
2.7%
컴퍼니 38
 
0.3%
company 25
 
0.2%
25
 
0.2%
스튜디오 22
 
0.2%
22
 
0.2%
korea 17
 
0.1%
the 15
 
0.1%
co 14
 
0.1%
코리아 13
 
0.1%
Other values (10840) 11897
95.8%
2024-04-30T04:49:01.801790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2419
 
3.7%
2398
 
3.6%
( 1875
 
2.8%
) 1875
 
2.8%
1869
 
2.8%
1088
 
1.6%
e 894
 
1.4%
882
 
1.3%
o 765
 
1.2%
716
 
1.1%
Other values (1102) 51263
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45286
68.6%
Lowercase Letter 7499
 
11.4%
Uppercase Letter 5851
 
8.9%
Space Separator 2419
 
3.7%
Open Punctuation 1879
 
2.8%
Close Punctuation 1879
 
2.8%
Other Punctuation 610
 
0.9%
Decimal Number 500
 
0.8%
Dash Punctuation 77
 
0.1%
Other Symbol 21
 
< 0.1%
Other values (4) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2398
 
5.3%
1869
 
4.1%
1088
 
2.4%
882
 
1.9%
716
 
1.6%
657
 
1.5%
649
 
1.4%
638
 
1.4%
634
 
1.4%
593
 
1.3%
Other values (1016) 35162
77.6%
Lowercase Letter
ValueCountFrequency (%)
e 894
11.9%
o 765
 
10.2%
a 680
 
9.1%
i 562
 
7.5%
n 543
 
7.2%
r 483
 
6.4%
l 419
 
5.6%
t 387
 
5.2%
s 373
 
5.0%
m 284
 
3.8%
Other values (16) 2109
28.1%
Uppercase Letter
ValueCountFrequency (%)
O 461
 
7.9%
A 448
 
7.7%
E 430
 
7.3%
S 420
 
7.2%
N 363
 
6.2%
T 334
 
5.7%
M 302
 
5.2%
L 296
 
5.1%
I 291
 
5.0%
C 288
 
4.9%
Other values (16) 2218
37.9%
Other Punctuation
ValueCountFrequency (%)
? 303
49.7%
. 134
22.0%
& 105
 
17.2%
, 28
 
4.6%
' 19
 
3.1%
: 8
 
1.3%
! 5
 
0.8%
/ 3
 
0.5%
; 2
 
0.3%
# 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 101
20.2%
1 89
17.8%
0 64
12.8%
4 43
8.6%
5 43
8.6%
3 42
8.4%
9 40
 
8.0%
7 38
 
7.6%
8 23
 
4.6%
6 17
 
3.4%
Math Symbol
ValueCountFrequency (%)
< 2
40.0%
> 2
40.0%
+ 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 1875
99.8%
[ 4
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1875
99.8%
] 4
 
0.2%
Space Separator
ValueCountFrequency (%)
2419
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%
Other Symbol
ValueCountFrequency (%)
21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45282
68.6%
Latin 13350
 
20.2%
Common 7387
 
11.2%
Han 25
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2398
 
5.3%
1869
 
4.1%
1088
 
2.4%
882
 
1.9%
716
 
1.6%
657
 
1.5%
649
 
1.4%
638
 
1.4%
634
 
1.4%
593
 
1.3%
Other values (997) 35158
77.6%
Latin
ValueCountFrequency (%)
e 894
 
6.7%
o 765
 
5.7%
a 680
 
5.1%
i 562
 
4.2%
n 543
 
4.1%
r 483
 
3.6%
O 461
 
3.5%
A 448
 
3.4%
E 430
 
3.2%
S 420
 
3.1%
Other values (42) 7664
57.4%
Common
ValueCountFrequency (%)
2419
32.7%
( 1875
25.4%
) 1875
25.4%
? 303
 
4.1%
. 134
 
1.8%
& 105
 
1.4%
2 101
 
1.4%
1 89
 
1.2%
- 77
 
1.0%
0 64
 
0.9%
Other values (23) 345
 
4.7%
Han
ValueCountFrequency (%)
3
 
12.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (10) 10
40.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45261
68.5%
ASCII 20735
31.4%
CJK 24
 
< 0.1%
None 21
 
< 0.1%
Punctuation 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2419
 
11.7%
( 1875
 
9.0%
) 1875
 
9.0%
e 894
 
4.3%
o 765
 
3.7%
a 680
 
3.3%
i 562
 
2.7%
n 543
 
2.6%
r 483
 
2.3%
O 461
 
2.2%
Other values (74) 10178
49.1%
Hangul
ValueCountFrequency (%)
2398
 
5.3%
1869
 
4.1%
1088
 
2.4%
882
 
1.9%
716
 
1.6%
657
 
1.5%
649
 
1.4%
638
 
1.4%
634
 
1.4%
593
 
1.3%
Other values (996) 35137
77.6%
None
ValueCountFrequency (%)
21
100.0%
CJK
ValueCountFrequency (%)
3
 
12.5%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (9) 9
37.5%
Punctuation
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct9951
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-02 15:07:36
Maximum2024-04-25 16:17:30
2024-04-30T04:49:01.920532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:02.034333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7510 
U
2490 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowU
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7510
75.1%
U 2490
 
24.9%

Length

2024-04-30T04:49:02.138158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:02.221889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7510
75.1%
u 2490
 
24.9%
Distinct1521
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-30T04:49:02.321055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T04:49:02.446284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct436
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T04:49:02.598202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length9.2505
Min length1

Characters and Unicode

Total characters92505
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique269 ?
Unique (%)2.7%

Sample

1st row의류/패션/잡화/뷰티
2nd row의류/패션/잡화/뷰티 기타
3rd row기타
4th row종합몰
5th row의류/패션/잡화/뷰티
ValueCountFrequency (%)
의류/패션/잡화/뷰티 4629
34.3%
종합몰 2868
21.3%
기타 1853
13.7%
건강/식품 869
 
6.4%
교육/도서/완구/오락 682
 
5.1%
컴퓨터/사무용품 526
 
3.9%
가전 464
 
3.4%
461
 
3.4%
가구/수납용품 443
 
3.3%
레져/여행/공연 278
 
2.1%
Other values (3) 406
 
3.0%
2024-04-30T04:49:02.864763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 18649
20.2%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
4629
 
5.0%
3479
 
3.8%
Other values (41) 33345
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69916
75.6%
Other Punctuation 18649
 
20.2%
Space Separator 3479
 
3.8%
Dash Punctuation 461
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
2868
 
4.1%
2868
 
4.1%
Other values (38) 27148
38.8%
Other Punctuation
ValueCountFrequency (%)
/ 18649
100.0%
Space Separator
ValueCountFrequency (%)
3479
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 461
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69916
75.6%
Common 22589
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
2868
 
4.1%
2868
 
4.1%
Other values (38) 27148
38.8%
Common
ValueCountFrequency (%)
/ 18649
82.6%
3479
 
15.4%
- 461
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69916
75.6%
ASCII 22589
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 18649
82.6%
3479
 
15.4%
- 461
 
2.0%
Hangul
ValueCountFrequency (%)
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
4629
 
6.6%
2868
 
4.1%
2868
 
4.1%
Other values (38) 27148
38.8%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct5183
Distinct (%)55.1%
Missing589
Missing (%)5.9%
Infinite0
Infinite (%)0.0%
Mean201957.08
Minimum175646.27
Maximum211779.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:02.983047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum175646.27
5-th percentile200890.92
Q1201398.83
median201952.47
Q3202432.85
95-th percentile203211.4
Maximum211779.01
Range36132.735
Interquartile range (IQR)1034.0255

Descriptive statistics

Standard deviation824.44753
Coefficient of variation (CV)0.0040822908
Kurtosis158.26389
Mean201957.08
Median Absolute Deviation (MAD)522.87686
Skewness-4.9126657
Sum1.9006181 × 109
Variance679713.74
MonotonicityNot monotonic
2024-04-30T04:49:03.120591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201045.611312321 234
 
2.3%
202228.431717714 122
 
1.2%
202155.401317068 112
 
1.1%
201606.445832012 104
 
1.0%
200886.333781015 96
 
1.0%
201598.511376716 86
 
0.9%
201530.926516903 84
 
0.8%
204089.673173372 68
 
0.7%
201425.248301789 63
 
0.6%
201731.684416461 63
 
0.6%
Other values (5173) 8379
83.8%
(Missing) 589
 
5.9%
ValueCountFrequency (%)
175646.274164252 1
 
< 0.1%
180929.0 1
 
< 0.1%
193166.430144679 1
 
< 0.1%
195051.53369717 1
 
< 0.1%
199973.0134989 1
 
< 0.1%
200199.670226051 1
 
< 0.1%
200261.348771565 1
 
< 0.1%
200265.862283018 3
< 0.1%
200300.602259809 2
< 0.1%
200305.645883789 1
 
< 0.1%
ValueCountFrequency (%)
211779.009484005 1
 
< 0.1%
206528.213989031 1
 
< 0.1%
206092.082758783 1
 
< 0.1%
205234.814028864 1
 
< 0.1%
204089.673173372 68
0.7%
204001.173587299 1
 
< 0.1%
203999.650902849 1
 
< 0.1%
203994.671015804 1
 
< 0.1%
203985.983038714 1
 
< 0.1%
203981.35 3
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct5183
Distinct (%)55.1%
Missing589
Missing (%)5.9%
Infinite0
Infinite (%)0.0%
Mean458770.27
Minimum436874.89
Maximum463373.45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T04:49:03.439818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum436874.89
5-th percentile456967.72
Q1457759.87
median458828.6
Q3459703.74
95-th percentile460563.05
Maximum463373.45
Range26498.564
Interquartile range (IQR)1943.8663

Descriptive statistics

Standard deviation1277.7595
Coefficient of variation (CV)0.0027851837
Kurtosis18.414191
Mean458770.27
Median Absolute Deviation (MAD)950.1139
Skewness-1.0608695
Sum4.317487 × 109
Variance1632669.3
MonotonicityNot monotonic
2024-04-30T04:49:03.552655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
457292.961457264 234
 
2.3%
458158.105668456 122
 
1.2%
459411.940883021 112
 
1.1%
457645.184347648 96
 
1.0%
460001.632075829 93
 
0.9%
457357.306972319 86
 
0.9%
456987.839409342 84
 
0.8%
458041.237609025 68
 
0.7%
457219.335594234 63
 
0.6%
457269.520249946 63
 
0.6%
Other values (5173) 8390
83.9%
(Missing) 589
 
5.9%
ValueCountFrequency (%)
436874.887994225 1
< 0.1%
438119.0 1
< 0.1%
444098.946985341 1
< 0.1%
448575.956518546 1
< 0.1%
448918.838738238 1
< 0.1%
450425.852790862 1
< 0.1%
453688.318721139 1
< 0.1%
454136.52802116 1
< 0.1%
454145.894169522 1
< 0.1%
455072.414926747 1
< 0.1%
ValueCountFrequency (%)
463373.452311656 1
< 0.1%
463346.007634503 1
< 0.1%
463240.16478942 1
< 0.1%
463237.199751839 1
< 0.1%
463148.411483786 1
< 0.1%
462373.950921999 1
< 0.1%
462336.911564584 1
< 0.1%
462325.511670607 1
< 0.1%
462315.103274932 1
< 0.1%
462314.795271739 1
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9314 
0
 
686

Length

Max length4
Median length4
Mean length3.7942
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9314
93.1%
0 686
 
6.9%

Length

2024-04-30T04:49:03.663182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:03.740341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9314
93.1%
0 686
 
6.9%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9314 
0
 
686

Length

Max length4
Median length4
Mean length3.7942
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9314
93.1%
0 686
 
6.9%

Length

2024-04-30T04:49:03.820125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:03.895278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9314
93.1%
0 686
 
6.9%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9314 
0
 
686

Length

Max length4
Median length4
Mean length3.7942
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9314
93.1%
0 686
 
6.9%

Length

2024-04-30T04:49:03.977505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T04:49:04.052420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9314
93.1%
0 686
 
6.9%

판매방식명
Categorical

IMBALANCE 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
5078 
<NA>
4684 
인터넷, 기타
 
84
기타
 
57
TV홈쇼핑, 인터넷
 
21
Other values (14)
 
76

Length

Max length26
Median length3
Mean length3.5882
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 5078
50.8%
<NA> 4684
46.8%
인터넷, 기타 84
 
0.8%
기타 57
 
0.6%
TV홈쇼핑, 인터넷 21
 
0.2%
인터넷, 카다로그 15
 
0.1%
TV홈쇼핑 11
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 9
 
0.1%
TV홈쇼핑, 인터넷, 기타 8
 
0.1%
인터넷, 카다로그, 기타 7
 
0.1%
Other values (9) 26
 
0.3%

Length

2024-04-30T04:49:04.134693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 5240
51.2%
na 4684
45.8%
기타 174
 
1.7%
tv홈쇼핑 60
 
0.6%
카다로그 51
 
0.5%
신문잡지 26
 
0.3%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
14383308000020243080190302001102024-01-19<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 ***-** 예움하우스 B**호서울특별시 강북구 삼양로**길 **-*, 지층 B**호 (수유동, 예움하우스)01086아트제스페이스2024-01-19 17:17:12I2023-11-30 22:01:00.0의류/패션/잡화/뷰티201405.648333459633.290712<NA><NA><NA><NA>
44293080000201430801393020051220141124<NA>3폐업3폐업처리20200813<NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ***번지 **호 신화빌딩*층서울특별시 강북구 도봉로**길 *, *층 (미아동, 신화빌딩)142100신선이야기2020-08-13 15:16:31I2021-12-03 22:02:00.0의류/패션/잡화/뷰티 기타202481.081333457573.044203<NA><NA><NA><NA>
853308000020073080092302019522007-07-16<NA>5제외/삭제/전출5타시군구이관2023-08-08<NA><NA><NA>02-1688-0236<NA><NA>서울특별시 강북구 수유동 **번지 **호서울특별시 강북구 노해로 **, *층 (수유동, 유성빌딩)01077(주) 보라웹2023-08-08 09:38:55U2022-12-07 23:00:00.0기타201893.32323459873.841564<NA><NA><NA><NA>
13470308000020233080190302006402023-05-19<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 *** 수유현대아파트 ***동 ****호서울특별시 강북구 한천로 ****, ***동 ****호 (수유동, 수유현대아파트)01039제이에스 글로벌2023-05-19 16:17:59I2022-12-04 22:01:00.0종합몰201567.676982460338.95785<NA><NA><NA><NA>
35283080000201330801393020004220130118<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-963-1217<NA><NA><NA>서울특별시 강북구 도봉로**길 **, ***호 (미아동)142100더힙컴퍼니2013-01-22 12:47:11I2018-08-31 23:59:59.0의류/패션/잡화/뷰티202128.317513457853.045989<NA><NA><NA>인터넷
13450308000020233080190302006202023-05-15<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 번동 *** 번동*단지주공아파트 ***동 ****호서울특별시 강북구 한천로***길 **, ***동 ****호 (번동, 번동*단지주공아파트)01229그린웨이브2023-05-18 15:24:05U2022-12-04 22:00:00.0종합몰203932.991518458247.431674<NA><NA><NA><NA>
4113080000200630800923020143420060410<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 6409 5168<NA>142070서울특별시 강북구 수유동 ***번지 **호 정원시티빌 ***호서울특별시 강북구 *.**로 **-*, ***호 (수유동, 정원시티빌)142070키즈베이2015-01-05 17:07:31I2018-08-31 23:59:59.0의류/패션/잡화/뷰티200987.456391460556.500266<NA><NA><NA>인터넷
38623080000201330801393020042920130916<NA>3폐업3폐업처리20170117<NA><NA><NA>02-6369-9200<NA><NA><NA>서울특별시 강북구 수유로**나길 *-* (수유동)142890훈브로트레이드2017-01-17 11:00:29I2018-08-31 23:59:59.0의류/패션/잡화/뷰티201588.97204459418.337027<NA><NA><NA>인터넷
14331308000020243080190302000582020-10-20<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 ***-** 극동아파트 **동 ***호서울특별시 강북구 인수봉로**길 **, 극동아파트 **동 *층 ***호 (수유동)01032에즈문2024-01-15 13:10:54U2023-11-30 23:07:00.0의류/패션/잡화/뷰티200996.748132459945.147477<NA><NA><NA><NA>
47973080000201530801393020036020150709<NA>5제외/삭제/전출5타시군구이관20150819<NA><NA><NA>070-8961-6031<NA>142100서울특별시 강북구 미아동 ***번지 미아동부센트레빌아파트 ***동 ****호서울특별시 강북구 숭인로*가길 **, ***동 ****호 (미아동, 미아동부센트레빌아파트)142100유그래픽스2015-08-19 15:34:32I2021-12-03 22:02:00.0컴퓨터/사무용품 기타202200.319246456816.004356<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
107243080000202130801693020083720210604<NA>4취소/말소/만료/정지/중지7직권말소<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 535-159서울특별시 강북구 인수봉로79가길 38 (수유동)01016에스피엠이씨2022-12-26 13:18:54U2021-11-01 22:08:00.0컴퓨터/사무용품 가구/수납용품200574.095883460177.506772<NA><NA><NA><NA>
119753080000202230801693020048920220422<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 ***-** 대경 센트레빌서울특별시 강북구 삼양로**길 **, ***호 (수유동, 대경 센트레빌)01086치유몰2022-04-29 14:12:25U2021-12-05 00:03:00.0종합몰201396.233729459619.002629<NA><NA><NA><NA>
14017308000020233080190302011882023-10-06<NA>5제외/삭제/전출5타시군구이관2024-01-24<NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ***-* 위브 테라스 파크 ***-A호서울특별시 강북구 삼양로**길 **, 위브 테라스 파크 *층 ***-A**호 (미아동)01197하나슬2024-01-24 17:47:19U2023-11-30 22:06:00.0종합몰201598.511377457357.306972<NA><NA><NA><NA>
101613080000202130801693020025420210209<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 수유동 ***-** 극동아파트서울특별시 강북구 인수봉로**길 *, ***동 ***호 (수유동, 극동아파트)01032행복공감2021-02-09 10:51:16I2021-12-03 22:02:00.0종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 건강/식품 의류/패션/잡화/뷰티 자동차/자동차용품200996.748132459945.147477<NA><NA><NA><NA>
56573080000201630801393020069820161123<NA>3폐업3폐업처리20170613<NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ***번지 **호서울특별시 강북구 솔샘로**길 **-** (미아동)01205간지스타일2017-06-13 17:32:45I2018-08-31 23:59:59.0의류/패션/잡화/뷰티202422.351843457123.549454<NA><NA><NA>인터넷
46443080000201530801393020018020150325<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA>142100서울특별시 강북구 미아동 ***번지 **호서울특별시 강북구 솔매로**길 **, ***호 (미아동)142100애플마트2015-03-27 13:46:20I2018-08-31 23:59:59.0기타202146.713366458342.085286<NA><NA><NA>인터넷
14647308000020243080190302003852024-03-28<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ***-** 명가시티빌 ***호서울특별시 강북구 도봉로**길 **-*, ***호 (미아동, 명가시티빌)01164냥희네가게2024-03-28 13:40:31I2023-12-02 21:00:00.0종합몰202616.273676457589.583223<NA><NA><NA><NA>
124173080000202230801693020093220220816<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 번동 ***-** 신진빌라 나동 ***호서울특별시 강북구 덕릉로**길 **-*, 나동 ***호 (번동, 신진빌라)01061모티박스2022-08-16 11:41:42I2021-12-07 23:08:00.0종합몰203032.322338459371.184516<NA><NA><NA><NA>
100873080000202130801693020017620210128<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강북구 미아동 ***-**서울특별시 강북구 도봉로 ***, *층 **호 (미아동)01165리아프로젝트2021-01-28 09:45:25I2021-12-03 22:02:00.0종합몰 가구/수납용품202562.189143457313.277082<NA><NA><NA><NA>
28973080000201130801393020044520110927<NA>3폐업3폐업처리20130202<NA><NA><NA><NA><NA>142888서울특별시 강북구 수유*동 ***번지 **호서울특별시 강북구 노해로 *** (수유동)<NA>라 프리마(La Prima)2013-02-02 14:33:40I2018-08-31 23:59:59.0의류/패션/잡화/뷰티202059.436434460373.882349<NA><NA><NA>인터넷