Overview

Dataset statistics

Number of variables29
Number of observations10000
Missing cells65963
Missing cells (%)22.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 MiB
Average record size in memory252.0 B

Variable types

Categorical10
Numeric5
DateTime7
Text6
Unsupported1

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자산규모,부채총액,자본금,판매방식명
Author구로구
URLhttps://data.seoul.go.kr/dataList/OA-18818/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (93.5%)Imbalance
자산규모 is highly imbalanced (65.5%)Imbalance
부채총액 is highly imbalanced (65.5%)Imbalance
자본금 is highly imbalanced (65.5%)Imbalance
판매방식명 is highly imbalanced (69.0%)Imbalance
폐업일자 has 6697 (67.0%) missing valuesMissing
휴업시작일자 has 9945 (99.5%) missing valuesMissing
휴업종료일자 has 9945 (99.5%) missing valuesMissing
재개업일자 has 9988 (99.9%) missing valuesMissing
전화번호 has 5270 (52.7%) missing valuesMissing
소재지면적 has 10000 (100.0%) missing valuesMissing
소재지우편번호 has 7865 (78.6%) missing valuesMissing
지번주소 has 1711 (17.1%) missing valuesMissing
도로명주소 has 778 (7.8%) missing valuesMissing
도로명우편번호 has 2228 (22.3%) missing valuesMissing
좌표정보(X) has 768 (7.7%) missing valuesMissing
좌표정보(Y) has 768 (7.7%) missing valuesMissing
관리번호 is highly skewed (γ1 = -87.12760568)Skewed
좌표정보(X) is highly skewed (γ1 = 34.74460914)Skewed
좌표정보(Y) is highly skewed (γ1 = -56.72691846)Skewed
관리번호 has unique valuesUnique
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-21 10:15:14.189607
Analysis finished2024-04-21 10:15:17.047374
Duration2.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3160000
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3160000
2nd row3160000
3rd row3160000
4th row3160000
5th row3160000

Common Values

ValueCountFrequency (%)
3160000 10000
100.0%

Length

2024-04-21T19:15:17.152736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:17.315064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3160000 10000
100.0%

관리번호
Real number (ℝ)

SKEWED  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0161191 × 1018
Minimum2.007316 × 1017
Maximum2.023316 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T19:15:17.497078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.007316 × 1017
5-th percentile2.006316 × 1018
Q12.012066 × 1018
median2.017316 × 1018
Q32.021316 × 1018
95-th percentile2.023316 × 1018
Maximum2.023316 × 1018
Range1.8225844 × 1018
Interquartile range (IQR)9.25 × 1015

Descriptive statistics

Standard deviation1.9010083 × 1016
Coefficient of variation (CV)0.0094290476
Kurtosis8319.3542
Mean2.0161191 × 1018
Median Absolute Deviation (MAD)4 × 1015
Skewness-87.127606
Sum-1.100706 × 1018
Variance3.6138324 × 1032
MonotonicityNot monotonic
2024-04-21T19:15:17.758508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2013316015930201169 1
 
< 0.1%
2018316015930200576 1
 
< 0.1%
2016316015930201130 1
 
< 0.1%
2022316015930202604 1
 
< 0.1%
2020316015930201769 1
 
< 0.1%
2018316015930200561 1
 
< 0.1%
2020316015930201400 1
 
< 0.1%
2020316015930201019 1
 
< 0.1%
2019316015930201125 1
 
< 0.1%
2018316015930200080 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
200731601173020378 1
< 0.1%
1997316011730200012 1
< 0.1%
1998316011730200003 1
< 0.1%
1998316011730200016 1
< 0.1%
1998316011730200018 1
< 0.1%
1998316011730200022 1
< 0.1%
1998316011730200024 1
< 0.1%
1998316011730200025 1
< 0.1%
1999316011730200032 1
< 0.1%
1999316011730200033 1
< 0.1%
ValueCountFrequency (%)
2023316015930202081 1
< 0.1%
2023316015930202077 1
< 0.1%
2023316015930202075 1
< 0.1%
2023316015930202073 1
< 0.1%
2023316015930202070 1
< 0.1%
2023316015930202069 1
< 0.1%
2023316015930202062 1
< 0.1%
2023316015930202058 1
< 0.1%
2023316015930202057 1
< 0.1%
2023316015930202056 1
< 0.1%
Distinct3701
Distinct (%)37.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1997-04-02 00:00:00
Maximum2023-09-22 00:00:00
2024-04-21T19:15:18.012363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:18.408864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9836 
20091127
 
145
20091130
 
18
20211019
 
1

Length

Max length8
Median length4
Mean length4.0656
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9836
98.4%
20091127 145
 
1.5%
20091130 18
 
0.2%
20211019 1
 
< 0.1%

Length

2024-04-21T19:15:18.637098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:18.826930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9836
98.4%
20091127 145
 
1.5%
20091130 18
 
0.2%
20211019 1
 
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5563 
3
2592 
4
1101 
5
711 
2
 
33

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 5563
55.6%
3 2592
25.9%
4 1101
 
11.0%
5 711
 
7.1%
2 33
 
0.3%

Length

2024-04-21T19:15:19.005563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:19.182317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5563
55.6%
3 2592
25.9%
4 1101
 
11.0%
5 711
 
7.1%
2 33
 
0.3%

영업상태명
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업/정상
5563 
폐업
2592 
취소/말소/만료/정지/중지
1101 
제외/삭제/전출
711 
휴업
 
33

Length

Max length14
Median length5
Mean length5.4167
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row폐업
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 5563
55.6%
폐업 2592
25.9%
취소/말소/만료/정지/중지 1101
 
11.0%
제외/삭제/전출 711
 
7.1%
휴업 33
 
0.3%

Length

2024-04-21T19:15:19.383305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:19.572995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 5563
55.6%
폐업 2592
25.9%
취소/말소/만료/정지/중지 1101
 
11.0%
제외/삭제/전출 711
 
7.1%
휴업 33
 
0.3%

상세영업상태코드
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.4171
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T19:15:19.738666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.9196604
Coefficient of variation (CV)0.79419983
Kurtosis0.44481859
Mean2.4171
Median Absolute Deviation (MAD)0
Skewness1.2378992
Sum24171
Variance3.6850961
MonotonicityNot monotonic
2024-04-21T19:15:19.914979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 5562
55.6%
3 2592
25.9%
7 934
 
9.3%
5 711
 
7.1%
4 167
 
1.7%
2 33
 
0.3%
6 1
 
< 0.1%
ValueCountFrequency (%)
1 5562
55.6%
2 33
 
0.3%
3 2592
25.9%
4 167
 
1.7%
5 711
 
7.1%
6 1
 
< 0.1%
7 934
 
9.3%
ValueCountFrequency (%)
7 934
 
9.3%
6 1
 
< 0.1%
5 711
 
7.1%
4 167
 
1.7%
3 2592
25.9%
2 33
 
0.3%
1 5562
55.6%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정상영업
5562 
폐업처리
2592 
직권말소
934 
타시군구이관
711 
직권취소
 
167
Other values (2)
 
34

Length

Max length6
Median length4
Mean length4.1424
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row폐업처리
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 5562
55.6%
폐업처리 2592
25.9%
직권말소 934
 
9.3%
타시군구이관 711
 
7.1%
직권취소 167
 
1.7%
휴업처리 33
 
0.3%
타시군구전입 1
 
< 0.1%

Length

2024-04-21T19:15:20.138304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:20.345461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 5562
55.6%
폐업처리 2592
25.9%
직권말소 934
 
9.3%
타시군구이관 711
 
7.1%
직권취소 167
 
1.7%
휴업처리 33
 
0.3%
타시군구전입 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2039
Distinct (%)61.7%
Missing6697
Missing (%)67.0%
Memory size156.2 KiB
Minimum2003-02-24 00:00:00
Maximum2024-04-17 00:00:00
2024-04-21T19:15:20.563522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:20.797844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct55
Distinct (%)100.0%
Missing9945
Missing (%)99.5%
Memory size156.2 KiB
Minimum2008-05-03 00:00:00
Maximum2024-04-04 00:00:00
2024-04-21T19:15:21.043581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:21.307898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업종료일자
Date

MISSING 

Distinct49
Distinct (%)89.1%
Missing9945
Missing (%)99.5%
Memory size156.2 KiB
Minimum2008-08-31 00:00:00
Maximum2099-12-31 00:00:00
2024-04-21T19:15:21.562135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:21.969151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

재개업일자
Date

MISSING 

Distinct12
Distinct (%)100.0%
Missing9988
Missing (%)99.9%
Memory size156.2 KiB
Minimum2008-09-04 00:00:00
Maximum2024-02-23 00:00:00
2024-04-21T19:15:22.322317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:22.677736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)

전화번호
Text

MISSING 

Distinct4576
Distinct (%)96.7%
Missing5270
Missing (%)52.7%
Memory size156.2 KiB
2024-04-21T19:15:23.602417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length11.378647
Min length1

Characters and Unicode

Total characters53821
Distinct characters18
Distinct categories7 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4499 ?
Unique (%)95.1%

Sample

1st row02-2688-4846
2nd row070-4651-5200
3rd row070-4768-1912
4th row02-1522-5320
5th row02-6499-0306
ValueCountFrequency (%)
02 1227
 
17.0%
070 33
 
0.5%
2108 30
 
0.4%
2060 23
 
0.3%
855 23
 
0.3%
851 23
 
0.3%
858 22
 
0.3%
6679 22
 
0.3%
2689 21
 
0.3%
2066 21
 
0.3%
Other values (4749) 5757
79.9%
2024-04-21T19:15:25.012289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8632
16.0%
2 7255
13.5%
- 5706
10.6%
6 4554
8.5%
8 4520
8.4%
7 4101
7.6%
1 3683
6.8%
3607
6.7%
5 3458
6.4%
3 3087
 
5.7%
Other values (8) 5218
9.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44474
82.6%
Dash Punctuation 5706
 
10.6%
Space Separator 3607
 
6.7%
Other Punctuation 16
 
< 0.1%
Math Symbol 14
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8632
19.4%
2 7255
16.3%
6 4554
10.2%
8 4520
10.2%
7 4101
9.2%
1 3683
8.3%
5 3458
7.8%
3 3087
 
6.9%
4 2738
 
6.2%
9 2446
 
5.5%
Other Punctuation
ValueCountFrequency (%)
. 13
81.2%
/ 2
 
12.5%
, 1
 
6.2%
Dash Punctuation
ValueCountFrequency (%)
- 5706
100.0%
Space Separator
ValueCountFrequency (%)
3607
100.0%
Math Symbol
ValueCountFrequency (%)
~ 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 53821
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8632
16.0%
2 7255
13.5%
- 5706
10.6%
6 4554
8.5%
8 4520
8.4%
7 4101
7.6%
1 3683
6.8%
3607
6.7%
5 3458
6.4%
3 3087
 
5.7%
Other values (8) 5218
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 53821
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8632
16.0%
2 7255
13.5%
- 5706
10.6%
6 4554
8.5%
8 4520
8.4%
7 4101
7.6%
1 3683
6.8%
3607
6.7%
5 3458
6.4%
3 3087
 
5.7%
Other values (8) 5218
9.7%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct240
Distinct (%)11.2%
Missing7865
Missing (%)78.6%
Infinite0
Infinite (%)0.0%
Mean156321.41
Minimum110540
Maximum660983
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T19:15:25.253549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110540
5-th percentile152050
Q1152050
median152719
Q3152838
95-th percentile152892
Maximum660983
Range550443
Interquartile range (IQR)788

Descriptive statistics

Standard deviation35295.129
Coefficient of variation (CV)0.22578563
Kurtosis86.009611
Mean156321.41
Median Absolute Deviation (MAD)579
Skewness8.9744485
Sum3.3374622 × 108
Variance1.2457462 × 109
MonotonicityNot monotonic
2024-04-21T19:15:25.487116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
152050 647
 
6.5%
152090 76
 
0.8%
152100 75
 
0.8%
152080 71
 
0.7%
152848 43
 
0.4%
152070 40
 
0.4%
152790 31
 
0.3%
152719 29
 
0.3%
152880 27
 
0.3%
152842 24
 
0.2%
Other values (230) 1072
 
10.7%
(Missing) 7865
78.6%
ValueCountFrequency (%)
110540 1
 
< 0.1%
120110 2
< 0.1%
121070 1
 
< 0.1%
121190 1
 
< 0.1%
121210 3
< 0.1%
122090 1
 
< 0.1%
130842 1
 
< 0.1%
133100 1
 
< 0.1%
135120 2
< 0.1%
135513 1
 
< 0.1%
ValueCountFrequency (%)
660983 1
< 0.1%
614030 1
< 0.1%
602091 1
< 0.1%
530390 1
< 0.1%
446520 1
< 0.1%
441360 1
< 0.1%
440210 1
< 0.1%
437080 1
< 0.1%
426180 1
< 0.1%
423070 1
< 0.1%

지번주소
Text

MISSING 

Distinct3646
Distinct (%)44.0%
Missing1711
Missing (%)17.1%
Memory size156.2 KiB
2024-04-21T19:15:26.286046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length50
Mean length29.573049
Min length13

Characters and Unicode

Total characters245131
Distinct characters477
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2659 ?
Unique (%)32.1%

Sample

1st row서울특별시 구로구 고척동 *** 신원프라자
2nd row서울특별시 구로구 구로동 ***번지 ***호 ***호
3rd row서울특별시 구로구 가리봉동 ***-** 온누리
4th row서울특별시 구로구 구로동 ***-**
5th row서울특별시 구로구 구로동 ***번지 신도림현대아파트 ***동 ***호
ValueCountFrequency (%)
서울특별시 8252
17.4%
구로구 8200
17.3%
5117
10.8%
구로동 4547
9.6%
4400
9.3%
번지 4042
 
8.5%
개봉동 827
 
1.7%
오류동 661
 
1.4%
고척동 602
 
1.3%
신도림동 484
 
1.0%
Other values (2048) 10272
21.7%
2024-04-21T19:15:27.365967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 48859
19.9%
40612
16.6%
21831
 
8.9%
13509
 
5.5%
9337
 
3.8%
8372
 
3.4%
8330
 
3.4%
8303
 
3.4%
8256
 
3.4%
8252
 
3.4%
Other values (467) 69470
28.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 150023
61.2%
Other Punctuation 49033
 
20.0%
Space Separator 40612
 
16.6%
Dash Punctuation 3823
 
1.6%
Uppercase Letter 617
 
0.3%
Decimal Number 533
 
0.2%
Lowercase Letter 119
 
< 0.1%
Open Punctuation 117
 
< 0.1%
Close Punctuation 117
 
< 0.1%
Letter Number 99
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21831
14.6%
13509
 
9.0%
9337
 
6.2%
8372
 
5.6%
8330
 
5.6%
8303
 
5.5%
8256
 
5.5%
8252
 
5.5%
5559
 
3.7%
5244
 
3.5%
Other values (400) 53030
35.3%
Uppercase Letter
ValueCountFrequency (%)
B 94
15.2%
K 72
11.7%
S 62
10.0%
T 61
9.9%
I 57
9.2%
A 50
8.1%
C 30
 
4.9%
G 26
 
4.2%
E 24
 
3.9%
R 20
 
3.2%
Other values (15) 121
19.6%
Lowercase Letter
ValueCountFrequency (%)
e 44
37.0%
b 11
 
9.2%
a 10
 
8.4%
t 8
 
6.7%
i 8
 
6.7%
o 7
 
5.9%
w 5
 
4.2%
s 5
 
4.2%
r 4
 
3.4%
n 3
 
2.5%
Other values (7) 14
 
11.8%
Decimal Number
ValueCountFrequency (%)
1 120
22.5%
2 96
18.0%
5 48
 
9.0%
3 45
 
8.4%
6 43
 
8.1%
4 41
 
7.7%
7 39
 
7.3%
8 37
 
6.9%
0 35
 
6.6%
9 29
 
5.4%
Other Punctuation
ValueCountFrequency (%)
* 48859
99.6%
, 147
 
0.3%
. 15
 
< 0.1%
/ 6
 
< 0.1%
@ 3
 
< 0.1%
& 3
 
< 0.1%
Letter Number
ValueCountFrequency (%)
45
45.5%
45
45.5%
8
 
8.1%
1
 
1.0%
Space Separator
ValueCountFrequency (%)
40612
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3823
100.0%
Open Punctuation
ValueCountFrequency (%)
( 117
100.0%
Close Punctuation
ValueCountFrequency (%)
) 117
100.0%
Math Symbol
ValueCountFrequency (%)
~ 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 150023
61.2%
Common 94273
38.5%
Latin 835
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21831
14.6%
13509
 
9.0%
9337
 
6.2%
8372
 
5.6%
8330
 
5.6%
8303
 
5.5%
8256
 
5.5%
8252
 
5.5%
5559
 
3.7%
5244
 
3.5%
Other values (400) 53030
35.3%
Latin
ValueCountFrequency (%)
B 94
 
11.3%
K 72
 
8.6%
S 62
 
7.4%
T 61
 
7.3%
I 57
 
6.8%
A 50
 
6.0%
45
 
5.4%
45
 
5.4%
e 44
 
5.3%
C 30
 
3.6%
Other values (36) 275
32.9%
Common
ValueCountFrequency (%)
* 48859
51.8%
40612
43.1%
- 3823
 
4.1%
, 147
 
0.2%
1 120
 
0.1%
( 117
 
0.1%
) 117
 
0.1%
2 96
 
0.1%
5 48
 
0.1%
3 45
 
< 0.1%
Other values (11) 289
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 150023
61.2%
ASCII 95009
38.8%
Number Forms 99
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 48859
51.4%
40612
42.7%
- 3823
 
4.0%
, 147
 
0.2%
1 120
 
0.1%
( 117
 
0.1%
) 117
 
0.1%
2 96
 
0.1%
B 94
 
0.1%
K 72
 
0.1%
Other values (53) 952
 
1.0%
Hangul
ValueCountFrequency (%)
21831
14.6%
13509
 
9.0%
9337
 
6.2%
8372
 
5.6%
8330
 
5.6%
8303
 
5.5%
8256
 
5.5%
8252
 
5.5%
5559
 
3.7%
5244
 
3.5%
Other values (400) 53030
35.3%
Number Forms
ValueCountFrequency (%)
45
45.5%
45
45.5%
8
 
8.1%
1
 
1.0%

도로명주소
Text

MISSING 

Distinct5366
Distinct (%)58.2%
Missing778
Missing (%)7.8%
Memory size156.2 KiB
2024-04-21T19:15:28.231951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length59
Mean length40.171872
Min length21

Characters and Unicode

Total characters370465
Distinct characters498
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4076 ?
Unique (%)44.2%

Sample

1st row서울특별시 구로구 중앙로*길 **-**, *층 ***호 (고척동, 신원프라자)
2nd row서울특별시 구로구 구로동로**길 **-*, ***호 (구로동)
3rd row서울특별시 구로구 경인로 ***, ***동 ****호 (신도림동, 신도림에스케이뷰)
4th row서울특별시 구로구 디지털로**길 ***, ***호 (구로동, 제이앤케이디지털타워)
5th row서울특별시 구로구 남부순환로***길 ***, 온누리 ***호 (가리봉동)
ValueCountFrequency (%)
9298
14.1%
서울특별시 9199
13.9%
구로구 9149
13.8%
6928
 
10.5%
구로동 4625
 
7.0%
2171
 
3.3%
2040
 
3.1%
디지털로**길 1695
 
2.6%
개봉동 858
 
1.3%
오류동 662
 
1.0%
Other values (2799) 19449
29.4%
2024-04-21T19:15:29.365138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 70529
19.0%
56862
15.3%
25585
 
6.9%
25275
 
6.8%
, 13200
 
3.6%
12893
 
3.5%
9443
 
2.5%
9382
 
2.5%
( 9278
 
2.5%
) 9278
 
2.5%
Other values (488) 128740
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 206504
55.7%
Other Punctuation 83762
22.6%
Space Separator 56862
 
15.3%
Open Punctuation 9278
 
2.5%
Close Punctuation 9278
 
2.5%
Dash Punctuation 2303
 
0.6%
Uppercase Letter 1097
 
0.3%
Decimal Number 1069
 
0.3%
Lowercase Letter 127
 
< 0.1%
Letter Number 122
 
< 0.1%
Other values (2) 63
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25585
 
12.4%
25275
 
12.2%
12893
 
6.2%
9443
 
4.6%
9382
 
4.5%
9262
 
4.5%
9209
 
4.5%
9199
 
4.5%
7728
 
3.7%
5953
 
2.9%
Other values (417) 82575
40.0%
Uppercase Letter
ValueCountFrequency (%)
B 276
25.2%
A 142
12.9%
K 82
 
7.5%
S 75
 
6.8%
T 73
 
6.7%
I 61
 
5.6%
C 53
 
4.8%
P 44
 
4.0%
G 35
 
3.2%
E 34
 
3.1%
Other values (16) 222
20.2%
Lowercase Letter
ValueCountFrequency (%)
e 47
37.0%
b 13
 
10.2%
s 8
 
6.3%
i 7
 
5.5%
o 7
 
5.5%
a 7
 
5.5%
w 6
 
4.7%
z 5
 
3.9%
c 4
 
3.1%
n 4
 
3.1%
Other values (9) 19
15.0%
Decimal Number
ValueCountFrequency (%)
1 276
25.8%
0 169
15.8%
2 159
14.9%
3 110
 
10.3%
5 77
 
7.2%
8 65
 
6.1%
4 63
 
5.9%
7 57
 
5.3%
9 47
 
4.4%
6 46
 
4.3%
Other Punctuation
ValueCountFrequency (%)
* 70529
84.2%
, 13200
 
15.8%
. 22
 
< 0.1%
/ 8
 
< 0.1%
& 2
 
< 0.1%
# 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
58
47.5%
56
45.9%
7
 
5.7%
1
 
0.8%
Space Separator
ValueCountFrequency (%)
56862
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9278
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9278
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2303
100.0%
Math Symbol
ValueCountFrequency (%)
~ 61
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 206504
55.7%
Common 162615
43.9%
Latin 1346
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25585
 
12.4%
25275
 
12.2%
12893
 
6.2%
9443
 
4.6%
9382
 
4.5%
9262
 
4.5%
9209
 
4.5%
9199
 
4.5%
7728
 
3.7%
5953
 
2.9%
Other values (417) 82575
40.0%
Latin
ValueCountFrequency (%)
B 276
20.5%
A 142
 
10.5%
K 82
 
6.1%
S 75
 
5.6%
T 73
 
5.4%
I 61
 
4.5%
58
 
4.3%
56
 
4.2%
C 53
 
3.9%
e 47
 
3.5%
Other values (39) 423
31.4%
Common
ValueCountFrequency (%)
* 70529
43.4%
56862
35.0%
, 13200
 
8.1%
( 9278
 
5.7%
) 9278
 
5.7%
- 2303
 
1.4%
1 276
 
0.2%
0 169
 
0.1%
2 159
 
0.1%
3 110
 
0.1%
Other values (12) 451
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 206504
55.7%
ASCII 163839
44.2%
Number Forms 122
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 70529
43.0%
56862
34.7%
, 13200
 
8.1%
( 9278
 
5.7%
) 9278
 
5.7%
- 2303
 
1.4%
B 276
 
0.2%
1 276
 
0.2%
0 169
 
0.1%
2 159
 
0.1%
Other values (57) 1509
 
0.9%
Hangul
ValueCountFrequency (%)
25585
 
12.4%
25275
 
12.2%
12893
 
6.2%
9443
 
4.6%
9382
 
4.5%
9262
 
4.5%
9209
 
4.5%
9199
 
4.5%
7728
 
3.7%
5953
 
2.9%
Other values (417) 82575
40.0%
Number Forms
ValueCountFrequency (%)
58
47.5%
56
45.9%
7
 
5.7%
1
 
0.8%

도로명우편번호
Text

MISSING 

Distinct386
Distinct (%)5.0%
Missing2228
Missing (%)22.3%
Memory size156.2 KiB
2024-04-21T19:15:30.655500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1872105
Min length5

Characters and Unicode

Total characters40315
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)0.7%

Sample

1st row08227
2nd row08317
3rd row152749
4th row152848
5th row08386
ValueCountFrequency (%)
08390 337
 
4.3%
08378 237
 
3.0%
08377 189
 
2.4%
08381 187
 
2.4%
08393 160
 
2.1%
08375 152
 
2.0%
08389 140
 
1.8%
08217 139
 
1.8%
08376 119
 
1.5%
08271 116
 
1.5%
Other values (376) 5996
77.1%
2024-04-21T19:15:32.405821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 9078
22.5%
0 8092
20.1%
2 5597
13.9%
3 4736
11.7%
1 3100
 
7.7%
5 2836
 
7.0%
7 2716
 
6.7%
9 1829
 
4.5%
6 1217
 
3.0%
4 1090
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40291
99.9%
Dash Punctuation 24
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 9078
22.5%
0 8092
20.1%
2 5597
13.9%
3 4736
11.8%
1 3100
 
7.7%
5 2836
 
7.0%
7 2716
 
6.7%
9 1829
 
4.5%
6 1217
 
3.0%
4 1090
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40315
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 9078
22.5%
0 8092
20.1%
2 5597
13.9%
3 4736
11.7%
1 3100
 
7.7%
5 2836
 
7.0%
7 2716
 
6.7%
9 1829
 
4.5%
6 1217
 
3.0%
4 1090
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40315
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 9078
22.5%
0 8092
20.1%
2 5597
13.9%
3 4736
11.7%
1 3100
 
7.7%
5 2836
 
7.0%
7 2716
 
6.7%
9 1829
 
4.5%
6 1217
 
3.0%
4 1090
 
2.7%
Distinct9840
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T19:15:33.794504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length37
Mean length7.3271
Min length1

Characters and Unicode

Total characters73271
Distinct characters1085
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9691 ?
Unique (%)96.9%

Sample

1st row(주) 윈골프
2nd row아로인
3rd row신영테크툴
4th row(주) 메인테크오브에이네트웍스
5th row루미너스
ValueCountFrequency (%)
주식회사 1234
 
9.1%
307
 
2.3%
30
 
0.2%
co.,ltd 26
 
0.2%
co 25
 
0.2%
컴퍼니 24
 
0.2%
company 24
 
0.2%
korea 24
 
0.2%
22
 
0.2%
inc 21
 
0.2%
Other values (10919) 11775
87.1%
2024-04-21T19:15:35.850900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3516
 
4.8%
2764
 
3.8%
2596
 
3.5%
) 2457
 
3.4%
( 2455
 
3.4%
2114
 
2.9%
1644
 
2.2%
1348
 
1.8%
1321
 
1.8%
1140
 
1.6%
Other values (1075) 51916
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53216
72.6%
Lowercase Letter 5698
 
7.8%
Uppercase Letter 4954
 
6.8%
Space Separator 3516
 
4.8%
Close Punctuation 2460
 
3.4%
Open Punctuation 2458
 
3.4%
Other Punctuation 411
 
0.6%
Decimal Number 379
 
0.5%
Other Symbol 97
 
0.1%
Dash Punctuation 66
 
0.1%
Other values (4) 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2764
 
5.2%
2596
 
4.9%
2114
 
4.0%
1644
 
3.1%
1348
 
2.5%
1321
 
2.5%
1140
 
2.1%
986
 
1.9%
834
 
1.6%
832
 
1.6%
Other values (991) 37637
70.7%
Lowercase Letter
ValueCountFrequency (%)
e 687
12.1%
o 597
 
10.5%
a 489
 
8.6%
n 463
 
8.1%
i 394
 
6.9%
r 381
 
6.7%
t 356
 
6.2%
l 308
 
5.4%
s 288
 
5.1%
m 216
 
3.8%
Other values (16) 1519
26.7%
Uppercase Letter
ValueCountFrequency (%)
A 388
 
7.8%
E 344
 
6.9%
O 331
 
6.7%
S 315
 
6.4%
N 297
 
6.0%
T 290
 
5.9%
I 285
 
5.8%
L 281
 
5.7%
C 279
 
5.6%
M 259
 
5.2%
Other values (16) 1885
38.1%
Decimal Number
ValueCountFrequency (%)
2 75
19.8%
3 51
13.5%
1 48
12.7%
0 40
10.6%
4 36
9.5%
5 34
9.0%
9 27
 
7.1%
7 25
 
6.6%
8 22
 
5.8%
6 21
 
5.5%
Other Punctuation
ValueCountFrequency (%)
. 217
52.8%
& 88
21.4%
, 63
 
15.3%
' 21
 
5.1%
? 6
 
1.5%
: 5
 
1.2%
/ 5
 
1.2%
# 4
 
1.0%
2
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 2457
99.9%
] 3
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2455
99.9%
[ 3
 
0.1%
Modifier Symbol
ValueCountFrequency (%)
` 2
66.7%
˚ 1
33.3%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
= 1
50.0%
Space Separator
ValueCountFrequency (%)
3516
100.0%
Other Symbol
ValueCountFrequency (%)
97
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53271
72.7%
Latin 10653
 
14.5%
Common 9305
 
12.7%
Han 42
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2764
 
5.2%
2596
 
4.9%
2114
 
4.0%
1644
 
3.1%
1348
 
2.5%
1321
 
2.5%
1140
 
2.1%
986
 
1.9%
834
 
1.6%
832
 
1.6%
Other values (953) 37692
70.8%
Latin
ValueCountFrequency (%)
e 687
 
6.4%
o 597
 
5.6%
a 489
 
4.6%
n 463
 
4.3%
i 394
 
3.7%
A 388
 
3.6%
r 381
 
3.6%
t 356
 
3.3%
E 344
 
3.2%
O 331
 
3.1%
Other values (43) 6223
58.4%
Han
ValueCountFrequency (%)
3
 
7.1%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (29) 29
69.0%
Common
ValueCountFrequency (%)
3516
37.8%
) 2457
26.4%
( 2455
26.4%
. 217
 
2.3%
& 88
 
0.9%
2 75
 
0.8%
- 66
 
0.7%
, 63
 
0.7%
3 51
 
0.5%
1 48
 
0.5%
Other values (20) 269
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53174
72.6%
ASCII 19954
 
27.2%
None 99
 
0.1%
CJK 41
 
0.1%
CJK Compat Ideographs 1
 
< 0.1%
Number Forms 1
 
< 0.1%
Modifier Letters 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3516
17.6%
) 2457
 
12.3%
( 2455
 
12.3%
e 687
 
3.4%
o 597
 
3.0%
a 489
 
2.5%
n 463
 
2.3%
i 394
 
2.0%
A 388
 
1.9%
r 381
 
1.9%
Other values (70) 8127
40.7%
Hangul
ValueCountFrequency (%)
2764
 
5.2%
2596
 
4.9%
2114
 
4.0%
1644
 
3.1%
1348
 
2.5%
1321
 
2.5%
1140
 
2.1%
986
 
1.9%
834
 
1.6%
832
 
1.6%
Other values (952) 37595
70.7%
None
ValueCountFrequency (%)
97
98.0%
2
 
2.0%
CJK
ValueCountFrequency (%)
3
 
7.3%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (28) 28
68.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Modifier Letters
ValueCountFrequency (%)
˚ 1
100.0%
Distinct9982
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-07-09 18:20:51
Maximum2024-04-17 13:10:07
2024-04-21T19:15:36.087993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:36.339434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7971 
U
2029 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7971
79.7%
U 2029
 
20.3%

Length

2024-04-21T19:15:36.557319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:36.718267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7971
79.7%
u 2029
 
20.3%
Distinct1483
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-04 00:07:00
2024-04-21T19:15:36.903834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:15:37.147794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct638
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T19:15:37.688623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length80
Mean length9.21
Min length1

Characters and Unicode

Total characters92100
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique406 ?
Unique (%)4.1%

Sample

1st row레져/여행/공연
2nd row의류/패션/잡화/뷰티
3rd row기타
4th row의류/패션/잡화/뷰티
5th row종합몰 의류/패션/잡화/뷰티
ValueCountFrequency (%)
종합몰 3132
20.6%
의류/패션/잡화/뷰티 3123
20.5%
기타 2821
18.6%
건강/식품 1052
 
6.9%
교육/도서/완구/오락 920
 
6.1%
컴퓨터/사무용품 918
 
6.0%
가전 779
 
5.1%
754
 
5.0%
가구/수납용품 558
 
3.7%
레져/여행/공연 434
 
2.9%
Other values (3) 713
 
4.7%
2024-04-21T19:15:38.489798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 16054
 
17.4%
5204
 
5.7%
3241
 
3.5%
3132
 
3.4%
3132
 
3.4%
3132
 
3.4%
3123
 
3.4%
3123
 
3.4%
3123
 
3.4%
3123
 
3.4%
Other values (41) 45713
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 70088
76.1%
Other Punctuation 16054
 
17.4%
Space Separator 5204
 
5.7%
Dash Punctuation 754
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3241
 
4.6%
3132
 
4.5%
3132
 
4.5%
3132
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
Other values (38) 38713
55.2%
Other Punctuation
ValueCountFrequency (%)
/ 16054
100.0%
Space Separator
ValueCountFrequency (%)
5204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 754
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 70088
76.1%
Common 22012
 
23.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3241
 
4.6%
3132
 
4.5%
3132
 
4.5%
3132
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
Other values (38) 38713
55.2%
Common
ValueCountFrequency (%)
/ 16054
72.9%
5204
 
23.6%
- 754
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 70088
76.1%
ASCII 22012
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 16054
72.9%
5204
 
23.6%
- 754
 
3.4%
Hangul
ValueCountFrequency (%)
3241
 
4.6%
3132
 
4.5%
3132
 
4.5%
3132
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
3123
 
4.5%
Other values (38) 38713
55.2%

좌표정보(X)
Real number (ℝ)

MISSING  SKEWED 

Distinct2829
Distinct (%)30.6%
Missing768
Missing (%)7.7%
Infinite0
Infinite (%)0.0%
Mean188984.52
Minimum170220.63
Maximum392606.38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T19:15:38.725310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum170220.63
5-th percentile185061.05
Q1187292.34
median189718.13
Q3190479.9
95-th percentile190951.33
Maximum392606.38
Range222385.74
Interquartile range (IQR)3187.5641

Descriptive statistics

Standard deviation3732.8924
Coefficient of variation (CV)0.019752372
Kurtosis1837.8409
Mean188984.52
Median Absolute Deviation (MAD)930.31666
Skewness34.744609
Sum1.744705 × 109
Variance13934485
MonotonicityNot monotonic
2024-04-21T19:15:38.972389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
188840.515865617 244
 
2.4%
190680.536850936 193
 
1.9%
190232.524534335 172
 
1.7%
191020.264430044 130
 
1.3%
190609.413674522 122
 
1.2%
190447.24866829 107
 
1.1%
190845.231594802 91
 
0.9%
190671.436719919 82
 
0.8%
190005.132500398 81
 
0.8%
190967.257864999 80
 
0.8%
Other values (2819) 7930
79.3%
(Missing) 768
 
7.7%
ValueCountFrequency (%)
170220.632809728 1
< 0.1%
174070.673905156 1
< 0.1%
176389.234645983 1
< 0.1%
179485.511830407 1
< 0.1%
179671.854476984 1
< 0.1%
180356.979271 1
< 0.1%
180480.72536639 1
< 0.1%
182367.999646453 1
< 0.1%
183524.022411706 1
< 0.1%
183685.662788976 1
< 0.1%
ValueCountFrequency (%)
392606.376131233 1
< 0.1%
387201.162928924 1
< 0.1%
264139.728394955 1
< 0.1%
213748.350236302 2
< 0.1%
210932.067547437 1
< 0.1%
210439.151506898 1
< 0.1%
209250.77343435 1
< 0.1%
208589.363343145 1
< 0.1%
206981.454072644 1
< 0.1%
206090.434292925 1
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING  SKEWED 

Distinct2826
Distinct (%)30.6%
Missing768
Missing (%)7.7%
Infinite0
Infinite (%)0.0%
Mean443450.72
Minimum185607.1
Maximum488061.03
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T19:15:39.214793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum185607.1
5-th percentile442154.99
Q1442598.14
median443530.15
Q3444306.77
95-th percentile445024.87
Maximum488061.03
Range302453.93
Interquartile range (IQR)1708.6263

Descriptive statistics

Standard deviation3967.597
Coefficient of variation (CV)0.0089470978
Kurtosis3644.5374
Mean443450.72
Median Absolute Deviation (MAD)867.72672
Skewness-56.726918
Sum4.0939371 × 109
Variance15741826
MonotonicityNot monotonic
2024-04-21T19:15:39.461761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
444306.771104527 244
 
2.4%
442392.645303533 193
 
1.9%
444978.682746138 172
 
1.7%
442460.919021593 130
 
1.3%
442782.030555155 122
 
1.2%
442558.310755613 107
 
1.1%
442639.420821243 91
 
0.9%
442219.137493629 82
 
0.8%
445250.538868558 81
 
0.8%
442305.216740774 80
 
0.8%
Other values (2816) 7930
79.3%
(Missing) 768
 
7.7%
ValueCountFrequency (%)
185607.095959414 1
< 0.1%
193352.89036403 1
< 0.1%
417038.654300737 1
< 0.1%
420986.113614344 1
< 0.1%
421224.243577552 1
< 0.1%
428430.919641025 1
< 0.1%
428612.04035631 1
< 0.1%
430562.873904469 1
< 0.1%
434589.992328292 1
< 0.1%
438010.980172246 2
< 0.1%
ValueCountFrequency (%)
488061.029180565 1
< 0.1%
464930.038022176 1
< 0.1%
462496.438646084 1
< 0.1%
461661.737947627 1
< 0.1%
460112.950775803 1
< 0.1%
458960.471391303 1
< 0.1%
458687.136639426 1
< 0.1%
458656.25777626 1
< 0.1%
457641.33756859 1
< 0.1%
453162.844280292 1
< 0.1%

자산규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9354 
0
 
646

Length

Max length4
Median length4
Mean length3.8062
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9354
93.5%
0 646
 
6.5%

Length

2024-04-21T19:15:39.699706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:39.867399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9354
93.5%
0 646
 
6.5%

부채총액
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9354 
0
 
646

Length

Max length4
Median length4
Mean length3.8062
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9354
93.5%
0 646
 
6.5%

Length

2024-04-21T19:15:40.045107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:40.211488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9354
93.5%
0 646
 
6.5%

자본금
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9354 
0
 
646

Length

Max length4
Median length4
Mean length3.8062
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9354
93.5%
0 646
 
6.5%

Length

2024-04-21T19:15:40.388983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:15:40.556492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9354
93.5%
0 646
 
6.5%

판매방식명
Categorical

IMBALANCE 

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인터넷
4858 
<NA>
4633 
인터넷, 기타
 
141
기타
 
133
TV홈쇼핑, 인터넷
 
36
Other values (18)
 
199

Length

Max length26
Median length22
Mean length3.7583
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row인터넷
5th row<NA>

Common Values

ValueCountFrequency (%)
인터넷 4858
48.6%
<NA> 4633
46.3%
인터넷, 기타 141
 
1.4%
기타 133
 
1.3%
TV홈쇼핑, 인터넷 36
 
0.4%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 36
 
0.4%
인터넷, 카다로그 36
 
0.4%
TV홈쇼핑 23
 
0.2%
인터넷, 카다로그, 신문잡지, 기타 15
 
0.1%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지 12
 
0.1%
Other values (13) 77
 
0.8%

Length

2024-04-21T19:15:40.741438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 5194
49.2%
na 4633
43.9%
기타 358
 
3.4%
카다로그 145
 
1.4%
tv홈쇼핑 133
 
1.3%
신문잡지 97
 
0.9%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
87753160000201331601593020116920131227<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-2688-4846<NA><NA>서울특별시 구로구 고척동 *** 신원프라자서울특별시 구로구 중앙로*길 **-**, *층 ***호 (고척동, 신원프라자)08227(주) 윈골프2021-04-30 16:59:33U2021-05-02 02:40:00.0레져/여행/공연187870.47084444127.308634<NA><NA><NA>인터넷
147143160000201831601593020085220180621<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 구로동 ***번지 ***호 ***호서울특별시 구로구 구로동로**길 **-*, ***호 (구로동)08317아로인2018-06-22 17:18:24I2018-08-31 23:59:59.0의류/패션/잡화/뷰티189404.391795442877.728512<NA><NA><NA>인터넷
86173160000201331601593020096520131028<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 경인로 ***, ***동 ****호 (신도림동, 신도림에스케이뷰)152749신영테크툴2013-10-29 09:23:09I2018-08-31 23:59:59.0기타189894.636867444982.548166<NA><NA><NA>인터넷
77403160000201231601593020116920121115<NA>3폐업3폐업처리20121121<NA><NA><NA>070-4651-5200<NA><NA><NA>서울특별시 구로구 디지털로**길 ***, ***호 (구로동, 제이앤케이디지털타워)152848(주) 메인테크오브에이네트웍스2012-11-21 14:22:49I2018-08-31 23:59:59.0의류/패션/잡화/뷰티190822.851419442253.132896<NA><NA><NA>인터넷
191053160000202031601593020196520200828<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 가리봉동 ***-** 온누리서울특별시 구로구 남부순환로***길 ***, 온누리 ***호 (가리봉동)08386루미너스2020-08-28 18:32:26I2021-12-03 22:02:00.0종합몰 의류/패션/잡화/뷰티189976.628554442128.997805<NA><NA><NA><NA>
21559316000020213160159302015412021-02-08<NA>3폐업3폐업처리2023-06-09<NA><NA><NA><NA><NA><NA>서울특별시 구로구 구로동 ***-**서울특별시 구로구 도림로*길 **-*, 삼주빌라 ***호 (구로동)08374씬하이2023-06-09 12:34:56U2022-12-05 23:01:00.0의류/패션/잡화/뷰티190036.966591442552.347315<NA><NA><NA><NA>
148263160000201831601593020097820180719<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 구로동 ***번지 신도림현대아파트 ***동 ***호서울특별시 구로구 새말로*길 **, ***동 *층 ***호 (구로동, 신도림현대아파트)08291제이티글로벌(JT.Global)2018-07-23 10:03:55I2021-12-03 22:02:00.0교육/도서/완구/오락 가전 컴퓨터/사무용품 의류/패션/잡화/뷰티189836.89383444739.3026<NA><NA><NA><NA>
149963160000201831601593020117720180904<NA>1영업/정상1정상영업<NA><NA><NA><NA>070-4768-1912<NA><NA>서울특별시 구로구 구로동 ****번지 *호 파트너스타워*차 ****호서울특별시 구로구 디지털로**가길 **, 파트너스타워*차 ****호 ***실 (구로동)08393골드웨이브2018-09-05 13:45:43U2018-09-05 23:59:59.0종합몰190967.257865442305.216741<NA><NA><NA>인터넷
25590316000020223160159302026902021-11-30<NA>3폐업3폐업처리2024-03-20<NA><NA><NA>02-1522-5320<NA><NA>서울특별시 구로구 구로동 ***-* 코오롱디지털타워빌란트Ⅱ서울특별시 구로구 디지털로**길 **, 코오롱디지털타워빌란트Ⅱ ****호 (구로동)08390주식회사 바로여기2024-03-20 20:20:24U2023-12-02 22:02:00.0종합몰190734.812272442305.23086<NA><NA><NA><NA>
129303160000201731601593020052020161214<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6499-0306<NA><NA><NA>서울특별시 구로구 구로동로**길 **, *층 (구로동)08317투비콘텐츠2017-04-11 14:23:16I2021-12-03 22:02:00.0종합몰 가전 자동차/자동차용품 상품권189466.312802442847.547034<NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자산규모부채총액자본금판매방식명
5593160000200431601173020144120040521200911274취소/말소/만료/정지/중지4직권취소<NA><NA><NA><NA>02 2107 8531<NA><NA>서울특별시 구로구 구로동 ***-** 벽산***<NA><NA>(주)유양모피2009-12-02 17:05:55I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
227713160000202131601593020278520211206<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 구로동 ***-** 한국현대아파트서울특별시 구로구 구일로*길 ***, *동 *층 **호 (구로동, 한국현대아파트)08323호제이2021-12-06 17:12:44I2021-12-08 00:22:43.0건강/식품 의류/패션/잡화/뷰티188909.995638443966.206319000인터넷
26162316000020233160159302005552023-02-28<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 고척동 **-** 서봉빌딩서울특별시 구로구 중앙로*길 **, 서봉빌딩 ***호 (고척동)08225㈜ 정보아이엔에스2023-02-28 11:20:18I2022-12-03 00:03:00.0기타187811.16784444233.836063<NA><NA><NA><NA>
55243160000201031601593020094820101015<NA>5제외/삭제/전출5타시군구이관20111122<NA><NA><NA>2107-8111<NA>152050서울특별시 구로구 구로동 ***번지 **호 벽산디지털밸리*차 ***호서울특별시 구로구 디지털로**길 **, ***호 (구로동,벽산디지털밸리*차)<NA>주식회사 엘티스코리아2011-12-04 14:22:12I2021-12-03 22:02:00.0컴퓨터/사무용품 기타190461.790473442368.090413<NA><NA><NA><NA>
198123160000202031601593020270420201126<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-3280-2407<NA><NA>서울특별시 구로구 구로동 ***-** 이스페이스서울특별시 구로구 디지털로**길 **, 이스페이스 *층 ***-*호 (구로동)08381엠티에스 게임즈 (MTS Games)2020-11-26 10:11:16I2021-12-03 22:02:00.0교육/도서/완구/오락 기타190423.216683442403.568114<NA><NA><NA><NA>
29843160000200831601173020012920080204<NA>3폐업3폐업처리20110303<NA><NA><NA><NA><NA>152050서울특별시 구로구 구로동 ***번지 *호 성호주상복합 ***호서울특별시 구로구 가마산로 *** (구로동,성호주상복합 ***호)<NA>베이비몽소2011-03-03 10:40:55I2018-08-31 23:59:59.0의류/패션/잡화/뷰티190324.45505443860.544984<NA><NA><NA>인터넷
3263160000200331601173020092820030623200911274취소/말소/만료/정지/중지4직권취소<NA><NA><NA><NA>02 2688 9437<NA><NA>서울특별시 구로구 고척동 **-** 서봉빌딩***<NA><NA>(주)가나에스엠2009-12-01 10:41:52I2021-12-03 22:02:00.0-<NA><NA><NA><NA><NA><NA>
122563160000201631601593020135720161027<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 오리로**길 **, ***호 (궁동, 현대그린빌라)08257혜성커뮤니케이션2016-10-27 15:58:18I2021-12-03 22:02:00.0교육/도서/완구/오락 레져/여행/공연 기타184784.585328443994.046222<NA><NA><NA><NA>
90323160000201431601593020031920120515<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 구로구 가마산로 ***, ***호 (구로동, 성원상떼뷰)152840케이씨 (K. C)2014-03-12 18:06:16I2021-12-03 22:02:00.0가전 가구/수납용품189925.846221443503.181948<NA><NA><NA><NA>
231293160000202231601593020021820190216<NA>3폐업3폐업처리20220913<NA><NA><NA>02-853-2779<NA><NA>서울특별시 구로구 구로동 *** 신도림현대아파트서울특별시 구로구 새말로*길 **, ***동 ****호 (구로동, 신도림현대아파트)08291위드더제이2022-09-13 08:40:33U2021-12-08 23:05:00.0종합몰189836.89383444739.3026<NA><NA><NA><NA>