Overview

Dataset statistics

Number of variables26
Number of observations34
Missing cells309
Missing cells (%)35.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory228.9 B

Variable types

Numeric6
DateTime3
Unsupported7
Categorical7
Text3

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),설비규격
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-16082/S/1/datasetView.do

Alerts

휴업시작일자 is highly imbalanced (80.9%)Imbalance
휴업종료일자 is highly imbalanced (80.9%)Imbalance
인허가취소일자 has 34 (100.0%) missing valuesMissing
폐업일자 has 20 (58.8%) missing valuesMissing
재개업일자 has 34 (100.0%) missing valuesMissing
전화번호 has 34 (100.0%) missing valuesMissing
소재지면적 has 34 (100.0%) missing valuesMissing
소재지우편번호 has 34 (100.0%) missing valuesMissing
지번주소 has 1 (2.9%) missing valuesMissing
도로명주소 has 16 (47.1%) missing valuesMissing
도로명우편번호 has 28 (82.4%) missing valuesMissing
업태구분명 has 34 (100.0%) missing valuesMissing
좌표정보(X) has 3 (8.8%) missing valuesMissing
좌표정보(Y) has 3 (8.8%) missing valuesMissing
설비규격 has 34 (100.0%) missing valuesMissing
관리번호 has unique valuesUnique
인허가일자 has unique valuesUnique
최종수정일자 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전화번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
설비규격 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 07:12:09.422013
Analysis finished2024-05-11 07:12:09.711362
Duration0.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Real number (ℝ)

Distinct13
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3104705.9
Minimum3000000
Maximum3230000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:09.761783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000000
5-th percentile3000000
Q13022500
median3150000
Q33170000
95-th percentile3217000
Maximum3230000
Range230000
Interquartile range (IQR)147500

Descriptive statistics

Standard deviation79857.27
Coefficient of variation (CV)0.025721364
Kurtosis-1.6373282
Mean3104705.9
Median Absolute Deviation (MAD)60000
Skewness-0.14759674
Sum1.0556 × 108
Variance6.3771836 × 109
MonotonicityNot monotonic
2024-05-11T16:12:09.868848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
3170000 6
17.6%
3150000 6
17.6%
3000000 5
14.7%
3030000 4
11.8%
3010000 3
8.8%
3180000 2
 
5.9%
3230000 2
 
5.9%
3020000 1
 
2.9%
3210000 1
 
2.9%
3050000 1
 
2.9%
Other values (3) 3
8.8%
ValueCountFrequency (%)
3000000 5
14.7%
3010000 3
8.8%
3020000 1
 
2.9%
3030000 4
11.8%
3050000 1
 
2.9%
3090000 1
 
2.9%
3140000 1
 
2.9%
3150000 6
17.6%
3160000 1
 
2.9%
3170000 6
17.6%
ValueCountFrequency (%)
3230000 2
 
5.9%
3210000 1
 
2.9%
3180000 2
 
5.9%
3170000 6
17.6%
3160000 1
 
2.9%
3150000 6
17.6%
3140000 1
 
2.9%
3090000 1
 
2.9%
3050000 1
 
2.9%
3030000 4
11.8%

관리번호
Real number (ℝ)

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0092811 × 1018
Minimum1.989305 × 1018
Maximum2.022301 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:09.988929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.989305 × 1018
5-th percentile1.9986007 × 1018
Q12.007317 × 1018
median2.0093015 × 1018
Q32.0113175 × 1018
95-th percentile2.0177114 × 1018
Maximum2.022301 × 1018
Range3.2996008 × 1016
Interquartile range (IQR)4.0005078 × 1015

Descriptive statistics

Standard deviation5.9877558 × 1015
Coefficient of variation (CV)0.0029800489
Kurtosis3.5053663
Mean2.0092811 × 1018
Median Absolute Deviation (MAD)1.9935016 × 1015
Skewness-0.92518526
Sum-5.4714199 × 1018
Variance3.585322 × 1031
MonotonicityNot monotonic
2024-05-11T16:12:10.101590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1999300012916500001 1
 
2.9%
2008315010016500001 1
 
2.9%
2011309013516500000 1
 
2.9%
2008314011416500036 1
 
2.9%
2016315015116500001 1
 
2.9%
2007315010016514511 1
 
2.9%
2007315010016514526 1
 
2.9%
2007315010016514559 1
 
2.9%
2009315012316500001 1
 
2.9%
2010303010316511112 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1989305010016500029 1
2.9%
1997302012216500002 1
2.9%
1999300012916500001 1
2.9%
2005300012916500001 1
2.9%
2006317015016500001 1
2.9%
2007301010016500001 1
2.9%
2007315010016514511 1
2.9%
2007315010016514526 1
2.9%
2007315010016514559 1
2.9%
2007323013116500000 1
2.9%
ValueCountFrequency (%)
2022301017716500001 1
2.9%
2020301017716500001 1
2.9%
2016317015016500001 1
2.9%
2016315015116500001 1
2.9%
2014317015016500001 1
2.9%
2014300017016500001 1
2.9%
2013317015016500001 1
2.9%
2013300017016500001 1
2.9%
2011318019016500001 1
2.9%
2011316017416500001 1
2.9%

인허가일자
Date

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum1989-01-07 00:00:00
Maximum2022-12-19 00:00:00
2024-05-11T16:12:10.214906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:12:10.324184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B
Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
1
19 
3
14 
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row1
2nd row3
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 19
55.9%
3 14
41.2%
2 1
 
2.9%

Length

2024-05-11T16:12:10.430086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:10.517124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 19
55.9%
3 14
41.2%
2 1
 
2.9%

영업상태명
Categorical

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
영업/정상
19 
폐업
14 
휴업
 
1

Length

Max length5
Median length5
Mean length3.6764706
Min length2

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row영업/정상
2nd row폐업
3rd row폐업
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 19
55.9%
폐업 14
41.2%
휴업 1
 
2.9%

Length

2024-05-11T16:12:10.614318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:10.710388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 19
55.9%
폐업 14
41.2%
휴업 1
 
2.9%
Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
1
19 
3
14 
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row1
2nd row3
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 19
55.9%
3 14
41.2%
2 1
 
2.9%

Length

2024-05-11T16:12:10.810182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:10.899021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 19
55.9%
3 14
41.2%
2 1
 
2.9%
Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
영업중
19 
폐업
14 
휴업
 
1

Length

Max length3
Median length3
Mean length2.5588235
Min length2

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row영업중
2nd row폐업
3rd row폐업
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 19
55.9%
폐업 14
41.2%
휴업 1
 
2.9%

Length

2024-05-11T16:12:10.999568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:11.095846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 19
55.9%
폐업 14
41.2%
휴업 1
 
2.9%

폐업일자
Real number (ℝ)

MISSING 

Distinct14
Distinct (%)100.0%
Missing20
Missing (%)58.8%
Infinite0
Infinite (%)0.0%
Mean20145641
Minimum20080731
Maximum20200819
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:11.183736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20080731
5-th percentile20093333
Q120113036
median20145873
Q320185539
95-th percentile20200625
Maximum20200819
Range120088
Interquartile range (IQR)72503

Descriptive statistics

Standard deviation40441.351
Coefficient of variation (CV)0.0020074492
Kurtosis-1.3126292
Mean20145641
Median Absolute Deviation (MAD)40099.5
Skewness0.0096827866
Sum2.8203898 × 108
Variance1.6355029 × 109
MonotonicityNot monotonic
2024-05-11T16:12:11.300104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
20100121 1
 
2.9%
20121212 1
 
2.9%
20190820 1
 
2.9%
20200819 1
 
2.9%
20151214 1
 
2.9%
20200520 1
 
2.9%
20080731 1
 
2.9%
20170627 1
 
2.9%
20151127 1
 
2.9%
20100119 1
 
2.9%
Other values (4) 4
 
11.8%
(Missing) 20
58.8%
ValueCountFrequency (%)
20080731 1
2.9%
20100119 1
2.9%
20100121 1
2.9%
20110311 1
2.9%
20121212 1
2.9%
20130228 1
2.9%
20140619 1
2.9%
20151127 1
2.9%
20151214 1
2.9%
20170627 1
2.9%
ValueCountFrequency (%)
20200819 1
2.9%
20200520 1
2.9%
20190820 1
2.9%
20190510 1
2.9%
20170627 1
2.9%
20151214 1
2.9%
20151127 1
2.9%
20140619 1
2.9%
20130228 1
2.9%
20121212 1
2.9%

휴업시작일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
<NA>
33 
20120101
 
1

Length

Max length8
Median length4
Mean length4.1176471
Min length4

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 33
97.1%
20120101 1
 
2.9%

Length

2024-05-11T16:12:11.423961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:11.518610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 33
97.1%
20120101 1
 
2.9%

휴업종료일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
<NA>
33 
20121231
 
1

Length

Max length8
Median length4
Mean length4.1176471
Min length4

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 33
97.1%
20121231 1
 
2.9%

Length

2024-05-11T16:12:11.613306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:11.707762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 33
97.1%
20121231 1
 
2.9%

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

전화번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

지번주소
Text

MISSING 

Distinct32
Distinct (%)97.0%
Missing1
Missing (%)2.9%
Memory size404.0 B
2024-05-11T16:12:12.114817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length26
Mean length22.030303
Min length17

Characters and Unicode

Total characters727
Distinct characters106
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)93.9%

Sample

1st row서울특별시 종로구 계동 140-2 현대건설빌딩
2nd row서울특별시 성동구 도선동 58
3rd row서울특별시 금천구 독산동 999-10
4th row서울특별시 중구 장교동 1 한화빌딩
5th row서울특별시 종로구 청진동 70 그랑서울
ValueCountFrequency (%)
서울특별시 33
22.6%
금천구 6
 
4.1%
강서구 5
 
3.4%
종로구 5
 
3.4%
성동구 4
 
2.7%
가산동 3
 
2.1%
중구 3
 
2.1%
성수동2가 2
 
1.4%
풍납동 2
 
1.4%
영등포구 2
 
1.4%
Other values (74) 81
55.5%
2024-05-11T16:12:12.473293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
141
19.4%
41
 
5.6%
38
 
5.2%
35
 
4.8%
34
 
4.7%
34
 
4.7%
33
 
4.5%
33
 
4.5%
1 30
 
4.1%
- 22
 
3.0%
Other values (96) 286
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 424
58.3%
Space Separator 141
 
19.4%
Decimal Number 136
 
18.7%
Dash Punctuation 22
 
3.0%
Uppercase Letter 2
 
0.3%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
9.7%
38
 
9.0%
35
 
8.3%
34
 
8.0%
34
 
8.0%
33
 
7.8%
33
 
7.8%
10
 
2.4%
9
 
2.1%
7
 
1.7%
Other values (81) 150
35.4%
Decimal Number
ValueCountFrequency (%)
1 30
22.1%
3 20
14.7%
0 17
12.5%
6 14
10.3%
2 14
10.3%
9 10
 
7.4%
5 9
 
6.6%
8 9
 
6.6%
4 9
 
6.6%
7 4
 
2.9%
Space Separator
ValueCountFrequency (%)
141
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 424
58.3%
Common 301
41.4%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
9.7%
38
 
9.0%
35
 
8.3%
34
 
8.0%
34
 
8.0%
33
 
7.8%
33
 
7.8%
10
 
2.4%
9
 
2.1%
7
 
1.7%
Other values (81) 150
35.4%
Common
ValueCountFrequency (%)
141
46.8%
1 30
 
10.0%
- 22
 
7.3%
3 20
 
6.6%
0 17
 
5.6%
6 14
 
4.7%
2 14
 
4.7%
9 10
 
3.3%
5 9
 
3.0%
8 9
 
3.0%
Other values (4) 15
 
5.0%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 424
58.3%
ASCII 303
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
141
46.5%
1 30
 
9.9%
- 22
 
7.3%
3 20
 
6.6%
0 17
 
5.6%
6 14
 
4.6%
2 14
 
4.6%
9 10
 
3.3%
5 9
 
3.0%
8 9
 
3.0%
Other values (5) 17
 
5.6%
Hangul
ValueCountFrequency (%)
41
 
9.7%
38
 
9.0%
35
 
8.3%
34
 
8.0%
34
 
8.0%
33
 
7.8%
33
 
7.8%
10
 
2.4%
9
 
2.1%
7
 
1.7%
Other values (81) 150
35.4%

도로명주소
Text

MISSING 

Distinct18
Distinct (%)100.0%
Missing16
Missing (%)47.1%
Memory size404.0 B
2024-05-11T16:12:12.682513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length27.944444
Min length21

Characters and Unicode

Total characters503
Distinct characters100
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)100.0%

Sample

1st row서울특별시 종로구 율곡로 75 (계동)
2nd row서울특별시 금천구 범안로11길 45 (독산동)
3rd row서울특별시 중구 청계천로 86, 한화빌딩 (장교동)
4th row서울특별시 종로구 종로 33, 그랑서울 (청진동)
5th row서울특별시 금천구 서부샛길 648, 대륭테크노타운6차 104호 (가산동)
ValueCountFrequency (%)
서울특별시 18
 
17.6%
종로구 5
 
4.9%
금천구 4
 
3.9%
풍납동 2
 
2.0%
88 2
 
2.0%
올림픽로43길 2
 
2.0%
송파구 2
 
2.0%
b동 2
 
2.0%
율곡로 2
 
2.0%
75 2
 
2.0%
Other values (55) 61
59.8%
2024-05-11T16:12:12.996938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
17.5%
24
 
4.8%
23
 
4.6%
21
 
4.2%
19
 
3.8%
) 19
 
3.8%
( 19
 
3.8%
18
 
3.6%
18
 
3.6%
18
 
3.6%
Other values (90) 236
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 307
61.0%
Space Separator 88
 
17.5%
Decimal Number 61
 
12.1%
Close Punctuation 19
 
3.8%
Open Punctuation 19
 
3.8%
Other Punctuation 7
 
1.4%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
7.8%
23
 
7.5%
21
 
6.8%
19
 
6.2%
18
 
5.9%
18
 
5.9%
18
 
5.9%
18
 
5.9%
8
 
2.6%
7
 
2.3%
Other values (75) 133
43.3%
Decimal Number
ValueCountFrequency (%)
4 10
16.4%
1 9
14.8%
8 7
11.5%
5 7
11.5%
3 6
9.8%
6 6
9.8%
7 5
8.2%
2 4
 
6.6%
0 4
 
6.6%
9 3
 
4.9%
Space Separator
ValueCountFrequency (%)
88
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 307
61.0%
Common 194
38.6%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
7.8%
23
 
7.5%
21
 
6.8%
19
 
6.2%
18
 
5.9%
18
 
5.9%
18
 
5.9%
18
 
5.9%
8
 
2.6%
7
 
2.3%
Other values (75) 133
43.3%
Common
ValueCountFrequency (%)
88
45.4%
) 19
 
9.8%
( 19
 
9.8%
4 10
 
5.2%
1 9
 
4.6%
8 7
 
3.6%
, 7
 
3.6%
5 7
 
3.6%
3 6
 
3.1%
6 6
 
3.1%
Other values (4) 16
 
8.2%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 307
61.0%
ASCII 196
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
88
44.9%
) 19
 
9.7%
( 19
 
9.7%
4 10
 
5.1%
1 9
 
4.6%
8 7
 
3.6%
, 7
 
3.6%
5 7
 
3.6%
3 6
 
3.1%
6 6
 
3.1%
Other values (5) 18
 
9.2%
Hangul
ValueCountFrequency (%)
24
 
7.8%
23
 
7.5%
21
 
6.8%
19
 
6.2%
18
 
5.9%
18
 
5.9%
18
 
5.9%
18
 
5.9%
8
 
2.6%
7
 
2.3%
Other values (75) 133
43.3%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct6
Distinct (%)100.0%
Missing28
Missing (%)82.4%
Infinite0
Infinite (%)0.0%
Mean4512.5
Minimum3142
Maximum8504
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:13.104668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3142
5-th percentile3146.25
Q13164.5
median3861
Q34546.25
95-th percentile7515
Maximum8504
Range5362
Interquartile range (IQR)1381.75

Descriptive statistics

Standard deviation2069.6507
Coefficient of variation (CV)0.45864835
Kurtosis3.8165397
Mean4512.5
Median Absolute Deviation (MAD)694.5
Skewness1.9010827
Sum27075
Variance4283453.9
MonotonicityNot monotonic
2024-05-11T16:12:13.205339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
4541 1
 
2.9%
3159 1
 
2.9%
8504 1
 
2.9%
3181 1
 
2.9%
4548 1
 
2.9%
3142 1
 
2.9%
(Missing) 28
82.4%
ValueCountFrequency (%)
3142 1
2.9%
3159 1
2.9%
3181 1
2.9%
4541 1
2.9%
4548 1
2.9%
8504 1
2.9%
ValueCountFrequency (%)
8504 1
2.9%
4548 1
2.9%
4541 1
2.9%
3181 1
2.9%
3159 1
2.9%
3142 1
2.9%
Distinct33
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-05-11T16:12:13.390612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length8.7352941
Min length4

Characters and Unicode

Total characters297
Distinct characters104
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)94.1%

Sample

1st row현대건설(주)
2nd row전풍기업
3rd row한국공조기술개발(주)
4th row(주)한화
5th row지에스건설(주)
ValueCountFrequency (%)
서울아산병원 2
 
5.0%
주식회사 2
 
5.0%
주)현대이엔지 1
 
2.5%
세일정기주식회사 1
 
2.5%
한국신용유통 1
 
2.5%
주)이랜드리테일홈에버목동점 1
 
2.5%
의)성삼의료재단 1
 
2.5%
미즈메디병원 1
 
2.5%
주)대한항공 1
 
2.5%
기내식사업소 1
 
2.5%
Other values (28) 28
70.0%
2024-05-11T16:12:13.694624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
7.4%
) 19
 
6.4%
( 19
 
6.4%
9
 
3.0%
8
 
2.7%
8
 
2.7%
7
 
2.4%
7
 
2.4%
7
 
2.4%
6
 
2.0%
Other values (94) 185
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 251
84.5%
Close Punctuation 19
 
6.4%
Open Punctuation 19
 
6.4%
Space Separator 6
 
2.0%
Other Symbol 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
8.8%
9
 
3.6%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (90) 167
66.5%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 253
85.2%
Common 44
 
14.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
8.7%
9
 
3.6%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (91) 169
66.8%
Common
ValueCountFrequency (%)
) 19
43.2%
( 19
43.2%
6
 
13.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 251
84.5%
ASCII 44
 
14.8%
None 2
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
8.8%
9
 
3.6%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
Other values (90) 167
66.5%
ASCII
ValueCountFrequency (%)
) 19
43.2%
( 19
43.2%
6
 
13.6%
None
ValueCountFrequency (%)
2
100.0%

최종수정일자
Date

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2007-07-31 14:30:38
Maximum2024-04-17 15:07:18
2024-05-11T16:12:13.812896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:12:13.952334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
I
24 
U
10 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowI
3rd rowI
4th rowU
5th rowU

Common Values

ValueCountFrequency (%)
I 24
70.6%
U 10
29.4%

Length

2024-05-11T16:12:14.068749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:12:14.180141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 24
70.6%
u 10
29.4%
Distinct11
Distinct (%)32.4%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2018-08-31 23:59:59
Maximum2023-12-03 23:09:00
2024-05-11T16:12:14.261514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:12:14.346042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct29
Distinct (%)93.5%
Missing3
Missing (%)8.8%
Infinite0
Infinite (%)0.0%
Mean195631.11
Minimum182914.6
Maximum209607.59
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:14.455197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum182914.6
5-th percentile184534.58
Q1189917.49
median197542.32
Q3200942.13
95-th percentile207846.11
Maximum209607.59
Range26692.992
Interquartile range (IQR)11024.639

Descriptive statistics

Standard deviation7719.707
Coefficient of variation (CV)0.039460528
Kurtosis-1.0735122
Mean195631.11
Median Absolute Deviation (MAD)7273.8724
Skewness0.14237007
Sum6064564.5
Variance59593876
MonotonicityNot monotonic
2024-05-11T16:12:14.559313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
189917.493331104 2
 
5.9%
209607.590372037 2
 
5.9%
198286.311784052 1
 
2.9%
205557.941680639 1
 
2.9%
198176.159239663 1
 
2.9%
190050.499085001 1
 
2.9%
198744.433228989 1
 
2.9%
183644.168046816 1
 
2.9%
186237.636869491 1
 
2.9%
182914.598086861 1
 
2.9%
Other values (19) 19
55.9%
(Missing) 3
 
8.8%
ValueCountFrequency (%)
182914.598086861 1
2.9%
183644.168046816 1
2.9%
185424.984404321 1
2.9%
186237.636869491 1
2.9%
187256.952950432 1
2.9%
188729.190478822 1
2.9%
188968.189711073 1
2.9%
189917.493331104 2
5.9%
190031.474991519 1
2.9%
190050.499085001 1
2.9%
ValueCountFrequency (%)
209607.590372037 2
5.9%
206084.625088881 1
2.9%
205557.941680639 1
2.9%
204614.272744322 1
2.9%
204528.315643675 1
2.9%
202478.967119055 1
2.9%
202236.514618153 1
2.9%
199647.750199088 1
2.9%
199214.10239741 1
2.9%
198817.289034611 1
2.9%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct29
Distinct (%)93.5%
Missing3
Missing (%)8.8%
Infinite0
Infinite (%)0.0%
Mean448359.21
Minimum439603.12
Maximum461177.27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-05-11T16:12:14.669638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum439603.12
5-th percentile440885.38
Q1445089.1
median449473.46
Q3451581.74
95-th percentile452738.23
Maximum461177.27
Range21574.151
Interquartile range (IQR)6492.6439

Descriptive statistics

Standard deviation4727.9404
Coefficient of variation (CV)0.010544983
Kurtosis0.47991744
Mean448359.21
Median Absolute Deviation (MAD)2352.8866
Skewness0.044809849
Sum13899135
Variance22353421
MonotonicityNot monotonic
2024-05-11T16:12:14.773309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
442031.656022304 2
 
5.9%
447120.577229325 2
 
5.9%
452091.444332546 1
 
2.9%
449473.46380541 1
 
2.9%
452574.67151361 1
 
2.9%
441065.917079449 1
 
2.9%
451677.065126229 1
 
2.9%
443559.728870219 1
 
2.9%
450556.127049871 1
 
2.9%
450966.559529824 1
 
2.9%
Other values (19) 19
55.9%
(Missing) 3
 
8.8%
ValueCountFrequency (%)
439603.120857528 1
2.9%
440704.844749009 1
2.9%
441065.917079449 1
2.9%
442031.656022304 2
5.9%
442119.780901928 1
2.9%
443559.728870219 1
2.9%
443757.994255551 1
2.9%
446420.19757582 1
2.9%
447120.577229325 2
5.9%
447262.223484619 1
2.9%
ValueCountFrequency (%)
461177.27144294 1
2.9%
452901.786745273 1
2.9%
452574.67151361 1
2.9%
452091.444332546 1
2.9%
452039.262262313 1
2.9%
451934.193821509 1
2.9%
451677.065126229 1
2.9%
451596.900364676 1
2.9%
451566.579232622 1
2.9%
451194.398505245 1
2.9%

설비규격
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)100.0%
Memory size438.0 B

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)설비규격
03000000199930001291650000119991229<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 계동 140-2 현대건설빌딩서울특별시 종로구 율곡로 75 (계동)<NA>현대건설(주)2021-04-13 16:38:41U2021-04-15 02:40:00.0<NA>198817.289035452901.786745<NA>
13030000201030301031651111120100121<NA>3폐업3폐업20100121<NA><NA><NA><NA><NA><NA>서울특별시 성동구 도선동 58<NA><NA>전풍기업2010-01-21 17:19:38I2018-08-31 23:59:59.0<NA><NA><NA><NA>
23170000200831701121652007120070205<NA>3폐업3폐업20121212<NA><NA><NA><NA><NA><NA>서울특별시 금천구 독산동 999-10서울특별시 금천구 범안로11길 45 (독산동)<NA>한국공조기술개발(주)2012-12-12 10:30:57I2018-08-31 23:59:59.0<NA>190268.447161440704.844749<NA>
3301000020223010177165000012022-12-19<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 중구 장교동 1 한화빌딩서울특별시 중구 청계천로 86, 한화빌딩 (장교동)4541(주)한화2023-04-25 09:00:04U2022-12-03 22:07:00.0<NA>198744.433229451677.065126<NA>
4300000020143000170165000012014-03-03<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 청진동 70 그랑서울서울특별시 종로구 종로 33, 그랑서울 (청진동)3159지에스건설(주)2024-04-17 15:07:18U2023-12-03 23:09:00.0<NA>198286.311784452091.444333<NA>
53170000201331701501650000120130612<NA>3폐업3폐업20190820<NA><NA><NA><NA><NA><NA>서울특별시 금천구 가산동 493-6 대륭테크노타운6차서울특별시 금천구 서부샛길 648, 대륭테크노타운6차 104호 (가산동)8504(주)센트라공조2019-08-21 09:34:29U2019-08-23 02:40:00.0<NA>188968.189711442119.780902<NA>
63170000201631701501650000120160901<NA>3폐업3폐업20200819<NA><NA><NA><NA><NA><NA>서울특별시 금천구 시흥동 113-15 5동 508호<NA><NA>(주)티원엔지니어링2020-08-19 20:07:12U2020-08-21 02:40:00.0<NA>190916.469867439603.120858<NA>
73180000200931801171650000220030217<NA>3폐업3폐업20151214<NA><NA><NA><NA><NA><NA>서울특별시 영등포구 양평동3가 56-6<NA><NA>건민산업(주)2015-12-14 17:34:41I2018-08-31 23:59:59.0<NA>190031.474992447262.223485<NA>
83000000200530001291650000120051118<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 평동 222 디타워(돈의문)서울특별시 종로구 통일로 134, 디타워(돈의문) (평동)3181디엘이엔씨 주식회사2021-02-01 19:26:09U2021-02-03 02:40:00.0<NA>196955.166566451596.900365<NA>
93000000200930001291650000120090402<NA>3폐업3폐업20200520<NA><NA><NA><NA><NA><NA>서울특별시 종로구 신문로1가 금호아시아나1관서울특별시 종로구 새문안로 75 (신문로1가)<NA>주식회사 대우건설2021-03-02 10:39:55U2021-03-04 02:40:00.0<NA>197542.319572452039.262262<NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)설비규격
243150000200731501001651452619891010<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 등촌동 641-1<NA><NA>세일정기주식회사2007-08-10 16:53:30I2018-08-31 23:59:59.0<NA>187256.95295450525.741428<NA>
253150000200731501001651455920070710<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 공항동 150<NA><NA>한국공항공사 서울지역본부2007-07-31 14:30:38I2018-08-31 23:59:59.0<NA>182914.598087450966.55953<NA>
263150000200831501001650000120080916<NA>3폐업3폐업20140619<NA><NA><NA><NA><NA><NA>서울특별시 강서구 등촌동 630-15<NA><NA>(주)다코스지2014-06-19 18:11:32I2018-08-31 23:59:59.0<NA><NA><NA><NA>
273150000200931501231650000120090408<NA>3폐업3폐업20110311<NA><NA><NA><NA><NA><NA>서울특별시 강서구 내발산동 119<NA><NA>(주)한국가스기술공사2011-03-11 16:53:31I2018-08-31 23:59:59.0<NA>186237.636869450556.12705<NA>
283160000201131601741650000120110331<NA>3폐업3폐업20190510<NA><NA><NA><NA><NA><NA>서울특별시 구로구 온수동 100-59<NA><NA>대가파우더시스템(주)2019-05-23 17:16:25U2019-05-25 02:40:00.0<NA>183644.168047443559.72887<NA>
293170000200631701501650000120060327<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 금천구 가산동 60-4 코오롱테크노밸리 B동 104호서울특별시 금천구 디지털로9길 56, B동 104호 (가산동, 코오롱테크노밸리)<NA>㈜현대이엔지2011-11-28 11:33:49I2018-08-31 23:59:59.0<NA>189917.493331442031.656022<NA>
303170000201031701121650500120051228<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 금천구 독산동 336-10서울특별시 금천구 가산로3길 129 (독산동)<NA>라텍스코리아시스템2011-11-28 11:20:38I2018-08-31 23:59:59.0<NA>190050.499085441065.917079<NA>
313170000201431701501650000120140214<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 금천구 가산동 60-4<NA><NA>(주)현대이엔지2015-12-08 17:54:08I2018-08-31 23:59:59.0<NA>189917.493331442031.656022<NA>
323230000200732301311650000120020419<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 송파구 풍납동 388-1서울특별시 송파구 올림픽로43길 88 (풍납동)<NA>서울아산병원2007-10-12 17:36:51I2018-08-31 23:59:59.0<NA>209607.590372447120.577229<NA>
333000000201330001701650000120130820<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 종로구 중학동 14 트윈 트리 빌딩 B동서울특별시 종로구 율곡로 6, 트윈 트리 빌딩 B동 (중학동)3142에스케이에코엔지니어링(주)2022-12-27 16:05:37U2021-11-01 22:09:00.0<NA>198176.15924452574.671514<NA>