Overview

Dataset statistics

Number of variables27
Number of observations480
Missing cells4068
Missing cells (%)31.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory106.5 KiB
Average record size in memory227.3 B

Variable types

Categorical10
Numeric4
DateTime7
Unsupported3
Text3

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),제조구분명,사업장부지용도구분명
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-19024/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
소재지면적 is highly imbalanced (95.9%)Imbalance
업태구분명 is highly imbalanced (77.5%)Imbalance
인허가취소일자 has 480 (100.0%) missing valuesMissing
폐업일자 has 356 (74.2%) missing valuesMissing
휴업시작일자 has 473 (98.5%) missing valuesMissing
휴업종료일자 has 473 (98.5%) missing valuesMissing
재개업일자 has 468 (97.5%) missing valuesMissing
전화번호 has 480 (100.0%) missing valuesMissing
소재지우편번호 has 480 (100.0%) missing valuesMissing
지번주소 has 9 (1.9%) missing valuesMissing
도로명주소 has 353 (73.5%) missing valuesMissing
도로명우편번호 has 404 (84.2%) missing valuesMissing
좌표정보(X) has 46 (9.6%) missing valuesMissing
좌표정보(Y) has 46 (9.6%) missing valuesMissing
관리번호 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전화번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 07:05:16.933913
Analysis finished2024-05-11 07:05:17.690635
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
3220000
480 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3220000
2nd row3220000
3rd row3220000
4th row3220000
5th row3220000

Common Values

ValueCountFrequency (%)
3220000 480
100.0%

Length

2024-05-11T16:05:17.785644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:17.938368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3220000 480
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct480
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0017512 × 1018
Minimum1.900322 × 1018
Maximum2.024322 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-05-11T16:05:18.087220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.900322 × 1018
5-th percentile1.984322 × 1018
Q11.991322 × 1018
median2.002322 × 1018
Q32.010322 × 1018
95-th percentile2.021322 × 1018
Maximum2.024322 × 1018
Range1.24 × 1017
Interquartile range (IQR)1.9000007 × 1016

Descriptive statistics

Standard deviation1.28818 × 1016
Coefficient of variation (CV)0.0064352653
Kurtosis6.9723859
Mean2.0017512 × 1018
Median Absolute Deviation (MAD)1.1 × 1016
Skewness-1.0021906
Sum1.6098737 × 1018
Variance1.6594077 × 1032
MonotonicityStrictly increasing
2024-05-11T16:05:18.305608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1900322012702200114 1
 
0.2%
2002322008302210001 1
 
0.2%
2008322012702220081 1
 
0.2%
2008322012702210008 1
 
0.2%
2008322012702200415 1
 
0.2%
2008322012702200162 1
 
0.2%
2008322012702200161 1
 
0.2%
2008322012702200123 1
 
0.2%
2008322012702200013 1
 
0.2%
2008322012702200012 1
 
0.2%
Other values (470) 470
97.9%
ValueCountFrequency (%)
1900322012702200114 1
0.2%
1950322008302200172 1
0.2%
1979322008302200003 1
0.2%
1980322008302200011 1
0.2%
1980322008302200012 1
0.2%
1981322008302200014 1
0.2%
1981322008302200015 1
0.2%
1981322008302200016 1
0.2%
1983322008302200046 1
0.2%
1983322008302200053 1
0.2%
ValueCountFrequency (%)
2024322017602100001 1
0.2%
2023322017602200007 1
0.2%
2023322017602200005 1
0.2%
2023322017602200004 1
0.2%
2023322017602200003 1
0.2%
2023322017602200002 1
0.2%
2023322017602200001 1
0.2%
2023322017602100003 1
0.2%
2023322017602100002 1
0.2%
2023322017602100001 1
0.2%
Distinct388
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum1950-05-16 00:00:00
Maximum2024-02-28 00:00:00
2024-05-11T16:05:18.484437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:18.671847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing480
Missing (%)100.0%
Memory size4.3 KiB
Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
1
339 
3
133 
2
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 339
70.6%
3 133
 
27.7%
2 8
 
1.7%

Length

2024-05-11T16:05:19.123476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:19.225075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 339
70.6%
3 133
 
27.7%
2 8
 
1.7%

영업상태명
Categorical

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
영업/정상
339 
폐업
133 
휴업
 
8

Length

Max length5
Median length5
Mean length4.11875
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row폐업
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 339
70.6%
폐업 133
 
27.7%
휴업 8
 
1.7%

Length

2024-05-11T16:05:19.357325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:19.495444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 339
70.6%
폐업 133
 
27.7%
휴업 8
 
1.7%
Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
1
339 
3
133 
2
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 339
70.6%
3 133
 
27.7%
2 8
 
1.7%

Length

2024-05-11T16:05:19.681295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:19.841805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 339
70.6%
3 133
 
27.7%
2 8
 
1.7%
Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
영업중
339 
폐업
133 
휴업
 
8

Length

Max length3
Median length3
Mean length2.70625
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row폐업
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 339
70.6%
폐업 133
 
27.7%
휴업 8
 
1.7%

Length

2024-05-11T16:05:19.982087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:20.146022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 339
70.6%
폐업 133
 
27.7%
휴업 8
 
1.7%

폐업일자
Date

MISSING 

Distinct117
Distinct (%)94.4%
Missing356
Missing (%)74.2%
Memory size3.9 KiB
Minimum2007-10-12 00:00:00
Maximum2024-03-19 00:00:00
2024-05-11T16:05:20.298374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:20.463841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing473
Missing (%)98.5%
Memory size3.9 KiB
Minimum2007-06-29 00:00:00
Maximum2022-10-31 00:00:00
2024-05-11T16:05:20.588819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:20.709651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

휴업종료일자
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing473
Missing (%)98.5%
Memory size3.9 KiB
Minimum2007-06-29 00:00:00
Maximum2024-06-19 00:00:00
2024-05-11T16:05:20.834950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:20.964007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

재개업일자
Date

MISSING 

Distinct10
Distinct (%)83.3%
Missing468
Missing (%)97.5%
Memory size3.9 KiB
Minimum2009-08-31 00:00:00
Maximum2023-12-21 00:00:00
2024-05-11T16:05:21.081911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:21.206817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)

전화번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing480
Missing (%)100.0%
Memory size4.3 KiB

소재지면적
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
<NA>
476 
1
 
2
0
 
1
10000
 
1

Length

Max length5
Median length4
Mean length3.9833333
Min length1

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 476
99.2%
1 2
 
0.4%
0 1
 
0.2%
10000 1
 
0.2%

Length

2024-05-11T16:05:21.374632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:21.527332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 476
99.2%
1 2
 
0.4%
0 1
 
0.2%
10000 1
 
0.2%

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing480
Missing (%)100.0%
Memory size4.3 KiB

지번주소
Text

MISSING 

Distinct424
Distinct (%)90.0%
Missing9
Missing (%)1.9%
Memory size3.9 KiB
2024-05-11T16:05:21.905455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length21.244161
Min length16

Characters and Unicode

Total characters10006
Distinct characters201
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique388 ?
Unique (%)82.4%

Sample

1st row서울특별시 강남구 삼성동 78-3
2nd row서울특별시 강남구 신사동 543
3rd row서울특별시 강남구 논현동 6
4th row서울특별시 강남구 논현동 18-3
5th row서울특별시 강남구 논현동 204-6
ValueCountFrequency (%)
서울특별시 470
23.2%
강남구 470
23.2%
역삼동 112
 
5.5%
논현동 87
 
4.3%
삼성동 66
 
3.3%
대치동 65
 
3.2%
신사동 47
 
2.3%
청담동 30
 
1.5%
개포동 23
 
1.1%
도곡동 22
 
1.1%
Other values (469) 635
31.3%
2024-05-11T16:05:22.523418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1988
19.9%
487
 
4.9%
480
 
4.8%
478
 
4.8%
477
 
4.8%
476
 
4.8%
475
 
4.7%
472
 
4.7%
470
 
4.7%
470
 
4.7%
Other values (191) 3733
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5746
57.4%
Space Separator 1988
 
19.9%
Decimal Number 1880
 
18.8%
Dash Punctuation 342
 
3.4%
Uppercase Letter 38
 
0.4%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
487
 
8.5%
480
 
8.4%
478
 
8.3%
477
 
8.3%
476
 
8.3%
475
 
8.3%
472
 
8.2%
470
 
8.2%
470
 
8.2%
185
 
3.2%
Other values (165) 1276
22.2%
Uppercase Letter
ValueCountFrequency (%)
S 7
18.4%
E 6
15.8%
T 5
13.2%
O 4
10.5%
G 3
7.9%
W 3
7.9%
R 3
7.9%
C 2
 
5.3%
L 2
 
5.3%
A 2
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 367
19.5%
2 205
10.9%
7 194
10.3%
6 173
9.2%
9 172
9.1%
4 166
8.8%
8 155
8.2%
5 152
8.1%
0 152
8.1%
3 144
 
7.7%
Space Separator
ValueCountFrequency (%)
1988
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 342
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5746
57.4%
Common 4222
42.2%
Latin 38
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
487
 
8.5%
480
 
8.4%
478
 
8.3%
477
 
8.3%
476
 
8.3%
475
 
8.3%
472
 
8.2%
470
 
8.2%
470
 
8.2%
185
 
3.2%
Other values (165) 1276
22.2%
Common
ValueCountFrequency (%)
1988
47.1%
1 367
 
8.7%
- 342
 
8.1%
2 205
 
4.9%
7 194
 
4.6%
6 173
 
4.1%
9 172
 
4.1%
4 166
 
3.9%
8 155
 
3.7%
5 152
 
3.6%
Other values (5) 308
 
7.3%
Latin
ValueCountFrequency (%)
S 7
18.4%
E 6
15.8%
T 5
13.2%
O 4
10.5%
G 3
7.9%
W 3
7.9%
R 3
7.9%
C 2
 
5.3%
L 2
 
5.3%
A 2
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5746
57.4%
ASCII 4260
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1988
46.7%
1 367
 
8.6%
- 342
 
8.0%
2 205
 
4.8%
7 194
 
4.6%
6 173
 
4.1%
9 172
 
4.0%
4 166
 
3.9%
8 155
 
3.6%
5 152
 
3.6%
Other values (16) 346
 
8.1%
Hangul
ValueCountFrequency (%)
487
 
8.5%
480
 
8.4%
478
 
8.3%
477
 
8.3%
476
 
8.3%
475
 
8.3%
472
 
8.2%
470
 
8.2%
470
 
8.2%
185
 
3.2%
Other values (165) 1276
22.2%

도로명주소
Text

MISSING 

Distinct105
Distinct (%)82.7%
Missing353
Missing (%)73.5%
Memory size3.9 KiB
2024-05-11T16:05:22.889667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length38
Mean length28.346457
Min length22

Characters and Unicode

Total characters3600
Distinct characters170
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)71.7%

Sample

1st row서울특별시 강남구 도산대로 168 (논현동)
2nd row서울특별시 강남구 도산대로 524, 청담빌딩 (청담동)
3rd row서울특별시 강남구 강남대로 584 (논현동)
4th row서울특별시 강남구 논현로 531 (역삼동)
5th row서울특별시 강남구 테헤란로92길 27 (대치동)
ValueCountFrequency (%)
서울특별시 127
17.5%
강남구 127
17.5%
테헤란로 33
 
4.5%
역삼동 28
 
3.9%
삼성동 27
 
3.7%
대치동 22
 
3.0%
논현동 17
 
2.3%
영동대로 16
 
2.2%
513 9
 
1.2%
언주로 9
 
1.2%
Other values (181) 312
42.9%
2024-05-11T16:05:23.508536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
600
 
16.7%
153
 
4.2%
137
 
3.8%
134
 
3.7%
134
 
3.7%
131
 
3.6%
129
 
3.6%
) 129
 
3.6%
129
 
3.6%
( 129
 
3.6%
Other values (160) 1795
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2265
62.9%
Space Separator 600
 
16.7%
Decimal Number 403
 
11.2%
Close Punctuation 129
 
3.6%
Open Punctuation 129
 
3.6%
Other Punctuation 67
 
1.9%
Uppercase Letter 6
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
 
6.8%
137
 
6.0%
134
 
5.9%
134
 
5.9%
131
 
5.8%
129
 
5.7%
129
 
5.7%
128
 
5.7%
127
 
5.6%
127
 
5.6%
Other values (141) 936
41.3%
Decimal Number
ValueCountFrequency (%)
1 82
20.3%
4 58
14.4%
2 54
13.4%
3 48
11.9%
0 42
10.4%
5 41
10.2%
6 32
 
7.9%
7 23
 
5.7%
8 13
 
3.2%
9 10
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
E 2
33.3%
T 2
33.3%
C 1
16.7%
S 1
16.7%
Space Separator
ValueCountFrequency (%)
600
100.0%
Close Punctuation
ValueCountFrequency (%)
) 129
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2265
62.9%
Common 1329
36.9%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
153
 
6.8%
137
 
6.0%
134
 
5.9%
134
 
5.9%
131
 
5.8%
129
 
5.7%
129
 
5.7%
128
 
5.7%
127
 
5.6%
127
 
5.6%
Other values (141) 936
41.3%
Common
ValueCountFrequency (%)
600
45.1%
) 129
 
9.7%
( 129
 
9.7%
1 82
 
6.2%
, 67
 
5.0%
4 58
 
4.4%
2 54
 
4.1%
3 48
 
3.6%
0 42
 
3.2%
5 41
 
3.1%
Other values (5) 79
 
5.9%
Latin
ValueCountFrequency (%)
E 2
33.3%
T 2
33.3%
C 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2265
62.9%
ASCII 1335
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
600
44.9%
) 129
 
9.7%
( 129
 
9.7%
1 82
 
6.1%
, 67
 
5.0%
4 58
 
4.3%
2 54
 
4.0%
3 48
 
3.6%
0 42
 
3.1%
5 41
 
3.1%
Other values (9) 85
 
6.4%
Hangul
ValueCountFrequency (%)
153
 
6.8%
137
 
6.0%
134
 
5.9%
134
 
5.9%
131
 
5.8%
129
 
5.7%
129
 
5.7%
128
 
5.7%
127
 
5.6%
127
 
5.6%
Other values (141) 936
41.3%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct47
Distinct (%)61.8%
Missing404
Missing (%)84.2%
Infinite0
Infinite (%)0.0%
Mean6180.2763
Minimum6017
Maximum6362
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-05-11T16:05:23.684598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6017
5-th percentile6035.25
Q16130.5
median6173
Q36221
95-th percentile6331.5
Maximum6362
Range345
Interquartile range (IQR)90.5

Descriptive statistics

Standard deviation84.163587
Coefficient of variation (CV)0.013618094
Kurtosis-0.14135949
Mean6180.2763
Median Absolute Deviation (MAD)48
Skewness0.26876874
Sum469701
Variance7083.5093
MonotonicityNot monotonic
2024-05-11T16:05:23.894101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
6164 9
 
1.9%
6194 5
 
1.0%
6221 4
 
0.8%
6331 4
 
0.8%
6110 3
 
0.6%
6132 2
 
0.4%
6181 2
 
0.4%
6058 2
 
0.4%
6236 2
 
0.4%
6174 2
 
0.4%
Other values (37) 41
 
8.5%
(Missing) 404
84.2%
ValueCountFrequency (%)
6017 1
0.2%
6026 2
0.4%
6027 1
0.2%
6038 1
0.2%
6048 1
0.2%
6058 2
0.4%
6071 1
0.2%
6088 1
0.2%
6098 1
0.2%
6109 2
0.4%
ValueCountFrequency (%)
6362 1
 
0.2%
6353 1
 
0.2%
6351 1
 
0.2%
6333 1
 
0.2%
6331 4
0.8%
6330 1
 
0.2%
6294 1
 
0.2%
6287 1
 
0.2%
6261 1
 
0.2%
6249 1
 
0.2%
Distinct430
Distinct (%)89.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-05-11T16:05:24.215115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length9
Min length2

Characters and Unicode

Total characters4320
Distinct characters376
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique394 ?
Unique (%)82.1%

Sample

1st row원일빌딩
2nd row삼대양개발(주)
3rd row용창산업(주) 영동관광호텔
4th row영창빌딩
5th row한국페인트잉크협동조합
ValueCountFrequency (%)
주식회사 24
 
4.1%
새서울철도 6
 
1.0%
한국철도공사 6
 
1.0%
사)한국무역협회전시장 5
 
0.9%
재단법인 4
 
0.7%
9호선 4
 
0.7%
개포자이프레지던스아파트 4
 
0.7%
이지스제210호전문투자형사모부동산투자회사 4
 
0.7%
삼성서울병원 3
 
0.5%
포스코홀딩스㈜ 3
 
0.5%
Other values (476) 524
89.3%
2024-05-11T16:05:24.801979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 220
 
5.1%
( 219
 
5.1%
212
 
4.9%
107
 
2.5%
105
 
2.4%
101
 
2.3%
101
 
2.3%
81
 
1.9%
70
 
1.6%
64
 
1.5%
Other values (366) 3040
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3670
85.0%
Close Punctuation 220
 
5.1%
Open Punctuation 219
 
5.1%
Space Separator 107
 
2.5%
Decimal Number 57
 
1.3%
Uppercase Letter 22
 
0.5%
Other Symbol 13
 
0.3%
Lowercase Letter 5
 
0.1%
Other Punctuation 4
 
0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
212
 
5.8%
105
 
2.9%
101
 
2.8%
101
 
2.8%
81
 
2.2%
70
 
1.9%
64
 
1.7%
59
 
1.6%
59
 
1.6%
57
 
1.6%
Other values (332) 2761
75.2%
Uppercase Letter
ValueCountFrequency (%)
S 6
27.3%
J 3
13.6%
L 2
 
9.1%
N 2
 
9.1%
K 2
 
9.1%
I 1
 
4.5%
G 1
 
4.5%
C 1
 
4.5%
M 1
 
4.5%
B 1
 
4.5%
Other values (2) 2
 
9.1%
Decimal Number
ValueCountFrequency (%)
2 13
22.8%
1 9
15.8%
3 9
15.8%
9 7
12.3%
4 6
10.5%
0 5
 
8.8%
7 4
 
7.0%
8 3
 
5.3%
6 1
 
1.8%
Lowercase Letter
ValueCountFrequency (%)
s 2
40.0%
t 1
20.0%
x 1
20.0%
j 1
20.0%
Other Punctuation
ValueCountFrequency (%)
? 2
50.0%
. 1
25.0%
, 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 220
100.0%
Open Punctuation
ValueCountFrequency (%)
( 219
100.0%
Space Separator
ValueCountFrequency (%)
107
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3683
85.3%
Common 608
 
14.1%
Latin 29
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
212
 
5.8%
105
 
2.9%
101
 
2.7%
101
 
2.7%
81
 
2.2%
70
 
1.9%
64
 
1.7%
59
 
1.6%
59
 
1.6%
57
 
1.5%
Other values (333) 2774
75.3%
Latin
ValueCountFrequency (%)
S 6
20.7%
J 3
10.3%
s 2
 
6.9%
L 2
 
6.9%
N 2
 
6.9%
2
 
6.9%
K 2
 
6.9%
t 1
 
3.4%
x 1
 
3.4%
I 1
 
3.4%
Other values (7) 7
24.1%
Common
ValueCountFrequency (%)
) 220
36.2%
( 219
36.0%
107
17.6%
2 13
 
2.1%
1 9
 
1.5%
3 9
 
1.5%
9 7
 
1.2%
4 6
 
1.0%
0 5
 
0.8%
7 4
 
0.7%
Other values (6) 9
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3670
85.0%
ASCII 635
 
14.7%
None 13
 
0.3%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 220
34.6%
( 219
34.5%
107
16.9%
2 13
 
2.0%
1 9
 
1.4%
3 9
 
1.4%
9 7
 
1.1%
S 6
 
0.9%
4 6
 
0.9%
0 5
 
0.8%
Other values (22) 34
 
5.4%
Hangul
ValueCountFrequency (%)
212
 
5.8%
105
 
2.9%
101
 
2.8%
101
 
2.8%
81
 
2.2%
70
 
1.9%
64
 
1.7%
59
 
1.6%
59
 
1.6%
57
 
1.6%
Other values (332) 2761
75.2%
None
ValueCountFrequency (%)
13
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Distinct455
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2002-04-01 00:00:00
Maximum2024-05-09 16:05:10
2024-05-11T16:05:25.015034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:25.232953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
I
285 
U
195 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowU
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 285
59.4%
U 195
40.6%

Length

2024-05-11T16:05:25.460156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:25.586320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 285
59.4%
u 195
40.6%
Distinct170
Distinct (%)35.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T16:05:25.729956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:05:25.941375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태구분명
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
제조
454 
저장소
 
16
판매
 
10

Length

Max length3
Median length2
Mean length2.0333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조
2nd row제조
3rd row제조
4th row제조
5th row제조

Common Values

ValueCountFrequency (%)
제조 454
94.6%
저장소 16
 
3.3%
판매 10
 
2.1%

Length

2024-05-11T16:05:26.121906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:26.248209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조 454
94.6%
저장소 16
 
3.3%
판매 10
 
2.1%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct339
Distinct (%)78.1%
Missing46
Missing (%)9.6%
Infinite0
Infinite (%)0.0%
Mean203976.38
Minimum201588
Maximum209063.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-05-11T16:05:26.412080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201588
5-th percentile202046.19
Q1202774.59
median203745.3
Q3205039.87
95-th percentile206426.99
Maximum209063.7
Range7475.6973
Interquartile range (IQR)2265.2786

Descriptive statistics

Standard deviation1468.8936
Coefficient of variation (CV)0.0072012925
Kurtosis0.50098984
Mean203976.38
Median Absolute Deviation (MAD)1076.8306
Skewness0.76829663
Sum88525749
Variance2157648.3
MonotonicityNot monotonic
2024-05-11T16:05:26.651059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205130.591678902 10
 
2.1%
204895.310005801 7
 
1.5%
205060.813210698 5
 
1.0%
205710.170088377 5
 
1.0%
207534.49462606 5
 
1.0%
205340.631121567 5
 
1.0%
204213.643236507 4
 
0.8%
204669.543366778 4
 
0.8%
203016.183843955 4
 
0.8%
206239.623997375 4
 
0.8%
Other values (329) 381
79.4%
(Missing) 46
 
9.6%
ValueCountFrequency (%)
201588.001969935 2
0.4%
201595.925382148 1
0.2%
201646.385389914 1
0.2%
201658.416590861 1
0.2%
201730.153319017 1
0.2%
201750.935 1
0.2%
201757.105695785 1
0.2%
201763.662103756 1
0.2%
201774.365077827 2
0.4%
201828.756638827 1
0.2%
ValueCountFrequency (%)
209063.699252024 1
 
0.2%
209052.072465426 1
 
0.2%
208937.760652081 1
 
0.2%
208825.514283308 2
 
0.4%
208184.247157358 1
 
0.2%
207834.156216454 1
 
0.2%
207715.942548592 1
 
0.2%
207534.49462606 5
1.0%
207399.55165484 1
 
0.2%
207254.144850327 3
0.6%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct339
Distinct (%)78.1%
Missing46
Missing (%)9.6%
Infinite0
Infinite (%)0.0%
Mean444997.36
Minimum441112.7
Maximum447402.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-05-11T16:05:26.903524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum441112.7
5-th percentile442715.08
Q1444137.25
median445022.71
Q3445828.89
95-th percentile446987.79
Maximum447402.3
Range6289.6026
Interquartile range (IQR)1691.6438

Descriptive statistics

Standard deviation1278.2134
Coefficient of variation (CV)0.0028724068
Kurtosis-0.38888396
Mean444997.36
Median Absolute Deviation (MAD)850.65143
Skewness-0.2645835
Sum1.9312886 × 108
Variance1633829.6
MonotonicityNot monotonic
2024-05-11T16:05:27.142527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
445590.096837802 10
 
2.1%
444842.923410013 7
 
1.5%
445022.708339772 5
 
1.0%
444934.283205797 5
 
1.0%
442943.860447203 5
 
1.0%
445354.571117445 5
 
1.0%
444113.028210915 4
 
0.8%
443873.621189048 4
 
0.8%
444168.205988118 4
 
0.8%
442598.51298973 4
 
0.8%
Other values (329) 381
79.4%
(Missing) 46
 
9.6%
ValueCountFrequency (%)
441112.696875025 1
 
0.2%
441557.179656218 1
 
0.2%
441814.435 1
 
0.2%
441905.92831718 2
0.4%
442131.941663523 3
0.6%
442205.362043985 1
 
0.2%
442344.083634873 1
 
0.2%
442426.399301013 1
 
0.2%
442456.005 1
 
0.2%
442521.07233718 1
 
0.2%
ValueCountFrequency (%)
447402.299460543 1
0.2%
447392.192148188 1
0.2%
447379.84918057 1
0.2%
447359.687525534 1
0.2%
447335.760650218 1
0.2%
447287.490354431 1
0.2%
447271.99 2
0.4%
447221.69432852 1
0.2%
447206.956641768 1
0.2%
447196.182046397 1
0.2%

제조구분명
Categorical

Distinct5
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
냉동
278 
<NA>
118 
일반
70 
특정
 
10
충전
 
4

Length

Max length4
Median length2
Mean length2.4916667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row냉동
2nd row냉동
3rd row냉동
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
냉동 278
57.9%
<NA> 118
24.6%
일반 70
 
14.6%
특정 10
 
2.1%
충전 4
 
0.8%

Length

2024-05-11T16:05:27.373862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:05:27.515373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
냉동 278
57.9%
na 118
24.6%
일반 70
 
14.6%
특정 10
 
2.1%
충전 4
 
0.8%
Distinct13
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
기타
211 
<NA>
168 
상업.업무용
55 
업무용
 
14
상업용
 
12
Other values (8)
 
20

Length

Max length6
Median length4
Mean length3.3291667
Min length2

Unique

Unique5 ?
Unique (%)1.0%

Sample

1st row<NA>
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 211
44.0%
<NA> 168
35.0%
상업.업무용 55
 
11.5%
업무용 14
 
2.9%
상업용 12
 
2.5%
지정되지않음 9
 
1.9%
상업기타 4
 
0.8%
주거용 2
 
0.4%
주거기타 1
 
0.2%
공업용 1
 
0.2%
Other values (3) 3
 
0.6%

Length

2024-05-11T16:05:28.013265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 211
44.0%
na 168
35.0%
상업.업무용 55
 
11.5%
업무용 14
 
2.9%
상업용 12
 
2.5%
지정되지않음 9
 
1.9%
상업기타 4
 
0.8%
주거용 2
 
0.4%
주거기타 1
 
0.2%
공업용 1
 
0.2%
Other values (3) 3
 
0.6%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)제조구분명사업장부지용도구분명
03220000190032201270220011420080619<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 삼성동 78-3<NA><NA>원일빌딩2008-06-19 11:21:20I2018-08-31 23:59:59.0제조205060.318785446343.538486냉동<NA>
13220000195032200830220017219500516<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 신사동 543<NA><NA>삼대양개발(주)2006-03-06 00:00:00I2018-08-31 23:59:59.0제조202047.42204446436.905216냉동기타
23220000197932200830220000320090917<NA>3폐업3폐업20181231<NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 6<NA><NA>용창산업(주) 영동관광호텔2018-12-31 10:39:34U2019-01-02 02:40:00.0제조202108.433847446160.61824냉동기타
33220000198032200830220001120100115<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 18-3<NA><NA>영창빌딩2010-01-15 18:43:14I2018-08-31 23:59:59.0제조201750.935445774.145일반기타
43220000198032200830220001219800801<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 204-6<NA><NA>한국페인트잉크협동조합2016-06-24 10:45:27I2018-08-31 23:59:59.0제조202491.418046444881.11183일반기타
53220000198132200830220001419810710<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 130-9<NA><NA>(주)기린건축2005-03-28 00:00:00I2018-08-31 23:59:59.0제조<NA><NA>냉동기타
63220000198132200830220001520080721<NA>3폐업3폐업20080721<NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 199<NA><NA>주식회사 에이치케이상호저축은행2008-07-21 15:17:12I2018-08-31 23:59:59.0제조202077.942568444851.37383냉동기타
73220000198132200830220001619810720<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 대치동 27-1<NA><NA>대한도시가스(주)2011-08-29 10:18:10I2018-08-31 23:59:59.0제조206629.761344444129.159196일반기타
83220000198332200830220004620090731<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 역삼동 833<NA><NA>대려도2015-03-23 15:43:50I2018-08-31 23:59:59.0제조202763.197583443461.085901냉동기타
93220000198332200830220005319831126<NA>3폐업3폐업20210713<NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 213-5<NA><NA>JS빌딩2021-07-13 14:46:35U2021-07-15 02:40:00.0제조202827.319821445717.213521일반기타
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)제조구분명사업장부지용도구분명
470322000020233220176021000012023-02-07<NA>1영업/정상1영업중<NA><NA><NA>2023-04-17<NA><NA><NA>서울특별시 강남구 삼성동 159 코엑스인터콘티넨탈서울서울특별시 강남구 봉은사로 524, 코엑스인터콘티넨탈서울 지하5층 (삼성동)6164파르나스호텔㈜2023-04-14 17:51:22U2022-12-03 23:06:00.0제조205130.591679445590.096838<NA><NA>
471322000020233220176021000022023-02-07<NA>1영업/정상1영업중<NA><NA><NA>2023-06-08<NA><NA><NA>서울특별시 강남구 역삼동 646 코레이트 타워서울특별시 강남구 테헤란로 137, 코레이트 타워 지하7층 (역삼동)6132(주)코레이트타워위탁관리부동산투자회사2023-06-07 15:30:40U2022-12-06 00:09:00.0제조202935.293419444246.724506<NA><NA>
472322000020233220176021000032023-12-12<NA>1영업/정상1영업중<NA><NA><NA>2023-12-21<NA><NA><NA>서울특별시 강남구 역삼동 755 한솔필리아<NA><NA>주식회사 퍼시픽아레나2023-12-21 16:25:07U2022-11-01 22:03:00.0제조204213.643237444113.028211<NA><NA>
473322000020233220176022000012023-04-20<NA>1영업/정상1영업중<NA><NA><NA>2023-06-12<NA><NA><NA>서울특별시 강남구 역삼동 720-2 삼익 라비돌 빌딩서울특별시 강남구 테헤란로 234, 삼익 라비돌 빌딩 (역삼동)6221삼익라비돌빌딩2023-06-09 09:47:27U2022-12-05 23:01:00.0제조203628.379146444413.806441<NA><NA>
474322000020233220176022000022023-01-19<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 189 개포자이 프레지던스 407동서울특별시 강남구 삼성로 14 (개포동, 개포자이 프레지던스)6331개포자이프레지던스아파트 407동2023-08-04 10:21:39I2022-12-08 00:06:00.0제조206239.623997442598.51299<NA><NA>
475322000020233220176022000032023-01-19<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 189 개포자이 프레지던스 413동서울특별시 강남구 삼성로 14 (개포동, 개포자이 프레지던스)6331개포자이프레지던스아파트 413동2023-08-04 10:12:22I2022-12-08 00:06:00.0제조206239.623997442598.51299<NA><NA>
476322000020233220176022000042023-01-19<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 189 개포자이 프레지던스 427동서울특별시 강남구 삼성로 14 (개포동, 개포자이 프레지던스)6331개포자이프레지던스아파트 427동2023-08-04 10:16:38I2022-12-08 00:06:00.0제조206239.623997442598.51299<NA><NA>
477322000020233220176022000052023-01-19<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 개포동 189 개포자이 프레지던스 434동서울특별시 강남구 삼성로 14 (개포동, 개포자이 프레지던스)6331개포자이프레지던스아파트 434동2023-08-04 10:20:18I2022-12-08 00:06:00.0제조206239.623997442598.51299<NA><NA>
478322000020233220176022000072023-09-22<NA>1영업/정상1영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 수서동 728서울특별시 강남구 밤고개로 76-2 (수서동)6362국가철도공단(수도권본부)2023-09-25 11:07:27I2022-12-08 22:07:00.0제조208825.514283442900.219169<NA><NA>
479322000020243220176021000012024-02-28<NA>2휴업2휴업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 논현동 248-7 임피리얼 팰리스호텔서울특별시 강남구 언주로 640, 임피리얼 팰리스호텔 (논현동)6098(주)태승이십일2024-02-28 17:35:08I2023-12-03 00:01:00.0제조203102.074986445751.652028<NA><NA>