Overview

Dataset statistics

Number of variables34
Number of observations361
Missing cells3792
Missing cells (%)30.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory102.4 KiB
Average record size in memory290.4 B

Variable types

Categorical12
Text6
DateTime4
Unsupported8
Numeric4

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),문화체육업종명,공사립구분명,보험가입여부코드,지도자수,건축물동수,건축물연면적,회원모집총인원,세부업종명,법인명
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-19867/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
보험가입여부코드 is highly imbalanced (64.7%)Imbalance
회원모집총인원 is highly imbalanced (58.7%)Imbalance
인허가취소일자 has 361 (100.0%) missing valuesMissing
폐업일자 has 248 (68.7%) missing valuesMissing
휴업시작일자 has 361 (100.0%) missing valuesMissing
휴업종료일자 has 361 (100.0%) missing valuesMissing
재개업일자 has 361 (100.0%) missing valuesMissing
전화번호 has 138 (38.2%) missing valuesMissing
소재지면적 has 361 (100.0%) missing valuesMissing
소재지우편번호 has 235 (65.1%) missing valuesMissing
도로명주소 has 4 (1.1%) missing valuesMissing
도로명우편번호 has 78 (21.6%) missing valuesMissing
업태구분명 has 361 (100.0%) missing valuesMissing
좌표정보(X) has 16 (4.4%) missing valuesMissing
좌표정보(Y) has 16 (4.4%) missing valuesMissing
건축물연면적 has 169 (46.8%) missing valuesMissing
세부업종명 has 361 (100.0%) missing valuesMissing
법인명 has 361 (100.0%) missing valuesMissing
관리번호 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업종료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
세부업종명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
법인명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
건축물연면적 has 25 (6.9%) zerosZeros

Reproduction

Analysis started2024-05-11 07:03:52.086502
Analysis finished2024-05-11 07:03:53.327669
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
3150000
361 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3150000
2nd row3150000
3rd row3150000
4th row3150000
5th row3150000

Common Values

ValueCountFrequency (%)
3150000 361
100.0%

Length

2024-05-11T16:03:53.470919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:03:53.728061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3150000 361
100.0%

관리번호
Text

UNIQUE 

Distinct361
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-05-11T16:03:54.018673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters7220
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique361 ?
Unique (%)100.0%

Sample

1st rowCDFH3301061989000001
2nd rowCDFH3301061989000002
3rd rowCDFH3301061989000003
4th rowCDFH3301061989000004
5th rowCDFH3301061989000005
ValueCountFrequency (%)
cdfh3301061989000001 1
 
0.3%
cdfh3301062021000008 1
 
0.3%
cdfh3301062021000006 1
 
0.3%
cdfh3301062021000005 1
 
0.3%
cdfh3301062021000004 1
 
0.3%
cdfh3301062021000003 1
 
0.3%
cdfh3301062021000002 1
 
0.3%
cdfh3301062021000001 1
 
0.3%
cdfh3301062020000031 1
 
0.3%
cdfh3301062020000030 1
 
0.3%
Other values (351) 351
97.2%
2024-05-11T16:03:54.657890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2842
39.4%
3 825
 
11.4%
1 684
 
9.5%
2 628
 
8.7%
6 417
 
5.8%
C 361
 
5.0%
D 361
 
5.0%
F 361
 
5.0%
H 361
 
5.0%
9 125
 
1.7%
Other values (4) 255
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5776
80.0%
Uppercase Letter 1444
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2842
49.2%
3 825
 
14.3%
1 684
 
11.8%
2 628
 
10.9%
6 417
 
7.2%
9 125
 
2.2%
4 78
 
1.4%
5 64
 
1.1%
8 60
 
1.0%
7 53
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
C 361
25.0%
D 361
25.0%
F 361
25.0%
H 361
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5776
80.0%
Latin 1444
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2842
49.2%
3 825
 
14.3%
1 684
 
11.8%
2 628
 
10.9%
6 417
 
7.2%
9 125
 
2.2%
4 78
 
1.4%
5 64
 
1.1%
8 60
 
1.0%
7 53
 
0.9%
Latin
ValueCountFrequency (%)
C 361
25.0%
D 361
25.0%
F 361
25.0%
H 361
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7220
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2842
39.4%
3 825
 
11.4%
1 684
 
9.5%
2 628
 
8.7%
6 417
 
5.8%
C 361
 
5.0%
D 361
 
5.0%
F 361
 
5.0%
H 361
 
5.0%
9 125
 
1.7%
Other values (4) 255
 
3.5%
Distinct341
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum1989-12-06 00:00:00
Maximum2024-04-29 00:00:00
2024-05-11T16:03:54.931532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:03:55.203418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB
Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
1
248 
3
107 
4
 
6

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row3
4th row4
5th row1

Common Values

ValueCountFrequency (%)
1 248
68.7%
3 107
29.6%
4 6
 
1.7%

Length

2024-05-11T16:03:55.452333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:03:55.645283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 248
68.7%
3 107
29.6%
4 6
 
1.7%

영업상태명
Categorical

Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
영업/정상
248 
폐업
107 
취소/말소/만료/정지/중지
 
6

Length

Max length14
Median length5
Mean length4.2603878
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row취소/말소/만료/정지/중지
3rd row폐업
4th row취소/말소/만료/정지/중지
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 248
68.7%
폐업 107
29.6%
취소/말소/만료/정지/중지 6
 
1.7%

Length

2024-05-11T16:03:55.838852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:03:56.030365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 248
68.7%
폐업 107
29.6%
취소/말소/만료/정지/중지 6
 
1.7%
Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
13
248 
3
107 
35
 
6

Length

Max length2
Median length2
Mean length1.7036011
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row35
3rd row3
4th row35
5th row13

Common Values

ValueCountFrequency (%)
13 248
68.7%
3 107
29.6%
35 6
 
1.7%

Length

2024-05-11T16:03:56.217251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:03:56.390139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
13 248
68.7%
3 107
29.6%
35 6
 
1.7%
Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
영업중
248 
폐업
107 
직권말소
 
6

Length

Max length4
Median length3
Mean length2.7202216
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row직권말소
3rd row폐업
4th row직권말소
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 248
68.7%
폐업 107
29.6%
직권말소 6
 
1.7%

Length

2024-05-11T16:03:56.592550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:03:56.792715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 248
68.7%
폐업 107
29.6%
직권말소 6
 
1.7%

폐업일자
Date

MISSING 

Distinct106
Distinct (%)93.8%
Missing248
Missing (%)68.7%
Memory size2.9 KiB
Minimum1999-12-20 00:00:00
Maximum2024-05-03 00:00:00
2024-05-11T16:03:57.035779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:03:57.281399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

휴업종료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

전화번호
Text

MISSING 

Distinct217
Distinct (%)97.3%
Missing138
Missing (%)38.2%
Memory size2.9 KiB
2024-05-11T16:03:57.719042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length9.9147982
Min length8

Characters and Unicode

Total characters2211
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)94.6%

Sample

1st row662-2085
2nd row663-1064
3rd row663-7696
4th row2605-0088
5th row642-6993
ValueCountFrequency (%)
02-6741-1010 2
 
0.9%
2661-3500 2
 
0.9%
02-2662-0676 2
 
0.9%
02-2658-6177 2
 
0.9%
2606-2112 2
 
0.9%
2666-5454 2
 
0.9%
3664-2848 1
 
0.4%
702-7307 1
 
0.4%
2662-0676 1
 
0.4%
6339-7575 1
 
0.4%
Other values (207) 207
92.8%
2024-05-11T16:03:58.322286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 411
18.6%
2 308
13.9%
- 294
13.3%
0 247
11.2%
3 174
7.9%
1 148
 
6.7%
8 138
 
6.2%
5 137
 
6.2%
7 127
 
5.7%
9 117
 
5.3%
Other values (4) 110
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1913
86.5%
Dash Punctuation 294
 
13.3%
Close Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 411
21.5%
2 308
16.1%
0 247
12.9%
3 174
9.1%
1 148
 
7.7%
8 138
 
7.2%
5 137
 
7.2%
7 127
 
6.6%
9 117
 
6.1%
4 106
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 294
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2211
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 411
18.6%
2 308
13.9%
- 294
13.3%
0 247
11.2%
3 174
7.9%
1 148
 
6.7%
8 138
 
6.2%
5 137
 
6.2%
7 127
 
5.7%
9 117
 
5.3%
Other values (4) 110
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2211
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 411
18.6%
2 308
13.9%
- 294
13.3%
0 247
11.2%
3 174
7.9%
1 148
 
6.7%
8 138
 
6.2%
5 137
 
6.2%
7 127
 
5.7%
9 117
 
5.3%
Other values (4) 110
 
5.0%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

소재지우편번호
Text

MISSING 

Distinct62
Distinct (%)49.2%
Missing235
Missing (%)65.1%
Memory size2.9 KiB
2024-05-11T16:03:58.676819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0793651
Min length6

Characters and Unicode

Total characters766
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)28.6%

Sample

1st row157833
2nd row157853
3rd row157811
4th row157884
5th row157898
ValueCountFrequency (%)
157030 10
 
7.9%
157910 7
 
5.6%
157851 5
 
4.0%
157210 5
 
4.0%
157801 4
 
3.2%
157918 4
 
3.2%
157840 4
 
3.2%
157839 4
 
3.2%
157897 4
 
3.2%
157881 4
 
3.2%
Other values (52) 75
59.5%
2024-05-11T16:03:59.261799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 167
21.8%
7 151
19.7%
5 144
18.8%
8 92
12.0%
0 66
 
8.6%
9 41
 
5.4%
3 37
 
4.8%
2 24
 
3.1%
4 19
 
2.5%
6 15
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 756
98.7%
Dash Punctuation 10
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 167
22.1%
7 151
20.0%
5 144
19.0%
8 92
12.2%
0 66
 
8.7%
9 41
 
5.4%
3 37
 
4.9%
2 24
 
3.2%
4 19
 
2.5%
6 15
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 766
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 167
21.8%
7 151
19.7%
5 144
18.8%
8 92
12.0%
0 66
 
8.6%
9 41
 
5.4%
3 37
 
4.8%
2 24
 
3.1%
4 19
 
2.5%
6 15
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 766
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 167
21.8%
7 151
19.7%
5 144
18.8%
8 92
12.0%
0 66
 
8.6%
9 41
 
5.4%
3 37
 
4.8%
2 24
 
3.1%
4 19
 
2.5%
6 15
 
2.0%
Distinct354
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-05-11T16:03:59.691318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length28.495845
Min length17

Characters and Unicode

Total characters10287
Distinct characters267
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique347 ?
Unique (%)96.1%

Sample

1st row서울특별시 강서구 내발산동 701-1번지
2nd row서울특별시 강서구 방화동 619-7번지
3rd row서울특별시 강서구 공항동 22-24번지
4th row서울특별시 강서구 화곡동 373-14번지
5th row서울특별시 강서구 화곡동 359-54번지
ValueCountFrequency (%)
서울특별시 361
18.8%
강서구 361
18.8%
화곡동 101
 
5.3%
마곡동 78
 
4.1%
등촌동 67
 
3.5%
가양동 28
 
1.5%
염창동 26
 
1.4%
내발산동 25
 
1.3%
방화동 24
 
1.2%
3층 21
 
1.1%
Other values (645) 831
43.2%
2024-05-11T16:04:00.322048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1764
 
17.1%
734
 
7.1%
1 407
 
4.0%
389
 
3.8%
380
 
3.7%
367
 
3.6%
364
 
3.5%
361
 
3.5%
361
 
3.5%
361
 
3.5%
Other values (257) 4799
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5924
57.6%
Decimal Number 2146
 
20.9%
Space Separator 1764
 
17.1%
Dash Punctuation 306
 
3.0%
Uppercase Letter 71
 
0.7%
Other Punctuation 45
 
0.4%
Math Symbol 16
 
0.2%
Letter Number 8
 
0.1%
Lowercase Letter 3
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
734
 
12.4%
389
 
6.6%
380
 
6.4%
367
 
6.2%
364
 
6.1%
361
 
6.1%
361
 
6.1%
361
 
6.1%
210
 
3.5%
200
 
3.4%
Other values (221) 2197
37.1%
Uppercase Letter
ValueCountFrequency (%)
B 41
57.7%
A 10
 
14.1%
I 3
 
4.2%
H 3
 
4.2%
D 3
 
4.2%
J 2
 
2.8%
K 2
 
2.8%
V 1
 
1.4%
S 1
 
1.4%
P 1
 
1.4%
Other values (4) 4
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 407
19.0%
0 248
11.6%
3 247
11.5%
2 234
10.9%
7 232
10.8%
4 178
8.3%
6 176
8.2%
5 167
7.8%
9 137
 
6.4%
8 120
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
h 1
33.3%
e 1
33.3%
b 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 44
97.8%
/ 1
 
2.2%
Letter Number
ValueCountFrequency (%)
4
50.0%
4
50.0%
Space Separator
ValueCountFrequency (%)
1764
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 306
100.0%
Math Symbol
ValueCountFrequency (%)
~ 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5924
57.6%
Common 4281
41.6%
Latin 82
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
734
 
12.4%
389
 
6.6%
380
 
6.4%
367
 
6.2%
364
 
6.1%
361
 
6.1%
361
 
6.1%
361
 
6.1%
210
 
3.5%
200
 
3.4%
Other values (221) 2197
37.1%
Latin
ValueCountFrequency (%)
B 41
50.0%
A 10
 
12.2%
4
 
4.9%
4
 
4.9%
I 3
 
3.7%
H 3
 
3.7%
D 3
 
3.7%
J 2
 
2.4%
K 2
 
2.4%
V 1
 
1.2%
Other values (9) 9
 
11.0%
Common
ValueCountFrequency (%)
1764
41.2%
1 407
 
9.5%
- 306
 
7.1%
0 248
 
5.8%
3 247
 
5.8%
2 234
 
5.5%
7 232
 
5.4%
4 178
 
4.2%
6 176
 
4.1%
5 167
 
3.9%
Other values (7) 322
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5924
57.6%
ASCII 4355
42.3%
Number Forms 8
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1764
40.5%
1 407
 
9.3%
- 306
 
7.0%
0 248
 
5.7%
3 247
 
5.7%
2 234
 
5.4%
7 232
 
5.3%
4 178
 
4.1%
6 176
 
4.0%
5 167
 
3.8%
Other values (24) 396
 
9.1%
Hangul
ValueCountFrequency (%)
734
 
12.4%
389
 
6.6%
380
 
6.4%
367
 
6.2%
364
 
6.1%
361
 
6.1%
361
 
6.1%
361
 
6.1%
210
 
3.5%
200
 
3.4%
Other values (221) 2197
37.1%
Number Forms
ValueCountFrequency (%)
4
50.0%
4
50.0%

도로명주소
Text

MISSING 

Distinct354
Distinct (%)99.2%
Missing4
Missing (%)1.1%
Memory size2.9 KiB
2024-05-11T16:04:00.776788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length35.747899
Min length22

Characters and Unicode

Total characters12762
Distinct characters279
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)98.3%

Sample

1st row서울특별시 강서구 강서로 299 (내발산동)
2nd row서울특별시 강서구 개화동로27가길 33 (방화동)
3rd row서울특별시 강서구 월정로30길 27 (화곡동)
4th row서울특별시 강서구 가로공원로76길 100 (화곡동)
5th row서울특별시 강서구 곰달래로53길 19 (화곡동)
ValueCountFrequency (%)
서울특별시 357
 
14.9%
강서구 357
 
14.9%
화곡동 88
 
3.7%
마곡동 76
 
3.2%
등촌동 59
 
2.5%
강서로 54
 
2.3%
공항대로 43
 
1.8%
양천로 35
 
1.5%
2층 28
 
1.2%
3층 28
 
1.2%
Other values (704) 1265
52.9%
2024-05-11T16:04:01.857291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2097
 
16.4%
815
 
6.4%
, 460
 
3.6%
458
 
3.6%
1 448
 
3.5%
403
 
3.2%
368
 
2.9%
363
 
2.8%
360
 
2.8%
) 358
 
2.8%
Other values (269) 6632
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7147
56.0%
Decimal Number 2141
 
16.8%
Space Separator 2097
 
16.4%
Other Punctuation 462
 
3.6%
Close Punctuation 358
 
2.8%
Open Punctuation 358
 
2.8%
Uppercase Letter 109
 
0.9%
Math Symbol 40
 
0.3%
Dash Punctuation 38
 
0.3%
Letter Number 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
815
 
11.4%
458
 
6.4%
403
 
5.6%
368
 
5.1%
363
 
5.1%
360
 
5.0%
357
 
5.0%
357
 
5.0%
357
 
5.0%
300
 
4.2%
Other values (232) 3009
42.1%
Uppercase Letter
ValueCountFrequency (%)
B 77
70.6%
A 11
 
10.1%
I 3
 
2.8%
H 3
 
2.8%
S 2
 
1.8%
J 2
 
1.8%
K 2
 
1.8%
D 2
 
1.8%
Z 1
 
0.9%
T 1
 
0.9%
Other values (5) 5
 
4.6%
Decimal Number
ValueCountFrequency (%)
1 448
20.9%
0 292
13.6%
3 283
13.2%
2 277
12.9%
4 192
9.0%
5 190
8.9%
6 165
 
7.7%
7 105
 
4.9%
8 95
 
4.4%
9 94
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
b 2
50.0%
e 1
25.0%
h 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 460
99.6%
/ 2
 
0.4%
Letter Number
ValueCountFrequency (%)
4
50.0%
4
50.0%
Space Separator
ValueCountFrequency (%)
2097
100.0%
Close Punctuation
ValueCountFrequency (%)
) 358
100.0%
Open Punctuation
ValueCountFrequency (%)
( 358
100.0%
Math Symbol
ValueCountFrequency (%)
~ 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7147
56.0%
Common 5494
43.0%
Latin 121
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
815
 
11.4%
458
 
6.4%
403
 
5.6%
368
 
5.1%
363
 
5.1%
360
 
5.0%
357
 
5.0%
357
 
5.0%
357
 
5.0%
300
 
4.2%
Other values (232) 3009
42.1%
Latin
ValueCountFrequency (%)
B 77
63.6%
A 11
 
9.1%
4
 
3.3%
4
 
3.3%
I 3
 
2.5%
H 3
 
2.5%
S 2
 
1.7%
J 2
 
1.7%
K 2
 
1.7%
b 2
 
1.7%
Other values (10) 11
 
9.1%
Common
ValueCountFrequency (%)
2097
38.2%
, 460
 
8.4%
1 448
 
8.2%
) 358
 
6.5%
( 358
 
6.5%
0 292
 
5.3%
3 283
 
5.2%
2 277
 
5.0%
4 192
 
3.5%
5 190
 
3.5%
Other values (7) 539
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7147
56.0%
ASCII 5607
43.9%
Number Forms 8
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2097
37.4%
, 460
 
8.2%
1 448
 
8.0%
) 358
 
6.4%
( 358
 
6.4%
0 292
 
5.2%
3 283
 
5.0%
2 277
 
4.9%
4 192
 
3.4%
5 190
 
3.4%
Other values (25) 652
 
11.6%
Hangul
ValueCountFrequency (%)
815
 
11.4%
458
 
6.4%
403
 
5.6%
368
 
5.1%
363
 
5.1%
360
 
5.0%
357
 
5.0%
357
 
5.0%
357
 
5.0%
300
 
4.2%
Other values (232) 3009
42.1%
Number Forms
ValueCountFrequency (%)
4
50.0%
4
50.0%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct122
Distinct (%)43.1%
Missing78
Missing (%)21.6%
Infinite0
Infinite (%)0.0%
Mean10853.438
Minimum7510
Maximum157937
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-11T16:04:02.111023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7510
5-th percentile7528.2
Q17581
median7651
Q37777.5
95-th percentile7806
Maximum157937
Range150427
Interquartile range (IQR)196.5

Descriptive statistics

Standard deviation21681.741
Coefficient of variation (CV)1.9976841
Kurtosis42.962962
Mean10853.438
Median Absolute Deviation (MAD)89
Skewness6.6826942
Sum3071523
Variance4.700979 × 108
MonotonicityNot monotonic
2024-05-11T16:04:02.339499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7803 17
 
4.7%
7788 8
 
2.2%
7599 8
 
2.2%
7807 8
 
2.2%
7802 7
 
1.9%
7801 6
 
1.7%
7714 6
 
1.7%
7715 6
 
1.7%
7774 6
 
1.7%
7806 5
 
1.4%
Other values (112) 206
57.1%
(Missing) 78
 
21.6%
ValueCountFrequency (%)
7510 1
 
0.3%
7516 1
 
0.3%
7517 1
 
0.3%
7519 1
 
0.3%
7525 2
0.6%
7526 4
1.1%
7527 2
0.6%
7528 3
0.8%
7530 2
0.6%
7532 2
0.6%
ValueCountFrequency (%)
157937 1
 
0.3%
157916 1
 
0.3%
157910 2
 
0.6%
157905 1
 
0.3%
157884 1
 
0.3%
7807 8
2.2%
7806 5
 
1.4%
7805 1
 
0.3%
7803 17
4.7%
7802 7
1.9%
Distinct355
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-05-11T16:04:02.846948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length19
Mean length7.7368421
Min length2

Characters and Unicode

Total characters2793
Distinct characters360
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique349 ?
Unique (%)96.7%

Sample

1st row코리아헬스크럽
2nd row국제헬스크럽
3rd row공항헬스크럽
4th row원헬스크럽
5th row백제헬스크럽
ValueCountFrequency (%)
휘트니스 27
 
4.3%
14
 
2.2%
pt 13
 
2.1%
gym 10
 
1.6%
크로스핏 8
 
1.3%
피트니스 7
 
1.1%
스튜디오 6
 
1.0%
에이블짐 5
 
0.8%
커브스 5
 
0.8%
스포애니 5
 
0.8%
Other values (450) 526
84.0%
2024-05-11T16:04:03.745894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
265
 
9.5%
209
 
7.5%
96
 
3.4%
71
 
2.5%
67
 
2.4%
61
 
2.2%
60
 
2.1%
54
 
1.9%
46
 
1.6%
41
 
1.5%
Other values (350) 1823
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2071
74.1%
Uppercase Letter 268
 
9.6%
Space Separator 265
 
9.5%
Lowercase Letter 100
 
3.6%
Decimal Number 44
 
1.6%
Close Punctuation 15
 
0.5%
Other Punctuation 14
 
0.5%
Open Punctuation 12
 
0.4%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
209
 
10.1%
96
 
4.6%
71
 
3.4%
67
 
3.2%
61
 
2.9%
60
 
2.9%
54
 
2.6%
46
 
2.2%
41
 
2.0%
37
 
1.8%
Other values (288) 1329
64.2%
Uppercase Letter
ValueCountFrequency (%)
T 40
14.9%
P 31
11.6%
M 25
 
9.3%
Y 25
 
9.3%
G 22
 
8.2%
S 13
 
4.9%
R 11
 
4.1%
O 10
 
3.7%
A 10
 
3.7%
F 9
 
3.4%
Other values (13) 72
26.9%
Lowercase Letter
ValueCountFrequency (%)
n 12
12.0%
e 11
11.0%
i 10
10.0%
r 8
 
8.0%
a 8
 
8.0%
o 8
 
8.0%
t 7
 
7.0%
s 7
 
7.0%
u 5
 
5.0%
l 4
 
4.0%
Other values (9) 20
20.0%
Decimal Number
ValueCountFrequency (%)
2 11
25.0%
1 9
20.5%
0 7
15.9%
4 6
13.6%
8 4
 
9.1%
6 2
 
4.5%
5 2
 
4.5%
3 2
 
4.5%
9 1
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 5
35.7%
& 3
21.4%
: 2
 
14.3%
/ 1
 
7.1%
' 1
 
7.1%
, 1
 
7.1%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
265
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2071
74.1%
Latin 368
 
13.2%
Common 354
 
12.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
209
 
10.1%
96
 
4.6%
71
 
3.4%
67
 
3.2%
61
 
2.9%
60
 
2.9%
54
 
2.6%
46
 
2.2%
41
 
2.0%
37
 
1.8%
Other values (288) 1329
64.2%
Latin
ValueCountFrequency (%)
T 40
 
10.9%
P 31
 
8.4%
M 25
 
6.8%
Y 25
 
6.8%
G 22
 
6.0%
S 13
 
3.5%
n 12
 
3.3%
e 11
 
3.0%
R 11
 
3.0%
O 10
 
2.7%
Other values (32) 168
45.7%
Common
ValueCountFrequency (%)
265
74.9%
) 15
 
4.2%
( 12
 
3.4%
2 11
 
3.1%
1 9
 
2.5%
0 7
 
2.0%
4 6
 
1.7%
. 5
 
1.4%
- 4
 
1.1%
8 4
 
1.1%
Other values (10) 16
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2071
74.1%
ASCII 721
 
25.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
265
36.8%
T 40
 
5.5%
P 31
 
4.3%
M 25
 
3.5%
Y 25
 
3.5%
G 22
 
3.1%
) 15
 
2.1%
S 13
 
1.8%
( 12
 
1.7%
n 12
 
1.7%
Other values (51) 261
36.2%
Hangul
ValueCountFrequency (%)
209
 
10.1%
96
 
4.6%
71
 
3.4%
67
 
3.2%
61
 
2.9%
60
 
2.9%
54
 
2.6%
46
 
2.2%
41
 
2.0%
37
 
1.8%
Other values (288) 1329
64.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct349
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2003-04-18 11:50:37
Maximum2024-05-03 17:35:03
2024-05-11T16:04:04.118699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:04:04.359244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
I
216 
U
145 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 216
59.8%
U 145
40.2%

Length

2024-05-11T16:04:04.573674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:04.745028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 216
59.8%
u 145
40.2%
Distinct203
Distinct (%)56.2%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:05:00
2024-05-11T16:04:04.936584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T16:04:05.164773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct305
Distinct (%)88.4%
Missing16
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean185946.57
Minimum182924.51
Maximum189066.64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-11T16:04:05.417988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum182924.51
5-th percentile183280.68
Q1185163
median185894.51
Q3186920.98
95-th percentile188304.73
Maximum189066.64
Range6142.1376
Interquartile range (IQR)1757.9848

Descriptive statistics

Standard deviation1450.5221
Coefficient of variation (CV)0.0078007468
Kurtosis-0.42631397
Mean185946.57
Median Absolute Deviation (MAD)921.97667
Skewness-0.15011982
Sum64151567
Variance2104014.5
MonotonicityNot monotonic
2024-05-11T16:04:05.649614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
185660.0 4
 
1.1%
184368.686658363 4
 
1.1%
187119.948165892 4
 
1.1%
183897.0 3
 
0.8%
187952.560027898 3
 
0.8%
186501.233192961 3
 
0.8%
183280.682239227 3
 
0.8%
186899.241516278 3
 
0.8%
185894.50946257 2
 
0.6%
185590.150482304 2
 
0.6%
Other values (295) 314
87.0%
(Missing) 16
 
4.4%
ValueCountFrequency (%)
182924.505322144 1
0.3%
182959.976679854 1
0.3%
182965.529 1
0.3%
182974.850127567 1
0.3%
182983.398791677 1
0.3%
182984.891498757 1
0.3%
183039.924452506 1
0.3%
183046.486354734 1
0.3%
183090.805509245 1
0.3%
183116.294494255 1
0.3%
ValueCountFrequency (%)
189066.642961645 1
0.3%
189007.651555034 1
0.3%
188991.037609055 1
0.3%
188953.293071222 1
0.3%
188907.145870979 1
0.3%
188872.058097899 1
0.3%
188843.428976776 1
0.3%
188783.060728518 1
0.3%
188674.333972649 1
0.3%
188627.811125081 1
0.3%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct304
Distinct (%)88.1%
Missing16
Missing (%)4.4%
Infinite0
Infinite (%)0.0%
Mean450212.96
Minimum447316.21
Maximum452817.47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-11T16:04:05.831403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum447316.21
5-th percentile447679.84
Q1449181.41
median450598.94
Q3451093.02
95-th percentile452116.45
Maximum452817.47
Range5501.2545
Interquartile range (IQR)1911.6162

Descriptive statistics

Standard deviation1358.6024
Coefficient of variation (CV)0.0030176884
Kurtosis-0.64041266
Mean450212.96
Median Absolute Deviation (MAD)860.11966
Skewness-0.52809073
Sum1.5532347 × 108
Variance1845800.6
MonotonicityNot monotonic
2024-05-11T16:04:06.031330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
450691.593173724 4
 
1.1%
450884.0 4
 
1.1%
451726.492157 4
 
1.1%
451681.0 3
 
0.8%
451485.251875922 3
 
0.8%
450983.093101522 3
 
0.8%
451093.022236256 3
 
0.8%
450938.0 3
 
0.8%
450562.020225978 3
 
0.8%
448782.56826907 2
 
0.6%
Other values (294) 313
86.7%
(Missing) 16
 
4.4%
ValueCountFrequency (%)
447316.214981355 1
0.3%
447363.383938393 1
0.3%
447377.82704763 1
0.3%
447378.904571819 1
0.3%
447380.297316673 1
0.3%
447381.645907113 1
0.3%
447385.475690118 1
0.3%
447468.836493019 1
0.3%
447507.871788551 1
0.3%
447532.217283307 1
0.3%
ValueCountFrequency (%)
452817.469477897 2
0.6%
452514.491608298 1
0.3%
452409.070679958 1
0.3%
452402.238297341 1
0.3%
452349.671029508 1
0.3%
452339.893234091 1
0.3%
452327.871674126 1
0.3%
452325.784336192 1
0.3%
452309.686583372 1
0.3%
452303.518832496 1
0.3%
Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
체력단련장업
224 
<NA>
137 

Length

Max length6
Median length6
Mean length5.2409972
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row체력단련장업
2nd row체력단련장업
3rd row체력단련장업
4th row체력단련장업
5th row체력단련장업

Common Values

ValueCountFrequency (%)
체력단련장업 224
62.0%
<NA> 137
38.0%

Length

2024-05-11T16:04:06.286050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:06.483245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체력단련장업 224
62.0%
na 137
38.0%
Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
사립
223 
<NA>
137 
공립
 
1

Length

Max length4
Median length2
Mean length2.7590028
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row사립
2nd row사립
3rd row사립
4th row사립
5th row사립

Common Values

ValueCountFrequency (%)
사립 223
61.8%
<NA> 137
38.0%
공립 1
 
0.3%

Length

2024-05-11T16:04:06.648668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:06.850620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 223
61.8%
na 137
38.0%
공립 1
 
0.3%

보험가입여부코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
318 
0
41 
Y
 
2

Length

Max length4
Median length4
Mean length3.6426593
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row0
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 318
88.1%
0 41
 
11.4%
Y 2
 
0.6%

Length

2024-05-11T16:04:07.032033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:07.188362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 318
88.1%
0 41
 
11.4%
y 2
 
0.6%

지도자수
Categorical

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
184 
1
128 
2
28 
0
21 

Length

Max length4
Median length4
Mean length2.5290859
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 184
51.0%
1 128
35.5%
2 28
 
7.8%
0 21
 
5.8%

Length

2024-05-11T16:04:07.354374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:07.519371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 184
51.0%
1 128
35.5%
2 28
 
7.8%
0 21
 
5.8%

건축물동수
Categorical

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
265 
0
50 
1
44 
2
 
2

Length

Max length4
Median length4
Mean length3.2022161
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 265
73.4%
0 50
 
13.9%
1 44
 
12.2%
2 2
 
0.6%

Length

2024-05-11T16:04:07.653364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:07.795654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 265
73.4%
0 50
 
13.9%
1 44
 
12.2%
2 2
 
0.6%

건축물연면적
Real number (ℝ)

MISSING  ZEROS 

Distinct153
Distinct (%)79.7%
Missing169
Missing (%)46.8%
Infinite0
Infinite (%)0.0%
Mean13363.417
Minimum0
Maximum346224
Zeros25
Zeros (%)6.9%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-11T16:04:07.951035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1933.9825
median3194.09
Q314384.26
95-th percentile43857.008
Maximum346224
Range346224
Interquartile range (IQR)13450.278

Descriptive statistics

Standard deviation33382.606
Coefficient of variation (CV)2.4980592
Kurtosis55.032467
Mean13363.417
Median Absolute Deviation (MAD)2993.29
Skewness6.5083274
Sum2565776
Variance1.1143984 × 109
MonotonicityNot monotonic
2024-05-11T16:04:08.159504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 25
 
6.9%
2393.57 3
 
0.8%
14328.01 3
 
0.8%
20474.94 3
 
0.8%
132407.0 3
 
0.8%
23591.7 2
 
0.6%
6445.82 2
 
0.6%
659.83 2
 
0.6%
69465.86 2
 
0.6%
26217.02 2
 
0.6%
Other values (143) 145
40.2%
(Missing) 169
46.8%
ValueCountFrequency (%)
0.0 25
6.9%
99.36 1
 
0.3%
125.74 1
 
0.3%
307.72 1
 
0.3%
329.76 1
 
0.3%
385.99 1
 
0.3%
509.43 1
 
0.3%
563.97 1
 
0.3%
591.0 1
 
0.3%
631.59 1
 
0.3%
ValueCountFrequency (%)
346224.0 1
 
0.3%
151916.85 1
 
0.3%
132407.0 3
0.8%
106200.4 1
 
0.3%
69465.86 2
0.6%
59109.0 1
 
0.3%
44641.6 1
 
0.3%
43215.07 1
 
0.3%
38356.5 1
 
0.3%
31841.76 1
 
0.3%

회원모집총인원
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
<NA>
331 
0
 
30

Length

Max length4
Median length4
Mean length3.7506925
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 331
91.7%
0 30
 
8.3%

Length

2024-05-11T16:04:08.341105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T16:04:08.502313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 331
91.7%
0 30
 
8.3%

세부업종명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

법인명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing361
Missing (%)100.0%
Memory size3.3 KiB

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)문화체육업종명공사립구분명보험가입여부코드지도자수건축물동수건축물연면적회원모집총인원세부업종명법인명
03150000CDFH330106198900000119891206<NA>3폐업3폐업20120817<NA><NA><NA>662-2085<NA>157833서울특별시 강서구 내발산동 701-1번지서울특별시 강서구 강서로 299 (내발산동)<NA>코리아헬스크럽2012-08-17 14:58:18I2018-08-31 23:59:59.0<NA>185437.626964450042.953948체력단련장업사립<NA>000.0<NA><NA><NA>
13150000CDFH330106198900000219891211<NA>4취소/말소/만료/정지/중지35직권말소20130527<NA><NA><NA>663-1064<NA>157853서울특별시 강서구 방화동 619-7번지서울특별시 강서구 개화동로27가길 33 (방화동)<NA>국제헬스크럽2013-05-28 14:51:11I2018-08-31 23:59:59.0<NA>182924.505322451175.952499체력단련장업사립<NA><NA>00.0<NA><NA><NA>
23150000CDFH330106198900000319891219<NA>3폐업3폐업20130429<NA><NA><NA>663-7696<NA>157811서울특별시 강서구 공항동 22-24번지<NA><NA>공항헬스크럽2013-04-29 14:52:18I2018-08-31 23:59:59.0<NA>183245.773853451320.634145체력단련장업사립0<NA><NA><NA><NA><NA><NA>
33150000CDFH330106198900000419891230<NA>4취소/말소/만료/정지/중지35직권말소20130527<NA><NA><NA>2605-0088<NA>157884서울특별시 강서구 화곡동 373-14번지서울특별시 강서구 월정로30길 27 (화곡동)<NA>원헬스크럽2013-05-28 14:56:50I2018-08-31 23:59:59.0<NA>185565.926535448026.216712체력단련장업사립<NA><NA><NA><NA><NA><NA><NA>
43150000CDFH330106198900000519891230<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 화곡동 359-54번지서울특별시 강서구 가로공원로76길 100 (화곡동)7766백제헬스크럽2020-05-20 19:36:30U2020-05-22 02:40:00.0<NA>185684.370684447853.18391체력단련장업사립<NA><NA>00.0<NA><NA><NA>
53150000CDFH330106199100000119910614<NA>3폐업3폐업20090615<NA><NA><NA>642-6993<NA>157898서울특별시 강서구 화곡동 800-1번지서울특별시 강서구 곰달래로53길 19 (화곡동)<NA>올림피아헬스크럽2009-06-15 13:33:36I2018-08-31 23:59:59.0<NA>187515.980613447841.752021체력단련장업사립<NA>000.0<NA><NA><NA>
63150000CDFH330106199100000219910821<NA>1영업/정상13영업중<NA><NA><NA><NA>2602-9184<NA>157872서울특별시 강서구 화곡동 110-65 외2필지 3층서울특별시 강서구 화곡로 206, 3층 (화곡동)7721헬스와 필라테스 화곡점2021-05-04 10:32:17U2021-05-06 02:40:00.0<NA>186125.314508449082.323326체력단련장업사립<NA>1<NA>2272.6<NA><NA><NA>
73150000CDFH330106199200000119920422<NA>3폐업3폐업20010703<NA><NA><NA>663-7898<NA>157847서울특별시 강서구 방화동 285-14번지서울특별시 강서구 양천로24가길 18 (방화동)<NA>강동헬스2003-04-18 11:50:37I2018-08-31 23:59:59.0<NA>183548.293703452303.518832체력단련장업사립<NA>000.0<NA><NA><NA>
83150000CDFH330106199200000219920725<NA>3폐업3폐업20010702<NA><NA><NA>661-7539<NA>157210서울특별시 강서구 마곡동 359-3번지서울특별시 강서구 양천로30길 123 (마곡동)<NA>항우헬스크럽2003-04-18 11:50:37I2018-08-31 23:59:59.0<NA>184300.877643451745.818462체력단련장업사립<NA>000.0<NA><NA><NA>
93150000CDFH330106199300000119930726<NA>3폐업3폐업20050929<NA><NA><NA>666-3459<NA>157852서울특별시 강서구 방화동 609-30번지서울특별시 강서구 방화동로 56 (방화동)<NA>라이온헬스크럽2005-09-29 15:55:19I2018-08-31 23:59:59.0<NA>183346.939918451463.798073체력단련장업사립0<NA><NA><NA><NA><NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)문화체육업종명공사립구분명보험가입여부코드지도자수건축물동수건축물연면적회원모집총인원세부업종명법인명
3513150000CDFH33010620230000212023-12-27<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 화곡동 1032-23 3층서울특별시 강서구 강서로 199, 3층 (화곡동)7707밸런스 222023-12-27 15:17:08I2022-11-01 22:09:00.0<NA>185620.516917449062.149052<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3523150000CDFH33010620240000012024-01-18<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 등촌동 701 미주진로아파트서울특별시 강서구 공항대로 351-21, 지층 비101호 (등촌동, 미주진로아파트)7587스토리짐2024-01-18 08:07:15I2023-11-30 22:00:00.0<NA>186414.512867450851.460122<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3533150000CDFH33010620240000022024-01-23<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 등촌동 683 인방빌딩서울특별시 강서구 강서로68길 12, 인방빌딩 201호 (등촌동)7582운동독립스쿨 PT 스튜디오2024-01-23 17:39:42I2023-11-30 22:05:00.0<NA>185932.035421451560.536862<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3543150000CDFH33010620240000032024-02-27<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 염창동 252-16 대영빌딩서울특별시 강서구 양천로 713, 대영빌딩 4층 전체 및 5층 일부호 (염창동)7540바른헬스2024-02-27 18:47:06I2023-12-01 22:09:00.0<NA>188907.145871449738.825007<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3553150000CDFH33010620240000042024-03-11<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 화곡동 343-6 평인빌딩 지층서울특별시 강서구 강서로 47-8, 평인빌딩 지층 (화곡동)7774나홀로짐에 까치산역점2024-03-11 18:13:02I2023-12-02 23:03:00.0<NA>186308.028991447763.968747<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3563150000CDFH33010620240000052024-04-04<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 마곡동 800-5 210호서울특별시 강서구 공항대로 228, 210호 (마곡동)7806짐올레2024-04-04 08:44:44I2023-12-04 00:07:00.0<NA>185148.900295450719.380288<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3573150000CDFH33010620240000062024-04-12<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 화곡동 1102-9 성진빌딩 1층서울특별시 강서구 화곡로59길 73, 성진빌딩 1층 (화곡동)7650핏밀리 강서화곡점2024-04-12 11:45:53I2023-12-03 23:05:00.0<NA>186491.528412450167.375002<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3583150000CDFH33010620240000072024-04-12<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 염창동 295 강변성원아파트 상가동 B101호서울특별시 강서구 양천로69길 65, 상가동 B101호 (염창동, 강변성원아파트)7535골드마운틴 짐2024-04-12 13:20:51I2023-12-03 23:05:00.0<NA>188627.811125450375.417027<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3593150000CDFH33010620240000082024-04-22<NA>1영업/정상13영업중<NA><NA><NA><NA><NA><NA><NA>서울특별시 강서구 내발산동 647 DH647 B01호서울특별시 강서구 강서로52길 43, B01호 (내발산동, DH647)7646레디짐2024-04-22 16:02:52I2023-12-03 22:05:00.0<NA>185840.838378450639.667176<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3603150000CDFH33010620240000092024-04-29<NA>1영업/정상13영업중<NA><NA><NA><NA>02-3661-0420<NA><NA>서울특별시 강서구 염창동 312 센터스퀘어 B102호서울특별시 강서구 공항대로 543, B102~B106/B109~B111호 (염창동, 센터스퀘어)7562에이블짐 등촌역점2024-04-29 14:52:22I2023-12-05 00:01:00.0<NA>188123.970482449841.957613<NA><NA><NA><NA><NA><NA><NA><NA><NA>