Overview

Dataset statistics

Number of variables22
Number of observations7859
Missing cells56477
Missing cells (%)32.7%
Duplicate rows3
Duplicate rows (%)< 0.1%
Total size in memory1.4 MiB
Average record size in memory184.0 B

Variable types

Categorical7
Text7
DateTime1
Unsupported2
Numeric5

Alerts

Dataset has 3 (< 0.1%) duplicate rowsDuplicates
영업상태구분코드 is highly imbalanced (73.7%)Imbalance
업태구분명정보 is highly imbalanced (76.4%)Imbalance
문화체육업종명 is highly imbalanced (99.5%)Imbalance
공사립구분명 is highly imbalanced (98.7%)Imbalance
보험가입여부코드 is highly imbalanced (98.0%)Imbalance
인허가취소일자 has 7859 (100.0%) missing valuesMissing
폐업일자 has 5085 (64.7%) missing valuesMissing
소재지시설전화번호 has 7322 (93.2%) missing valuesMissing
소재지면적정보 has 7859 (100.0%) missing valuesMissing
도로명우편번호 has 6828 (86.9%) missing valuesMissing
소재지도로명주소 has 436 (5.5%) missing valuesMissing
WGS84위도 has 261 (3.3%) missing valuesMissing
WGS84경도 has 261 (3.3%) missing valuesMissing
X좌표값 has 6831 (86.9%) missing valuesMissing
Y좌표값 has 6831 (86.9%) missing valuesMissing
회원모집총인원 has 6881 (87.6%) missing valuesMissing
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적정보 is an unsupported type, check if it needs cleaning or further analysisUnsupported
회원모집총인원 has 969 (12.3%) zerosZeros

Reproduction

Analysis started2023-12-10 21:16:45.855258
Analysis finished2023-12-10 21:16:47.742852
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct31
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
고양시
701 
수원시
660 
용인시
578 
성남시
550 
부천시
541 
Other values (26)
4829 

Length

Max length4
Median length3
Mean length3.1005217
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
고양시 701
 
8.9%
수원시 660
 
8.4%
용인시 578
 
7.4%
성남시 550
 
7.0%
부천시 541
 
6.9%
안산시 473
 
6.0%
남양주시 437
 
5.6%
화성시 436
 
5.5%
시흥시 320
 
4.1%
안양시 317
 
4.0%
Other values (21) 2846
36.2%

Length

2023-12-11T06:16:47.836128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고양시 701
 
8.9%
수원시 660
 
8.4%
용인시 578
 
7.4%
성남시 550
 
7.0%
부천시 541
 
6.9%
안산시 473
 
6.0%
남양주시 437
 
5.6%
화성시 436
 
5.5%
시흥시 320
 
4.1%
안양시 317
 
4.0%
Other values (21) 2846
36.2%
Distinct6436
Distinct (%)81.9%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
2023-12-11T06:16:48.127816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length30
Mean length8.4115027
Min length1

Characters and Unicode

Total characters66106
Distinct characters655
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5469 ?
Unique (%)69.6%

Sample

1st row크로스나인
2nd row가평유도체육관
3rd row충효체육관
4th row혜성운암태권도장
5th row해동검도 가평본관
ValueCountFrequency (%)
태권도장 1030
 
8.1%
태권도 637
 
5.0%
용인대 521
 
4.1%
경희대 373
 
2.9%
체육관 231
 
1.8%
석사 122
 
1.0%
국가대표 98
 
0.8%
한국체대 94
 
0.7%
복싱 82
 
0.6%
복싱클럽 43
 
0.3%
Other values (5902) 9429
74.5%
2023-12-11T06:16:48.577661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6129
 
9.3%
5434
 
8.2%
5359
 
8.1%
4807
 
7.3%
2903
 
4.4%
2702
 
4.1%
2238
 
3.4%
1881
 
2.8%
1745
 
2.6%
1207
 
1.8%
Other values (645) 31701
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58429
88.4%
Space Separator 4807
 
7.3%
Uppercase Letter 1717
 
2.6%
Lowercase Letter 465
 
0.7%
Close Punctuation 168
 
0.3%
Open Punctuation 168
 
0.3%
Other Punctuation 167
 
0.3%
Decimal Number 146
 
0.2%
Dash Punctuation 37
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6129
 
10.5%
5434
 
9.3%
5359
 
9.2%
2903
 
5.0%
2702
 
4.6%
2238
 
3.8%
1881
 
3.2%
1745
 
3.0%
1207
 
2.1%
1188
 
2.0%
Other values (573) 27643
47.3%
Uppercase Letter
ValueCountFrequency (%)
T 244
14.2%
M 211
12.3%
A 162
 
9.4%
K 146
 
8.5%
S 112
 
6.5%
G 89
 
5.2%
J 81
 
4.7%
Y 76
 
4.4%
C 63
 
3.7%
E 62
 
3.6%
Other values (15) 471
27.4%
Lowercase Letter
ValueCountFrequency (%)
s 52
11.2%
e 51
11.0%
i 45
9.7%
o 34
 
7.3%
m 32
 
6.9%
t 32
 
6.9%
r 32
 
6.9%
n 29
 
6.2%
a 29
 
6.2%
k 21
 
4.5%
Other values (12) 108
23.2%
Decimal Number
ValueCountFrequency (%)
2 56
38.4%
1 40
27.4%
3 24
16.4%
8 10
 
6.8%
7 5
 
3.4%
5 4
 
2.7%
4 3
 
2.1%
0 3
 
2.1%
9 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 65
38.9%
& 53
31.7%
, 21
 
12.6%
' 15
 
9.0%
6
 
3.6%
· 3
 
1.8%
: 3
 
1.8%
/ 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 167
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 167
99.4%
[ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
4807
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58407
88.4%
Common 5495
 
8.3%
Latin 2182
 
3.3%
Han 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6129
 
10.5%
5434
 
9.3%
5359
 
9.2%
2903
 
5.0%
2702
 
4.6%
2238
 
3.8%
1881
 
3.2%
1745
 
3.0%
1207
 
2.1%
1188
 
2.0%
Other values (559) 27621
47.3%
Latin
ValueCountFrequency (%)
T 244
 
11.2%
M 211
 
9.7%
A 162
 
7.4%
K 146
 
6.7%
S 112
 
5.1%
G 89
 
4.1%
J 81
 
3.7%
Y 76
 
3.5%
C 63
 
2.9%
E 62
 
2.8%
Other values (37) 936
42.9%
Common
ValueCountFrequency (%)
4807
87.5%
) 167
 
3.0%
( 167
 
3.0%
. 65
 
1.2%
2 56
 
1.0%
& 53
 
1.0%
1 40
 
0.7%
- 37
 
0.7%
3 24
 
0.4%
, 21
 
0.4%
Other values (15) 58
 
1.1%
Han
ValueCountFrequency (%)
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (4) 4
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58407
88.4%
ASCII 7668
 
11.6%
CJK 22
 
< 0.1%
None 9
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6129
 
10.5%
5434
 
9.3%
5359
 
9.2%
2903
 
5.0%
2702
 
4.6%
2238
 
3.8%
1881
 
3.2%
1745
 
3.0%
1207
 
2.1%
1188
 
2.0%
Other values (559) 27621
47.3%
ASCII
ValueCountFrequency (%)
4807
62.7%
T 244
 
3.2%
M 211
 
2.8%
) 167
 
2.2%
( 167
 
2.2%
A 162
 
2.1%
K 146
 
1.9%
S 112
 
1.5%
G 89
 
1.2%
J 81
 
1.1%
Other values (60) 1482
 
19.3%
None
ValueCountFrequency (%)
6
66.7%
· 3
33.3%
CJK
ValueCountFrequency (%)
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (4) 4
18.2%
Distinct4341
Distinct (%)55.2%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
Minimum1979-08-06 00:00:00
Maximum2023-11-30 00:00:00
2023-12-11T06:16:48.716168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:16:48.853109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7859
Missing (%)100.0%
Memory size69.2 KiB

영업상태구분코드
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
<NA>
6761 
13
957 
3
 
103
35
 
34
2
 
3

Length

Max length4
Median length4
Mean length3.7070874
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row13
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6761
86.0%
13 957
 
12.2%
3 103
 
1.3%
35 34
 
0.4%
2 3
 
< 0.1%
15 1
 
< 0.1%

Length

2023-12-11T06:16:49.000414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:49.122027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6761
86.0%
13 957
 
12.2%
3 103
 
1.3%
35 34
 
0.4%
2 3
 
< 0.1%
15 1
 
< 0.1%

영업상태명
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
운영중
4111 
폐업 등
2641 
영업중
957 
폐업
 
103
직권말소
 
34
Other values (3)
 
13

Length

Max length4
Median length3
Mean length3.3279043
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row영업중
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 4111
52.3%
폐업 등 2641
33.6%
영업중 957
 
12.2%
폐업 103
 
1.3%
직권말소 34
 
0.4%
휴업 등 9
 
0.1%
휴업 3
 
< 0.1%
전출 1
 
< 0.1%

Length

2023-12-11T06:16:49.244026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:49.399654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 4111
39.1%
폐업 2744
26.1%
2650
25.2%
영업중 957
 
9.1%
직권말소 34
 
0.3%
휴업 12
 
0.1%
전출 1
 
< 0.1%

폐업일자
Text

MISSING 

Distinct1890
Distinct (%)68.1%
Missing5085
Missing (%)64.7%
Memory size61.5 KiB
2023-12-11T06:16:49.715951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.0994953
Min length8

Characters and Unicode

Total characters22468
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1361 ?
Unique (%)49.1%

Sample

1st row20170811
2nd row20130121
3rd row20170123
4th row20110803
5th row20180130
ValueCountFrequency (%)
20180604 33
 
1.2%
20100201 21
 
0.8%
20160425 17
 
0.6%
20100311 12
 
0.4%
2023-04-28 12
 
0.4%
20160215 11
 
0.4%
20030524 11
 
0.4%
20130625 10
 
0.4%
20180523 10
 
0.4%
2023-08-16 10
 
0.4%
Other values (1880) 2627
94.7%
2023-12-11T06:16:50.252088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7395
32.9%
2 4690
20.9%
1 4121
18.3%
3 1087
 
4.8%
6 875
 
3.9%
4 852
 
3.8%
9 802
 
3.6%
7 794
 
3.5%
8 788
 
3.5%
5 788
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22192
98.8%
Dash Punctuation 276
 
1.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7395
33.3%
2 4690
21.1%
1 4121
18.6%
3 1087
 
4.9%
6 875
 
3.9%
4 852
 
3.8%
9 802
 
3.6%
7 794
 
3.6%
8 788
 
3.6%
5 788
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 276
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22468
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7395
32.9%
2 4690
20.9%
1 4121
18.3%
3 1087
 
4.8%
6 875
 
3.9%
4 852
 
3.8%
9 802
 
3.6%
7 794
 
3.5%
8 788
 
3.5%
5 788
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7395
32.9%
2 4690
20.9%
1 4121
18.3%
3 1087
 
4.8%
6 875
 
3.9%
4 852
 
3.8%
9 802
 
3.6%
7 794
 
3.5%
8 788
 
3.5%
5 788
 
3.5%
Distinct531
Distinct (%)98.9%
Missing7322
Missing (%)93.2%
Memory size61.5 KiB
2023-12-11T06:16:50.582397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length11.242086
Min length7

Characters and Unicode

Total characters6037
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique525 ?
Unique (%)97.8%

Sample

1st row969-8123
2nd row962-8722
3rd row031-917-5401
4th row031-964-1120
5th row031-974-6520
ValueCountFrequency (%)
02-476-8666 2
 
0.4%
031-206-2229 2
 
0.4%
031-382-2941 2
 
0.4%
031-577-6322 2
 
0.4%
031-447-7203 2
 
0.4%
02-354-1191 2
 
0.4%
031-616-7777 1
 
0.2%
419-7987 1
 
0.2%
031-407-1376 1
 
0.2%
031-485-9695 1
 
0.2%
Other values (521) 521
97.0%
2023-12-11T06:16:50.986965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 944
15.6%
3 831
13.8%
0 787
13.0%
1 773
12.8%
2 475
7.9%
7 474
7.9%
6 378
6.3%
8 371
 
6.1%
5 370
 
6.1%
9 342
 
5.7%
Other values (4) 292
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5087
84.3%
Dash Punctuation 944
 
15.6%
Close Punctuation 4
 
0.1%
Other Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 831
16.3%
0 787
15.5%
1 773
15.2%
2 475
9.3%
7 474
9.3%
6 378
7.4%
8 371
7.3%
5 370
7.3%
9 342
6.7%
4 286
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 944
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6037
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 944
15.6%
3 831
13.8%
0 787
13.0%
1 773
12.8%
2 475
7.9%
7 474
7.9%
6 378
6.3%
8 371
 
6.1%
5 370
 
6.1%
9 342
 
5.7%
Other values (4) 292
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6037
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 944
15.6%
3 831
13.8%
0 787
13.0%
1 773
12.8%
2 475
7.9%
7 474
7.9%
6 378
6.3%
8 371
 
6.1%
5 370
 
6.1%
9 342
 
5.7%
Other values (4) 292
 
4.8%

소재지면적정보
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7859
Missing (%)100.0%
Memory size69.2 KiB

도로명우편번호
Text

MISSING 

Distinct738
Distinct (%)71.6%
Missing6828
Missing (%)86.9%
Memory size61.5 KiB
2023-12-11T06:16:51.418016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0368574
Min length5

Characters and Unicode

Total characters5193
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique556 ?
Unique (%)53.9%

Sample

1st row12437
2nd row10275
3rd row10508
4th row10497
5th row10387
ValueCountFrequency (%)
15010 9
 
0.9%
10551 9
 
0.9%
10546 6
 
0.6%
11473 6
 
0.6%
12771 6
 
0.6%
11444 5
 
0.5%
16706 5
 
0.5%
12248 5
 
0.5%
10888 5
 
0.5%
18472 4
 
0.4%
Other values (728) 971
94.2%
2023-12-11T06:16:52.138526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1398
26.9%
0 512
 
9.9%
4 448
 
8.6%
8 446
 
8.6%
2 430
 
8.3%
6 421
 
8.1%
5 416
 
8.0%
3 405
 
7.8%
7 401
 
7.7%
9 297
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5174
99.6%
Dash Punctuation 19
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1398
27.0%
0 512
 
9.9%
4 448
 
8.7%
8 446
 
8.6%
2 430
 
8.3%
6 421
 
8.1%
5 416
 
8.0%
3 405
 
7.8%
7 401
 
7.8%
9 297
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5193
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1398
26.9%
0 512
 
9.9%
4 448
 
8.6%
8 446
 
8.6%
2 430
 
8.3%
6 421
 
8.1%
5 416
 
8.0%
3 405
 
7.8%
7 401
 
7.7%
9 297
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1398
26.9%
0 512
 
9.9%
4 448
 
8.6%
8 446
 
8.6%
2 430
 
8.3%
6 421
 
8.1%
5 416
 
8.0%
3 405
 
7.8%
7 401
 
7.7%
9 297
 
5.7%
Distinct6820
Distinct (%)91.9%
Missing436
Missing (%)5.5%
Memory size61.5 KiB
2023-12-11T06:16:52.533902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length56
Mean length31.409403
Min length13

Characters and Unicode

Total characters233152
Distinct characters599
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6246 ?
Unique (%)84.1%

Sample

1st row경기도 가평군 조종면 청군로 1292, 2층
2nd row경기도 가평군 가평읍 오리나무길 33
3rd row경기도 가평군 가평읍 석봉로 168
4th row경기도 가평군 가평읍 석봉로153번길 12
5th row경기도 가평군 가평읍 보납로 1
ValueCountFrequency (%)
경기도 7423
 
15.4%
2층 717
 
1.5%
고양시 663
 
1.4%
수원시 634
 
1.3%
용인시 545
 
1.1%
성남시 534
 
1.1%
3층 524
 
1.1%
부천시 523
 
1.1%
안산시 443
 
0.9%
남양주시 426
 
0.9%
Other values (8061) 35641
74.1%
2023-12-11T06:16:53.194137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42195
 
18.1%
7945
 
3.4%
7885
 
3.4%
7829
 
3.4%
7775
 
3.3%
7730
 
3.3%
7090
 
3.0%
, 6929
 
3.0%
1 6913
 
3.0%
( 6536
 
2.8%
Other values (589) 124325
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130725
56.1%
Space Separator 42195
 
18.1%
Decimal Number 38398
 
16.5%
Other Punctuation 6991
 
3.0%
Open Punctuation 6536
 
2.8%
Close Punctuation 6536
 
2.8%
Dash Punctuation 1124
 
0.5%
Uppercase Letter 364
 
0.2%
Math Symbol 217
 
0.1%
Lowercase Letter 58
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7945
 
6.1%
7885
 
6.0%
7829
 
6.0%
7775
 
5.9%
7730
 
5.9%
7090
 
5.4%
3483
 
2.7%
2991
 
2.3%
2828
 
2.2%
2707
 
2.1%
Other values (523) 72462
55.4%
Uppercase Letter
ValueCountFrequency (%)
B 107
29.4%
A 63
17.3%
C 17
 
4.7%
S 17
 
4.7%
G 16
 
4.4%
L 16
 
4.4%
E 16
 
4.4%
P 15
 
4.1%
K 15
 
4.1%
T 13
 
3.6%
Other values (13) 69
19.0%
Lowercase Letter
ValueCountFrequency (%)
e 24
41.4%
l 9
 
15.5%
a 4
 
6.9%
h 4
 
6.9%
p 3
 
5.2%
t 3
 
5.2%
s 2
 
3.4%
o 2
 
3.4%
b 1
 
1.7%
k 1
 
1.7%
Other values (5) 5
 
8.6%
Decimal Number
ValueCountFrequency (%)
1 6913
18.0%
2 6420
16.7%
0 5033
13.1%
3 4926
12.8%
4 3792
9.9%
5 3138
8.2%
6 2403
 
6.3%
7 2126
 
5.5%
8 1918
 
5.0%
9 1729
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 6929
99.1%
. 41
 
0.6%
@ 9
 
0.1%
& 6
 
0.1%
/ 3
 
< 0.1%
· 2
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
50.0%
2
25.0%
1
 
12.5%
1
 
12.5%
Math Symbol
ValueCountFrequency (%)
~ 214
98.6%
2
 
0.9%
+ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
42195
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6536
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6536
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1124
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 130724
56.1%
Common 101997
43.7%
Latin 430
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7945
 
6.1%
7885
 
6.0%
7829
 
6.0%
7775
 
5.9%
7730
 
5.9%
7090
 
5.4%
3483
 
2.7%
2991
 
2.3%
2828
 
2.2%
2707
 
2.1%
Other values (522) 72461
55.4%
Latin
ValueCountFrequency (%)
B 107
24.9%
A 63
14.7%
e 24
 
5.6%
C 17
 
4.0%
S 17
 
4.0%
G 16
 
3.7%
L 16
 
3.7%
E 16
 
3.7%
P 15
 
3.5%
K 15
 
3.5%
Other values (32) 124
28.8%
Common
ValueCountFrequency (%)
42195
41.4%
, 6929
 
6.8%
1 6913
 
6.8%
( 6536
 
6.4%
) 6536
 
6.4%
2 6420
 
6.3%
0 5033
 
4.9%
3 4926
 
4.8%
4 3792
 
3.7%
5 3138
 
3.1%
Other values (14) 9579
 
9.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 130724
56.1%
ASCII 102414
43.9%
Number Forms 8
 
< 0.1%
None 3
 
< 0.1%
Math Operators 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42195
41.2%
, 6929
 
6.8%
1 6913
 
6.8%
( 6536
 
6.4%
) 6536
 
6.4%
2 6420
 
6.3%
0 5033
 
4.9%
3 4926
 
4.8%
4 3792
 
3.7%
5 3138
 
3.1%
Other values (49) 9996
 
9.8%
Hangul
ValueCountFrequency (%)
7945
 
6.1%
7885
 
6.0%
7829
 
6.0%
7775
 
5.9%
7730
 
5.9%
7090
 
5.4%
3483
 
2.7%
2991
 
2.3%
2828
 
2.2%
2707
 
2.1%
Other values (522) 72461
55.4%
Number Forms
ValueCountFrequency (%)
4
50.0%
2
25.0%
1
 
12.5%
1
 
12.5%
Math Operators
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct7557
Distinct (%)96.2%
Missing1
Missing (%)< 0.1%
Memory size61.5 KiB
2023-12-11T06:16:53.633872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length52
Mean length26.880122
Min length10

Characters and Unicode

Total characters211224
Distinct characters562
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7278 ?
Unique (%)92.6%

Sample

1st row경기도 가평군 조종면 현리 296-8
2nd row경기도 가평군 가평읍 대곡리 173-7번지
3rd row경기도 가평군 가평읍 읍내리 493-6번지
4th row경기도 가평군 가평읍 대곡리 285-9번지
5th row경기도 가평군 가평읍 읍내리 506-6번지
ValueCountFrequency (%)
경기도 7858
 
17.6%
고양시 701
 
1.6%
2층 663
 
1.5%
수원시 660
 
1.5%
용인시 578
 
1.3%
3층 554
 
1.2%
성남시 550
 
1.2%
부천시 541
 
1.2%
안산시 473
 
1.1%
남양주시 437
 
1.0%
Other values (9699) 31612
70.8%
2023-12-11T06:16:54.204473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37670
 
17.8%
8221
 
3.9%
8208
 
3.9%
8126
 
3.8%
8038
 
3.8%
7919
 
3.7%
1 7730
 
3.7%
7665
 
3.6%
6577
 
3.1%
2 6432
 
3.0%
Other values (552) 104638
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122775
58.1%
Decimal Number 43410
 
20.6%
Space Separator 37670
 
17.8%
Dash Punctuation 5643
 
2.7%
Other Punctuation 774
 
0.4%
Uppercase Letter 378
 
0.2%
Close Punctuation 192
 
0.1%
Open Punctuation 192
 
0.1%
Math Symbol 135
 
0.1%
Lowercase Letter 47
 
< 0.1%
Other values (2) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8221
 
6.7%
8208
 
6.7%
8126
 
6.6%
8038
 
6.5%
7919
 
6.5%
7665
 
6.2%
6577
 
5.4%
3575
 
2.9%
2799
 
2.3%
2459
 
2.0%
Other values (487) 59188
48.2%
Uppercase Letter
ValueCountFrequency (%)
B 100
26.5%
A 84
22.2%
L 24
 
6.3%
S 23
 
6.1%
P 23
 
6.1%
T 17
 
4.5%
K 17
 
4.5%
C 15
 
4.0%
G 13
 
3.4%
M 10
 
2.6%
Other values (15) 52
13.8%
Lowercase Letter
ValueCountFrequency (%)
e 17
36.2%
l 8
17.0%
a 8
17.0%
c 4
 
8.5%
b 2
 
4.3%
s 2
 
4.3%
p 1
 
2.1%
o 1
 
2.1%
z 1
 
2.1%
h 1
 
2.1%
Other values (2) 2
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 7730
17.8%
2 6432
14.8%
3 5234
12.1%
0 5173
11.9%
4 4152
9.6%
5 3758
8.7%
6 3100
7.1%
7 2963
 
6.8%
8 2565
 
5.9%
9 2303
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 662
85.5%
. 57
 
7.4%
@ 38
 
4.9%
/ 10
 
1.3%
& 4
 
0.5%
· 2
 
0.3%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Math Symbol
ValueCountFrequency (%)
~ 133
98.5%
2
 
1.5%
Space Separator
ValueCountFrequency (%)
37670
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5643
100.0%
Close Punctuation
ValueCountFrequency (%)
) 192
100.0%
Open Punctuation
ValueCountFrequency (%)
( 192
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122774
58.1%
Common 88017
41.7%
Latin 432
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8221
 
6.7%
8208
 
6.7%
8126
 
6.6%
8038
 
6.5%
7919
 
6.5%
7665
 
6.2%
6577
 
5.4%
3575
 
2.9%
2799
 
2.3%
2459
 
2.0%
Other values (486) 59187
48.2%
Latin
ValueCountFrequency (%)
B 100
23.1%
A 84
19.4%
L 24
 
5.6%
S 23
 
5.3%
P 23
 
5.3%
T 17
 
3.9%
e 17
 
3.9%
K 17
 
3.9%
C 15
 
3.5%
G 13
 
3.0%
Other values (31) 99
22.9%
Common
ValueCountFrequency (%)
37670
42.8%
1 7730
 
8.8%
2 6432
 
7.3%
- 5643
 
6.4%
3 5234
 
5.9%
0 5173
 
5.9%
4 4152
 
4.7%
5 3758
 
4.3%
6 3100
 
3.5%
7 2963
 
3.4%
Other values (14) 6162
 
7.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 122773
58.1%
ASCII 88436
41.9%
Number Forms 7
 
< 0.1%
None 3
 
< 0.1%
Math Operators 2
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37670
42.6%
1 7730
 
8.7%
2 6432
 
7.3%
- 5643
 
6.4%
3 5234
 
5.9%
0 5173
 
5.8%
4 4152
 
4.7%
5 3758
 
4.2%
6 3100
 
3.5%
7 2963
 
3.4%
Other values (47) 6581
 
7.4%
Hangul
ValueCountFrequency (%)
8221
 
6.7%
8208
 
6.7%
8126
 
6.6%
8038
 
6.5%
7919
 
6.5%
7665
 
6.2%
6577
 
5.4%
3575
 
2.9%
2799
 
2.3%
2459
 
2.0%
Other values (485) 59186
48.2%
Number Forms
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Math Operators
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct3237
Distinct (%)41.3%
Missing22
Missing (%)0.3%
Memory size61.5 KiB
2023-12-11T06:16:54.583884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length5.6627536
Min length5

Characters and Unicode

Total characters44379
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1618 ?
Unique (%)20.6%

Sample

1st row12437
2nd row477804
3rd row477801
4th row477804
5th row12413
ValueCountFrequency (%)
445160 31
 
0.4%
410831 27
 
0.3%
445360 25
 
0.3%
472901 24
 
0.3%
482060 21
 
0.3%
15010 21
 
0.3%
415060 21
 
0.3%
482050 19
 
0.2%
472865 18
 
0.2%
443400 18
 
0.2%
Other values (3227) 7612
97.1%
2023-12-11T06:16:55.115281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 8497
19.1%
1 6775
15.3%
8 5380
12.1%
0 4956
11.2%
2 4455
10.0%
3 3430
7.7%
6 3072
 
6.9%
5 3042
 
6.9%
7 2599
 
5.9%
9 1957
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44163
99.5%
Dash Punctuation 216
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 8497
19.2%
1 6775
15.3%
8 5380
12.2%
0 4956
11.2%
2 4455
10.1%
3 3430
7.8%
6 3072
 
7.0%
5 3042
 
6.9%
7 2599
 
5.9%
9 1957
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44379
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 8497
19.1%
1 6775
15.3%
8 5380
12.1%
0 4956
11.2%
2 4455
10.0%
3 3430
7.7%
6 3072
 
6.9%
5 3042
 
6.9%
7 2599
 
5.9%
9 1957
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44379
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 8497
19.1%
1 6775
15.3%
8 5380
12.1%
0 4956
11.2%
2 4455
10.0%
3 3430
7.7%
6 3072
 
6.9%
5 3042
 
6.9%
7 2599
 
5.9%
9 1957
 
4.4%

WGS84위도
Real number (ℝ)

MISSING 

Distinct5817
Distinct (%)76.6%
Missing261
Missing (%)3.3%
Infinite0
Infinite (%)0.0%
Mean37.439537
Minimum36.95861
Maximum38.158096
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.2 KiB
2023-12-11T06:16:55.316589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.95861
5-th percentile37.103301
Q137.293316
median37.405476
Q337.623491
95-th percentile37.77472
Maximum38.158096
Range1.1994864
Interquartile range (IQR)0.33017546

Descriptive statistics

Standard deviation0.2099033
Coefficient of variation (CV)0.005606461
Kurtosis-0.44984632
Mean37.439537
Median Absolute Deviation (MAD)0.1354893
Skewness0.17691656
Sum284465.6
Variance0.044059397
MonotonicityNot monotonic
2023-12-11T06:16:55.525270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.2933002148 6
 
0.1%
37.3660200932 6
 
0.1%
37.5880541645 6
 
0.1%
37.1563738117 5
 
0.1%
37.5109766434 5
 
0.1%
37.2971310229 5
 
0.1%
37.625367764 5
 
0.1%
37.6719368348 5
 
0.1%
37.2373925781 5
 
0.1%
37.3318965835 5
 
0.1%
Other values (5807) 7545
96.0%
(Missing) 261
 
3.3%
ValueCountFrequency (%)
36.9586098016 2
< 0.1%
36.9605410794 1
< 0.1%
36.9632453036 1
< 0.1%
36.9643269427 1
< 0.1%
36.9643606434 2
< 0.1%
36.9646954649 1
< 0.1%
36.9653633742 1
< 0.1%
36.9662549734 1
< 0.1%
36.9768403552 1
< 0.1%
36.9776918612 1
< 0.1%
ValueCountFrequency (%)
38.1580962294 1
< 0.1%
38.0994814482 2
< 0.1%
38.0981274274 2
< 0.1%
38.0922736597 2
< 0.1%
38.0344901657 1
< 0.1%
38.0327840029 1
< 0.1%
38.0322174448 1
< 0.1%
38.0320288045 1
< 0.1%
38.0305162095 1
< 0.1%
38.0302311623 1
< 0.1%

WGS84경도
Real number (ℝ)

MISSING 

Distinct5817
Distinct (%)76.6%
Missing261
Missing (%)3.3%
Infinite0
Infinite (%)0.0%
Mean126.99925
Minimum126.58183
Maximum127.71417
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.2 KiB
2023-12-11T06:16:55.683962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.58183
5-th percentile126.7393
Q1126.83233
median127.02408
Q3127.12401
95-th percentile127.28416
Maximum127.71417
Range1.1323392
Interquartile range (IQR)0.2916784

Descriptive statistics

Standard deviation0.18814716
Coefficient of variation (CV)0.0014814824
Kurtosis0.27249252
Mean126.99925
Median Absolute Deviation (MAD)0.138488
Skewness0.47621372
Sum964940.34
Variance0.035399356
MonotonicityNot monotonic
2023-12-11T06:16:55.835269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.8648009663 6
 
0.1%
126.9642912126 6
 
0.1%
127.2132623546 6
 
0.1%
127.0779658056 5
 
0.1%
126.7618393072 5
 
0.1%
126.9937659478 5
 
0.1%
126.7026546756 5
 
0.1%
126.7589777549 5
 
0.1%
127.0582923263 5
 
0.1%
127.1269472958 5
 
0.1%
Other values (5807) 7545
96.0%
(Missing) 261
 
3.3%
ValueCountFrequency (%)
126.5818311267 1
< 0.1%
126.5829862284 2
< 0.1%
126.5837814903 1
< 0.1%
126.5845685345 1
< 0.1%
126.5847887606 1
< 0.1%
126.5862207582 1
< 0.1%
126.5864818952 2
< 0.1%
126.5937185429 1
< 0.1%
126.5943919211 1
< 0.1%
126.5951969768 1
< 0.1%
ValueCountFrequency (%)
127.7141703278 1
< 0.1%
127.7083286196 1
< 0.1%
127.6809393215 1
< 0.1%
127.6808831865 1
< 0.1%
127.6612332091 1
< 0.1%
127.6472836094 1
< 0.1%
127.6464884895 1
< 0.1%
127.6453885949 1
< 0.1%
127.6451846932 1
< 0.1%
127.6450606674 1
< 0.1%

업태구분명정보
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
<NA>
6905 
태권도
 
685
권투
 
126
합기도
 
54
유도
 
47
Other values (3)
 
42

Length

Max length4
Median length4
Mean length3.8522713
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row권투
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6905
87.9%
태권도 685
 
8.7%
권투 126
 
1.6%
합기도 54
 
0.7%
유도 47
 
0.6%
검도 27
 
0.3%
레슬링 8
 
0.1%
우슈 7
 
0.1%

Length

2023-12-11T06:16:56.001301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:56.139575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6905
87.9%
태권도 685
 
8.7%
권투 126
 
1.6%
합기도 54
 
0.7%
유도 47
 
0.6%
검도 27
 
0.3%
레슬링 8
 
0.1%
우슈 7
 
0.1%

X좌표값
Real number (ℝ)

MISSING 

Distinct971
Distinct (%)94.5%
Missing6831
Missing (%)86.9%
Infinite0
Infinite (%)0.0%
Mean202166.03
Minimum164545.23
Maximum260281.34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.2 KiB
2023-12-11T06:16:56.314597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum164545.23
5-th percentile178219.34
Q1187059.74
median204766.06
Q3213051.43
95-th percentile225697.25
Maximum260281.34
Range95736.116
Interquartile range (IQR)25991.684

Descriptive statistics

Standard deviation16367.623
Coefficient of variation (CV)0.080961292
Kurtosis-0.1128158
Mean202166.03
Median Absolute Deviation (MAD)11977.535
Skewness0.22943269
Sum2.0782668 × 108
Variance2.6789909 × 108
MonotonicityNot monotonic
2023-12-11T06:16:56.462489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
206861.153230815 3
 
< 0.1%
210903.455875022 3
 
< 0.1%
198203.689226267 3
 
< 0.1%
218760.550709581 3
 
< 0.1%
186034.79271903 2
 
< 0.1%
243648.330380255 2
 
< 0.1%
223383.419849915 2
 
< 0.1%
224104.747601001 2
 
< 0.1%
187136.009514412 2
 
< 0.1%
219627.938465481 2
 
< 0.1%
Other values (961) 1004
 
12.8%
(Missing) 6831
86.9%
ValueCountFrequency (%)
164545.227217254 1
< 0.1%
166403.504343211 1
< 0.1%
167059.98854799 1
< 0.1%
167402.90994765 1
< 0.1%
167595.119391588 1
< 0.1%
170606.757261863 2
< 0.1%
171140.484585133 1
< 0.1%
171546.892467746 1
< 0.1%
172052.968340156 1
< 0.1%
172237.38941878 1
< 0.1%
ValueCountFrequency (%)
260281.343107359 1
< 0.1%
257033.94588556 1
< 0.1%
256432.82354579 1
< 0.1%
256302.431981475 1
< 0.1%
255704.301127036 1
< 0.1%
255494.621747268 1
< 0.1%
248540.565961813 1
< 0.1%
243648.330380255 2
< 0.1%
243580.175259869 1
< 0.1%
243472.036026616 1
< 0.1%

Y좌표값
Real number (ℝ)

MISSING 

Distinct971
Distinct (%)94.5%
Missing6831
Missing (%)86.9%
Infinite0
Infinite (%)0.0%
Mean435303.64
Minimum384114.18
Maximum510588.37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.2 KiB
2023-12-11T06:16:56.587222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum384114.18
5-th percentile394666.86
Q1418891.23
median433361.12
Q3455443
95-th percentile471315.86
Maximum510588.37
Range126474.19
Interquartile range (IQR)36551.776

Descriptive statistics

Standard deviation22988.114
Coefficient of variation (CV)0.052809378
Kurtosis-0.45917343
Mean435303.64
Median Absolute Deviation (MAD)15970.971
Skewness0.11621781
Sum4.4749214 × 108
Variance5.284534 × 108
MonotonicityNot monotonic
2023-12-11T06:16:56.711700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
406048.57444542 3
 
< 0.1%
408880.438725047 3
 
< 0.1%
433837.219052849 3
 
< 0.1%
453992.29486238 3
 
< 0.1%
424864.389594433 2
 
< 0.1%
417433.164352394 2
 
< 0.1%
389748.386645421 2
 
< 0.1%
433013.1479701 2
 
< 0.1%
422384.557429755 2
 
< 0.1%
452831.349534158 2
 
< 0.1%
Other values (961) 1004
 
12.8%
(Missing) 6831
86.9%
ValueCountFrequency (%)
384114.177788394 1
< 0.1%
386342.101094516 1
< 0.1%
386429.688791162 1
< 0.1%
386883.994102439 1
< 0.1%
387044.546796059 1
< 0.1%
387321.388077925 1
< 0.1%
387434.553103312 1
< 0.1%
387597.597422515 1
< 0.1%
387601.9284226 1
< 0.1%
387703.716885502 2
< 0.1%
ValueCountFrequency (%)
510588.365461489 1
< 0.1%
509970.433245136 1
< 0.1%
502922.508176665 1
< 0.1%
494464.353733618 1
< 0.1%
494441.352570986 1
< 0.1%
484247.169548502 1
< 0.1%
484206.721230221 1
< 0.1%
484122.806396564 1
< 0.1%
483456.165626772 1
< 0.1%
482535.74364315 1
< 0.1%

문화체육업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
체육도장업
7856 
<NA>
 
3

Length

Max length5
Median length5
Mean length4.9996183
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row체육도장업
2nd row체육도장업
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 7856
> 99.9%
<NA> 3
 
< 0.1%

Length

2023-12-11T06:16:56.859008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:56.974656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 7856
> 99.9%
na 3
 
< 0.1%

공사립구분명
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
사립
7844 
공립
 
12
<NA>
 
3

Length

Max length4
Median length2
Mean length2.0007635
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사립
2nd row사립
3rd row사립
4th row사립
5th row사립

Common Values

ValueCountFrequency (%)
사립 7844
99.8%
공립 12
 
0.2%
<NA> 3
 
< 0.1%

Length

2023-12-11T06:16:57.088082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:57.186278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립 7844
99.8%
공립 12
 
0.2%
na 3
 
< 0.1%

보험가입여부코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size61.5 KiB
<NA>
7836 
Y
 
18
0
 
5

Length

Max length4
Median length4
Mean length3.9912203
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7836
99.7%
Y 18
 
0.2%
0 5
 
0.1%

Length

2023-12-11T06:16:57.291287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:16:57.392422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7836
99.7%
y 18
 
0.2%
0 5
 
0.1%

회원모집총인원
Real number (ℝ)

MISSING  ZEROS 

Distinct8
Distinct (%)0.8%
Missing6881
Missing (%)87.6%
Infinite0
Infinite (%)0.0%
Mean0.34560327
Minimum0
Maximum100
Zeros969
Zeros (%)12.3%
Negative0
Negative (%)0.0%
Memory size69.2 KiB
2023-12-11T06:16:57.469333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum100
Range100
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4.5979821
Coefficient of variation (CV)13.30422
Kurtosis285.73615
Mean0.34560327
Median Absolute Deviation (MAD)0
Skewness15.940817
Sum338
Variance21.141439
MonotonicityNot monotonic
2023-12-11T06:16:57.564958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 969
 
12.3%
50 2
 
< 0.1%
1 2
 
< 0.1%
17 1
 
< 0.1%
20 1
 
< 0.1%
40 1
 
< 0.1%
59 1
 
< 0.1%
100 1
 
< 0.1%
(Missing) 6881
87.6%
ValueCountFrequency (%)
0 969
12.3%
1 2
 
< 0.1%
17 1
 
< 0.1%
20 1
 
< 0.1%
40 1
 
< 0.1%
50 2
 
< 0.1%
59 1
 
< 0.1%
100 1
 
< 0.1%
ValueCountFrequency (%)
100 1
 
< 0.1%
59 1
 
< 0.1%
50 2
 
< 0.1%
40 1
 
< 0.1%
20 1
 
< 0.1%
17 1
 
< 0.1%
1 2
 
< 0.1%
0 969
12.3%

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값문화체육업종명공사립구분명보험가입여부코드회원모집총인원
0가평군크로스나인2023-06-19<NA>13영업중<NA><NA><NA>12437경기도 가평군 조종면 청군로 1292, 2층경기도 가평군 조종면 현리 296-81243737.819549127.34643권투230435.903152479724.122057체육도장업사립<NA>0
1가평군가평유도체육관20150625<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 오리나무길 33경기도 가평군 가평읍 대곡리 173-7번지47780437.824479127.514158<NA><NA><NA>체육도장업사립<NA><NA>
2가평군충효체육관20020226<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 석봉로 168경기도 가평군 가평읍 읍내리 493-6번지47780137.830122127.510977<NA><NA><NA>체육도장업사립<NA><NA>
3가평군혜성운암태권도장20040325<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 석봉로153번길 12경기도 가평군 가평읍 대곡리 285-9번지47780437.828734127.50975<NA><NA><NA>체육도장업사립<NA><NA>
4가평군해동검도 가평본관20160304<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 보납로 1경기도 가평군 가평읍 읍내리 506-6번지1241337.831465127.510671<NA><NA><NA>체육도장업사립<NA><NA>
5가평군설악현무도장19990208<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 설악면 자잠로 28-8경기도 가평군 설악면 신천리 443-7번지47785337.679627127.48968<NA><NA><NA>체육도장업사립<NA><NA>
6가평군동원 태권 스쿨20130124<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 하면 조종새싹로 46경기도 가평군 하면 현리 231-17번지47783237.822824127.349219<NA><NA><NA>체육도장업사립<NA><NA>
7가평군한양대 송라 태권도장20111013<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 경춘로 807-18경기도 가평군 청평면 청평리 470-13번지47781537.738122127.417318<NA><NA><NA>체육도장업사립<NA><NA>
8가평군가평검도관20031004<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 보납로6번길 8경기도 가평군 가평읍 읍내리 492-4번지47780137.830527127.511114<NA><NA><NA>체육도장업사립<NA><NA>
9가평군튼튼체육관20010804<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 가평읍 석봉로 146경기도 가평군 가평읍 읍내리 329-21번지47780137.828148127.511625<NA><NA><NA>체육도장업사립<NA><NA>
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값문화체육업종명공사립구분명보험가입여부코드회원모집총인원
7849화성시경희대상무태권도장20071203<NA><NA>폐업 등20161031<NA><NA><NA><NA>경기도 화성시 반송동 50-2블럭 가희프자라2 6층(604호 605호)445160<NA><NA><NA><NA><NA>체육도장업사립<NA><NA>
7850화성시진우 고려 태권도20071206<NA><NA>폐업 등20100317<NA><NA><NA><NA>경기도 화성시 팔탄면 가재리 691번지 진우아파트상가 A동 309~312호44591137.159943126.929019<NA><NA><NA>체육도장업사립<NA><NA>
7851화성시경희대기안태권도20071221<NA><NA>폐업 등20160405<NA><NA><NA>경기도 화성시 효행로291번길 24경기도 화성시 기안동 910 신일아파트상가 203,204,205호44531037.223123126.976451<NA><NA><NA>체육도장업사립<NA><NA>
7852화성시경희대 최강 태권도20071228<NA><NA>폐업 등20101004<NA><NA><NA><NA>경기도 화성시 봉담읍 동화리 봉담택지개발지구 씨블럭 11롯트 쌍용프라자 501~502445893<NA><NA><NA><NA><NA>체육도장업사립<NA><NA>
7853화성시경희대 아이짐 태권도&특공무술20080123<NA><NA>폐업 등20150914<NA><NA><NA>경기도 화성시 봉담읍 동화길 89, 601호경기도 화성시 봉담읍 동화리 599-3번지 601호44589337.216716126.958676<NA><NA><NA>체육도장업사립<NA><NA>
7854화성시용인대석사태권도20080421<NA><NA>폐업 등20090120<NA><NA><NA>경기도 화성시 동탄지성로 143경기도 화성시 능동 1114-2 에버스타 701호,702호44532037.209535127.059604<NA><NA><NA>체육도장업사립<NA><NA>
7855화성시반송경희체육관20080626<NA><NA>폐업 등20091005<NA><NA><NA>경기도 화성시 동탄중앙로 76경기도 화성시 반송동 221-2 서건프라자 5층 502호44516037.193195127.07251<NA><NA><NA>체육도장업사립<NA><NA>
7856화성시도원체육관19921026<NA><NA>폐업 등19951130<NA><NA><NA>경기도 화성시 봉담읍 참샘길 15경기도 화성시 봉담읍 와우리 65-10번지44589737.214417126.976591<NA><NA><NA>체육도장업사립<NA><NA>
7857화성시화산무림태권도체육관19930427<NA><NA>폐업 등20111230<NA><NA><NA>경기도 화성시 화산중앙로64번길 3, 303호 (송산동,삼신빌딩 3층)경기도 화성시 송산동 165-1번지 삼신빌딩 3층 303호44537037.209708127.011796<NA><NA><NA>체육도장업사립<NA><NA>
7858화성시남양체육관19940812<NA><NA>휴업 등<NA><NA><NA><NA>경기도 화성시 남양읍 남양시장로45번길 12, 3층경기도 화성시 남양읍 남양리 651-6번지1825837.209669126.814597<NA><NA><NA>체육도장업사립<NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태구분코드영업상태명폐업일자소재지시설전화번호도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값문화체육업종명공사립구분명보험가입여부코드회원모집총인원# duplicates
0양주시비룡무술스포츠타운19950316<NA>폐업 등19990114<NA><NA>경기도 양주시 평화로1429번길 85 (덕계동)경기도 양주시 덕계동 656-2번지48205037.82164127.04346<NA><NA><NA>체육도장업사립<NA><NA>2
1용인시둥지 태권도 체육관20030417<NA>폐업 등20051114<NA><NA>경기도 용인시 처인구 양지면 양지로143번길 3경기도 용인시 처인구 양지면 양지리 593-3번지44982337.235009127.284164<NA><NA><NA>체육도장업사립<NA><NA>2
2용인시신갈검도관20030509<NA>폐업 등20051031<NA><NA>경기도 용인시 기흥구 신갈로84번길 9 (신갈동)경기도 용인시 기흥구 신갈동 40-19번지44659637.274295127.106771<NA><NA><NA>체육도장업사립<NA><NA>2