Overview

Dataset statistics

Number of variables19
Number of observations10000
Missing cells53237
Missing cells (%)28.0%
Duplicate rows9
Duplicate rows (%)0.1%
Total size in memory1.6 MiB
Average record size in memory169.0 B

Variable types

Categorical2
Text6
DateTime2
Unsupported3
Numeric6

Alerts

Dataset has 9 (0.1%) duplicate rowsDuplicates
영업상태구분코드 is highly overall correlated with 영업상태명High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with X좌표값 and 1 other fieldsHigh correlation
X좌표값 is highly overall correlated with WGS84경도 and 1 other fieldsHigh correlation
Y좌표값 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
시군명 is highly overall correlated with 소재지우편번호 and 4 other fieldsHigh correlation
영업상태명 is highly overall correlated with 영업상태구분코드High correlation
영업상태명 is highly imbalanced (64.5%)Imbalance
인허가취소일자 has 10000 (100.0%) missing valuesMissing
폐업일자 has 3970 (39.7%) missing valuesMissing
소재지시설전화번호 has 3798 (38.0%) missing valuesMissing
소재지면적정보 has 10000 (100.0%) missing valuesMissing
도로명우편번호 has 4228 (42.3%) missing valuesMissing
소재지도로명주소 has 718 (7.2%) missing valuesMissing
소재지지번주소 has 241 (2.4%) missing valuesMissing
소재지우편번호 has 2643 (26.4%) missing valuesMissing
WGS84위도 has 2691 (26.9%) missing valuesMissing
WGS84경도 has 2691 (26.9%) missing valuesMissing
업태구분명정보 has 10000 (100.0%) missing valuesMissing
X좌표값 has 1082 (10.8%) missing valuesMissing
Y좌표값 has 1082 (10.8%) missing valuesMissing
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적정보 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명정보 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 21:41:45.735568
Analysis finished2023-12-10 21:41:52.282724
Duration6.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고양시
908 
부천시
885 
수원시
878 
안산시
692 
성남시
629 
Other values (27)
6008 

Length

Max length4
Median length3
Mean length3.0921
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포천시
2nd row수원시
3rd row성남시
4th row안양시
5th row이천시

Common Values

ValueCountFrequency (%)
고양시 908
 
9.1%
부천시 885
 
8.8%
수원시 878
 
8.8%
안산시 692
 
6.9%
성남시 629
 
6.3%
남양주시 485
 
4.9%
안양시 478
 
4.8%
시흥시 412
 
4.1%
용인시 407
 
4.1%
화성시 398
 
4.0%
Other values (22) 3828
38.3%

Length

2023-12-11T06:41:52.337176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고양시 908
 
9.1%
부천시 885
 
8.8%
수원시 878
 
8.8%
안산시 692
 
6.9%
성남시 629
 
6.3%
남양주시 485
 
4.9%
안양시 478
 
4.8%
시흥시 412
 
4.1%
용인시 407
 
4.1%
화성시 398
 
4.0%
Other values (22) 3828
38.3%
Distinct7821
Distinct (%)78.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:41:52.594221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length5.7711
Min length1

Characters and Unicode

Total characters57711
Distinct characters779
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6566 ?
Unique (%)65.7%

Sample

1st row엘리온
2nd row플러스디자인
3rd row뉴한빛광고
4th row(주)제일광고기획
5th row뉴아트
ValueCountFrequency (%)
주식회사 411
 
3.6%
디자인 136
 
1.2%
광고 49
 
0.4%
45
 
0.4%
광고기획 43
 
0.4%
기획 28
 
0.2%
광고나라 20
 
0.2%
미래광고 20
 
0.2%
제일광고 17
 
0.2%
애드 16
 
0.1%
Other values (7917) 10485
93.0%
2023-12-11T06:41:52.985932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3065
 
5.3%
2964
 
5.1%
2261
 
3.9%
2258
 
3.9%
1975
 
3.4%
1878
 
3.3%
) 1771
 
3.1%
( 1729
 
3.0%
1461
 
2.5%
1316
 
2.3%
Other values (769) 37033
64.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51408
89.1%
Close Punctuation 1771
 
3.1%
Open Punctuation 1729
 
3.0%
Space Separator 1316
 
2.3%
Uppercase Letter 851
 
1.5%
Decimal Number 269
 
0.5%
Lowercase Letter 223
 
0.4%
Other Punctuation 113
 
0.2%
Dash Punctuation 26
 
< 0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3065
 
6.0%
2964
 
5.8%
2261
 
4.4%
2258
 
4.4%
1975
 
3.8%
1878
 
3.7%
1461
 
2.8%
1218
 
2.4%
1204
 
2.3%
1153
 
2.2%
Other values (698) 31971
62.2%
Uppercase Letter
ValueCountFrequency (%)
S 95
 
11.2%
D 81
 
9.5%
A 57
 
6.7%
N 53
 
6.2%
I 52
 
6.1%
E 49
 
5.8%
C 49
 
5.8%
G 47
 
5.5%
R 45
 
5.3%
M 44
 
5.2%
Other values (16) 279
32.8%
Lowercase Letter
ValueCountFrequency (%)
s 31
13.9%
e 30
13.5%
n 29
13.0%
i 24
10.8%
g 19
8.5%
o 14
 
6.3%
a 12
 
5.4%
r 11
 
4.9%
d 8
 
3.6%
t 7
 
3.1%
Other values (11) 38
17.0%
Decimal Number
ValueCountFrequency (%)
1 81
30.1%
2 67
24.9%
0 43
16.0%
3 21
 
7.8%
5 14
 
5.2%
4 14
 
5.2%
8 10
 
3.7%
6 9
 
3.3%
9 7
 
2.6%
7 3
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 68
60.2%
& 33
29.2%
, 5
 
4.4%
/ 2
 
1.8%
· 2
 
1.8%
: 1
 
0.9%
' 1
 
0.9%
# 1
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 1771
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1729
100.0%
Space Separator
ValueCountFrequency (%)
1316
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51411
89.1%
Common 5225
 
9.1%
Latin 1074
 
1.9%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3065
 
6.0%
2964
 
5.8%
2261
 
4.4%
2258
 
4.4%
1975
 
3.8%
1878
 
3.7%
1461
 
2.8%
1218
 
2.4%
1204
 
2.3%
1153
 
2.2%
Other values (698) 31974
62.2%
Latin
ValueCountFrequency (%)
S 95
 
8.8%
D 81
 
7.5%
A 57
 
5.3%
N 53
 
4.9%
I 52
 
4.8%
E 49
 
4.6%
C 49
 
4.6%
G 47
 
4.4%
R 45
 
4.2%
M 44
 
4.1%
Other values (37) 502
46.7%
Common
ValueCountFrequency (%)
) 1771
33.9%
( 1729
33.1%
1316
25.2%
1 81
 
1.6%
. 68
 
1.3%
2 67
 
1.3%
0 43
 
0.8%
& 33
 
0.6%
- 26
 
0.5%
3 21
 
0.4%
Other values (13) 70
 
1.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51406
89.1%
ASCII 6297
 
10.9%
None 6
 
< 0.1%
CJK 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3065
 
6.0%
2964
 
5.8%
2261
 
4.4%
2258
 
4.4%
1975
 
3.8%
1878
 
3.7%
1461
 
2.8%
1218
 
2.4%
1204
 
2.3%
1153
 
2.2%
Other values (696) 31969
62.2%
ASCII
ValueCountFrequency (%)
) 1771
28.1%
( 1729
27.5%
1316
20.9%
S 95
 
1.5%
D 81
 
1.3%
1 81
 
1.3%
. 68
 
1.1%
2 67
 
1.1%
A 57
 
0.9%
N 53
 
0.8%
Other values (59) 979
15.5%
None
ValueCountFrequency (%)
4
66.7%
· 2
33.3%
CJK
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct5468
Distinct (%)54.7%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
Minimum1882-06-04 00:00:00
Maximum2023-12-01 00:00:00
2023-12-11T06:41:53.106304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:53.213355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

영업상태구분코드
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.3953
Minimum10
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:53.296812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile20
Q120
median40
Q340
95-th percentile40
Maximum90
Range80
Interquartile range (IQR)20

Descriptive statistics

Standard deviation10.707441
Coefficient of variation (CV)0.3206272
Kurtosis0.50873918
Mean33.3953
Median Absolute Deviation (MAD)0
Skewness0.1931503
Sum333953
Variance114.6493
MonotonicityNot monotonic
2023-12-11T06:41:53.383621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
40 6160
61.6%
20 3493
34.9%
70 169
 
1.7%
30 155
 
1.6%
71 7
 
0.1%
50 6
 
0.1%
10 5
 
0.1%
72 3
 
< 0.1%
90 1
 
< 0.1%
60 1
 
< 0.1%
ValueCountFrequency (%)
10 5
 
0.1%
20 3493
34.9%
30 155
 
1.6%
40 6160
61.6%
50 6
 
0.1%
60 1
 
< 0.1%
70 169
 
1.7%
71 7
 
0.1%
72 3
 
< 0.1%
90 1
 
< 0.1%
ValueCountFrequency (%)
90 1
 
< 0.1%
72 3
 
< 0.1%
71 7
 
0.1%
70 169
 
1.7%
60 1
 
< 0.1%
50 6
 
0.1%
40 6160
61.6%
30 155
 
1.6%
20 3493
34.9%
10 5
 
0.1%

영업상태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
6160 
정상
3493 
취소
 
169
휴업
 
155
신청취소(취하)
 
7
Other values (5)
 
16

Length

Max length8
Median length2
Mean length2.0086
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row폐업
2nd row정상
3rd row폐업
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
폐업 6160
61.6%
정상 3493
34.9%
취소 169
 
1.7%
휴업 155
 
1.6%
신청취소(취하) 7
 
0.1%
영업정지 6
 
0.1%
설립신청 5
 
0.1%
신청취소(반려) 3
 
< 0.1%
등록신청 1
 
< 0.1%
부분가동 1
 
< 0.1%

Length

2023-12-11T06:41:53.492030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:41:53.588922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 6160
61.6%
정상 3493
34.9%
취소 169
 
1.7%
휴업 155
 
1.6%
신청취소(취하 7
 
0.1%
영업정지 6
 
0.1%
설립신청 5
 
< 0.1%
신청취소(반려 3
 
< 0.1%
등록신청 1
 
< 0.1%
부분가동 1
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2788
Distinct (%)46.2%
Missing3970
Missing (%)39.7%
Memory size156.2 KiB
Minimum1991-12-31 00:00:00
Maximum2023-12-04 00:00:00
2023-12-11T06:41:53.702738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:53.813693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct5785
Distinct (%)93.3%
Missing3798
Missing (%)38.0%
Memory size156.2 KiB
2023-12-11T06:41:54.084328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length10.969203
Min length3

Characters and Unicode

Total characters68031
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5455 ?
Unique (%)88.0%

Sample

1st row7541379
2nd row031 474 0024
3rd row032 6539390
4th row031 8446283
5th row0318665322
ValueCountFrequency (%)
031 3669
28.0%
032 680
 
5.2%
02 312
 
2.4%
0344 142
 
1.1%
0343 38
 
0.3%
0331 31
 
0.2%
070 28
 
0.2%
675 23
 
0.2%
652 17
 
0.1%
653 16
 
0.1%
Other values (6140) 8153
62.2%
2023-12-11T06:41:54.520356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9868
14.5%
3 9656
14.2%
1 8232
12.1%
7162
10.5%
2 5723
8.4%
4 5380
7.9%
5 4793
7.0%
6 4779
7.0%
7 4678
6.9%
8 3958
5.8%
Other values (2) 3802
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60867
89.5%
Space Separator 7162
 
10.5%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9868
16.2%
3 9656
15.9%
1 8232
13.5%
2 5723
9.4%
4 5380
8.8%
5 4793
7.9%
6 4779
7.9%
7 4678
7.7%
8 3958
6.5%
9 3800
 
6.2%
Space Separator
ValueCountFrequency (%)
7162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 68031
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9868
14.5%
3 9656
14.2%
1 8232
12.1%
7162
10.5%
2 5723
8.4%
4 5380
7.9%
5 4793
7.0%
6 4779
7.0%
7 4678
6.9%
8 3958
5.8%
Other values (2) 3802
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68031
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9868
14.5%
3 9656
14.2%
1 8232
12.1%
7162
10.5%
2 5723
8.4%
4 5380
7.9%
5 4793
7.0%
6 4779
7.0%
7 4678
6.9%
8 3958
5.8%
Other values (2) 3802
 
5.6%

소재지면적정보
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

도로명우편번호
Text

MISSING 

Distinct2791
Distinct (%)48.4%
Missing4228
Missing (%)42.3%
Memory size156.2 KiB
2023-12-11T06:41:54.817946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.229903
Min length5

Characters and Unicode

Total characters30187
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1569 ?
Unique (%)27.2%

Sample

1st row16610
2nd row14057
3rd row17406
4th row10256
5th row11667
ValueCountFrequency (%)
12918 31
 
0.5%
11184 19
 
0.3%
10863 17
 
0.3%
12128 17
 
0.3%
430818 16
 
0.3%
16564 16
 
0.3%
15431 15
 
0.3%
430833 14
 
0.2%
12770 14
 
0.2%
16479 14
 
0.2%
Other values (2781) 5599
97.0%
2023-12-11T06:41:55.199755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 7073
23.4%
4 3548
11.8%
0 3261
10.8%
2 2937
9.7%
6 2740
 
9.1%
8 2520
 
8.3%
5 2371
 
7.9%
3 2329
 
7.7%
7 1743
 
5.8%
9 1588
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30110
99.7%
Dash Punctuation 77
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 7073
23.5%
4 3548
11.8%
0 3261
10.8%
2 2937
9.8%
6 2740
 
9.1%
8 2520
 
8.4%
5 2371
 
7.9%
3 2329
 
7.7%
7 1743
 
5.8%
9 1588
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30187
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 7073
23.4%
4 3548
11.8%
0 3261
10.8%
2 2937
9.7%
6 2740
 
9.1%
8 2520
 
8.3%
5 2371
 
7.9%
3 2329
 
7.7%
7 1743
 
5.8%
9 1588
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30187
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 7073
23.4%
4 3548
11.8%
0 3261
10.8%
2 2937
9.7%
6 2740
 
9.1%
8 2520
 
8.3%
5 2371
 
7.9%
3 2329
 
7.7%
7 1743
 
5.8%
9 1588
 
5.3%
Distinct7276
Distinct (%)78.4%
Missing718
Missing (%)7.2%
Memory size156.2 KiB
2023-12-11T06:41:55.475555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length56
Mean length28.368455
Min length14

Characters and Unicode

Total characters263316
Distinct characters642
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6070 ?
Unique (%)65.4%

Sample

1st row경기도 수원시 권선구 효탑로**번길 **-**, ***호 (탑동)
2nd row경기도 성남시 중원구 광명로***번길 * (금광동)
3rd row경기도 안양시 동안구 시민대로***번길 ** (관양동)
4th row경기도 이천시 모가면 사실로***번길 **-*
5th row경기도 고양시 일산동구 지영로 *** (지영동)
ValueCountFrequency (%)
9358
 
16.8%
경기도 9253
 
16.6%
1803
 
3.2%
910
 
1.6%
고양시 856
 
1.5%
수원시 829
 
1.5%
부천시 819
 
1.5%
안산시 677
 
1.2%
성남시 618
 
1.1%
남양주시 458
 
0.8%
Other values (5270) 30056
54.0%
2023-12-11T06:41:55.908977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48343
18.4%
* 42538
16.2%
9785
 
3.7%
9728
 
3.7%
9616
 
3.7%
9610
 
3.6%
9555
 
3.6%
8476
 
3.2%
( 7843
 
3.0%
) 7843
 
3.0%
Other values (632) 99979
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149795
56.9%
Space Separator 48343
 
18.4%
Other Punctuation 46223
 
17.6%
Open Punctuation 7843
 
3.0%
Close Punctuation 7843
 
3.0%
Dash Punctuation 2371
 
0.9%
Uppercase Letter 727
 
0.3%
Lowercase Letter 150
 
0.1%
Math Symbol 16
 
< 0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9785
 
6.5%
9728
 
6.5%
9616
 
6.4%
9610
 
6.4%
9555
 
6.4%
8476
 
5.7%
4335
 
2.9%
4256
 
2.8%
3307
 
2.2%
3069
 
2.0%
Other values (572) 78058
52.1%
Uppercase Letter
ValueCountFrequency (%)
B 142
19.5%
A 109
15.0%
T 51
 
7.0%
E 44
 
6.1%
I 42
 
5.8%
C 39
 
5.4%
F 38
 
5.2%
S 37
 
5.1%
K 36
 
5.0%
R 25
 
3.4%
Other values (16) 164
22.6%
Lowercase Letter
ValueCountFrequency (%)
e 37
24.7%
r 20
13.3%
n 19
12.7%
t 18
12.0%
c 17
11.3%
w 7
 
4.7%
o 6
 
4.0%
i 5
 
3.3%
k 4
 
2.7%
a 4
 
2.7%
Other values (9) 13
 
8.7%
Other Punctuation
ValueCountFrequency (%)
* 42538
92.0%
, 3658
 
7.9%
. 10
 
< 0.1%
& 9
 
< 0.1%
/ 6
 
< 0.1%
' 1
 
< 0.1%
@ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 12
75.0%
4
 
25.0%
Space Separator
ValueCountFrequency (%)
48343
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7843
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7843
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2371
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149795
56.9%
Common 112641
42.8%
Latin 880
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9785
 
6.5%
9728
 
6.5%
9616
 
6.4%
9610
 
6.4%
9555
 
6.4%
8476
 
5.7%
4335
 
2.9%
4256
 
2.8%
3307
 
2.2%
3069
 
2.0%
Other values (572) 78058
52.1%
Latin
ValueCountFrequency (%)
B 142
16.1%
A 109
 
12.4%
T 51
 
5.8%
E 44
 
5.0%
I 42
 
4.8%
C 39
 
4.4%
F 38
 
4.3%
e 37
 
4.2%
S 37
 
4.2%
K 36
 
4.1%
Other values (36) 305
34.7%
Common
ValueCountFrequency (%)
48343
42.9%
* 42538
37.8%
( 7843
 
7.0%
) 7843
 
7.0%
, 3658
 
3.2%
- 2371
 
2.1%
~ 12
 
< 0.1%
. 10
 
< 0.1%
& 9
 
< 0.1%
/ 6
 
< 0.1%
Other values (4) 8
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149795
56.9%
ASCII 113514
43.1%
Math Operators 4
 
< 0.1%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48343
42.6%
* 42538
37.5%
( 7843
 
6.9%
) 7843
 
6.9%
, 3658
 
3.2%
- 2371
 
2.1%
B 142
 
0.1%
A 109
 
0.1%
T 51
 
< 0.1%
E 44
 
< 0.1%
Other values (48) 572
 
0.5%
Hangul
ValueCountFrequency (%)
9785
 
6.5%
9728
 
6.5%
9616
 
6.4%
9610
 
6.4%
9555
 
6.4%
8476
 
5.7%
4335
 
2.9%
4256
 
2.8%
3307
 
2.2%
3069
 
2.0%
Other values (572) 78058
52.1%
Math Operators
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
3
100.0%

소재지지번주소
Text

MISSING 

Distinct5383
Distinct (%)55.2%
Missing241
Missing (%)2.4%
Memory size156.2 KiB
2023-12-11T06:41:56.214631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length47
Mean length23.234245
Min length10

Characters and Unicode

Total characters226743
Distinct characters564
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3982 ?
Unique (%)40.8%

Sample

1st row경기도 포천시 소흘읍 이동교리 ***-*번지
2nd row경기도 수원시 권선구 탑동 ***-* ***호
3rd row경기도 성남시 중원구 금광동 ****번지
4th row경기도 안양시 동안구 관양동 ***-*
5th row경기도 이천시 모가면 두미리 ***-*
ValueCountFrequency (%)
경기도 9690
19.8%
번지 5616
 
11.5%
4182
 
8.6%
1190
 
2.4%
고양시 879
 
1.8%
수원시 876
 
1.8%
부천시 872
 
1.8%
안산시 681
 
1.4%
성남시 580
 
1.2%
안양시 473
 
1.0%
Other values (2684) 23850
48.8%
2023-12-11T06:41:56.661691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 45316
20.0%
43086
19.0%
10063
 
4.4%
9958
 
4.4%
9913
 
4.4%
9738
 
4.3%
9403
 
4.1%
- 7920
 
3.5%
6191
 
2.7%
5630
 
2.5%
Other values (554) 69525
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129502
57.1%
Other Punctuation 45401
 
20.0%
Space Separator 43086
 
19.0%
Dash Punctuation 7920
 
3.5%
Uppercase Letter 532
 
0.2%
Lowercase Letter 137
 
0.1%
Close Punctuation 77
 
< 0.1%
Open Punctuation 77
 
< 0.1%
Math Symbol 8
 
< 0.1%
Letter Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10063
 
7.8%
9958
 
7.7%
9913
 
7.7%
9738
 
7.5%
9403
 
7.3%
6191
 
4.8%
5630
 
4.3%
4305
 
3.3%
2962
 
2.3%
2356
 
1.8%
Other values (495) 58983
45.5%
Uppercase Letter
ValueCountFrequency (%)
B 88
16.5%
A 64
12.0%
E 43
 
8.1%
T 40
 
7.5%
S 35
 
6.6%
K 34
 
6.4%
I 29
 
5.5%
C 27
 
5.1%
R 23
 
4.3%
D 20
 
3.8%
Other values (16) 129
24.2%
Lowercase Letter
ValueCountFrequency (%)
e 35
25.5%
n 18
13.1%
t 17
12.4%
r 17
12.4%
c 16
11.7%
i 6
 
4.4%
w 6
 
4.4%
o 4
 
2.9%
l 4
 
2.9%
a 3
 
2.2%
Other values (9) 11
 
8.0%
Other Punctuation
ValueCountFrequency (%)
* 45316
99.8%
, 59
 
0.1%
/ 11
 
< 0.1%
. 8
 
< 0.1%
& 3
 
< 0.1%
@ 3
 
< 0.1%
' 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
4
50.0%
~ 4
50.0%
Space Separator
ValueCountFrequency (%)
43086
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7920
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129502
57.1%
Common 96569
42.6%
Latin 672
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10063
 
7.8%
9958
 
7.7%
9913
 
7.7%
9738
 
7.5%
9403
 
7.3%
6191
 
4.8%
5630
 
4.3%
4305
 
3.3%
2962
 
2.3%
2356
 
1.8%
Other values (495) 58983
45.5%
Latin
ValueCountFrequency (%)
B 88
 
13.1%
A 64
 
9.5%
E 43
 
6.4%
T 40
 
6.0%
e 35
 
5.2%
S 35
 
5.2%
K 34
 
5.1%
I 29
 
4.3%
C 27
 
4.0%
R 23
 
3.4%
Other values (36) 254
37.8%
Common
ValueCountFrequency (%)
* 45316
46.9%
43086
44.6%
- 7920
 
8.2%
) 77
 
0.1%
( 77
 
0.1%
, 59
 
0.1%
/ 11
 
< 0.1%
. 8
 
< 0.1%
4
 
< 0.1%
~ 4
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129502
57.1%
ASCII 97234
42.9%
Math Operators 4
 
< 0.1%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 45316
46.6%
43086
44.3%
- 7920
 
8.1%
B 88
 
0.1%
) 77
 
0.1%
( 77
 
0.1%
A 64
 
0.1%
, 59
 
0.1%
E 43
 
< 0.1%
T 40
 
< 0.1%
Other values (47) 464
 
0.5%
Hangul
ValueCountFrequency (%)
10063
 
7.8%
9958
 
7.7%
9913
 
7.7%
9738
 
7.5%
9403
 
7.3%
6191
 
4.8%
5630
 
4.3%
4305
 
3.3%
2962
 
2.3%
2356
 
1.8%
Other values (495) 58983
45.5%
Math Operators
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
3
100.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2717
Distinct (%)36.9%
Missing2643
Missing (%)26.4%
Infinite0
Infinite (%)0.0%
Mean14077.089
Minimum2201
Maximum59511
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:56.789496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2201
5-th percentile10316
Q112032
median14212
Q315841
95-th percentile18118
Maximum59511
Range57310
Interquartile range (IQR)3809

Descriptive statistics

Standard deviation2634.933
Coefficient of variation (CV)0.18717883
Kurtosis22.600263
Mean14077.089
Median Absolute Deviation (MAD)2034
Skewness1.6774727
Sum1.0356514 × 108
Variance6942872
MonotonicityNot monotonic
2023-12-11T06:41:56.915384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15119 41
 
0.4%
11184 31
 
0.3%
15431 24
 
0.2%
12770 23
 
0.2%
12774 21
 
0.2%
10324 19
 
0.2%
10317 19
 
0.2%
14948 19
 
0.2%
18469 18
 
0.2%
11932 18
 
0.2%
Other values (2707) 7124
71.2%
(Missing) 2643
 
26.4%
ValueCountFrequency (%)
2201 1
< 0.1%
3174 1
< 0.1%
3397 1
< 0.1%
3646 1
< 0.1%
3769 1
< 0.1%
4367 1
< 0.1%
4373 1
< 0.1%
4964 1
< 0.1%
5317 1
< 0.1%
5510 1
< 0.1%
ValueCountFrequency (%)
59511 1
< 0.1%
47856 1
< 0.1%
46703 1
< 0.1%
44935 1
< 0.1%
39830 1
< 0.1%
35255 1
< 0.1%
32722 1
< 0.1%
27462 2
< 0.1%
22664 1
< 0.1%
22174 1
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6563
Distinct (%)89.8%
Missing2691
Missing (%)26.9%
Infinite0
Infinite (%)0.0%
Mean37.451668
Minimum34.680416
Maximum38.184878
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:57.053076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.680416
5-th percentile37.073858
Q137.307426
median37.435967
Q337.622354
95-th percentile37.795943
Maximum38.184878
Range3.5044614
Interquartile range (IQR)0.3149283

Descriptive statistics

Standard deviation0.2189752
Coefficient of variation (CV)0.0058468745
Kurtosis7.163205
Mean37.451668
Median Absolute Deviation (MAD)0.14887433
Skewness-0.62559382
Sum273734.24
Variance0.047950139
MonotonicityNot monotonic
2023-12-11T06:41:57.189159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.327696 10
 
0.1%
37.3715256 8
 
0.1%
37.5526520126 6
 
0.1%
37.3240283 6
 
0.1%
37.439938 6
 
0.1%
37.650156 5
 
0.1%
37.7497611 5
 
0.1%
37.677682 5
 
0.1%
37.4882899 5
 
0.1%
37.3263705 5
 
0.1%
Other values (6553) 7248
72.5%
(Missing) 2691
 
26.9%
ValueCountFrequency (%)
34.6804163 1
< 0.1%
35.1996033 1
< 0.1%
35.2097628665 1
< 0.1%
35.613427346 1
< 0.1%
36.0017489 1
< 0.1%
36.1031105123 1
< 0.1%
36.3445533 1
< 0.1%
36.942255 1
< 0.1%
36.94673 1
< 0.1%
36.9585802 1
< 0.1%
ValueCountFrequency (%)
38.1848777 1
< 0.1%
38.1365183 2
< 0.1%
38.103027 1
< 0.1%
38.102776 1
< 0.1%
38.1020538 1
< 0.1%
38.0970829227 1
< 0.1%
38.0967769 1
< 0.1%
38.095708977 1
< 0.1%
38.0947765 1
< 0.1%
38.092027 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6563
Distinct (%)89.8%
Missing2691
Missing (%)26.9%
Infinite0
Infinite (%)0.0%
Mean126.99224
Minimum126.52952
Maximum129.12314
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:57.597759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.52952
5-th percentile126.73627
Q1126.80754
median126.98219
Q3127.13449
95-th percentile127.32988
Maximum129.12314
Range2.5936263
Interquartile range (IQR)0.32695232

Descriptive statistics

Standard deviation0.20900635
Coefficient of variation (CV)0.0016458199
Kurtosis4.1894289
Mean126.99224
Median Absolute Deviation (MAD)0.1618225
Skewness1.0221627
Sum928186.25
Variance0.043683654
MonotonicityNot monotonic
2023-12-11T06:41:57.752300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.6942604 10
 
0.1%
126.9520679 8
 
0.1%
126.787603 6
 
0.1%
127.1827170463 6
 
0.1%
126.858463 5
 
0.1%
126.7949031 5
 
0.1%
126.7674828 5
 
0.1%
126.7822931 5
 
0.1%
126.7592911 5
 
0.1%
127.1776552 5
 
0.1%
Other values (6553) 7249
72.5%
(Missing) 2691
 
26.9%
ValueCountFrequency (%)
126.5295165 1
< 0.1%
126.547429 1
< 0.1%
126.5546186 1
< 0.1%
126.5582773 1
< 0.1%
126.5655067 1
< 0.1%
126.568635 1
< 0.1%
126.5691885 1
< 0.1%
126.5700155 1
< 0.1%
126.5701437 1
< 0.1%
126.5709841 1
< 0.1%
ValueCountFrequency (%)
129.1231428243 1
< 0.1%
129.0623133 1
< 0.1%
128.9810001665 1
< 0.1%
128.3891533 1
< 0.1%
127.8098774 1
< 0.1%
127.8090426 1
< 0.1%
127.8079126 1
< 0.1%
127.8065452 1
< 0.1%
127.731106 1
< 0.1%
127.6895662382 1
< 0.1%

업태구분명정보
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

X좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct7716
Distinct (%)86.5%
Missing1082
Missing (%)10.8%
Infinite0
Infinite (%)0.0%
Mean199489.59
Minimum155002.68
Maximum392327.14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:57.881148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum155002.68
5-th percentile176704.96
Q1183642.97
median200233.08
Q3211806.21
95-th percentile227872.12
Maximum392327.14
Range237324.46
Interquartile range (IQR)28163.241

Descriptive statistics

Standard deviation17818.233
Coefficient of variation (CV)0.089319113
Kurtosis3.224814
Mean199489.59
Median Absolute Deviation (MAD)13930.816
Skewness0.8275917
Sum1.7790482 × 109
Variance3.1748943 × 108
MonotonicityNot monotonic
2023-12-11T06:41:58.019829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
181108.608529259 23
 
0.2%
191287.508236468 17
 
0.2%
181727.683200933 11
 
0.1%
175751.992022502 10
 
0.1%
195688.283833074 10
 
0.1%
181835.358338529 8
 
0.1%
181258.71259879 8
 
0.1%
180878.941038538 7
 
0.1%
197536.460660019 7
 
0.1%
175470.687063644 7
 
0.1%
Other values (7706) 8810
88.1%
(Missing) 1082
 
10.8%
ValueCountFrequency (%)
155002.676889436 1
< 0.1%
159230.139923355 1
< 0.1%
159910.371904607 1
< 0.1%
160836.025727836 1
< 0.1%
161104.454753353 1
< 0.1%
161346.986707785 1
< 0.1%
161501.932383243 1
< 0.1%
161671.216496793 1
< 0.1%
161723.449144973 1
< 0.1%
161790.051139887 2
< 0.1%
ValueCountFrequency (%)
392327.140005794 1
< 0.1%
387750.774666143 1
< 0.1%
325183.754117655 1
< 0.1%
271978.802251 1
< 0.1%
271751.9004 1
< 0.1%
264905.603427 1
< 0.1%
260872.453305388 1
< 0.1%
260631.13864 1
< 0.1%
258914.046410087 1
< 0.1%
258873.653366293 1
< 0.1%

Y좌표값
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct7715
Distinct (%)86.5%
Missing1082
Missing (%)10.8%
Infinite0
Infinite (%)0.0%
Mean438618.23
Minimum131397.5
Maximum520541.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:41:58.163153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum131397.5
5-th percentile397425.25
Q1422204.71
median436584.69
Q3458276.34
95-th percentile476360.36
Maximum520541.9
Range389144.4
Interquartile range (IQR)36071.625

Descriptive statistics

Standard deviation23958.724
Coefficient of variation (CV)0.054623184
Kurtosis6.151774
Mean438618.23
Median Absolute Deviation (MAD)16523.994
Skewness-0.52753658
Sum3.9115974 × 109
Variance5.7402046 × 108
MonotonicityNot monotonic
2023-12-11T06:41:58.293062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
424698.61951891 23
 
0.2%
460746.546568849 17
 
0.2%
460855.679685362 11
 
0.1%
429949.927814167 10
 
0.1%
425940.901471747 10
 
0.1%
460894.742344229 8
 
0.1%
424761.274007584 8
 
0.1%
432133.905985364 7
 
0.1%
424612.422892591 7
 
0.1%
426164.742756383 7
 
0.1%
Other values (7705) 8810
88.1%
(Missing) 1082
 
10.8%
ValueCountFrequency (%)
131397.500991 1
< 0.1%
183352.869277213 1
< 0.1%
190883.177447059 1
< 0.1%
236938.165923925 1
< 0.1%
278829.032892769 1
< 0.1%
289310.002689 1
< 0.1%
316050.827105 1
< 0.1%
369954.462640583 1
< 0.1%
380242.0 1
< 0.1%
382781.618927894 1
< 0.1%
ValueCountFrequency (%)
520541.903573 1
< 0.1%
517141.250309034 1
< 0.1%
510763.069095262 1
< 0.1%
510588.365461489 1
< 0.1%
510498.842355602 1
< 0.1%
510450.462276381 1
< 0.1%
510226.176378482 1
< 0.1%
509955.893915324 1
< 0.1%
509911.282875648 1
< 0.1%
509799.506532021 1
< 0.1%
Distinct2628
Distinct (%)26.5%
Missing86
Missing (%)0.9%
Memory size156.2 KiB
2023-12-11T06:41:58.516764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length49
Mean length9.0092798
Min length1

Characters and Unicode

Total characters89318
Distinct characters411
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2128 ?
Unique (%)21.5%

Sample

1st row간판제작,전기재료
2nd row광고물 제작 및 설치
3rd row옥외광고물제작 및 시공
4th row광고물제작
5th row옥외광고업
ValueCountFrequency (%)
제작 2653
 
13.0%
옥외광고물 1819
 
8.9%
1639
 
8.0%
광고물 1107
 
5.4%
광고물제작 1094
 
5.4%
설치 975
 
4.8%
옥외광고업 917
 
4.5%
간판 854
 
4.2%
옥외광고물제작 514
 
2.5%
간판제작 465
 
2.3%
Other values (1811) 8408
41.1%
2023-12-11T06:41:58.888740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10589
 
11.9%
8464
 
9.5%
8415
 
9.4%
7347
 
8.2%
6476
 
7.3%
6170
 
6.9%
4658
 
5.2%
4576
 
5.1%
, 3228
 
3.6%
2692
 
3.0%
Other values (401) 26703
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73947
82.8%
Space Separator 10589
 
11.9%
Other Punctuation 3602
 
4.0%
Uppercase Letter 373
 
0.4%
Close Punctuation 338
 
0.4%
Open Punctuation 323
 
0.4%
Decimal Number 82
 
0.1%
Lowercase Letter 46
 
0.1%
Dash Punctuation 16
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8464
11.4%
8415
11.4%
7347
 
9.9%
6476
 
8.8%
6170
 
8.3%
4658
 
6.3%
4576
 
6.2%
2692
 
3.6%
2314
 
3.1%
2150
 
2.9%
Other values (340) 20685
28.0%
Uppercase Letter
ValueCountFrequency (%)
D 96
25.7%
L 92
24.7%
E 90
24.1%
C 22
 
5.9%
P 16
 
4.3%
V 11
 
2.9%
O 9
 
2.4%
T 8
 
2.1%
A 5
 
1.3%
S 5
 
1.3%
Other values (8) 19
 
5.1%
Lowercase Letter
ValueCountFrequency (%)
n 7
15.2%
p 7
15.2%
e 5
10.9%
s 4
8.7%
g 3
 
6.5%
o 3
 
6.5%
i 3
 
6.5%
d 2
 
4.3%
c 2
 
4.3%
t 2
 
4.3%
Other values (6) 8
17.4%
Other Punctuation
ValueCountFrequency (%)
, 3228
89.6%
. 198
 
5.5%
/ 93
 
2.6%
· 60
 
1.7%
* 16
 
0.4%
' 2
 
0.1%
: 2
 
0.1%
1
 
< 0.1%
1
 
< 0.1%
& 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 22
26.8%
1 20
24.4%
2 18
22.0%
4 6
 
7.3%
3 6
 
7.3%
8 5
 
6.1%
7 3
 
3.7%
9 1
 
1.2%
5 1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 337
99.7%
] 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 322
99.7%
[ 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
10589
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73946
82.8%
Common 14952
 
16.7%
Latin 419
 
0.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8464
11.4%
8415
11.4%
7347
 
9.9%
6476
 
8.8%
6170
 
8.3%
4658
 
6.3%
4576
 
6.2%
2692
 
3.6%
2314
 
3.1%
2150
 
2.9%
Other values (339) 20684
28.0%
Latin
ValueCountFrequency (%)
D 96
22.9%
L 92
22.0%
E 90
21.5%
C 22
 
5.3%
P 16
 
3.8%
V 11
 
2.6%
O 9
 
2.1%
T 8
 
1.9%
n 7
 
1.7%
p 7
 
1.7%
Other values (24) 61
14.6%
Common
ValueCountFrequency (%)
10589
70.8%
, 3228
 
21.6%
) 337
 
2.3%
( 322
 
2.2%
. 198
 
1.3%
/ 93
 
0.6%
· 60
 
0.4%
0 22
 
0.1%
1 20
 
0.1%
2 18
 
0.1%
Other values (17) 65
 
0.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73933
82.8%
ASCII 15309
 
17.1%
None 61
 
0.1%
Compat Jamo 13
 
< 0.1%
Punctuation 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10589
69.2%
, 3228
 
21.1%
) 337
 
2.2%
( 322
 
2.1%
. 198
 
1.3%
D 96
 
0.6%
/ 93
 
0.6%
L 92
 
0.6%
E 90
 
0.6%
0 22
 
0.1%
Other values (48) 242
 
1.6%
Hangul
ValueCountFrequency (%)
8464
11.4%
8415
11.4%
7347
 
9.9%
6476
 
8.8%
6170
 
8.3%
4658
 
6.3%
4576
 
6.2%
2692
 
3.6%
2314
 
3.1%
2150
 
2.9%
Other values (338) 20671
28.0%
None
ValueCountFrequency (%)
· 60
98.4%
1
 
1.6%
Compat Jamo
ValueCountFrequency (%)
13
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T06:41:50.854555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.174809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.646678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.123435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.670954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.275131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.977703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.259941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.730286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.209141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.761899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.388958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:51.065771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.335340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.817352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.289541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.858503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.506998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:51.340837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.413842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.895861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.378717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.967438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.591130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:51.428315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.491806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.970161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.459232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.087908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.676645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:51.513897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:48.566773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.045891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:49.550491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.167077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:41:50.757133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:41:58.974739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태구분코드영업상태명소재지우편번호WGS84위도WGS84경도X좌표값Y좌표값
시군명1.0000.3820.3750.8780.9750.9490.9410.938
영업상태구분코드0.3821.0001.0000.0510.1110.1930.0780.101
영업상태명0.3751.0001.0000.0540.1510.1290.0790.067
소재지우편번호0.8780.0510.0541.0000.9760.8120.7840.953
WGS84위도0.9750.1110.1510.9761.0000.8120.7770.999
WGS84경도0.9490.1930.1290.8120.8121.0000.9940.779
X좌표값0.9410.0780.0790.7840.7770.9941.0000.774
Y좌표값0.9380.1010.0670.9530.9990.7790.7741.000
2023-12-11T06:41:59.066728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업상태명시군명
영업상태명1.0000.141
시군명0.1411.000
2023-12-11T06:41:59.144976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업상태구분코드소재지우편번호WGS84위도WGS84경도X좌표값Y좌표값시군명영업상태명
영업상태구분코드1.000-0.0010.020-0.105-0.060-0.0050.1581.000
소재지우편번호-0.0011.000-0.8980.0670.005-0.8950.7050.017
WGS84위도0.020-0.8981.000-0.169-0.1330.9980.8940.049
WGS84경도-0.1050.067-0.1691.0000.997-0.1310.8000.068
X좌표값-0.0600.005-0.1330.9971.000-0.1530.7780.040
Y좌표값-0.005-0.8950.998-0.131-0.1531.0000.7720.028
시군명0.1580.7050.8940.8000.7780.7721.0000.141
영업상태명1.0000.0170.0490.0680.0400.0280.1411.000

Missing values

2023-12-11T06:41:51.667740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:41:51.926462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:41:52.141837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값영업내용
9975포천시엘리온20071029<NA>40폐업20101028<NA><NA><NA><NA>경기도 포천시 소흘읍 이동교리 ***-*번지1118437.803009127.13126<NA>211494.912783477846.875711간판제작,전기재료
4996수원시플러스디자인20180413<NA>20정상<NA><NA><NA>16610경기도 수원시 권선구 효탑로**번길 **-**, ***호 (탑동)경기도 수원시 권선구 탑동 ***-* ***호<NA><NA><NA><NA>197820.150021418394.279623광고물 제작 및 설치
4688성남시뉴한빛광고20000408<NA>40폐업200812197541379<NA><NA>경기도 성남시 중원구 광명로***번길 * (금광동)경기도 성남시 중원구 금광동 ****번지1317537.44794127.170452<NA>215017.031782438444.143983옥외광고물제작 및 시공
7041안양시(주)제일광고기획20210924<NA>20정상<NA>031 474 0024<NA>14057경기도 안양시 동안구 시민대로***번길 ** (관양동)경기도 안양시 동안구 관양동 ***-*<NA><NA><NA><NA>197587.203824432793.712338광고물제작
8959이천시뉴아트20160407<NA>20정상<NA><NA><NA>17406경기도 이천시 모가면 사실로***번길 **-*경기도 이천시 모가면 두미리 ***-*17406<NA><NA><NA>238721.352202407255.104611옥외광고업
1040고양시마루커뮤니케이션2018-09-17<NA>30휴업<NA><NA><NA>10256경기도 고양시 일산동구 지영로 *** (지영동)경기도 고양시 일산동구 지영동 ***-*<NA><NA><NA><NA>184499.95933467969.319835옥외광고물 제작 , 설치
3293부천시죠이풀 에드벌룬20000901<NA>40폐업20001213032 6539390<NA><NA>경기도 부천시 성지로 ** *동 ***호 (원종동,경인아파트)경기도 부천시 원종동 ***-*번지 경인아파트 *동 ***호1441837.524731126.810716<NA>183202.429608446971.461575에어캐릭터.에어광고물제작 및 설치
8922의정부시더블디애드20190411<NA>40폐업20200303<NA><NA>11667경기도 의정부시 호국로****번길 *, *층 (가능동)경기도 의정부시 가능동 ***-*번지1166737.74362127.036352<NA>203138.987142471249.355883간판제작,설치등
2237김포시더블유 디자인20220502<NA>20정상<NA><NA><NA>10068경기도 김포시 김포한강*로***번길 ***, ***호 (마산동)<NA><NA><NA><NA><NA>168482.0459202.0간판제작 및 시공
8779의정부시솜씨방19910415<NA>40폐업20041111031 8446283<NA><NA><NA>경기도 의정부시 가능동 ***-*번지<NA>37.743738127.03254<NA><NA><NA>옥외광고물제작
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값영업내용
8308용인시사인팟20160108<NA>40폐업20161231<NA><NA>17033경기도 용인시 처인구 모현면 외개일로***번길 **경기도 용인시 처인구 모현읍 일산리 **-**번지1703337.346008127.244147<NA>221567.710569427143.1537옥외광고업
5514수원시(주)동방아트기획20121011<NA>40폐업20200129031 238 7911<NA>16480경기도 수원시 팔달구 경수대로 *** (인계동,창덕빌딩 *층)경기도 수원시 팔달구 인계동 ***-* 창덕빌딩 *층1648037.264062127.025047<NA>202151.379086418012.185544옥외광고물 제작업
322고양시마루에이디20130521<NA>20정상<NA>0319226111<NA>10251경기도 고양시 일산동구 고봉로***번길 ** (설문동)경기도 고양시 일산동구 설문동 ***-*번지1025137.717211126.792629<NA>181650.146356468336.250185옥외광고물 제작,표시,설치,대행
6926안성시디자인씨20091009<NA>20정상<NA>031 618 1203<NA>456820경기도 안성시 공도읍 고무다리길 **경기도 안성시 공도읍 승두리 ***-**번지1756236.992555127.172133<NA>215255.973818387901.725681광고업제작
2944남양주시보람하우징2023-03-07<NA>30휴업<NA><NA><NA>12136경기도 남양주시 진건읍 사릉로***번길 **, *층<NA><NA><NA><NA><NA>216065.633855461610.547984옥외광고 대행(설치,시공,디자인)
2118군포시가득기획20080509<NA>40폐업20090914<NA><NA><NA>경기도 군포시 광정로 **, ***동 *호 (산본동,신산본빌딩)경기도 군포시 산본동 ****-*번지 신산본빌딩 ***동 *호1586537.358423126.930924<NA>193814.926225428497.29983옥외광고물 제작 및 옥외광고물 대행
7780양평군뿔라광고20140820<NA>40폐업20171215<NA><NA>476913경기도 양평군 강상면 강남로 ***경기도 양평군 강상면 교평리 ***-*번지1257137.485498127.481704<NA>242538.381532442706.860188옥외광고물제작설치
8870의정부시21세기 광고기획20140314<NA>40폐업20200110<NA><NA>11747경기도 의정부시 장금로**번길 ** (신곡동)경기도 의정부시 신곡동 ***-**번지1174737.741988127.062128<NA>205412.499198471066.865353옥외광고업(간판,현수막,실사출력,허가대행,광고설치 등)
2234김포시해오름솔루션 주식회사2023-07-11<NA>20정상<NA><NA><NA>10027경기도 김포시 대곶면 대곶로 ***-**경기도 김포시 대곶면 송마리 ***-*<NA><NA><NA><NA>160836.025728462681.692378시트지 인쇄 및 제품 제작
5143수원시디홀20170109<NA>40폐업20210713<NA><NA>16436경기도 수원시 팔달구 화양로**번길 * (화서동)경기도 수원시 팔달구 화서동 ***-**<NA><NA><NA><NA>199588.075838420049.715299옥외광고물 제작업

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태구분코드영업상태명폐업일자소재지시설전화번호도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도X좌표값Y좌표값영업내용# duplicates
6성남시광고진2000011740폐업20021113<NA><NA>경기도 성남시 중원구 희망로 ***-* (금광동,통보*차 상가 *)경기도 성남시 중원구 금광동 **번지 통보*차 상가 *1318137.44383127.158872213994.852019437984.05559옥외광고물 제작 및 시공3
0고양시(주)고려종합광고기획2006011970취소20111223<NA><NA><NA>경기도 고양시 덕양구 향동동 **번지<NA>37.608775126.896594190803.570295456286.725313옥외광고업2
1고양시일오공사2002032240폐업20100803031921 1504<NA>경기도 고양시 일산동구 무궁화로***번길 ** (정발산동)경기도 고양시 일산동구 정발산동 ****-*번지1035837.672267126.78082180598.007158463351.615808옥외광고업2
2광주시본디자인2008052940폐업<NA>031 7621246<NA>경기도 광주시 중앙로 *** (송정동)경기도 광주시 송정동 ***-****번지1273937.418748127.256557222645.739428435221.444528광고물제작2
3군포시보라콤2002010440폐업20020822031 4525058<NA>경기도 군포시 번영로***번길 ** (금정동)경기도 군포시 금정동 ***-*번지1582837.361499126.936486<NA><NA>옥외광고매체 대행 및 제조2
4동두천시이조광고2001122240폐업<NA>031 8663891483030경기도 동두천시 중앙로***번길 ** (생연동)경기도 동두천시 생연동 ***-***번지1132937.906468127.055426204830.850294489332.959654옥외광고물제작2
5성남시(주)에솔애드1999082040폐업20041130<NA>462807경기도 성남시 중원구 사기막골로***번길 ** (상대원동)경기도 성남시 중원구 상대원동 ***-*번지1320137.442135127.172406215192.368657437799.151917옥외광고물제작 및 시공2
7성남시중앙애드타워2021-10-1540폐업2023-04-24031764203213504경기도 성남시 분당구 벌말로 **, 일심조합상가 ***-*호 (야탑동)경기도 성남시 분당구 야탑동 ***-* 일심조합상가<NA><NA><NA>212139.179239434331.375255옥외광고물제작시공2
8용인시이지플랜2002061140폐업20071010<NA><NA><NA>경기도 용인시 수지구 성복동 **번지<NA>37.313512127.081938<NA><NA>옥외광고물 제작 및 설치2