Dataset statistics
Number of variables | 22 |
---|---|
Number of observations | 10000 |
Missing cells | 96808 |
Missing cells (%) | 44.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.8 MiB |
Average record size in memory | 193.0 B |
Variable types
Categorical | 5 |
---|---|
Text | 7 |
DateTime | 2 |
Unsupported | 3 |
Numeric | 5 |
Dataset
Description | 쓰레기종량제 봉투판매업체 현황 |
---|---|
Author | 행정안전부 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=2GTL6EM012XZ8855K86E174924&infSeq=1 |
영업상태구분코드 is highly imbalanced (85.6%) | Imbalance |
영업상태명 is highly imbalanced (70.7%) | Imbalance |
업소구분명정보 is highly imbalanced (77.9%) | Imbalance |
항목값정보 is highly imbalanced (67.7%) | Imbalance |
인허가일자 has 1092 (10.9%) missing values | Missing |
인허가취소일자 has 10000 (100.0%) missing values | Missing |
폐업일자 has 8722 (87.2%) missing values | Missing |
소재지시설전화번호 has 9927 (99.3%) missing values | Missing |
소재지면적정보 has 10000 (100.0%) missing values | Missing |
도로명우편번호 has 9418 (94.2%) missing values | Missing |
소재지도로명주소 has 2632 (26.3%) missing values | Missing |
소재지우편번호 has 349 (3.5%) missing values | Missing |
WGS84위도 has 1736 (17.4%) missing values | Missing |
WGS84경도 has 1736 (17.4%) missing values | Missing |
업태구분명정보 has 10000 (100.0%) missing values | Missing |
X좌표값 has 9432 (94.3%) missing values | Missing |
Y좌표값 has 9432 (94.3%) missing values | Missing |
소재지주소 has 9481 (94.8%) missing values | Missing |
신청일자 has 2824 (28.2%) missing values | Missing |
신청일자 is highly skewed (γ1 = -33.4011233) | Skewed |
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
소재지면적정보 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
업태구분명정보 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-10 21:06:58.009284 |
---|---|
Analysis finished | 2023-12-10 21:06:59.971465 |
Duration | 1.96 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시군명
Categorical
Distinct | 30 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
고양시 | |
---|---|
용인시 | |
의정부시 | |
평택시 | |
김포시 | |
Other values (25) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.1193 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 남양주시 |
---|---|
2nd row | 안양시 |
3rd row | 김포시 |
4th row | 의왕시 |
5th row | 동두천시 |
Common Values
Value | Count | Frequency (%) |
고양시 | 1277 | |
용인시 | 949 | 9.5% |
의정부시 | 829 | 8.3% |
평택시 | 819 | 8.2% |
김포시 | 790 | 7.9% |
파주시 | 679 | 6.8% |
화성시 | 637 | 6.4% |
안산시 | 591 | 5.9% |
여주시 | 343 | 3.4% |
광명시 | 331 | 3.3% |
Other values (20) | 2755 |
Length
Value | Count | Frequency (%) |
고양시 | 1277 | |
용인시 | 949 | 9.5% |
의정부시 | 829 | 8.3% |
평택시 | 819 | 8.2% |
김포시 | 790 | 7.9% |
파주시 | 679 | 6.8% |
화성시 | 637 | 6.4% |
안산시 | 591 | 5.9% |
여주시 | 343 | 3.4% |
광명시 | 331 | 3.3% |
Other values (20) | 2755 |
사업장명
Text
Distinct | 7750 |
---|---|
Distinct (%) | 77.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
씨유 | 356 | 2.9% |
gs25 | 321 | 2.6% |
세븐일레븐 | 256 | 2.1% |
훼미리마트 | 105 | 0.8% |
이마트24 | 78 | 0.6% |
지에스25 | 75 | 0.6% |
미니스톱 | 65 | 0.5% |
주)코리아세븐 | 62 | 0.5% |
현대슈퍼 | 46 | 0.4% |
위드미 | 45 | 0.4% |
Other values (7820) | 11008 |
Most occurring characters
Value | Count | Frequency (%) |
마 | 2834 | 4.2% |
점 | 2829 | 4.2% |
트 | 2593 | 3.9% |
퍼 | 2475 | 3.7% |
2425 | 3.6% | |
슈 | 2253 | 3.4% |
2 | 1098 | 1.6% |
리 | 1066 | 1.6% |
) | 1013 | 1.5% |
( | 1010 | 1.5% |
Other values (741) | 47576 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 57653 | |
Decimal Number | 2544 | 3.8% |
Space Separator | 2425 | 3.6% |
Uppercase Letter | 2123 | 3.2% |
Close Punctuation | 1013 | 1.5% |
Open Punctuation | 1010 | 1.5% |
Lowercase Letter | 297 | 0.4% |
Other Punctuation | 55 | 0.1% |
Dash Punctuation | 41 | 0.1% |
Other Symbol | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
마 | 2834 | 4.9% |
점 | 2829 | 4.9% |
트 | 2593 | 4.5% |
퍼 | 2475 | 4.3% |
슈 | 2253 | 3.9% |
리 | 1066 | 1.8% |
스 | 842 | 1.5% |
주 | 842 | 1.5% |
유 | 838 | 1.5% |
지 | 781 | 1.4% |
Other values (671) | 40300 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 671 | |
S | 627 | |
C | 142 | 6.7% |
L | 113 | 5.3% |
U | 95 | 4.5% |
K | 74 | 3.5% |
D | 53 | 2.5% |
A | 51 | 2.4% |
I | 44 | 2.1% |
M | 38 | 1.8% |
Other values (15) | 215 | 10.1% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 34 | 11.4% |
a | 32 | 10.8% |
t | 24 | 8.1% |
y | 22 | 7.4% |
s | 20 | 6.7% |
u | 20 | 6.7% |
m | 18 | 6.1% |
g | 16 | 5.4% |
k | 13 | 4.4% |
r | 13 | 4.4% |
Other values (12) | 85 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1098 | |
5 | 832 | |
4 | 189 | 7.4% |
1 | 145 | 5.7% |
3 | 90 | 3.5% |
6 | 58 | 2.3% |
9 | 47 | 1.8% |
0 | 31 | 1.2% |
7 | 28 | 1.1% |
8 | 26 | 1.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 29 | |
, | 11 | 20.0% |
/ | 6 | 10.9% |
@ | 4 | 7.3% |
· | 3 | 5.5% |
& | 1 | 1.8% |
\ | 1 | 1.8% |
Space Separator
Value | Count | Frequency (%) |
2425 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1013 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1010 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 41 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 10 |
Math Symbol
Value | Count | Frequency (%) |
∥ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 57663 | |
Common | 7089 | 10.6% |
Latin | 2420 | 3.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
마 | 2834 | 4.9% |
점 | 2829 | 4.9% |
트 | 2593 | 4.5% |
퍼 | 2475 | 4.3% |
슈 | 2253 | 3.9% |
리 | 1066 | 1.8% |
스 | 842 | 1.5% |
주 | 842 | 1.5% |
유 | 838 | 1.5% |
지 | 781 | 1.4% |
Other values (672) | 40310 |
Latin
Value | Count | Frequency (%) |
G | 671 | |
S | 627 | |
C | 142 | 5.9% |
L | 113 | 4.7% |
U | 95 | 3.9% |
K | 74 | 3.1% |
D | 53 | 2.2% |
A | 51 | 2.1% |
I | 44 | 1.8% |
M | 38 | 1.6% |
Other values (37) | 512 |
Common
Value | Count | Frequency (%) |
2425 | ||
2 | 1098 | |
) | 1013 | |
( | 1010 | |
5 | 832 | 11.7% |
4 | 189 | 2.7% |
1 | 145 | 2.0% |
3 | 90 | 1.3% |
6 | 58 | 0.8% |
9 | 47 | 0.7% |
Other values (12) | 182 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 57653 | |
ASCII | 9505 | 14.2% |
None | 13 | < 0.1% |
Math Operators | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
마 | 2834 | 4.9% |
점 | 2829 | 4.9% |
트 | 2593 | 4.5% |
퍼 | 2475 | 4.3% |
슈 | 2253 | 3.9% |
리 | 1066 | 1.8% |
스 | 842 | 1.5% |
주 | 842 | 1.5% |
유 | 838 | 1.5% |
지 | 781 | 1.4% |
Other values (671) | 40300 |
ASCII
Value | Count | Frequency (%) |
2425 | ||
2 | 1098 | |
) | 1013 | |
( | 1010 | |
5 | 832 | 8.8% |
G | 671 | 7.1% |
S | 627 | 6.6% |
4 | 189 | 2.0% |
1 | 145 | 1.5% |
C | 142 | 1.5% |
Other values (57) | 1353 |
None
Value | Count | Frequency (%) |
㈜ | 10 | |
· | 3 | 23.1% |
Math Operators
Value | Count | Frequency (%) |
∥ | 1 |
인허가일자
Date
MISSING
 
Distinct | 3827 |
---|---|
Distinct (%) | 43.0% |
Missing | 1092 |
Missing (%) | 10.9% |
Memory size | 156.2 KiB |
Minimum | 1901-07-25 00:00:00 |
---|---|
Maximum | 2023-12-05 00:00:00 |
인허가취소일자
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
영업상태구분코드
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
11 | 484 |
2 | 93 |
5 | 6 |
4 | 4 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8714 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9410 | |
11 | 484 | 4.8% |
2 | 93 | 0.9% |
5 | 6 | 0.1% |
4 | 4 | < 0.1% |
0 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9410 | |
11 | 484 | 4.8% |
2 | 93 | 0.9% |
5 | 6 | 0.1% |
4 | 4 | < 0.1% |
0 | 3 | < 0.1% |
영업상태명
Categorical
IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
운영중 | |
---|---|
폐업 등 | |
영업 | 484 |
폐업 | 93 |
제외사항 | 6 |
Other values (3) | 8 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0592 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 운영중 |
---|---|
2nd row | 운영중 |
3rd row | 운영중 |
4th row | 운영중 |
5th row | 운영중 |
Common Values
Value | Count | Frequency (%) |
운영중 | 8246 | |
폐업 등 | 1163 | 11.6% |
영업 | 484 | 4.8% |
폐업 | 93 | 0.9% |
제외사항 | 6 | 0.1% |
폐쇄 | 4 | < 0.1% |
<NA> | 3 | < 0.1% |
휴업 등 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
운영중 | 8246 | |
폐업 | 1256 | 11.3% |
등 | 1164 | 10.4% |
영업 | 484 | 4.3% |
제외사항 | 6 | 0.1% |
폐쇄 | 4 | < 0.1% |
na | 3 | < 0.1% |
휴업 | 1 | < 0.1% |
폐업일자
Date
MISSING
 
Distinct | 508 |
---|---|
Distinct (%) | 39.7% |
Missing | 8722 |
Missing (%) | 87.2% |
Memory size | 156.2 KiB |
Minimum | 1994-12-30 00:00:00 |
---|---|
Maximum | 2023-12-04 00:00:00 |
소재지시설전화번호
Text
MISSING
 
Distinct | 68 |
---|---|
Distinct (%) | 93.2% |
Missing | 9927 |
Missing (%) | 99.3% |
Memory size | 156.2 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 10.849315 |
Min length | 7 |
Characters and Unicode
Total characters | 792 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 66 ? |
---|---|
Unique (%) | 90.4% |
Sample
1st row | 031-932-5400 |
---|---|
2nd row | 0314971519 |
3rd row | 0317624086 |
4th row | 1577-0711 |
5th row | 529-5999 |
Value | Count | Frequency (%) |
1577-0711 | 4 | 5.5% |
02-2290-5937 | 3 | 4.1% |
02-6954-0301 | 1 | 1.4% |
03151893770 | 1 | 1.4% |
031-550-8774 | 1 | 1.4% |
02-381-0084 | 1 | 1.4% |
02-6332-9000 | 1 | 1.4% |
031-289-0604 | 1 | 1.4% |
070-8950-3063 | 1 | 1.4% |
031-462-4449 | 1 | 1.4% |
Other values (58) | 58 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 126 | |
- | 107 | |
1 | 91 | |
3 | 82 | |
9 | 72 | |
2 | 63 | |
7 | 62 | |
5 | 52 | |
8 | 47 | 5.9% |
6 | 47 | 5.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 685 | |
Dash Punctuation | 107 | 13.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 126 | |
1 | 91 | |
3 | 82 | |
9 | 72 | |
2 | 63 | |
7 | 62 | |
5 | 52 | |
8 | 47 | 6.9% |
6 | 47 | 6.9% |
4 | 43 | 6.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 107 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 792 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 126 | |
- | 107 | |
1 | 91 | |
3 | 82 | |
9 | 72 | |
2 | 63 | |
7 | 62 | |
5 | 52 | |
8 | 47 | 5.9% |
6 | 47 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 792 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 126 | |
- | 107 | |
1 | 91 | |
3 | 82 | |
9 | 72 | |
2 | 63 | |
7 | 62 | |
5 | 52 | |
8 | 47 | 5.9% |
6 | 47 | 5.9% |
소재지면적정보
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
도로명우편번호
Text
MISSING
 
Distinct | 419 |
---|---|
Distinct (%) | 72.0% |
Missing | 9418 |
Missing (%) | 94.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
10071 | 7 | 1.2% |
11813 | 6 | 1.0% |
18478 | 4 | 0.7% |
12248 | 4 | 0.7% |
12473 | 4 | 0.7% |
10111 | 4 | 0.7% |
12438 | 4 | 0.7% |
10584 | 4 | 0.7% |
10362 | 4 | 0.7% |
11812 | 4 | 0.7% |
Other values (409) | 537 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 803 | |
0 | 433 | |
4 | 352 | |
2 | 319 | 10.8% |
8 | 271 | 9.1% |
5 | 189 | 6.4% |
3 | 170 | 5.7% |
7 | 161 | 5.4% |
6 | 143 | 4.8% |
9 | 95 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2936 | |
Dash Punctuation | 26 | 0.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 803 | |
0 | 433 | |
4 | 352 | |
2 | 319 | 10.9% |
8 | 271 | 9.2% |
5 | 189 | 6.4% |
3 | 170 | 5.8% |
7 | 161 | 5.5% |
6 | 143 | 4.9% |
9 | 95 | 3.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 26 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2962 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 803 | |
0 | 433 | |
4 | 352 | |
2 | 319 | 10.8% |
8 | 271 | 9.1% |
5 | 189 | 6.4% |
3 | 170 | 5.7% |
7 | 161 | 5.4% |
6 | 143 | 4.8% |
9 | 95 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2962 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 803 | |
0 | 433 | |
4 | 352 | |
2 | 319 | 10.8% |
8 | 271 | 9.1% |
5 | 189 | 6.4% |
3 | 170 | 5.7% |
7 | 161 | 5.4% |
6 | 143 | 4.8% |
9 | 95 | 3.2% |
소재지도로명주소
Text
MISSING
 
Distinct | 6914 |
---|---|
Distinct (%) | 93.8% |
Missing | 2632 |
Missing (%) | 26.3% |
Memory size | 156.2 KiB |
Length
Max length | 80 |
---|---|
Median length | 64 |
Mean length | 25.283116 |
Min length | 13 |
Characters and Unicode
Total characters | 186286 |
---|---|
Distinct characters | 577 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 6494 ? |
---|---|
Unique (%) | 88.1% |
Sample
1st row | 경기도 김포시 풍년로 9, 1층 104호 (사우동, 풍년마을삼보아파트 상가) |
---|---|
2nd row | 경기도 용인시 수지구 신봉2로14번길 8, 106호 (신봉동,백산빌딩 가동 1층) |
3rd row | 경기도 고양시 일산서구 성저로 47 (대화동,성저마을) |
4th row | 경기도 의정부시 본원로46번길 21 (녹양동) |
5th row | 경기도 안산시 상록구 본오로 66 (본오동) |
Value | Count | Frequency (%) |
경기도 | 7363 | 18.3% |
고양시 | 1024 | 2.5% |
의정부시 | 769 | 1.9% |
용인시 | 683 | 1.7% |
김포시 | 610 | 1.5% |
평택시 | 608 | 1.5% |
1층 | 578 | 1.4% |
화성시 | 559 | 1.4% |
일산동구 | 505 | 1.3% |
안산시 | 482 | 1.2% |
Other values (6910) | 27125 |
Most occurring characters
Value | Count | Frequency (%) |
34103 | 18.3% | |
1 | 8411 | 4.5% |
기 | 7782 | 4.2% |
경 | 7555 | 4.1% |
도 | 7554 | 4.1% |
시 | 7389 | 4.0% |
로 | 6650 | 3.6% |
동 | 5285 | 2.8% |
2 | 4341 | 2.3% |
) | 3410 | 1.8% |
Other values (567) | 93806 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 109339 | |
Space Separator | 34103 | 18.3% |
Decimal Number | 31499 | 16.9% |
Close Punctuation | 3410 | 1.8% |
Open Punctuation | 3409 | 1.8% |
Other Punctuation | 2824 | 1.5% |
Dash Punctuation | 1415 | 0.8% |
Uppercase Letter | 224 | 0.1% |
Math Symbol | 45 | < 0.1% |
Lowercase Letter | 11 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 7782 | 7.1% |
경 | 7555 | 6.9% |
도 | 7554 | 6.9% |
시 | 7389 | 6.8% |
로 | 6650 | 6.1% |
동 | 5285 | 4.8% |
길 | 3288 | 3.0% |
구 | 2667 | 2.4% |
양 | 2319 | 2.1% |
번 | 2283 | 2.1% |
Other values (515) | 56567 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 63 | |
A | 30 | |
S | 16 | 7.1% |
C | 15 | 6.7% |
I | 12 | 5.4% |
L | 12 | 5.4% |
D | 10 | 4.5% |
M | 9 | 4.0% |
R | 8 | 3.6% |
G | 7 | 3.1% |
Other values (13) | 42 |
Decimal Number
Value | Count | Frequency (%) |
1 | 8411 | |
2 | 4341 | |
3 | 3117 | 9.9% |
0 | 3098 | 9.8% |
4 | 2522 | 8.0% |
5 | 2374 | 7.5% |
6 | 2186 | 6.9% |
7 | 1965 | 6.2% |
8 | 1854 | 5.9% |
9 | 1631 | 5.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2791 | |
. | 16 | 0.6% |
· | 6 | 0.2% |
& | 4 | 0.1% |
@ | 4 | 0.1% |
/ | 2 | 0.1% |
# | 1 | < 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4 | |
a | 2 | |
b | 2 | |
c | 2 | |
k | 1 | 9.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 5 | |
Ⅱ | 2 | 28.6% |
Space Separator
Value | Count | Frequency (%) |
34103 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3410 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3409 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1415 |
Math Symbol
Value | Count | Frequency (%) |
~ | 45 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 109339 | |
Common | 76705 | |
Latin | 242 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 7782 | 7.1% |
경 | 7555 | 6.9% |
도 | 7554 | 6.9% |
시 | 7389 | 6.8% |
로 | 6650 | 6.1% |
동 | 5285 | 4.8% |
길 | 3288 | 3.0% |
구 | 2667 | 2.4% |
양 | 2319 | 2.1% |
번 | 2283 | 2.1% |
Other values (515) | 56567 |
Latin
Value | Count | Frequency (%) |
B | 63 | |
A | 30 | |
S | 16 | 6.6% |
C | 15 | 6.2% |
I | 12 | 5.0% |
L | 12 | 5.0% |
D | 10 | 4.1% |
M | 9 | 3.7% |
R | 8 | 3.3% |
G | 7 | 2.9% |
Other values (20) | 60 |
Common
Value | Count | Frequency (%) |
34103 | ||
1 | 8411 | 11.0% |
2 | 4341 | 5.7% |
) | 3410 | 4.4% |
( | 3409 | 4.4% |
3 | 3117 | 4.1% |
0 | 3098 | 4.0% |
, | 2791 | 3.6% |
4 | 2522 | 3.3% |
5 | 2374 | 3.1% |
Other values (12) | 9129 | 11.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 109339 | |
ASCII | 76934 | |
Number Forms | 7 | < 0.1% |
None | 6 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
34103 | ||
1 | 8411 | 10.9% |
2 | 4341 | 5.6% |
) | 3410 | 4.4% |
( | 3409 | 4.4% |
3 | 3117 | 4.1% |
0 | 3098 | 4.0% |
, | 2791 | 3.6% |
4 | 2522 | 3.3% |
5 | 2374 | 3.1% |
Other values (39) | 9358 | 12.2% |
Hangul
Value | Count | Frequency (%) |
기 | 7782 | 7.1% |
경 | 7555 | 6.9% |
도 | 7554 | 6.9% |
시 | 7389 | 6.8% |
로 | 6650 | 6.1% |
동 | 5285 | 4.8% |
길 | 3288 | 3.0% |
구 | 2667 | 2.4% |
양 | 2319 | 2.1% |
번 | 2283 | 2.1% |
Other values (515) | 56567 |
None
Value | Count | Frequency (%) |
· | 6 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 5 | |
Ⅱ | 2 | 28.6% |
소재지지번주소
Text
Distinct | 9395 |
---|---|
Distinct (%) | 94.2% |
Missing | 27 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Length
Max length | 66 |
---|---|
Median length | 53 |
Mean length | 24.273338 |
Min length | 10 |
Characters and Unicode
Total characters | 242078 |
---|---|
Distinct characters | 570 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 8983 ? |
---|---|
Unique (%) | 90.1% |
Sample
1st row | 경기도 남양주시 오남읍 양지리 95-3 번지 ,4 |
---|---|
2nd row | 경기도 안양시 동안구 관양동 1463-1 번지 |
3rd row | 경기도 김포시 사우동 856번지 풍년마을삼보아파트 상가 1층 104호 |
4th row | 경기도 의왕시 삼동 150-10번지 |
5th row | 경기도 동두천시 생연동 790-2 번지 |
Value | Count | Frequency (%) |
경기도 | 9968 | 19.0% |
번지 | 1471 | 2.8% |
고양시 | 1277 | 2.4% |
용인시 | 944 | 1.8% |
의정부시 | 822 | 1.6% |
평택시 | 819 | 1.6% |
김포시 | 788 | 1.5% |
파주시 | 678 | 1.3% |
화성시 | 635 | 1.2% |
안산시 | 591 | 1.1% |
Other values (11081) | 34540 |
Most occurring characters
Value | Count | Frequency (%) |
45529 | ||
1 | 10847 | 4.5% |
기 | 10378 | 4.3% |
도 | 10261 | 4.2% |
경 | 9999 | 4.1% |
시 | 9702 | 4.0% |
동 | 9562 | 3.9% |
지 | 8958 | 3.7% |
번 | 7486 | 3.1% |
- | 6635 | 2.7% |
Other values (560) | 112721 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 143425 | |
Space Separator | 45529 | 18.8% |
Decimal Number | 45462 | 18.8% |
Dash Punctuation | 6635 | 2.7% |
Uppercase Letter | 435 | 0.2% |
Other Punctuation | 332 | 0.1% |
Open Punctuation | 79 | < 0.1% |
Close Punctuation | 79 | < 0.1% |
Math Symbol | 58 | < 0.1% |
Lowercase Letter | 37 | < 0.1% |
Other values (2) | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 10378 | 7.2% |
도 | 10261 | 7.2% |
경 | 9999 | 7.0% |
시 | 9702 | 6.8% |
동 | 9562 | 6.7% |
지 | 8958 | 6.2% |
번 | 7486 | 5.2% |
구 | 3425 | 2.4% |
양 | 3070 | 2.1% |
산 | 2790 | 1.9% |
Other values (507) | 67794 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 127 | |
A | 95 | |
L | 30 | 6.9% |
C | 25 | 5.7% |
I | 19 | 4.4% |
S | 17 | 3.9% |
D | 16 | 3.7% |
T | 15 | 3.4% |
G | 12 | 2.8% |
P | 11 | 2.5% |
Other values (13) | 68 |
Decimal Number
Value | Count | Frequency (%) |
1 | 10847 | |
2 | 5164 | |
0 | 4781 | |
3 | 4288 | 9.4% |
4 | 4100 | 9.0% |
5 | 3618 | 8.0% |
6 | 3484 | 7.7% |
7 | 3423 | 7.5% |
8 | 3163 | 7.0% |
9 | 2594 | 5.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 235 | |
@ | 40 | 12.0% |
. | 32 | 9.6% |
/ | 14 | 4.2% |
· | 6 | 1.8% |
& | 5 | 1.5% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 20 | |
c | 9 | |
e | 4 | 10.8% |
l | 2 | 5.4% |
p | 1 | 2.7% |
b | 1 | 2.7% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 3 | |
Ⅰ | 3 |
Space Separator
Value | Count | Frequency (%) |
45529 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6635 |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Math Symbol
Value | Count | Frequency (%) |
~ | 58 |
Other Symbol
Value | Count | Frequency (%) |
ⓐ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 143424 | |
Common | 98175 | |
Latin | 478 | 0.2% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 10378 | 7.2% |
도 | 10261 | 7.2% |
경 | 9999 | 7.0% |
시 | 9702 | 6.8% |
동 | 9562 | 6.7% |
지 | 8958 | 6.2% |
번 | 7486 | 5.2% |
구 | 3425 | 2.4% |
양 | 3070 | 2.1% |
산 | 2790 | 1.9% |
Other values (506) | 67793 |
Latin
Value | Count | Frequency (%) |
B | 127 | |
A | 95 | |
L | 30 | 6.3% |
C | 25 | 5.2% |
a | 20 | 4.2% |
I | 19 | 4.0% |
S | 17 | 3.6% |
D | 16 | 3.3% |
T | 15 | 3.1% |
G | 12 | 2.5% |
Other values (21) | 102 |
Common
Value | Count | Frequency (%) |
45529 | ||
1 | 10847 | 11.0% |
- | 6635 | 6.8% |
2 | 5164 | 5.3% |
0 | 4781 | 4.9% |
3 | 4288 | 4.4% |
4 | 4100 | 4.2% |
5 | 3618 | 3.7% |
6 | 3484 | 3.5% |
7 | 3423 | 3.5% |
Other values (12) | 6306 | 6.4% |
Han
Value | Count | Frequency (%) |
蘭 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 143424 | |
ASCII | 98640 | |
None | 6 | < 0.1% |
Number Forms | 6 | < 0.1% |
CJK | 1 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
45529 | ||
1 | 10847 | 11.0% |
- | 6635 | 6.7% |
2 | 5164 | 5.2% |
0 | 4781 | 4.8% |
3 | 4288 | 4.3% |
4 | 4100 | 4.2% |
5 | 3618 | 3.7% |
6 | 3484 | 3.5% |
7 | 3423 | 3.5% |
Other values (39) | 6771 | 6.9% |
Hangul
Value | Count | Frequency (%) |
기 | 10378 | 7.2% |
도 | 10261 | 7.2% |
경 | 9999 | 7.0% |
시 | 9702 | 6.8% |
동 | 9562 | 6.7% |
지 | 8958 | 6.2% |
번 | 7486 | 5.2% |
구 | 3425 | 2.4% |
양 | 3070 | 2.1% |
산 | 2790 | 1.9% |
Other values (506) | 67793 |
None
Value | Count | Frequency (%) |
· | 6 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 3 | |
Ⅰ | 3 |
CJK
Value | Count | Frequency (%) |
蘭 | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓐ | 1 |
소재지우편번호
Text
MISSING
 
Distinct | 2387 |
---|---|
Distinct (%) | 24.7% |
Missing | 349 |
Missing (%) | 3.5% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
469800 | 106 | 1.1% |
459010 | 86 | 0.9% |
467010 | 85 | 0.9% |
445160 | 67 | 0.7% |
412210 | 58 | 0.6% |
449840 | 57 | 0.6% |
425030 | 54 | 0.6% |
447010 | 54 | 0.6% |
450152 | 50 | 0.5% |
447060 | 50 | 0.5% |
Other values (2377) | 8984 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 9959 | |
4 | 9882 | |
1 | 9422 | |
8 | 5371 | |
2 | 4185 | |
5 | 3729 | 6.8% |
6 | 3419 | 6.2% |
3 | 3406 | 6.2% |
7 | 2848 | 5.2% |
9 | 2483 | 4.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 54704 | |
Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 9959 | |
4 | 9882 | |
1 | 9422 | |
8 | 5371 | |
2 | 4185 | |
5 | 3729 | 6.8% |
6 | 3419 | 6.2% |
3 | 3406 | 6.2% |
7 | 2848 | 5.2% |
9 | 2483 | 4.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 54723 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 9959 | |
4 | 9882 | |
1 | 9422 | |
8 | 5371 | |
2 | 4185 | |
5 | 3729 | 6.8% |
6 | 3419 | 6.2% |
3 | 3406 | 6.2% |
7 | 2848 | 5.2% |
9 | 2483 | 4.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 54723 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 9959 | |
4 | 9882 | |
1 | 9422 | |
8 | 5371 | |
2 | 4185 | |
5 | 3729 | 6.8% |
6 | 3419 | 6.2% |
3 | 3406 | 6.2% |
7 | 2848 | 5.2% |
9 | 2483 | 4.5% |
WGS84위도
Real number (ℝ)
MISSING
 
Distinct | 7218 |
---|---|
Distinct (%) | 87.3% |
Missing | 1736 |
Missing (%) | 17.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.468268 |
Minimum | 36.921743 |
---|---|
Maximum | 38.213767 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 36.921743 |
---|---|
5-th percentile | 37.012879 |
Q1 | 37.279743 |
median | 37.476311 |
Q3 | 37.67931 |
95-th percentile | 37.819146 |
Maximum | 38.213767 |
Range | 1.2920243 |
Interquartile range (IQR) | 0.39956628 |
Descriptive statistics
Standard deviation | 0.25477914 |
---|---|
Coefficient of variation (CV) | 0.0067998644 |
Kurtosis | -0.97002232 |
Mean | 37.468268 |
Median Absolute Deviation (MAD) | 0.19975365 |
Skewness | -0.13127901 |
Sum | 309637.77 |
Variance | 0.064912412 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.6465071802 | 8 | 0.1% |
37.7286592752 | 6 | 0.1% |
37.6667902747 | 5 | 0.1% |
37.6814913143 | 5 | 0.1% |
37.4718872149 | 5 | 0.1% |
37.3356619393 | 5 | 0.1% |
37.0682181092 | 4 | < 0.1% |
37.732655477 | 4 | < 0.1% |
37.3217382997 | 4 | < 0.1% |
37.3097701653 | 4 | < 0.1% |
Other values (7208) | 8214 | |
(Missing) | 1736 | 17.4% |
Value | Count | Frequency (%) |
36.9217429025 | 1 | |
36.9364498364 | 1 | |
36.9397947684 | 1 | |
36.9445114765 | 1 | |
36.9459060522 | 1 | |
36.9468639658 | 2 | |
36.9494228962 | 1 | |
36.9500175829 | 1 | |
36.9554012954 | 1 | |
36.9570704965 | 1 |
Value | Count | Frequency (%) |
38.2137672111 | 1 | |
38.2125136135 | 1 | |
38.1872547744 | 1 | |
38.1864225091 | 1 | |
38.1862829672 | 1 | |
38.1856598096 | 1 | |
38.1855182519 | 1 | |
38.1839826981 | 1 | |
38.1795199944 | 1 | |
38.1663245972 | 1 |
WGS84경도
Real number (ℝ)
MISSING
 
Distinct | 7218 |
---|---|
Distinct (%) | 87.3% |
Missing | 1736 |
Missing (%) | 17.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.99784 |
Minimum | 126.53018 |
---|---|
Maximum | 127.75716 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 126.53018 |
---|---|
5-th percentile | 126.68339 |
Q1 | 126.82983 |
median | 127.03429 |
Q3 | 127.09153 |
95-th percentile | 127.48 |
Maximum | 127.75716 |
Range | 1.2269789 |
Interquartile range (IQR) | 0.26170129 |
Descriptive statistics
Standard deviation | 0.2233501 |
---|---|
Coefficient of variation (CV) | 0.0017586921 |
Kurtosis | 0.74165296 |
Mean | 126.99784 |
Median Absolute Deviation (MAD) | 0.15905649 |
Skewness | 0.77968843 |
Sum | 1049510.2 |
Variance | 0.049885269 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.6833852938 | 8 | 0.1% |
127.0599639933 | 6 | 0.1% |
126.7661835316 | 5 | 0.1% |
126.7808951307 | 5 | 0.1% |
126.8529110316 | 5 | 0.1% |
127.0933785874 | 5 | 0.1% |
127.0614238428 | 4 | < 0.1% |
127.0857586661 | 4 | < 0.1% |
126.9550798261 | 4 | < 0.1% |
127.0870405665 | 4 | < 0.1% |
Other values (7208) | 8214 | |
(Missing) | 1736 | 17.4% |
Value | Count | Frequency (%) |
126.5301839126 | 1 | |
126.5328963829 | 1 | |
126.542553238 | 1 | |
126.5464865874 | 1 | |
126.5469890908 | 1 | |
126.5485079113 | 2 | |
126.5507753674 | 1 | |
126.5522819959 | 1 | |
126.5530273259 | 1 | |
126.5537542192 | 1 |
Value | Count | Frequency (%) |
127.7571627832 | 1 | |
127.754000888 | 1 | |
127.752734759 | 1 | |
127.7482733147 | 1 | |
127.7459308727 | 1 | |
127.7284008767 | 1 | |
127.7279077557 | 1 | |
127.7271861579 | 1 | |
127.7155378772 | 1 | |
127.7107033595 | 1 |
업태구분명정보
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
X좌표값
Real number (ℝ)
MISSING
 
Distinct | 518 |
---|---|
Distinct (%) | 91.2% |
Missing | 9432 |
Missing (%) | 94.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200137.8 |
Minimum | 158529.9 |
---|---|
Maximum | 248758.93 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 158529.9 |
---|---|
5-th percentile | 166978.51 |
Q1 | 184043.9 |
median | 202469.05 |
Q3 | 210950.76 |
95-th percentile | 241953.8 |
Maximum | 248758.93 |
Range | 90229.032 |
Interquartile range (IQR) | 26906.866 |
Descriptive statistics
Standard deviation | 20985.873 |
---|---|
Coefficient of variation (CV) | 0.10485712 |
Kurtosis | -0.48169571 |
Mean | 200137.8 |
Median Absolute Deviation (MAD) | 15117.515 |
Skewness | 0.29653354 |
Sum | 1.1367827 × 108 |
Variance | 4.4040686 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
206953.271798898 | 3 | < 0.1% |
208320.773270613 | 3 | < 0.1% |
204989.468555169 | 2 | < 0.1% |
205376.075847502 | 2 | < 0.1% |
209555.742274039 | 2 | < 0.1% |
203914.161110681 | 2 | < 0.1% |
172128.333251763 | 2 | < 0.1% |
197619.003545834 | 2 | < 0.1% |
198049.964400548 | 2 | < 0.1% |
175471.691562495 | 2 | < 0.1% |
Other values (508) | 546 | 5.5% |
(Missing) | 9432 |
Value | Count | Frequency (%) |
158529.902137014 | 1 | |
160073.337441912 | 2 | |
160285.557245716 | 1 | |
160617.867680121 | 2 | |
160675.575212779 | 1 | |
161893.250712491 | 1 | |
162944.463230413 | 1 | |
163269.819119279 | 1 | |
164172.739483319 | 1 | |
164214.507122961 | 1 |
Value | Count | Frequency (%) |
248758.933943455 | 1 | |
248260.454500232 | 1 | |
247822.828445397 | 1 | |
246514.51695601 | 1 | |
246073.966899104 | 1 | |
245923.165371386 | 1 | |
245748.034805118 | 1 | |
245278.796427534 | 1 | |
245224.951984457 | 1 | |
245016.247594061 | 1 |
Y좌표값
Real number (ℝ)
MISSING
 
Distinct | 518 |
---|---|
Distinct (%) | 91.2% |
Missing | 9432 |
Missing (%) | 94.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 450525.43 |
Minimum | 394238.02 |
---|---|
Maximum | 505790.49 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 394238.02 |
---|---|
5-th percentile | 408174.29 |
Q1 | 433585.47 |
median | 459903.15 |
Q3 | 466919.17 |
95-th percentile | 479433.6 |
Maximum | 505790.49 |
Range | 111552.47 |
Interquartile range (IQR) | 33333.697 |
Descriptive statistics
Standard deviation | 24051.628 |
---|---|
Coefficient of variation (CV) | 0.053385728 |
Kurtosis | -0.59253095 |
Mean | 450525.43 |
Median Absolute Deviation (MAD) | 9687.8893 |
Skewness | -0.67377123 |
Sum | 2.5589844 × 108 |
Variance | 5.784808 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
412679.702211695 | 3 | < 0.1% |
406869.617158721 | 3 | < 0.1% |
412292.831983603 | 2 | < 0.1% |
411861.905551312 | 2 | < 0.1% |
408766.503953602 | 2 | < 0.1% |
469339.759820934 | 2 | < 0.1% |
461262.941306652 | 2 | < 0.1% |
430753.794512672 | 2 | < 0.1% |
434683.369863805 | 2 | < 0.1% |
457750.555549167 | 2 | < 0.1% |
Other values (508) | 546 | 5.5% |
(Missing) | 9432 |
Value | Count | Frequency (%) |
394238.019163015 | 1 | |
397543.51793425 | 1 | |
397598.627891452 | 1 | |
397646.937377135 | 1 | |
398166.523340534 | 1 | |
398247.172626513 | 1 | |
399522.497859014 | 1 | |
400379.944425591 | 1 | |
400689.52302517 | 1 | |
403165.019234282 | 1 |
Value | Count | Frequency (%) |
505790.48767371 | 1 | |
503103.512139261 | 2 | |
502827.430494496 | 1 | |
502755.806654713 | 1 | |
501338.618428324 | 1 | |
498852.373327364 | 1 | |
494007.96624481 | 1 | |
492662.316413614 | 1 | |
491678.04678242 | 1 | |
488380.74573579 | 1 |
업소구분명정보
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
지정 | 536 |
종료 | 54 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.882 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9410 | |
지정 | 536 | 5.4% |
종료 | 54 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9410 | |
지정 | 536 | 5.4% |
종료 | 54 | 0.5% |
소재지주소
Text
MISSING
 
Distinct | 500 |
---|---|
Distinct (%) | 96.3% |
Missing | 9481 |
Missing (%) | 94.8% |
Memory size | 156.2 KiB |
Length
Max length | 55 |
---|---|
Median length | 43 |
Mean length | 24.271676 |
Min length | 11 |
Characters and Unicode
Total characters | 12597 |
---|---|
Distinct characters | 337 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 482 ? |
---|---|
Unique (%) | 92.9% |
Sample
1st row | 경기도 고양시 일산동구 정발산동 1224-4 |
---|---|
2nd row | 경기도 남양주시 다산동 691-1 1층 |
3rd row | 경기도 남양주시 와부읍 덕소리 195-1 KT덕소지점 |
4th row | 경기도 화성시 송산면 지화리 685-4 |
5th row | 경기도 고양시 덕양구 강매동 260 |
Value | Count | Frequency (%) |
경기도 | 516 | 18.3% |
화성시 | 114 | 4.0% |
고양시 | 92 | 3.3% |
김포시 | 77 | 2.7% |
남양주시 | 74 | 2.6% |
가평군 | 66 | 2.3% |
덕양구 | 48 | 1.7% |
일산동구 | 44 | 1.6% |
광명시 | 25 | 0.9% |
의정부시 | 21 | 0.7% |
Other values (1020) | 1742 |
Most occurring characters
Value | Count | Frequency (%) |
2300 | 18.3% | |
도 | 562 | 4.5% |
기 | 525 | 4.2% |
경 | 517 | 4.1% |
1 | 510 | 4.0% |
시 | 476 | 3.8% |
동 | 450 | 3.6% |
- | 293 | 2.3% |
2 | 268 | 2.1% |
양 | 258 | 2.0% |
Other values (327) | 6438 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7588 | |
Decimal Number | 2333 | 18.5% |
Space Separator | 2300 | 18.3% |
Dash Punctuation | 293 | 2.3% |
Uppercase Letter | 57 | 0.5% |
Other Punctuation | 21 | 0.2% |
Math Symbol | 3 | < 0.1% |
Letter Number | 1 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
도 | 562 | 7.4% |
기 | 525 | 6.9% |
경 | 517 | 6.8% |
시 | 476 | 6.3% |
동 | 450 | 5.9% |
양 | 258 | 3.4% |
리 | 204 | 2.7% |
지 | 155 | 2.0% |
화 | 137 | 1.8% |
성 | 129 | 1.7% |
Other values (293) | 4175 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 11 | |
D | 7 | |
M | 6 | |
C | 6 | |
S | 5 | |
A | 4 | 7.0% |
B | 4 | 7.0% |
K | 2 | 3.5% |
T | 2 | 3.5% |
R | 2 | 3.5% |
Other values (7) | 8 |
Decimal Number
Value | Count | Frequency (%) |
1 | 510 | |
2 | 268 | |
0 | 242 | |
3 | 220 | |
6 | 218 | |
4 | 210 | |
5 | 192 | 8.2% |
7 | 183 | 7.8% |
8 | 155 | 6.6% |
9 | 135 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 15 | |
. | 6 | 28.6% |
Space Separator
Value | Count | Frequency (%) |
2300 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 293 |
Math Symbol
Value | Count | Frequency (%) |
~ | 3 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7588 | |
Common | 4950 | |
Latin | 59 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
도 | 562 | 7.4% |
기 | 525 | 6.9% |
경 | 517 | 6.8% |
시 | 476 | 6.3% |
동 | 450 | 5.9% |
양 | 258 | 3.4% |
리 | 204 | 2.7% |
지 | 155 | 2.0% |
화 | 137 | 1.8% |
성 | 129 | 1.7% |
Other values (293) | 4175 |
Latin
Value | Count | Frequency (%) |
I | 11 | |
D | 7 | |
M | 6 | |
C | 6 | |
S | 5 | |
A | 4 | 6.8% |
B | 4 | 6.8% |
K | 2 | 3.4% |
T | 2 | 3.4% |
R | 2 | 3.4% |
Other values (9) | 10 |
Common
Value | Count | Frequency (%) |
2300 | ||
1 | 510 | 10.3% |
- | 293 | 5.9% |
2 | 268 | 5.4% |
0 | 242 | 4.9% |
3 | 220 | 4.4% |
6 | 218 | 4.4% |
4 | 210 | 4.2% |
5 | 192 | 3.9% |
7 | 183 | 3.7% |
Other values (5) | 314 | 6.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7588 | |
ASCII | 5008 | |
Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2300 | ||
1 | 510 | 10.2% |
- | 293 | 5.9% |
2 | 268 | 5.4% |
0 | 242 | 4.8% |
3 | 220 | 4.4% |
6 | 218 | 4.4% |
4 | 210 | 4.2% |
5 | 192 | 3.8% |
7 | 183 | 3.7% |
Other values (23) | 372 | 7.4% |
Hangul
Value | Count | Frequency (%) |
도 | 562 | 7.4% |
기 | 525 | 6.9% |
경 | 517 | 6.8% |
시 | 476 | 6.3% |
동 | 450 | 5.9% |
양 | 258 | 3.4% |
리 | 204 | 2.7% |
지 | 155 | 2.0% |
화 | 137 | 1.8% |
성 | 129 | 1.7% |
Other values (293) | 4175 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 1 |
신청일자
Real number (ℝ)
MISSING
  SKEWED
 
Distinct | 3337 |
---|---|
Distinct (%) | 46.5% |
Missing | 2824 |
Missing (%) | 28.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20051002 |
Minimum | 1996 |
---|---|
Maximum | 22020731 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1996 |
---|---|
5-th percentile | 19941219 |
Q1 | 20000401 |
median | 20070102 |
Q3 | 20140225 |
95-th percentile | 20211116 |
Maximum | 22020731 |
Range | 22018735 |
Interquartile range (IQR) | 139824 |
Descriptive statistics
Standard deviation | 574628.06 |
---|---|
Coefficient of variation (CV) | 0.028658321 |
Kurtosis | 1142.3223 |
Mean | 20051002 |
Median Absolute Deviation (MAD) | 69800 |
Skewness | -33.401123 |
Sum | 1.4388599 × 1011 |
Variance | 3.3019741 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19950101 | 263 | 2.6% |
19941201 | 219 | 2.2% |
19941220 | 190 | 1.9% |
19990101 | 172 | 1.7% |
20020731 | 76 | 0.8% |
20020802 | 64 | 0.6% |
19941219 | 60 | 0.6% |
19941102 | 55 | 0.5% |
20000401 | 45 | 0.4% |
20020805 | 41 | 0.4% |
Other values (3327) | 5991 | |
(Missing) | 2824 |
Value | Count | Frequency (%) |
1996 | 2 | |
199810 | 2 | |
199906 | 1 | |
1999010 | 1 | |
19010725 | 1 | |
19940103 | 1 | |
19940310 | 1 | |
19940612 | 1 | |
19940802 | 1 | |
19941020 | 1 |
Value | Count | Frequency (%) |
22020731 | 1 | < 0.1% |
20231205 | 2 | < 0.1% |
20231201 | 1 | < 0.1% |
20231130 | 1 | < 0.1% |
20231128 | 2 | < 0.1% |
20231124 | 5 | |
20231123 | 2 | < 0.1% |
20231122 | 1 | < 0.1% |
20231121 | 2 | < 0.1% |
20231120 | 1 | < 0.1% |
항목값정보
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
관급봉투 | 590 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9410 | |
관급봉투 | 590 | 5.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9410 | |
관급봉투 | 590 | 5.9% |
시군명 | 사업장명 | 인허가일자 | 인허가취소일자 | 영업상태구분코드 | 영업상태명 | 폐업일자 | 소재지시설전화번호 | 소재지면적정보 | 도로명우편번호 | 소재지도로명주소 | 소재지지번주소 | 소재지우편번호 | WGS84위도 | WGS84경도 | 업태구분명정보 | X좌표값 | Y좌표값 | 업소구분명정보 | 소재지주소 | 신청일자 | 항목값정보 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
5648 | 남양주시 | 현대화마트 | 19990512 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 남양주시 오남읍 양지리 95-3 번지 ,4 | 12036 | 37.697653 | 127.204231 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7638 | 안양시 | 영수슈퍼 | <NA> | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 안양시 동안구 관양동 1463-1 번지 | 431062 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 19941201 | <NA> |
4894 | 김포시 | 이마트24(김포풍년마을점) | 20170801 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 김포시 풍년로 9, 1층 104호 (사우동, 풍년마을삼보아파트 상가) | 경기도 김포시 사우동 856번지 풍년마을삼보아파트 상가 1층 104호 | 10111 | 37.624909 | 126.724383 | <NA> | <NA> | <NA> | <NA> | <NA> | 20170726 | <NA> |
11335 | 의왕시 | 풍성슈퍼 | 19990524 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 의왕시 삼동 150-10번지 | 16095 | 37.317184 | 126.950643 | <NA> | <NA> | <NA> | <NA> | <NA> | 19990524 | <NA> |
5873 | 동두천시 | 한우리슈퍼 | 20040910 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 동두천시 생연동 790-2 번지 | 483032 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
10689 | 용인시 | LG마트 | 20040117 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 용인시 수지구 신봉2로14번길 8, 106호 (신봉동,백산빌딩 가동 1층) | 경기도 용인시 수지구 신봉동 43번지 백산빌딩 가동 1층 106호 | 449150 | 37.323857 | 127.078724 | <NA> | <NA> | <NA> | <NA> | <NA> | 20040116 | <NA> |
1691 | 고양시 | 나이스 데이 | 20001122 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 고양시 일산서구 성저로 47 (대화동,성저마을) | 경기도 고양시 일산서구 대화동 2081번지 성저마을 | 411410 | 37.684187 | 126.752051 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
11704 | 의정부시 | 이마트24 녹양누리점 | 20170913 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 의정부시 본원로46번길 21 (녹양동) | 경기도 의정부시 녹양동 403-11번지 | 11605 | 37.762962 | 127.040585 | <NA> | <NA> | <NA> | <NA> | <NA> | 20170913 | <NA> |
7227 | 안성시 | 이천슈퍼 | 20020731 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 안성시 봉산동 33번지 | 456030 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 20020731 | <NA> |
6648 | 안산시 | 한아름슈퍼 | 19980724 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 안산시 상록구 본오로 66 (본오동) | 경기도 안산시 상록구 본오동 854번지 | 426180 | 37.290894 | 126.86657 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
시군명 | 사업장명 | 인허가일자 | 인허가취소일자 | 영업상태구분코드 | 영업상태명 | 폐업일자 | 소재지시설전화번호 | 소재지면적정보 | 도로명우편번호 | 소재지도로명주소 | 소재지지번주소 | 소재지우편번호 | WGS84위도 | WGS84경도 | 업태구분명정보 | X좌표값 | Y좌표값 | 업소구분명정보 | 소재지주소 | 신청일자 | 항목값정보 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
14778 | 평택시 | 가람수퍼 | 19970530 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 평택시 지산동 835-35번지 | 459110 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3967 | 군포시 | GS25군포번영점 | 20070508 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 군포시 금산로 1 | 경기도 군포시 금정동 722-7 아산나부빌 상가동 101 | 15827 | 37.362767 | 126.941051 | <NA> | <NA> | <NA> | <NA> | <NA> | 20070508 | <NA> |
11511 | 의왕시 | 판다팜 | 20180810 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 의왕시 부곡시장길 26-3, 아울렛D.C마트 (삼동) | 경기도 의왕시 삼동 166-32번지 아울렛D.C마트 | 16095 | 37.318758 | 126.951783 | <NA> | <NA> | <NA> | <NA> | <NA> | 20180810 | <NA> |
4422 | 김포시 | (주)GS수퍼 김포감정점 | 20101126 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 김포시 감정동 676번지 | 415010 | 37.626466 | 126.699821 | <NA> | <NA> | <NA> | <NA> | <NA> | 20101126 | <NA> |
12474 | 의정부시 | 성공마트 | 20080506 | <NA> | <NA> | 폐업 등 | 20180124 | <NA> | <NA> | <NA> | 경기도 의정부시 평화로 220 (호원동,브랜드상설매장 1층 112호) | 경기도 의정부시 호원동 455-3번지 브랜드상설매장 1층 112호 | 480856 | 37.711579 | 127.04805 | <NA> | <NA> | <NA> | <NA> | <NA> | 20080506 | <NA> |
5387 | 김포시 | 자연드림김포생협(장기점) | 20130531 | <NA> | <NA> | 폐업 등 | 20160629 | <NA> | <NA> | <NA> | 경기도 김포시 김포한강4로 118, 105호 (장기동) | 경기도 김포시 장기동 1851번지 | 415060 | 37.644709 | 126.668522 | <NA> | <NA> | <NA> | <NA> | <NA> | 20140423 | <NA> |
12791 | 의정부시 | 위드미 가능흥선로점 | 20161214 | <NA> | <NA> | 폐업 등 | 20180126 | <NA> | <NA> | <NA> | 경기도 의정부시 가능로7번길 19, 1층 (가능동) | 경기도 의정부시 가능동 687-7번지 1층 | 11675 | 37.747233 | 127.031148 | <NA> | <NA> | <NA> | <NA> | <NA> | 20161214 | <NA> |
5658 | 남양주시 | 오남상회 | 19990728 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 남양주시 오남읍 오남리 732-1 번지 | 12041 | 37.695144 | 127.206498 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3095 | 광명시 | 흥부철물그릇 | <NA> | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | <NA> | 경기도 광명시 하안동 204번지 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7151 | 안성시 | 무 | 20020805 | <NA> | <NA> | 운영중 | <NA> | <NA> | <NA> | <NA> | 경기도 안성시 고삼면 고삼호수로 360 | 경기도 안성시 고삼면 쌍지리 932-2번지 | 456921 | 37.097463 | 127.289832 | <NA> | <NA> | <NA> | <NA> | <NA> | 20020805 | <NA> |