Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 26334 |
Missing cells (%) | 37.6% |
Duplicate rows | 1130 |
Duplicate rows (%) | 11.3% |
Total size in memory | 625.0 KiB |
Average record size in memory | 64.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 1 |
Boolean | 1 |
Dataset
Description | 사업번호,업체분류명,업체명,주소,전화번호,홈페이지주소,계약해지여부 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-2255/S/1/datasetView.do |
Reproduction
Analysis started | 2024-03-13 07:43:30.061327 |
---|---|
Analysis finished | 2024-03-13 07:43:30.945146 |
Duration | 0.88 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업번호
Text
Distinct | 563 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Characters and Unicode
Total characters | 150000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 53 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 11230-900000737 |
---|---|
2nd row | 11560-100003012 |
3rd row | 11380-100001057 |
4th row | 11650-100002018 |
5th row | 11680-900000942 |
Value | Count | Frequency (%) |
11710-100002008 | 136 | 1.4% |
11230-100006043 | 114 | 1.1% |
11290-100016009 | 101 | 1.0% |
11380-100001045 | 92 | 0.9% |
11290-100015000 | 85 | 0.9% |
11110-100003002 | 84 | 0.8% |
11620-100004002 | 84 | 0.8% |
11650-900000161 | 80 | 0.8% |
11740-100000049 | 76 | 0.8% |
11590-100002006 | 76 | 0.8% |
Other values (553) | 9072 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 66200 | |
1 | 35498 | |
- | 10000 | 6.7% |
2 | 6815 | 4.5% |
9 | 5312 | 3.5% |
6 | 5030 | 3.4% |
5 | 4965 | 3.3% |
3 | 4861 | 3.2% |
4 | 4742 | 3.2% |
7 | 3681 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 140000 | |
Dash Punctuation | 10000 | 6.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 66200 | |
1 | 35498 | |
2 | 6815 | 4.9% |
9 | 5312 | 3.8% |
6 | 5030 | 3.6% |
5 | 4965 | 3.5% |
3 | 4861 | 3.5% |
4 | 4742 | 3.4% |
7 | 3681 | 2.6% |
8 | 2896 | 2.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 150000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 66200 | |
1 | 35498 | |
- | 10000 | 6.7% |
2 | 6815 | 4.5% |
9 | 5312 | 3.5% |
6 | 5030 | 3.4% |
5 | 4965 | 3.3% |
3 | 4861 | 3.2% |
4 | 4742 | 3.2% |
7 | 3681 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 150000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 66200 | |
1 | 35498 | |
- | 10000 | 6.7% |
2 | 6815 | 4.5% |
9 | 5312 | 3.5% |
6 | 5030 | 3.4% |
5 | 4965 | 3.3% |
3 | 4861 | 3.2% |
4 | 4742 | 3.2% |
7 | 3681 | 2.5% |
업체분류명
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
기타 | |
---|---|
설계자 | 491 |
정비업자 | 400 |
시공자 | 251 |
철거업자 | 63 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.1668 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 설계자 |
---|---|
2nd row | 기타 |
3rd row | 기타 |
4th row | 기타 |
5th row | 기타 |
Common Values
Value | Count | Frequency (%) |
기타 | 8795 | |
설계자 | 491 | 4.9% |
정비업자 | 400 | 4.0% |
시공자 | 251 | 2.5% |
철거업자 | 63 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
기타 | 8795 | |
설계자 | 491 | 4.9% |
정비업자 | 400 | 4.0% |
시공자 | 251 | 2.5% |
철거업자 | 63 | 0.6% |
업체명
Text
Distinct | 4269 |
---|---|
Distinct (%) | 42.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 35 |
---|---|
Median length | 21 |
Mean length | 9.2611 |
Min length | 2 |
Characters and Unicode
Total characters | 92611 |
---|---|
Distinct characters | 587 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2753 ? |
---|---|
Unique (%) | 27.5% |
Sample
1st row | (주)그룹환경종합건축사사무소 |
---|---|
2nd row | (주)개신건설 |
3rd row | (주) 나라감정평가법인 |
4th row | 법무법인엘케이비앤파트너스 |
5th row | (주)주성 시.엠.시 |
Value | Count | Frequency (%) |
주식회사 | 957 | 7.3% |
법무법인 | 902 | 6.9% |
주 | 227 | 1.7% |
법률사무소 | 176 | 1.3% |
을지 | 85 | 0.6% |
법무법인(유한 | 76 | 0.6% |
산하 | 75 | 0.6% |
정비 | 74 | 0.6% |
법무법인산하 | 68 | 0.5% |
법무법인을지 | 58 | 0.4% |
Other values (4028) | 10432 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 5689 | 6.1% |
법 | 4914 | 5.3% |
사 | 4716 | 5.1% |
) | 4651 | 5.0% |
( | 4640 | 5.0% |
무 | 3676 | 4.0% |
3226 | 3.5% | |
인 | 2995 | 3.2% |
이 | 2026 | 2.2% |
건 | 1868 | 2.0% |
Other values (577) | 54210 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 79555 | |
Close Punctuation | 4657 | 5.0% |
Open Punctuation | 4646 | 5.0% |
Space Separator | 3226 | 3.5% |
Uppercase Letter | 340 | 0.4% |
Lowercase Letter | 86 | 0.1% |
Other Punctuation | 65 | 0.1% |
Decimal Number | 35 | < 0.1% |
Other Symbol | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 5689 | 7.2% |
법 | 4914 | 6.2% |
사 | 4716 | 5.9% |
무 | 3676 | 4.6% |
인 | 2995 | 3.8% |
이 | 2026 | 2.5% |
건 | 1868 | 2.3% |
소 | 1754 | 2.2% |
지 | 1695 | 2.1% |
회 | 1624 | 2.0% |
Other values (521) | 48598 |
Uppercase Letter
Value | Count | Frequency (%) |
H | 49 | |
P | 47 | |
C | 43 | |
S | 40 | |
G | 28 | |
M | 18 | 5.3% |
K | 16 | 4.7% |
N | 15 | 4.4% |
T | 15 | 4.4% |
D | 13 | 3.8% |
Other values (13) | 56 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 13 | |
o | 10 | |
n | 8 | |
t | 7 | |
c | 6 | 7.0% |
d | 6 | 7.0% |
r | 5 | 5.8% |
i | 5 | 5.8% |
e | 5 | 5.8% |
h | 4 | 4.7% |
Other values (7) | 17 |
Decimal Number
Value | Count | Frequency (%) |
1 | 13 | |
2 | 12 | |
0 | 3 | 8.6% |
5 | 3 | 8.6% |
6 | 2 | 5.7% |
3 | 2 | 5.7% |
Other Punctuation
Value | Count | Frequency (%) |
. | 51 | |
, | 7 | 10.8% |
& | 6 | 9.2% |
' | 1 | 1.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 4651 | |
) | 6 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 4640 | |
( | 6 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
3226 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 79556 | |
Common | 12629 | 13.6% |
Latin | 426 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 5689 | 7.2% |
법 | 4914 | 6.2% |
사 | 4716 | 5.9% |
무 | 3676 | 4.6% |
인 | 2995 | 3.8% |
이 | 2026 | 2.5% |
건 | 1868 | 2.3% |
소 | 1754 | 2.2% |
지 | 1695 | 2.1% |
회 | 1624 | 2.0% |
Other values (522) | 48599 |
Latin
Value | Count | Frequency (%) |
H | 49 | 11.5% |
P | 47 | 11.0% |
C | 43 | 10.1% |
S | 40 | 9.4% |
G | 28 | 6.6% |
M | 18 | 4.2% |
K | 16 | 3.8% |
N | 15 | 3.5% |
T | 15 | 3.5% |
D | 13 | 3.1% |
Other values (30) | 142 |
Common
Value | Count | Frequency (%) |
) | 4651 | |
( | 4640 | |
3226 | ||
. | 51 | 0.4% |
1 | 13 | 0.1% |
2 | 12 | 0.1% |
, | 7 | 0.1% |
) | 6 | < 0.1% |
( | 6 | < 0.1% |
& | 6 | < 0.1% |
Other values (5) | 11 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 79555 | |
ASCII | 13043 | 14.1% |
None | 13 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 5689 | 7.2% |
법 | 4914 | 6.2% |
사 | 4716 | 5.9% |
무 | 3676 | 4.6% |
인 | 2995 | 3.8% |
이 | 2026 | 2.5% |
건 | 1868 | 2.3% |
소 | 1754 | 2.2% |
지 | 1695 | 2.1% |
회 | 1624 | 2.0% |
Other values (521) | 48598 |
ASCII
Value | Count | Frequency (%) |
) | 4651 | |
( | 4640 | |
3226 | ||
. | 51 | 0.4% |
H | 49 | 0.4% |
P | 47 | 0.4% |
C | 43 | 0.3% |
S | 40 | 0.3% |
G | 28 | 0.2% |
M | 18 | 0.1% |
Other values (43) | 250 | 1.9% |
None
Value | Count | Frequency (%) |
) | 6 | |
( | 6 | |
㈜ | 1 | 7.7% |
주소
Text
MISSING
 
Distinct | 794 |
---|---|
Distinct (%) | 46.5% |
Missing | 8292 |
Missing (%) | 82.9% |
Memory size | 156.2 KiB |
Length
Max length | 48 |
---|---|
Median length | 41 |
Mean length | 27.883489 |
Min length | 20 |
Characters and Unicode
Total characters | 47625 |
---|---|
Distinct characters | 430 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 495 ? |
---|---|
Unique (%) | 29.0% |
Sample
1st row | 서울특별시 강남구 봉은사로 179 (논현동,H-TOWER) |
---|---|
2nd row | 경기도 안양시 만안구 만안로 49 (안양동) |
3rd row | 서울특별시 서초구 서초대로8길 62 (방배동) |
4th row | 인천광역시 계양구 서운로 33-8 (서운동) |
5th row | 서울특별시 서초구 서초대로 253 (서초동,지천빌딩) |
Value | Count | Frequency (%) |
서울특별시 | 1490 | 16.9% |
서초구 | 475 | 5.4% |
강남구 | 297 | 3.4% |
송파구 | 181 | 2.0% |
경기도 | 164 | 1.9% |
서초동 | 104 | 1.2% |
마포구 | 94 | 1.1% |
문정동 | 90 | 1.0% |
서초대로 | 86 | 1.0% |
역삼동 | 82 | 0.9% |
Other values (1582) | 5776 |
Most occurring characters
Value | Count | Frequency (%) |
7141 | 15.0% | |
서 | 2675 | 5.6% |
동 | 1992 | 4.2% |
로 | 1864 | 3.9% |
구 | 1731 | 3.6% |
시 | 1725 | 3.6% |
) | 1724 | 3.6% |
( | 1724 | 3.6% |
울 | 1531 | 3.2% |
별 | 1498 | 3.1% |
Other values (420) | 24020 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30021 | |
Space Separator | 7141 | 15.0% |
Decimal Number | 5800 | 12.2% |
Close Punctuation | 1724 | 3.6% |
Open Punctuation | 1724 | 3.6% |
Other Punctuation | 753 | 1.6% |
Uppercase Letter | 264 | 0.6% |
Dash Punctuation | 171 | 0.4% |
Letter Number | 10 | < 0.1% |
Lowercase Letter | 9 | < 0.1% |
Other values (2) | 8 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 2675 | 8.9% |
동 | 1992 | 6.6% |
로 | 1864 | 6.2% |
구 | 1731 | 5.8% |
시 | 1725 | 5.7% |
울 | 1531 | 5.1% |
별 | 1498 | 5.0% |
특 | 1496 | 5.0% |
초 | 1045 | 3.5% |
길 | 729 | 2.4% |
Other values (371) | 13735 |
Uppercase Letter
Value | Count | Frequency (%) |
R | 39 | |
I | 33 | |
T | 24 | |
E | 20 | 7.6% |
S | 17 | 6.4% |
O | 17 | 6.4% |
W | 17 | 6.4% |
A | 16 | 6.1% |
K | 15 | 5.7% |
D | 13 | 4.9% |
Other values (12) | 53 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1253 | |
2 | 927 | |
3 | 640 | |
4 | 562 | |
6 | 545 | |
5 | 505 | |
0 | 391 | 6.7% |
7 | 379 | 6.5% |
8 | 337 | 5.8% |
9 | 261 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 741 | |
& | 5 | 0.7% |
. | 4 | 0.5% |
/ | 2 | 0.3% |
, | 1 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
f | 2 | |
e | 2 | |
v | 2 | |
i | 2 | |
n | 1 |
Space Separator
Value | Count | Frequency (%) |
7141 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1724 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1724 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 171 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 10 |
Other Symbol
Value | Count | Frequency (%) |
ⓝ | 6 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30013 | |
Common | 17321 | |
Latin | 283 | 0.6% |
Han | 8 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 2675 | 8.9% |
동 | 1992 | 6.6% |
로 | 1864 | 6.2% |
구 | 1731 | 5.8% |
시 | 1725 | 5.7% |
울 | 1531 | 5.1% |
별 | 1498 | 5.0% |
특 | 1496 | 5.0% |
초 | 1045 | 3.5% |
길 | 729 | 2.4% |
Other values (369) | 13727 |
Latin
Value | Count | Frequency (%) |
R | 39 | |
I | 33 | |
T | 24 | 8.5% |
E | 20 | 7.1% |
S | 17 | 6.0% |
O | 17 | 6.0% |
W | 17 | 6.0% |
A | 16 | 5.7% |
K | 15 | 5.3% |
D | 13 | 4.6% |
Other values (18) | 72 |
Common
Value | Count | Frequency (%) |
7141 | ||
) | 1724 | 10.0% |
( | 1724 | 10.0% |
1 | 1253 | 7.2% |
2 | 927 | 5.4% |
, | 741 | 4.3% |
3 | 640 | 3.7% |
4 | 562 | 3.2% |
6 | 545 | 3.1% |
5 | 505 | 2.9% |
Other values (11) | 1559 | 9.0% |
Han
Value | Count | Frequency (%) |
成 | 4 | |
寶 | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30013 | |
ASCII | 17587 | |
Number Forms | 10 | < 0.1% |
CJK | 8 | < 0.1% |
Enclosed Alphanum | 6 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
7141 | ||
) | 1724 | 9.8% |
( | 1724 | 9.8% |
1 | 1253 | 7.1% |
2 | 927 | 5.3% |
, | 741 | 4.2% |
3 | 640 | 3.6% |
4 | 562 | 3.2% |
6 | 545 | 3.1% |
5 | 505 | 2.9% |
Other values (36) | 1825 | 10.4% |
Hangul
Value | Count | Frequency (%) |
서 | 2675 | 8.9% |
동 | 1992 | 6.6% |
로 | 1864 | 6.2% |
구 | 1731 | 5.8% |
시 | 1725 | 5.7% |
울 | 1531 | 5.1% |
별 | 1498 | 5.0% |
특 | 1496 | 5.0% |
초 | 1045 | 3.5% |
길 | 729 | 2.4% |
Other values (369) | 13727 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 10 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓝ | 6 |
CJK
Value | Count | Frequency (%) |
成 | 4 | |
寶 | 4 |
None
Value | Count | Frequency (%) |
, | 1 |
전화번호
Text
MISSING
 
Distinct | 808 |
---|---|
Distinct (%) | 52.2% |
Missing | 8451 |
Missing (%) | 84.5% |
Memory size | 156.2 KiB |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 11.342156 |
Min length | 9 |
Characters and Unicode
Total characters | 17569 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 541 ? |
---|---|
Unique (%) | 34.9% |
Sample
1st row | 02-518-8818 |
---|---|
2nd row | 031-6909-5912 |
3rd row | 02-521-1122 |
4th row | 02-536-5805 |
5th row | 02-525-2733 |
Value | Count | Frequency (%) |
02-537-3322 | 38 | 2.5% |
02-2055-1919 | 33 | 2.1% |
02-455-5503 | 18 | 1.2% |
02-536-5805 | 17 | 1.1% |
02-584-2581 | 17 | 1.1% |
02-587-2130 | 16 | 1.0% |
02-533-8505 | 13 | 0.8% |
02-3019-1200 | 13 | 0.8% |
02-6346-2287 | 13 | 0.8% |
02-599-5333 | 13 | 0.8% |
Other values (798) | 1358 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3046 | |
- | 2893 | |
2 | 2711 | |
3 | 1523 | |
5 | 1513 | |
1 | 1289 | |
4 | 1075 | 6.1% |
7 | 971 | 5.5% |
8 | 877 | 5.0% |
9 | 844 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 14676 | |
Dash Punctuation | 2893 | 16.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3046 | |
2 | 2711 | |
3 | 1523 | |
5 | 1513 | |
1 | 1289 | |
4 | 1075 | 7.3% |
7 | 971 | 6.6% |
8 | 877 | 6.0% |
9 | 844 | 5.8% |
6 | 827 | 5.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2893 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 17569 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3046 | |
- | 2893 | |
2 | 2711 | |
3 | 1523 | |
5 | 1513 | |
1 | 1289 | |
4 | 1075 | 6.1% |
7 | 971 | 5.5% |
8 | 877 | 5.0% |
9 | 844 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 17569 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3046 | |
- | 2893 | |
2 | 2711 | |
3 | 1523 | |
5 | 1513 | |
1 | 1289 | |
4 | 1075 | 6.1% |
7 | 971 | 5.5% |
8 | 877 | 5.0% |
9 | 844 | 4.8% |
홈페이지주소
Text
MISSING
 
Distinct | 252 |
---|---|
Distinct (%) | 61.6% |
Missing | 9591 |
Missing (%) | 95.9% |
Memory size | 156.2 KiB |
Length
Max length | 63 |
---|---|
Median length | 41 |
Mean length | 23.848411 |
Min length | 16 |
Characters and Unicode
Total characters | 9754 |
---|---|
Distinct characters | 54 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 180 ? |
---|---|
Unique (%) | 44.0% |
Sample
1st row | http://www.eagar.co.kr |
---|---|
2nd row | http://www.grouphan.co.kr |
3rd row | http://www.msapp.co.kr/ |
4th row | http://www.donginlaw.co.kr |
5th row | http://www.sanhalaw.co.kr/ |
Value | Count | Frequency (%) |
http://www.ulchi.co.kr | 14 | 3.4% |
http://www.sanhalaw.co.kr | 12 | 2.9% |
http://www.jblaw.kr | 12 | 2.9% |
http://www.dh2002.co.kr | 7 | 1.7% |
http://www.leeko.com:8080 | 6 | 1.5% |
http://nksbar@gmail.com | 6 | 1.5% |
http://www.eagar.co.kr | 6 | 1.5% |
http://www.bkl.co.kr | 6 | 1.5% |
http://www.eagroup.co.kr | 5 | 1.2% |
http://www.cmeia.co.kr | 5 | 1.2% |
Other values (222) | 330 |
Most occurring characters
Value | Count | Frequency (%) |
w | 1090 | |
. | 1039 | 10.7% |
/ | 1028 | 10.5% |
t | 905 | 9.3% |
o | 560 | 5.7% |
h | 554 | 5.7% |
c | 547 | 5.6% |
p | 504 | 5.2% |
: | 415 | 4.3% |
r | 401 | 4.1% |
Other values (44) | 2711 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 7062 | |
Other Punctuation | 2507 | 25.7% |
Decimal Number | 145 | 1.5% |
Dash Punctuation | 20 | 0.2% |
Uppercase Letter | 14 | 0.1% |
Math Symbol | 4 | < 0.1% |
Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
w | 1090 | |
t | 905 | |
o | 560 | 7.9% |
h | 554 | 7.8% |
c | 547 | 7.7% |
p | 504 | 7.1% |
r | 401 | 5.7% |
a | 380 | 5.4% |
k | 363 | 5.1% |
n | 280 | 4.0% |
Other values (15) | 1478 |
Decimal Number
Value | Count | Frequency (%) |
2 | 38 | |
0 | 36 | |
8 | 18 | |
1 | 15 | 10.3% |
4 | 10 | 6.9% |
3 | 9 | 6.2% |
5 | 8 | 5.5% |
7 | 8 | 5.5% |
9 | 2 | 1.4% |
6 | 1 | 0.7% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1039 | |
/ | 1028 | |
: | 415 | 16.6% |
@ | 15 | 0.6% |
# | 4 | 0.2% |
? | 4 | 0.2% |
& | 1 | < 0.1% |
? | 1 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
R | 3 | |
W | 3 | |
K | 2 | |
A | 2 | |
L | 1 | 7.1% |
N | 1 | 7.1% |
I | 1 | 7.1% |
C | 1 | 7.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20 |
Math Symbol
Value | Count | Frequency (%) |
= | 4 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7076 | |
Common | 2678 | 27.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
w | 1090 | |
t | 905 | |
o | 560 | 7.9% |
h | 554 | 7.8% |
c | 547 | 7.7% |
p | 504 | 7.1% |
r | 401 | 5.7% |
a | 380 | 5.4% |
k | 363 | 5.1% |
n | 280 | 4.0% |
Other values (23) | 1492 |
Common
Value | Count | Frequency (%) |
. | 1039 | |
/ | 1028 | |
: | 415 | 15.5% |
2 | 38 | 1.4% |
0 | 36 | 1.3% |
- | 20 | 0.7% |
8 | 18 | 0.7% |
@ | 15 | 0.6% |
1 | 15 | 0.6% |
4 | 10 | 0.4% |
Other values (11) | 44 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9753 | |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
w | 1090 | |
. | 1039 | 10.7% |
/ | 1028 | 10.5% |
t | 905 | 9.3% |
o | 560 | 5.7% |
h | 554 | 5.7% |
c | 547 | 5.6% |
p | 504 | 5.2% |
: | 415 | 4.3% |
r | 401 | 4.1% |
Other values (43) | 2710 |
None
Value | Count | Frequency (%) |
? | 1 |
계약해지여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False | |
---|---|
True | 221 |
Value | Count | Frequency (%) |
False | 9779 | |
True | 221 | 2.2% |
업체분류명 | 계약해지여부 | |
---|---|---|
업체분류명 | 1.000 | 0.093 |
계약해지여부 | 0.093 | 1.000 |
계약해지여부 | 업체분류명 | |
---|---|---|
계약해지여부 | 1.000 | 0.114 |
업체분류명 | 0.114 | 1.000 |
업체분류명 | 계약해지여부 | |
---|---|---|
업체분류명 | 1.000 | 0.114 |
계약해지여부 | 0.114 | 1.000 |
사업번호 | 업체분류명 | 업체명 | 주소 | 전화번호 | 홈페이지주소 | 계약해지여부 | |
---|---|---|---|---|---|---|---|
16131 | 11230-900000737 | 설계자 | (주)그룹환경종합건축사사무소 | <NA> | <NA> | <NA> | N |
18736 | 11560-100003012 | 기타 | (주)개신건설 | <NA> | <NA> | <NA> | N |
31978 | 11380-100001057 | 기타 | (주) 나라감정평가법인 | <NA> | <NA> | <NA> | N |
5011 | 11650-100002018 | 기타 | 법무법인엘케이비앤파트너스 | <NA> | <NA> | <NA> | N |
2738 | 11680-900000942 | 기타 | (주)주성 시.엠.시 | <NA> | <NA> | <NA> | N |
25440 | 11590-900000481 | 기타 | 유안타증권(주) | <NA> | <NA> | <NA> | N |
27405 | 11380-100001048 | 기타 | 대한지적공사 | <NA> | <NA> | <NA> | N |
10558 | 11620-100004000 | 기타 | 주식회사 코리아이앤씨 | <NA> | <NA> | <NA> | N |
5148 | 11290-100016005 | 기타 | 법무법인(유한) 시그니처 | <NA> | <NA> | <NA> | N |
8227 | 11740-900000149 | 기타 | 한방유비스 | <NA> | <NA> | <NA> | N |
사업번호 | 업체분류명 | 업체명 | 주소 | 전화번호 | 홈페이지주소 | 계약해지여부 | |
---|---|---|---|---|---|---|---|
28910 | 11440-100006006 | 기타 | (주)우현 엔지니어링 | <NA> | <NA> | <NA> | N |
27762 | 11380-100001060 | 기타 | 주식회사 풍익개발 | <NA> | <NA> | <NA> | N |
17523 | 11230-100006042 | 기타 | (주)씨엠닉스 | <NA> | <NA> | <NA> | N |
24654 | 11140-100002004 | 기타 | 건화종합건축사사무소 | <NA> | <NA> | <NA> | N |
3056 | 11650-900000553 | 기타 | 주식회사 플로우컴퍼니 | <NA> | <NA> | <NA> | N |
25499 | 11680-900000694 | 기타 | (주)다올하우징 | <NA> | <NA> | <NA> | N |
26438 | 11290-100016009 | 기타 | (주)삼호기술개발공사 | 서울특별시 서초구 서초대로22길 11-7 (방배동) | 031-426-3966 | <NA> | N |
31676 | 11290-900000106 | 기타 | (주)제이앤비코퍼레이션 | <NA> | <NA> | <NA> | N |
20069 | 11260-100002011 | 기타 | (주)도울기획 | <NA> | <NA> | <NA> | N |
1324 | 11710-100002008 | 기타 | 주식회사 백야시스템 | <NA> | <NA> | <NA> | N |
Most frequently occurring
사업번호 | 업체분류명 | 업체명 | 주소 | 전화번호 | 홈페이지주소 | 계약해지여부 | # duplicates | |
---|---|---|---|---|---|---|---|---|
226 | 11230-100012000 | 기타 | 법률사무소정비정경아 | <NA> | <NA> | <NA> | N | 45 |
796 | 11620-100004002 | 기타 | 국토속기사사무소 | <NA> | <NA> | <NA> | N | 35 |
438 | 11380-100001045 | 기타 | 법무법인산하 | <NA> | <NA> | <NA> | N | 31 |
726 | 11590-100001010 | 기타 | 변호사남기송법률사무소 | <NA> | <NA> | <NA> | N | 29 |
90 | 11200-100002000 | 기타 | 법무법인 혜안 | <NA> | <NA> | <NA> | N | 22 |
980 | 11680-900000542 | 기타 | 법무법인로드맵 | <NA> | <NA> | <NA> | N | 20 |
1047 | 11710-900000431 | 기타 | 법무법인케이씨엘 | <NA> | <NA> | <NA> | N | 20 |
249 | 11260-100003045 | 기타 | 중원종합법률사무소 | <NA> | <NA> | <NA> | N | 18 |
140 | 11215-100002008 | 기타 | 법무법인(유한)대륙아주 | <NA> | <NA> | <NA> | N | 17 |
507 | 11380-900000142 | 기타 | 법률사무소 정비 | <NA> | <NA> | <NA> | N | 17 |