Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 1283 |
Missing cells | 6432 |
Missing cells (%) | 35.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 141.7 KiB |
Average record size in memory | 113.1 B |
Variable types
Text | 11 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 환경신기술 인검증 신청 회원 업체 정보(2020.10.26. 기준, 회원구분, 회사업종, 업태, 주소, 홈페이지, 사업분야 등) |
---|---|
Author | 한국환경산업기술원 |
URL | https://www.data.go.kr/data/15071519/fileData.do |
(회사)기준년도 is highly overall correlated with 회원구분 and 1 other fields | High correlation |
회원구분 is highly overall correlated with (회사)기준년도 | High correlation |
(회사)업태 is highly overall correlated with (회사)기준년도 | High correlation |
회원구분 is highly imbalanced (64.4%) | Imbalance |
사업자등록번호 has 137 (10.7%) missing values | Missing |
(회사)업종 has 353 (27.5%) missing values | Missing |
회사 대표자 has 118 (9.2%) missing values | Missing |
(회사)주소1 has 47 (3.7%) missing values | Missing |
(회사)주소2 has 1046 (81.5%) missing values | Missing |
(회사)전화 has 339 (26.4%) missing values | Missing |
(회사)홈페이지 has 793 (61.8%) missing values | Missing |
(회사)기준년도 has 1271 (99.1%) missing values | Missing |
기업명 영문 has 1172 (91.3%) missing values | Missing |
회사 사업 분야 has 1156 (90.1%) missing values | Missing |
회사번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 00:03:31.437300 |
---|---|
Analysis finished | 2023-12-12 00:03:33.086925 |
Duration | 1.65 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
회사번호
Text
UNIQUE
 
Distinct | 1283 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 12830 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1283 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | CP00000875 |
---|---|
2nd row | CP00000876 |
3rd row | CP00000877 |
4th row | CP00000878 |
5th row | CP00000884 |
Value | Count | Frequency (%) |
cp00000875 | 1 | 0.1% |
cp00000347 | 1 | 0.1% |
cp00000039 | 1 | 0.1% |
cp00000038 | 1 | 0.1% |
cp00000037 | 1 | 0.1% |
cp00000036 | 1 | 0.1% |
cp00000034 | 1 | 0.1% |
cp00000033 | 1 | 0.1% |
cp00000032 | 1 | 0.1% |
cp00000040 | 1 | 0.1% |
Other values (1273) | 1273 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 6038 | |
C | 1283 | 10.0% |
P | 1283 | 10.0% |
1 | 784 | 6.1% |
3 | 684 | 5.3% |
2 | 602 | 4.7% |
7 | 381 | 3.0% |
6 | 376 | 2.9% |
4 | 372 | 2.9% |
8 | 360 | 2.8% |
Other values (2) | 667 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10264 | |
Uppercase Letter | 2566 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 6038 | |
1 | 784 | 7.6% |
3 | 684 | 6.7% |
2 | 602 | 5.9% |
7 | 381 | 3.7% |
6 | 376 | 3.7% |
4 | 372 | 3.6% |
8 | 360 | 3.5% |
5 | 359 | 3.5% |
9 | 308 | 3.0% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1283 | |
P | 1283 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10264 | |
Latin | 2566 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 6038 | |
1 | 784 | 7.6% |
3 | 684 | 6.7% |
2 | 602 | 5.9% |
7 | 381 | 3.7% |
6 | 376 | 3.7% |
4 | 372 | 3.6% |
8 | 360 | 3.5% |
5 | 359 | 3.5% |
9 | 308 | 3.0% |
Latin
Value | Count | Frequency (%) |
C | 1283 | |
P | 1283 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12830 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 6038 | |
C | 1283 | 10.0% |
P | 1283 | 10.0% |
1 | 784 | 6.1% |
3 | 684 | 5.3% |
2 | 602 | 4.7% |
7 | 381 | 3.0% |
6 | 376 | 2.9% |
4 | 372 | 2.9% |
8 | 360 | 2.8% |
Other values (2) | 667 | 5.2% |
회원구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 13 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.2 KiB |
중소기업 | |
---|---|
대기업 | 82 |
벤쳐 | 62 |
기타 | 37 |
<NA> | 29 |
Other values (8) | 58 |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 3.7911146 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 중소기업 |
---|---|
2nd row | 중소기업 |
3rd row | 중소기업 |
4th row | 중소기업 |
5th row | 중소기업 |
Common Values
Value | Count | Frequency (%) |
중소기업 | 1015 | |
대기업 | 82 | 6.4% |
벤쳐 | 62 | 4.8% |
기타 | 37 | 2.9% |
<NA> | 29 | 2.3% |
중견기업 | 19 | 1.5% |
대학 | 16 | 1.2% |
중소기업연구소 | 7 | 0.5% |
출연연구기관 | 5 | 0.4% |
공공기관 | 5 | 0.4% |
Other values (3) | 6 | 0.5% |
Length
Value | Count | Frequency (%) |
중소기업 | 1015 | |
대기업 | 82 | 6.4% |
벤쳐 | 62 | 4.8% |
기타 | 37 | 2.9% |
na | 29 | 2.3% |
중견기업 | 19 | 1.5% |
대학 | 16 | 1.2% |
중소기업연구소 | 7 | 0.5% |
출연연구기관 | 5 | 0.4% |
공공기관 | 5 | 0.4% |
Other values (3) | 6 | 0.5% |
(회사)명
Text
Distinct | 1259 |
---|---|
Distinct (%) | 98.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.2 KiB |
Value | Count | Frequency (%) |
주식회사 | 103 | 7.2% |
산학협력단 | 12 | 0.8% |
주 | 6 | 0.4% |
주)포스코엔지니어링 | 3 | 0.2% |
유한회사 | 3 | 0.2% |
서울특별시 | 2 | 0.1% |
서울대학교 | 2 | 0.1% |
그린환경 | 2 | 0.1% |
㈜부강테크 | 2 | 0.1% |
이에스지케이 | 2 | 0.1% |
Other values (1263) | 1284 |
Most occurring characters
Value | Count | Frequency (%) |
주 | 741 | 7.7% |
) | 610 | 6.3% |
( | 609 | 6.3% |
㈜ | 392 | 4.1% |
이 | 306 | 3.2% |
경 | 216 | 2.2% |
환 | 195 | 2.0% |
사 | 192 | 2.0% |
엔 | 180 | 1.9% |
산 | 177 | 1.8% |
Other values (414) | 6012 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7767 | |
Close Punctuation | 610 | 6.3% |
Open Punctuation | 609 | 6.3% |
Other Symbol | 392 | 4.1% |
Space Separator | 138 | 1.4% |
Uppercase Letter | 93 | 1.0% |
Decimal Number | 10 | 0.1% |
Other Punctuation | 5 | 0.1% |
Lowercase Letter | 5 | 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 741 | 9.5% |
이 | 306 | 3.9% |
경 | 216 | 2.8% |
환 | 195 | 2.5% |
사 | 192 | 2.5% |
엔 | 180 | 2.3% |
산 | 177 | 2.3% |
회 | 162 | 2.1% |
업 | 159 | 2.0% |
식 | 154 | 2.0% |
Other values (379) | 5285 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 17 | |
G | 14 | |
S | 11 | |
T | 11 | |
N | 8 | |
I | 7 | |
C | 6 | 6.5% |
K | 3 | 3.2% |
V | 3 | 3.2% |
A | 2 | 2.2% |
Other values (9) | 11 |
Lowercase Letter
Value | Count | Frequency (%) |
x | 1 | |
i | 1 | |
n | 1 | |
o | 1 | |
e | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 20.0% |
8 | 2 | 20.0% |
4 | 1 | 10.0% |
Other Punctuation
Value | Count | Frequency (%) |
& | 3 | |
. | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 610 |
Open Punctuation
Value | Count | Frequency (%) |
( | 609 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 392 |
Space Separator
Value | Count | Frequency (%) |
138 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8157 | |
Common | 1373 | 14.3% |
Latin | 98 | 1.0% |
Han | 2 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 741 | 9.1% |
㈜ | 392 | 4.8% |
이 | 306 | 3.8% |
경 | 216 | 2.6% |
환 | 195 | 2.4% |
사 | 192 | 2.4% |
엔 | 180 | 2.2% |
산 | 177 | 2.2% |
회 | 162 | 2.0% |
업 | 159 | 1.9% |
Other values (378) | 5437 |
Latin
Value | Count | Frequency (%) |
E | 17 | |
G | 14 | |
S | 11 | |
T | 11 | |
N | 8 | |
I | 7 | |
C | 6 | 6.1% |
K | 3 | 3.1% |
V | 3 | 3.1% |
A | 2 | 2.0% |
Other values (14) | 16 |
Common
Value | Count | Frequency (%) |
) | 610 | |
( | 609 | |
138 | 10.1% | |
1 | 5 | 0.4% |
& | 3 | 0.2% |
. | 2 | 0.1% |
2 | 2 | 0.1% |
8 | 2 | 0.1% |
4 | 1 | 0.1% |
_ | 1 | 0.1% |
Han
Value | Count | Frequency (%) |
境 | 1 | |
環 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7765 | |
ASCII | 1471 | 15.3% |
None | 392 | 4.1% |
CJK | 2 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
주 | 741 | 9.5% |
이 | 306 | 3.9% |
경 | 216 | 2.8% |
환 | 195 | 2.5% |
사 | 192 | 2.5% |
엔 | 180 | 2.3% |
산 | 177 | 2.3% |
회 | 162 | 2.1% |
업 | 159 | 2.0% |
식 | 154 | 2.0% |
Other values (377) | 5283 |
ASCII
Value | Count | Frequency (%) |
) | 610 | |
( | 609 | |
138 | 9.4% | |
E | 17 | 1.2% |
G | 14 | 1.0% |
S | 11 | 0.7% |
T | 11 | 0.7% |
N | 8 | 0.5% |
I | 7 | 0.5% |
C | 6 | 0.4% |
Other values (24) | 40 | 2.7% |
None
Value | Count | Frequency (%) |
㈜ | 392 |
CJK
Value | Count | Frequency (%) |
境 | 1 | |
環 | 1 |
사업자등록번호
Text
MISSING
 
Distinct | 1098 |
---|---|
Distinct (%) | 95.8% |
Missing | 137 |
Missing (%) | 10.7% |
Memory size | 10.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 13752 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1054 ? |
---|---|
Unique (%) | 92.0% |
Sample
1st row | 124-81-69018 |
---|---|
2nd row | 313-81-01199 |
3rd row | 215-86-53364 |
4th row | 128-81-85378 |
5th row | 136-81-13652 |
Value | Count | Frequency (%) |
123-81-62292 | 3 | 0.3% |
107-82-14534 | 3 | 0.3% |
312-81-34493 | 3 | 0.3% |
136-81-13652 | 3 | 0.3% |
137-85-01837 | 2 | 0.2% |
134-81-80567 | 2 | 0.2% |
318-81-01331 | 2 | 0.2% |
116-81-25566 | 2 | 0.2% |
111-11-11111 | 2 | 0.2% |
120-81-46916 | 2 | 0.2% |
Other values (1088) | 1122 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2462 | |
- | 2292 | |
8 | 1766 | |
0 | 1253 | |
2 | 1233 | |
3 | 946 | 6.9% |
6 | 895 | 6.5% |
4 | 881 | 6.4% |
5 | 754 | 5.5% |
7 | 679 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 11460 | |
Dash Punctuation | 2292 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2462 | |
8 | 1766 | |
0 | 1253 | |
2 | 1233 | |
3 | 946 | 8.3% |
6 | 895 | 7.8% |
4 | 881 | 7.7% |
5 | 754 | 6.6% |
7 | 679 | 5.9% |
9 | 591 | 5.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2292 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 13752 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2462 | |
- | 2292 | |
8 | 1766 | |
0 | 1253 | |
2 | 1233 | |
3 | 946 | 6.9% |
6 | 895 | 6.5% |
4 | 881 | 6.4% |
5 | 754 | 5.5% |
7 | 679 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13752 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2462 | |
- | 2292 | |
8 | 1766 | |
0 | 1253 | |
2 | 1233 | |
3 | 946 | 6.9% |
6 | 895 | 6.5% |
4 | 881 | 6.4% |
5 | 754 | 5.5% |
7 | 679 | 4.9% |
(회사)업종
Text
MISSING
 
Distinct | 566 |
---|---|
Distinct (%) | 60.9% |
Missing | 353 |
Missing (%) | 27.5% |
Memory size | 10.2 KiB |
Value | Count | Frequency (%) |
제조업 | 108 | 6.7% |
제조 | 81 | 5.0% |
건설업 | 67 | 4.1% |
건설폐기물중간처리업 | 56 | 3.5% |
서비스 | 55 | 3.4% |
및 | 53 | 3.3% |
외 | 47 | 2.9% |
건설 | 41 | 2.5% |
건설폐기물 | 30 | 1.9% |
중간처리업 | 19 | 1.2% |
Other values (740) | 1059 |
Most occurring characters
Value | Count | Frequency (%) |
688 | 6.9% | |
, | 616 | 6.2% |
업 | 544 | 5.5% |
설 | 491 | 4.9% |
기 | 391 | 3.9% |
제 | 359 | 3.6% |
건 | 359 | 3.6% |
조 | 331 | 3.3% |
비 | 229 | 2.3% |
물 | 224 | 2.3% |
Other values (318) | 5701 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8469 | |
Space Separator | 688 | 6.9% |
Other Punctuation | 657 | 6.6% |
Uppercase Letter | 52 | 0.5% |
Open Punctuation | 30 | 0.3% |
Close Punctuation | 30 | 0.3% |
Decimal Number | 4 | < 0.1% |
Dash Punctuation | 2 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 544 | 6.4% |
설 | 491 | 5.8% |
기 | 391 | 4.6% |
제 | 359 | 4.2% |
건 | 359 | 4.2% |
조 | 331 | 3.9% |
비 | 229 | 2.7% |
물 | 224 | 2.6% |
리 | 202 | 2.4% |
폐 | 177 | 2.1% |
Other values (297) | 5162 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 15 | |
T | 9 | |
V | 8 | |
E | 6 | 11.5% |
P | 3 | 5.8% |
W | 2 | 3.8% |
H | 2 | 3.8% |
N | 2 | 3.8% |
G | 2 | 3.8% |
S | 2 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 616 | |
/ | 22 | 3.3% |
. | 19 | 2.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
2 | 1 | 25.0% |
Space Separator
Value | Count | Frequency (%) |
688 |
Open Punctuation
Value | Count | Frequency (%) |
( | 30 |
Close Punctuation
Value | Count | Frequency (%) |
) | 30 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Lowercase Letter
Value | Count | Frequency (%) |
w | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8468 | |
Common | 1411 | 14.2% |
Latin | 53 | 0.5% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 544 | 6.4% |
설 | 491 | 5.8% |
기 | 391 | 4.6% |
제 | 359 | 4.2% |
건 | 359 | 4.2% |
조 | 331 | 3.9% |
비 | 229 | 2.7% |
물 | 224 | 2.6% |
리 | 202 | 2.4% |
폐 | 177 | 2.1% |
Other values (296) | 5161 |
Latin
Value | Count | Frequency (%) |
C | 15 | |
T | 9 | |
V | 8 | |
E | 6 | 11.3% |
P | 3 | 5.7% |
W | 2 | 3.8% |
H | 2 | 3.8% |
N | 2 | 3.8% |
G | 2 | 3.8% |
S | 2 | 3.8% |
Other values (2) | 2 | 3.8% |
Common
Value | Count | Frequency (%) |
688 | ||
, | 616 | |
( | 30 | 2.1% |
) | 30 | 2.1% |
/ | 22 | 1.6% |
. | 19 | 1.3% |
1 | 3 | 0.2% |
- | 2 | 0.1% |
2 | 1 | 0.1% |
Han
Value | Count | Frequency (%) |
外 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8468 | |
ASCII | 1464 | 14.7% |
CJK | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
688 | ||
, | 616 | |
( | 30 | 2.0% |
) | 30 | 2.0% |
/ | 22 | 1.5% |
. | 19 | 1.3% |
C | 15 | 1.0% |
T | 9 | 0.6% |
V | 8 | 0.5% |
E | 6 | 0.4% |
Other values (11) | 21 | 1.4% |
Hangul
Value | Count | Frequency (%) |
업 | 544 | 6.4% |
설 | 491 | 5.8% |
기 | 391 | 4.6% |
제 | 359 | 4.2% |
건 | 359 | 4.2% |
조 | 331 | 3.9% |
비 | 229 | 2.7% |
물 | 224 | 2.6% |
리 | 202 | 2.4% |
폐 | 177 | 2.1% |
Other values (296) | 5161 |
CJK
Value | Count | Frequency (%) |
外 | 1 |
(회사)업태
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.2 KiB |
<NA> | |
---|---|
기타 | |
일반건설업 | |
전문건설업 | |
기술용역 | 50 |
Other values (3) | 44 |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 3.664848 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 기타 |
3rd row | <NA> |
4th row | 전문건설업 |
5th row | 기타 |
Common Values
Value | Count | Frequency (%) |
<NA> | 579 | |
기타 | 341 | |
일반건설업 | 137 | 10.7% |
전문건설업 | 132 | 10.3% |
기술용역 | 50 | 3.9% |
연구소 | 23 | 1.8% |
정부투자기관 | 12 | 0.9% |
개인 | 9 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 579 | |
기타 | 341 | |
일반건설업 | 137 | 10.7% |
전문건설업 | 132 | 10.3% |
기술용역 | 50 | 3.9% |
연구소 | 23 | 1.8% |
정부투자기관 | 12 | 0.9% |
개인 | 9 | 0.7% |
회사 대표자
Text
MISSING
 
Distinct | 1100 |
---|---|
Distinct (%) | 94.4% |
Missing | 118 |
Missing (%) | 9.2% |
Memory size | 10.2 KiB |
Value | Count | Frequency (%) |
김 | 5 | 0.4% |
신영균 | 3 | 0.2% |
조성광 | 3 | 0.2% |
정일호 | 3 | 0.2% |
박용기 | 3 | 0.2% |
최형기 | 2 | 0.2% |
권오현 | 2 | 0.2% |
정창화 | 2 | 0.2% |
안성국 | 2 | 0.2% |
이용현 | 2 | 0.2% |
Other values (1130) | 1190 |
Most occurring characters
Value | Count | Frequency (%) |
김 | 233 | 6.2% |
이 | 183 | 4.8% |
영 | 108 | 2.9% |
정 | 105 | 2.8% |
박 | 103 | 2.7% |
성 | 77 | 2.0% |
수 | 68 | 1.8% |
최 | 64 | 1.7% |
재 | 63 | 1.7% |
58 | 1.5% | |
Other values (215) | 2713 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3652 | |
Space Separator | 58 | 1.5% |
Other Punctuation | 55 | 1.5% |
Uppercase Letter | 9 | 0.2% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
김 | 233 | 6.4% |
이 | 183 | 5.0% |
영 | 108 | 3.0% |
정 | 105 | 2.9% |
박 | 103 | 2.8% |
성 | 77 | 2.1% |
수 | 68 | 1.9% |
최 | 64 | 1.8% |
재 | 63 | 1.7% |
종 | 58 | 1.6% |
Other values (204) | 2590 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 2 | |
D | 2 | |
G | 1 | |
I | 1 | |
E | 1 | |
N | 1 | |
A | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 53 | |
/ | 2 | 3.6% |
Space Separator
Value | Count | Frequency (%) |
58 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3652 | |
Common | 114 | 3.0% |
Latin | 9 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
김 | 233 | 6.4% |
이 | 183 | 5.0% |
영 | 108 | 3.0% |
정 | 105 | 2.9% |
박 | 103 | 2.8% |
성 | 77 | 2.1% |
수 | 68 | 1.9% |
최 | 64 | 1.8% |
재 | 63 | 1.7% |
종 | 58 | 1.6% |
Other values (204) | 2590 |
Latin
Value | Count | Frequency (%) |
L | 2 | |
D | 2 | |
G | 1 | |
I | 1 | |
E | 1 | |
N | 1 | |
A | 1 |
Common
Value | Count | Frequency (%) |
58 | ||
, | 53 | |
/ | 2 | 1.8% |
- | 1 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3652 | |
ASCII | 123 | 3.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
김 | 233 | 6.4% |
이 | 183 | 5.0% |
영 | 108 | 3.0% |
정 | 105 | 2.9% |
박 | 103 | 2.8% |
성 | 77 | 2.1% |
수 | 68 | 1.9% |
최 | 64 | 1.8% |
재 | 63 | 1.7% |
종 | 58 | 1.6% |
Other values (204) | 2590 |
ASCII
Value | Count | Frequency (%) |
58 | ||
, | 53 | |
L | 2 | 1.6% |
D | 2 | 1.6% |
/ | 2 | 1.6% |
G | 1 | 0.8% |
I | 1 | 0.8% |
E | 1 | 0.8% |
N | 1 | 0.8% |
A | 1 | 0.8% |
(회사)주소1
Text
MISSING
 
Distinct | 1207 |
---|---|
Distinct (%) | 97.7% |
Missing | 47 |
Missing (%) | 3.7% |
Memory size | 10.2 KiB |
Length
Max length | 49 |
---|---|
Median length | 41 |
Mean length | 22.231392 |
Min length | 9 |
Characters and Unicode
Total characters | 27478 |
---|---|
Distinct characters | 469 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 1182 ? |
---|---|
Unique (%) | 95.6% |
Sample
1st row | 경기도 연천군 전곡읍 늘목리 61-4 |
---|---|
2nd row | 경기도 화성시 정문송산로93번길 10-27 |
3rd row | 충남 천안시 직산면 자은가리 82-2 |
4th row | 충청남도 보령시 남포면 평촌밤섬길 218-191 |
5th row | 서울 송파구 방이1동 165-3 |
Value | Count | Frequency (%) |
경기도 | 275 | 4.4% |
서울 | 126 | 2.0% |
경기 | 109 | 1.8% |
서울특별시 | 108 | 1.7% |
서울시 | 49 | 0.8% |
화성시 | 43 | 0.7% |
충남 | 41 | 0.7% |
성남시 | 40 | 0.6% |
안양시 | 39 | 0.6% |
경남 | 38 | 0.6% |
Other values (2945) | 5327 |
Most occurring characters
Value | Count | Frequency (%) |
5004 | 18.2% | |
1 | 1067 | 3.9% |
시 | 968 | 3.5% |
구 | 787 | 2.9% |
2 | 772 | 2.8% |
동 | 765 | 2.8% |
3 | 613 | 2.2% |
로 | 585 | 2.1% |
- | 573 | 2.1% |
경 | 545 | 2.0% |
Other values (459) | 15799 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 16295 | |
Decimal Number | 5194 | 18.9% |
Space Separator | 5004 | 18.2% |
Dash Punctuation | 573 | 2.1% |
Uppercase Letter | 121 | 0.4% |
Close Punctuation | 108 | 0.4% |
Open Punctuation | 108 | 0.4% |
Other Punctuation | 34 | 0.1% |
Lowercase Letter | 24 | 0.1% |
Math Symbol | 10 | < 0.1% |
Other values (2) | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 968 | 5.9% |
구 | 787 | 4.8% |
동 | 765 | 4.7% |
로 | 585 | 3.6% |
경 | 545 | 3.3% |
도 | 525 | 3.2% |
서 | 458 | 2.8% |
기 | 437 | 2.7% |
남 | 371 | 2.3% |
산 | 351 | 2.2% |
Other values (405) | 10503 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 20 | |
T | 17 | |
S | 11 | |
K | 10 | |
A | 10 | |
I | 9 | 7.4% |
C | 7 | 5.8% |
E | 5 | 4.1% |
D | 5 | 4.1% |
L | 4 | 3.3% |
Other values (11) | 23 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4 | |
w | 3 | |
n | 3 | |
o | 3 | |
r | 2 | |
s | 2 | |
i | 2 | |
h | 1 | 4.2% |
c | 1 | 4.2% |
k | 1 | 4.2% |
Other values (2) | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1067 | |
2 | 772 | |
3 | 613 | |
5 | 474 | |
0 | 465 | |
4 | 398 | 7.7% |
6 | 397 | 7.6% |
8 | 353 | 6.8% |
7 | 341 | 6.6% |
9 | 314 | 6.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 29 | |
/ | 4 | 11.8% |
& | 1 | 2.9% |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 5 | |
ⓝ | 1 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
5004 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 573 |
Close Punctuation
Value | Count | Frequency (%) |
) | 108 |
Open Punctuation
Value | Count | Frequency (%) |
( | 108 |
Math Symbol
Value | Count | Frequency (%) |
~ | 10 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 16300 | |
Common | 11032 | |
Latin | 146 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 968 | 5.9% |
구 | 787 | 4.8% |
동 | 765 | 4.7% |
로 | 585 | 3.6% |
경 | 545 | 3.3% |
도 | 525 | 3.2% |
서 | 458 | 2.8% |
기 | 437 | 2.7% |
남 | 371 | 2.3% |
산 | 351 | 2.2% |
Other values (406) | 10508 |
Latin
Value | Count | Frequency (%) |
B | 20 | |
T | 17 | 11.6% |
S | 11 | 7.5% |
K | 10 | 6.8% |
A | 10 | 6.8% |
I | 9 | 6.2% |
C | 7 | 4.8% |
E | 5 | 3.4% |
D | 5 | 3.4% |
L | 4 | 2.7% |
Other values (24) | 48 |
Common
Value | Count | Frequency (%) |
5004 | ||
1 | 1067 | 9.7% |
2 | 772 | 7.0% |
3 | 613 | 5.6% |
- | 573 | 5.2% |
5 | 474 | 4.3% |
0 | 465 | 4.2% |
4 | 398 | 3.6% |
6 | 397 | 3.6% |
8 | 353 | 3.2% |
Other values (9) | 916 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 16295 | |
ASCII | 11175 | |
None | 6 | < 0.1% |
Number Forms | 1 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5004 | ||
1 | 1067 | 9.5% |
2 | 772 | 6.9% |
3 | 613 | 5.5% |
- | 573 | 5.1% |
5 | 474 | 4.2% |
0 | 465 | 4.2% |
4 | 398 | 3.6% |
6 | 397 | 3.6% |
8 | 353 | 3.2% |
Other values (40) | 1059 | 9.5% |
Hangul
Value | Count | Frequency (%) |
시 | 968 | 5.9% |
구 | 787 | 4.8% |
동 | 765 | 4.7% |
로 | 585 | 3.6% |
경 | 545 | 3.3% |
도 | 525 | 3.2% |
서 | 458 | 2.8% |
기 | 437 | 2.7% |
남 | 371 | 2.3% |
산 | 351 | 2.2% |
Other values (405) | 10503 |
None
Value | Count | Frequency (%) |
㈜ | 5 | |
& | 1 | 16.7% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓝ | 1 |
(회사)주소2
Text
MISSING
 
Distinct | 216 |
---|---|
Distinct (%) | 91.1% |
Missing | 1046 |
Missing (%) | 81.5% |
Memory size | 10.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 20 |
Mean length | 8.9704641 |
Min length | 1 |
Characters and Unicode
Total characters | 2126 |
---|---|
Distinct characters | 277 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 205 ? |
---|---|
Unique (%) | 86.5% |
Sample
1st row | 2401호(영덕동, 유-타워) |
---|---|
2nd row | 1110호 |
3rd row | 1007호 |
4th row | 시티플러스 702 |
5th row | 12층(퍼스트타워) |
Value | Count | Frequency (%) |
2층 | 13 | 3.5% |
3층 | 10 | 2.7% |
4층 | 7 | 1.9% |
6층 | 6 | 1.6% |
1층 | 6 | 1.6% |
a동 | 5 | 1.4% |
405호 | 4 | 1.1% |
202호 | 4 | 1.1% |
3 | 0.8% | |
b동 | 3 | 0.8% |
Other values (292) | 309 |
Most occurring characters
Value | Count | Frequency (%) |
134 | 6.3% | |
1 | 122 | 5.7% |
0 | 111 | 5.2% |
호 | 105 | 4.9% |
2 | 87 | 4.1% |
동 | 87 | 4.1% |
( | 79 | 3.7% |
) | 79 | 3.7% |
층 | 64 | 3.0% |
3 | 44 | 2.1% |
Other values (267) | 1214 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1213 | |
Decimal Number | 519 | |
Space Separator | 134 | 6.3% |
Open Punctuation | 79 | 3.7% |
Close Punctuation | 79 | 3.7% |
Uppercase Letter | 37 | 1.7% |
Other Punctuation | 33 | 1.6% |
Dash Punctuation | 23 | 1.1% |
Lowercase Letter | 6 | 0.3% |
Math Symbol | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 105 | 8.7% |
동 | 87 | 7.2% |
층 | 64 | 5.3% |
이 | 32 | 2.6% |
주 | 27 | 2.2% |
빌 | 27 | 2.2% |
딩 | 26 | 2.1% |
스 | 25 | 2.1% |
대 | 21 | 1.7% |
경 | 19 | 1.6% |
Other values (226) | 780 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 11 | |
B | 8 | |
T | 4 | 10.8% |
R | 2 | 5.4% |
I | 2 | 5.4% |
N | 1 | 2.7% |
C | 1 | 2.7% |
X | 1 | 2.7% |
J | 1 | 2.7% |
D | 1 | 2.7% |
Other values (5) | 5 |
Decimal Number
Value | Count | Frequency (%) |
1 | 122 | |
0 | 111 | |
2 | 87 | |
3 | 44 | 8.5% |
6 | 39 | 7.5% |
4 | 37 | 7.1% |
5 | 30 | 5.8% |
7 | 23 | 4.4% |
8 | 19 | 3.7% |
9 | 7 | 1.3% |
Lowercase Letter
Value | Count | Frequency (%) |
w | 1 | |
e | 1 | |
b | 1 | |
a | 1 | |
m | 1 | |
p | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 30 | |
. | 1 | 3.0% |
& | 1 | 3.0% |
; | 1 | 3.0% |
Space Separator
Value | Count | Frequency (%) |
134 |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 23 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1214 | |
Common | 869 | |
Latin | 43 | 2.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 105 | 8.6% |
동 | 87 | 7.2% |
층 | 64 | 5.3% |
이 | 32 | 2.6% |
주 | 27 | 2.2% |
빌 | 27 | 2.2% |
딩 | 26 | 2.1% |
스 | 25 | 2.1% |
대 | 21 | 1.7% |
경 | 19 | 1.6% |
Other values (227) | 781 |
Latin
Value | Count | Frequency (%) |
A | 11 | |
B | 8 | |
T | 4 | 9.3% |
R | 2 | 4.7% |
I | 2 | 4.7% |
w | 1 | 2.3% |
e | 1 | 2.3% |
N | 1 | 2.3% |
C | 1 | 2.3% |
b | 1 | 2.3% |
Other values (11) | 11 |
Common
Value | Count | Frequency (%) |
134 | ||
1 | 122 | |
0 | 111 | |
2 | 87 | |
( | 79 | |
) | 79 | |
3 | 44 | 5.1% |
6 | 39 | 4.5% |
4 | 37 | 4.3% |
5 | 30 | 3.5% |
Other values (9) | 107 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1213 | |
ASCII | 912 | |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
134 | ||
1 | 122 | |
0 | 111 | |
2 | 87 | |
( | 79 | |
) | 79 | |
3 | 44 | 4.8% |
6 | 39 | 4.3% |
4 | 37 | 4.1% |
5 | 30 | 3.3% |
Other values (30) | 150 |
Hangul
Value | Count | Frequency (%) |
호 | 105 | 8.7% |
동 | 87 | 7.2% |
층 | 64 | 5.3% |
이 | 32 | 2.6% |
주 | 27 | 2.2% |
빌 | 27 | 2.2% |
딩 | 26 | 2.1% |
스 | 25 | 2.1% |
대 | 21 | 1.7% |
경 | 19 | 1.6% |
Other values (226) | 780 |
None
Value | Count | Frequency (%) |
㈜ | 1 |
(회사)전화
Text
MISSING
 
Distinct | 904 |
---|---|
Distinct (%) | 95.8% |
Missing | 339 |
Missing (%) | 26.4% |
Memory size | 10.2 KiB |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 11.907839 |
Min length | 11 |
Characters and Unicode
Total characters | 11241 |
---|---|
Distinct characters | 13 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 867 ? |
---|---|
Unique (%) | 91.8% |
Sample
1st row | 031-832-0011 |
---|---|
2nd row | 041-584-7007 |
3rd row | 041-931-1425 |
4th row | 02-417-4150 |
5th row | 031-906-3223 |
Value | Count | Frequency (%) |
032-562-1658 | 3 | 0.3% |
043-855-7901 | 3 | 0.3% |
053-526-4377 | 3 | 0.3% |
031-495-0574 | 2 | 0.2% |
041-357-5100 | 2 | 0.2% |
02-2008-9841 | 2 | 0.2% |
055-932-9200 | 2 | 0.2% |
062-383-6040 | 2 | 0.2% |
031-382-7907 | 2 | 0.2% |
02-745-2111 | 2 | 0.2% |
Other values (894) | 921 |
Most occurring characters
Value | Count | Frequency (%) |
- | 1887 | |
0 | 1768 | |
3 | 1187 | |
2 | 1132 | |
1 | 1025 | |
5 | 958 | |
4 | 766 | |
6 | 732 | 6.5% |
7 | 729 | 6.5% |
8 | 642 | 5.7% |
Other values (3) | 415 | 3.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 9351 | |
Dash Punctuation | 1887 | 16.8% |
Math Symbol | 2 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 1768 | |
3 | 1187 | |
2 | 1132 | |
1 | 1025 | |
5 | 958 | |
4 | 766 | |
6 | 732 | |
7 | 729 | |
8 | 642 | 6.9% |
9 | 412 | 4.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1887 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 11241 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 1887 | |
0 | 1768 | |
3 | 1187 | |
2 | 1132 | |
1 | 1025 | |
5 | 958 | |
4 | 766 | |
6 | 732 | 6.5% |
7 | 729 | 6.5% |
8 | 642 | 5.7% |
Other values (3) | 415 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11241 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 1887 | |
0 | 1768 | |
3 | 1187 | |
2 | 1132 | |
1 | 1025 | |
5 | 958 | |
4 | 766 | |
6 | 732 | 6.5% |
7 | 729 | 6.5% |
8 | 642 | 5.7% |
Other values (3) | 415 | 3.7% |
(회사)홈페이지
Text
MISSING
 
Distinct | 472 |
---|---|
Distinct (%) | 96.3% |
Missing | 793 |
Missing (%) | 61.8% |
Memory size | 10.2 KiB |
Length
Max length | 47 |
---|---|
Median length | 36 |
Mean length | 17.777551 |
Min length | 1 |
Characters and Unicode
Total characters | 8711 |
---|---|
Distinct characters | 85 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 457 ? |
---|---|
Unique (%) | 93.3% |
Sample
1st row | ww.kbec.co.kr |
---|---|
2nd row | www.krsys.kr |
3rd row | www.wellture.com |
4th row | www.ilsong.co.kr |
5th row | http://www.goldrecycle.co.kr/ |
Value | Count | Frequency (%) |
7 | 1.4% | |
www.forcebel.co.kr | 3 | 0.6% |
www.insun.com | 2 | 0.4% |
www.thewillsystem.com | 2 | 0.4% |
www.janghyung.co.kr | 2 | 0.4% |
www.lh.or.kr | 2 | 0.4% |
www.ktr.or.kr | 2 | 0.4% |
www.taeyoung.com | 2 | 0.4% |
www.hansoleme.com | 2 | 0.4% |
www.tscne.net | 2 | 0.4% |
Other values (461) | 466 |
Most occurring characters
Value | Count | Frequency (%) |
w | 1330 | |
. | 1191 | |
o | 719 | 8.3% |
c | 632 | 7.3% |
r | 464 | 5.3% |
e | 461 | 5.3% |
t | 445 | 5.1% |
k | 406 | 4.7% |
n | 378 | 4.3% |
/ | 291 | 3.3% |
Other values (75) | 2394 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 6908 | |
Other Punctuation | 1616 | 18.6% |
Decimal Number | 93 | 1.1% |
Other Letter | 40 | 0.5% |
Dash Punctuation | 37 | 0.4% |
Uppercase Letter | 10 | 0.1% |
Space Separator | 2 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Open Punctuation | 2 | < 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
수 | 3 | 7.5% |
국 | 2 | 5.0% |
월 | 2 | 5.0% |
문 | 2 | 5.0% |
삼 | 2 | 5.0% |
기 | 2 | 5.0% |
덮 | 1 | 2.5% |
성 | 1 | 2.5% |
상 | 1 | 2.5% |
중 | 1 | 2.5% |
Other values (23) | 23 |
Lowercase Letter
Value | Count | Frequency (%) |
w | 1330 | |
o | 719 | |
c | 632 | |
r | 464 | 6.7% |
e | 461 | 6.7% |
t | 445 | 6.4% |
k | 406 | 5.9% |
n | 378 | 5.5% |
h | 272 | 3.9% |
a | 260 | 3.8% |
Other values (16) | 1541 |
Decimal Number
Value | Count | Frequency (%) |
1 | 26 | |
2 | 22 | |
0 | 19 | |
8 | 7 | 7.5% |
3 | 6 | 6.5% |
7 | 5 | 5.4% |
9 | 3 | 3.2% |
4 | 3 | 3.2% |
6 | 1 | 1.1% |
5 | 1 | 1.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1191 | |
/ | 291 | 18.0% |
: | 118 | 7.3% |
@ | 8 | 0.5% |
, | 7 | 0.4% |
; | 1 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
W | 6 | |
O | 1 | 10.0% |
M | 1 | 10.0% |
C | 1 | 10.0% |
D | 1 | 10.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 37 |
Space Separator
Value | Count | Frequency (%) |
2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6918 | |
Common | 1753 | 20.1% |
Hangul | 40 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
수 | 3 | 7.5% |
국 | 2 | 5.0% |
월 | 2 | 5.0% |
문 | 2 | 5.0% |
삼 | 2 | 5.0% |
기 | 2 | 5.0% |
덮 | 1 | 2.5% |
성 | 1 | 2.5% |
상 | 1 | 2.5% |
중 | 1 | 2.5% |
Other values (23) | 23 |
Latin
Value | Count | Frequency (%) |
w | 1330 | |
o | 719 | |
c | 632 | |
r | 464 | 6.7% |
e | 461 | 6.7% |
t | 445 | 6.4% |
k | 406 | 5.9% |
n | 378 | 5.5% |
h | 272 | 3.9% |
a | 260 | 3.8% |
Other values (21) | 1551 |
Common
Value | Count | Frequency (%) |
. | 1191 | |
/ | 291 | 16.6% |
: | 118 | 6.7% |
- | 37 | 2.1% |
1 | 26 | 1.5% |
2 | 22 | 1.3% |
0 | 19 | 1.1% |
@ | 8 | 0.5% |
8 | 7 | 0.4% |
, | 7 | 0.4% |
Other values (11) | 27 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8671 | |
Hangul | 40 | 0.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
w | 1330 | |
. | 1191 | |
o | 719 | 8.3% |
c | 632 | 7.3% |
r | 464 | 5.4% |
e | 461 | 5.3% |
t | 445 | 5.1% |
k | 406 | 4.7% |
n | 378 | 4.4% |
/ | 291 | 3.4% |
Other values (42) | 2354 |
Hangul
Value | Count | Frequency (%) |
수 | 3 | 7.5% |
국 | 2 | 5.0% |
월 | 2 | 5.0% |
문 | 2 | 5.0% |
삼 | 2 | 5.0% |
기 | 2 | 5.0% |
덮 | 1 | 2.5% |
성 | 1 | 2.5% |
상 | 1 | 2.5% |
중 | 1 | 2.5% |
Other values (23) | 23 |
(회사)기준년도
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 50.0% |
Missing | 1271 |
Missing (%) | 99.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2007.4167 |
Minimum | 1961 |
---|---|
Maximum | 2016 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.4 KiB |
Quantile statistics
Minimum | 1961 |
---|---|
5-th percentile | 1981.9 |
Q1 | 2004.25 |
median | 2015 |
Q3 | 2015 |
95-th percentile | 2016 |
Maximum | 2016 |
Range | 55 |
Interquartile range (IQR) | 10.75 |
Descriptive statistics
Standard deviation | 15.819771 |
---|---|
Coefficient of variation (CV) | 0.0078806613 |
Kurtosis | 7.7372052 |
Mean | 2007.4167 |
Median Absolute Deviation (MAD) | 0.5 |
Skewness | -2.6698106 |
Sum | 24089 |
Variance | 250.26515 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2015 | 6 | 0.5% |
2016 | 2 | 0.2% |
1961 | 1 | 0.1% |
2002 | 1 | 0.1% |
2005 | 1 | 0.1% |
1999 | 1 | 0.1% |
(Missing) | 1271 |
Value | Count | Frequency (%) |
1961 | 1 | 0.1% |
1999 | 1 | 0.1% |
2002 | 1 | 0.1% |
2005 | 1 | 0.1% |
2015 | 6 | |
2016 | 2 | 0.2% |
Value | Count | Frequency (%) |
2016 | 2 | 0.2% |
2015 | 6 | |
2005 | 1 | 0.1% |
2002 | 1 | 0.1% |
1999 | 1 | 0.1% |
1961 | 1 | 0.1% |
기업명 영문
Text
MISSING
 
Distinct | 110 |
---|---|
Distinct (%) | 99.1% |
Missing | 1172 |
Missing (%) | 91.3% |
Memory size | 10.2 KiB |
Length
Max length | 46 |
---|---|
Median length | 29 |
Mean length | 20.108108 |
Min length | 4 |
Characters and Unicode
Total characters | 2232 |
---|---|
Distinct characters | 69 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 109 ? |
---|---|
Unique (%) | 98.2% |
Sample
1st row | Sacheon Environment co |
---|---|
2nd row | Tobang E&E |
3rd row | TAECHANG NIKKEI |
4th row | SAMIL CHEMICAL Co., Ltd. |
5th row | Forcebel Global Co., Ltd. |
Value | Count | Frequency (%) |
co | 29 | 9.2% |
ltd | 29 | 9.2% |
co.,ltd | 18 | 5.7% |
environment | 10 | 3.2% |
industry | 7 | 2.2% |
co.ltd | 6 | 1.9% |
korea | 5 | 1.6% |
inc | 5 | 1.6% |
construction | 5 | 1.6% |
engineering | 4 | 1.3% |
Other values (170) | 197 |
Most occurring characters
Value | Count | Frequency (%) |
206 | 9.2% | |
o | 159 | 7.1% |
n | 155 | 6.9% |
e | 109 | 4.9% |
t | 105 | 4.7% |
. | 99 | 4.4% |
C | 91 | 4.1% |
L | 77 | 3.4% |
i | 71 | 3.2% |
r | 67 | 3.0% |
Other values (59) | 1093 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1122 | |
Uppercase Letter | 719 | |
Space Separator | 206 | 9.2% |
Other Punctuation | 161 | 7.2% |
Other Letter | 13 | 0.6% |
Dash Punctuation | 5 | 0.2% |
Decimal Number | 2 | 0.1% |
Close Punctuation | 2 | 0.1% |
Open Punctuation | 2 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 159 | |
n | 155 | |
e | 109 | |
t | 105 | |
i | 71 | 6.3% |
r | 67 | 6.0% |
d | 62 | 5.5% |
a | 53 | 4.7% |
c | 49 | 4.4% |
g | 42 | 3.7% |
Other values (14) | 250 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 91 | |
L | 77 | |
E | 64 | 8.9% |
O | 53 | 7.4% |
N | 47 | 6.5% |
T | 47 | 6.5% |
D | 42 | 5.8% |
I | 41 | 5.7% |
S | 34 | 4.7% |
A | 33 | 4.6% |
Other values (14) | 190 |
Other Letter
Value | Count | Frequency (%) |
주 | 2 | |
기 | 1 | |
성 | 1 | |
한 | 1 | |
식 | 1 | |
부 | 1 | |
회 | 1 | |
웅 | 1 | |
환 | 1 | |
경 | 1 | |
Other values (2) | 2 |
Other Punctuation
Value | Count | Frequency (%) |
. | 99 | |
, | 49 | |
& | 7 | 4.3% |
; | 6 | 3.7% |
Space Separator
Value | Count | Frequency (%) |
206 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5 |
Decimal Number
Value | Count | Frequency (%) |
2 | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1841 | |
Common | 378 | 16.9% |
Hangul | 13 | 0.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 159 | 8.6% |
n | 155 | 8.4% |
e | 109 | 5.9% |
t | 105 | 5.7% |
C | 91 | 4.9% |
L | 77 | 4.2% |
i | 71 | 3.9% |
r | 67 | 3.6% |
E | 64 | 3.5% |
d | 62 | 3.4% |
Other values (38) | 881 |
Hangul
Value | Count | Frequency (%) |
주 | 2 | |
기 | 1 | |
성 | 1 | |
한 | 1 | |
식 | 1 | |
부 | 1 | |
회 | 1 | |
웅 | 1 | |
환 | 1 | |
경 | 1 | |
Other values (2) | 2 |
Common
Value | Count | Frequency (%) |
206 | ||
. | 99 | |
, | 49 | 13.0% |
& | 7 | 1.9% |
; | 6 | 1.6% |
- | 5 | 1.3% |
2 | 2 | 0.5% |
) | 2 | 0.5% |
( | 2 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2219 | |
Hangul | 13 | 0.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
206 | 9.3% | |
o | 159 | 7.2% |
n | 155 | 7.0% |
e | 109 | 4.9% |
t | 105 | 4.7% |
. | 99 | 4.5% |
C | 91 | 4.1% |
L | 77 | 3.5% |
i | 71 | 3.2% |
r | 67 | 3.0% |
Other values (47) | 1080 |
Hangul
Value | Count | Frequency (%) |
주 | 2 | |
기 | 1 | |
성 | 1 | |
한 | 1 | |
식 | 1 | |
부 | 1 | |
회 | 1 | |
웅 | 1 | |
환 | 1 | |
경 | 1 | |
Other values (2) | 2 |
회사 사업 분야
Text
MISSING
 
Distinct | 91 |
---|---|
Distinct (%) | 71.7% |
Missing | 1156 |
Missing (%) | 90.1% |
Memory size | 10.2 KiB |
Length
Max length | 100 |
---|---|
Median length | 47 |
Mean length | 13.889764 |
Min length | 2 |
Characters and Unicode
Total characters | 1764 |
---|---|
Distinct characters | 177 |
Distinct categories | 6 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 79 ? |
---|---|
Unique (%) | 62.2% |
Sample
1st row | 조경, 환경복원, 생태복원 |
---|---|
2nd row | 건설폐기물(폐콘/폐아스콘) 중간 처리 |
3rd row | 농업, 어업, 광업, 임업 |
4th row | 석유, 화학, 에너지 |
5th row | 토양/지하수정화, 토목공사업, 엔지니어링, 전문광해방지 |
Value | Count | Frequency (%) |
건설 | 26 | 6.7% |
토목 | 17 | 4.4% |
건축 | 17 | 4.4% |
환경 | 14 | 3.6% |
시공 | 11 | 2.8% |
및 | 10 | 2.6% |
제조 | 10 | 2.6% |
폐기물 | 8 | 2.1% |
처리업 | 8 | 2.1% |
상하수도 | 7 | 1.8% |
Other values (180) | 258 |
Most occurring characters
Value | Count | Frequency (%) |
259 | 14.7% | |
, | 131 | 7.4% |
설 | 79 | 4.5% |
기 | 68 | 3.9% |
건 | 64 | 3.6% |
공 | 46 | 2.6% |
업 | 46 | 2.6% |
경 | 38 | 2.2% |
수 | 37 | 2.1% |
환 | 36 | 2.0% |
Other values (167) | 960 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1359 | |
Space Separator | 259 | 14.7% |
Other Punctuation | 138 | 7.8% |
Uppercase Letter | 4 | 0.2% |
Open Punctuation | 2 | 0.1% |
Close Punctuation | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
설 | 79 | 5.8% |
기 | 68 | 5.0% |
건 | 64 | 4.7% |
공 | 46 | 3.4% |
업 | 46 | 3.4% |
경 | 38 | 2.8% |
수 | 37 | 2.7% |
환 | 36 | 2.6% |
제 | 35 | 2.6% |
사 | 34 | 2.5% |
Other values (158) | 876 |
Other Punctuation
Value | Count | Frequency (%) |
, | 131 | |
/ | 5 | 3.6% |
· | 2 | 1.4% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 2 | |
R | 1 | |
B | 1 |
Space Separator
Value | Count | Frequency (%) |
259 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1359 | |
Common | 401 | 22.7% |
Latin | 4 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
설 | 79 | 5.8% |
기 | 68 | 5.0% |
건 | 64 | 4.7% |
공 | 46 | 3.4% |
업 | 46 | 3.4% |
경 | 38 | 2.8% |
수 | 37 | 2.7% |
환 | 36 | 2.6% |
제 | 35 | 2.6% |
사 | 34 | 2.5% |
Other values (158) | 876 |
Common
Value | Count | Frequency (%) |
259 | ||
, | 131 | |
/ | 5 | 1.2% |
· | 2 | 0.5% |
( | 2 | 0.5% |
) | 2 | 0.5% |
Latin
Value | Count | Frequency (%) |
S | 2 | |
R | 1 | |
B | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1359 | |
ASCII | 403 | 22.8% |
None | 2 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
259 | ||
, | 131 | |
/ | 5 | 1.2% |
( | 2 | 0.5% |
) | 2 | 0.5% |
S | 2 | 0.5% |
R | 1 | 0.2% |
B | 1 | 0.2% |
Hangul
Value | Count | Frequency (%) |
설 | 79 | 5.8% |
기 | 68 | 5.0% |
건 | 64 | 4.7% |
공 | 46 | 3.4% |
업 | 46 | 3.4% |
경 | 38 | 2.8% |
수 | 37 | 2.7% |
환 | 36 | 2.6% |
제 | 35 | 2.6% |
사 | 34 | 2.5% |
Other values (158) | 876 |
None
Value | Count | Frequency (%) |
· | 2 |
회원구분 | (회사)업태 | (회사)기준년도 | 회사 사업 분야 | |
---|---|---|---|---|
회원구분 | 1.000 | 0.585 | 0.872 | 0.426 |
(회사)업태 | 0.585 | 1.000 | 0.435 | 0.864 |
(회사)기준년도 | 0.872 | 0.435 | 1.000 | NaN |
회사 사업 분야 | 0.426 | 0.864 | NaN | 1.000 |
회원구분 | (회사)업태 | |
---|---|---|
회원구분 | 1.000 | 0.337 |
(회사)업태 | 0.337 | 1.000 |
(회사)기준년도 | 회원구분 | (회사)업태 | |
---|---|---|---|
(회사)기준년도 | 1.000 | 0.701 | 0.575 |
회원구분 | 0.701 | 1.000 | 0.337 |
(회사)업태 | 0.575 | 0.337 | 1.000 |
회사번호 | 회원구분 | (회사)명 | 사업자등록번호 | (회사)업종 | (회사)업태 | 회사 대표자 | (회사)주소1 | (회사)주소2 | (회사)전화 | (회사)홈페이지 | (회사)기준년도 | 기업명 영문 | 회사 사업 분야 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | CP00000875 | 중소기업 | 신잔토개발㈜ | <NA> | <NA> | <NA> | <NA> | 경기도 연천군 전곡읍 늘목리 61-4 | <NA> | 031-832-0011 | <NA> | <NA> | <NA> | <NA> |
1 | CP00000876 | 중소기업 | (주)진흥중공업 | 124-81-69018 | 폐기물수집 및 처리업 | 기타 | 박찬양 | 경기도 화성시 정문송산로93번길 10-27 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | CP00000877 | 중소기업 | 도솔환경산업㈜ | <NA> | <NA> | <NA> | <NA> | 충남 천안시 직산면 자은가리 82-2 | <NA> | 041-584-7007 | <NA> | <NA> | <NA> | <NA> |
3 | CP00000878 | 중소기업 | 삼원환경산업(주) | 313-81-01199 | 서비스 | 전문건설업 | 남궁훈 | 충청남도 보령시 남포면 평촌밤섬길 218-191 | <NA> | 041-931-1425 | <NA> | <NA> | <NA> | <NA> |
4 | CP00000884 | 중소기업 | ㈜케이벡코리아 | 215-86-53364 | 환경.토목엔지니어링 | 기타 | 송테드 | 서울 송파구 방이1동 165-3 | <NA> | 02-417-4150 | ww.kbec.co.kr | <NA> | <NA> | <NA> |
5 | CP00000890 | 중소기업 | 동원이앤텍㈜ | 128-81-85378 | 환경시설 | 기타 | 최광진 | 경기 고양시 일산동구 백석동 | <NA> | 031-906-3223 | <NA> | <NA> | <NA> | <NA> |
6 | CP00000896 | 중소기업 | ㈜장형기업 | 136-81-13652 | <NA> | <NA> | <NA> | 인천광역시 서구 오류동 410-472 | <NA> | 032-562-1658 | <NA> | <NA> | <NA> | <NA> |
7 | CP00000897 | 중소기업 | ㈜경진엔지니어링 | 122-81-84819 | 상하수도공사 | 전문건설업 | 최철현 | 인천 계양구 서운동 148-84 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | CP00000899 | 대학 | 경남과학기술대학교 산학협력단 | 613-82-09900 | <NA> | <NA> | <NA> | 경남 진주시 칠암동 150 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | CP00000900 | 중소기업 | 청정환경설비 | 137-02-76975 | <NA> | <NA> | <NA> | 경기도 이천시 모가면 소고리 96-14 | <NA> | 031-574-6305 | <NA> | <NA> | <NA> | <NA> |
회사번호 | 회원구분 | (회사)명 | 사업자등록번호 | (회사)업종 | (회사)업태 | 회사 대표자 | (회사)주소1 | (회사)주소2 | (회사)전화 | (회사)홈페이지 | (회사)기준년도 | 기업명 영문 | 회사 사업 분야 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1273 | CP00003345 | 중소기업 | 케이알컨소시엄주식회사 | 114-86-67395 | 환경컨설팅및엔지니어링, 기타무역업, 환경민에너지연구개발, 경영컨설팅, 환경정화및복원사업 | <NA> | 이영민 | 서울특별시 서초구 서초대로 46 (방배동) | (방배동, 극동빌딩) | <NA> | <NA> | <NA> | KR Consortium Co., Ltd. | <NA> |
1274 | CP00003347 | 중소기업 | (주)대산엘이디전기조명 | 514-81-95612 | 엘이디, 경관조명장치, 조명기구및 제어장치, 철제, 스텐리스가로등주, 전기공사 | <NA> | 이종규 | 대구광역시 북구 유통단지로 103 (산격동) | 건축자재관 1층 18호 | <NA> | <NA> | <NA> | <NA> | <NA> |
1275 | CP00003350 | 중소기업 | 케이퓨전테크놀로지 | 332-87-00795 | 플라즈마 발생장치, 수소 발생장치 | <NA> | 곽헌길 | 경기도 안산시 상록구 한양대학로 55 (사동) | 한양대 에리카 창업보육센터 213호 | 031-400-3815 | <NA> | <NA> | K-fusion Technology, inc | <NA> |
1276 | CP00003352 | 중견기업 | 성신양회 주식회사 | 101-81-18194 | 제조업 | <NA> | 김상규 | 서울특별시 종로구 인사동5길 29 (인사동) | 7층 | 02-3782-7000 | http://www.sungshincement.co.kr/ | <NA> | <NA> | <NA> |
1277 | CP00003353 | 중소기업 | (주)삼진야드 | 105-81-77837 | 냉동탑, 특장차, 항만장비 | <NA> | 신성수 | 부산광역시 강서구 미음산단6로 56 (미음동) | (미음동) | 051-831-7525 | <NA> | <NA> | samjinyard.co.LTD | <NA> |
1278 | CP00003354 | 중소기업 | 이앤켐솔루션 | 206-86-19800 | 흡착제 제조, 연구개발업 | <NA> | 김신동 | 경기도 포천시 군내면 용정경제로1길 94-38 | 이앤켐솔루션 | <NA> | <NA> | <NA> | E & Chem Solution Corp. | <NA> |
1279 | CP00003348 | 중소기업 | 주식회사 시원 | 446-86-01418 | 수전금구,욕실부자재 | <NA> | 이시원 | 경기도 김포시 하성면 월하로705번길 54-3 | 주식회사 시원 | 031-982-5227 | <NA> | <NA> | <NA> | <NA> |
1280 | CP00003349 | 중소기업 | 한국환경시스템주식회사 | 814-81-00000 | 제조, 도소매 | <NA> | 박재갑 | 경기도 고양시 일산서구 구산로69번길 23-17 (구산동) | 한국환경시스템(주) | 031-912-0815 | http;//www.uhdkes.co.kr | <NA> | Korea Environmental System Co., Ltd. | <NA> |
1281 | CP00003346 | 중소기업 | (주)호생환경 | 606-81-17346 | 비금속광물제품제조업 | <NA> | 황 준 | 부산광역시 사상구 낙동대로 665 (엄궁동) | <NA> | 051-327-1333 | <NA> | <NA> | <NA> | <NA> |
1282 | CP00003351 | 공공기관 | 재단법인 철원플라즈마 산업기술연구원 | 127-82-15110 | 플라즈마 연구및개발업, 플라즈마 응용제품 등 | <NA> | 이현종 | 강원도 철원군 갈말읍 호국로 4620 | 철원플라즈마산업기술연구원 | 033-452-9709 | http://www.cpri.re.kr/ | <NA> | Cheorwon Plasma Research Institute | <NA> |