Overview

Dataset statistics

Number of variables14
Number of observations1283
Missing cells6432
Missing cells (%)35.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory141.7 KiB
Average record size in memory113.1 B

Variable types

Text11
Categorical2
Numeric1

Dataset

Description환경신기술 인검증 신청 회원 업체 정보(2020.10.26. 기준, 회원구분, 회사업종, 업태, 주소, 홈페이지, 사업분야 등)
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15071519/fileData.do

Alerts

(회사)기준년도 is highly overall correlated with 회원구분 and 1 other fieldsHigh correlation
회원구분 is highly overall correlated with (회사)기준년도High correlation
(회사)업태 is highly overall correlated with (회사)기준년도High correlation
회원구분 is highly imbalanced (64.4%)Imbalance
사업자등록번호 has 137 (10.7%) missing valuesMissing
(회사)업종 has 353 (27.5%) missing valuesMissing
회사 대표자 has 118 (9.2%) missing valuesMissing
(회사)주소1 has 47 (3.7%) missing valuesMissing
(회사)주소2 has 1046 (81.5%) missing valuesMissing
(회사)전화 has 339 (26.4%) missing valuesMissing
(회사)홈페이지 has 793 (61.8%) missing valuesMissing
(회사)기준년도 has 1271 (99.1%) missing valuesMissing
기업명 영문 has 1172 (91.3%) missing valuesMissing
회사 사업 분야 has 1156 (90.1%) missing valuesMissing
회사번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:03:31.437300
Analysis finished2023-12-12 00:03:33.086925
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회사번호
Text

UNIQUE 

Distinct1283
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T09:03:33.271662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters12830
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1283 ?
Unique (%)100.0%

Sample

1st rowCP00000875
2nd rowCP00000876
3rd rowCP00000877
4th rowCP00000878
5th rowCP00000884
ValueCountFrequency (%)
cp00000875 1
 
0.1%
cp00000347 1
 
0.1%
cp00000039 1
 
0.1%
cp00000038 1
 
0.1%
cp00000037 1
 
0.1%
cp00000036 1
 
0.1%
cp00000034 1
 
0.1%
cp00000033 1
 
0.1%
cp00000032 1
 
0.1%
cp00000040 1
 
0.1%
Other values (1273) 1273
99.2%
2023-12-12T09:03:33.604485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6038
47.1%
C 1283
 
10.0%
P 1283
 
10.0%
1 784
 
6.1%
3 684
 
5.3%
2 602
 
4.7%
7 381
 
3.0%
6 376
 
2.9%
4 372
 
2.9%
8 360
 
2.8%
Other values (2) 667
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10264
80.0%
Uppercase Letter 2566
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6038
58.8%
1 784
 
7.6%
3 684
 
6.7%
2 602
 
5.9%
7 381
 
3.7%
6 376
 
3.7%
4 372
 
3.6%
8 360
 
3.5%
5 359
 
3.5%
9 308
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
C 1283
50.0%
P 1283
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10264
80.0%
Latin 2566
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6038
58.8%
1 784
 
7.6%
3 684
 
6.7%
2 602
 
5.9%
7 381
 
3.7%
6 376
 
3.7%
4 372
 
3.6%
8 360
 
3.5%
5 359
 
3.5%
9 308
 
3.0%
Latin
ValueCountFrequency (%)
C 1283
50.0%
P 1283
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12830
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6038
47.1%
C 1283
 
10.0%
P 1283
 
10.0%
1 784
 
6.1%
3 684
 
5.3%
2 602
 
4.7%
7 381
 
3.0%
6 376
 
2.9%
4 372
 
2.9%
8 360
 
2.8%
Other values (2) 667
 
5.2%

회원구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
중소기업
1015 
대기업
 
82
벤쳐
 
62
기타
 
37
<NA>
 
29
Other values (8)
 
58

Length

Max length7
Median length4
Mean length3.7911146
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row중소기업
2nd row중소기업
3rd row중소기업
4th row중소기업
5th row중소기업

Common Values

ValueCountFrequency (%)
중소기업 1015
79.1%
대기업 82
 
6.4%
벤쳐 62
 
4.8%
기타 37
 
2.9%
<NA> 29
 
2.3%
중견기업 19
 
1.5%
대학 16
 
1.2%
중소기업연구소 7
 
0.5%
출연연구기관 5
 
0.4%
공공기관 5
 
0.4%
Other values (3) 6
 
0.5%

Length

2023-12-12T09:03:33.941767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중소기업 1015
79.1%
대기업 82
 
6.4%
벤쳐 62
 
4.8%
기타 37
 
2.9%
na 29
 
2.3%
중견기업 19
 
1.5%
대학 16
 
1.2%
중소기업연구소 7
 
0.5%
출연연구기관 5
 
0.4%
공공기관 5
 
0.4%
Other values (3) 6
 
0.5%
Distinct1259
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T09:03:34.131822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length7.5058457
Min length3

Characters and Unicode

Total characters9630
Distinct characters424
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1236 ?
Unique (%)96.3%

Sample

1st row신잔토개발㈜
2nd row(주)진흥중공업
3rd row도솔환경산업㈜
4th row삼원환경산업(주)
5th row㈜케이벡코리아
ValueCountFrequency (%)
주식회사 103
 
7.2%
산학협력단 12
 
0.8%
6
 
0.4%
주)포스코엔지니어링 3
 
0.2%
유한회사 3
 
0.2%
서울특별시 2
 
0.1%
서울대학교 2
 
0.1%
그린환경 2
 
0.1%
㈜부강테크 2
 
0.1%
이에스지케이 2
 
0.1%
Other values (1263) 1284
90.4%
2023-12-12T09:03:34.492905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
741
 
7.7%
) 610
 
6.3%
( 609
 
6.3%
392
 
4.1%
306
 
3.2%
216
 
2.2%
195
 
2.0%
192
 
2.0%
180
 
1.9%
177
 
1.8%
Other values (414) 6012
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7767
80.7%
Close Punctuation 610
 
6.3%
Open Punctuation 609
 
6.3%
Other Symbol 392
 
4.1%
Space Separator 138
 
1.4%
Uppercase Letter 93
 
1.0%
Decimal Number 10
 
0.1%
Other Punctuation 5
 
0.1%
Lowercase Letter 5
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
741
 
9.5%
306
 
3.9%
216
 
2.8%
195
 
2.5%
192
 
2.5%
180
 
2.3%
177
 
2.3%
162
 
2.1%
159
 
2.0%
154
 
2.0%
Other values (379) 5285
68.0%
Uppercase Letter
ValueCountFrequency (%)
E 17
18.3%
G 14
15.1%
S 11
11.8%
T 11
11.8%
N 8
8.6%
I 7
7.5%
C 6
 
6.5%
K 3
 
3.2%
V 3
 
3.2%
A 2
 
2.2%
Other values (9) 11
11.8%
Lowercase Letter
ValueCountFrequency (%)
x 1
20.0%
i 1
20.0%
n 1
20.0%
o 1
20.0%
e 1
20.0%
Decimal Number
ValueCountFrequency (%)
1 5
50.0%
2 2
 
20.0%
8 2
 
20.0%
4 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
& 3
60.0%
. 2
40.0%
Close Punctuation
ValueCountFrequency (%)
) 610
100.0%
Open Punctuation
ValueCountFrequency (%)
( 609
100.0%
Other Symbol
ValueCountFrequency (%)
392
100.0%
Space Separator
ValueCountFrequency (%)
138
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8157
84.7%
Common 1373
 
14.3%
Latin 98
 
1.0%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
741
 
9.1%
392
 
4.8%
306
 
3.8%
216
 
2.6%
195
 
2.4%
192
 
2.4%
180
 
2.2%
177
 
2.2%
162
 
2.0%
159
 
1.9%
Other values (378) 5437
66.7%
Latin
ValueCountFrequency (%)
E 17
17.3%
G 14
14.3%
S 11
11.2%
T 11
11.2%
N 8
8.2%
I 7
7.1%
C 6
 
6.1%
K 3
 
3.1%
V 3
 
3.1%
A 2
 
2.0%
Other values (14) 16
16.3%
Common
ValueCountFrequency (%)
) 610
44.4%
( 609
44.4%
138
 
10.1%
1 5
 
0.4%
& 3
 
0.2%
. 2
 
0.1%
2 2
 
0.1%
8 2
 
0.1%
4 1
 
0.1%
_ 1
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7765
80.6%
ASCII 1471
 
15.3%
None 392
 
4.1%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
741
 
9.5%
306
 
3.9%
216
 
2.8%
195
 
2.5%
192
 
2.5%
180
 
2.3%
177
 
2.3%
162
 
2.1%
159
 
2.0%
154
 
2.0%
Other values (377) 5283
68.0%
ASCII
ValueCountFrequency (%)
) 610
41.5%
( 609
41.4%
138
 
9.4%
E 17
 
1.2%
G 14
 
1.0%
S 11
 
0.7%
T 11
 
0.7%
N 8
 
0.5%
I 7
 
0.5%
C 6
 
0.4%
Other values (24) 40
 
2.7%
None
ValueCountFrequency (%)
392
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

사업자등록번호
Text

MISSING 

Distinct1098
Distinct (%)95.8%
Missing137
Missing (%)10.7%
Memory size10.2 KiB
2023-12-12T09:03:34.696521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters13752
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1054 ?
Unique (%)92.0%

Sample

1st row124-81-69018
2nd row313-81-01199
3rd row215-86-53364
4th row128-81-85378
5th row136-81-13652
ValueCountFrequency (%)
123-81-62292 3
 
0.3%
107-82-14534 3
 
0.3%
312-81-34493 3
 
0.3%
136-81-13652 3
 
0.3%
137-85-01837 2
 
0.2%
134-81-80567 2
 
0.2%
318-81-01331 2
 
0.2%
116-81-25566 2
 
0.2%
111-11-11111 2
 
0.2%
120-81-46916 2
 
0.2%
Other values (1088) 1122
97.9%
2023-12-12T09:03:35.024650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2462
17.9%
- 2292
16.7%
8 1766
12.8%
0 1253
9.1%
2 1233
9.0%
3 946
 
6.9%
6 895
 
6.5%
4 881
 
6.4%
5 754
 
5.5%
7 679
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11460
83.3%
Dash Punctuation 2292
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2462
21.5%
8 1766
15.4%
0 1253
10.9%
2 1233
10.8%
3 946
 
8.3%
6 895
 
7.8%
4 881
 
7.7%
5 754
 
6.6%
7 679
 
5.9%
9 591
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 2292
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13752
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2462
17.9%
- 2292
16.7%
8 1766
12.8%
0 1253
9.1%
2 1233
9.0%
3 946
 
6.9%
6 895
 
6.5%
4 881
 
6.4%
5 754
 
5.5%
7 679
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13752
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2462
17.9%
- 2292
16.7%
8 1766
12.8%
0 1253
9.1%
2 1233
9.0%
3 946
 
6.9%
6 895
 
6.5%
4 881
 
6.4%
5 754
 
5.5%
7 679
 
4.9%

(회사)업종
Text

MISSING 

Distinct566
Distinct (%)60.9%
Missing353
Missing (%)27.5%
Memory size10.2 KiB
2023-12-12T09:03:35.277654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length66
Mean length10.680645
Min length2

Characters and Unicode

Total characters9933
Distinct characters328
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique511 ?
Unique (%)54.9%

Sample

1st row폐기물수집 및 처리업
2nd row서비스
3rd row환경.토목엔지니어링
4th row환경시설
5th row상하수도공사
ValueCountFrequency (%)
제조업 108
 
6.7%
제조 81
 
5.0%
건설업 67
 
4.1%
건설폐기물중간처리업 56
 
3.5%
서비스 55
 
3.4%
53
 
3.3%
47
 
2.9%
건설 41
 
2.5%
건설폐기물 30
 
1.9%
중간처리업 19
 
1.2%
Other values (740) 1059
65.5%
2023-12-12T09:03:35.736140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
688
 
6.9%
, 616
 
6.2%
544
 
5.5%
491
 
4.9%
391
 
3.9%
359
 
3.6%
359
 
3.6%
331
 
3.3%
229
 
2.3%
224
 
2.3%
Other values (318) 5701
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8469
85.3%
Space Separator 688
 
6.9%
Other Punctuation 657
 
6.6%
Uppercase Letter 52
 
0.5%
Open Punctuation 30
 
0.3%
Close Punctuation 30
 
0.3%
Decimal Number 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
544
 
6.4%
491
 
5.8%
391
 
4.6%
359
 
4.2%
359
 
4.2%
331
 
3.9%
229
 
2.7%
224
 
2.6%
202
 
2.4%
177
 
2.1%
Other values (297) 5162
61.0%
Uppercase Letter
ValueCountFrequency (%)
C 15
28.8%
T 9
17.3%
V 8
15.4%
E 6
 
11.5%
P 3
 
5.8%
W 2
 
3.8%
H 2
 
3.8%
N 2
 
3.8%
G 2
 
3.8%
S 2
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 616
93.8%
/ 22
 
3.3%
. 19
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 3
75.0%
2 1
 
25.0%
Space Separator
ValueCountFrequency (%)
688
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
w 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8468
85.3%
Common 1411
 
14.2%
Latin 53
 
0.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
544
 
6.4%
491
 
5.8%
391
 
4.6%
359
 
4.2%
359
 
4.2%
331
 
3.9%
229
 
2.7%
224
 
2.6%
202
 
2.4%
177
 
2.1%
Other values (296) 5161
60.9%
Latin
ValueCountFrequency (%)
C 15
28.3%
T 9
17.0%
V 8
15.1%
E 6
 
11.3%
P 3
 
5.7%
W 2
 
3.8%
H 2
 
3.8%
N 2
 
3.8%
G 2
 
3.8%
S 2
 
3.8%
Other values (2) 2
 
3.8%
Common
ValueCountFrequency (%)
688
48.8%
, 616
43.7%
( 30
 
2.1%
) 30
 
2.1%
/ 22
 
1.6%
. 19
 
1.3%
1 3
 
0.2%
- 2
 
0.1%
2 1
 
0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8468
85.3%
ASCII 1464
 
14.7%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
688
47.0%
, 616
42.1%
( 30
 
2.0%
) 30
 
2.0%
/ 22
 
1.5%
. 19
 
1.3%
C 15
 
1.0%
T 9
 
0.6%
V 8
 
0.5%
E 6
 
0.4%
Other values (11) 21
 
1.4%
Hangul
ValueCountFrequency (%)
544
 
6.4%
491
 
5.8%
391
 
4.6%
359
 
4.2%
359
 
4.2%
331
 
3.9%
229
 
2.7%
224
 
2.6%
202
 
2.4%
177
 
2.1%
Other values (296) 5161
60.9%
CJK
ValueCountFrequency (%)
1
100.0%

(회사)업태
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
<NA>
579 
기타
341 
일반건설업
137 
전문건설업
132 
기술용역
 
50
Other values (3)
 
44

Length

Max length6
Median length5
Mean length3.664848
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row기타
3rd row<NA>
4th row전문건설업
5th row기타

Common Values

ValueCountFrequency (%)
<NA> 579
45.1%
기타 341
26.6%
일반건설업 137
 
10.7%
전문건설업 132
 
10.3%
기술용역 50
 
3.9%
연구소 23
 
1.8%
정부투자기관 12
 
0.9%
개인 9
 
0.7%

Length

2023-12-12T09:03:35.876235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:03:36.001943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 579
45.1%
기타 341
26.6%
일반건설업 137
 
10.7%
전문건설업 132
 
10.3%
기술용역 50
 
3.9%
연구소 23
 
1.8%
정부투자기관 12
 
0.9%
개인 9
 
0.7%

회사 대표자
Text

MISSING 

Distinct1100
Distinct (%)94.4%
Missing118
Missing (%)9.2%
Memory size10.2 KiB
2023-12-12T09:03:36.312354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length3.2403433
Min length1

Characters and Unicode

Total characters3775
Distinct characters225
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1039 ?
Unique (%)89.2%

Sample

1st row박찬양
2nd row남궁훈
3rd row송테드
4th row최광진
5th row최철현
ValueCountFrequency (%)
5
 
0.4%
신영균 3
 
0.2%
조성광 3
 
0.2%
정일호 3
 
0.2%
박용기 3
 
0.2%
최형기 2
 
0.2%
권오현 2
 
0.2%
정창화 2
 
0.2%
안성국 2
 
0.2%
이용현 2
 
0.2%
Other values (1130) 1190
97.8%
2023-12-12T09:03:36.767445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
233
 
6.2%
183
 
4.8%
108
 
2.9%
105
 
2.8%
103
 
2.7%
77
 
2.0%
68
 
1.8%
64
 
1.7%
63
 
1.7%
58
 
1.5%
Other values (215) 2713
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3652
96.7%
Space Separator 58
 
1.5%
Other Punctuation 55
 
1.5%
Uppercase Letter 9
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
233
 
6.4%
183
 
5.0%
108
 
3.0%
105
 
2.9%
103
 
2.8%
77
 
2.1%
68
 
1.9%
64
 
1.8%
63
 
1.7%
58
 
1.6%
Other values (204) 2590
70.9%
Uppercase Letter
ValueCountFrequency (%)
L 2
22.2%
D 2
22.2%
G 1
11.1%
I 1
11.1%
E 1
11.1%
N 1
11.1%
A 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 53
96.4%
/ 2
 
3.6%
Space Separator
ValueCountFrequency (%)
58
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3652
96.7%
Common 114
 
3.0%
Latin 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
233
 
6.4%
183
 
5.0%
108
 
3.0%
105
 
2.9%
103
 
2.8%
77
 
2.1%
68
 
1.9%
64
 
1.8%
63
 
1.7%
58
 
1.6%
Other values (204) 2590
70.9%
Latin
ValueCountFrequency (%)
L 2
22.2%
D 2
22.2%
G 1
11.1%
I 1
11.1%
E 1
11.1%
N 1
11.1%
A 1
11.1%
Common
ValueCountFrequency (%)
58
50.9%
, 53
46.5%
/ 2
 
1.8%
- 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3652
96.7%
ASCII 123
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
233
 
6.4%
183
 
5.0%
108
 
3.0%
105
 
2.9%
103
 
2.8%
77
 
2.1%
68
 
1.9%
64
 
1.8%
63
 
1.7%
58
 
1.6%
Other values (204) 2590
70.9%
ASCII
ValueCountFrequency (%)
58
47.2%
, 53
43.1%
L 2
 
1.6%
D 2
 
1.6%
/ 2
 
1.6%
G 1
 
0.8%
I 1
 
0.8%
E 1
 
0.8%
N 1
 
0.8%
A 1
 
0.8%

(회사)주소1
Text

MISSING 

Distinct1207
Distinct (%)97.7%
Missing47
Missing (%)3.7%
Memory size10.2 KiB
2023-12-12T09:03:37.105231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length22.231392
Min length9

Characters and Unicode

Total characters27478
Distinct characters469
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1182 ?
Unique (%)95.6%

Sample

1st row경기도 연천군 전곡읍 늘목리 61-4
2nd row경기도 화성시 정문송산로93번길 10-27
3rd row충남 천안시 직산면 자은가리 82-2
4th row충청남도 보령시 남포면 평촌밤섬길 218-191
5th row서울 송파구 방이1동 165-3
ValueCountFrequency (%)
경기도 275
 
4.4%
서울 126
 
2.0%
경기 109
 
1.8%
서울특별시 108
 
1.7%
서울시 49
 
0.8%
화성시 43
 
0.7%
충남 41
 
0.7%
성남시 40
 
0.6%
안양시 39
 
0.6%
경남 38
 
0.6%
Other values (2945) 5327
86.0%
2023-12-12T09:03:37.656165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5004
 
18.2%
1 1067
 
3.9%
968
 
3.5%
787
 
2.9%
2 772
 
2.8%
765
 
2.8%
3 613
 
2.2%
585
 
2.1%
- 573
 
2.1%
545
 
2.0%
Other values (459) 15799
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16295
59.3%
Decimal Number 5194
 
18.9%
Space Separator 5004
 
18.2%
Dash Punctuation 573
 
2.1%
Uppercase Letter 121
 
0.4%
Close Punctuation 108
 
0.4%
Open Punctuation 108
 
0.4%
Other Punctuation 34
 
0.1%
Lowercase Letter 24
 
0.1%
Math Symbol 10
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
968
 
5.9%
787
 
4.8%
765
 
4.7%
585
 
3.6%
545
 
3.3%
525
 
3.2%
458
 
2.8%
437
 
2.7%
371
 
2.3%
351
 
2.2%
Other values (405) 10503
64.5%
Uppercase Letter
ValueCountFrequency (%)
B 20
16.5%
T 17
14.0%
S 11
9.1%
K 10
8.3%
A 10
8.3%
I 9
 
7.4%
C 7
 
5.8%
E 5
 
4.1%
D 5
 
4.1%
L 4
 
3.3%
Other values (11) 23
19.0%
Lowercase Letter
ValueCountFrequency (%)
e 4
16.7%
w 3
12.5%
n 3
12.5%
o 3
12.5%
r 2
8.3%
s 2
8.3%
i 2
8.3%
h 1
 
4.2%
c 1
 
4.2%
k 1
 
4.2%
Other values (2) 2
8.3%
Decimal Number
ValueCountFrequency (%)
1 1067
20.5%
2 772
14.9%
3 613
11.8%
5 474
9.1%
0 465
9.0%
4 398
 
7.7%
6 397
 
7.6%
8 353
 
6.8%
7 341
 
6.6%
9 314
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 29
85.3%
/ 4
 
11.8%
1
 
2.9%
Other Symbol
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
5004
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 573
100.0%
Close Punctuation
ValueCountFrequency (%)
) 108
100.0%
Open Punctuation
ValueCountFrequency (%)
( 108
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16300
59.3%
Common 11032
40.1%
Latin 146
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
968
 
5.9%
787
 
4.8%
765
 
4.7%
585
 
3.6%
545
 
3.3%
525
 
3.2%
458
 
2.8%
437
 
2.7%
371
 
2.3%
351
 
2.2%
Other values (406) 10508
64.5%
Latin
ValueCountFrequency (%)
B 20
13.7%
T 17
 
11.6%
S 11
 
7.5%
K 10
 
6.8%
A 10
 
6.8%
I 9
 
6.2%
C 7
 
4.8%
E 5
 
3.4%
D 5
 
3.4%
L 4
 
2.7%
Other values (24) 48
32.9%
Common
ValueCountFrequency (%)
5004
45.4%
1 1067
 
9.7%
2 772
 
7.0%
3 613
 
5.6%
- 573
 
5.2%
5 474
 
4.3%
0 465
 
4.2%
4 398
 
3.6%
6 397
 
3.6%
8 353
 
3.2%
Other values (9) 916
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16295
59.3%
ASCII 11175
40.7%
None 6
 
< 0.1%
Number Forms 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5004
44.8%
1 1067
 
9.5%
2 772
 
6.9%
3 613
 
5.5%
- 573
 
5.1%
5 474
 
4.2%
0 465
 
4.2%
4 398
 
3.6%
6 397
 
3.6%
8 353
 
3.2%
Other values (40) 1059
 
9.5%
Hangul
ValueCountFrequency (%)
968
 
5.9%
787
 
4.8%
765
 
4.7%
585
 
3.6%
545
 
3.3%
525
 
3.2%
458
 
2.8%
437
 
2.7%
371
 
2.3%
351
 
2.2%
Other values (405) 10503
64.5%
None
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Number Forms
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

(회사)주소2
Text

MISSING 

Distinct216
Distinct (%)91.1%
Missing1046
Missing (%)81.5%
Memory size10.2 KiB
2023-12-12T09:03:37.920352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length8.9704641
Min length1

Characters and Unicode

Total characters2126
Distinct characters277
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)86.5%

Sample

1st row2401호(영덕동, 유-타워)
2nd row1110호
3rd row1007호
4th row시티플러스 702
5th row12층(퍼스트타워)
ValueCountFrequency (%)
2층 13
 
3.5%
3층 10
 
2.7%
4층 7
 
1.9%
6층 6
 
1.6%
1층 6
 
1.6%
a동 5
 
1.4%
405호 4
 
1.1%
202호 4
 
1.1%
3
 
0.8%
b동 3
 
0.8%
Other values (292) 309
83.5%
2023-12-12T09:03:38.346019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
6.3%
1 122
 
5.7%
0 111
 
5.2%
105
 
4.9%
2 87
 
4.1%
87
 
4.1%
( 79
 
3.7%
) 79
 
3.7%
64
 
3.0%
3 44
 
2.1%
Other values (267) 1214
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1213
57.1%
Decimal Number 519
24.4%
Space Separator 134
 
6.3%
Open Punctuation 79
 
3.7%
Close Punctuation 79
 
3.7%
Uppercase Letter 37
 
1.7%
Other Punctuation 33
 
1.6%
Dash Punctuation 23
 
1.1%
Lowercase Letter 6
 
0.3%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
8.7%
87
 
7.2%
64
 
5.3%
32
 
2.6%
27
 
2.2%
27
 
2.2%
26
 
2.1%
25
 
2.1%
21
 
1.7%
19
 
1.6%
Other values (226) 780
64.3%
Uppercase Letter
ValueCountFrequency (%)
A 11
29.7%
B 8
21.6%
T 4
 
10.8%
R 2
 
5.4%
I 2
 
5.4%
N 1
 
2.7%
C 1
 
2.7%
X 1
 
2.7%
J 1
 
2.7%
D 1
 
2.7%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
1 122
23.5%
0 111
21.4%
2 87
16.8%
3 44
 
8.5%
6 39
 
7.5%
4 37
 
7.1%
5 30
 
5.8%
7 23
 
4.4%
8 19
 
3.7%
9 7
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
w 1
16.7%
e 1
16.7%
b 1
16.7%
a 1
16.7%
m 1
16.7%
p 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 30
90.9%
. 1
 
3.0%
& 1
 
3.0%
; 1
 
3.0%
Space Separator
ValueCountFrequency (%)
134
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1214
57.1%
Common 869
40.9%
Latin 43
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
8.6%
87
 
7.2%
64
 
5.3%
32
 
2.6%
27
 
2.2%
27
 
2.2%
26
 
2.1%
25
 
2.1%
21
 
1.7%
19
 
1.6%
Other values (227) 781
64.3%
Latin
ValueCountFrequency (%)
A 11
25.6%
B 8
18.6%
T 4
 
9.3%
R 2
 
4.7%
I 2
 
4.7%
w 1
 
2.3%
e 1
 
2.3%
N 1
 
2.3%
C 1
 
2.3%
b 1
 
2.3%
Other values (11) 11
25.6%
Common
ValueCountFrequency (%)
134
15.4%
1 122
14.0%
0 111
12.8%
2 87
10.0%
( 79
9.1%
) 79
9.1%
3 44
 
5.1%
6 39
 
4.5%
4 37
 
4.3%
5 30
 
3.5%
Other values (9) 107
12.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1213
57.1%
ASCII 912
42.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
134
14.7%
1 122
13.4%
0 111
12.2%
2 87
9.5%
( 79
8.7%
) 79
8.7%
3 44
 
4.8%
6 39
 
4.3%
4 37
 
4.1%
5 30
 
3.3%
Other values (30) 150
16.4%
Hangul
ValueCountFrequency (%)
105
 
8.7%
87
 
7.2%
64
 
5.3%
32
 
2.6%
27
 
2.2%
27
 
2.2%
26
 
2.1%
25
 
2.1%
21
 
1.7%
19
 
1.6%
Other values (226) 780
64.3%
None
ValueCountFrequency (%)
1
100.0%

(회사)전화
Text

MISSING 

Distinct904
Distinct (%)95.8%
Missing339
Missing (%)26.4%
Memory size10.2 KiB
2023-12-12T09:03:38.645643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.907839
Min length11

Characters and Unicode

Total characters11241
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique867 ?
Unique (%)91.8%

Sample

1st row031-832-0011
2nd row041-584-7007
3rd row041-931-1425
4th row02-417-4150
5th row031-906-3223
ValueCountFrequency (%)
032-562-1658 3
 
0.3%
043-855-7901 3
 
0.3%
053-526-4377 3
 
0.3%
031-495-0574 2
 
0.2%
041-357-5100 2
 
0.2%
02-2008-9841 2
 
0.2%
055-932-9200 2
 
0.2%
062-383-6040 2
 
0.2%
031-382-7907 2
 
0.2%
02-745-2111 2
 
0.2%
Other values (894) 921
97.6%
2023-12-12T09:03:39.059667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1887
16.8%
0 1768
15.7%
3 1187
10.6%
2 1132
10.1%
1 1025
9.1%
5 958
8.5%
4 766
6.8%
6 732
 
6.5%
7 729
 
6.5%
8 642
 
5.7%
Other values (3) 415
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9351
83.2%
Dash Punctuation 1887
 
16.8%
Math Symbol 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1768
18.9%
3 1187
12.7%
2 1132
12.1%
1 1025
11.0%
5 958
10.2%
4 766
8.2%
6 732
7.8%
7 729
7.8%
8 642
 
6.9%
9 412
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 1887
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11241
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1887
16.8%
0 1768
15.7%
3 1187
10.6%
2 1132
10.1%
1 1025
9.1%
5 958
8.5%
4 766
6.8%
6 732
 
6.5%
7 729
 
6.5%
8 642
 
5.7%
Other values (3) 415
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11241
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1887
16.8%
0 1768
15.7%
3 1187
10.6%
2 1132
10.1%
1 1025
9.1%
5 958
8.5%
4 766
6.8%
6 732
 
6.5%
7 729
 
6.5%
8 642
 
5.7%
Other values (3) 415
 
3.7%

(회사)홈페이지
Text

MISSING 

Distinct472
Distinct (%)96.3%
Missing793
Missing (%)61.8%
Memory size10.2 KiB
2023-12-12T09:03:39.335093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length36
Mean length17.777551
Min length1

Characters and Unicode

Total characters8711
Distinct characters85
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique457 ?
Unique (%)93.3%

Sample

1st rowww.kbec.co.kr
2nd rowwww.krsys.kr
3rd rowwww.wellture.com
4th rowwww.ilsong.co.kr
5th rowhttp://www.goldrecycle.co.kr/
ValueCountFrequency (%)
7
 
1.4%
www.forcebel.co.kr 3
 
0.6%
www.insun.com 2
 
0.4%
www.thewillsystem.com 2
 
0.4%
www.janghyung.co.kr 2
 
0.4%
www.lh.or.kr 2
 
0.4%
www.ktr.or.kr 2
 
0.4%
www.taeyoung.com 2
 
0.4%
www.hansoleme.com 2
 
0.4%
www.tscne.net 2
 
0.4%
Other values (461) 466
94.7%
2023-12-12T09:03:39.842131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 1330
15.3%
. 1191
13.7%
o 719
 
8.3%
c 632
 
7.3%
r 464
 
5.3%
e 461
 
5.3%
t 445
 
5.1%
k 406
 
4.7%
n 378
 
4.3%
/ 291
 
3.3%
Other values (75) 2394
27.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6908
79.3%
Other Punctuation 1616
 
18.6%
Decimal Number 93
 
1.1%
Other Letter 40
 
0.5%
Dash Punctuation 37
 
0.4%
Uppercase Letter 10
 
0.1%
Space Separator 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%
1
 
2.5%
Other values (23) 23
57.5%
Lowercase Letter
ValueCountFrequency (%)
w 1330
19.3%
o 719
10.4%
c 632
9.1%
r 464
 
6.7%
e 461
 
6.7%
t 445
 
6.4%
k 406
 
5.9%
n 378
 
5.5%
h 272
 
3.9%
a 260
 
3.8%
Other values (16) 1541
22.3%
Decimal Number
ValueCountFrequency (%)
1 26
28.0%
2 22
23.7%
0 19
20.4%
8 7
 
7.5%
3 6
 
6.5%
7 5
 
5.4%
9 3
 
3.2%
4 3
 
3.2%
6 1
 
1.1%
5 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 1191
73.7%
/ 291
 
18.0%
: 118
 
7.3%
@ 8
 
0.5%
, 7
 
0.4%
; 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
W 6
60.0%
O 1
 
10.0%
M 1
 
10.0%
C 1
 
10.0%
D 1
 
10.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6918
79.4%
Common 1753
 
20.1%
Hangul 40
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%
1
 
2.5%
Other values (23) 23
57.5%
Latin
ValueCountFrequency (%)
w 1330
19.2%
o 719
10.4%
c 632
9.1%
r 464
 
6.7%
e 461
 
6.7%
t 445
 
6.4%
k 406
 
5.9%
n 378
 
5.5%
h 272
 
3.9%
a 260
 
3.8%
Other values (21) 1551
22.4%
Common
ValueCountFrequency (%)
. 1191
67.9%
/ 291
 
16.6%
: 118
 
6.7%
- 37
 
2.1%
1 26
 
1.5%
2 22
 
1.3%
0 19
 
1.1%
@ 8
 
0.5%
8 7
 
0.4%
, 7
 
0.4%
Other values (11) 27
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8671
99.5%
Hangul 40
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 1330
15.3%
. 1191
13.7%
o 719
 
8.3%
c 632
 
7.3%
r 464
 
5.4%
e 461
 
5.3%
t 445
 
5.1%
k 406
 
4.7%
n 378
 
4.4%
/ 291
 
3.4%
Other values (42) 2354
27.1%
Hangul
ValueCountFrequency (%)
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
1
 
2.5%
1
 
2.5%
1
 
2.5%
Other values (23) 23
57.5%

(회사)기준년도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)50.0%
Missing1271
Missing (%)99.1%
Infinite0
Infinite (%)0.0%
Mean2007.4167
Minimum1961
Maximum2016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.4 KiB
2023-12-12T09:03:40.002858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1961
5-th percentile1981.9
Q12004.25
median2015
Q32015
95-th percentile2016
Maximum2016
Range55
Interquartile range (IQR)10.75

Descriptive statistics

Standard deviation15.819771
Coefficient of variation (CV)0.0078806613
Kurtosis7.7372052
Mean2007.4167
Median Absolute Deviation (MAD)0.5
Skewness-2.6698106
Sum24089
Variance250.26515
MonotonicityNot monotonic
2023-12-12T09:03:40.103168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2015 6
 
0.5%
2016 2
 
0.2%
1961 1
 
0.1%
2002 1
 
0.1%
2005 1
 
0.1%
1999 1
 
0.1%
(Missing) 1271
99.1%
ValueCountFrequency (%)
1961 1
 
0.1%
1999 1
 
0.1%
2002 1
 
0.1%
2005 1
 
0.1%
2015 6
0.5%
2016 2
 
0.2%
ValueCountFrequency (%)
2016 2
 
0.2%
2015 6
0.5%
2005 1
 
0.1%
2002 1
 
0.1%
1999 1
 
0.1%
1961 1
 
0.1%

기업명 영문
Text

MISSING 

Distinct110
Distinct (%)99.1%
Missing1172
Missing (%)91.3%
Memory size10.2 KiB
2023-12-12T09:03:40.439663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length29
Mean length20.108108
Min length4

Characters and Unicode

Total characters2232
Distinct characters69
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)98.2%

Sample

1st rowSacheon Environment co
2nd rowTobang E&amp;E
3rd rowTAECHANG NIKKEI
4th rowSAMIL CHEMICAL Co., Ltd.
5th rowForcebel Global Co., Ltd.
ValueCountFrequency (%)
co 29
 
9.2%
ltd 29
 
9.2%
co.,ltd 18
 
5.7%
environment 10
 
3.2%
industry 7
 
2.2%
co.ltd 6
 
1.9%
korea 5
 
1.6%
inc 5
 
1.6%
construction 5
 
1.6%
engineering 4
 
1.3%
Other values (170) 197
62.5%
2023-12-12T09:03:40.997081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
 
9.2%
o 159
 
7.1%
n 155
 
6.9%
e 109
 
4.9%
t 105
 
4.7%
. 99
 
4.4%
C 91
 
4.1%
L 77
 
3.4%
i 71
 
3.2%
r 67
 
3.0%
Other values (59) 1093
49.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1122
50.3%
Uppercase Letter 719
32.2%
Space Separator 206
 
9.2%
Other Punctuation 161
 
7.2%
Other Letter 13
 
0.6%
Dash Punctuation 5
 
0.2%
Decimal Number 2
 
0.1%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 159
14.2%
n 155
13.8%
e 109
9.7%
t 105
9.4%
i 71
 
6.3%
r 67
 
6.0%
d 62
 
5.5%
a 53
 
4.7%
c 49
 
4.4%
g 42
 
3.7%
Other values (14) 250
22.3%
Uppercase Letter
ValueCountFrequency (%)
C 91
12.7%
L 77
10.7%
E 64
 
8.9%
O 53
 
7.4%
N 47
 
6.5%
T 47
 
6.5%
D 42
 
5.8%
I 41
 
5.7%
S 34
 
4.7%
A 33
 
4.6%
Other values (14) 190
26.4%
Other Letter
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%
Other Punctuation
ValueCountFrequency (%)
. 99
61.5%
, 49
30.4%
& 7
 
4.3%
; 6
 
3.7%
Space Separator
ValueCountFrequency (%)
206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1841
82.5%
Common 378
 
16.9%
Hangul 13
 
0.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 159
 
8.6%
n 155
 
8.4%
e 109
 
5.9%
t 105
 
5.7%
C 91
 
4.9%
L 77
 
4.2%
i 71
 
3.9%
r 67
 
3.6%
E 64
 
3.5%
d 62
 
3.4%
Other values (38) 881
47.9%
Hangul
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%
Common
ValueCountFrequency (%)
206
54.5%
. 99
26.2%
, 49
 
13.0%
& 7
 
1.9%
; 6
 
1.6%
- 5
 
1.3%
2 2
 
0.5%
) 2
 
0.5%
( 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2219
99.4%
Hangul 13
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
206
 
9.3%
o 159
 
7.2%
n 155
 
7.0%
e 109
 
4.9%
t 105
 
4.7%
. 99
 
4.5%
C 91
 
4.1%
L 77
 
3.5%
i 71
 
3.2%
r 67
 
3.0%
Other values (47) 1080
48.7%
Hangul
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%

회사 사업 분야
Text

MISSING 

Distinct91
Distinct (%)71.7%
Missing1156
Missing (%)90.1%
Memory size10.2 KiB
2023-12-12T09:03:41.323252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length47
Mean length13.889764
Min length2

Characters and Unicode

Total characters1764
Distinct characters177
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)62.2%

Sample

1st row조경, 환경복원, 생태복원
2nd row건설폐기물(폐콘/폐아스콘) 중간 처리
3rd row농업, 어업, 광업, 임업
4th row석유, 화학, 에너지
5th row토양/지하수정화, 토목공사업, 엔지니어링, 전문광해방지
ValueCountFrequency (%)
건설 26
 
6.7%
토목 17
 
4.4%
건축 17
 
4.4%
환경 14
 
3.6%
시공 11
 
2.8%
10
 
2.6%
제조 10
 
2.6%
폐기물 8
 
2.1%
처리업 8
 
2.1%
상하수도 7
 
1.8%
Other values (180) 258
66.8%
2023-12-12T09:03:41.786471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
259
 
14.7%
, 131
 
7.4%
79
 
4.5%
68
 
3.9%
64
 
3.6%
46
 
2.6%
46
 
2.6%
38
 
2.2%
37
 
2.1%
36
 
2.0%
Other values (167) 960
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1359
77.0%
Space Separator 259
 
14.7%
Other Punctuation 138
 
7.8%
Uppercase Letter 4
 
0.2%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
5.8%
68
 
5.0%
64
 
4.7%
46
 
3.4%
46
 
3.4%
38
 
2.8%
37
 
2.7%
36
 
2.6%
35
 
2.6%
34
 
2.5%
Other values (158) 876
64.5%
Other Punctuation
ValueCountFrequency (%)
, 131
94.9%
/ 5
 
3.6%
· 2
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
S 2
50.0%
R 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
259
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1359
77.0%
Common 401
 
22.7%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
5.8%
68
 
5.0%
64
 
4.7%
46
 
3.4%
46
 
3.4%
38
 
2.8%
37
 
2.7%
36
 
2.6%
35
 
2.6%
34
 
2.5%
Other values (158) 876
64.5%
Common
ValueCountFrequency (%)
259
64.6%
, 131
32.7%
/ 5
 
1.2%
· 2
 
0.5%
( 2
 
0.5%
) 2
 
0.5%
Latin
ValueCountFrequency (%)
S 2
50.0%
R 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1359
77.0%
ASCII 403
 
22.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
259
64.3%
, 131
32.5%
/ 5
 
1.2%
( 2
 
0.5%
) 2
 
0.5%
S 2
 
0.5%
R 1
 
0.2%
B 1
 
0.2%
Hangul
ValueCountFrequency (%)
79
 
5.8%
68
 
5.0%
64
 
4.7%
46
 
3.4%
46
 
3.4%
38
 
2.8%
37
 
2.7%
36
 
2.6%
35
 
2.6%
34
 
2.5%
Other values (158) 876
64.5%
None
ValueCountFrequency (%)
· 2
100.0%

Interactions

2023-12-12T09:03:32.554358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:03:41.910594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원구분(회사)업태(회사)기준년도회사 사업 분야
회원구분1.0000.5850.8720.426
(회사)업태0.5851.0000.4350.864
(회사)기준년도0.8720.4351.000NaN
회사 사업 분야0.4260.864NaN1.000
2023-12-12T09:03:42.011117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회원구분(회사)업태
회원구분1.0000.337
(회사)업태0.3371.000
2023-12-12T09:03:42.096452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
(회사)기준년도회원구분(회사)업태
(회사)기준년도1.0000.7010.575
회원구분0.7011.0000.337
(회사)업태0.5750.3371.000

Missing values

2023-12-12T09:03:32.654440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:03:32.829663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:03:32.968193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사번호회원구분(회사)명사업자등록번호(회사)업종(회사)업태회사 대표자(회사)주소1(회사)주소2(회사)전화(회사)홈페이지(회사)기준년도기업명 영문회사 사업 분야
0CP00000875중소기업신잔토개발㈜<NA><NA><NA><NA>경기도 연천군 전곡읍 늘목리 61-4<NA>031-832-0011<NA><NA><NA><NA>
1CP00000876중소기업(주)진흥중공업124-81-69018폐기물수집 및 처리업기타박찬양경기도 화성시 정문송산로93번길 10-27<NA><NA><NA><NA><NA><NA>
2CP00000877중소기업도솔환경산업㈜<NA><NA><NA><NA>충남 천안시 직산면 자은가리 82-2<NA>041-584-7007<NA><NA><NA><NA>
3CP00000878중소기업삼원환경산업(주)313-81-01199서비스전문건설업남궁훈충청남도 보령시 남포면 평촌밤섬길 218-191<NA>041-931-1425<NA><NA><NA><NA>
4CP00000884중소기업㈜케이벡코리아215-86-53364환경.토목엔지니어링기타송테드서울 송파구 방이1동 165-3<NA>02-417-4150ww.kbec.co.kr<NA><NA><NA>
5CP00000890중소기업동원이앤텍㈜128-81-85378환경시설기타최광진경기 고양시 일산동구 백석동<NA>031-906-3223<NA><NA><NA><NA>
6CP00000896중소기업㈜장형기업136-81-13652<NA><NA><NA>인천광역시 서구 오류동 410-472<NA>032-562-1658<NA><NA><NA><NA>
7CP00000897중소기업㈜경진엔지니어링122-81-84819상하수도공사전문건설업최철현인천 계양구 서운동 148-84<NA><NA><NA><NA><NA><NA>
8CP00000899대학경남과학기술대학교 산학협력단613-82-09900<NA><NA><NA>경남 진주시 칠암동 150<NA><NA><NA><NA><NA><NA>
9CP00000900중소기업청정환경설비137-02-76975<NA><NA><NA>경기도 이천시 모가면 소고리 96-14<NA>031-574-6305<NA><NA><NA><NA>
회사번호회원구분(회사)명사업자등록번호(회사)업종(회사)업태회사 대표자(회사)주소1(회사)주소2(회사)전화(회사)홈페이지(회사)기준년도기업명 영문회사 사업 분야
1273CP00003345중소기업케이알컨소시엄주식회사114-86-67395환경컨설팅및엔지니어링, 기타무역업, 환경민에너지연구개발, 경영컨설팅, 환경정화및복원사업<NA>이영민서울특별시 서초구 서초대로 46 (방배동)(방배동, 극동빌딩)<NA><NA><NA>KR Consortium Co., Ltd.<NA>
1274CP00003347중소기업(주)대산엘이디전기조명514-81-95612엘이디, 경관조명장치, 조명기구및 제어장치, 철제, 스텐리스가로등주, 전기공사<NA>이종규대구광역시 북구 유통단지로 103 (산격동)건축자재관 1층 18호<NA><NA><NA><NA><NA>
1275CP00003350중소기업케이퓨전테크놀로지332-87-00795플라즈마 발생장치, 수소 발생장치<NA>곽헌길경기도 안산시 상록구 한양대학로 55 (사동)한양대 에리카 창업보육센터 213호031-400-3815<NA><NA>K-fusion Technology, inc<NA>
1276CP00003352중견기업성신양회 주식회사101-81-18194제조업<NA>김상규서울특별시 종로구 인사동5길 29 (인사동)7층02-3782-7000http://www.sungshincement.co.kr/<NA><NA><NA>
1277CP00003353중소기업(주)삼진야드105-81-77837냉동탑, 특장차, 항만장비<NA>신성수부산광역시 강서구 미음산단6로 56 (미음동)(미음동)051-831-7525<NA><NA>samjinyard.co.LTD<NA>
1278CP00003354중소기업이앤켐솔루션206-86-19800흡착제 제조, 연구개발업<NA>김신동경기도 포천시 군내면 용정경제로1길 94-38이앤켐솔루션<NA><NA><NA>E &amp; Chem Solution Corp.<NA>
1279CP00003348중소기업주식회사 시원446-86-01418수전금구,욕실부자재<NA>이시원경기도 김포시 하성면 월하로705번길 54-3주식회사 시원031-982-5227<NA><NA><NA><NA>
1280CP00003349중소기업한국환경시스템주식회사814-81-00000제조, 도소매<NA>박재갑경기도 고양시 일산서구 구산로69번길 23-17 (구산동)한국환경시스템(주)031-912-0815http;//www.uhdkes.co.kr<NA>Korea Environmental System Co., Ltd.<NA>
1281CP00003346중소기업(주)호생환경606-81-17346비금속광물제품제조업<NA>황 준부산광역시 사상구 낙동대로 665 (엄궁동)<NA>051-327-1333<NA><NA><NA><NA>
1282CP00003351공공기관재단법인 철원플라즈마 산업기술연구원127-82-15110플라즈마 연구및개발업, 플라즈마 응용제품 등<NA>이현종강원도 철원군 갈말읍 호국로 4620철원플라즈마산업기술연구원033-452-9709http://www.cpri.re.kr/<NA>Cheorwon Plasma Research Institute<NA>