Overview

Dataset statistics

Number of variables8
Number of observations738
Missing cells880
Missing cells (%)14.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory46.3 KiB
Average record size in memory64.2 B

Variable types

Text6
Categorical2

Dataset

Description충청남도 금산군의 제조업체의 관한 사항으로 회사명, 대표명, 회사 주소, 회사 번호, 팩스 번호 등 의 자료를 포함하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=395&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034906

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates
전화번호 has 156 (21.1%) missing valuesMissing
팩스번호 has 724 (98.1%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:58:34.139834
Analysis finished2024-01-09 22:58:35.173105
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct665
Distinct (%)90.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-01-10T07:58:35.320235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length7.2601626
Min length2

Characters and Unicode

Total characters5358
Distinct characters378
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique595 ?
Unique (%)80.6%

Sample

1st row(사)충청남도 장애인부모회 금산지회
2nd row(유)다함
3rd row(유)신화상사
4th row(유)자연의길
5th row(주)BDC
ValueCountFrequency (%)
농업회사법인 11
 
1.4%
제2공장 9
 
1.1%
영농조합법인 6
 
0.7%
금산공장 6
 
0.7%
진우산업 4
 
0.5%
주)휴온스네이처 4
 
0.5%
주)에스코알티에스 3
 
0.4%
중부대학교 3
 
0.4%
농업회사법인(주 3
 
0.4%
제1공장 3
 
0.4%
Other values (667) 753
93.5%
2024-01-10T07:58:35.653526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
407
 
7.6%
( 402
 
7.5%
) 402
 
7.5%
178
 
3.3%
161
 
3.0%
109
 
2.0%
98
 
1.8%
97
 
1.8%
80
 
1.5%
80
 
1.5%
Other values (368) 3344
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4401
82.1%
Open Punctuation 402
 
7.5%
Close Punctuation 402
 
7.5%
Space Separator 67
 
1.3%
Uppercase Letter 32
 
0.6%
Decimal Number 28
 
0.5%
Other Symbol 20
 
0.4%
Dash Punctuation 3
 
0.1%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
407
 
9.2%
178
 
4.0%
161
 
3.7%
109
 
2.5%
98
 
2.2%
97
 
2.2%
80
 
1.8%
80
 
1.8%
77
 
1.7%
76
 
1.7%
Other values (345) 3038
69.0%
Uppercase Letter
ValueCountFrequency (%)
E 5
15.6%
I 4
12.5%
D 4
12.5%
B 3
9.4%
C 3
9.4%
G 3
9.4%
P 2
 
6.2%
N 2
 
6.2%
H 2
 
6.2%
T 1
 
3.1%
Other values (3) 3
9.4%
Decimal Number
ValueCountFrequency (%)
2 20
71.4%
1 7
 
25.0%
3 1
 
3.6%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
, 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 402
100.0%
Space Separator
ValueCountFrequency (%)
67
100.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4421
82.5%
Common 905
 
16.9%
Latin 32
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
407
 
9.2%
178
 
4.0%
161
 
3.6%
109
 
2.5%
98
 
2.2%
97
 
2.2%
80
 
1.8%
80
 
1.8%
77
 
1.7%
76
 
1.7%
Other values (346) 3058
69.2%
Latin
ValueCountFrequency (%)
E 5
15.6%
I 4
12.5%
D 4
12.5%
B 3
9.4%
C 3
9.4%
G 3
9.4%
P 2
 
6.2%
N 2
 
6.2%
H 2
 
6.2%
T 1
 
3.1%
Other values (3) 3
9.4%
Common
ValueCountFrequency (%)
( 402
44.4%
) 402
44.4%
67
 
7.4%
2 20
 
2.2%
1 7
 
0.8%
- 3
 
0.3%
& 2
 
0.2%
, 1
 
0.1%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4401
82.1%
ASCII 937
 
17.5%
None 20
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
407
 
9.2%
178
 
4.0%
161
 
3.7%
109
 
2.5%
98
 
2.2%
97
 
2.2%
80
 
1.8%
80
 
1.8%
77
 
1.7%
76
 
1.7%
Other values (345) 3038
69.0%
ASCII
ValueCountFrequency (%)
( 402
42.9%
) 402
42.9%
67
 
7.2%
2 20
 
2.1%
1 7
 
0.7%
E 5
 
0.5%
I 4
 
0.4%
D 4
 
0.4%
- 3
 
0.3%
B 3
 
0.3%
Other values (12) 20
 
2.1%
None
ValueCountFrequency (%)
20
100.0%

주소
Text

Distinct682
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-01-10T07:58:35.947211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length42
Mean length18.132791
Min length9

Characters and Unicode

Total characters13382
Distinct characters288
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique638 ?
Unique (%)86.4%

Sample

1st row금성면 금산로 2044, (마수리 204-3)
2nd row금산읍 후곤천길 112, 102동 103호
3rd row남이면 강변길 49-21, (흑암리 915)
4th row금성면 금성공단로 19-18, (하신리 770)
5th row추부면 신평공단1로 62
ValueCountFrequency (%)
추부면 300
 
10.0%
복수면 162
 
5.4%
금산군 123
 
4.1%
충청남도 122
 
4.1%
금성면 78
 
2.6%
추풍로 58
 
1.9%
다복로 55
 
1.8%
진산면 54
 
1.8%
군북면 47
 
1.6%
금산읍 46
 
1.5%
Other values (849) 1944
65.0%
2024-01-10T07:58:36.407265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2404
 
18.0%
697
 
5.2%
1 540
 
4.0%
2 418
 
3.1%
407
 
3.0%
364
 
2.7%
349
 
2.6%
328
 
2.5%
( 313
 
2.3%
310
 
2.3%
Other values (278) 7252
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7096
53.0%
Decimal Number 2779
 
20.8%
Space Separator 2404
 
18.0%
Open Punctuation 313
 
2.3%
Close Punctuation 310
 
2.3%
Dash Punctuation 274
 
2.0%
Other Punctuation 136
 
1.0%
Uppercase Letter 46
 
0.3%
Other Symbol 24
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
697
 
9.8%
407
 
5.7%
364
 
5.1%
349
 
4.9%
328
 
4.6%
310
 
4.4%
280
 
3.9%
244
 
3.4%
234
 
3.3%
231
 
3.3%
Other values (248) 3652
51.5%
Uppercase Letter
ValueCountFrequency (%)
C 8
17.4%
E 7
15.2%
M 5
10.9%
S 4
8.7%
H 4
8.7%
R 3
 
6.5%
J 3
 
6.5%
T 3
 
6.5%
O 2
 
4.3%
I 2
 
4.3%
Other values (3) 5
10.9%
Decimal Number
ValueCountFrequency (%)
1 540
19.4%
2 418
15.0%
5 282
10.1%
3 280
10.1%
4 271
9.8%
6 223
8.0%
0 217
7.8%
7 195
 
7.0%
8 186
 
6.7%
9 167
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 135
99.3%
& 1
 
0.7%
Space Separator
ValueCountFrequency (%)
2404
100.0%
Open Punctuation
ValueCountFrequency (%)
( 313
100.0%
Close Punctuation
ValueCountFrequency (%)
) 310
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 274
100.0%
Other Symbol
ValueCountFrequency (%)
24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7120
53.2%
Common 6216
46.5%
Latin 46
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
697
 
9.8%
407
 
5.7%
364
 
5.1%
349
 
4.9%
328
 
4.6%
310
 
4.4%
280
 
3.9%
244
 
3.4%
234
 
3.3%
231
 
3.2%
Other values (249) 3676
51.6%
Common
ValueCountFrequency (%)
2404
38.7%
1 540
 
8.7%
2 418
 
6.7%
( 313
 
5.0%
) 310
 
5.0%
5 282
 
4.5%
3 280
 
4.5%
- 274
 
4.4%
4 271
 
4.4%
6 223
 
3.6%
Other values (6) 901
 
14.5%
Latin
ValueCountFrequency (%)
C 8
17.4%
E 7
15.2%
M 5
10.9%
S 4
8.7%
H 4
8.7%
R 3
 
6.5%
J 3
 
6.5%
T 3
 
6.5%
O 2
 
4.3%
I 2
 
4.3%
Other values (3) 5
10.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7096
53.0%
ASCII 6262
46.8%
None 24
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2404
38.4%
1 540
 
8.6%
2 418
 
6.7%
( 313
 
5.0%
) 310
 
5.0%
5 282
 
4.5%
3 280
 
4.5%
- 274
 
4.4%
4 271
 
4.3%
6 223
 
3.6%
Other values (19) 947
 
15.1%
Hangul
ValueCountFrequency (%)
697
 
9.8%
407
 
5.7%
364
 
5.1%
349
 
4.9%
328
 
4.6%
310
 
4.4%
280
 
3.9%
244
 
3.4%
234
 
3.3%
231
 
3.3%
Other values (248) 3652
51.5%
None
ValueCountFrequency (%)
24
100.0%

전화번호
Text

MISSING 

Distinct514
Distinct (%)88.3%
Missing156
Missing (%)21.1%
Memory size5.9 KiB
2024-01-10T07:58:36.640859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.027491
Min length9

Characters and Unicode

Total characters7000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique454 ?
Unique (%)78.0%

Sample

1st row041-753-7887
2nd row042-335-9092
3rd row041-753-7979
4th row041-751-6262
5th row041-751-6262
ValueCountFrequency (%)
041-752-9945 4
 
0.7%
041-754-5421 4
 
0.7%
041-751-6870 3
 
0.5%
041-754-0072 3
 
0.5%
041-753-7327 3
 
0.5%
041-752-3243 3
 
0.5%
041-752-0872 2
 
0.3%
041-753-6102 2
 
0.3%
041-753-7685 2
 
0.3%
041-751-8091 2
 
0.3%
Other values (504) 554
95.2%
2024-01-10T07:58:37.004434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1157
16.5%
0 979
14.0%
1 923
13.2%
4 897
12.8%
7 778
11.1%
5 747
10.7%
2 394
 
5.6%
3 392
 
5.6%
8 265
 
3.8%
9 234
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5843
83.5%
Dash Punctuation 1157
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 979
16.8%
1 923
15.8%
4 897
15.4%
7 778
13.3%
5 747
12.8%
2 394
6.7%
3 392
6.7%
8 265
 
4.5%
9 234
 
4.0%
6 234
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 1157
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1157
16.5%
0 979
14.0%
1 923
13.2%
4 897
12.8%
7 778
11.1%
5 747
10.7%
2 394
 
5.6%
3 392
 
5.6%
8 265
 
3.8%
9 234
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1157
16.5%
0 979
14.0%
1 923
13.2%
4 897
12.8%
7 778
11.1%
5 747
10.7%
2 394
 
5.6%
3 392
 
5.6%
8 265
 
3.8%
9 234
 
3.3%

팩스번호
Text

MISSING 

Distinct14
Distinct (%)100.0%
Missing724
Missing (%)98.1%
Memory size5.9 KiB
2024-01-10T07:58:37.175394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.857143
Min length12

Characters and Unicode

Total characters180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)100.0%

Sample

1st row041-753-2980
2nd row041-0753-4142
3rd row041-0754-3529
4th row041-0753-8658
5th row041-0753-3395
ValueCountFrequency (%)
041-753-2980 1
 
7.1%
041-0753-4142 1
 
7.1%
041-0754-3529 1
 
7.1%
041-0753-8658 1
 
7.1%
041-0753-3395 1
 
7.1%
041-0753-8711 1
 
7.1%
041-0753-7688 1
 
7.1%
041-0753-6785 1
 
7.1%
041-0752-9228 1
 
7.1%
041-0752-9278 1
 
7.1%
Other values (4) 4
28.6%
2024-01-10T07:58:37.487090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29
16.1%
- 28
15.6%
4 22
12.2%
7 22
12.2%
5 19
10.6%
1 18
10.0%
2 11
 
6.1%
8 11
 
6.1%
3 10
 
5.6%
9 5
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 152
84.4%
Dash Punctuation 28
 
15.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29
19.1%
4 22
14.5%
7 22
14.5%
5 19
12.5%
1 18
11.8%
2 11
 
7.2%
8 11
 
7.2%
3 10
 
6.6%
9 5
 
3.3%
6 5
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 29
16.1%
- 28
15.6%
4 22
12.2%
7 22
12.2%
5 19
10.6%
1 18
10.0%
2 11
 
6.1%
8 11
 
6.1%
3 10
 
5.6%
9 5
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29
16.1%
- 28
15.6%
4 22
12.2%
7 22
12.2%
5 19
10.6%
1 18
10.0%
2 11
 
6.1%
8 11
 
6.1%
3 10
 
5.6%
9 5
 
2.8%

업종
Categorical

Distinct24
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
음식료
215 
석유·화학
87 
목재종이출판
85 
기계
85 
철강
63 
Other values (19)
203 

Length

Max length25
Median length16
Mean length3.798103
Min length1

Unique

Unique10 ?
Unique (%)1.4%

Sample

1st row목재종이출판
2nd row전기·전자
3rd row목재종이출판
4th row음식료
5th row전기·전자

Common Values

ValueCountFrequency (%)
음식료 215
29.1%
석유·화학 87
11.8%
목재종이출판 85
 
11.5%
기계 85
 
11.5%
철강 63
 
8.5%
비금속소재 59
 
8.0%
전기·전자 40
 
5.4%
기타 35
 
4.7%
운송장비 20
 
2.7%
섬유의복 15
 
2.0%
Other values (14) 34
 
4.6%

Length

2024-01-10T07:58:37.612023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
음식료 215
28.1%
석유·화학 87
11.4%
목재종이출판 85
 
11.1%
기계 85
 
11.1%
철강 63
 
8.2%
비금속소재 59
 
7.7%
전기·전자 40
 
5.2%
기타 35
 
4.6%
운송장비 21
 
2.7%
섬유의복 15
 
2.0%
Other values (29) 61
 
8.0%
Distinct582
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-01-10T07:58:37.869639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length22
Mean length7.8102981
Min length1

Characters and Unicode

Total characters5764
Distinct characters480
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique483 ?
Unique (%)65.4%

Sample

1st row롤화장지
2nd row방송장비
3rd row재제목
4th row홍삼, 백삼
5th row강관, 방송장비, 조명장치 등
ValueCountFrequency (%)
32
 
2.6%
홍삼액 27
 
2.2%
홍삼 15
 
1.2%
플라스틱 15
 
1.2%
인삼식품 10
 
0.8%
철구조물 9
 
0.7%
8
 
0.7%
홍삼절편 8
 
0.7%
알루미늄 7
 
0.6%
홍삼정과 7
 
0.6%
Other values (785) 1077
88.6%
2024-01-10T07:58:38.295263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
481
 
8.3%
, 318
 
5.5%
174
 
3.0%
130
 
2.3%
123
 
2.1%
117
 
2.0%
103
 
1.8%
98
 
1.7%
94
 
1.6%
71
 
1.2%
Other values (470) 4055
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4730
82.1%
Space Separator 481
 
8.3%
Other Punctuation 343
 
6.0%
Uppercase Letter 117
 
2.0%
Open Punctuation 38
 
0.7%
Close Punctuation 38
 
0.7%
Lowercase Letter 8
 
0.1%
Dash Punctuation 6
 
0.1%
Decimal Number 2
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
174
 
3.7%
130
 
2.7%
123
 
2.6%
117
 
2.5%
103
 
2.2%
98
 
2.1%
94
 
2.0%
71
 
1.5%
66
 
1.4%
65
 
1.4%
Other values (434) 3689
78.0%
Uppercase Letter
ValueCountFrequency (%)
P 18
15.4%
E 18
15.4%
C 13
11.1%
D 10
8.5%
L 9
7.7%
V 8
6.8%
H 8
6.8%
T 7
 
6.0%
R 7
 
6.0%
F 5
 
4.3%
Other values (8) 14
12.0%
Lowercase Letter
ValueCountFrequency (%)
d 2
25.0%
o 1
12.5%
c 1
12.5%
l 1
12.5%
a 1
12.5%
e 1
12.5%
r 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 318
92.7%
. 22
 
6.4%
· 2
 
0.6%
/ 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
481
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4730
82.1%
Common 909
 
15.8%
Latin 125
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
174
 
3.7%
130
 
2.7%
123
 
2.6%
117
 
2.5%
103
 
2.2%
98
 
2.1%
94
 
2.0%
71
 
1.5%
66
 
1.4%
65
 
1.4%
Other values (434) 3689
78.0%
Latin
ValueCountFrequency (%)
P 18
14.4%
E 18
14.4%
C 13
10.4%
D 10
8.0%
L 9
7.2%
V 8
 
6.4%
H 8
 
6.4%
T 7
 
5.6%
R 7
 
5.6%
F 5
 
4.0%
Other values (15) 22
17.6%
Common
ValueCountFrequency (%)
481
52.9%
, 318
35.0%
( 38
 
4.2%
) 38
 
4.2%
. 22
 
2.4%
- 6
 
0.7%
· 2
 
0.2%
2 1
 
0.1%
4 1
 
0.1%
` 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4730
82.1%
ASCII 1032
 
17.9%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
481
46.6%
, 318
30.8%
( 38
 
3.7%
) 38
 
3.7%
. 22
 
2.1%
P 18
 
1.7%
E 18
 
1.7%
C 13
 
1.3%
D 10
 
1.0%
L 9
 
0.9%
Other values (25) 67
 
6.5%
Hangul
ValueCountFrequency (%)
174
 
3.7%
130
 
2.7%
123
 
2.6%
117
 
2.5%
103
 
2.2%
98
 
2.1%
94
 
2.0%
71
 
1.5%
66
 
1.4%
65
 
1.4%
Other values (434) 3689
78.0%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct620
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-01-10T07:58:38.613057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.101626
Min length2

Characters and Unicode

Total characters2289
Distinct characters196
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique514 ?
Unique (%)69.6%

Sample

1st row최성분
2nd row박성화
3rd row조경배
4th row길호철
5th row송운용
ValueCountFrequency (%)
김우정 4
 
0.5%
천청운 4
 
0.5%
심순택 3
 
0.4%
손동이 3
 
0.4%
이승현 3
 
0.4%
박병달 3
 
0.4%
유태식 3
 
0.4%
최원문 3
 
0.4%
조민행 3
 
0.4%
조성호 3
 
0.4%
Other values (621) 723
95.8%
2024-01-10T07:58:39.034480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
157
 
6.9%
101
 
4.4%
76
 
3.3%
61
 
2.7%
57
 
2.5%
40
 
1.7%
40
 
1.7%
40
 
1.7%
39
 
1.7%
36
 
1.6%
Other values (186) 1642
71.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2253
98.4%
Space Separator 17
 
0.7%
Other Punctuation 17
 
0.7%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
 
7.0%
101
 
4.5%
76
 
3.4%
61
 
2.7%
57
 
2.5%
40
 
1.8%
40
 
1.8%
40
 
1.8%
39
 
1.7%
36
 
1.6%
Other values (183) 1606
71.3%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2253
98.4%
Common 36
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
 
7.0%
101
 
4.5%
76
 
3.4%
61
 
2.7%
57
 
2.5%
40
 
1.8%
40
 
1.8%
40
 
1.8%
39
 
1.7%
36
 
1.6%
Other values (183) 1606
71.3%
Common
ValueCountFrequency (%)
17
47.2%
, 17
47.2%
1 2
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2253
98.4%
ASCII 36
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
157
 
7.0%
101
 
4.5%
76
 
3.4%
61
 
2.7%
57
 
2.5%
40
 
1.8%
40
 
1.8%
40
 
1.8%
39
 
1.7%
36
 
1.6%
Other values (183) 1606
71.3%
ASCII
ValueCountFrequency (%)
17
47.2%
, 17
47.2%
1 2
 
5.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2020-02-10
738 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-02-10
2nd row2020-02-10
3rd row2020-02-10
4th row2020-02-10
5th row2020-02-10

Common Values

ValueCountFrequency (%)
2020-02-10 738
100.0%

Length

2024-01-10T07:58:39.161967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:58:39.244807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-02-10 738
100.0%

Correlations

2024-01-10T07:58:39.296521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팩스번호업종
팩스번호1.0001.000
업종1.0001.000

Missing values

2024-01-10T07:58:34.934546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:58:35.048047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:58:35.130440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명주소전화번호팩스번호업종생산품대표명데이터기준일자
0(사)충청남도 장애인부모회 금산지회금성면 금산로 2044, (마수리 204-3)041-753-7887<NA>목재종이출판롤화장지최성분2020-02-10
1(유)다함금산읍 후곤천길 112, 102동 103호042-335-9092<NA>전기·전자방송장비박성화2020-02-10
2(유)신화상사남이면 강변길 49-21, (흑암리 915)<NA><NA>목재종이출판재제목조경배2020-02-10
3(유)자연의길금성면 금성공단로 19-18, (하신리 770)041-753-7979<NA>음식료홍삼, 백삼길호철2020-02-10
4(주)BDC추부면 신평공단1로 62041-751-6262<NA>전기·전자강관, 방송장비, 조명장치 등송운용2020-02-10
5(주)BDC 제2공장추부면 신평공단1로 85041-751-6262<NA>전기·전자무선통신용강관주송운용2020-02-10
6(주)EG추부면 서대산로 459 (㈜EG)041-750-7777<NA>석유·화학산화철박지만2020-02-10
7(주)갈산복수면 복수공단길 37, (용진리 115-18) (풍국타올)070-7602-7895<NA>비금속소재우레탄 수지매트설진웅2020-02-10
8(주)고구려식품추부면 서대산로 180041-753-6311<NA>음식료돈육손준호2020-02-10
9(주)광성화학추부면 미삭길 20041-752-6000<NA>비금속소재중질탄산칼슘김광래2020-02-10
회사명주소전화번호팩스번호업종생산품대표명데이터기준일자
728현창 I&D충청남도 금산군 복수면 복수공단길 64, (용진리 115-31) ((주)우리금풍)041-753-1310<NA>비금속소재인조대리석김현일2020-02-10
729홍도고려홍삼공사충청남도 금산군 남일면 홍도1길 83<NA><NA>음식료홍삼액, 홍삼절편김호2020-02-10
730홍삼랜드충청남도 금산군 금산읍 와정길 58 (청정인삼)<NA><NA>음식료홍삼정과, 홍삼액강원구2020-02-10
731홍원바이오아그로충청남도 금산군 추부면 추풍로 344 (유원물류창고)<NA><NA>음식료동물사료및조제식품김정화2020-02-10
732화림산업사충청남도 금산군 추부면 자부리 177번지041-754-7870<NA>섬유의복자수제품,자수용재료강흠구2020-02-10
733효성산업충청남도 금산군 복수면 구례리 332-4번지 외 2필지<NA><NA>석유·화학PE 비닐이원우2020-02-10
734효성하나로(주)충청남도 금산군 추부면 동당말길 13041-752-9955<NA>기타프라이팬김수진2020-02-10
735효원엔지니어링충청남도 금산군 추부면 다복로 721-5, (마전리 784번지)042-670-6900<NA>기계가공공작기계안종철2020-02-10
736후드코리아충청남도 금산군 복수면 학평길 23-5<NA><NA>음식료한약제엑기스, 배.사과과즙전문길2020-02-10
737흥농산업충청남도 금산군 금성면 상가리 252 번지041-753-8906<NA>섬유의복차광망김광임2020-02-10

Duplicate rows

Most frequently occurring

회사명주소전화번호팩스번호업종생산품대표명데이터기준일자# duplicates
0지앤비패키지충청남도 금산군 복수면 복수공단길 40-6, (용진리 115-8) (대륙화학공업물류창고)041-753-7327<NA>목재종이출판종이상자최원문2020-02-102