Overview

Dataset statistics

Number of variables6
Number of observations5913
Missing cells419
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory283.1 KiB
Average record size in memory49.0 B

Variable types

Text5
Numeric1

Dataset

Description(재)장애인기업종합지원센터 장애인기업 등록 현황 정보 * 공공구매종합정보망(www.smpp.go.kr)에서 데이터 추출 ** 제공 자료: 업체명, 사업자등록번호, 업종, 주요생산품목, 주소, 회사연락처
Author(재)장애인기업종합지원센터
URLhttps://www.data.go.kr/data/15035258/fileData.do

Alerts

주요 생산품목 has 138 (2.3%) missing valuesMissing
소재지 has 143 (2.4%) missing valuesMissing
전화번호 has 138 (2.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 21:30:17.647717
Analysis finished2023-12-12 21:30:19.582874
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct5804
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size46.3 KiB
2023-12-13T06:30:19.883175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length7.3649586
Min length1

Characters and Unicode

Total characters43549
Distinct characters827
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5704 ?
Unique (%)96.5%

Sample

1st row(주)용봉
2nd row수창커튼
3rd row(주)고원공간정보
4th row(주)기원전자
5th row(주)지정건설
ValueCountFrequency (%)
주식회사 936
 
12.7%
59
 
0.8%
유한회사 45
 
0.6%
농업회사법인 26
 
0.4%
건축사사무소 20
 
0.3%
코리아 7
 
0.1%
시스템 6
 
0.1%
6
 
0.1%
tech 6
 
0.1%
이레플랫폼 5
 
0.1%
Other values (6044) 6262
84.9%
2023-12-13T06:30:20.418425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2970
 
6.8%
1904
 
4.4%
) 1865
 
4.3%
( 1842
 
4.2%
1505
 
3.5%
1398
 
3.2%
1299
 
3.0%
1201
 
2.8%
901
 
2.1%
531
 
1.2%
Other values (817) 28133
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36655
84.2%
Close Punctuation 1867
 
4.3%
Open Punctuation 1844
 
4.2%
Space Separator 1505
 
3.5%
Uppercase Letter 852
 
2.0%
Lowercase Letter 628
 
1.4%
Other Punctuation 88
 
0.2%
Decimal Number 87
 
0.2%
Dash Punctuation 21
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2970
 
8.1%
1904
 
5.2%
1398
 
3.8%
1299
 
3.5%
1201
 
3.3%
901
 
2.5%
531
 
1.4%
530
 
1.4%
490
 
1.3%
487
 
1.3%
Other values (744) 24944
68.1%
Lowercase Letter
ValueCountFrequency (%)
e 84
13.4%
o 55
 
8.8%
a 49
 
7.8%
t 49
 
7.8%
n 48
 
7.6%
c 45
 
7.2%
i 36
 
5.7%
r 36
 
5.7%
s 33
 
5.3%
l 26
 
4.1%
Other values (15) 167
26.6%
Uppercase Letter
ValueCountFrequency (%)
E 81
 
9.5%
N 73
 
8.6%
C 64
 
7.5%
S 63
 
7.4%
A 56
 
6.6%
T 52
 
6.1%
O 49
 
5.8%
L 45
 
5.3%
M 45
 
5.3%
G 44
 
5.2%
Other values (15) 280
32.9%
Decimal Number
ValueCountFrequency (%)
1 22
25.3%
2 18
20.7%
3 12
13.8%
5 11
12.6%
4 8
 
9.2%
0 7
 
8.0%
8 3
 
3.4%
6 3
 
3.4%
9 3
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 45
51.1%
& 37
42.0%
/ 3
 
3.4%
· 2
 
2.3%
@ 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 1865
99.9%
] 1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1842
99.9%
[ 1
 
0.1%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
1505
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36651
84.2%
Common 5412
 
12.4%
Latin 1480
 
3.4%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2970
 
8.1%
1904
 
5.2%
1398
 
3.8%
1299
 
3.5%
1201
 
3.3%
901
 
2.5%
531
 
1.4%
530
 
1.4%
490
 
1.3%
487
 
1.3%
Other values (739) 24940
68.0%
Latin
ValueCountFrequency (%)
e 84
 
5.7%
E 81
 
5.5%
N 73
 
4.9%
C 64
 
4.3%
S 63
 
4.3%
A 56
 
3.8%
o 55
 
3.7%
T 52
 
3.5%
a 49
 
3.3%
O 49
 
3.3%
Other values (40) 854
57.7%
Common
ValueCountFrequency (%)
) 1865
34.5%
( 1842
34.0%
1505
27.8%
. 45
 
0.8%
& 37
 
0.7%
1 22
 
0.4%
- 21
 
0.4%
2 18
 
0.3%
3 12
 
0.2%
5 11
 
0.2%
Other values (12) 34
 
0.6%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36649
84.2%
ASCII 6888
 
15.8%
None 6
 
< 0.1%
CJK 6
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2970
 
8.1%
1904
 
5.2%
1398
 
3.8%
1299
 
3.5%
1201
 
3.3%
901
 
2.5%
531
 
1.4%
530
 
1.4%
490
 
1.3%
487
 
1.3%
Other values (738) 24938
68.0%
ASCII
ValueCountFrequency (%)
) 1865
27.1%
( 1842
26.7%
1505
21.8%
e 84
 
1.2%
E 81
 
1.2%
N 73
 
1.1%
C 64
 
0.9%
S 63
 
0.9%
A 56
 
0.8%
o 55
 
0.8%
Other values (59) 1200
17.4%
None
ValueCountFrequency (%)
2
33.3%
· 2
33.3%
1
16.7%
1
16.7%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

사업자등록번호
Real number (ℝ)

Distinct5834
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9569398 × 109
Minimum1.0101918 × 109
Maximum8.9942003 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2023-12-13T06:30:20.539367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0101918 × 109
5-th percentile1.1402584 × 109
Q12.0219618 × 109
median3.8986017 × 109
Q35.7818009 × 109
95-th percentile8.1093614 × 109
Maximum8.9942003 × 109
Range7.9840085 × 109
Interquartile range (IQR)3.7598391 × 109

Descriptive statistics

Standard deviation2.2236806 × 109
Coefficient of variation (CV)0.56196979
Kurtosis-0.8988995
Mean3.9569398 × 109
Median Absolute Deviation (MAD)1.8777497 × 109
Skewness0.40792053
Sum2.3397385 × 1013
Variance4.9447555 × 1018
MonotonicityNot monotonic
2023-12-13T06:30:20.652740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4160225874 2
 
< 0.1%
2404100595 2
 
< 0.1%
1348161444 2
 
< 0.1%
4588602123 2
 
< 0.1%
5488100194 2
 
< 0.1%
5038183417 2
 
< 0.1%
1298146325 2
 
< 0.1%
3974300347 2
 
< 0.1%
4128117251 2
 
< 0.1%
1891500587 2
 
< 0.1%
Other values (5824) 5893
99.7%
ValueCountFrequency (%)
1010191776 1
< 0.1%
1010524085 1
< 0.1%
1010546852 1
< 0.1%
1010633223 1
< 0.1%
1010771068 1
< 0.1%
1010953822 1
< 0.1%
1011041337 1
< 0.1%
1011053740 1
< 0.1%
1011194511 1
< 0.1%
1011561809 1
< 0.1%
ValueCountFrequency (%)
8994200278 1
< 0.1%
8988601550 1
< 0.1%
8987500138 1
< 0.1%
8984300044 1
< 0.1%
8983700311 1
< 0.1%
8980101927 1
< 0.1%
8978801323 1
< 0.1%
8978800760 1
< 0.1%
8978700273 1
< 0.1%
8978601877 1
< 0.1%
Distinct657
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size46.3 KiB
2023-12-13T06:30:20.909366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length225
Median length140
Mean length11.006765
Min length2

Characters and Unicode

Total characters65083
Distinct characters186
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique481 ?
Unique (%)8.1%

Sample

1st row도매 및 소매업
2nd row도매 및 소매업
3rd row전문 과학 및 기술 서비스업
4th row그 밖의 제품 제조업
5th row건설업
ValueCountFrequency (%)
3337
18.0%
제조업 1801
 
9.7%
소매업 1118
 
6.0%
도매 1098
 
5.9%
건설업 988
 
5.3%
서비스업 922
 
5.0%
기타 305
 
1.6%
기술 274
 
1.5%
밖의 270
 
1.5%
223
 
1.2%
Other values (573) 8201
44.2%
2023-12-13T06:30:21.264078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12625
19.4%
7927
 
12.2%
3337
 
5.1%
3286
 
5.0%
2993
 
4.6%
2417
 
3.7%
1609
 
2.5%
1506
 
2.3%
1467
 
2.3%
1455
 
2.2%
Other values (176) 26461
40.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 50644
77.8%
Space Separator 12625
 
19.4%
Math Symbol 1288
 
2.0%
Other Punctuation 173
 
0.3%
Open Punctuation 169
 
0.3%
Close Punctuation 169
 
0.3%
Decimal Number 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7927
 
15.7%
3337
 
6.6%
3286
 
6.5%
2993
 
5.9%
2417
 
4.8%
1609
 
3.2%
1506
 
3.0%
1467
 
2.9%
1455
 
2.9%
1417
 
2.8%
Other values (168) 23230
45.9%
Other Punctuation
ValueCountFrequency (%)
; 99
57.2%
/ 70
40.5%
, 4
 
2.3%
Space Separator
ValueCountFrequency (%)
12625
100.0%
Math Symbol
ValueCountFrequency (%)
| 1288
100.0%
Open Punctuation
ValueCountFrequency (%)
( 169
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Decimal Number
ValueCountFrequency (%)
1 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50518
77.6%
Common 14439
 
22.2%
Han 126
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7927
 
15.7%
3337
 
6.6%
3286
 
6.5%
2993
 
5.9%
2417
 
4.8%
1609
 
3.2%
1506
 
3.0%
1467
 
2.9%
1455
 
2.9%
1417
 
2.8%
Other values (166) 23104
45.7%
Common
ValueCountFrequency (%)
12625
87.4%
| 1288
 
8.9%
( 169
 
1.2%
) 169
 
1.2%
; 99
 
0.7%
/ 70
 
0.5%
1 15
 
0.1%
, 4
 
< 0.1%
Han
ValueCountFrequency (%)
63
50.0%
63
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50472
77.6%
ASCII 14439
 
22.2%
CJK 126
 
0.2%
Compat Jamo 46
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12625
87.4%
| 1288
 
8.9%
( 169
 
1.2%
) 169
 
1.2%
; 99
 
0.7%
/ 70
 
0.5%
1 15
 
0.1%
, 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
7927
 
15.7%
3337
 
6.6%
3286
 
6.5%
2993
 
5.9%
2417
 
4.8%
1609
 
3.2%
1506
 
3.0%
1467
 
2.9%
1455
 
2.9%
1417
 
2.8%
Other values (165) 23058
45.7%
CJK
ValueCountFrequency (%)
63
50.0%
63
50.0%
Compat Jamo
ValueCountFrequency (%)
46
100.0%

주요 생산품목
Text

MISSING 

Distinct3202
Distinct (%)55.4%
Missing138
Missing (%)2.3%
Memory size46.3 KiB
2023-12-13T06:30:21.458532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length54
Mean length8.1428571
Min length1

Characters and Unicode

Total characters47025
Distinct characters738
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2681 ?
Unique (%)46.4%

Sample

1st row도.소매
2nd row커튼 및 천막
3rd row측량
4th row컨트롤러 센서 스위치 외 전장품
5th row건설업
ValueCountFrequency (%)
284
 
3.5%
건물청소서비스 124
 
1.5%
기타인쇄물 121
 
1.5%
전기공사 82
 
1.0%
건축공사 68
 
0.8%
도소매 65
 
0.8%
건설업 63
 
0.8%
간판 55
 
0.7%
시설물유지관리공사 52
 
0.6%
46
 
0.6%
Other values (3795) 7215
88.3%
2023-12-13T06:30:21.777478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2476
 
5.3%
1669
 
3.5%
1215
 
2.6%
1200
 
2.6%
1112
 
2.4%
1056
 
2.2%
1003
 
2.1%
945
 
2.0%
837
 
1.8%
827
 
1.8%
Other values (728) 34685
73.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43217
91.9%
Space Separator 2476
 
5.3%
Uppercase Letter 442
 
0.9%
Other Punctuation 401
 
0.9%
Lowercase Letter 155
 
0.3%
Open Punctuation 107
 
0.2%
Close Punctuation 105
 
0.2%
Decimal Number 105
 
0.2%
Dash Punctuation 9
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1669
 
3.9%
1215
 
2.8%
1200
 
2.8%
1112
 
2.6%
1056
 
2.4%
1003
 
2.3%
945
 
2.2%
837
 
1.9%
827
 
1.9%
779
 
1.8%
Other values (660) 32574
75.4%
Lowercase Letter
ValueCountFrequency (%)
c 22
14.2%
t 19
12.3%
v 10
 
6.5%
r 10
 
6.5%
a 9
 
5.8%
u 9
 
5.8%
s 9
 
5.8%
o 8
 
5.2%
i 8
 
5.2%
e 7
 
4.5%
Other values (14) 44
28.4%
Uppercase Letter
ValueCountFrequency (%)
D 94
21.3%
E 82
18.6%
L 81
18.3%
P 28
 
6.3%
C 28
 
6.3%
R 14
 
3.2%
S 14
 
3.2%
V 13
 
2.9%
I 12
 
2.7%
T 12
 
2.7%
Other values (10) 64
14.5%
Decimal Number
ValueCountFrequency (%)
1 30
28.6%
2 19
18.1%
3 17
16.2%
0 12
 
11.4%
7 8
 
7.6%
9 7
 
6.7%
5 5
 
4.8%
8 3
 
2.9%
6 3
 
2.9%
4 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 210
52.4%
/ 153
38.2%
· 31
 
7.7%
& 3
 
0.7%
, 2
 
0.5%
' 1
 
0.2%
% 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
| 6
85.7%
+ 1
 
14.3%
Space Separator
ValueCountFrequency (%)
2476
100.0%
Open Punctuation
ValueCountFrequency (%)
( 107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 105
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43212
91.9%
Common 3211
 
6.8%
Latin 597
 
1.3%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1669
 
3.9%
1215
 
2.8%
1200
 
2.8%
1112
 
2.6%
1056
 
2.4%
1003
 
2.3%
945
 
2.2%
837
 
1.9%
827
 
1.9%
779
 
1.8%
Other values (657) 32569
75.4%
Latin
ValueCountFrequency (%)
D 94
15.7%
E 82
13.7%
L 81
13.6%
P 28
 
4.7%
C 28
 
4.7%
c 22
 
3.7%
t 19
 
3.2%
R 14
 
2.3%
S 14
 
2.3%
V 13
 
2.2%
Other values (34) 202
33.8%
Common
ValueCountFrequency (%)
2476
77.1%
. 210
 
6.5%
/ 153
 
4.8%
( 107
 
3.3%
) 105
 
3.3%
· 31
 
1.0%
1 30
 
0.9%
2 19
 
0.6%
3 17
 
0.5%
0 12
 
0.4%
Other values (14) 51
 
1.6%
Han
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43210
91.9%
ASCII 3777
 
8.0%
None 31
 
0.1%
CJK 5
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2476
65.6%
. 210
 
5.6%
/ 153
 
4.1%
( 107
 
2.8%
) 105
 
2.8%
D 94
 
2.5%
E 82
 
2.2%
L 81
 
2.1%
1 30
 
0.8%
P 28
 
0.7%
Other values (57) 411
 
10.9%
Hangul
ValueCountFrequency (%)
1669
 
3.9%
1215
 
2.8%
1200
 
2.8%
1112
 
2.6%
1056
 
2.4%
1003
 
2.3%
945
 
2.2%
837
 
1.9%
827
 
1.9%
779
 
1.8%
Other values (655) 32567
75.4%
None
ValueCountFrequency (%)
· 31
100.0%
CJK
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

소재지
Text

MISSING 

Distinct5738
Distinct (%)99.4%
Missing143
Missing (%)2.4%
Memory size46.3 KiB
2023-12-13T06:30:22.081709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length53
Mean length28.286482
Min length12

Characters and Unicode

Total characters163213
Distinct characters764
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5711 ?
Unique (%)99.0%

Sample

1st row광주광역시 북구 저불로73번길 6-3앞쪽정문 (용봉동)
2nd row광주광역시 서구 천변좌로 243양동 나2-51 (양동)
3rd row전라북도 전주시 덕진구 무삼지로 214층
4th row전라북도 익산시 보석로6길 109(신흥동 740-33)
5th row전라북도 전주시 완산구 선너머3길 9
ValueCountFrequency (%)
경기도 1287
 
4.3%
서울특별시 837
 
2.8%
경상북도 395
 
1.3%
전라북도 306
 
1.0%
인천광역시 278
 
0.9%
충청남도 275
 
0.9%
전라남도 273
 
0.9%
경상남도 272
 
0.9%
부산광역시 266
 
0.9%
강원도 245
 
0.8%
Other values (12853) 25245
85.1%
2023-12-13T06:30:22.557226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24198
 
14.8%
1 7174
 
4.4%
5382
 
3.3%
4937
 
3.0%
2 4840
 
3.0%
4246
 
2.6%
3815
 
2.3%
3661
 
2.2%
3 3626
 
2.2%
0 3470
 
2.1%
Other values (754) 97864
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99869
61.2%
Decimal Number 31303
 
19.2%
Space Separator 24198
 
14.8%
Close Punctuation 2726
 
1.7%
Open Punctuation 2724
 
1.7%
Dash Punctuation 1744
 
1.1%
Uppercase Letter 458
 
0.3%
Lowercase Letter 106
 
0.1%
Other Punctuation 65
 
< 0.1%
Math Symbol 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5382
 
5.4%
4937
 
4.9%
4246
 
4.3%
3815
 
3.8%
3661
 
3.7%
2858
 
2.9%
2305
 
2.3%
2262
 
2.3%
1995
 
2.0%
1867
 
1.9%
Other values (683) 66541
66.6%
Uppercase Letter
ValueCountFrequency (%)
B 111
24.2%
A 72
15.7%
C 33
 
7.2%
T 23
 
5.0%
S 23
 
5.0%
I 20
 
4.4%
L 20
 
4.4%
D 19
 
4.1%
E 18
 
3.9%
F 18
 
3.9%
Other values (14) 101
22.1%
Lowercase Letter
ValueCountFrequency (%)
b 13
12.3%
t 12
11.3%
n 9
8.5%
e 9
8.5%
i 9
8.5%
o 8
 
7.5%
c 7
 
6.6%
s 6
 
5.7%
a 6
 
5.7%
y 5
 
4.7%
Other values (10) 22
20.8%
Decimal Number
ValueCountFrequency (%)
1 7174
22.9%
2 4840
15.5%
3 3626
11.6%
0 3470
11.1%
4 2696
 
8.6%
5 2438
 
7.8%
6 2087
 
6.7%
7 1808
 
5.8%
8 1665
 
5.3%
9 1499
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 40
61.5%
/ 7
 
10.8%
7
 
10.8%
· 5
 
7.7%
& 2
 
3.1%
# 2
 
3.1%
: 1
 
1.5%
' 1
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 2725
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2723
> 99.9%
[ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 18
94.7%
= 1
 
5.3%
Space Separator
ValueCountFrequency (%)
24198
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1744
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99869
61.2%
Common 62779
38.5%
Latin 565
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5382
 
5.4%
4937
 
4.9%
4246
 
4.3%
3815
 
3.8%
3661
 
3.7%
2858
 
2.9%
2305
 
2.3%
2262
 
2.3%
1995
 
2.0%
1867
 
1.9%
Other values (683) 66541
66.6%
Latin
ValueCountFrequency (%)
B 111
19.6%
A 72
 
12.7%
C 33
 
5.8%
T 23
 
4.1%
S 23
 
4.1%
I 20
 
3.5%
L 20
 
3.5%
D 19
 
3.4%
E 18
 
3.2%
F 18
 
3.2%
Other values (35) 208
36.8%
Common
ValueCountFrequency (%)
24198
38.5%
1 7174
 
11.4%
2 4840
 
7.7%
3 3626
 
5.8%
0 3470
 
5.5%
) 2725
 
4.3%
( 2723
 
4.3%
4 2696
 
4.3%
5 2438
 
3.9%
6 2087
 
3.3%
Other values (16) 6802
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99868
61.2%
ASCII 63331
38.8%
None 12
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24198
38.2%
1 7174
 
11.3%
2 4840
 
7.6%
3 3626
 
5.7%
0 3470
 
5.5%
) 2725
 
4.3%
( 2723
 
4.3%
4 2696
 
4.3%
5 2438
 
3.8%
6 2087
 
3.3%
Other values (58) 7354
 
11.6%
Hangul
ValueCountFrequency (%)
5382
 
5.4%
4937
 
4.9%
4246
 
4.3%
3815
 
3.8%
3661
 
3.7%
2858
 
2.9%
2305
 
2.3%
2262
 
2.3%
1995
 
2.0%
1867
 
1.9%
Other values (682) 66540
66.6%
None
ValueCountFrequency (%)
7
58.3%
· 5
41.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct5581
Distinct (%)96.6%
Missing138
Missing (%)2.3%
Memory size46.3 KiB
2023-12-13T06:30:22.802019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.997056
Min length9

Characters and Unicode

Total characters69283
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5410 ?
Unique (%)93.7%

Sample

1st row062-266-4556
2nd row062-368-9653
3rd row063-243-0329
4th row063-833-0111
5th row063-273-9122
ValueCountFrequency (%)
070-7721-4442 6
 
0.1%
064-732-7500 5
 
0.1%
054-751-9666 5
 
0.1%
042-255-2200 4
 
0.1%
070-7422-0550 4
 
0.1%
054-275-9926 3
 
0.1%
063-221-9956 3
 
0.1%
053-801-8600 3
 
0.1%
052-292-6150 3
 
0.1%
031-926-2790 3
 
0.1%
Other values (5571) 5736
99.3%
2023-12-13T06:30:23.216022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 11540
16.7%
0 10444
15.1%
3 6885
9.9%
2 6473
9.3%
5 6043
8.7%
1 5781
8.3%
4 5415
7.8%
7 4839
7.0%
6 4799
6.9%
8 4002
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 57743
83.3%
Dash Punctuation 11540
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10444
18.1%
3 6885
11.9%
2 6473
11.2%
5 6043
10.5%
1 5781
10.0%
4 5415
9.4%
7 4839
8.4%
6 4799
8.3%
8 4002
 
6.9%
9 3062
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 11540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 69283
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 11540
16.7%
0 10444
15.1%
3 6885
9.9%
2 6473
9.3%
5 6043
8.7%
1 5781
8.3%
4 5415
7.8%
7 4839
7.0%
6 4799
6.9%
8 4002
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 69283
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 11540
16.7%
0 10444
15.1%
3 6885
9.9%
2 6473
9.3%
5 6043
8.7%
1 5781
8.3%
4 5415
7.8%
7 4839
7.0%
6 4799
6.9%
8 4002
 
5.8%

Interactions

2023-12-13T06:30:19.108813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T06:30:19.267606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:30:19.380400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T06:30:19.506536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명사업자등록번호주업종주요 생산품목소재지전화번호
0(주)용봉4098612070도매 및 소매업도.소매광주광역시 북구 저불로73번길 6-3앞쪽정문 (용봉동)062-266-4556
1수창커튼4100815875도매 및 소매업커튼 및 천막광주광역시 서구 천변좌로 243양동 나2-51 (양동)062-368-9653
2(주)고원공간정보1238137487전문 과학 및 기술 서비스업측량전라북도 전주시 덕진구 무삼지로 214층063-243-0329
3(주)기원전자4038101231그 밖의 제품 제조업컨트롤러 센서 스위치 외 전장품전라북도 익산시 보석로6길 109(신흥동 740-33)063-833-0111
4(주)지정건설5108122010건설업건설업전라북도 전주시 완산구 선너머3길 9063-273-9122
5노나건축설비2932500513전기 가스 증기 및 수도사업가스시공업상하수도공사업인테리어창호공사업경기도 시흥시 수인로3335번길 14-1지층031-404-6704
6에스휴먼 주식회사2148735623사업시설관리 및 사업지원 서비스업아웃소싱 컨설팅경기도 평택시 서정북로 872층031-668-9114
7예진산업1281473394제조업목재가구 및 부품경기도 파주시 파평면 파산서원길 320나동031-944-3040
8이수공영4059860516보건업 및 사회복지 서비스업장례및묘지관련용역전라북도 익산시 오산면 선화2로 7676070-7765-1117
9(유)비엔에스테크4028154910건설업정보통신공사전라북도 완주군 상관면 춘향로 4845-20063-241-6037
업체명사업자등록번호주업종주요 생산품목소재지전화번호
5903주식회사 세이프퀴슬1388156619전자부품 컴퓨터 영상 음향 및 통신장비 제조업<NA><NA><NA>
5904서원당2241660018제조업<NA><NA><NA>
5905JUNEandSL2220869930생활/주방<NA><NA><NA>
5906늘푸름4220200989기타<NA><NA><NA>
5907알파에프엔텍1330127311전기기계 제조업<NA><NA><NA>
5908홀그레인호밀농장3758700619식품/생필품<NA><NA><NA>
5909근우테크주식회사5158132906그 밖의 기계 및 장비 제조업<NA><NA><NA>
5910참빛파워텍(주)6178602756건설업|전기장비 제조업<NA><NA><NA>
5911이지엠테크(주)1348161444제조업<NA><NA><NA>
5912휴먼디자인1293864652제조업<NA><NA><NA>