Overview

Dataset statistics

Number of variables8
Number of observations8723
Missing cells6401
Missing cells (%)9.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory553.8 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Categorical2
Text5

Dataset

Description전북특별자치도 시군별 제조업체 현황입니다. (지역, 사업체명, 대표자 성명, 본사 소재지, 업종코드 2자리, 주생산품, 주소 등)
Author전북특별자치도
URLhttps://www.data.go.kr/data/15010992/fileData.do

Alerts

연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
본사소재지 is highly imbalanced (84.9%)Imbalance
홈페이지 has 6396 (73.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:38:32.280423
Analysis finished2024-03-14 12:38:36.045864
Duration3.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct8723
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4362
Minimum1
Maximum8723
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size76.8 KiB
2024-03-14T21:38:36.267926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile437.1
Q12181.5
median4362
Q36542.5
95-th percentile8286.9
Maximum8723
Range8722
Interquartile range (IQR)4361

Descriptive statistics

Standard deviation2518.2575
Coefficient of variation (CV)0.57731718
Kurtosis-1.2
Mean4362
Median Absolute Deviation (MAD)2181
Skewness0
Sum38049726
Variance6341621
MonotonicityStrictly increasing
2024-03-14T21:38:36.698580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
5820 1
 
< 0.1%
5814 1
 
< 0.1%
5815 1
 
< 0.1%
5816 1
 
< 0.1%
5817 1
 
< 0.1%
5818 1
 
< 0.1%
5819 1
 
< 0.1%
5821 1
 
< 0.1%
5727 1
 
< 0.1%
Other values (8713) 8713
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
8723 1
< 0.1%
8722 1
< 0.1%
8721 1
< 0.1%
8720 1
< 0.1%
8719 1
< 0.1%
8718 1
< 0.1%
8717 1
< 0.1%
8716 1
< 0.1%
8715 1
< 0.1%
8714 1
< 0.1%

지역
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size68.3 KiB
전주
1818 
익산
1706 
군산
1447 
완주
894 
김제
841 
Other values (9)
2017 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전주
2nd row전주
3rd row전주
4th row전주
5th row전주

Common Values

ValueCountFrequency (%)
전주 1818
20.8%
익산 1706
19.6%
군산 1447
16.6%
완주 894
10.2%
김제 841
9.6%
정읍 569
 
6.5%
남원 362
 
4.1%
고창 244
 
2.8%
부안 238
 
2.7%
진안 161
 
1.8%
Other values (4) 443
 
5.1%

Length

2024-03-14T21:38:37.098860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전주 1818
20.8%
익산 1706
19.6%
군산 1447
16.6%
완주 894
10.2%
김제 841
9.6%
정읍 569
 
6.5%
남원 362
 
4.1%
고창 244
 
2.8%
부안 238
 
2.7%
진안 161
 
1.8%
Other values (4) 443
 
5.1%
Distinct8494
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size68.3 KiB
2024-03-14T21:38:38.367212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length6.9291528
Min length1

Characters and Unicode

Total characters60443
Distinct characters839
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8283 ?
Unique (%)95.0%

Sample

1st row켈리의푸드부띠끄
2nd row샤뽀
3rd row㈜신산이
4th row서동푸드협동조합
5th row가온디자인
ValueCountFrequency (%)
주식회사 293
 
2.9%
농업회사법인 209
 
2.1%
유한회사 146
 
1.4%
영농조합법인 73
 
0.7%
2공장 24
 
0.2%
18
 
0.2%
제2공장 16
 
0.2%
익산공장 16
 
0.2%
15
 
0.1%
농업회사법인(유 14
 
0.1%
Other values (8692) 9256
91.8%
2024-03-14T21:38:40.290875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2950
 
4.9%
) 1898
 
3.1%
( 1896
 
3.1%
1706
 
2.8%
1637
 
2.7%
1511
 
2.5%
1447
 
2.4%
1431
 
2.4%
1188
 
2.0%
1158
 
1.9%
Other values (829) 43621
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51095
84.5%
Other Symbol 2950
 
4.9%
Close Punctuation 1898
 
3.1%
Open Punctuation 1896
 
3.1%
Space Separator 1511
 
2.5%
Uppercase Letter 470
 
0.8%
Decimal Number 261
 
0.4%
Connector Punctuation 153
 
0.3%
Lowercase Letter 68
 
0.1%
Other Punctuation 55
 
0.1%
Other values (2) 86
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1706
 
3.3%
1637
 
3.2%
1447
 
2.8%
1431
 
2.8%
1188
 
2.3%
1158
 
2.3%
1146
 
2.2%
1105
 
2.2%
1001
 
2.0%
894
 
1.7%
Other values (764) 38382
75.1%
Uppercase Letter
ValueCountFrequency (%)
E 53
11.3%
G 53
11.3%
N 50
 
10.6%
S 42
 
8.9%
C 33
 
7.0%
J 24
 
5.1%
K 23
 
4.9%
M 22
 
4.7%
T 21
 
4.5%
B 18
 
3.8%
Other values (15) 131
27.9%
Lowercase Letter
ValueCountFrequency (%)
e 8
11.8%
t 7
10.3%
k 6
 
8.8%
s 5
 
7.4%
h 5
 
7.4%
c 5
 
7.4%
n 5
 
7.4%
a 4
 
5.9%
i 4
 
5.9%
l 3
 
4.4%
Other values (8) 16
23.5%
Decimal Number
ValueCountFrequency (%)
2 141
54.0%
1 65
24.9%
3 26
 
10.0%
6 7
 
2.7%
8 7
 
2.7%
5 5
 
1.9%
4 4
 
1.5%
0 4
 
1.5%
9 2
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 29
52.7%
& 17
30.9%
, 5
 
9.1%
/ 2
 
3.6%
# 1
 
1.8%
* 1
 
1.8%
Other Symbol
ValueCountFrequency (%)
2950
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1898
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1896
100.0%
Space Separator
ValueCountFrequency (%)
1511
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%
Control
ValueCountFrequency (%)
33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54045
89.4%
Common 5860
 
9.7%
Latin 538
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2950
 
5.5%
1706
 
3.2%
1637
 
3.0%
1447
 
2.7%
1431
 
2.6%
1188
 
2.2%
1158
 
2.1%
1146
 
2.1%
1105
 
2.0%
1001
 
1.9%
Other values (765) 39276
72.7%
Latin
ValueCountFrequency (%)
E 53
 
9.9%
G 53
 
9.9%
N 50
 
9.3%
S 42
 
7.8%
C 33
 
6.1%
J 24
 
4.5%
K 23
 
4.3%
M 22
 
4.1%
T 21
 
3.9%
B 18
 
3.3%
Other values (33) 199
37.0%
Common
ValueCountFrequency (%)
) 1898
32.4%
( 1896
32.4%
1511
25.8%
_ 153
 
2.6%
2 141
 
2.4%
1 65
 
1.1%
- 53
 
0.9%
33
 
0.6%
. 29
 
0.5%
3 26
 
0.4%
Other values (11) 55
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51095
84.5%
ASCII 6398
 
10.6%
None 2950
 
4.9%

Most frequent character per block

None
ValueCountFrequency (%)
2950
100.0%
ASCII
ValueCountFrequency (%)
) 1898
29.7%
( 1896
29.6%
1511
23.6%
_ 153
 
2.4%
2 141
 
2.2%
1 65
 
1.0%
- 53
 
0.8%
E 53
 
0.8%
G 53
 
0.8%
N 50
 
0.8%
Other values (54) 525
 
8.2%
Hangul
ValueCountFrequency (%)
1706
 
3.3%
1637
 
3.2%
1447
 
2.8%
1431
 
2.8%
1188
 
2.3%
1158
 
2.3%
1146
 
2.2%
1105
 
2.2%
1001
 
2.0%
894
 
1.7%
Other values (764) 38382
75.1%
Distinct7011
Distinct (%)80.4%
Missing0
Missing (%)0.0%
Memory size68.3 KiB
2024-03-14T21:38:41.652812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length3
Mean length3.1388284
Min length2

Characters and Unicode

Total characters27380
Distinct characters349
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5814 ?
Unique (%)66.7%

Sample

1st row이해영
2nd row조현종
3rd row김수석
4th row정병호
5th row송훈배
ValueCountFrequency (%)
김정숙 11
 
0.1%
김성훈 9
 
0.1%
김경희 7
 
0.1%
이현주 7
 
0.1%
이성민 7
 
0.1%
박상규 7
 
0.1%
유태호 6
 
0.1%
김민수 6
 
0.1%
김정희 6
 
0.1%
김정호 6
 
0.1%
Other values (7021) 8769
99.2%
2024-03-14T21:38:43.466329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1908
 
7.0%
1306
 
4.8%
929
 
3.4%
704
 
2.6%
681
 
2.5%
597
 
2.2%
500
 
1.8%
482
 
1.8%
465
 
1.7%
459
 
1.7%
Other values (339) 19349
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26863
98.1%
Space Separator 189
 
0.7%
Other Punctuation 187
 
0.7%
Control 64
 
0.2%
Close Punctuation 30
 
0.1%
Open Punctuation 30
 
0.1%
Lowercase Letter 9
 
< 0.1%
Decimal Number 6
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1908
 
7.1%
1306
 
4.9%
929
 
3.5%
704
 
2.6%
681
 
2.5%
597
 
2.2%
500
 
1.9%
482
 
1.8%
465
 
1.7%
459
 
1.7%
Other values (323) 18832
70.1%
Lowercase Letter
ValueCountFrequency (%)
n 2
22.2%
a 2
22.2%
g 2
22.2%
d 1
11.1%
r 1
11.1%
e 1
11.1%
Other Punctuation
ValueCountFrequency (%)
, 181
96.8%
/ 5
 
2.7%
. 1
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
J 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
189
100.0%
Control
ValueCountFrequency (%)
64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Decimal Number
ValueCountFrequency (%)
1 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26863
98.1%
Common 506
 
1.8%
Latin 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1908
 
7.1%
1306
 
4.9%
929
 
3.5%
704
 
2.6%
681
 
2.5%
597
 
2.2%
500
 
1.9%
482
 
1.8%
465
 
1.7%
459
 
1.7%
Other values (323) 18832
70.1%
Common
ValueCountFrequency (%)
189
37.4%
, 181
35.8%
64
 
12.6%
) 30
 
5.9%
( 30
 
5.9%
1 6
 
1.2%
/ 5
 
1.0%
. 1
 
0.2%
Latin
ValueCountFrequency (%)
n 2
18.2%
a 2
18.2%
g 2
18.2%
J 1
9.1%
d 1
9.1%
r 1
9.1%
e 1
9.1%
B 1
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26863
98.1%
ASCII 517
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1908
 
7.1%
1306
 
4.9%
929
 
3.5%
704
 
2.6%
681
 
2.5%
597
 
2.2%
500
 
1.9%
482
 
1.8%
465
 
1.7%
459
 
1.7%
Other values (323) 18832
70.1%
ASCII
ValueCountFrequency (%)
189
36.6%
, 181
35.0%
64
 
12.4%
) 30
 
5.8%
( 30
 
5.8%
1 6
 
1.2%
/ 5
 
1.0%
n 2
 
0.4%
a 2
 
0.4%
g 2
 
0.4%
Other values (6) 6
 
1.2%

본사소재지
Categorical

IMBALANCE 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size68.3 KiB
전북
8063 
서울
 
196
경기
 
184
인천
 
47
광주
 
37
Other values (12)
 
196

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row전북
2nd row전북
3rd row전북
4th row전북
5th row전북

Common Values

ValueCountFrequency (%)
전북 8063
92.4%
서울 196
 
2.2%
경기 184
 
2.1%
인천 47
 
0.5%
광주 37
 
0.4%
충남 37
 
0.4%
전남 31
 
0.4%
대전 28
 
0.3%
경남 22
 
0.3%
경북 18
 
0.2%
Other values (7) 60
 
0.7%

Length

2024-03-14T21:38:43.692414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전북 8063
92.4%
서울 196
 
2.2%
경기 184
 
2.1%
인천 47
 
0.5%
광주 37
 
0.4%
충남 37
 
0.4%
전남 31
 
0.4%
대전 28
 
0.3%
경남 22
 
0.3%
경북 18
 
0.2%
Other values (7) 60
 
0.7%
Distinct6184
Distinct (%)70.9%
Missing5
Missing (%)0.1%
Memory size68.3 KiB
2024-03-14T21:38:44.763090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length37
Mean length7.6533609
Min length1

Characters and Unicode

Total characters66722
Distinct characters835
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5381 ?
Unique (%)61.7%

Sample

1st row고추장,간장,된장,과일청
2nd row모자
3rd row신재생에너지
4th row전통한과
5th row주방용 및 음식점용 목재가구
ValueCountFrequency (%)
308
 
2.3%
자동차부품 176
 
1.3%
제조 110
 
0.8%
제조업 87
 
0.7%
간판 83
 
0.6%
부품 75
 
0.6%
광고물 72
 
0.5%
레미콘 70
 
0.5%
곡물도정 70
 
0.5%
인쇄 66
 
0.5%
Other values (6647) 12010
91.5%
2024-03-14T21:38:46.100949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4490
 
6.7%
, 3443
 
5.2%
1845
 
2.8%
1557
 
2.3%
1491
 
2.2%
1363
 
2.0%
1159
 
1.7%
1044
 
1.6%
964
 
1.4%
931
 
1.4%
Other values (825) 48435
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56448
84.6%
Space Separator 4490
 
6.7%
Other Punctuation 3482
 
5.2%
Uppercase Letter 1022
 
1.5%
Lowercase Letter 568
 
0.9%
Open Punctuation 304
 
0.5%
Close Punctuation 303
 
0.5%
Decimal Number 68
 
0.1%
Control 25
 
< 0.1%
Dash Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1845
 
3.3%
1557
 
2.8%
1491
 
2.6%
1363
 
2.4%
1159
 
2.1%
1044
 
1.8%
964
 
1.7%
931
 
1.6%
889
 
1.6%
865
 
1.5%
Other values (755) 44340
78.6%
Uppercase Letter
ValueCountFrequency (%)
E 139
13.6%
C 138
13.5%
D 116
11.4%
L 113
11.1%
P 108
10.6%
V 70
6.8%
T 61
 
6.0%
S 46
 
4.5%
R 34
 
3.3%
O 28
 
2.7%
Other values (16) 169
16.5%
Lowercase Letter
ValueCountFrequency (%)
c 90
15.8%
p 65
11.4%
v 48
8.5%
t 47
 
8.3%
e 47
 
8.3%
o 45
 
7.9%
r 33
 
5.8%
s 29
 
5.1%
l 24
 
4.2%
d 22
 
3.9%
Other values (13) 118
20.8%
Decimal Number
ValueCountFrequency (%)
1 19
27.9%
4 12
17.6%
3 9
13.2%
2 9
13.2%
0 7
 
10.3%
8 3
 
4.4%
6 3
 
4.4%
9 3
 
4.4%
7 2
 
2.9%
5 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 3443
98.9%
. 23
 
0.7%
/ 13
 
0.4%
* 1
 
< 0.1%
· 1
 
< 0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
4490
100.0%
Open Punctuation
ValueCountFrequency (%)
( 304
100.0%
Close Punctuation
ValueCountFrequency (%)
) 303
100.0%
Control
ValueCountFrequency (%)
25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56447
84.6%
Common 8684
 
13.0%
Latin 1590
 
2.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1845
 
3.3%
1557
 
2.8%
1491
 
2.6%
1363
 
2.4%
1159
 
2.1%
1044
 
1.8%
964
 
1.7%
931
 
1.6%
889
 
1.6%
865
 
1.5%
Other values (754) 44339
78.5%
Latin
ValueCountFrequency (%)
E 139
 
8.7%
C 138
 
8.7%
D 116
 
7.3%
L 113
 
7.1%
P 108
 
6.8%
c 90
 
5.7%
V 70
 
4.4%
p 65
 
4.1%
T 61
 
3.8%
v 48
 
3.0%
Other values (39) 642
40.4%
Common
ValueCountFrequency (%)
4490
51.7%
, 3443
39.6%
( 304
 
3.5%
) 303
 
3.5%
25
 
0.3%
. 23
 
0.3%
1 19
 
0.2%
/ 13
 
0.1%
4 12
 
0.1%
- 12
 
0.1%
Other values (11) 40
 
0.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56446
84.6%
ASCII 10273
 
15.4%
None 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4490
43.7%
, 3443
33.5%
( 304
 
3.0%
) 303
 
2.9%
E 139
 
1.4%
C 138
 
1.3%
D 116
 
1.1%
L 113
 
1.1%
P 108
 
1.1%
c 90
 
0.9%
Other values (59) 1029
 
10.0%
Hangul
ValueCountFrequency (%)
1845
 
3.3%
1557
 
2.8%
1491
 
2.6%
1363
 
2.4%
1159
 
2.1%
1044
 
1.8%
964
 
1.7%
931
 
1.6%
889
 
1.6%
865
 
1.5%
Other values (753) 44338
78.5%
None
ValueCountFrequency (%)
· 1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct7773
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size68.3 KiB
2024-03-14T21:38:46.848796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length13.775651
Min length7

Characters and Unicode

Total characters120165
Distinct characters500
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7091 ?
Unique (%)81.3%

Sample

1st row전주시완산구우전1길65-9
2nd row전주시완산구전동성당길8
3rd row전주시덕진구비석날로59-11(여의동)
4th row전주시완산구장승배기8길10-10
5th row전주시완산구바람쇠는길131-9
ValueCountFrequency (%)
완주군봉동읍완주산단6로224 10
 
0.1%
군산시동가도길50 9
 
0.1%
군산시외항로864 7
 
0.1%
군산시서수면상장곤윗길66 7
 
0.1%
전주시덕진구상리로45 6
 
0.1%
완주군봉동읍완주산단6로266 6
 
0.1%
익산시석암로1길41 6
 
0.1%
전주시덕진구유상로67 6
 
0.1%
김제시백산면대동공단2길34-32 6
 
0.1%
진안군진안읍거북바위로3길15-31 6
 
0.1%
Other values (7767) 8658
99.2%
2024-03-14T21:38:47.902056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6890
 
5.7%
1 6886
 
5.7%
5865
 
4.9%
5054
 
4.2%
2 4876
 
4.1%
4032
 
3.4%
3 3626
 
3.0%
3495
 
2.9%
3256
 
2.7%
3027
 
2.5%
Other values (490) 73158
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83109
69.2%
Decimal Number 31907
 
26.6%
Dash Punctuation 2973
 
2.5%
Other Punctuation 823
 
0.7%
Close Punctuation 626
 
0.5%
Open Punctuation 624
 
0.5%
Uppercase Letter 75
 
0.1%
Other Symbol 11
 
< 0.1%
Math Symbol 8
 
< 0.1%
Space Separator 4
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6890
 
8.3%
5865
 
7.1%
5054
 
6.1%
4032
 
4.9%
3495
 
4.2%
3256
 
3.9%
3027
 
3.6%
2360
 
2.8%
2047
 
2.5%
2005
 
2.4%
Other values (452) 45078
54.2%
Uppercase Letter
ValueCountFrequency (%)
B 26
34.7%
A 17
22.7%
F 12
16.0%
C 4
 
5.3%
G 3
 
4.0%
T 3
 
4.0%
I 2
 
2.7%
L 2
 
2.7%
D 2
 
2.7%
M 1
 
1.3%
Other values (3) 3
 
4.0%
Decimal Number
ValueCountFrequency (%)
1 6886
21.6%
2 4876
15.3%
3 3626
11.4%
4 2953
9.3%
5 2713
 
8.5%
6 2471
 
7.7%
0 2387
 
7.5%
7 2204
 
6.9%
8 1937
 
6.1%
9 1854
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 798
97.0%
. 17
 
2.1%
/ 6
 
0.7%
: 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 7
87.5%
> 1
 
12.5%
Lowercase Letter
ValueCountFrequency (%)
b 2
66.7%
a 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 2973
100.0%
Close Punctuation
ValueCountFrequency (%)
) 626
100.0%
Open Punctuation
ValueCountFrequency (%)
( 624
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83120
69.2%
Common 36967
30.8%
Latin 78
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6890
 
8.3%
5865
 
7.1%
5054
 
6.1%
4032
 
4.9%
3495
 
4.2%
3256
 
3.9%
3027
 
3.6%
2360
 
2.8%
2047
 
2.5%
2005
 
2.4%
Other values (453) 45089
54.2%
Common
ValueCountFrequency (%)
1 6886
18.6%
2 4876
13.2%
3 3626
9.8%
- 2973
8.0%
4 2953
8.0%
5 2713
 
7.3%
6 2471
 
6.7%
0 2387
 
6.5%
7 2204
 
6.0%
8 1937
 
5.2%
Other values (12) 3941
10.7%
Latin
ValueCountFrequency (%)
B 26
33.3%
A 17
21.8%
F 12
15.4%
C 4
 
5.1%
G 3
 
3.8%
T 3
 
3.8%
I 2
 
2.6%
L 2
 
2.6%
D 2
 
2.6%
b 2
 
2.6%
Other values (5) 5
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83107
69.2%
ASCII 37045
30.8%
None 11
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6890
 
8.3%
5865
 
7.1%
5054
 
6.1%
4032
 
4.9%
3495
 
4.2%
3256
 
3.9%
3027
 
3.6%
2360
 
2.8%
2047
 
2.5%
2005
 
2.4%
Other values (450) 45076
54.2%
ASCII
ValueCountFrequency (%)
1 6886
18.6%
2 4876
13.2%
3 3626
9.8%
- 2973
8.0%
4 2953
8.0%
5 2713
 
7.3%
6 2471
 
6.7%
0 2387
 
6.4%
7 2204
 
5.9%
8 1937
 
5.2%
Other values (27) 4019
10.8%
None
ValueCountFrequency (%)
11
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

홈페이지
Text

MISSING 

Distinct2068
Distinct (%)88.9%
Missing6396
Missing (%)73.3%
Memory size68.3 KiB
2024-03-14T21:38:48.651169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length33
Mean length14.784272
Min length1

Characters and Unicode

Total characters34403
Distinct characters274
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1860 ?
Unique (%)79.9%

Sample

1st rowwww.luielle.com
2nd rowwww.sinsane.com
3rd rowwww.interbita.com
4th rowdscontrol.co.kr
5th rowastabio.co.kr
ValueCountFrequency (%)
harim.com 6
 
0.3%
horyong.co.kr 6
 
0.3%
www.auk.co.kr 5
 
0.2%
www.aceglass.co.kr 5
 
0.2%
www.samwonship.com 4
 
0.2%
www.chamgoeul.com 4
 
0.2%
미응답 4
 
0.2%
www.em-solar.co.kr 3
 
0.1%
aisttech.co.kr 3
 
0.1%
leiko.net 3
 
0.1%
Other values (2057) 2282
98.2%
2024-03-14T21:38:49.963724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 4426
12.9%
o 3682
 
10.7%
w 3295
 
9.6%
c 2688
 
7.8%
r 2090
 
6.1%
e 1716
 
5.0%
n 1695
 
4.9%
k 1694
 
4.9%
m 1679
 
4.9%
a 1572
 
4.6%
Other values (264) 9866
28.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 28304
82.3%
Other Punctuation 4701
 
13.7%
Decimal Number 576
 
1.7%
Other Letter 491
 
1.4%
Dash Punctuation 176
 
0.5%
Uppercase Letter 111
 
0.3%
Space Separator 27
 
0.1%
Control 15
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
3.3%
11
 
2.2%
10
 
2.0%
9
 
1.8%
8
 
1.6%
8
 
1.6%
7
 
1.4%
7
 
1.4%
7
 
1.4%
7
 
1.4%
Other values (196) 401
81.7%
Lowercase Letter
ValueCountFrequency (%)
o 3682
13.0%
w 3295
11.6%
c 2688
 
9.5%
r 2090
 
7.4%
e 1716
 
6.1%
n 1695
 
6.0%
k 1694
 
6.0%
m 1679
 
5.9%
a 1572
 
5.6%
s 1057
 
3.7%
Other values (16) 7136
25.2%
Uppercase Letter
ValueCountFrequency (%)
C 15
13.5%
O 14
12.6%
E 9
 
8.1%
K 9
 
8.1%
S 7
 
6.3%
T 7
 
6.3%
R 7
 
6.3%
A 7
 
6.3%
N 6
 
5.4%
M 6
 
5.4%
Other values (11) 24
21.6%
Decimal Number
ValueCountFrequency (%)
0 107
18.6%
1 97
16.8%
2 89
15.5%
3 62
10.8%
6 40
 
6.9%
7 40
 
6.9%
4 39
 
6.8%
5 35
 
6.1%
9 35
 
6.1%
8 32
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 4426
94.2%
/ 202
 
4.3%
: 44
 
0.9%
@ 13
 
0.3%
, 8
 
0.2%
? 7
 
0.1%
* 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 176
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Control
ValueCountFrequency (%)
15
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28415
82.6%
Common 5497
 
16.0%
Hangul 491
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
3.3%
11
 
2.2%
10
 
2.0%
9
 
1.8%
8
 
1.6%
8
 
1.6%
7
 
1.4%
7
 
1.4%
7
 
1.4%
7
 
1.4%
Other values (196) 401
81.7%
Latin
ValueCountFrequency (%)
o 3682
13.0%
w 3295
11.6%
c 2688
 
9.5%
r 2090
 
7.4%
e 1716
 
6.0%
n 1695
 
6.0%
k 1694
 
6.0%
m 1679
 
5.9%
a 1572
 
5.5%
s 1057
 
3.7%
Other values (37) 7247
25.5%
Common
ValueCountFrequency (%)
. 4426
80.5%
/ 202
 
3.7%
- 176
 
3.2%
0 107
 
1.9%
1 97
 
1.8%
2 89
 
1.6%
3 62
 
1.1%
: 44
 
0.8%
6 40
 
0.7%
7 40
 
0.7%
Other values (11) 214
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33912
98.6%
Hangul 491
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 4426
13.1%
o 3682
 
10.9%
w 3295
 
9.7%
c 2688
 
7.9%
r 2090
 
6.2%
e 1716
 
5.1%
n 1695
 
5.0%
k 1694
 
5.0%
m 1679
 
5.0%
a 1572
 
4.6%
Other values (58) 9375
27.6%
Hangul
ValueCountFrequency (%)
16
 
3.3%
11
 
2.2%
10
 
2.0%
9
 
1.8%
8
 
1.6%
8
 
1.6%
7
 
1.4%
7
 
1.4%
7
 
1.4%
7
 
1.4%
Other values (196) 401
81.7%

Interactions

2024-03-14T21:38:34.817524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:38:50.220908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역본사소재지
연번1.0000.9270.162
지역0.9271.0000.148
본사소재지0.1620.1481.000
2024-03-14T21:38:50.457497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본사소재지지역
본사소재지1.0000.051
지역0.0511.000
2024-03-14T21:38:50.688670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역본사소재지
연번1.0000.7280.063
지역0.7281.0000.051
본사소재지0.0630.0511.000

Missing values

2024-03-14T21:38:35.159647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:38:35.583354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T21:38:35.895657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번지역사업체명대표자 성명본사소재지주생산품주소홈페이지
01전주켈리의푸드부띠끄이해영전북고추장,간장,된장,과일청전주시완산구우전1길65-9<NA>
12전주샤뽀조현종전북모자전주시완산구전동성당길8www.luielle.com
23전주㈜신산이김수석전북신재생에너지전주시덕진구비석날로59-11(여의동)www.sinsane.com
34전주서동푸드협동조합정병호전북전통한과전주시완산구장승배기8길10-10<NA>
45전주가온디자인송훈배전북주방용 및 음식점용 목재가구전주시완산구바람쇠는길131-9<NA>
56전주한일판넬전형기전북금속제창문전주시덕진구초포다리로169<NA>
67전주유비월드정선주전북인조손톱전주시덕진구청복길90<NA>
78전주인터비타김정숙전북리모컨, 키보드전주시완산구천잠로303,ㅡ벤처창업관314호www.interbita.com
89전주동서콘트롤㈜박춘경전북전장부품전주시덕진구구렛들1길20dscontrol.co.kr
910전주㈜모아에코텍유원석전북LID저영향기법개발전주시덕진구명주6길15<NA>
연번지역사업체명대표자 성명본사소재지주생산품주소홈페이지
87138714부안농업회사법인(유)도원미곡김준영전북백미,잡곡부안군동진면간척길127<NA>
87148715부안케이헬스푸드 주식회사김형조전북홍삼액기스등부안군행안면옥여길32-22<NA>
87158716부안비제이산업최윤하전북콘크리트공사용 옹벽블록부안군보안면남포리337-1<NA>
87168717부안떡내음가득 시루떡이윤정전북떡류부안군진서면청자로955<NA>
87178718부안유한회사 한길수산정치영전북생선가공품부안군줄포면신리로313-12<NA>
87188719부안농업회사법인 유한회사 슬지제빵소김슬지전북팥앙금.빵류,만두부안군진서면청자로1076zzinbbang.kr
87198720부안미구가강미구전북양파즙,사과즙,배즙부안군행안면행안중앙로93miguga.com
87208721부안농업회사법인 케이유팜스김의숙전북무청가공,표고버섯부안군상서면유정길18-41<NA>
87218722부안(주)이예스산업이현주전북디자인형울타리부안군보안면농공단지길33-9<NA>
87228723부안농업회사법인유한회사피오레문요환전북빵,과자류부안군주산면주산동로88