Overview

Dataset statistics

Number of variables7
Number of observations763
Missing cells128
Missing cells (%)2.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory42.6 KiB
Average record size in memory57.2 B

Variable types

Text6
Numeric1

Dataset

Description여주시 관내 공장등록에 대한 현황 데이터를 제공합니다. 제공하는 항목으로는 회사명, 공장대표주소(도로명), 공장대표주소(지번), 업종코드, 업종명, 전화번호, 생산품이 있습니다.
Author경기도 여주시
URLhttps://www.data.go.kr/data/3065498/fileData.do

Alerts

전화번호 has 127 (16.6%) missing valuesMissing

Reproduction

Analysis started2024-04-20 20:10:04.894126
Analysis finished2024-04-20 20:10:06.911964
Duration2.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct752
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-04-21T05:10:07.566962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length6.8754915
Min length1

Characters and Unicode

Total characters5246
Distinct characters439
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique741 ?
Unique (%)97.1%

Sample

1st row주식회사 털보농산
2nd row영진공업사
3rd row힐링하우징
4th row청원농산
5th row(주)삼영텍스타일
ValueCountFrequency (%)
주식회사 40
 
4.7%
농업회사법인 20
 
2.3%
여주공장 5
 
0.6%
주)수리산업 3
 
0.4%
여주지점 3
 
0.4%
맹가노니시스템 2
 
0.2%
제3공장 2
 
0.2%
산림조합중앙회 2
 
0.2%
성진요업 2
 
0.2%
케이엠씨밸브(주 2
 
0.2%
Other values (765) 775
90.5%
2024-04-21T05:10:08.611694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
444
 
8.5%
) 372
 
7.1%
( 372
 
7.1%
154
 
2.9%
122
 
2.3%
100
 
1.9%
100
 
1.9%
94
 
1.8%
94
 
1.8%
87
 
1.7%
Other values (429) 3307
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4322
82.4%
Close Punctuation 372
 
7.1%
Open Punctuation 372
 
7.1%
Space Separator 94
 
1.8%
Uppercase Letter 46
 
0.9%
Lowercase Letter 16
 
0.3%
Other Punctuation 13
 
0.2%
Decimal Number 10
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
444
 
10.3%
154
 
3.6%
122
 
2.8%
100
 
2.3%
100
 
2.3%
94
 
2.2%
87
 
2.0%
86
 
2.0%
71
 
1.6%
61
 
1.4%
Other values (389) 3003
69.5%
Uppercase Letter
ValueCountFrequency (%)
A 6
13.0%
P 4
 
8.7%
R 3
 
6.5%
D 3
 
6.5%
N 3
 
6.5%
G 3
 
6.5%
S 3
 
6.5%
B 3
 
6.5%
H 3
 
6.5%
F 2
 
4.3%
Other values (9) 13
28.3%
Lowercase Letter
ValueCountFrequency (%)
e 5
31.2%
a 2
 
12.5%
b 1
 
6.2%
l 1
 
6.2%
z 1
 
6.2%
i 1
 
6.2%
j 1
 
6.2%
v 1
 
6.2%
w 1
 
6.2%
n 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
. 8
61.5%
& 4
30.8%
, 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 4
40.0%
2 4
40.0%
3 2
20.0%
Close Punctuation
ValueCountFrequency (%)
) 372
100.0%
Open Punctuation
ValueCountFrequency (%)
( 372
100.0%
Space Separator
ValueCountFrequency (%)
94
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4322
82.4%
Common 862
 
16.4%
Latin 62
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
444
 
10.3%
154
 
3.6%
122
 
2.8%
100
 
2.3%
100
 
2.3%
94
 
2.2%
87
 
2.0%
86
 
2.0%
71
 
1.6%
61
 
1.4%
Other values (389) 3003
69.5%
Latin
ValueCountFrequency (%)
A 6
 
9.7%
e 5
 
8.1%
P 4
 
6.5%
R 3
 
4.8%
D 3
 
4.8%
N 3
 
4.8%
G 3
 
4.8%
S 3
 
4.8%
B 3
 
4.8%
H 3
 
4.8%
Other values (20) 26
41.9%
Common
ValueCountFrequency (%)
) 372
43.2%
( 372
43.2%
94
 
10.9%
. 8
 
0.9%
1 4
 
0.5%
2 4
 
0.5%
& 4
 
0.5%
3 2
 
0.2%
, 1
 
0.1%
- 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4322
82.4%
ASCII 924
 
17.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
444
 
10.3%
154
 
3.6%
122
 
2.8%
100
 
2.3%
100
 
2.3%
94
 
2.2%
87
 
2.0%
86
 
2.0%
71
 
1.6%
61
 
1.4%
Other values (389) 3003
69.5%
ASCII
ValueCountFrequency (%)
) 372
40.3%
( 372
40.3%
94
 
10.2%
. 8
 
0.9%
A 6
 
0.6%
e 5
 
0.5%
1 4
 
0.4%
P 4
 
0.4%
2 4
 
0.4%
& 4
 
0.4%
Other values (30) 51
 
5.5%
Distinct713
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-04-21T05:10:09.733035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length26
Mean length19.353866
Min length13

Characters and Unicode

Total characters14767
Distinct characters179
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique667 ?
Unique (%)87.4%

Sample

1st row경기도 여주시 가남읍 가남로 105
2nd row경기도 여주시 가남읍 가남로 114
3rd row경기도 여주시 가남읍 가남로 153
4th row경기도 여주시 가남읍 가남로 230
5th row경기도 여주시 가남읍 가남로 298
ValueCountFrequency (%)
경기도 763
21.1%
여주시 763
21.1%
가남읍 147
 
4.1%
북내면 101
 
2.8%
점동면 69
 
1.9%
세종대왕면 62
 
1.7%
대신면 57
 
1.6%
흥천면 46
 
1.3%
여양로 43
 
1.2%
강천면 40
 
1.1%
Other values (737) 1528
42.2%
2024-04-21T05:10:11.098469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2856
19.3%
938
 
6.4%
825
 
5.6%
806
 
5.5%
783
 
5.3%
773
 
5.2%
763
 
5.2%
1 536
 
3.6%
489
 
3.3%
408
 
2.8%
Other values (169) 5590
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8905
60.3%
Space Separator 2856
 
19.3%
Decimal Number 2690
 
18.2%
Dash Punctuation 291
 
2.0%
Other Punctuation 8
 
0.1%
Open Punctuation 6
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Uppercase Letter 3
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
938
 
10.5%
825
 
9.3%
806
 
9.1%
783
 
8.8%
773
 
8.7%
763
 
8.6%
489
 
5.5%
408
 
4.6%
275
 
3.1%
220
 
2.5%
Other values (151) 2625
29.5%
Decimal Number
ValueCountFrequency (%)
1 536
19.9%
2 356
13.2%
3 312
11.6%
4 279
10.4%
5 245
9.1%
6 218
8.1%
8 209
 
7.8%
7 195
 
7.2%
0 192
 
7.1%
9 148
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
66.7%
A 1
33.3%
Space Separator
ValueCountFrequency (%)
2856
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 291
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8905
60.3%
Common 5859
39.7%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
938
 
10.5%
825
 
9.3%
806
 
9.1%
783
 
8.8%
773
 
8.7%
763
 
8.6%
489
 
5.5%
408
 
4.6%
275
 
3.1%
220
 
2.5%
Other values (151) 2625
29.5%
Common
ValueCountFrequency (%)
2856
48.7%
1 536
 
9.1%
2 356
 
6.1%
3 312
 
5.3%
- 291
 
5.0%
4 279
 
4.8%
5 245
 
4.2%
6 218
 
3.7%
8 209
 
3.6%
7 195
 
3.3%
Other values (6) 362
 
6.2%
Latin
ValueCountFrequency (%)
B 2
66.7%
A 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8905
60.3%
ASCII 5862
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2856
48.7%
1 536
 
9.1%
2 356
 
6.1%
3 312
 
5.3%
- 291
 
5.0%
4 279
 
4.8%
5 245
 
4.2%
6 218
 
3.7%
8 209
 
3.6%
7 195
 
3.3%
Other values (8) 365
 
6.2%
Hangul
ValueCountFrequency (%)
938
 
10.5%
825
 
9.3%
806
 
9.1%
783
 
8.8%
773
 
8.7%
763
 
8.6%
489
 
5.5%
408
 
4.6%
275
 
3.1%
220
 
2.5%
Other values (151) 2625
29.5%
Distinct708
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-04-21T05:10:11.966581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length19.627785
Min length14

Characters and Unicode

Total characters14976
Distinct characters132
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique657 ?
Unique (%)86.1%

Sample

1st row경기도 여주시 가남읍 하귀리 351-4
2nd row경기도 여주시 가남읍 하귀리 346
3rd row경기도 여주시 가남읍 하귀리 241-3
4th row경기도 여주시 가남읍 하귀리 산 43-2
5th row경기도 여주시 가남읍 양귀리 146
ValueCountFrequency (%)
경기도 763
21.1%
여주시 763
21.1%
가남읍 147
 
4.1%
북내면 101
 
2.8%
점동면 69
 
1.9%
세종대왕면 62
 
1.7%
대신면 57
 
1.6%
흥천면 46
 
1.3%
현암동 45
 
1.2%
강천면 40
 
1.1%
Other values (795) 1526
42.2%
2024-04-21T05:10:13.135850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2856
19.1%
770
 
5.1%
767
 
5.1%
763
 
5.1%
763
 
5.1%
763
 
5.1%
763
 
5.1%
- 609
 
4.1%
555
 
3.7%
1 497
 
3.3%
Other values (122) 5870
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8638
57.7%
Decimal Number 2873
 
19.2%
Space Separator 2856
 
19.1%
Dash Punctuation 609
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
770
 
8.9%
767
 
8.9%
763
 
8.8%
763
 
8.8%
763
 
8.8%
763
 
8.8%
555
 
6.4%
408
 
4.7%
277
 
3.2%
178
 
2.1%
Other values (110) 2631
30.5%
Decimal Number
ValueCountFrequency (%)
1 497
17.3%
2 434
15.1%
3 387
13.5%
5 296
10.3%
4 273
9.5%
6 256
8.9%
7 217
7.6%
8 194
 
6.8%
0 165
 
5.7%
9 154
 
5.4%
Space Separator
ValueCountFrequency (%)
2856
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 609
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8638
57.7%
Common 6338
42.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
770
 
8.9%
767
 
8.9%
763
 
8.8%
763
 
8.8%
763
 
8.8%
763
 
8.8%
555
 
6.4%
408
 
4.7%
277
 
3.2%
178
 
2.1%
Other values (110) 2631
30.5%
Common
ValueCountFrequency (%)
2856
45.1%
- 609
 
9.6%
1 497
 
7.8%
2 434
 
6.8%
3 387
 
6.1%
5 296
 
4.7%
4 273
 
4.3%
6 256
 
4.0%
7 217
 
3.4%
8 194
 
3.1%
Other values (2) 319
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8638
57.7%
ASCII 6338
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2856
45.1%
- 609
 
9.6%
1 497
 
7.8%
2 434
 
6.8%
3 387
 
6.1%
5 296
 
4.7%
4 273
 
4.3%
6 256
 
4.0%
7 217
 
3.4%
8 194
 
3.1%
Other values (2) 319
 
5.0%
Hangul
ValueCountFrequency (%)
770
 
8.9%
767
 
8.9%
763
 
8.8%
763
 
8.8%
763
 
8.8%
763
 
8.8%
555
 
6.4%
408
 
4.7%
277
 
3.2%
178
 
2.1%
Other values (110) 2631
30.5%

대표업종번호
Real number (ℝ)

Distinct203
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22054.379
Minimum10121
Maximum38311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2024-04-21T05:10:13.434342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10121
5-th percentile10611
Q120317.5
median23221
Q325112
95-th percentile32011
Maximum38311
Range28190
Interquartile range (IQR)4794.5

Descriptive statistics

Standard deviation6145.8501
Coefficient of variation (CV)0.27866802
Kurtosis-0.17610896
Mean22054.379
Median Absolute Deviation (MAD)1892
Skewness-0.53188519
Sum16827491
Variance37771473
MonotonicityNot monotonic
2024-04-21T05:10:13.729145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23221 173
 
22.7%
23325 23
 
3.0%
23324 20
 
2.6%
25113 17
 
2.2%
10611 17
 
2.2%
25111 14
 
1.8%
23229 14
 
1.8%
22223 13
 
1.7%
10742 11
 
1.4%
23322 11
 
1.4%
Other values (193) 450
59.0%
ValueCountFrequency (%)
10121 2
 
0.3%
10122 2
 
0.3%
10129 2
 
0.3%
10219 1
 
0.1%
10220 2
 
0.3%
10301 5
0.7%
10302 1
 
0.1%
10309 10
1.3%
10402 1
 
0.1%
10403 2
 
0.3%
ValueCountFrequency (%)
38311 2
0.3%
34011 2
0.3%
33999 4
0.5%
33993 1
 
0.1%
33992 1
 
0.1%
33933 3
0.4%
33932 1
 
0.1%
33910 3
0.4%
33309 2
0.3%
33302 3
0.4%
Distinct310
Distinct (%)40.6%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-04-21T05:10:14.848326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length17.480996
Min length5

Characters and Unicode

Total characters13338
Distinct characters271
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)26.9%

Sample

1st row김치류 제조업 외 1 종
2nd row탭, 밸브 및 유사장치 제조업
3rd row기타 구조용 금속제품 제조업
4th row천막, 텐트 및 유사 제품 제조업 외 1 종
5th row편조원단 제조업
ValueCountFrequency (%)
제조업 691
16.3%
434
 
10.2%
291
 
6.8%
252
 
5.9%
도자기 193
 
4.5%
가정용 178
 
4.2%
기타 175
 
4.1%
장식용 173
 
4.1%
1 153
 
3.6%
콘크리트 71
 
1.7%
Other values (379) 1641
38.6%
2024-04-21T05:10:16.225431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3489
26.2%
880
 
6.6%
813
 
6.1%
790
 
5.9%
521
 
3.9%
499
 
3.7%
435
 
3.3%
297
 
2.2%
256
 
1.9%
254
 
1.9%
Other values (261) 5104
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9452
70.9%
Space Separator 3489
 
26.2%
Decimal Number 261
 
2.0%
Other Punctuation 128
 
1.0%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
880
 
9.3%
813
 
8.6%
790
 
8.4%
521
 
5.5%
499
 
5.3%
435
 
4.6%
297
 
3.1%
256
 
2.7%
254
 
2.7%
246
 
2.6%
Other values (247) 4461
47.2%
Decimal Number
ValueCountFrequency (%)
1 162
62.1%
2 43
 
16.5%
3 23
 
8.8%
4 12
 
4.6%
5 7
 
2.7%
9 5
 
1.9%
7 4
 
1.5%
6 3
 
1.1%
8 2
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 125
97.7%
. 3
 
2.3%
Space Separator
ValueCountFrequency (%)
3489
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9452
70.9%
Common 3886
29.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
880
 
9.3%
813
 
8.6%
790
 
8.4%
521
 
5.5%
499
 
5.3%
435
 
4.6%
297
 
3.1%
256
 
2.7%
254
 
2.7%
246
 
2.6%
Other values (247) 4461
47.2%
Common
ValueCountFrequency (%)
3489
89.8%
1 162
 
4.2%
, 125
 
3.2%
2 43
 
1.1%
3 23
 
0.6%
4 12
 
0.3%
5 7
 
0.2%
9 5
 
0.1%
) 4
 
0.1%
( 4
 
0.1%
Other values (4) 12
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9441
70.8%
ASCII 3886
29.1%
Compat Jamo 11
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3489
89.8%
1 162
 
4.2%
, 125
 
3.2%
2 43
 
1.1%
3 23
 
0.6%
4 12
 
0.3%
5 7
 
0.2%
9 5
 
0.1%
) 4
 
0.1%
( 4
 
0.1%
Other values (4) 12
 
0.3%
Hangul
ValueCountFrequency (%)
880
 
9.3%
813
 
8.6%
790
 
8.4%
521
 
5.5%
499
 
5.3%
435
 
4.6%
297
 
3.1%
256
 
2.7%
254
 
2.7%
246
 
2.6%
Other values (246) 4450
47.1%
Compat Jamo
ValueCountFrequency (%)
11
100.0%

전화번호
Text

MISSING 

Distinct604
Distinct (%)95.0%
Missing127
Missing (%)16.6%
Memory size6.1 KiB
2024-04-21T05:10:17.042698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.982704
Min length9

Characters and Unicode

Total characters7621
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique576 ?
Unique (%)90.6%

Sample

1st row031-886-8975
2nd row031-883-5394
3rd row031-882-2520
4th row031-884-3432
5th row031-883-6321
ValueCountFrequency (%)
031-884-2366 3
 
0.5%
031-883-0117 3
 
0.5%
031-883-8781 3
 
0.5%
031-886-3100 3
 
0.5%
031-885-8057 2
 
0.3%
031-884-6538 2
 
0.3%
031-884-1658 2
 
0.3%
031-884-8229 2
 
0.3%
031-881-2013 2
 
0.3%
031-885-1552 2
 
0.3%
Other values (594) 612
96.2%
2024-04-21T05:10:18.085754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 1318
17.3%
- 1270
16.7%
1 1035
13.6%
0 1000
13.1%
3 948
12.4%
5 399
 
5.2%
2 395
 
5.2%
6 363
 
4.8%
4 354
 
4.6%
7 319
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6351
83.3%
Dash Punctuation 1270
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 1318
20.8%
1 1035
16.3%
0 1000
15.7%
3 948
14.9%
5 399
 
6.3%
2 395
 
6.2%
6 363
 
5.7%
4 354
 
5.6%
7 319
 
5.0%
9 220
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 1270
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7621
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
8 1318
17.3%
- 1270
16.7%
1 1035
13.6%
0 1000
13.1%
3 948
12.4%
5 399
 
5.2%
2 395
 
5.2%
6 363
 
4.8%
4 354
 
4.6%
7 319
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7621
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 1318
17.3%
- 1270
16.7%
1 1035
13.6%
0 1000
13.1%
3 948
12.4%
5 399
 
5.2%
2 395
 
5.2%
6 363
 
4.8%
4 354
 
4.6%
7 319
 
4.2%
Distinct554
Distinct (%)72.7%
Missing1
Missing (%)0.1%
Memory size6.1 KiB
2024-04-21T05:10:19.041079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length47
Mean length8.2874016
Min length1

Characters and Unicode

Total characters6315
Distinct characters505
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique508 ?
Unique (%)66.7%

Sample

1st row단무지
2nd row밸브
3rd row농막
4th row차광망
5th row원단
ValueCountFrequency (%)
도자기 143
 
10.7%
28
 
2.1%
25
 
1.9%
창호 18
 
1.4%
플라스틱 17
 
1.3%
콘크리트 11
 
0.8%
레미콘 11
 
0.8%
이동식 6
 
0.5%
생활도자기 6
 
0.5%
컨테이너 5
 
0.4%
Other values (829) 1062
79.7%
2024-04-21T05:10:20.274851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
572
 
9.1%
, 405
 
6.4%
331
 
5.2%
234
 
3.7%
190
 
3.0%
104
 
1.6%
94
 
1.5%
79
 
1.3%
78
 
1.2%
76
 
1.2%
Other values (495) 4152
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5034
79.7%
Space Separator 572
 
9.1%
Other Punctuation 416
 
6.6%
Uppercase Letter 106
 
1.7%
Lowercase Letter 64
 
1.0%
Open Punctuation 60
 
1.0%
Close Punctuation 60
 
1.0%
Decimal Number 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
6.6%
234
 
4.6%
190
 
3.8%
104
 
2.1%
94
 
1.9%
79
 
1.6%
78
 
1.5%
76
 
1.5%
69
 
1.4%
64
 
1.3%
Other values (444) 3715
73.8%
Uppercase Letter
ValueCountFrequency (%)
P 17
16.0%
C 13
12.3%
E 7
 
6.6%
H 7
 
6.6%
B 7
 
6.6%
S 6
 
5.7%
D 6
 
5.7%
L 5
 
4.7%
R 4
 
3.8%
F 4
 
3.8%
Other values (11) 30
28.3%
Lowercase Letter
ValueCountFrequency (%)
p 12
18.8%
t 8
12.5%
c 7
10.9%
e 6
9.4%
s 5
7.8%
a 4
 
6.2%
v 3
 
4.7%
i 3
 
4.7%
f 2
 
3.1%
g 2
 
3.1%
Other values (10) 12
18.8%
Other Punctuation
ValueCountFrequency (%)
, 405
97.4%
/ 6
 
1.4%
. 4
 
1.0%
· 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
9 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
572
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5034
79.7%
Common 1111
 
17.6%
Latin 170
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
 
6.6%
234
 
4.6%
190
 
3.8%
104
 
2.1%
94
 
1.9%
79
 
1.6%
78
 
1.5%
76
 
1.5%
69
 
1.4%
64
 
1.3%
Other values (444) 3715
73.8%
Latin
ValueCountFrequency (%)
P 17
 
10.0%
C 13
 
7.6%
p 12
 
7.1%
t 8
 
4.7%
E 7
 
4.1%
c 7
 
4.1%
H 7
 
4.1%
B 7
 
4.1%
S 6
 
3.5%
e 6
 
3.5%
Other values (31) 80
47.1%
Common
ValueCountFrequency (%)
572
51.5%
, 405
36.5%
( 60
 
5.4%
) 60
 
5.4%
/ 6
 
0.5%
. 4
 
0.4%
9 1
 
0.1%
4 1
 
0.1%
- 1
 
0.1%
· 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5034
79.7%
ASCII 1280
 
20.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
572
44.7%
, 405
31.6%
( 60
 
4.7%
) 60
 
4.7%
P 17
 
1.3%
C 13
 
1.0%
p 12
 
0.9%
t 8
 
0.6%
E 7
 
0.5%
c 7
 
0.5%
Other values (40) 119
 
9.3%
Hangul
ValueCountFrequency (%)
331
 
6.6%
234
 
4.6%
190
 
3.8%
104
 
2.1%
94
 
1.9%
79
 
1.6%
78
 
1.5%
76
 
1.5%
69
 
1.4%
64
 
1.3%
Other values (444) 3715
73.8%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2024-04-21T05:10:06.063529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-21T05:10:06.469696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T05:10:06.669595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T05:10:06.833458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명공장대표주소(도로명)공장대표주소(지번)대표업종번호업종명전화번호생산품
0주식회사 털보농산경기도 여주시 가남읍 가남로 105경기도 여주시 가남읍 하귀리 351-410301김치류 제조업 외 1 종031-886-8975단무지
1영진공업사경기도 여주시 가남읍 가남로 114경기도 여주시 가남읍 하귀리 34629133탭, 밸브 및 유사장치 제조업031-883-5394밸브
2힐링하우징경기도 여주시 가남읍 가남로 153경기도 여주시 가남읍 하귀리 241-325119기타 구조용 금속제품 제조업031-882-2520농막
3청원농산경기도 여주시 가남읍 가남로 230경기도 여주시 가남읍 하귀리 산 43-213224천막, 텐트 및 유사 제품 제조업 외 1 종031-884-3432차광망
4(주)삼영텍스타일경기도 여주시 가남읍 가남로 298경기도 여주시 가남읍 양귀리 14613300편조원단 제조업031-883-6321원단
5주식회사 와이에스피전자경기도 여주시 가남읍 가남로 34경기도 여주시 가남읍 삼군리 358-326299그 외 기타 전자부품 제조업 외 2 종031-883-7095전자판 회로(PCB ASSY), 기타 자외선 살균기
6(주)제이비이엔엘경기도 여주시 가남읍 가남로 389경기도 여주시 가남읍 양귀리 462-228122전기회로 접속장치 제조업 외 1 종<NA>콘센트, 스위치
7삼한식품경기도 여주시 가남읍 가남로 40경기도 여주시 가남읍 삼군리 356-310743장류 제조업031-882-2873장류제조업
8(주)서진강재경기도 여주시 가남읍 가남로 409-5경기도 여주시 가남읍 양귀리 461-825111금속 문, 창, 셔터 및 관련제품 제조업<NA>샷시보강재
9(주)씨테크경기도 여주시 가남읍 가남로 421경기도 여주시 가남읍 양귀리 511-126121발광 다이오드 제조업 외 1 종031-881-2160saw filter(이동통신 중계기)
회사명공장대표주소(도로명)공장대표주소(지번)대표업종번호업종명전화번호생산품
753(주)포이닉스경기도 여주시 흥천면 흥천로 259경기도 여주시 흥천면 대당리 25-523991아스팔트 콘크리트 및 혼합제품 제조업031-888-5530아스콘
754바른경기도 여주시 흥천면 흥천로 308-102경기도 여주시 흥천면 율극리 313-916229기타 건축용 나무제품 제조업 외 1 종<NA>이동식 화장실
755(주)극동플러스경기도 여주시 흥천면 흥천로 308-110경기도 여주시 흥천면 귀백리 58-1322251폴리스티렌 발포 성형제품 제조업 외 1 종02-579-0488우레탄 발포 성형품, 단열볼트
756대양이에스경기도 여주시 흥천면 흥천로 308-110경기도 여주시 흥천면 귀백리 58-1323911건설용 석제품 제조업 외 1 종<NA>석제품
757(주)다성레미콘 여주지점경기도 여주시 흥천면 흥천로 308-85경기도 여주시 흥천면 효지리 45423322레미콘 제조업031-632-3048레디믹스트 콘크리트
758영농조합법인서흥티.엠.알경기도 여주시 흥천면 흥천로 423-5경기도 여주시 흥천면 효지리 182-510801배합 사료 제조업 외 1 종031-883-1210사료(티엠알)
759(주)창경기도 여주시 흥천면 흥천로 61경기도 여주시 흥천면 신근리 724-1122223플라스틱 창호 제조업<NA>플라스틱 창호
760R-엔지니어링경기도 여주시 흥천면 흥천로 708경기도 여주시 흥천면 다대리 2229199그 외 기타 일반목적용 기계 제조업<NA>유리제조 설비용 기계
761일산교역경기도 여주시 흥천면 흥천로 708경기도 여주시 흥천면 다대리 2216291목재 도구 및 주방용 나무제품 제조업031-884-1261목각밥상
762(주)지성경기도 여주시 흥천면 흥천로 75경기도 여주시 흥천면 신근리 72425111금속 문, 창, 셔터 및 관련제품 제조업 외 1 종031-884-8223파이프이음쇠