Overview

Dataset statistics

Number of variables16
Number of observations780
Missing cells630
Missing cells (%)5.0%
Duplicate rows2
Duplicate rows (%)0.3%
Total size in memory99.9 KiB
Average record size in memory131.2 B

Variable types

Text11
Categorical2
Numeric3

Dataset

Description공공구매 영업지원 희망 사회적기업 목록
Author경기도일자리재단
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=KBF4WJP0TZXNYQS4X5NJ32201037&infSeq=1

Alerts

Dataset has 2 (0.3%) duplicate rowsDuplicates
정제우편번호 is highly overall correlated with 정제WGS84위도High correlation
정제WGS84위도 is highly overall correlated with 정제우편번호High correlation
지정정보처리장치 is highly imbalanced (61.7%)Imbalance
연락처 has 167 (21.4%) missing valuesMissing
정제도로명주소 has 54 (6.9%) missing valuesMissing
홈페이지 has 241 (30.9%) missing valuesMissing
면허및특허 has 147 (18.8%) missing valuesMissing

Reproduction

Analysis started2024-04-11 03:07:35.148004
Analysis finished2024-04-11 03:07:39.187196
Duration4.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct72
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2024-04-11T12:07:39.281379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length36
Mean length7.1692308
Min length4

Characters and Unicode

Total characters5592
Distinct characters50
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)5.8%

Sample

1st row사회적협동조합
2nd row사회적협동조합
3rd row마을기업
4th row사회적협동조합
5th row사회적기업
ValueCountFrequency (%)
사회적기업 280
28.9%
자활기업 188
19.4%
예비사회적기업 177
18.3%
사회적협동조합 93
 
9.6%
협동조합 63
 
6.5%
마을기업 45
 
4.6%
여성기업 38
 
3.9%
청년기업 18
 
1.9%
장애인기업 10
 
1.0%
중증장애인생산시설 8
 
0.8%
Other values (28) 48
 
5.0%
2024-04-11T12:07:39.543087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
817
14.6%
806
14.4%
576
10.3%
569
10.2%
569
10.2%
, 207
 
3.7%
190
 
3.4%
190
 
3.4%
188
 
3.4%
183
 
3.3%
Other values (40) 1297
23.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5190
92.8%
Other Punctuation 208
 
3.7%
Space Separator 188
 
3.4%
Close Punctuation 2
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
817
15.7%
806
15.5%
576
11.1%
569
11.0%
569
11.0%
190
 
3.7%
190
 
3.7%
183
 
3.5%
183
 
3.5%
163
 
3.1%
Other values (34) 944
18.2%
Other Punctuation
ValueCountFrequency (%)
, 207
99.5%
/ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
188
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5190
92.8%
Common 402
 
7.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
817
15.7%
806
15.5%
576
11.1%
569
11.0%
569
11.0%
190
 
3.7%
190
 
3.7%
183
 
3.5%
183
 
3.5%
163
 
3.1%
Other values (34) 944
18.2%
Common
ValueCountFrequency (%)
, 207
51.5%
188
46.8%
) 2
 
0.5%
- 2
 
0.5%
( 2
 
0.5%
/ 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5190
92.8%
ASCII 402
 
7.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
817
15.7%
806
15.5%
576
11.1%
569
11.0%
569
11.0%
190
 
3.7%
190
 
3.7%
183
 
3.5%
183
 
3.5%
163
 
3.1%
Other values (34) 944
18.2%
ASCII
ValueCountFrequency (%)
, 207
51.5%
188
46.8%
) 2
 
0.5%
- 2
 
0.5%
( 2
 
0.5%
/ 1
 
0.2%

분야
Text

Distinct54
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2024-04-11T12:07:39.717449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.0282051
Min length2

Characters and Unicode

Total characters5482
Distinct characters130
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)2.3%

Sample

1st row교육/상담
2nd row교육/상담
3rd row교육/상담
4th row기타서비스(용역)
5th row기타서비스(용역)
ValueCountFrequency (%)
농수축산물/식재료 115
14.7%
청소/시설관리?경비 80
 
10.3%
교육/상담 75
 
9.6%
생활용품 66
 
8.5%
인쇄/홍보물 41
 
5.3%
비품 40
 
5.1%
견학/체험 38
 
4.9%
행사/문화예술 29
 
3.7%
건축/설비?보수 29
 
3.7%
사무용품 25
 
3.2%
Other values (44) 242
31.0%
2024-04-11T12:07:40.026948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 447
 
8.2%
201
 
3.7%
186
 
3.4%
176
 
3.2%
165
 
3.0%
165
 
3.0%
) 149
 
2.7%
( 149
 
2.7%
144
 
2.6%
137
 
2.5%
Other values (120) 3563
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4570
83.4%
Other Punctuation 609
 
11.1%
Close Punctuation 149
 
2.7%
Open Punctuation 149
 
2.7%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
4.4%
186
 
4.1%
176
 
3.9%
165
 
3.6%
165
 
3.6%
144
 
3.2%
137
 
3.0%
130
 
2.8%
124
 
2.7%
117
 
2.6%
Other values (114) 3025
66.2%
Other Punctuation
ValueCountFrequency (%)
/ 447
73.4%
? 109
 
17.9%
, 53
 
8.7%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Open Punctuation
ValueCountFrequency (%)
( 149
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4570
83.4%
Common 912
 
16.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
4.4%
186
 
4.1%
176
 
3.9%
165
 
3.6%
165
 
3.6%
144
 
3.2%
137
 
3.0%
130
 
2.8%
124
 
2.7%
117
 
2.6%
Other values (114) 3025
66.2%
Common
ValueCountFrequency (%)
/ 447
49.0%
) 149
 
16.3%
( 149
 
16.3%
? 109
 
12.0%
, 53
 
5.8%
- 5
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4514
82.3%
ASCII 912
 
16.6%
Compat Jamo 56
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 447
49.0%
) 149
 
16.3%
( 149
 
16.3%
? 109
 
12.0%
, 53
 
5.8%
- 5
 
0.5%
Hangul
ValueCountFrequency (%)
201
 
4.5%
186
 
4.1%
176
 
3.9%
165
 
3.7%
165
 
3.7%
144
 
3.2%
137
 
3.0%
130
 
2.9%
124
 
2.7%
117
 
2.6%
Other values (113) 2969
65.8%
Compat Jamo
ValueCountFrequency (%)
56
100.0%
Distinct740
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2024-04-11T12:07:40.265281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length8.9230769
Min length2

Characters and Unicode

Total characters6960
Distinct characters536
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique701 ?
Unique (%)89.9%

Sample

1st row사회적협동조합 배움과나눔
2nd row새싹코딩협동조합
3rd row새터마을협동조합
4th row마을미디어인스토리 협동
5th row사단법인희망나눔플러스
ValueCountFrequency (%)
주식회사 109
 
10.4%
사회적협동조합 59
 
5.6%
협동조합 16
 
1.5%
농업회사법인 12
 
1.1%
사단법인 6
 
0.6%
사회복지법인 6
 
0.6%
4
 
0.4%
씨유 4
 
0.4%
유한회사 4
 
0.4%
에듀가든 3
 
0.3%
Other values (782) 828
78.8%
2024-04-11T12:07:40.621669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
310
 
4.5%
295
 
4.2%
279
 
4.0%
271
 
3.9%
247
 
3.5%
238
 
3.4%
238
 
3.4%
232
 
3.3%
) 167
 
2.4%
( 165
 
2.4%
Other values (526) 4518
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6226
89.5%
Space Separator 271
 
3.9%
Close Punctuation 167
 
2.4%
Open Punctuation 165
 
2.4%
Other Symbol 77
 
1.1%
Uppercase Letter 31
 
0.4%
Decimal Number 15
 
0.2%
Other Punctuation 6
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
310
 
5.0%
295
 
4.7%
279
 
4.5%
247
 
4.0%
238
 
3.8%
238
 
3.8%
232
 
3.7%
139
 
2.2%
112
 
1.8%
100
 
1.6%
Other values (495) 4036
64.8%
Uppercase Letter
ValueCountFrequency (%)
C 6
19.4%
S 4
12.9%
U 3
9.7%
G 2
 
6.5%
B 2
 
6.5%
O 2
 
6.5%
A 2
 
6.5%
M 1
 
3.2%
D 1
 
3.2%
K 1
 
3.2%
Other values (7) 7
22.6%
Decimal Number
ValueCountFrequency (%)
2 4
26.7%
5 4
26.7%
8 3
20.0%
1 3
20.0%
9 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 2
33.3%
, 2
33.3%
/ 1
16.7%
& 1
16.7%
Space Separator
ValueCountFrequency (%)
271
100.0%
Close Punctuation
ValueCountFrequency (%)
) 167
100.0%
Open Punctuation
ValueCountFrequency (%)
( 165
100.0%
Other Symbol
ValueCountFrequency (%)
77
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6303
90.6%
Common 626
 
9.0%
Latin 31
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
310
 
4.9%
295
 
4.7%
279
 
4.4%
247
 
3.9%
238
 
3.8%
238
 
3.8%
232
 
3.7%
139
 
2.2%
112
 
1.8%
100
 
1.6%
Other values (496) 4113
65.3%
Latin
ValueCountFrequency (%)
C 6
19.4%
S 4
12.9%
U 3
9.7%
G 2
 
6.5%
B 2
 
6.5%
O 2
 
6.5%
A 2
 
6.5%
M 1
 
3.2%
D 1
 
3.2%
K 1
 
3.2%
Other values (7) 7
22.6%
Common
ValueCountFrequency (%)
271
43.3%
) 167
26.7%
( 165
26.4%
2 4
 
0.6%
5 4
 
0.6%
8 3
 
0.5%
1 3
 
0.5%
. 2
 
0.3%
, 2
 
0.3%
- 2
 
0.3%
Other values (3) 3
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6226
89.5%
ASCII 657
 
9.4%
None 77
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
310
 
5.0%
295
 
4.7%
279
 
4.5%
247
 
4.0%
238
 
3.8%
238
 
3.8%
232
 
3.7%
139
 
2.2%
112
 
1.8%
100
 
1.6%
Other values (495) 4036
64.8%
ASCII
ValueCountFrequency (%)
271
41.2%
) 167
25.4%
( 165
25.1%
C 6
 
0.9%
2 4
 
0.6%
5 4
 
0.6%
S 4
 
0.6%
8 3
 
0.5%
U 3
 
0.5%
1 3
 
0.5%
Other values (20) 27
 
4.1%
None
ValueCountFrequency (%)
77
100.0%
Distinct697
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2024-04-11T12:07:40.903196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.2884615
Min length2

Characters and Unicode

Total characters2565
Distinct characters212
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique629 ?
Unique (%)80.6%

Sample

1st row천경배
2nd row박은경
3rd row온영란
4th row유증종
5th row박홍진
ValueCountFrequency (%)
1명 6
 
0.7%
6
 
0.7%
김은숙 4
 
0.5%
이순이 4
 
0.5%
송수진 3
 
0.4%
김은희 3
 
0.4%
이정희 3
 
0.4%
박영춘 3
 
0.4%
김주원 3
 
0.4%
김경숙 3
 
0.4%
Other values (716) 786
95.4%
2024-04-11T12:07:41.309415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
186
 
7.3%
138
 
5.4%
94
 
3.7%
74
 
2.9%
68
 
2.7%
61
 
2.4%
60
 
2.3%
55
 
2.1%
54
 
2.1%
45
 
1.8%
Other values (202) 1730
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2462
96.0%
Space Separator 44
 
1.7%
Other Punctuation 32
 
1.2%
Uppercase Letter 15
 
0.6%
Decimal Number 10
 
0.4%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
186
 
7.6%
138
 
5.6%
94
 
3.8%
74
 
3.0%
68
 
2.8%
61
 
2.5%
60
 
2.4%
55
 
2.2%
54
 
2.2%
45
 
1.8%
Other values (185) 1627
66.1%
Uppercase Letter
ValueCountFrequency (%)
I 3
20.0%
A 2
13.3%
Y 2
13.3%
O 2
13.3%
K 2
13.3%
B 1
 
6.7%
S 1
 
6.7%
H 1
 
6.7%
M 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 6
60.0%
2 3
30.0%
4 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 29
90.6%
/ 3
 
9.4%
Space Separator
ValueCountFrequency (%)
44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2462
96.0%
Common 88
 
3.4%
Latin 15
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
186
 
7.6%
138
 
5.6%
94
 
3.8%
74
 
3.0%
68
 
2.8%
61
 
2.5%
60
 
2.4%
55
 
2.2%
54
 
2.2%
45
 
1.8%
Other values (185) 1627
66.1%
Latin
ValueCountFrequency (%)
I 3
20.0%
A 2
13.3%
Y 2
13.3%
O 2
13.3%
K 2
13.3%
B 1
 
6.7%
S 1
 
6.7%
H 1
 
6.7%
M 1
 
6.7%
Common
ValueCountFrequency (%)
44
50.0%
, 29
33.0%
1 6
 
6.8%
2 3
 
3.4%
/ 3
 
3.4%
4 1
 
1.1%
( 1
 
1.1%
) 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2462
96.0%
ASCII 103
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
186
 
7.6%
138
 
5.6%
94
 
3.8%
74
 
3.0%
68
 
2.8%
61
 
2.5%
60
 
2.4%
55
 
2.2%
54
 
2.2%
45
 
1.8%
Other values (185) 1627
66.1%
ASCII
ValueCountFrequency (%)
44
42.7%
, 29
28.2%
1 6
 
5.8%
I 3
 
2.9%
2 3
 
2.9%
/ 3
 
2.9%
A 2
 
1.9%
Y 2
 
1.9%
O 2
 
1.9%
K 2
 
1.9%
Other values (7) 7
 
6.8%

연락처
Text

MISSING 

Distinct555
Distinct (%)90.5%
Missing167
Missing (%)21.4%
Memory size6.2 KiB
2024-04-11T12:07:41.516980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.006525
Min length9

Characters and Unicode

Total characters7360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique506 ?
Unique (%)82.5%

Sample

1st row02-6015-1690
2nd row031-321-0870
3rd row070-4652-5138
4th row031-769-1345
5th row070-8866-7065
ValueCountFrequency (%)
02-2612-0453 4
 
0.7%
031-635-0200 3
 
0.5%
031-945-7500 3
 
0.5%
031-313-2733 3
 
0.5%
031-232-0179 3
 
0.5%
031-879-2733 3
 
0.5%
031-569-1919 3
 
0.5%
032-323-9946 3
 
0.5%
031-761-5758 2
 
0.3%
031-221-9238 2
 
0.3%
Other values (545) 584
95.3%
2024-04-11T12:07:41.826757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1207
16.4%
0 1044
14.2%
3 931
12.6%
1 925
12.6%
2 527
7.2%
7 524
7.1%
5 482
 
6.5%
8 447
 
6.1%
4 444
 
6.0%
9 423
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6153
83.6%
Dash Punctuation 1207
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1044
17.0%
3 931
15.1%
1 925
15.0%
2 527
8.6%
7 524
8.5%
5 482
7.8%
8 447
7.3%
4 444
7.2%
9 423
6.9%
6 406
 
6.6%
Dash Punctuation
ValueCountFrequency (%)
- 1207
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1207
16.4%
0 1044
14.2%
3 931
12.6%
1 925
12.6%
2 527
7.2%
7 524
7.1%
5 482
 
6.5%
8 447
 
6.1%
4 444
 
6.0%
9 423
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1207
16.4%
0 1044
14.2%
3 931
12.6%
1 925
12.6%
2 527
7.2%
7 524
7.1%
5 482
 
6.5%
8 447
 
6.1%
4 444
 
6.0%
9 423
 
5.7%

정제도로명주소
Text

MISSING 

Distinct628
Distinct (%)86.5%
Missing54
Missing (%)6.9%
Memory size6.2 KiB
2024-04-11T12:07:42.111238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length19.778237
Min length13

Characters and Unicode

Total characters14359
Distinct characters271
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique558 ?
Unique (%)76.9%

Sample

1st row경기도 남양주시 평내로 167
2nd row경기도 용인시 기흥구 동백5로 22
3rd row경기도 양주시 은현면 화합로691번길 10-14
4th row경기도 의정부시 오목로205번길 36
5th row경기도 포천시 내촌면 금강로 2975-23
ValueCountFrequency (%)
경기도 726
 
21.7%
남양주시 57
 
1.7%
양주시 52
 
1.6%
수원시 50
 
1.5%
부천시 50
 
1.5%
성남시 39
 
1.2%
파주시 39
 
1.2%
시흥시 39
 
1.2%
안산시 39
 
1.2%
원미구 33
 
1.0%
Other values (1063) 2229
66.5%
2024-04-11T12:07:42.515951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2627
18.3%
757
 
5.3%
753
 
5.2%
751
 
5.2%
750
 
5.2%
650
 
4.5%
1 506
 
3.5%
2 415
 
2.9%
335
 
2.3%
3 307
 
2.1%
Other values (261) 6508
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8850
61.6%
Decimal Number 2707
 
18.9%
Space Separator 2627
 
18.3%
Dash Punctuation 175
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
757
 
8.6%
753
 
8.5%
751
 
8.5%
750
 
8.5%
650
 
7.3%
335
 
3.8%
275
 
3.1%
254
 
2.9%
209
 
2.4%
176
 
2.0%
Other values (249) 3940
44.5%
Decimal Number
ValueCountFrequency (%)
1 506
18.7%
2 415
15.3%
3 307
11.3%
4 240
8.9%
5 235
8.7%
6 213
7.9%
7 206
7.6%
0 204
7.5%
9 204
7.5%
8 177
 
6.5%
Space Separator
ValueCountFrequency (%)
2627
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 175
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8850
61.6%
Common 5509
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
757
 
8.6%
753
 
8.5%
751
 
8.5%
750
 
8.5%
650
 
7.3%
335
 
3.8%
275
 
3.1%
254
 
2.9%
209
 
2.4%
176
 
2.0%
Other values (249) 3940
44.5%
Common
ValueCountFrequency (%)
2627
47.7%
1 506
 
9.2%
2 415
 
7.5%
3 307
 
5.6%
4 240
 
4.4%
5 235
 
4.3%
6 213
 
3.9%
7 206
 
3.7%
0 204
 
3.7%
9 204
 
3.7%
Other values (2) 352
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8850
61.6%
ASCII 5509
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2627
47.7%
1 506
 
9.2%
2 415
 
7.5%
3 307
 
5.6%
4 240
 
4.4%
5 235
 
4.3%
6 213
 
3.9%
7 206
 
3.7%
0 204
 
3.7%
9 204
 
3.7%
Other values (2) 352
 
6.4%
Hangul
ValueCountFrequency (%)
757
 
8.6%
753
 
8.5%
751
 
8.5%
750
 
8.5%
650
 
7.3%
335
 
3.8%
275
 
3.1%
254
 
2.9%
209
 
2.4%
176
 
2.0%
Other values (249) 3940
44.5%
Distinct680
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2024-04-11T12:07:42.792588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length21.974359
Min length15

Characters and Unicode

Total characters17140
Distinct characters295
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique607 ?
Unique (%)77.8%

Sample

1st row경기도 남양주시 평내동 597-2번지
2nd row경기도 광명시 철산동 241번지 철산13단지주공아파트
3rd row경기도 광명시 광명동 732번지 광명중앙하이츠아파트
4th row경기도 용인시 기흥구 중동 838번지
5th row경기도 양주시 은현면 운암리 466-7번지
ValueCountFrequency (%)
경기도 778
 
21.2%
남양주시 62
 
1.7%
부천시 57
 
1.6%
수원시 53
 
1.4%
양주시 53
 
1.4%
파주시 43
 
1.2%
성남시 42
 
1.1%
안산시 41
 
1.1%
시흥시 40
 
1.1%
원미구 35
 
1.0%
Other values (1211) 2472
67.2%
2024-04-11T12:07:43.190233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2896
 
16.9%
807
 
4.7%
806
 
4.7%
803
 
4.7%
800
 
4.7%
781
 
4.6%
777
 
4.5%
630
 
3.7%
1 588
 
3.4%
- 568
 
3.3%
Other values (285) 7684
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10527
61.4%
Decimal Number 3119
 
18.2%
Space Separator 2896
 
16.9%
Dash Punctuation 568
 
3.3%
Uppercase Letter 21
 
0.1%
Other Punctuation 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
807
 
7.7%
806
 
7.7%
803
 
7.6%
800
 
7.6%
781
 
7.4%
777
 
7.4%
630
 
6.0%
296
 
2.8%
225
 
2.1%
221
 
2.1%
Other values (257) 4381
41.6%
Decimal Number
ValueCountFrequency (%)
1 588
18.9%
2 374
12.0%
4 339
10.9%
3 328
10.5%
5 315
10.1%
8 273
8.8%
6 254
8.1%
7 238
7.6%
0 217
 
7.0%
9 193
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
L 4
19.0%
I 3
14.3%
H 2
9.5%
K 2
9.5%
R 2
9.5%
A 2
9.5%
P 2
9.5%
D 2
9.5%
J 1
 
4.8%
N 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
@ 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
h 1
50.0%
l 1
50.0%
Space Separator
ValueCountFrequency (%)
2896
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 568
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10527
61.4%
Common 6590
38.4%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
807
 
7.7%
806
 
7.7%
803
 
7.6%
800
 
7.6%
781
 
7.4%
777
 
7.4%
630
 
6.0%
296
 
2.8%
225
 
2.1%
221
 
2.1%
Other values (257) 4381
41.6%
Common
ValueCountFrequency (%)
2896
43.9%
1 588
 
8.9%
- 568
 
8.6%
2 374
 
5.7%
4 339
 
5.1%
3 328
 
5.0%
5 315
 
4.8%
8 273
 
4.1%
6 254
 
3.9%
7 238
 
3.6%
Other values (6) 417
 
6.3%
Latin
ValueCountFrequency (%)
L 4
17.4%
I 3
13.0%
H 2
8.7%
K 2
8.7%
R 2
8.7%
A 2
8.7%
P 2
8.7%
D 2
8.7%
J 1
 
4.3%
N 1
 
4.3%
Other values (2) 2
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10527
61.4%
ASCII 6613
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2896
43.8%
1 588
 
8.9%
- 568
 
8.6%
2 374
 
5.7%
4 339
 
5.1%
3 328
 
5.0%
5 315
 
4.8%
8 273
 
4.1%
6 254
 
3.8%
7 238
 
3.6%
Other values (18) 440
 
6.7%
Hangul
ValueCountFrequency (%)
807
 
7.7%
806
 
7.7%
803
 
7.6%
800
 
7.6%
781
 
7.4%
777
 
7.4%
630
 
6.0%
296
 
2.8%
225
 
2.1%
221
 
2.1%
Other values (257) 4381
41.6%

홈페이지
Text

MISSING 

Distinct478
Distinct (%)88.7%
Missing241
Missing (%)30.9%
Memory size6.2 KiB
2024-04-11T12:07:43.392443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length202
Median length56
Mean length25.105751
Min length1

Characters and Unicode

Total characters13532
Distinct characters170
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique441 ?
Unique (%)81.8%

Sample

1st rowhttps://baeumnanum.org/
2nd rowhttps://cafe.naver.com/codinglab2
3rd rowblog.naver.com/darl
4th rowhttps://instory.tv/media
5th rowhttp://www.hsplus.or.kr/document/main/main.php
ValueCountFrequency (%)
24
 
4.4%
www.ilnanum.com 6
 
1.1%
3
 
0.6%
http://edugarden.co.kr/shop/display/home.php?mode=home 3
 
0.6%
www.icleancare.com 2
 
0.4%
www.ashomecare.or.kr 2
 
0.4%
https://www.aspf.or.kr/triple/common/organization.html?cate_no=44 2
 
0.4%
http://ptjahwal.or.kr 2
 
0.4%
www.jjh.or.kr 2
 
0.4%
https://smartstore.naver.com/siheungpyogo 2
 
0.4%
Other values (468) 497
91.2%
2024-04-11T12:07:43.755195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1100
 
8.1%
o 1038
 
7.7%
/ 965
 
7.1%
t 898
 
6.6%
w 853
 
6.3%
r 653
 
4.8%
a 617
 
4.6%
c 558
 
4.1%
e 535
 
4.0%
p 515
 
3.8%
Other values (160) 5800
42.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9684
71.6%
Other Punctuation 2566
 
19.0%
Decimal Number 793
 
5.9%
Uppercase Letter 232
 
1.7%
Other Letter 133
 
1.0%
Dash Punctuation 75
 
0.6%
Math Symbol 26
 
0.2%
Connector Punctuation 16
 
0.1%
Space Separator 6
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
3.8%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (86) 105
78.9%
Lowercase Letter
ValueCountFrequency (%)
o 1038
 
10.7%
t 898
 
9.3%
w 853
 
8.8%
r 653
 
6.7%
a 617
 
6.4%
c 558
 
5.8%
e 535
 
5.5%
p 515
 
5.3%
h 499
 
5.2%
m 491
 
5.1%
Other values (16) 3027
31.3%
Uppercase Letter
ValueCountFrequency (%)
C 46
19.8%
E 38
16.4%
D 38
16.4%
B 20
8.6%
A 16
 
6.9%
N 9
 
3.9%
P 7
 
3.0%
M 7
 
3.0%
G 6
 
2.6%
K 5
 
2.2%
Other values (15) 40
17.2%
Decimal Number
ValueCountFrequency (%)
1 108
13.6%
0 98
12.4%
8 87
11.0%
3 86
10.8%
2 80
10.1%
9 79
10.0%
4 77
9.7%
7 71
9.0%
5 58
7.3%
6 49
6.2%
Other Punctuation
ValueCountFrequency (%)
. 1100
42.9%
/ 965
37.6%
: 300
 
11.7%
% 135
 
5.3%
* 36
 
1.4%
? 23
 
0.9%
& 6
 
0.2%
@ 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Math Symbol
ValueCountFrequency (%)
= 26
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9916
73.3%
Common 3483
 
25.7%
Hangul 133
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
3.8%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (86) 105
78.9%
Latin
ValueCountFrequency (%)
o 1038
 
10.5%
t 898
 
9.1%
w 853
 
8.6%
r 653
 
6.6%
a 617
 
6.2%
c 558
 
5.6%
e 535
 
5.4%
p 515
 
5.2%
h 499
 
5.0%
m 491
 
5.0%
Other values (41) 3259
32.9%
Common
ValueCountFrequency (%)
. 1100
31.6%
/ 965
27.7%
: 300
 
8.6%
% 135
 
3.9%
1 108
 
3.1%
0 98
 
2.8%
8 87
 
2.5%
3 86
 
2.5%
2 80
 
2.3%
9 79
 
2.3%
Other values (13) 445
12.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13399
99.0%
Hangul 133
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1100
 
8.2%
o 1038
 
7.7%
/ 965
 
7.2%
t 898
 
6.7%
w 853
 
6.4%
r 653
 
4.9%
a 617
 
4.6%
c 558
 
4.2%
e 535
 
4.0%
p 515
 
3.8%
Other values (64) 5667
42.3%
Hangul
ValueCountFrequency (%)
5
 
3.8%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (86) 105
78.9%
Distinct284
Distinct (%)36.7%
Missing7
Missing (%)0.9%
Memory size6.2 KiB
2024-04-11T12:07:43.965012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length8.9573092
Min length1

Characters and Unicode

Total characters6924
Distinct characters321
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique209 ?
Unique (%)27.0%

Sample

1st row교육?상담
2nd row교육?상담
3rd row교육?상담
4th row영상물제작
5th row운동장 인조잔디 철거 용역 서비스
ValueCountFrequency (%)
기타식품류 59
 
4.6%
청소?소독?방역 47
 
3.7%
교육?상담 39
 
3.1%
37
 
2.9%
견학?체험 28
 
2.2%
기타설치물 24
 
1.9%
유통ㆍ판매(편의점 20
 
1.6%
배송ㆍ운전(양곡택배 20
 
1.6%
19
 
1.5%
집수리 18
 
1.4%
Other values (503) 961
75.6%
2024-04-11T12:07:44.511998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
499
 
7.2%
? 280
 
4.0%
, 250
 
3.6%
201
 
2.9%
197
 
2.8%
197
 
2.8%
175
 
2.5%
141
 
2.0%
) 133
 
1.9%
( 133
 
1.9%
Other values (311) 4718
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5519
79.7%
Other Punctuation 580
 
8.4%
Space Separator 499
 
7.2%
Close Punctuation 133
 
1.9%
Open Punctuation 133
 
1.9%
Uppercase Letter 53
 
0.8%
Dash Punctuation 6
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
3.6%
197
 
3.6%
197
 
3.6%
175
 
3.2%
141
 
2.6%
122
 
2.2%
111
 
2.0%
111
 
2.0%
110
 
2.0%
110
 
2.0%
Other values (290) 4044
73.3%
Uppercase Letter
ValueCountFrequency (%)
I 11
20.8%
T 10
18.9%
C 7
13.2%
D 6
11.3%
P 4
 
7.5%
E 3
 
5.7%
L 3
 
5.7%
Y 3
 
5.7%
V 2
 
3.8%
S 2
 
3.8%
Other values (2) 2
 
3.8%
Other Punctuation
ValueCountFrequency (%)
? 280
48.3%
, 250
43.1%
/ 48
 
8.3%
· 2
 
0.3%
Space Separator
ValueCountFrequency (%)
499
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5519
79.7%
Common 1352
 
19.5%
Latin 53
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
3.6%
197
 
3.6%
197
 
3.6%
175
 
3.2%
141
 
2.6%
122
 
2.2%
111
 
2.0%
111
 
2.0%
110
 
2.0%
110
 
2.0%
Other values (290) 4044
73.3%
Latin
ValueCountFrequency (%)
I 11
20.8%
T 10
18.9%
C 7
13.2%
D 6
11.3%
P 4
 
7.5%
E 3
 
5.7%
L 3
 
5.7%
Y 3
 
5.7%
V 2
 
3.8%
S 2
 
3.8%
Other values (2) 2
 
3.8%
Common
ValueCountFrequency (%)
499
36.9%
? 280
20.7%
, 250
18.5%
) 133
 
9.8%
( 133
 
9.8%
/ 48
 
3.6%
- 6
 
0.4%
· 2
 
0.1%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5427
78.4%
ASCII 1403
 
20.3%
Compat Jamo 92
 
1.3%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
499
35.6%
? 280
20.0%
, 250
17.8%
) 133
 
9.5%
( 133
 
9.5%
/ 48
 
3.4%
I 11
 
0.8%
T 10
 
0.7%
C 7
 
0.5%
D 6
 
0.4%
Other values (10) 26
 
1.9%
Hangul
ValueCountFrequency (%)
201
 
3.7%
197
 
3.6%
197
 
3.6%
175
 
3.2%
141
 
2.6%
122
 
2.2%
111
 
2.0%
111
 
2.0%
110
 
2.0%
110
 
2.0%
Other values (289) 3952
72.8%
Compat Jamo
ValueCountFrequency (%)
92
100.0%
None
ValueCountFrequency (%)
· 2
100.0%

면허및특허
Text

MISSING 

Distinct62
Distinct (%)9.8%
Missing147
Missing (%)18.8%
Memory size6.2 KiB
2024-04-11T12:07:44.763256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length7
Mean length8.0521327
Min length2

Characters and Unicode

Total characters5097
Distinct characters206
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)8.4%

Sample

1st row사전 확인필요
2nd row사전 확인필요
3rd row사전 확인필요
4th row사전 확인필요
5th row사전 확인필요
ValueCountFrequency (%)
사전 546
41.9%
확인필요 546
41.9%
없음 15
 
1.2%
인증 9
 
0.7%
특허실용신안 5
 
0.4%
iso9001 5
 
0.4%
haccp 4
 
0.3%
iso인증 4
 
0.3%
친환경 3
 
0.2%
ks 3
 
0.2%
Other values (145) 164
 
12.6%
2024-04-11T12:07:45.118447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
671
13.2%
595
11.7%
556
10.9%
550
10.8%
549
10.8%
546
10.7%
546
10.7%
0 68
 
1.3%
, 51
 
1.0%
45
 
0.9%
Other values (196) 920
18.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4009
78.7%
Space Separator 671
 
13.2%
Decimal Number 204
 
4.0%
Uppercase Letter 117
 
2.3%
Other Punctuation 60
 
1.2%
Dash Punctuation 18
 
0.4%
Close Punctuation 9
 
0.2%
Open Punctuation 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
595
14.8%
556
13.9%
550
13.7%
549
13.7%
546
13.6%
546
13.6%
45
 
1.1%
26
 
0.6%
22
 
0.5%
22
 
0.5%
Other values (163) 552
13.8%
Uppercase Letter
ValueCountFrequency (%)
S 24
20.5%
I 18
15.4%
O 18
15.4%
C 15
12.8%
K 11
9.4%
A 7
 
6.0%
P 6
 
5.1%
H 5
 
4.3%
T 3
 
2.6%
G 3
 
2.6%
Other values (5) 7
 
6.0%
Decimal Number
ValueCountFrequency (%)
0 68
33.3%
1 40
19.6%
2 27
 
13.2%
9 14
 
6.9%
3 13
 
6.4%
5 13
 
6.4%
4 12
 
5.9%
7 8
 
3.9%
6 6
 
2.9%
8 3
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 51
85.0%
/ 4
 
6.7%
· 4
 
6.7%
: 1
 
1.7%
Space Separator
ValueCountFrequency (%)
671
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4009
78.7%
Common 971
 
19.1%
Latin 117
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
595
14.8%
556
13.9%
550
13.7%
549
13.7%
546
13.6%
546
13.6%
45
 
1.1%
26
 
0.6%
22
 
0.5%
22
 
0.5%
Other values (163) 552
13.8%
Common
ValueCountFrequency (%)
671
69.1%
0 68
 
7.0%
, 51
 
5.3%
1 40
 
4.1%
2 27
 
2.8%
- 18
 
1.9%
9 14
 
1.4%
3 13
 
1.3%
5 13
 
1.3%
4 12
 
1.2%
Other values (8) 44
 
4.5%
Latin
ValueCountFrequency (%)
S 24
20.5%
I 18
15.4%
O 18
15.4%
C 15
12.8%
K 11
9.4%
A 7
 
6.0%
P 6
 
5.1%
H 5
 
4.3%
T 3
 
2.6%
G 3
 
2.6%
Other values (5) 7
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4009
78.7%
ASCII 1084
 
21.3%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
671
61.9%
0 68
 
6.3%
, 51
 
4.7%
1 40
 
3.7%
2 27
 
2.5%
S 24
 
2.2%
- 18
 
1.7%
I 18
 
1.7%
O 18
 
1.7%
C 15
 
1.4%
Other values (22) 134
 
12.4%
Hangul
ValueCountFrequency (%)
595
14.8%
556
13.9%
550
13.7%
549
13.7%
546
13.6%
546
13.6%
45
 
1.1%
26
 
0.6%
22
 
0.5%
22
 
0.5%
Other values (163) 552
13.8%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
사전 확인필요
445 
191 
<NA>
144 

Length

Max length7
Median length7
Mean length4.9769231
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사전 확인필요
2nd row사전 확인필요
3rd row사전 확인필요
4th row사전 확인필요
5th row

Common Values

ValueCountFrequency (%)
사전 확인필요 445
57.1%
191
24.5%
<NA> 144
 
18.5%

Length

2024-04-11T12:07:45.236191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-11T12:07:45.325674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사전 445
36.3%
확인필요 445
36.3%
191
15.6%
na 144
 
11.8%

지정정보처리장치
Categorical

IMBALANCE 

Distinct11
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
<NA>
581 
나라장터
120 
그 외
 
35
나라장터, 학교장터, 그 외
 
10
나라장터, 그 외
 
7
Other values (6)
 
27

Length

Max length15
Median length4
Mean length4.3102564
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 581
74.5%
나라장터 120
 
15.4%
그 외 35
 
4.5%
나라장터, 학교장터, 그 외 10
 
1.3%
나라장터, 그 외 7
 
0.9%
나라장터,학교장터 7
 
0.9%
학교장터 6
 
0.8%
나라장터,학교장터,그 외 6
 
0.8%
나라장터, 학교장터 5
 
0.6%
학교장터, 그 외 2
 
0.3%

Length

2024-04-11T12:07:45.422496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 581
66.4%
나라장터 143
 
16.3%
60
 
6.9%
55
 
6.3%
학교장터 23
 
2.6%
나라장터,학교장터 7
 
0.8%
나라장터,학교장터,그 6
 
0.7%
Distinct639
Distinct (%)82.1%
Missing2
Missing (%)0.3%
Memory size6.2 KiB
2024-04-11T12:07:45.702816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length193
Median length59
Mean length17.627249
Min length2

Characters and Unicode

Total characters13714
Distinct characters577
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique571 ?
Unique (%)73.4%

Sample

1st row학교밖 청소년 및 비진학 청년 대상 교육사업
2nd row창의융합교육 서비스
3rd row방과후 교육, 자격증
4th row산업 단체,영상콘텐츠 제작,미디어교육
5th row인조잔디 철거
ValueCountFrequency (%)
126
 
4.5%
66
 
2.4%
교육 36
 
1.3%
기타 33
 
1.2%
청소 30
 
1.1%
판매 29
 
1.0%
교육/행사/문화예술 21
 
0.7%
일반 21
 
0.7%
그외 21
 
0.7%
체험활동서비스 20
 
0.7%
Other values (1413) 2403
85.6%
2024-04-11T12:07:46.150377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2028
 
14.8%
, 985
 
7.2%
261
 
1.9%
239
 
1.7%
213
 
1.6%
174
 
1.3%
167
 
1.2%
/ 165
 
1.2%
161
 
1.2%
157
 
1.1%
Other values (567) 9164
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10157
74.1%
Space Separator 2028
 
14.8%
Other Punctuation 1169
 
8.5%
Uppercase Letter 201
 
1.5%
Close Punctuation 70
 
0.5%
Open Punctuation 69
 
0.5%
Decimal Number 11
 
0.1%
Lowercase Letter 6
 
< 0.1%
Math Symbol 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
261
 
2.6%
239
 
2.4%
213
 
2.1%
174
 
1.7%
167
 
1.6%
161
 
1.6%
157
 
1.5%
154
 
1.5%
151
 
1.5%
144
 
1.4%
Other values (523) 8336
82.1%
Uppercase Letter
ValueCountFrequency (%)
C 32
15.9%
D 31
15.4%
L 26
12.9%
E 25
12.4%
T 16
8.0%
V 13
6.5%
P 12
 
6.0%
I 11
 
5.5%
A 7
 
3.5%
S 5
 
2.5%
Other values (12) 23
11.4%
Other Punctuation
ValueCountFrequency (%)
, 985
84.3%
/ 165
 
14.1%
. 10
 
0.9%
& 4
 
0.3%
· 3
 
0.3%
: 1
 
0.1%
? 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 3
27.3%
3 3
27.3%
0 2
18.2%
2 2
18.2%
4 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
t 2
33.3%
i 2
33.3%
k 1
16.7%
o 1
16.7%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
2028
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10157
74.1%
Common 3350
 
24.4%
Latin 207
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
261
 
2.6%
239
 
2.4%
213
 
2.1%
174
 
1.7%
167
 
1.6%
161
 
1.6%
157
 
1.5%
154
 
1.5%
151
 
1.5%
144
 
1.4%
Other values (523) 8336
82.1%
Latin
ValueCountFrequency (%)
C 32
15.5%
D 31
15.0%
L 26
12.6%
E 25
12.1%
T 16
7.7%
V 13
6.3%
P 12
 
5.8%
I 11
 
5.3%
A 7
 
3.4%
S 5
 
2.4%
Other values (16) 29
14.0%
Common
ValueCountFrequency (%)
2028
60.5%
, 985
29.4%
/ 165
 
4.9%
) 70
 
2.1%
( 69
 
2.1%
. 10
 
0.3%
& 4
 
0.1%
1 3
 
0.1%
· 3
 
0.1%
3 3
 
0.1%
Other values (8) 10
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10157
74.1%
ASCII 3554
 
25.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2028
57.1%
, 985
27.7%
/ 165
 
4.6%
) 70
 
2.0%
( 69
 
1.9%
C 32
 
0.9%
D 31
 
0.9%
L 26
 
0.7%
E 25
 
0.7%
T 16
 
0.5%
Other values (33) 107
 
3.0%
Hangul
ValueCountFrequency (%)
261
 
2.6%
239
 
2.4%
213
 
2.1%
174
 
1.7%
167
 
1.6%
161
 
1.6%
157
 
1.5%
154
 
1.5%
151
 
1.5%
144
 
1.4%
Other values (523) 8336
82.1%
None
ValueCountFrequency (%)
· 3
100.0%

정제우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct573
Distinct (%)73.7%
Missing2
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean13965.089
Minimum10009
Maximum18614
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2024-04-11T12:07:46.282683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10009
5-th percentile10463.55
Q111670
median14033
Q315886.75
95-th percentile18127.2
Maximum18614
Range8605
Interquartile range (IQR)4216.75

Descriptive statistics

Standard deviation2484.8698
Coefficient of variation (CV)0.17793441
Kurtosis-1.1517949
Mean13965.089
Median Absolute Deviation (MAD)2228.5
Skewness0.23529233
Sum10864839
Variance6174578
MonotonicityNot monotonic
2024-04-11T12:07:46.398741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16591 6
 
0.8%
12918 5
 
0.6%
17385 5
 
0.6%
15258 5
 
0.6%
11444 5
 
0.6%
17006 5
 
0.6%
12237 5
 
0.6%
12234 4
 
0.5%
13320 4
 
0.5%
15494 4
 
0.5%
Other values (563) 730
93.6%
ValueCountFrequency (%)
10009 3
0.4%
10013 1
 
0.1%
10017 1
 
0.1%
10020 1
 
0.1%
10025 1
 
0.1%
10037 1
 
0.1%
10048 2
0.3%
10057 1
 
0.1%
10060 1
 
0.1%
10079 1
 
0.1%
ValueCountFrequency (%)
18614 4
0.5%
18606 1
 
0.1%
18598 2
0.3%
18592 1
 
0.1%
18589 1
 
0.1%
18582 1
 
0.1%
18577 3
0.4%
18565 1
 
0.1%
18550 1
 
0.1%
18533 2
0.3%

정제WGS84위도
Real number (ℝ)

HIGH CORRELATION 

Distinct672
Distinct (%)86.7%
Missing5
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean37.485775
Minimum36.946474
Maximum38.158161
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2024-04-11T12:07:46.516433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.946474
5-th percentile37.094073
Q137.305811
median37.455532
Q337.677974
95-th percentile37.871853
Maximum38.158161
Range1.2116869
Interquartile range (IQR)0.37216248

Descriptive statistics

Standard deviation0.24272197
Coefficient of variation (CV)0.006475042
Kurtosis-0.63474692
Mean37.485775
Median Absolute Deviation (MAD)0.1808691
Skewness0.11136268
Sum29051.476
Variance0.058913953
MonotonicityNot monotonic
2024-04-11T12:07:46.639975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.2637909377 6
 
0.8%
37.8168156964 4
 
0.5%
37.2913537558 4
 
0.5%
37.3350967217 4
 
0.5%
37.0940731389 4
 
0.5%
37.4316578307 4
 
0.5%
37.6356196123 4
 
0.5%
37.4355707194 4
 
0.5%
37.7716397207 3
 
0.4%
37.754599675 3
 
0.4%
Other values (662) 735
94.2%
(Missing) 5
 
0.6%
ValueCountFrequency (%)
36.9464737997 1
0.1%
36.9581089825 1
0.1%
36.9678021518 1
0.1%
36.9712290894 1
0.1%
36.9819183732 1
0.1%
36.9890470648 1
0.1%
36.9901906598 2
0.3%
36.990890878 1
0.1%
36.991379514 1
0.1%
36.992789064 1
0.1%
ValueCountFrequency (%)
38.1581607278 1
0.1%
38.1398873332 1
0.1%
38.1033659522 1
0.1%
38.049487645 1
0.1%
38.0374529628 1
0.1%
38.0324078062 1
0.1%
38.0293870693 1
0.1%
38.027920798 1
0.1%
38.0236526286 1
0.1%
38.0235862307 1
0.1%

정제WGS84경도
Real number (ℝ)

Distinct672
Distinct (%)86.7%
Missing5
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean127.00887
Minimum126.53768
Maximum127.79248
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2024-04-11T12:07:46.769121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.53768
5-th percentile126.73718
Q1126.83748
median127.01872
Q3127.1405
95-th percentile127.32963
Maximum127.79248
Range1.2547941
Interquartile range (IQR)0.30301597

Descriptive statistics

Standard deviation0.19851799
Coefficient of variation (CV)0.0015630246
Kurtosis0.37412146
Mean127.00887
Median Absolute Deviation (MAD)0.14917507
Skewness0.4774235
Sum98431.874
Variance0.039409393
MonotonicityNot monotonic
2024-04-11T12:07:46.896415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0028311044 6
 
0.8%
126.9882041974 4
 
0.5%
127.1481097299 4
 
0.5%
126.8448748891 4
 
0.5%
126.9011823221 4
 
0.5%
127.1745928428 4
 
0.5%
127.2064946291 4
 
0.5%
127.1290105785 4
 
0.5%
126.7335489255 3
 
0.4%
126.7813346334 3
 
0.4%
Other values (662) 735
94.2%
(Missing) 5
 
0.6%
ValueCountFrequency (%)
126.5376816212 1
0.1%
126.5560224786 1
0.1%
126.593945233 1
0.1%
126.5994491576 1
0.1%
126.6022922219 1
0.1%
126.604522595 1
0.1%
126.6061541746 1
0.1%
126.6172480699 1
0.1%
126.6201461202 1
0.1%
126.6290954267 1
0.1%
ValueCountFrequency (%)
127.7924757138 1
0.1%
127.7473549942 1
0.1%
127.6578722646 1
0.1%
127.6328039113 1
0.1%
127.6311245415 1
0.1%
127.6135400446 1
0.1%
127.6114108505 2
0.3%
127.5989166973 1
0.1%
127.5935364827 1
0.1%
127.5156458516 1
0.1%

Interactions

2024-04-11T12:07:38.492913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:37.965020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.250911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.568708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.090950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.329735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.652096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.171793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T12:07:38.408708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-11T12:07:46.981868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업유형분야면허및특허직접생산유무지정정보처리장치정제우편번호정제WGS84위도정제WGS84경도
기업유형1.0000.0000.6140.4140.8090.2540.3870.170
분야0.0001.0000.2760.3390.3440.3650.0000.316
면허및특허0.6140.2761.0000.2580.3290.3540.1530.000
직접생산유무0.4140.3390.2581.0000.2650.2580.2800.134
지정정보처리장치0.8090.3440.3290.2651.0000.0000.0000.160
정제우편번호0.2540.3650.3540.2580.0001.0000.9330.845
정제WGS84위도0.3870.0000.1530.2800.0000.9331.0000.615
정제WGS84경도0.1700.3160.0000.1340.1600.8450.6151.000
2024-04-11T12:07:47.076921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
직접생산유무지정정보처리장치
직접생산유무1.0000.199
지정정보처리장치0.1991.000
2024-04-11T12:07:47.153153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정제우편번호정제WGS84위도정제WGS84경도직접생산유무지정정보처리장치
정제우편번호1.000-0.9250.0720.1990.000
정제WGS84위도-0.9251.000-0.0640.2130.000
정제WGS84경도0.072-0.0641.0000.1020.071
직접생산유무0.1990.2130.1021.0000.199
지정정보처리장치0.0000.0000.0710.1991.000

Missing values

2024-04-11T12:07:38.771527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-11T12:07:38.939481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-11T12:07:39.089525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기업유형분야기업명대표자명연락처정제도로명주소정제지번주소홈페이지주요상품면허및특허직접생산유무지정정보처리장치주요품목정제우편번호정제WGS84위도정제WGS84경도
0사회적협동조합교육/상담사회적협동조합 배움과나눔천경배<NA>경기도 남양주시 평내로 167경기도 남양주시 평내동 597-2번지https://baeumnanum.org/교육?상담사전 확인필요사전 확인필요<NA>학교밖 청소년 및 비진학 청년 대상 교육사업1221937.647495127.243459
1사회적협동조합교육/상담새싹코딩협동조합박은경<NA><NA>경기도 광명시 철산동 241번지 철산13단지주공아파트https://cafe.naver.com/codinglab2교육?상담사전 확인필요사전 확인필요<NA>창의융합교육 서비스1423237.477629126.869532
2마을기업교육/상담새터마을협동조합온영란02-6015-1690<NA>경기도 광명시 광명동 732번지 광명중앙하이츠아파트blog.naver.com/darl교육?상담사전 확인필요사전 확인필요<NA>방과후 교육, 자격증1427837.469309126.854437
3사회적협동조합기타서비스(용역)마을미디어인스토리 협동유증종031-321-0870경기도 용인시 기흥구 동백5로 22경기도 용인시 기흥구 중동 838번지https://instory.tv/media영상물제작사전 확인필요사전 확인필요<NA>산업 단체,영상콘텐츠 제작,미디어교육1700637.275904127.151272
4사회적기업기타서비스(용역)사단법인희망나눔플러스박홍진<NA>경기도 양주시 은현면 화합로691번길 10-14경기도 양주시 은현면 운암리 466-7번지http://www.hsplus.or.kr/document/main/main.php운동장 인조잔디 철거 용역 서비스사전 확인필요<NA>인조잔디 철거1142637.871391127.007463
5청년기업기타서비스(용역)사회적협동조합 블루링크이동규<NA>경기도 의정부시 오목로205번길 36경기도 의정부시 민락동 835-1번지http://www.blcoop.or.kr/bbs/board.php?bo_table=bl_user장애인 보호작업장 운영, 물품 포장 등 사업사전 확인필요사전 확인필요<NA>장애인식개선 프로그램, 직무기능향상훈련 등1181337.745737127.097002
6사회적기업기타서비스(용역)㈜리맨구자덕, 장만호070-4652-5138경기도 포천시 내촌면 금강로 2975-23경기도 포천시 내촌면 소학리 12번지http://www.remann.co.kr/디지털 제품 재제조, IT 자산 일괄 처리ISO9001, 14001, 27001 인증, 마이크로소프트 MAR 인증<NA>태블릿, 노트북, 컴퓨터, 모니터, CCTV 등1118837.827929127.263415
7사회적기업농수축산물/식재료(주)떡찌니석지현031-769-1345경기도 광주시 곤지암읍 가마을길 203경기도 광주시 곤지암읍 부항리 368-1번지www.dduckzziny.co.kr기타식품류사전 확인필요사전 확인필요그 외떡볶이, 답례떡, 전통음료1272337.359604127.392248
8예비사회적기업농수축산물/식재료(주)애플하우스김경숙070-8866-7065경기도 부천시 원미구 원미로 205경기도 부천시 원미구 춘의동 193-4번지www.applehouse.or.kr화훼류사전 확인필요사전 확인필요<NA>친환경 화분, 친환경 연필1455637.500821126.79313
9사회적기업농수축산물/식재료(주)해피트리김용택031-835-0879경기도 연천군 청산면 학담로 177경기도 연천군 청산면 초성리 289-5번지https://www.instagram.com/bluericecakeofficial/기타식품류사전 확인필요사전 확인필요나라장터떡, 케이크, 빵류1102338.005123127.07399
기업유형분야기업명대표자명연락처정제도로명주소정제지번주소홈페이지주요상품면허및특허직접생산유무지정정보처리장치주요품목정제우편번호정제WGS84위도정제WGS84경도
770협동조합농수축산물/식재료경기두레소비자생활협동조합황홍순032-321-2378경기도 부천시 원미구 송내대로265번길 43경기도 부천시 원미구 상동 537-1번지www.bsdure.or.kr기타식품류사전 확인필요사전 확인필요<NA>친환경 농수산물/ 친환경생활용품/ 공정무역 제품/ 안전먹거리 교육/공정무역 교육 강사단1454237.507058126.756368
771사회적기업, 여성기업, 장애인기업생활용품주식회사 빛누리이진철, 류경미<NA>경기도 김포시 통진읍 고척로 203경기도 김포시 통진읍 고정리 268번지www.빛누리.comLED조명기구녹색기술 인증, KS인증, ISO9001<NA>LED평판등, 다운라이트, 가로등, 보안등1000937.719572126.593945
772예비사회적기업생활용품주식회사 해밀펫푸드이병길031-862-7582경기도 양주시 칠봉산로228번길 84-25경기도 양주시 봉양동 215-3번지<NA>사료 및 간식 제조 판매사전 확인필요사전 확인필요<NA>반려동물 간식1144837.856543127.081247
773협동조합, 마을기업생활용품청살림 영농협동조합최석봉031-585-3768경기도 가평군 설악면 미사리로 49경기도 가평군 설악면 선촌리 127번지blog.naver.com/csbsjs체험, 교육, 교육재료사전 확인필요사전 확인필요<NA>체험, 교육, 교육재료1246637.686117127.498435
774예비사회적기업, 여성기업, 소셜벤처생활용품하늘빚다최정임<NA>경기도 군포시 산본로431번안길 3-3경기도 군포시 산본동 1104번지https://shopping.naver.com/living/handmade/stores/1000006163생활용품사전 확인필요<NA>친환경 화분 제작1582037.367165126.925953
775사회적기업교육/상담주식회사 잇츠박은영031-378-6729경기도 오산시 밀머리로 64경기도 오산시 원동 578-4번지https://eats.quv.kr/교육?상담사전 확인필요<NA>1인 미디어 교육, 행사공연 물품대여, 문화예술 강사교육1814437.137099127.077127
776예비사회적기업견학/체험(주)한복문화연구소 한땀김경미031-402-7965경기도 안산시 단원구 중앙대로 784경기도 안산시 단원구 고잔동 453-53번지https://hanbokhanttam.modoo.at/견학?체험사전 확인필요사전 확인필요<NA>교육/행사/문화예술, 체험활동서비스, 의류1535637.316913126.822032
777사회적기업, 중증장애인생산시설, 녹색제품 인증비품사회복지법인 성만원공성옥<NA>경기도 용인시 처인구 백암면 죽양대로912번길 78경기도 용인시 처인구 백암면 백봉리 640-1번지<NA>비품사전 확인필요<NA>LED 조명기구1718037.14555127.3953
778사회적기업비품사회적협동조합 어우리오성록031-362-5727경기도 시흥시 공단1대로 341경기도 시흥시 정왕동 1289-6번지www.eouri.co.kr기타설치물정보통신공사업(제311754호)그 외CCTV, 표식장비, 멀티탭1507837.331616126.737828
779사회적기업비품아름다운창 주식회사윤중현031-665-9691경기도 평택시 팔용당길 66-52경기도 평택시 도일동 177-4번지http://www.xn--2j1bq9j21kenbp8m.com/기타설치물사전 확인필요나라장터커튼, 블라인드, 전동시스템, 강당커튼,무대막1772437.046619127.118575

Duplicate rows

Most frequently occurring

기업유형분야기업명대표자명연락처정제도로명주소정제지번주소홈페이지주요상품면허및특허직접생산유무지정정보처리장치주요품목정제우편번호정제WGS84위도정제WGS84경도# duplicates
0사회적기업, 자활기업기타서비스(용역)주식회사 컴윈정연철031-351-4576경기도 화성시 팔탄면 온천로 122-9경기도 화성시 팔탄면 덕우리 187-5번지COMWIN.CO.KRIT사전 확인필요나라장터IT, 불용자산 처리1857737.135133126.8649552
1예비사회적기업, 자활기업피복(주)휴먼컨스홍순임031-223-6996경기도 수원시 팔달구 세지로152번길 28경기도 수원시 팔달구 인계동 1008-24번지https://human007.modoo.at/의류사전 확인필요사전 확인필요<NA>홈패션1647937.264493127.0223832