Overview

Dataset statistics

Number of variables5
Number of observations552
Missing cells0
Missing cells (%)0.0%
Duplicate rows79
Duplicate rows (%)14.3%
Total size in memory22.2 KiB
Average record size in memory41.2 B

Variable types

Categorical3
Text2

Dataset

Description한국동서발전의 사회적기업지원현황 정보를 제공합니다. 사회적기업지원현황은 년도, 기관명, 사회적기업명, 기업소개(구매내용), 지원내용의 항목으로 구성됩니다.
URLhttps://www.data.go.kr/data/15065330/fileData.do

Alerts

년도 has constant value ""Constant
기관명 has constant value ""Constant
Dataset has 79 (14.3%) duplicate rowsDuplicates
지원내용 is highly imbalanced (74.2%)Imbalance

Reproduction

Analysis started2023-12-12 09:29:52.278426
Analysis finished2023-12-12 09:29:52.749007
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2022
552 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 552
100.0%

Length

2023-12-12T18:29:52.827278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:29:52.969909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 552
100.0%

기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
한국동서발전㈜
552 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국동서발전㈜
2nd row한국동서발전㈜
3rd row한국동서발전㈜
4th row한국동서발전㈜
5th row한국동서발전㈜

Common Values

ValueCountFrequency (%)
한국동서발전㈜ 552
100.0%

Length

2023-12-12T18:29:53.098924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:29:53.207348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국동서발전㈜ 552
100.0%
Distinct109
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T18:29:53.490064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length9.6956522
Min length2

Characters and Unicode

Total characters5352
Distinct characters224
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)6.2%

Sample

1st row사회적협동조합 일원
2nd row주식회사 두루행복나눔터
3rd row(주)그린주의
4th row(주)두루행복한세상
5th row주식회사 맑은기업
ValueCountFrequency (%)
주식회사 89
 
11.0%
사회적협동조합 64
 
7.9%
일원 48
 
5.9%
주)다정플러스 31
 
3.8%
주)대영하이텍 25
 
3.1%
사단법인 24
 
3.0%
주)행복큐산업 24
 
3.0%
주)열린세상 23
 
2.8%
협동조합 21
 
2.6%
드림장애인보호작업장 19
 
2.3%
Other values (115) 442
54.6%
2023-12-12T18:29:53.973924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
329
 
6.1%
268
 
5.0%
258
 
4.8%
) 243
 
4.5%
( 242
 
4.5%
230
 
4.3%
166
 
3.1%
142
 
2.7%
130
 
2.4%
121
 
2.3%
Other values (214) 3223
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4590
85.8%
Space Separator 258
 
4.8%
Close Punctuation 243
 
4.5%
Open Punctuation 242
 
4.5%
Uppercase Letter 18
 
0.3%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
329
 
7.2%
268
 
5.8%
230
 
5.0%
166
 
3.6%
142
 
3.1%
130
 
2.8%
121
 
2.6%
106
 
2.3%
103
 
2.2%
102
 
2.2%
Other values (201) 2893
63.0%
Uppercase Letter
ValueCountFrequency (%)
S 4
22.2%
I 3
16.7%
C 3
16.7%
G 2
11.1%
B 2
11.1%
T 1
 
5.6%
O 1
 
5.6%
M 1
 
5.6%
R 1
 
5.6%
Space Separator
ValueCountFrequency (%)
258
100.0%
Close Punctuation
ValueCountFrequency (%)
) 243
100.0%
Open Punctuation
ValueCountFrequency (%)
( 242
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4590
85.8%
Common 744
 
13.9%
Latin 18
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
329
 
7.2%
268
 
5.8%
230
 
5.0%
166
 
3.6%
142
 
3.1%
130
 
2.8%
121
 
2.6%
106
 
2.3%
103
 
2.2%
102
 
2.2%
Other values (201) 2893
63.0%
Latin
ValueCountFrequency (%)
S 4
22.2%
I 3
16.7%
C 3
16.7%
G 2
11.1%
B 2
11.1%
T 1
 
5.6%
O 1
 
5.6%
M 1
 
5.6%
R 1
 
5.6%
Common
ValueCountFrequency (%)
258
34.7%
) 243
32.7%
( 242
32.5%
7 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4590
85.8%
ASCII 762
 
14.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
329
 
7.2%
268
 
5.8%
230
 
5.0%
166
 
3.6%
142
 
3.1%
130
 
2.8%
121
 
2.6%
106
 
2.3%
103
 
2.2%
102
 
2.2%
Other values (201) 2893
63.0%
ASCII
ValueCountFrequency (%)
258
33.9%
) 243
31.9%
( 242
31.8%
S 4
 
0.5%
I 3
 
0.4%
C 3
 
0.4%
G 2
 
0.3%
B 2
 
0.3%
7 1
 
0.1%
T 1
 
0.1%
Other values (3) 3
 
0.4%
Distinct215
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T18:29:54.367370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.7300725
Min length2

Characters and Unicode

Total characters3715
Distinct characters243
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)28.3%

Sample

1st row사무용 소모품
2nd row홍보기념품
3rd row청소용품
4th row동결방지용 물품
5th row분말소화기 구매
ValueCountFrequency (%)
구매 359
32.4%
소모품 86
 
7.8%
비품 60
 
5.4%
청소용품 41
 
3.7%
방역소독 30
 
2.7%
사무용 24
 
2.2%
지급 23
 
2.1%
사급자재 16
 
1.4%
비품구매 15
 
1.4%
기념품 12
 
1.1%
Other values (236) 442
39.9%
2023-12-12T18:29:54.942907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
556
 
15.0%
395
 
10.6%
381
 
10.3%
315
 
8.5%
175
 
4.7%
163
 
4.4%
105
 
2.8%
87
 
2.3%
77
 
2.1%
57
 
1.5%
Other values (233) 1404
37.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3095
83.3%
Space Separator 556
 
15.0%
Uppercase Letter 55
 
1.5%
Decimal Number 3
 
0.1%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
395
 
12.8%
381
 
12.3%
315
 
10.2%
175
 
5.7%
163
 
5.3%
105
 
3.4%
87
 
2.8%
77
 
2.5%
57
 
1.8%
48
 
1.6%
Other values (220) 1292
41.7%
Uppercase Letter
ValueCountFrequency (%)
C 14
25.5%
L 9
16.4%
E 9
16.4%
D 9
16.4%
V 7
12.7%
T 7
12.7%
Decimal Number
ValueCountFrequency (%)
9 1
33.3%
1 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
556
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3095
83.3%
Common 565
 
15.2%
Latin 55
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
395
 
12.8%
381
 
12.3%
315
 
10.2%
175
 
5.7%
163
 
5.3%
105
 
3.4%
87
 
2.8%
77
 
2.5%
57
 
1.8%
48
 
1.6%
Other values (220) 1292
41.7%
Common
ValueCountFrequency (%)
556
98.4%
) 2
 
0.4%
( 2
 
0.4%
, 2
 
0.4%
9 1
 
0.2%
1 1
 
0.2%
2 1
 
0.2%
Latin
ValueCountFrequency (%)
C 14
25.5%
L 9
16.4%
E 9
16.4%
D 9
16.4%
V 7
12.7%
T 7
12.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3095
83.3%
ASCII 620
 
16.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
556
89.7%
C 14
 
2.3%
L 9
 
1.5%
E 9
 
1.5%
D 9
 
1.5%
V 7
 
1.1%
T 7
 
1.1%
) 2
 
0.3%
( 2
 
0.3%
, 2
 
0.3%
Other values (3) 3
 
0.5%
Hangul
ValueCountFrequency (%)
395
 
12.8%
381
 
12.3%
315
 
10.2%
175
 
5.7%
163
 
5.3%
105
 
3.4%
87
 
2.8%
77
 
2.5%
57
 
1.8%
48
 
1.6%
Other values (220) 1292
41.7%

지원내용
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
물품
528 
용역
 
24

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row물품
2nd row물품
3rd row물품
4th row물품
5th row물품

Common Values

ValueCountFrequency (%)
물품 528
95.7%
용역 24
 
4.3%

Length

2023-12-12T18:29:55.118118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:29:55.261343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물품 528
95.7%
용역 24
 
4.3%

Missing values

2023-12-12T18:29:52.595413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:29:52.704499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도기관명사회적기업명기업소개(구매내용)지원내용
02022한국동서발전㈜사회적협동조합 일원사무용 소모품물품
12022한국동서발전㈜주식회사 두루행복나눔터홍보기념품물품
22022한국동서발전㈜(주)그린주의청소용품물품
32022한국동서발전㈜(주)두루행복한세상동결방지용 물품물품
42022한국동서발전㈜주식회사 맑은기업분말소화기 구매물품
52022한국동서발전㈜(주)대영하이텍소모품 구매물품
62022한국동서발전㈜더부러 주식회사현장점검 용품물품
72022한국동서발전㈜홍애원정전작용용 분전반 구매물품
82022한국동서발전㈜사단법인 한국사회적일자리협회침입감지시스템물품
92022한국동서발전㈜(주)일렉콤조명등 구매물품
년도기관명사회적기업명기업소개(구매내용)지원내용
5422022한국동서발전㈜(주)그린엔젤스소모품 구매물품
5432022한국동서발전㈜(주)다정플러스비품 구매물품
5442022한국동서발전㈜창익비품 구매물품
5452022한국동서발전㈜사회적협동조합 푸른하늘사급자재 구매물품
5462022한국동서발전㈜(주)성운통상방한용품 구매물품
5472022한국동서발전㈜주식회사 두루행복나눔터홍보기념품 구매물품
5482022한국동서발전㈜(주)대영하이텍소모품 구매물품
5492022한국동서발전㈜사회적협동조합 일원관리용품 구매물품
5502022한국동서발전㈜사단법인 한국장애인고용창출협회 나누리가구사업단비품구매물품
5512022한국동서발전㈜사단번인 한국장애인고용창출협회비품구매물품

Duplicate rows

Most frequently occurring

년도기관명사회적기업명기업소개(구매내용)지원내용# duplicates
552022한국동서발전㈜사회적협동조합 일원청소용품 구매물품11
402022한국동서발전㈜드림장애인보호작업장방역소독물품10
42022한국동서발전㈜(주)그린주의청소용품 구매물품8
142022한국동서발전㈜(주)동연디자인비품 구매물품8
302022한국동서발전㈜(주)행복큐산업소모품 구매물품8
122022한국동서발전㈜(주)대영하이텍소모품 구매물품7
422022한국동서발전㈜사단번인 한국장애인고용창출협회비품 구매물품7
612022한국동서발전㈜주식회사 다정플러스청소용품 구매물품7
222022한국동서발전㈜(주)열린세상소모품 구매물품6
352022한국동서발전㈜녹색마을 협동조합방역소독물품6