Overview

Dataset statistics

Number of variables4
Number of observations75
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory34.8 B

Variable types

Categorical2
Text2

Dataset

Description1.사업내용 : 대전 지역 우수 기업 홍보 동영상 제작 2.대상기업 : 고용우수인증 기업 및 좋은일터 인증 사업장 3.지원규모 : 연 15개 업체 내외 4.소요예산 : 30,000천원
Author대전광역시
URLhttps://www.data.go.kr/data/15079887/fileData.do

Alerts

연도 is highly overall correlated with 선정이유High correlation
선정이유 is highly overall correlated with 연도High correlation

Reproduction

Analysis started2023-12-12 19:40:13.970236
Analysis finished2023-12-12 19:40:14.456042
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size732.0 B
2019
30 
2020
16 
2021
15 
2022
14 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 30
40.0%
2020 16
21.3%
2021 15
20.0%
2022 14
18.7%

Length

2023-12-13T04:40:14.538367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:14.677379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 30
40.0%
2020 16
21.3%
2021 15
20.0%
2022 14
18.7%
Distinct74
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-13T04:40:14.905893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length6.72
Min length3

Characters and Unicode

Total characters504
Distinct characters189
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)97.3%

Sample

1st row(사)대전광역시컨택센터협회
2nd row(사)삼진정밀
3rd row㈜옵트론텍
4th row㈜에르코스
5th row㈜알에프세미
ValueCountFrequency (%)
㈜솔탑 2
 
2.5%
마이크로닉시스템㈜ 1
 
1.3%
㈜래트론 1
 
1.3%
㈜광재상사 1
 
1.3%
㈜금강일보사 1
 
1.3%
의)유성선병원 1
 
1.3%
㈜호텔icc 1
 
1.3%
㈜그린푸드앤케어 1
 
1.3%
㈜w여성병원 1
 
1.3%
㈜jpc오토모티브 1
 
1.3%
Other values (68) 68
86.1%
2023-12-13T04:40:15.331159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
12.9%
21
 
4.2%
21
 
4.2%
10
 
2.0%
8
 
1.6%
( 7
 
1.4%
) 7
 
1.4%
7
 
1.4%
6
 
1.2%
6
 
1.2%
Other values (179) 346
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 410
81.3%
Other Symbol 65
 
12.9%
Uppercase Letter 11
 
2.2%
Open Punctuation 7
 
1.4%
Close Punctuation 7
 
1.4%
Space Separator 4
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
5.1%
21
 
5.1%
10
 
2.4%
8
 
2.0%
7
 
1.7%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
Other values (166) 313
76.3%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
K 1
 
9.1%
L 1
 
9.1%
P 1
 
9.1%
J 1
 
9.1%
S 1
 
9.1%
E 1
 
9.1%
W 1
 
9.1%
I 1
 
9.1%
Other Symbol
ValueCountFrequency (%)
65
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 475
94.2%
Common 18
 
3.6%
Latin 11
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
13.7%
21
 
4.4%
21
 
4.4%
10
 
2.1%
8
 
1.7%
7
 
1.5%
6
 
1.3%
6
 
1.3%
6
 
1.3%
6
 
1.3%
Other values (167) 319
67.2%
Latin
ValueCountFrequency (%)
C 3
27.3%
K 1
 
9.1%
L 1
 
9.1%
P 1
 
9.1%
J 1
 
9.1%
S 1
 
9.1%
E 1
 
9.1%
W 1
 
9.1%
I 1
 
9.1%
Common
ValueCountFrequency (%)
( 7
38.9%
) 7
38.9%
4
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 410
81.3%
None 65
 
12.9%
ASCII 29
 
5.8%

Most frequent character per block

None
ValueCountFrequency (%)
65
100.0%
Hangul
ValueCountFrequency (%)
21
 
5.1%
21
 
5.1%
10
 
2.4%
8
 
2.0%
7
 
1.7%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
Other values (166) 313
76.3%
ASCII
ValueCountFrequency (%)
( 7
24.1%
) 7
24.1%
4
13.8%
C 3
10.3%
K 1
 
3.4%
L 1
 
3.4%
P 1
 
3.4%
J 1
 
3.4%
S 1
 
3.4%
E 1
 
3.4%
Other values (2) 2
 
6.9%

선정이유
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
좋은일터선정 기업
24 
고용우수인증 기업
21 
21년 좋은일터선정 기업
14 
22년 좋은일터선정 기업
22년 청년친화 강소기업
Other values (3)

Length

Max length15
Median length9
Mean length10.6
Min length9

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row컨텍센터협회 선정
2nd row좋은일터선정 기업
3rd row좋은일터선정 기업
4th row좋은일터선정 기업
5th row좋은일터선정 기업

Common Values

ValueCountFrequency (%)
좋은일터선정 기업 24
32.0%
고용우수인증 기업 21
28.0%
21년 좋은일터선정 기업 14
18.7%
22년 좋은일터선정 기업 6
 
8.0%
22년 청년친화 강소기업 5
 
6.7%
19년 좋은일터선정 기업 2
 
2.7%
21년 일하기 좋은 중소기업 2
 
2.7%
컨텍센터협회 선정 1
 
1.3%

Length

2023-12-13T04:40:15.486593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:15.626578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기업 67
37.0%
좋은일터선정 46
25.4%
고용우수인증 21
 
11.6%
21년 16
 
8.8%
22년 11
 
6.1%
청년친화 5
 
2.8%
강소기업 5
 
2.8%
19년 2
 
1.1%
일하기 2
 
1.1%
좋은 2
 
1.1%
Other values (3) 4
 
2.2%
Distinct65
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-13T04:40:15.926730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.4666667
Min length2

Characters and Unicode

Total characters260
Distinct characters71
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)74.7%

Sample

1st row박*구
2nd row정*희
3rd row홍*관
4th row김*기
5th row이*효
ValueCountFrequency (%)
선*훈 3
 
3.7%
김*기 2
 
2.4%
정*호 2
 
2.4%
최*진 2
 
2.4%
사**보 2
 
2.4%
김*범 2
 
2.4%
김*주 2
 
2.4%
이*석 2
 
2.4%
천*석 2
 
2.4%
정*용 2
 
2.4%
Other values (61) 61
74.4%
2023-12-13T04:40:16.368081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 83
31.9%
19
 
7.3%
14
 
5.4%
10
 
3.8%
7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
, 6
 
2.3%
5
 
1.9%
Other values (61) 97
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 163
62.7%
Other Punctuation 89
34.2%
Space Separator 7
 
2.7%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
11.7%
14
 
8.6%
10
 
6.1%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (57) 83
50.9%
Other Punctuation
ValueCountFrequency (%)
* 83
93.3%
, 6
 
6.7%
Space Separator
ValueCountFrequency (%)
7
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 163
62.7%
Common 97
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
11.7%
14
 
8.6%
10
 
6.1%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (57) 83
50.9%
Common
ValueCountFrequency (%)
* 83
85.6%
7
 
7.2%
, 6
 
6.2%
5 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 163
62.7%
ASCII 97
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 83
85.6%
7
 
7.2%
, 6
 
6.2%
5 1
 
1.0%
Hangul
ValueCountFrequency (%)
19
 
11.7%
14
 
8.6%
10
 
6.1%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (57) 83
50.9%

Correlations

2023-12-13T04:40:16.468711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도기업명선정이유대표자
연도1.0000.8720.9880.526
기업명0.8721.0001.0001.000
선정이유0.9881.0001.0000.899
대표자0.5261.0000.8991.000
2023-12-13T04:40:16.567546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도선정이유
연도1.0000.825
선정이유0.8251.000
2023-12-13T04:40:16.654279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도선정이유
연도1.0000.825
선정이유0.8251.000

Missing values

2023-12-13T04:40:14.302699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:40:14.412240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도기업명선정이유대표자
02019(사)대전광역시컨택센터협회컨텍센터협회 선정박*구
12019(사)삼진정밀좋은일터선정 기업정*희
22019㈜옵트론텍좋은일터선정 기업홍*관
32019㈜에르코스좋은일터선정 기업김*기
42019㈜알에프세미좋은일터선정 기업이*효
52019㈜에브릿좋은일터선정 기업정*수
62019㈜솔탑좋은일터선정 기업사**보
72019㈜바이오니아좋은일터선정 기업박*오
82019로쏘㈜성심당좋은일터선정 기업임*진
92019㈜한국엔지니어링웍스좋은일터선정 기업송*호
연도기업명선정이유대표자
652022㈜예람21년 일하기 좋은 중소기업강*돈
662022㈜해솔정보통신21년 일하기 좋은 중소기업조*현
672022㈜유니오텍22년 좋은일터선정 기업김*하
682022㈜인포스22년 좋은일터선정 기업최*진
692022㈜지엔소프트22년 청년친화 강소기업김*수
702022㈜하이브파트너스21년 좋은일터선정 기업조*호
712022㈜한살림대전22년 좋은일터선정 기업김*원
722022㈜레고켐 바이오사이언스22년 청년친화 강소기업김*주
732022㈜엑스엠더블유22년 청년친화 강소기업이*석
742022㈜디엔에프22년 청년친화 강소기업김*운