Overview

Dataset statistics

Number of variables5
Number of observations35
Missing cells30
Missing cells (%)17.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory43.8 B

Variable types

Unsupported1
Text3
Categorical1

Dataset

Description2019년부터 첫 시행된 김해시 일자리 우수 선정 업체 현황에 대한 자료로 연도, 기업명, 소재지, 대표자, 업종으로 구성되어 있습니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15106079

Alerts

김해시 일자리 우수기업 현황 has 30 (85.7%) missing valuesMissing
Unnamed: 1 has unique valuesUnique
김해시 일자리 우수기업 현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:40:32.546780
Analysis finished2023-12-10 23:40:32.951625
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

김해시 일자리 우수기업 현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing30
Missing (%)85.7%
Memory size412.0 B

Unnamed: 1
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T08:40:33.083293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length5.9142857
Min length3

Characters and Unicode

Total characters207
Distinct characters95
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row정아정밀㈜
3rd row㈜에이치에스텍
4th row기득산업㈜
5th row㈜주노텍
ValueCountFrequency (%)
주식회사 8
 
17.8%
업체명 1
 
2.2%
이닉스 1
 
2.2%
㈜삼오 1
 
2.2%
세종플렉스 1
 
2.2%
대신기계 1
 
2.2%
대흥공업㈜ 1
 
2.2%
신천기계공업㈜ 1
 
2.2%
인큐스 1
 
2.2%
㈜엘앤지 1
 
2.2%
Other values (28) 28
62.2%
2023-12-11T08:40:33.411469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
9.7%
10
 
4.8%
9
 
4.3%
9
 
4.3%
8
 
3.9%
8
 
3.9%
7
 
3.4%
7
 
3.4%
6
 
2.9%
6
 
2.9%
Other values (85) 117
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 177
85.5%
Other Symbol 20
 
9.7%
Space Separator 10
 
4.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
5.1%
9
 
5.1%
8
 
4.5%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.8%
4
 
2.3%
Other values (83) 108
61.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 197
95.2%
Common 10
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
10.2%
9
 
4.6%
9
 
4.6%
8
 
4.1%
8
 
4.1%
7
 
3.6%
7
 
3.6%
6
 
3.0%
6
 
3.0%
5
 
2.5%
Other values (84) 112
56.9%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 177
85.5%
None 20
 
9.7%
ASCII 10
 
4.8%

Most frequent character per block

None
ValueCountFrequency (%)
20
100.0%
ASCII
ValueCountFrequency (%)
10
100.0%
Hangul
ValueCountFrequency (%)
9
 
5.1%
9
 
5.1%
8
 
4.5%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
6
 
3.4%
5
 
2.8%
4
 
2.3%
Other values (83) 108
61.0%
Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T08:40:33.616901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters105
Distinct characters53
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row대표자
2nd row김용진
3rd row김관락
4th row공경열
5th row김민석
ValueCountFrequency (%)
공경열 2
 
5.7%
설경숙 1
 
2.9%
김상우 1
 
2.9%
이상호 1
 
2.9%
김종재 1
 
2.9%
장영탁 1
 
2.9%
윤일진 1
 
2.9%
강동호 1
 
2.9%
정길동 1
 
2.9%
김석조 1
 
2.9%
Other values (24) 24
68.6%
2023-12-11T08:40:33.894758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
13.3%
7
 
6.7%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (43) 56
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 105
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
13.3%
7
 
6.7%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (43) 56
53.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 105
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
13.3%
7
 
6.7%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (43) 56
53.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 105
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
13.3%
7
 
6.7%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (43) 56
53.3%

Unnamed: 3
Categorical

Distinct12
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
진영읍
주촌면
생림면
진례면
부곡동
Other values (7)

Length

Max length5
Median length3
Mean length3.0571429
Min length3

Unique

Unique6 ?
Unique (%)17.1%

Sample

1st row소재지
2nd row주촌면
3rd row생림면
4th row부곡동
5th row생림면

Common Values

ValueCountFrequency (%)
진영읍 8
22.9%
주촌면 6
17.1%
생림면 6
17.1%
진례면 5
14.3%
부곡동 2
 
5.7%
한림면 2
 
5.7%
소재지 1
 
2.9%
안 동 1
 
2.9%
상동면 1
 
2.9%
지내동 1
 
2.9%
Other values (2) 2
 
5.7%

Length

2023-12-11T08:40:34.017964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
진영읍 8
22.2%
주촌면 6
16.7%
생림면 6
16.7%
진례면 5
13.9%
부곡동 2
 
5.6%
한림면 2
 
5.6%
소재지 1
 
2.8%
1
 
2.8%
1
 
2.8%
상동면 1
 
2.8%
Other values (3) 3
 
8.3%
Distinct29
Distinct (%)82.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T08:40:34.183851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length11.171429
Min length2

Characters and Unicode

Total characters391
Distinct characters88
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)74.3%

Sample

1st row업종
2nd row제조(자동차부품)
3rd row제조(중장비부품)
4th row제조(산업기계,철강)
5th row제조(산업기계제작 등)
ValueCountFrequency (%)
24
32.0%
제조(식육포장 4
 
5.3%
제조(자동차부품 4
 
5.3%
부품 3
 
4.0%
제조(자동차 3
 
4.0%
제조(파이프,튜브 1
 
1.3%
터빈 1
 
1.3%
내연기관 1
 
1.3%
제조(유기성 1
 
1.3%
폐기물 1
 
1.3%
Other values (32) 32
42.7%
2023-12-11T08:40:34.500683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
10.2%
36
 
9.2%
35
 
9.0%
( 34
 
8.7%
) 34
 
8.7%
24
 
6.1%
11
 
2.8%
10
 
2.6%
8
 
2.0%
8
 
2.0%
Other values (78) 151
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 277
70.8%
Space Separator 40
 
10.2%
Open Punctuation 34
 
8.7%
Close Punctuation 34
 
8.7%
Other Punctuation 6
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
13.0%
35
 
12.6%
24
 
8.7%
11
 
4.0%
10
 
3.6%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (74) 121
43.7%
Space Separator
ValueCountFrequency (%)
40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 277
70.8%
Common 114
29.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
13.0%
35
 
12.6%
24
 
8.7%
11
 
4.0%
10
 
3.6%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (74) 121
43.7%
Common
ValueCountFrequency (%)
40
35.1%
( 34
29.8%
) 34
29.8%
, 6
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 277
70.8%
ASCII 114
29.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40
35.1%
( 34
29.8%
) 34
29.8%
, 6
 
5.3%
Hangul
ValueCountFrequency (%)
36
 
13.0%
35
 
12.6%
24
 
8.7%
11
 
4.0%
10
 
3.6%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (74) 121
43.7%

Correlations

2023-12-11T08:40:34.589314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
Unnamed: 11.0001.0001.0001.000
Unnamed: 21.0001.0001.0000.964
Unnamed: 31.0001.0001.0000.816
Unnamed: 41.0000.9640.8161.000

Missing values

2023-12-11T08:40:32.806893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:40:32.907862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

김해시 일자리 우수기업 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
0연도업체명대표자소재지업종
12019정아정밀㈜김용진주촌면제조(자동차부품)
2NaN㈜에이치에스텍김관락생림면제조(중장비부품)
3NaN기득산업㈜공경열부곡동제조(산업기계,철강)
4NaN㈜주노텍김민석생림면제조(산업기계제작 등)
5NaN㈜우진정밀김철곤생림면제조(동력전달장치 등)
62020대명산업기술㈜김환기진례면제조(기계장비 등)
7NaN경원벤턱㈜공경열부곡동제조(조선기자재 등)
8NaN㈜세계산업전병안안 동제조(자동차부품 등)
9NaN남성정밀박희망상동면제조(관이음쇠 등)
김해시 일자리 우수기업 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
25NaN주식회사 인큐스김명수진영읍제조(유기성 폐기물 소멸장치 재생원료 등)
26NaN주식회사 그린푸드밸리정길동삼안동제조(식육포장)
27NaN오토워시 스토어김민재풍유동제조(세차용품)
282022(상반기)㈜월드튜브설경숙진례면제조(파이프,튜브)
29NaN고모텍㈜윤일진진례면제조(소형가전, 냉장고 등)
30NaN㈜케이디에이장영탁진영읍제조(자동차 부품 등)
31NaN주식회사 하이스텐김종재주촌면제조(밸브, 관이음쇠 등)
32NaN㈜착한떡이상호진영읍제조(식품가공)
33NaN주식회사 피트쿡김상우주촌면제조(식육포장)
34NaN㈜타누스이영기진영읍제조(고무, 타이어 등)