Overview

Dataset statistics

Number of variables6
Number of observations165
Missing cells165
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory49.8 B

Variable types

Categorical1
Text3
DateTime1
Unsupported1

Dataset

Description경제통상국 일자리정책과 취업지원팀 고용우수 인증기업 현황
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2716

Alerts

비고 has 165 (100.0%) missing valuesMissing
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 21:15:12.301885
Analysis finished2024-01-09 21:15:12.782908
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Categorical

Distinct12
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
청주
59 
음성
30 
진천
21 
충주
19 
제천
10 
Other values (7)
26 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row음성
2nd row진천
3rd row제천
4th row음성
5th row충주

Common Values

ValueCountFrequency (%)
청주 59
35.8%
음성 30
18.2%
진천 21
 
12.7%
충주 19
 
11.5%
제천 10
 
6.1%
괴산 7
 
4.2%
옥천 5
 
3.0%
보은 4
 
2.4%
단양 4
 
2.4%
영동 3
 
1.8%
Other values (2) 3
 
1.8%

Length

2024-01-10T06:15:12.836180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
청주 59
35.8%
음성 30
18.2%
진천 21
 
12.7%
충주 19
 
11.5%
제천 10
 
6.1%
괴산 7
 
4.2%
옥천 5
 
3.0%
보은 4
 
2.4%
단양 4
 
2.4%
영동 3
 
1.8%
Other values (2) 3
 
1.8%
Distinct157
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-10T06:15:13.014979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length5.8181818
Min length3

Characters and Unicode

Total characters960
Distinct characters203
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)90.3%

Sample

1st row㈜금아플로우
2nd row㈜)동하정밀
3rd row씨엔에스푸드시스템
4th row㈜엘시시
5th row㈜엠테크
ValueCountFrequency (%)
주식회사 3
 
1.7%
㈜대현하이텍 2
 
1.1%
㈜제론텍 2
 
1.1%
㈜원앤씨 2
 
1.1%
뉴그린창신㈜ 2
 
1.1%
㈜코아아이티 2
 
1.1%
㈜에스앤디 2
 
1.1%
㈜풍림푸드 2
 
1.1%
㈜투에이취켐 2
 
1.1%
증평지점 1
 
0.6%
Other values (154) 154
88.5%
2024-01-10T06:15:13.308626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
150
 
15.6%
40
 
4.2%
40
 
4.2%
28
 
2.9%
17
 
1.8%
14
 
1.5%
14
 
1.5%
13
 
1.4%
13
 
1.4%
12
 
1.2%
Other values (193) 619
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 787
82.0%
Other Symbol 150
 
15.6%
Space Separator 9
 
0.9%
Close Punctuation 6
 
0.6%
Open Punctuation 5
 
0.5%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
5.1%
40
 
5.1%
28
 
3.6%
17
 
2.2%
14
 
1.8%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.5%
12
 
1.5%
Other values (186) 584
74.2%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
P 1
33.3%
O 1
33.3%
Other Symbol
ValueCountFrequency (%)
150
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 937
97.6%
Common 20
 
2.1%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
16.0%
40
 
4.3%
40
 
4.3%
28
 
3.0%
17
 
1.8%
14
 
1.5%
14
 
1.5%
13
 
1.4%
13
 
1.4%
12
 
1.3%
Other values (187) 596
63.6%
Common
ValueCountFrequency (%)
9
45.0%
) 6
30.0%
( 5
25.0%
Latin
ValueCountFrequency (%)
C 1
33.3%
P 1
33.3%
O 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 787
82.0%
None 150
 
15.6%
ASCII 23
 
2.4%

Most frequent character per block

None
ValueCountFrequency (%)
150
100.0%
Hangul
ValueCountFrequency (%)
40
 
5.1%
40
 
5.1%
28
 
3.6%
17
 
2.2%
14
 
1.8%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.5%
12
 
1.5%
Other values (186) 584
74.2%
ASCII
ValueCountFrequency (%)
9
39.1%
) 6
26.1%
( 5
21.7%
C 1
 
4.3%
P 1
 
4.3%
O 1
 
4.3%
Distinct152
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-10T06:15:13.624182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length3
Mean length3.4484848
Min length2

Characters and Unicode

Total characters569
Distinct characters127
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)84.8%

Sample

1st row김근배
2nd row신희증
3rd row심문구
4th row백성천
5th row김경희
ValueCountFrequency (%)
신동용 3
 
1.7%
김태훈 2
 
1.1%
이병구 2
 
1.1%
성문규 2
 
1.1%
이재진 2
 
1.1%
정연현 2
 
1.1%
김기환 2
 
1.1%
여경목 2
 
1.1%
이병욱 2
 
1.1%
진경복 2
 
1.1%
Other values (150) 153
87.9%
2024-01-10T06:15:14.025362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
5.8%
32
 
5.6%
20
 
3.5%
4 16
 
2.8%
12
 
2.1%
11
 
1.9%
11
 
1.9%
11
 
1.9%
11
 
1.9%
11
 
1.9%
Other values (117) 401
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 519
91.2%
Other Punctuation 24
 
4.2%
Decimal Number 17
 
3.0%
Space Separator 9
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
6.4%
32
 
6.2%
20
 
3.9%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
Other values (111) 356
68.6%
Other Punctuation
ValueCountFrequency (%)
; 8
33.3%
# 8
33.3%
& 8
33.3%
Decimal Number
ValueCountFrequency (%)
4 16
94.1%
5 1
 
5.9%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 519
91.2%
Common 50
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
6.4%
32
 
6.2%
20
 
3.9%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
Other values (111) 356
68.6%
Common
ValueCountFrequency (%)
4 16
32.0%
9
18.0%
; 8
16.0%
# 8
16.0%
& 8
16.0%
5 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 519
91.2%
ASCII 50
 
8.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
6.4%
32
 
6.2%
20
 
3.9%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
Other values (111) 356
68.6%
ASCII
ValueCountFrequency (%)
4 16
32.0%
9
18.0%
; 8
16.0%
# 8
16.0%
& 8
16.0%
5 1
 
2.0%
Distinct160
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-01-10T06:15:14.298707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length18
Mean length10.29697
Min length2

Characters and Unicode

Total characters1699
Distinct characters287
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)93.9%

Sample

1st row자동차용 파이프
2nd row자동차 부품
3rd row핫도그, 탕수육
4th row손소독제 등 화장품
5th row철판가공, 중장비
ValueCountFrequency (%)
16
 
4.4%
제조업 16
 
4.4%
기타 10
 
2.8%
화장품 8
 
2.2%
7
 
1.9%
부품 6
 
1.7%
반도체 5
 
1.4%
가공 4
 
1.1%
플라스틱 4
 
1.1%
자동차 4
 
1.1%
Other values (249) 280
77.8%
2024-01-10T06:15:14.714135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
195
 
11.5%
4 90
 
5.3%
51
 
3.0%
48
 
2.8%
& 45
 
2.6%
; 45
 
2.6%
# 45
 
2.6%
39
 
2.3%
29
 
1.7%
28
 
1.6%
Other values (277) 1084
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1209
71.2%
Space Separator 195
 
11.5%
Other Punctuation 137
 
8.1%
Decimal Number 92
 
5.4%
Uppercase Letter 49
 
2.9%
Lowercase Letter 7
 
0.4%
Close Punctuation 5
 
0.3%
Open Punctuation 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
4.2%
48
 
4.0%
39
 
3.2%
29
 
2.4%
28
 
2.3%
27
 
2.2%
24
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.7%
Other values (245) 899
74.4%
Uppercase Letter
ValueCountFrequency (%)
S 5
10.2%
C 5
10.2%
P 5
10.2%
E 5
10.2%
T 5
10.2%
D 4
8.2%
L 4
8.2%
I 3
 
6.1%
W 3
 
6.1%
G 2
 
4.1%
Other values (7) 8
16.3%
Other Punctuation
ValueCountFrequency (%)
& 45
32.8%
; 45
32.8%
# 45
32.8%
/ 1
 
0.7%
· 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
s 2
28.6%
l 2
28.6%
a 1
14.3%
c 1
14.3%
d 1
14.3%
Decimal Number
ValueCountFrequency (%)
4 90
97.8%
2 2
 
2.2%
Space Separator
ValueCountFrequency (%)
195
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1209
71.2%
Common 434
 
25.5%
Latin 56
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
4.2%
48
 
4.0%
39
 
3.2%
29
 
2.4%
28
 
2.3%
27
 
2.2%
24
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.7%
Other values (245) 899
74.4%
Latin
ValueCountFrequency (%)
S 5
 
8.9%
C 5
 
8.9%
P 5
 
8.9%
E 5
 
8.9%
T 5
 
8.9%
D 4
 
7.1%
L 4
 
7.1%
I 3
 
5.4%
W 3
 
5.4%
s 2
 
3.6%
Other values (12) 15
26.8%
Common
ValueCountFrequency (%)
195
44.9%
4 90
20.7%
& 45
 
10.4%
; 45
 
10.4%
# 45
 
10.4%
) 5
 
1.2%
( 5
 
1.2%
2 2
 
0.5%
/ 1
 
0.2%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1209
71.2%
ASCII 489
28.8%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
195
39.9%
4 90
18.4%
& 45
 
9.2%
; 45
 
9.2%
# 45
 
9.2%
) 5
 
1.0%
S 5
 
1.0%
( 5
 
1.0%
C 5
 
1.0%
P 5
 
1.0%
Other values (21) 44
 
9.0%
Hangul
ValueCountFrequency (%)
51
 
4.2%
48
 
4.0%
39
 
3.2%
29
 
2.4%
28
 
2.3%
27
 
2.2%
24
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.7%
Other values (245) 899
74.4%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct14
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2010-10-21 09:20:12
Maximum2023-09-12 09:00:00
2024-01-10T06:15:14.819967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:15:14.912752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing165
Missing (%)100.0%
Memory size1.6 KiB

Correlations

2024-01-10T06:15:14.985284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군인증 기간
시군1.0000.195
인증 기간0.1951.000

Missing values

2024-01-10T06:15:12.670398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:15:12.748878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군기업체명대표자주요 생산품 업종인증 기간비고
0음성㈜금아플로우김근배자동차용 파이프2010-10-21 09:20:12<NA>
1진천㈜)동하정밀신희증자동차 부품2010-10-21 09:20:12<NA>
2제천씨엔에스푸드시스템심문구핫도그&#44; 탕수육2010-10-21 09:20:12<NA>
3음성㈜엘시시백성천손소독제 등 화장품2010-10-21 09:20:12<NA>
4충주㈜엠테크김경희철판가공&#44; 중장비2010-10-21 09:20:12<NA>
5청원㈜옵토팩김덕훈휴대폰 이미지 센서패키지2010-10-21 09:20:12<NA>
6보은㈜진미유 민김치2010-10-21 09:20:12<NA>
7청주㈜지디김명선LCD Glass2011-10-25 09:20:13<NA>
8진천㈜제니스월드남배송lcd&#44; LED반도체 장비부품2011-10-25 09:20:13<NA>
9진천㈜백산OPC김상화토너 레이져복사기 부품2011-10-25 09:20:13<NA>
시군기업체명대표자주요 생산품 업종인증 기간비고
155진천주식회사 우리델리카이우주기타 식사용 가공처리 조리식품 제오업2023-09-12 09:00:00<NA>
156진천청남공조김진홍산업용 송풍기 및 배기장치 제조업2023-09-12 09:00:00<NA>
157청주나손사이언스㈜박래리종물리&#44; 화학 및 생물학 연구개발업2023-09-12 09:00:00<NA>
158청주㈜신화아이티홍원희&#44; 손준혁그 외 기타 전자부품 제조업2023-09-12 09:00:00<NA>
159청주주식회사 제이에이치씨김병선접착제 및 젤라틴 제조업2023-09-12 09:00:00<NA>
160음성㈜송아퍼니처유태근주방용 및 음식점용 목재가구 제조업2023-09-12 09:00:00<NA>
161충주농업회사법인㈜비전레드이대로기타 발효주 제조업2023-09-12 09:00:00<NA>
162청주㈜에스엠오산박민규의학 및 약학 연구개발업2023-09-12 09:00:00<NA>
163음성㈜제론텍김진수플라스틱 적층&#44; 도포 및 기타표면 처리 제품 제조업2023-09-12 09:00:00<NA>
164음성㈜태우권상대그 외 기타 분류 안된 화학제품 제조업2023-09-12 09:00:00<NA>